This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/IR/
-
llvm/
-
IR/
-
ConstantRange.h
-
lib/IR/
-
IR/
2/6
ConstantRange.cpp
-
test/Analysis/LazyValueAnalysis/
-
Analysis/
-
LazyValueAnalysis/
1/9
lvi-for-ctlz.ll
-
unittests/IR/
-
IR/
-
ConstantRangeTest.cpp

Differential D142234

[ConstantRange] Handle `Intrinsic::ctlz`
ClosedPublic

Authored by antoniofrighetto on Jan 20 2023, 9:36 AM.

Download Raw Diff

Details

Reviewers

nikic
reames

Commits

rG65898e526060: [ConstantRange] Handle `Intrinsic::ctlz`

Summary

Introduce support for ctlz intrinsic, including exhaustive testing. Among other things, LVI may now be able to propagate information about constant ranges lattice values. This change makes sure to provide a precise result where possible, or a more conservative one (e.g., should the first argument value be zero and is_zero_poison true), but always correct.

Diff Detail

Event Timeline

antoniofrighetto created this revision.Jan 20 2023, 9:36 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 20 2023, 9:36 AM

Herald added subscribers: StephenFan, hiraditya. · View Herald Transcript

antoniofrighetto requested review of this revision.Jan 20 2023, 9:36 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 20 2023, 9:36 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

arsenm added a subscriber: arsenm.Jan 20 2023, 9:39 AM

arsenm added inline comments.

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
33	Test looks pretty small for all the different paths in the patch. You don't test the i1 false case for example

antoniofrighetto added inline comments.Jan 20 2023, 9:41 AM

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
33	For the second argument `i1 false` the result should always be more precise than the current one, but I'll make sure to test it soon as well.

This needs to be implemented inside ConstantRange and tested exhaustively inside ConstantRangeTest. See the handling of Intrinsic::abs as a possible sample to follow (it also has a semantics-controlling argument).

This revision now requires changes to proceed.Jan 20 2023, 9:45 AM

Harbormaster completed remote builds in B209020: Diff 490893.Jan 20 2023, 10:56 AM

Moved the implementation in ConstantRange.cpp, and tested exhaustively. For the sake of completeness and rigor, I left the test in LVI, further improved: it shouldn’t harm, and there are similar unit tests in ConstantRangeTest.cpp which are also in test/Analysis.

Harbormaster completed remote builds in B209837: Diff 492056.Jan 25 2023, 4:46 AM

antoniofrighetto updated this revision to Diff 492086.Jan 25 2023, 6:25 AM

antoniofrighetto edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B209852: Diff 492086.Jan 25 2023, 8:28 AM

antoniofrighetto updated this revision to Diff 492168.Jan 25 2023, 10:16 AM

antoniofrighetto updated this revision to Diff 492170.Jan 25 2023, 10:19 AM

Harbormaster completed remote builds in B209914: Diff 492170.Jan 25 2023, 12:26 PM

The unit test currently fails both for correctness and optimality checks:

/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:137
Value of: rangeContains(CR, APInt(BitWidth, Elem), Inputs)
  Actual: false (empty-set does not contain 0 for inputs: full-set, )
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:137
Value of: rangeContains(CR, APInt(BitWidth, Elem), Inputs)
  Actual: false (empty-set does not contain -1 for inputs: full-set, )
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:145
Value of: CR.isEmptySet()
  Actual: false
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = full-set, CR = full-set, BetterCR = [0,-1))
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:145
Value of: CR.isEmptySet()
  Actual: false
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = [0,2), CR = full-set, BetterCR = [3,4))
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = [0,3), CR = full-set, BetterCR = [2,4))
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = [0,4), CR = full-set, BetterCR = [2,4))
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = [0,5), CR = full-set, BetterCR = [1,4))
Expected: true
[...]

You can run the unit tests using ninja -Cbuild check-llvm-unit. Or ninja -Cbuild IRTests && build/unittests/IR/IRTests to run just this test suite.

llvm/include/llvm/ADT/APInt.h
1741 ↗	(On Diff #492170)	I'd rather not have this API on APInt. Getting leading zeros as an APInt is pretty unusual.
llvm/lib/IR/ConstantRange.cpp
1684	Constant ranges are understood modulo poison. The value is in the range or it is poison. As such, if ZeroIsPoison is set, we should return a better range that doesn't have to account for zero input, not a worse range.
1700	Can Lower be larger than Upper for a non-wrapped set?
llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	Having an IR test is fine, but please do not test LVI debug output. Just check the resulting IR change using update_test_checks.py.

This revision now requires changes to proceed.Jan 26 2023, 4:08 AM

@nikic, thank you, I'm having a look at these now. Just a few considerations.

I'm not entirely sure about the unit test failures: should the returned range simply not include zero, if the incoming interval may contain zero and ZeroIsPoison is true, as you noted? I reckoned that being more conservative (unknown/overdefined) could be better here. Same applies for isEmptySet, we may want to return overdefined, I guess (I'm looking at Value of: CR.isEmptySet(), Actual: false, Expected: true)?.
If we don't want to have ctlz API on APInt, could it be fine to overload TestUnaryOpExhaustive so that it accepts a callable reference to std::optional<unsigned>(const APInt &) too?
range.ll now fails as the metadata's range is no longer being used now. I'm gradually realizing that this needs to be handled properly in CVP as well. Could it be fine to revert the ret i1 true to this change, considering that the range is now [0,16] (verification here) for this time, and postpone the handling in CVP for a separate dedicated patch?

In D142234#4082670, @antoniofrighetto wrote:

@nikic, thank you, I'm having a look at these now. Just a few considerations.

I'm not entirely sure about the unit test failures: should the returned range simply not include zero, if the incoming interval may contain zero and ZeroIsPoison is true, as you noted? I reckoned that being more conservative (unknown/overdefined) could be better here. Same applies for isEmptySet, we may want to return overdefined, I guess (I'm looking at Value of: CR.isEmptySet(), Actual: false, Expected: true)?.

It's not necessary to make any conservative choices here. If ZeroIsPoison is set, you can assume that the input range does not contain zero, e.g. by intersecting it out.

If we don't want to have ctlz API on APInt, could it be fine to overload TestUnaryOpExhaustive so that it accepts a callable reference to std::optional<unsigned>(const APInt &) too?

I think we should just create the APInt in the two places where it's needed.

range.ll now fails as the metadata's range is no longer being used now. I'm gradually realizing that this needs to be handled properly in CVP as well. Could it be fine to revert the ret i1 true to this change, considering that the range is now [0,16] (verification here) for this time, and postpone the handling in CVP for a separate dedicated patch?

This issue should already be fixed by https://github.com/llvm/llvm-project/commit/2e9bc1b8614c9422573cf2f4728525787b0cb0cb.

antoniofrighetto updated this revision to Diff 493846.Jan 31 2023, 11:53 PM

antoniofrighetto marked an inline comment as done.

antoniofrighetto added inline comments.

llvm/lib/IR/ConstantRange.cpp
1700	It seems it is possible with negative numbers as well (`[0,-7)`), non-wrapped set.

antoniofrighetto marked an inline comment as not done.Feb 1 2023, 12:00 AM

antoniofrighetto retitled this revision from [LVI] Handle Intrinsic::ctlz to [ConstantRange] Handle Intrinsic::ctlz.Feb 1 2023, 1:06 AM

Harbormaster completed remote builds in B211157: Diff 493846.Feb 1 2023, 1:12 AM

This looks on the right track now.

llvm/lib/IR/ConstantRange.cpp

1737

This looks basically fine, but a bit overly complicated. This should be sufficient:

ConstantRange ConstantRange::ctlz(bool ZeroIsPoison) const {
  if (isEmptySet())
    return getEmpty();

  APInt Zero = APInt::getZero(getBitWidth());
  if (ZeroIsPoison && contains(Zero)) {
    // ZeroIsPoison is set, and zero is contained. We discern three cases, in
    // which a zero can appear:
    // 1) Lower is zero, handling cases of kind [0, 1), [0, 2), etc.
    // 2) Upper is zero, wrapped set, handling cases of kind [3, 0], etc.
    // 3) Zero contained in a wrapped set, e.g., [3, 2), [3, 1), etc.
    
    if (getLower().isZero()) {
      if ((getUpper() - 1).isZero()) {
        // We have in input interval of kind [0, 1). In this case we cannot
        // really help but return empty-set.
        return getEmpty();
      }

      // Compute the resulting range by excluding zero from Lower.
      return ConstantRange(
          APInt(getBitWidth(), (getUpper() - 1).countLeadingZeros()), 
          APInt(getBitWidth(), (getLower() + 1).countLeadingZeros() + 1));
    } else if ((getUpper() - 1).isZero()) {
      // Compute the resulting range by excluding zero from Upper.
      return ConstantRange(
          Zero, APInt(getBitWidth(), getLower().countLeadingZeros() + 1));
    } else { 
      return ConstantRange(Zero, APInt(getBitWidth(), getBitWidth()));
    }   
  } 
  
  // Zero is either safe or not in the range. The output range is composed by
  // the result of countLeadingZero of the two extremes.
  return getNonEmpty(
      APInt(getBitWidth(), getUnsignedMax().countLeadingZeros()), 
      APInt(getBitWidth(), getUnsignedMin().countLeadingZeros() + 1));
}

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll

I don't think that as written, these tests really test anything. There needs to be a comparison involving the ctlz that can be folded away, or similar.

This revision now requires changes to proceed.Feb 1 2023, 2:50 AM

antoniofrighetto updated this revision to Diff 494409.Feb 2 2023, 12:32 PM

antoniofrighetto marked an inline comment as done.

antoniofrighetto added inline comments.Feb 2 2023, 12:39 PM

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	I feel like a suitable test could be the following one: int lol(int b) { if (b < 65536) { int n = __builtin_clz(b); if (n < 8) return 0; else return 1; } return 2; } which now could be simplified in a `return (b < 65536) ? 1 : 2;`. However, as far as I understand this, CVP needs to be extended as well in order to be able to obtain the above constant folding. Is that correct?

antoniofrighetto marked an inline comment as not done.Feb 2 2023, 12:40 PM

Harbormaster completed remote builds in B211561: Diff 494409.Feb 2 2023, 2:04 PM

StephenFan added inline comments.Feb 2 2023, 11:27 PM

llvm/lib/IR/ConstantRange.cpp
1686	IIUC, is `[3, 0]` the same as the `[3, 1)` in case 3?
llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	Is the simplification to `return (b < 65536) ? 1:2;` correct? Since if `b` is negative, `clz` returns 0.

antoniofrighetto added inline comments.Feb 2 2023, 11:44 PM

llvm/lib/IR/ConstantRange.cpp
1686	Correct.
llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	True, sorry, I originally intended the first argument to be `unsigned`, whose optimization seems to occur successfully in this case.

antoniofrighetto added inline comments.Feb 7 2023, 1:12 AM

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	@nikic, perhaps a better example could be the following one? That would still require a change in CVP if I'm not mistaken though, correct? int test_ctlz(int b) { if (b < 65536) { int n = __builtin_clz(b); if (n < 8 && n > 2) return 0; else return 1; } return 2; }

nikic added inline comments.Feb 15 2023, 2:34 AM

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	Wouldn't something like this work as a test? int test_ctlz(unsigned b) { if (b < 65536) { unsigned n = __builtin_clz(b); return n >= 16; // Fold to 1 } return 2; }

Can you please rebase over https://github.com/llvm/llvm-project/commit/14fcdd7f9d7b3973661efc5a426da18e077155bf? I think that should work as a CVP test for this functionality, if I got the numbers right.

antoniofrighetto updated this revision to Diff 497611.Feb 15 2023, 2:59 AM

antoniofrighetto retitled this revision from [ConstantRange] Handle Intrinsic::ctlz to [ConstantRange] Handle `Intrinsic::ctlz`.

@nikic, rebased and removed the old test. Might be wrong, but it seems like O2 is already handling the test case you suggested; whereas the latter one I previously suggested (the one with if (n < 8 && n > 2) return 0;, which can never happen) does not seem to be reduced to a icmp, select, ret as of now.

Harbormaster completed remote builds in B213850: Diff 497611.Feb 15 2023, 3:48 AM

Can you please rerun update_test_checks.py on the CorrelatedPropagation/range.ll test? There should be some diffs there.

In D142234#4128565, @antoniofrighetto wrote:

@nikic, rebased and removed the old test. Might be wrong, but it seems like O2 is already handling the test case you suggested; whereas the latter one I previously suggested (the one with if (n < 8 && n > 2) return 0;, which can never happen) does not seem to be reduced to a icmp, select, ret as of now.

Right. It's not really important that individual pass tests aren't folded by -O2. One construct more complex cases that LVI can handle that InstCombine can't. It's just more straightforward to test the basic case.

antoniofrighetto updated this revision to Diff 497674.Feb 15 2023, 7:43 AM

Harbormaster completed remote builds in B213896: Diff 497674.Feb 15 2023, 8:33 AM

nikic added inline comments.Feb 16 2023, 2:48 AM

llvm/test/Transforms/CorrelatedValuePropagation/range.ll
954 ↗	(On Diff #497674)	Did you use the right build to produce this? This is not the expected diff, and it still fails in pre-merge checks.

antoniofrighetto updated this revision to Diff 497952.Feb 16 2023, 3:17 AM

LGTM

This revision is now accepted and ready to land.Feb 16 2023, 3:19 AM

antoniofrighetto added inline comments.Feb 16 2023, 3:20 AM

llvm/test/Transforms/CorrelatedValuePropagation/range.ll
954 ↗	(On Diff #497674)	Sorry, my bad, confused builds and must have accidentally passed the wrong `--opt-bin` path. Checked twice, we should be good now.

Harbormaster completed remote builds in B214113: Diff 497952.Feb 16 2023, 5:31 AM

Closed by commit rG65898e526060: [ConstantRange] Handle `Intrinsic::ctlz` (authored by antoniofrighetto, committed by nikic). · Explain WhyFeb 17 2023, 12:57 AM

This revision was automatically updated to reflect the committed changes.

nikic added a commit: rG65898e526060: [ConstantRange] Handle `Intrinsic::ctlz`.

Revision Contents

Path

Size

llvm/

include/

llvm/

IR/

ConstantRange.h

5 lines

lib/

IR/

ConstantRange.cpp

46 lines

test/

Analysis/

LazyValueAnalysis/

lvi-for-ctlz.ll

108 lines

unittests/

IR/

ConstantRangeTest.cpp

15 lines

Diff 494409

llvm/include/llvm/IR/ConstantRange.h

Show First 20 Lines • Show All 520 Lines • ▼ Show 20 Lines	public:
/// Return a new range that is the logical not of the current set.		/// Return a new range that is the logical not of the current set.
ConstantRange inverse() const;		ConstantRange inverse() const;

/// Calculate absolute value range. If the original range contains signed		/// Calculate absolute value range. If the original range contains signed
/// min, then the resulting range will contain signed min if and only if		/// min, then the resulting range will contain signed min if and only if
/// \p IntMinIsPoison is false.		/// \p IntMinIsPoison is false.
ConstantRange abs(bool IntMinIsPoison = false) const;		ConstantRange abs(bool IntMinIsPoison = false) const;

		/// Calculate absolute value range. If the original range contains zero, then
		/// the resulting range will be bounded if and only if \p ZeroIsPoison is
		/// false.
		ConstantRange ctlz(bool ZeroIsPoison = false) const;

/// Represents whether an operation on the given constant range is known to		/// Represents whether an operation on the given constant range is known to
/// always or never overflow.		/// always or never overflow.
enum class OverflowResult {		enum class OverflowResult {
/// Always overflows in the direction of signed/unsigned min value.		/// Always overflows in the direction of signed/unsigned min value.
AlwaysOverflowsLow,		AlwaysOverflowsLow,
/// Always overflows in the direction of signed/unsigned max value.		/// Always overflows in the direction of signed/unsigned max value.
AlwaysOverflowsHigh,		AlwaysOverflowsHigh,
/// May or may not overflow.		/// May or may not overflow.
▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

llvm/lib/IR/ConstantRange.cpp

Show First 20 Lines • Show All 939 Lines • ▼ Show 20 Lines	bool ConstantRange::isIntrinsicSupported(Intrinsic::ID IntrinsicID) {
case Intrinsic::usub_sat:		case Intrinsic::usub_sat:
case Intrinsic::sadd_sat:		case Intrinsic::sadd_sat:
case Intrinsic::ssub_sat:		case Intrinsic::ssub_sat:
case Intrinsic::umin:		case Intrinsic::umin:
case Intrinsic::umax:		case Intrinsic::umax:
case Intrinsic::smin:		case Intrinsic::smin:
case Intrinsic::smax:		case Intrinsic::smax:
case Intrinsic::abs:		case Intrinsic::abs:
		case Intrinsic::ctlz:
return true;		return true;
default:		default:
return false;		return false;
}		}
}		}

ConstantRange ConstantRange::intrinsic(Intrinsic::ID IntrinsicID,		ConstantRange ConstantRange::intrinsic(Intrinsic::ID IntrinsicID,
ArrayRef<ConstantRange> Ops) {		ArrayRef<ConstantRange> Ops) {
Show All 15 Lines	ConstantRange ConstantRange::intrinsic(Intrinsic::ID IntrinsicID,
case Intrinsic::smax:		case Intrinsic::smax:
return Ops[0].smax(Ops[1]);		return Ops[0].smax(Ops[1]);
case Intrinsic::abs: {		case Intrinsic::abs: {
const APInt *IntMinIsPoison = Ops[1].getSingleElement();		const APInt *IntMinIsPoison = Ops[1].getSingleElement();
assert(IntMinIsPoison && "Must be known (immarg)");		assert(IntMinIsPoison && "Must be known (immarg)");
assert(IntMinIsPoison->getBitWidth() == 1 && "Must be boolean");		assert(IntMinIsPoison->getBitWidth() == 1 && "Must be boolean");
return Ops[0].abs(IntMinIsPoison->getBoolValue());		return Ops[0].abs(IntMinIsPoison->getBoolValue());
}		}
		case Intrinsic::ctlz: {
		const APInt *ZeroIsPoison = Ops[1].getSingleElement();
		assert(ZeroIsPoison && "Must be known (immarg)");
		assert(ZeroIsPoison->getBitWidth() == 1 && "Must be boolean");
		return Ops[0].ctlz(ZeroIsPoison->getBoolValue());
		}
default:		default:
assert(!isIntrinsicSupported(IntrinsicID) && "Shouldn't be supported");		assert(!isIntrinsicSupported(IntrinsicID) && "Shouldn't be supported");
llvm_unreachable("Unsupported intrinsic");		llvm_unreachable("Unsupported intrinsic");
}		}
}		}

ConstantRange		ConstantRange
ConstantRange::add(const ConstantRange &Other) const {		ConstantRange::add(const ConstantRange &Other) const {
▲ Show 20 Lines • Show All 675 Lines • ▼ Show 20 Lines	ConstantRange ConstantRange::abs(bool IntMinIsPoison) const {
if (SMax.isNegative())		if (SMax.isNegative())
return ConstantRange(-SMax, -SMin + 1);		return ConstantRange(-SMax, -SMin + 1);

// Range crosses zero.		// Range crosses zero.
return ConstantRange::getNonEmpty(APInt::getZero(getBitWidth()),		return ConstantRange::getNonEmpty(APInt::getZero(getBitWidth()),
APIntOps::umax(-SMin, SMax) + 1);		APIntOps::umax(-SMin, SMax) + 1);
}		}

		ConstantRange ConstantRange::ctlz(bool ZeroIsPoison) const {
		if (isEmptySet())
		return getEmpty();

		APInt Zero = APInt::getZero(getBitWidth());
		if (ZeroIsPoison && contains(Zero)) {
		// ZeroIsPoison is set, and zero is contained. We discern three cases, in
		// which a zero can appear:
		nikicUnsubmitted Not Done Reply Inline Actions Constant ranges are understood modulo poison. The value is in the range or it is poison. As such, if ZeroIsPoison is set, we should return a better range that doesn't have to account for zero input, not a worse range. nikic: Constant ranges are understood modulo poison. The value is in the range or it is poison. As…
		// 1) Lower is zero, handling cases of kind [0, 1), [0, 2), etc.
		// 2) Upper is zero, wrapped set, handling cases of kind [3, 0], etc.
		StephenFanUnsubmitted Not Done Reply Inline Actions IIUC, is `[3, 0]` the same as the `[3, 1)` in case 3? StephenFan: IIUC, is `[3, 0]` the same as the `[3, 1)` in case 3?
		antoniofrighettoAuthorUnsubmitted Not Done Reply Inline Actions Correct. antoniofrighetto: Correct.
		// 3) Zero contained in a wrapped set, e.g., [3, 2), [3, 1), etc.

		if (getLower().isZero()) {
		if ((getUpper() - 1).isZero()) {
		// We have in input interval of kind [0, 1). In this case we cannot
		// really help but return empty-set.
		return getEmpty();
		}

		// Compute the resulting range by excluding zero from Lower.
		return ConstantRange(
		APInt(getBitWidth(), (getUpper() - 1).countLeadingZeros()),
		APInt(getBitWidth(), (getLower() + 1).countLeadingZeros() + 1));
		} else if ((getUpper() - 1).isZero()) {
		nikicUnsubmitted Not Done Reply Inline Actions Can Lower be larger than Upper for a non-wrapped set? nikic: Can Lower be larger than Upper for a non-wrapped set?
		antoniofrighettoAuthorUnsubmitted Done Reply Inline Actions It seems it is possible with negative numbers as well (`[0,-7)`), non-wrapped set. antoniofrighetto: It seems it is possible with negative numbers as well (`[0,-7)`), non-wrapped set.
		// Compute the resulting range by excluding zero from Upper.
		return ConstantRange(
		Zero, APInt(getBitWidth(), getLower().countLeadingZeros() + 1));
		} else {
		return ConstantRange(Zero, APInt(getBitWidth(), getBitWidth()));
		}
		}

		// Zero is either safe or not in the range. The output range is composed by
		// the result of countLeadingZero of the two extremes.
		return getNonEmpty(
		APInt(getBitWidth(), getUnsignedMax().countLeadingZeros()),
		APInt(getBitWidth(), getUnsignedMin().countLeadingZeros() + 1));
		}

ConstantRange::OverflowResult ConstantRange::unsignedAddMayOverflow(		ConstantRange::OverflowResult ConstantRange::unsignedAddMayOverflow(
const ConstantRange &Other) const {		const ConstantRange &Other) const {
if (isEmptySet() \|\| Other.isEmptySet())		if (isEmptySet() \|\| Other.isEmptySet())
return OverflowResult::MayOverflow;		return OverflowResult::MayOverflow;

APInt Min = getUnsignedMin(), Max = getUnsignedMax();		APInt Min = getUnsignedMin(), Max = getUnsignedMax();
APInt OtherMin = Other.getUnsignedMin(), OtherMax = Other.getUnsignedMax();		APInt OtherMin = Other.getUnsignedMin(), OtherMax = Other.getUnsignedMax();

// a u+ b overflows high iff a u> ~b.		// a u+ b overflows high iff a u> ~b.
if (Min.ugt(~OtherMin))		if (Min.ugt(~OtherMin))
return OverflowResult::AlwaysOverflowsHigh;		return OverflowResult::AlwaysOverflowsHigh;
if (Max.ugt(~OtherMax))		if (Max.ugt(~OtherMax))
return OverflowResult::MayOverflow;		return OverflowResult::MayOverflow;
return OverflowResult::NeverOverflows;		return OverflowResult::NeverOverflows;
}		}

ConstantRange::OverflowResult ConstantRange::signedAddMayOverflow(		ConstantRange::OverflowResult ConstantRange::signedAddMayOverflow(
const ConstantRange &Other) const {		const ConstantRange &Other) const {
if (isEmptySet() \|\| Other.isEmptySet())		if (isEmptySet() \|\| Other.isEmptySet())
return OverflowResult::MayOverflow;		return OverflowResult::MayOverflow;

APInt Min = getSignedMin(), Max = getSignedMax();		APInt Min = getSignedMin(), Max = getSignedMax();
		nikicUnsubmitted Done Reply Inline Actions This looks basically fine, but a bit overly complicated. This should be sufficient: ConstantRange ConstantRange::ctlz(bool ZeroIsPoison) const { if (isEmptySet()) return getEmpty(); APInt Zero = APInt::getZero(getBitWidth()); if (ZeroIsPoison && contains(Zero)) { // ZeroIsPoison is set, and zero is contained. We discern three cases, in // which a zero can appear: // 1) Lower is zero, handling cases of kind [0, 1), [0, 2), etc. // 2) Upper is zero, wrapped set, handling cases of kind [3, 0], etc. // 3) Zero contained in a wrapped set, e.g., [3, 2), [3, 1), etc. if (getLower().isZero()) { if ((getUpper() - 1).isZero()) { // We have in input interval of kind [0, 1). In this case we cannot // really help but return empty-set. return getEmpty(); } // Compute the resulting range by excluding zero from Lower. return ConstantRange( APInt(getBitWidth(), (getUpper() - 1).countLeadingZeros()), APInt(getBitWidth(), (getLower() + 1).countLeadingZeros() + 1)); } else if ((getUpper() - 1).isZero()) { // Compute the resulting range by excluding zero from Upper. return ConstantRange( Zero, APInt(getBitWidth(), getLower().countLeadingZeros() + 1)); } else { return ConstantRange(Zero, APInt(getBitWidth(), getBitWidth())); } } // Zero is either safe or not in the range. The output range is composed by // the result of countLeadingZero of the two extremes. return getNonEmpty( APInt(getBitWidth(), getUnsignedMax().countLeadingZeros()), APInt(getBitWidth(), getUnsignedMin().countLeadingZeros() + 1)); } nikic: This looks basically fine, but a bit overly complicated. This should be sufficient: ```…
APInt OtherMin = Other.getSignedMin(), OtherMax = Other.getSignedMax();		APInt OtherMin = Other.getSignedMin(), OtherMax = Other.getSignedMax();

APInt SignedMin = APInt::getSignedMinValue(getBitWidth());		APInt SignedMin = APInt::getSignedMinValue(getBitWidth());
APInt SignedMax = APInt::getSignedMaxValue(getBitWidth());		APInt SignedMax = APInt::getSignedMaxValue(getBitWidth());

// a s+ b overflows high iff a s>=0 && b s>= 0 && a s> smax - b.		// a s+ b overflows high iff a s>=0 && b s>= 0 && a s> smax - b.
// a s+ b overflows low iff a s< 0 && b s< 0 && a s< smin - b.		// a s+ b overflows low iff a s< 0 && b s< 0 && a s< smin - b.
if (Min.isNonNegative() && OtherMin.isNonNegative() &&		if (Min.isNonNegative() && OtherMin.isNonNegative() &&
▲ Show 20 Lines • Show All 118 Lines • Show Last 20 Lines

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt < %s -passes=jump-threading -S \| FileCheck %s
				nikicUnsubmitted Done Reply Inline Actions Having an IR test is fine, but please do not test LVI debug output. Just check the resulting IR change using update_test_checks.py. nikic: Having an IR test is fine, but please do not test LVI debug output. Just check the resulting IR…
				nikicUnsubmitted Not Done Reply Inline Actions I don't think that as written, these tests really test anything. There needs to be a comparison involving the ctlz that can be folded away, or similar. nikic: I don't think that as written, these tests really test anything. There needs to be a comparison…
				antoniofrighettoAuthorUnsubmitted Not Done Reply Inline Actions I feel like a suitable test could be the following one: int lol(int b) { if (b < 65536) { int n = __builtin_clz(b); if (n < 8) return 0; else return 1; } return 2; } which now could be simplified in a `return (b < 65536) ? 1 : 2;`. However, as far as I understand this, CVP needs to be extended as well in order to be able to obtain the above constant folding. Is that correct? antoniofrighetto: I feel like a suitable test could be the following one: ``` int lol(int b) { if (b < 65536)…
				StephenFanUnsubmitted Not Done Reply Inline Actions Is the simplification to `return (b < 65536) ? 1:2;` correct? Since if `b` is negative, `clz` returns 0. StephenFan: Is the simplification to `return (b < 65536) ? 1:2;` correct? Since if `b` is negative, `clz`…
				antoniofrighettoAuthorUnsubmitted Not Done Reply Inline Actions True, sorry, I originally intended the first argument to be `unsigned`, whose optimization seems to occur successfully in this case. antoniofrighetto: True, sorry, I originally intended the first argument to be `unsigned`, whose optimization…
				antoniofrighettoAuthorUnsubmitted Not Done Reply Inline Actions @nikic, perhaps a better example could be the following one? That would still require a change in CVP if I'm not mistaken though, correct? int test_ctlz(int b) { if (b < 65536) { int n = __builtin_clz(b); if (n < 8 && n > 2) return 0; else return 1; } return 2; } antoniofrighetto: @nikic, perhaps a better example could be the following one? That would still require a change…
				nikicUnsubmitted Not Done Reply Inline Actions Wouldn't something like this work as a test? int test_ctlz(unsigned b) { if (b < 65536) { unsigned n = __builtin_clz(b); return n >= 16; // Fold to 1 } return 2; } nikic: Wouldn't something like this work as a test? ``` int test_ctlz(unsigned b) { if (b < 65536)…

				; Ensures that LazyValueInfo correctly propagates the constant range lattice values to ctlz intrinsic.

				; Constant range value may be bounded on [28,31] interval,
				; as forward.0 is taken only if %val is within [1,9].
				define i32 @test_ctlz_1(i32 %val) {
				; CHECK-LABEL: @test_ctlz_1(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[ADD:%.]] = add i32 [[VAL:%.]], -1
				; CHECK-NEXT: [[COND_0:%.*]] = icmp ult i32 [[ADD]], 9
				; CHECK-NEXT: br i1 [[COND_0]], label [[FORWARD_0:%.]], label [[FORWARD_1:%.]]
				; CHECK: forward.0:
				; CHECK-NEXT: [[RET_VAL_0:%.*]] = tail call i32 @llvm.ctlz.i32(i32 [[VAL]], i1 true), !range [[RNG0:![0-9]+]]
				; CHECK-NEXT: ret i32 [[RET_VAL_0]]
				; CHECK: forward.1:
				; CHECK-NEXT: [[COND_1:%.*]] = icmp ugt i32 [[VAL]], 20
				; CHECK-NEXT: [[RET_VAL_1:%.*]] = select i1 [[COND_1]], i32 10, i32 0
				; CHECK-NEXT: ret i32 [[RET_VAL_1]]
				;
				entry:
				%add = add i32 %val, -1
				%cond.0 = icmp ult i32 %add, 9
				br i1 %cond.0, label %forward.0, label %forward.1

				forward.0: ; preds = %entry
				%ret_val.0 = tail call i32 @llvm.ctlz.i32(i32 %val, i1 true), !range !0
				ret i32 %ret_val.0

				forward.1: ; preds = %entry
				%cond.1 = icmp ugt i32 %val, 20
				%ret_val.1 = select i1 %cond.1, i32 10, i32 0
				arsenmUnsubmitted Not Done Reply Inline Actions Test looks pretty small for all the different paths in the patch. You don't test the i1 false case for example arsenm: Test looks pretty small for all the different paths in the patch. You don't test the i1 false…
				antoniofrighettoAuthorUnsubmitted Not Done Reply Inline Actions For the second argument `i1 false` the result should always be more precise than the current one, but I'll make sure to test it soon as well. antoniofrighetto: For the second argument `i1 false` the result should always be more precise than the current…
				ret i32 %ret_val.1
				}

				@g_unsigned_char = global i8 0

				; Constant range value may be bounded on [23,32] interval, as it takes
				; the sum of %val, which is within [0, 10], and g_unsigned_char, which is
				; within [0, 255], and ZeroIsPoison is set to false.
				define i32 @test_ctlz_2(i32 %val) {
				; CHECK-LABEL: @test_ctlz_2(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[COND_0:%.]] = icmp ult i32 [[VAL:%.]], 10
				; CHECK-NEXT: br i1 [[COND_0]], label [[FORWARD_0:%.]], label [[FORWARD_1:%.]]
				; CHECK: forward.0:
				; CHECK-NEXT: [[LOAD:%.*]] = load i8, ptr @g_unsigned_char, align 1
				; CHECK-NEXT: [[ZEXT:%.*]] = zext i8 [[LOAD]] to i32
				; CHECK-NEXT: [[ADD:%.*]] = add nuw nsw i32 [[ZEXT]], [[VAL]]
				; CHECK-NEXT: [[RES:%.*]] = tail call i32 @llvm.ctlz.i32(i32 [[ADD]], i1 false), !range [[RNG0]]
				; CHECK-NEXT: br label [[FORWARD_1]]
				; CHECK: forward.1:
				; CHECK-NEXT: [[RET_VAL:%.]] = phi i32 [ [[RES]], [[FORWARD_0]] ], [ 0, [[ENTRY:%.]] ]
				; CHECK-NEXT: ret i32 [[RET_VAL]]
				;
				entry:
				%cond.0 = icmp ult i32 %val, 10
				br i1 %cond.0, label %forward.0, label %forward.1

				forward.0: ; preds = %entry
				%load = load i8, i8* @g_unsigned_char
				%zext = zext i8 %load to i32
				%add = add nuw nsw i32 %zext, %val
				%res = tail call i32 @llvm.ctlz.i32(i32 %add, i1 false), !range !0
				br label %forward.1

				forward.1: ; preds = %entry, %forward.0
				%ret_val = phi i32 [ %res, %forward.0 ], [ 0, %entry ]
				ret i32 %ret_val
				}

				; Same as test_ctlz_2, this time with ZeroIsPoison set to true.
				; Hence, constant range value may be unknown.
				define i32 @test_ctlz_3(i32 %val) {
				; CHECK-LABEL: @test_ctlz_3(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[COND_0:%.]] = icmp ult i32 [[VAL:%.]], 10
				; CHECK-NEXT: br i1 [[COND_0]], label [[FORWARD_0:%.]], label [[FORWARD_1:%.]]
				; CHECK: forward.0:
				; CHECK-NEXT: [[LOAD:%.*]] = load i8, ptr @g_unsigned_char, align 1
				; CHECK-NEXT: [[ZEXT:%.*]] = zext i8 [[LOAD]] to i32
				; CHECK-NEXT: [[ADD:%.*]] = add nuw nsw i32 [[ZEXT]], [[VAL]]
				; CHECK-NEXT: [[RES:%.*]] = tail call i32 @llvm.ctlz.i32(i32 [[ADD]], i1 true), !range [[RNG0]]
				; CHECK-NEXT: br label [[FORWARD_1]]
				; CHECK: forward.1:
				; CHECK-NEXT: [[RET_VAL:%.]] = phi i32 [ [[RES]], [[FORWARD_0]] ], [ 0, [[ENTRY:%.]] ]
				; CHECK-NEXT: ret i32 [[RET_VAL]]
				;
				entry:
				%cond.0 = icmp ult i32 %val, 10
				br i1 %cond.0, label %forward.0, label %forward.1

				forward.0: ; preds = %entry
				%load = load i8, i8* @g_unsigned_char
				%zext = zext i8 %load to i32
				%add = add nuw nsw i32 %zext, %val
				%res = tail call i32 @llvm.ctlz.i32(i32 %add, i1 true), !range !0
				br label %forward.1

				forward.1: ; preds = %entry, %forward.0
				%ret_val = phi i32 [ %res, %forward.0 ], [ 0, %entry ]
				ret i32 %ret_val
				}

				declare i32 @llvm.ctlz.i32(i32, i1 immarg) nounwind willreturn

				!0 = !{i32 0, i32 33}

llvm/unittests/IR/ConstantRangeTest.cpp

Show First 20 Lines • Show All 2,391 Lines • ▼ Show 20 Lines	TestUnaryOpExhaustive(
[](const ConstantRange &CR) { return CR.abs(/IntMinIsPoison=/true); },		[](const ConstantRange &CR) { return CR.abs(/IntMinIsPoison=/true); },
[](const APInt &N) -> std::optional<APInt> {		[](const APInt &N) -> std::optional<APInt> {
if (N.isMinSignedValue())		if (N.isMinSignedValue())
return std::nullopt;		return std::nullopt;
return N.abs();		return N.abs();
});		});
}		}

		TEST_F(ConstantRangeTest, Ctlz) {
		TestUnaryOpExhaustive([](const ConstantRange &CR) { return CR.ctlz(); },
		[](const APInt &N) {
		return APInt(N.getBitWidth(), N.countLeadingZeros());
		});

		TestUnaryOpExhaustive(
		[](const ConstantRange &CR) { return CR.ctlz(/ZeroIsPoison=/true); },
		[](const APInt &N) -> std::optional<APInt> {
		if (N.isZero())
		return std::nullopt;
		return APInt(N.getBitWidth(), N.countLeadingZeros());
		});
		}

TEST_F(ConstantRangeTest, castOps) {		TEST_F(ConstantRangeTest, castOps) {
ConstantRange A(APInt(16, 66), APInt(16, 128));		ConstantRange A(APInt(16, 66), APInt(16, 128));
ConstantRange FpToI8 = A.castOp(Instruction::FPToSI, 8);		ConstantRange FpToI8 = A.castOp(Instruction::FPToSI, 8);
EXPECT_EQ(8u, FpToI8.getBitWidth());		EXPECT_EQ(8u, FpToI8.getBitWidth());
EXPECT_TRUE(FpToI8.isFullSet());		EXPECT_TRUE(FpToI8.isFullSet());

ConstantRange FpToI16 = A.castOp(Instruction::FPToSI, 16);		ConstantRange FpToI16 = A.castOp(Instruction::FPToSI, 16);
EXPECT_EQ(16u, FpToI16.getBitWidth());		EXPECT_EQ(16u, FpToI16.getBitWidth());
▲ Show 20 Lines • Show All 211 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[ConstantRange] Handle `Intrinsic::ctlz`ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 494409

llvm/include/llvm/IR/ConstantRange.h

llvm/lib/IR/ConstantRange.cpp

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll

llvm/unittests/IR/ConstantRangeTest.cpp

[ConstantRange] Handle `Intrinsic::ctlz`
ClosedPublic