Download Raw Diff

Details

Reviewers

nikic
reames

Commits

rG65898e526060: [ConstantRange] Handle `Intrinsic::ctlz`

Summary

Introduce support for ctlz intrinsic, including exhaustive testing. Among other things, LVI may now be able to propagate information about constant ranges lattice values. This change makes sure to provide a precise result where possible, or a more conservative one (e.g., should the first argument value be zero and is_zero_poison true), but always correct.

Diff Detail

Event Timeline

antoniofrighetto created this revision.Jan 20 2023, 9:36 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 20 2023, 9:36 AM

Herald added subscribers: StephenFan, hiraditya. · View Herald Transcript

antoniofrighetto requested review of this revision.Jan 20 2023, 9:36 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 20 2023, 9:36 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

arsenm added a subscriber: arsenm.Jan 20 2023, 9:39 AM

arsenm added inline comments.

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
32	Test looks pretty small for all the different paths in the patch. You don't test the i1 false case for example

antoniofrighetto added inline comments.Jan 20 2023, 9:41 AM

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
32	For the second argument `i1 false` the result should always be more precise than the current one, but I'll make sure to test it soon as well.

This needs to be implemented inside ConstantRange and tested exhaustively inside ConstantRangeTest. See the handling of Intrinsic::abs as a possible sample to follow (it also has a semantics-controlling argument).

This revision now requires changes to proceed.Jan 20 2023, 9:45 AM

Harbormaster completed remote builds in B209020: Diff 490893.Jan 20 2023, 10:56 AM

Moved the implementation in ConstantRange.cpp, and tested exhaustively. For the sake of completeness and rigor, I left the test in LVI, further improved: it shouldn’t harm, and there are similar unit tests in ConstantRangeTest.cpp which are also in test/Analysis.

Harbormaster completed remote builds in B209837: Diff 492056.Jan 25 2023, 4:46 AM

antoniofrighetto updated this revision to Diff 492086.Jan 25 2023, 6:25 AM

antoniofrighetto edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B209852: Diff 492086.Jan 25 2023, 8:28 AM

antoniofrighetto updated this revision to Diff 492168.Jan 25 2023, 10:16 AM

antoniofrighetto updated this revision to Diff 492170.Jan 25 2023, 10:19 AM

Harbormaster completed remote builds in B209914: Diff 492170.Jan 25 2023, 12:26 PM

The unit test currently fails both for correctness and optimality checks:

/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:137
Value of: rangeContains(CR, APInt(BitWidth, Elem), Inputs)
  Actual: false (empty-set does not contain 0 for inputs: full-set, )
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:137
Value of: rangeContains(CR, APInt(BitWidth, Elem), Inputs)
  Actual: false (empty-set does not contain -1 for inputs: full-set, )
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:145
Value of: CR.isEmptySet()
  Actual: false
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = full-set, CR = full-set, BetterCR = [0,-1))
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:145
Value of: CR.isEmptySet()
  Actual: false
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = [0,2), CR = full-set, BetterCR = [3,4))
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = [0,3), CR = full-set, BetterCR = [2,4))
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = [0,4), CR = full-set, BetterCR = [2,4))
Expected: true
/var/lib/buildkite-agent/builds/llvm-project/llvm/unittests/IR/ConstantRangeTest.cpp:177
Value of: NotPreferred(PossibleCR)
  Actual: false (Inputs = [0,5), CR = full-set, BetterCR = [1,4))
Expected: true
[...]

You can run the unit tests using ninja -Cbuild check-llvm-unit. Or ninja -Cbuild IRTests && build/unittests/IR/IRTests to run just this test suite.

llvm/include/llvm/ADT/APInt.h
1741 ↗	(On Diff #492170)	I'd rather not have this API on APInt. Getting leading zeros as an APInt is pretty unusual.
llvm/lib/IR/ConstantRange.cpp
1684 ↗	(On Diff #492170)	Constant ranges are understood modulo poison. The value is in the range or it is poison. As such, if ZeroIsPoison is set, we should return a better range that doesn't have to account for zero input, not a worse range.
1700 ↗	(On Diff #492170)	Can Lower be larger than Upper for a non-wrapped set?
llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	Having an IR test is fine, but please do not test LVI debug output. Just check the resulting IR change using update_test_checks.py.

This revision now requires changes to proceed.Jan 26 2023, 4:08 AM

@nikic, thank you, I'm having a look at these now. Just a few considerations.

I'm not entirely sure about the unit test failures: should the returned range simply not include zero, if the incoming interval may contain zero and ZeroIsPoison is true, as you noted? I reckoned that being more conservative (unknown/overdefined) could be better here. Same applies for isEmptySet, we may want to return overdefined, I guess (I'm looking at Value of: CR.isEmptySet(), Actual: false, Expected: true)?.
If we don't want to have ctlz API on APInt, could it be fine to overload TestUnaryOpExhaustive so that it accepts a callable reference to std::optional<unsigned>(const APInt &) too?
range.ll now fails as the metadata's range is no longer being used now. I'm gradually realizing that this needs to be handled properly in CVP as well. Could it be fine to revert the ret i1 true to this change, considering that the range is now [0,16] (verification here) for this time, and postpone the handling in CVP for a separate dedicated patch?

In D142234#4082670, @antoniofrighetto wrote:

@nikic, thank you, I'm having a look at these now. Just a few considerations.

I'm not entirely sure about the unit test failures: should the returned range simply not include zero, if the incoming interval may contain zero and ZeroIsPoison is true, as you noted? I reckoned that being more conservative (unknown/overdefined) could be better here. Same applies for isEmptySet, we may want to return overdefined, I guess (I'm looking at Value of: CR.isEmptySet(), Actual: false, Expected: true)?.

It's not necessary to make any conservative choices here. If ZeroIsPoison is set, you can assume that the input range does not contain zero, e.g. by intersecting it out.

If we don't want to have ctlz API on APInt, could it be fine to overload TestUnaryOpExhaustive so that it accepts a callable reference to std::optional<unsigned>(const APInt &) too?

I think we should just create the APInt in the two places where it's needed.

range.ll now fails as the metadata's range is no longer being used now. I'm gradually realizing that this needs to be handled properly in CVP as well. Could it be fine to revert the ret i1 true to this change, considering that the range is now [0,16] (verification here) for this time, and postpone the handling in CVP for a separate dedicated patch?

This issue should already be fixed by https://github.com/llvm/llvm-project/commit/2e9bc1b8614c9422573cf2f4728525787b0cb0cb.

antoniofrighetto updated this revision to Diff 493846.Jan 31 2023, 11:53 PM

antoniofrighetto marked an inline comment as done.

antoniofrighetto added inline comments.

llvm/lib/IR/ConstantRange.cpp
1700 ↗	(On Diff #492170)	It seems it is possible with negative numbers as well (`[0,-7)`), non-wrapped set.

antoniofrighetto marked an inline comment as not done.Feb 1 2023, 12:00 AM

antoniofrighetto retitled this revision from [LVI] Handle Intrinsic::ctlz to [ConstantRange] Handle Intrinsic::ctlz.Feb 1 2023, 1:06 AM

Harbormaster completed remote builds in B211157: Diff 493846.Feb 1 2023, 1:12 AM

This looks on the right track now.

llvm/lib/IR/ConstantRange.cpp

1737 ↗

(On Diff #493846)

This looks basically fine, but a bit overly complicated. This should be sufficient:

ConstantRange ConstantRange::ctlz(bool ZeroIsPoison) const {
  if (isEmptySet())
    return getEmpty();

  APInt Zero = APInt::getZero(getBitWidth());
  if (ZeroIsPoison && contains(Zero)) {
    // ZeroIsPoison is set, and zero is contained. We discern three cases, in
    // which a zero can appear:
    // 1) Lower is zero, handling cases of kind [0, 1), [0, 2), etc.
    // 2) Upper is zero, wrapped set, handling cases of kind [3, 0], etc.
    // 3) Zero contained in a wrapped set, e.g., [3, 2), [3, 1), etc.
    
    if (getLower().isZero()) {
      if ((getUpper() - 1).isZero()) {
        // We have in input interval of kind [0, 1). In this case we cannot
        // really help but return empty-set.
        return getEmpty();
      }

      // Compute the resulting range by excluding zero from Lower.
      return ConstantRange(
          APInt(getBitWidth(), (getUpper() - 1).countLeadingZeros()), 
          APInt(getBitWidth(), (getLower() + 1).countLeadingZeros() + 1));
    } else if ((getUpper() - 1).isZero()) {
      // Compute the resulting range by excluding zero from Upper.
      return ConstantRange(
          Zero, APInt(getBitWidth(), getLower().countLeadingZeros() + 1));
    } else { 
      return ConstantRange(Zero, APInt(getBitWidth(), getBitWidth()));
    }   
  } 
  
  // Zero is either safe or not in the range. The output range is composed by
  // the result of countLeadingZero of the two extremes.
  return getNonEmpty(
      APInt(getBitWidth(), getUnsignedMax().countLeadingZeros()), 
      APInt(getBitWidth(), getUnsignedMin().countLeadingZeros() + 1));
}

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll

I don't think that as written, these tests really test anything. There needs to be a comparison involving the ctlz that can be folded away, or similar.

This revision now requires changes to proceed.Feb 1 2023, 2:50 AM

antoniofrighetto updated this revision to Diff 494409.Feb 2 2023, 12:32 PM

antoniofrighetto marked an inline comment as done.

antoniofrighetto added inline comments.Feb 2 2023, 12:39 PM

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	I feel like a suitable test could be the following one: int lol(int b) { if (b < 65536) { int n = __builtin_clz(b); if (n < 8) return 0; else return 1; } return 2; } which now could be simplified in a `return (b < 65536) ? 1 : 2;`. However, as far as I understand this, CVP needs to be extended as well in order to be able to obtain the above constant folding. Is that correct?

antoniofrighetto marked an inline comment as not done.Feb 2 2023, 12:40 PM

Harbormaster completed remote builds in B211561: Diff 494409.Feb 2 2023, 2:04 PM

StephenFan added inline comments.Feb 2 2023, 11:27 PM

llvm/lib/IR/ConstantRange.cpp
1686 ↗	(On Diff #494409)	IIUC, is `[3, 0]` the same as the `[3, 1)` in case 3?
llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	Is the simplification to `return (b < 65536) ? 1:2;` correct? Since if `b` is negative, `clz` returns 0.

antoniofrighetto added inline comments.Feb 2 2023, 11:44 PM

llvm/lib/IR/ConstantRange.cpp
1686 ↗	(On Diff #494409)	Correct.
llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	True, sorry, I originally intended the first argument to be `unsigned`, whose optimization seems to occur successfully in this case.

antoniofrighetto added inline comments.Feb 7 2023, 1:12 AM

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	@nikic, perhaps a better example could be the following one? That would still require a change in CVP if I'm not mistaken though, correct? int test_ctlz(int b) { if (b < 65536) { int n = __builtin_clz(b); if (n < 8 && n > 2) return 0; else return 1; } return 2; }

nikic added inline comments.Feb 15 2023, 2:34 AM

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll
2	Wouldn't something like this work as a test? int test_ctlz(unsigned b) { if (b < 65536) { unsigned n = __builtin_clz(b); return n >= 16; // Fold to 1 } return 2; }

Can you please rebase over https://github.com/llvm/llvm-project/commit/14fcdd7f9d7b3973661efc5a426da18e077155bf? I think that should work as a CVP test for this functionality, if I got the numbers right.

antoniofrighetto updated this revision to Diff 497611.Feb 15 2023, 2:59 AM

antoniofrighetto retitled this revision from [ConstantRange] Handle Intrinsic::ctlz to [ConstantRange] Handle `Intrinsic::ctlz`.

@nikic, rebased and removed the old test. Might be wrong, but it seems like O2 is already handling the test case you suggested; whereas the latter one I previously suggested (the one with if (n < 8 && n > 2) return 0;, which can never happen) does not seem to be reduced to a icmp, select, ret as of now.

Harbormaster completed remote builds in B213850: Diff 497611.Feb 15 2023, 3:48 AM

Can you please rerun update_test_checks.py on the CorrelatedPropagation/range.ll test? There should be some diffs there.

In D142234#4128565, @antoniofrighetto wrote:

@nikic, rebased and removed the old test. Might be wrong, but it seems like O2 is already handling the test case you suggested; whereas the latter one I previously suggested (the one with if (n < 8 && n > 2) return 0;, which can never happen) does not seem to be reduced to a icmp, select, ret as of now.

Right. It's not really important that individual pass tests aren't folded by -O2. One construct more complex cases that LVI can handle that InstCombine can't. It's just more straightforward to test the basic case.

antoniofrighetto updated this revision to Diff 497674.Feb 15 2023, 7:43 AM

Harbormaster completed remote builds in B213896: Diff 497674.Feb 15 2023, 8:33 AM

nikic added inline comments.Feb 16 2023, 2:48 AM

llvm/test/Transforms/CorrelatedValuePropagation/range.ll
954 ↗	(On Diff #497674)	Did you use the right build to produce this? This is not the expected diff, and it still fails in pre-merge checks.

antoniofrighetto updated this revision to Diff 497952.Feb 16 2023, 3:17 AM

LGTM

This revision is now accepted and ready to land.Feb 16 2023, 3:19 AM

antoniofrighetto added inline comments.Feb 16 2023, 3:20 AM

llvm/test/Transforms/CorrelatedValuePropagation/range.ll
954 ↗	(On Diff #497674)	Sorry, my bad, confused builds and must have accidentally passed the wrong `--opt-bin` path. Checked twice, we should be good now.

Harbormaster completed remote builds in B214113: Diff 497952.Feb 16 2023, 5:31 AM

Closed by commit rG65898e526060: [ConstantRange] Handle `Intrinsic::ctlz` (authored by antoniofrighetto, committed by nikic). · Explain WhyFeb 17 2023, 12:57 AM

This revision was automatically updated to reflect the committed changes.

nikic added a commit: rG65898e526060: [ConstantRange] Handle `Intrinsic::ctlz`.

Diff 490893

llvm/lib/Analysis/LazyValueInfo.cpp

Show First 20 Lines • Show All 418 Lines • ▼ Show 20 Lines	std::optional<ValueLatticeElement> solveBlockValueBinaryOpImpl(
std::function<ConstantRange(const ConstantRange &, const ConstantRange &)>		std::function<ConstantRange(const ConstantRange &, const ConstantRange &)>
OpFn);		OpFn);
std::optional<ValueLatticeElement>		std::optional<ValueLatticeElement>
solveBlockValueBinaryOp(BinaryOperator BBI, BasicBlock BB);		solveBlockValueBinaryOp(BinaryOperator BBI, BasicBlock BB);
std::optional<ValueLatticeElement> solveBlockValueCast(CastInst *CI,		std::optional<ValueLatticeElement> solveBlockValueCast(CastInst *CI,
BasicBlock *BB);		BasicBlock *BB);
std::optional<ValueLatticeElement>		std::optional<ValueLatticeElement>
solveBlockValueOverflowIntrinsic(WithOverflowInst WO, BasicBlock BB);		solveBlockValueOverflowIntrinsic(WithOverflowInst WO, BasicBlock BB);
		std::optional<ValueLatticeElement>
		solveBlockValueCtlzIntrinsic(IntrinsicInst II, BasicBlock BB);
std::optional<ValueLatticeElement> solveBlockValueIntrinsic(IntrinsicInst *II,		std::optional<ValueLatticeElement> solveBlockValueIntrinsic(IntrinsicInst *II,
BasicBlock *BB);		BasicBlock *BB);
std::optional<ValueLatticeElement>		std::optional<ValueLatticeElement>
solveBlockValueExtractValue(ExtractValueInst EVI, BasicBlock BB);		solveBlockValueExtractValue(ExtractValueInst EVI, BasicBlock BB);
bool isNonNullAtEndOfBlock(Value Val, BasicBlock BB);		bool isNonNullAtEndOfBlock(Value Val, BasicBlock BB);
void intersectAssumeOrGuardBlockValueConstantRange(Value *Val,		void intersectAssumeOrGuardBlockValueConstantRange(Value *Val,
ValueLatticeElement &BBLV,		ValueLatticeElement &BBLV,
Instruction *BBI);		Instruction *BBI);
▲ Show 20 Lines • Show All 549 Lines • ▼ Show 20 Lines	LazyValueInfoImpl::solveBlockValueOverflowIntrinsic(WithOverflowInst *WO,
BasicBlock *BB) {		BasicBlock *BB) {
return solveBlockValueBinaryOpImpl(		return solveBlockValueBinaryOpImpl(
WO, BB, [WO](const ConstantRange &CR1, const ConstantRange &CR2) {		WO, BB, [WO](const ConstantRange &CR1, const ConstantRange &CR2) {
return CR1.binaryOp(WO->getBinaryOp(), CR2);		return CR1.binaryOp(WO->getBinaryOp(), CR2);
});		});
}		}

std::optional<ValueLatticeElement>		std::optional<ValueLatticeElement>
		LazyValueInfoImpl::solveBlockValueCtlzIntrinsic(IntrinsicInst *II,
		BasicBlock *BB) {
		Value *Argument = II->getArgOperand(0);

		using VLE = ValueLatticeElement;
		std::optional<VLE> BlockVal = getBlockValue(Argument, BB);
		if (!BlockVal)
		return std::nullopt;

		VLE V = BlockVal.getValue();

		const unsigned OperandBitWidth = DL.getTypeSizeInBits(II->getType());
		auto GetAPInt = [OperandBitWidth](uint64_t V) {
		return APInt(OperandBitWidth, V);
		};
		auto GetRange = [&GetAPInt](uint64_t Lower, uint64_t Upper) {
		return VLE::getRange(ConstantRange(GetAPInt(Lower), GetAPInt(Upper)));
		};

		Constant *C = nullptr;
		if (V.isConstant())
		C = V.getConstant();
		else if (V.isNotConstant())
		C = V.getNotConstant();
		ConstantInt *CI = dyn_cast_or_null<ConstantInt>(C);
		const APInt *NV = CI != nullptr ? &CI->getValue() : nullptr;

		bool ZeroIsPoison = cast<ConstantInt>(II->getArgOperand(1))->isOne();
		std::optional<ValueLatticeElement> Res = std::nullopt;

		if (V.isUnknownOrUndef()) {
		// No valid values
		Res = V;
		} else if (V.isOverdefined()) {
		if (ZeroIsPoison) {
		// It might be zero, the result is overdefined
		Res = VLE::getOverdefined();
		} else {
		// From 0 to the bit width
		Res = GetRange(0, OperandBitWidth + 1);
		}
		} else if (V.isConstant()) {
		if (ZeroIsPoison && (CI == nullptr \|\| CI->isZero())) {
		// If we have an explicit zero (or we cannot tell), the result is
		// undefined
		Res = VLE::getOverdefined();
		} else if (NV != nullptr) {
		// Zero is safe, and we have the constant, return the exact result
		Res = VLE::get(ConstantInt::get(II->getType(), NV->countLeadingZeros()));
		} else {
		// Zero is safe but the constant is not known, get the range of bits
		Res = GetRange(0, OperandBitWidth + 1);
		}
		} else if (V.isNotConstant()) {
		if (CI != nullptr && CI->isZero()) {
		// We can explicitly exclude zero, valid results are from 0 to bit width
		// minus one
		Res = GetRange(0, OperandBitWidth);
		} else if (ZeroIsPoison) {
		// Zero is not safe, and we can't explicitly exclude it
		Res = VLE::getOverdefined();
		} else {
		// Zero is safe, but we can't say much. We could say "not one" but we
		// cannot express the disjoint range
		Res = GetRange(0, OperandBitWidth + 1);
		}
		} else if (V.isConstantRange()) {
		const ConstantRange &Range = V.getConstantRange();
		if (ZeroIsPoison && Range.contains(GetAPInt(0))) {
		// Zero is not safe and it's not excluded by the range
		Res = VLE::getOverdefined();
		} else if (Range.isWrappedSet() \|\| Range.isFullSet()) {
		// The range wraps, therefore it includes the two extreme encodings, all
		// zeros and all ones. The only way we can express this [0, BitWidth + 1)
		Res = GetRange(0, OperandBitWidth + 1);
		} else {
		// Zero is either safe or not in the range. The output range is composed
		// by the result of countLeadingZero of the two extremes, sorted
		APInt Lower = GetAPInt(Range.getLower().countLeadingZeros());
		APInt Last = Range.getUpper() - 1;
		APInt Upper = GetAPInt(Last.countLeadingZeros());

		if (Lower.eq(Upper)) {
		Res = VLE::get(ConstantInt::get(II->getType(), Lower));
		} else {
		if (Lower.ugt(Upper))
		std::swap(Lower, Upper);
		++Upper;
		Res = VLE::getRange(ConstantRange(Lower, Upper));
		}
		}
		}

		return Res;
		}

		std::optional<ValueLatticeElement>
LazyValueInfoImpl::solveBlockValueIntrinsic(IntrinsicInst II, BasicBlock BB) {		LazyValueInfoImpl::solveBlockValueIntrinsic(IntrinsicInst II, BasicBlock BB) {
		if (II->getIntrinsicID() == Intrinsic::ctlz)
		return solveBlockValueCtlzIntrinsic(II, BB);

if (!ConstantRange::isIntrinsicSupported(II->getIntrinsicID())) {		if (!ConstantRange::isIntrinsicSupported(II->getIntrinsicID())) {
LLVM_DEBUG(dbgs() << " compute BB '" << BB->getName()		LLVM_DEBUG(dbgs() << " compute BB '" << BB->getName()
<< "' - unknown intrinsic.\n");		<< "' - unknown intrinsic.\n");
return getFromRangeMetadata(II);		return getFromRangeMetadata(II);
}		}

SmallVector<ConstantRange, 2> OpRanges;		SmallVector<ConstantRange, 2> OpRanges;
for (Value *Op : II->args()) {		for (Value *Op : II->args()) {
▲ Show 20 Lines • Show All 1,064 Lines • Show Last 20 Lines

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll

This file was added.

				; RUN: opt < %s -passes=jump-threading -print-lvi-after-jump-threading -disable-output 2>&1 \| FileCheck %s

				nikicUnsubmitted Done Reply Inline Actions Having an IR test is fine, but please do not test LVI debug output. Just check the resulting IR change using update_test_checks.py. nikic: Having an IR test is fine, but please do not test LVI debug output. Just check the resulting IR…
				nikicUnsubmitted Not Done Reply Inline Actions I don't think that as written, these tests really test anything. There needs to be a comparison involving the ctlz that can be folded away, or similar. nikic: I don't think that as written, these tests really test anything. There needs to be a comparison…
				antoniofrighettoAuthorUnsubmitted Not Done Reply Inline Actions I feel like a suitable test could be the following one: int lol(int b) { if (b < 65536) { int n = __builtin_clz(b); if (n < 8) return 0; else return 1; } return 2; } which now could be simplified in a `return (b < 65536) ? 1 : 2;`. However, as far as I understand this, CVP needs to be extended as well in order to be able to obtain the above constant folding. Is that correct? antoniofrighetto: I feel like a suitable test could be the following one: ``` int lol(int b) { if (b < 65536)…
				StephenFanUnsubmitted Not Done Reply Inline Actions Is the simplification to `return (b < 65536) ? 1:2;` correct? Since if `b` is negative, `clz` returns 0. StephenFan: Is the simplification to `return (b < 65536) ? 1:2;` correct? Since if `b` is negative, `clz`…
				antoniofrighettoAuthorUnsubmitted Not Done Reply Inline Actions True, sorry, I originally intended the first argument to be `unsigned`, whose optimization seems to occur successfully in this case. antoniofrighetto: True, sorry, I originally intended the first argument to be `unsigned`, whose optimization…
				antoniofrighettoAuthorUnsubmitted Not Done Reply Inline Actions @nikic, perhaps a better example could be the following one? That would still require a change in CVP if I'm not mistaken though, correct? int test_ctlz(int b) { if (b < 65536) { int n = __builtin_clz(b); if (n < 8 && n > 2) return 0; else return 1; } return 2; } antoniofrighetto: @nikic, perhaps a better example could be the following one? That would still require a change…
				nikicUnsubmitted Not Done Reply Inline Actions Wouldn't something like this work as a test? int test_ctlz(unsigned b) { if (b < 65536) { unsigned n = __builtin_clz(b); return n >= 16; // Fold to 1 } return 2; } nikic: Wouldn't something like this work as a test? ``` int test_ctlz(unsigned b) { if (b < 65536)…
				; Test LazyValueInfo to correctly propagate the constant range lattice values to ctlz intrinsic.
				; In the following example, forward.0 basic block, which counts the leading zeroes of %val, is
				; taken only if %val is within [1,9]. This allows LVI to infer that ctlz.i32 may return a number
				; of zeroes that is bounded on [28, 31] interval.
				define i32 @test_ctlz(i32 %val) {
				; CHECK-LABEL: LVI for function 'test_ctlz':
				entry:
				%add = add i32 %val, -1
				%cond.0 = icmp ult i32 %add, 9
				br i1 %cond.0, label %forward.0, label %forward.1

				forward.0: ; preds = %entry
				; CHECK-LABEL: forward.0:
				; CHECK: ; LatticeVal for: 'i32 %val' is: constantrange<1, 10>
				; CHECK: ; LatticeVal for: ' %ret_val.0 = tail call i32 @llvm.ctlz.i32(i32 %val, i1 true), !range !0' in BB: '%forward.0' is: constantrange<28, 32>
				; CHECK-NOT: ; LatticeVal for: ' %ret_val.0 = tail call i32 @llvm.ctlz.i32(i32 %val, i1 true), !range !0' in BB: '%forward.0' is: constantrange<0, 33>
				; CHECK: %ret_val.0 = tail call i32 @llvm.ctlz.i32(i32 %val, i1 true), !range !0
				%ret_val.0 = tail call i32 @llvm.ctlz.i32(i32 %val, i1 true), !range !0
				ret i32 %ret_val.0

				forward.1: ; preds = %entry
				; CHECK-LABEL: forward.1
				; CHECK: ; LatticeVal for: 'i32 %val' is: constantrange<10, 1>
				; CHECK: ; LatticeVal for: ' %cond.1 = icmp ugt i32 %val, 20' in BB: '%forward.1' is: overdefined
				; CHECK: %cond.1 = icmp ugt i32 %val, 20
				%cond.1 = icmp ugt i32 %val, 20
				%ret_val.1 = select i1 %cond.1, i32 10, i32 0
				ret i32 %ret_val.1
				}

				arsenmUnsubmitted Not Done Reply Inline Actions Test looks pretty small for all the different paths in the patch. You don't test the i1 false case for example arsenm: Test looks pretty small for all the different paths in the patch. You don't test the i1 false…
				antoniofrighettoAuthorUnsubmitted Not Done Reply Inline Actions For the second argument `i1 false` the result should always be more precise than the current one, but I'll make sure to test it soon as well. antoniofrighetto: For the second argument `i1 false` the result should always be more precise than the current…
				declare i32 @llvm.ctlz.i32(i32, i1 immarg) nounwind willreturn

				!0 = !{i32 0, i32 33}

This is an archive of the discontinued LLVM Phabricator instance.

[ConstantRange] Handle `Intrinsic::ctlz`
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 490893

llvm/lib/Analysis/LazyValueInfo.cpp

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll

This is an archive of the discontinued LLVM Phabricator instance.

[ConstantRange] Handle `Intrinsic::ctlz`ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 490893

llvm/lib/Analysis/LazyValueInfo.cpp

llvm/test/Analysis/LazyValueAnalysis/lvi-for-ctlz.ll

[ConstantRange] Handle `Intrinsic::ctlz`
ClosedPublic