This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
InstCombineCompares.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
gep-inbounds-null.ll

Differential D66608

[InstCombine] icmp eq/ne (gep inbounds P, Idx..), null -> icmp eq/ne P, null
ClosedPublic

Authored by reames on Aug 22 2019, 10:58 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
grandinj
nikic

Commits

rG5b02cfa0b3c2: [InstCombine] icmp eq/ne (gep inbounds P, Idx..), null -> icmp eq/ne P, null
rL369789: [InstCombine] icmp eq/ne (gep inbounds P, Idx..), null -> icmp eq/ne P, null

Summary

This generalizes the isGEPKnownNonNull rule from ValueTracking to apply when we do not know if the base is non-null, and thus need to replace one condition with another.

This is an alternative, much more aggressive, approach to the same problem as https://reviews.llvm.org/D64533.

The core notion is that since an inbounds GEP can only form null if the base pointer is null and the offset is zero. However, if the offset is non-zero, the the "inbounds" marker makes the result poison. Thus, we're free to ignore the case where the offset is non-zero. Similarly, there's no case under which a non-null base can result in a null result without generating poison.

Reviewers - I'd appreciate careful review of the reasoning here. It's subtle, and I found several bugs in early versions of this patch. I'm not at all certain there aren't some left.

Diff Detail

Repository: rL LLVM

Event Timeline

reames created this revision.Aug 22 2019, 10:58 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 22 2019, 10:58 AM

Herald added subscribers: bollu, mcrosier. · View Herald Transcript

Fix a typo in one of the tests and add clarifying comments.

reames retitled this revision from [InstCombine] icmp eq/ne (gep P, Idx..), null -> icmp eq/ne P, null to [InstCombine] icmp eq/ne (gep inbounds P, Idx..), null -> icmp eq/ne P, null.Aug 22 2019, 11:10 AM

jdoerfert added inline comments.Aug 22 2019, 1:12 PM

lib/Transforms/InstCombine/InstCombineCompares.cpp

882 ↗

(On Diff #216674)

Side note: This should probably be RHS->stripPointerCastsSameRepresentation() as we below argue about nullness of values and address spaces.

915 ↗

(On Diff #216674)

I drew up the following table to convince myself this is valid given the restrictions checked in the conditional above: inbounds, rhs = zero, null is not valid.

`b`ase/offset	zero	not zero
zero	`b` = zero	poison
not zero	`b`	not zero or poison

As mentioned on the previous review, do we need to handle getelementptr inbounds {}, {}* %p, i32 %idx? That is, an inbounds GEP on a zero-sized type, which may be null for a non-zero index. It looks like this case is not handled in the existing code either though, so possibly this construction isn't legal (though I don't see anything to that effect in langref)?

In D66608#1641754, @nikic wrote:

As mentioned on the previous review, do we need to handle getelementptr inbounds {}, {}* %p, i32 %idx? That is, an inbounds GEP on a zero-sized type, which may be null for a non-zero index. It looks like this case is not handled in the existing code either though, so possibly this construction isn't legal (though I don't see anything to that effect in langref)?

Here it could cause problems, different other places we actually accumulate the index and decide based on the value, e.g., Offset != 0, which should avoid the problem.
Long story short, I guess we need another check in the if condition here.

jdoerfert mentioned this in D64533: [IndVars] Special case the problematic (gep inbounds p, iv == nullptr) problem (pr42357) .Aug 22 2019, 2:19 PM

In D66608#1641754, @nikic wrote:

As mentioned on the previous review, do we need to handle getelementptr inbounds {}, {}* %p, i32 %idx? That is, an inbounds GEP on a zero-sized type, which may be null for a non-zero index. It looks like this case is not handled in the existing code either though, so possibly this construction isn't legal (though I don't see anything to that effect in langref)?

I'm happy to add another bailout. To be honest, the exact semantics of unsized types are not all clear to me.

When I went to add the suggested size-0 type bailout, I realized the existing code was correct for this case. I added tests and a comment to show this.

In D66608#1642250, @reames wrote:

When I went to add the suggested size-0 type bailout, I realized the existing code was correct for this case. I added tests and a comment to show this.

The comment is helpful, thanks!

I'm good with this but I'm also tired. Let's wait for others to voice their opinion or at least till I can take another look with fresh eyes.

LGTM

lib/Transforms/InstCombine/InstCombineCompares.cpp
911 ↗	(On Diff #216764)	I'd suggest changing Index -> Offset in these comments.
test/Transforms/InstCombine/gep-inbounds-null.ll
36 ↗	(On Diff #216764)	Should be eq?

This revision is now accepted and ready to land.Aug 23 2019, 12:33 AM

xbolva00 added a subscriber: xbolva00.Aug 23 2019, 8:27 AM

xbolva00 added inline comments.

lib/Transforms/InstCombine/InstCombineCompares.cpp
907 ↗	(On Diff #216764)	Four?

xbolva00 added inline comments.Aug 23 2019, 8:29 AM

lib/Transforms/InstCombine/InstCombineCompares.cpp
900 ↗	(On Diff #216764)	isa<ConstantPointerNull>(...)

Closed by commit rL369789: [InstCombine] icmp eq/ne (gep inbounds P, Idx..), null -> icmp eq/ne P, null (authored by reames). · Explain WhyAug 23 2019, 11:03 AM

This revision was automatically updated to reflect the committed changes.

I noticed a bug in what got committed, and fixed that in rL369795.

A vector extension of this is now posted for review: https://reviews.llvm.org/D66671

lib/Transforms/InstCombine/InstCombineCompares.cpp
900 ↗	(On Diff #216764)	Left as is for the vector patch now posted.

Thanks for taking the time doing this! Now the compile times for glew are back to more tolerable levels (down to 1 minute in my case, from 6 minutes before), despite the excessive unrolling.

lebedev.ri mentioned this in D67122: [UBSan][clang][compiler-rt] Applying non-zero offset to nullptr is undefined behaviour.Sep 3 2019, 11:43 AM

Diffusion mentioned this in rL374293: [UBSan][clang][compiler-rt] Applying non-zero offset to nullptr is undefined….Oct 10 2019, 2:31 AM

lebedev.ri mentioned this in rG536b0ee40ab9: [UBSan][clang][compiler-rt] Applying non-zero offset to nullptr is undefined….Oct 10 2019, 2:31 AM

Revision Contents

Path

Size

llvm/

trunk/

lib/

Transforms/

InstCombine/

InstCombineCompares.cpp

21 lines

test/

Transforms/

InstCombine/

gep-inbounds-null.ll

184 lines

Diff 216903

llvm/trunk/lib/Transforms/InstCombine/InstCombineCompares.cpp

Show First 20 Lines • Show All 888 Lines • ▼ Show 20 Lines	if (PtrBase == RHS && GEPLHS->isInBounds()) {
// output an optimized form.		// output an optimized form.
Value Offset = evaluateGEPOffsetExpression(GEPLHS, this, DL);		Value Offset = evaluateGEPOffsetExpression(GEPLHS, this, DL);

// If not, synthesize the offset the hard way.		// If not, synthesize the offset the hard way.
if (!Offset)		if (!Offset)
Offset = EmitGEPOffset(GEPLHS);		Offset = EmitGEPOffset(GEPLHS);
return new ICmpInst(ICmpInst::getSignedPredicate(Cond), Offset,		return new ICmpInst(ICmpInst::getSignedPredicate(Cond), Offset,
Constant::getNullValue(Offset->getType()));		Constant::getNullValue(Offset->getType()));
		} else if (GEPLHS->isInBounds() && ICmpInst::isEquality(Cond) &&
		GEPLHS->getType()->isPointerTy() && // TODO: extend to vector geps
		isa<Constant>(RHS) && cast<Constant>(RHS)->isNullValue() &&
		!NullPointerIsDefined(I.getFunction(),
		RHS->getType()->getPointerAddressSpace())) {
		// For most address spaces, an allocation can't be placed at null, but null
		// itself is treated as a 0 size allocation in the in bounds rules. Thus,
		// the only valid inbounds address derived from null, is null itself.
		// Thus, we have four cases to consider:
		// 1) Base == nullptr, Offset == 0 -> inbounds, null
		// 2) Base == nullptr, Offset != 0 -> poison as the result is out of bounds
		// 3) Base != nullptr, Offset == (-base) -> poison (crossing allocations)
		// 4) Base != nullptr, Offset != (-base) -> nonnull (and possibly poison)
		//
		// (Note if we're indexing a type of size 0, that simply collapses into one
		// of the buckets above.)
		//
		// In general, we're allowed to make values less poison (i.e. remove
		// sources of full UB), so in this case, we just select between the two
		// non-poison cases (1 and 4 above).
		return new ICmpInst(Cond, GEPLHS->getPointerOperand(), RHS);
} else if (GEPOperator *GEPRHS = dyn_cast<GEPOperator>(RHS)) {		} else if (GEPOperator *GEPRHS = dyn_cast<GEPOperator>(RHS)) {
// If the base pointers are different, but the indices are the same, just		// If the base pointers are different, but the indices are the same, just
// compare the base pointer.		// compare the base pointer.
if (PtrBase != GEPRHS->getOperand(0)) {		if (PtrBase != GEPRHS->getOperand(0)) {
bool IndicesTheSame = GEPLHS->getNumOperands()==GEPRHS->getNumOperands();		bool IndicesTheSame = GEPLHS->getNumOperands()==GEPRHS->getNumOperands();
IndicesTheSame &= GEPLHS->getOperand(0)->getType() ==		IndicesTheSame &= GEPLHS->getOperand(0)->getType() ==
GEPRHS->getOperand(0)->getType();		GEPRHS->getOperand(0)->getType();
if (IndicesTheSame)		if (IndicesTheSame)
▲ Show 20 Lines • Show All 4,938 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/gep-inbounds-null.ll

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -S < %s -instcombine \| FileCheck %s

				;; Start by showing the results of constant folding (which doesn't use
				;; the poison implied by gep for the nonnull cases).

				define i1 @test_ne_constants_null() {
				; CHECK-LABEL: @test_ne_constants_null(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 false
				;
				entry:
				%gep = getelementptr inbounds i8, i8* null, i64 0
				%cnd = icmp ne i8* %gep, null
				ret i1 %cnd
				}

				define i1 @test_ne_constants_nonnull() {
				; CHECK-LABEL: @test_ne_constants_nonnull(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 true
				;
				entry:
				%gep = getelementptr inbounds i8, i8* null, i64 1
				%cnd = icmp ne i8* %gep, null
				ret i1 %cnd
				}

				define i1 @test_eq_constants_null() {
				; CHECK-LABEL: @test_eq_constants_null(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 true
				;
				entry:
				%gep = getelementptr inbounds i8, i8* null, i64 0
				%cnd = icmp eq i8* %gep, null
				ret i1 %cnd
				}

				define i1 @test_eq_constants_nonnull() {
				; CHECK-LABEL: @test_eq_constants_nonnull(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 false
				;
				entry:
				%gep = getelementptr inbounds i8, i8* null, i64 1
				%cnd = icmp eq i8* %gep, null
				ret i1 %cnd
				}

				;; Then show the results for non-constants. These use the inbounds provided
				;; UB fact to ignore the possible overflow cases.

				define i1 @test_ne(i8* %base, i64 %idx) {
				; CHECK-LABEL: @test_ne(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[CND:%.]] = icmp ne i8 [[BASE:%.*]], null
				; CHECK-NEXT: ret i1 [[CND]]
				;
				entry:
				%gep = getelementptr inbounds i8, i8* %base, i64 %idx
				%cnd = icmp ne i8* %gep, null
				ret i1 %cnd
				}

				define i1 @test_eq(i8* %base, i64 %idx) {
				; CHECK-LABEL: @test_eq(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[CND:%.]] = icmp eq i8 [[BASE:%.*]], null
				; CHECK-NEXT: ret i1 [[CND]]
				;
				entry:
				%gep = getelementptr inbounds i8, i8* %base, i64 %idx
				%cnd = icmp eq i8* %gep, null
				ret i1 %cnd
				}

				;; TODO: vectors not yet handled
				define <2 x i1> @test_vector_base(<2 x i8*> %base, i64 %idx) {
				; CHECK-LABEL: @test_vector_base(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[GEP:%.]] = getelementptr inbounds i8, <2 x i8> [[BASE:%.]], i64 [[IDX:%.]]
				; CHECK-NEXT: [[CND:%.]] = icmp eq <2 x i8> [[GEP]], zeroinitializer
				; CHECK-NEXT: ret <2 x i1> [[CND]]
				;
				entry:
				%gep = getelementptr inbounds i8, <2 x i8*> %base, i64 %idx
				%cnd = icmp eq <2 x i8*> %gep, zeroinitializer
				ret <2 x i1> %cnd
				}

				define <2 x i1> @test_vector_index(i8* %base, <2 x i64> %idx) {
				; CHECK-LABEL: @test_vector_index(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[GEP:%.]] = getelementptr inbounds i8, i8 [[BASE:%.]], <2 x i64> [[IDX:%.]]
				; CHECK-NEXT: [[CND:%.]] = icmp eq <2 x i8> [[GEP]], zeroinitializer
				; CHECK-NEXT: ret <2 x i1> [[CND]]
				;
				entry:
				%gep = getelementptr inbounds i8, i8* %base, <2 x i64> %idx
				%cnd = icmp eq <2 x i8*> %gep, zeroinitializer
				ret <2 x i1> %cnd
				}

				;; These two show instsimplify's reasoning getting to the non-zero offsets
				;; before instcombine does.

				define i1 @test_eq_pos_idx(i8* %base) {
				; CHECK-LABEL: @test_eq_pos_idx(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 false
				;
				entry:
				%gep = getelementptr inbounds i8, i8* %base, i64 1
				%cnd = icmp eq i8* %gep, null
				ret i1 %cnd
				}

				define i1 @test_eq_neg_idx(i8* %base) {
				; CHECK-LABEL: @test_eq_neg_idx(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 false
				;
				entry:
				%gep = getelementptr inbounds i8, i8* %base, i64 -1
				%cnd = icmp eq i8* %gep, null
				ret i1 %cnd
				}

				;; Show an example with a zero sized type since that's
				;; a cornercase which keeps getting mentioned. The GEP
				;; produces %base regardless of the value of the index
				;; expression.
				define i1 @test_size0({}* %base, i64 %idx) {
				; CHECK-LABEL: @test_size0(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[CND:%.]] = icmp ne {} [[BASE:%.*]], null
				; CHECK-NEXT: ret i1 [[CND]]
				;
				entry:
				%gep = getelementptr inbounds {}, {}* %base, i64 %idx
				%cnd = icmp ne {}* %gep, null
				ret i1 %cnd
				}
				define i1 @test_size0_nonzero_offset({}* %base) {
				; CHECK-LABEL: @test_size0_nonzero_offset(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[CND:%.]] = icmp ne {} [[BASE:%.*]], null
				; CHECK-NEXT: ret i1 [[CND]]
				;
				entry:
				%gep = getelementptr inbounds {}, {}* %base, i64 15
				%cnd = icmp ne {}* %gep, null
				ret i1 %cnd
				}


				;; Finally, some negative tests for sanity checking.

				define i1 @neq_noinbounds(i8* %base, i64 %idx) {
				; CHECK-LABEL: @neq_noinbounds(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[GEP:%.]] = getelementptr i8, i8 [[BASE:%.]], i64 [[IDX:%.]]
				; CHECK-NEXT: [[CND:%.]] = icmp ne i8 [[GEP]], null
				; CHECK-NEXT: ret i1 [[CND]]
				;
				entry:
				%gep = getelementptr i8, i8* %base, i64 %idx
				%cnd = icmp ne i8* %gep, null
				ret i1 %cnd
				}

				define i1 @neg_objectatnull(i8 addrspace(2)* %base, i64 %idx) {
				; CHECK-LABEL: @neg_objectatnull(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[GEP:%.]] = getelementptr inbounds i8, i8 addrspace(2) [[BASE:%.]], i64 [[IDX:%.]]
				; CHECK-NEXT: [[CND:%.]] = icmp eq i8 addrspace(2) [[GEP]], null
				; CHECK-NEXT: ret i1 [[CND]]
				;
				entry:
				%gep = getelementptr inbounds i8, i8 addrspace(2)* %base, i64 %idx
				%cnd = icmp eq i8 addrspace(2)* %gep, null
				ret i1 %cnd
				}