This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
InstCombineCompares.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
pr39908.ll

Differential D55449

[InstCombine] Fix negative GEP offset evaluation for 32-bit pointers
ClosedPublic

Authored by nikic on Dec 7 2018, 11:59 AM.

Download Raw Diff

Details

Reviewers

spatel
RKSimon
majnemer
efriedma

Commits

rG36e03ac6ee91: [InstCombine] Fix negative GEP offset evaluation for 32-bit pointers
rL348987: [InstCombine] Fix negative GEP offset evaluation for 32-bit pointers

Summary

This fixes https://bugs.llvm.org/show_bug.cgi?id=39908.

The evaluateGEPOffsetExpression() function simplifies GEP offsets for use in comparisons against zero, basically by converting X*Scale+Offset==0 to X+Offset/Scale==0 if Scale divides Offset. However, before this is done, Offset is masked down to the pointer size. This results in incorrect results for negative Offsets, because we basically end up dividing the 32-bit offset *zero* extended to 64-bit bits (rather than sign extended).

The masking code could be fixed to properly replicate sign bits. However, as we are operating on inbounds GEPs here, I would expect that we do not have to care about address space overflows, because these would result in poison anyway. If that understanding is correct, then we can just drop this code entirely, which is what this patch does.

Diff Detail

Repository: rL LLVM

Event Timeline

nikic created this revision.Dec 7 2018, 11:59 AM

Herald added a subscriber: llvm-commits. · View Herald TranscriptDec 7 2018, 11:59 AM

RKSimon added a reviewer: majnemer.Dec 7 2018, 1:50 PM

mati865 added a subscriber: mati865.Dec 8 2018, 3:44 AM

nikic added a reviewer: efriedma.Dec 12 2018, 11:56 AM

I think I would rather SignExtend64(X, 32) or something like that... yes, it's effectively the same for well-defined code, but I'd rather explicitly truncate the offset rather than implicitly truncate it in ConstantInt::get.

In D55449#1328633, @efriedma wrote:

I think I would rather SignExtend64(X, 32) or something like that... yes, it's effectively the same for well-defined code, but I'd rather explicitly truncate the offset rather than implicitly truncate it in ConstantInt::get.

Another possibility would be to not apply the simplification if the offsets are too large, i.e. make isIntN(IntPtrWidth, Offset) && isIntN(IntPtrWidth, VariableScale) a precondition for this transform. What do you think?

Checking for overflow would also be fine, but I think you would want to check overflow on each operation that modifies the offset, as opposed to the final result (which might have overflowed the 64-bit integer anyway).

Use SignExtend64 for a signed truncation.

I went with the SignExtend64 approach you suggested. Looking at the code directly above (for the zero offset case), it even explicitly inserts a truncate into IR, so it seems appropriate to just follow suit.

LGTM

This revision is now accepted and ready to land.Dec 12 2018, 2:19 PM

Closed by commit rL348987: [InstCombine] Fix negative GEP offset evaluation for 32-bit pointers (authored by nikic). · Explain WhyDec 12 2018, 3:22 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Transforms/

InstCombine/

InstCombineCompares.cpp

8 lines

test/

Transforms/

InstCombine/

pr39908.ll

49 lines

Diff 177951

llvm/trunk/lib/Transforms/InstCombine/InstCombineCompares.cpp

Show First 20 Lines • Show All 516 Lines • ▼ Show 20 Lines	if (Offset == 0) {
// computation crosses zero.		// computation crosses zero.
if (VariableIdx->getType()->getPrimitiveSizeInBits() > IntPtrWidth) {		if (VariableIdx->getType()->getPrimitiveSizeInBits() > IntPtrWidth) {
VariableIdx = IC.Builder.CreateTrunc(VariableIdx, IntPtrTy);		VariableIdx = IC.Builder.CreateTrunc(VariableIdx, IntPtrTy);
}		}
return VariableIdx;		return VariableIdx;
}		}

// Otherwise, there is an index. The computation we will do will be modulo		// Otherwise, there is an index. The computation we will do will be modulo
// the pointer size, so get it.		// the pointer size.
uint64_t PtrSizeMask = ~0ULL >> (64-IntPtrWidth);		Offset = SignExtend64(Offset, IntPtrWidth);
		VariableScale = SignExtend64(VariableScale, IntPtrWidth);
Offset &= PtrSizeMask;
VariableScale &= PtrSizeMask;

// To do this transformation, any constant index must be a multiple of the		// To do this transformation, any constant index must be a multiple of the
// variable scale factor. For example, we can evaluate "12 + 4*i" as "3 + i",		// variable scale factor. For example, we can evaluate "12 + 4*i" as "3 + i",
// but we can't evaluate "10 + 3*i" in terms of i. Check that the offset is a		// but we can't evaluate "10 + 3*i" in terms of i. Check that the offset is a
// multiple of the variable scale.		// multiple of the variable scale.
int64_t NewOffs = Offset / (int64_t)VariableScale;		int64_t NewOffs = Offset / (int64_t)VariableScale;
if (Offset != NewOffs*(int64_t)VariableScale)		if (Offset != NewOffs*(int64_t)VariableScale)
return nullptr;		return nullptr;
▲ Show 20 Lines • Show All 5,000 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/pr39908.ll

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt < %s -instcombine -S \| FileCheck %s

				target datalayout = "p:32:32"

				%S = type { [2 x i32] }

				define i1 @test([0 x %S]* %p, i32 %n) {
				; CHECK-LABEL: @test(
				; CHECK-NEXT: [[CMP:%.]] = icmp eq i32 [[N:%.]], 1
				; CHECK-NEXT: ret i1 [[CMP]]
				;
				%start.cast = bitcast [0 x %S]* %p to %S*
				%end = getelementptr inbounds [0 x %S], [0 x %S]* %p, i32 0, i32 %n, i32 0, i32 0
				%end.cast = bitcast i32* %end to %S*
				%last = getelementptr inbounds %S, %S* %end.cast, i32 -1
				%cmp = icmp eq %S* %last, %start.cast
				ret i1 %cmp
				}

				; Same test using 64-bit indices.
				define i1 @test64([0 x %S]* %p, i64 %n) {
				; CHECK-LABEL: @test64(
				; CHECK-NEXT: [[TMP1:%.]] = trunc i64 [[N:%.]] to i32
				; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[TMP1]], 1
				; CHECK-NEXT: ret i1 [[CMP]]
				;
				%start.cast = bitcast [0 x %S]* %p to %S*
				%end = getelementptr inbounds [0 x %S], [0 x %S]* %p, i64 0, i64 %n, i32 0, i64 0
				%end.cast = bitcast i32* %end to %S*
				%last = getelementptr inbounds %S, %S* %end.cast, i64 -1
				%cmp = icmp eq %S* %last, %start.cast
				ret i1 %cmp
				}

				; Here the offset overflows and is treated modulo 2^32. This is UB.
				define i1 @test64_overflow([0 x %S]* %p, i64 %n) {
				; CHECK-LABEL: @test64_overflow(
				; CHECK-NEXT: [[TMP1:%.]] = trunc i64 [[N:%.]] to i32
				; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[TMP1]], 1
				; CHECK-NEXT: ret i1 [[CMP]]
				;
				%start.cast = bitcast [0 x %S]* %p to %S*
				%end = getelementptr inbounds [0 x %S], [0 x %S]* %p, i64 0, i64 %n, i32 0, i64 8589934592
				%end.cast = bitcast i32* %end to %S*
				%last = getelementptr inbounds %S, %S* %end.cast, i64 -1
				%cmp = icmp eq %S* %last, %start.cast
				ret i1 %cmp
				}