This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Analysis/
-
Analysis/
1/2
BasicAliasAnalysis.cpp
-
test/Analysis/BasicAA/
-
Analysis/
-
BasicAA/
-
struct-geps.ll

Differential D91027

[BasicAA] Generalize base offset modulus handling
ClosedPublic

Authored by nikic on Nov 8 2020, 1:49 AM.

Download Raw Diff

Details

Reviewers

asbirlea
fhahn
jdoerfert
hfinkel

Commits

rGcd3c22c47e4b: [BasicAA] Generalize base offset modulus handling

Summary

The GEP aliasing implementation currently has two pieces of code that solve two different subsets of the same basic problem: If you have GEPs with offsets 4*x + 0 and 4*y + 1 (assuming access size 1), then they do not alias regardless of whether x and y are the same.

One implementation is in aliasSameBasePointerGEPs(), which looks at this in a limited structural way. It requires both GEP base pointers to be exactly the same, then (optionally) a number of equal indexes, then an unknown index, then a non-equal index into a struct. This set of limitations works, but it's overly restrictive and hides the core property we're trying to exploit.

The second implementation is part of aliasGEP() itself and tries to find a common modulus in the scales, so it can then check that the constant offset doesn't overlap under modular arithmetic. The second implementation has the right idea of what the general problem is, but effectively only considers power of two factors in the scales (while aliasSameBasePointerGEPs also works with non-pow2 struct sizes.)

What this patch does is to adjust the aliasGEP() implementation to instead find the largest common factor in all the scales (i.e. the GCD) and use that as the modulus.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nikic created this revision.Nov 8 2020, 1:49 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 8 2020, 1:49 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

nikic requested review of this revision.Nov 8 2020, 1:49 AM

Harbormaster completed remote builds in B78028: Diff 303699.Nov 8 2020, 2:34 AM

jdoerfert added inline comments.Nov 9 2020, 7:43 AM

llvm/lib/Analysis/BasicAliasAnalysis.cpp

1402

Maybe a comment above this to explain, I don't get the reasoning TBH.

Here is my idea of what is happening but it is not complete:

// X[0:N] is a vector of N+1 unknowns
GEP1 = GEP1BaseOffset + X[0:N] * GEP1.scale[0:N]
GCD = gcd(GEP1.scale[0], ..., GEP1.scale[N])
ModOffset = GEP1BaseOffset % GCD
// Every value of GEP1 looks like: GEP1 := Y * GCD + ModOffset
// and ModOffset < GCD.
Diff = GCD - ModOffset
if (ModOffset >=u V2Size  // I guess V2 fits in the size between Y * GCD and GEP1
 &&      Diff >=u V1Size  // this I don't understand.
  return NoAlias;

Update comment.

nikic added inline comments.Nov 9 2020, 10:05 AM

llvm/lib/Analysis/BasicAliasAnalysis.cpp
1402	Does the new comment make sense to you? We have the second access at [0..V2Size) with the remaining [V2Size..GCD) not being accessed. We don't alias if the first access fits in there, i.e. ModOffset >= V2Size and ModOffset+V1Size <= GCD.

Yes, now I get it. I think I was also missing the "V2 has offset 0" precondition.

I like this, nice cleanup that adds more power. I think it is correct and good to go, would prefer a second opinion though.

This makes sense to me too. The additional comment was helpful with the gcd/difference condition.

This revision is now accepted and ready to land.Nov 17 2020, 5:10 PM

Closed by commit rGcd3c22c47e4b: [BasicAA] Generalize base offset modulus handling (authored by nikic). · Explain WhyNov 18 2020, 12:49 PM

This revision was automatically updated to reflect the committed changes.

nikic added a commit: rGcd3c22c47e4b: [BasicAA] Generalize base offset modulus handling.

nikic mentioned this in D92723: [BasicAA] Migrate "same base pointer" logic to decomposed GEPs.Dec 5 2020, 12:39 PM

nikic mentioned this in rG5e69e2ebad9d: [BasicAA] Migrate "same base pointer" logic to decomposed GEPs.Dec 6 2020, 1:31 AM

Unfortunately we are seeing mis-compiles caused by this patch. I don't think think the use of modulo arithmetic here is safe, unless the index computation is guaranteed to not overflow.

For the example below, this patch determines no-alias for %gep.idx and %gep.6. But if %mul overflows (e.g. %idx == 52), %add == 6 and they are must alias. (https://godbolt.org/z/bWjz86K97)

I pushed a set of test cases in bcc8d80192f1 and put up a candidate fix D99424

; %gep.idx and %gep.6 must-alias if %mul overflows (e.g. %idx == 52).
define void @may_overflow_mul_add_i8([16 x i8]* %ptr, i8 %idx) {
  %mul = mul i8 %idx, 5
  %add = add i8 %mul, 2
  %gep.idx = getelementptr [16 x i8], [16 x i8]* %ptr, i32 0, i8 %add
  store i8 0, i8* %gep.idx, align 1
  %gep.6 = getelementptr [16 x i8], [16 x i8]* %ptr, i32 0, i32 6
  store i8 1, i8* %gep.6, align 1
  ret void
}

Revision Contents

Path

Size

llvm/

lib/

Analysis/

BasicAliasAnalysis.cpp

70 lines

test/

Analysis/

BasicAA/

struct-geps.ll

24 lines

Diff 306191

llvm/lib/Analysis/BasicAliasAnalysis.cpp

Show First 20 Lines • Show All 1,082 Lines • ▼ Show 20 Lines	for (unsigned i = 1, e = GEP1->getNumIndices() - 1; i != e; ++i) {
if (!isa<ArrayType>(GetElementPtrInst::getIndexedType(		if (!isa<ArrayType>(GetElementPtrInst::getIndexedType(
GEP1->getSourceElementType(), IntermediateIndices)))		GEP1->getSourceElementType(), IntermediateIndices)))
return MayAlias;		return MayAlias;
IntermediateIndices.push_back(GEP1->getOperand(i + 1));		IntermediateIndices.push_back(GEP1->getOperand(i + 1));
}		}

auto *Ty = GetElementPtrInst::getIndexedType(		auto *Ty = GetElementPtrInst::getIndexedType(
GEP1->getSourceElementType(), IntermediateIndices);		GEP1->getSourceElementType(), IntermediateIndices);
StructType *LastIndexedStruct = dyn_cast<StructType>(Ty);

if (isa<ArrayType>(Ty) \|\| isa<VectorType>(Ty)) {		if (isa<ArrayType>(Ty) \|\| isa<VectorType>(Ty)) {
// We know that:		// We know that:
// - both GEPs begin indexing from the exact same pointer;		// - both GEPs begin indexing from the exact same pointer;
// - the last indices in both GEPs are constants, indexing into a sequential		// - the last indices in both GEPs are constants, indexing into a sequential
// type (array or vector);		// type (array or vector);
// - both GEPs only index through arrays prior to that.		// - both GEPs only index through arrays prior to that.
//		//
// Because array indices greater than the number of elements are valid in		// Because array indices greater than the number of elements are valid in
Show All 37 Lines	if (C1 && C2)
KnownBits Known2 = computeKnownBits(GEP2LastIdx, DL);		KnownBits Known2 = computeKnownBits(GEP2LastIdx, DL);
if (Known1.Zero.intersects(Known2.One) \|\|		if (Known1.Zero.intersects(Known2.One) \|\|
Known1.One.intersects(Known2.Zero))		Known1.One.intersects(Known2.Zero))
return NoAlias;		return NoAlias;
}		}
} else if (isKnownNonEqual(GEP1LastIdx, GEP2LastIdx, DL))		} else if (isKnownNonEqual(GEP1LastIdx, GEP2LastIdx, DL))
return NoAlias;		return NoAlias;
}		}
return MayAlias;
} else if (!LastIndexedStruct \|\| !C1 \|\| !C2) {
return MayAlias;
}		}

if (C1->getValue().getActiveBits() > 64 \|\|
C2->getValue().getActiveBits() > 64)
return MayAlias;

// We know that:
// - both GEPs begin indexing from the exact same pointer;
// - the last indices in both GEPs are constants, indexing into a struct;
// - said indices are different, hence, the pointed-to fields are different;
// - both GEPs only index through arrays prior to that.
//
// This lets us determine that the struct that GEP1 indexes into and the
// struct that GEP2 indexes into must either precisely overlap or be
// completely disjoint. Because they cannot partially overlap, indexing into
// different non-overlapping fields of the struct will never alias.

// Therefore, the only remaining thing needed to show that both GEPs can't
// alias is that the fields are not overlapping.
const StructLayout *SL = DL.getStructLayout(LastIndexedStruct);
const uint64_t StructSize = SL->getSizeInBytes();
const uint64_t V1Off = SL->getElementOffset(C1->getZExtValue());
const uint64_t V2Off = SL->getElementOffset(C2->getZExtValue());

auto EltsDontOverlap = [StructSize](uint64_t V1Off, uint64_t V1Size,
uint64_t V2Off, uint64_t V2Size) {
return V1Off < V2Off && V1Off + V1Size <= V2Off &&
((V2Off + V2Size <= StructSize) \|\|
(V2Off + V2Size - StructSize <= V1Off));
};

if (EltsDontOverlap(V1Off, V1Size, V2Off, V2Size) \|\|
EltsDontOverlap(V2Off, V2Size, V1Off, V1Size))
return NoAlias;

return MayAlias;		return MayAlias;
}		}

// If a we have (a) a GEP and (b) a pointer based on an alloca, and the		// If a we have (a) a GEP and (b) a pointer based on an alloca, and the
// beginning of the object the GEP points would have a negative offset with		// beginning of the object the GEP points would have a negative offset with
// repsect to the alloca, that means the GEP can not alias pointer (b).		// repsect to the alloca, that means the GEP can not alias pointer (b).
// Note that the pointer based on the alloca may not be a GEP. For		// Note that the pointer based on the alloca may not be a GEP. For
// example, it may be the alloca itself.		// example, it may be the alloca itself.
▲ Show 20 Lines • Show All 195 Lines • ▼ Show 20 Lines	if (GEP1BaseOffset.sge(0)) {
if ((-GEP1BaseOffset).ult(V1Size.getValue()))		if ((-GEP1BaseOffset).ult(V1Size.getValue()))
return PartialAlias;		return PartialAlias;
return NoAlias;		return NoAlias;
}		}
}		}
}		}

if (!DecompGEP1.VarIndices.empty()) {		if (!DecompGEP1.VarIndices.empty()) {
APInt Modulo(MaxPointerSize, 0);		APInt GCD;
bool AllNonNegative = GEP1BaseOffset.isNonNegative();		bool AllNonNegative = GEP1BaseOffset.isNonNegative();
bool AllNonPositive = GEP1BaseOffset.isNonPositive();		bool AllNonPositive = GEP1BaseOffset.isNonPositive();
for (unsigned i = 0, e = DecompGEP1.VarIndices.size(); i != e; ++i) {		for (unsigned i = 0, e = DecompGEP1.VarIndices.size(); i != e; ++i) {
		const APInt &Scale = DecompGEP1.VarIndices[i].Scale;
// Try to distinguish something like &A[i][1] against &A[42][0].		if (i == 0)
// Grab the least significant bit set in any of the scales. We		GCD = Scale.abs();
// don't need std::abs here (even if the scale's negative) as we'll		else
// be ^'ing Modulo with itself later.		GCD = APIntOps::GreatestCommonDivisor(GCD, Scale.abs());
Modulo \|= DecompGEP1.VarIndices[i].Scale;

if (AllNonNegative \|\| AllNonPositive) {		if (AllNonNegative \|\| AllNonPositive) {
// If the Value could change between cycles, then any reasoning about		// If the Value could change between cycles, then any reasoning about
// the Value this cycle may not hold in the next cycle. We'll just		// the Value this cycle may not hold in the next cycle. We'll just
// give up if we can't determine conditions that hold for every cycle:		// give up if we can't determine conditions that hold for every cycle:
const Value *V = DecompGEP1.VarIndices[i].V;		const Value *V = DecompGEP1.VarIndices[i].V;

KnownBits Known =		KnownBits Known =
computeKnownBits(V, DL, 0, &AC, dyn_cast<Instruction>(GEP1), DT);		computeKnownBits(V, DL, 0, &AC, dyn_cast<Instruction>(GEP1), DT);
bool SignKnownZero = Known.isNonNegative();		bool SignKnownZero = Known.isNonNegative();
bool SignKnownOne = Known.isNegative();		bool SignKnownOne = Known.isNegative();

// Zero-extension widens the variable, and so forces the sign		// Zero-extension widens the variable, and so forces the sign
// bit to zero.		// bit to zero.
bool IsZExt = DecompGEP1.VarIndices[i].ZExtBits > 0 \|\| isa<ZExtInst>(V);		bool IsZExt = DecompGEP1.VarIndices[i].ZExtBits > 0 \|\| isa<ZExtInst>(V);
SignKnownZero \|= IsZExt;		SignKnownZero \|= IsZExt;
SignKnownOne &= !IsZExt;		SignKnownOne &= !IsZExt;

APInt Scale = DecompGEP1.VarIndices[i].Scale;
AllNonNegative &= (SignKnownZero && Scale.isNonNegative()) \|\|		AllNonNegative &= (SignKnownZero && Scale.isNonNegative()) \|\|
(SignKnownOne && Scale.isNonPositive());		(SignKnownOne && Scale.isNonPositive());
AllNonPositive &= (SignKnownZero && Scale.isNonPositive()) \|\|		AllNonPositive &= (SignKnownZero && Scale.isNonPositive()) \|\|
(SignKnownOne && Scale.isNonNegative());		(SignKnownOne && Scale.isNonNegative());
}		}
}		}

Modulo = Modulo ^ (Modulo & (Modulo - 1));		// We now have accesses at two offsets from the same base:
		// 1. (...)*GCD + GEP1BaseOffset with size V1Size
// We can compute the difference between the two addresses		// 2. 0 with size V2Size
// mod Modulo. Check whether that difference guarantees that the		// Using arithmetic modulo GCD, the accesses are at
// two locations do not alias.		// [ModOffset..ModOffset+V1Size) and [0..V2Size). If the first access fits
APInt ModOffset = GEP1BaseOffset & (Modulo - 1);		// into the range [V2Size..GCD), then we know they cannot overlap.
		APInt ModOffset = GEP1BaseOffset.srem(GCD);
		if (ModOffset.isNegative())
		ModOffset += GCD; // We want mod, not rem.
if (V1Size != LocationSize::unknown() &&		if (V1Size != LocationSize::unknown() &&
V2Size != LocationSize::unknown() && ModOffset.uge(V2Size.getValue()) &&		V2Size != LocationSize::unknown() && ModOffset.uge(V2Size.getValue()) &&
(Modulo - ModOffset).uge(V1Size.getValue()))		(GCD - ModOffset).uge(V1Size.getValue()))
return NoAlias;		return NoAlias;
		jdoerfertUnsubmitted Not Done Reply Inline Actions Maybe a comment above this to explain, I don't get the reasoning TBH. Here is my idea of what is happening but it is not complete: // X[0:N] is a vector of N+1 unknowns GEP1 = GEP1BaseOffset + X[0:N] * GEP1.scale[0:N] GCD = gcd(GEP1.scale[0], ..., GEP1.scale[N]) ModOffset = GEP1BaseOffset % GCD // Every value of GEP1 looks like: GEP1 := Y * GCD + ModOffset // and ModOffset < GCD. Diff = GCD - ModOffset if (ModOffset >=u V2Size // I guess V2 fits in the size between Y * GCD and GEP1 && Diff >=u V1Size // this I don't understand. return NoAlias; jdoerfert: Maybe a comment above this to explain, I don't get the reasoning TBH. Here is my idea of what…
		nikicAuthorUnsubmitted Done Reply Inline Actions Does the new comment make sense to you? We have the second access at [0..V2Size) with the remaining [V2Size..GCD) not being accessed. We don't alias if the first access fits in there, i.e. ModOffset >= V2Size and ModOffset+V1Size <= GCD. nikic: Does the new comment make sense to you? We have the second access at [0..V2Size) with the…

// If we know all the variables are non-negative, then the total offset is		// If we know all the variables are non-negative, then the total offset is
// also non-negative and >= GEP1BaseOffset. We have the following layout:		// also non-negative and >= GEP1BaseOffset. We have the following layout:
// [0, V2Size) ... [TotalOffset, TotalOffer+V1Size]		// [0, V2Size) ... [TotalOffset, TotalOffer+V1Size]
// If GEP1BaseOffset >= V2Size, the accesses don't alias.		// If GEP1BaseOffset >= V2Size, the accesses don't alias.
if (AllNonNegative && V2Size != LocationSize::unknown() &&		if (AllNonNegative && V2Size != LocationSize::unknown() &&
GEP1BaseOffset.uge(V2Size.getValue()))		GEP1BaseOffset.uge(V2Size.getValue()))
return NoAlias;		return NoAlias;
▲ Show 20 Lines • Show All 585 Lines • Show Last 20 Lines

llvm/test/Analysis/BasicAA/struct-geps.ll

	Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines
	}			}

	; CHECK-LABEL: test_same_underlying_object_same_indices			; CHECK-LABEL: test_same_underlying_object_same_indices

	; CHECK-DAG: NoAlias: i32* %x, i32* %x2			; CHECK-DAG: NoAlias: i32* %x, i32* %x2
	; CHECK-DAG: NoAlias: i32* %y, i32* %y2			; CHECK-DAG: NoAlias: i32* %y, i32* %y2
	; CHECK-DAG: NoAlias: i32* %z, i32* %z2			; CHECK-DAG: NoAlias: i32* %z, i32* %z2

	; CHECK-DAG: MayAlias: i32* %x, i32* %y2			; CHECK-DAG: NoAlias: i32* %x, i32* %y2
	; CHECK-DAG: MayAlias: i32* %x, i32* %z2			; CHECK-DAG: NoAlias: i32* %x, i32* %z2

	; CHECK-DAG: MayAlias: i32* %x2, i32* %y			; CHECK-DAG: NoAlias: i32* %x2, i32* %y
	; CHECK-DAG: MayAlias: i32* %y, i32* %z2			; CHECK-DAG: NoAlias: i32* %y, i32* %z2

	; CHECK-DAG: MayAlias: i32* %x2, i32* %z			; CHECK-DAG: NoAlias: i32* %x2, i32* %z
	; CHECK-DAG: MayAlias: i32* %y2, i32* %z			; CHECK-DAG: NoAlias: i32* %y2, i32* %z

	define void @test_same_underlying_object_same_indices(%struct* %st, i64 %i, i64 %j, i64 %k) {			define void @test_same_underlying_object_same_indices(%struct* %st, i64 %i, i64 %j, i64 %k) {
	%st2 = getelementptr %struct, %struct* %st, i32 10			%st2 = getelementptr %struct, %struct* %st, i32 10
	%x2 = getelementptr %struct, %struct* %st2, i64 %i, i32 0			%x2 = getelementptr %struct, %struct* %st2, i64 %i, i32 0
	%y2 = getelementptr %struct, %struct* %st2, i64 %j, i32 1			%y2 = getelementptr %struct, %struct* %st2, i64 %j, i32 1
	%z2 = getelementptr %struct, %struct* %st2, i64 %k, i32 2			%z2 = getelementptr %struct, %struct* %st2, i64 %k, i32 2
	%x = getelementptr %struct, %struct* %st, i64 %i, i32 0			%x = getelementptr %struct, %struct* %st, i64 %i, i32 0
	%y = getelementptr %struct, %struct* %st, i64 %j, i32 1			%y = getelementptr %struct, %struct* %st, i64 %j, i32 1
	%z = getelementptr %struct, %struct* %st, i64 %k, i32 2			%z = getelementptr %struct, %struct* %st, i64 %k, i32 2
	ret void			ret void
	}			}

	; CHECK-LABEL: test_same_underlying_object_different_indices			; CHECK-LABEL: test_same_underlying_object_different_indices

	; CHECK-DAG: MayAlias: i32* %x, i32* %x2			; CHECK-DAG: MayAlias: i32* %x, i32* %x2
	; CHECK-DAG: MayAlias: i32* %y, i32* %y2			; CHECK-DAG: MayAlias: i32* %y, i32* %y2
	; CHECK-DAG: MayAlias: i32* %z, i32* %z2			; CHECK-DAG: MayAlias: i32* %z, i32* %z2

	; CHECK-DAG: MayAlias: i32* %x, i32* %y2			; CHECK-DAG: NoAlias: i32* %x, i32* %y2
	; CHECK-DAG: MayAlias: i32* %x, i32* %z2			; CHECK-DAG: NoAlias: i32* %x, i32* %z2

	; CHECK-DAG: MayAlias: i32* %x2, i32* %y			; CHECK-DAG: NoAlias: i32* %x2, i32* %y
	; CHECK-DAG: MayAlias: i32* %y, i32* %z2			; CHECK-DAG: NoAlias: i32* %y, i32* %z2

	; CHECK-DAG: MayAlias: i32* %x2, i32* %z			; CHECK-DAG: NoAlias: i32* %x2, i32* %z
	; CHECK-DAG: MayAlias: i32* %y2, i32* %z			; CHECK-DAG: NoAlias: i32* %y2, i32* %z

	define void @test_same_underlying_object_different_indices(%struct* %st, i64 %i1, i64 %j1, i64 %k1, i64 %i2, i64 %k2, i64 %j2) {			define void @test_same_underlying_object_different_indices(%struct* %st, i64 %i1, i64 %j1, i64 %k1, i64 %i2, i64 %k2, i64 %j2) {
	%st2 = getelementptr %struct, %struct* %st, i32 10			%st2 = getelementptr %struct, %struct* %st, i32 10
	%x2 = getelementptr %struct, %struct* %st2, i64 %i2, i32 0			%x2 = getelementptr %struct, %struct* %st2, i64 %i2, i32 0
	%y2 = getelementptr %struct, %struct* %st2, i64 %j2, i32 1			%y2 = getelementptr %struct, %struct* %st2, i64 %j2, i32 1
	%z2 = getelementptr %struct, %struct* %st2, i64 %k2, i32 2			%z2 = getelementptr %struct, %struct* %st2, i64 %k2, i32 2
	%x = getelementptr %struct, %struct* %st, i64 %i1, i32 0			%x = getelementptr %struct, %struct* %st, i64 %i1, i32 0
	%y = getelementptr %struct, %struct* %st, i64 %j1, i32 1			%y = getelementptr %struct, %struct* %st, i64 %j1, i32 1
	Show All 23 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[BasicAA] Generalize base offset modulus handlingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 306191

llvm/lib/Analysis/BasicAliasAnalysis.cpp

llvm/test/Analysis/BasicAA/struct-geps.ll

[BasicAA] Generalize base offset modulus handling
ClosedPublic