Download Raw Diff

Details

Reviewers

sebpop
hfinkel
• dberlin
fhahn
grosser

Commits

rG5ef933b02ca5: [DA] Improve alias checking in dependence analysis
rL329692: [DA] Improve alias checking in dependence analysis

Summary

We were supplying the size to AA as DL.getTypeStoreSize(AObj->getType()),
which causes us to incorrectly presume noalias between loads/stores
with dependencies. I believe BasicAA was returning noalias because it
is undefined behaviour to access objects with a larger access than the
object size.

Diff Detail

Event Timeline

dmgreen created this revision.Jan 22 2018, 9:28 AM

dmgreen mentioned this in D41953: [LoopUnroll] Unroll and Jam.Jan 23 2018, 2:37 AM

Rebase and simplify test.

It's possible that the size here should be the size of the thing that AObj points to, as opposed to unknown. I've added a number of reviewers who may know what they are talking about.

ping :)

hfinkel added inline comments.Apr 4 2018, 2:55 PM

lib/Analysis/DependenceAnalysis.cpp
630–653	It's possible that the size here should be the size of the thing that AObj points to, as opposed to unknown There are two problems here, and I think that we can fix them both. What we're really trying to do here is to identify disjoint underlying objects. First, GetUnderlyingObject might not always return the underlying object (because it has a recursion-depth cutoff). Thus, we need to validate the fact that GetUnderlyingObject actually did return such a thing. So we really want to do: if (!isIdentifiedObject(AObj) \|\| !isIdentifiedObject(BObj)) return true; At that point, the sizes are essentially irrelevant. and we should pass MemoryLocation::UnknownSize as the size. However, at that point, since we have unique underlying objects, we can just compare them, so we can just do (there's no further value in calling into AA): return AObj == BObj;

nlopes added a subscriber: nlopes.Apr 5 2018, 2:59 AM

nlopes added inline comments.

lib/Analysis/DependenceAnalysis.cpp
630–653	I agree with Hal's comment. Seems like the way to go.

dmgreen updated this revision to Diff 141139.Apr 5 2018, 5:56 AM

dmgreen added inline comments.

lib/Analysis/DependenceAnalysis.cpp
630–653	This sounds good. I like the simplification. I've updated things to something that may still be wrong. As far as I understand, this is returning a tristate: MustAlias - Do dependency analysis. NoAlias - No dependence is possible. MayAlias - Don't know anything - a confused dependence. A lot of the tests use function arguments as input (as will real code). If AObj and BObj are the same argument, we are obviously mustalias, even if that isn't an identified object. So I think this is now still correct even if we hit the recursion limit, the mustalias will still be valid. Also (and this might be something that doesn't come up very often) what if one argument has a noalias attribute, but another doesn't (or one is a alloca, the other an argument). We can still prove noalias there? Also I was hoping to get TBAA working here. See D42382. As in - if the loads/stores of the Src and Dst are different types, we know there is no alias, so no dependency. Any ideas what the best bet there would be? Call alias on the Src and Dst? Work with the TBAA metadata somehow? (I'm not sure that's possible)

hfinkel added inline comments.Apr 6 2018, 5:19 AM

lib/Analysis/DependenceAnalysis.cpp
630–653	Also (and this might be something that doesn't come up very often) what if one argument has a noalias attribute, but another doesn't (or one is a alloca, the other an argument). We can still prove noalias there? No, although you could add that as a special case (i.e., one value is an argument (or global value) and for the other isIdentifiedFunctionLocal returns true. Also I was hoping to get TBAA working here. Good point. This will require a slightly larger change, but seems worthwhile. You'll want to collect the AA metadata and then construct a memory location using that metadata but using MemoryLocation::UnknownSize, and then to an AA query using those memory locations (this will effectively check the underlying objects, but also make use of the metadata). I recommend that you change this function to accept to MemoryLocation references. Get these from the original accesses using MemoryLocation::get, and then inside this function form two new MemoryLocation objects, one from each original, MemoryLocation(LocA.Ptr, MemoryLocation::UnknownSize, LocA.AATags) (or something equivalent).

Thanks for the pointers. I've folded the tbaa changes into this patch as they are related. This now checks the alias on the original locations, on the assumption that if they don't alias for any reason then there is no dependency. Then falls back to underlying object.

Minus size...

In D42381#1061022, @dmgreen wrote:

Thanks for the pointers. I've folded the tbaa changes into this patch as they are related. This now checks the alias on the original locations, on the assumption that if they don't alias for any reason then there is no dependency. Then falls back to underlying object.

Does the fallback add anything? I think that the AA query should catch all relevant cases (i.e., you can just return the result of the AA query and be done).

Does the fallback add anything?

We need to detect things like this as mustalias (to find the flow dependence), not mayalias (confused):

for (int i = 0; i < n; i++) {
  A[i + 2] = i;
  ... = A[i];

Unless you mean just do the alias analysis like this:?

return AA->alias(GetUnderlyingObject(LocA.Ptr), MemoryLocation::UnknownSize, LocA.AATags, GetUnderlyingObject(LocB.Ptr), MemoryLocation::UnknownSize, LocB.AATags)

In D42381#1061379, @dmgreen wrote:
Does the fallback add anything?

We need to detect things like this as mustalias (to find the flow dependence), not mayalias (confused):
for (int i = 0; i < n; i++) {
  A[i + 2] = i;
  ... = A[i];
Unless you mean just do the alias analysis like this:?
return AA->alias(GetUnderlyingObject(LocA.Ptr), MemoryLocation::UnknownSize, LocA.AATags, GetUnderlyingObject(LocB.Ptr), MemoryLocation::UnknownSize, LocB.AATags)

Just do:

return AA->alias(LocA.Ptr, MemoryLocation::UnknownSize, LocA.AATags, LocB.Ptr, MemoryLocation::UnknownSize, LocB.AATags)

there's no need to call GetUnderlyingObject here explicitly. When you pass an unknown size, that's essentially what BasicAA is doing anyway (since it has no size information, AA can only look at the metadata and the underlying objects for aliasing information).

But in here:

for (int i = 0; i < n; i++) {
  A[i + 2] = i;
  ... = A[i];

the load and store are mayalias. We need the fact that the underlying obects are mustalias (otherwise a number of tests are failing in a way that looks like it's discovering less dependencies, more are confused)

In D42381#1061394, @dmgreen wrote:
But in here:
for (int i = 0; i < n; i++) {
  A[i + 2] = i;
  ... = A[i];
the load and store are mayalias. We need the fact that the underlying obects are mustalias (otherwise a number of tests are failing in a way that looks like it's discovering less dependencies, more are confused)

Ah, okay. LGTM.

This revision is now accepted and ready to land.Apr 9 2018, 4:08 AM

Closed by commit rL329692: [DA] Improve alias checking in dependence analysis (authored by dmgreen). · Explain WhyApr 10 2018, 4:40 AM

This revision was automatically updated to reflect the committed changes.

dmgreen mentioned this in D42382: [DA] Pass TBAA info to the AA in dependency analysis.Apr 12 2018, 1:45 PM

Diff 141548

lib/Analysis/DependenceAnalysis.cpp

Show First 20 Lines • Show All 615 Lines • ▼ Show 20 Lines	if (isLoopIndependent())
OS << "\|<";		OS << "\|<";
OS << "]";		OS << "]";
if (Splitable)		if (Splitable)
OS << " splitable";		OS << " splitable";
}		}
OS << "!\n";		OS << "!\n";
}		}

		// Returns NoAlias/MayAliass/MustAlias for two memory locations based upon their
		// underlaying objects. If LocA and LocB are known to not alias (for any reason:
		// tbaa, non-overlapping regions etc), then it is known there is no dependecy.
		// Otherwise the underlying objects are checked to see if they point to
		// different identifiable objects.
static AliasResult underlyingObjectsAlias(AliasAnalysis *AA,		static AliasResult underlyingObjectsAlias(AliasAnalysis *AA,
const DataLayout &DL, const Value *A,		const DataLayout &DL,
const Value *B) {		const MemoryLocation &LocA,
const Value *AObj = GetUnderlyingObject(A, DL);		const MemoryLocation &LocB) {
const Value *BObj = GetUnderlyingObject(B, DL);		// Check the original locations for noalias, which can happen for
return AA->alias(AObj, DL.getTypeStoreSize(AObj->getType()),		// tbaa, incompatible underlying object locations, etc.
BObj, DL.getTypeStoreSize(BObj->getType()));		if (AA->alias(LocA, LocB) == NoAlias)
		return NoAlias;

		// Check the underlying objects are the same
		const Value *AObj = GetUnderlyingObject(LocA.Ptr, DL);
		const Value *BObj = GetUnderlyingObject(LocB.Ptr, DL);

		// If the underlying objects are the same, they must alias
		if (AObj == BObj)
		return MustAlias;

		// We may have hit the recursion limit for underlying objects, or have
		// underlying objects where we don't know they will alias.
		if (!isIdentifiedObject(AObj) \|\| !isIdentifiedObject(BObj))
		return MayAlias;

		// Otherwise we know the objects are different and both identified objects so
		// must not alias.
		return NoAlias;
		hfinkelUnsubmitted Not Done Reply Inline Actions It's possible that the size here should be the size of the thing that AObj points to, as opposed to unknown There are two problems here, and I think that we can fix them both. What we're really trying to do here is to identify disjoint underlying objects. First, GetUnderlyingObject might not always return the underlying object (because it has a recursion-depth cutoff). Thus, we need to validate the fact that GetUnderlyingObject actually did return such a thing. So we really want to do: if (!isIdentifiedObject(AObj) \|\| !isIdentifiedObject(BObj)) return true; At that point, the sizes are essentially irrelevant. and we should pass MemoryLocation::UnknownSize as the size. However, at that point, since we have unique underlying objects, we can just compare them, so we can just do (there's no further value in calling into AA): return AObj == BObj; hfinkel: > It's possible that the size here should be the size of the thing that AObj points to, as…
		nlopesUnsubmitted Not Done Reply Inline Actions I agree with Hal's comment. Seems like the way to go. nlopes: I agree with Hal's comment. Seems like the way to go.
		dmgreenAuthorUnsubmitted Not Done Reply Inline Actions This sounds good. I like the simplification. I've updated things to something that may still be wrong. As far as I understand, this is returning a tristate: MustAlias - Do dependency analysis. NoAlias - No dependence is possible. MayAlias - Don't know anything - a confused dependence. A lot of the tests use function arguments as input (as will real code). If AObj and BObj are the same argument, we are obviously mustalias, even if that isn't an identified object. So I think this is now still correct even if we hit the recursion limit, the mustalias will still be valid. Also (and this might be something that doesn't come up very often) what if one argument has a noalias attribute, but another doesn't (or one is a alloca, the other an argument). We can still prove noalias there? Also I was hoping to get TBAA working here. See D42382. As in - if the loads/stores of the Src and Dst are different types, we know there is no alias, so no dependency. Any ideas what the best bet there would be? Call alias on the Src and Dst? Work with the TBAA metadata somehow? (I'm not sure that's possible) dmgreen: This sounds good. I like the simplification. I've updated things to something that may still be…
		hfinkelUnsubmitted Not Done Reply Inline Actions Also (and this might be something that doesn't come up very often) what if one argument has a noalias attribute, but another doesn't (or one is a alloca, the other an argument). We can still prove noalias there? No, although you could add that as a special case (i.e., one value is an argument (or global value) and for the other isIdentifiedFunctionLocal returns true. Also I was hoping to get TBAA working here. Good point. This will require a slightly larger change, but seems worthwhile. You'll want to collect the AA metadata and then construct a memory location using that metadata but using MemoryLocation::UnknownSize, and then to an AA query using those memory locations (this will effectively check the underlying objects, but also make use of the metadata). I recommend that you change this function to accept to MemoryLocation references. Get these from the original accesses using MemoryLocation::get, and then inside this function form two new MemoryLocation objects, one from each original, MemoryLocation(LocA.Ptr, MemoryLocation::UnknownSize, LocA.AATags) (or something equivalent). hfinkel: > Also (and this might be something that doesn't come up very often) what if one argument has a…
}		}


// Returns true if the load or store can be analyzed. Atomic and volatile		// Returns true if the load or store can be analyzed. Atomic and volatile
// operations have properties which this analysis does not understand.		// operations have properties which this analysis does not understand.
static		static
bool isLoadOrStore(const Instruction *I) {		bool isLoadOrStore(const Instruction *I) {
if (const LoadInst *LI = dyn_cast<LoadInst>(I))		if (const LoadInst *LI = dyn_cast<LoadInst>(I))
▲ Show 20 Lines • Show All 2,654 Lines • ▼ Show 20 Lines	if (!isLoadOrStore(Src) \|\| !isLoadOrStore(Dst)) {
return make_unique<Dependence>(Src, Dst);		return make_unique<Dependence>(Src, Dst);
}		}

assert(isLoadOrStore(Src) && "instruction is not load or store");		assert(isLoadOrStore(Src) && "instruction is not load or store");
assert(isLoadOrStore(Dst) && "instruction is not load or store");		assert(isLoadOrStore(Dst) && "instruction is not load or store");
Value *SrcPtr = getLoadStorePointerOperand(Src);		Value *SrcPtr = getLoadStorePointerOperand(Src);
Value *DstPtr = getLoadStorePointerOperand(Dst);		Value *DstPtr = getLoadStorePointerOperand(Dst);

switch (underlyingObjectsAlias(AA, F->getParent()->getDataLayout(), DstPtr,		switch (underlyingObjectsAlias(AA, F->getParent()->getDataLayout(),
SrcPtr)) {		MemoryLocation::get(Dst),
		MemoryLocation::get(Src))) {
case MayAlias:		case MayAlias:
case PartialAlias:		case PartialAlias:
// cannot analyse objects if we don't understand their aliasing.		// cannot analyse objects if we don't understand their aliasing.
DEBUG(dbgs() << "can't analyze may or partial alias\n");		DEBUG(dbgs() << "can't analyze may or partial alias\n");
return make_unique<Dependence>(Src, Dst);		return make_unique<Dependence>(Src, Dst);
case NoAlias:		case NoAlias:
// If the objects noalias, they are distinct, accesses are independent.		// If the objects noalias, they are distinct, accesses are independent.
DEBUG(dbgs() << "no alias\n");		DEBUG(dbgs() << "no alias\n");
▲ Show 20 Lines • Show All 399 Lines • ▼ Show 20 Lines	const SCEV *DependenceInfo::getSplitIteration(const Dependence &Dep,
Instruction *Src = Dep.getSrc();		Instruction *Src = Dep.getSrc();
Instruction *Dst = Dep.getDst();		Instruction *Dst = Dep.getDst();
assert(Src->mayReadFromMemory() \|\| Src->mayWriteToMemory());		assert(Src->mayReadFromMemory() \|\| Src->mayWriteToMemory());
assert(Dst->mayReadFromMemory() \|\| Dst->mayWriteToMemory());		assert(Dst->mayReadFromMemory() \|\| Dst->mayWriteToMemory());
assert(isLoadOrStore(Src));		assert(isLoadOrStore(Src));
assert(isLoadOrStore(Dst));		assert(isLoadOrStore(Dst));
Value *SrcPtr = getLoadStorePointerOperand(Src);		Value *SrcPtr = getLoadStorePointerOperand(Src);
Value *DstPtr = getLoadStorePointerOperand(Dst);		Value *DstPtr = getLoadStorePointerOperand(Dst);
assert(underlyingObjectsAlias(AA, F->getParent()->getDataLayout(), DstPtr,		assert(underlyingObjectsAlias(AA, F->getParent()->getDataLayout(),
SrcPtr) == MustAlias);		MemoryLocation::get(Dst),
		MemoryLocation::get(Src)) == MustAlias);

// establish loop nesting levels		// establish loop nesting levels
establishNestingLevels(Src, Dst);		establishNestingLevels(Src, Dst);

FullDependence Result(Src, Dst, false, CommonLevels);		FullDependence Result(Src, Dst, false, CommonLevels);

unsigned Pairs = 1;		unsigned Pairs = 1;
SmallVector<Subscript, 2> Pair(Pairs);		SmallVector<Subscript, 2> Pair(Pairs);
▲ Show 20 Lines • Show All 155 Lines • Show Last 20 Lines

test/Analysis/DependenceAnalysis/AA.ll

This file was added.

				; RUN: opt < %s -analyze -basicaa -tbaa -da \| FileCheck %s

				; CHECK-LABEL: 'Dependence Analysis' for function 'test_no_noalias'
				; CHECK: da analyze - none!
				; CHECK: da analyze - confused!
				; CHECK: da analyze - none!
				define void @test_no_noalias(i32* %A, i32* %B) {
				store i32 1, i32* %A
				store i32 2, i32* %B
				ret void
				}

				; CHECK-LABEL: test_one_noalias
				; CHECK: da analyze - none!
				; CHECK: da analyze - none!
				; CHECK: da analyze - none!
				define void @test_one_noalias(i32* noalias %A, i32* %B) {
				store i32 1, i32* %A
				store i32 2, i32* %B
				ret void
				}

				; CHECK-LABEL: test_two_noalias
				; CHECK: da analyze - none!
				; CHECK: da analyze - none!
				; CHECK: da analyze - none!
				define void @test_two_noalias(i32* noalias %A, i32* noalias %B) {
				store i32 1, i32* %A
				store i32 2, i32* %B
				ret void
				}

				; CHECK-LABEL: test_global_alias
				; CHECK: da analyze - none!
				; CHECK: da analyze - confused!
				; CHECK: da analyze - none!
				@g = global i32 5
				define void @test_global_alias(i32* %A) {
				store i32 1, i32* %A
				store i32 2, i32* @g
				ret void
				}

				; CHECK-LABEL: test_global_noalias
				; CHECK: da analyze - none!
				; CHECK: da analyze - none!
				; CHECK: da analyze - none!
				define void @test_global_noalias(i32* noalias %A) {
				store i32 1, i32* %A
				store i32 2, i32* @g
				ret void
				}

				; CHECK-LABEL: test_global_size
				; CHECK: da analyze - none!
				; CHECK: da analyze - confused!
				; CHECK: da analyze - none!
				; CHECK: da analyze - none!
				; CHECK: da analyze - confused!
				; CHECK: da analyze - none!

				@a = global i16 5, align 2
				@b = global i16* @a, align 4
				define void @test_global_size() {
				%l0 = load i16, i16* @b, align 4
				%l1 = load i16, i16* %l0, align 2
				store i16 1, i16* @a, align 2
				ret void
				}

				; CHECK-LABEL: test_tbaa_same
				; CHECK: da analyze - none!
				; CHECK: da analyze - confused!
				; CHECK: da analyze - none!
				define void @test_tbaa_same(i32* %A, i32* %B) {
				store i32 1, i32* %A, !tbaa !5
				store i32 2, i32* %B, !tbaa !5
				ret void
				}

				; CHECK-LABEL: test_tbaa_diff
				; CHECK: da analyze - none!
				; CHECK: da analyze - none!
				; CHECK: da analyze - none!
				define void @test_tbaa_diff(i32* %A, i16* %B) {
				store i32 1, i32* %A, !tbaa !5
				store i16 2, i16* %B, !tbaa !9
				ret void
				}

				; CHECK-LABEL: tbaa_loop
				; CHECK: da analyze - input
				; CHECK: da analyze - none
				; CHECK: da analyze - output
				define void @tbaa_loop(i32 %I, i32 %J, i32* nocapture %A, i16* nocapture readonly %B) {
				entry:
				%cmp = icmp ne i32 %J, 0
				%cmp122 = icmp ne i32 %I, 0
				%or.cond = and i1 %cmp, %cmp122
				br i1 %or.cond, label %for.outer.preheader, label %for.end

				for.outer.preheader:
				br label %for.outer

				for.outer:
				%i.us = phi i32 [ %add8.us, %for.latch ], [ 0, %for.outer.preheader ]
				br label %for.inner

				for.inner:
				%j.us = phi i32 [ 0, %for.outer ], [ %inc.us, %for.inner ]
				%sum1.us = phi i32 [ 0, %for.outer ], [ %add.us, %for.inner ]
				%arrayidx.us = getelementptr inbounds i16, i16* %B, i32 %j.us
				%0 = load i16, i16* %arrayidx.us, align 4, !tbaa !9
				%sext = sext i16 %0 to i32
				%add.us = add i32 %sext, %sum1.us
				%inc.us = add nuw i32 %j.us, 1
				%exitcond = icmp eq i32 %inc.us, %J
				br i1 %exitcond, label %for.latch, label %for.inner

				for.latch:
				%add.us.lcssa = phi i32 [ %add.us, %for.inner ]
				%arrayidx6.us = getelementptr inbounds i32, i32* %A, i32 %i.us
				store i32 %add.us.lcssa, i32* %arrayidx6.us, align 4, !tbaa !5
				%add8.us = add nuw i32 %i.us, 1
				%exitcond25 = icmp eq i32 %add8.us, %I
				br i1 %exitcond25, label %for.end.loopexit, label %for.outer

				for.end.loopexit:
				br label %for.end

				for.end:
				ret void
				}

				!5 = !{!6, !6, i64 0}
				!6 = !{!"int", !7, i64 0}
				!7 = !{!"omnipotent char", !8, i64 0}
				!8 = !{!"Simple C/C++ TBAA"}
				!9 = !{!10, !10, i64 0}
				!10 = !{!"short", !7, i64 0}

This is an archive of the discontinued LLVM Phabricator instance.

[DA] Correct size parameter from dependency analysis to AA
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 141548

lib/Analysis/DependenceAnalysis.cpp

test/Analysis/DependenceAnalysis/AA.ll

This is an archive of the discontinued LLVM Phabricator instance.

[DA] Correct size parameter from dependency analysis to AAClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 141548

lib/Analysis/DependenceAnalysis.cpp

test/Analysis/DependenceAnalysis/AA.ll

[DA] Correct size parameter from dependency analysis to AA
ClosedPublic