This is an archive of the discontinued LLVM Phabricator instance.

lib/Analysis/IPA/GlobalsModRef.cpp
654 ↗	(On Diff #32240)	It seems like we could make this more general is two ways: Call GetUnderlyingObjects, and checking that all of the results are GlobalValues Also checking that the underlying objects are not arguments or call/invoke return values. If I'm reading this code correctly, one possible way of handling this, is just to do this: push.Inputs(LI->getPointerOperand()); continue; which seems to accomplish both things, and keeps an integrated Depth constraint.

chandlerc added inline comments.Aug 16 2015, 3:38 PM

lib/Analysis/IPA/GlobalsModRef.cpp
654 ↗	(On Diff #32240)	You're saying a pointer loaded from an escaping pointer is itself escaping? I think I agree, but it makes this surprisingly more powerful. This will chase a chain of loads until it hits control flow or a known underlying object. Nifty, but maybe surprising. However, unless I'm mistaken, we still need to call GetUnderlyingObject? The worklist doesn't do that. I'm a bit scared about using GetUnderlyingObjects making this significantly more slow...

hfinkel added inline comments.Aug 16 2015, 4:24 PM

lib/Analysis/IPA/GlobalsModRef.cpp
654 ↗	(On Diff #32240)	Responses below, but actually, it seems that we can do even better than this: A non-addr-taken global does not have its address stored anywhere, ever. Thus, if we get a pointer from a load, it cannot be a non-addr-taken global (except for those in AllocsForIndirectGlobals). No? You're saying a pointer loaded from an escaping pointer is itself escaping? Yes, I think that's correct. I think I agree, but it makes this surprisingly more powerful. This will chase a chain of loads until it hits control flow or a known underlying object. Nifty, but maybe surprising. With a depth limit of 4, it is not going to do very much chasing ;) - but, yes, that's the idea. However, unless I'm mistaken, we still need to call GetUnderlyingObject? The worklist doesn't do that. Ah, yes, correct. The existing worklist does not look though GEPs. Should it? I'm a bit scared about using GetUnderlyingObjects making this significantly more slow... Indeed; this was my motivation for suggesting using the existing worklist (to get the unified depth limit).

Thanks for the idea, Hal.
But, unless I'm missing something, this doesn't quite work, because the base condition is different.

Let's say you have:
%v = load i32, i32* @g1

isNonEscapingGlobalNoAlias(@g1, @g1) should return false, but isNonEscapingGlobalNoAlias(@g1, %v) should return true.
So we can't just push @g1 onto the worklist.

Let's start with the patch as-is. I'm sure that this at least is correct and unlikely to be problematic for compile time, while it is sufficient to solve real world performance problems.

We can revisit other designs as follow-up patches.

Michael, feel free to submit, and we can keep poking at what the exact right model here is.

This revision is now accepted and ready to land.Aug 17 2015, 1:30 AM

In D12064#225516, @mkuper wrote:

Thanks for the idea, Hal.
But, unless I'm missing something, this doesn't quite work, because the base condition is different.

Let's say you have:
%v = load i32, i32* @g1

isNonEscapingGlobalNoAlias(@g1, @g1) should return false, but isNonEscapingGlobalNoAlias(@g1, %v) should return true.
So we can't just push @g1 onto the worklist.

Good point.

Closed by commit rL245207: [GMR] isNonEscapingGlobalNoAlias() should look through Bitcasts/GEPs when… (authored by mkuper). · Explain WhyAug 17 2015, 3:07 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Analysis/

IPA/

GlobalsModRef.cpp

2 lines

test/

Analysis/

GlobalsModRef/

nonescaping-noalias.ll

3 lines

Diff 32281

llvm/trunk/lib/Analysis/IPA/GlobalsModRef.cpp

Show First 20 Lines • Show All 645 Lines • ▼ Show 20 Lines	if (isa<Argument>(Input) \|\| isa<CallInst>(Input) \|\|
// Arguments to functions or returns from functions are inherently		// Arguments to functions or returns from functions are inherently
// escaping, so we can immediately classify those as not aliasing any		// escaping, so we can immediately classify those as not aliasing any
// non-addr-taken globals.		// non-addr-taken globals.
continue;		continue;
}		}
if (auto *LI = dyn_cast<LoadInst>(Input)) {		if (auto *LI = dyn_cast<LoadInst>(Input)) {
// A pointer loaded from a global would have been captured, and we know		// A pointer loaded from a global would have been captured, and we know
// that the global is non-escaping, so no alias.		// that the global is non-escaping, so no alias.
if (isa<GlobalValue>(LI->getPointerOperand()))		if (isa<GlobalValue>(GetUnderlyingObject(LI->getPointerOperand(), *DL)))
continue;		continue;

// Otherwise, a load could come from anywhere, so bail.		// Otherwise, a load could come from anywhere, so bail.
return false;		return false;
}		}

// Recurse through a limited number of selects and PHIs. This is an		// Recurse through a limited number of selects and PHIs. This is an
// arbitrary depth of 4, lower numbers could be used to fix compile time		// arbitrary depth of 4, lower numbers could be used to fix compile time
▲ Show 20 Lines • Show All 136 Lines • Show Last 20 Lines

llvm/trunk/test/Analysis/GlobalsModRef/nonescaping-noalias.ll

Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	entry:
store i32 42, i32* @g1		store i32 42, i32* @g1
%ptr1 = load i32, i32* @g2		%ptr1 = load i32, i32* @g2
store i32 7, i32* %ptr1		store i32 7, i32* %ptr1
%v = load i32, i32* @g1		%v = load i32, i32* @g1
ret i32 %v		ret i32 %v
}		}

@g3 = internal global i32 1		@g3 = internal global i32 1
		@g4 = internal global [10 x i32*] zeroinitializer

define i32 @test4(i32* %param, i32 %n, i1 %c1, i1 %c2, i1 %c3) {		define i32 @test4(i32* %param, i32 %n, i1 %c1, i1 %c2, i1 %c3) {
; Ensure that we can fold a store to a load of a global across a store to		; Ensure that we can fold a store to a load of a global across a store to
; the pointer loaded from that global even when the load is behind PHIs and		; the pointer loaded from that global even when the load is behind PHIs and
; selects, and there is a mixture of a load and another global or argument.		; selects, and there is a mixture of a load and another global or argument.
; Note that we can't eliminate the load here because it is used in a PHI and		; Note that we can't eliminate the load here because it is used in a PHI and
; GVN doesn't try to do real DCE. The store is still forwarded by GVN though.		; GVN doesn't try to do real DCE. The store is still forwarded by GVN though.
;		;
; CHECK-LABEL: @test4(		; CHECK-LABEL: @test4(
; CHECK: store i32 42, i32* @g1		; CHECK: store i32 42, i32* @g1
; CHECK: store i32 7, i32*		; CHECK: store i32 7, i32*
; CHECK: ret i32 42		; CHECK: ret i32 42
entry:		entry:
%call = call i32* @f()		%call = call i32* @f()
store i32 42, i32* @g1		store i32 42, i32* @g1
%ptr1 = load i32, i32* @g2		%ptr1 = load i32, i32* @g2
%ptr2 = select i1 %c1, i32* %ptr1, i32* %param		%ptr2 = select i1 %c1, i32* %ptr1, i32* %param
%ptr3 = select i1 %c3, i32* %ptr2, i32* @g3		%ptr3 = select i1 %c3, i32* %ptr2, i32* @g3
br label %loop		br label %loop

loop:		loop:
%iv = phi i32 [ 0, %entry ], [ %inc, %loop ]		%iv = phi i32 [ 0, %entry ], [ %inc, %loop ]
%ptr = phi i32* [ %ptr3, %entry ], [ %ptr5, %loop ]		%ptr = phi i32* [ %ptr3, %entry ], [ %ptr5, %loop ]
store i32 7, i32* %ptr		store i32 7, i32* %ptr
%ptr4 = load i32, i32* @g2		%ptr4 = load i32, i32* getelementptr ([10 x i32], [10 x i32]* @g4, i32 0, i32 1)
%ptr5 = select i1 %c2, i32* %ptr4, i32* %call		%ptr5 = select i1 %c2, i32* %ptr4, i32* %call
%inc = add i32 %iv, 1		%inc = add i32 %iv, 1
%test = icmp slt i32 %inc, %n		%test = icmp slt i32 %inc, %n
br i1 %test, label %loop, label %exit		br i1 %test, label %loop, label %exit

exit:		exit:
%v = load i32, i32* @g1		%v = load i32, i32* @g1
ret i32 %v		ret i32 %v
}		}

This is an archive of the discontinued LLVM Phabricator instance.

[GMR] isNonEscapingGlobalNoAlias() should look through Bitcasts/GEPs when looking at loads.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 32281

llvm/trunk/lib/Analysis/IPA/GlobalsModRef.cpp

llvm/trunk/test/Analysis/GlobalsModRef/nonescaping-noalias.ll

[GMR] isNonEscapingGlobalNoAlias() should look through Bitcasts/GEPs when looking at loads.
ClosedPublic