This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/IR/
-
llvm/
-
IR/
-
Intrinsics.td
-
test/
-
Analysis/MemorySSA/
-
MemorySSA/
-
invariant-groups.ll
-
Other/
-
invariant.group.barrier.ll

Differential D32006

Mark invariant.group.barrier as inaccessiblememonly
AbandonedPublic

Authored by Prazek on Apr 13 2017, 4:38 AM.

Download Raw Diff

Details

Reviewers

chandlerc
sanjoy
hfinkel
nlewycky
mehdi_amini
kuhar
Prazek

Summary

It turned out that readonly argmemonly is not enough.

store 42, %p
%b = barrier(%p)
store 43, %b

the first store is dead, but because barrier was marked as
reading argument memory, it was considered alive. With
inaccessiblememonly it doesn't read the argument, but
it also can't be CSEd.

Diff Detail

Repository

rL LLVM

Build Status

Buildable 17596
Build 17596: arc lint + arc unit

Event Timeline

Prazek created this revision.Apr 13 2017, 4:38 AM

Prazek added a child revision: D31581: perform DSE through invariant.group.barrier.Apr 13 2017, 4:39 AM

Prazek set the repository for this revision to rL LLVM.Apr 16 2017, 1:38 PM

ping

Prazek added a reviewer: nlewycky.Apr 24 2017, 3:43 AM

Doesn't this conflict with the test case you had for D31531:

void foo() {
    Base *x = new Base{};
    new (x) Derived1{};
    int a = std::launder(x)->foo();
    new (x) Derived2{};
    int b = std::launder(x)->foo();
}

Won't it now be OK to CSE the two launders since the only memory changed between the two is visible to the module (and thus not read by an inaccessiblememonly readonly function)?

This revision now requires changes to proceed.Apr 25 2017, 12:57 AM

In D32006#736448, @sanjoy wrote:
Doesn't this conflict with the test case you had for D31531:
void foo() {
    Base *x = new Base{};
    new (x) Derived1{};
    int a = std::launder(x)->foo();
    new (x) Derived2{};
    int b = std::launder(x)->foo();
}
Won't it now be OK to CSE the two launders since the only memory changed between the two is visible to the module (and thus not read by an inaccessiblememonly readonly function)?

I have to double check that. If this is the case then probably we can remove readonly from the barrier. This way we won't be able to CSE barrier together, but we will be able to DSE through the barrier.

Yes, you are right. I missed that in the tests. I will change it to have only inaccesiblememonly. It would be good to figure out a way to CSE 2 barriers if there is no write to memory between, because I don't see if it
is possible with current set of attributes. Fortunatelly it is not that important, we should not miss any important optimizations without that.

Add nocapture

Because now barrier is considered as writing memory, I marked it's argument as nocapture. I am not sure if it is correct, since barrier returns it's pointer. Does it mean that it outlives the barrier?

Remove nocapture

ping

hfinkel added inline comments.May 7 2017, 2:17 PM

include/llvm/IR/Intrinsics.td
620 ↗	(On Diff #97115)	You'd need to update the comment.

Fix comment

Is it possible to add "writeonly"? I am not sure if it will help in any way, but the tests seems to be working it.

In D32006#748602, @Prazek wrote:

Is it possible to add "writeonly"? I am not sure if it will help in any way, but the tests seems to be working it.

Yes, I think this makes sense. The model is that there's some side table holding the object type of all memory, and this barrier represents places where we might be updating that table to assign a different type. Right?

In D32006#748700, @hfinkel wrote:

In D32006#748602, @Prazek wrote:

Is it possible to add "writeonly"? I am not sure if it will help in any way, but the tests seems to be working it.

Yes, I think this makes sense. The model is that there's some side table holding the object type of all memory, and this barrier represents places where we might be updating that table to assign a different type. Right?

If that is the model then barrier can't be both writeonly *and* argmemonly, since the side table is disjoint from the object.

In D32006#748700, @hfinkel wrote:

In D32006#748602, @Prazek wrote:

Is it possible to add "writeonly"? I am not sure if it will help in any way, but the tests seems to be working it.

Yes, I think this makes sense. The model is that there's some side table holding the object type of all memory, and this barrier represents places where we might be updating that table to assign a different type. Right?

I am not sure about this because the barrier is inserted before changing the dynamic type, not after.
Even the Chandler's model was invalid in that way (the model was that the barrier returns pointer that aliases the argument, but
it has some special bits that represents the dyanmic type).

Maybe we can think about this as returning pointer aliasing the argument, but with some bits set in "unique" way, so that 2 barriers calls on the same pointer
returns different value?

I can give a brief summary of my thinking about what attributes we can have on the barrier and what can't
Argument:
+ readonly (alone): we can't read the argument because we won't be able to perform DSE

store 42, %p ; dead store
%b = barrier(%p)
store 43, %b

+ writeonly (alone): we can't write through the argument because we won't be able to get values through the barrier:

store 42, %p
%b = barrier(%p)
load %b

+ the only possible way is to have it as readnone (equivalent or stronger)

Function attributes (considering that the argument is readnone):
+ Can't be readonly alone, because it will be possible to remove barrier like:

%b1 = barrier(%p)
store 42, %b1, !invariant.group
%b2 = barrier(%p) ; We could replace it with %b1 because  
; %p didn't escape, and %p aliases with %b1, so
; the barrier would read the same memory as %b1

I haven't checked that with the Capture tracking patch https://reviews.llvm.org/D32673
but if my thinking is correct then one day it could brake

+ Can't be writeonly alone, because if the barrier argument would escape before it,
we would consider barrier changing the value.

+ Can't be inaccessiblememonly & readonly, because we would be able to CSE 2 barriers like:

%ptr2 = call i8* @llvm.invariant.group.barrier(i8* nonnull %ptr)
store i8 43, i8* %ptr2, align 1, !invariant.group !0
%ptr3 = call i8* @llvm.invariant.group.barrier(i8* nonnull %ptr)

Because the store would not change inaccessible memory

+ Can be marked as inaccessiblememonly & writeonly, but
unfortunatelly we will loose the ability to CSE barriers like:

%ptr2 = call i8* @llvm.invariant.group.barrier(i8* %ptr)
%ptr3 = call i8* @llvm.invariant.group.barrier(i8* %ptr)

In D32006#748725, @sanjoy wrote:

If that is the model then barrier can't be both writeonly *and* argmemonly, since the side table is disjoint from the object.

I am not sure if I follow, because I changed argmemonly to inaccessiblememonly

ping

ping2

Prazek added a child revision: D33235: Mark invariant.group as experimental.May 21 2017, 1:16 AM

Prazek added a reviewer: mehdi_amini.May 25 2017, 3:09 AM

friendly ping

friendly ping2

friendly ping 3

Rebase. This patch will not be commited, but will be used as a base for new development.

Prazek added a child revision: D45111: Rename invariant.group.barrier to launder.invariant.group.Mar 30 2018, 2:52 PM

LGTM

ping Sajnoy/Richard

LGTM

Based on your document, I think it's correct for this to additionally be marked writeonly. I think the key question here is: is the side-channel information in the notional "fat pointer" established at the point where the barrier is executed, or at the first !invariant.group load through the pointer after the barrier? If it's at the point of the barrier, then the barrier notionally performs a read through its argument and shouldn't be writeonly, but your document says that the intrinsic merely establishes a new invariant group, and that only future !invariant.group loads in that group actually determine the invariant value. As well as being more optimizable, this also seems like a nicer model for frontend code generation.

Herald added a subscriber: george.burgess.iv. · View Herald TranscriptApr 30 2018, 1:56 PM

As I don't see if writeonly could help us in any way, and I am worried that in case:

%a = launder(%p)
%b = launder(%p)

%a could potentially be optimized away, even if it would have load of %a in beetween (because of inaccessiblememonly), so I will leave it like this for now.

In D32006#1084474, @Prazek wrote:

As I don't see if writeonly could help us in any way, and I am worried that in case:

%a = launder(%p)
%b = launder(%p)

%a could potentially be optimized away, even if it would have load of %a in beetween (because of inaccessiblememonly), so I will leave it like this for now.

I agree (and, in fact, @homerdin is working on that optimization, so this likely won't be theoretical for very long).

Thanks, pushed to master. Trying to close the revision right now.

pushed

Harbormaster completed remote builds in B17596: Diff 144838.May 2 2018, 1:46 AM

Can't close it somehow.

Revision Contents

Path

Size

llvm/

include/

llvm/

IR/

Intrinsics.td

11 lines

test/

Analysis/

MemorySSA/

invariant-groups.ll

141 lines

Other/

invariant.group.barrier.ll

29 lines

Diff 144838

llvm/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 706 Lines • ▼ Show 20 Lines	def int_invariant_start : Intrinsic<[llvm_descriptor_ty],
[IntrArgMemOnly, NoCapture<1>]>;		[IntrArgMemOnly, NoCapture<1>]>;
def int_invariant_end : Intrinsic<[],		def int_invariant_end : Intrinsic<[],
[llvm_descriptor_ty, llvm_i64_ty,		[llvm_descriptor_ty, llvm_i64_ty,
llvm_anyptr_ty],		llvm_anyptr_ty],
[IntrArgMemOnly, NoCapture<2>]>;		[IntrArgMemOnly, NoCapture<2>]>;

// invariant.group.barrier can't be marked with 'readnone' (IntrNoMem),		// invariant.group.barrier can't be marked with 'readnone' (IntrNoMem),
// because it would cause CSE of two barriers with the same argument.		// because it would cause CSE of two barriers with the same argument.
// Readonly and argmemonly says that barrier only reads its argument and		// Inaccessiblememonly says that the barrier doesn't read the argument,
// it can be CSE only if memory didn't change between 2 barriers call,		// but it changes state not accessible to this module. This way
// which is valid.		// we can DSE through the barrier because it doesn't read the value
		// after store. Although the barrier doesn't modify any memory it
		// can't be marked as readonly, because it would be possible to
		// CSE 2 barriers with store in between.
// The argument also can't be marked with 'returned' attribute, because		// The argument also can't be marked with 'returned' attribute, because
// it would remove barrier.		// it would remove barrier.
// Note that it is still experimental, which means that its semantics		// Note that it is still experimental, which means that its semantics
// might change in the future.		// might change in the future.
def int_invariant_group_barrier : Intrinsic<[llvm_anyptr_ty],		def int_invariant_group_barrier : Intrinsic<[llvm_anyptr_ty],
[LLVMMatchType<0>],		[LLVMMatchType<0>],
[IntrReadMem, IntrArgMemOnly]>;		[IntrInaccessibleMemOnly]>;

//===------------------------ Stackmap Intrinsics -------------------------===//		//===------------------------ Stackmap Intrinsics -------------------------===//
//		//
def int_experimental_stackmap : Intrinsic<[],		def int_experimental_stackmap : Intrinsic<[],
[llvm_i64_ty, llvm_i32_ty, llvm_vararg_ty],		[llvm_i64_ty, llvm_i32_ty, llvm_vararg_ty],
[Throws]>;		[Throws]>;
def int_experimental_patchpoint_void : Intrinsic<[],		def int_experimental_patchpoint_void : Intrinsic<[],
[llvm_i64_ty, llvm_i32_ty,		[llvm_i64_ty, llvm_i32_ty,
▲ Show 20 Lines • Show All 264 Lines • Show Last 20 Lines

llvm/test/Analysis/MemorySSA/invariant-groups.ll

	Show All 10 Lines
	; CHECK-NEXT: store i32 0			; CHECK-NEXT: store i32 0
	store i32 0, i32* %a, align 4, !invariant.group !0			store i32 0, i32* %a, align 4, !invariant.group !0

	; CHECK: 2 = MemoryDef(1)			; CHECK: 2 = MemoryDef(1)
	; CHECK-NEXT: store i32 1			; CHECK-NEXT: store i32 1
	store i32 1, i32* @g, align 4			store i32 1, i32* @g, align 4

	%1 = bitcast i32* %a to i8*			%1 = bitcast i32* %a to i8*
	; CHECK: MemoryUse(2)			; CHECK: 3 = MemoryDef(2)
	; CHECK-NEXT: %a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)			; CHECK-NEXT: %a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)
	%a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)			%a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)
	%a32 = bitcast i8* %a8 to i32*			%a32 = bitcast i8* %a8 to i32*

	; This have to be MemoryUse(2), because we can't skip the barrier based on			; This have to be MemoryUse(2), because we can't skip the barrier based on
	; invariant.group.			; invariant.group.
	; CHECK: MemoryUse(2)			; CHECK: MemoryUse(2)
	; CHECK-NEXT: %2 = load i32			; CHECK-NEXT: %2 = load i32
	%2 = load i32, i32* %a32, align 4, !invariant.group !0			%2 = load i32, i32* %a32, align 4, !invariant.group !0
	ret i32 %2			ret i32 %2
	}			}

	define i32 @skipBarrier(i32* %a) {			define i32 @skipBarrier(i32* %a) {
	; CHECK: 1 = MemoryDef(liveOnEntry)			; CHECK: 1 = MemoryDef(liveOnEntry)
	; CHECK-NEXT: store i32 0			; CHECK-NEXT: store i32 0
	store i32 0, i32* %a, align 4, !invariant.group !0			store i32 0, i32* %a, align 4, !invariant.group !0

	%1 = bitcast i32* %a to i8*			%1 = bitcast i32* %a to i8*
	; CHECK: MemoryUse(1)			; CHECK: 2 = MemoryDef(1)
	; CHECK-NEXT: %a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)			; CHECK-NEXT: %a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)
	%a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)			%a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)
	%a32 = bitcast i8* %a8 to i32*			%a32 = bitcast i8* %a8 to i32*

	; We can skip the barrier only if the "skip" is not based on !invariant.group.			; We can skip the barrier only if the "skip" is not based on !invariant.group.
	; CHECK: MemoryUse(1)			; CHECK: MemoryUse(1)
	; CHECK-NEXT: %2 = load i32			; CHECK-NEXT: %2 = load i32
	%2 = load i32, i32* %a32, align 4, !invariant.group !0			%2 = load i32, i32* %a32, align 4, !invariant.group !0
	ret i32 %2			ret i32 %2
	}			}

	define i32 @skipBarrier2(i32* %a) {			define i32 @skipBarrier2(i32* %a) {

	; CHECK: MemoryUse(liveOnEntry)			; CHECK: MemoryUse(liveOnEntry)
	; CHECK-NEXT: %v = load i32			; CHECK-NEXT: %v = load i32
	%v = load i32, i32* %a, align 4, !invariant.group !0			%v = load i32, i32* %a, align 4, !invariant.group !0

	%1 = bitcast i32* %a to i8*			%1 = bitcast i32* %a to i8*
	; CHECK: MemoryUse(liveOnEntry)			; CHECK: 1 = MemoryDef(liveOnEntry)
	; CHECK-NEXT: %a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)			; CHECK-NEXT: %a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)
	%a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)			%a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)
	%a32 = bitcast i8* %a8 to i32*			%a32 = bitcast i8* %a8 to i32*

	; We can skip the barrier only if the "skip" is not based on !invariant.group.			; We can skip the barrier only if the "skip" is not based on !invariant.group.
	; CHECK: MemoryUse(liveOnEntry)			; CHECK: MemoryUse(liveOnEntry)
	; CHECK-NEXT: %v2 = load i32			; CHECK-NEXT: %v2 = load i32
	%v2 = load i32, i32* %a32, align 4, !invariant.group !0			%v2 = load i32, i32* %a32, align 4, !invariant.group !0
	; CHECK: 1 = MemoryDef(liveOnEntry)			; CHECK: 2 = MemoryDef(1)
	; CHECK-NEXT: store i32 1			; CHECK-NEXT: store i32 1
	store i32 1, i32* @g, align 4			store i32 1, i32* @g, align 4

	; FIXME: based on invariant.group it should be MemoryUse(liveOnEntry)			; FIXME: based on invariant.group it should be MemoryUse(liveOnEntry)
	; CHECK: MemoryUse(1)			; CHECK: MemoryUse(2)
	; CHECK-NEXT: %v3 = load i32			; CHECK-NEXT: %v3 = load i32
	%v3 = load i32, i32* %a32, align 4, !invariant.group !0			%v3 = load i32, i32* %a32, align 4, !invariant.group !0
	%add = add nsw i32 %v2, %v3			%add = add nsw i32 %v2, %v3
	%add2 = add nsw i32 %add, %v			%add2 = add nsw i32 %add, %v
	ret i32 %add2			ret i32 %add2
	}			}

	define i32 @handleInvariantGroups(i32* %a) {			define i32 @handleInvariantGroups(i32* %a) {
	; CHECK: 1 = MemoryDef(liveOnEntry)			; CHECK: 1 = MemoryDef(liveOnEntry)
	; CHECK-NEXT: store i32 0			; CHECK-NEXT: store i32 0
	store i32 0, i32* %a, align 4, !invariant.group !0			store i32 0, i32* %a, align 4, !invariant.group !0

	; CHECK: 2 = MemoryDef(1)			; CHECK: 2 = MemoryDef(1)
	; CHECK-NEXT: store i32 1			; CHECK-NEXT: store i32 1
	store i32 1, i32* @g, align 4			store i32 1, i32* @g, align 4
	%1 = bitcast i32* %a to i8*			%1 = bitcast i32* %a to i8*
	; CHECK: MemoryUse(2)			; CHECK: 3 = MemoryDef(2)
	; CHECK-NEXT: %a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)			; CHECK-NEXT: %a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)
	%a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)			%a8 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %1)
	%a32 = bitcast i8* %a8 to i32*			%a32 = bitcast i8* %a8 to i32*

	; CHECK: MemoryUse(2)			; CHECK: MemoryUse(2)
	; CHECK-NEXT: %2 = load i32			; CHECK-NEXT: %2 = load i32
	%2 = load i32, i32* %a32, align 4, !invariant.group !0			%2 = load i32, i32* %a32, align 4, !invariant.group !0

	; CHECK: 3 = MemoryDef(2)			; CHECK: 4 = MemoryDef(3)
	; CHECK-NEXT: store i32 2			; CHECK-NEXT: store i32 2
	store i32 2, i32* @g, align 4			store i32 2, i32* @g, align 4

	; FIXME: This can be changed to MemoryUse(2)			; FIXME: This can be changed to MemoryUse(2)
	; CHECK: MemoryUse(3)			; CHECK: MemoryUse(4)
	; CHECK-NEXT: %3 = load i32			; CHECK-NEXT: %3 = load i32
	%3 = load i32, i32* %a32, align 4, !invariant.group !0			%3 = load i32, i32* %a32, align 4, !invariant.group !0
	%add = add nsw i32 %2, %3			%add = add nsw i32 %2, %3
	ret i32 %add			ret i32 %add
	}			}

	define i32 @loop(i1 %a) {			define i32 @loop(i1 %a) {
	entry:			entry:
	Show All 28 Lines
	entry:			entry:
	; CHECK: 1 = MemoryDef(liveOnEntry)			; CHECK: 1 = MemoryDef(liveOnEntry)
	; CHECK-NEXT: store i8			; CHECK-NEXT: store i8
	store i8 4, i8* %p, !invariant.group !0			store i8 4, i8* %p, !invariant.group !0
	; CHECK: 2 = MemoryDef(1)			; CHECK: 2 = MemoryDef(1)
	; CHECK-NEXT: call void @clobber			; CHECK-NEXT: call void @clobber
	call void @clobber8(i8* %p)			call void @clobber8(i8* %p)

	; CHECK: MemoryUse(2)			; CHECK: 3 = MemoryDef(2)
	; CHECK-NEXT: %after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)			; CHECK-NEXT: %after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)
	%after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)			%after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)
	br i1 undef, label %Loop.Body, label %Loop.End			br i1 undef, label %Loop.Body, label %Loop.End

	Loop.Body:			Loop.Body:
	; 4 = MemoryPhi({entry,2},{Loop.Body,3},{Loop.End,5})			; 5 = MemoryPhi({entry,3},{Loop.Body,4},{Loop.End,6})
	; CHECK: MemoryUse(4)			; CHECK: MemoryUse(5)
	; CHECK-NEXT: %0 = load i8			; CHECK-NEXT: %0 = load i8
	%0 = load i8, i8* %after, !invariant.group !0			%0 = load i8, i8* %after, !invariant.group !0

	; FIXME: MemoryUse(1)			; FIXME: MemoryUse(1)
	; CHECK: MemoryUse(4)			; CHECK: MemoryUse(5)
	; CHECK-NEXT: %1 = load i8			; CHECK-NEXT: %1 = load i8
	%1 = load i8, i8* %p, !invariant.group !0			%1 = load i8, i8* %p, !invariant.group !0

	; CHECK: 3 = MemoryDef(4)			; CHECK: 4 = MemoryDef(5)
	store i8 4, i8* %after, !invariant.group !0			store i8 4, i8* %after, !invariant.group !0

	br i1 undef, label %Loop.End, label %Loop.Body			br i1 undef, label %Loop.End, label %Loop.Body

	Loop.End:			Loop.End:
	; 5 = MemoryPhi({entry,2},{Loop.Body,3})			; 6 = MemoryPhi({entry,3},{Loop.Body,4})
	; CHECK: MemoryUse(5)			; CHECK: MemoryUse(6)
	; CHECK-NEXT: %2 = load			; CHECK-NEXT: %2 = load
	%2 = load i8, i8* %after, align 4, !invariant.group !0			%2 = load i8, i8* %after, align 4, !invariant.group !0

	; FIXME: MemoryUse(1)			; FIXME: MemoryUse(1)
	; CHECK: MemoryUse(5)			; CHECK: MemoryUse(6)
	; CHECK-NEXT: %3 = load			; CHECK-NEXT: %3 = load
	%3 = load i8, i8* %p, align 4, !invariant.group !0			%3 = load i8, i8* %p, align 4, !invariant.group !0
	br i1 undef, label %Ret, label %Loop.Body			br i1 undef, label %Ret, label %Loop.Body

	Ret:			Ret:
	ret i8 %3			ret i8 %3
	}			}


	define i8 @loop3(i8* %p) {			define i8 @loop3(i8* %p) {
	entry:			entry:
	; CHECK: 1 = MemoryDef(liveOnEntry)			; CHECK: 1 = MemoryDef(liveOnEntry)
	; CHECK-NEXT: store i8			; CHECK-NEXT: store i8
	store i8 4, i8* %p, !invariant.group !0			store i8 4, i8* %p, !invariant.group !0
	; CHECK: 2 = MemoryDef(1)			; CHECK: 2 = MemoryDef(1)
	; CHECK-NEXT: call void @clobber			; CHECK-NEXT: call void @clobber
	call void @clobber8(i8* %p)			call void @clobber8(i8* %p)

	; CHECK: MemoryUse(2)			; CHECK: 3 = MemoryDef(2)
	; CHECK-NEXT: %after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)			; CHECK-NEXT: %after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)
	%after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)			%after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)
	br i1 undef, label %Loop.Body, label %Loop.End			br i1 undef, label %Loop.Body, label %Loop.End

	Loop.Body:			Loop.Body:
	; CHECK: 6 = MemoryPhi({entry,2},{Loop.Body,3},{Loop.next,4},{Loop.End,5})			; CHECK: 7 = MemoryPhi({entry,3},{Loop.Body,4},{Loop.next,5},{Loop.End,6})
	; CHECK: MemoryUse(6)			; CHECK: MemoryUse(7)
	; CHECK-NEXT: %0 = load i8			; CHECK-NEXT: %0 = load i8
	%0 = load i8, i8* %after, !invariant.group !0			%0 = load i8, i8* %after, !invariant.group !0

	; CHECK: 3 = MemoryDef(6)			; CHECK: 4 = MemoryDef(7)
	; CHECK-NEXT: call void @clobber8			; CHECK-NEXT: call void @clobber8
	call void @clobber8(i8* %after)			call void @clobber8(i8* %after)

	; FIXME: MemoryUse(6)			; FIXME: MemoryUse(7)
	; CHECK: MemoryUse(3)			; CHECK: MemoryUse(4)
	; CHECK-NEXT: %1 = load i8			; CHECK-NEXT: %1 = load i8
	%1 = load i8, i8* %after, !invariant.group !0			%1 = load i8, i8* %after, !invariant.group !0

	br i1 undef, label %Loop.next, label %Loop.Body			br i1 undef, label %Loop.next, label %Loop.Body
	Loop.next:			Loop.next:
	; CHECK: 4 = MemoryDef(3)			; CHECK: 5 = MemoryDef(4)
	; CHECK-NEXT: call void @clobber8			; CHECK-NEXT: call void @clobber8
	call void @clobber8(i8* %after)			call void @clobber8(i8* %after)

	; FIXME: MemoryUse(6)			; FIXME: MemoryUse(7)
	; CHECK: MemoryUse(4)			; CHECK: MemoryUse(5)
	; CHECK-NEXT: %2 = load i8			; CHECK-NEXT: %2 = load i8
	%2 = load i8, i8* %after, !invariant.group !0			%2 = load i8, i8* %after, !invariant.group !0

	br i1 undef, label %Loop.End, label %Loop.Body			br i1 undef, label %Loop.End, label %Loop.Body

	Loop.End:			Loop.End:
	; CHECK: 7 = MemoryPhi({entry,2},{Loop.next,4})			; CHECK: 8 = MemoryPhi({entry,3},{Loop.next,5})
	; CHECK: MemoryUse(7)			; CHECK: MemoryUse(8)
	; CHECK-NEXT: %3 = load			; CHECK-NEXT: %3 = load
	%3 = load i8, i8* %after, align 4, !invariant.group !0			%3 = load i8, i8* %after, align 4, !invariant.group !0

	; CHECK: 5 = MemoryDef(7)			; CHECK: 6 = MemoryDef(8)
	; CHECK-NEXT: call void @clobber8			; CHECK-NEXT: call void @clobber8
	call void @clobber8(i8* %after)			call void @clobber8(i8* %after)

	; FIXME: MemoryUse(7)			; FIXME: MemoryUse(8)
	; CHECK: MemoryUse(5)			; CHECK: MemoryUse(6)
	; CHECK-NEXT: %4 = load			; CHECK-NEXT: %4 = load
	%4 = load i8, i8* %after, align 4, !invariant.group !0			%4 = load i8, i8* %after, align 4, !invariant.group !0
	br i1 undef, label %Ret, label %Loop.Body			br i1 undef, label %Ret, label %Loop.Body

	Ret:			Ret:
	ret i8 %3			ret i8 %3
	}			}

	define i8 @loop4(i8* %p) {			define i8 @loop4(i8* %p) {
	entry:			entry:
	; CHECK: 1 = MemoryDef(liveOnEntry)			; CHECK: 1 = MemoryDef(liveOnEntry)
	; CHECK-NEXT: store i8			; CHECK-NEXT: store i8
	store i8 4, i8* %p, !invariant.group !0			store i8 4, i8* %p, !invariant.group !0
	; CHECK: 2 = MemoryDef(1)			; CHECK: 2 = MemoryDef(1)
	; CHECK-NEXT: call void @clobber			; CHECK-NEXT: call void @clobber
	call void @clobber8(i8* %p)			call void @clobber8(i8* %p)
	; CHECK: MemoryUse(2)			; CHECK: 3 = MemoryDef(2)
	; CHECK-NEXT: %after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)			; CHECK-NEXT: %after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)
	%after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)			%after = call i8* @llvm.invariant.group.barrier.p0i8(i8* %p)
	br i1 undef, label %Loop.Pre, label %Loop.End			br i1 undef, label %Loop.Pre, label %Loop.End

	Loop.Pre:			Loop.Pre:
	; CHECK: MemoryUse(2)			; CHECK: MemoryUse(2)
	; CHECK-NEXT: %0 = load i8			; CHECK-NEXT: %0 = load i8
	%0 = load i8, i8* %after, !invariant.group !0			%0 = load i8, i8* %after, !invariant.group !0
	br label %Loop.Body			br label %Loop.Body
	Loop.Body:			Loop.Body:
	; CHECK: 4 = MemoryPhi({Loop.Pre,2},{Loop.Body,3},{Loop.End,5})			; CHECK: 5 = MemoryPhi({Loop.Pre,3},{Loop.Body,4},{Loop.End,6})
	; CHECK-NEXT: MemoryUse(4)			; CHECK-NEXT: MemoryUse(5)
	; CHECK-NEXT: %1 = load i8			; CHECK-NEXT: %1 = load i8
	%1 = load i8, i8* %after, !invariant.group !0			%1 = load i8, i8* %after, !invariant.group !0

	; FIXME: MemoryUse(2)			; FIXME: MemoryUse(2)
	; CHECK: MemoryUse(4)			; CHECK: MemoryUse(5)
	; CHECK-NEXT: %2 = load i8			; CHECK-NEXT: %2 = load i8
	%2 = load i8, i8* %p, !invariant.group !0			%2 = load i8, i8* %p, !invariant.group !0

	; CHECK: 3 = MemoryDef(4)			; CHECK: 4 = MemoryDef(5)
	store i8 4, i8* %after, !invariant.group !0			store i8 4, i8* %after, !invariant.group !0
	br i1 undef, label %Loop.End, label %Loop.Body			br i1 undef, label %Loop.End, label %Loop.Body

	Loop.End:			Loop.End:
	; CHECK: 5 = MemoryPhi({entry,2},{Loop.Body,3})			; CHECK: 6 = MemoryPhi({entry,3},{Loop.Body,4})
	; CHECK-NEXT: MemoryUse(5)			; CHECK-NEXT: MemoryUse(6)
	; CHECK-NEXT: %3 = load			; CHECK-NEXT: %3 = load
	%3 = load i8, i8* %after, align 4, !invariant.group !0			%3 = load i8, i8* %after, align 4, !invariant.group !0

	; FIXME: MemoryUse(2)			; FIXME: MemoryUse(2)
	; CHECK: MemoryUse(5)			; CHECK: MemoryUse(6)
	; CHECK-NEXT: %4 = load			; CHECK-NEXT: %4 = load
	%4 = load i8, i8* %p, align 4, !invariant.group !0			%4 = load i8, i8* %p, align 4, !invariant.group !0
	br i1 undef, label %Ret, label %Loop.Body			br i1 undef, label %Ret, label %Loop.Body

	Ret:			Ret:
	ret i8 %3			ret i8 %3
	}			}

				; In the future we would like to CSE barriers if there is no clobber between.
				; CHECK-LABEL: define i8 @optimizable()
				define i8 @optimizable() {
				entry:
				%ptr = alloca i8
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: store i8 42, i8* %ptr, !invariant.group !0
				store i8 42, i8* %ptr, !invariant.group !0
				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: call i8* @llvm.invariant.group.barrier
				%ptr2 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)
				; FIXME: This one could be CSEd.
				; CHECK: 3 = MemoryDef(2)
				; CHECK: call i8* @llvm.invariant.group.barrier
				%ptr3 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)
				; CHECK: 4 = MemoryDef(3)
				; CHECK-NEXT: call void @clobber8(i8* %ptr)
				call void @clobber8(i8* %ptr)
				; CHECK: 5 = MemoryDef(4)
				; CHECK-NEXT: call void @use(i8* %ptr2)
				call void @use(i8* %ptr2)
				; CHECK: 6 = MemoryDef(5)
				; CHECK-NEXT: call void @use(i8* %ptr3)
				call void @use(i8* %ptr3)
				; CHECK: MemoryUse(6)
				; CHECK-NEXT: load i8, i8* %ptr3, {{.*}}!invariant.group
				%v = load i8, i8* %ptr3, !invariant.group !0

				ret i8 %v
				}

				; CHECK-LABEL: define i8 @unoptimizable2()
				define i8 @unoptimizable2() {
				%ptr = alloca i8
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: store i8 42, i8* %ptr, !invariant.group !0
				store i8 42, i8* %ptr, !invariant.group !0
				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: call i8* @llvm.invariant.group.barrier
				%ptr2 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)
				; CHECK: 3 = MemoryDef(2)
				store i8 43, i8* %ptr
				; CHECK: 4 = MemoryDef(3)
				; CHECK-NEXT: call i8* @llvm.invariant.group.barrier
				%ptr3 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)
				; CHECK: 5 = MemoryDef(4)
				; CHECK-NEXT: call void @clobber8(i8* %ptr)
				call void @clobber8(i8* %ptr)
				; 6 = MemoryDef(5)
				; CHECK-NEXT call void @use(i8* %ptr2)
				call void @use(i8* %ptr2)
				; CHECK: 7 = MemoryDef(6)
				; CHECK-NEXT: call void @use(i8* %ptr3)
				call void @use(i8* %ptr3)
				; CHECK: MemoryUse(7)
				; CHECK-NEXT: %v = load i8, i8* %ptr3, !invariant.group !0
				%v = load i8, i8* %ptr3, !invariant.group !0
				ret i8 %v
				}


	declare i8* @llvm.invariant.group.barrier.p0i8(i8*)			declare i8* @llvm.invariant.group.barrier.p0i8(i8*)
	declare void @clobber(i32*)			declare void @clobber(i32*)
	declare void @clobber8(i8*)			declare void @clobber8(i8*)
				declare void @use(i8* readonly)

	!0 = !{!"group1"}			!0 = !{!"group1"}

llvm/test/Other/invariant.group.barrier.ll

	; RUN: opt -S -early-cse < %s \| FileCheck %s			; RUN: opt -S -early-cse < %s \| FileCheck %s
	; RUN: opt -S -gvn < %s \| FileCheck %s			; RUN: opt -S -gvn < %s \| FileCheck %s
	; RUN: opt -S -newgvn < %s \| FileCheck %s			; RUN: opt -S -newgvn < %s \| FileCheck %s
	; RUN: opt -S -O3 < %s \| FileCheck %s			; RUN: opt -S -O3 < %s \| FileCheck %s

	; These tests checks if passes with CSE functionality can do CSE on			; These tests checks if passes with CSE functionality can do CSE on
	; invariant.group.barrier, that is prohibited if there is a memory clobber			; invariant.group.barrier, that is prohibited if there is a memory clobber
	; between barriers call.			; between barriers call.

	; CHECK-LABEL: define i8 @optimizable()			; CHECK-LABEL: define i8 @optimizable()
	define i8 @optimizable() {			define i8 @optimizable() {
	entry:			entry:
	%ptr = alloca i8			%ptr = alloca i8
	store i8 42, i8* %ptr, !invariant.group !0			store i8 42, i8* %ptr, !invariant.group !0
	; CHECK: call i8* @llvm.invariant.group.barrier.p0i8			; CHECK: call i8* @llvm.invariant.group.barrier.p0i8
	%ptr2 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)			%ptr2 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)
	; CHECK-NOT: call i8* @llvm.invariant.group.barrier.p0i8			; FIXME: This one could be CSE
				; CHECK: call i8* @llvm.invariant.group.barrier
	%ptr3 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)			%ptr3 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)
	; CHECK: call void @clobber(i8* {{.*}}%ptr)			; CHECK: call void @clobber(i8* {{.*}}%ptr)
	call void @clobber(i8* %ptr)			call void @clobber(i8* %ptr)

	; CHECK: call void @use(i8* {{.*}}%ptr2)			; CHECK: call void @use(i8* {{.*}}%ptr2)
	call void @use(i8* %ptr2)			call void @use(i8* %ptr2)
	; CHECK: call void @use(i8* {{.*}}%ptr2)			; CHECK: call void @use(i8* {{.*}}%ptr3)
	call void @use(i8* %ptr3)			call void @use(i8* %ptr3)
	; CHECK: load i8, i8* %ptr2, {{.*}}!invariant.group			; CHECK: load i8, i8* %ptr3, {{.*}}!invariant.group
	%v = load i8, i8* %ptr3, !invariant.group !0			%v = load i8, i8* %ptr3, !invariant.group !0

	ret i8 %v			ret i8 %v
	}			}

	; CHECK-LABEL: define i8 @unoptimizable()			; CHECK-LABEL: define i8 @unoptimizable()
	define i8 @unoptimizable() {			define i8 @unoptimizable() {
	entry:			entry:
	Show All 11 Lines
	; CHECK: call void @use(i8* {{.*}}%ptr3)			; CHECK: call void @use(i8* {{.*}}%ptr3)
	call void @use(i8* %ptr3)			call void @use(i8* %ptr3)
	; CHECK: load i8, i8* %ptr3, {{.*}}!invariant.group			; CHECK: load i8, i8* %ptr3, {{.*}}!invariant.group
	%v = load i8, i8* %ptr3, !invariant.group !0			%v = load i8, i8* %ptr3, !invariant.group !0

	ret i8 %v			ret i8 %v
	}			}

				; CHECK-LABEL: define i8 @unoptimizable2()
				define i8 @unoptimizable2() {
				%ptr = alloca i8
				store i8 42, i8* %ptr, !invariant.group !0
				; CHECK: call i8* @llvm.invariant.group.barrier
				%ptr2 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)
				store i8 43, i8* %ptr
				; CHECK: call i8* @llvm.invariant.group.barrier
				%ptr3 = call i8* @llvm.invariant.group.barrier.p0i8(i8* %ptr)
				; CHECK: call void @clobber(i8* {{.*}}%ptr)
				call void @clobber(i8* %ptr)
				; CHECK: call void @use(i8* {{.*}}%ptr2)
				call void @use(i8* %ptr2)
				; CHECK: call void @use(i8* {{.*}}%ptr3)
				call void @use(i8* %ptr3)
				; CHECK: load i8, i8* %ptr3, {{.*}}!invariant.group
				%v = load i8, i8* %ptr3, !invariant.group !0
				ret i8 %v
				}

	declare void @use(i8* readonly)			declare void @use(i8* readonly)

	declare void @clobber(i8*)			declare void @clobber(i8*)
	; CHECK: Function Attrs: argmemonly nounwind readonly			; CHECK: Function Attrs: inaccessiblememonly nounwind{{$}}
	; CHECK-NEXT: declare i8* @llvm.invariant.group.barrier.p0i8(i8*)			; CHECK-NEXT: declare i8* @llvm.invariant.group.barrier.p0i8(i8*)
	declare i8* @llvm.invariant.group.barrier.p0i8(i8*)			declare i8* @llvm.invariant.group.barrier.p0i8(i8*)

	!0 = !{}			!0 = !{}

This is an archive of the discontinued LLVM Phabricator instance.

Mark invariant.group.barrier as inaccessiblememonlyAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 144838

llvm/include/llvm/IR/Intrinsics.td

llvm/test/Analysis/MemorySSA/invariant-groups.ll

llvm/test/Other/invariant.group.barrier.ll

Mark invariant.group.barrier as inaccessiblememonly
AbandonedPublic