This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
-
MemorySSA.cpp
-
test/Transforms/Util/MemorySSA/
-
Transforms/
-
Util/
-
MemorySSA/
-
atomic-clobber.ll
-
constant-memory.ll
-
invariant-groups.ll
-
volatile-clobber.ll

Differential D16875

MemorySSA Optimizations: Patch 1 of N
ClosedPublic

Authored by george.burgess.iv on Feb 3 2016, 9:21 PM.

Download Raw Diff

Details

Reviewers

reames
hfinkel

Commits

rG82e355ce481e: [MSSA] Add logic for special handling of atomics/volatiles.
rL277637: [MSSA] Add logic for special handling of atomics/volatiles.

Summary

This patch re-adds two of the optimizations that were taken out of MemorySSA due to sketchiness.

Specifically:

We now try to take advantage of memory ordering rules on loads before querying AA
We recognize and appropriately react to invariant loads, and loads that AA can prove point constant memory.

Note that the memory ordering check was strengthened to Monotonic, because that's the equivalent of std::memory_order_relaxed, which guarantees no ordering whatsoever, AFAIK.

Also note that I'm not an expert in memory models. So if anything seems fishy, let me know. :)

Diff Detail

Repository: rL LLVM

Event Timeline

george.burgess.iv updated this revision to Diff 46869.Feb 3 2016, 9:21 PM

george.burgess.iv retitled this revision from to MemorySSA Optimizations: Patch 1 of N.

george.burgess.iv updated this object.

george.burgess.iv added reviewers: hfinkel, reames.

george.burgess.iv added subscribers: • dberlin, davidxl, llvm-commits.

majnemer added a subscriber: majnemer.Feb 3 2016, 9:38 PM

majnemer added inline comments.

lib/Transforms/Utils/MemorySSA.cpp
940 ↗	(On Diff #46869)	TODO(username) is more of a Google-ism, I don't think LLVM does this very often.
942–943 ↗	(On Diff #46869)	Is this clang-format'd?

Addressed all feedback + made a comment more concise.

lib/Transforms/Utils/MemorySSA.cpp
940 ↗	(On Diff #46869)	Works for me! Thanks for the heads up
942–943 ↗	(On Diff #46869)	It is now :)

haicheng added a subscriber: haicheng.Feb 4 2016, 11:04 AM

Ping :)

I'll look when i'm back from getting married :)

davidxl added inline comments.Feb 25 2016, 3:01 PM

lib/Transforms/Utils/MemorySSA.cpp
711 ↗	(On Diff #46875)	Unfinished sentence here.
713 ↗	(On Diff #46875)	nit: MaybeDef -> MayDef
714 ↗	(On Diff #46875)	This needs some explanation --> can the alias query be moved here to make this function more general?
724 ↗	(On Diff #46875)	Move this comment up and combine with the other volatile related comment.
744 ↗	(On Diff #46875)	This does not look correct: When canUseBeReorderedAboveDef returns false, it is not ok to unconditionally feed it into the AA query -- unless AA query also honors the ordering constraint (I have not checked).
test/Transforms/Util/MemorySSA/atomic-clobber.ll
16 ↗	(On Diff #46875)	what is this change about?
41 ↗	(On Diff #46875)	Is this correct? The atomic load can not be reordered before the previous load with acquire.
44 ↗	(On Diff #46875)	Is this too conservative?

Addressed feedback

lib/Transforms/Utils/MemorySSA.cpp
714 ↗	(On Diff #46875)	Do you still think this should happen (given my response to your comment below)?
724 ↗	(On Diff #46875)	I'm assuming you meant just the first sentence -- done. :)
744 ↗	(On Diff #46875)	Looking at the things we query, it seems that AAResults (lib/Analysis/AliasAnalysis.cpp) always hands back `MRI_ModRef` if there's any kind of ordering (including volatile) involved for loads/stores, and will return `MRI_ModRef` if we hand it an `AtomicCmpXchgInst` or `AtomicRMWInst`, with ordering greater than `Monotonic`. This seems inconsistent, so I've sent out http://reviews.llvm.org/D17631 to see if that's intentional or not. If said patch goes in, `AAResults` should always be conservative in the cases that `canUseBeReorderedAboveDef` says are no bueno. If you think it would be better, I can try adding a `canUseNeverBeReorderedAboveDef` function, or I can have `canUseBeReorderedAboveDef` return an enum of `{Never, IfNoAlias, Always}`. Entirely up to you. :)
test/Transforms/Util/MemorySSA/atomic-clobber.ll
16 ↗	(On Diff #46875)	Random noise. Reverted.
41 ↗	(On Diff #46875)	I believe it is. Even if the `load` for `%2` was non-atomic, it can't be hoisted above an acquire, because that would break cases like struct Foo { std::atomic<int> a; int b; Foo(): a(0), b(0) {} }; Foo f; void thread1() { f.b = 1; f.a.store(1, std::memory_order_release); } void thread2() { if (f.a.load(std::memory_order_acquire)) { assert(f.b == 1); } } Wouldn't it? (specifically, if we allowed this, `thread2` would be able to load `f.b` before `f.a`)
44 ↗	(On Diff #46875)	For the same reason as above, I think this is all we can do here.

davidxl added inline comments.Feb 25 2016, 7:25 PM

test/Transforms/Util/MemorySSA/atomic-clobber.ll
42 ↗	(On Diff #49139)	yes -- load acquire prevents all following load/store from being moved above it. The problem is that the memory SSA dump confused me a little: I thought ID:1 MemoryDef is for the LiveOnEntry -- but actually it is the MemoryDef associated the acquire load. So indeed this is correct.
45 ↗	(On Diff #49139)	you are right -- I missread the dump.

davidxl added inline comments.Feb 27 2016, 3:43 PM

lib/Transforms/Utils/MemorySSA.cpp
752 ↗	(On Diff #49139)	MaybeDef --> MayDef
759 ↗	(On Diff #49139)	The problem with the current interface design is that 'false' does not really mean 'false' -- it means either 'don't know yet' or 'no it can not be reordered'. This either leads to redundant check later (if it means 'no') or skipped mistakenly later (if it means 'don't know yet) when the client code does know the difference. Possible solution is to move the check that both accesses are loads outside this function and change the function name to : canLoadsBeSafelyReordered( ...) if (isLoad(Def) && isLoad(Use)) return canLoadsBeSafetlyReordered(); // Using AA interface to do the check here ...
773 ↗	(On Diff #49139)	This is too strict. if (LoadDef->getOrdering() <= Monotonic && LoadUse->getOrdering() <= Monotonic) return true; // Check other cases. For instance, non acquire loads before an acquire load can be moved after it.
788–789 ↗	(On Diff #49139)	yes -- there should be an 'hand shaking' with AA interfaces otherwise you will need to special treatment here for synchronization related accesses before AA query. Add test cases to cover those should be good.

Addressed all feedback

lib/Transforms/Utils/MemorySSA.cpp
752 ↗	(On Diff #49139)	Oops. :)
759 ↗	(On Diff #49139)	Yeah, I like `canLoadsBeReordered` better.
773 ↗	(On Diff #49139)	Nice catch.
788–789 ↗	(On Diff #49139)	Added `check_aa_is_sane` (which is really tiny) in atomic-clobber.ll. I think it largely covers the behavior that we need, though.

Fix a typo in the tests.

davidxl added inline comments.Mar 4 2016, 11:29 AM

lib/Transforms/Utils/MemorySSA.cpp
864 ↗	(On Diff #49548)	The overall structure of the patch looks good -- however this function still needs more scrutiny -- please ping Phillip or Hal for more comments.

Ping :)

ping :)

Ping :)

So, I bugged Philip at the social, and he said the patch (as it stood) looks good, with a few comments:

The LLVM spec is ambiguous about whether we can hoist a non-volatile load above a volatile load when the loads alias. It's probably best not to exploit this ambiguity at the moment by unconditionally allowing the motion of nonvolatile loads above volatile loads (and vice versa).

It may be good to make pointsToConstantMemory bit check not succeed if the operation is a volatile access

A few style things

Items #3 and #1 have been addressed. #1 required a bit of refactoring of the newly-named getLoadReorderability function, because we now have to care about aliasing in some cases.

Item #2 I've thought about, and I'm no longer sure that I agree. Specifically, I believe that MemorySSA's job is to reason about whether one memory operation can cause another memory operation to produce a different result. If MemorySSA determines that memop A doesn't interfere with memop B, it's the *user's* job to determine where it's actually safe to put memop B. For example, consider:

define void @foo(i8* %a) {
	; 1 = MemoryDef(liveOnEntry)
	store volatile i8 0, i8* %a
	br i1 undef, label %if.then, label %if.end

if.then:
	; 2 = MemoryDef(1)
	load volatile i8, i8* %a
	br i1 label %if.end

if.end:
	ret void
}

...MemorySSA will happily (and correctly) say that the volatile load is clobbered by the volatile store. However, it's clearly incorrect to hoist the volatile load into the entry block here, so passes that use MemorySSA will need to exercise some amount of caution in cases involving volatile ops anyway. For this reason, I think it's fine if we say that loads of constant memory are always live on entry, regardless of whether the load is volatile or not.

It may be good to make pointsToConstantMemory bit check not succeed if the operation is a volatile access

Wow, that was phrased poorly.

It may be good to make the pointsToConstantMemory check (line 1122) not succeed if the operation is a volatile access. That's what I meant. :)

Closed by commit rL277637: [MSSA] Add logic for special handling of atomics/volatiles. (authored by • gbiv). · Explain WhyAug 3 2016, 12:47 PM

This revision was automatically updated to reflect the committed changes.

asbirlea mentioned this in D41525: [MemorySSA] Allow reordering of loads that alias in the presence of volatile loads..Dec 21 2017, 3:09 PM

asbirlea mentioned this in rL321382: [MemorySSA] Allow reordering of loads that alias in the presence of volatile….Dec 22 2017, 11:55 AM

Revision Contents

Path

Size

llvm/

trunk/

lib/

Transforms/

Utils/

MemorySSA.cpp

58 lines

test/

Transforms/

Util/

MemorySSA/

101 lines

41 lines

30 lines

72 lines

Diff 66694

llvm/trunk/lib/Transforms/Utils/MemorySSA.cpp

Show First 20 Lines • Show All 201 Lines • ▼ Show 20 Lines	switch (II->getIntrinsicID()) {
return AA.isMustAlias(MemoryLocation(II->getArgOperand(1)), Loc);		return AA.isMustAlias(MemoryLocation(II->getArgOperand(1)), Loc);
default:		default:
return false;		return false;
}		}
}		}
return false;		return false;
}		}

		enum class Reorderability {
		Always,
		IfNoAlias,
		Never
		};

		/// This does one-way checks to see if Use could theoretically be hoisted above
		/// MayClobber. This will not check the other way around.
		///
		/// This assumes that, for the purposes of MemorySSA, Use comes directly after
		/// MayClobber, with no potentially clobbering operations in between them.
		/// (Where potentially clobbering ops are memory barriers, aliased stores, etc.)
		static Reorderability getLoadReorderability(const LoadInst *Use,
		const LoadInst *MayClobber) {
		bool VolatileUse = Use->isVolatile();
		bool VolatileClobber = MayClobber->isVolatile();
		// Volatile operations may never be reordered with other volatile operations.
		if (VolatileUse && VolatileClobber)
		return Reorderability::Never;

		// The lang ref allows reordering of volatile and non-volatile operations.
		// Whether an aliasing nonvolatile load and volatile load can be reordered,
		// though, is ambiguous. Because it may not be best to exploit this ambiguity,
		// we only allow volatile/non-volatile reordering if the volatile and
		// non-volatile operations don't alias.
		Reorderability Result = VolatileUse \|\| VolatileClobber
		? Reorderability::IfNoAlias
		: Reorderability::Always;

		// If a load is seq_cst, it cannot be moved above other loads. If its ordering
		// is weaker, it can be moved above other loads. We just need to be sure that
		// MayClobber isn't an acquire load, because loads can't be moved above
		// acquire loads.
		//
		// Note that this explicitly does allow the free reordering of monotonic (or
		// weaker) loads of the same address.
		bool SeqCstUse = Use->getOrdering() == AtomicOrdering::SequentiallyConsistent;
		bool MayClobberIsAcquire = isAtLeastOrStrongerThan(MayClobber->getOrdering(),
		AtomicOrdering::Acquire);
		if (SeqCstUse \|\| MayClobberIsAcquire)
		return Reorderability::Never;
		return Result;
		}

static bool instructionClobbersQuery(MemoryDef *MD,		static bool instructionClobbersQuery(MemoryDef *MD,
const MemoryLocation &UseLoc,		const MemoryLocation &UseLoc,
const Instruction *UseInst,		const Instruction *UseInst,
AliasAnalysis &AA) {		AliasAnalysis &AA) {
Instruction *DefInst = MD->getMemoryInst();		Instruction *DefInst = MD->getMemoryInst();
assert(DefInst && "Defining instruction not actually an instruction");		assert(DefInst && "Defining instruction not actually an instruction");

if (const IntrinsicInst *II = dyn_cast<IntrinsicInst>(DefInst)) {		if (const IntrinsicInst *II = dyn_cast<IntrinsicInst>(DefInst)) {
Show All 11 Lines	if (const IntrinsicInst *II = dyn_cast<IntrinsicInst>(DefInst)) {
}		}
}		}

ImmutableCallSite UseCS(UseInst);		ImmutableCallSite UseCS(UseInst);
if (UseCS) {		if (UseCS) {
ModRefInfo I = AA.getModRefInfo(DefInst, UseCS);		ModRefInfo I = AA.getModRefInfo(DefInst, UseCS);
return I != MRI_NoModRef;		return I != MRI_NoModRef;
}		}

		if (auto *DefLoad = dyn_cast<LoadInst>(DefInst)) {
		if (auto *UseLoad = dyn_cast<LoadInst>(UseInst)) {
		switch (getLoadReorderability(UseLoad, DefLoad)) {
		case Reorderability::Always:
		return false;
		case Reorderability::Never:
		return true;
		case Reorderability::IfNoAlias:
		return !AA.isNoAlias(UseLoc, MemoryLocation::get(DefLoad));
		}
		}
		}

return AA.getModRefInfo(DefInst, UseLoc) & MRI_Mod;		return AA.getModRefInfo(DefInst, UseLoc) & MRI_Mod;
}		}

static bool instructionClobbersQuery(MemoryDef MD, MemoryUse MU,		static bool instructionClobbersQuery(MemoryDef MD, MemoryUse MU,
const MemoryLocOrCall &UseMLOC,		const MemoryLocOrCall &UseMLOC,
AliasAnalysis &AA) {		AliasAnalysis &AA) {
// FIXME: This is a temporary hack to allow a single instructionClobbersQuery		// FIXME: This is a temporary hack to allow a single instructionClobbersQuery
// to exist while MemoryLocOrCall is pushed through places.		// to exist while MemoryLocOrCall is pushed through places.
▲ Show 20 Lines • Show All 1,982 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/Util/MemorySSA/atomic-clobber.ll

	; RUN: opt -basicaa -print-memoryssa -verify-memoryssa -analyze < %s 2>&1 \| FileCheck %s			; RUN: opt -basicaa -print-memoryssa -verify-memoryssa -analyze < %s 2>&1 \| FileCheck %s
	; RUN: opt -aa-pipeline=basic-aa -passes='print<memoryssa>,verify<memoryssa>' -disable-output < %s 2>&1 \| FileCheck %s			; RUN: opt -aa-pipeline=basic-aa -passes='print<memoryssa>,verify<memoryssa>' -disable-output < %s 2>&1 \| FileCheck %s
	;			;
	; Ensures that atomic loads count as MemoryDefs			; Ensures that atomic loads count as MemoryDefs

				; CHECK-LABEL: define i32 @foo
	define i32 @foo(i32* %a, i32* %b) {			define i32 @foo(i32* %a, i32* %b) {
	; CHECK: 1 = MemoryDef(liveOnEntry)			; CHECK: 1 = MemoryDef(liveOnEntry)
	; CHECK-NEXT: store i32 4			; CHECK-NEXT: store i32 4
	store i32 4, i32* %a, align 4			store i32 4, i32* %a, align 4
	; CHECK: 2 = MemoryDef(1)			; CHECK: 2 = MemoryDef(1)
	; CHECK-NEXT: %1 = load atomic i32			; CHECK-NEXT: %1 = load atomic i32
	%1 = load atomic i32, i32* %b acquire, align 4			%1 = load atomic i32, i32* %b acquire, align 4
	; CHECK: MemoryUse(2)			; CHECK: MemoryUse(2)
	; CHECK-NEXT: %2 = load i32			; CHECK-NEXT: %2 = load i32
	%2 = load i32, i32* %a, align 4			%2 = load i32, i32* %a, align 4
	%3 = add i32 %1, %2			%3 = add i32 %1, %2
	ret i32 %3			ret i32 %3
	}			}

				; CHECK-LABEL: define void @bar
				define void @bar(i32* %a) {
				; CHECK: MemoryUse(liveOnEntry)
				; CHECK-NEXT: load atomic i32, i32* %a unordered, align 4
				load atomic i32, i32* %a unordered, align 4
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: load atomic i32, i32* %a monotonic, align 4
				load atomic i32, i32* %a monotonic, align 4
				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: load atomic i32, i32* %a acquire, align 4
				load atomic i32, i32* %a acquire, align 4
				; CHECK: 3 = MemoryDef(2)
				; CHECK-NEXT: load atomic i32, i32* %a seq_cst, align 4
				load atomic i32, i32* %a seq_cst, align 4
				ret void
				}

				; CHECK-LABEL: define void @baz
				define void @baz(i32* %a) {
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: %1 = load atomic i32
				%1 = load atomic i32, i32* %a acquire, align 4
				; CHECK: MemoryUse(1)
				; CHECK-NEXT: %2 = load atomic i32, i32* %a unordered, align 4
				%2 = load atomic i32, i32* %a unordered, align 4
				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: %3 = load atomic i32, i32* %a monotonic, align 4
				%3 = load atomic i32, i32* %a monotonic, align 4
				ret void
				}

				; CHECK-LABEL: define void @fences
				define void @fences(i32* %a) {
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: fence acquire
				fence acquire
				; CHECK: MemoryUse(1)
				; CHECK-NEXT: %1 = load i32, i32* %a
				%1 = load i32, i32* %a

				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: fence release
				fence release
				; CHECK: MemoryUse(2)
				; CHECK-NEXT: %2 = load i32, i32* %a
				%2 = load i32, i32* %a

				; CHECK: 3 = MemoryDef(2)
				; CHECK-NEXT: fence acq_rel
				fence acq_rel
				; CHECK: MemoryUse(3)
				; CHECK-NEXT: %3 = load i32, i32* %a
				%3 = load i32, i32* %a

				; CHECK: 4 = MemoryDef(3)
				; CHECK-NEXT: fence seq_cst
				fence seq_cst
				; CHECK: MemoryUse(4)
				; CHECK-NEXT: %4 = load i32, i32* %a
				%4 = load i32, i32* %a
				ret void
				}

				; CHECK-LABEL: define void @seq_cst_clobber
				define void @seq_cst_clobber(i32* noalias %a, i32* noalias %b) {
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: %1 = load atomic i32, i32* %a monotonic, align 4
				load atomic i32, i32* %a monotonic, align 4

				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: %2 = load atomic i32, i32* %a seq_cst, align 4
				load atomic i32, i32* %a seq_cst, align 4

				; CHECK: 3 = MemoryDef(2)
				; CHECK-NEXT: load atomic i32, i32* %a monotonic, align 4
				load atomic i32, i32* %a monotonic, align 4

				ret void
				}

				; Ensure that AA hands us MRI_Mod on unreorderable atomic ops.
				;
				; This test is a bit implementation-specific. In particular, it depends on that
				; we pass cmpxchg-load queries to AA, without trying to reason about them on
				; our own.
				;
				; If AA gets more aggressive, we can find another way.
				;
				; CHECK-LABEL: define void @check_aa_is_sane
				define void @check_aa_is_sane(i32* noalias %a, i32* noalias %b) {
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: cmpxchg i32* %a, i32 0, i32 1 acquire acquire
				cmpxchg i32* %a, i32 0, i32 1 acquire acquire
				; CHECK: MemoryUse(1)
				; CHECK-NEXT: load i32, i32* %b, align 4
				load i32, i32* %b, align 4

				ret void
				}

llvm/trunk/test/Transforms/Util/MemorySSA/constant-memory.ll

				; RUN: opt -basicaa -print-memoryssa -verify-memoryssa -analyze < %s 2>&1 \| FileCheck %s
				;
				; Things that BasicAA can prove points to constant memory should be
				; liveOnEntry, as well.

				declare void @clobberAllTheThings()

				@str = private unnamed_addr constant [2 x i8] c"hi"

				define i8 @foo() {
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: call void @clobberAllTheThings()
				call void @clobberAllTheThings()
				%1 = getelementptr [2 x i8], [2 x i8]* @str, i64 0, i64 0
				; CHECK: MemoryUse(liveOnEntry)
				; CHECK-NEXT: %2 = load i8
				%2 = load i8, i8* %1, align 1
				%3 = getelementptr [2 x i8], [2 x i8]* @str, i64 0, i64 1
				; CHECK: MemoryUse(liveOnEntry)
				; CHECK-NEXT: %4 = load i8
				%4 = load i8, i8* %3, align 1
				%5 = add i8 %2, %4
				ret i8 %5
				}

				define i8 @select(i1 %b) {
				%1 = alloca i8, align 1
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: store i8 0
				store i8 0, i8* %1, align 1

				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: call void @clobberAllTheThings()
				call void @clobberAllTheThings()
				%2 = getelementptr [2 x i8], [2 x i8]* @str, i64 0, i64 0
				%3 = select i1 %b, i8* %2, i8* %1
				; CHECK: MemoryUse(2)
				; CHECK-NEXT: %4 = load i8
				%4 = load i8, i8* %3, align 1
				ret i8 %4
				}

llvm/trunk/test/Transforms/Util/MemorySSA/invariant-groups.ll

				; RUN: opt -basicaa -print-memoryssa -verify-memoryssa -analyze < %s 2>&1 \| FileCheck %s
				;
				; Currently, MemorySSA doesn't support invariant groups. So, we should ignore
				; invariant.group.barrier intrinsics entirely. We'll need to pay attention to
				; them when/if we decide to support invariant groups.

				@g = external global i32

				define i32 @foo(i32* %a) {
				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: store i32 0
				store i32 0, i32* %a, align 4, !llvm.invariant.group !0

				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: store i32 1
				store i32 1, i32* @g, align 4

				%1 = bitcast i32* %a to i8*
				%a8 = call i8* @llvm.invariant.group.barrier(i8* %1)
				%a32 = bitcast i8* %a8 to i32*

				; CHECK: MemoryUse(2)
				; CHECK-NEXT: %2 = load i32
				%2 = load i32, i32* %a32, align 4, !llvm.invariant.group !0
				ret i32 %2
				}

				declare i8* @llvm.invariant.group.barrier(i8*)

				!0 = !{!"group1"}

llvm/trunk/test/Transforms/Util/MemorySSA/volatile-clobber.ll

	; RUN: opt -basicaa -print-memoryssa -verify-memoryssa -analyze < %s 2>&1 \| FileCheck %s			; RUN: opt -basicaa -print-memoryssa -verify-memoryssa -analyze < %s 2>&1 \| FileCheck %s
	; RUN: opt -aa-pipeline=basic-aa -passes='print<memoryssa>,verify<memoryssa>' -disable-output < %s 2>&1 \| FileCheck %s			; RUN: opt -aa-pipeline=basic-aa -passes='print<memoryssa>,verify<memoryssa>' -disable-output < %s 2>&1 \| FileCheck %s
	;			;
	; Ensures that volatile stores/loads count as MemoryDefs			; Ensures that volatile stores/loads count as MemoryDefs

				; CHECK-LABEL: define i32 @foo
	define i32 @foo() {			define i32 @foo() {
	%1 = alloca i32, align 4			%1 = alloca i32, align 4
	; CHECK: 1 = MemoryDef(liveOnEntry)			; CHECK: 1 = MemoryDef(liveOnEntry)
	; CHECK-NEXT: store volatile i32 4			; CHECK-NEXT: store volatile i32 4
	store volatile i32 4, i32* %1, align 4			store volatile i32 4, i32* %1, align 4
	; CHECK: 2 = MemoryDef(1)			; CHECK: 2 = MemoryDef(1)
	; CHECK-NEXT: store volatile i32 8			; CHECK-NEXT: store volatile i32 8
	store volatile i32 8, i32* %1, align 4			store volatile i32 8, i32* %1, align 4
	; CHECK: 3 = MemoryDef(2)			; CHECK: 3 = MemoryDef(2)
	; CHECK-NEXT: %2 = load volatile i32			; CHECK-NEXT: %2 = load volatile i32
	%2 = load volatile i32, i32* %1, align 4			%2 = load volatile i32, i32* %1, align 4
	; CHECK: 4 = MemoryDef(3)			; CHECK: 4 = MemoryDef(3)
	; CHECK-NEXT: %3 = load volatile i32			; CHECK-NEXT: %3 = load volatile i32
	%3 = load volatile i32, i32* %1, align 4			%3 = load volatile i32, i32* %1, align 4
	%4 = add i32 %3, %2			%4 = add i32 %3, %2
	ret i32 %4			ret i32 %4
	}			}

				; Ensuring that we don't automatically hoist nonvolatile loads around volatile
				; loads
				; CHECK-LABEL define void @volatile_only
				define void @volatile_only(i32* %arg1, i32* %arg2) {
				; Trivially NoAlias/MustAlias
				%a = alloca i32
				%b = alloca i32

				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: load volatile i32, i32* %a
				load volatile i32, i32* %a
				; CHECK: MemoryUse(liveOnEntry)
				; CHECK-NEXT: load i32, i32* %b
				load i32, i32* %b
				; CHECK: MemoryUse(1)
				; CHECK-NEXT: load i32, i32* %a
				load i32, i32* %a

				; MayAlias
				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: load volatile i32, i32* %arg1
				load volatile i32, i32* %arg1
				; CHECK: MemoryUse(2)
				; CHECK-NEXT: load i32, i32* %arg2
				load i32, i32* %arg2

				ret void
				}

				; Ensuring that volatile atomic operations work properly.
				; CHECK-LABEL define void @volatile_atomics
				define void @volatile_atomics(i32* %arg1, i32* %arg2) {
				%a = alloca i32
				%b = alloca i32

				; Trivially NoAlias/MustAlias

				; CHECK: 1 = MemoryDef(liveOnEntry)
				; CHECK-NEXT: load atomic volatile i32, i32* %a acquire, align 4
				load atomic volatile i32, i32* %a acquire, align 4
				; CHECK: MemoryUse(1)
				; CHECK-NEXT: load i32, i32* %b
				load i32, i32* %b

				; CHECK: 2 = MemoryDef(1)
				; CHECK-NEXT: load atomic volatile i32, i32* %a monotonic, align 4
				load atomic volatile i32, i32* %a monotonic, align 4
				; CHECK: MemoryUse(1)
				; CHECK-NEXT: load i32, i32* %b
				load i32, i32* %b
				; CHECK: MemoryUse(1)
				; CHECK-NEXT: load atomic i32, i32* %b unordered, align 4
				load atomic i32, i32* %b unordered, align 4
				; CHECK: MemoryUse(2)
				; CHECK-NEXT: load atomic i32, i32* %a unordered, align 4
				load atomic i32, i32* %a unordered, align 4
				; CHECK: MemoryUse(2)
				; CHECK-NEXT: load i32, i32* %a
				load i32, i32* %a

				; MayAlias
				; CHECK: 3 = MemoryDef(2)
				; CHECK-NEXT: load atomic volatile i32, i32* %arg1 monotonic, align 4
				load atomic volatile i32, i32* %arg1 monotonic, align 4
				; CHECK: MemoryUse(3)
				; CHECK-NEXT: load i32, i32* %arg2
				load i32, i32* %arg2

				ret void
				}

This is an archive of the discontinued LLVM Phabricator instance.

MemorySSA Optimizations: Patch 1 of NClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 66694

llvm/trunk/lib/Transforms/Utils/MemorySSA.cpp

llvm/trunk/test/Transforms/Util/MemorySSA/atomic-clobber.ll

llvm/trunk/test/Transforms/Util/MemorySSA/constant-memory.ll

llvm/trunk/test/Transforms/Util/MemorySSA/invariant-groups.ll

llvm/trunk/test/Transforms/Util/MemorySSA/volatile-clobber.ll

MemorySSA Optimizations: Patch 1 of N
ClosedPublic