This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/
-
llvm/
-
Analysis/
-
CaptureTracking.h
-
Transforms/Utils/
-
Utils/
-
LoopUtils.h
-
lib/
-
Analysis/
6/6
CaptureTracking.cpp
-
Passes/
-
PassBuilder.cpp
-
PassRegistry.def
-
Transforms/Scalar/
-
Scalar/
-
DeadStoreElimination.cpp
-
LICM.cpp

Differential D27585

[CaptureTracking] Add optimistic capture tracker for stores
AbandonedPublic

Authored by JDevlieghere on Dec 8 2016, 11:43 AM.

Download Raw Diff

Details

Reviewers

reames
sanjoy
hfinkel

Summary

The current capture tracking implementation always considers a store instruction to capture the given use. The example below shows a situation where this is overly pessimistic. The current implementation considers %ptr to be captured because it's the operand of the second store.

define void @sample() {
entry:
  %ptr = alloca i32
  store i32 1, i32* %ptr
  %ptrtoptr = alloca i32*
  store i32* %ptr, i32** %ptrtoptr
  ret void
}

This patch adds a new capture tracker implementation called OptimisticCaptureTracker. It can be used through a slightly modified interface that expects alias analysis results as an additional argument. If storeCaptures is set to true or no AA is provided, the OptimisticCaptureTracker behaves exactly as the current SimpleCaptureTracker. Otherwise, it only consider %ptr to escape in the previous example if it is stored to either a global or to one of the arguments of the function.

Diff Detail

Repository: rL LLVM

Event Timeline

JDevlieghere updated this revision to Diff 80795.Dec 8 2016, 11:43 AM

JDevlieghere retitled this revision from to [CaptureTracking] Add optimistic capture tracker for stores.

JDevlieghere updated this object.

JDevlieghere added reviewers: hfinkel, reames.

JDevlieghere set the repository for this revision to rL LLVM.

JDevlieghere added subscribers: dzn, apilipenko, anna, llvm-commits.

mssimpso added a subscriber: mssimpso.Dec 8 2016, 4:11 PM

Otherwise, it only consider %ptr to escape in the previous example if it is stored to either a global or to one of the arguments of the function.

I suppose this is named "optimistic" because it might return a false negative? If the pointer value is stored into some alloca then read and the read value stored into some global, then the value is captured but your analysis will return false. Is this what you intend?

In D27585#621863, @hfinkel wrote:

Otherwise, it only consider %ptr to escape in the previous example if it is stored to either a global or to one of the arguments of the function.

I suppose this is named "optimistic" because it might return a false negative? If the pointer value is stored into some alloca then read and the read value stored into some global, then the value is captured but your analysis will return false. Is this what you intend?

Does the example below cover what you have in mind?

@global = external global i32*

define void @sample() {
entry:
  %ptr = alloca i32
  store i32 1, i32* %ptr
  %ptrtoptr = alloca i32*
  store i32* %ptr, i32** %ptrtoptr
  %deref = load i32*, i32** %ptrtoptr
  store i32* %deref , i32** @global
  ret void
}

If so, the escape of %ptr is properly detected by my change. I have a small printer pass for capture tracking which enables me to easily verify this kind of stuff. I will update my diff to include it, we can always remove it again later.

Added small printer pass for capture tracking.

In D27585#621980, @JDevlieghere wrote:
In D27585#621863, @hfinkel wrote:

Otherwise, it only consider %ptr to escape in the previous example if it is stored to either a global or to one of the arguments of the function.

I suppose this is named "optimistic" because it might return a false negative? If the pointer value is stored into some alloca then read and the read value stored into some global, then the value is captured but your analysis will return false. Is this what you intend?

Does the example below cover what you have in mind?
@global = external global i32*

define void @sample() {
entry:
  %ptr = alloca i32
  store i32 1, i32* %ptr
  %ptrtoptr = alloca i32*
  store i32* %ptr, i32** %ptrtoptr
  %deref = load i32*, i32** %ptrtoptr
  store i32* %deref , i32** @global
  ret void
}
If so, the escape of %ptr is properly detected by my change. I have a small printer pass for capture tracking which enables me to easily verify this kind of stuff. I will update my diff to include it, we can always remove it again later.

Can you please explain how this is detected? The store is to an alloca, and I don't believe that the system does any kind of memory-based data-flow tracking.

In any cases, some regression test cases will be necessary.

lib/Analysis/CaptureTracking.cpp
81	An Argument is always a Value, so this cast/check should never be necessary.
82	This check won't work correctly if the value is stored into some offset based on the argument. You can make the right kind of query, but only if you construct a Location with an unknown size based on the value.
90	Again, this cast/check is not necessary (a Global is always a Value).
91	Same comment here about the size/offset issue.
101	Remove this blank line.
223	Add a space in between the for and the ( [same for the lines below for for/if].

Addressed @hfinkel's comments
Added a few simple tests

Fixed style comment (not sure why clang-format missed it?)
Only print values that have a name

In D27585#622381, @hfinkel wrote:
In D27585#621980, @JDevlieghere wrote:
In D27585#621863, @hfinkel wrote:

Otherwise, it only consider %ptr to escape in the previous example if it is stored to either a global or to one of the arguments of the function.

I suppose this is named "optimistic" because it might return a false negative? If the pointer value is stored into some alloca then read and the read value stored into some global, then the value is captured but your analysis will return false. Is this what you intend?

Does the example below cover what you have in mind?
@global = external global i32*

define void @sample() {
entry:
  %ptr = alloca i32
  store i32 1, i32* %ptr
  %ptrtoptr = alloca i32*
  store i32* %ptr, i32** %ptrtoptr
  %deref = load i32*, i32** %ptrtoptr
  store i32* %deref , i32** @global
  ret void
}
If so, the escape of %ptr is properly detected by my change. I have a small printer pass for capture tracking which enables me to easily verify this kind of stuff. I will update my diff to include it, we can always remove it again later.
Can you please explain how this is detected? The store is to an alloca, and I don't believe that the system does any kind of memory-based data-flow tracking.

I don't know enough about AA to make a definitive statement about this. Conceptually, everything makes sense up the isNoAlias call. The two pointers do point to the same value, so assuming a perfect AA implementation, the results is not totally unexpected. Maybe the default is very conservative and says that's it can't say for sure that the pointer and the global don't alias. As a result the pointer is considered to be captured. The test case I added for storing to the function argument behaves exactly the same, it reports the pointer being captured as desired. I'll dig deeper into AA, because I want to understand what happens, but for now this is the best explanation I can give.

In any cases, some regression test cases will be necessary.

I've added 3 tests to illustrate the added merit of this change. Since the printer pass makes it easy to test the current CT implementation, I'm planning on adding more tests for the other parts of the algorithm in a separate differential.

In D27585#622547, @JDevlieghere wrote:
In D27585#622381, @hfinkel wrote:
In D27585#621980, @JDevlieghere wrote:
In D27585#621863, @hfinkel wrote:

Otherwise, it only consider %ptr to escape in the previous example if it is stored to either a global or to one of the arguments of the function.

I suppose this is named "optimistic" because it might return a false negative? If the pointer value is stored into some alloca then read and the read value stored into some global, then the value is captured but your analysis will return false. Is this what you intend?

Does the example below cover what you have in mind?
@global = external global i32*

define void @sample() {
entry:
  %ptr = alloca i32
  store i32 1, i32* %ptr
  %ptrtoptr = alloca i32*
  store i32* %ptr, i32** %ptrtoptr
  %deref = load i32*, i32** %ptrtoptr
  store i32* %deref , i32** @global
  ret void
}
If so, the escape of %ptr is properly detected by my change. I have a small printer pass for capture tracking which enables me to easily verify this kind of stuff. I will update my diff to include it, we can always remove it again later.
Can you please explain how this is detected? The store is to an alloca, and I don't believe that the system does any kind of memory-based data-flow tracking.
I don't know enough about AA to make a definitive statement about this. Conceptually, everything makes sense up the isNoAlias call. The two pointers do point to the same value, so assuming a perfect AA implementation, the results is not totally unexpected. Maybe the default is very conservative and says that's it can't say for sure that the pointer and the global don't alias. As a result the pointer is considered to be captured. The test case I added for storing to the function argument behaves exactly the same, it reports the pointer being captured as desired. I'll dig deeper into AA, because I want to understand what happens, but for now this is the best explanation I can give.

Yea, we need to figure this out. Do you have BasicAA enabled? What happens if you add -aa-pipeline=basic-aa to the opt command?

In any cases, some regression test cases will be necessary.

I've added 3 tests to illustrate the added merit of this change. Since the printer pass makes it easy to test the current CT implementation, I'm planning on adding more tests for the other parts of the algorithm in a separate differential.

In D27585#623234, @hfinkel wrote:

Yea, we need to figure this out. Do you have BasicAA enabled? What happens if you add -aa-pipeline=basic-aa to the opt command?

With basic AA it is indeed not working; The results are only as good as the AA method provided, and in a sense it's optimistic in that regard. Would you agree that this is fair behavior, because the interface explicitly expects AA results and a flag indicating whether or not a store should capture? This would need to be documented of course.

In D27585#623431, @JDevlieghere wrote:

In D27585#623234, @hfinkel wrote:

Yea, we need to figure this out. Do you have BasicAA enabled? What happens if you add -aa-pipeline=basic-aa to the opt command?

With basic AA it is indeed not working; The results are only as good as the AA method provided, and in a sense it's optimistic in that regard. Would you agree that this is fair behavior, because the interface explicitly expects AA results and a flag indicating whether or not a store should capture? This would need to be documented of course.

Can you explain the use case for a capture-tracking mode that might return false negatives?

Let's consider the following table:

	No AA	Basic AA	Advanced AA
Current Capture Tracking	false positives (a)	n/a	n/a
Optimistic Capture Tracking	false positives (b)	false negatives (c)	less false positives (d)

Right now, only situation (a) exists, where we get false positives for stores to non-aliasing function arguments and globals, as illustrated by the example in the original description of this patch. Optimistic capture tracking behaves exactly the same if no AA is provided, so situation (a) and (b) are identical.

The interesting cases are (c) and (d), especially in combination with the situation you described, where a "value is stored into some alloca then read and the read value stored into some global".

For (c), basic AA doesn't perform memory-based data-flow tracking and it returns a false negative, which in turn causes the optimistic capture tracker to return a false negative as well. Returning a false negative is less desirable than returning a false positive, i.e. situation (a). So I suggest we enforce that this scenario can not occur, e.g. with an assert.
For (d), it properly detects that the two variables might alias, and we reduce the set of false positives generated by stores to globals and function argument. The lit tests verify this.

The net result is that we ether end up in (a) or (d) which is in my opinion more desirable than what we currently have.

ping

In D27585#623685, @JDevlieghere wrote:

Let's consider the following table:

No AA Basic AA Advanced AA

Current Capture Tracking false positives (a) n/a n/a

Optimistic Capture Tracking false positives (b) false negatives (c) less false positives (d)

Right now, only situation (a) exists, where we get false positives for stores to non-aliasing function arguments and globals, as illustrated by the example in the original description of this patch. Optimistic capture tracking behaves exactly the same if no AA is provided, so situation (a) and (b) are identical.

The interesting cases are (c) and (d), especially in combination with the situation you described, where a "value is stored into some alloca then read and the read value stored into some global".

For (c), basic AA doesn't perform memory-based data-flow tracking and it returns a false negative, which in turn causes the optimistic capture tracker to return a false negative as well. Returning a false negative is less desirable than returning a false positive, i.e. situation (a). So I suggest we enforce that this scenario can not occur, e.g. with an assert.

What would you assert? That is, what would the assert look like?

For (d), it properly detects that the two variables might alias, and we reduce the set of false positives generated by stores to globals and function argument. The lit tests verify this.

Which lit test?

In any case we can't rely on the AA being smart or advanced for correctness. That is, a less aggressive / more conservative AA should not cause us to miscompile code.

The net result is that we ether end up in (a) or (c) which is in my opinion more desirable than what we currently have.

Optimizations based on (c) will generally be incorrect and so (a) or (c) is less desirable than (a). That is, (a) == "always correct", (a) or (c) == "sometimes wrong".

I think Hal's question was more of the tune of: do you have optimizations that will be correct even with false negatives? That is, even if the optimization thinks a value did not escape when it actually did, the transform it does will be be correct? If so, that is interestingly different from what LLVM does today, and worth discussion _before_ we add this new capture tracker to LLVM.

As per previous comment.

This revision now requires changes to proceed.Dec 22 2016, 2:28 PM

In D27585#630030, @sanjoy wrote:

In D27585#623685, @JDevlieghere wrote:

Let's consider the following table:

...

I think Hal's question was more of the tune of: do you have optimizations that will be correct even with false negatives? That is, even if the optimization thinks a value did not escape when it actually did, the transform it does will be be correct? If so, that is interestingly different from what LLVM does today, and worth discussion _before_ we add this new capture tracker to LLVM.

Yes, this is exactly what I'd like to know.

In D27585#630030, @sanjoy wrote:

In D27585#623685, @JDevlieghere wrote:

Let's consider the following table:

No AA Basic AA Advanced AA

Current Capture Tracking false positives (a) n/a n/a

Optimistic Capture Tracking false positives (b) false negatives (c) less false positives (d)

Right now, only situation (a) exists, where we get false positives for stores to non-aliasing function arguments and globals, as illustrated by the example in the original description of this patch. Optimistic capture tracking behaves exactly the same if no AA is provided, so situation (a) and (b) are identical.

The interesting cases are (c) and (d), especially in combination with the situation you described, where a "value is stored into some alloca then read and the read value stored into some global".

For (c), basic AA doesn't perform memory-based data-flow tracking and it returns a false negative, which in turn causes the optimistic capture tracker to return a false negative as well. Returning a false negative is less desirable than returning a false positive, i.e. situation (a). So I suggest we enforce that this scenario can not occur, e.g. with an assert.

What would you assert? That is, what would the assert look like?

Originally I wanted to assert the algorithm used for AA. However, now I understand that unless the AA algorithm is perfect, there will always be a case where this generates a false negative. What I'd need for this to work properly is for AA to provide a guaranteed must not alias result. Unfortunately for me, this is inherently incompatible with what we have now. LLVM is (correctly) conservative in saying that something aliases, while I would need an implementation that is conservative in saying that something does not alias.

For (d), it properly detects that the two variables might alias, and we reduce the set of false positives generated by stores to globals and function argument. The lit tests verify this.

Which lit test?

captureindirectstore.ll

In any case we can't rely on the AA being smart or advanced for correctness. That is, a less aggressive / more conservative AA should not cause us to miscompile code.

Of course, this goes without saying!

The net result is that we ether end up in (a) or (c) which is in my opinion more desirable than what we currently have.

Optimizations based on (c) will generally be incorrect and so (a) or (c) is less desirable than (a). That is, (a) == "always correct", (a) or (c) == "sometimes wrong".

This was a typo on my part, it should've said (a) or (d). However, it doesn't really matter anymore, because of the reason I mentioned above.

I think Hal's question was more of the tune of: do you have optimizations that will be correct even with false negatives? That is, even if the optimization thinks a value did not escape when it actually did, the transform it does will be be correct? If so, that is interestingly different from what LLVM does today, and worth discussion _before_ we add this new capture tracker to LLVM.

No, I don't have an optimization that can deal with a false negative. I originally extended the capture tracker because I ran into a lot of false negatives for my use cases. This extension made sense from a theoretical point of view, and after testing it, I was able to correctly classify every case I could come up with. I think I confused both you and myself by calling it optimistic. I didn't mean for it to be optimistic in the sense that it would misclassify captures, rather that it could do better than what we currently have with the help of AA. Anyway, now I understand that I can't guarantee my desired scenario (d). I don't really see a way around this limitation, without implementing a inverse-conservative variant of AA.

Do you guys have an alternative idea for reducing the amount of false positives? If not I guess I'll have to abandon this change. :-(

In D27585#630064, @hfinkel wrote:

Yes, this is exactly what I'd like to know.

Thank you both for the clarification. I guess I must have misunderstood the question, sorry!

In D27585#631486, @JDevlieghere wrote:

In D27585#630030, @sanjoy wrote:

In D27585#623685, @JDevlieghere wrote:

Let's consider the following table:

...

Do you guys have an alternative idea for reducing the amount of false positives? If not I guess I'll have to abandon this change. :-(

To literally do what you'd like, you'll need to do some kind of data-flow analysis. This is possible, but first we should understand the use case. Specifically, why are the variables in question being stored such that the values don't escape but nevertheless mem2reg is not promoting them to SSA values.

JDevlieghere abandoned this revision.Apr 27 2017, 6:00 AM

Revision Contents

Path

Size

include/

llvm/

Analysis/

CaptureTracking.h

35 lines

Transforms/

Utils/

LoopUtils.h

5 lines

lib/

Analysis/

CaptureTracking.cpp

117 lines

Passes/

PassBuilder.cpp

1 line

PassRegistry.def

1 line

Transforms/

Scalar/

DeadStoreElimination.cpp

5 lines

LICM.cpp

12 lines

Diff 81360

include/llvm/Analysis/CaptureTracking.h

//===----- llvm/Analysis/CaptureTracking.h - Pointer capture ----- C++ --===//		//===----- llvm/Analysis/CaptureTracking.h - Pointer capture ----- C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file contains routines that help determine which pointers are captured.		// This file contains routines that help determine which pointers are captured.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_ANALYSIS_CAPTURETRACKING_H		#ifndef LLVM_ANALYSIS_CAPTURETRACKING_H
#define LLVM_ANALYSIS_CAPTURETRACKING_H		#define LLVM_ANALYSIS_CAPTURETRACKING_H

		#include "llvm/IR/PassManager.h"
		#include "llvm/Pass.h"

namespace llvm {		namespace llvm {

class Value;		class AAResults;
class Use;
class Instruction;
class DominatorTree;		class DominatorTree;
		class Function;
		class Instruction;
class OrderedBasicBlock;		class OrderedBasicBlock;
		class Use;
		class Value;
		class raw_ostream;

/// PointerMayBeCaptured - Return true if this pointer value may be captured		/// PointerMayBeCaptured - Return true if this pointer value may be captured
/// by the enclosing function (which is required to exist). This routine can		/// by the enclosing function (which is required to exist). This routine can
/// be expensive, so consider caching the results. The boolean ReturnCaptures		/// be expensive, so consider caching the results. The boolean ReturnCaptures
/// specifies whether returning the value (or part of it) from the function		/// specifies whether returning the value (or part of it) from the function
/// counts as capturing it or not. The boolean StoreCaptures specified		/// counts as capturing it or not. The boolean StoreCaptures specified
/// whether storing the value (or part of it) into memory anywhere		/// whether storing the value (or part of it) into memory anywhere
/// automatically counts as capturing it or not.		/// automatically counts as capturing it or not.
bool PointerMayBeCaptured(const Value *V,		bool PointerMayBeCaptured(const Value *V,
bool ReturnCaptures,		bool ReturnCaptures,
bool StoreCaptures);		bool StoreCaptures);

		/// PointerMayBeCaptured - Return true if this pointer value may be captured
		/// by the enclosing function (which is required to exist). This routine can
		/// be expensive, so consider caching the results. The boolean ReturnCaptures
		/// specifies whether returning the value (or part of it) from the function
		/// counts as capturing it or not. The boolean StoreCaptures specified
		/// whether storing the value (or part of it) into memory anywhere
		/// automatically counts as capturing it or not. For the latter to have
		/// effect, Alias Analysis results are required.
		bool PointerMayBeCaptured(const Value *V, bool ReturnCaptures,
		bool StoreCaptures, AAResults *AA);

/// PointerMayBeCapturedBefore - Return true if this pointer value may be		/// PointerMayBeCapturedBefore - Return true if this pointer value may be
/// captured by the enclosing function (which is required to exist). If a		/// captured by the enclosing function (which is required to exist). If a
/// DominatorTree is provided, only captures which happen before the given		/// DominatorTree is provided, only captures which happen before the given
/// instruction are considered. This routine can be expensive, so consider		/// instruction are considered. This routine can be expensive, so consider
/// caching the results. The boolean ReturnCaptures specifies whether		/// caching the results. The boolean ReturnCaptures specifies whether
/// returning the value (or part of it) from the function counts as capturing		/// returning the value (or part of it) from the function counts as capturing
/// it or not. The boolean StoreCaptures specified whether storing the value		/// it or not. The boolean StoreCaptures specified whether storing the value
/// (or part of it) into memory anywhere automatically counts as capturing it		/// (or part of it) into memory anywhere automatically counts as capturing it
Show All 23 Lines	struct CaptureTracker {
virtual bool shouldExplore(const Use *U);		virtual bool shouldExplore(const Use *U);

/// captured - Information about the pointer was captured by the user of		/// captured - Information about the pointer was captured by the user of
/// use U. Return true to stop the traversal or false to continue looking		/// use U. Return true to stop the traversal or false to continue looking
/// for more capturing instructions.		/// for more capturing instructions.
virtual bool captured(const Use *U) = 0;		virtual bool captured(const Use *U) = 0;
};		};

		/// Printer pass for the EscapeInfo results.
		class CaptureTrackingPrinterPass
		: public PassInfoMixin<CaptureTrackingPrinterPass> {

		public:
		explicit CaptureTrackingPrinterPass(raw_ostream &OS);
		PreservedAnalyses run(Function &F, FunctionAnalysisManager &AM);

		private:
		raw_ostream &OS;
		};

/// PointerMayBeCaptured - Visit the value and the values derived from it and		/// PointerMayBeCaptured - Visit the value and the values derived from it and
/// find values which appear to be capturing the pointer value. This feeds		/// find values which appear to be capturing the pointer value. This feeds
/// results into and is controlled by the CaptureTracker object.		/// results into and is controlled by the CaptureTracker object.
void PointerMayBeCaptured(const Value V, CaptureTracker Tracker);		void PointerMayBeCaptured(const Value V, CaptureTracker Tracker);
} // end namespace llvm		} // end namespace llvm

#endif		#endif

include/llvm/Transforms/Utils/LoopUtils.h

Show First 20 Lines • Show All 297 Lines • ▼ Show 20 Lines	public:
/// this induction. If by some other means the caller has a better SCEV		/// this induction. If by some other means the caller has a better SCEV
/// expression for \p Phi than the one returned by the ScalarEvolution		/// expression for \p Phi than the one returned by the ScalarEvolution
/// analysis, it can be passed through \p Expr.		/// analysis, it can be passed through \p Expr.
static bool isInductionPHI(PHINode Phi, const Loop L, ScalarEvolution *SE,		static bool isInductionPHI(PHINode Phi, const Loop L, ScalarEvolution *SE,
InductionDescriptor &D,		InductionDescriptor &D,
const SCEV *Expr = nullptr);		const SCEV *Expr = nullptr);

/// Returns true if \p Phi is a floating point induction in the loop \p L.		/// Returns true if \p Phi is a floating point induction in the loop \p L.
/// If \p Phi is an induction, the induction descriptor \p D will contain		/// If \p Phi is an induction, the induction descriptor \p D will contain
/// the data describing this induction.		/// the data describing this induction.
static bool isFPInductionPHI(PHINode Phi, const Loop L,		static bool isFPInductionPHI(PHINode Phi, const Loop L,
ScalarEvolution *SE, InductionDescriptor &D);		ScalarEvolution *SE, InductionDescriptor &D);

/// Returns true if \p Phi is a loop \p L induction, in the context associated		/// Returns true if \p Phi is a loop \p L induction, in the context associated
/// with the run-time predicate of PSE. If \p Assume is true, this can add		/// with the run-time predicate of PSE. If \p Assume is true, this can add
/// further SCEV predicates to \p PSE in order to prove that \p Phi is an		/// further SCEV predicates to \p PSE in order to prove that \p Phi is an
/// induction.		/// induction.
▲ Show 20 Lines • Show All 112 Lines • ▼ Show 20 Lines
/// loop invariant. It takes AliasSet, Loop exit blocks vector, loop exit blocks		/// loop invariant. It takes AliasSet, Loop exit blocks vector, loop exit blocks
/// insertion point vector, PredIteratorCache, LoopInfo, DominatorTree, Loop,		/// insertion point vector, PredIteratorCache, LoopInfo, DominatorTree, Loop,
/// AliasSet information for all instructions of the loop and loop safety		/// AliasSet information for all instructions of the loop and loop safety
/// information as arguments. It returns changed status.		/// information as arguments. It returns changed status.
bool promoteLoopAccessesToScalars(AliasSet &, SmallVectorImpl<BasicBlock *> &,		bool promoteLoopAccessesToScalars(AliasSet &, SmallVectorImpl<BasicBlock *> &,
SmallVectorImpl<Instruction *> &,		SmallVectorImpl<Instruction *> &,
PredIteratorCache &, LoopInfo *,		PredIteratorCache &, LoopInfo *,
DominatorTree , const TargetLibraryInfo ,		DominatorTree , const TargetLibraryInfo ,
Loop , AliasSetTracker , LoopSafetyInfo *);		AliasAnalysis AA, Loop , AliasSetTracker *,
		LoopSafetyInfo *);

/// \brief Computes safety information for a loop		/// \brief Computes safety information for a loop
/// checks loop body & header for the possibility of may throw		/// checks loop body & header for the possibility of may throw
/// exception, it takes LoopSafetyInfo and loop as argument.		/// exception, it takes LoopSafetyInfo and loop as argument.
/// Updates safety information in LoopSafetyInfo argument.		/// Updates safety information in LoopSafetyInfo argument.
void computeLoopSafetyInfo(LoopSafetyInfo , Loop );		void computeLoopSafetyInfo(LoopSafetyInfo , Loop );

/// Returns true if the instruction in a loop is guaranteed to execute at least		/// Returns true if the instruction in a loop is guaranteed to execute at least
▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

lib/Analysis/CaptureTracking.cpp

Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	bool captured(const Use *U) override {
return true;		return true;
}		}

bool ReturnCaptures;		bool ReturnCaptures;

bool Captured;		bool Captured;
};		};

		struct OptimisticCaptureTracker : public CaptureTracker {

		OptimisticCaptureTracker(bool ReturnCaptures, bool StoreCaptures,
		AliasAnalysis *AA)
		: ReturnCaptures(ReturnCaptures), StoreCaptures(StoreCaptures), AA(AA),
		Captured(false) {}

		void tooManyUses() override { Captured = true; }

		bool storeCaptures(const StoreInst *S) {
		if (StoreCaptures)
		return true;

		if (!AA)
		return true;

		const Function *F = S->getFunction();
		const Value *Ptr = S->getPointerOperand();

		// Check if the pointer is stored to a function argument.
		for (auto &Arg : F->getArgumentList()) {
		if (!isa<PointerType>(Arg.getType()))
		continue;

		if (auto *V = dyn_cast<Value>(&Arg)) {
		hfinkelUnsubmitted Done Reply Inline Actions An Argument is always a Value, so this cast/check should never be necessary. hfinkel: An Argument is always a Value, so this cast/check should never be necessary.
		if (!AA->isNoAlias(Ptr, V)) {
		hfinkelUnsubmitted Done Reply Inline Actions This check won't work correctly if the value is stored into some offset based on the argument. You can make the right kind of query, but only if you construct a Location with an unknown size based on the value. hfinkel: This check won't work correctly if the value is stored into some offset based on the argument.
		return true;
		}
		}
		}

		// Check if the pointer is stored to a global.
		for (auto &Global : F->getParent()->globals()) {
		if (auto *V = dyn_cast<Value>(&Global)) {
		hfinkelUnsubmitted Done Reply Inline Actions Again, this cast/check is not necessary (a Global is always a Value). hfinkel: Again, this cast/check is not necessary (a Global is always a Value).
		if (!AA->isNoAlias(Ptr, V)) {
		hfinkelUnsubmitted Done Reply Inline Actions Same comment here about the size/offset issue. hfinkel: Same comment here about the size/offset issue.
		return true;
		}
		}
		}

		return false;
		}

		bool captured(const Use *U) override {

		hfinkelUnsubmitted Done Reply Inline Actions Remove this blank line. hfinkel: Remove this blank line.
		if (isa<ReturnInst>(U->getUser()) && !ReturnCaptures)
		return false;

		if (auto *S = dyn_cast<StoreInst>(U->getUser()))
		if (!storeCaptures(S))
		return false;

		Captured = true;
		return true;
		}

		bool ReturnCaptures;
		bool StoreCaptures;

		AliasAnalysis *AA;

		bool Captured;
		};

/// Only find pointer captures which happen before the given instruction. Uses		/// Only find pointer captures which happen before the given instruction. Uses
/// the dominator tree to determine whether one instruction is before another.		/// the dominator tree to determine whether one instruction is before another.
/// Only support the case where the Value is defined in the same basic block		/// Only support the case where the Value is defined in the same basic block
/// as the given instruction and the use.		/// as the given instruction and the use.
struct CapturesBefore : public CaptureTracker {		struct CapturesBefore : public CaptureTracker {

CapturesBefore(bool ReturnCaptures, const Instruction I, DominatorTree DT,		CapturesBefore(bool ReturnCaptures, const Instruction I, DominatorTree DT,
bool IncludeI, OrderedBasicBlock *IC)		bool IncludeI, OrderedBasicBlock *IC)
▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	struct CapturesBefore : public CaptureTracker {

bool ReturnCaptures;		bool ReturnCaptures;
bool IncludeI;		bool IncludeI;

bool Captured;		bool Captured;
};		};
}		}

		CaptureTrackingPrinterPass::CaptureTrackingPrinterPass(raw_ostream &OS)
		: OS(OS) {}

		PreservedAnalyses CaptureTrackingPrinterPass::run(Function &F,
		FunctionAnalysisManager &AM) {
		OS << "Escape Analysis for function: " << F.getName() << "\n";
		auto& AA = AM.getResult<AAManager>(F);
		for(auto& BB : F) {
		hfinkelUnsubmitted Done Reply Inline Actions Add a space in between the for and the ( [same for the lines below for for/if]. hfinkel: Add a space in between the for and the ( [same for the lines below for for/if].
		for(auto& I : BB) {
		if(auto* V = dyn_cast<Value>(&I)){
		if(V->getType()->isPointerTy() && PointerMayBeCaptured(V, true, false, &AA)) {
		OS << V->getName() << " may be captured\n";
		}
		}
		}
		}

		return PreservedAnalyses::all();
		}

/// PointerMayBeCaptured - Return true if this pointer value may be captured		/// PointerMayBeCaptured - Return true if this pointer value may be captured
/// by the enclosing function (which is required to exist). This routine can		/// by the enclosing function (which is required to exist). This routine can
/// be expensive, so consider caching the results. The boolean ReturnCaptures		/// be expensive, so consider caching the results. The boolean ReturnCaptures
/// specifies whether returning the value (or part of it) from the function		/// specifies whether returning the value (or part of it) from the function
/// counts as capturing it or not. The boolean StoreCaptures specified whether		/// counts as capturing it or not. The boolean StoreCaptures specified whether
/// storing the value (or part of it) into memory anywhere automatically		/// storing the value (or part of it) into memory anywhere automatically
/// counts as capturing it or not.		/// counts as capturing it or not.
bool llvm::PointerMayBeCaptured(const Value *V,		bool llvm::PointerMayBeCaptured(const Value *V,
bool ReturnCaptures, bool StoreCaptures) {		bool ReturnCaptures, bool StoreCaptures) {
assert(!isa<GlobalValue>(V) &&		assert(!isa<GlobalValue>(V) &&
"It doesn't make sense to ask whether a global is captured.");		"It doesn't make sense to ask whether a global is captured.");

// TODO: If StoreCaptures is not true, we could do Fancy analysis		// TODO: If StoreCaptures is not true, we could do Fancy analysis
// to determine whether this store is not actually an escape point.		// to determine whether this store is not actually an escape point.
// In that case, BasicAliasAnalysis should be updated as well to		// In that case, BasicAliasAnalysis should be updated as well to
// take advantage of this.		// take advantage of this.
(void)StoreCaptures;		//(void)StoreCaptures;

		// TODO: We can essentially the SimpleCaptureTracker completely with the
		// Optimistic one and possibly reuse the name?

		// SimpleCaptureTracker SCT(ReturnCaptures);
		// PointerMayBeCaptured(V, &SCT);
		// return SCT.Captured;

		OptimisticCaptureTracker OCT(ReturnCaptures, StoreCaptures, nullptr);
		PointerMayBeCaptured(V, &OCT);
		return OCT.Captured;
		}

		/// PointerMayBeCaptured - Return true if this pointer value may be captured by
		/// the enclosing function (which is required to exist). This routine can be
		/// expensive, so consider caching the results. The boolean ReturnCaptures
		/// specifies whether returning the value (or part of it) from the function
		/// counts as capturing it or not. The boolean StoreCaptures specified whether
		/// storing the value (or part of it) into memory anywhere automatically counts
		/// as capturing it or not. For the latter to have effect, Alias Analysis
		/// results are required.
		bool llvm::PointerMayBeCaptured(const Value *V, bool ReturnCaptures,
		bool StoreCaptures, AliasAnalysis *AA) {
		assert(!isa<GlobalValue>(V) &&
		"It doesn't make sense to ask whether a global is captured.");

SimpleCaptureTracker SCT(ReturnCaptures);		OptimisticCaptureTracker OCT(ReturnCaptures, StoreCaptures, AA);
PointerMayBeCaptured(V, &SCT);		PointerMayBeCaptured(V, &OCT);
return SCT.Captured;		return OCT.Captured;
}		}

/// PointerMayBeCapturedBefore - Return true if this pointer value may be		/// PointerMayBeCapturedBefore - Return true if this pointer value may be
/// captured by the enclosing function (which is required to exist). If a		/// captured by the enclosing function (which is required to exist). If a
/// DominatorTree is provided, only captures which happen before the given		/// DominatorTree is provided, only captures which happen before the given
/// instruction are considered. This routine can be expensive, so consider		/// instruction are considered. This routine can be expensive, so consider
/// caching the results. The boolean ReturnCaptures specifies whether		/// caching the results. The boolean ReturnCaptures specifies whether
/// returning the value (or part of it) from the function counts as capturing		/// returning the value (or part of it) from the function counts as capturing
▲ Show 20 Lines • Show All 179 Lines • Show Last 20 Lines

lib/Passes/PassBuilder.cpp

	Show All 18 Lines
	#include "llvm/ADT/StringSwitch.h"			#include "llvm/ADT/StringSwitch.h"
	#include "llvm/Analysis/AliasAnalysis.h"			#include "llvm/Analysis/AliasAnalysis.h"
	#include "llvm/Analysis/AliasAnalysisEvaluator.h"			#include "llvm/Analysis/AliasAnalysisEvaluator.h"
	#include "llvm/Analysis/AssumptionCache.h"			#include "llvm/Analysis/AssumptionCache.h"
	#include "llvm/Analysis/BasicAliasAnalysis.h"			#include "llvm/Analysis/BasicAliasAnalysis.h"
	#include "llvm/Analysis/BlockFrequencyInfo.h"			#include "llvm/Analysis/BlockFrequencyInfo.h"
	#include "llvm/Analysis/BlockFrequencyInfoImpl.h"			#include "llvm/Analysis/BlockFrequencyInfoImpl.h"
	#include "llvm/Analysis/BranchProbabilityInfo.h"			#include "llvm/Analysis/BranchProbabilityInfo.h"
				#include "llvm/Analysis/CaptureTracking.h"
	#include "llvm/Analysis/CFGPrinter.h"			#include "llvm/Analysis/CFGPrinter.h"
	#include "llvm/Analysis/CFLAndersAliasAnalysis.h"			#include "llvm/Analysis/CFLAndersAliasAnalysis.h"
	#include "llvm/Analysis/CFLSteensAliasAnalysis.h"			#include "llvm/Analysis/CFLSteensAliasAnalysis.h"
	#include "llvm/Analysis/CGSCCPassManager.h"			#include "llvm/Analysis/CGSCCPassManager.h"
	#include "llvm/Analysis/CallGraph.h"			#include "llvm/Analysis/CallGraph.h"
	#include "llvm/Analysis/DemandedBits.h"			#include "llvm/Analysis/DemandedBits.h"
	#include "llvm/Analysis/DependenceAnalysis.h"			#include "llvm/Analysis/DependenceAnalysis.h"
	#include "llvm/Analysis/DominanceFrontier.h"			#include "llvm/Analysis/DominanceFrontier.h"
	▲ Show 20 Lines • Show All 797 Lines • Show Last 20 Lines

lib/Passes/PassRegistry.def

	Show First 20 Lines • Show All 166 Lines • ▼ Show 20 Lines
	FUNCTION_PASS("lcssa", LCSSAPass())			FUNCTION_PASS("lcssa", LCSSAPass())
	FUNCTION_PASS("loop-data-prefetch", LoopDataPrefetchPass())			FUNCTION_PASS("loop-data-prefetch", LoopDataPrefetchPass())
	FUNCTION_PASS("loop-distribute", LoopDistributePass())			FUNCTION_PASS("loop-distribute", LoopDistributePass())
	FUNCTION_PASS("loop-vectorize", LoopVectorizePass())			FUNCTION_PASS("loop-vectorize", LoopVectorizePass())
	FUNCTION_PASS("print", PrintFunctionPass(dbgs()))			FUNCTION_PASS("print", PrintFunctionPass(dbgs()))
	FUNCTION_PASS("print<assumptions>", AssumptionPrinterPass(dbgs()))			FUNCTION_PASS("print<assumptions>", AssumptionPrinterPass(dbgs()))
	FUNCTION_PASS("print<block-freq>", BlockFrequencyPrinterPass(dbgs()))			FUNCTION_PASS("print<block-freq>", BlockFrequencyPrinterPass(dbgs()))
	FUNCTION_PASS("print<branch-prob>", BranchProbabilityPrinterPass(dbgs()))			FUNCTION_PASS("print<branch-prob>", BranchProbabilityPrinterPass(dbgs()))
				FUNCTION_PASS("print<captures>", CaptureTrackingPrinterPass(dbgs()))
	FUNCTION_PASS("print<domtree>", DominatorTreePrinterPass(dbgs()))			FUNCTION_PASS("print<domtree>", DominatorTreePrinterPass(dbgs()))
	FUNCTION_PASS("print<postdomtree>", PostDominatorTreePrinterPass(dbgs()))			FUNCTION_PASS("print<postdomtree>", PostDominatorTreePrinterPass(dbgs()))
	FUNCTION_PASS("print<demanded-bits>", DemandedBitsPrinterPass(dbgs()))			FUNCTION_PASS("print<demanded-bits>", DemandedBitsPrinterPass(dbgs()))
	FUNCTION_PASS("print<domfrontier>", DominanceFrontierPrinterPass(dbgs()))			FUNCTION_PASS("print<domfrontier>", DominanceFrontierPrinterPass(dbgs()))
	FUNCTION_PASS("print<loops>", LoopPrinterPass(dbgs()))			FUNCTION_PASS("print<loops>", LoopPrinterPass(dbgs()))
	FUNCTION_PASS("print<memoryssa>", MemorySSAPrinterPass(dbgs()))			FUNCTION_PASS("print<memoryssa>", MemorySSAPrinterPass(dbgs()))
	FUNCTION_PASS("print<regions>", RegionInfoPrinterPass(dbgs()))			FUNCTION_PASS("print<regions>", RegionInfoPrinterPass(dbgs()))
	FUNCTION_PASS("print<scalar-evolution>", ScalarEvolutionPrinterPass(dbgs()))			FUNCTION_PASS("print<scalar-evolution>", ScalarEvolutionPrinterPass(dbgs()))
	▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

lib/Transforms/Scalar/DeadStoreElimination.cpp

Show First 20 Lines • Show All 1,083 Lines • ▼ Show 20 Lines	while (InstDep.isDef() \|\| InstDep.isClobber()) {
if (DepIndex <= LastThrowingInstIndex) {		if (DepIndex <= LastThrowingInstIndex) {
const Value* Underlying = GetUnderlyingObject(DepLoc.Ptr, DL);		const Value* Underlying = GetUnderlyingObject(DepLoc.Ptr, DL);
bool IsStoreDeadOnUnwind = isa<AllocaInst>(Underlying);		bool IsStoreDeadOnUnwind = isa<AllocaInst>(Underlying);
if (!IsStoreDeadOnUnwind) {		if (!IsStoreDeadOnUnwind) {
// We're looking for a call to an allocation function		// We're looking for a call to an allocation function
// where the allocation doesn't escape before the last		// where the allocation doesn't escape before the last
// throwing instruction; PointerMayBeCaptured		// throwing instruction; PointerMayBeCaptured
// reasonably fast approximation.		// reasonably fast approximation.
IsStoreDeadOnUnwind = isAllocLikeFn(Underlying, TLI) &&		IsStoreDeadOnUnwind =
!PointerMayBeCaptured(Underlying, false, true);		isAllocLikeFn(Underlying, TLI) &&
		!PointerMayBeCaptured(Underlying, false, false, AA);
}		}
if (!IsStoreDeadOnUnwind)		if (!IsStoreDeadOnUnwind)
break;		break;
}		}

// If we find a write that is a) removable (i.e., non-volatile), b) is		// If we find a write that is a) removable (i.e., non-volatile), b) is
// completely obliterated by the store to 'Loc', and c) which we know that		// completely obliterated by the store to 'Loc', and c) which we know that
// 'Inst' doesn't load from, then we can remove it.		// 'Inst' doesn't load from, then we can remove it.
▲ Show 20 Lines • Show All 145 Lines • Show Last 20 Lines

lib/Transforms/Scalar/LICM.cpp

Show First 20 Lines • Show All 244 Lines • ▼ Show 20 Lines	bool LoopInvariantCodeMotion::runOnLoop(Loop L, AliasAnalysis AA,
// memory references to scalars that we can.		// memory references to scalars that we can.
if (!DisablePromotion && (Preheader \|\| L->hasDedicatedExits())) {		if (!DisablePromotion && (Preheader \|\| L->hasDedicatedExits())) {
SmallVector<BasicBlock *, 8> ExitBlocks;		SmallVector<BasicBlock *, 8> ExitBlocks;
SmallVector<Instruction *, 8> InsertPts;		SmallVector<Instruction *, 8> InsertPts;
PredIteratorCache PIC;		PredIteratorCache PIC;

// Loop over all of the alias sets in the tracker object.		// Loop over all of the alias sets in the tracker object.
for (AliasSet &AS : *CurAST)		for (AliasSet &AS : *CurAST)
Changed \|= promoteLoopAccessesToScalars(		Changed \|=
AS, ExitBlocks, InsertPts, PIC, LI, DT, TLI, L, CurAST, &SafetyInfo);		promoteLoopAccessesToScalars(AS, ExitBlocks, InsertPts, PIC, LI, DT,
		TLI, AA, L, CurAST, &SafetyInfo);

// Once we have promoted values across the loop body we have to recursively		// Once we have promoted values across the loop body we have to recursively
// reform LCSSA as any nested loop may now have values defined within the		// reform LCSSA as any nested loop may now have values defined within the
// loop used in the outer loop.		// loop used in the outer loop.
// FIXME: This is really heavy handed. It would be a bit better to use an		// FIXME: This is really heavy handed. It would be a bit better to use an
// SSAUpdater strategy during promotion that was LCSSA aware and reformed		// SSAUpdater strategy during promotion that was LCSSA aware and reformed
// it as it went.		// it as it went.
if (Changed) {		if (Changed) {
▲ Show 20 Lines • Show All 575 Lines • ▼ Show 20 Lines
/// loop and moving loads to before the loop. We do this by looping over		/// loop and moving loads to before the loop. We do this by looping over
/// the stores in the loop, looking for stores to Must pointers which are		/// the stores in the loop, looking for stores to Must pointers which are
/// loop invariant.		/// loop invariant.
///		///
bool llvm::promoteLoopAccessesToScalars(		bool llvm::promoteLoopAccessesToScalars(
AliasSet &AS, SmallVectorImpl<BasicBlock *> &ExitBlocks,		AliasSet &AS, SmallVectorImpl<BasicBlock *> &ExitBlocks,
SmallVectorImpl<Instruction *> &InsertPts, PredIteratorCache &PIC,		SmallVectorImpl<Instruction *> &InsertPts, PredIteratorCache &PIC,
LoopInfo LI, DominatorTree DT, const TargetLibraryInfo *TLI,		LoopInfo LI, DominatorTree DT, const TargetLibraryInfo *TLI,
Loop CurLoop, AliasSetTracker CurAST, LoopSafetyInfo *SafetyInfo) {		AliasAnalysis AA, Loop CurLoop, AliasSetTracker *CurAST,
		LoopSafetyInfo *SafetyInfo) {
// Verify inputs.		// Verify inputs.
assert(LI != nullptr && DT != nullptr && CurLoop != nullptr &&		assert(LI != nullptr && DT != nullptr && CurLoop != nullptr &&
CurAST != nullptr && SafetyInfo != nullptr &&		CurAST != nullptr && SafetyInfo != nullptr &&
"Unexpected Input to promoteLoopAccessesToScalars");		"Unexpected Input to promoteLoopAccessesToScalars");

// We can promote this alias set if it has a store, if it is a "Must" alias		// We can promote this alias set if it has a store, if it is a "Must" alias
// set, if the pointer is loop invariant, and if we are not eliminating any		// set, if the pointer is loop invariant, and if we are not eliminating any
// volatile loads or stores.		// volatile loads or stores.
▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	bool llvm::promoteLoopAccessesToScalars(

// Check legality per comment above. Otherwise, we can't promote.		// Check legality per comment above. Otherwise, we can't promote.
bool PromotionIsLegal = GuaranteedToExecute;		bool PromotionIsLegal = GuaranteedToExecute;
if (!PromotionIsLegal && CanSpeculateLoad) {		if (!PromotionIsLegal && CanSpeculateLoad) {
// If this is a thread local location, then we can insert stores along		// If this is a thread local location, then we can insert stores along
// paths which originally didn't have them without violating the memory		// paths which originally didn't have them without violating the memory
// model.		// model.
Value *Object = GetUnderlyingObject(SomePtr, MDL);		Value *Object = GetUnderlyingObject(SomePtr, MDL);
PromotionIsLegal =		PromotionIsLegal = isAllocLikeFn(Object, TLI) &&
isAllocLikeFn(Object, TLI) && !PointerMayBeCaptured(Object, true, true);		!PointerMayBeCaptured(Object, true, false, AA);
}		}
if (!PromotionIsLegal)		if (!PromotionIsLegal)
return Changed;		return Changed;

// Figure out the loop exits and their insertion points, if this is the		// Figure out the loop exits and their insertion points, if this is the
// first promotion.		// first promotion.
if (ExitBlocks.empty()) {		if (ExitBlocks.empty()) {
CurLoop->getUniqueExitBlocks(ExitBlocks);		CurLoop->getUniqueExitBlocks(ExitBlocks);
▲ Show 20 Lines • Show All 155 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[CaptureTracking] Add optimistic capture tracker for storesAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 81360

include/llvm/Analysis/CaptureTracking.h

include/llvm/Transforms/Utils/LoopUtils.h

lib/Analysis/CaptureTracking.cpp

lib/Passes/PassBuilder.cpp

lib/Passes/PassRegistry.def

lib/Transforms/Scalar/DeadStoreElimination.cpp

lib/Transforms/Scalar/LICM.cpp

[CaptureTracking] Add optimistic capture tracker for stores
AbandonedPublic