This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
Analysis/
-
AliasAnalysis.h
-
CaptureTracking.h
-
OrderedBasicBlock.h
-
OrderedInstructions.h
-
IR/
11
BasicBlock.h
-
Instruction.h
1
SymbolTableListTraits.h
-
lib/
-
Analysis/
-
AliasAnalysis.cpp
-
CMakeLists.txt
-
CaptureTracking.cpp
-
InstructionPrecedenceTracking.cpp
-
MemoryDependenceAnalysis.cpp
-
OrderedBasicBlock.cpp
1/2
OrderedInstructions.cpp
-
IR/
3
BasicBlock.cpp
1/1
Instruction.cpp
3
SymbolTableListTraitsImpl.h
-
Transforms/
-
Scalar/
1/1
DeadStoreElimination.cpp
-
Vectorize/
-
LoadStoreVectorizer.cpp
-
unittests/
-
Analysis/
-
CMakeLists.txt
-
OrderedBasicBlockTest.cpp
-
IR/
1/3
BasicBlockTest.cpp

Differential D51664

[IR] Lazily number instructions for local dominance queries
ClosedPublic

Authored by rnk on Sep 4 2018, 4:45 PM.

Download Raw Diff

Details

Reviewers

efriedma
• dberlin
• gbiv
george.burgess.iv
ahatanak
hfinkel
lattner

Commits

rG7d0b07c08a38: [IR] Lazily number instructions for local dominance queries
rG0c2b09a9b624: [IR] Lazily number instructions for local dominance queries

Summary

Essentially, fold OrderedBasicBlock into BasicBlock, and make it
auto-invalidate the instruction ordering when new instructions are
added. Notably, we don't need to invalidate it when removing
instructions, which is helpful when a pass mostly delete dead
instructions rather than transforming them.

The downside is that Instruction grows from 56 bytes to 64 bytes. The
resulting LLVM code is substantially simpler and automatically handles
invalidation, which makes me think that this is the right speed and size
tradeoff. There's more low-hanging fruit in MemorySSA and DSE, which
maintain their own instruction orderings today.

The important change is in SymbolTableTraitsImpl.h, where the numbering
is invalidated. Everything else should be straightforward.

We probably want to implement a fancier re-numbering scheme so that
local updates don't invalidate the ordering, but I plan for that to be
future work, maybe for someone else.

Diff Detail

Build Status

Buildable 22245
Build 22245: arc lint + arc unit

Event Timeline

rnk created this revision.Sep 4 2018, 4:45 PM

Herald added a reviewer: george.burgess.iv. · View Herald TranscriptSep 4 2018, 4:45 PM

Herald added subscribers: george.burgess.iv, JDevlieghere, hiraditya and 2 others. · View Herald Transcript

Harbormaster completed remote builds in B22245: Diff 163948.Sep 4 2018, 4:45 PM

rnk added inline comments.Sep 4 2018, 5:05 PM

llvm/include/llvm/IR/SymbolTableListTraits.h
61–64	Oops, I'll revert this.

remove stale changes

Harbormaster completed remote builds in B22247: Diff 163955.Sep 4 2018, 5:17 PM

I have zero context on the various ways we can/do add instructions to a BB, so I can't immediately be helpful with the tricky part of this review. :) I've no complaints about the general direction of this patch, though.

When the code's more polished, would it be reasonable to also stick a verifier of this somewhere (that we maybe just run per-query or per-pass on EXPENSIVE_CHECKS builds, or something)?

In D51664#1224779, @george.burgess.iv wrote:

When the code's more polished, would it be reasonable to also stick a verifier of this somewhere (that we maybe just run per-query or per-pass on EXPENSIVE_CHECKS builds, or something)?

We could do it if expensive checks are enabled, but I think checks in NDEBUG that don't affect the asymptotic performance are the most useful. Maybe before inserting an instruction, if the numbering was previously considered valid, we can assert that it actually was? This way, we do linear work the first time we create the ordering, and then do linear work when we invalidate it. We could also check when destroying the BB, since that also does linear work.

add instruction order validation
invalidate order when splicing instructions
call transferNodesFromList for same-list transfers

Herald added a subscriber: dexonsmith. · View Herald TranscriptSep 5 2018, 11:32 AM

Harbormaster completed remote builds in B22277: Diff 164089.Sep 5 2018, 11:34 AM

This is awesome to see!

@ahatanak, you had a prototype of something similar a couple of years ago. Can you take a look?

The downside is that Instruction grows from 56 bytes to 64 bytes,

What's the growth for 32-bit pointers?

and I
don't have a good way to measure what that costs in practice. As the one
who removed the vtable from Value, I will say that this is how I would
like to spend the 8 bytes that I saved a year ago in r303362.

The two memory-sensitive cases I'm aware of are embedded compilers and LTO (with debug info).

It would be good to get someone to comment on the former.

On the latter: we still support users of -flto=full, so it would be interesting to know how big a regression in peak memory this would be. In the past, I've gotten numbers by running a build with -flto, using -save-temps when invoking the ld64, and then running llc on the optimized (but not CodeGen'ed) bitcode file (since the peak is in CodeGen). If you're really curious, that still seems like a decent way to look at the impact. But I doubt this will even register a difference; Instructions are usually malloc-allocated, and malloc is often 16-byte aligned.

dmgreen added a subscriber: dmgreen.Sep 8 2018, 6:43 AM

fhahn mentioned this in D51923: [NFC] Add validation to Ordered Instructions.Sep 11 2018, 4:07 AM

Any opinions on this? I'm eager to get it in so I can release it and get some faster builds, but I know it's a core data structure change.

FWIW, it seems like OrderedBB invalidation is causing bugs at least in LoopSafetyInfo (D50377) which @mkazantsev is currently working on fixing. There might be other places that get this wrong too, so having automatic invalidation seems like another good plus on top of the speedups.

This looks good to me, but like said, I don't have sufficient context to stamp it with great confidence.

Thanks again!

llvm/include/llvm/IR/BasicBlock.h
433	nit: I'd expect a function named this to `assert()`. Can we rename it to something that sounds more boolean-y like `isInstrOrderingValid()` (?) and/or add `LLVM_NODISCARD`?
433	nit * 2: I don't know what precedent is set elsewhere, but personally, I feel that if the intent is for this to only be used in NDEBUG builds, we should wrap this function decl in `#ifndef NDEBUG`. Mostly because I'd prefer the compiler tells me to reconsider my life choices over silently getting potentially-incorrect results. If it turns out that there's a case where the latter is preferable, it's a ~4 line diff. :)

I've run a big corpus of fuzz tests on this patch and it passed OK. So the patch seems good to me in terms of stability. Unfortunately I don't have time to give a proper code review on that. :(

It also makes my work on ICF tracking much easier as @fhahn has mentioned. So I'd also be happy to see it checked in. :)

mkazantsev mentioned this in D52017: [NFC] Introduce surgical invalidation of IPT.Sep 13 2018, 12:59 AM

mkazantsev added a child revision: D52017: [NFC] Introduce surgical invalidation of IPT.

mkazantsev mentioned this in D50377: [LICM] Use ICFLoopSafetyInfo in LICM.Sep 13 2018, 8:59 PM

@gbiv suggested that I test this on a large codebase, so I went ahead and built the Chrome unit_tests target with this, and the validation checks passed.

I'm going to address his comments, and otherwise I think we should go forward with this.

vsk added a subscriber: vsk.Sep 15 2018, 12:48 AM

I don't see any fundamental flaws in the algorithm, it looks pretty robust. I have some nit comments, otherwise it LGTM. (Note that I'm maybe not the most qualified person to approve changes in such fundamental components as BasicBlock and Instruction, but this change seems profitable).

llvm/unittests/IR/BasicBlockTest.cpp
152	`EXPECT_TRUE/FALSE(BB->isInstrOrderValid())` before and after that to make sure that it works at all?

mkazantsev added inline comments.Sep 16 2018, 8:31 PM

llvm/include/llvm/IR/BasicBlock.h
418	I would rather use some named constant instead of `1`; it is widespread use across the code and may be confusing for a reader. Just a suggestion.
433	Agreed here.
llvm/lib/IR/Instruction.cpp
102	Maybe also makes sense to assert that Parent is not nullptr (i.e. instructions not detached).
llvm/unittests/IR/BasicBlockTest.cpp
186	Do you mind adding the similar check for `Instruction->removeFromParent` and `Instruction->eraseFromParent`?

Use private bitfield for subclass data
Change cached order invalidation
Add tests for removal and erasure

Harbormaster completed remote builds in B22836: Diff 166056.Sep 18 2018, 6:18 PM

rnk added inline comments.Sep 18 2018, 6:22 PM

llvm/include/llvm/IR/BasicBlock.h
418	I rewrote this to use a bitfield. I think it's easier to understand now.
433	So, `isInstrOrderingValid` would conflict with `isInstrOrderValid`, which gives you the cached answer. I'll just make it void and assert internally in !NDEBUG builds.
llvm/unittests/IR/BasicBlockTest.cpp
186	Done, but they don't invalidate ordering, so I check for that instead.

Fix one last use of setValueSubclassData

Harbormaster completed remote builds in B22838: Diff 166059.Sep 18 2018, 6:23 PM

kuhar added a subscriber: kuhar.Sep 20 2018, 7:34 AM

aditya_nandakumar added a subscriber: aditya_nandakumar.Sep 20 2018, 9:09 PM

kuhar added inline comments.Sep 21 2018, 1:08 PM

llvm/include/llvm/IR/BasicBlock.h
455	Does it make sense to disable this when EXPENSIVE_CHECKS are set?
llvm/lib/Analysis/OrderedInstructions.cpp
32	Is the separate function `localDominates` still needed? Seems like the body is trivial and could be inlined here?
llvm/lib/IR/BasicBlock.cpp
496	Is it possible to use noncontiguous indices? If the indices are spread apart, you should be able to perform most insertions without renumbering instructions.
llvm/lib/IR/SymbolTableListTraitsImpl.h
96	Isn't it enough to invalidate only the indices of instructions that follow the first inserted one?

smeenai added a subscriber: smeenai.Sep 21 2018, 1:19 PM

tvvikram added a subscriber: tvvikram.Sep 21 2018, 9:38 PM

remove localDominates

Any other thoughts on this?

llvm/include/llvm/IR/BasicBlock.h
455	I would assume EXPENSIVE_CHECKS implies !NDEBUG, and these checks are on when assertions enabled.
llvm/lib/Analysis/OrderedInstructions.cpp
32	Sure, fixed.
llvm/lib/IR/BasicBlock.cpp
496	I want to put that out of scope of the initial change. We can do all kinds of fancy tricks here to avoid invalidating the ordering, but it's hard to provide meaningfully better algorithmic guarantees. And, the more complex code will require more complex testing, and it might have bugs. I'd rather come back and implement a more complex algorithm once profiling shows that there is a bottleneck, especially since it's often easier to remove these bottlenecks by delaying insertion.
llvm/lib/IR/SymbolTableListTraitsImpl.h
96	Yes, but recording that info and leveraging it is complex, and it doesn't change the asymptotic performance. We'd need more than a bit in BasicBlock to do it.

Harbormaster completed remote builds in B22992: Diff 166712.Sep 24 2018, 10:10 AM

rnk added a reviewer: hfinkel.Sep 24 2018, 10:19 AM

kuhar added inline comments.Sep 24 2018, 10:20 AM

llvm/include/llvm/IR/BasicBlock.h
455	I think you can use expensive checks independently of build type
llvm/lib/IR/BasicBlock.cpp
496	Sure, makes perfect sense. I'm not very familiar with the IR part of llvm, but I'd prefer to see a comment that explains that in a relevant place if you believe that this is a good future direction.
llvm/lib/IR/SymbolTableListTraitsImpl.h
96	Makes sense. How expensive is it to add new data members to BasicBlock? Do you know of any attempts to stick some data inside and measure how it affects compilation times?

george.burgess.iv added inline comments.Sep 24 2018, 11:14 AM

llvm/include/llvm/IR/BasicBlock.h
455	Sounds to me like a bug in `EXPENSIVE_CHECKS` if it can be used without assertions. Personally, every time I've written `#if EXPENSIVE_CHECKS` code, I've assumed that assertions are on. Glancing at a handful of users, the most common case I can find by far is: #ifdef EXPENSIVE_CHECKS assert(...); // Or call a function that just does a lot of asserts #endif Moreover, I can't think of a case where I'd say "build an LLVM that spends as much time as it wants verifying itself, but not with assertions."

The downside is that Instruction grows from 56 bytes to 64 bytes, and I don't have a good way to measure what that costs in practice.

In the commit that removed the table you said "Removing the virtual table pointer from Value saves 1% of RSS when doing LTO of llc on Linux."; so I'd expect the regression to be in the same ballpark?

In D51664#1243763, @sanjoy wrote:

The downside is that Instruction grows from 56 bytes to 64 bytes, and I don't have a good way to measure what that costs in practice.

In the commit that removed the table you said "Removing the virtual table pointer from Value saves 1% of RSS when doing LTO of llc on Linux."; so I'd expect the regression to be in the same ballpark?

I've started running a full LTO step of llc, but it's taking quite a while (>20min). I recall I picked LTO of llc last time because it completed in a few minutes, so I could repeat the measurement a few times to build confidence that it wasn't noise. Something may have changed. :( We'll see what comes back soon, I guess.

If I get no results, at least 1% RSS is an upper bound on increased LTO memory usage. I'm happy to trade that for 40% shorter compile time of the slowest TUs in clang.

If I get no results, at least 1% RSS is an upper bound on increased LTO memory usage. I'm happy to trade that for 40% shorter compile time of the slowest TUs in clang.

That I agree with. :)

Though, as I said on the llvm-dev thread, we may be able to get the best of both worlds by using something like the waymarking algorithm.

In D51664#1243954, @sanjoy wrote:

If I get no results, at least 1% RSS is an upper bound on increased LTO memory usage. I'm happy to trade that for 40% shorter compile time of the slowest TUs in clang.

That I agree with. :)

I'm only patient enough to get two runs, before and after, and max RSS before the patch was 4816464 kb, and after, 4867836 kb. That's an increase of 1.06%. In absolute terms, ~50MB of wasted memory for Instruction positions feels high.

Though, as I said on the llvm-dev thread, we may be able to get the best of both worlds by using something like the waymarking algorithm.

That's probably possible, but I'm a little afraid to try to steal bits from ilist next/prev pointers. The complexity cost of stealing those bits from ilist is likely to be more than is really worth it, and it's not clear if we want to use linked lists over the long term. I would rather get right complexity first, and then make a follow-up change to try to reduce the memory usage. I don't want to let the perfect be the enemy of the good.

lattner added a subscriber: lattner.Sep 24 2018, 9:55 PM

As I mentioned on llvmdev, I'm strongly opposed to this patch without a significant amount of analysis and diligence applied to it. I'm concerned about both memory and compile time impact. We don't just add random caches to the core IR to speed up particular clients. We care a lot about sizeof(Instruction) and its subclasses, and this adds bloat to all of them. Similarly, you don't seem interested in evaluating "tricks" to make this efficient in practice (sparse numbering), nor have you evaluated other implementation choices that will probably also work with less impact. Let's continue discussing this on llvm-dev.

To be clear, I'm not necessarily saying that this patch is the wrong thing to do, I would just like more diligence and experimentation with alternate approaches. Thanks :-)

nikic added a subscriber: nikic.Oct 26 2018, 2:08 PM

nikic added inline comments.

llvm/include/llvm/IR/BasicBlock.h
458	This should probably be `BasicBlock::validateInstrOrdering()` rather than `Instruction::validateInstrOrdering()`.

takuto.ikuta added a subscriber: takuto.ikuta.Nov 28 2018, 2:40 PM

rebase

rebase

Harbormaster completed remote builds in B26816: Diff 181660.Jan 14 2019, 3:49 PM

Hi Reid,

After giving you a hard time about this a few months ago, I've come around to believing that this is the right thing to do. Certain classes of algorithms really do benefit from having a lexicographic ordering comparison that is fast, and I think that this general approach is the best way to go.

-Chris

I haven't reviewed the patch in full detail, but the predicate "comesBefore" should probably be something like "isBeforeInBlock".

In D51664#1357417, @lattner wrote:

After giving you a hard time about this a few months ago, I've come around to believing that this is the right thing to do. Certain classes of algorithms really do benefit from having a lexicographic ordering comparison that is fast, and I think that this general approach is the best way to go.

Thanks! I'll bring it up on the dev list. I do know that @chandlerc wants to see a version of this that uses out of line numbers in a hash table, similar to the way we maintain value names out of line in a symbol table, but still done as part of the IR. I wanted to prototype that and compare.

In D51664#1358377, @rnk wrote:

In D51664#1357417, @lattner wrote:

After giving you a hard time about this a few months ago, I've come around to believing that this is the right thing to do. Certain classes of algorithms really do benefit from having a lexicographic ordering comparison that is fast, and I think that this general approach is the best way to go.

Thanks! I'll bring it up on the dev list. I do know that @chandlerc wants to see a version of this that uses out of line numbers in a hash table, similar to the way we maintain value names out of line in a symbol table, but still done as part of the IR. I wanted to prototype that and compare.

To be super clear, I definitely want *some* solution here. I'm completely on board with this being a real problem and we should solve it.

And if even getting reasonable data proves to be tons of work, I think its fine to say that and move on. In our conversation I was worried we weren't even checking to see if a side table was an effective strategy. I don't have any reason to believe this is going to be so important that it is worth *tons* of effort to validate both alternatives, it just seems useful to try a quick prototype.

rebase over r351992

Harbormaster completed remote builds in B27224: Diff 183195.Jan 23 2019, 3:07 PM

Hi Reid,

Thanks for doing this.

Instead of changing the instruction and basic block classes, could we instead provide an enhanced version of ilist that does that?
E.g., something based on https://scholar.google.com/scholar?cluster=5225046542682967685&hl=en&as_sdt=0,5 (and we can add laziness on top if we want)

The rationale is that I wanted to do something similar in the Machine representation as well (and rework the SlotIndexes in the process) and was thinking that an improved version of ilist would do that for us while allowing to share the code in the middle-end and backend.

Bottom line, I was hoping this work would solve the dominance problem instead of basic block for the backend as well :).

Cheers,
-Quentin

Herald added a project: Restricted Project. · View Herald TranscriptFeb 7 2019, 4:15 PM

rebase

Herald added a subscriber: jdoerfert. · View Herald TranscriptMar 1 2019, 4:11 PM

Harbormaster completed remote builds in B28693: Diff 188995.Mar 1 2019, 4:11 PM

For DSE, it seems quite straight-forward to preserve OrderedBB (we just remove instructions or replace existing ones with another one). I've added D59789 sketching that. This could be a stop-gap until this patch gets through.

Herald added a subscriber: asbirlea. · View Herald TranscriptMar 25 2019, 11:23 AM

vsk mentioned this in D64630: [DebugInfo] Address performance regression with r364515.Oct 25 2019, 6:19 PM

Is there still interest in pushing this forward? I like the approach taken here. We also just got a report of clang spending 21 minutes inside of DeadStoreElimination, and the lion's share of that time was within OrderedBasicBlock::comesBefore..

Herald added a subscriber: bmahjour. · View Herald TranscriptFeb 13 2020, 5:26 PM

I still think we should do this. I think @fhahn is reimplementing DSE using MemorySSA, so presumably the DSE cases won't be an issue soon, but setting all that aside, I still think it would be nice if we could say once and for all that Instruction::dominates(Instruction*) is amortized O(1) if you haven't modified the instruction stream. Otherwise this kind of pathology will pop up again. Putting the ordering on the IR saves clients from ferrying around and maintaining OrderedInstructions / OrderedBasicBlock data structures, and that seems like a win.

I meant to add that @rriddle adopted this approach in MLIR:
https://github.com/llvm/llvm-project/blob/master/mlir/include/mlir/IR/Operation.h#L615

In D51664#1877029, @rnk wrote:

I still think we should do this. I think @fhahn is reimplementing DSE using MemorySSA, so presumably the DSE cases won't be an issue soon, but setting all that aside, I still think it would be nice if we could say once and for all that Instruction::dominates(Instruction*) is amortized O(1) if you haven't modified the instruction stream. Otherwise this kind of pathology will pop up again. Putting the ordering on the IR saves clients from ferrying around and maintaining OrderedInstructions / OrderedBasicBlock data structures, and that seems like a win.

I'm strongly in favour of this too. Is there any reviewer you're specifically waiting for? Or should the RFC be bumped to ensure there's consensus before you spend the time rebasing?

In D51664#1877032, @rnk wrote:

I meant to add that @rriddle adopted this approach in MLIR:
https://github.com/llvm/llvm-project/blob/master/mlir/include/mlir/IR/Operation.h#L615

Yes, we've been using this in MLIR for at least a year now. There have been quite a few cases in MLIR, unrelated to DSE, that have really benefited from O(1) dominance checks readily available. There has only been one situation that I can recall where recomputing the block order showed up on a profile, but adding in some basic striding to the order assignment completely fixed it.

rnk mentioned this in D74642: [CodeGenPrepare] Speed up placeDbgValues, NFC.Feb 14 2020, 1:26 PM

rnk edited the summary of this revision. (Show Details)Feb 14 2020, 1:36 PM

rebase

Herald added a subscriber: mgrang. · View Herald TranscriptFeb 14 2020, 1:37 PM

I reposted the RFC to llvm-dev, and I think this time we'll reach a different consensus.

llvm/lib/Transforms/Scalar/DeadStoreElimination.cpp
1265–1267	I don't have an equivalent API for this. I don't have a good way to profile to check if that is or is not the case.

Harbormaster failed remote builds in B46551: Diff 244758!Feb 14 2020, 2:30 PM

I haven't carefully reviewed the patch, but I think this is the right thing to do architecturally for the compiler. Thank you for driving this Reid. I'd appreciate it if someone could scrutinize the patch though!

This revision is now accepted and ready to land.Feb 15 2020, 11:58 AM

Thanks! I was planning to wait a bit more after restarting the RFC to land this, but I think we have a lot of support here, and I don't have a lot of reasons to wait. I'm going to go ahead and push it, and if there are problems or objections, we can revert.

Closed by commit rG0c2b09a9b624: [IR] Lazily number instructions for local dominance queries (authored by rnk). · Explain WhyFeb 18 2020, 2:46 PM

This revision was automatically updated to reflect the committed changes.

WOW! Thanks!

In D51664#1881963, @mkazantsev wrote:

WOW! Thanks!

I think that this is a really nice step forward and will make a lot of things easier in the future.

daltenty mentioned this in D74911: [AIX] Pack BasicBlockBits.Feb 20 2020, 9:38 AM

daltenty mentioned this in rG967eeb109bed: [AIX] Pack BasicBlockBits.Feb 20 2020, 11:33 AM

fhahn mentioned this in D76228: [ValueTracking] Use Inst::comesBefore in isValidAssumeForCtx (NFC)..Mar 16 2020, 5:52 AM

fhahn mentioned this in rG47ee40407562: [ValueTracking] Use Inst::comesBefore in isValidAssumeForCtx (NFC)..Apr 5 2020, 4:47 AM

qcolombet mentioned this in D87297: [GlobalISel] Add bailout thresholds to CSEMIRBuilder::dominates() and the localizer..Sep 9 2020, 12:20 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

AliasAnalysis.h

11 lines

CaptureTracking.h

8 lines

OrderedBasicBlock.h

OrderedInstructions.h

18 lines

IR/

BasicBlock.h

21 lines

Instruction.h

12 lines

SymbolTableListTraits.h

11 lines

lib/

Analysis/

AliasAnalysis.cpp

9 lines

CMakeLists.txt

1 line

CaptureTracking.cpp

26 lines

InstructionPrecedenceTracking.cpp

3 lines

MemoryDependenceAnalysis.cpp

9 lines

OrderedBasicBlock.cpp

OrderedInstructions.cpp

11 lines

IR/

BasicBlock.cpp

14 lines

Instruction.cpp

7 lines

SymbolTableListTraitsImpl.h

6 lines

Transforms/

Scalar/

DeadStoreElimination.cpp

3 lines

Vectorize/

LoadStoreVectorizer.cpp

16 lines

unittests/

Analysis/

CMakeLists.txt

1 line

OrderedBasicBlockTest.cpp

IR/

BasicBlockTest.cpp

32 lines

Diff 163948

llvm/include/llvm/Analysis/AliasAnalysis.h

Show First 20 Lines • Show All 664 Lines • ▼ Show 20 Lines	public:

/// Return information about whether two call sites may refer to the same set		/// Return information about whether two call sites may refer to the same set
/// of memory locations. See the AA documentation for details:		/// of memory locations. See the AA documentation for details:
/// http://llvm.org/docs/AliasAnalysis.html#ModRefInfo		/// http://llvm.org/docs/AliasAnalysis.html#ModRefInfo
ModRefInfo getModRefInfo(ImmutableCallSite CS1, ImmutableCallSite CS2);		ModRefInfo getModRefInfo(ImmutableCallSite CS1, ImmutableCallSite CS2);

/// Return information about whether a particular call site modifies		/// Return information about whether a particular call site modifies
/// or reads the specified memory location \p MemLoc before instruction \p I		/// or reads the specified memory location \p MemLoc before instruction \p I
/// in a BasicBlock. An ordered basic block \p OBB can be used to speed up		/// in a BasicBlock.
/// instruction ordering queries inside the BasicBlock containing \p I.
/// Early exits in callCapturesBefore may lead to ModRefInfo::Must not being		/// Early exits in callCapturesBefore may lead to ModRefInfo::Must not being
/// set.		/// set.
ModRefInfo callCapturesBefore(const Instruction *I,		ModRefInfo callCapturesBefore(const Instruction *I,
const MemoryLocation &MemLoc, DominatorTree *DT,		const MemoryLocation &MemLoc, DominatorTree *DT);
OrderedBasicBlock *OBB = nullptr);

/// A convenience wrapper to synthesize a memory location.		/// A convenience wrapper to synthesize a memory location.
ModRefInfo callCapturesBefore(const Instruction I, const Value P,		ModRefInfo callCapturesBefore(const Instruction I, const Value P,
LocationSize Size, DominatorTree *DT,		LocationSize Size, DominatorTree *DT) {
OrderedBasicBlock *OBB = nullptr) {		return callCapturesBefore(I, MemoryLocation(P, Size), DT);
return callCapturesBefore(I, MemoryLocation(P, Size), DT, OBB);
}		}

/// @}		/// @}
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
/// \name Higher level methods for querying mod/ref information.		/// \name Higher level methods for querying mod/ref information.
/// @{		/// @{

/// Check if it is possible for execution of the specified basic block to		/// Check if it is possible for execution of the specified basic block to
▲ Show 20 Lines • Show All 412 Lines • Show Last 20 Lines

llvm/include/llvm/Analysis/CaptureTracking.h

	Show All 14 Lines
	#define LLVM_ANALYSIS_CAPTURETRACKING_H			#define LLVM_ANALYSIS_CAPTURETRACKING_H

	namespace llvm {			namespace llvm {

	class Value;			class Value;
	class Use;			class Use;
	class Instruction;			class Instruction;
	class DominatorTree;			class DominatorTree;
	class OrderedBasicBlock;

	/// PointerMayBeCaptured - Return true if this pointer value may be captured			/// PointerMayBeCaptured - Return true if this pointer value may be captured
	/// by the enclosing function (which is required to exist). This routine can			/// by the enclosing function (which is required to exist). This routine can
	/// be expensive, so consider caching the results. The boolean ReturnCaptures			/// be expensive, so consider caching the results. The boolean ReturnCaptures
	/// specifies whether returning the value (or part of it) from the function			/// specifies whether returning the value (or part of it) from the function
	/// counts as capturing it or not. The boolean StoreCaptures specified			/// counts as capturing it or not. The boolean StoreCaptures specified
	/// whether storing the value (or part of it) into memory anywhere			/// whether storing the value (or part of it) into memory anywhere
	/// automatically counts as capturing it or not.			/// automatically counts as capturing it or not.
	bool PointerMayBeCaptured(const Value *V,			bool PointerMayBeCaptured(const Value *V,
	bool ReturnCaptures,			bool ReturnCaptures,
	bool StoreCaptures);			bool StoreCaptures);

	/// PointerMayBeCapturedBefore - Return true if this pointer value may be			/// PointerMayBeCapturedBefore - Return true if this pointer value may be
	/// captured by the enclosing function (which is required to exist). If a			/// captured by the enclosing function (which is required to exist). If a
	/// DominatorTree is provided, only captures which happen before the given			/// DominatorTree is provided, only captures which happen before the given
	/// instruction are considered. This routine can be expensive, so consider			/// instruction are considered. This routine can be expensive, so consider
	/// caching the results. The boolean ReturnCaptures specifies whether			/// caching the results. The boolean ReturnCaptures specifies whether
	/// returning the value (or part of it) from the function counts as capturing			/// returning the value (or part of it) from the function counts as capturing
	/// it or not. The boolean StoreCaptures specified whether storing the value			/// it or not. The boolean StoreCaptures specified whether storing the value
	/// (or part of it) into memory anywhere automatically counts as capturing it			/// (or part of it) into memory anywhere automatically counts as capturing it
	/// or not. Captures by the provided instruction are considered if the			/// or not. Captures by the provided instruction are considered if the
	/// final parameter is true. An ordered basic block in \p OBB could be used			/// final parameter is true.
	/// to speed up capture-tracker queries.
	bool PointerMayBeCapturedBefore(const Value *V, bool ReturnCaptures,			bool PointerMayBeCapturedBefore(const Value *V, bool ReturnCaptures,
	bool StoreCaptures, const Instruction *I,			bool StoreCaptures, const Instruction *I,
	const DominatorTree *DT, bool IncludeI = false,			const DominatorTree *DT,
	OrderedBasicBlock *OBB = nullptr);			bool IncludeI = false);

	/// This callback is used in conjunction with PointerMayBeCaptured. In			/// This callback is used in conjunction with PointerMayBeCaptured. In
	/// addition to the interface here, you'll need to provide your own getters			/// addition to the interface here, you'll need to provide your own getters
	/// to see whether anything was captured.			/// to see whether anything was captured.
	struct CaptureTracker {			struct CaptureTracker {
	virtual ~CaptureTracker();			virtual ~CaptureTracker();

	/// tooManyUses - The depth of traversal has breached a limit. There may be			/// tooManyUses - The depth of traversal has breached a limit. There may be
	Show All 23 Lines

llvm/include/llvm/Analysis/OrderedBasicBlock.h

This file was deleted.

	//===- llvm/Analysis/OrderedBasicBlock.h --------------------- -- C++ --===//
	//
	// The LLVM Compiler Infrastructure
	//
	// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.
	//
	//===----------------------------------------------------------------------===//
	//
	// This file defines the OrderedBasicBlock class. OrderedBasicBlock maintains
	// an interface where clients can query if one instruction comes before another
	// in a BasicBlock. Since BasicBlock currently lacks a reliable way to query
	// relative position between instructions one can use OrderedBasicBlock to do
	// such queries. OrderedBasicBlock is lazily built on a source BasicBlock and
	// maintains an internal Instruction -> Position map. A OrderedBasicBlock
	// instance should be discarded whenever the source BasicBlock changes.
	//
	// It's currently used by the CaptureTracker in order to find relative
	// positions of a pair of instructions inside a BasicBlock.
	//
	//===----------------------------------------------------------------------===//

	#ifndef LLVM_ANALYSIS_ORDEREDBASICBLOCK_H
	#define LLVM_ANALYSIS_ORDEREDBASICBLOCK_H

	#include "llvm/ADT/DenseMap.h"
	#include "llvm/IR/BasicBlock.h"

	namespace llvm {

	class Instruction;
	class BasicBlock;

	class OrderedBasicBlock {
	private:
	/// Map a instruction to its position in a BasicBlock.
	SmallDenseMap<const Instruction *, unsigned, 32> NumberedInsts;

	/// Keep track of last instruction inserted into \p NumberedInsts.
	/// It speeds up queries for uncached instructions by providing a start point
	/// for new queries in OrderedBasicBlock::comesBefore.
	BasicBlock::const_iterator LastInstFound;

	/// The position/number to tag the next instruction to be found.
	unsigned NextInstPos;

	/// The source BasicBlock to map.
	const BasicBlock *BB;

	/// Given no cached results, find if \p A comes before \p B in \p BB.
	/// Cache and number out instruction while walking \p BB.
	bool comesBefore(const Instruction A, const Instruction B);

	public:
	OrderedBasicBlock(const BasicBlock *BasicB);

	/// Find out whether \p A dominates \p B, meaning whether \p A
	/// comes before \p B in \p BB. This is a simplification that considers
	/// cached instruction positions and ignores other basic blocks, being
	/// only relevant to compare relative instructions positions inside \p BB.
	/// Returns false for A == B.
	bool dominates(const Instruction A, const Instruction B);
	};

	} // End llvm namespace

	#endif

llvm/include/llvm/Analysis/OrderedInstructions.h

	//===- llvm/Transforms/Utils/OrderedInstructions.h -------------- C++ --===//			//===- llvm/Transforms/Utils/OrderedInstructions.h -------------- C++ --===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file defines an efficient way to check for dominance relation between 2			// This file defines an efficient way to check for dominance relation between 2
	// instructions.			// instructions.
	//			//
	// This interface dispatches to appropriate dominance check given 2			// FIXME: This is really just a convenience wrapper to check dominance between
	// instructions, i.e. in case the instructions are in the same basic block,			// two arbitrary instructions in different basic blocks. We should fold it into
	// OrderedBasicBlock (with instruction numbering and caching) are used.			// DominatorTree, which is the more widely used interface.
	// Otherwise, dominator tree is used.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_ANALYSIS_ORDEREDINSTRUCTIONS_H			#ifndef LLVM_ANALYSIS_ORDEREDINSTRUCTIONS_H
	#define LLVM_ANALYSIS_ORDEREDINSTRUCTIONS_H			#define LLVM_ANALYSIS_ORDEREDINSTRUCTIONS_H

	#include "llvm/ADT/DenseMap.h"			#include "llvm/ADT/DenseMap.h"
	#include "llvm/Analysis/OrderedBasicBlock.h"
	#include "llvm/IR/Dominators.h"			#include "llvm/IR/Dominators.h"
	#include "llvm/IR/Operator.h"			#include "llvm/IR/Operator.h"

	namespace llvm {			namespace llvm {

	class OrderedInstructions {			class OrderedInstructions {
	/// Used to check dominance for instructions in same basic block.
	mutable DenseMap<const BasicBlock *, std::unique_ptr<OrderedBasicBlock>>
	OBBMap;

	/// The dominator tree of the parent function.			/// The dominator tree of the parent function.
	DominatorTree *DT;			DominatorTree *DT;

	/// Return true if the first instruction comes before the second in the			/// Return true if the first instruction comes before the second in the
	/// same basic block. It will create an ordered basic block, if it does			/// same basic block. It will create an ordered basic block, if it does
	/// not yet exist in OBBMap.			/// not yet exist in OBBMap.
	bool localDominates(const Instruction , const Instruction ) const;			bool localDominates(const Instruction , const Instruction ) const;

	public:			public:
	/// Constructor.			/// Constructor.
	OrderedInstructions(DominatorTree *DT) : DT(DT) {}			OrderedInstructions(DominatorTree *DT) : DT(DT) {}

	/// Return true if first instruction dominates the second.			/// Return true if first instruction dominates the second.
	bool dominates(const Instruction , const Instruction ) const;			bool dominates(const Instruction , const Instruction ) const;

	/// Return true if the first instruction comes before the second in the			/// Return true if the first instruction comes before the second in the
	/// dominator tree DFS traversal if they are in different basic blocks,			/// dominator tree DFS traversal if they are in different basic blocks,
	/// or if the first instruction comes before the second in the same basic			/// or if the first instruction comes before the second in the same basic
	/// block.			/// block.
	bool dfsBefore(const Instruction , const Instruction ) const;			bool dfsBefore(const Instruction , const Instruction ) const;

	/// Invalidate the OrderedBasicBlock cache when its basic block changes.
	/// i.e. If an instruction is deleted or added to the basic block, the user
	/// should call this function to invalidate the OrderedBasicBlock cache for
	/// this basic block.
	void invalidateBlock(const BasicBlock *BB) { OBBMap.erase(BB); }
	};			};

	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_ANALYSIS_ORDEREDINSTRUCTIONS_H			#endif // LLVM_ANALYSIS_ORDEREDINSTRUCTIONS_H

llvm/include/llvm/IR/BasicBlock.h

Show First 20 Lines • Show All 378 Lines • ▼ Show 20 Lines	public:
/// keeping loop information consistent, use the SplitBlock utility function.		/// keeping loop information consistent, use the SplitBlock utility function.
BasicBlock *splitBasicBlock(iterator I, const Twine &BBName = "");		BasicBlock *splitBasicBlock(iterator I, const Twine &BBName = "");
BasicBlock splitBasicBlock(Instruction I, const Twine &BBName = "") {		BasicBlock splitBasicBlock(Instruction I, const Twine &BBName = "") {
return splitBasicBlock(I->getIterator(), BBName);		return splitBasicBlock(I->getIterator(), BBName);
}		}

/// Returns true if there are any uses of this basic block other than		/// Returns true if there are any uses of this basic block other than
/// direct branches, switches, etc. to it.		/// direct branches, switches, etc. to it.
bool hasAddressTaken() const { return getSubclassDataFromValue() != 0; }		bool hasAddressTaken() const {
		return (getSubclassDataFromValue() >> 1) != 0;
		}

/// Update all phi nodes in this basic block's successors to refer to basic		/// Update all phi nodes in this basic block's successors to refer to basic
/// block \p New instead of to it.		/// block \p New instead of to it.
void replaceSuccessorsPhiUsesWith(BasicBlock *New);		void replaceSuccessorsPhiUsesWith(BasicBlock *New);

/// Return true if this basic block is an exception handling block.		/// Return true if this basic block is an exception handling block.
bool isEHPad() const { return getFirstNonPHI()->isEHPad(); }		bool isEHPad() const { return getFirstNonPHI()->isEHPad(); }

Show All 10 Lines	return const_cast<LandingPadInst *>(
static_cast<const BasicBlock *>(this)->getLandingPadInst());		static_cast<const BasicBlock *>(this)->getLandingPadInst());
}		}

/// Return true if it is legal to hoist instructions into this block.		/// Return true if it is legal to hoist instructions into this block.
bool isLegalToHoistInto() const;		bool isLegalToHoistInto() const;

Optional<uint64_t> getIrrLoopHeaderWeight() const;		Optional<uint64_t> getIrrLoopHeaderWeight() const;

		/// Returns true if the Order field of child Instructions is valid.
		bool isInstrOrderValid() {
		return getSubclassDataFromValue() & 1;
		mkazantsevUnsubmitted Not Done Reply Inline Actions I would rather use some named constant instead of `1`; it is widespread use across the code and may be confusing for a reader. Just a suggestion. mkazantsev: I would rather use some named constant instead of `1`; it is widespread use across the code and…
		rnkAuthorUnsubmitted Not Done Reply Inline Actions I rewrote this to use a bitfield. I think it's easier to understand now. rnk: I rewrote this to use a bitfield. I think it's easier to understand now.
		}

		/// Mark instruction ordering invalid. Done on every instruction insert.
		void invalidateOrders() {
		setValueSubclassData(getSubclassDataFromValue() & ~1U);
		}

		/// Renumber instructions and mark the ordering as valid.
		void renumberInstructions();

private:		private:
/// Increment the internal refcount of the number of BlockAddresses		/// Increment the internal refcount of the number of BlockAddresses
/// referencing this BasicBlock by \p Amt.		/// referencing this BasicBlock by \p Amt.
///		///
/// This is almost always 0, sometimes one possibly, but almost never 2, and		/// This is almost always 0, sometimes one possibly, but almost never 2, and
		george.burgess.ivUnsubmitted Not Done Reply Inline Actions nit: I'd expect a function named this to `assert()`. Can we rename it to something that sounds more boolean-y like `isInstrOrderingValid()` (?) and/or add `LLVM_NODISCARD`? george.burgess.iv: nit: I'd expect a function named this to `assert()`. Can we rename it to something that sounds…
		george.burgess.ivUnsubmitted Not Done Reply Inline Actions nit * 2: I don't know what precedent is set elsewhere, but personally, I feel that if the intent is for this to only be used in NDEBUG builds, we should wrap this function decl in `#ifndef NDEBUG`. Mostly because I'd prefer the compiler tells me to reconsider my life choices over silently getting potentially-incorrect results. If it turns out that there's a case where the latter is preferable, it's a ~4 line diff. :) george.burgess.iv: nit * 2: I don't know what precedent is set elsewhere, but personally, I feel that if the…
		mkazantsevUnsubmitted Not Done Reply Inline Actions Agreed here. mkazantsev: Agreed here.
		rnkAuthorUnsubmitted Not Done Reply Inline Actions So, `isInstrOrderingValid` would conflict with `isInstrOrderValid`, which gives you the cached answer. I'll just make it void and assert internally in !NDEBUG builds. rnk: So, `isInstrOrderingValid` would conflict with `isInstrOrderValid`, which gives you the cached…
/// inconceivably 3 or more.		/// inconceivably 3 or more.
void AdjustBlockAddressRefCount(int Amt) {		void AdjustBlockAddressRefCount(int Amt) {
setValueSubclassData(getSubclassDataFromValue()+Amt);		setValueSubclassData(getSubclassDataFromValue() + (Amt << 1));
assert((int)(signed char)getSubclassDataFromValue() >= 0 &&		assert((int)(signed char)(getSubclassDataFromValue() >> 1) >= 0 &&
"Refcount wrap-around");		"Refcount wrap-around");
}		}

/// Shadow Value::setValueSubclassData with a private forwarding method so		/// Shadow Value::setValueSubclassData with a private forwarding method so
/// that any future subclasses cannot accidentally use it.		/// that any future subclasses cannot accidentally use it.
void setValueSubclassData(unsigned short D) {		void setValueSubclassData(unsigned short D) {
Value::setValueSubclassData(D);		Value::setValueSubclassData(D);
}		}
};		};

// Create wrappers for C Binding types (see CBindingWrapping.h).		// Create wrappers for C Binding types (see CBindingWrapping.h).
DEFINE_SIMPLE_CONVERSION_FUNCTIONS(BasicBlock, LLVMBasicBlockRef)		DEFINE_SIMPLE_CONVERSION_FUNCTIONS(BasicBlock, LLVMBasicBlockRef)

/// Advance \p It while it points to a debug instruction and return the result.		/// Advance \p It while it points to a debug instruction and return the result.
/// This assumes that \p It is not at the end of a block.		/// This assumes that \p It is not at the end of a block.
BasicBlock::iterator skipDebugIntrinsics(BasicBlock::iterator It);		BasicBlock::iterator skipDebugIntrinsics(BasicBlock::iterator It);

} // end namespace llvm		} // end namespace llvm
		kuharUnsubmitted Not Done Reply Inline Actions Does it make sense to disable this when EXPENSIVE_CHECKS are set? kuhar: Does it make sense to disable this when EXPENSIVE_CHECKS are set?
		rnkAuthorUnsubmitted Not Done Reply Inline Actions I would assume EXPENSIVE_CHECKS implies !NDEBUG, and these checks are on when assertions enabled. rnk: I would assume EXPENSIVE_CHECKS implies !NDEBUG, and these checks are on when assertions…
		kuharUnsubmitted Not Done Reply Inline Actions I think you can use expensive checks independently of build type kuhar: I think you can use expensive checks independently of build type
		george.burgess.ivUnsubmitted Not Done Reply Inline Actions Sounds to me like a bug in `EXPENSIVE_CHECKS` if it can be used without assertions. Personally, every time I've written `#if EXPENSIVE_CHECKS` code, I've assumed that assertions are on. Glancing at a handful of users, the most common case I can find by far is: #ifdef EXPENSIVE_CHECKS assert(...); // Or call a function that just does a lot of asserts #endif Moreover, I can't think of a case where I'd say "build an LLVM that spends as much time as it wants verifying itself, but not with assertions." george.burgess.iv: Sounds to me like a bug in `EXPENSIVE_CHECKS` if it can be used without assertions. Personally…

#endif // LLVM_IR_BASICBLOCK_H		#endif // LLVM_IR_BASICBLOCK_H
		nikicUnsubmitted Not Done Reply Inline Actions This should probably be `BasicBlock::validateInstrOrdering()` rather than `Instruction::validateInstrOrdering()`. nikic: This should probably be `BasicBlock::validateInstrOrdering()` rather than `Instruction…

llvm/include/llvm/IR/Instruction.h

Show All 40 Lines	template <> struct ilist_alloc_traits<Instruction> {
static inline void deleteNode(Instruction *V);		static inline void deleteNode(Instruction *V);
};		};

class Instruction : public User,		class Instruction : public User,
public ilist_node_with_parent<Instruction, BasicBlock> {		public ilist_node_with_parent<Instruction, BasicBlock> {
BasicBlock *Parent;		BasicBlock *Parent;
DebugLoc DbgLoc; // 'dbg' Metadata cache.		DebugLoc DbgLoc; // 'dbg' Metadata cache.

		/// Relative order of this instruction in its parent basic block. Used for
		/// O(1) local dominance checks between instructions.
		mutable unsigned Order = 0;

enum {		enum {
/// This is a bit stored in the SubClassData field which indicates whether		/// This is a bit stored in the SubClassData field which indicates whether
/// this instruction has metadata attached to it or not.		/// this instruction has metadata attached to it or not.
HasMetadataBit = 1 << 15		HasMetadataBit = 1 << 15
};		};

protected:		protected:
~Instruction(); // Use deleteValue() to delete a generic Instruction.		~Instruction(); // Use deleteValue() to delete a generic Instruction.
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	public:
///		///
/// \pre I is a valid iterator into BB.		/// \pre I is a valid iterator into BB.
void moveBefore(BasicBlock &BB, SymbolTableList<Instruction>::iterator I);		void moveBefore(BasicBlock &BB, SymbolTableList<Instruction>::iterator I);

/// Unlink this instruction from its current basic block and insert it into		/// Unlink this instruction from its current basic block and insert it into
/// the basic block that MovePos lives in, right after MovePos.		/// the basic block that MovePos lives in, right after MovePos.
void moveAfter(Instruction *MovePos);		void moveAfter(Instruction *MovePos);

		/// Given an instruction Other in the same basic block as this instruction,
		/// return true if this instruction comes before Other. In this worst case,
		/// this takes linear time in the number of instructions in the block. The
		/// results are cached, so in common cases when the block remains unmodified,
		/// it takes constant time.
		bool comesBefore(const Instruction *Other) const;

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Subclass classification.		// Subclass classification.
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

/// Returns a member of one of the enums like Instruction::Add.		/// Returns a member of one of the enums like Instruction::Add.
unsigned getOpcode() const { return getValueID() - InstructionVal; }		unsigned getOpcode() const { return getValueID() - InstructionVal; }

const char *getOpcodeName() const { return getOpcodeName(getOpcode()); }		const char *getOpcodeName() const { return getOpcodeName(getOpcode()); }
▲ Show 20 Lines • Show All 557 Lines • ▼ Show 20 Lines
#define FIRST_OTHER_INST(N) OtherOpsBegin = N,		#define FIRST_OTHER_INST(N) OtherOpsBegin = N,
#define HANDLE_OTHER_INST(N, OPC, CLASS) OPC = N,		#define HANDLE_OTHER_INST(N, OPC, CLASS) OPC = N,
#define LAST_OTHER_INST(N) OtherOpsEnd = N+1		#define LAST_OTHER_INST(N) OtherOpsEnd = N+1
#include "llvm/IR/Instruction.def"		#include "llvm/IR/Instruction.def"
};		};

private:		private:
friend class SymbolTableListTraits<Instruction>;		friend class SymbolTableListTraits<Instruction>;
		friend class BasicBlock; // For renumbering.

// Shadow Value::setValueSubclassData with a private forwarding method so that		// Shadow Value::setValueSubclassData with a private forwarding method so that
// subclasses cannot accidentally use it.		// subclasses cannot accidentally use it.
void setValueSubclassData(unsigned short D) {		void setValueSubclassData(unsigned short D) {
Value::setValueSubclassData(D);		Value::setValueSubclassData(D);
}		}

unsigned short getSubclassDataFromValue() const {		unsigned short getSubclassDataFromValue() const {
▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/include/llvm/IR/SymbolTableListTraits.h

Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines
DEFINE_SYMBOL_TABLE_PARENT_TYPE(BasicBlock, Function)		DEFINE_SYMBOL_TABLE_PARENT_TYPE(BasicBlock, Function)
DEFINE_SYMBOL_TABLE_PARENT_TYPE(Argument, Function)		DEFINE_SYMBOL_TABLE_PARENT_TYPE(Argument, Function)
DEFINE_SYMBOL_TABLE_PARENT_TYPE(Function, Module)		DEFINE_SYMBOL_TABLE_PARENT_TYPE(Function, Module)
DEFINE_SYMBOL_TABLE_PARENT_TYPE(GlobalVariable, Module)		DEFINE_SYMBOL_TABLE_PARENT_TYPE(GlobalVariable, Module)
DEFINE_SYMBOL_TABLE_PARENT_TYPE(GlobalAlias, Module)		DEFINE_SYMBOL_TABLE_PARENT_TYPE(GlobalAlias, Module)
DEFINE_SYMBOL_TABLE_PARENT_TYPE(GlobalIFunc, Module)		DEFINE_SYMBOL_TABLE_PARENT_TYPE(GlobalIFunc, Module)
#undef DEFINE_SYMBOL_TABLE_PARENT_TYPE		#undef DEFINE_SYMBOL_TABLE_PARENT_TYPE

template <typename NodeTy> class SymbolTableList;		template <typename ValueSubClass> class SymbolTableListTraits;
		template <typename ValueSubClass,
		typename TraitsTy = SymbolTableListTraits<ValueSubClass>>
		class SymbolTableList;
		rnkAuthorUnsubmitted Not Done Reply Inline Actions Oops, I'll revert this. rnk: Oops, I'll revert this.

// ValueSubClass - The type of objects that I hold, e.g. Instruction.		// ValueSubClass - The type of objects that I hold, e.g. Instruction.
// ItemParentClass - The type of object that owns the list, e.g. BasicBlock.		// ItemParentClass - The type of object that owns the list, e.g. BasicBlock.
//		//
template <typename ValueSubClass>		template <typename ValueSubClass>
class SymbolTableListTraits : public ilist_alloc_traits<ValueSubClass> {		class SymbolTableListTraits : public ilist_alloc_traits<ValueSubClass> {
using ListTy = SymbolTableList<ValueSubClass>;		using ListTy = SymbolTableList<ValueSubClass>;
using iterator = typename simple_ilist<ValueSubClass>::iterator;		using iterator = typename simple_ilist<ValueSubClass>::iterator;
using ItemParentClass =		using ItemParentClass =
typename SymbolTableListParentType<ValueSubClass>::type;		typename SymbolTableListParentType<ValueSubClass>::type;

public:		public:
SymbolTableListTraits() = default;		SymbolTableListTraits() = default;

private:		protected:
/// getListOwner - Return the object that owns this list. If this is a list		/// getListOwner - Return the object that owns this list. If this is a list
/// of instructions, it returns the BasicBlock that owns them.		/// of instructions, it returns the BasicBlock that owns them.
ItemParentClass *getListOwner() {		ItemParentClass *getListOwner() {
size_t Offset(size_t(&((ItemParentClass)nullptr->ItemParentClass::		size_t Offset(size_t(&((ItemParentClass)nullptr->ItemParentClass::
getSublistAccess(static_cast<ValueSubClass*>(nullptr)))));		getSublistAccess(static_cast<ValueSubClass*>(nullptr)))));
ListTy Anchor(static_cast<ListTy >(this));		ListTy Anchor(static_cast<ListTy >(this));
return reinterpret_cast<ItemParentClass>(reinterpret_cast<char>(Anchor)-		return reinterpret_cast<ItemParentClass>(reinterpret_cast<char>(Anchor)-
Offset);		Offset);
Show All 19 Lines	public:
static ValueSymbolTable *toPtr(ValueSymbolTable &R) { return &R; }		static ValueSymbolTable *toPtr(ValueSymbolTable &R) { return &R; }
};		};

/// List that automatically updates parent links and symbol tables.		/// List that automatically updates parent links and symbol tables.
///		///
/// When nodes are inserted into and removed from this list, the associated		/// When nodes are inserted into and removed from this list, the associated
/// symbol table will be automatically updated. Similarly, parent links get		/// symbol table will be automatically updated. Similarly, parent links get
/// updated automatically.		/// updated automatically.
template <class T>		template <class T, class TraitsT>
class SymbolTableList		class SymbolTableList
: public iplist_impl<simple_ilist<T>, SymbolTableListTraits<T>> {};		: public iplist_impl<simple_ilist<T>, TraitsT> {};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_IR_SYMBOLTABLELISTTRAITS_H		#endif // LLVM_IR_SYMBOLTABLELISTTRAITS_H

llvm/lib/Analysis/AliasAnalysis.cpp

Show First 20 Lines • Show All 533 Lines • ▼ Show 20 Lines	if (AR == MustAlias)
return ModRefInfo::MustModRef;		return ModRefInfo::MustModRef;
}		}

return ModRefInfo::ModRef;		return ModRefInfo::ModRef;
}		}

/// Return information about whether a particular call site modifies		/// Return information about whether a particular call site modifies
/// or reads the specified memory location \p MemLoc before instruction \p I		/// or reads the specified memory location \p MemLoc before instruction \p I
/// in a BasicBlock. An ordered basic block \p OBB can be used to speed up		/// in a BasicBlock.
/// instruction-ordering queries inside the BasicBlock containing \p I.
/// FIXME: this is really just shoring-up a deficiency in alias analysis.		/// FIXME: this is really just shoring-up a deficiency in alias analysis.
/// BasicAA isn't willing to spend linear time determining whether an alloca		/// BasicAA isn't willing to spend linear time determining whether an alloca
/// was captured before or after this particular call, while we are. However,		/// was captured before or after this particular call, while we are. However,
/// with a smarter AA in place, this test is just wasting compile time.		/// with a smarter AA in place, this test is just wasting compile time.
ModRefInfo AAResults::callCapturesBefore(const Instruction *I,		ModRefInfo AAResults::callCapturesBefore(const Instruction *I,
const MemoryLocation &MemLoc,		const MemoryLocation &MemLoc,
DominatorTree *DT,		DominatorTree *DT) {
OrderedBasicBlock *OBB) {
if (!DT)		if (!DT)
return ModRefInfo::ModRef;		return ModRefInfo::ModRef;

const Value *Object =		const Value *Object =
GetUnderlyingObject(MemLoc.Ptr, I->getModule()->getDataLayout());		GetUnderlyingObject(MemLoc.Ptr, I->getModule()->getDataLayout());
if (!isIdentifiedObject(Object) \|\| isa<GlobalValue>(Object) \|\|		if (!isIdentifiedObject(Object) \|\| isa<GlobalValue>(Object) \|\|
isa<Constant>(Object))		isa<Constant>(Object))
return ModRefInfo::ModRef;		return ModRefInfo::ModRef;

ImmutableCallSite CS(I);		ImmutableCallSite CS(I);
if (!CS.getInstruction() \|\| CS.getInstruction() == Object)		if (!CS.getInstruction() \|\| CS.getInstruction() == Object)
return ModRefInfo::ModRef;		return ModRefInfo::ModRef;

if (PointerMayBeCapturedBefore(Object, /* ReturnCaptures */ true,		if (PointerMayBeCapturedBefore(Object, /* ReturnCaptures */ true,
/* StoreCaptures */ true, I, DT,		/* StoreCaptures */ true, I, DT,
/* include Object */ true,		/* include Object */ true))
/* OrderedBasicBlock */ OBB))
return ModRefInfo::ModRef;		return ModRefInfo::ModRef;

unsigned ArgNo = 0;		unsigned ArgNo = 0;
ModRefInfo R = ModRefInfo::NoModRef;		ModRefInfo R = ModRefInfo::NoModRef;
bool IsMustAlias = true;		bool IsMustAlias = true;
// Set flag only if no May found and all operands processed.		// Set flag only if no May found and all operands processed.
for (auto CI = CS.data_operands_begin(), CE = CS.data_operands_end();		for (auto CI = CS.data_operands_begin(), CE = CS.data_operands_end();
CI != CE; ++CI, ++ArgNo) {		CI != CE; ++CI, ++ArgNo) {
▲ Show 20 Lines • Show All 262 Lines • Show Last 20 Lines

llvm/lib/Analysis/CMakeLists.txt

Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	add_llvm_library(LLVMAnalysis
MemorySSAUpdater.cpp		MemorySSAUpdater.cpp
ModuleDebugInfoPrinter.cpp		ModuleDebugInfoPrinter.cpp
ModuleSummaryAnalysis.cpp		ModuleSummaryAnalysis.cpp
MustExecute.cpp		MustExecute.cpp
ObjCARCAliasAnalysis.cpp		ObjCARCAliasAnalysis.cpp
ObjCARCAnalysisUtils.cpp		ObjCARCAnalysisUtils.cpp
ObjCARCInstKind.cpp		ObjCARCInstKind.cpp
OptimizationRemarkEmitter.cpp		OptimizationRemarkEmitter.cpp
OrderedBasicBlock.cpp
OrderedInstructions.cpp		OrderedInstructions.cpp
PHITransAddr.cpp		PHITransAddr.cpp
PhiValues.cpp		PhiValues.cpp
PostDominators.cpp		PostDominators.cpp
ProfileSummaryInfo.cpp		ProfileSummaryInfo.cpp
PtrUseVisitor.cpp		PtrUseVisitor.cpp
RegionInfo.cpp		RegionInfo.cpp
RegionPass.cpp		RegionPass.cpp
Show All 23 Lines

llvm/lib/Analysis/CaptureTracking.cpp

Show All 15 Lines
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/Analysis/CaptureTracking.h"		#include "llvm/Analysis/CaptureTracking.h"
#include "llvm/ADT/SmallSet.h"		#include "llvm/ADT/SmallSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/Analysis/AliasAnalysis.h"		#include "llvm/Analysis/AliasAnalysis.h"
#include "llvm/Analysis/CFG.h"		#include "llvm/Analysis/CFG.h"
#include "llvm/Analysis/OrderedBasicBlock.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/IR/CallSite.h"		#include "llvm/IR/CallSite.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/Dominators.h"		#include "llvm/IR/Dominators.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/IntrinsicInst.h"		#include "llvm/IR/IntrinsicInst.h"

using namespace llvm;		using namespace llvm;
Show All 24 Lines	namespace {

/// Only find pointer captures which happen before the given instruction. Uses		/// Only find pointer captures which happen before the given instruction. Uses
/// the dominator tree to determine whether one instruction is before another.		/// the dominator tree to determine whether one instruction is before another.
/// Only support the case where the Value is defined in the same basic block		/// Only support the case where the Value is defined in the same basic block
/// as the given instruction and the use.		/// as the given instruction and the use.
struct CapturesBefore : public CaptureTracker {		struct CapturesBefore : public CaptureTracker {

CapturesBefore(bool ReturnCaptures, const Instruction I, const DominatorTree DT,		CapturesBefore(bool ReturnCaptures, const Instruction I, const DominatorTree DT,
bool IncludeI, OrderedBasicBlock *IC)		bool IncludeI)
: OrderedBB(IC), BeforeHere(I), DT(DT),		: BeforeHere(I), DT(DT),
ReturnCaptures(ReturnCaptures), IncludeI(IncludeI), Captured(false) {}		ReturnCaptures(ReturnCaptures), IncludeI(IncludeI), Captured(false) {}

void tooManyUses() override { Captured = true; }		void tooManyUses() override { Captured = true; }

bool isSafeToPrune(Instruction *I) {		bool isSafeToPrune(Instruction *I) {
BasicBlock *BB = I->getParent();		BasicBlock *BB = I->getParent();
// We explore this usage only if the usage can reach "BeforeHere".		// We explore this usage only if the usage can reach "BeforeHere".
// If use is not reachable from entry, there is no need to explore.		// If use is not reachable from entry, there is no need to explore.
if (BeforeHere != I && !DT->isReachableFromEntry(BB))		if (BeforeHere != I && !DT->isReachableFromEntry(BB))
return true;		return true;

// Compute the case where both instructions are inside the same basic		// Compute the case where both instructions are inside the same basic
// block. Since instructions in the same BB as BeforeHere are numbered in		// block.
// 'OrderedBB', avoid using 'dominates' and 'isPotentiallyReachable'
// which are very expensive for large basic blocks.
if (BB == BeforeHere->getParent()) {		if (BB == BeforeHere->getParent()) {
// 'I' dominates 'BeforeHere' => not safe to prune.		// 'I' dominates 'BeforeHere' => not safe to prune.
//		//
// The value defined by an invoke dominates an instruction only		// The value defined by an invoke dominates an instruction only
// if it dominates every instruction in UseBB. A PHI is dominated only		// if it dominates every instruction in UseBB. A PHI is dominated only
// if the instruction dominates every possible use in the UseBB. Since		// if the instruction dominates every possible use in the UseBB. Since
// UseBB == BB, avoid pruning.		// UseBB == BB, avoid pruning.
if (isa<InvokeInst>(BeforeHere) \|\| isa<PHINode>(I) \|\| I == BeforeHere)		if (isa<InvokeInst>(BeforeHere) \|\| isa<PHINode>(I) \|\| I == BeforeHere)
return false;		return false;
if (!OrderedBB->dominates(BeforeHere, I))		if (!BeforeHere->comesBefore(I))
return false;		return false;

// 'BeforeHere' comes before 'I', it's safe to prune if we also		// 'BeforeHere' comes before 'I', it's safe to prune if we also
// guarantee that 'I' never reaches 'BeforeHere' through a back-edge or		// guarantee that 'I' never reaches 'BeforeHere' through a back-edge or
// by its successors, i.e, prune if:		// by its successors, i.e, prune if:
//		//
// (1) BB is an entry block or have no successors.		// (1) BB is an entry block or have no successors.
// (2) There's no path coming back through BB successors.		// (2) There's no path coming back through BB successors.
Show All 34 Lines	bool captured(const Use *U) override {

if (!shouldExplore(U))		if (!shouldExplore(U))
return false;		return false;

Captured = true;		Captured = true;
return true;		return true;
}		}

OrderedBasicBlock *OrderedBB;
const Instruction *BeforeHere;		const Instruction *BeforeHere;
const DominatorTree *DT;		const DominatorTree *DT;

bool ReturnCaptures;		bool ReturnCaptures;
bool IncludeI;		bool IncludeI;

bool Captured;		bool Captured;
};		};
Show All 25 Lines
/// PointerMayBeCapturedBefore - Return true if this pointer value may be		/// PointerMayBeCapturedBefore - Return true if this pointer value may be
/// captured by the enclosing function (which is required to exist). If a		/// captured by the enclosing function (which is required to exist). If a
/// DominatorTree is provided, only captures which happen before the given		/// DominatorTree is provided, only captures which happen before the given
/// instruction are considered. This routine can be expensive, so consider		/// instruction are considered. This routine can be expensive, so consider
/// caching the results. The boolean ReturnCaptures specifies whether		/// caching the results. The boolean ReturnCaptures specifies whether
/// returning the value (or part of it) from the function counts as capturing		/// returning the value (or part of it) from the function counts as capturing
/// it or not. The boolean StoreCaptures specified whether storing the value		/// it or not. The boolean StoreCaptures specified whether storing the value
/// (or part of it) into memory anywhere automatically counts as capturing it		/// (or part of it) into memory anywhere automatically counts as capturing it
/// or not. A ordered basic block \p OBB can be used in order to speed up		/// or not.
/// queries about relative order among instructions in the same basic block.
bool llvm::PointerMayBeCapturedBefore(const Value *V, bool ReturnCaptures,		bool llvm::PointerMayBeCapturedBefore(const Value *V, bool ReturnCaptures,
bool StoreCaptures, const Instruction *I,		bool StoreCaptures, const Instruction *I,
const DominatorTree *DT, bool IncludeI,		const DominatorTree *DT, bool IncludeI) {
OrderedBasicBlock *OBB) {
assert(!isa<GlobalValue>(V) &&		assert(!isa<GlobalValue>(V) &&
"It doesn't make sense to ask whether a global is captured.");		"It doesn't make sense to ask whether a global is captured.");
bool UseNewOBB = OBB == nullptr;

if (!DT)		if (!DT)
return PointerMayBeCaptured(V, ReturnCaptures, StoreCaptures);		return PointerMayBeCaptured(V, ReturnCaptures, StoreCaptures);
if (UseNewOBB)
OBB = new OrderedBasicBlock(I->getParent());

// TODO: See comment in PointerMayBeCaptured regarding what could be done		// TODO: See comment in PointerMayBeCaptured regarding what could be done
// with StoreCaptures.		// with StoreCaptures.

CapturesBefore CB(ReturnCaptures, I, DT, IncludeI, OBB);		CapturesBefore CB(ReturnCaptures, I, DT, IncludeI);
PointerMayBeCaptured(V, &CB);		PointerMayBeCaptured(V, &CB);

if (UseNewOBB)
delete OBB;
return CB.Captured;		return CB.Captured;
}		}

/// TODO: Write a new FunctionPass AliasAnalysis so that it can keep		/// TODO: Write a new FunctionPass AliasAnalysis so that it can keep
/// a cache. Then we can move the code from BasicAliasAnalysis into		/// a cache. Then we can move the code from BasicAliasAnalysis into
/// that path, and remove this threshold.		/// that path, and remove this threshold.
static int const Threshold = 20;		static int const Threshold = 20;

▲ Show 20 Lines • Show All 150 Lines • Show Last 20 Lines

llvm/lib/Analysis/InstructionPrecedenceTracking.cpp

Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	if (isSpecialInstruction(&I)) {
break;		break;
}		}

// Mark this block as having a known result.		// Mark this block as having a known result.
KnownBlocks.insert(BB);		KnownBlocks.insert(BB);
}		}

void InstructionPrecedenceTracking::invalidateBlock(const BasicBlock *BB) {		void InstructionPrecedenceTracking::invalidateBlock(const BasicBlock *BB) {
OI.invalidateBlock(BB);
FirstSpecialInsts.erase(BB);		FirstSpecialInsts.erase(BB);
KnownBlocks.erase(BB);		KnownBlocks.erase(BB);
}		}

void InstructionPrecedenceTracking::clear() {		void InstructionPrecedenceTracking::clear() {
for (auto It : FirstSpecialInsts)
OI.invalidateBlock(It.first);
FirstSpecialInsts.clear();		FirstSpecialInsts.clear();
KnownBlocks.clear();		KnownBlocks.clear();
}		}

bool ImplicitControlFlowTracking::isSpecialInstruction(		bool ImplicitControlFlowTracking::isSpecialInstruction(
const Instruction *Insn) const {		const Instruction *Insn) const {
// If a block's instruction doesn't always pass the control to its successor		// If a block's instruction doesn't always pass the control to its successor
// instruction, mark the block as having implicit control flow. We use them		// instruction, mark the block as having implicit control flow. We use them
Show All 24 Lines

llvm/lib/Analysis/MemoryDependenceAnalysis.cpp

Show All 18 Lines
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/Analysis/AliasAnalysis.h"		#include "llvm/Analysis/AliasAnalysis.h"
#include "llvm/Analysis/AssumptionCache.h"		#include "llvm/Analysis/AssumptionCache.h"
#include "llvm/Analysis/MemoryBuiltins.h"		#include "llvm/Analysis/MemoryBuiltins.h"
#include "llvm/Analysis/MemoryLocation.h"		#include "llvm/Analysis/MemoryLocation.h"
#include "llvm/Analysis/OrderedBasicBlock.h"
#include "llvm/Analysis/PHITransAddr.h"		#include "llvm/Analysis/PHITransAddr.h"
#include "llvm/Analysis/PhiValues.h"		#include "llvm/Analysis/PhiValues.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/IR/Attributes.h"		#include "llvm/IR/Attributes.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/CallSite.h"		#include "llvm/IR/CallSite.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
▲ Show 20 Lines • Show All 448 Lines • ▼ Show 20 Lines	MemDepResult MemoryDependenceResults::getSimplePointerDependencyFrom(
if (isLoad && QueryInst) {		if (isLoad && QueryInst) {
LoadInst *LI = dyn_cast<LoadInst>(QueryInst);		LoadInst *LI = dyn_cast<LoadInst>(QueryInst);
if (LI && LI->getMetadata(LLVMContext::MD_invariant_load) != nullptr)		if (LI && LI->getMetadata(LLVMContext::MD_invariant_load) != nullptr)
isInvariantLoad = true;		isInvariantLoad = true;
}		}

const DataLayout &DL = BB->getModule()->getDataLayout();		const DataLayout &DL = BB->getModule()->getDataLayout();

// Create a numbered basic block to lazily compute and cache instruction
// positions inside a BB. This is used to provide fast queries for relative
// position between two instructions in a BB and can be used by
// AliasAnalysis::callCapturesBefore.
OrderedBasicBlock OBB(BB);

// Return "true" if and only if the instruction I is either a non-simple		// Return "true" if and only if the instruction I is either a non-simple
// load or a non-simple store.		// load or a non-simple store.
auto isNonSimpleLoadOrStore = [](Instruction *I) -> bool {		auto isNonSimpleLoadOrStore = [](Instruction *I) -> bool {
if (auto *LI = dyn_cast<LoadInst>(I))		if (auto *LI = dyn_cast<LoadInst>(I))
return !LI->isSimple();		return !LI->isSimple();
if (auto *SI = dyn_cast<StoreInst>(I))		if (auto *SI = dyn_cast<StoreInst>(I))
return !SI->isSimple();		return !SI->isSimple();
return false;		return false;
▲ Show 20 Lines • Show All 173 Lines • ▼ Show 20 Lines	#endif
if (FenceInst *FI = dyn_cast<FenceInst>(Inst))		if (FenceInst *FI = dyn_cast<FenceInst>(Inst))
if (isLoad && FI->getOrdering() == AtomicOrdering::Release)		if (isLoad && FI->getOrdering() == AtomicOrdering::Release)
continue;		continue;

// See if this instruction (e.g. a call or vaarg) mod/ref's the pointer.		// See if this instruction (e.g. a call or vaarg) mod/ref's the pointer.
ModRefInfo MR = AA.getModRefInfo(Inst, MemLoc);		ModRefInfo MR = AA.getModRefInfo(Inst, MemLoc);
// If necessary, perform additional analysis.		// If necessary, perform additional analysis.
if (isModAndRefSet(MR))		if (isModAndRefSet(MR))
MR = AA.callCapturesBefore(Inst, MemLoc, &DT, &OBB);		MR = AA.callCapturesBefore(Inst, MemLoc, &DT);
switch (clearMust(MR)) {		switch (clearMust(MR)) {
case ModRefInfo::NoModRef:		case ModRefInfo::NoModRef:
// If the call has no effect on the queried pointer, just ignore it.		// If the call has no effect on the queried pointer, just ignore it.
continue;		continue;
case ModRefInfo::Mod:		case ModRefInfo::Mod:
return MemDepResult::getClobber(Inst);		return MemDepResult::getClobber(Inst);
case ModRefInfo::Ref:		case ModRefInfo::Ref:
// If the call is known to never store to the pointer, and if this is a		// If the call is known to never store to the pointer, and if this is a
▲ Show 20 Lines • Show All 1,110 Lines • Show Last 20 Lines

llvm/lib/Analysis/OrderedBasicBlock.cpp

This file was deleted.

	//===- OrderedBasicBlock.cpp --------------------------------- -- C++ --===//
	//
	// The LLVM Compiler Infrastructure
	//
	// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.
	//
	//===----------------------------------------------------------------------===//
	//
	// This file implements the OrderedBasicBlock class. OrderedBasicBlock
	// maintains an interface where clients can query if one instruction comes
	// before another in a BasicBlock. Since BasicBlock currently lacks a reliable
	// way to query relative position between instructions one can use
	// OrderedBasicBlock to do such queries. OrderedBasicBlock is lazily built on a
	// source BasicBlock and maintains an internal Instruction -> Position map. A
	// OrderedBasicBlock instance should be discarded whenever the source
	// BasicBlock changes.
	//
	// It's currently used by the CaptureTracker in order to find relative
	// positions of a pair of instructions inside a BasicBlock.
	//
	//===----------------------------------------------------------------------===//

	#include "llvm/Analysis/OrderedBasicBlock.h"
	#include "llvm/IR/Instruction.h"
	using namespace llvm;

	OrderedBasicBlock::OrderedBasicBlock(const BasicBlock *BasicB)
	: NextInstPos(0), BB(BasicB) {
	LastInstFound = BB->end();
	}

	/// Given no cached results, find if \p A comes before \p B in \p BB.
	/// Cache and number out instruction while walking \p BB.
	bool OrderedBasicBlock::comesBefore(const Instruction *A,
	const Instruction *B) {
	const Instruction *Inst = nullptr;
	assert(!(LastInstFound == BB->end() && NextInstPos != 0) &&
	"Instruction supposed to be in NumberedInsts");

	// Start the search with the instruction found in the last lookup round.
	auto II = BB->begin();
	auto IE = BB->end();
	if (LastInstFound != IE)
	II = std::next(LastInstFound);

	// Number all instructions up to the point where we find 'A' or 'B'.
	for (; II != IE; ++II) {
	Inst = cast<Instruction>(II);
	NumberedInsts[Inst] = NextInstPos++;
	if (Inst == A \|\| Inst == B)
	break;
	}

	assert(II != IE && "Instruction not found?");
	assert((Inst == A \|\| Inst == B) && "Should find A or B");
	LastInstFound = II;
	return Inst != B;
	}

	/// Find out whether \p A dominates \p B, meaning whether \p A
	/// comes before \p B in \p BB. This is a simplification that considers
	/// cached instruction positions and ignores other basic blocks, being
	/// only relevant to compare relative instructions positions inside \p BB.
	bool OrderedBasicBlock::dominates(const Instruction A, const Instruction B) {
	assert(A->getParent() == B->getParent() &&
	"Instructions must be in the same basic block!");

	// First we lookup the instructions. If they don't exist, lookup will give us
	// back ::end(). If they both exist, we compare the numbers. Otherwise, if NA
	// exists and NB doesn't, it means NA must come before NB because we would
	// have numbered NB as well if it didn't. The same is true for NB. If it
	// exists, but NA does not, NA must come after it. If neither exist, we need
	// to number the block and cache the results (by calling comesBefore).
	auto NAI = NumberedInsts.find(A);
	auto NBI = NumberedInsts.find(B);
	if (NAI != NumberedInsts.end() && NBI != NumberedInsts.end())
	return NAI->second < NBI->second;
	if (NAI != NumberedInsts.end())
	return true;
	if (NBI != NumberedInsts.end())
	return false;

	return comesBefore(A, B);
	}

llvm/lib/Analysis/OrderedInstructions.cpp

	Show All 13 Lines
	#include "llvm/Analysis/OrderedInstructions.h"			#include "llvm/Analysis/OrderedInstructions.h"
	using namespace llvm;			using namespace llvm;

	bool OrderedInstructions::localDominates(const Instruction *InstA,			bool OrderedInstructions::localDominates(const Instruction *InstA,
	const Instruction *InstB) const {			const Instruction *InstB) const {
	assert(InstA->getParent() == InstB->getParent() &&			assert(InstA->getParent() == InstB->getParent() &&
	"Instructions must be in the same basic block");			"Instructions must be in the same basic block");

	const BasicBlock *IBB = InstA->getParent();			return InstA->comesBefore(InstB);
	auto OBB = OBBMap.find(IBB);
	if (OBB == OBBMap.end())
	OBB = OBBMap.insert({IBB, make_unique<OrderedBasicBlock>(IBB)}).first;
	return OBB->second->dominates(InstA, InstB);
	}			}

	/// Given 2 instructions, use OrderedBasicBlock to check for dominance relation			/// Given 2 instructions, check for dominance relation if the instructions are
	/// if the instructions are in the same basic block, Otherwise, use dominator			/// in the same basic block. Otherwise, use dominator tree.
	/// tree.
	bool OrderedInstructions::dominates(const Instruction *InstA,			bool OrderedInstructions::dominates(const Instruction *InstA,
	const Instruction *InstB) const {			const Instruction *InstB) const {
	// Use ordered basic block to do dominance check in case the 2 instructions			// Use ordered basic block to do dominance check in case the 2 instructions
	// are in the same basic block.			// are in the same basic block.
	if (InstA->getParent() == InstB->getParent())			if (InstA->getParent() == InstB->getParent())
	return localDominates(InstA, InstB);			return localDominates(InstA, InstB);
				kuharUnsubmitted Done Reply Inline Actions Is the separate function `localDominates` still needed? Seems like the body is trivial and could be inlined here? kuhar: Is the separate function `localDominates` still needed? Seems like the body is trivial and…
				rnkAuthorUnsubmitted Not Done Reply Inline Actions Sure, fixed. rnk: Sure, fixed.
	return DT->dominates(InstA->getParent(), InstB->getParent());			return DT->dominates(InstA->getParent(), InstB->getParent());
	}			}

	bool OrderedInstructions::dfsBefore(const Instruction *InstA,			bool OrderedInstructions::dfsBefore(const Instruction *InstA,
	const Instruction *InstB) const {			const Instruction *InstB) const {
	// Use ordered basic block in case the 2 instructions are in the same basic			// Use ordered basic block in case the 2 instructions are in the same basic
	// block.			// block.
	if (InstA->getParent() == InstB->getParent())			if (InstA->getParent() == InstB->getParent())
	return localDominates(InstA, InstB);			return localDominates(InstA, InstB);

	DomTreeNode *DA = DT->getNode(InstA->getParent());			DomTreeNode *DA = DT->getNode(InstA->getParent());
	DomTreeNode *DB = DT->getNode(InstB->getParent());			DomTreeNode *DB = DT->getNode(InstB->getParent());
	return DA->getDFSNumIn() < DB->getDFSNumIn();			return DA->getDFSNumIn() < DB->getDFSNumIn();
	}			}

llvm/lib/IR/BasicBlock.cpp

Show All 28 Lines	if (Function *F = getParent())
return F->getValueSymbolTable();		return F->getValueSymbolTable();
return nullptr;		return nullptr;
}		}

LLVMContext &BasicBlock::getContext() const {		LLVMContext &BasicBlock::getContext() const {
return getType()->getContext();		return getType()->getContext();
}		}

		template <> void llvm::notifyParentOfInsertions(BasicBlock *BB) {
		BB->invalidateOrders();
		}

// Explicit instantiation of SymbolTableListTraits since some of the methods		// Explicit instantiation of SymbolTableListTraits since some of the methods
// are not in the public header file...		// are not in the public header file...
template class llvm::SymbolTableListTraits<Instruction>;		template class llvm::SymbolTableListTraits<Instruction>;

BasicBlock::BasicBlock(LLVMContext &C, const Twine &Name, Function *NewParent,		BasicBlock::BasicBlock(LLVMContext &C, const Twine &Name, Function *NewParent,
BasicBlock *InsertBefore)		BasicBlock *InsertBefore)
: Value(Type::getLabelTy(C), Value::BasicBlockVal), Parent(nullptr) {		: Value(Type::getLabelTy(C), Value::BasicBlockVal), Parent(nullptr) {

▲ Show 20 Lines • Show All 435 Lines • ▼ Show 20 Lines	Optional<uint64_t> BasicBlock::getIrrLoopHeaderWeight() const {
return Optional<uint64_t>();		return Optional<uint64_t>();
}		}

BasicBlock::iterator llvm::skipDebugIntrinsics(BasicBlock::iterator It) {		BasicBlock::iterator llvm::skipDebugIntrinsics(BasicBlock::iterator It) {
while (isa<DbgInfoIntrinsic>(It))		while (isa<DbgInfoIntrinsic>(It))
++It;		++It;
return It;		return It;
}		}

		void BasicBlock::renumberInstructions() {
		unsigned Order = 0;
		for (Instruction &I : *this)
		I.Order = Order++;
		kuharUnsubmitted Not Done Reply Inline Actions Is it possible to use noncontiguous indices? If the indices are spread apart, you should be able to perform most insertions without renumbering instructions. kuhar: Is it possible to use noncontiguous indices? If the indices are spread apart, you should be…
		rnkAuthorUnsubmitted Not Done Reply Inline Actions I want to put that out of scope of the initial change. We can do all kinds of fancy tricks here to avoid invalidating the ordering, but it's hard to provide meaningfully better algorithmic guarantees. And, the more complex code will require more complex testing, and it might have bugs. I'd rather come back and implement a more complex algorithm once profiling shows that there is a bottleneck, especially since it's often easier to remove these bottlenecks by delaying insertion. rnk: I want to put that out of scope of the initial change. We can do all kinds of fancy tricks here…
		kuharUnsubmitted Not Done Reply Inline Actions Sure, makes perfect sense. I'm not very familiar with the IR part of llvm, but I'd prefer to see a comment that explains that in a relevant place if you believe that this is a good future direction. kuhar: Sure, makes perfect sense. I'm not very familiar with the IR part of llvm, but I'd prefer to…

		// Set the low bit in the subclass data to mark the instruction ordering as
		// valid.
		setValueSubclassData(getSubclassDataFromValue() \| 1U);
		}

llvm/lib/IR/Instruction.cpp

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	}			}

	void Instruction::moveBefore(BasicBlock &BB,			void Instruction::moveBefore(BasicBlock &BB,
	SymbolTableList<Instruction>::iterator I) {			SymbolTableList<Instruction>::iterator I) {
	assert(I == BB.end() \|\| I->getParent() == &BB);			assert(I == BB.end() \|\| I->getParent() == &BB);
	BB.getInstList().splice(I, getParent()->getInstList(), getIterator());			BB.getInstList().splice(I, getParent()->getInstList(), getIterator());
	}			}

				bool Instruction::comesBefore(const Instruction *Other) const {
				assert(Parent == Other->Parent && "cross-BB instruction order comparison");
				mkazantsevUnsubmitted Done Reply Inline Actions Maybe also makes sense to assert that Parent is not nullptr (i.e. instructions not detached). mkazantsev: Maybe also makes sense to assert that Parent is not nullptr (i.e. instructions not detached).
				if (!Parent->isInstrOrderValid())
				Parent->renumberInstructions();
				return Order < Other->Order;
				}

	void Instruction::setHasNoUnsignedWrap(bool b) {			void Instruction::setHasNoUnsignedWrap(bool b) {
	cast<OverflowingBinaryOperator>(this)->setHasNoUnsignedWrap(b);			cast<OverflowingBinaryOperator>(this)->setHasNoUnsignedWrap(b);
	}			}

	void Instruction::setHasNoSignedWrap(bool b) {			void Instruction::setHasNoSignedWrap(bool b) {
	cast<OverflowingBinaryOperator>(this)->setHasNoSignedWrap(b);			cast<OverflowingBinaryOperator>(this)->setHasNoSignedWrap(b);
	}			}

	▲ Show 20 Lines • Show All 658 Lines • Show Last 20 Lines

llvm/lib/IR/SymbolTableListTraitsImpl.h

	Show All 15 Lines
	#ifndef LLVM_LIB_IR_SYMBOLTABLELISTTRAITSIMPL_H			#ifndef LLVM_LIB_IR_SYMBOLTABLELISTTRAITSIMPL_H
	#define LLVM_LIB_IR_SYMBOLTABLELISTTRAITSIMPL_H			#define LLVM_LIB_IR_SYMBOLTABLELISTTRAITSIMPL_H

	#include "llvm/IR/SymbolTableListTraits.h"			#include "llvm/IR/SymbolTableListTraits.h"
	#include "llvm/IR/ValueSymbolTable.h"			#include "llvm/IR/ValueSymbolTable.h"

	namespace llvm {			namespace llvm {

				/// Notify basic blocks when an instruction is inserted.
				template <typename ParentClass>
				inline void notifyParentOfInsertions(ParentClass *Parent) {}
				template <> void notifyParentOfInsertions(BasicBlock *BB);

	/// setSymTabObject - This is called when (f.e.) the parent of a basic block			/// setSymTabObject - This is called when (f.e.) the parent of a basic block
	/// changes. This requires us to remove all the instruction symtab entries from			/// changes. This requires us to remove all the instruction symtab entries from
	/// the current function and reinsert them into the new function.			/// the current function and reinsert them into the new function.
	template <typename ValueSubClass>			template <typename ValueSubClass>
	template <typename TPtr>			template <typename TPtr>
	void SymbolTableListTraits<ValueSubClass>::setSymTabObject(TPtr *Dest,			void SymbolTableListTraits<ValueSubClass>::setSymTabObject(TPtr *Dest,
	TPtr Src) {			TPtr Src) {
	// Get the old symtab and value list before doing the assignment.			// Get the old symtab and value list before doing the assignment.
	Show All 28 Lines

	}			}

	template <typename ValueSubClass>			template <typename ValueSubClass>
	void SymbolTableListTraits<ValueSubClass>::addNodeToList(ValueSubClass *V) {			void SymbolTableListTraits<ValueSubClass>::addNodeToList(ValueSubClass *V) {
	assert(!V->getParent() && "Value already in a container!!");			assert(!V->getParent() && "Value already in a container!!");
	ItemParentClass *Owner = getListOwner();			ItemParentClass *Owner = getListOwner();
	V->setParent(Owner);			V->setParent(Owner);
				notifyParentOfInsertions(Owner);
	if (V->hasName())			if (V->hasName())
	if (ValueSymbolTable *ST = getSymTab(Owner))			if (ValueSymbolTable *ST = getSymTab(Owner))
	ST->reinsertValue(V);			ST->reinsertValue(V);
	}			}

	template <typename ValueSubClass>			template <typename ValueSubClass>
	void SymbolTableListTraits<ValueSubClass>::removeNodeFromList(			void SymbolTableListTraits<ValueSubClass>::removeNodeFromList(
	ValueSubClass *V) {			ValueSubClass *V) {
	V->setParent(nullptr);			V->setParent(nullptr);
	if (V->hasName())			if (V->hasName())
	if (ValueSymbolTable *ST = getSymTab(getListOwner()))			if (ValueSymbolTable *ST = getSymTab(getListOwner()))
	ST->removeValueName(V->getValueName());			ST->removeValueName(V->getValueName());
	}			}

	template <typename ValueSubClass>			template <typename ValueSubClass>
	void SymbolTableListTraits<ValueSubClass>::transferNodesFromList(			void SymbolTableListTraits<ValueSubClass>::transferNodesFromList(
	SymbolTableListTraits &L2, iterator first, iterator last) {			SymbolTableListTraits &L2, iterator first, iterator last) {
	// We only have to do work here if transferring instructions between BBs			// We only have to do work here if transferring instructions between BBs
	ItemParentClass NewIP = getListOwner(), OldIP = L2.getListOwner();			ItemParentClass NewIP = getListOwner(), OldIP = L2.getListOwner();
	assert(NewIP != OldIP && "Expected different list owners");			assert(NewIP != OldIP && "Expected different list owners");

	// We only have to update symbol table entries if we are transferring the			// We only have to update symbol table entries if we are transferring the
	// instructions to a different symtab object...			// instructions to a different symtab object...
				kuharUnsubmitted Not Done Reply Inline Actions Isn't it enough to invalidate only the indices of instructions that follow the first inserted one? kuhar: Isn't it enough to invalidate only the indices of instructions that follow the first inserted…
				rnkAuthorUnsubmitted Not Done Reply Inline Actions Yes, but recording that info and leveraging it is complex, and it doesn't change the asymptotic performance. We'd need more than a bit in BasicBlock to do it. rnk: Yes, but recording that info and leveraging it is complex, and it doesn't change the asymptotic…
				kuharUnsubmitted Not Done Reply Inline Actions Makes sense. How expensive is it to add new data members to BasicBlock? Do you know of any attempts to stick some data inside and measure how it affects compilation times? kuhar: Makes sense. How expensive is it to add new data members to BasicBlock? Do you know of any…
	ValueSymbolTable *NewST = getSymTab(NewIP);			ValueSymbolTable *NewST = getSymTab(NewIP);
	ValueSymbolTable *OldST = getSymTab(OldIP);			ValueSymbolTable *OldST = getSymTab(OldIP);
	if (NewST != OldST) {			if (NewST != OldST) {
	for (; first != last; ++first) {			for (; first != last; ++first) {
	ValueSubClass &V = *first;			ValueSubClass &V = *first;
	bool HasName = V.hasName();			bool HasName = V.hasName();
	if (OldST && HasName)			if (OldST && HasName)
	OldST->removeValueName(V.getValueName());			OldST->removeValueName(V.getValueName());
	Show All 15 Lines

llvm/lib/Transforms/Scalar/DeadStoreElimination.cpp

Show First 20 Lines • Show All 1,066 Lines • ▼ Show 20 Lines
}		}

static bool eliminateDeadStores(BasicBlock &BB, AliasAnalysis *AA,		static bool eliminateDeadStores(BasicBlock &BB, AliasAnalysis *AA,
MemoryDependenceResults MD, DominatorTree DT,		MemoryDependenceResults MD, DominatorTree DT,
const TargetLibraryInfo *TLI) {		const TargetLibraryInfo *TLI) {
const DataLayout &DL = BB.getModule()->getDataLayout();		const DataLayout &DL = BB.getModule()->getDataLayout();
bool MadeChange = false;		bool MadeChange = false;

// FIXME: Maybe change this to use some abstraction like OrderedBasicBlock?		// FIXME: Don't maintain our own ordering. Use Instruction::comesBefore.
// The current OrderedBasicBlock can't deal with mutation at the moment.
size_t LastThrowingInstIndex = 0;		size_t LastThrowingInstIndex = 0;
DenseMap<Instruction*, size_t> InstrOrdering;		DenseMap<Instruction*, size_t> InstrOrdering;
size_t InstrIndex = 1;		size_t InstrIndex = 1;

// A map of interval maps representing partially-overwritten value parts.		// A map of interval maps representing partially-overwritten value parts.
InstOverlapIntervalsTy IOL;		InstOverlapIntervalsTy IOL;

// Do a top-down walk on the BB.		// Do a top-down walk on the BB.
▲ Show 20 Lines • Show All 172 Lines • ▼ Show 20 Lines	while (InstDep.isDef() \|\| InstDep.isClobber()) {
unsigned MDToKeep[] = {LLVMContext::MD_dbg, LLVMContext::MD_tbaa,		unsigned MDToKeep[] = {LLVMContext::MD_dbg, LLVMContext::MD_tbaa,
LLVMContext::MD_alias_scope,		LLVMContext::MD_alias_scope,
LLVMContext::MD_noalias,		LLVMContext::MD_noalias,
LLVMContext::MD_nontemporal};		LLVMContext::MD_nontemporal};
SI->copyMetadata(*DepWrite, MDToKeep);		SI->copyMetadata(*DepWrite, MDToKeep);
++NumModifiedStores;		++NumModifiedStores;

// Remove earlier, wider, store		// Remove earlier, wider, store
size_t Idx = InstrOrdering.lookup(DepWrite);		size_t Idx = InstrOrdering.lookup(DepWrite);
InstrOrdering.erase(DepWrite);		InstrOrdering.erase(DepWrite);
InstrOrdering.insert(std::make_pair(SI, Idx));		InstrOrdering.insert(std::make_pair(SI, Idx));
rnkAuthorUnsubmitted Done Reply Inline Actions I don't have an equivalent API for this. I don't have a good way to profile to check if that is or is not the case. rnk: I don't have an equivalent API for this. I don't have a good way to profile to check if that is…

// Delete the old stores and now-dead instructions that feed them.		// Delete the old stores and now-dead instructions that feed them.
deleteDeadInstruction(Inst, &BBI, MD, TLI, IOL, &InstrOrdering);		deleteDeadInstruction(Inst, &BBI, MD, TLI, IOL, &InstrOrdering);
deleteDeadInstruction(DepWrite, &BBI, MD, TLI, IOL,		deleteDeadInstruction(DepWrite, &BBI, MD, TLI, IOL,
&InstrOrdering);		&InstrOrdering);
MadeChange = true;		MadeChange = true;

// We erased DepWrite and Inst (Loc); start over.		// We erased DepWrite and Inst (Loc); start over.
▲ Show 20 Lines • Show All 122 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines
#include "llvm/ADT/PostOrderIterator.h"		#include "llvm/ADT/PostOrderIterator.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/ADT/iterator_range.h"		#include "llvm/ADT/iterator_range.h"
#include "llvm/Analysis/AliasAnalysis.h"		#include "llvm/Analysis/AliasAnalysis.h"
#include "llvm/Analysis/MemoryLocation.h"		#include "llvm/Analysis/MemoryLocation.h"
#include "llvm/Analysis/OrderedBasicBlock.h"
#include "llvm/Analysis/ScalarEvolution.h"		#include "llvm/Analysis/ScalarEvolution.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/Transforms/Utils/Local.h"		#include "llvm/Transforms/Utils/Local.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/Analysis/VectorUtils.h"		#include "llvm/Analysis/VectorUtils.h"
#include "llvm/IR/Attributes.h"		#include "llvm/IR/Attributes.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
▲ Show 20 Lines • Show All 403 Lines • ▼ Show 20 Lines	if (auto *SelectB = dyn_cast<SelectInst>(PtrB)) {
areConsecutivePointers(SelectA->getFalseValue(),		areConsecutivePointers(SelectA->getFalseValue(),
SelectB->getFalseValue(), PtrDelta, Depth);		SelectB->getFalseValue(), PtrDelta, Depth);
}		}
}		}
return false;		return false;
}		}

void Vectorizer::reorder(Instruction *I) {		void Vectorizer::reorder(Instruction *I) {
OrderedBasicBlock OBB(I->getParent());
SmallPtrSet<Instruction *, 16> InstructionsToMove;		SmallPtrSet<Instruction *, 16> InstructionsToMove;
SmallVector<Instruction *, 16> Worklist;		SmallVector<Instruction *, 16> Worklist;

Worklist.push_back(I);		Worklist.push_back(I);
while (!Worklist.empty()) {		while (!Worklist.empty()) {
Instruction *IW = Worklist.pop_back_val();		Instruction *IW = Worklist.pop_back_val();
int NumOperands = IW->getNumOperands();		int NumOperands = IW->getNumOperands();
for (int i = 0; i < NumOperands; i++) {		for (int i = 0; i < NumOperands; i++) {
Instruction *IM = dyn_cast<Instruction>(IW->getOperand(i));		Instruction *IM = dyn_cast<Instruction>(IW->getOperand(i));
if (!IM \|\| IM->getOpcode() == Instruction::PHI)		if (!IM \|\| IM->getOpcode() == Instruction::PHI)
continue;		continue;

// If IM is in another BB, no need to move it, because this pass only		// If IM is in another BB, no need to move it, because this pass only
// vectorizes instructions within one BB.		// vectorizes instructions within one BB.
if (IM->getParent() != I->getParent())		if (IM->getParent() != I->getParent())
continue;		continue;

if (!OBB.dominates(IM, I)) {		if (!IM->comesBefore(I)) {
InstructionsToMove.insert(IM);		InstructionsToMove.insert(IM);
Worklist.push_back(IM);		Worklist.push_back(IM);
}		}
}		}
}		}

// All instructions to move should follow I. Start from I, not from begin().		// All instructions to move should follow I. Start from I, not from begin().
for (auto BBI = I->getIterator(), E = I->getParent()->end(); BBI != E;		for (auto BBI = I->getIterator(), E = I->getParent()->end(); BBI != E;
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	if (isa<LoadInst>(I) \|\| isa<StoreInst>(I)) {
break;		break;
} else if (!IsLoadChain && (I.mayReadOrWriteMemory() \|\| I.mayThrow())) {		} else if (!IsLoadChain && (I.mayReadOrWriteMemory() \|\| I.mayThrow())) {
LLVM_DEBUG(dbgs() << "LSV: Found may-read/write/throw operation: " << I		LLVM_DEBUG(dbgs() << "LSV: Found may-read/write/throw operation: " << I
<< '\n');		<< '\n');
break;		break;
}		}
}		}

OrderedBasicBlock OBB(Chain[0]->getParent());

// Loop until we find an instruction in ChainInstrs that we can't vectorize.		// Loop until we find an instruction in ChainInstrs that we can't vectorize.
unsigned ChainInstrIdx = 0;		unsigned ChainInstrIdx = 0;
Instruction *BarrierMemoryInstr = nullptr;		Instruction *BarrierMemoryInstr = nullptr;

for (unsigned E = ChainInstrs.size(); ChainInstrIdx < E; ++ChainInstrIdx) {		for (unsigned E = ChainInstrs.size(); ChainInstrIdx < E; ++ChainInstrIdx) {
Instruction *ChainInstr = ChainInstrs[ChainInstrIdx];		Instruction *ChainInstr = ChainInstrs[ChainInstrIdx];

// If a barrier memory instruction was found, chain instructions that follow		// If a barrier memory instruction was found, chain instructions that follow
// will not be added to the valid prefix.		// will not be added to the valid prefix.
if (BarrierMemoryInstr && OBB.dominates(BarrierMemoryInstr, ChainInstr))		if (BarrierMemoryInstr && BarrierMemoryInstr->comesBefore(ChainInstr))
break;		break;

// Check (in BB order) if any instruction prevents ChainInstr from being		// Check (in BB order) if any instruction prevents ChainInstr from being
// vectorized. Find and store the first such "conflicting" instruction.		// vectorized. Find and store the first such "conflicting" instruction.
for (Instruction *MemInstr : MemoryInstrs) {		for (Instruction *MemInstr : MemoryInstrs) {
// If a barrier memory instruction was found, do not check past it.		// If a barrier memory instruction was found, do not check past it.
if (BarrierMemoryInstr && OBB.dominates(BarrierMemoryInstr, MemInstr))		if (BarrierMemoryInstr && BarrierMemoryInstr->comesBefore(MemInstr))
break;		break;

auto *MemLoad = dyn_cast<LoadInst>(MemInstr);		auto *MemLoad = dyn_cast<LoadInst>(MemInstr);
auto *ChainLoad = dyn_cast<LoadInst>(ChainInstr);		auto *ChainLoad = dyn_cast<LoadInst>(ChainInstr);
if (MemLoad && ChainLoad)		if (MemLoad && ChainLoad)
continue;		continue;

// We can ignore the alias if the we have a load store pair and the load		// We can ignore the alias if the we have a load store pair and the load
// is known to be invariant. The load cannot be clobbered by the store.		// is known to be invariant. The load cannot be clobbered by the store.
auto IsInvariantLoad = [](const LoadInst *LI) -> bool {		auto IsInvariantLoad = [](const LoadInst *LI) -> bool {
return LI->getMetadata(LLVMContext::MD_invariant_load);		return LI->getMetadata(LLVMContext::MD_invariant_load);
};		};

// We can ignore the alias as long as the load comes before the store,		// We can ignore the alias as long as the load comes before the store,
// because that means we won't be moving the load past the store to		// because that means we won't be moving the load past the store to
// vectorize it (the vectorized load is inserted at the location of the		// vectorize it (the vectorized load is inserted at the location of the
// first load in the chain).		// first load in the chain).
if (isa<StoreInst>(MemInstr) && ChainLoad &&		if (isa<StoreInst>(MemInstr) && ChainLoad &&
(IsInvariantLoad(ChainLoad) \|\| OBB.dominates(ChainLoad, MemInstr)))		(IsInvariantLoad(ChainLoad) \|\| ChainLoad->comesBefore(MemInstr)))
continue;		continue;

// Same case, but in reverse.		// Same case, but in reverse.
if (MemLoad && isa<StoreInst>(ChainInstr) &&		if (MemLoad && isa<StoreInst>(ChainInstr) &&
(IsInvariantLoad(MemLoad) \|\| OBB.dominates(MemLoad, ChainInstr)))		(IsInvariantLoad(MemLoad) \|\| MemLoad->comesBefore(ChainInstr)))
continue;		continue;

if (!AA.isNoAlias(MemoryLocation::get(MemInstr),		if (!AA.isNoAlias(MemoryLocation::get(MemInstr),
MemoryLocation::get(ChainInstr))) {		MemoryLocation::get(ChainInstr))) {
LLVM_DEBUG({		LLVM_DEBUG({
dbgs() << "LSV: Found alias:\n"		dbgs() << "LSV: Found alias:\n"
" Aliasing instruction and pointer:\n"		" Aliasing instruction and pointer:\n"
<< " " << *MemInstr << '\n'		<< " " << *MemInstr << '\n'
Show All 9 Lines	for (Instruction *MemInstr : MemoryInstrs) {
}		}
}		}
// Continue the search only for store chains, since vectorizing stores that		// Continue the search only for store chains, since vectorizing stores that
// precede an aliasing load is valid. Conversely, vectorizing loads is valid		// precede an aliasing load is valid. Conversely, vectorizing loads is valid
// up to an aliasing store, but should not pull loads from further down in		// up to an aliasing store, but should not pull loads from further down in
// the basic block.		// the basic block.
if (IsLoadChain && BarrierMemoryInstr) {		if (IsLoadChain && BarrierMemoryInstr) {
// The BarrierMemoryInstr is a store that precedes ChainInstr.		// The BarrierMemoryInstr is a store that precedes ChainInstr.
assert(OBB.dominates(BarrierMemoryInstr, ChainInstr));		assert(BarrierMemoryInstr->comesBefore(ChainInstr));
break;		break;
}		}
}		}

// Find the largest prefix of Chain whose elements are all in		// Find the largest prefix of Chain whose elements are all in
// ChainInstrs[0, ChainInstrIdx). This is the largest vectorizable prefix of		// ChainInstrs[0, ChainInstrIdx). This is the largest vectorizable prefix of
// Chain. (Recall that Chain is in address order, but ChainInstrs is in BB		// Chain. (Recall that Chain is in address order, but ChainInstrs is in BB
// order.)		// order.)
▲ Show 20 Lines • Show All 550 Lines • Show Last 20 Lines

llvm/unittests/Analysis/CMakeLists.txt

Show All 13 Lines	add_llvm_unittest(AnalysisTests
CFGTest.cpp		CFGTest.cpp
CGSCCPassManagerTest.cpp		CGSCCPassManagerTest.cpp
GlobalsModRefTest.cpp		GlobalsModRefTest.cpp
ValueLatticeTest.cpp		ValueLatticeTest.cpp
LazyCallGraphTest.cpp		LazyCallGraphTest.cpp
LoopInfoTest.cpp		LoopInfoTest.cpp
MemoryBuiltinsTest.cpp		MemoryBuiltinsTest.cpp
MemorySSATest.cpp		MemorySSATest.cpp
OrderedBasicBlockTest.cpp
OrderedInstructionsTest.cpp		OrderedInstructionsTest.cpp
PhiValuesTest.cpp		PhiValuesTest.cpp
ProfileSummaryInfoTest.cpp		ProfileSummaryInfoTest.cpp
ScalarEvolutionTest.cpp		ScalarEvolutionTest.cpp
SparsePropagation.cpp		SparsePropagation.cpp
TargetLibraryInfoTest.cpp		TargetLibraryInfoTest.cpp
TBAATest.cpp		TBAATest.cpp
UnrollAnalyzerTest.cpp		UnrollAnalyzerTest.cpp
ValueTrackingTest.cpp		ValueTrackingTest.cpp
)		)

llvm/unittests/Analysis/OrderedBasicBlockTest.cpp

This file was deleted.

	//===- OrderedBasicBlockTest.cpp - OrderedBasicBlock unit tests -----------===//
	//
	// The LLVM Compiler Infrastructure
	//
	// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.
	//
	//===----------------------------------------------------------------------===//

	#include "llvm/Analysis/OrderedBasicBlock.h"
	#include "llvm/AsmParser/Parser.h"
	#include "llvm/IR/BasicBlock.h"
	#include "llvm/IR/Function.h"
	#include "llvm/IR/LLVMContext.h"
	#include "llvm/IR/Module.h"
	#include "llvm/Support/DataTypes.h"
	#include "llvm/Support/SourceMgr.h"
	#include "gtest/gtest.h"

	namespace llvm {
	namespace {

	class OrderedBasicBlockTest : public testing::Test {
	protected:
	LLVMContext C;

	std::unique_ptr<Module> makeLLVMModule() {
	const char *ModuleString = R"(define i32 @f(i32 %x) {
	%add = add i32 %x, 42
	ret i32 %add
	})";
	SMDiagnostic Err;
	auto foo = parseAssemblyString(ModuleString, Err, C);
	return foo;
	}
	};

	TEST_F(OrderedBasicBlockTest, Basic) {
	auto M = makeLLVMModule();
	Function *F = M->getFunction("f");
	BasicBlock::iterator I = F->front().begin();
	Instruction Add = &I++;
	Instruction Ret = &I++;

	OrderedBasicBlock OBB(&F->front());
	// Intentionally duplicated to verify cached and uncached are the same.
	EXPECT_FALSE(OBB.dominates(Add, Add));
	EXPECT_FALSE(OBB.dominates(Add, Add));
	EXPECT_TRUE(OBB.dominates(Add, Ret));
	EXPECT_TRUE(OBB.dominates(Add, Ret));
	EXPECT_FALSE(OBB.dominates(Ret, Add));
	EXPECT_FALSE(OBB.dominates(Ret, Add));
	EXPECT_FALSE(OBB.dominates(Ret, Ret));
	EXPECT_FALSE(OBB.dominates(Ret, Ret));
	}

	} // end anonymous namespace
	} // end namespace llvm

llvm/unittests/IR/BasicBlockTest.cpp

//===- llvm/unittest/IR/BasicBlockTest.cpp - BasicBlock unit tests --------===//		//===- llvm/unittest/IR/BasicBlockTest.cpp - BasicBlock unit tests --------===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
		#include "llvm/AsmParser/Parser.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/IRBuilder.h"		#include "llvm/IR/IRBuilder.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IR/NoFolder.h"		#include "llvm/IR/NoFolder.h"
		#include "llvm/Support/SourceMgr.h"
#include "gmock/gmock-matchers.h"		#include "gmock/gmock-matchers.h"
#include "gtest/gtest.h"		#include "gtest/gtest.h"
#include <memory>		#include <memory>

namespace llvm {		namespace llvm {
namespace {		namespace {

TEST(BasicBlockTest, PhiRange) {		TEST(BasicBlockTest, PhiRange) {
▲ Show 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	TEST(BasicBlockTest, TestInstructionsWithoutDebug) {
SmallVector<Instruction *, 4> Exp = {Var, AddInst, MulInst, SubInst};		SmallVector<Instruction *, 4> Exp = {Var, AddInst, MulInst, SubInst};
CHECK_ITERATORS(BB1->instructionsWithoutDebug(), Exp);		CHECK_ITERATORS(BB1->instructionsWithoutDebug(), Exp);
CHECK_ITERATORS(BBConst->instructionsWithoutDebug(), Exp);		CHECK_ITERATORS(BBConst->instructionsWithoutDebug(), Exp);

delete M;		delete M;
delete V;		delete V;
}		}

		TEST(BasicBlockTest, ComesBefore) {
		const char *ModuleString = R"(define i32 @f(i32 %x) {
		%add = add i32 %x, 42
		ret i32 %add
		})";
		LLVMContext Ctx;
		SMDiagnostic Err;
		auto M = parseAssemblyString(ModuleString, Err, Ctx);
		ASSERT_TRUE(M.get());

		Function *F = M->getFunction("f");
		BasicBlock &BB = F->front();
		BasicBlock::iterator I = BB.begin();
		Instruction Add = &I++;
		Instruction Ret = &I++;

		// Intentionally duplicated to verify cached and uncached are the same.
		EXPECT_FALSE(Add->comesBefore(Add));
		EXPECT_FALSE(Add->comesBefore(Add));
		BB.invalidateOrders();
		mkazantsevUnsubmitted Done Reply Inline Actions `EXPECT_TRUE/FALSE(BB->isInstrOrderValid())` before and after that to make sure that it works at all? mkazantsev: `EXPECT_TRUE/FALSE(BB->isInstrOrderValid())` before and after that to make sure that it works…
		EXPECT_TRUE(Add->comesBefore(Ret));
		EXPECT_TRUE(Add->comesBefore(Ret));
		BB.invalidateOrders();
		EXPECT_FALSE(Ret->comesBefore(Add));
		EXPECT_FALSE(Ret->comesBefore(Add));
		BB.invalidateOrders();
		EXPECT_FALSE(Ret->comesBefore(Ret));
		EXPECT_FALSE(Ret->comesBefore(Ret));
		}

} // End anonymous namespace.		} // End anonymous namespace.
} // End llvm namespace.		} // End llvm namespace.
		mkazantsevUnsubmitted Not Done Reply Inline Actions Do you mind adding the similar check for `Instruction->removeFromParent` and `Instruction->eraseFromParent`? mkazantsev: Do you mind adding the similar check for `Instruction->removeFromParent` and `Instruction…
		rnkAuthorUnsubmitted Not Done Reply Inline Actions Done, but they don't invalidate ordering, so I check for that instead. rnk: Done, but they don't invalidate ordering, so I check for that instead.

This is an archive of the discontinued LLVM Phabricator instance.

[IR] Lazily number instructions for local dominance queriesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 163948

llvm/include/llvm/Analysis/AliasAnalysis.h

llvm/include/llvm/Analysis/CaptureTracking.h

llvm/include/llvm/Analysis/OrderedBasicBlock.h

llvm/include/llvm/Analysis/OrderedInstructions.h

llvm/include/llvm/IR/BasicBlock.h

llvm/include/llvm/IR/Instruction.h

llvm/include/llvm/IR/SymbolTableListTraits.h

llvm/lib/Analysis/AliasAnalysis.cpp

llvm/lib/Analysis/CMakeLists.txt

llvm/lib/Analysis/CaptureTracking.cpp

llvm/lib/Analysis/InstructionPrecedenceTracking.cpp

llvm/lib/Analysis/MemoryDependenceAnalysis.cpp

llvm/lib/Analysis/OrderedBasicBlock.cpp

llvm/lib/Analysis/OrderedInstructions.cpp

llvm/lib/IR/BasicBlock.cpp

llvm/lib/IR/Instruction.cpp

llvm/lib/IR/SymbolTableListTraitsImpl.h

llvm/lib/Transforms/Scalar/DeadStoreElimination.cpp

llvm/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp

llvm/unittests/Analysis/CMakeLists.txt

llvm/unittests/Analysis/OrderedBasicBlockTest.cpp

llvm/unittests/IR/BasicBlockTest.cpp

[IR] Lazily number instructions for local dominance queries
ClosedPublic