This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Transforms/IPO/
-
Transforms/
-
IPO/
45/55
FunctionAttrs.cpp
-
test/
-
Other/
-
cgscc-devirt-iteration.ll
-
Transforms/
-
FunctionAttrs/
-
2008-09-03-Mutual.ll
-
2008-09-03-ReadNone.ll
-
2008-09-03-ReadOnly.ll
-
2008-09-13-VolatileRead.ll
-
2008-12-29-Constant.ll
-
2009-01-02-LocalStores.ll
-
2010-10-30-volatile.ll
-
assume.ll
-
atomic.ll
-
comdat-ipo.ll
-
convergent.ll
-
int_sideeffect.ll
-
nocapture.ll
-
nonnull-global.ll
-
nonnull.ll
-
norecurse.ll
-
operand-bundles-scc.ll
-
optnone.ll
-
out-of-bounds-iterator-bug.ll
-
readnone.ll
-
returned.ll
-
Inline/
-
cgscc-update.ll
-
PruneEH/
-
2008-06-02-Weak.ll
-
ipo-nounwind.ll
-
operand-bundles.ll
-
pr23971.ll
-
pr26263.ll
-
recursivetest.ll
-
seh-nounwind.ll
-
simpletest.ll

Differential D44415

[PM][FunctionAttrs] add NoUnwind attribute inference to PostOrderFunctionAttrs pass
ClosedPublic

Authored by fedor.sergeev on Mar 13 2018, 4:11 AM.

Download Raw Diff

Details

Reviewers

chandlerc
jlebar

Commits

rG6660fd0f959e: [PM][FunctionAttrs] add NoUnwind attribute inference to PostOrderFunctionAttrs…
rL328377: [PM][FunctionAttrs] add NoUnwind attribute inference to PostOrderFunctionAttrs…

Summary

This was motivated by absence of PrunEH functionality in new PM.
It was decided that a proper way to do PruneEH is to add NoUnwind inference
into PostOrderFunctionAttrs and then perform normal SimplifyCFG on top.

This change generalizes attribute handling implemented for (a removal of)
Convergent attribute, by introducing a generic builder-like class

AttributeInferer

It registers all the attribute inference requests, storing per-attribute
predicates into a vector, and then goes through an SCC Node, scanning all
the instructions for not breaking attribute assumptions.

The main idea is that as soon all the instructions from all the functions
of SCC Node conform to attribute assumptions then we are free to infer
the attribute as set for all the functions of SCC Node.

It handles two distinct cases of attributes:

those that might break due to derefinement of the function code

for these attributes we are allowed to apply inference only if all the functions are "exact definitions". Example - NoUnwind.
those that do not care about derefinement

for these attributes we are allowed to apply inference as soon as we see any function definition. Example - removal of Convergent attribute.

Also in this commit:

Converted all the FunctionAttrs tests to use FileCheck and added new-PM invocations to them

FunctionAttrs/convergent.ll test demonstrates a difference in behavior between new and old PM implementations. Marked with FIXME.

PruneEH tests were converted to new-PM as well, using function-attrs+simplify-cfg combo as intended

some of "other" tests were updated since function-attrs now infers 'nounwind' even for old PM pipeline

-disable-nounwind-inference hidden option added as a possible workaround for a supposedly rare case when nounwind being inferred by default presents a problem

Diff Detail

Repository

rL LLVM

Build Status

Buildable 16256
Build 16256: arc lint + arc unit

Event Timeline

fedor.sergeev created this revision.Mar 13 2018, 4:11 AM

Herald added a subscriber: eraman. · View Herald TranscriptMar 13 2018, 4:11 AM

Harbormaster completed remote builds in B15998: Diff 138148.Mar 13 2018, 4:11 AM

fedor.sergeev edited the summary of this revision. (Show Details)Mar 13 2018, 4:12 AM

cleaning up some tests, still a couple FIXMEs in tests

Harbormaster completed remote builds in B16016: Diff 138210.Mar 13 2018, 9:26 AM

all the tests updated, one FIXME left on convergent.ll test as intended

Harbormaster completed remote builds in B16021: Diff 138219.Mar 13 2018, 10:07 AM

fedor.sergeev edited the summary of this revision. (Show Details)Mar 13 2018, 10:10 AM

Hi, thanks for the patch.

I like the overall approach of making this pass generic. But I have some high-level comments on the patch:

Is it possible to explain in the code itself everything a user needs to know to understand the code? For example, I could not find where we define "derefinable", or derefinement. More broadly, it is not easy for me to follow the overall structure of this code just by reading the code.

It's a nit, but I would prefer to write all comments in proper English, so starting with capital letters, ending with periods (if they are complete sentences), using apostrophes in contractions (e.g. cant -> can't), etc. I think this will make it easier for others to read.

I'm somewhat confused by our notion of some instructions being "invalid". Perhaps that should have a different name.

I'm a bit concerned about the overhead of calling multiple virtual functions (std::function) for every instruction in the module, especially since our loop structure (which is the correct one!) does not have one easily-predicted virtual function call target, but instead iterates between call targets. I know we go through a lot of pain elsewhere in LLVM to avoid this. Since the list of functors is static, I wonder if we shouldn't metaprogram this. Using llvm::integer_sequence would make it not *too* bad.

But maybe that's a premature optimization; I definitely don't want to do it if there's no benefit. Perhaps @chandlerc can comment on this.

Sorry for delay sending these commetns -- i had them typed up and didn't mash send. =[[[

Also, to address (briefly) what Justin mentioned: I think getting cache locality of single visit of instructions is likely to be more valuable than the overhead of the indirect calls. Could be wrong, but we can also look to see if there is a problem.

lib/Transforms/IPO/FunctionAttrs.cpp
1048–1049	This should be a (more expansive I suspect) doxygen comment on the type.
1050–1055	The tuple here should almost certainly be a little struct to simplify the code. At that point, I don't know that you need the type aliases.
1060–1066	I think this should be named more along the lines of registering something as it doesn't actually do inference here. Currently, the doxygen comment doesn't add much value. Instead, I would suggest that the doxygen comment should explain what the semantics of these predicates are, how they are used, and mention what is different about the derefinement.
1081	This routine is only going to be run once over an SCC. I think that suggests a much simpler implementation. As we process the SCC nodes and the instructions within each, we can use a `remove_if` (or better `erase_if`) pattern so that as predicates fail, we simple remove the struct w/ the callbacks for that attribute. Then at the end, we run all the callbacks that remain. This should allow you to not have index based walks or the bit vectors.

Also, to address (briefly) what Justin mentioned: I think getting cache locality of single visit of instructions is likely to be more valuable than the overhead of the indirect calls. Could be wrong, but we can also look to see if there is a problem.

Totally agree, but I think I wasn't clear, that's a different comparison from the one I was trying to make. The question I was trying ask is, given the current loop structure (which I think you're right, is the right structure), is the overhead of indirect calls vs direct (possibly inlined) calls (generated via metaprogramming) significant?

In D44415#1036911, @jlebar wrote:

Also, to address (briefly) what Justin mentioned: I think getting cache locality of single visit of instructions is likely to be more valuable than the overhead of the indirect calls. Could be wrong, but we can also look to see if there is a problem.

Totally agree, but I think I wasn't clear, that's a different comparison from the one I was trying to make. The question I was trying ask is, given the current loop structure (which I think you're right, is the right structure), is the overhead of indirect calls vs direct (possibly inlined) calls (generated via metaprogramming) significant?

Probably not? But maybe?

That said, generating direct calls here seems really, really hard... Maybe you have an idea I don't see...

The only way I see to make them not indirect, we would need to have heterogeneous collection of callbacks for the various attributes and iterate across it heterogeneously.... Essentially, we'd need to take each collection of callbacks in the constructor so we can deduce all of their types and the number of them in one go, build a tuple of them, and then have the 'run' thing actually use tuple-iteration (maybe w/ a variadic function template) instead of a loop. The complexity of that code just didn't seem worth it w/o a benchmark...

generating direct calls here seems really, really hard... Maybe you have an idea I don't see...

It's...not awful. https://godbolt.org/g/D3KMQH

#include <array>
#include <vector>
#include <utility>

struct Instruction;

struct FooDescriptor {
  static bool IsGood(Instruction* I) { return true; }
};

struct BarDescriptor {
  static bool IsGood(Instruction* I) { return false; }
};

template <typename... Descriptors>
struct AttrInferer {
    void Foo(std::vector<Instruction*> Instrs) {
        for (Instruction* Instr : Instrs) {
            bool IsGood[] = {
                [&] { return Descriptors::IsGood(Instr); }()...
            };
            for (int i = 0; i < sizeof...(Descriptors); ++i) {
                
            }
        }
    }
};

void Test() {
    AttrInferer<FooDescriptor, BarDescriptor> a;
    a.Foo({});
}

Anyway if you think the virtual functions aren't a big deal, that's good enough for me.

Thanks everybody for good comments, will see into them.
Will put virtual functions issue at the very end of the queue.

In D44415#1036829, @jlebar wrote:

Is it possible to explain in the code itself everything a user needs to know to understand the code? For example, I could not find where we define "derefinable", or derefinement.

I'm not sure about proper usage of this term either. It came from my discussions with Chandler, but if it sounds confusing ....
Anyway, I tried to explain the term in my commit message.

More broadly, it is not easy for me to follow the overall structure of this code just by reading the code.

You mean overall structure of AttributeInferer::run?

It's a nit, but I would prefer to write all comments in proper English, so starting with capital letters, ending with periods

Yeah, it is my biggest common mistake with the style :-/

fedor.sergeev added inline comments.Mar 14 2018, 4:13 PM

lib/Transforms/IPO/FunctionAttrs.cpp
1081	I can use remove/erase for the purpose of "Valid" attributes tracking. However I still need a couple of deducible boolean facts separately tracked for each attribute -"ScanHere" and "Ready". I can try adding those into Predicates structure and clean/set them accordingly. Lets see how it looks like in implementation suggested by Justin (which, btw, does have an index walk ;) ).

fixing comments, adding struct instead of tuple.
Addressed pretty much everything except of bitvectors and virtual functions,
just to have some ready base for comparison with other solutions.

Harbormaster completed remote builds in B16108: Diff 138512.Mar 15 2018, 2:51 AM

fedor.sergeev marked 3 inline comments as done.Mar 15 2018, 3:02 AM

I like it.

I have a suggestions on the comments, but that's...pretty normal for me. :)

I think that if we don't go with the variadic templates thing -- which is probably a premature optimization, I trust Chandler here -- then going with Chandler's suggestion and getting rid of most or all of the bitsets would be a good simplification.

lib/Transforms/IPO/FunctionAttrs.cpp
1056	Can we expand upon what we mean by an "exact" definition?
1059	Should we say this function can be null (and if so, all functions need to be scanned)?
1069	Perhaps a different name for this vector, now that the struct has a name.
1147	This is kind of confusing right above the `auto &AP =` line, because it's not really describing that line... Perhaps we could have a comment on the loop explaining at a higher level what we're doing?
1151	s/Perhaps,/Perhaps/
1151	This comment is probably obvious from the line above now.
1154	else if (...) { } ?
1157	can't
1214	We should indicate somewhere (on the struct definition?) that if a function is skipped, it's skipped from both scanning and from having the attribute set on it. That was surprising to me.
1219	Perhaps instead of introducing this new "invalid" terminology, we should call it something related to one of the names in InferenceDescriptor?
1229	Perhaps this function would be clearer as: if (I.mayThrow()) return false; if (...) { if (SCCNodes.count(Callee)) { // I is a may-throw call to a function inside our SCC. This doesn't invalidate our working assumption that the SCC is no-throw; we have to scan that other function. return false; } } // This instruction may throw, and it's not a call to another function in our SCC. It therefore invalidates our non-throwing assumption. return true;
1260	Suggest `/RequiresExactDefinition=/false` this way it's not ambiguous whether we're saying that false means it does require an exact definition, or that it doesn't.
1275	s/Note, that/Note that/
1290	Please re-clang-format

addressing most of Justin's comments on comments ;)

Harbormaster completed remote builds in B16196: Diff 138893.Mar 19 2018, 4:27 AM

fedor.sergeev marked 11 inline comments as done.Mar 19 2018, 4:33 AM

fedor.sergeev added inline comments.

lib/Transforms/IPO/FunctionAttrs.cpp
1056	I'm not sure how can I do that w/o duplicating a huge comment for GlobalValue::isDefinitionExact.
1059	SkipFunc is not intended to be null. Comment added below to clarify the intended use.
1290	Err... clang-format behaves badly on this :( It formats two calls of registerAttrInference differently. So I took one format as is and hand-edited another invocation correspondingly. Will repeat the same before integration as well.

fedor.sergeev marked an inline comment as done.Mar 19 2018, 4:50 AM

Btw, I tried Justin's version of "simplified" meta-programmed callbacks... and fully functional version of it
(one that performs early bailouts on ScanAttrHere/ValidAttrs) starts approaching Chandler's suggested tuple/tuple-iteration by complexity.
The problem with bailouts is that you cant just fill the whole array with calculated values of all the predicates,
or else you need to extend all your predicates with shortcut-ting semantics.

getting rid of bitvectors

Harbormaster completed remote builds in B16223: Diff 139023.Mar 19 2018, 4:13 PM

making InstrBreaksNonThrowing a bit more readable, introducing -disable-nounwind-inference option just in case

fedor.sergeev edited the summary of this revision. (Show Details)Mar 20 2018, 3:46 AM

fedor.sergeev edited the summary of this revision. (Show Details)

fedor.sergeev marked 3 inline comments as done.

just to clarify - I believe this is ready to go :)

jlebar added inline comments.Mar 21 2018, 7:55 AM

lib/Transforms/IPO/FunctionAttrs.cpp
1056	Could we just have a pointer to GlobalValue::isDefinitionExact? As in If true, only "exact" definitions can be used to infer this attribute. (See GlobalValue::isDefinitionExact.)
1072	If we're going to mix both "properties" of the attribute and "state" of the algorithm in the same class, can we at least have a comment indicating that this is what we're doing, and all variables below this point are mutable state used by the algorithm?
1073	Capital
1073	s/still hold valid/are still valid/
1080	Capital
1082	I don't know how Chandler feels about it, but personally I'm not wild about the approach of sticking state into this struct. It uses the descriptor class for two quite different purposes, and perhaps worse, it's really easy to have a bug where we forget to reset ScanCurrent at the top of the loop... I think Chandler's suggestion was, instead of tracking the state with a bitset, track it by removing elements from a list (or adding them to a list of "attributes to use when scanning this function"). It looks like the difficulty you face here is that the list you want to remove elements from is also the list you're iterating over? I'd be OK with pretty much any of the obvious solutions to that -- remove and adjust the loop index, use an std::list and remove in place, remove after the loop completes... Dunno if Chandler has feels about this too.
1290	So I took one format as is and hand-edited another invocation correspondingly. That's fine with me, so long as we don't expect that future modifications to this part of the file will preserve the format. People should be able to clang-format their changes and expect it's "good enough".

fedor.sergeev marked an inline comment as done.Mar 21 2018, 2:14 PM

fedor.sergeev added inline comments.

lib/Transforms/IPO/FunctionAttrs.cpp
1082	Well, I dont like merging descriptor with state either. In my original implementation I had those in separate vectors, which forced me to use indices as the way to link descriptors and states. Chandler's suggestion indeed was to remove elements. However it does not play well with the need to have as many early exits as possible (which was a property of implementation before my changes). Namely, there is a single meaningful vector here, a vector of descriptors, and while I can use removal from it to signify a single binary fact (e.g. Valid), there are more facts to track. Should I introduce copies of the main vector? How would it be different from having bit-vectors? I really do not see an ideal solution here. I can introduce a vector< pair<descriptor,state> >, which is rather clumsy. Or I can introduce a more complicated interface to the descriptor class, with read/write accessors as needed, and service functions that do initialization of mutables as needed.
1290	Most recent version is clang-formatted. This particular problem was solved by reordering arguments of InferenceDescriptor constructor.

fedor.sergeev marked an inline comment as done.Mar 21 2018, 2:16 PM

fedor.sergeev added inline comments.

lib/Transforms/IPO/FunctionAttrs.cpp
1080	I wonder if capitalizing "true" which is a boolean value and not an English word here is fine?

jlebar added inline comments.Mar 21 2018, 2:33 PM

lib/Transforms/IPO/FunctionAttrs.cpp
1080	I think of the boolean as being true, which in C++ might mean it has value `true`, in Python might mean it has value `True`, in PHP might mean it has value `"True"` :)... That is, I think you can say "this Python variable is true" with a lower-case "t", and that's just fine. But that's quite a nit, whatever you like is fine. :)
1082	Should I introduce copies of the main vector? That's what I was thinking. How would it be different from having bit-vectors? I think the notion was that you'd iterate over the vector that's being modified, so there's less indirection in the code. In pseudocode: ScannedSomeFunction = false; SCCDescriptors = InterfaceDescriptors; for (F : SCCNodes) { FnDescriptors = [] for (Desc : SCCDescriptors) { if (F->isDeclaration() \|\| (ID.RequiresExactDefinition && !F->hasExactDefinition())) { SCCDescriptors.Remove(Desc); } else if (!Desc.SkipFunction(F)) { FnDescriptors.Append(F); } } for (I : instructions(F)) { if (FnDescriptors.empty()) break; for (Desc : FnDescriptors) { if (Desc.InstrBreaksAttribute(I)) { FnDescriptors.Remove(Desc); SCCDescriptors.Remove(Desc); } } } for (Desc : SCCDescriptors) { if (Desc.SkipFunction(F)) continue; Desc.SetAttribute(F) } } I don't have SuccessfullyScanned here, but I think it's actually not necessary, because SkipFunction is the only reason we wouldn't scan something, and if we skipped it, we also won't set the attr on it.

fedor.sergeev added inline comments.Mar 21 2018, 2:37 PM

lib/Transforms/IPO/FunctionAttrs.cpp
1082	Ahem... that should work, indeed. Lemme try it.

minor cleanup first

Harbormaster completed remote builds in B16327: Diff 139374.Mar 21 2018, 2:53 PM

trying with erase_if

Harbormaster completed remote builds in B16339: Diff 139426.Mar 22 2018, 4:06 AM

fedor.sergeev marked 2 inline comments as done.Mar 22 2018, 4:08 AM

fedor.sergeev added inline comments.

lib/Transforms/IPO/FunctionAttrs.cpp
1082	Done more or less along the lines of your suggestion, Justin (applying Chandler's advice about using erase_if). The only catch is that I had to add AttrKind to the descriptor to be able to match descriptors in different vectors (to implement your : FnDescriptors.Remove + SCCDescriptors.Remove).

I like the algorithm! Just comments on comments, then I think I'm happy. Dunno if @chandlerc wants to have a look or not...

lib/Transforms/IPO/FunctionAttrs.cpp
1054	Remove newline?
1057	Update comment?
1076	s/)/.)/
1103	Remove newline?
1106	... "for each of them. We'll remove attributes which aren't valid for the SCC from InferInSCC."
1147	This comment may be more confusing than not at this point.
1147	s/meeting/visiting/ (idom, sorry)
1148	has an unsuitable definition
1149	Nit, remove outer parens?
1155	Suggest moving the comment up one line and saying something like For each attribute still in InferInSCC that doesn't skip F, check that the instructions in F are valid.
1165	Remove newline?
1169	Perhaps this can get a "because" -- it's kind of the crux of the whole algorithm, and it's quite hidden here.
1178	If you move this check to the top of the loop, then you don't need the check before the loop.

addressing Justin's comments

Harbormaster completed remote builds in B16368: Diff 139528.Mar 22 2018, 3:46 PM

fedor.sergeev marked 22 inline comments as done.Mar 22 2018, 3:51 PM

more comments cleanup

fedor.sergeev marked 5 inline comments as done.Mar 22 2018, 3:55 PM

fedor.sergeev added inline comments.

lib/Transforms/IPO/FunctionAttrs.cpp
1178	Perhaps its a bit of over-optimization, but I dont want to touch instructions(F) (enter the instructions loop) just to determine that we need to break out. I dont feel that having these two checks is detrimental to readability.

Hey, I can almost sign off on this, but two of the comments are not clear. I'm happy to provide suggestions if you like.

lib/Transforms/IPO/FunctionAttrs.cpp
1155	Sorry, I think the reworded comment is still unclear. If you're willing to tell me what you don't like about the suggestion I made earlier, perhaps we can come up with something we both like.
1170	This comment needs to be reworded, sorry.

minor comments cleanup

fedor.sergeev marked an inline comment as not done.Mar 23 2018, 11:19 AM

fedor.sergeev added inline comments.

lib/Transforms/IPO/FunctionAttrs.cpp
1155	Mostly I didnt like "instructions are valid" part, which is not what we are checking for. If you dont like my last variant - please, suggest your own that talks about attribute, not instruction.

\o/

lib/Transforms/IPO/FunctionAttrs.cpp
1155	What you came up with looks great to me. Thank you for being patient with me -- I know I'm not an easy reviewer.
1169	lgtm, thanks again for being patient with me.

This revision is now accepted and ready to land.Mar 23 2018, 12:50 PM

Thanks for following up till the very end of it! :)
Your feeling of language is a bit over my capabilities, but I do like to learn here.

minor semantical update - calculation of return value corrected for AttributeInferer::run,
returning true only when real changes are detected.
Fixes regression introduced by most recent algorithmic update.

Closed by commit rL328377: [PM][FunctionAttrs] add NoUnwind attribute inference to PostOrderFunctionAttrs… (authored by fedor.sergeev). · Explain WhyMar 23 2018, 2:49 PM

This revision was automatically updated to reflect the committed changes.

aeubanks mentioned this in D90012: [PruneEH] Pin tests to legacy PM.Oct 22 2020, 10:34 PM

aeubanks mentioned this in rGd673beee55c5: [PruneEH] Pin tests to legacy PM.Oct 29 2020, 6:25 PM

speryt mentioned this in D134686: [NFC][2/n] Remove PrunePH pass.Sep 26 2022, 5:36 PM

aeubanks mentioned this in rG46fc75ab28b7: [NFC][2/n] Remove PrunePH pass.Sep 26 2022, 6:38 PM

Revision Contents

Path

Size

lib/

Transforms/

IPO/

FunctionAttrs.cpp

252 lines

test/

Other/

cgscc-devirt-iteration.ll

25 lines

Transforms/

FunctionAttrs/

2008-09-03-Mutual.ll

9 lines

2008-09-03-ReadNone.ll

21 lines

2008-09-03-ReadOnly.ll

1 line

2008-09-13-VolatileRead.ll

5 lines

2008-12-29-Constant.ll

6 lines

2009-01-02-LocalStores.ll

1 line

2010-10-30-volatile.ll

8 lines

1 line

5 lines

1 line

10 lines

12 lines

2 lines

1 line

2 lines

48 lines

operand-bundles-scc.ll

8 lines

optnone.ll

3 lines

out-of-bounds-iterator-bug.ll

1 line

readnone.ll

1 line

returned.ll

1 line

Inline/

cgscc-update.ll

16 lines

PruneEH/

10 lines

1 line

1 line

1 line

32 lines

7 lines

1 line

5 lines

Diff 139097

lib/Transforms/IPO/FunctionAttrs.cpp

Show First 20 Lines • Show All 68 Lines • ▼ Show 20 Lines
STATISTIC(NumReadOnly, "Number of functions marked readonly");		STATISTIC(NumReadOnly, "Number of functions marked readonly");
STATISTIC(NumNoCapture, "Number of arguments marked nocapture");		STATISTIC(NumNoCapture, "Number of arguments marked nocapture");
STATISTIC(NumReturned, "Number of arguments marked returned");		STATISTIC(NumReturned, "Number of arguments marked returned");
STATISTIC(NumReadNoneArg, "Number of arguments marked readnone");		STATISTIC(NumReadNoneArg, "Number of arguments marked readnone");
STATISTIC(NumReadOnlyArg, "Number of arguments marked readonly");		STATISTIC(NumReadOnlyArg, "Number of arguments marked readonly");
STATISTIC(NumNoAlias, "Number of function returns marked noalias");		STATISTIC(NumNoAlias, "Number of function returns marked noalias");
STATISTIC(NumNonNullReturn, "Number of function returns marked nonnull");		STATISTIC(NumNonNullReturn, "Number of function returns marked nonnull");
STATISTIC(NumNoRecurse, "Number of functions marked as norecurse");		STATISTIC(NumNoRecurse, "Number of functions marked as norecurse");
		STATISTIC(NumNoUnwind, "Number of functions marked as nounwind");

// FIXME: This is disabled by default to avoid exposing security vulnerabilities		// FIXME: This is disabled by default to avoid exposing security vulnerabilities
// in C/C++ code compiled by clang:		// in C/C++ code compiled by clang:
// http://lists.llvm.org/pipermail/cfe-dev/2017-January/052066.html		// http://lists.llvm.org/pipermail/cfe-dev/2017-January/052066.html
static cl::opt<bool> EnableNonnullArgPropagation(		static cl::opt<bool> EnableNonnullArgPropagation(
"enable-nonnull-arg-prop", cl::Hidden,		"enable-nonnull-arg-prop", cl::Hidden,
cl::desc("Try to propagate nonnull argument attributes from callsites to "		cl::desc("Try to propagate nonnull argument attributes from callsites to "
"caller functions."));		"caller functions."));

		static cl::opt<bool> DisableNoUnwindInference(
		"disable-nounwind-inference", cl::Hidden,
		cl::desc("Stop inferring nounwind attribute during function-attrs pass"));

namespace {		namespace {

using SCCNodeSet = SmallSetVector<Function *, 8>;		using SCCNodeSet = SmallSetVector<Function *, 8>;

} // end anonymous namespace		} // end anonymous namespace

/// Returns the memory access attribute for function F using AAR for AA results,		/// Returns the memory access attribute for function F using AAR for AA results,
/// where SCCNodes is the current SCC.		/// where SCCNodes is the current SCC.
▲ Show 20 Lines • Show All 938 Lines • ▼ Show 20 Lines	for (Function *F : SCCNodes) {
++NumNonNullReturn;		++NumNonNullReturn;
MadeChange = true;		MadeChange = true;
}		}
}		}

return MadeChange;		return MadeChange;
}		}

/// Remove the convergent attribute from all functions in the SCC if every		namespace {
/// callsite within the SCC is not convergent (except for calls to functions
/// within the SCC). Returns true if changes were made.		/// Collects a set of attribute inference requests and performs them all in one
static bool removeConvergentAttrs(const SCCNodeSet &SCCNodes) {		/// go on a single SCC Node. Inference involves scanning function bodies
// For every function in SCC, ensure that either		/// looking for instructions that violate attribute assumptions.
		chandlercUnsubmitted Done Reply Inline Actions This should be a (more expansive I suspect) doxygen comment on the type. chandlerc: This should be a (more expansive I suspect) doxygen comment on the type.
// * it is not convergent, or		/// As soon as all the bodies are fine we are free to set the attribute.
// * we can remove its convergent attribute.		/// Customization of inference for individual attributes is performed by
bool HasConvergentFn = false;		/// providing a handful of predicates for each attribute.
		class AttributeInferer {

		jlebarUnsubmitted Done Reply Inline Actions Remove newline? jlebar: Remove newline?
		public:
		chandlercUnsubmitted Done Reply Inline Actions The tuple here should almost certainly be a little struct to simplify the code. At that point, I don't know that you need the type aliases. chandlerc: The tuple here should almost certainly be a little struct to simplify the code. At that point…
		/// Describes a request for inference of a single attribute.
		jlebarUnsubmitted Done Reply Inline Actions Can we expand upon what we mean by an "exact" definition? jlebar: Can we expand upon what we mean by an "exact" definition?
		fedor.sergeevAuthorUnsubmitted Done Reply Inline Actions I'm not sure how can I do that w/o duplicating a huge comment for GlobalValue::isDefinitionExact. fedor.sergeev: I'm not sure how can I do that w/o duplicating a huge comment for GlobalValue…
		jlebarUnsubmitted Done Reply Inline Actions Could we just have a pointer to GlobalValue::isDefinitionExact? As in If true, only "exact" definitions can be used to infer this attribute. (See GlobalValue::isDefinitionExact.) jlebar: Could we just have a pointer to GlobalValue::isDefinitionExact? As in ```If true, only…
		struct InferenceDescriptor {
		jlebarUnsubmitted Done Reply Inline Actions Update comment? jlebar: Update comment?
		/// Returns true if this function does not have to be handled.
		/// General intent for this predicate is to provide an optimization
		jlebarUnsubmitted Done Reply Inline Actions Should we say this function can be null (and if so, all functions need to be scanned)? jlebar: Should we say this function can be null (and if so, all functions need to be scanned)?
		fedor.sergeevAuthorUnsubmitted Done Reply Inline Actions SkipFunc is not intended to be null. Comment added below to clarify the intended use. fedor.sergeev: SkipFunc is not intended to be null. Comment added below to clarify the intended use.
		/// for functions that do not need this attribute inference at all
		/// (say, for functions that already have the attribute).
		std::function<bool(const Function &)> SkipFunction;

		/// Returns true if this instruction violates attribute assumptions.
		std::function<bool(Instruction &)> InstrBreaksAttribute;

		chandlercUnsubmitted Done Reply Inline Actions I think this should be named more along the lines of registering something as it doesn't actually do inference here. Currently, the doxygen comment doesn't add much value. Instead, I would suggest that the doxygen comment should explain what the semantics of these predicates are, how they are used, and mention what is different about the derefinement. chandlerc: I think this should be named more along the lines of registering something as it doesn't…
		/// Sets the inferred attribute for this function.
		std::function<void(Function &)> SetAttribute;

		jlebarUnsubmitted Done Reply Inline Actions Perhaps a different name for this vector, now that the struct has a name. jlebar: Perhaps a different name for this vector, now that the struct has a name.
		/// Only "exact" definitions can be used to infer this attribute.
		bool RequiresExactDefinition;

		jlebarUnsubmitted Done Reply Inline Actions If we're going to mix both "properties" of the attribute and "state" of the algorithm in the same class, can we at least have a comment indicating that this is what we're doing, and all variables below this point are mutable state used by the algorithm? jlebar: If we're going to mix both "properties" of the attribute and "state" of the algorithm in the…
		/// assumptions of this attribute still hold valid.
		jlebarUnsubmitted Done Reply Inline Actions Capital jlebar: Capital
		jlebarUnsubmitted Done Reply Inline Actions s/still hold valid/are still valid/ jlebar: s/still hold valid/are still valid/
		bool Valid = true;

		/// Set to true for each function if we still need to scan for our
		jlebarUnsubmitted Done Reply Inline Actions s/)/.)/ jlebar: s/)/.)/
		/// attribute.
		bool ScanCurrent = false;

		/// true for "valid" attributes that went through at least one instruction
		jlebarUnsubmitted Done Reply Inline Actions Capital jlebar: Capital
		fedor.sergeevAuthorUnsubmitted Done Reply Inline Actions I wonder if capitalizing "true" which is a boolean value and not an English word here is fine? fedor.sergeev: I wonder if capitalizing "true" which is a boolean value and not an English word here is fine?
		jlebarUnsubmitted Done Reply Inline Actions I think of the boolean as being true, which in C++ might mean it has value `true`, in Python might mean it has value `True`, in PHP might mean it has value `"True"` :)... That is, I think you can say "this Python variable is true" with a lower-case "t", and that's just fine. But that's quite a nit, whatever you like is fine. :) jlebar: I think of the boolean as being true, which in C++ might mean it has value `true`, in Python…
		/// scan
		chandlercUnsubmitted Done Reply Inline Actions This routine is only going to be run once over an SCC. I think that suggests a much simpler implementation. As we process the SCC nodes and the instructions within each, we can use a `remove_if` (or better `erase_if`) pattern so that as predicates fail, we simple remove the struct w/ the callbacks for that attribute. Then at the end, we run all the callbacks that remain. This should allow you to not have index based walks or the bit vectors. chandlerc: This routine is only going to be run once over an SCC. I think that suggests a much simpler…
		fedor.sergeevAuthorUnsubmitted Done Reply Inline Actions I can use remove/erase for the purpose of "Valid" attributes tracking. However I still need a couple of deducible boolean facts separately tracked for each attribute -"ScanHere" and "Ready". I can try adding those into Predicates structure and clean/set them accordingly. Lets see how it looks like in implementation suggested by Justin (which, btw, does have an index walk ;) ). fedor.sergeev: I can use remove/erase for the purpose of "Valid" attributes tracking. However I still need a…
		bool SuccessfullyScanned = false;
		jlebarUnsubmitted Not Done Reply Inline Actions I don't know how Chandler feels about it, but personally I'm not wild about the approach of sticking state into this struct. It uses the descriptor class for two quite different purposes, and perhaps worse, it's really easy to have a bug where we forget to reset ScanCurrent at the top of the loop... I think Chandler's suggestion was, instead of tracking the state with a bitset, track it by removing elements from a list (or adding them to a list of "attributes to use when scanning this function"). It looks like the difficulty you face here is that the list you want to remove elements from is also the list you're iterating over? I'd be OK with pretty much any of the obvious solutions to that -- remove and adjust the loop index, use an std::list and remove in place, remove after the loop completes... Dunno if Chandler has feels about this too. jlebar: I don't know how Chandler feels about it, but personally I'm not wild about the approach of…
		fedor.sergeevAuthorUnsubmitted Not Done Reply Inline Actions Well, I dont like merging descriptor with state either. In my original implementation I had those in separate vectors, which forced me to use indices as the way to link descriptors and states. Chandler's suggestion indeed was to remove elements. However it does not play well with the need to have as many early exits as possible (which was a property of implementation before my changes). Namely, there is a single meaningful vector here, a vector of descriptors, and while I can use removal from it to signify a single binary fact (e.g. Valid), there are more facts to track. Should I introduce copies of the main vector? How would it be different from having bit-vectors? I really do not see an ideal solution here. I can introduce a vector< pair<descriptor,state> >, which is rather clumsy. Or I can introduce a more complicated interface to the descriptor class, with read/write accessors as needed, and service functions that do initialization of mutables as needed. fedor.sergeev: Well, I dont like merging descriptor with state either. In my original implementation I had…
		jlebarUnsubmitted Done Reply Inline Actions Should I introduce copies of the main vector? That's what I was thinking. How would it be different from having bit-vectors? I think the notion was that you'd iterate over the vector that's being modified, so there's less indirection in the code. In pseudocode: ScannedSomeFunction = false; SCCDescriptors = InterfaceDescriptors; for (F : SCCNodes) { FnDescriptors = [] for (Desc : SCCDescriptors) { if (F->isDeclaration() \|\| (ID.RequiresExactDefinition && !F->hasExactDefinition())) { SCCDescriptors.Remove(Desc); } else if (!Desc.SkipFunction(F)) { FnDescriptors.Append(F); } } for (I : instructions(F)) { if (FnDescriptors.empty()) break; for (Desc : FnDescriptors) { if (Desc.InstrBreaksAttribute(I)) { FnDescriptors.Remove(Desc); SCCDescriptors.Remove(Desc); } } } for (Desc : SCCDescriptors) { if (Desc.SkipFunction(F)) continue; Desc.SetAttribute(F) } } I don't have SuccessfullyScanned here, but I think it's actually not necessary, because SkipFunction is the only reason we wouldn't scan something, and if we skipped it, we also won't set the attr on it. jlebar: > Should I introduce copies of the main vector? That's what I was thinking. > How would it be…
		fedor.sergeevAuthorUnsubmitted Done Reply Inline Actions Ahem... that should work, indeed. Lemme try it. fedor.sergeev: Ahem... that should work, indeed. Lemme try it.
		fedor.sergeevAuthorUnsubmitted Done Reply Inline Actions Done more or less along the lines of your suggestion, Justin (applying Chandler's advice about using erase_if). The only catch is that I had to add AttrKind to the descriptor to be able to match descriptors in different vectors (to implement your : FnDescriptors.Remove + SCCDescriptors.Remove). fedor.sergeev: Done more or less along the lines of your suggestion, Justin (applying Chandler's advice about…

		InferenceDescriptor(std::function<bool(const Function &)> SkipFunc,
		std::function<bool(Instruction &)> InstrScan,
		std::function<void(Function &)> SetAttr,
		bool ReqExactDef)
		: SkipFunction(SkipFunc), InstrBreaksAttribute(InstrScan),
		SetAttribute(SetAttr), RequiresExactDefinition(ReqExactDef) {}
		};

		private:
		SmallVector<InferenceDescriptor, 4> InferenceDescriptors;

		public:
		void registerAttrInference(InferenceDescriptor AttrInference) {
		InferenceDescriptors.push_back(AttrInference);
		}

		bool run(const SCCNodeSet &SCCNodes);
		};

		/// Perform all the requested attribute inference actions according to the
		jlebarUnsubmitted Done Reply Inline Actions Remove newline? jlebar: Remove newline?
		/// attribute predicates stored before.
		bool AttributeInferer::run(const SCCNodeSet &SCCNodes) {

		jlebarUnsubmitted Done Reply Inline Actions ... "for each of them. We'll remove attributes which aren't valid for the SCC from InferInSCC." jlebar: ... "for each of them. We'll remove attributes which aren't valid for the SCC from InferInSCC."
		bool ScannedSomeFunc = false;

		// Go through all the functions in SCC and check corresponding attribute
		// assumptions for each of them.
for (Function *F : SCCNodes) {		for (Function *F : SCCNodes) {
if (!F->isConvergent()) continue;
HasConvergentFn = true;

// Can't remove convergent from function declarations.		llvm::erase_if(InferenceDescriptors,
if (F->isDeclaration()) return false;		[](const InferenceDescriptor &ID) { return !ID.Valid; });

		// No attributes whose assumptions are still valid - done.
		if (InferenceDescriptors.empty())
		return false;

		// Check if our attributes ever need scanning/can be scanned.
		bool ScanThisFunc = false;
		for (auto &ID : InferenceDescriptors) {
		if (!ID.Valid) {
		ID.ScanCurrent = false;
		} else if (ID.SkipFunction(*F))
		// This function is explicitly skipped from inference w/o breaking the
		// main per-Function loop. Perhaps it already has the attribute.
		ID.ScanCurrent = false;
		else if (F->isDeclaration() \|\|
		(ID.RequiresExactDefinition && !F->hasExactDefinition())) {
		// No instructions to scan, can't handle this attribute.
		ID.ScanCurrent = false;
		ID.Valid = false;
		} else {
		ID.ScanCurrent = true;
		ScanThisFunc = true;
		}
		}

		if (!ScanThisFunc)
		continue;

// Can't remove convergent if any of our functions has a convergent call to a		// Start instruction scan.
// function not in the SCC.
for (Instruction &I : instructions(*F)) {		for (Instruction &I : instructions(*F)) {
CallSite CS(&I);		bool StillScanning = false;
// Bail if CS is a convergent call to a function not in the SCC.		for (auto &ID : InferenceDescriptors) {
if (CS && CS.isConvergent() &&		if (!ID.ScanCurrent)
		jlebarUnsubmitted Done Reply Inline Actions This is kind of confusing right above the `auto &AP =` line, because it's not really describing that line... Perhaps we could have a comment on the loop explaining at a higher level what we're doing? jlebar: This is kind of confusing right above the `auto &AP =` line, because it's not really describing…
		jlebarUnsubmitted Done Reply Inline Actions This comment may be more confusing than not at this point. jlebar: This comment may be more confusing than not at this point.
		jlebarUnsubmitted Done Reply Inline Actions s/meeting/visiting/ (idom, sorry) jlebar: s/meeting/visiting/ (idom, sorry)
SCCNodes.count(CS.getCalledFunction()) == 0)		continue;
		jlebarUnsubmitted Done Reply Inline Actions has an unsuitable definition jlebar: has an unsuitable definition

		jlebarUnsubmitted Done Reply Inline Actions Nit, remove outer parens? jlebar: Nit, remove outer parens?
		if (ID.InstrBreaksAttribute(I)) {
		ID.ScanCurrent = false;
		jlebarUnsubmitted Done Reply Inline Actions s/Perhaps,/Perhaps/ jlebar: s/Perhaps,/Perhaps/
		jlebarUnsubmitted Done Reply Inline Actions This comment is probably obvious from the line above now. jlebar: This comment is probably obvious from the line above now.
		ID.Valid = false;
		} else {
		StillScanning = true;
		jlebarUnsubmitted Done Reply Inline Actions else if (...) { } ? jlebar: ```else if (...) { }``` ?
		}
		jlebarUnsubmitted Not Done Reply Inline Actions Suggest moving the comment up one line and saying something like For each attribute still in InferInSCC that doesn't skip F, check that the instructions in F are valid. jlebar: Suggest moving the comment up one line and saying something like ```For each attribute still…
		jlebarUnsubmitted Not Done Reply Inline Actions Sorry, I think the reworded comment is still unclear. If you're willing to tell me what you don't like about the suggestion I made earlier, perhaps we can come up with something we both like. jlebar: Sorry, I think the reworded comment is still unclear. If you're willing to tell me what you…
		fedor.sergeevAuthorUnsubmitted Not Done Reply Inline Actions Mostly I didnt like "instructions are valid" part, which is not what we are checking for. If you dont like my last variant - please, suggest your own that talks about attribute, not instruction. fedor.sergeev: Mostly I didnt like "instructions are valid" part, which is not what we are checking for. If…
		jlebarUnsubmitted Not Done Reply Inline Actions What you came up with looks great to me. Thank you for being patient with me -- I know I'm not an easy reviewer. jlebar: What you came up with looks great to me. Thank you for being patient with me -- I know I'm not…
		}

		jlebarUnsubmitted Done Reply Inline Actions can't jlebar: can't
		if (!StillScanning)
		break;
		}

		for (auto &ID : InferenceDescriptors)
		if (ID.ScanCurrent) {
		ScannedSomeFunc = true;
		ID.SuccessfullyScanned = true;
		jlebarUnsubmitted Done Reply Inline Actions Remove newline? jlebar: Remove newline?
		}
		}

		// If the SCC doesn't have functions that were successfully scanned then we
		jlebarUnsubmitted Done Reply Inline Actions Perhaps this can get a "because" -- it's kind of the crux of the whole algorithm, and it's quite hidden here. jlebar: Perhaps this can get a "because" -- it's kind of the crux of the whole algorithm, and it's…
		jlebarUnsubmitted Not Done Reply Inline Actions lgtm, thanks again for being patient with me. jlebar: lgtm, thanks again for being patient with me.
		// have nothing to do.
		jlebarUnsubmitted Not Done Reply Inline Actions This comment needs to be reworded, sorry. jlebar: This comment needs to be reworded, sorry.
		if (!ScannedSomeFunc)
return false;		return false;

		// If we got here, all of the SCC's functions adhere to the attribute
		// assumptions being checked above, so we go and set all the attributes that
		// are still valid.
		for (Function *F : SCCNodes)
		for (auto &ID : InferenceDescriptors) {
		jlebarUnsubmitted Not Done Reply Inline Actions If you move this check to the top of the loop, then you don't need the check before the loop. jlebar: If you move this check to the top of the loop, then you don't need the check before the loop.
		fedor.sergeevAuthorUnsubmitted Not Done Reply Inline Actions Perhaps its a bit of over-optimization, but I dont want to touch instructions(F) (enter the instructions loop) just to determine that we need to break out. I dont feel that having these two checks is detrimental to readability. fedor.sergeev: Perhaps its a bit of over-optimization, but I dont want to touch instructions(F) (enter the…
		if (!ID.SuccessfullyScanned)
		continue;

		if (ID.SkipFunction(*F))
		continue;

		ID.SetAttribute(*F);
}		}
		return true;
}		}

// If the SCC doesn't have any convergent functions, we have nothing to do.		} // end anonymous namespace
if (!HasConvergentFn) return false;

// If we got here, all of the calls the SCC makes to functions not in the SCC		/// Helper for non-Convergent inference predicate InstrBreaksAttribute.
// are non-convergent. Therefore all of the SCC's functions can also be made		static bool InstrBreaksNonConvergent(Instruction &I,
// non-convergent. We'll remove the attr from the callsites in		const SCCNodeSet &SCCNodes) {
// InstCombineCalls.		const CallSite CS(&I);
for (Function *F : SCCNodes) {		// Breaks non-convergent assumption if CS is a convergent call to a function
if (!F->isConvergent()) continue;		// not in the SCC.
		return CS && CS.isConvergent() && SCCNodes.count(CS.getCalledFunction()) == 0;
		}

DEBUG(dbgs() << "Removing convergent attr from fn " << F->getName()		/// Helper for NoUnwind inference predicate InstrBreaksAttribute.
<< "\n");		static bool InstrBreaksNonThrowing(Instruction &I, const SCCNodeSet &SCCNodes) {
F->setNotConvergent();		if (!I.mayThrow())
		return false;
		if (const auto *CI = dyn_cast<CallInst>(&I)) {
		if (Function *Callee = CI->getCalledFunction()) {
		// I is a may-throw call to a function inside our SCC. This doesn't
		// invalidate our current working assumption that the SCC is no-throw; we
		// just have to scan that other function.
		if (SCCNodes.count(Callee) > 0)
		return false;
		}
}		}
return true;		return true;
		jlebarUnsubmitted Done Reply Inline Actions We should indicate somewhere (on the struct definition?) that if a function is skipped, it's skipped from both scanning and from having the attribute set on it. That was surprising to me. jlebar: We should indicate somewhere (on the struct definition?) that if a function is skipped, it's…
}		}

		/// Infer attributes from all functions in the SCC by scanning every
		/// instruction for compliance to the attribute assumptions. Currently it
		/// does:
		jlebarUnsubmitted Done Reply Inline Actions Perhaps instead of introducing this new "invalid" terminology, we should call it something related to one of the names in InferenceDescriptor? jlebar: Perhaps instead of introducing this new "invalid" terminology, we should call it something…
		/// - removal of Convergent attribute
		/// - addition of NoUnwind attribute
		///
		/// Returns true if any changes to function attributes were made.
		static bool inferAttrsFromFunctionBodies(const SCCNodeSet &SCCNodes) {

		AttributeInferer AI;

		// Request to remove the convergent attribute from all functions in the SCC
		// if every callsite within the SCC is not convergent (except for calls
		jlebarUnsubmitted Done Reply Inline Actions Perhaps this function would be clearer as: if (I.mayThrow()) return false; if (...) { if (SCCNodes.count(Callee)) { // I is a may-throw call to a function inside our SCC. This doesn't invalidate our working assumption that the SCC is no-throw; we have to scan that other function. return false; } } // This instruction may throw, and it's not a call to another function in our SCC. It therefore invalidates our non-throwing assumption. return true; jlebar: Perhaps this function would be clearer as: ``` if (I.mayThrow()) return false; if (...) { if…
		// to functions within the SCC).
		// Note: Removal of the attr from the callsites will happen in
		// InstCombineCalls separately.
		AI.registerAttrInference(AttributeInferer::InferenceDescriptor{
		// Skip non-convergent functions.
		[](const Function &F) { return !F.isConvergent(); },
		// Instructions that break non-convergent assumption.
		[SCCNodes](Instruction &I) {
		return InstrBreaksNonConvergent(I, SCCNodes);
		},
		[](Function &F) {
		DEBUG(dbgs() << "Removing convergent attr from fn " << F.getName()
		<< "\n");
		F.setNotConvergent();
		},
		/* RequiresExactDefinition= */ false});

		if (!DisableNoUnwindInference)
		// Request to infer nounwind attribute for all the functions in the SCC if
		// every callsite within the SCC is not throwing (except for calls to
		// functions within the SCC). Note that nounwind attribute suffers from
		// derefinement - results may change depending on how functions are
		// optimized. Thus it can be inferred only from exact definitions.
		AI.registerAttrInference(AttributeInferer::InferenceDescriptor{
		// Skip non-throwing functions.
		[](const Function &F) { return F.doesNotThrow(); },
		// Instructions that break non-throwing assumption.
		[SCCNodes](Instruction &I) {
		return InstrBreaksNonThrowing(I, SCCNodes);
		},
		[](Function &F) {
		jlebarUnsubmitted Done Reply Inline Actions Suggest `/RequiresExactDefinition=/false` this way it's not ambiguous whether we're saying that false means it does require an exact definition, or that it doesn't. jlebar: Suggest `/RequiresExactDefinition=/false` this way it's not ambiguous whether we're saying…
		DEBUG(dbgs() << "Adding nounwind attr to fn " << F.getName() << "\n");
		F.setDoesNotThrow();
		++NumNoUnwind;
		},
		/* RequiresExactDefinition= */ true});

		// Perform all the requested attribute inference actions.
		return AI.run(SCCNodes);
		}

static bool setDoesNotRecurse(Function &F) {		static bool setDoesNotRecurse(Function &F) {
if (F.doesNotRecurse())		if (F.doesNotRecurse())
return false;		return false;
F.setDoesNotRecurse();		F.setDoesNotRecurse();
++NumNoRecurse;		++NumNoRecurse;
		jlebarUnsubmitted Done Reply Inline Actions s/Note, that/Note that/ jlebar: s/Note, that/Note that/
return true;		return true;
}		}

static bool addNoRecurseAttrs(const SCCNodeSet &SCCNodes) {		static bool addNoRecurseAttrs(const SCCNodeSet &SCCNodes) {
// Try and identify functions that do not recurse.		// Try and identify functions that do not recurse.

// If the SCC contains multiple nodes we know for sure there is recursion.		// If the SCC contains multiple nodes we know for sure there is recursion.
if (SCCNodes.size() != 1)		if (SCCNodes.size() != 1)
return false;		return false;

Function F = SCCNodes.begin();		Function F = SCCNodes.begin();
if (!F \|\| F->isDeclaration() \|\| F->doesNotRecurse())		if (!F \|\| F->isDeclaration() \|\| F->doesNotRecurse())
return false;		return false;

// If all of the calls in F are identifiable and are to norecurse functions, F		// If all of the calls in F are identifiable and are to norecurse functions, F
		jlebarUnsubmitted Done Reply Inline Actions Please re-clang-format jlebar: Please re-clang-format
		fedor.sergeevAuthorUnsubmitted Done Reply Inline Actions Err... clang-format behaves badly on this :( It formats two calls of registerAttrInference differently. So I took one format as is and hand-edited another invocation correspondingly. Will repeat the same before integration as well. fedor.sergeev: Err... clang-format behaves badly on this :( It formats two calls of registerAttrInference…
		jlebarUnsubmitted Done Reply Inline Actions So I took one format as is and hand-edited another invocation correspondingly. That's fine with me, so long as we don't expect that future modifications to this part of the file will preserve the format. People should be able to clang-format their changes and expect it's "good enough". jlebar: > So I took one format as is and hand-edited another invocation correspondingly. That's fine…
		fedor.sergeevAuthorUnsubmitted Done Reply Inline Actions Most recent version is clang-formatted. This particular problem was solved by reordering arguments of InferenceDescriptor constructor. fedor.sergeev: Most recent version is clang-formatted. This particular problem was solved by reordering…
// is norecurse. This check also detects self-recursion as F is not currently		// is norecurse. This check also detects self-recursion as F is not currently
// marked norecurse, so any called from F to F will not be marked norecurse.		// marked norecurse, so any called from F to F will not be marked norecurse.
for (Instruction &I : instructions(*F))		for (Instruction &I : instructions(*F))
if (auto CS = CallSite(&I)) {		if (auto CS = CallSite(&I)) {
Function *Callee = CS.getCalledFunction();		Function *Callee = CS.getCalledFunction();
if (!Callee \|\| Callee == F \|\| !Callee->doesNotRecurse())		if (!Callee \|\| Callee == F \|\| !Callee->doesNotRecurse())
// Function calls a potentially recursive function.		// Function calls a potentially recursive function.
return false;		return false;
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	PreservedAnalyses PostOrderFunctionAttrsPass::run(LazyCallGraph::SCC &C,
Changed \|= addReadAttrs(SCCNodes, AARGetter);		Changed \|= addReadAttrs(SCCNodes, AARGetter);
Changed \|= addArgumentAttrs(SCCNodes);		Changed \|= addArgumentAttrs(SCCNodes);

// If we have no external nodes participating in the SCC, we can deduce some		// If we have no external nodes participating in the SCC, we can deduce some
// more precise attributes as well.		// more precise attributes as well.
if (!HasUnknownCall) {		if (!HasUnknownCall) {
Changed \|= addNoAliasAttrs(SCCNodes);		Changed \|= addNoAliasAttrs(SCCNodes);
Changed \|= addNonNullAttrs(SCCNodes);		Changed \|= addNonNullAttrs(SCCNodes);
Changed \|= removeConvergentAttrs(SCCNodes);		Changed \|= inferAttrsFromFunctionBodies(SCCNodes);
Changed \|= addNoRecurseAttrs(SCCNodes);		Changed \|= addNoRecurseAttrs(SCCNodes);
}		}

return Changed ? PreservedAnalyses::none() : PreservedAnalyses::all();		return Changed ? PreservedAnalyses::none() : PreservedAnalyses::all();
}		}

namespace {		namespace {

▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	static bool runImpl(CallGraphSCC &SCC, AARGetterT AARGetter) {
Changed \|= addReadAttrs(SCCNodes, AARGetter);		Changed \|= addReadAttrs(SCCNodes, AARGetter);
Changed \|= addArgumentAttrs(SCCNodes);		Changed \|= addArgumentAttrs(SCCNodes);

// If we have no external nodes participating in the SCC, we can deduce some		// If we have no external nodes participating in the SCC, we can deduce some
// more precise attributes as well.		// more precise attributes as well.
if (!ExternalNode) {		if (!ExternalNode) {
Changed \|= addNoAliasAttrs(SCCNodes);		Changed \|= addNoAliasAttrs(SCCNodes);
Changed \|= addNonNullAttrs(SCCNodes);		Changed \|= addNonNullAttrs(SCCNodes);
Changed \|= removeConvergentAttrs(SCCNodes);		Changed \|= inferAttrsFromFunctionBodies(SCCNodes);
Changed \|= addNoRecurseAttrs(SCCNodes);		Changed \|= addNoRecurseAttrs(SCCNodes);
}		}

return Changed;		return Changed;
}		}

bool PostOrderFunctionAttrsLegacyPass::runOnSCC(CallGraphSCC &SCC) {		bool PostOrderFunctionAttrsLegacyPass::runOnSCC(CallGraphSCC &SCC) {
if (skipSCC(SCC))		if (skipSCC(SCC))
▲ Show 20 Lines • Show All 113 Lines • Show Last 20 Lines

test/Other/cgscc-devirt-iteration.ll

; The CGSCC pass manager includes an SCC iteration utility that tracks indirect		; The CGSCC pass manager includes an SCC iteration utility that tracks indirect
; calls that are turned into direct calls (devirtualization) and re-visits the		; calls that are turned into direct calls (devirtualization) and re-visits the
; SCC to expose those calls to the SCC-based IPO passes. We trigger		; SCC to expose those calls to the SCC-based IPO passes. We trigger
; devirtualization here with GVN which forwards a store through a load and to		; devirtualization here with GVN which forwards a store through a load and to
; an indirect call.		; an indirect call.
;		;
; RUN: opt -aa-pipeline=basic-aa -passes='cgscc(function-attrs,function(gvn,instcombine))' -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=BEFORE		; RUN: opt -aa-pipeline=basic-aa -passes='cgscc(function-attrs,function(gvn,instcombine))' -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=BEFORE
; RUN: opt -aa-pipeline=basic-aa -passes='cgscc(devirt<1>(function-attrs,function(gvn,instcombine)))' -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=AFTER --check-prefix=AFTER1		; RUN: opt -aa-pipeline=basic-aa -passes='cgscc(devirt<1>(function-attrs,function(gvn,instcombine)))' -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=AFTER --check-prefix=AFTER1
; RUN: opt -aa-pipeline=basic-aa -passes='cgscc(devirt<2>(function-attrs,function(gvn,instcombine)))' -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=AFTER --check-prefix=AFTER2		; RUN: opt -aa-pipeline=basic-aa -passes='cgscc(devirt<2>(function-attrs,function(gvn,instcombine)))' -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=AFTER --check-prefix=AFTER2
;		;
; We also verify that the real O2 pipeline catches these cases.		; We also verify that the real O2 pipeline catches these cases.
; RUN: opt -aa-pipeline=basic-aa -passes='default<O2>' -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=AFTER --check-prefix=AFTER2		; RUN: opt -aa-pipeline=basic-aa -passes='default<O2>' -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=AFTER --check-prefix=AFTER2

declare void @readnone() readnone		declare void @readnone() readnone
; CHECK: Function Attrs: readnone		; CHECK: Function Attrs: readnone
; CHECK: declare void @readnone()		; CHECK-NEXT: declare void @readnone()

declare void @unknown()		declare void @unknown()
; CHECK-NOT: Function Attrs		; CHECK-NOT: Function Attrs
; CHECK: declare void @unknown()		; CHECK-LABEL: declare void @unknown(){{ *$}}

; The @test1 function checks that when we refine an indirect call to a direct		; The @test1 function checks that when we refine an indirect call to a direct
; call we revisit the SCC passes to reflect the more precise information. This		; call we revisit the SCC passes to reflect the more precise information. This
; is the basic functionality.		; is the basic functionality.

define void @test1() {		define void @test1() {
; BEFORE-NOT: Function Attrs		; BEFORE-NOT: Function Attrs
; AFTER: Function Attrs: readnone		; AFTER: Function Attrs: readnone
; CHECK: define void @test1()		; CHECK-LABEL: define void @test1()
entry:		entry:
%fptr = alloca void ()*		%fptr = alloca void ()*
store void ()* @readnone, void ()** %fptr		store void ()* @readnone, void ()** %fptr
%f = load void (), void ()* %fptr		%f = load void (), void ()* %fptr
call void %f()		call void %f()
ret void		ret void
}		}

; The @test2_* functions check that when we need multiple (in this case 2)		; The @test2_* functions check that when we need multiple (in this case 2)
; repetitions to compute some state that is incrementally exposed with each		; repetitions to compute some state that is incrementally exposed with each
; one, the limit on repetitions is enforced. So we make progress with		; one, the limit on repetitions is enforced. So we make progress with
; one repetition but not as much as with three.		; one repetition but not as much as with three.
;		;
; This is somewhat awkward to test because we have to contrive to have a state		; This is somewhat awkward to test because we have to contrive to have a state
; repetition triggered and observed with very few passes. The technique here		; repetition triggered and observed with very few passes. The technique here
; is to have one indirect call that can only be resolved when the entire SCC is		; is to have one indirect call that can only be resolved when the entire SCC is
; deduced as readonly, and mark that indirect call at the call site as readonly		; deduced as readonly, and mark that indirect call at the call site as readonly
; to make that possible. This forces us to first deduce readonly, then		; to make that possible. This forces us to first deduce readonly, then
; devirtualize again, and then deduce readnone.		; devirtualize again, and then deduce readnone.

declare void @readnone_with_arg(void ()**) readnone		declare void @readnone_with_arg(void ()**) readnone
; CHECK: Function Attrs: readnone		; CHECK: Function Attrs: readnone
; CHECK: declare void @readnone_with_arg(void ()**)		; CHECK-LABEL: declare void @readnone_with_arg(void ()**)

define void @test2_a(void ()** %ignore) {		define void @test2_a(void ()** %ignore) {
; BEFORE-NOT: Function Attrs		; BEFORE-NOT: Function Attrs
; AFTER1: Function Attrs: readonly		; AFTER1: Function Attrs: readonly
; AFTER2: Function Attrs: readnone		; AFTER2: Function Attrs: readnone
; BEFORE: define void @test2_a(void ()** %ignore)		; BEFORE: define void @test2_a(void ()** %ignore)
; AFTER: define void @test2_a(void ()** readnone %ignore)		; AFTER: define void @test2_a(void ()** readnone %ignore)
entry:		entry:
Show All 10 Lines	; CHECK: call void @readnone_with_arg(void ()** %ignore)

ret void		ret void
}		}

define void @test2_b() {		define void @test2_b() {
; BEFORE-NOT: Function Attrs		; BEFORE-NOT: Function Attrs
; AFTER1: Function Attrs: readonly		; AFTER1: Function Attrs: readonly
; AFTER2: Function Attrs: readnone		; AFTER2: Function Attrs: readnone
; CHECK: define void @test2_b()		; CHECK-LABEL: define void @test2_b()
entry:		entry:
%f2ptr = alloca void ()*		%f2ptr = alloca void ()*
store void ()* @readnone, void ()** %f2ptr		store void ()* @readnone, void ()** %f2ptr
; Call the other function here to prevent forwarding until the SCC has had		; Call the other function here to prevent forwarding until the SCC has had
; function attrs deduced.		; function attrs deduced.
call void @test2_a(void ()** %f2ptr)		call void @test2_a(void ()** %f2ptr)

%f2 = load void (), void ()* %f2ptr		%f2 = load void (), void ()* %f2ptr
; This is the second indirect call to be resolved, and can only be resolved		; This is the second indirect call to be resolved, and can only be resolved
; after we deduce 'readonly' for the rest of the SCC. Once it is		; after we deduce 'readonly' for the rest of the SCC. Once it is
; devirtualized, we can deduce readnone for the SCC.		; devirtualized, we can deduce readnone for the SCC.
call void %f2() readonly		call void %f2() readonly
; BEFORE: call void %f2()		; BEFORE: call void %f2()
; AFTER: call void @readnone()		; AFTER: call void @readnone()

ret void		ret void
}		}

declare i8* @memcpy(i8, i8, i64)		declare i8* @memcpy(i8, i8, i64)
; CHECK: declare i8* @memcpy(		; CHECK-LABEL: declare i8* @memcpy(

; The @test3 function checks that when we refine an indirect call to an		; The @test3 function checks that when we refine an indirect call to an
; intrinsic we still revisit the SCC pass. This also covers cases where the		; intrinsic we still revisit the SCC pass. This also covers cases where the
; value handle itself doesn't persist due to the nature of how instcombine		; value handle itself doesn't persist due to the nature of how instcombine
; creates the memcpy intrinsic call, and we rely on the count of indirect calls		; creates the memcpy intrinsic call, and we rely on the count of indirect calls
; decreasing and the count of direct calls increasing.		; decreasing and the count of direct calls increasing.
define void @test3(i8* %src, i8* %dest, i64 %size) {		; Adding 'noinline' attribute to force attributes for improved matching.
; CHECK-NOT: Function Attrs		define void @test3(i8* %src, i8* %dest, i64 %size) noinline {
; BEFORE: define void @test3(i8* %src, i8* %dest, i64 %size)		; CHECK: Function Attrs
; AFTER: define void @test3(i8* nocapture readonly %src, i8* nocapture %dest, i64 %size)		; CHECK-NOT: read
		; CHECK-SAME: noinline
		; BEFORE-LABEL: define void @test3(i8* %src, i8* %dest, i64 %size)
		; AFTER-LABEL: define void @test3(i8* nocapture readonly %src, i8* nocapture %dest, i64 %size)
%fptr = alloca i8* (i8, i8, i64)*		%fptr = alloca i8* (i8, i8, i64)*
store i8* (i8, i8, i64)* @memcpy, i8* (i8, i8, i64)** %fptr		store i8* (i8, i8, i64)* @memcpy, i8* (i8, i8, i64)** %fptr
%f = load i8* (i8, i8, i64), i8 (i8, i8, i64)** %fptr		%f = load i8* (i8, i8, i64), i8 (i8, i8, i64)** %fptr
call i8* %f(i8* %dest, i8* %src, i64 %size)		call i8* %f(i8* %dest, i8* %src, i64 %size)
; CHECK: call void @llvm.memcpy		; CHECK: call void @llvm.memcpy
ret void		ret void
}		}

; A boring function that just keeps our declarations around.		; A boring function that just keeps our declarations around.
define void @keep(i8** %sink) {		define void @keep(i8** %sink) {
; CHECK-NOT: Function Attrs		; CHECK-NOT: Function Attrs
; CHECK: define void @keep(		; CHECK-LABEL: define void @keep(
entry:		entry:
store volatile i8* bitcast (void ()* @readnone to i8), i8* %sink		store volatile i8* bitcast (void ()* @readnone to i8), i8* %sink
store volatile i8* bitcast (void ()* @unknown to i8), i8* %sink		store volatile i8* bitcast (void ()* @unknown to i8), i8* %sink
store volatile i8* bitcast (i8* (i8, i8, i64)* @memcpy to i8), i8* %sink		store volatile i8* bitcast (i8* (i8, i8, i64)* @memcpy to i8), i8* %sink
call void @unknown()		call void @unknown()
ret void		ret void
}		}

test/Transforms/FunctionAttrs/2008-09-03-Mutual.ll

	; RUN: opt < %s -functionattrs -S \| grep readnone			; RUN: opt < %s -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -passes=function-attrs -S \| FileCheck %s

				; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NEXT: define i32 @a
	define i32 @a() {			define i32 @a() {
	%tmp = call i32 @b( ) ; <i32> [#uses=1]			%tmp = call i32 @b( ) ; <i32> [#uses=1]
	ret i32 %tmp			ret i32 %tmp
	}			}

				; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NEXT: define i32 @b
	define i32 @b() {			define i32 @b() {
	%tmp = call i32 @a( ) ; <i32> [#uses=1]			%tmp = call i32 @a( ) ; <i32> [#uses=1]
	ret i32 %tmp			ret i32 %tmp
	}			}

test/Transforms/FunctionAttrs/2008-09-03-ReadNone.ll

	; RUN: opt < %s -basicaa -functionattrs -S \| FileCheck %s			; RUN: opt < %s -basicaa -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -aa-pipeline=basic-aa -passes=function-attrs -S \| FileCheck %s

	@x = global i32 0			@x = global i32 0

	; CHECK: declare i32 @e() #0			; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NEXT: declare i32 @e
	declare i32 @e() readnone			declare i32 @e() readnone

	; CHECK: define i32 @f() #0			; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NEXT: define i32 @f
	define i32 @f() {			define i32 @f() {
	%tmp = call i32 @e( ) ; <i32> [#uses=1]			%tmp = call i32 @e( ) ; <i32> [#uses=1]
	ret i32 %tmp			ret i32 %tmp
	}			}

	; CHECK: define i32 @g() #1			; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NEXT: define i32 @g
	define i32 @g() readonly {			define i32 @g() readonly {
	ret i32 0			ret i32 0
	}			}

	; CHECK: define i32 @h() #1			; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NEXT: define i32 @h
	define i32 @h() readnone {			define i32 @h() readnone {
	%tmp = load i32, i32* @x ; <i32> [#uses=1]			%tmp = load i32, i32* @x ; <i32> [#uses=1]
	ret i32 %tmp			ret i32 %tmp
	}			}

	; CHECK: attributes #0 = { readnone }
	; CHECK: attributes #1 = { norecurse readnone }

test/Transforms/FunctionAttrs/2008-09-03-ReadOnly.ll

	; RUN: opt < %s -basicaa -functionattrs -S \| FileCheck %s			; RUN: opt < %s -basicaa -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -aa-pipeline=basic-aa -passes=function-attrs -S \| FileCheck %s

	; CHECK: define i32 @f() #0			; CHECK: define i32 @f() #0
	define i32 @f() {			define i32 @f() {
	entry:			entry:
	%tmp = call i32 @e( )			%tmp = call i32 @e( )
	ret i32 %tmp			ret i32 %tmp
	}			}

	; CHECK: declare i32 @e() #0			; CHECK: declare i32 @e() #0
	declare i32 @e() readonly			declare i32 @e() readonly

	; CHECK: attributes #0 = { readonly }			; CHECK: attributes #0 = { readonly }

test/Transforms/FunctionAttrs/2008-09-13-VolatileRead.ll

	; RUN: opt < %s -functionattrs -S \| not grep read			; RUN: opt < %s -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -passes=function-attrs -S \| FileCheck %s
	; PR2792			; PR2792

	@g = global i32 0 ; <i32*> [#uses=1]			@g = global i32 0 ; <i32*> [#uses=1]

	define i32 @f() {			define i32 @f() {
	%t = load volatile i32, i32* @g ; <i32> [#uses=1]			%t = load volatile i32, i32* @g ; <i32> [#uses=1]
	ret i32 %t			ret i32 %t
	}			}

				; CHECK-NOT: attributes #{{.*}} read

test/Transforms/FunctionAttrs/2008-12-29-Constant.ll

	; RUN: opt < %s -basicaa -functionattrs -S \| grep readnone			; RUN: opt < %s -basicaa -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -aa-pipeline=basic-aa -passes=function-attrs -S \| FileCheck %s

	@s = external constant i8 ; <i8*> [#uses=1]			@s = external constant i8 ; <i8*> [#uses=1]

				; CHECK: define i8 @f() #0
	define i8 @f() {			define i8 @f() {
	%tmp = load i8, i8* @s ; <i8> [#uses=1]			%tmp = load i8, i8* @s ; <i8> [#uses=1]
	ret i8 %tmp			ret i8 %tmp
	}			}

				; CHECK: attributes #0 = { {{.*}} readnone

test/Transforms/FunctionAttrs/2009-01-02-LocalStores.ll

	; RUN: opt < %s -functionattrs -S \| FileCheck %s			; RUN: opt < %s -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -passes=function-attrs -S \| FileCheck %s

	; CHECK: define i32* @a(i32** nocapture readonly %p)			; CHECK: define i32* @a(i32** nocapture readonly %p)
	define i32* @a(i32** %p) {			define i32* @a(i32** %p) {
	%tmp = load i32, i32* %p			%tmp = load i32, i32* %p
	ret i32* %tmp			ret i32* %tmp
	}			}

	; CHECK: define i32* @b(i32* %q)			; CHECK: define i32* @b(i32* %q)
	Show All 14 Lines

test/Transforms/FunctionAttrs/2010-10-30-volatile.ll

	; RUN: opt < %s -functionattrs -S \| FileCheck %s			; RUN: opt < %s -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -passes=function-attrs -S \| FileCheck %s
	; PR8279			; PR8279

	@g = constant i32 1			@g = constant i32 1

				; CHECK: Function Attrs
				; CHECK-SAME: norecurse
				; CHECK-NOT: readonly
				; CHECK-NEXT: void @foo()
	define void @foo() {			define void @foo() {
	; CHECK: void @foo() #0 {
	%tmp = load volatile i32, i32* @g			%tmp = load volatile i32, i32* @g
	ret void			ret void
	}			}

	; CHECK: attributes #0 = { norecurse }

test/Transforms/FunctionAttrs/assume.ll

	; RUN: opt -S -o - -functionattrs %s \| FileCheck %s			; RUN: opt -S -o - -functionattrs %s \| FileCheck %s
				; RUN: opt -S -o - -passes=function-attrs %s \| FileCheck %s

	; CHECK-NOT: readnone			; CHECK-NOT: readnone
	declare void @llvm.assume(i1)			declare void @llvm.assume(i1)

test/Transforms/FunctionAttrs/atomic.ll

	; RUN: opt -basicaa -functionattrs -S < %s \| FileCheck %s			; RUN: opt -basicaa -functionattrs -S < %s \| FileCheck %s
				; RUN: opt -aa-pipeline=basic-aa -passes=function-attrs -S < %s \| FileCheck %s

	; Atomic load/store to local doesn't affect whether a function is			; Atomic load/store to local doesn't affect whether a function is
	; readnone/readonly.			; readnone/readonly.
	define i32 @test1(i32 %x) uwtable ssp {			define i32 @test1(i32 %x) uwtable ssp {
	; CHECK: define i32 @test1(i32 %x) #0 {			; CHECK: define i32 @test1(i32 %x) #0 {
	entry:			entry:
	%x.addr = alloca i32, align 4			%x.addr = alloca i32, align 4
	store atomic i32 %x, i32* %x.addr seq_cst, align 4			store atomic i32 %x, i32* %x.addr seq_cst, align 4
	%r = load atomic i32, i32* %x.addr seq_cst, align 4			%r = load atomic i32, i32* %x.addr seq_cst, align 4
	ret i32 %r			ret i32 %r
	}			}

	; A function with an Acquire load is not readonly.			; A function with an Acquire load is not readonly.
	define i32 @test2(i32* %x) uwtable ssp {			define i32 @test2(i32* %x) uwtable ssp {
	; CHECK: define i32 @test2(i32* nocapture readonly %x) #1 {			; CHECK: define i32 @test2(i32* nocapture readonly %x) #1 {
	entry:			entry:
	%r = load atomic i32, i32* %x seq_cst, align 4			%r = load atomic i32, i32* %x seq_cst, align 4
	ret i32 %r			ret i32 %r
	}			}

	; CHECK: attributes #0 = { norecurse readnone ssp uwtable }			; CHECK: attributes #0 = { norecurse nounwind readnone ssp uwtable }
	; CHECK: attributes #1 = { norecurse ssp uwtable }			; CHECK: attributes #1 = { norecurse nounwind ssp uwtable }

test/Transforms/FunctionAttrs/comdat-ipo.ll

	; RUN: opt < %s -functionattrs -S \| FileCheck %s			; RUN: opt < %s -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -passes=function-attrs -S \| FileCheck %s

	; See PR26774			; See PR26774

	; CHECK-LABEL: define void @bar(i8* readonly) {			; CHECK-LABEL: define void @bar(i8* readonly) {
	define void @bar(i8* readonly) {			define void @bar(i8* readonly) {
	call void @foo(i8* %0)			call void @foo(i8* %0)
	ret void			ret void
	}			}


	; CHECK-LABEL: define linkonce_odr void @foo(i8* readonly) {			; CHECK-LABEL: define linkonce_odr void @foo(i8* readonly) {
	define linkonce_odr void @foo(i8* readonly) {			define linkonce_odr void @foo(i8* readonly) {
	call void @bar(i8* %0)			call void @bar(i8* %0)
	ret void			ret void
	}			}

test/Transforms/FunctionAttrs/convergent.ll

	; RUN: opt -functionattrs -S < %s \| FileCheck %s			; FIXME: convert CHECK-INDIRECT into CHECK (and remove -check-prefixes) as soon
				; FIXME: as new-pass-manager's handling of indirect_non_convergent_call is fixed
				;
				; RUN: opt -functionattrs -S < %s \| FileCheck %s --check-prefixes=CHECK,CHECK-INDIRECT
				; RUN: opt -passes=function-attrs -S < %s \| FileCheck %s

	; CHECK: Function Attrs			; CHECK: Function Attrs
	; CHECK-NOT: convergent			; CHECK-NOT: convergent
	; CHECK-NEXT: define i32 @nonleaf()			; CHECK-NEXT: define i32 @nonleaf()
	define i32 @nonleaf() convergent {			define i32 @nonleaf() convergent {
	%a = call i32 @leaf()			%a = call i32 @leaf()
	ret i32 %a			ret i32 %a
	}			}
	Show All 35 Lines
	define i32 @indirect_convergent_call(i32 ()* %f) convergent {			define i32 @indirect_convergent_call(i32 ()* %f) convergent {
	%a = call i32 %f() convergent			%a = call i32 %f() convergent
	ret i32 %a			ret i32 %a
	}			}
	; Give indirect_non_convergent_call the norecurse attribute so we get a			; Give indirect_non_convergent_call the norecurse attribute so we get a
	; "Function Attrs" comment in the output.			; "Function Attrs" comment in the output.
	;			;
	; CHECK: Function Attrs			; CHECK: Function Attrs
	; CHECK-NOT: convergent			; CHECK-INDIRECT-NOT: convergent
	; CHECK-NEXT: define i32 @indirect_non_convergent_call(			; CHECK-INDIRECT-NEXT: define i32 @indirect_non_convergent_call(
	define i32 @indirect_non_convergent_call(i32 ()* %f) convergent norecurse {			define i32 @indirect_non_convergent_call(i32 ()* %f) convergent norecurse {
	%a = call i32 %f()			%a = call i32 %f()
	ret i32 %a			ret i32 %a
	}			}

	; CHECK: Function Attrs			; CHECK: Function Attrs
	; CHECK-SAME: convergent			; CHECK-SAME: convergent
	; CHECK-NEXT: declare void @llvm.nvvm.barrier0()			; CHECK-NEXT: declare void @llvm.nvvm.barrier0()
	▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

test/Transforms/FunctionAttrs/int_sideeffect.ll

	; RUN: opt -S < %s -functionattrs \| FileCheck %s			; RUN: opt -S < %s -functionattrs \| FileCheck %s
				; RUN: opt -S < %s -passes=function-attrs \| FileCheck %s

				; CHECK: Function Attrs
				; CHECK-SAME: inaccessiblememonly
				; CHECK-NEXT: declare void @llvm.sideeffect()
	declare void @llvm.sideeffect()			declare void @llvm.sideeffect()

	; Don't add readnone or similar attributes when an @llvm.sideeffect() intrinsic			; Don't add readnone or similar attributes when an @llvm.sideeffect() intrinsic
	; is present.			; is present.

	; CHECK: define void @test() {			; CHECK: Function Attrs
				; CHECK-NOT: readnone
				; CHECK: define void @test()
	define void @test() {			define void @test() {
	call void @llvm.sideeffect()			call void @llvm.sideeffect()
	ret void			ret void
	}			}

	; CHECK: define void @loop() {			; CHECK: Function Attrs
				; CHECK-NOT: readnone
				; CHECK: define void @loop()
	define void @loop() {			define void @loop() {
	br label %loop			br label %loop

	loop:			loop:
	call void @llvm.sideeffect()			call void @llvm.sideeffect()
	br label %loop			br label %loop
	}			}

test/Transforms/FunctionAttrs/nocapture.ll

	; RUN: opt < %s -functionattrs -S \| FileCheck %s			; RUN: opt < %s -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -passes=function-attrs -S \| FileCheck %s

	@g = global i32* null ; <i32**> [#uses=1]			@g = global i32* null ; <i32**> [#uses=1]

	; CHECK: define i32* @c1(i32* readnone returned %q)			; CHECK: define i32* @c1(i32* readnone returned %q)
	define i32* @c1(i32* %q) {			define i32* @c1(i32* %q) {
	ret i32* %q			ret i32* %q
	}			}

	; CHECK: define void @c2(i32* %q)			; CHECK: define void @c2(i32* %q)
	▲ Show 20 Lines • Show All 211 Lines • Show Last 20 Lines

test/Transforms/FunctionAttrs/nonnull-global.ll

	; RUN: opt -S -functionattrs %s \| FileCheck %s			; RUN: opt -S -functionattrs %s \| FileCheck %s
				; RUN: opt -S -passes=function-attrs %s \| FileCheck %s

	@a = external global i8, !absolute_symbol !0			@a = external global i8, !absolute_symbol !0

	; CHECK-NOT: define nonnull			; CHECK-NOT: define nonnull
	define i8* @foo() {			define i8* @foo() {
	ret i8* @a			ret i8* @a
	}			}

	!0 = !{i64 0, i64 256}			!0 = !{i64 0, i64 256}

test/Transforms/FunctionAttrs/nonnull.ll

	; RUN: opt -S -functionattrs -enable-nonnull-arg-prop %s \| FileCheck %s			; RUN: opt -S -functionattrs -enable-nonnull-arg-prop %s \| FileCheck %s
				; RUN: opt -S -passes=function-attrs -enable-nonnull-arg-prop %s \| FileCheck %s

	declare nonnull i8* @ret_nonnull()			declare nonnull i8* @ret_nonnull()

	; Return a pointer trivially nonnull (call return attribute)			; Return a pointer trivially nonnull (call return attribute)
	define i8* @test1() {			define i8* @test1() {
	; CHECK: define nonnull i8* @test1			; CHECK: define nonnull i8* @test1
	%ret = call i8* @ret_nonnull()			%ret = call i8* @ret_nonnull()
	ret i8* %ret			ret i8* %ret
	}			}
	▲ Show 20 Lines • Show All 220 Lines • Show Last 20 Lines

test/Transforms/FunctionAttrs/norecurse.ll

	; RUN: opt < %s -basicaa -functionattrs -rpo-functionattrs -S \| FileCheck %s			; RUN: opt < %s -basicaa -functionattrs -rpo-functionattrs -S \| FileCheck %s
	; RUN: opt < %s -aa-pipeline=basic-aa -passes='cgscc(function-attrs),rpo-functionattrs' -S \| FileCheck %s			; RUN: opt < %s -aa-pipeline=basic-aa -passes='cgscc(function-attrs),rpo-functionattrs' -S \| FileCheck %s

	; CHECK: define i32 @leaf() #0			; CHECK: Function Attrs
				; CHECK-SAME: norecurse nounwind readnone
				; CHECK-NEXT: define i32 @leaf()
	define i32 @leaf() {			define i32 @leaf() {
	ret i32 1			ret i32 1
	}			}

	; CHECK: define i32 @self_rec() #1			; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NOT: norecurse
				; CHECK-NEXT: define i32 @self_rec()
	define i32 @self_rec() {			define i32 @self_rec() {
	%a = call i32 @self_rec()			%a = call i32 @self_rec()
	ret i32 4			ret i32 4
	}			}

	; CHECK: define i32 @indirect_rec() #1			; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NOT: norecurse
				; CHECK-NEXT: define i32 @indirect_rec()
	define i32 @indirect_rec() {			define i32 @indirect_rec() {
	%a = call i32 @indirect_rec2()			%a = call i32 @indirect_rec2()
	ret i32 %a			ret i32 %a
	}			}
	; CHECK: define i32 @indirect_rec2() #1			; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NOT: norecurse
				; CHECK-NEXT: define i32 @indirect_rec2()
	define i32 @indirect_rec2() {			define i32 @indirect_rec2() {
	%a = call i32 @indirect_rec()			%a = call i32 @indirect_rec()
	ret i32 %a			ret i32 %a
	}			}

	; CHECK: define i32 @extern() #1			; CHECK: Function Attrs
				; CHECK-SAME: readnone
				; CHECK-NOT: norecurse
				; CHECK-NEXT: define i32 @extern()
	define i32 @extern() {			define i32 @extern() {
	%a = call i32 @k()			%a = call i32 @k()
	ret i32 %a			ret i32 %a
	}			}

				; CHECK: Function Attrs
				; CHECK-NEXT: declare i32 @k()
	declare i32 @k() readnone			declare i32 @k() readnone

	; CHECK: define void @intrinsic(i8* nocapture %dest, i8* nocapture readonly %src, i32 %len) {			; CHECK: Function Attrs
				; CHECK-SAME: nounwind
				; CHECK-NOT: norecurse
				; CHECK-NEXT: define void @intrinsic(i8* nocapture %dest, i8* nocapture readonly %src, i32 %len)
	define void @intrinsic(i8* %dest, i8* %src, i32 %len) {			define void @intrinsic(i8* %dest, i8* %src, i32 %len) {
	call void @llvm.memcpy.p0i8.p0i8.i32(i8* %dest, i8* %src, i32 %len, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i32(i8* %dest, i8* %src, i32 %len, i1 false)
	ret void			ret void
	}			}

				; CHECK: Function Attrs
				; CHECK-NEXT: declare void @llvm.memcpy.p0i8.p0i8.i32
	declare void @llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i1)			declare void @llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i1)

	; CHECK: define internal i32 @called_by_norecurse() #0			; CHECK: Function Attrs
				; CHECK-SAME: norecurse readnone
				; CHECK-NEXT: define internal i32 @called_by_norecurse()
	define internal i32 @called_by_norecurse() {			define internal i32 @called_by_norecurse() {
	%a = call i32 @k()			%a = call i32 @k()
	ret i32 %a			ret i32 %a
	}			}
				; CHECK: Function Attrs
				; CHECK-NEXT: define void @m()
	define void @m() norecurse {			define void @m() norecurse {
	%a = call i32 @called_by_norecurse()			%a = call i32 @called_by_norecurse()
	ret void			ret void
	}			}

	; CHECK: define internal i32 @called_by_norecurse_indirectly() #0			; CHECK: Function Attrs
				; CHECK-SAME: norecurse readnone
				; CHECK-NEXT: define internal i32 @called_by_norecurse_indirectly()
	define internal i32 @called_by_norecurse_indirectly() {			define internal i32 @called_by_norecurse_indirectly() {
	%a = call i32 @k()			%a = call i32 @k()
	ret i32 %a			ret i32 %a
	}			}
	define internal void @o() {			define internal void @o() {
	%a = call i32 @called_by_norecurse_indirectly()			%a = call i32 @called_by_norecurse_indirectly()
	ret void			ret void
	}			}
	define void @p() norecurse {			define void @p() norecurse {
	call void @o()			call void @o()
	ret void			ret void
	}			}

	; CHECK: attributes #0 = { norecurse readnone }
	; CHECK: attributes #1 = { readnone }

test/Transforms/FunctionAttrs/operand-bundles-scc.ll

	; RUN: opt -S -functionattrs < %s \| FileCheck %s			; RUN: opt -S -functionattrs < %s \| FileCheck %s
				; RUN: opt -S -passes=function-attrs < %s \| FileCheck %s

	define void @f() {			define void @f() {
	; CHECK-LABEL: define void @f() {			; CHECK-LABEL: define void @f() #0 {
	call void @g() [ "unknown"() ]			call void @g() [ "unknown"() ]
	ret void			ret void
	}			}

	define void @g() {			define void @g() {
	; CHECK-LABEL: define void @g() {			; CHECK-LABEL: define void @g() #0 {
	call void @f()			call void @f()
	ret void			ret void
	}			}


				; CHECK: attributes #0 = { nounwind }

test/Transforms/FunctionAttrs/optnone.ll

	; RUN: opt < %s -functionattrs -S \| FileCheck %s			; RUN: opt < %s -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -passes=function-attrs -S \| FileCheck %s

	@x = global i32 0			@x = global i32 0

	define void @test_opt(i8* %p) {			define void @test_opt(i8* %p) {
	; CHECK-LABEL: @test_opt			; CHECK-LABEL: @test_opt
	; CHECK: (i8* nocapture readnone %p) #0 {			; CHECK: (i8* nocapture readnone %p) #0 {
	ret void			ret void
	}			}

	define void @test_optnone(i8* %p) noinline optnone {			define void @test_optnone(i8* %p) noinline optnone {
	; CHECK-LABEL: @test_optnone			; CHECK-LABEL: @test_optnone
	; CHECK: (i8* %p) #1 {			; CHECK: (i8* %p) #1 {
	ret void			ret void
	}			}

	declare i8 @strlen(i8*) noinline optnone			declare i8 @strlen(i8*) noinline optnone
	; CHECK-LABEL: @strlen			; CHECK-LABEL: @strlen
	; CHECK: (i8*) #1			; CHECK: (i8*) #1

	; CHECK-LABEL: attributes #0			; CHECK-LABEL: attributes #0
	; CHECK: = { norecurse readnone }			; CHECK: = { norecurse nounwind readnone }
	; CHECK-LABEL: attributes #1			; CHECK-LABEL: attributes #1
	; CHECK: = { noinline optnone }			; CHECK: = { noinline optnone }

test/Transforms/FunctionAttrs/out-of-bounds-iterator-bug.ll

	; RUN: opt -functionattrs -S < %s \| FileCheck %s			; RUN: opt -functionattrs -S < %s \| FileCheck %s
				; RUN: opt -passes=function-attrs -S < %s \| FileCheck %s

	; This checks for an iterator wraparound bug in FunctionAttrs. The previous			; This checks for an iterator wraparound bug in FunctionAttrs. The previous
	; "incorrect" behavior was inferring readonly for the %x argument in @caller.			; "incorrect" behavior was inferring readonly for the %x argument in @caller.
	; Inferring readonly for %x is actually correct, since @va_func is marked			; Inferring readonly for %x is actually correct, since @va_func is marked
	; readonly, but FunctionAttrs was inferring readonly for the wrong reasons (and			; readonly, but FunctionAttrs was inferring readonly for the wrong reasons (and
	; we _need_ the readonly on @va_func to trigger the problematic code path). It			; we _need_ the readonly on @va_func to trigger the problematic code path). It
	; is possible that in the future FunctionAttrs becomes smart enough to infer			; is possible that in the future FunctionAttrs becomes smart enough to infer
	; readonly for %x for the right reasons, and at that point this test will have			; readonly for %x for the right reasons, and at that point this test will have
	Show All 21 Lines

test/Transforms/FunctionAttrs/readnone.ll

	; RUN: opt < %s -functionattrs -S \| FileCheck %s			; RUN: opt < %s -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -passes=function-attrs -S \| FileCheck %s

	; CHECK: define void @bar(i8* nocapture readnone)			; CHECK: define void @bar(i8* nocapture readnone)
	define void @bar(i8* readonly) {			define void @bar(i8* readonly) {
	call void @foo(i8* %0)			call void @foo(i8* %0)
	ret void			ret void
	}			}

	; CHECK: define void @foo(i8* nocapture readnone)			; CHECK: define void @foo(i8* nocapture readnone)
	define void @foo(i8* readonly) {			define void @foo(i8* readonly) {
	call void @bar(i8* %0)			call void @bar(i8* %0)
	ret void			ret void
	}			}

test/Transforms/FunctionAttrs/returned.ll

	; RUN: opt < %s -functionattrs -S \| FileCheck %s			; RUN: opt < %s -functionattrs -S \| FileCheck %s
				; RUN: opt < %s -passes=function-attrs -S \| FileCheck %s

	; CHECK: define i32 @test1(i32 %p, i32 %q)			; CHECK: define i32 @test1(i32 %p, i32 %q)
	define i32 @test1(i32 %p, i32 %q) {			define i32 @test1(i32 %p, i32 %q) {
	entry:			entry:
	%cmp = icmp sgt i32 %p, %q			%cmp = icmp sgt i32 %p, %q
	br i1 %cmp, label %cond.end, label %lor.lhs.false			br i1 %cmp, label %cond.end, label %lor.lhs.false

	lor.lhs.false: ; preds = %entry			lor.lhs.false: ; preds = %entry
	Show All 21 Lines

test/Transforms/Inline/cgscc-update.ll

	; RUN: opt < %s -aa-pipeline=basic-aa -passes='cgscc(function-attrs,inline)' -S \| FileCheck %s			; RUN: opt < %s -aa-pipeline=basic-aa -passes='cgscc(function-attrs,inline)' -S \| FileCheck %s
	; This test runs the inliner and the function attribute deduction. It ensures			; This test runs the inliner and the function attribute deduction. It ensures
	; that when the inliner mutates the call graph it correctly updates the CGSCC			; that when the inliner mutates the call graph it correctly updates the CGSCC
	; iteration so that we can compute refined function attributes. In this way it			; iteration so that we can compute refined function attributes. In this way it
	; is leveraging function attribute computation to observe correct call graph			; is leveraging function attribute computation to observe correct call graph
	; updates.			; updates.

	; Boring unknown external function call.			; Boring unknown external function call.
	; CHECK: declare void @unknown()			; CHECK: declare void @unknown()
	declare void @unknown()			declare void @unknown()

	; Sanity check: this should get annotated as readnone.			; Sanity check: this should get annotated as readnone.
	; CHECK: Function Attrs: readnone			; CHECK: Function Attrs: nounwind readnone
	; CHECK-NEXT: declare void @readnone()			; CHECK-NEXT: declare void @readnone()
	declare void @readnone() readnone			declare void @readnone() readnone nounwind

	; The 'test1_' prefixed functions are designed to trigger forming a new direct			; The 'test1_' prefixed functions are designed to trigger forming a new direct
	; call in the inlined body of the function. After that, we form a new SCC and			; call in the inlined body of the function. After that, we form a new SCC and
	; using that can deduce precise function attrs.			; using that can deduce precise function attrs.

	; This function should no longer exist.			; This function should no longer exist.
	; CHECK-NOT: @test1_f()			; CHECK-NOT: @test1_f()
	define internal void @test1_f(void()* %p) {			define internal void @test1_f(void()* %p) {
	entry:			entry:
	call void %p()			call void %p()
	ret void			ret void
	}			}

	; This function should have had 'readnone' deduced for its SCC.			; This function should have had 'readnone' deduced for its SCC.
	; CHECK: Function Attrs: noinline readnone			; CHECK: Function Attrs: noinline nounwind readnone
	; CHECK-NEXT: define void @test1_g()			; CHECK-NEXT: define void @test1_g()
	define void @test1_g() noinline {			define void @test1_g() noinline {
	entry:			entry:
	call void @test1_f(void()* @test1_h)			call void @test1_f(void()* @test1_h)
	ret void			ret void
	}			}

	; This function should have had 'readnone' deduced for its SCC.			; This function should have had 'readnone' deduced for its SCC.
	; CHECK: Function Attrs: noinline readnone			; CHECK: Function Attrs: noinline nounwind readnone
	; CHECK-NEXT: define void @test1_h()			; CHECK-NEXT: define void @test1_h()
	define void @test1_h() noinline {			define void @test1_h() noinline {
	entry:			entry:
	call void @test1_g()			call void @test1_g()
	call void @readnone()			call void @readnone()
	ret void			ret void
	}			}


	; The 'test2_' prefixed functions are designed to trigger forming a new direct			; The 'test2_' prefixed functions are designed to trigger forming a new direct
	; call due to RAUW-ing the returned value of a called function into the caller.			; call due to RAUW-ing the returned value of a called function into the caller.
	; This too should form a new SCC which can then be reasoned about to compute			; This too should form a new SCC which can then be reasoned about to compute
	; precise function attrs.			; precise function attrs.

	; This function should no longer exist.			; This function should no longer exist.
	; CHECK-NOT: @test2_f()			; CHECK-NOT: @test2_f()
	define internal void()* @test2_f() {			define internal void()* @test2_f() {
	entry:			entry:
	ret void()* @test2_h			ret void()* @test2_h
	}			}

	; This function should have had 'readnone' deduced for its SCC.			; This function should have had 'readnone' deduced for its SCC.
	; CHECK: Function Attrs: noinline readnone			; CHECK: Function Attrs: noinline nounwind readnone
	; CHECK-NEXT: define void @test2_g()			; CHECK-NEXT: define void @test2_g()
	define void @test2_g() noinline {			define void @test2_g() noinline {
	entry:			entry:
	%p = call void()* @test2_f()			%p = call void()* @test2_f()
	call void %p()			call void %p()
	ret void			ret void
	}			}

	; This function should have had 'readnone' deduced for its SCC.			; This function should have had 'readnone' deduced for its SCC.
	; CHECK: Function Attrs: noinline readnone			; CHECK: Function Attrs: noinline nounwind readnone
	; CHECK-NEXT: define void @test2_h()			; CHECK-NEXT: define void @test2_h()
	define void @test2_h() noinline {			define void @test2_h() noinline {
	entry:			entry:
	call void @test2_g()			call void @test2_g()
	call void @readnone()			call void @readnone()
	ret void			ret void
	}			}

	▲ Show 20 Lines • Show All 66 Lines • ▼ Show 20 Lines

	; The 'test4_' prefixed functions are designed to trigger forming a new direct			; The 'test4_' prefixed functions are designed to trigger forming a new direct
	; call in the inlined body of the function similar to 'test1_'. However, after			; call in the inlined body of the function similar to 'test1_'. However, after
	; that we continue to inline another edge of the graph forcing us to do a more			; that we continue to inline another edge of the graph forcing us to do a more
	; interesting call graph update for the new call edge. Eventually, we still			; interesting call graph update for the new call edge. Eventually, we still
	; form a new SCC and should use that can deduce precise function attrs.			; form a new SCC and should use that can deduce precise function attrs.

	; This function should have had 'readnone' deduced for its SCC.			; This function should have had 'readnone' deduced for its SCC.
	; CHECK: Function Attrs: noinline readnone			; CHECK: Function Attrs: noinline nounwind readnone
	; CHECK-NEXT: define void @test4_f1()			; CHECK-NEXT: define void @test4_f1()
	define void @test4_f1() noinline {			define void @test4_f1() noinline {
	entry:			entry:
	call void @test4_h()			call void @test4_h()
	ret void			ret void
	}			}

	; CHECK-NOT: @test4_f2			; CHECK-NOT: @test4_f2
	define internal void @test4_f2() {			define internal void @test4_f2() {
	entry:			entry:
	call void @test4_f1()			call void @test4_f1()
	ret void			ret void
	}			}

	; CHECK-NOT: @test4_g			; CHECK-NOT: @test4_g
	define internal void @test4_g(void()* %p) {			define internal void @test4_g(void()* %p) {
	entry:			entry:
	call void %p()			call void %p()
	ret void			ret void
	}			}

	; This function should have had 'readnone' deduced for its SCC.			; This function should have had 'readnone' deduced for its SCC.
	; CHECK: Function Attrs: noinline readnone			; CHECK: Function Attrs: noinline nounwind readnone
	; CHECK-NEXT: define void @test4_h()			; CHECK-NEXT: define void @test4_h()
	define void @test4_h() noinline {			define void @test4_h() noinline {
	entry:			entry:
	call void @test4_g(void()* @test4_f2)			call void @test4_g(void()* @test4_f2)
	ret void			ret void
	}			}

test/Transforms/PruneEH/2008-06-02-Weak.ll

	; RUN: opt < %s -prune-eh -S \| not grep nounwind			; RUN: opt < %s -prune-eh -S \| FileCheck %s
				; RUN: opt < %s -passes='function-attrs,function(simplify-cfg)' -S \| FileCheck %s

				; We should not infer 'nounwind' for/from a weak function,
				; since it can be overriden by throwing implementation.
				;
				; CHECK-LABEL: define weak void @f()
	define weak void @f() {			define weak void @f() {
	entry:			entry:
	ret void			ret void
	}			}

				; CHECK-LABEL: define void @g()
	define void @g() {			define void @g() {
	entry:			entry:
	call void @f()			call void @f()
	ret void			ret void
	}			}

				; CHECK-NOT: {{^}}attributes #{{[0-9].*}} nounwind

test/Transforms/PruneEH/ipo-nounwind.ll

	; RUN: opt -S -prune-eh < %s \| FileCheck %s			; RUN: opt -S -prune-eh < %s \| FileCheck %s
				; RUN: opt -S -passes='function-attrs,function(simplify-cfg)' < %s \| FileCheck %s

	declare void @may_throw()			declare void @may_throw()

	; @callee below may be an optimized form of this function, which can			; @callee below may be an optimized form of this function, which can
	; throw at runtime (see r265762 for more details):			; throw at runtime (see r265762 for more details):
	;			;
	; define linkonce_odr void @callee(i32* %ptr) noinline {			; define linkonce_odr void @callee(i32* %ptr) noinline {
	; entry:			; entry:
	Show All 34 Lines

test/Transforms/PruneEH/operand-bundles.ll

	; RUN: opt < %s -prune-eh -S \| FileCheck %s			; RUN: opt < %s -prune-eh -S \| FileCheck %s
				; RUN: opt < %s -passes='function-attrs,function(simplify-cfg)' -S \| FileCheck %s

	declare void @nounwind() nounwind			declare void @nounwind() nounwind

	define internal void @foo() {			define internal void @foo() {
	call void @nounwind()			call void @nounwind()
	ret void			ret void
	}			}

	Show All 17 Lines

test/Transforms/PruneEH/pr23971.ll

	; RUN: opt -S -prune-eh < %s \| FileCheck %s			; RUN: opt -S -prune-eh < %s \| FileCheck %s
				; RUN: opt -S -passes='function-attrs,function(simplify-cfg)' < %s \| FileCheck %s

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define void @f() #0 {			define void @f() #0 {
	entry:			entry:
	call void asm sideeffect "ret\0A\09", "~{dirflag},~{fpsr},~{flags}"()			call void asm sideeffect "ret\0A\09", "~{dirflag},~{fpsr},~{flags}"()
	unreachable			unreachable
	Show All 12 Lines

test/Transforms/PruneEH/pr26263.ll

	; RUN: opt -prune-eh -S < %s \| FileCheck %s			; PruneEH is less powerful than simplify-cfg in terms of cfg simplification,
				; so it leaves some of the unreachable stuff hanging around.
				; Checking it with CHECK-OLD.
				;
				; RUN: opt -prune-eh -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-OLD
				; RUN: opt -passes='function-attrs,function(simplify-cfg)' -S < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-NEW

	target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"			target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"
	target triple = "i386-pc-windows-msvc"			target triple = "i386-pc-windows-msvc"

	declare void @neverthrows() nounwind			declare void @neverthrows() nounwind

	define void @test1() personality i32 (...)* @__CxxFrameHandler3 {			define void @test1() personality i32 (...)* @__CxxFrameHandler3 {
	invoke void @neverthrows()			invoke void @neverthrows()
	to label %try.cont unwind label %cleanuppad			to label %try.cont unwind label %cleanuppad

	try.cont:			try.cont:
	ret void			ret void

	cleanuppad:			cleanuppad:
	%cp = cleanuppad within none []			%cp = cleanuppad within none []
	br label %cleanupret			br label %cleanupret

	cleanupret:			cleanupret:
	cleanupret from %cp unwind to caller			cleanupret from %cp unwind to caller
	}			}

	; CHECK-LABEL: define void @test1(			; CHECK-LABEL: define void @test1(
	; CHECK: call void @neverthrows()			; CHECK: call void @neverthrows()
				; CHECK-NEW-NEXT: ret void
				; CHECK-NEW-NEXT: }
				; CHECK-OLD: ret void

	; CHECK: %[[cp:.*]] = cleanuppad within none []			; CHECK-OLD: %[[cp:.*]] = cleanuppad within none []
	; CHECK-NEXT: unreachable			; CHECK-OLD-NEXT: unreachable

	; CHECK: cleanupret from %[[cp]] unwind to caller			; CHECK-OLD: cleanupret from %[[cp]] unwind to caller

	define void @test2() personality i32 (...)* @__CxxFrameHandler3 {			define void @test2() personality i32 (...)* @__CxxFrameHandler3 {
	invoke void @neverthrows()			invoke void @neverthrows()
	to label %try.cont unwind label %catchswitch			to label %try.cont unwind label %catchswitch

	try.cont:			try.cont:
	ret void			ret void

	catchswitch:			catchswitch:
	%cs = catchswitch within none [label %catchpad] unwind to caller			%cs = catchswitch within none [label %catchpad] unwind to caller

	catchpad:			catchpad:
	%cp = catchpad within %cs []			%cp = catchpad within %cs []
	unreachable			unreachable

	ret:			ret:
	ret void			ret void
	}			}

	; CHECK-LABEL: define void @test2(			; CHECK-LABEL: define void @test2(
	; CHECK: call void @neverthrows()			; CHECK: call void @neverthrows()
				; CHECK-NEW-NEXT: ret void
				; CHECK-NEW-NEXT: }
				; CHECK-OLD: ret void

				; CHECK-OLD: %[[cs:.*]] = catchswitch within none [label

	; CHECK: %[[cs:.*]] = catchswitch within none [label			; CHECK-OLD: catchpad within %[[cs]] []
				; CHECK-OLD-NEXT: unreachable

	; CHECK: catchpad within %[[cs]] []			; CHECK-OLD:ret void
	; CHECK-NEXT: unreachable

	declare i32 @__CxxFrameHandler3(...)			declare i32 @__CxxFrameHandler3(...)

test/Transforms/PruneEH/recursivetest.ll

	; RUN: opt < %s -prune-eh -S \| not grep invoke			; RUN: opt < %s -prune-eh -S \| FileCheck %s
				; RUN: opt < %s -passes='function-attrs,function(simplify-cfg)' -S \| FileCheck %s

				; CHECK-LABEL: define internal i32 @foo()
	define internal i32 @foo() personality i32 (...)* @__gxx_personality_v0 {			define internal i32 @foo() personality i32 (...)* @__gxx_personality_v0 {
				; CHECK-NOT: invoke i32 @foo()
	invoke i32 @foo( )			invoke i32 @foo( )
	to label %Normal unwind label %Except ; <i32>:1 [#uses=0]			to label %Normal unwind label %Except ; <i32>:1 [#uses=0]
	Normal: ; preds = %0			Normal: ; preds = %0
	ret i32 12			ret i32 12
	Except: ; preds = %0			Except: ; preds = %0
	landingpad { i8*, i32 }			landingpad { i8*, i32 }
	catch i8* null			catch i8* null
	ret i32 123			ret i32 123
	}			}

				; CHECK-LABEL: define i32 @caller()
	define i32 @caller() personality i32 (...)* @__gxx_personality_v0 {			define i32 @caller() personality i32 (...)* @__gxx_personality_v0 {
				; CHECK-NOT: invoke i32 @foo()
	invoke i32 @foo( )			invoke i32 @foo( )
	to label %Normal unwind label %Except ; <i32>:1 [#uses=0]			to label %Normal unwind label %Except ; <i32>:1 [#uses=0]
	Normal: ; preds = %0			Normal: ; preds = %0
	ret i32 0			ret i32 0
	Except: ; preds = %0			Except: ; preds = %0
	landingpad { i8*, i32 }			landingpad { i8*, i32 }
	catch i8* null			catch i8* null
	ret i32 1			ret i32 1
	}			}

	declare i32 @__gxx_personality_v0(...)			declare i32 @__gxx_personality_v0(...)

test/Transforms/PruneEH/seh-nounwind.ll

	; RUN: opt -S -prune-eh < %s \| FileCheck %s			; RUN: opt -S -prune-eh < %s \| FileCheck %s
				; RUN: opt -S -passes='function-attrs,function(simplify-cfg)' < %s \| FileCheck %s

	; Don't remove invokes of nounwind functions if the personality handles async			; Don't remove invokes of nounwind functions if the personality handles async
	; exceptions. The @div function in this test can fault, even though it can't			; exceptions. The @div function in this test can fault, even though it can't
	; throw a synchronous exception.			; throw a synchronous exception.

	define i32 @div(i32 %n, i32 %d) nounwind {			define i32 @div(i32 %n, i32 %d) nounwind {
	entry:			entry:
	%div = sdiv i32 %n, %d			%div = sdiv i32 %n, %d
	Show All 22 Lines

test/Transforms/PruneEH/simpletest.ll

	; RUN: opt < %s -prune-eh -S \| not grep invoke			; RUN: opt < %s -prune-eh -S \| FileCheck %s
				; RUN: opt < %s -passes='function-attrs,function(simplify-cfg)' -S \| FileCheck %s

	declare void @nounwind() nounwind			declare void @nounwind() nounwind

	define internal void @foo() {			define internal void @foo() {
	call void @nounwind()			call void @nounwind()
	ret void			ret void
	}			}

				; CHECK-LABEL: define i32 @caller()
	define i32 @caller() personality i32 (...)* @__gxx_personality_v0 {			define i32 @caller() personality i32 (...)* @__gxx_personality_v0 {
				; CHECK-NOT: invoke void @foo
	invoke void @foo( )			invoke void @foo( )
	to label %Normal unwind label %Except			to label %Normal unwind label %Except

	Normal: ; preds = %0			Normal: ; preds = %0
	ret i32 0			ret i32 0

	Except: ; preds = %0			Except: ; preds = %0
	landingpad { i8*, i32 }			landingpad { i8*, i32 }
	catch i8* null			catch i8* null
	ret i32 1			ret i32 1
	}			}

	declare i32 @__gxx_personality_v0(...)			declare i32 @__gxx_personality_v0(...)

This is an archive of the discontinued LLVM Phabricator instance.

[PM][FunctionAttrs] add NoUnwind attribute inference to PostOrderFunctionAttrs passClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 139097

lib/Transforms/IPO/FunctionAttrs.cpp

test/Other/cgscc-devirt-iteration.ll

test/Transforms/FunctionAttrs/2008-09-03-Mutual.ll

test/Transforms/FunctionAttrs/2008-09-03-ReadNone.ll

test/Transforms/FunctionAttrs/2008-09-03-ReadOnly.ll

test/Transforms/FunctionAttrs/2008-09-13-VolatileRead.ll

test/Transforms/FunctionAttrs/2008-12-29-Constant.ll

test/Transforms/FunctionAttrs/2009-01-02-LocalStores.ll

test/Transforms/FunctionAttrs/2010-10-30-volatile.ll

test/Transforms/FunctionAttrs/assume.ll

test/Transforms/FunctionAttrs/atomic.ll

test/Transforms/FunctionAttrs/comdat-ipo.ll

test/Transforms/FunctionAttrs/convergent.ll

test/Transforms/FunctionAttrs/int_sideeffect.ll

test/Transforms/FunctionAttrs/nocapture.ll

test/Transforms/FunctionAttrs/nonnull-global.ll

test/Transforms/FunctionAttrs/nonnull.ll

test/Transforms/FunctionAttrs/norecurse.ll

test/Transforms/FunctionAttrs/operand-bundles-scc.ll

test/Transforms/FunctionAttrs/optnone.ll

test/Transforms/FunctionAttrs/out-of-bounds-iterator-bug.ll

test/Transforms/FunctionAttrs/readnone.ll

test/Transforms/FunctionAttrs/returned.ll

test/Transforms/Inline/cgscc-update.ll

test/Transforms/PruneEH/2008-06-02-Weak.ll

test/Transforms/PruneEH/ipo-nounwind.ll

test/Transforms/PruneEH/operand-bundles.ll

test/Transforms/PruneEH/pr23971.ll

test/Transforms/PruneEH/pr26263.ll

test/Transforms/PruneEH/recursivetest.ll

test/Transforms/PruneEH/seh-nounwind.ll

test/Transforms/PruneEH/simpletest.ll

[PM][FunctionAttrs] add NoUnwind attribute inference to PostOrderFunctionAttrs pass
ClosedPublic