This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Analysis/
6/6
NumberOfExecutions.h
-
Dialect/
-
Async/IR/
-
IR/
-
Async.h
2/2
AsyncOps.td
-
SCF/
1/1
SCFOps.td
-
Interfaces/
-
ControlFlowInterfaces.h
3/3
ControlFlowInterfaces.td
-
Support/
1/1
MathExtras.h
-
lib/
-
Analysis/
-
CMakeLists.txt
36/38
NumberOfExecutions.cpp
-
Dialect/
-
Async/IR/
-
IR/
1/1
Async.cpp
-
SCF/
1/1
SCF.cpp
-
Interfaces/
1/1
ControlFlowInterfaces.cpp
-
test/
-
Analysis/
-
test-number-of-block-executions.mlir
-
test-number-of-operation-executions.mlir
-
lib/Transforms/
-
Transforms/
-
CMakeLists.txt
-
TestNumberOfExecutions.cpp
-
tools/mlir-opt/
-
mlir-opt/
-
mlir-opt.cpp
-
unittests/Support/
-
Support/
-
CMakeLists.txt
-
MathExtrasTest.cpp

Differential D90922

[mlir] Add NumberOfExecutions analysis + update RegionBranchOpInterface interface to query number of region invocations
ClosedPublic

Authored by ezhulenev on Nov 6 2020, 3:02 AM.

Download Raw Diff

Details

Reviewers

silvas
mehdi_amini
ftynse
rriddle

Commits

rGbb0d5f767dd7: [mlir] Add NumberOfExecutions analysis + update RegionBranchOpInterface…

Summary

Implements RFC discussed in: https://llvm.discourse.group/t/rfc-operationinstancesinterface-or-any-better-name/2158/10

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ezhulenev created this revision.Nov 6 2020, 3:02 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 6 2020, 3:02 AM

Herald added subscribers: rdzhabarov, tatianashp, msifontes and 14 others. · View Herald Transcript

ezhulenev requested review of this revision.Nov 6 2020, 3:02 AM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald TranscriptNov 6 2020, 3:02 AM

ezhulenev edited the summary of this revision. (Show Details)Nov 6 2020, 3:04 AM

ezhulenev added reviewers: silvas, mehdi_amini, ftynse.

ezhulenev edited the summary of this revision. (Show Details)Nov 6 2020, 3:08 AM

Harbormaster completed remote builds in B77853: Diff 303393.Nov 6 2020, 3:21 AM

Follow block parents to compute number of block executions + compute number of executions for ForOp with constant bounds

ezhulenev edited the summary of this revision. (Show Details)Nov 6 2020, 4:12 AM

ftynse added a subscriber: wsmoses.Nov 6 2020, 4:17 AM

ezhulenev retitled this revision from [mlir] Add NumberOfExecutions analysis + RegionInvocationsOpInterface to controlflow interfaces to [mlir] Add NumberOfExecutions analysis + update RegionBranchOpInterface interface to query number of region invocations.Nov 6 2020, 4:23 AM

Harbormaster completed remote builds in B77860: Diff 303402.Nov 6 2020, 4:28 AM

Very nice!

I have a bunch of nits and several generalization suggestions.

mlir/include/mlir/Analysis/NumberOfExecutions.h
38	Nit: explicit?
69	There doesn't seem to be an accessor for this field, so IMO it can be dropped (unless there is an anticipated need for it). At which point, we can also consider `using BlockNumberOfExecutionsInfo = Optional<int64_t>`, which can be nicely constructed from an `int64_t` or from `llvm::None`.
mlir/include/mlir/Dialect/Async/IR/AsyncOps.td
86–88	It should be possible to do `DeclareOpInterfaceMethods<RegionBranchOpInterface, ["getNumRegionInvocations"]>` in the `def` and avoid this explicit declaration.
mlir/include/mlir/Interfaces/ControlFlowInterfaces.td
135	Nit: I'd be explicit and say something like "Populates `countPerRegion` with the number..."
137	Nit: could we have a named constant instead of -1?
149	Nit: countPerRegion.resize(numRegions, -1) avoids the loop
mlir/lib/Analysis/NumberOfExecutions.cpp
45–46	I wonder if this can be generalized to work on all constant-like operations. Currently, we only have a `ConstantLike` trait, so we cannot get the constant value as attribute without knowing the concrete op type. Maybe we can turn ConstantLike into an interface with `value` being an interface method. Out of scope for this commit probably.
53	Could we rather query the op? Hardcoding FuncOp makes this analysis non-applicable to custom function types (we have some, e.g., in LLVM dialect and in Flang).
58	Let's also be future-proof and guard against graph regions (https://mlir.llvm.org/docs/LangRef/#graph-regions). They don't have blocks (yet), but if they do, the logic of following block successors below will not necessarily apply. An assertion could do.
66	Nit: we usually don't declare function stack variables as `const` in MLIR
71	Nit: I would have just used `auto` and kept the original lambda type here, it's not like we are passing it around.
mlir/lib/Dialect/SCF/SCF.cpp
303–306	Could we rather extend the one we already have in MathExtras.h to support negative RHS?

Address PR comments

mlir/include/mlir/Analysis/NumberOfExecutions.h
69	I added a custom non-default constructible type to catch accidental default constructions of None instead of a failure. And maybe more advanced analysis can add some other fields in the future. Added this to class documentation for the next reader and removed unused Block*.
mlir/include/mlir/Dialect/Async/IR/AsyncOps.td
86–88	Cool, learning something new every day :)
mlir/lib/Analysis/NumberOfExecutions.cpp
45–46	Yeah, that would be useful also in the context of TF graphs with tf.Const ops, but I did not find any relevant examples and chose the easiest path.
53	It guess in can just implement RegionBranchOpInterface, I decided to special case it because it's "built in" and not defined in any of the dialect .td files. I expect all other FuncOpLike things to implement RegionBranchOpInterface.
71	Unfortunately lambdas do not support recursion without ugly tricks.

Harbormaster completed remote builds in B77867: Diff 303415.Nov 6 2020, 5:45 AM

LGTM. Please wait for one more reviewer to double-check the logic according to the forum discussion.

mlir/include/mlir/Analysis/NumberOfExecutions.h
69	Ack. (Phabricator doesn't let anybody other than the author to mark comments as done).
mlir/lib/Analysis/NumberOfExecutions.cpp
53	ODS is not mandatory to implement an interface, ops defined in C++ can derive `Traits<InterfaceName::Trait>` and get the same functionality. I'm fine with your current implementation given the explanation provided. We should look into having FuncOp implement the interface separately and consult with @rriddle about future plans regarding it being built-in.

This revision is now accepted and ready to land.Nov 6 2020, 6:25 AM

ezhulenev marked 2 inline comments as done.Nov 6 2020, 8:43 AM

ezhulenev added inline comments.

mlir/lib/Analysis/NumberOfExecutions.cpp
53	Yeah, I know, I just don't know is it "ok" to add it to built in type :) Assuming `FuncOp` is as fundamental as `Region` I think it's ok to have a special case, but I'd like to hear opinion from someone with a better judgement.

mehdi_amini added inline comments.Nov 6 2020, 8:56 AM

mlir/lib/Analysis/NumberOfExecutions.cpp
53	FuncOp isn't as fundamental as Region, I rather treat FuncOp as any other op. For example LLVM, SPIRV, GPU dialects aren't using FuncOp.

ezhulenev added inline comments.Nov 6 2020, 9:10 AM

mlir/lib/Analysis/NumberOfExecutions.cpp
53	I've looked into adding `RegionBranchOpInterface to` FuncOp`, but it feels weird because the `FuncOp` does not execute the region, only the `CallOp` does. For example: `getSuccessorEntryOperands`: "Returns the operands of this operation used as the entry arguments", but the `FuncOp` is `OpTrait::ZeroOperands`. Maybe add a special trait, something like `OpTrait::InvokesRegionsOnce`, an additional piece of information just like `OpTrait::OneRegion`. Or just rely on `OpTrait::FunctionLike` here, assuming that all functions execute attached region once.

Use FunctionLike trait to to get number of region invocations

Harbormaster completed remote builds in B77909: Diff 303488.Nov 6 2020, 10:10 AM

rriddle requested changes to this revision.Nov 6 2020, 11:07 AM

rriddle added inline comments.

mlir/lib/Analysis/NumberOfExecutions.cpp
14	llvm includes go after MLIR includes.
19	Is the dependency on standard ops necessary?
23	Please trim some of these includes, several are unnecessary.
36	Missing static here?
66	This is not correct and seems like it would result in incorrect usages of the analysis. If you want to estimate function executions, you'll need to use the callgraph.
75	Drop trivial braces.
122	Nit: Flip the condition of this if, and remove the trivial braces. if (!...) blockInfo.insert
mlir/lib/Dialect/Async/IR/Async.cpp
125	nit: Seems like push_back(1) is better than an explicit resize and assignment.
mlir/lib/Interfaces/ControlFlowInterfaces.cpp
78	Prefer explicit qualification, i.e. `mlir::kUnknownNumRegionInvocations`

This revision now requires changes to proceed.Nov 6 2020, 11:07 AM

rriddle added inline comments.Nov 6 2020, 11:08 AM

mlir/lib/Analysis/NumberOfExecutions.cpp
66	Why should functions be default to executed once? That doesn't seem like a conservatively safe default.

silvas added inline comments.Nov 6 2020, 12:48 PM

mlir/lib/Analysis/NumberOfExecutions.cpp
66	I agree, there seems to be a code smell here. One solution is for queries to be of the form `Optional<int64_t> getNumberOfExecutions(Block block, Region assumingThisRegionIsEnteredOnce)`. So the analysis could hold a `DenseMap<Block *, NumberOfExecutionsAssumingDirectParentRegionEnteredOnce>`. Then on a query, you would walk up parents, multiplying `NumberOfExecutionsAssumingDirectParentRegionEnteredOnce` at each step until we reach `assumingThisRegionIsEnteredOnce` (and until we implement a call graph analysis, we will return None if `assumingThisRegionIsEnteredOnce` is not an ancestor of `block`) And the body of a module should probably be an illegal Region to pass as `assumingThisRegionIsEnteredOnce`, because it is a "declarative region" and never "executes" (we might need a new IR property to represent this?) test-print-number-of-executions would have some hardcoded policy to print values assuming `assumingThisRegionIsEnteredOnce == enclosingFunc.getBody()`

Address PR comments

mlir/lib/Analysis/NumberOfExecutions.cpp
19	It's needed for dyn_cast<ConstantOp>
66	This is roughly ~= "when a function called it executes its entry block once". And this seems like a reasonable assumption. This is not about how many call operations are in the graph.

silvas added inline comments.Nov 6 2020, 12:59 PM

mlir/lib/Analysis/NumberOfExecutions.cpp
66	what about a function like def f(i): print("here") # How many executions will this be assigned? if i > 0: f(i - 1) I think we need to be more precise about what we mean by "execution" here. Also, does this pass correctly handle a case like? def g(i): if i == 0: abort() print("here") # How many executions?

Harbormaster completed remote builds in B77925: Diff 303528.Nov 6 2020, 1:07 PM

At the moment it is somewhat unclear at what scope this analysis is intended to be constructed and used. It seems like this currently doesn't handle things related to the callgraph, doesn't handle control flow well, and assumes that it only operates locally within a single function? I'm slightly concerned that we are building an analysis that is very tied with the constraints on async tokens as determined by D90716, without much reusability outside of those constraints. Along with Sean's point, I don't think the concept of "Execution" is very well scoped here and leads to some weird code smells with functions/modules/etc.

mlir/include/mlir/Dialect/SCF/SCFOps.td
201	Replace -1 here with kUnknownNumRegionInvocations.
mlir/lib/Analysis/NumberOfExecutions.cpp
54	Use m_Constant here instead, we should avoid hardcoding operation types whenever possible.

Use matchPattern to match constants

In D90922#2379863, @rriddle wrote:

At the moment it is somewhat unclear at what scope this analysis is intended to be constructed and used. It seems like this currently doesn't handle things related to the callgraph ...

I'd say that call graph is irrelevant for this analysis. Call operations "consumes" SSA values and then it is the responsibility of the callee to do the right thing. But I'm speaking from the point of view of async-ref-counting, I'd love to hear what @ftynse can tell about his need for register assignment.

Maybe there is a better terminology then "number of executions" that is less confusing.

mlir/lib/Analysis/NumberOfExecutions.cpp
66	#1. 1 execution of the basic block that contains print operations. Function call is irrelevant here. #2. Unknown, because it involves dynamic control flow CFG will look something like this: ^bb0 %cond = cmp i == 0 cond_br %cond ^abort, ^print ^abort br ^print ^print def g(i): abort() print("here") # is a bit more tricky, but let's assume this is an # estimate given that no operations can terminate a program I think we need to be more precise about what we mean by "execution" here. "How many times this operation will be executed in this region assuming region invoked N times". Call operation is about completely different region (different instance of a region).

FWIW Original async ref counting PR uses "number of operation instances in the region"

silvas added inline comments.Nov 6 2020, 3:23 PM

mlir/lib/Analysis/NumberOfExecutions.cpp
66	I see that the current code actually does walk parents in the way I described. Sorry. I thought somewhat about the definition of "execution" that we mean here. It's really "for each time that Region is entered, how many times will given operation be executed". That makes the definition work for my case `f` above, since it yields `getNumberOfExecutions(thePrintOp, f.getBody()) == 1`. Also, because of the possibility of aborting or calling a function that cannot be proven to terminate in the middle of a block, I think it is best for getNumberOfExecutions to take an `Operation ` instead of a Block, so that cases like this behave as intended, and users don't have to do their own within-a-block check: def h(): print("here1") # Number of executions is 1 per entry to h.getBody() call @weProvedThatItAbortsOrInfiniteLoops() print("here") # Number of executions is 0 per entry to h.getBody() As such, could we do something like an interface to this analysis that looks like: `Optional<IntegerRange> getNumberOfExecutions(Operation op, Region *perEntryOfThisRegion)` IntegerRange is just a pair of integers that represent a lower and upper bound on the number of executions, which composes nicely to handle cases where we can't prove that the statement executes at all (like those tricky abort/infinite-loop cases) by having a lower bound of 0 (we could do something more precise than an interval, such as an arbitrary set of integers, but that might get unwieldy). To handle this correctly, getNumRegionInvocations will need to be wordsmithed to something like "number of times the body will be invoked if the body yields normally (i.e. doesn't abort or invoke an infinite loop)". Then this analysis can do a bottom up traversal of the region tree, doing the op-level analysis within each block looking for possibly aborting/infiniteLooping operations and correctly handling any ops that follow ops that might abort/infiniteLoop, and transitively propagating that up the region tree. We can make this more precise with interprocedural analysis that does the equivalent of inferring attributes like LLVM's norecurse/willreturn/mustprogress (see https://llvm.org/docs/LangRef.html#function-attributes) Under this definition, we have: def f(i): print("here") # Number of executions is 1 per entry to f.getBody() if i > 0: f(i - 1) def g(i): if i == 0: abort() print("here") # Number of executions is 0 or 1 per entry to g.getBody() def h(): print("here1") # Number of executions is 1 per entry to h.getBody() call @weProvedThatItAbortsOrInfiniteLoops() print("here") # Number of executions is 0 per entry to h.getBody() def i(): print("here1") # Number of executions is 1 per entry to i.getBody() call @cannotProveItDoesntAbortOrInfiniteLoop() print("here") # Number of executions is 0 or 1 per entry to i.getBody()

Reworderd NumberOfExecutions documentation

Update getNumRegionInvocations documentation to mention "yields normally" property

ezhulenev added inline comments.Nov 6 2020, 3:48 PM

mlir/lib/Analysis/NumberOfExecutions.cpp
66	I updated documentation for the interface and NumberOfExecutions to mention "yields normally" and "each time" properties. I like this API ... Optional<IntegerRange> getNumberOfExecutions(Operation op, Region perEntryOfThisRegion) ... but for now computing the `IntegerRange` is not really needed and it adds a lot of complexity. I plan to add API with Optional<int64_t> to represent "known fixed number" vs "everything else" Optional<int64_t> getNumberOfExecutions(Operation op, Region perEntryOfThisRegion); and assume that all operations are "normal": do not abort, or go into infinite loop == "completed in finite amount of time".

Harbormaster completed remote builds in B77946: Diff 303564.Nov 6 2020, 4:05 PM

Harbormaster completed remote builds in B77949: Diff 303568.Nov 6 2020, 4:26 PM

Harbormaster completed remote builds in B77952: Diff 303571.Nov 6 2020, 4:45 PM

Implment Optional<int64_t> getNumberOfExecutions(Operation *op, Region *perEntryOfThisRegion) API

Harbormaster completed remote builds in B78006: Diff 303668.Nov 7 2020, 1:48 PM

Check that block lies inside the perEntryOfThisRegion

Harbormaster completed remote builds in B78030: Diff 303704.Nov 8 2020, 3:12 AM

silvas added inline comments.Nov 9 2020, 12:55 PM

mlir/lib/Analysis/NumberOfExecutions.cpp
66	Thanks. I think with the new API, we don't need these special cases. FunctionLike already didn't make sense because an op is not "executed" by the module region (we should probably return "unknown" (or an error) for `getNumberOfExecutions(someOpInAFunction, module.getBody())`. OpTrait::OneRegion doesn't guarantee anything about the number of invocations. E.g. scf::ForOp implements OpTrait::OneRegion.
136	The note that all operates are assumed to not abort/infloop should be in the documentation comment for users of the API; it is not an implementation detail because it makes the analysis incorrect for some potential users.

Do not special case functions for number of executions computation

mlir/lib/Analysis/NumberOfExecutions.cpp
66	Done. It required few more changes, but the special case for functions is gone.
136	Added as a top-level documentation because it is also relevant to blocks.

Thanks Eugene. All I have left are nits. This LGTM.

We probably want to wait on River to give final sign-off. In particular surrounding this analysis not being conservative w.r.t. potentially aborting/nonterminating computations.

mlir/include/mlir/Support/MathExtras.h
32	nit: do we have a unittest you could extend? (or add? :) )

Harbormaster completed remote builds in B78192: Diff 303988.Nov 9 2020, 2:46 PM

Add a unit test for Support/MathExtras

Harbormaster completed remote builds in B78195: Diff 303995.Nov 9 2020, 3:20 PM

The new wording makes more sense to me and clears the confusion I had previously, thanks! LGTM after the remaining comments.

mlir/include/mlir/Analysis/NumberOfExecutions.h
38	nit: in a finite amount
98	nit: Use /// for comments here.
mlir/lib/Analysis/NumberOfExecutions.cpp
16	Can this function include be removed?
39	This seems like it would break when a graph region is nested within a CFG region. Can we avoid the direct assert here and just use `kUnknownNumRegionInvocations` instead? I could see an assert on the top level operation, but is there a reason to assert on nested operations?
51–60
129	nit: in a finite amount of time
209–222

This revision is now accepted and ready to land.Nov 10 2020, 11:06 PM

Address PR comments

mlir/lib/Analysis/NumberOfExecutions.cpp
209–222	I had to walk block and then operations to get sensible output for filecheck, otherwise it prints inner operation before its parent.

Harbormaster completed remote builds in B78420: Diff 304424.Nov 11 2020, 1:28 AM

Rebased

This revision was landed with ongoing or failed builds.Nov 11 2020, 1:43 AM

Closed by commit rGbb0d5f767dd7: [mlir] Add NumberOfExecutions analysis + update RegionBranchOpInterface… (authored by ezhulenev). · Explain Why

This revision was automatically updated to reflect the committed changes.

ezhulenev added a commit: rGbb0d5f767dd7: [mlir] Add NumberOfExecutions analysis + update RegionBranchOpInterface….

Harbormaster completed remote builds in B78421: Diff 304427.Nov 11 2020, 1:59 AM

Revision Contents

Path

Size

mlir/

include/

mlir/

Analysis/

NumberOfExecutions.h

107 lines

Dialect/

Async/

IR/

Async.h

1 line

AsyncOps.td

3 lines

SCF/

SCFOps.td

5 lines

Interfaces/

ControlFlowInterfaces.h

3 lines

ControlFlowInterfaces.td

20 lines

Support/

MathExtras.h

14 lines

lib/

Analysis/

CMakeLists.txt

4 lines

NumberOfExecutions.cpp

243 lines

Dialect/

Async/

IR/

Async.cpp

21 lines

SCF/

SCF.cpp

21 lines

Interfaces/

ControlFlowInterfaces.cpp

3 lines

test/

Analysis/

test-number-of-block-executions.mlir

192 lines

test-number-of-operation-executions.mlir

66 lines

lib/

Transforms/

CMakeLists.txt

1 line

TestNumberOfExecutions.cpp

57 lines

tools/

mlir-opt/

mlir-opt.cpp

4 lines

unittests/

Support/

CMakeLists.txt

1 line

MathExtrasTest.cpp

27 lines

Diff 304430

mlir/include/mlir/Analysis/NumberOfExecutions.h

This file was added.

				//===- NumberOfExecutions.h - Number of executions analysis ------ C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file contains an analysis for computing how many times a block within a
				// region is executed each time that region is entered. The analysis
				// iterates over all associated regions that are attached to the given top-level
				// operation.
				//
				// It is possible to query number of executions information on block level.
				//
				//===----------------------------------------------------------------------===//

				#ifndef MLIR_ANALYSIS_NUMBER_OF_EXECUTIONS_H
				#define MLIR_ANALYSIS_NUMBER_OF_EXECUTIONS_H

				#include "mlir/Support/LLVM.h"
				#include "llvm/ADT/DenseMap.h"
				#include "llvm/ADT/Optional.h"

				namespace mlir {

				class Block;
				class BlockNumberOfExecutionsInfo;
				class Operation;
				class Region;

				/// Represents an analysis for computing how many times a block or an operation
				/// within a region is executed each time that region is entered. The analysis
				/// iterates over all associated regions that are attached to the given
				/// top-level operation.
				///
				/// This analysis assumes that all operations complete in a finite amount of
				/// time (do not abort and do not go into the infinite loop).
				ftynseUnsubmitted Done Reply Inline Actions Nit: explicit? ftynse: Nit: explicit?
				rriddleUnsubmitted Done Reply Inline Actions nit: in a finite amount rriddle: nit: in a finite amount
				class NumberOfExecutions {
				public:
				/// Creates a new NumberOfExecutions analysis that computes how many times a
				/// block within a region is executed for all associated regions.
				explicit NumberOfExecutions(Operation *op);

				/// Returns the number of times operations `op` is executed each time the
				/// control flow enters the region `perEntryOfThisRegion`. Returns empty
				/// optional if this is not known statically.
				Optional<int64_t> getNumberOfExecutions(Operation *op,
				Region *perEntryOfThisRegion) const;

				/// Returns the number of times block `block` is executed each time the
				/// control flow enters the region `perEntryOfThisRegion`. Returns empty
				/// optional if this is not known statically.
				Optional<int64_t> getNumberOfExecutions(Block *block,
				Region *perEntryOfThisRegion) const;

				/// Dumps the number of block executions each time the control flow enters
				/// the region `perEntryOfThisRegion` to the given stream.
				void printBlockExecutions(raw_ostream &os,
				Region *perEntryOfThisRegion) const;

				/// Dumps the number of operation executions each time the control flow
				/// enters the region `perEntryOfThisRegion` to the given stream.
				void printOperationExecutions(raw_ostream &os,
				Region *perEntryOfThisRegion) const;

				private:
				/// The operation this analysis was constructed from.
				Operation *operation;
				ftynseUnsubmitted Done Reply Inline Actions There doesn't seem to be an accessor for this field, so IMO it can be dropped (unless there is an anticipated need for it). At which point, we can also consider `using BlockNumberOfExecutionsInfo = Optional<int64_t>`, which can be nicely constructed from an `int64_t` or from `llvm::None`. ftynse: There doesn't seem to be an accessor for this field, so IMO it can be dropped (unless there is…
				ezhulenevAuthorUnsubmitted Done Reply Inline Actions I added a custom non-default constructible type to catch accidental default constructions of None instead of a failure. And maybe more advanced analysis can add some other fields in the future. Added this to class documentation for the next reader and removed unused Block. ezhulenev:* I added a custom non-default constructible type to catch accidental default constructions of…
				ftynseUnsubmitted Done Reply Inline Actions Ack. (Phabricator doesn't let anybody other than the author to mark comments as done). ftynse: Ack. (Phabricator doesn't let anybody other than the author to mark comments as done).

				/// A mapping from blocks to number of executions information.
				DenseMap<Block *, BlockNumberOfExecutionsInfo> blockNumbersOfExecution;
				};

				/// Represents number of block executions information.
				class BlockNumberOfExecutionsInfo {
				public:
				BlockNumberOfExecutionsInfo(Block *block,
				Optional<int64_t> numberOfRegionInvocations,
				Optional<int64_t> numberOfBlockExecutions);

				/// Returns the number of times this block will be executed each time the
				/// parent operation is executed.
				Optional<int64_t> getNumberOfExecutions() const;

				/// Returns the number of times this block will be executed if the parent
				/// region is invoked `numberOfRegionInvocations` times. This can be different
				/// from the number of region invocations by the parent operation.
				Optional<int64_t>
				getNumberOfExecutions(int64_t numberOfRegionInvocations) const;

				Block *getBlock() const { return block; }

				private:
				Block *block;

				/// Number of `block` parent region invocations each time parent operation
				/// is executed.
				rriddleUnsubmitted Done Reply Inline Actions nit: Use /// for comments here. rriddle: nit: Use /// for comments here.
				Optional<int64_t> numberOfRegionInvocations;

				/// Number of `block` executions each time parent region is invoked.
				Optional<int64_t> numberOfBlockExecutions;
				};

				} // end namespace mlir

				#endif // MLIR_ANALYSIS_NUMBER_OF_EXECUTIONS_H

mlir/include/mlir/Dialect/Async/IR/Async.h

	Show All 13 Lines
	#ifndef MLIR_DIALECT_ASYNC_IR_ASYNC_H			#ifndef MLIR_DIALECT_ASYNC_IR_ASYNC_H
	#define MLIR_DIALECT_ASYNC_IR_ASYNC_H			#define MLIR_DIALECT_ASYNC_IR_ASYNC_H

	#include "mlir/IR/Builders.h"			#include "mlir/IR/Builders.h"
	#include "mlir/IR/Dialect.h"			#include "mlir/IR/Dialect.h"
	#include "mlir/IR/OpDefinition.h"			#include "mlir/IR/OpDefinition.h"
	#include "mlir/IR/OpImplementation.h"			#include "mlir/IR/OpImplementation.h"
	#include "mlir/IR/StandardTypes.h"			#include "mlir/IR/StandardTypes.h"
				#include "mlir/Interfaces/ControlFlowInterfaces.h"
	#include "mlir/Interfaces/SideEffectInterfaces.h"			#include "mlir/Interfaces/SideEffectInterfaces.h"

	namespace mlir {			namespace mlir {
	namespace async {			namespace async {

	namespace detail {			namespace detail {
	struct ValueTypeStorage;			struct ValueTypeStorage;
	} // namespace detail			} // namespace detail
	Show All 28 Lines

mlir/include/mlir/Dialect/Async/IR/AsyncOps.td

//===- AsyncOps.td - Async operations definition ------------ tablegen --===//		//===- AsyncOps.td - Async operations definition ------------ tablegen --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This is the operation definition file for Async dialect operations.		// This is the operation definition file for Async dialect operations.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef ASYNC_OPS		#ifndef ASYNC_OPS
#define ASYNC_OPS		#define ASYNC_OPS

include "mlir/Dialect/Async/IR/AsyncBase.td"		include "mlir/Dialect/Async/IR/AsyncBase.td"
		include "mlir/Interfaces/ControlFlowInterfaces.td"
include "mlir/Interfaces/SideEffectInterfaces.td"		include "mlir/Interfaces/SideEffectInterfaces.td"

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Async op definitions		// Async op definitions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// Base class for the operation in this dialect		// Base class for the operation in this dialect
class Async_Op<string mnemonic, list<OpTrait> traits = []> :		class Async_Op<string mnemonic, list<OpTrait> traits = []> :
Op<AsyncDialect, mnemonic, traits>;		Op<AsyncDialect, mnemonic, traits>;

def Async_ExecuteOp :		def Async_ExecuteOp :
Async_Op<"execute", [SingleBlockImplicitTerminator<"YieldOp">,		Async_Op<"execute", [SingleBlockImplicitTerminator<"YieldOp">,
		DeclareOpInterfaceMethods<RegionBranchOpInterface,
		["getNumRegionInvocations"]>,
AttrSizedOperandSegments]> {		AttrSizedOperandSegments]> {
let summary = "Asynchronous execute operation";		let summary = "Asynchronous execute operation";
let description = [{		let description = [{
The `body` region attached to the `async.execute` operation semantically		The `body` region attached to the `async.execute` operation semantically
can be executed concurrently with the successor operation. In the followup		can be executed concurrently with the successor operation. In the followup
example "compute0" can be executed concurrently with "compute1".		example "compute0" can be executed concurrently with "compute1".

The actual concurrency semantics depends on the dialect lowering to the		The actual concurrency semantics depends on the dialect lowering to the
Show All 38 Lines	let results = (outs Async_TokenType:$token,
Variadic<Async_AnyValueType>:$results);		Variadic<Async_AnyValueType>:$results);
let regions = (region SizedRegion<1>:$body);		let regions = (region SizedRegion<1>:$body);

let printer = [{ return ::print(p, *this); }];		let printer = [{ return ::print(p, *this); }];
let parser = [{ return ::parse$cppClass(parser, result); }];		let parser = [{ return ::parse$cppClass(parser, result); }];
let verifier = [{ return ::verify(*this); }];		let verifier = [{ return ::verify(*this); }];
}		}

def Async_YieldOp :		def Async_YieldOp :
Async_Op<"yield", [HasParent<"ExecuteOp">, NoSideEffect, Terminator]> {		Async_Op<"yield", [HasParent<"ExecuteOp">, NoSideEffect, Terminator]> {
let summary = "terminator for Async execute operation";		let summary = "terminator for Async execute operation";
		ftynseUnsubmitted Done Reply Inline Actions It should be possible to do `DeclareOpInterfaceMethods<RegionBranchOpInterface, ["getNumRegionInvocations"]>` in the `def` and avoid this explicit declaration. ftynse: It should be possible to do `DeclareOpInterfaceMethods<RegionBranchOpInterface…
		ezhulenevAuthorUnsubmitted Done Reply Inline Actions Cool, learning something new every day :) ezhulenev: Cool, learning something new every day :)
let description = [{		let description = [{
The `async.yield` is a special terminator operation for the block inside		The `async.yield` is a special terminator operation for the block inside
`async.execute` operation.		`async.execute` operation.
}];		}];

let arguments = (ins Variadic<AnyType>:$operands);		let arguments = (ins Variadic<AnyType>:$operands);

let assemblyFormat = "attr-dict ($operands^ `:` type($operands))?";		let assemblyFormat = "attr-dict ($operands^ `:` type($operands))?";
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/SCF/SCFOps.td

Show First 20 Lines • Show All 190 Lines • ▼ Show 20 Lines	unsigned getNumIterOperands() {
return getOperation()->getNumOperands() - getNumControlOperands();		return getOperation()->getNumOperands() - getNumControlOperands();
}		}

/// Return operands used when entering the region at 'index'. These operands		/// Return operands used when entering the region at 'index'. These operands
/// correspond to the loop iterator operands, i.e., those exclusing the		/// correspond to the loop iterator operands, i.e., those exclusing the
/// induction variable. LoopOp only has one region, so 0 is the only valid		/// induction variable. LoopOp only has one region, so 0 is the only valid
/// value for `index`.		/// value for `index`.
OperandRange getSuccessorEntryOperands(unsigned index);		OperandRange getSuccessorEntryOperands(unsigned index);

		/// Returns the number of invocations of the body block if the loop bounds
		/// are constants. Returns `kUnknownNumRegionInvocations` otherwise.
		rriddleUnsubmitted Done Reply Inline Actions Replace -1 here with kUnknownNumRegionInvocations. rriddle: Replace -1 here with kUnknownNumRegionInvocations.
		void getNumRegionInvocations(ArrayRef<Attribute> operands,
		SmallVectorImpl<int64_t> &countPerRegion);
}];		}];

let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
}		}

def IfOp : SCF_Op<"if",		def IfOp : SCF_Op<"if",
[DeclareOpInterfaceMethods<RegionBranchOpInterface>,		[DeclareOpInterfaceMethods<RegionBranchOpInterface>,
SingleBlockImplicitTerminator<"scf::YieldOp">, RecursiveSideEffects,		SingleBlockImplicitTerminator<"scf::YieldOp">, RecursiveSideEffects,
▲ Show 20 Lines • Show All 380 Lines • Show Last 20 Lines

mlir/include/mlir/Interfaces/ControlFlowInterfaces.h

	Show All 35 Lines
	LogicalResult verifyBranchSuccessorOperands(Operation *op, unsigned succNo,			LogicalResult verifyBranchSuccessorOperands(Operation *op, unsigned succNo,
	Optional<OperandRange> operands);			Optional<OperandRange> operands);
	} // namespace detail			} // namespace detail

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// RegionBranchOpInterface			// RegionBranchOpInterface
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

				// A constant value to represent unknown number of region invocations.
				extern const int64_t kUnknownNumRegionInvocations;

	namespace detail {			namespace detail {
	/// Verify that types match along control flow edges described the given op.			/// Verify that types match along control flow edges described the given op.
	LogicalResult verifyTypesAlongControlFlowEdges(Operation *op);			LogicalResult verifyTypesAlongControlFlowEdges(Operation *op);
	} // namespace detail			} // namespace detail

	/// This class represents a successor of a region. A region successor can either			/// This class represents a successor of a region. A region successor can either
	/// be another region, or the parent operation. If the successor is a region,			/// be another region, or the parent operation. If the successor is a region,
	/// this class represents the destination region, as well as a set of arguments			/// this class represents the destination region, as well as a set of arguments
	▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

mlir/include/mlir/Interfaces/ControlFlowInterfaces.td

Show First 20 Lines • Show All 124 Lines • ▼ Show 20 Lines	InterfaceMethod<[{
operation as a successor. This method allows for describing which		operation as a successor. This method allows for describing which
regions may be executed when entering an operation, and which regions		regions may be executed when entering an operation, and which regions
are executed after having executed another region of the parent op. The		are executed after having executed another region of the parent op. The
successor region must be non-empty.		successor region must be non-empty.
}],		}],
"void", "getSuccessorRegions",		"void", "getSuccessorRegions",
(ins "Optional<unsigned>":$index, "ArrayRef<Attribute>":$operands,		(ins "Optional<unsigned>":$index, "ArrayRef<Attribute>":$operands,
"SmallVectorImpl<RegionSuccessor> &":$regions)		"SmallVectorImpl<RegionSuccessor> &":$regions)
		>,
		InterfaceMethod<[{
		Populates countPerRegion with the number of times this operation will
		ftynseUnsubmitted Done Reply Inline Actions Nit: I'd be explicit and say something like "Populates `countPerRegion` with the number..." ftynse: Nit: I'd be explicit and say something like "Populates `countPerRegion` with the number..."
		invoke the attached regions (assuming the regions yield normally, i.e.
		do not abort or invoke an infinite loop). If the number of region
		ftynseUnsubmitted Done Reply Inline Actions Nit: could we have a named constant instead of -1? ftynse: Nit: could we have a named constant instead of -1?
		invocations is not known statically it will set the number of
		invocations to `kUnknownNumRegionInvocations`.

		`operands` is a set of optional attributes that either correspond to a
		constant values for each operand of this operation, or null if that
		operand is not a constant.
		}],
		"void", "getNumRegionInvocations",
		(ins "ArrayRef<Attribute>":$operands,
		"SmallVectorImpl<int64_t> &":$countPerRegion), [{}],
		/defaultImplementation=/[{
		unsigned numRegions = this->getOperation()->getNumRegions();
		ftynseUnsubmitted Done Reply Inline Actions Nit: countPerRegion.resize(numRegions, -1) avoids the loop ftynse: Nit: countPerRegion.resize(numRegions, -1) avoids the loop
		assert(countPerRegion.empty());
		countPerRegion.resize(numRegions, kUnknownNumRegionInvocations);
		}]
>		>
];		];

let verify = [{		let verify = [{
static_assert(!ConcreteOpType::template hasTrait<OpTrait::ZeroRegion>(),		static_assert(!ConcreteOpType::template hasTrait<OpTrait::ZeroRegion>(),
"expected operation to have non-zero regions");		"expected operation to have non-zero regions");
return success();		return success();
}];		}];
Show All 24 Lines

mlir/include/mlir/Support/MathExtras.h

	Show All 13 Lines
	#define MLIR_SUPPORT_MATHEXTRAS_H_			#define MLIR_SUPPORT_MATHEXTRAS_H_

	#include "mlir/Support/LLVM.h"			#include "mlir/Support/LLVM.h"
	#include "llvm/ADT/APInt.h"			#include "llvm/ADT/APInt.h"

	namespace mlir {			namespace mlir {

	/// Returns the result of MLIR's ceildiv operation on constants. The RHS is			/// Returns the result of MLIR's ceildiv operation on constants. The RHS is
	/// expected to be positive.			/// expected to be non-zero.
	inline int64_t ceilDiv(int64_t lhs, int64_t rhs) {			inline int64_t ceilDiv(int64_t lhs, int64_t rhs) {
	assert(rhs >= 1);			assert(rhs != 0);
	// C/C++'s integer division rounds towards 0.			// C/C++'s integer division rounds towards 0.
	return lhs % rhs > 0 ? lhs / rhs + 1 : lhs / rhs;			int64_t x = (rhs > 0) ? -1 : 1;
				return (lhs * rhs > 0) ? ((lhs + x) / rhs) + 1 : -(-lhs / rhs);
	}			}

	/// Returns the result of MLIR's floordiv operation on constants. The RHS is			/// Returns the result of MLIR's floordiv operation on constants. The RHS is
	/// expected to be positive.			/// expected to be non-zero.
	inline int64_t floorDiv(int64_t lhs, int64_t rhs) {			inline int64_t floorDiv(int64_t lhs, int64_t rhs) {
				silvasUnsubmitted Done Reply Inline Actions nit: do we have a unittest you could extend? (or add? :) ) silvas: nit: do we have a unittest you could extend? (or add? :) )
	assert(rhs >= 1);			assert(rhs != 0);
	// C/C++'s integer division rounds towards 0.			// C/C++'s integer division rounds towards 0.
	return lhs % rhs < 0 ? lhs / rhs - 1 : lhs / rhs;			int64_t x = (rhs < 0) ? 1 : -1;
				return (lhs * rhs < 0) ? -((-lhs + x) / rhs) - 1 : lhs / rhs;
	}			}

	/// Returns MLIR's mod operation on constants. MLIR's mod operation yields the			/// Returns MLIR's mod operation on constants. MLIR's mod operation yields the
	/// remainder of the Euclidean division of 'lhs' by 'rhs', and is therefore not			/// remainder of the Euclidean division of 'lhs' by 'rhs', and is therefore not
	/// C's % operator. The RHS is always expected to be positive, and the result			/// C's % operator. The RHS is always expected to be positive, and the result
	/// is always non-negative.			/// is always non-negative.
	inline int64_t mod(int64_t lhs, int64_t rhs) {			inline int64_t mod(int64_t lhs, int64_t rhs) {
	assert(rhs >= 1);			assert(rhs >= 1);
	Show All 14 Lines

mlir/lib/Analysis/CMakeLists.txt

set(LLVM_OPTIONAL_SOURCES		set(LLVM_OPTIONAL_SOURCES
AffineAnalysis.cpp		AffineAnalysis.cpp
AffineStructures.cpp		AffineStructures.cpp
BufferAliasAnalysis.cpp		BufferAliasAnalysis.cpp
CallGraph.cpp		CallGraph.cpp
Liveness.cpp		Liveness.cpp
LoopAnalysis.cpp		LoopAnalysis.cpp
NestedMatcher.cpp		NestedMatcher.cpp
		NumberOfExecutions.cpp
PresburgerSet.cpp		PresburgerSet.cpp
SliceAnalysis.cpp		SliceAnalysis.cpp
Utils.cpp		Utils.cpp
)		)

add_mlir_library(MLIRAnalysis		add_mlir_library(MLIRAnalysis
BufferAliasAnalysis.cpp		BufferAliasAnalysis.cpp
CallGraph.cpp		CallGraph.cpp
Liveness.cpp		Liveness.cpp
		NumberOfExecutions.cpp
SliceAnalysis.cpp		SliceAnalysis.cpp

ADDITIONAL_HEADER_DIRS		ADDITIONAL_HEADER_DIRS
${MLIR_MAIN_INCLUDE_DIR}/mlir/Analysis		${MLIR_MAIN_INCLUDE_DIR}/mlir/Analysis

DEPENDS		DEPENDS
mlir-headers		mlir-headers

Show All 22 Lines	add_mlir_library(MLIRLoopAnalysis
LINK_LIBS PUBLIC		LINK_LIBS PUBLIC
MLIRAffine		MLIRAffine
MLIRCallInterfaces		MLIRCallInterfaces
MLIRControlFlowInterfaces		MLIRControlFlowInterfaces
MLIRInferTypeOpInterface		MLIRInferTypeOpInterface
MLIRPresburger		MLIRPresburger
MLIRSCF		MLIRSCF
)		)

add_subdirectory(Presburger)		add_subdirectory(Presburger)

mlir/lib/Analysis/NumberOfExecutions.cpp

This file was added.

//===- NumberOfExecutions.cpp - Number of executions analysis -------------===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

// Implementation of the number of executions analysis.

//===----------------------------------------------------------------------===//

#include "mlir/Analysis/NumberOfExecutions.h"

#include "mlir/Dialect/StandardOps/IR/Ops.h"

rriddleUnsubmitted

Done

llvm includes go after MLIR includes.

rriddle: llvm includes go after MLIR includes.

#include "mlir/IR/Matchers.h"

#include "mlir/IR/RegionKindInterface.h"

rriddleUnsubmitted

Done

Can this function include be removed?

rriddle: Can this function include be removed?

#include "mlir/Interfaces/ControlFlowInterfaces.h"

#include "llvm/ADT/FunctionExtras.h"

rriddleUnsubmitted

Done

Is the dependency on standard ops necessary?

rriddle: Is the dependency on standard ops necessary?

ezhulenevAuthorUnsubmitted

Done

It's needed for dyn_cast<ConstantOp>

ezhulenev: It's needed for dyn_cast<ConstantOp>

#include "llvm/ADT/SmallSet.h"

#include "llvm/Support/raw_ostream.h"

#define DEBUG_TYPE "number-of-executions-analysis"

rriddleUnsubmitted

Done

Please trim some of these includes, several are unnecessary.

rriddle: Please trim some of these includes, several are unnecessary.

using namespace mlir;

//===----------------------------------------------------------------------===//

// NumberOfExecutions

//===----------------------------------------------------------------------===//

/// Computes blocks number of executions information for the given region.

static void computeRegionBlockNumberOfExecutions(

Region &region, DenseMap<Block *, BlockNumberOfExecutionsInfo> &blockInfo) {

Operation *parentOp = region.getParentOp();

int regionId = region.getRegionNumber();

rriddleUnsubmitted

Done

Missing static here?

rriddle: Missing static here?

auto regionKindInterface = dyn_cast<RegionKindInterface>(parentOp);

bool isGraphRegion =

regionKindInterface &&

rriddleUnsubmitted

Done

This seems like it would break when a graph region is nested within a CFG region. Can we avoid the direct assert here and just use kUnknownNumRegionInvocations instead? I could see an assert on the top level operation, but is there a reason to assert on nested operations?

rriddle: This seems like it would break when a graph region is nested within a CFG region. Can we avoid…

regionKindInterface.getRegionKind(regionId) == RegionKind::Graph;

// CFG analysis does not make sense for Graph regions, set the number of

// executions for all blocks as unknown.

if (isGraphRegion) {

for (Block &block : region)

blockInfo.insert({&block, {&block, None, None}});

ftynseUnsubmitted

Done

I wonder if this can be generalized to work on all constant-like operations. Currently, we only have a ConstantLike trait, so we cannot get the constant value as attribute without knowing the concrete op type. Maybe we can turn ConstantLike into an interface with value being an interface method. Out of scope for this commit probably.

ftynse: I wonder if this can be generalized to work on all constant-like operations. Currently, we only…

ezhulenevAuthorUnsubmitted

Done

Yeah, that would be useful also in the context of TF graphs with tf.Const ops, but I did not find any relevant examples and chose the easiest path.

ezhulenev: Yeah, that would be useful also in the context of TF graphs with tf.Const ops, but I did not…

return;

}

// Number of region invocations for all attached regions.

SmallVector<int64_t, 4> numRegionsInvocations;

// Query RegionBranchOpInterface interface if it is available.

ftynseUnsubmitted

Done

Could we rather query the op? Hardcoding FuncOp makes this analysis non-applicable to custom function types (we have some, e.g., in LLVM dialect and in Flang).

ftynse: Could we rather query the op? Hardcoding FuncOp makes this analysis non-applicable to custom…

ezhulenevAuthorUnsubmitted

Done

It guess in can just implement RegionBranchOpInterface, I decided to special case it because it's "built in" and not defined in any of the dialect .td files. I expect all other FuncOpLike things to implement RegionBranchOpInterface.

ezhulenev: It guess in can just implement RegionBranchOpInterface, I decided to special case it because…

ftynseUnsubmitted

Not Done

ODS is not mandatory to implement an interface, ops defined in C++ can derive Traits<InterfaceName::Trait> and get the same functionality.

I'm fine with your current implementation given the explanation provided. We should look into having FuncOp implement the interface separately and consult with @rriddle about future plans regarding it being built-in.

ftynse: ODS is not mandatory to implement an interface, ops defined in C++ can derive…

ezhulenevAuthorUnsubmitted

Done

Yeah, I know, I just don't know is it "ok" to add it to built in type :) Assuming FuncOp is as fundamental as Region I think it's ok to have a special case, but I'd like to hear opinion from someone with a better judgement.

ezhulenev: Yeah, I know, I just don't know is it "ok" to add it to built in type :) Assuming `FuncOp` is…

mehdi_aminiUnsubmitted

Not Done

FuncOp isn't as fundamental as Region, I rather treat FuncOp as any other op.

For example LLVM, SPIRV, GPU dialects aren't using FuncOp.

mehdi_amini: FuncOp isn't as fundamental as Region, I rather treat FuncOp as any other op. For example LLVM…

ezhulenevAuthorUnsubmitted

Done

I've looked into adding RegionBranchOpInterface to FuncOp`, but it feels weird because the FuncOp does not execute the region, only the CallOp does.

For example: getSuccessorEntryOperands: "Returns the operands of this operation used as the entry arguments", but the FuncOp is OpTrait::ZeroOperands.

Maybe add a special trait, something like OpTrait::InvokesRegionsOnce, an additional piece of information just like OpTrait::OneRegion.

Or just rely on OpTrait::FunctionLike here, assuming that all functions execute attached region once.

ezhulenev: I've looked into adding `RegionBranchOpInterface to `FuncOp`, but it feels weird because the…

if (auto regionInterface = dyn_cast<RegionBranchOpInterface>(parentOp)) {

rriddleUnsubmitted

Done

Use m_Constant here instead, we should avoid hardcoding operation types whenever possible.

rriddle: Use m_Constant here instead, we should avoid hardcoding operation types whenever possible.

SmallVector<Attribute, 4> operands(parentOp->getNumOperands());

for (auto operandIt : llvm::enumerate(parentOp->getOperands()))

matchPattern(operandIt.value(), m_Constant(&operands[operandIt.index()]));

ftynseUnsubmitted

Done

Let's also be future-proof and guard against graph regions (https://mlir.llvm.org/docs/LangRef/#graph-regions). They don't have blocks (yet), but if they do, the logic of following block successors below will not necessarily apply. An assertion could do.

ftynse: Let's also be future-proof and guard against graph regions (https://mlir.llvm.

regionInterface.getNumRegionInvocations(operands, numRegionsInvocations);

}

rriddleUnsubmitted

Done

if (auto regionInterface = dyn_cast<RegionBranchOpInterface>(parentOp)) {

- SmallVector<Attribute, 2> operands;

- operands.reserve(parentOp->getNumOperands());

- for (Value operand : parentOp->getOperands()) {

- Attribute attr;

- if (matchPattern(operand, m_Constant(&attr)))

- operands.push_back(attr);

- else

- operands.push_back(Attribute());

- }

+ SmallVector<Attribute, 2> operands(parentOp->getNumOperands());

+ for (auto operandIt : llvm::enumerate(parentOp->getOperands()))

+ matchPattern(operandIt.value(), m_Constant(&operands[operandIt.index()]));

regionInterface.getNumRegionInvocations(operands, numRegionsInvocations);

rriddle:

// Number of region invocations *each time* parent operation is invoked.

Optional<int64_t> numRegionInvocations;

if (!numRegionsInvocations.empty() &&

numRegionsInvocations[regionId] != kUnknownNumRegionInvocations) {

ftynseUnsubmitted

Done

Nit: we usually don't declare function stack variables as const in MLIR

ftynse: Nit: we usually don't declare function stack variables as `const` in MLIR

rriddleUnsubmitted

Done

This is not correct and seems like it would result in incorrect usages of the analysis. If you want to estimate function executions, you'll need to use the callgraph.

rriddle: This is not correct and seems like it would result in incorrect usages of the analysis. If you…

rriddleUnsubmitted

Done

Why should functions be default to executed once? That doesn't seem like a conservatively safe default.

rriddle: Why should functions be default to executed once? That doesn't seem like a conservatively safe…

silvasUnsubmitted

Done

I agree, there seems to be a code smell here.

One solution is for queries to be of the form Optional<int64_t> getNumberOfExecutions(Block *block, Region *assumingThisRegionIsEnteredOnce).

So the analysis could hold a DenseMap<Block *, NumberOfExecutionsAssumingDirectParentRegionEnteredOnce>. Then on a query, you would walk up parents, multiplying NumberOfExecutionsAssumingDirectParentRegionEnteredOnce at each step until we reach assumingThisRegionIsEnteredOnce (and until we implement a call graph analysis, we will return None if assumingThisRegionIsEnteredOnce is not an ancestor of block)

And the body of a module should probably be an illegal Region to pass as assumingThisRegionIsEnteredOnce, because it is a "declarative region" and never "executes" (we might need a new IR property to represent this?)

test-print-number-of-executions would have some hardcoded policy to print values assuming assumingThisRegionIsEnteredOnce == enclosingFunc.getBody()

silvas: I agree, there seems to be a code smell here. One solution is for queries to be of the form…

ezhulenevAuthorUnsubmitted

Done

This is roughly ~= "when a function called it executes its entry block once". And this seems like a reasonable assumption. This is not about how many call operations are in the graph.

ezhulenev: This is roughly ~= "when a function called it executes its entry block once". And this seems…

silvasUnsubmitted

Done

what about a function like

def f(i):
  print("here") # How many executions will this be assigned?
  if i > 0:
    f(i - 1)

I think we need to be more precise about what we mean by "execution" here.

Also, does this pass correctly handle a case like?

def g(i):
  if i == 0:
    abort()
  print("here") # How many executions?

silvas: what about a function like ``` def f(i): print("here") # How many executions will this be…

ezhulenevAuthorUnsubmitted

Done

#1. 1 execution of the basic block that contains print operations. Function call is irrelevant here.

#2. Unknown, because it involves dynamic control flow

CFG will look something like this:

^bb0
  %cond = cmp i == 0
  cond_br %cond ^abort, ^print
^abort
  br ^print
^print

def g(i):
  abort()
  print("here") # is a bit more tricky, but let's assume this is an 
                # estimate given that no operations can terminate a program

I think we need to be more precise about what we mean by "execution" here.

"How many times this operation will be executed in this region assuming region invoked N times".

Call operation is about completely different region (different instance of a region).

ezhulenev: #1. 1 execution of the basic block that contains print operations. Function call is irrelevant…

silvasUnsubmitted

Done

I see that the current code actually does walk parents in the way I described. Sorry.

I thought somewhat about the definition of "execution" that we mean here. It's really "for *each* time that Region is entered, how many times will given operation be executed". That makes the definition work for my case f above, since it yields getNumberOfExecutions(thePrintOp, f.getBody()) == 1.

Also, because of the possibility of aborting or calling a function that cannot be proven to terminate in the middle of a block, I think it is best for getNumberOfExecutions to take an Operation * instead of a Block, so that cases like this behave as intended, and users don't have to do their own within-a-block check:

def h():
  print("here1") # Number of executions is 1 per entry to h.getBody()
  call @weProvedThatItAbortsOrInfiniteLoops()
  print("here") # Number of executions is 0 per entry to h.getBody()

As such, could we do something like an interface to this analysis that looks like:

Optional<IntegerRange> getNumberOfExecutions(Operation *op, Region *perEntryOfThisRegion)

IntegerRange is just a pair of integers that represent a lower and upper bound on the number of executions, which composes nicely to handle cases where we can't prove that the statement executes at all (like those tricky abort/infinite-loop cases) by having a lower bound of 0 (we could do something more precise than an interval, such as an arbitrary set of integers, but that might get unwieldy).

To handle this correctly, getNumRegionInvocations will need to be wordsmithed to something like "number of times the body will be invoked if the body yields normally (i.e. doesn't abort or invoke an infinite loop)". Then this analysis can do a bottom up traversal of the region tree, doing the op-level analysis within each block looking for possibly aborting/infiniteLooping operations and correctly handling any ops that follow ops that might abort/infiniteLoop, and transitively propagating that up the region tree.

We can make this more precise with interprocedural analysis that does the equivalent of inferring attributes like LLVM's norecurse/willreturn/mustprogress (see https://llvm.org/docs/LangRef.html#function-attributes)

Under this definition, we have:

def f(i):
  print("here") # Number of executions is 1 per entry to f.getBody()
  if i > 0:
    f(i - 1)

def g(i):
  if i == 0:
    abort()
  print("here") # Number of executions is 0 or 1 per entry to g.getBody()

def h():
  print("here1") # Number of executions is 1 per entry to h.getBody()
  call @weProvedThatItAbortsOrInfiniteLoops()
  print("here") # Number of executions is 0 per entry to h.getBody()

def i():
  print("here1") # Number of executions is 1 per entry to i.getBody()
  call @cannotProveItDoesntAbortOrInfiniteLoop()
  print("here") # Number of executions is 0 or 1 per entry to i.getBody()

silvas: I see that the current code actually does walk parents in the way I described. Sorry. I…

ezhulenevAuthorUnsubmitted

Done

I updated documentation for the interface and NumberOfExecutions to mention "yields normally" and "each time" properties.

I like this API ...

Optional<IntegerRange> getNumberOfExecutions(Operation *op, Region *perEntryOfThisRegion)

... but for now computing the IntegerRange is not really needed and it adds a lot of complexity.

I plan to add API with Optional<int64_t> to represent "known fixed number" vs "everything else"

Optional<int64_t> getNumberOfExecutions(Operation *op, Region *perEntryOfThisRegion);

and assume that all operations are "normal": do not abort, or go into infinite loop == "completed in finite amount of time".

ezhulenev: I updated documentation for the interface and NumberOfExecutions to mention "yields normally"…

silvasUnsubmitted

Done

Thanks.

I think with the new API, we don't need these special cases. FunctionLike already didn't make sense because an op is not "executed" by the module region (we should probably return "unknown" (or an error) for getNumberOfExecutions(someOpInAFunction, module.getBody()). OpTrait::OneRegion doesn't guarantee anything about the number of invocations. E.g. scf::ForOp implements OpTrait::OneRegion.

silvas: Thanks. I think with the new API, we don't need these special cases. FunctionLike already…

ezhulenevAuthorUnsubmitted

Done

Done. It required few more changes, but the special case for functions is gone.

ezhulenev: Done. It required few more changes, but the special case for functions is gone.

numRegionInvocations = numRegionsInvocations[regionId];

}

// DFS traversal looking for loops in the CFG.

llvm::SmallSet<Block *, 4> loopStart;

ftynseUnsubmitted

Done

Nit: I would have just used auto and kept the original lambda type here, it's not like we are passing it around.

ftynse: Nit: I would have just used `auto` and kept the original lambda type here, it's not like we are…

ezhulenevAuthorUnsubmitted

Done

Unfortunately lambdas do not support recursion without ugly tricks.

ezhulenev: Unfortunately lambdas do not support recursion without ugly tricks.

llvm::unique_function<void(Block *, llvm::SmallSet<Block *, 4> &)> dfs =

[&](Block *block, llvm::SmallSet<Block *, 4> &visited) {

// Found a loop in the CFG that starts at the `block`.

rriddleUnsubmitted

Done

Drop trivial braces.

rriddle: Drop trivial braces.

if (visited.contains(block)) {

loopStart.insert(block);

return;

}

// Continue DFS traversal.

visited.insert(block);

for (Block *successor : block->getSuccessors())

dfs(successor, visited);

visited.erase(block);

};

llvm::SmallSet<Block *, 4> visited;

dfs(&region.front(), visited);

// Start from the entry block and follow only blocks with single succesor.

Block *block = &region.front();

while (block && !loopStart.contains(block)) {

// Block will be executed exactly once.

blockInfo.insert(

{block, BlockNumberOfExecutionsInfo(block, numRegionInvocations,

/*numberOfBlockExecutions=*/1)});

// We reached the exit block or block with multiple successors.

if (block->getNumSuccessors() != 1)

break;

// Continue traversal.

block = block->getSuccessor(0);

}

// For all blocks that we did not visit set the executions number to unknown.

for (Block &block : region)

if (blockInfo.count(&block) == 0)

blockInfo.insert({&block, BlockNumberOfExecutionsInfo(

&block, numRegionInvocations,

/*numberOfBlockExecutions=*/None)});

}

/// Creates a new NumberOfExecutions analysis that computes how many times a

/// block within a region is executed for all associated regions.

NumberOfExecutions::NumberOfExecutions(Operation *op) : operation(op) {

operation->walk([&](Region *region) {

computeRegionBlockNumberOfExecutions(*region, blockNumbersOfExecution);

});

}

rriddleUnsubmitted

Done

Nit: Flip the condition of this if, and remove the trivial braces.

if (!...)
  blockInfo.insert

rriddle: Nit: Flip the condition of this if, and remove the trivial braces. ``` if (!...) blockInfo.

Optional<int64_t>

NumberOfExecutions::getNumberOfExecutions(Operation *op,

Region *perEntryOfThisRegion) const {

// Assuming that all operations complete in a finite amount of time (do not

// abort and do not go into the infinite loop), the number of operation

// executions is equal to the number of block executions that contains the

// operation.

rriddleUnsubmitted

Done

nit: in a finite amount of time

rriddle: nit: in a finite amount of time

return getNumberOfExecutions(op->getBlock(), perEntryOfThisRegion);

}

Optional<int64_t>

NumberOfExecutions::getNumberOfExecutions(Block *block,

Region *perEntryOfThisRegion) const {

// Return None if the given `block` does not lie inside the

silvasUnsubmitted

Done

The note that all operates are assumed to not abort/infloop should be in the documentation comment for users of the API; it is not an implementation detail because it makes the analysis incorrect for some potential users.

silvas: The note that all operates are assumed to not abort/infloop should be in the documentation…

ezhulenevAuthorUnsubmitted

Done

Added as a top-level documentation because it is also relevant to blocks.

ezhulenev: Added as a top-level documentation because it is also relevant to blocks.

// `perEntryOfThisRegion` region.

if (!perEntryOfThisRegion->findAncestorBlockInRegion(*block))

return None;

// Find the block information for the given `block.

auto blockIt = blockNumbersOfExecution.find(block);

if (blockIt == blockNumbersOfExecution.end())

return None;

const auto &blockInfo = blockIt->getSecond();

// Override the number of region invocations with `1` if the

// `perEntryOfThisRegion` region owns the block.

auto getNumberOfExecutions = [&](const BlockNumberOfExecutionsInfo &info) {

if (info.getBlock()->getParent() == perEntryOfThisRegion)

return info.getNumberOfExecutions(/*numberOfRegionInvocations=*/1);

return info.getNumberOfExecutions();

};

// Immediately return None if we do not know the block number of executions.

auto blockExecutions = getNumberOfExecutions(blockInfo);

if (!blockExecutions.hasValue())

return None;

// Follow parent operations until we reach the operations that owns the

// `perEntryOfThisRegion`.

int64_t numberOfExecutions = *blockExecutions;

Operation *parentOp = block->getParentOp();

while (parentOp != perEntryOfThisRegion->getParentOp()) {

// Find how many times will be executed the block that owns the parent

// operation.

Block *parentBlock = parentOp->getBlock();

auto parentBlockIt = blockNumbersOfExecution.find(parentBlock);

if (parentBlockIt == blockNumbersOfExecution.end())

return None;

const auto &parentBlockInfo = parentBlockIt->getSecond();

auto parentBlockExecutions = getNumberOfExecutions(parentBlockInfo);

// We stumbled upon an operation with unknown number of executions.

if (!parentBlockExecutions.hasValue())

return None;

// Number of block executions is a product of all parent blocks executions.

numberOfExecutions *= *parentBlockExecutions;

parentOp = parentOp->getParentOp();

assert(parentOp != nullptr);

}

return numberOfExecutions;

}

void NumberOfExecutions::printBlockExecutions(

raw_ostream &os, Region *perEntryOfThisRegion) const {

unsigned blockId = 0;

operation->walk([&](Block *block) {

llvm::errs() << "Block: " << blockId++ << "\n";

llvm::errs() << "Number of executions: ";

if (auto n = getNumberOfExecutions(block, perEntryOfThisRegion))

llvm::errs() << *n << "\n";

else

llvm::errs() << "<unknown>\n";

});

}

void NumberOfExecutions::printOperationExecutions(

raw_ostream &os, Region *perEntryOfThisRegion) const {

operation->walk([&](Block *block) {

block->walk([&](Operation *operation) {

// Skip the operation that was used to build the analysis.

if (operation == this->operation)

return;

llvm::errs() << "Operation: " << operation->getName() << "\n";

llvm::errs() << "Number of executions: ";

if (auto n = getNumberOfExecutions(operation, perEntryOfThisRegion))

llvm::errs() << *n << "\n";

else

llvm::errs() << "<unknown>\n";

});

}

//===----------------------------------------------------------------------===//

rriddleUnsubmitted

Done

raw_ostream &os, Region *perEntryOfThisRegion) const {

- operation->walk([&](Block *block) {

- block->walk([&](Operation *operation) {

- // Skip the operation that was used to build the analysis.

- if (operation == this->operation)

- return;

+ operation->walk([&](Operation *operation) {

+ // Skip the operation that was used to build the analysis.

+ if (operation == this->operation)

+ return;

- llvm::errs() << "Operation: " << operation->getName() << "\n";

- llvm::errs() << "Number of executions: ";

- if (auto n = getNumberOfExecutions(operation, perEntryOfThisRegion))

- llvm::errs() << *n << "\n";

- else

- llvm::errs() << "<unknown>\n";

- });

+ llvm::errs() << "Operation: " << operation->getName() << "\n";

+ llvm::errs() << "Number of executions: ";

+ if (auto n = getNumberOfExecutions(operation, perEntryOfThisRegion))

+ llvm::errs() << *n << "\n";

+ else

+ llvm::errs() << "<unknown>\n";

+ });

}

//===----------------------------------------------------------------------===//

rriddle:

ezhulenevAuthorUnsubmitted

Done

I had to walk block and then operations to get sensible output for filecheck, otherwise it prints inner operation before its parent.

ezhulenev: I had to walk block and then operations to get sensible output for filecheck, otherwise it…

// BlockNumberOfExecutionsInfo

//===----------------------------------------------------------------------===//

BlockNumberOfExecutionsInfo::BlockNumberOfExecutionsInfo(

Block *block, Optional<int64_t> numberOfRegionInvocations,

Optional<int64_t> numberOfBlockExecutions)

: block(block), numberOfRegionInvocations(numberOfRegionInvocations),

numberOfBlockExecutions(numberOfBlockExecutions) {}

Optional<int64_t> BlockNumberOfExecutionsInfo::getNumberOfExecutions() const {

if (numberOfRegionInvocations && numberOfBlockExecutions)

return *numberOfRegionInvocations * *numberOfBlockExecutions;

return None;

}

Optional<int64_t> BlockNumberOfExecutionsInfo::getNumberOfExecutions(

int64_t numberOfRegionInvocations) const {

if (numberOfBlockExecutions)

return numberOfRegionInvocations * *numberOfBlockExecutions;

return None;

}

mlir/lib/Dialect/Async/IR/Async.cpp

	Show First 20 Lines • Show All 112 Lines • ▼ Show 20 Lines
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	/// ExecuteOp			/// ExecuteOp
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	constexpr char kOperandSegmentSizesAttr[] = "operand_segment_sizes";			constexpr char kOperandSegmentSizesAttr[] = "operand_segment_sizes";

				void ExecuteOp::getNumRegionInvocations(
				ArrayRef<Attribute> operands, SmallVectorImpl<int64_t> &countPerRegion) {
				(void)operands;
				assert(countPerRegion.empty());
				countPerRegion.push_back(1);
				rriddleUnsubmitted Done Reply Inline Actions nit: Seems like push_back(1) is better than an explicit resize and assignment. rriddle: nit: Seems like push_back(1) is better than an explicit resize and assignment.
				}

				void ExecuteOp::getSuccessorRegions(Optional<unsigned> index,
				ArrayRef<Attribute> operands,
				SmallVectorImpl<RegionSuccessor> &regions) {
				// The `body` region branch back to the parent operation.
				if (index.hasValue()) {
				assert(*index == 0);
				regions.push_back(RegionSuccessor(getResults()));
				return;
				}

				// Otherwise the successor is the body region.
				regions.push_back(RegionSuccessor(&body()));
				}

	static void print(OpAsmPrinter &p, ExecuteOp op) {			static void print(OpAsmPrinter &p, ExecuteOp op) {
	p << op.getOperationName();			p << op.getOperationName();

	// [%tokens,...]			// [%tokens,...]
	if (!op.dependencies().empty())			if (!op.dependencies().empty())
	p << " [" << op.dependencies() << "]";			p << " [" << op.dependencies() << "]";

	// (%value as %unwrapped: !async.value<!arg.type>, ...)			// (%value as %unwrapped: !async.value<!arg.type>, ...)
	▲ Show 20 Lines • Show All 170 Lines • Show Last 20 Lines

mlir/lib/Dialect/SCF/SCF.cpp

//===- SCF.cpp - Structured Control Flow Operations -----------------------===//		//===- SCF.cpp - Structured Control Flow Operations -----------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Dialect/SCF/SCF.h"		#include "mlir/Dialect/SCF/SCF.h"
#include "mlir/Dialect/StandardOps/IR/Ops.h"		#include "mlir/Dialect/StandardOps/IR/Ops.h"
#include "mlir/IR/BlockAndValueMapping.h"		#include "mlir/IR/BlockAndValueMapping.h"
#include "mlir/IR/PatternMatch.h"		#include "mlir/IR/PatternMatch.h"
		#include "mlir/Support/MathExtras.h"
#include "mlir/Transforms/InliningUtils.h"		#include "mlir/Transforms/InliningUtils.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::scf;		using namespace mlir::scf;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// SCFDialect Dialect Interfaces		// SCFDialect Dialect Interfaces
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 258 Lines • ▼ Show 20 Lines	void ForOp::getSuccessorRegions(Optional<unsigned> index,
}		}

// Otherwise, the loop may branch back to itself or the parent operation.		// Otherwise, the loop may branch back to itself or the parent operation.
assert(index.getValue() == 0 && "expected loop region");		assert(index.getValue() == 0 && "expected loop region");
regions.push_back(RegionSuccessor(&getLoopBody(), getRegionIterArgs()));		regions.push_back(RegionSuccessor(&getLoopBody(), getRegionIterArgs()));
regions.push_back(RegionSuccessor(getResults()));		regions.push_back(RegionSuccessor(getResults()));
}		}

		void ForOp::getNumRegionInvocations(ArrayRef<Attribute> operands,
		SmallVectorImpl<int64_t> &countPerRegion) {
		assert(countPerRegion.empty());
		countPerRegion.resize(1);

		auto lb = operands[0].dyn_cast_or_null<IntegerAttr>();
		auto ub = operands[1].dyn_cast_or_null<IntegerAttr>();
		auto step = operands[2].dyn_cast_or_null<IntegerAttr>();

		// Loop bounds are not known statically.
		if (!lb \|\| !ub \|\| !step \|\| step.getValue().getSExtValue() == 0) {
		countPerRegion[0] = -1;
		return;
		}

		countPerRegion[0] =
		ceilDiv(ub.getValue().getSExtValue() - lb.getValue().getSExtValue(),
		step.getValue().getSExtValue());
		}
		ftynseUnsubmitted Done Reply Inline Actions Could we rather extend the one we already have in MathExtras.h to support negative RHS? ftynse: Could we rather extend the one we already have in MathExtras.h to support negative RHS?

ValueVector mlir::scf::buildLoopNest(		ValueVector mlir::scf::buildLoopNest(
OpBuilder &builder, Location loc, ValueRange lbs, ValueRange ubs,		OpBuilder &builder, Location loc, ValueRange lbs, ValueRange ubs,
ValueRange steps, ValueRange iterArgs,		ValueRange steps, ValueRange iterArgs,
function_ref<ValueVector(OpBuilder &, Location, ValueRange, ValueRange)>		function_ref<ValueVector(OpBuilder &, Location, ValueRange, ValueRange)>
bodyBuilder) {		bodyBuilder) {
assert(lbs.size() == ubs.size() &&		assert(lbs.size() == ubs.size() &&
"expected the same number of lower and upper bounds");		"expected the same number of lower and upper bounds");
assert(lbs.size() == steps.size() &&		assert(lbs.size() == steps.size() &&
▲ Show 20 Lines • Show All 947 Lines • Show Last 20 Lines

mlir/lib/Interfaces/ControlFlowInterfaces.cpp

Show First 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	detail::verifyBranchSuccessorOperands(Operation *op, unsigned succNo,
}		}
return success();		return success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// RegionBranchOpInterface		// RegionBranchOpInterface
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

		// A constant value to represent unknown number of region invocations.
		const int64_t mlir::kUnknownNumRegionInvocations = -1;

		rriddleUnsubmitted Done Reply Inline Actions Prefer explicit qualification, i.e. `mlir::kUnknownNumRegionInvocations` rriddle: Prefer explicit qualification, i.e. `mlir::kUnknownNumRegionInvocations`
/// Verify that types match along all region control flow edges originating from		/// Verify that types match along all region control flow edges originating from
/// `sourceNo` (region # if source is a region, llvm::None if source is parent		/// `sourceNo` (region # if source is a region, llvm::None if source is parent
/// op). `getInputsTypesForRegion` is a function that returns the types of the		/// op). `getInputsTypesForRegion` is a function that returns the types of the
/// inputs that flow from `sourceIndex' to the given region, or llvm::None if		/// inputs that flow from `sourceIndex' to the given region, or llvm::None if
/// the exact type match verification is not necessary (e.g., if the Op verifies		/// the exact type match verification is not necessary (e.g., if the Op verifies
/// the match itself).		/// the match itself).
static LogicalResult		static LogicalResult
verifyTypesAlongAllEdges(Operation *op, Optional<unsigned> sourceNo,		verifyTypesAlongAllEdges(Operation *op, Optional<unsigned> sourceNo,
▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines

mlir/test/Analysis/test-number-of-block-executions.mlir

This file was added.

				// RUN: mlir-opt %s \
				// RUN: -test-print-number-of-block-executions \
				// RUN: -split-input-file 2>&1 \
				// RUN: \| FileCheck %s --dump-input=always

				// CHECK-LABEL: Number of executions: empty
				func @empty() {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				return
				}

				// -----

				// CHECK-LABEL: Number of executions: sequential
				func @sequential() {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				br ^bb1
				^bb1:
				// CHECK: Block: 1
				// CHECK-NEXT: Number of executions: 1
				br ^bb2
				^bb2:
				// CHECK: Block: 2
				// CHECK-NEXT: Number of executions: 1
				return
				}

				// -----

				// CHECK-LABEL: Number of executions: conditional
				func @conditional(%cond : i1) {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				br ^bb1
				^bb1:
				// CHECK: Block: 1
				// CHECK-NEXT: Number of executions: 1
				cond_br %cond, ^bb2, ^bb3
				^bb2:
				// CHECK: Block: 2
				// CHECK-NEXT: Number of executions: <unknown>
				br ^bb4
				^bb3:
				// CHECK: Block: 3
				// CHECK-NEXT: Number of executions: <unknown>
				br ^bb4
				^bb4:
				// CHECK: Block: 4
				// CHECK-NEXT: Number of executions: <unknown>
				return
				}

				// -----

				// CHECK-LABEL: Number of executions: loop
				func @loop(%cond : i1) {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				br ^bb1
				^bb1:
				// CHECK: Block: 1
				// CHECK-NEXT: Number of executions: <unknown>
				br ^bb2
				^bb2:
				// CHECK: Block: 2
				// CHECK-NEXT: Number of executions: <unknown>
				br ^bb3
				^bb3:
				// CHECK: Block: 3
				// CHECK-NEXT: Number of executions: <unknown>
				cond_br %cond, ^bb1, ^bb4
				^bb4:
				// CHECK: Block: 4
				// CHECK-NEXT: Number of executions: <unknown>
				return
				}

				// -----

				// CHECK-LABEL: Number of executions: scf_if_dynamic_branch
				func @scf_if_dynamic_branch(%cond : i1) {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				scf.if %cond {
				// CHECK: Block: 1
				// CHECK-NEXT: Number of executions: <unknown>
				} else {
				// CHECK: Block: 2
				// CHECK-NEXT: Number of executions: <unknown>
				}
				return
				}

				// -----

				// CHECK-LABEL: Number of executions: async_execute
				func @async_execute() {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				async.execute {
				// CHECK: Block: 1
				// CHECK-NEXT: Number of executions: 1
				async.yield
				}
				return
				}

				// -----

				// CHECK-LABEL: Number of executions: async_execute_with_scf_if
				func @async_execute_with_scf_if(%cond : i1) {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				async.execute {
				// CHECK: Block: 1
				// CHECK-NEXT: Number of executions: 1
				scf.if %cond {
				// CHECK: Block: 2
				// CHECK-NEXT: Number of executions: <unknown>
				} else {
				// CHECK: Block: 3
				// CHECK-NEXT: Number of executions: <unknown>
				}
				async.yield
				}
				return
				}

				// -----

				// CHECK-LABEL: Number of executions: scf_for_constant_bounds
				func @scf_for_constant_bounds() {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				%c0 = constant 0 : index
				%c1 = constant 1 : index
				%c2 = constant 2 : index

				scf.for %i = %c0 to %c2 step %c1 {
				// CHECK: Block: 1
				// CHECK-NEXT: Number of executions: 2
				}

				return
				}

				// -----

				// CHECK-LABEL: Number of executions: propagate_parent_num_executions
				func @propagate_parent_num_executions() {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				%c0 = constant 0 : index
				%c1 = constant 1 : index
				%c2 = constant 2 : index

				scf.for %i = %c0 to %c2 step %c1 {
				// CHECK: Block: 1
				// CHECK-NEXT: Number of executions: 2
				async.execute {
				// CHECK: Block: 2
				// CHECK-NEXT: Number of executions: 2
				async.yield
				}
				}

				return
				}

				// -----

				// CHECK-LABEL: Number of executions: clear_num_executions
				func @clear_num_executions(%step : index) {
				// CHECK: Block: 0
				// CHECK-NEXT: Number of executions: 1
				%c0 = constant 0 : index
				%c2 = constant 2 : index

				scf.for %i = %c0 to %c2 step %step {
				// CHECK: Block: 1
				// CHECK-NEXT: Number of executions: <unknown>
				async.execute {
				// CHECK: Block: 2
				// CHECK-NEXT: Number of executions: <unknown>
				async.yield
				}
				}

				return
				}

mlir/test/Analysis/test-number-of-operation-executions.mlir

This file was added.

				// RUN: mlir-opt %s \
				// RUN: -test-print-number-of-operation-executions \
				// RUN: -split-input-file 2>&1 \
				// RUN: \| FileCheck %s

				// CHECK-LABEL: Number of executions: empty
				func @empty() {
				// CHECK: Operation: std.return
				// CHECK-NEXT: Number of executions: 1
				return
				}

				// -----

				// CHECK-LABEL: Number of executions: propagate_parent_num_executions
				func @propagate_parent_num_executions() {
				// CHECK: Operation: std.constant
				// CHECK-NEXT: Number of executions: 1
				%c0 = constant 0 : index
				// CHECK: Operation: std.constant
				// CHECK-NEXT: Number of executions: 1
				%c1 = constant 1 : index
				// CHECK: Operation: std.constant
				// CHECK-NEXT: Number of executions: 1
				%c2 = constant 2 : index

				// CHECK-DAG: Operation: scf.for
				// CHECK-NEXT: Number of executions: 1
				scf.for %i = %c0 to %c2 step %c1 {
				// CHECK-DAG: Operation: async.execute
				// CHECK-NEXT: Number of executions: 2
				async.execute {
				// CHECK-DAG: Operation: async.yield
				// CHECK-NEXT: Number of executions: 2
				async.yield
				}
				}

				return
				}

				// -----

				// CHECK-LABEL: Number of executions: clear_num_executions
				func @clear_num_executions(%step : index) {
				// CHECK: Operation: std.constant
				// CHECK-NEXT: Number of executions: 1
				%c0 = constant 0 : index
				// CHECK: Operation: std.constant
				// CHECK-NEXT: Number of executions: 1
				%c2 = constant 2 : index

				// CHECK: Operation: scf.for
				// CHECK-NEXT: Number of executions: 1
				scf.for %i = %c0 to %c2 step %step {
				// CHECK: Operation: async.execute
				// CHECK-NEXT: Number of executions: <unknown>
				async.execute {
				// CHECK: Operation: async.yield
				// CHECK-NEXT: Number of executions: <unknown>
				async.yield
				}
				}

				return
				}

mlir/test/lib/Transforms/CMakeLists.txt

Show All 18 Lines	add_mlir_library(MLIRTestTransforms
TestLinalgCodegenStrategy.cpp		TestLinalgCodegenStrategy.cpp
TestLinalgFusionTransforms.cpp		TestLinalgFusionTransforms.cpp
TestLinalgHoisting.cpp		TestLinalgHoisting.cpp
TestLinalgTransforms.cpp		TestLinalgTransforms.cpp
TestLiveness.cpp		TestLiveness.cpp
TestLoopMapping.cpp		TestLoopMapping.cpp
TestLoopParametricTiling.cpp		TestLoopParametricTiling.cpp
TestLoopUnrolling.cpp		TestLoopUnrolling.cpp
		TestNumberOfExecutions.cpp
TestOpaqueLoc.cpp		TestOpaqueLoc.cpp
TestMemRefBoundCheck.cpp		TestMemRefBoundCheck.cpp
TestMemRefDependenceCheck.cpp		TestMemRefDependenceCheck.cpp
TestMemRefStrideCalculation.cpp		TestMemRefStrideCalculation.cpp
TestSCFUtils.cpp		TestSCFUtils.cpp
TestVectorTransforms.cpp		TestVectorTransforms.cpp

EXCLUDE_FROM_LIBMLIR		EXCLUDE_FROM_LIBMLIR
Show All 32 Lines

mlir/test/lib/Transforms/TestNumberOfExecutions.cpp

This file was added.

				//===- TestNumberOfExecutions.cpp - Test number of executions analysis ----===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file contains test passes for constructing and resolving number of
				// executions information.
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Analysis/NumberOfExecutions.h"
				#include "mlir/Pass/Pass.h"

				using namespace mlir;

				namespace {

				struct TestNumberOfBlockExecutionsPass
				: public PassWrapper<TestNumberOfBlockExecutionsPass, FunctionPass> {
				void runOnFunction() override {
				llvm::errs() << "Number of executions: " << getFunction().getName() << "\n";
				getAnalysis<NumberOfExecutions>().printBlockExecutions(
				llvm::errs(), &getFunction().getBody());
				}
				};

				struct TestNumberOfOperationExecutionsPass
				: public PassWrapper<TestNumberOfOperationExecutionsPass, FunctionPass> {
				void runOnFunction() override {
				llvm::errs() << "Number of executions: " << getFunction().getName() << "\n";
				getAnalysis<NumberOfExecutions>().printOperationExecutions(
				llvm::errs(), &getFunction().getBody());
				}
				};

				} // end anonymous namespace

				namespace mlir {
				namespace test {
				void registerTestNumberOfBlockExecutionsPass() {
				PassRegistration<TestNumberOfBlockExecutionsPass>(
				"test-print-number-of-block-executions",
				"Print the contents of a constructed number of executions analysis for "
				"all blocks.");
				}

				void registerTestNumberOfOperationExecutionsPass() {
				PassRegistration<TestNumberOfOperationExecutionsPass>(
				"test-print-number-of-operation-executions",
				"Print the contents of a constructed number of executions analysis for "
				"all operations.");
				}
				} // namespace test
				} // namespace mlir

mlir/tools/mlir-opt/mlir-opt.cpp

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines
void registerTestLinalgHoisting();		void registerTestLinalgHoisting();
void registerTestLinalgTransforms();		void registerTestLinalgTransforms();
void registerTestLivenessPass();		void registerTestLivenessPass();
void registerTestLoopFusion();		void registerTestLoopFusion();
void registerTestLoopMappingPass();		void registerTestLoopMappingPass();
void registerTestLoopUnrollingPass();		void registerTestLoopUnrollingPass();
void registerTestMemRefDependenceCheck();		void registerTestMemRefDependenceCheck();
void registerTestMemRefStrideCalculation();		void registerTestMemRefStrideCalculation();
		void registerTestNumberOfBlockExecutionsPass();
		void registerTestNumberOfOperationExecutionsPass();
void registerTestOpaqueLoc();		void registerTestOpaqueLoc();
void registerTestPreparationPassWithAllowedMemrefResults();		void registerTestPreparationPassWithAllowedMemrefResults();
void registerTestRecursiveTypesPass();		void registerTestRecursiveTypesPass();
void registerTestSCFUtilsPass();		void registerTestSCFUtilsPass();
void registerTestVectorConversions();		void registerTestVectorConversions();
} // namespace test		} // namespace test
} // namespace mlir		} // namespace mlir

▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	#endif
test::registerTestLinalgHoisting();		test::registerTestLinalgHoisting();
test::registerTestLinalgTransforms();		test::registerTestLinalgTransforms();
test::registerTestLivenessPass();		test::registerTestLivenessPass();
test::registerTestLoopFusion();		test::registerTestLoopFusion();
test::registerTestLoopMappingPass();		test::registerTestLoopMappingPass();
test::registerTestLoopUnrollingPass();		test::registerTestLoopUnrollingPass();
test::registerTestMemRefDependenceCheck();		test::registerTestMemRefDependenceCheck();
test::registerTestMemRefStrideCalculation();		test::registerTestMemRefStrideCalculation();
		test::registerTestNumberOfBlockExecutionsPass();
		test::registerTestNumberOfOperationExecutionsPass();
test::registerTestOpaqueLoc();		test::registerTestOpaqueLoc();
test::registerTestRecursiveTypesPass();		test::registerTestRecursiveTypesPass();
test::registerTestSCFUtilsPass();		test::registerTestSCFUtilsPass();
test::registerTestVectorConversions();		test::registerTestVectorConversions();
}		}
#endif		#endif

int main(int argc, char **argv) {		int main(int argc, char **argv) {
Show All 13 Lines

mlir/unittests/Support/CMakeLists.txt

	add_mlir_unittest(MLIRSupportTests			add_mlir_unittest(MLIRSupportTests
	IndentedOstreamTest.cpp			IndentedOstreamTest.cpp
				MathExtrasTest.cpp
	)			)

	target_link_libraries(MLIRSupportTests			target_link_libraries(MLIRSupportTests
	PRIVATE MLIRSupportIndentedOstream MLIRSupport)			PRIVATE MLIRSupportIndentedOstream MLIRSupport)

mlir/unittests/Support/MathExtrasTest.cpp

This file was added.

				//===- MathExtrasTest.cpp - MathExtras Tests ------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Support/MathExtras.h"
				#include "gmock/gmock.h"

				using namespace mlir;
				using ::testing::Eq;

				TEST(MathExtrasTest, CeilDivTest) {
				EXPECT_THAT(ceilDiv(14, 3), Eq(5));
				EXPECT_THAT(ceilDiv(14, -3), Eq(-4));
				EXPECT_THAT(ceilDiv(-14, -3), Eq(5));
				EXPECT_THAT(ceilDiv(-14, 3), Eq(-4));
				}

				TEST(MathExtrasTest, FloorDivTest) {
				EXPECT_THAT(floorDiv(14, 3), Eq(4));
				EXPECT_THAT(floorDiv(14, -3), Eq(-5));
				EXPECT_THAT(floorDiv(-14, -3), Eq(4));
				EXPECT_THAT(floorDiv(-14, 3), Eq(-5));
				}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Add NumberOfExecutions analysis + update RegionBranchOpInterface interface to query number of region invocationsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 304430

mlir/include/mlir/Analysis/NumberOfExecutions.h

mlir/include/mlir/Dialect/Async/IR/Async.h

mlir/include/mlir/Dialect/Async/IR/AsyncOps.td

mlir/include/mlir/Dialect/SCF/SCFOps.td

mlir/include/mlir/Interfaces/ControlFlowInterfaces.h

mlir/include/mlir/Interfaces/ControlFlowInterfaces.td

mlir/include/mlir/Support/MathExtras.h

mlir/lib/Analysis/CMakeLists.txt

mlir/lib/Analysis/NumberOfExecutions.cpp

mlir/lib/Dialect/Async/IR/Async.cpp

mlir/lib/Dialect/SCF/SCF.cpp

mlir/lib/Interfaces/ControlFlowInterfaces.cpp

mlir/test/Analysis/test-number-of-block-executions.mlir

mlir/test/Analysis/test-number-of-operation-executions.mlir

mlir/test/lib/Transforms/CMakeLists.txt

mlir/test/lib/Transforms/TestNumberOfExecutions.cpp

mlir/tools/mlir-opt/mlir-opt.cpp

mlir/unittests/Support/CMakeLists.txt

mlir/unittests/Support/MathExtrasTest.cpp

[mlir] Add NumberOfExecutions analysis + update RegionBranchOpInterface interface to query number of region invocations
ClosedPublic