This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Affine/IR/
-
mlir/
-
Dialect/
-
Affine/
-
IR/
8/10
AffineOps.td
-
lib/Dialect/Affine/IR/
-
Dialect/
-
Affine/
-
IR/
26/26
AffineOps.cpp
-
test/Dialect/Affine/
-
Dialect/
-
Affine/
-
execute-region.mlir
-
invalid.mlir

Differential D72223

[MLIR] Introduce affine.execute_region op
Changes PlannedPublic

Authored by bondhugula on Jan 4 2020, 9:41 PM.

Download Raw Diff

Details

Reviewers

mehdi_amini
ftynse
andydavis1
nicolasvasilache

Summary

The affine.execute_region op executes its region exactly once while
defining a new polyhedral scope for its region for analysis and
transformation purposes, i.e., a new symbol context is defined for
operations appearing in its region. This allows the polyhedral form to
be used in a wider context without the need for function outlining.
The op explicitly captures only memrefs and is lowered readily to an
std.execute_region.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

bondhugula created this revision.Jan 4 2020, 9:41 PM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptJan 4 2020, 9:41 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, lucyrfox, mgester and 6 others. · View Herald Transcript

rename getParentAffineScope -> getAffineScope

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptJan 4 2020, 9:42 PM

bondhugula removed reviewers: rriddle, mehdi_amini, ftynse.Jan 4 2020, 9:46 PM

Herald added subscribers: rriddle, mehdi_amini. · View Herald TranscriptJan 4 2020, 9:46 PM

bondhugula added reviewers: mehdi_amini, rriddle, ftynse, dcaballe.Jan 4 2020, 9:47 PM

Update doc comments.

invalid test cases
update affine dialect doc

fix comments

add missed result parsing
fix verifier

mehdi_amini requested changes to this revision.Jan 5 2020, 5:49 AM

mehdi_amini added inline comments.

mlir/docs/Dialects/Affine.md
612 ↗	(On Diff #236236)	I don’t think you addressed my concerns on this topic?

This revision now requires changes to proceed.Jan 5 2020, 5:49 AM

bondhugula marked an inline comment as done.Jan 5 2020, 7:06 AM

bondhugula added inline comments.

mlir/docs/Dialects/Affine.md
612 ↗	(On Diff #236236)	I think I responded to everything and included all of the arguments in the RFC: https://github.com/bondhugula/llvm-project/blob/graybox/mlir/rfc/rfc-graybox.md Could you just provide a summary list of the concerns you still have - either here or on that thread as you prefer?

mehdi_amini added inline comments.Jan 5 2020, 2:35 PM

mlir/docs/Dialects/Affine.md
612 ↗	(On Diff #236236)	I explained my concerns in the original thread https://groups.google.com/a/tensorflow.org/d/msg/mlir/O5PXVbtlSng/3SXmxDiLAAAJ Here is what I wrote: I am trying to not consider affine at all here. I wrote these example to try to illustrate how MLIR region/op interaction are structured opaquely to be able to derive cross-dialect invariants in general. The invariant I am presenting above is independent from any dialect, let me abstract the type further: func @foo(%value : !dialect.type) { op.with_region { any.op(%value) : (!dialect.type) -> () } } If I look at it generically, here is my take on it: a) The `op.with_region` defines the semantic of its immediate region, so it can either accept or reject `any.op`. b) Let's assume that `op.with_region` does not know anything about `any.op` (no traits, no prior knowledge, the `any.op` could be unregistered at this point). c) For this IR to be valid, `op.with_region` must be accepting unknown op (like `any.op`). d) From the perspective of `op.with_region`, the `any.op` is like an opaque call to some code it cannot see. But what if `any.op` has a region? func @foo(%value : !dialect.type) { op.with_region { any.op(%value) ({ ^bb(%value_inside): // do something with %value_inside (explicitly captured) }) : (!dialect.type) -> () }} Here what changes is that: e) any.op has a region now. Unless `op.with_region` forbids unknown operation from having a region in its verifier, and it does not have specific handling for `any.op`, then the IR should be valid. f) Since %value_inside is explicitly captured, without knowing specifically `any.op`, then the uses of %value_inside cannot be restricted by `op.with_region` but only by `any_op`. g) For any practical purpose here, there should not be any difference between this form and the first one above. Finally, what if `any.op` has implicit capture? func @foo(%value : !dialect.type) { op.with_region { any.op() ({ // do something with %value (implicit capture instead of explicit) }) : (!dialect.type) -> () }} Now: h) `any.op()` is implicitly capturing %value. g) Without more information about `any.op` (traits, etc.), this should be equivalent to the explicit capture case: if the IR was valid the first and second case, then it should be valid here. If we don't have these properties, and if `op.with_region` can constrain the validity of the region attached to `any.op`, then `any.op` is not longer in control of the semantics of the enclosed region. No transformation can operate on `any.op` without knowing all of the enclosing operations, since these can add arbitrary restrictions. For example, this is a valid IR (you can pipe this in mlir-opt right now): module { "d1.op1" () ({ "d2.op2" () ({ module { func @bar() { return } func @foo() { call @bar() : () -> () return } } "d2.done" () : () -> () }) : () -> () }) : () -> () } If I get the inner @foo function, and would like to inline the call to @bar, what do I need to check to ensure I can? If the FuncOp defines the semantic of the region, then the FuncOp should control itself whether it allows to inline or not, and I should query FuncOp for @foo, CallOp for the call-site, FuncOp for @bar, and likely the op inside @bar that I am about to inline. If you allow to put restriction on what can happen inside @foo(), based on the enclosing operation, then you can't inline unless you ensure that all the enclosing operation will be happy with it (so you need to check the enclosing modules, but also "d1.op1" and "d2.op2"). Basically, this would be breaking the composability of the IR: you couldn't assemble independent pieces and reason about them independently. I don't know why we would want that, here we really want to reason about the functions in the inner module independently if they are surrounded by "d1.op1"() and "d2.op2"() like here (otherwise none of the current passes in MLIR are correct). I don't think you answered these points that explain why I am not convinced it is OK to have explicit capture just for memref. I don't think it is necessary to have explicit capture of memref either by the way, dropping this may help getting forward right now. You answered the email above in the thread, but you didn't address it I believe, you wrote "I'll respond to the connection to the affine.graybox proposal in another post" but I didn't see another post after that. (I'm on vacation till 1/10, expect some delays in answers)

bondhugula marked an inline comment as done.Jan 5 2020, 8:31 PM

bondhugula added inline comments.

mlir/docs/Dialects/Affine.md
612 ↗	(On Diff #236236)	The downsides of not explicitly capturing memrefs on the graybox are discussed in the RFC here. https://github.com/bondhugula/llvm-project/blob/graybox/mlir/rfc/rfc-graybox.md#rationale-and-design-alternatives-what-to-explicitly-capture Another way to think about this is that: because grayboxes introduce a new symbol context, most polyhedral walks would like to conceptually see the graybox just as a "call" with those memrefs passed to start with. Other arbitrary/unknown ops with regions don't start a symbol context and so the affine walks will just walk through such ops (just like they walk through affine.fors/ifs that are encountered). OTOH, walkAffine will not walk through grayboxes from the top. If you don't explicitly capture, the key is that most polyhedral/affine passes will have to stop/check every op for a graybox and then scan the interior of that op for memrefs if it turns out to be a graybox. (For the future, this would even go against multithreading polyhedral passes to run concurrently on different func's and grayboxes in an isolated way, but I'm not bringing this up now in the RFC). With an explicit capture, things would just be a regular operand scan of ops scanned with walkAffine. For the future cases where you really need more precise information on what's happening to the memrefs inside the graybox (just like you may want to in the case of call's), that can be done as needed. Non-memref values on the other hand just move transparently across the boundaries of grayboxes in the regular SSA passes/canonicalizations or hybrid polyhedral/SSA ones. My concerns with explicit capture are actually very different from yours: that they make it harder to move IR across without actually updating the memrefs being used (either hoist from inside to outside or sink). You'd have to check if you are moving past a graybox and then remap memrefs (consider scalar replacement on affine load/stores, LICM as examples). But I still strongly feel that explicit capture of the memrefs is the right tradeoff to start with (even if perhaps not the right one eventually) - we can reevaluate its impact and drop if necessary. Best, Uday

bondhugula marked an inline comment as done.Jan 6 2020, 12:59 AM

bondhugula added inline comments.

mlir/docs/Dialects/Affine.md
612 ↗	(On Diff #236236)	And finally reg. your points on unknown ops with regions that explicitly capture in addition to perhaps implicitly: that's an issue separate from affine.graybox and that exists in the codebase as is now. If you have an op whose operands bind to its region's arguments in ways that are unknown, the current affine passes/utilities don't check for example if the memrefs inside are shadows, how memory footprints / dependences should be accurate computed, etc. (in fact, they are already incorrect because they'd walk through that op and treat those memrefs as distinct). The solution to this would depend on the pass/utility: some should just do nothing in the presence of unknown ops that have one or more regions (we shouldn't even say 'explicit capture' here because we don't even know if it's really capturing in spite of having operands). Since that same op could also implicitly capture (as you mention in one case), choosing to not look inside may not be an option depending on the pass (unless of course the op is known to be isolated from above). But how is all this related to explicit capture in affine.grayboxes? The latter is a known op where the binding between its operands and region's arguments is well-defined.

bondhugula mentioned this in D75837: [MLIR] Introduce scf.execute_region op.Mar 18 2020, 3:58 AM

bondhugula retitled this revision from [LLVM] [MLIR] Introduce affine graybox op to [MLIR] Introduce affine graybox op.Mar 24 2020, 6:35 AM

Herald added subscribers: Joonsoo, liufengdb, aartbik. · View Herald TranscriptMar 24 2020, 6:35 AM

bondhugula edited reviewers, added: andydavis1; removed: nicolasvasilache, dcaballe.Mar 24 2020, 6:36 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptMar 24 2020, 6:36 AM

Rebase.

Herald added a subscriber: grosul1. · View Herald TranscriptMar 30 2020, 2:24 PM

changes after rebase - unregistered ops

Harbormaster failed remote builds in B51027: Diff 253701!Mar 30 2020, 3:17 PM

Harbormaster failed remote builds in B51028: Diff 253702!Mar 30 2020, 3:51 PM

Rename affine.graybox -> affine.execute_region. Rebase + updates to
context-sensitive valid dim/symbol checking.

Herald added a subscriber: frgossen. · View Herald TranscriptApr 18 2020, 1:28 PM

Harbormaster failed remote builds in B53866: Diff 258551!Apr 18 2020, 1:28 PM

bondhugula retitled this revision from [MLIR] Introduce affine graybox op to [MLIR] Introduce affine.execute_region op.Apr 18 2020, 1:30 PM

bondhugula edited the summary of this revision. (Show Details)

bondhugula removed a reviewer: rriddle.

bondhugula added a parent revision: D75837: [MLIR] Introduce scf.execute_region op.

bondhugula edited the summary of this revision. (Show Details)

I am very sorry, for some reason this diff was not appearing on my phabricator todo list until recently.... Please do not hesitate to ping me by email if I take more than a couple of workdays to iterate.

In general, this change makes sense to me and looks like a proper extension of the affine modeling. My only design concern is that one can circumvent the explicit-capture mechanism for memrefs. A direct solution would be to disallow any values of memref type to be defined in the region, but I have not considered the implications of this on the expressiveness.

There are several places where some action is performed twice (e.g., parsing attribute dict), requesting change for these.

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td
1115–1116	I understand the idea of the restriction, but it looks like it can be circumvented by: %0 = ... : memref<...> %1 = cheat.wrap %0 : memref<...> -> !cheat.opaque affine.execute_region { // This defines the memref inside the region, so seemingly complies with the semantics. %2 = cheat.unwrap %1 : !cheat.opaque -> memref<...> affine.load %2[...] }
1117–1120	I would avoid referring FuncOp, it is not special in any sense, and it actually allows any terminator. Neither would I remind that blocks must have terminators, it's a core IR requirement. "returns to right after the affine.execute_region op" sounds unnecessarily complex to me. "std.return" terminator placed inside blocks of the "affine.execute_region" returns the control flow to the "affine.execute_region". Since we already mentioned that "execution_region" executes the region exactly ones, it is naturally implied that the control flow will be further transferred to the control-flow successor of "execution_region...
1141	Syntax nit: I'd consider omitting empty `[] = ()`, looks distracting
mlir/lib/Dialect/Affine/IR/AffineOps.cpp
234	Nit: wouldn't it be easier to accept the region as argument instead of the operation that contains it?
247	Would variadic template help you? `isKnownIsolatedFromAbove` isn't a class...
254	The assert message is misleading. The op may have parent ops, just none that satisfy the "affine scope" conditions. Also, use `llvm_unreachable` instead of `assert(false)` and drop the return value
319–320	I cannot relate this comment to the code below.
322	Shouldn't this also check for "KnownIsolatedFromAbove" ?
1096	Nit: we tend to use `auto` when it increases readability, `Operation *` would look just fine here
2669	Prefer ValueRange to ArrayRef<Value>
2678	Nit: drop trivial braces
2681	If you have ValueRange, this entire vector manipulation gets replaced by `memrefs.getTypes()`
2682	You already pushed it in line 2295
2718	Avoid capturing `llvm::enumerate` by-reference. An enumerator only keeps iterators, so taking copying it is cheap, and we avoid running into potential problems with implicit lifetime extension
2720	Since you already have an enumerator, you might as well mention the position of the mismatching argument.
2783	You already parser the attr dict 4 lines above.

This revision now requires changes to proceed.Apr 24 2020, 6:50 AM

Herald added a subscriber: Kayjukh. · View Herald TranscriptApr 24 2020, 6:50 AM

In D72223#2001800, @ftynse wrote:

I am very sorry, for some reason this diff was not appearing on my phabricator todo list until recently.... Please do not hesitate to ping me by email if I take more than a couple of workdays to iterate.

In general, this change makes sense to me and looks like a proper extension of the affine modeling. My only design concern is that one can circumvent the explicit-capture mechanism for memrefs. A direct solution would be to disallow any values of memref type to be defined in the region, but I have not considered the implications of this on the expressiveness.

There are several places where some action is performed twice (e.g., parsing attribute dict), requesting change for these.

Thanks for the detailed review! Yes, we need to still converge on the memref explicit capture. But I'll address these other straightforward changes first.

Address review comments.

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td
1115–1116	Independently of affine.execute_region, such a problem exists with memref non-dereferencing ops and it can / has to be tackled. For example, consider this variation of your snippet (no affine.execute_region). %0 = ... : memref<...> %1 = cheat.wrap %0 : memref<...> -> !cheat.opaque %2 = cheat.load %1[0, 0] : !cheat.opaque cheat.store %v, %1[0, 0] : !cheat.opaque call @foo(%1) : (!cheat.opaque) -> () affine.load %0[0, 0] : memref<...> Note that all the current passes/utilities including affine store to load fwd'ing, invariant load hoisting/scalar rep, dependence analysis itself, the affine loop fusion would do the wrong thing here because there is an escape side channel like you show. So, the thing you are pointing to is interesting and has to be tackled, but if we step back and take another look, this is the same as the larger issue of dealing with escaping or aliasing and not specific to execute_region. A solution to deal with these is to actually detect/treat unknown memref non-dereferencing ops that define SSA values (like your `cheat.unwrap`) and bail out in their presence (depending on what we are doing). For eg. the dependence information isn't going to be accurate in their presence. The point of explicit captures is that you know which memrefs are going in, but it isn't free of the problem of side-channel escapes that manifest in straightline code themselves. But whenever we don't have such escapes (and we can always detect that being conservative), which I believe is the common scenario, having the captures does serve its intended purpose -- you still won't have to look inside and do something special with these ops in all passes/utilities.
1117–1120	Sure, dropped these lines.
mlir/lib/Dialect/Affine/IR/AffineOps.cpp
234	Actually, it is - thanks!
247	Right - it won't; the comment is probably stale (it was FuncOp and AffineExecuteRegionOp earlier). Anyway this code will be updated to make use of a new op trait 'PolyhedralScope' in another revision, which I'll submit as this one's parent. AffineExecuteRegion will be marked with this trait and other ops that want to define new scopes can have that trait.
254	Thanks.
322	This was checked as part of isValidSymbol above.
1096	Sure.
2681	Sure, this was really old code - that didn't get updated post ValueRange/TypeRange migrations!
2682	Thanks!

Harbormaster failed remote builds in B54614: Diff 259980!Apr 24 2020, 2:07 PM

bondhugula edited the summary of this revision. (Show Details)Apr 26 2020, 8:43 PM

Once again, this disappeared from my attention list... Is it due to "unresolved" grand-parent diff where I'm not listed as a reviewer?

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td
1115–1116	In the general case, I suppose it can be even worse than that. func @foo(%arg0: !cheat.opaque, %arg1: memref<..>) { // opaque and memref alias cheat.do_cheat %arg0 // this affects the memref affine.load %arg1[...] } which looks like we either need a powerful and abstract enough way to describe the aliasing between objects of different types, or to treat any side-effecting operation conservatively. Anyway, I agree with the argument that the proposed approach is no worse than what we already have in Affine and I would like to make progress on this.

mehdi_amini added inline comments.Apr 27 2020, 1:12 PM

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2698	You don't need an `if` here.
2702	Seems like you can fix the FIXME and just write: `if(op->isProperAncestor(memref.getDefiningOp()) continue;`

In D72223#2005186, @ftynse wrote:

Once again, this disappeared from my attention list... Is it due to "unresolved" grand-parent diff where I'm not listed as a reviewer?

That shouldn't happen logically - if it helps, I can add you as a reviewer there as well!

Rebase on polyhedral scope trait (simplifies this revision) + address remaining concerns

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td
1115–1116	which looks like we either need a powerful and abstract enough way to describe the aliasing between objects of different types, or to treat any side-effecting operation That's right, thanks.
mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2698	You actually do :-) (Notice the continue above is guarded again inside. ) - although I could better use an else here.
2702	Thanks, but the block arg owner check above also has to be fixed similarly to allow it to be any descendent of 'op'. Done - used a lambda with an all_of.

Harbormaster failed remote builds in B54931: Diff 260562!Apr 28 2020, 2:07 AM

bondhugula removed a parent revision: D75837: [MLIR] Introduce scf.execute_region op.Jun 18 2021, 7:29 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 18 2021, 7:29 PM

Herald added subscribers: dcaballe, cota, teijeong and 6 others. · View Herald Transcript

bondhugula edited the summary of this revision. (Show Details)Jun 18 2021, 7:29 PM

Rebase on upstream tip. Bring code to date.

Minor update to test cases.

bondhugula added inline comments.Jun 19 2021, 2:35 AM

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td
1141	@ftynse Unfortunately, the parser API won't support this unless we want to disallow `[] = ()` and even there we won't be able to emit the right error message. (Basically we won't know whether to parse the `=`.)

Harbormaster completed remote builds in B110060: Diff 353183.Jun 19 2021, 5:38 PM

ftynse accepted this revision.Jun 21 2021, 4:16 AM

ftynse added inline comments.

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td
120
mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2673	We can now do `builder.createBlock`

Address review comments.

Harbormaster completed remote builds in B111109: Diff 354651.Jun 26 2021, 1:29 AM

Avoid createBlock

Harbormaster completed remote builds in B111122: Diff 354666.Jun 26 2021, 5:32 AM

bondhugula added inline comments.Jun 27 2021, 8:22 PM

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2673	Unfortunately, using this messes up the insertion point subsequently - outweighs the idiomacy it provides and I'd like to avoid using a scope guard.

@mehdi_amini Could you take a look and let me know if you have any concerns with bringing this in? One thing this unblocks is an inliner into an affine.for/if.

What does this op bring on top of scf.execute_region? Can scf.execute_region carry the AffineScope trait?

Somehow related, in another thread of discussion, we discussed reversing the AffineScope traits: replacing it by a trait that would mean the opposite (let's say NotAnAffineScope as a straw man) and consider every operation that don't define NotAnAffineScope. This would enable to use any op unknown to affine that define a region inside an affine scope, it would just create implicitly a new scope. We could avoid the need to tag operations like FuncOp with AffineScope since that would be the default. Instead only affine operation would need tagging.

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td
120	(formatting still to be fixed here)

Re-read the RFC on "graybox", I am still entirely unconvinced by the current design and the explicit capture of memref used here. I think this is less flexible and not justified all-in-all. believe the more extensible alternative is to support opaque nested regions in general and treat them conservatively as new scope (basically what I wrote above about reversing the trait).
But I'm not willing to die on this hill today, if @ftynse agrees with you that affine should stay a bit more isolated, and that this explicit capture is really worth it right now, then feel free to move forward.

In D72223#2845669, @mehdi_amini wrote:

What does this op bring on top of scf.execute_region? Can scf.execute_region carry the AffineScope trait?

scf.execute_region can carry the affinescope trait (or equivalently can be free of the "ExpandAffineScope" trait which is the complement of AffineScope and which is what we want to replace AffineScope with). The real difference between the two is just going to be memref captures.

Somehow related, in another thread of discussion, we discussed reversing the AffineScope traits: replacing it by a trait that would mean the opposite (let's say NotAnAffineScope as a straw man) and consider every operation that don't define NotAnAffineScope. This would enable to use any op unknown to affine that define a region inside an affine scope, it would just create implicitly a new scope. We could avoid the need to tag operations like FuncOp with AffineScope since that would be the default. Instead only affine operation would need tagging.

Yes, we already discussed all of this. Just for the name, "ExpandsAffineScope" is more meaningful since it's adding to affine scope started by an op above that didn't have that trait. Any region holding op that doesn't have ExpandsAffineScope starts such a scope. Only affine.for, affine.if, and affine.parallel will have this trait. It's the complement of the current AffineScope.

In D72223#2845698, @mehdi_amini wrote:

Re-read the RFC on "graybox", I am still entirely unconvinced by the current design and the explicit capture of memref used here. I think this is less flexible and not justified all-in-all. believe the more extensible alternative is to support opaque nested regions in general and treat them conservatively as new scope (basically what I wrote above about reversing the trait).
But I'm not willing to die on this hill today, if @ftynse agrees with you that affine should stay a bit more isolated, and that this explicit capture is really worth it right now, then feel free to move forward.

Actually, if you don't want explicit memref captures, you can just use scf.execute_region as is! There would be no difference besides the fact that the latter would be from the scf dialect mixed with affine dialect ops and being looked at by affine passes/utilities etc. for transformation purposes, which is fine. I have felt that having a list of memrefs on the operand list of an affine.execute_region simplifies everything (passes/utilities) doing a walk from the top - they deal with that op transparently just like any other non-region holding op that has memref operands. But I'm willing to hold on to this revision until the use case beyond what can already be served by scf.execute_region arrives. We need to perhaps first think about extending affine passes properly in the presence of scf.execute_region ops and reevaluate this.

bondhugula planned changes to this revision.Oct 26 2021, 3:14 AM

Herald added subscribers: Groverkss, wenzhicui, wrengr, Chia-hungDuan. · View Herald TranscriptOct 26 2021, 3:14 AM

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Affine/

IR/

AffineOps.td

75 lines

lib/

Dialect/

Affine/

IR/

AffineOps.cpp

134 lines

test/

Dialect/

Affine/

execute-region.mlir

119 lines

invalid.mlir

56 lines

Diff 354666

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td

Show First 20 Lines • Show All 107 Lines • ▼ Show 20 Lines let extraClassDeclaration = [{

operand_range getMapOperands() { return getOperands(); } operand_range getMapOperands() { return getOperands(); }

}]; }];

let hasCanonicalizer = 1; let hasCanonicalizer = 1;

let hasFolder = 1; let hasFolder = 1;

} }

def AffineExecuteRegionOp : Affine_Op<"execute_region", [AffineScope]> {

let summary = "execute_region operation";

let description = [{

The `affine.execute_region` op introduces a new symbol context for affine

operations. It holds a single region, which can be a list of one or more

ftynseUnsubmitted

Not Done

The `affine.execute_region` op introduces a new symbol context for affine

- operations. It holds a single region, which can be a list of one or more

+ operations. It holds a single region, which can be a list of one or more

blocks, and its semantics are to execute its region exactly once. The op's

ftynse:

mehdi_aminiUnsubmitted

Not Done

(formatting still to be fixed here)

mehdi_amini: (formatting still to be fixed here)

blocks, and its semantics are to execute its region exactly once. The op's

region can have zero or more arguments, each of which can only be a

memref. The operands bind 1:1 to its region's arguments. The op can't use

any memrefs defined outside of it, but can use any other SSA values that

dominate it. The results of a execute_region op match 1:1 with the return

values from its region's blocks;

Examples:

```mlir

affine.for %i = 0 to 128 {

affine.execute_region [%rI, %rM] = (%I, %M)

: (memref<128xi32>, memref<24xf32>) -> () {

%idx = affine.load %rI[%i] : memref<128xi32>

%index = index_cast %idx : i32 to index

affine.load %rM[%index]: memref<24xf32>

return

}

```

```mlir

affine.for %i = 0 to %n {

affine.execute_region : () -> () {

// %pow can now be used as a loop bound.

%pow = call @powi(%i) : (index) -> index

affine.for %j = 0 to %pow {

"foo"() : () -> ()

}

return

}

```

```mlir

affine.for %i = 0 to %n {

affine.execute_region : () -> () {

// %pow can now be used as a loop bound.

%pow = call @powi(%i) : (index) -> index

affine.for %j = 0 to %pow {

"foo"() : () -> ()

}

return

}

```

}];

let arguments = (ins Variadic<AnyMemRef>:$operands);

let results = (outs Variadic<AnyType>);

let regions = (region AnyRegion:$region);

let skipDefaultBuilders = 1;

let builders = [

OpBuilder<(ins "ValueRange":$memrefs)>

];

// TODO: canonicalizations related to memrefs.

let hasCanonicalizer = 0;

}

def AffineForOp : Affine_Op<"for", def AffineForOp : Affine_Op<"for",

[ImplicitAffineTerminator, RecursiveSideEffects, [ImplicitAffineTerminator, RecursiveSideEffects,

DeclareOpInterfaceMethods<LoopLikeOpInterface>]> { DeclareOpInterfaceMethods<LoopLikeOpInterface>]> {

let summary = "for operation"; let summary = "for operation";

let description = [{ let description = [{

Syntax: Syntax:

``` ```

▲ Show 20 Lines • Show All 754 Lines • ▼ Show 20 Lines def AffineStoreOp : AffineStoreOpBase<"store"> {

]; ];

let extraClassDeclaration = extraClassDeclarationBase; let extraClassDeclaration = extraClassDeclarationBase;

let hasCanonicalizer = 1; let hasCanonicalizer = 1;

let hasFolder = 1; let hasFolder = 1;

} }

def AffineYieldOp : Affine_Op<"yield", [NoSideEffect, Terminator, ReturnLike, def AffineYieldOp : Affine_Op<"yield", [

MemRefsNormalizable]> { NoSideEffect, Terminator, ReturnLike, MemRefsNormalizable,

ParentOneOf <

["AffineExecuteRegionOp, AffineForOp, AffineIfOp, AffineParallelOp"]

>]> {

let summary = "Yield values to parent operation"; let summary = "Yield values to parent operation";

let description = [{ let description = [{

"affine.yield" yields zero or more SSA values from an affine op region and "affine.yield" yields zero or more SSA values from an affine op region and

terminates the region. The semantics of how the values yielded are used terminates the region. The semantics of how the values yielded are used

is defined by the parent operation. is defined by the parent operation.

If "affine.yield" has any operands, the operands must match the parent If "affine.yield" has any operands, the operands must match the parent

operation's results. operation's results.

If the parent operation defines no values, then the "affine.yield" may be If the parent operation defines no values, then the "affine.yield" may be

▲ Show 20 Lines • Show All 134 Lines • ▼ Show 20 Lines let extraClassDeclaration = extraClassDeclarationBase # [{

VectorType getVectorType() { VectorType getVectorType() {

return value().getType().cast<VectorType>(); return value().getType().cast<VectorType>();

} }

}]; }];

let hasCanonicalizer = 1; let hasCanonicalizer = 1;

} }

#endif // AFFINE_OPS #endif // AFFINE_OPS

ftynseUnsubmitted

Done

I understand the idea of the restriction, but it looks like it can be circumvented by:

%0 = ... : memref<...>
%1 = cheat.wrap %0 : memref<...> -> !cheat.opaque
affine.execute_region {
  // This defines the memref inside the region, so seemingly complies with the semantics.
  %2 = cheat.unwrap %1 : !cheat.opaque -> memref<...>
  affine.load %2[...]
}

ftynse: I understand the idea of the restriction, but it looks like it can be circumvented by: ``` %0 =…

bondhugulaAuthorUnsubmitted

Done

Independently of affine.execute_region, such a problem exists with memref non-dereferencing ops and it can / has to be tackled. For example, consider this variation of your snippet (no affine.execute_region).

%0 = ... : memref<...>
%1 = cheat.wrap %0 : memref<...> -> !cheat.opaque
%2 = cheat.load %1[0, 0] : !cheat.opaque
cheat.store %v, %1[0, 0] : !cheat.opaque
call @foo(%1) : (!cheat.opaque) -> ()
affine.load %0[0, 0] : memref<...>

Note that all the current passes/utilities including affine store to load fwd'ing, invariant load hoisting/scalar rep, dependence analysis itself, the affine loop fusion would do the wrong thing here because there is an escape side channel like you show. So, the thing you are pointing to is interesting and has to be tackled, but if we step back and take another look, this is the same as the larger issue of dealing with escaping or aliasing and not specific to execute_region. A solution to deal with these is to actually detect/treat unknown memref non-dereferencing ops that define SSA values (like your cheat.unwrap) and bail out in their presence (depending on what we are doing). For eg. the dependence information isn't going to be accurate in their presence. The point of explicit captures is that you know which memrefs are going in, but it isn't free of the problem of side-channel escapes that manifest in straightline code themselves. But whenever we don't have such escapes (and we can always detect that being conservative), which I believe is the common scenario, having the captures does serve its intended purpose -- you still won't have to look inside and do something special with these ops in all passes/utilities.

bondhugula: Independently of affine.execute_region, such a problem exists with memref non-dereferencing ops…

ftynseUnsubmitted

Done

In the general case, I suppose it can be even worse than that.

func @foo(%arg0: !cheat.opaque, %arg1: memref<..>) { // opaque and memref alias
  cheat.do_cheat %arg0 // this affects the memref
  affine.load %arg1[...]
}

which looks like we either need a powerful and abstract enough way to describe the aliasing between objects of different types, or to treat any side-effecting operation conservatively.

Anyway, I agree with the argument that the proposed approach is no worse than what we already have in Affine and I would like to make progress on this.

ftynse: In the general case, I suppose it can be even worse than that. ``` func @foo(%arg0: !cheat.

bondhugulaAuthorUnsubmitted

Done

which looks like we either need a powerful and abstract enough way to describe the
aliasing between objects of different types, or to treat any side-effecting operation

That's right, thanks.

bondhugula: >which looks like we either need a powerful and abstract enough way to describe the >aliasing…

ftynseUnsubmitted

Done

I would avoid referring FuncOp, it is not special in any sense, and it actually allows any terminator. Neither would I remind that blocks must have terminators, it's a core IR requirement.

"returns to right after the affine.execute_region op" sounds unnecessarily complex to me. "std.return" terminator placed inside blocks of the "affine.execute_region" returns the control flow to the "affine.execute_region". Since we already mentioned that "execution_region" executes the region exactly ones, it is naturally implied that the control flow will be further transferred to the control-flow successor of "execution_region...

ftynse: I would avoid referring FuncOp, it is not special in any sense, and it actually allows any…

bondhugulaAuthorUnsubmitted

Done

Sure, dropped these lines.

bondhugula: Sure, dropped these lines.

ftynseUnsubmitted

Done

Syntax nit: I'd consider omitting empty [] = (), looks distracting

ftynse: Syntax nit: I'd consider omitting empty `[] = ()`, looks distracting

bondhugulaAuthorUnsubmitted

Done

@ftynse Unfortunately, the parser API won't support this unless we want to disallow [] = () and even there we won't be able to emit the right error message. (Basically we won't know whether to parse the =.)

bondhugula: @ftynse Unfortunately, the parser API won't support this unless we want to disallow `[] = ()`…

mlir/lib/Dialect/Affine/IR/AffineOps.cpp

Show First 20 Lines • Show All 225 Lines • ▼ Show 20 Lines	bool mlir::isTopLevelValue(Value value) {
if (auto arg = value.dyn_cast<BlockArgument>()) {		if (auto arg = value.dyn_cast<BlockArgument>()) {
// The block owning the argument may be unlinked, e.g. when the surrounding		// The block owning the argument may be unlinked, e.g. when the surrounding
// region has not yet been attached to an Op, at which point the parent Op		// region has not yet been attached to an Op, at which point the parent Op
// is null.		// is null.
Operation *parentOp = arg.getOwner()->getParentOp();		Operation *parentOp = arg.getOwner()->getParentOp();
return parentOp && parentOp->hasTrait<OpTrait::AffineScope>();		return parentOp && parentOp->hasTrait<OpTrait::AffineScope>();
}		}
// The defining Op may live in an unlinked block so its parent Op may be null.		// The defining Op may live in an unlinked block so its parent Op may be null.
Operation *parentOp = value.getDefiningOp()->getParentOp();		Operation *parentOp = value.getDefiningOp()->getParentOp();
		ftynseUnsubmitted Done Reply Inline Actions Nit: wouldn't it be easier to accept the region as argument instead of the operation that contains it? ftynse: Nit: wouldn't it be easier to accept the region as argument instead of the operation that…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Actually, it is - thanks! bondhugula: Actually, it is - thanks!
return parentOp && parentOp->hasTrait<OpTrait::AffineScope>();		return parentOp && parentOp->hasTrait<OpTrait::AffineScope>();
}		}

/// Returns the closest region enclosing `op` that is held by an operation with		/// Returns the closest region enclosing `op` that is held by an operation with
/// trait `AffineScope`; `nullptr` if there is no such region.		/// trait `AffineScope`; `nullptr` if there is no such region.
// TODO: getAffineScope should be publicly exposed for affine passes/utilities.		// TODO: getAffineScope should be publicly exposed for affine passes/utilities.
static Region getAffineScope(Operation op) {		static Region getAffineScope(Operation op) {
auto *curOp = op;		auto *curOp = op;
while (auto *parentOp = curOp->getParentOp()) {		while (auto *parentOp = curOp->getParentOp()) {
if (parentOp->hasTrait<OpTrait::AffineScope>())		if (parentOp->hasTrait<OpTrait::AffineScope>())
return curOp->getParentRegion();		return curOp->getParentRegion();
curOp = parentOp;		curOp = parentOp;
}		}
		ftynseUnsubmitted Done Reply Inline Actions Would variadic template help you? `isKnownIsolatedFromAbove` isn't a class... ftynse: Would variadic template help you? `isKnownIsolatedFromAbove` isn't a class...
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Right - it won't; the comment is probably stale (it was FuncOp and AffineExecuteRegionOp earlier). Anyway this code will be updated to make use of a new op trait 'PolyhedralScope' in another revision, which I'll submit as this one's parent. AffineExecuteRegion will be marked with this trait and other ops that want to define new scopes can have that trait. bondhugula: Right - it won't; the comment is probably stale (it was FuncOp and AffineExecuteRegionOp…
return nullptr;		return nullptr;
}		}

// A Value can be used as a dimension id iff it meets one of the following		// A Value can be used as a dimension id iff it meets one of the following
// conditions:		// conditions:
// *) It is valid as a symbol.		// *) It is valid as a symbol.
// *) It is an induction variable.		// *) It is an induction variable.
		ftynseUnsubmitted Done Reply Inline Actions The assert message is misleading. The op may have parent ops, just none that satisfy the "affine scope" conditions. Also, use `llvm_unreachable` instead of `assert(false)` and drop the return value ftynse: The assert message is misleading. The op may have parent ops, just none that satisfy the…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Thanks. bondhugula: Thanks.
// *) It is the result of affine apply operation with dimension id arguments.		// *) It is the result of affine apply operation with dimension id arguments.
bool mlir::isValidDim(Value value) {		bool mlir::isValidDim(Value value) {
// The value must be an index type.		// The value must be an index type.
if (!value.getType().isIndex())		if (!value.getType().isIndex())
return false;		return false;

if (auto *defOp = value.getDefiningOp())		if (auto *defOp = value.getDefiningOp())
return isValidDim(value, getAffineScope(defOp));		return isValidDim(value, getAffineScope(defOp));
▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	static bool isMemRefSizeValidSymbol(AnyMemRefDefOp memrefDefOp, unsigned index,
if (!memRefType.isDynamicDim(index))		if (!memRefType.isDynamicDim(index))
return true;		return true;
// Get the position of the dimension among dynamic dimensions;		// Get the position of the dimension among dynamic dimensions;
unsigned dynamicDimPos = memRefType.getDynamicDimIndex(index);		unsigned dynamicDimPos = memRefType.getDynamicDimIndex(index);
return isValidSymbol(*(memrefDefOp.getDynamicSizes().begin() + dynamicDimPos),		return isValidSymbol(*(memrefDefOp.getDynamicSizes().begin() + dynamicDimPos),
region);		region);
}		}

/// Returns true if the result of the dim op is a valid symbol for `region`.		/// Returns true if the result of the dim op is a valid symbol for `region`.
static bool isDimOpValidSymbol(memref::DimOp dimOp, Region *region) {		static bool isDimOpValidSymbol(memref::DimOp dimOp, Region *region) {
		ftynseUnsubmitted Done Reply Inline Actions I cannot relate this comment to the code below. ftynse: I cannot relate this comment to the code below.
// The dim op is okay if its operand memref is defined at the top level.		// The dim op is okay if its operand memref is defined at the top level.
if (isTopLevelValue(dimOp.memrefOrTensor()))		if (isTopLevelValue(dimOp.memrefOrTensor()))
		ftynseUnsubmitted Done Reply Inline Actions Shouldn't this also check for "KnownIsolatedFromAbove" ? ftynse: Shouldn't this also check for "KnownIsolatedFromAbove" ?
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions This was checked as part of isValidSymbol above. bondhugula: This was checked as part of isValidSymbol above.
return true;		return true;

// Conservatively handle remaining BlockArguments as non-valid symbols.		// Conservatively handle remaining BlockArguments as non-valid symbols.
// E.g. scf.for iterArgs.		// E.g. scf.for iterArgs.
if (dimOp.memrefOrTensor().isa<BlockArgument>())		if (dimOp.memrefOrTensor().isa<BlockArgument>())
return false;		return false;

// The dim op is also okay if its operand memref is a view/subview whose		// The dim op is also okay if its operand memref is a view/subview whose
▲ Show 20 Lines • Show All 757 Lines • ▼ Show 20 Lines	LogicalResult AffineDmaStartOp::verify() {
unsigned numInputsAllMaps = getSrcMap().getNumInputs() +		unsigned numInputsAllMaps = getSrcMap().getNumInputs() +
getDstMap().getNumInputs() +		getDstMap().getNumInputs() +
getTagMap().getNumInputs();		getTagMap().getNumInputs();
if (getNumOperands() != numInputsAllMaps + 3 + 1 &&		if (getNumOperands() != numInputsAllMaps + 3 + 1 &&
getNumOperands() != numInputsAllMaps + 3 + 1 + 2) {		getNumOperands() != numInputsAllMaps + 3 + 1 + 2) {
return emitOpError("incorrect number of operands");		return emitOpError("incorrect number of operands");
}		}

Region scope = getAffineScope(this);		Region scope = getAffineScope(this);
		ftynseUnsubmitted Done Reply Inline Actions Nit: we tend to use `auto` when it increases readability, `Operation ` would look just fine here ftynse:* Nit: we tend to use `auto` when it increases readability, `Operation *` would look just fine…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Sure. bondhugula: Sure.
for (auto idx : getSrcIndices()) {		for (auto idx : getSrcIndices()) {
if (!idx.getType().isIndex())		if (!idx.getType().isIndex())
return emitOpError("src index to dma_start must have 'index' type");		return emitOpError("src index to dma_start must have 'index' type");
if (!isValidAffineIndexOperand(idx, scope))		if (!isValidAffineIndexOperand(idx, scope))
return emitOpError("src index must be a dimension or symbol identifier");		return emitOpError("src index must be a dimension or symbol identifier");
}		}
for (auto idx : getDstIndices()) {		for (auto idx : getDstIndices()) {
if (!idx.getType().isIndex())		if (!idx.getType().isIndex())
▲ Show 20 Lines • Show All 1,550 Lines • ▼ Show 20 Lines

LogicalResult AffinePrefetchOp::fold(ArrayRef<Attribute> cstOperands,		LogicalResult AffinePrefetchOp::fold(ArrayRef<Attribute> cstOperands,
SmallVectorImpl<OpFoldResult> &results) {		SmallVectorImpl<OpFoldResult> &results) {
/// prefetch(memrefcast) -> prefetch		/// prefetch(memrefcast) -> prefetch
return foldMemRefCast(*this);		return foldMemRefCast(*this);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// AffineExecuteRegionOp
		//===----------------------------------------------------------------------===//
		//

		// TODO: missing region body.
		void AffineExecuteRegionOp::build(OpBuilder &builder, OperationState &result,
		ValueRange memrefs) {
		ftynseUnsubmitted Done Reply Inline Actions Prefer ValueRange to ArrayRef<Value> ftynse: Prefer ValueRange to ArrayRef<Value>
		// Create a region and an empty entry block. The arguments of the region are
		// the supplied memrefs.
		Region *region = result.addRegion();
		Block *body = new Block();
		ftynseUnsubmitted Done Reply Inline Actions We can now do `builder.createBlock` ftynse: We can now do `builder.createBlock`
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Unfortunately, using this messes up the insertion point subsequently - outweighs the idiomacy it provides and I'd like to avoid using a scope guard. bondhugula: Unfortunately, using this messes up the insertion point subsequently - outweighs the idiomacy…
		region->push_back(body);
		body->addArguments(memrefs.getTypes());
		}

		static LogicalResult verify(AffineExecuteRegionOp op) {
		ftynseUnsubmitted Done Reply Inline Actions Nit: drop trivial braces ftynse: Nit: drop trivial braces
		// All memref uses in the execute_region region should be explicitly captured.
		// FIXME: change this walk to an affine walk that doesn't walk inner
		// execute_regions.
		ftynseUnsubmitted Done Reply Inline Actions If you have ValueRange, this entire vector manipulation gets replaced by `memrefs.getTypes()` ftynse: If you have ValueRange, this entire vector manipulation gets replaced by `memrefs.getTypes()`
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Sure, this was really old code - that didn't get updated post ValueRange/TypeRange migrations! bondhugula: Sure, this was really old code - that didn't get updated post ValueRange/TypeRange migrations!
		DenseSet<Value> memrefsUsed;
		ftynseUnsubmitted Done Reply Inline Actions You already pushed it in line 2295 ftynse: You already pushed it in line 2295
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Thanks! bondhugula: Thanks!
		op.region().walk([&](Operation *innerOp) {
		for (Value v : innerOp->getOperands())
		if (v.getType().isa<MemRefType>())
		memrefsUsed.insert(v);
		});

		// For each memref use, ensure either an execute_region argument or a local
		// def.
		auto implicitUse = [&](Value memref) {
		Operation *memrefOriginOp;
		if (auto arg = memref.dyn_cast<BlockArgument>())
		memrefOriginOp = arg.getOwner()->getParentOp();
		else
		memrefOriginOp = memref.getDefiningOp();
		return !op.getOperation()->isAncestor(memrefOriginOp);
		};
		mehdi_aminiUnsubmitted Done Reply Inline Actions You don't need an `if` here. mehdi_amini: You don't need an `if` here.
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions You actually do :-) (Notice the continue above is guarded again inside. ) - although I could better use an else here. bondhugula: You actually do :-) (Notice the continue above is guarded again inside. ) - although I could…
		if (llvm::any_of(memrefsUsed, implicitUse))
		return op.emitOpError("used memref not explicitly captured");

		// Verify that the region arguments match operands.
		mehdi_aminiUnsubmitted Done Reply Inline Actions Seems like you can fix the FIXME and just write: `if(op->isProperAncestor(memref.getDefiningOp()) continue;` mehdi_amini: Seems like you can fix the FIXME and just write: `if(op->isProperAncestor(memref.getDefiningOp…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Thanks, but the block arg owner check above also has to be fixed similarly to allow it to be any descendent of 'op'. Done - used a lambda with an all_of. bondhugula: Thanks, but the block arg owner check above also has to be fixed similarly to allow it to be…
		auto &entryBlock = op.region().front();
		if (entryBlock.getNumArguments() != op.getNumOperands())
		return op.emitOpError("region argument count does not match operand count");

		for (auto argEn : llvm::enumerate(entryBlock.getArguments())) {
		if (op.getOperand(argEn.index()).getType() != argEn.value().getType())
		return op.emitOpError("region argument ")
		<< argEn.index() << " does not match corresponding operand";
		}

		return success();
		}

		// Custom form syntax.
		//
		// (ssa-id `=`)? `affine.execute_region` (`[` memref-region-arg-list `]`
		ftynseUnsubmitted Done Reply Inline Actions Avoid capturing `llvm::enumerate` by-reference. An enumerator only keeps iterators, so taking copying it is cheap, and we avoid running into potential problems with implicit lifetime extension ftynse: Avoid capturing `llvm::enumerate` by-reference. An enumerator only keeps iterators, so taking…
		// `=` `(` memref-use-list `)`)?
		// `:` memref-type-list-parens `->` function-result-type `{`
		ftynseUnsubmitted Done Reply Inline Actions Since you already have an enumerator, you might as well mention the position of the mismatching argument. ftynse: Since you already have an enumerator, you might as well mention the position of the mismatching…
		// block+
		// `}`
		//
		// Ex:
		//
		// affine.execute_region [%rI, %rM] = (%I, %M)
		// : (memref<128xi32>, memref<1024xf32>) -> () {
		// %idx = affine.load %rI[%i] : memref<128xi32>
		// %index = index_cast %idx : i32 to index
		// affine.load %rM[%index]: memref<1024xf32>
		// return
		// }
		//
		static ParseResult parseAffineExecuteRegionOp(OpAsmParser &parser,
		OperationState &result) {
		// Memref operands.
		SmallVector<OpAsmParser::OperandType, 4> memrefs;

		// Region arguments to be created.
		SmallVector<OpAsmParser::OperandType, 4> regionMemRefs;

		// The execute_region op has the same type signature as a function.
		FunctionType opType;

		// Parse the memref assignments.
		auto argLoc = parser.getCurrentLocation();
		if (parser.parseRegionArgumentList(regionMemRefs,
		OpAsmParser::Delimiter::Square) \|\|
		parser.parseEqual() \|\|
		parser.parseOperandList(memrefs, OpAsmParser::Delimiter::Paren))
		return failure();

		if (memrefs.size() != regionMemRefs.size())
		return parser.emitError(parser.getNameLoc(),
		"incorrect number of memref captures");

		if (parser.parseColonType(opType) \|\|
		parser.addTypesToList(opType.getResults(), result.types))
		return failure();

		auto memrefTypes = opType.getInputs();
		if (parser.resolveOperands(memrefs, memrefTypes, argLoc, result.operands))
		return failure();

		// Introduce and parse body region, and the optional attribute list.
		Region *body = result.addRegion();
		if (parser.parseRegion(*body, regionMemRefs, memrefTypes) \|\|
		parser.parseOptionalAttrDict(result.attributes))
		return failure();

		return success();
		}

		static void print(OpAsmPrinter &p, AffineExecuteRegionOp op) {
		p << AffineExecuteRegionOp::getOperationName() << " [";
		// TODO: consider shadowing region arguments.
		p.printOperands(op.region().front().getArguments());
		p << "] = (";
		auto operands = op.getOperands();
		p.printOperands(operands);
		p << ") ";

		SmallVector<Type, 4> argTypes(op.getOperandTypes());
		ftynseUnsubmitted Done Reply Inline Actions You already parser the attr dict 4 lines above. ftynse: You already parser the attr dict 4 lines above.
		p << " : "
		<< FunctionType::get(op->getContext(), argTypes, op.getResultTypes());

		p.printRegion(op.region(),
		/printEntryBlockArgs=/false,
		/printBlockTerminators=/true);

		p.printOptionalAttrDict(op->getAttrs());
		}

		//===----------------------------------------------------------------------===//
// AffineParallelOp		// AffineParallelOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void AffineParallelOp::build(OpBuilder &builder, OperationState &result,		void AffineParallelOp::build(OpBuilder &builder, OperationState &result,
TypeRange resultTypes,		TypeRange resultTypes,
ArrayRef<AtomicRMWKind> reductions,		ArrayRef<AtomicRMWKind> reductions,
ArrayRef<int64_t> ranges) {		ArrayRef<int64_t> ranges) {
SmallVector<AffineMap> lbs(ranges.size(), builder.getConstantAffineMap(0));		SmallVector<AffineMap> lbs(ranges.size(), builder.getConstantAffineMap(0));
▲ Show 20 Lines • Show All 601 Lines • ▼ Show 20 Lines
// AffineYieldOp		// AffineYieldOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static LogicalResult verify(AffineYieldOp op) {		static LogicalResult verify(AffineYieldOp op) {
auto *parentOp = op->getParentOp();		auto *parentOp = op->getParentOp();
auto results = parentOp->getResults();		auto results = parentOp->getResults();
auto operands = op.getOperands();		auto operands = op.getOperands();

if (!isa<AffineParallelOp, AffineIfOp, AffineForOp>(parentOp))
return op.emitOpError() << "only terminates affine.if/for/parallel regions";
if (parentOp->getNumResults() != op.getNumOperands())		if (parentOp->getNumResults() != op.getNumOperands())
return op.emitOpError() << "parent of yield must have same number of "		return op.emitOpError() << "parent of yield must have same number of "
"results as the yield operands";		"results as the yield operands";
for (auto it : llvm::zip(results, operands)) {		for (auto it : llvm::zip(results, operands)) {
if (std::get<0>(it).getType() != std::get<1>(it).getType())		if (std::get<0>(it).getType() != std::get<1>(it).getType())
return op.emitOpError()		return op.emitOpError()
<< "types mismatch between yield op and its parent";		<< "types mismatch between yield op and its parent";
}		}
▲ Show 20 Lines • Show All 194 Lines • Show Last 20 Lines

mlir/test/Dialect/Affine/execute-region.mlir

This file was added.

				// RUN: mlir-opt %s \| mlir-opt -verify-diagnostics \| FileCheck %s

				// CHECK-LABEL: @arbitrary_bound
				func @arbitrary_bound(%n : index) {
				affine.for %i = 0 to %n {
				affine.execute_region [] = () : () -> () {
				// %pow can now be used as a loop bound.
				%pow = call @powi(%i) : (index) -> index
				affine.for %j = 0 to %pow {
				"test.foo"() : () -> ()
				}
				affine.yield
				}
				// CHECK: affine.execute_region [] = () : () -> () {
				// CHECK-NEXT: call @powi
				// CHECK-NEXT: affine.for
				// CHECK-NEXT: "test.foo"()
				// CHECK-NEXT: }
				// CHECK-NEXT: affine.yield
				// CHECK-NEXT: }
				}
				return
				}

				func private @powi(index) -> index

				// CHECK-LABEL: func @arbitrary_mem_access
				func @arbitrary_mem_access(%I: memref<128xi32>, %M: memref<1024xf32>) {
				affine.for %i = 0 to 128 {
				// CHECK: %{{.}} = affine.execute_region [{{.}}] = ({{.*}}) : (memref<128xi32>, memref<1024xf32>) -> f32
				%ret = affine.execute_region [%rI, %rM] = (%I, %M) : (memref<128xi32>, memref<1024xf32>) -> f32 {
				%idx = affine.load %rI[%i] : memref<128xi32>
				%index = index_cast %idx : i32 to index
				%v = affine.load %rM[%index]: memref<1024xf32>
				affine.yield %v : f32
				}
				}
				return
				}

				// CHECK-LABEL: @symbol_check
				func @symbol_check(%B: memref<100xi32>, %A: memref<100xf32>) {
				%cf1 = constant 1.0 : f32
				affine.for %i = 0 to 100 {
				%v = affine.load %B[%i] : memref<100xi32>
				%vo = index_cast %v : i32 to index
				// CHECK: affine.execute_region [%{{.}}] = (%{{.}}) : (memref<100xf32>) -> () {
				affine.execute_region [%rA] = (%A) : (memref<100xf32>) -> () {
				// %vi is now a symbol here.
				%vi = index_cast %v : i32 to index
				affine.load %rA[%vi] : memref<100xf32>
				// %vo is also a symbol (dominates the execute_region).
				affine.load %rA[%vo] : memref<100xf32>
				affine.yield
				}
				// CHECK: index_cast
				// CHECK-NEXT: affine.load
				// CHECK-NEXT: affine.load
				// CHECK-NEXT: affine.yield
				// CHECK-NEXT: }
				}
				return
				}

				// CHECK-LABEL: func @test_more_symbol_validity
				func @test_more_symbol_validity(%A: memref<100xf32>, %pos : index) {
				%c5 = constant 5 : index
				affine.for %i = 0 to 100 {
				%sym = call @external() : () -> (index)
				affine.execute_region [%rA] = (%A) : (memref<100xf32>) -> () {
				affine.load %rA[symbol(%pos) + symbol(%sym) + %c5] : memref<100xf32>
				affine.yield
				}
				}
				affine.execute_region [%rA] = (%A) : (memref<100xf32>) -> () {
				affine.load %rA[symbol(%pos) + %c5] : memref<100xf32>
				affine.yield
				}
				return
				}

				func private @external() -> (index)

				// CHECK-LABEL: func @search
				func @search(%A : memref<?x?xi32>, %S : memref<?xi32>, %key : i32) {
				%c0 = constant 0 : index
				%c1 = constant 1 : index
				%ni = memref.dim %A, %c0 : memref<?x?xi32>
				// This loop can be parallelized.
				affine.for %i = 0 to %ni {
				// CHECK: affine.execute_region
				affine.execute_region [%rA, %rS] = (%A, %S) : (memref<?x?xi32>, memref<?xi32>) -> () {
				%nj = memref.dim %rA, %c1 : memref<?x?xi32>
				br ^bb1(%c0 : index)

				^bb1(%j: index):
				%p1 = cmpi "slt", %j, %nj : index
				cond_br %p1, ^bb2(%j : index), ^bb5

				^bb2(%j_arg : index):
				%v = affine.load %rA[%i, %j_arg] : memref<?x?xi32>
				%p2 = cmpi "eq", %v, %key : i32
				cond_br %p2, ^bb3(%j_arg : index), ^bb4(%j_arg : index)

				^bb3(%j_arg2: index):
				%j_int = index_cast %j_arg2 : index to i32
				affine.store %j_int, %rS[%i] : memref<?xi32>
				br ^bb5

				^bb4(%j_arg3 : index):
				%jinc = addi %j_arg3, %c1 : index
				br ^bb1(%jinc : index)

				^bb5:
				affine.yield
				}
				}
				return
				}

mlir/test/Dialect/Affine/invalid.mlir

Show First 20 Lines • Show All 136 Lines • ▼ Show 20 Lines	func @affine_store_missing_l_square(%C: memref<4096x4096xf32>) {
%9 = constant 0.0 : f32		%9 = constant 0.0 : f32
// expected-error@+1 {{expected '['}}		// expected-error@+1 {{expected '['}}
affine.store %9, %C : memref<4096x4096xf32>		affine.store %9, %C : memref<4096x4096xf32>
return		return
}		}

// -----		// -----

		// CHECK-LABEL: @affine.execute_region_missing_capture
		func @affine.execute_region_missing_capture(%M : memref<2xi32>) {
		affine.for %i = 0 to 10 {
		affine.execute_region [] = () : () -> () {
		// expected-error@-1 {{used memref not explicitly captured}}
		affine.load %M[%i] : memref<2xi32>
		}
		}
		return
		}

		// -----

		// CHECK-LABEL: @affine.execute_region_wrong_capture
		func @affine.execute_region_wrong_capture(%s : index) {
		affine.execute_region [%rS] = (%s) : (index) -> () {
		// expected-error@-1 {{operand #0 must be memref}}
		"use"(%s) : (index) -> ()
		}
		}

		// -----

		// CHECK-LABEL: @affine.execute_region_wrong_capture
		func @affine.execute_region_wrong_capture(%A : memref<2xi32>) {
		affine.execute_region [] = (%A) : (memref<2xi32>) -> () {
		// expected-error@-1 {{incorrect number of memref captures}}
		}
		return
		}

		// -----

		// CHECK-LABEL: @affine.execute_region_region_type_mismatch
		func @affine.execute_region_region_type_mismatch(%A : memref<2xi32>) {
		"affine.execute_region"(%A) ({
		// expected-error@-1 {{region argument 0 does not match corresponding operand}}
		^bb0(%rA : memref<4xi32>):
		return
		}) : (memref<2xi32>) -> ()
		}

		// -----

		// CHECK-LABEL: @affine.execute_region_region_arg_count_mismatch
		func @affine.execute_region_region_arg_count_mismatch(%A : memref<2xi32>) {
		"affine.execute_region"(%A) ({
		// expected-error@-1 {{region argument count does not match operand count}}
		^bb0:
		return
		}) : (memref<2xi32>) -> ()
		return
		}

		// -----

func @affine_min(%arg0 : index, %arg1 : index, %arg2 : index) {		func @affine_min(%arg0 : index, %arg1 : index, %arg2 : index) {
// expected-error@+1 {{operand count and affine map dimension and symbol count must match}}		// expected-error@+1 {{operand count and affine map dimension and symbol count must match}}
%0 = affine.min affine_map<(d0) -> (d0)> (%arg0, %arg1)		%0 = affine.min affine_map<(d0) -> (d0)> (%arg0, %arg1)

return		return
}		}

// -----		// -----
▲ Show 20 Lines • Show All 221 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[MLIR] Introduce affine.execute_region opChanges PlannedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 354666

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td

mlir/lib/Dialect/Affine/IR/AffineOps.cpp

mlir/test/Dialect/Affine/execute-region.mlir

mlir/test/Dialect/Affine/invalid.mlir

[MLIR] Introduce affine.execute_region op
Changes PlannedPublic