This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
lib/Dialect/Linalg/IR/
-
Dialect/
-
Linalg/
-
IR/
10/35
LinalgInterfaces.cpp
-
test/Dialect/Linalg/
-
Dialect/
-
Linalg/
-
fusion-2-level.mlir
-
generalize-named-ops.mlir
1/3
invalid.mlir
-
named-ops.mlir
3/8
reshape_linearization_fusion.mlir
-
sparse_nd.mlir
1/2
tile-indexed-generic.mlir

Differential D98390

Added static verification for Linalg Ops.
ClosedPublic

Authored by • inho9606 on Mar 10 2021, 9:00 PM.

Download Raw Diff

Details

Reviewers

hanchung
aartbik
nicolasvasilache
mravishankar

Commits

rGf58463345415: Added static verification for Linalg Ops.

Summary

This verification is to check if the indices for static shaped operands on linalgOps access out of bound memory or not. For dynamic shaped operands, we would be able to check it on runtime stage.
Found 3 memory violation testcases, and modified them

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

• inho9606 created this revision.Mar 10 2021, 9:00 PM

Herald added a reviewer: aartbik. · View Herald TranscriptMar 10 2021, 9:00 PM

Herald added subscribers: dcaballe, cota, mravishankar and 17 others. · View Herald Transcript

• inho9606 requested review of this revision.Mar 10 2021, 9:00 PM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptMar 10 2021, 9:00 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: limo1996, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

hanchung requested changes to this revision.Mar 11 2021, 2:49 AM

hanchung added inline comments.

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
436	Check the return value
437	The LLVM style is `for (unsigned i = 0, e = ...; i < e; ...)`
440–441	This can be `getShapedOperandTypes()[i]`. I think it is better to have a `getShapedOperandType(i)`, could you add the interface method?
444	You should check if there are any symbols before calling compose method. Otherwise, it will hit an assertion in the method.
446	Why check with `>=`? Can we check if they are the same, ie with `==`? You can know the exact shape if you have loop boundary.
mlir/test/Dialect/Linalg/reshape_linearization_fusion.mlir
170–173	Below is more straightforward to me: #map2 = affine_map<(d0, d1, d2) -> (d2, d0, d1)> #map3 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> Does it work?
171–173	please help fix this typo since you touch this section. s/permultation/permutation

This revision now requires changes to proceed.Mar 11 2021, 2:49 AM

Harbormaster completed remote builds in B93216: Diff 329837.Mar 11 2021, 5:10 AM

@aartbik could you help check if the change in sparse_nd.mlir is correct?

Probably need a test that fails as well and verify the error message. You can use -verify-diagnostics on the command line, and match the error using // expected-error . See other tests that have this.

In D98390#2620390, @mravishankar wrote:

Probably need a test that fails as well and verify the error message. You can use -verify-diagnostics on the command line, and match the error using // expected-error . See other tests that have this.

Yes, totally agree! Please add a test to test/Dialect/Linalg/invalid.mlir.

mehdi_amini added inline comments.Mar 11 2021, 12:51 PM

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
437	Alternatively, `for (int i : llvm::seq<int>(0, (loopRanges).size())` But even better any time you don't need the index: `for (int64_t &range : loopRanges) range -= 1;` Also good to know the alternative when you need the index but iterate over a range: for (auto &indexedRange : llvm::enumerate(*loopRanges)) { // indexedRange.index() for the index // indexedRange.value() for the value

• inho9606 added inline comments.Mar 11 2021, 6:29 PM

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
437	Wow It's very helpful! Thanks for sharing the tips
444	I think it was already checked in line 327 when constructing indexingMaps,, isn't it?
446	Could you give me more hint? I thought we should check the upper bound of loops and the size of shaped operands since we have to detect access out of boundary. Or do you mean the size and loop range should be same?
mlir/test/Dialect/Linalg/reshape_linearization_fusion.mlir
170–173	With your suggestion, the indices are less than the sizes, but it is not match. I mean 3x35 shape matches 3x18 loop range, and 5x7x3 shape matches 4x2x2 loop range.. Refering to the case of reshaping 3x35 to 5x3x7 below, map2 matches input shape and map3 matches output. In fact, if I modify map2, then it does not work.. E.G., 3x35 -> 3x44..

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Reflected small code styles

hanchung added inline comments.Mar 11 2021, 11:29 PM

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
446	Yes, they should be the same. It does not make sense to me if one is larger. Otherwise the behavior is undefined. Say that both affine_map is (d0) -> (d0). One shape is [10], and another is [20]. It can raise the issue or not raise issue based on how you infer the shape. If you get the upper bound from the first operand, then the check passes. If you get the upper bound from the second operand, then the check fails. Does it make sense?
mlir/test/Dialect/Linalg/reshape_linearization_fusion.mlir
170–173	I do not understand what the issue is here. d0, d1, d2 will map to three numbers. This is just like what symbol maps to what value. I don't see the difference if you swap d1 and d2. I actually don't understand what you are saying like 3x18 loop range and 4x2x2 loop range. Could you explain more? What is the IR if you go with the suggestion?

Harbormaster completed remote builds in B93413: Diff 330123.Mar 12 2021, 12:43 AM

• inho9606 added inline comments.Mar 12 2021, 12:57 AM

mlir/test/Dialect/Linalg/reshape_linearization_fusion.mlir
170–173	Okay, I removed 'range -=1 statement' according to your comment above (not committed yet), and printed out each rank's size of the shapes and loop ranges. Here is IR with the suggestion. inhoseo@inho:~/llvm-project/build/bin$ ./mlir-opt linalg_test.mlir -linalg-fold-reshape-ops-by-linearization -verify-diagnostics shaped 3 index 3 shaped 5 index 5 shaped 7 index 7 shaped 5 index 5 shaped 7 index 7 shaped 3 index 3 shaped 3 index 3 shaped 35 index 26 shaped 5 index 5 shaped 7 index 3 shaped 3 index 3 #map0 = affine_map<(d0, d1, d2) -> (d1, d0 + d2 * 7)> #map1 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> module { func @generic_op_120_permultation_reshape_producer_fusion(%arg0: tensor<3x35xf32>) -> tensor<5x7x3xf32> { %0 = linalg.init_tensor [5, 7, 3] : tensor<5x7x3xf32> %1 = linalg.generic {indexing_maps = [#map0, #map1], iterator_types = ["parallel", "parallel", "parallel"]} ins(%arg0 : tensor<3x35xf32>) outs(%0 : tensor<5x7x3xf32>) { ^bb0(%arg1: f32, %arg2: f32): // no predecessors linalg.yield %arg1 : f32 } -> tensor<5x7x3xf32> return %1 : tensor<5x7x3xf32> } } -convert-linalg-to-loops does not work with other options here..

@inho9606, please add invalid ops to test/Dialect/Linalg/invalid.mlir

mlir/test/Dialect/Linalg/reshape_linearization_fusion.mlir
170–173	Okay, I removed 'range -=1 statement' according to your comment above (not committed yet), and printed out each rank's size of the shapes and loop ranges. Here is IR with the suggestion. I don't get the reason to remove 'range -=` statement'. But this seems unrelated, I will take a look once you upload the patch. inhoseo@inho:~/llvm-project/build/bin$ ./mlir-opt linalg_test.mlir -linalg-fold-reshape-ops-by-linearization -verify-diagnostics shaped 3 index 3 shaped 5 index 5 shaped 7 index 7 shaped 5 index 5 shaped 7 index 7 shaped 3 index 3 shaped 3 index 3 shaped 35 index 26 shaped 5 index 5 shaped 7 index 3 shaped 3 index 3 #map0 = affine_map<(d0, d1, d2) -> (d1, d0 + d2 * 7)> #map1 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> module { func @generic_op_120_permultation_reshape_producer_fusion(%arg0: tensor<3x35xf32>) -> tensor<5x7x3xf32> { %0 = linalg.init_tensor [5, 7, 3] : tensor<5x7x3xf32> %1 = linalg.generic {indexing_maps = [#map0, #map1], iterator_types = ["parallel", "parallel", "parallel"]} ins(%arg0 : tensor<3x35xf32>) outs(%0 : tensor<5x7x3xf32>) { ^bb0(%arg1: f32, %arg2: f32): // no predecessors linalg.yield %arg1 : f32 } -> tensor<5x7x3xf32> return %1 : tensor<5x7x3xf32> } } -convert-linalg-to-loops does not work with other options here.. I think there are two issues. The input IR is not valid, if I read correctly. #map2 = affine_map<(d0, d1, d2) -> (d1, d2, d0)> #map3 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> %2 = linalg.generic {indexing_maps = [#map2, #map3], iterator_types = ["parallel", "parallel", "parallel"]} ins(%0 : tensor<3x5x7xf32>) outs(%1 : tensor<5x7x3xf32>) { ^bb0(%arg2: f32, %arg3: f32): // no predecessors linalg.yield %arg2 : f32 } -> tensor<5x7x3xf32> The indexing maps for the generic op should be #map2 = affine_map<(d0, d1, d2) -> (d2, d0, d1)> #map3 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> The indexing maps that you provide is also correct. This is just the order of loops problem. There is probably a bug in the pass. The indexing maps in the output are #map0 = affine_map<(d0, d1, d2) -> (d1, d0 + d2 * 7)> #map1 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> %1 = linalg.generic {indexing_maps = [#map0, #map1], iterator_types = ["parallel", "parallel", "parallel"]} ins(%arg0 : tensor<3x35xf32>) outs(%0 : tensor<5x7x3xf32>) { ^bb0(%arg1: f32, %arg2: f32): // no predecessors linalg.yield %arg1 : f32 } -> tensor<5x7x3xf32> while I think they should be #map0 = affine_map<(d0, d1, d2) -> (d2, d1 + d0 * 7)> #map1 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> With your fix, you are hiding the issue in (2). I think we can go with your fix. Then Mahesh or I will fix the bug and add one more test for it. @mravishankar fyi, the input of the test case was wrong I think.

This revision now requires changes to proceed.Mar 12 2021, 4:50 AM

• inho9606 added inline comments.Mar 12 2021, 7:45 AM

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
446	Yes, if d0 matched (10) already and d0 tries to match (20), then it should fail. Here my understand is that elements of indices and elements of shapedOperand.getDimSize() are 1:1 match. So indices[j] accesses shapedOperand.getDimSize(j). And usually the valid range of indices is from 0 to size-1. That's why I thought we could pass if indices[j] < shapedOperand.getDimSize(j). + Yes you are right, we still need 'range -= 1' statement. It is to get the exact indices by compose method. So it is not related to this discussion. I misunderstood..
mlir/test/Dialect/Linalg/reshape_linearization_fusion.mlir
171–173	Could you give me more detail what you meant by 's/permultation/permutation'?

• inho9606 added inline comments.Mar 12 2021, 7:57 AM

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
446	Yes, if d0 matched (10) already and d0 tries to match (20), then it should fail. Here my understand is that elements of indices and elements of shapedOperand.getDimSize() are 1:1 match. So indices[j] accesses shapedOperand.getDimSize(j). And usually the valid range of indices is from 0 to size-1. That's why I thought we could pass if indices[j] < shapedOperand.getDimSize(j). + Yes you are right, we still need 'range -= 1' statement. It is to get the exact indices by compose method. So it is not related to the discussion about the test. I misunderstood..

hanchung added inline comments.Mar 12 2021, 10:59 AM

mlir/test/Dialect/Linalg/reshape_linearization_fusion.mlir
171–173	Oops, I meant to replace permultation with permutation.

• inho9606 updated this revision to Diff 330554.Mar 14 2021, 8:18 PM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Added invalid op in invalid.mlir

Harbormaster completed remote builds in B93739: Diff 330554.Mar 14 2021, 8:47 PM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Updated verification condition, and fixed some more testcases

Herald added a subscriber: arphaman. · View Herald TranscriptMar 15 2021, 2:53 AM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Removed comments and changed variable names

Harbormaster completed remote builds in B93767: Diff 330590.Mar 15 2021, 3:32 AM

Harbormaster completed remote builds in B93764: Diff 330587.Mar 15 2021, 3:36 AM

Inho and I had an offline discussion today, and we found that this doesn't work in some cases. E.g., if there is an affine_map is affine_map<(i) -> (10 - i)>. I will wait for Inho's fix and review it later.

This revision now requires changes to proceed.Mar 16 2021, 5:39 AM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Added ignoring case that has zero as it's laast inferred accessing index.

Harbormaster completed remote builds in B94405: Diff 331476.Mar 18 2021, 12:35 AM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Applied clang-format to LinalgInterfaces.cpp

Harbormaster completed remote builds in B94407: Diff 331480.Mar 18 2021, 12:53 AM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Applied rebase to top

Harbormaster completed remote builds in B94438: Diff 331531.Mar 18 2021, 6:24 AM

hanchung requested changes to this revision.Mar 18 2021, 9:58 AM

hanchung added inline comments.

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
436	nit: rename it to `hasDynamicRange`?
440–446	The logic is mixed in this loop. You can use `llvm::any_of` and it's better to have a comment on why you subtract loop ranges. E.g., bool hasDynamicRange = llvm::any_of(...) for (auto ...) range -= 1; The second line can even put to the body of if-statement. And in this case, we can write something like: if (llvm::any_of(...)) { for (auto ...) range -= 1; ... }
447	Add a comment on why? We actually can verify the below snippet right? linalg.matmul ins(%A, %B: memref<2x?xf32>, memref<?x4xf32>) outs(%C: memref<2x4xf32>) Is the reason that this is too expensive to check?
448–452	Let's follow LLVM style, use `llvm::enumerate(inalgOp.getShapedOperandTypes())`.
453	style nit: for (auto j : llvm:seq<unsigned>(0, shapedOperand.getRank()))
454–457	I feel we should have a more detailed description because this is not well-defined yet. How about: Ignore the case that the inferred last index is zero. The indexing is either increasing or decreasing in Linalg, ie, the last index will be `0` or `size-1`. We only check the cases that are non-zero because most of cases are increasing and it is too expensive to find the shape of decreasing cases.
460–466	How about: inferred XXX shaped operand shape's dimension XXX to be XXX but found XXX This looks shorter and cleaner to me.

This revision now requires changes to proceed.Mar 18 2021, 9:58 AM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Updated some comments and code styles

• inho9606 added inline comments.Mar 18 2021, 9:14 PM

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
440–446	Thanks for the useful feature. I think none-of would be more clear to read, so used it instead of any_of. But please let me know if I made something wrong please since it was my first time to use this.
448–452	I think we still need index i to compare loop ranges and shaped sizes, and to indicate which operand has an error in debug messages. So I just made this line like what you suggested for index j below instead.

Harbormaster completed remote builds in B94610: Diff 331755.Mar 18 2021, 9:58 PM

hanchung requested changes to this revision.Mar 18 2021, 11:01 PM

hanchung added inline comments.

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
448–452	`llvm::enumerate` will return the index and the object. for (auto &en : llvm::enumerate(linalgOp.getShapedOperandTypes())) { // en.index() for the index // en.value() for the value In this case `en.value()` will be `i`.
456
462–463	It is weird to me that start with "Detected Invalid shaped operand.". I don't see other verify methods have this kind of things. Can we delete it? And they are supposed to be sentence fragments. Errors reported via emitError or emitOpError should start with lower case.

This revision now requires changes to proceed.Mar 18 2021, 11:01 PM

hanchung added inline comments.Mar 18 2021, 11:05 PM

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
448–452	typo... I meant `en.index` will be `i`.

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Applied LLVM style and fixed comments

• inho9606 added inline comments.Mar 19 2021, 2:08 AM

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
448–452	Oh I didn't know if it has en.index(). thanks!
456	Sorry, I didn't catch what you wanted to mean because my screen reader does not recognize your comment. I think it might be because of starting with //.. Could you add the comment again?

Harbormaster completed remote builds in B94641: Diff 331799.Mar 19 2021, 2:47 AM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Check complicated cases; pass if indices are in-of-bound of shapes.

Harbormaster completed remote builds in B95213: Diff 332587.Mar 23 2021, 5:39 AM

hanchung requested changes to this revision.Mar 23 2021, 11:12 AM

hanchung added inline comments.

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
447	I don't think the comment explains my example. You will skip the check when some dims are dynamic and some dims are static. My example can be checked. Is there a reason for not checking it?
447	We don't intend to modify any values right? Let's use `const auto &en`.
451	Let's use `auto j`. LLVM style says that use auto with initializers like cast<Foo>(...) or other places where the type is already obvious from the context. llvm::seq already spells the type. https://llvm.org/docs/CodingStandards.html#use-auto-type-deduction-to-make-code-more-readable
453	type: Replace `inffered` with `inferred`.
456	I actually don't know the English of this symbol. It is probably a grave accent. You can type it with the key above `Tab` key. It is also on the left hand side of `1`.
462–467	The walk is not needed here. I think it is a tree-based structure. If the kind of an Affine expression is DimId, it won't have any children. If it is not, it is "complicated" in this case. There are couple ways to do it: you can try to dyn_cast it to AffineDimExpr e.g., if (dyn_cast<AffineDimExpr>(expr)) { // not complicated case } else { // complicated case } Just check if the kind is a DimID by `expr.getKind().isa<>(mlir::AffineExprKind::DimId)`. The first way is more common I think.
468–482	Couple issues. 0-base is more common. Let's not go with 1-base. Add comments. People won't understand why checking it this way. It's better to have a variable for `indices[j] + 1`, maybe `inferredSize`?
mlir/test/Dialect/Linalg/invalid.mlir
707–714	Add one more invalid test case for another case.

This revision now requires changes to proceed.Mar 23 2021, 11:12 AM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Updated code style and comments..
Added an invalid testcase.

• inho9606 added inline comments.Mar 23 2021, 8:50 PM

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
456	Oh grabe is correct! Actually the screen reader says '>' as 'grater than', and says '`' as grabe. It is similar, so my ears thought of them as same things.. LOL

Harbormaster completed remote builds in B95400: Diff 332858.Mar 24 2021, 2:15 AM

hanchung requested changes to this revision.Mar 25 2021, 12:06 AM

hanchung added inline comments.

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
466–480	There are four `en.value().getDimSize(j)`, maybe have a variable for it?
474–477	The error message is not reasonable, please fix it. should it be greater than?
mlir/test/Dialect/Linalg/invalid.mlir
719	This error message is weird. `xxx be less than or equal to 4 but found 3`. Doesn't 3 less than 4?
mlir/test/Dialect/Linalg/tile-indexed-generic.mlir
86	accidental change? please revert it.

This revision now requires changes to proceed.Mar 25 2021, 12:06 AM

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Fixed wierd comment.. and fixed minor style

• inho9606 added inline comments.Mar 25 2021, 3:53 AM

mlir/test/Dialect/Linalg/invalid.mlir
719	Oops it is definitely my fault.. Thanks
mlir/test/Dialect/Linalg/tile-indexed-generic.mlir
86	Ah,, I often use tab key to explore the screen, but sometimes it is typed as character and I miss it easily.. I need to be more careful. Anyway thanks for letting me know!

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Fixed clang-format issue

Harbormaster completed remote builds in B95657: Diff 333248.Mar 25 2021, 4:14 AM

Harbormaster completed remote builds in B95661: Diff 333252.Mar 25 2021, 4:38 AM

It looks like there is a failure in tile-and-fuse-tensors.mlir. Could you check the file/test, so we can make bots happy?

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

Rebased the patch to the top of tree, and fixed one test in tile-and-fuse-tensors

Updating D98390: Added static verification for Linalg Ops. #
Enter a brief description of the changes included in this update.
The first line is used as subject, next lines as comment.

I think this version is more reasonable comparing to other cases

Harbormaster completed remote builds in B95821: Diff 333479.Mar 25 2021, 7:09 PM

Harbormaster completed remote builds in B95820: Diff 333478.Mar 25 2021, 7:42 PM

Thanks!

This revision was not accepted when it landed; it landed in state Needs Review.Mar 30 2021, 7:11 AM

Closed by commit rGf58463345415: Added static verification for Linalg Ops. (authored by • inho9606, committed by hanchung). · Explain Why

This revision was automatically updated to reflect the committed changes.

hanchung added a commit: rGf58463345415: Added static verification for Linalg Ops..

Revision Contents

Path

Size

mlir/

lib/

Dialect/

Linalg/

IR/

LinalgInterfaces.cpp

35 lines

test/

Dialect/

Linalg/

fusion-2-level.mlir

2 lines

generalize-named-ops.mlir

6 lines

invalid.mlir

9 lines

named-ops.mlir

36 lines

reshape_linearization_fusion.mlir

12 lines

sparse_nd.mlir

16 lines

tile-indexed-generic.mlir

8 lines

Diff 331799

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp

Show First 20 Lines • Show All 426 Lines • ▼ Show 20 Lines for (auto it : llvm::zip(linalgOp.getShapedOperandTypes(),

if (std::get<0>(it).getElementType() != std::get<1>(it).getType()) if (std::get<0>(it).getElementType() != std::get<1>(it).getType())

return op->emitError("expected type of bb argument #") return op->emitError("expected type of bb argument #")

<< (idx + numBBIvs) << " (" << std::get<1>(it).getType() << ")" << (idx + numBBIvs) << " (" << std::get<1>(it).getType() << ")"

<< " to match element type of corresponding shaped operand (" << " to match element type of corresponding shaped operand ("

<< std::get<0>(it).getElementType() << ")"; << std::get<0>(it).getElementType() << ")";

++idx; ++idx;

} }

// Check if shapes are valid

Optional<SmallVector<int64_t, 4>> loopRanges = linalgOp.getStaticLoopRanges();

hanchungUnsubmitted

Not Done

Check the return value

hanchung: Check the return value

hanchungUnsubmitted

Not Done

nit: rename it to hasDynamicRange?

hanchung: nit: rename it to `hasDynamicRange`?

if (!loopRanges)

hanchungUnsubmitted

Not Done

The LLVM style is for (unsigned i = 0, e = ...; i < e; ...)

hanchung: The LLVM style is `for (unsigned i = 0, e = ...; i < e; ...)`

mehdi_aminiUnsubmitted

Not Done

Alternatively, for (int i : llvm::seq<int>(0, (*loopRanges).size())

But even better any time you don't need the index:

for (int64_t &range : *loopRanges) range -= 1;

Also good to know the alternative when you need the index but iterate over a range:

for (auto &indexedRange : llvm::enumerate(*loopRanges)) {
    // indexedRange.index() for the index
   // indexedRange.value() for the value

mehdi_amini: Alternatively, `for (int i : llvm::seq<int>(0, (*loopRanges).size())` But even better any time…

inho9606AuthorUnsubmitted

Done

Wow It's very helpful! Thanks for sharing the tips

inho9606: Wow It's very helpful! Thanks for sharing the tips

return linalgOp.emitError("unable to find loop range for operation");

// Verify only static cases since we can't get exact dimension sizes and loop

// ranges for dynamic cases in this stage.

hanchungUnsubmitted

Not Done

This can be getShapedOperandTypes()[i].
I think it is better to have a getShapedOperandType(i), could you add the interface method?

hanchung: This can be `getShapedOperandTypes()[i]`. I think it is better to have a `getShapedOperandType…

if (llvm::none_of(*loopRanges, [](int64_t &range) {

return range == ShapedType::kDynamicSize;

})) {

hanchungUnsubmitted

Not Done

You should check if there are any symbols before calling compose method. Otherwise, it will hit an assertion in the method.

hanchung: You should check if there are any symbols before calling compose method. Otherwise, it will hit…

inho9606AuthorUnsubmitted

Done

I think it was already checked in line 327 when constructing indexingMaps,, isn't it?

inho9606: I think it was already checked in line 327 when constructing indexingMaps,, isn't it?

for (int64_t &range : *loopRanges)

range -= 1;

hanchungUnsubmitted

Not Done

Why check with >=? Can we check if they are the same, ie with ==? You can know the exact shape if you have loop boundary.

hanchung: Why check with `>=`? Can we check if they are the same, ie with `==`? You can know the exact…

inho9606AuthorUnsubmitted

Done

Could you give me more hint? I thought we should check the upper bound of loops and the size of shaped operands since we have to detect access out of boundary. Or do you mean the size and loop range should be same?

inho9606: Could you give me more hint? I thought we should check the upper bound of loops and the size of…

hanchungUnsubmitted

Not Done

Yes, they should be the same. It does not make sense to me if one is larger. Otherwise the behavior is undefined.

Say that both affine_map is (d0) -> (d0). One shape is [10], and another is [20]. It can raise the issue or not raise issue based on how you infer the shape. If you get the upper bound from the first operand, then the check passes. If you get the upper bound from the second operand, then the check fails. Does it make sense?

hanchung: Yes, they should be the same. It does not make sense to me if one is larger. Otherwise the…

inho9606AuthorUnsubmitted

Done

Yes, if d0 matched (10) already and d0 tries to match (20), then it should fail.
Here my understand is that elements of indices and elements of shapedOperand.getDimSize() are 1:1 match. So indices[j] accesses shapedOperand.getDimSize(j). And usually the valid range of indices is from 0 to size-1. That's why I thought we could pass if indices[j] < shapedOperand.getDimSize(j).
+ Yes you are right, we still need 'range -= 1' statement. It is to get the exact indices by compose method. So it is not related to this discussion. I misunderstood..

inho9606: Yes, if d0 matched (10) already and d0 tries to match (20), then it should fail. Here my…

inho9606AuthorUnsubmitted

Done

inho9606: Yes, if d0 matched (10) already and d0 tries to match (20), then it should fail. Here my…

hanchungUnsubmitted

Not Done

The logic is mixed in this loop. You can use llvm::any_of and it's better to have a comment on why you subtract loop ranges. E.g.,

bool hasDynamicRange = llvm::any_of(...)
for (auto ...) range -= 1;

The second line can even put to the body of if-statement. And in this case, we can write something like:

if (llvm::any_of(...)) {
  for (auto ...) range -= 1;
  ...
}

hanchung: The logic is mixed in this loop. You can use `llvm::any_of` and it's better to have a comment…

inho9606AuthorUnsubmitted

Done

Thanks for the useful feature. I think none-of would be more clear to read, so used it instead of any_of. But please let me know if I made something wrong please since it was my first time to use this.

inho9606: Thanks for the useful feature. I think none-of would be more clear to read, so used it instead…

for (auto &en : llvm::enumerate(linalgOp.getShapedOperandTypes())) {

hanchungUnsubmitted

Not Done

Add a comment on why? We actually can verify the below snippet right?

linalg.matmul ins(%A, %B: memref<2x?xf32>, memref<?x4xf32>)
                     outs(%C: memref<2x4xf32>)

Is the reason that this is too expensive to check?

hanchung: Add a comment on why? We actually can verify the below snippet right? ``` linalg.matmul ins(%A…

hanchungUnsubmitted

Not Done

I don't think the comment explains my example. You will skip the check when some dims are dynamic and some dims are static. My example can be checked. Is there a reason for not checking it?

hanchung: I don't think the comment explains my example. You will skip the check when some dims are…

hanchungUnsubmitted

Not Done

We don't intend to modify any values right? Let's use const auto &en.

hanchung: We don't intend to modify any values right? Let's use `const auto &en`.

if (!en.value().hasStaticShape())

continue;

auto indices = indexingMaps[en.index()].compose(*loopRanges);

for (unsigned j : llvm::seq<unsigned>(0, en.value().getRank())) {

hanchungUnsubmitted

Not Done

Let's use auto j. LLVM style says that use auto with initializers like cast<Foo>(...) or other places where the type is already obvious from the context. llvm::seq already spells the type.

https://llvm.org/docs/CodingStandards.html#use-auto-type-deduction-to-make-code-more-readable

hanchung: Let's use `auto j`. LLVM style says that use auto with initializers like cast<Foo>(...) or…

hanchungUnsubmitted

Not Done

Let's follow LLVM style, use llvm::enumerate(inalgOp.getShapedOperandTypes()).

hanchung: Let's follow LLVM style, use `llvm::enumerate(inalgOp.getShapedOperandTypes())`.

inho9606AuthorUnsubmitted

Done

I think we still need index i to compare loop ranges and shaped sizes, and to indicate which operand has an error in debug messages. So I just made this line like what you suggested for index j below instead.

inho9606: I think we still need index i to compare loop ranges and shaped sizes, and to indicate which…

hanchungUnsubmitted

Not Done

llvm::enumerate will return the index and the object.

for (auto &en : llvm::enumerate(linalgOp.getShapedOperandTypes())) {
   // en.index() for the index
   // en.value() for the value

In this case en.value() will be i.

hanchung: `llvm::enumerate` will return the index and the object. ``` for (auto &en : llvm::enumerate…

hanchungUnsubmitted

Not Done

typo... I meant en.index will be i.

hanchung: typo... I meant `en.index` will be `i`.

inho9606AuthorUnsubmitted

Done

Oh I didn't know if it has en.index(). thanks!

inho9606: Oh I didn't know if it has en.index(). thanks!

// Ignore the case that the inffered last index is zero. The index is

hanchungUnsubmitted

Not Done

style nit:

for (auto j : llvm:seq<unsigned>(0, shapedOperand.getRank()))

hanchung: style nit: ``` for (auto j : llvm:seq<unsigned>(0, shapedOperand.getRank())) ```

hanchungUnsubmitted

Not Done

type: Replace inffered with inferred.

hanchung: type: Replace `inffered` with `inferred`.

// increasing or decreasing in Linalg, for example, the last index

// should be >0> or >size-1>. We only check the cases that are non-zero

// because most of cases are increasing and it is too expensive to find

hanchungUnsubmitted

Not Done

// increasing or decreasing in Linalg, for example, the last index

- // should be >0> or >size-1>. We only check the cases that are non-zero

+ // should be `0` or `size-1`. We only check the cases that are non-zero

// because most of cases are increasing and it is too expensive to find

hanchung:

inho9606AuthorUnsubmitted

Done

Sorry, I didn't catch what you wanted to mean because my screen reader does not recognize your comment. I think it might be because of starting with //.. Could you add the comment again?

inho9606: Sorry, I didn't catch what you wanted to mean because my screen reader does not recognize your…

hanchungUnsubmitted

Not Done

I actually don't know the English of this symbol. It is probably a grave accent. You can type it with the key above Tab key. It is also on the left hand side of 1.

hanchung: I actually don't know the English of this symbol. It is probably a grave accent. You can type…

inho9606AuthorUnsubmitted

Done

Oh grabe is correct! Actually the screen reader says '>' as 'grater than', and says '`' as grabe. It is similar, so my ears thought of them as same things.. LOL

inho9606: Oh grabe is correct! Actually the screen reader says '>' as 'grater than', and says '`' as…

// the shape of decreasing cases.

hanchungUnsubmitted

Not Done

I feel we should have a more detailed description because this is not well-defined yet. How about:

Ignore the case that the inferred last index is zero. The indexing is either increasing or decreasing in Linalg, ie, the last index will be `0` or `size-1`. We only check the cases that are non-zero because most of cases are increasing and it is too expensive to find the shape of decreasing cases.

hanchung: I feel we should have a more detailed description because this is not well-defined yet. How…

if (indices[j] == 0)

continue;

if (indices[j] != en.value().getDimSize(j) - 1) {

return linalgOp.emitOpError("inferred shaped operand #")

<< en.index() + 1 << " has shape's dimension #" << j + 1

<< " to be " << indices[j] + 1 << ", but found "

hanchungUnsubmitted

Not Done

if (indices[j] != shapedOperand.getDimSize(j) - 1) {

- return linalgOp.emitOpError("Detected Invalid shaped operand. "

- "Inferred shaped operand #")

+ return linalgOp.emitOpError("inferred shaped operand #")

<< i + 1 << " has shape's dimension #" << j + 1 << " to be "

It is weird to me that start with "Detected Invalid shaped operand.". I don't see other verify methods have this kind of things. Can we delete it?

And they are supposed to be sentence fragments. Errors reported via emitError or emitOpError should start with lower case.

hanchung: It is weird to me that start with "Detected Invalid shaped operand.". I don't see other verify…

<< en.value().getDimSize(j);

}

hanchungUnsubmitted

Not Done

How about:

inferred XXX shaped operand shape's dimension XXX to be XXX but found XXX

This looks shorter and cleaner to me.

hanchung: How about: ``` inferred XXX shaped operand shape's dimension XXX to be XXX but found XXX ```…

}

hanchungUnsubmitted

Not Done

The walk is not needed here. I think it is a tree-based structure. If the kind of an Affine expression is DimId, it won't have any children. If it is not, it is "complicated" in this case. There are couple ways to do it:

you can try to dyn_cast it to AffineDimExpr e.g.,

if (dyn_cast<AffineDimExpr>(expr)) {
  // not complicated case
} else {
  // complicated case
}

Just check if the kind is a DimID by expr.getKind().isa<>(mlir::AffineExprKind::DimId).

The first way is more common I think.

hanchung: The walk is not needed here. I think it is a tree-based structure. If the kind of an Affine…

}

return success(); return success();

} }

hanchungUnsubmitted

Not Done

The error message is not reasonable, please fix it. should it be greater than?

hanchung: The error message is not reasonable, please fix it. should it be greater than?

mlir/test/Dialect/Linalg/fusion-2-level.mlir

Show All 22 Lines	scf.for %arg6 = %c0 to %2 step %c30 {
%9 = memref.dim %5, %c0 : memref<?x?xf32, offset: ?, strides: [?, ?]>		%9 = memref.dim %5, %c0 : memref<?x?xf32, offset: ?, strides: [?, ?]>
%10 = memref.dim %5, %c1 : memref<?x?xf32, offset: ?, strides: [?, ?]>		%10 = memref.dim %5, %c1 : memref<?x?xf32, offset: ?, strides: [?, ?]>
%11 = memref.dim %7, %c1 : memref<?x?xf32, offset: ?, strides: [?, ?]>		%11 = memref.dim %7, %c1 : memref<?x?xf32, offset: ?, strides: [?, ?]>
scf.for %arg8 = %c0 to %9 step %c2 {		scf.for %arg8 = %c0 to %9 step %c2 {
scf.for %arg9 = %c0 to %11 step %c3 {		scf.for %arg9 = %c0 to %11 step %c3 {
scf.for %arg10 = %c0 to %10 step %c4 {		scf.for %arg10 = %c0 to %10 step %c4 {
%14 = memref.subview %5[%arg8, %arg10][%c2, %c4][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>		%14 = memref.subview %5[%arg8, %arg10][%c2, %c4][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
%16 = memref.subview %7[%arg10, %arg9][%c4, %c3][%c1, %c1]: memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>		%16 = memref.subview %7[%arg10, %arg9][%c4, %c3][%c1, %c1]: memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
%17 = memref.subview %8[%arg8, %arg9][%c2, %c4][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>		%17 = memref.subview %8[%arg8, %arg9][%c2, %c3][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul ins(%14, %16: memref<?x?xf32, offset: ?, strides: [?, ?]>, memref<?x?xf32, offset: ?, strides: [?, ?]>)		linalg.matmul ins(%14, %16: memref<?x?xf32, offset: ?, strides: [?, ?]>, memref<?x?xf32, offset: ?, strides: [?, ?]>)
outs(%17: memref<?x?xf32, offset: ?, strides: [?, ?]>)		outs(%17: memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
}		}
}		}
}		}
Show All 12 Lines

mlir/test/Dialect/Linalg/generalize-named-ops.mlir

	// RUN: mlir-opt %s -split-input-file -linalg-generalize-named-ops \| FileCheck %s			// RUN: mlir-opt %s -split-input-file -linalg-generalize-named-ops \| FileCheck %s

	func @generalize_conv(%input : memref<1x225x225x3xf32>, %filter: memref<3x3x3x32xf32>, %output: memref<1x112x112x32xf32>) {			func @generalize_conv(%input : memref<1x449x562x3xf32>, %filter: memref<3x3x3x32xf32>, %output: memref<1x112x112x32xf32>) {
	linalg.conv(%filter, %input, %output) {dilations = [2, 3], strides = [4, 5]} : memref<3x3x3x32xf32>, memref<1x225x225x3xf32>, memref<1x112x112x32xf32>			linalg.conv(%filter, %input, %output) {dilations = [2, 3], strides = [4, 5]} : memref<3x3x3x32xf32>, memref<1x449x562x3xf32>, memref<1x112x112x32xf32>
	return			return
	}			}

	// CHECK: #[[FILTER_MAP:.+]] = affine_map<(d0, d1, d2, d3, d4, d5, d6) -> (d5, d6, d4, d3)>			// CHECK: #[[FILTER_MAP:.+]] = affine_map<(d0, d1, d2, d3, d4, d5, d6) -> (d5, d6, d4, d3)>
	// CHECK: #[[INPUT_MAP:.+]] = affine_map<(d0, d1, d2, d3, d4, d5, d6) -> (d0, d1 * 4 + d5 * 2, d2 * 5 + d6 * 3, d4)>			// CHECK: #[[INPUT_MAP:.+]] = affine_map<(d0, d1, d2, d3, d4, d5, d6) -> (d0, d1 * 4 + d5 * 2, d2 * 5 + d6 * 3, d4)>
	// CHECK: #[[OUTPUT_MAP:.+]] = affine_map<(d0, d1, d2, d3, d4, d5, d6) -> (d0, d1, d2, d3)>			// CHECK: #[[OUTPUT_MAP:.+]] = affine_map<(d0, d1, d2, d3, d4, d5, d6) -> (d0, d1, d2, d3)>

	// CHECK: func @generalize_conv			// CHECK: func @generalize_conv
	// CHECK-SAME: %[[INPUT:.+]]: memref<1x225x225x3xf32>			// CHECK-SAME: %[[INPUT:.+]]: memref<1x449x562x3xf32>
	// CHECK-SAME: %[[FILTER:.+]]: memref<3x3x3x32xf32>			// CHECK-SAME: %[[FILTER:.+]]: memref<3x3x3x32xf32>
	// CHECK-SAME: %[[OUTPUT:.+]]: memref<1x112x112x32xf32>			// CHECK-SAME: %[[OUTPUT:.+]]: memref<1x112x112x32xf32>

	// CHECK: linalg.generic			// CHECK: linalg.generic
	// CHECK-SAME: indexing_maps = [#[[FILTER_MAP]], #[[INPUT_MAP]], #[[OUTPUT_MAP]]]			// CHECK-SAME: indexing_maps = [#[[FILTER_MAP]], #[[INPUT_MAP]], #[[OUTPUT_MAP]]]
	// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel", "parallel", "reduction", "window", "window"]			// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel", "parallel", "reduction", "window", "window"]
	// CHECK-SAME: ins(%[[FILTER]], %[[INPUT]]			// CHECK-SAME: ins(%[[FILTER]], %[[INPUT]]
	// CHECK-SAME: outs(%[[OUTPUT]]			// CHECK-SAME: outs(%[[OUTPUT]]
	▲ Show 20 Lines • Show All 344 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/invalid.mlir

	Show First 20 Lines • Show All 697 Lines • ▼ Show 20 Lines

	func @illegal_fill_tensor_with_memref_return			func @illegal_fill_tensor_with_memref_return
	(%arg0 : tensor<?x?xf32>, %arg1 : f32) -> memref<?x?xf32>			(%arg0 : tensor<?x?xf32>, %arg1 : f32) -> memref<?x?xf32>
	{			{
	// expected-error @+1 {{expected type of operand #0 ('tensor<?x?xf32>') to match type of corresponding result ('memref<?x?xf32>')}}			// expected-error @+1 {{expected type of operand #0 ('tensor<?x?xf32>') to match type of corresponding result ('memref<?x?xf32>')}}
	%0 = linalg.fill(%arg0, %arg1) : tensor<?x?xf32>, f32 -> memref<?x?xf32>			%0 = linalg.fill(%arg0, %arg1) : tensor<?x?xf32>, f32 -> memref<?x?xf32>
	return %0 : memref<?x?xf32>			return %0 : memref<?x?xf32>
	}			}

				// -----

				func @invalid_static_linalgOp(%arg0: memref<2x4xf32>, %arg1: memref<3x4xf32>, %arg2: memref<2x4xf32>) {
				// expected-error @+1 {{inferred shaped operand #2 has shape's dimension #1 to be 4, but found 3}}
				linalg.matmul ins(%arg0, %arg1 : memref<2x4xf32>, memref<3x4xf32>)
				outs(%arg2 :memref<2x4xf32>)
				return
				}
				hanchungUnsubmitted Not Done Reply Inline Actions Add one more invalid test case for another case. hanchung: Add one more invalid test case for another case.
				hanchungUnsubmitted Not Done Reply Inline Actions This error message is weird. `xxx be less than or equal to 4 but found 3`. Doesn't 3 less than 4? hanchung: This error message is weird. `xxx be less than or equal to 4 but found 3`. Doesn't 3 less than…
				inho9606AuthorUnsubmitted Done Reply Inline Actions Oops it is definitely my fault.. Thanks inho9606: Oops it is definitely my fault.. Thanks

mlir/test/Dialect/Linalg/named-ops.mlir

	Show First 20 Lines • Show All 276 Lines • ▼ Show 20 Lines
	}			}

	// -----			// -----

	// CHECK-LABEL: func @pooling_nhwc_sum_tensor			// CHECK-LABEL: func @pooling_nhwc_sum_tensor
	// CHECK: %{{.+}} = linalg.pooling_nhwc_sum			// CHECK: %{{.+}} = linalg.pooling_nhwc_sum
	// CHECK-SAME: dilations = dense<1> : tensor<2xi64>			// CHECK-SAME: dilations = dense<1> : tensor<2xi64>
	// CHECK-SAME: strides = dense<1> : tensor<2xi64>			// CHECK-SAME: strides = dense<1> : tensor<2xi64>
	// CHECK-SAME: ins(%{{.+}}, %{{.+}} : tensor<1x6x6x1xf32>, tensor<3x3xf32>)			// CHECK-SAME: ins(%{{.+}}, %{{.+}} : tensor<1x4x4x1xf32>, tensor<3x3xf32>)
	// CHECK-SAME: outs(%{{.+}} : tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>			// CHECK-SAME: outs(%{{.+}} : tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>
	func @pooling_nhwc_sum_tensor(%input: tensor<1x6x6x1xf32>) -> tensor<1x2x2x1xf32> {			func @pooling_nhwc_sum_tensor(%input: tensor<1x4x4x1xf32>) -> tensor<1x2x2x1xf32> {
	%fake = linalg.init_tensor [3, 3] : tensor<3x3xf32>			%fake = linalg.init_tensor [3, 3] : tensor<3x3xf32>
	%init = linalg.init_tensor [1, 2, 2, 1] : tensor<1x2x2x1xf32>			%init = linalg.init_tensor [1, 2, 2, 1] : tensor<1x2x2x1xf32>
	%cst = constant 0.000000e+00 : f32			%cst = constant 0.000000e+00 : f32
	%fill = linalg.fill(%init, %cst) : tensor<1x2x2x1xf32>, f32 -> tensor<1x2x2x1xf32>			%fill = linalg.fill(%init, %cst) : tensor<1x2x2x1xf32>, f32 -> tensor<1x2x2x1xf32>
	%res = linalg.pooling_nhwc_sum {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}			%res = linalg.pooling_nhwc_sum {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}
	ins(%input, %fake: tensor<1x6x6x1xf32>, tensor<3x3xf32>)			ins(%input, %fake: tensor<1x4x4x1xf32>, tensor<3x3xf32>)
	outs(%fill: tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>			outs(%fill: tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>
	return %res : tensor<1x2x2x1xf32>			return %res : tensor<1x2x2x1xf32>
	}			}

	// -----			// -----

	// CHECK-LABEL: func @pooling_nhwc_sum			// CHECK-LABEL: func @pooling_nhwc_sum
	// CHECK: linalg.pooling_nhwc_sum			// CHECK: linalg.pooling_nhwc_sum
	// CHECK-SAME: dilations = dense<1> : tensor<2xi64>			// CHECK-SAME: dilations = dense<1> : tensor<2xi64>
	// CHECK-SAME: strides = dense<1> : tensor<2xi64>			// CHECK-SAME: strides = dense<1> : tensor<2xi64>
	// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<1x6x6x1xf32>, memref<3x3xf32>)			// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<1x4x4x1xf32>, memref<3x3xf32>)
	// CHECK-SAME: outs(%{{.+}} : memref<1x2x2x1xf32>)			// CHECK-SAME: outs(%{{.+}} : memref<1x2x2x1xf32>)
	func @pooling_nhwc_sum(%input: memref<1x6x6x1xf32>, %fake: memref<3x3xf32>, %output: memref<1x2x2x1xf32>) {			func @pooling_nhwc_sum(%input: memref<1x4x4x1xf32>, %fake: memref<3x3xf32>, %output: memref<1x2x2x1xf32>) {
	linalg.pooling_nhwc_sum {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}			linalg.pooling_nhwc_sum {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}
	ins(%input, %fake: memref<1x6x6x1xf32>, memref<3x3xf32>)			ins(%input, %fake: memref<1x4x4x1xf32>, memref<3x3xf32>)
	outs(%output: memref<1x2x2x1xf32>)			outs(%output: memref<1x2x2x1xf32>)
	return			return
	}			}

	// -----			// -----

	// CHECK-LABEL: func @pooling_nhwc_max_tensor			// CHECK-LABEL: func @pooling_nhwc_max_tensor
	// CHECK: %{{.+}} = linalg.pooling_nhwc_max			// CHECK: %{{.+}} = linalg.pooling_nhwc_max
	// CHECK-SAME: dilations = dense<1> : tensor<2xi64>			// CHECK-SAME: dilations = dense<1> : tensor<2xi64>
	// CHECK-SAME: strides = dense<1> : tensor<2xi64>			// CHECK-SAME: strides = dense<1> : tensor<2xi64>
	// CHECK-SAME: ins(%{{.+}}, %{{.+}} : tensor<1x6x6x1xf32>, tensor<3x3xf32>)			// CHECK-SAME: ins(%{{.+}}, %{{.+}} : tensor<1x4x4x1xf32>, tensor<3x3xf32>)
	// CHECK-SAME: outs(%{{.+}} : tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>			// CHECK-SAME: outs(%{{.+}} : tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>
	func @pooling_nhwc_max_tensor(%input: tensor<1x6x6x1xf32>) -> tensor<1x2x2x1xf32> {			func @pooling_nhwc_max_tensor(%input: tensor<1x4x4x1xf32>) -> tensor<1x2x2x1xf32> {
	%fake = linalg.init_tensor [3, 3] : tensor<3x3xf32>			%fake = linalg.init_tensor [3, 3] : tensor<3x3xf32>
	%init = linalg.init_tensor [1, 2, 2, 1] : tensor<1x2x2x1xf32>			%init = linalg.init_tensor [1, 2, 2, 1] : tensor<1x2x2x1xf32>
	%cst = constant 0.000000e+00 : f32			%cst = constant 0.000000e+00 : f32
	%fill = linalg.fill(%init, %cst) : tensor<1x2x2x1xf32>, f32 -> tensor<1x2x2x1xf32>			%fill = linalg.fill(%init, %cst) : tensor<1x2x2x1xf32>, f32 -> tensor<1x2x2x1xf32>
	%res = linalg.pooling_nhwc_max {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}			%res = linalg.pooling_nhwc_max {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}
	ins(%input, %fake: tensor<1x6x6x1xf32>, tensor<3x3xf32>)			ins(%input, %fake: tensor<1x4x4x1xf32>, tensor<3x3xf32>)
	outs(%fill: tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>			outs(%fill: tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>
	return %res : tensor<1x2x2x1xf32>			return %res : tensor<1x2x2x1xf32>
	}			}

	// -----			// -----

	// CHECK-LABEL: func @pooling_nhwc_max			// CHECK-LABEL: func @pooling_nhwc_max
	// CHECK: linalg.pooling_nhwc_max			// CHECK: linalg.pooling_nhwc_max
	// CHECK-SAME: dilations = dense<1> : tensor<2xi64>			// CHECK-SAME: dilations = dense<1> : tensor<2xi64>
	// CHECK-SAME: strides = dense<1> : tensor<2xi64>			// CHECK-SAME: strides = dense<1> : tensor<2xi64>
	// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<1x6x6x1xf32>, memref<3x3xf32>)			// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<1x4x4x1xf32>, memref<3x3xf32>)
	// CHECK-SAME: outs(%{{.+}} : memref<1x2x2x1xf32>)			// CHECK-SAME: outs(%{{.+}} : memref<1x2x2x1xf32>)
	func @pooling_nhwc_max(%input: memref<1x6x6x1xf32>, %fake: memref<3x3xf32>, %output: memref<1x2x2x1xf32>) {			func @pooling_nhwc_max(%input: memref<1x4x4x1xf32>, %fake: memref<3x3xf32>, %output: memref<1x2x2x1xf32>) {
	linalg.pooling_nhwc_max {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}			linalg.pooling_nhwc_max {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}
	ins(%input, %fake: memref<1x6x6x1xf32>, memref<3x3xf32>)			ins(%input, %fake: memref<1x4x4x1xf32>, memref<3x3xf32>)
	outs(%output: memref<1x2x2x1xf32>)			outs(%output: memref<1x2x2x1xf32>)
	return			return
	}			}

	// -----			// -----

	// CHECK-LABEL: func @pooling_nhwc_min_tensor			// CHECK-LABEL: func @pooling_nhwc_min_tensor
	// CHECK: %{{.+}} = linalg.pooling_nhwc_min			// CHECK: %{{.+}} = linalg.pooling_nhwc_min
	// CHECK-SAME: dilations = dense<1> : tensor<2xi64>			// CHECK-SAME: dilations = dense<1> : tensor<2xi64>
	// CHECK-SAME: strides = dense<1> : tensor<2xi64>			// CHECK-SAME: strides = dense<1> : tensor<2xi64>
	// CHECK-SAME: ins(%{{.+}}, %{{.+}} : tensor<1x6x6x1xf32>, tensor<3x3xf32>)			// CHECK-SAME: ins(%{{.+}}, %{{.+}} : tensor<1x4x4x1xf32>, tensor<3x3xf32>)
	// CHECK-SAME: outs(%{{.+}} : tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>			// CHECK-SAME: outs(%{{.+}} : tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>
	func @pooling_nhwc_min_tensor(%input: tensor<1x6x6x1xf32>) -> tensor<1x2x2x1xf32> {			func @pooling_nhwc_min_tensor(%input: tensor<1x4x4x1xf32>) -> tensor<1x2x2x1xf32> {
	%fake = linalg.init_tensor [3, 3] : tensor<3x3xf32>			%fake = linalg.init_tensor [3, 3] : tensor<3x3xf32>
	%init = linalg.init_tensor [1, 2, 2, 1] : tensor<1x2x2x1xf32>			%init = linalg.init_tensor [1, 2, 2, 1] : tensor<1x2x2x1xf32>
	%cst = constant 0.000000e+00 : f32			%cst = constant 0.000000e+00 : f32
	%fill = linalg.fill(%init, %cst) : tensor<1x2x2x1xf32>, f32 -> tensor<1x2x2x1xf32>			%fill = linalg.fill(%init, %cst) : tensor<1x2x2x1xf32>, f32 -> tensor<1x2x2x1xf32>
	%res = linalg.pooling_nhwc_min {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}			%res = linalg.pooling_nhwc_min {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}
	ins(%input, %fake: tensor<1x6x6x1xf32>, tensor<3x3xf32>)			ins(%input, %fake: tensor<1x4x4x1xf32>, tensor<3x3xf32>)
	outs(%fill: tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>			outs(%fill: tensor<1x2x2x1xf32>) -> tensor<1x2x2x1xf32>
	return %res : tensor<1x2x2x1xf32>			return %res : tensor<1x2x2x1xf32>
	}			}

	// -----			// -----

	// CHECK-LABEL: func @pooling_nhwc_min			// CHECK-LABEL: func @pooling_nhwc_min
	// CHECK: linalg.pooling_nhwc_min			// CHECK: linalg.pooling_nhwc_min
	// CHECK-SAME: dilations = dense<1> : tensor<2xi64>			// CHECK-SAME: dilations = dense<1> : tensor<2xi64>
	// CHECK-SAME: strides = dense<1> : tensor<2xi64>			// CHECK-SAME: strides = dense<1> : tensor<2xi64>
	// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<1x6x6x1xf32>, memref<3x3xf32>)			// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<1x4x4x1xf32>, memref<3x3xf32>)
	// CHECK-SAME: outs(%{{.+}} : memref<1x2x2x1xf32>)			// CHECK-SAME: outs(%{{.+}} : memref<1x2x2x1xf32>)
	func @pooling_nhwc_min(%input: memref<1x6x6x1xf32>, %fake: memref<3x3xf32>, %output: memref<1x2x2x1xf32>) {			func @pooling_nhwc_min(%input: memref<1x4x4x1xf32>, %fake: memref<3x3xf32>, %output: memref<1x2x2x1xf32>) {
	linalg.pooling_nhwc_min {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}			linalg.pooling_nhwc_min {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}
	ins(%input, %fake: memref<1x6x6x1xf32>, memref<3x3xf32>)			ins(%input, %fake: memref<1x4x4x1xf32>, memref<3x3xf32>)
	outs(%output: memref<1x2x2x1xf32>)			outs(%output: memref<1x2x2x1xf32>)
	return			return
	}			}

mlir/test/Dialect/Linalg/reshape_linearization_fusion.mlir

	Show First 20 Lines • Show All 161 Lines • ▼ Show 20 Lines
	// CHECK: func @generic_op_021_permultation_reshape_producer_fusion			// CHECK: func @generic_op_021_permultation_reshape_producer_fusion
	// CHECK-NOT: linalg.tensor_reshape			// CHECK-NOT: linalg.tensor_reshape
	// CHECK: linalg.generic			// CHECK: linalg.generic
	// CHECK-SAME: indexing_maps = [#[[MAP0]], #[[MAP1]]]			// CHECK-SAME: indexing_maps = [#[[MAP0]], #[[MAP1]]]

	// -----			// -----

	#map0 = affine_map<(d0, d1, d2) -> (d0)>			#map0 = affine_map<(d0, d1, d2) -> (d0)>
	#map1 = affine_map<(d0, d1, d2) -> (d1, d2)>			#map1 = affine_map<(d0, d1, d2) -> (d1, d2)>
	#map2 = affine_map<(d0, d1, d2) -> (d1, d2, d0)>			#map2 = affine_map<(d0, d1, d2) -> (d1, d0, d2)>
	#map3 = affine_map<(d0, d1, d2) -> (d0, d1, d2)>			#map3 = affine_map<(d0, d1, d2) -> (d0, d2, d1)>
	func @generic_op_120_permultation_reshape_producer_fusion(%arg0 : tensor<3x35xf32>) -> tensor<5x7x3xf32> {			func @generic_op_120_permutation_reshape_producer_fusion(%arg0 : tensor<3x35xf32>) -> tensor<5x7x3xf32> {
				hanchungUnsubmitted Not Done Reply Inline Actions Below is more straightforward to me: #map2 = affine_map<(d0, d1, d2) -> (d2, d0, d1)> #map3 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> Does it work? hanchung: Below is more straightforward to me: ``` #map2 = affine_map<(d0, d1, d2) -> (d2, d0, d1)>…
				inho9606AuthorUnsubmitted Done Reply Inline Actions With your suggestion, the indices are less than the sizes, but it is not match. I mean 3x35 shape matches 3x18 loop range, and 5x7x3 shape matches 4x2x2 loop range.. Refering to the case of reshaping 3x35 to 5x3x7 below, map2 matches input shape and map3 matches output. In fact, if I modify map2, then it does not work.. E.G., 3x35 -> 3x44.. inho9606: With your suggestion, the indices are less than the sizes, but it is not match. I mean 3x35…
				hanchungUnsubmitted Not Done Reply Inline Actions I do not understand what the issue is here. d0, d1, d2 will map to three numbers. This is just like what symbol maps to what value. I don't see the difference if you swap d1 and d2. I actually don't understand what you are saying like 3x18 loop range and 4x2x2 loop range. Could you explain more? What is the IR if you go with the suggestion? hanchung: I do not understand what the issue is here. d0, d1, d2 will map to three numbers. This is just…
				inho9606AuthorUnsubmitted Done Reply Inline Actions Okay, I removed 'range -=1 statement' according to your comment above (not committed yet), and printed out each rank's size of the shapes and loop ranges. Here is IR with the suggestion. inhoseo@inho:~/llvm-project/build/bin$ ./mlir-opt linalg_test.mlir -linalg-fold-reshape-ops-by-linearization -verify-diagnostics shaped 3 index 3 shaped 5 index 5 shaped 7 index 7 shaped 5 index 5 shaped 7 index 7 shaped 3 index 3 shaped 3 index 3 shaped 35 index 26 shaped 5 index 5 shaped 7 index 3 shaped 3 index 3 #map0 = affine_map<(d0, d1, d2) -> (d1, d0 + d2 * 7)> #map1 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> module { func @generic_op_120_permultation_reshape_producer_fusion(%arg0: tensor<3x35xf32>) -> tensor<5x7x3xf32> { %0 = linalg.init_tensor [5, 7, 3] : tensor<5x7x3xf32> %1 = linalg.generic {indexing_maps = [#map0, #map1], iterator_types = ["parallel", "parallel", "parallel"]} ins(%arg0 : tensor<3x35xf32>) outs(%0 : tensor<5x7x3xf32>) { ^bb0(%arg1: f32, %arg2: f32): // no predecessors linalg.yield %arg1 : f32 } -> tensor<5x7x3xf32> return %1 : tensor<5x7x3xf32> } } -convert-linalg-to-loops does not work with other options here.. inho9606: Okay, I removed 'range -=1 statement' according to your comment above (not committed yet), and…
				hanchungUnsubmitted Not Done Reply Inline Actions Okay, I removed 'range -=1 statement' according to your comment above (not committed yet), and printed out each rank's size of the shapes and loop ranges. Here is IR with the suggestion. I don't get the reason to remove 'range -=` statement'. But this seems unrelated, I will take a look once you upload the patch. inhoseo@inho:~/llvm-project/build/bin$ ./mlir-opt linalg_test.mlir -linalg-fold-reshape-ops-by-linearization -verify-diagnostics shaped 3 index 3 shaped 5 index 5 shaped 7 index 7 shaped 5 index 5 shaped 7 index 7 shaped 3 index 3 shaped 3 index 3 shaped 35 index 26 shaped 5 index 5 shaped 7 index 3 shaped 3 index 3 #map0 = affine_map<(d0, d1, d2) -> (d1, d0 + d2 * 7)> #map1 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> module { func @generic_op_120_permultation_reshape_producer_fusion(%arg0: tensor<3x35xf32>) -> tensor<5x7x3xf32> { %0 = linalg.init_tensor [5, 7, 3] : tensor<5x7x3xf32> %1 = linalg.generic {indexing_maps = [#map0, #map1], iterator_types = ["parallel", "parallel", "parallel"]} ins(%arg0 : tensor<3x35xf32>) outs(%0 : tensor<5x7x3xf32>) { ^bb0(%arg1: f32, %arg2: f32): // no predecessors linalg.yield %arg1 : f32 } -> tensor<5x7x3xf32> return %1 : tensor<5x7x3xf32> } } -convert-linalg-to-loops does not work with other options here.. I think there are two issues. The input IR is not valid, if I read correctly. #map2 = affine_map<(d0, d1, d2) -> (d1, d2, d0)> #map3 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> %2 = linalg.generic {indexing_maps = [#map2, #map3], iterator_types = ["parallel", "parallel", "parallel"]} ins(%0 : tensor<3x5x7xf32>) outs(%1 : tensor<5x7x3xf32>) { ^bb0(%arg2: f32, %arg3: f32): // no predecessors linalg.yield %arg2 : f32 } -> tensor<5x7x3xf32> The indexing maps for the generic op should be #map2 = affine_map<(d0, d1, d2) -> (d2, d0, d1)> #map3 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> The indexing maps that you provide is also correct. This is just the order of loops problem. There is probably a bug in the pass. The indexing maps in the output are #map0 = affine_map<(d0, d1, d2) -> (d1, d0 + d2 * 7)> #map1 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> %1 = linalg.generic {indexing_maps = [#map0, #map1], iterator_types = ["parallel", "parallel", "parallel"]} ins(%arg0 : tensor<3x35xf32>) outs(%0 : tensor<5x7x3xf32>) { ^bb0(%arg1: f32, %arg2: f32): // no predecessors linalg.yield %arg1 : f32 } -> tensor<5x7x3xf32> while I think they should be #map0 = affine_map<(d0, d1, d2) -> (d2, d1 + d0 * 7)> #map1 = affine_map<(d0, d1, d2) -> (d0, d1, d2)> With your fix, you are hiding the issue in (2). I think we can go with your fix. Then Mahesh or I will fix the bug and add one more test for it. @mravishankar fyi, the input of the test case was wrong I think. hanchung: > Okay, I removed 'range -=1 statement' according to your comment above (not committed yet)…
				hanchungUnsubmitted Not Done Reply Inline Actions please help fix this typo since you touch this section. s/permultation/permutation hanchung: please help fix this typo since you touch this section. s/permultation/permutation
				inho9606AuthorUnsubmitted Done Reply Inline Actions Could you give me more detail what you meant by 's/permultation/permutation'? inho9606: Could you give me more detail what you meant by 's/permultation/permutation'?
				hanchungUnsubmitted Not Done Reply Inline Actions Oops, I meant to replace permultation with permutation. hanchung: Oops, I meant to replace permultation with permutation.
	%0 = linalg.tensor_reshape %arg0 [#map0, #map1] : tensor<3x35xf32> into tensor<3x5x7xf32>			%0 = linalg.tensor_reshape %arg0 [#map0, #map1] : tensor<3x35xf32> into tensor<3x5x7xf32>
	%1 = linalg.init_tensor [5, 7, 3] : tensor<5x7x3xf32>			%1 = linalg.init_tensor [5, 7, 3] : tensor<5x7x3xf32>
	%2 = linalg.generic			%2 = linalg.generic
	{indexing_maps = [#map2, #map3],			{indexing_maps = [#map2, #map3],
	iterator_types = ["parallel", "parallel", "parallel"]}			iterator_types = ["parallel", "parallel", "parallel"]}
	ins(%0 : tensor<3x5x7xf32>) outs(%1 : tensor<5x7x3xf32>) {			ins(%0 : tensor<3x5x7xf32>) outs(%1 : tensor<5x7x3xf32>) {
	^bb0(%arg2: f32, %arg3: f32): // no predecessors			^bb0(%arg2: f32, %arg3: f32): // no predecessors
	linalg.yield %arg2 : f32			linalg.yield %arg2 : f32
	} -> tensor<5x7x3xf32>			} -> tensor<5x7x3xf32>
	return %2 : tensor<5x7x3xf32>			return %2 : tensor<5x7x3xf32>
	}			}

	// CHECK-DAG: #[[MAP0:.+]] = affine_map<(d0, d1, d2) -> (d2, d0 * 7 + d1)>			// CHECK-DAG: #[[MAP0:.+]] = affine_map<(d0, d1, d2) -> (d1, d0 * 7 + d2)>
	// CHECK-DAG: #[[MAP1:.+]] = affine_map<(d0, d1, d2) -> (d0, d1, d2)>			// CHECK-DAG: #[[MAP1:.+]] = affine_map<(d0, d1, d2) -> (d0, d2, d1)>
	// CHECK: func @generic_op_120_permultation_reshape_producer_fusion			// CHECK: func @generic_op_120_permutation_reshape_producer_fusion
	// CHECK-NOT: linalg.tensor_reshape			// CHECK-NOT: linalg.tensor_reshape
	// CHECK: linalg.generic			// CHECK: linalg.generic
	// CHECK-SAME: indexing_maps = [#[[MAP0]], #[[MAP1]]]			// CHECK-SAME: indexing_maps = [#[[MAP0]], #[[MAP1]]]

	// -----			// -----

	#map0 = affine_map<(d0, d1, d2) -> (d0)>			#map0 = affine_map<(d0, d1, d2) -> (d0)>
	#map1 = affine_map<(d0, d1, d2) -> (d1, d2)>			#map1 = affine_map<(d0, d1, d2) -> (d1, d2)>
	▲ Show 20 Lines • Show All 78 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/sparse_nd.mlir

Show All 15 Lines	#trait_mul = {
],		],
iterator_types = ["parallel", "parallel", "parallel", "parallel",		iterator_types = ["parallel", "parallel", "parallel", "parallel",
"parallel", "parallel", "parallel", "parallel"],		"parallel", "parallel", "parallel", "parallel"],
doc = "X(i,j,k,l,m,n,o,p) = A(i,j,k,l,m,n,o,p) * B(p,o,n,m,l,k,j,i)"		doc = "X(i,j,k,l,m,n,o,p) = A(i,j,k,l,m,n,o,p) * B(p,o,n,m,l,k,j,i)"
}		}

// CHECK-LABEL: func @mul(		// CHECK-LABEL: func @mul(
// CHECK-SAME: %[[VAL_0:.*0]]: tensor<10x20x30x40x50x60x70x80xf32>,		// CHECK-SAME: %[[VAL_0:.*0]]: tensor<10x20x30x40x50x60x70x80xf32>,
// CHECK-SAME: %[[VAL_1:.*1]]: tensor<10x20x30x40x50x60x70x80xf32>,		// CHECK-SAME: %[[VAL_1:.*1]]: tensor<80x70x60x50x40x30x20x10xf32>,
// CHECK-SAME: %[[VAL_2:.*2]]: tensor<10x20x30x40x50x60x70x80xf32>) -> tensor<10x20x30x40x50x60x70x80xf32> {		// CHECK-SAME: %[[VAL_2:.*2]]: tensor<10x20x30x40x50x60x70x80xf32>) -> tensor<10x20x30x40x50x60x70x80xf32> {
// CHECK: %[[VAL_3:.*]] = constant 3 : index		// CHECK: %[[VAL_3:.*]] = constant 3 : index
// CHECK: %[[VAL_4:.*]] = constant 4 : index		// CHECK: %[[VAL_4:.*]] = constant 4 : index
// CHECK: %[[VAL_5:.*]] = constant 10 : index		// CHECK: %[[VAL_5:.*]] = constant 10 : index
// CHECK: %[[VAL_6:.*]] = constant 20 : index		// CHECK: %[[VAL_6:.*]] = constant 20 : index
// CHECK: %[[VAL_7:.*]] = constant 30 : index		// CHECK: %[[VAL_7:.*]] = constant 30 : index
// CHECK: %[[VAL_8:.*]] = constant 60 : index		// CHECK: %[[VAL_8:.*]] = constant 60 : index
// CHECK: %[[VAL_9:.*]] = constant 70 : index		// CHECK: %[[VAL_9:.*]] = constant 70 : index
// CHECK: %[[VAL_10:.*]] = constant 80 : index		// CHECK: %[[VAL_10:.*]] = constant 80 : index
// CHECK: %[[VAL_11:.*]] = constant 0 : index		// CHECK: %[[VAL_11:.*]] = constant 0 : index
// CHECK: %[[VAL_12:.*]] = constant 1 : index		// CHECK: %[[VAL_12:.*]] = constant 1 : index
// CHECK: %[[VAL_13:.*]] = memref.buffer_cast %[[VAL_0]] : memref<10x20x30x40x50x60x70x80xf32>		// CHECK: %[[VAL_13:.*]] = memref.buffer_cast %[[VAL_0]] : memref<10x20x30x40x50x60x70x80xf32>
// CHECK: %[[VAL_14:.*]] = linalg.sparse_pointers %[[VAL_1]], %[[VAL_3]] : tensor<10x20x30x40x50x60x70x80xf32> to memref<?xindex>		// CHECK: %[[VAL_14:.*]] = linalg.sparse_pointers %[[VAL_1]], %[[VAL_3]] : tensor<80x70x60x50x40x30x20x10xf32> to memref<?xindex>
// CHECK: %[[VAL_15:.*]] = linalg.sparse_indices %[[VAL_1]], %[[VAL_3]] : tensor<10x20x30x40x50x60x70x80xf32> to memref<?xindex>		// CHECK: %[[VAL_15:.*]] = linalg.sparse_indices %[[VAL_1]], %[[VAL_3]] : tensor<80x70x60x50x40x30x20x10xf32> to memref<?xindex>
// CHECK: %[[VAL_16:.*]] = linalg.sparse_pointers %[[VAL_1]], %[[VAL_4]] : tensor<10x20x30x40x50x60x70x80xf32> to memref<?xindex>		// CHECK: %[[VAL_16:.*]] = linalg.sparse_pointers %[[VAL_1]], %[[VAL_4]] : tensor<80x70x60x50x40x30x20x10xf32> to memref<?xindex>
// CHECK: %[[VAL_17:.*]] = linalg.sparse_indices %[[VAL_1]], %[[VAL_4]] : tensor<10x20x30x40x50x60x70x80xf32> to memref<?xindex>		// CHECK: %[[VAL_17:.*]] = linalg.sparse_indices %[[VAL_1]], %[[VAL_4]] : tensor<80x70x60x50x40x30x20x10xf32> to memref<?xindex>
// CHECK: %[[VAL_18:.*]] = linalg.sparse_values %[[VAL_1]] : tensor<10x20x30x40x50x60x70x80xf32> to memref<?xf32>		// CHECK: %[[VAL_18:.*]] = linalg.sparse_values %[[VAL_1]] : tensor<80x70x60x50x40x30x20x10xf32> to memref<?xf32>
// CHECK: %[[VAL_19:.*]] = memref.buffer_cast %[[VAL_2]] : memref<10x20x30x40x50x60x70x80xf32>		// CHECK: %[[VAL_19:.*]] = memref.buffer_cast %[[VAL_2]] : memref<10x20x30x40x50x60x70x80xf32>
// CHECK: %[[VAL_20:.*]] = memref.alloc() : memref<10x20x30x40x50x60x70x80xf32>		// CHECK: %[[VAL_20:.*]] = memref.alloc() : memref<10x20x30x40x50x60x70x80xf32>
// CHECK: linalg.copy(%[[VAL_19]], %[[VAL_20]]) : memref<10x20x30x40x50x60x70x80xf32>, memref<10x20x30x40x50x60x70x80xf32>		// CHECK: linalg.copy(%[[VAL_19]], %[[VAL_20]]) : memref<10x20x30x40x50x60x70x80xf32>, memref<10x20x30x40x50x60x70x80xf32>
// CHECK: scf.for %[[VAL_21:.*]] = %[[VAL_11]] to %[[VAL_10]] step %[[VAL_12]] {		// CHECK: scf.for %[[VAL_21:.*]] = %[[VAL_11]] to %[[VAL_10]] step %[[VAL_12]] {
// CHECK: scf.for %[[VAL_22:.*]] = %[[VAL_11]] to %[[VAL_9]] step %[[VAL_12]] {		// CHECK: scf.for %[[VAL_22:.*]] = %[[VAL_11]] to %[[VAL_9]] step %[[VAL_12]] {
// CHECK: %[[VAL_23:.*]] = muli %[[VAL_21]], %[[VAL_9]] : index		// CHECK: %[[VAL_23:.*]] = muli %[[VAL_21]], %[[VAL_9]] : index
// CHECK: %[[VAL_24:.*]] = addi %[[VAL_23]], %[[VAL_22]] : index		// CHECK: %[[VAL_24:.*]] = addi %[[VAL_23]], %[[VAL_22]] : index
// CHECK: scf.for %[[VAL_25:.*]] = %[[VAL_11]] to %[[VAL_8]] step %[[VAL_12]] {		// CHECK: scf.for %[[VAL_25:.*]] = %[[VAL_11]] to %[[VAL_8]] step %[[VAL_12]] {
Show All 29 Lines
// CHECK: }		// CHECK: }
// CHECK: }		// CHECK: }
// CHECK: }		// CHECK: }
// CHECK: }		// CHECK: }
// CHECK: %[[VAL_50:.*]] = memref.tensor_load %[[VAL_20]] : memref<10x20x30x40x50x60x70x80xf32>		// CHECK: %[[VAL_50:.*]] = memref.tensor_load %[[VAL_20]] : memref<10x20x30x40x50x60x70x80xf32>
// CHECK: return %[[VAL_50]] : tensor<10x20x30x40x50x60x70x80xf32>		// CHECK: return %[[VAL_50]] : tensor<10x20x30x40x50x60x70x80xf32>
// CHECK: }		// CHECK: }
func @mul(%arga: tensor<10x20x30x40x50x60x70x80xf32>,		func @mul(%arga: tensor<10x20x30x40x50x60x70x80xf32>,
%argb: tensor<10x20x30x40x50x60x70x80xf32>,		%argb: tensor<80x70x60x50x40x30x20x10xf32>,
%argx: tensor<10x20x30x40x50x60x70x80xf32>)		%argx: tensor<10x20x30x40x50x60x70x80xf32>)
-> tensor<10x20x30x40x50x60x70x80xf32> {		-> tensor<10x20x30x40x50x60x70x80xf32> {
%0 = linalg.generic #trait_mul		%0 = linalg.generic #trait_mul
ins(%arga, %argb: tensor<10x20x30x40x50x60x70x80xf32>,		ins(%arga, %argb: tensor<10x20x30x40x50x60x70x80xf32>,
tensor<10x20x30x40x50x60x70x80xf32>)		tensor<80x70x60x50x40x30x20x10xf32>)
outs(%argx: tensor<10x20x30x40x50x60x70x80xf32>) {		outs(%argx: tensor<10x20x30x40x50x60x70x80xf32>) {
^bb(%a: f32, %b: f32, %x: f32):		^bb(%a: f32, %b: f32, %x: f32):
%0 = mulf %a, %b : f32		%0 = mulf %a, %b : f32
linalg.yield %0 : f32		linalg.yield %0 : f32
} -> tensor<10x20x30x40x50x60x70x80xf32>		} -> tensor<10x20x30x40x50x60x70x80xf32>
return %0 : tensor<10x20x30x40x50x60x70x80xf32>		return %0 : tensor<10x20x30x40x50x60x70x80xf32>
}		}

mlir/test/Dialect/Linalg/tile-indexed-generic.mlir

Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	#combined_indices_trait = {
args_in = 1,		args_in = 1,
args_out = 1,		args_out = 1,
indexing_maps = [		indexing_maps = [
affine_map<(i, j) -> (j, i + j)>,		affine_map<(i, j) -> (j, i + j)>,
affine_map<(i, j) -> (i, j)>		affine_map<(i, j) -> (i, j)>
],		],
iterator_types = ["parallel", "parallel"]		iterator_types = ["parallel", "parallel"]
}		}
func @indexed_generic_matrix(%operand: memref<50x100xf32>, %result: memref<50x100xf32>) {		func @indexed_generic_matrix(%operand: memref<50x99xf32>, %result: memref<50x50xf32>) {
linalg.indexed_generic #combined_indices_trait		linalg.indexed_generic #combined_indices_trait
ins(%operand : memref<50x100xf32>)		ins(%operand : memref<50x99xf32>)
outs(%result : memref<50x100xf32>) {		outs(%result : memref<50x50xf32>) {
^bb0(%i: index, %j: index, %operand_in: f32, %result_in: f32):		^bb0(%i: index, %j: index, %operand_in: f32, %result_in: f32):
%i_int = index_cast %i: index to i32		%i_int = index_cast %i: index to i32
%i_float = sitofp %i_int : i32 to f32		%i_float = sitofp %i_int : i32 to f32
%j_int = index_cast %j: index to i32		%j_int = index_cast %j: index to i32
%j_float = sitofp %j_int : i32 to f32		%j_float = sitofp %j_int : i32 to f32
%out = addf %i_float, %j_float : f32		%out = addf %i_float, %j_float : f32
linalg.yield %out : f32		linalg.yield %out : f32
}		}
Show All 9 Lines
// TILE-10n25: %[[NEW_I:.*]] = addi %[[I]], %[[K]] : index		// TILE-10n25: %[[NEW_I:.*]] = addi %[[I]], %[[K]] : index
// TILE-10n25: %[[NEW_J:.*]] = addi %[[J]], %[[L]] : index		// TILE-10n25: %[[NEW_J:.*]] = addi %[[J]], %[[L]] : index
// TILE-10n25: %[[NEW_INT_I:.*]] = index_cast %[[NEW_I]] : index to i32		// TILE-10n25: %[[NEW_INT_I:.*]] = index_cast %[[NEW_I]] : index to i32
// TILE-10n25: %[[NEW_FLOAT_I:.*]] = sitofp %[[NEW_INT_I]] : i32 to f32		// TILE-10n25: %[[NEW_FLOAT_I:.*]] = sitofp %[[NEW_INT_I]] : i32 to f32
// TILE-10n25: %[[NEW_INT_J:.*]] = index_cast %[[NEW_J]] : index to i32		// TILE-10n25: %[[NEW_INT_J:.*]] = index_cast %[[NEW_J]] : index to i32
// TILE-10n25: %[[NEW_FLOAT_J:.*]] = sitofp %[[NEW_INT_J]] : i32 to f32		// TILE-10n25: %[[NEW_FLOAT_J:.*]] = sitofp %[[NEW_INT_J]] : i32 to f32
// TILE-10n25: %[[OUT:.*]] = addf %[[NEW_FLOAT_I]], %[[NEW_FLOAT_J]] : f32		// TILE-10n25: %[[OUT:.*]] = addf %[[NEW_FLOAT_I]], %[[NEW_FLOAT_J]] : f32

// TILE-25n0-LABEL: func @indexed_generic_matrix		// TILE-25n0-LABEL: func @indexed_generic_matrix
		hanchungUnsubmitted Not Done Reply Inline Actions accidental change? please revert it. hanchung: accidental change? please revert it.
		inho9606AuthorUnsubmitted Done Reply Inline Actions Ah,, I often use tab key to explore the screen, but sometimes it is typed as character and I miss it easily.. I need to be more careful. Anyway thanks for letting me know! inho9606: Ah,, I often use tab key to explore the screen, but sometimes it is typed as character and I…
// TILE-25n0: %[[C25:.*]] = constant 25 : index		// TILE-25n0: %[[C25:.*]] = constant 25 : index
// TILE-25n0: scf.for %[[L:.]] = {{.}} step %[[C25]]		// TILE-25n0: scf.for %[[L:.]] = {{.}} step %[[C25]]
// TILE-25n0: linalg.indexed_generic		// TILE-25n0: linalg.indexed_generic
// TILE-25n0: ^bb0(%[[I:.]]: index, %[[J:.]]: index, %[[IN:.]]: f32, %[[OUT:.]]: f32):		// TILE-25n0: ^bb0(%[[I:.]]: index, %[[J:.]]: index, %[[IN:.]]: f32, %[[OUT:.]]: f32):
// TILE-25n0: %[[NEW_I:.*]] = addi %[[I]], %[[L]] : index		// TILE-25n0: %[[NEW_I:.*]] = addi %[[I]], %[[L]] : index
// TILE-25n0: %[[NEW_INT_I:.*]] = index_cast %[[NEW_I]] : index to i32		// TILE-25n0: %[[NEW_INT_I:.*]] = index_cast %[[NEW_I]] : index to i32
// TILE-25n0: %[[NEW_FLOAT_I:.*]] = sitofp %[[NEW_INT_I]] : i32 to f32		// TILE-25n0: %[[NEW_FLOAT_I:.*]] = sitofp %[[NEW_INT_I]] : i32 to f32
// TILE-25n0: %[[INT_J:.*]] = index_cast %[[J]] : index to i32		// TILE-25n0: %[[INT_J:.*]] = index_cast %[[J]] : index to i32
Show All 14 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Added static verification for Linalg Ops.ClosedPublic

Details

Diff Detail

Event Timeline

Okay, I removed 'range -=1 statement' according to your comment above (not committed yet), and printed out each rank's size of the shapes and loop ranges. Here is IR with the suggestion.

}

}

Revision Contents

Diff 331799

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp

mlir/test/Dialect/Linalg/fusion-2-level.mlir

mlir/test/Dialect/Linalg/generalize-named-ops.mlir

mlir/test/Dialect/Linalg/invalid.mlir

mlir/test/Dialect/Linalg/named-ops.mlir

mlir/test/Dialect/Linalg/reshape_linearization_fusion.mlir

Okay, I removed 'range -=1 statement' according to your comment above (not committed yet), and printed out each rank's size of the shapes and loop ranges. Here is IR with the suggestion.

}

}

mlir/test/Dialect/Linalg/sparse_nd.mlir

mlir/test/Dialect/Linalg/tile-indexed-generic.mlir

Added static verification for Linalg Ops.
ClosedPublic