This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
docs/Dialects/
-
Dialects/
11/12
Linalg.md
-
include/mlir/
-
mlir/
-
Dialect/
-
Linalg/IR/
-
IR/
-
LinalgOps.h
-
LinalgStructuredOps.td
1/1
LinalgStructuredOpsInterface.td
3/3
LinalgTraits.h
-
Shape/IR/
-
IR/
-
ShapeBase.td
-
IR/
-
OpBase.td
-
integration_test/Dialect/Linalg/CPU/
-
Dialect/
-
Linalg/
-
CPU/
-
test-conv-1d-call.mlir
2/2
test-conv-1d-ncw-call.mlir
-
test-conv-1d-nwc-call.mlir
-
test-conv-2d-call.mlir
-
test-conv-2d-nchw-call.mlir
-
test-conv-2d-nhwc-call.mlir
-
test-conv-3d-call.mlir
-
test-conv-3d-ncdhw-call.mlir
-
test-conv-3d-ndhwc-call.mlir
-
lib/Dialect/Linalg/IR/
-
Dialect/
-
Linalg/
-
IR/
4/4
LinalgOps.cpp
-
LinalgTypes.cpp
-
test/
-
Conversion/LinalgToVector/
-
LinalgToVector/
-
linalg-to-vector.mlir
-
Dialect/Linalg/
-
Linalg/
-
affine.mlir
-
canonicalize.mlir
-
fold-affine-min-scf.mlir
-
fusion-2-level.mlir
-
fusion.mlir
-
invalid.mlir
-
loops.mlir
-
promote.mlir
-
promotion_options.mlir
-
roundtrip.mlir
-
standard.mlir
-
tile-and-distribute.mlir
-
tile.mlir
-
tile_parallel_reduce.mlir
-
transform-patterns-matmul-to-vector.mlir
-
transform-patterns.mlir
-
IR/
-
slice.mlir
-
lib/Dialect/Test/
-
Dialect/
-
Test/
-
TestOps.td
-
mlir-cpu-runner/
-
linalg_integration_test.mlir
-
mlir-linalg-ods-gen/
-
test-linalg-ods-gen.tc
-
tools/mlir-linalg-ods-gen/
-
mlir-linalg-ods-gen/
-
mlir-linalg-ods-gen.cpp

Differential D87767

[mlir][Linalg] Evolve named ops to use assembly form and support linalg on tensors.
AbandonedPublic

Authored by nicolasvasilache on Sep 16 2020, 8:08 AM.

Download Raw Diff

Details

Reviewers

ftynse
pifon2a
mravishankar
stellaraccident
silvas
benvanik
herhut
rriddle
antiagainst
aartbik
jpienaar
burmako

Summary

This revision allows representing a reduction at the level of linalg on tensors for named ops. When a structured op has a reduction and returns tensor(s), new conventions are added and documented.

As an illustration, the syntax for a linalg.matmul writing into a buffer is:

linalg.matmul ins(%a, %b : memref<?x?xf32>, tensor<?x?xf32>)
             outs(%c : memref<?x?xf32>)

, whereas the syntax for a linalg.matmul returning a new tensor is:

%d = linalg.matmul ins(%a, %b : tensor<?x?xf32>, memref<?x?xf32>)
                  init(%c : memref<?x?xf32>)
                    -> tensor<?x?xf32>

Other parts of linalg will be extended accordingly to allow mixed buffer/tensor semantics in the presence of reductions.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nicolasvasilache created this revision.Sep 16 2020, 8:08 AM

Herald added a reviewer: rriddle. · View Herald TranscriptSep 16 2020, 8:08 AM

Herald added a reviewer: antiagainst. · View Herald Transcript

Herald added a reviewer: aartbik. · View Herald Transcript

Herald added a reviewer: jpienaar. · View Herald Transcript

Herald added a reviewer: aartbik. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: tatianashp, msifontes, jurahul and 13 others. · View Herald Transcript

nicolasvasilache requested review of this revision.Sep 16 2020, 8:08 AM

Herald added subscribers: limo1996, stephenneuendorffer. · View Herald TranscriptSep 16 2020, 8:08 AM

Harbormaster completed remote builds in B71881: Diff 292227.Sep 16 2020, 8:09 AM

nicolasvasilache mentioned this in D87776: [mlir][ODS] Add TypeRef directive in Declarative Assembly Format to allow custom UserDirective parser to receive previously parsed types..Sep 16 2020, 10:10 AM

ftynse accepted this revision.Sep 17 2020, 6:07 AM

ftynse added inline comments.

mlir/docs/Dialects/Linalg.md
495	Does this support the change in elemental types? Otherwise it's not only the same shape, but the types must match completely.
533	This should be a tensor. If we expect it to be strictly the same type, we can also omit the type here.
mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterface.td
31	Can this rather be in `extraClassDeclaraiton` or, even better, a static function in the C++ implementation file? It does not look like this can ever have a non-default implementation so why pay the cost of making it "virtual"?
mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h
75	`ins`, `outs` and `init` ?
84	Nit: traits seem to be using singular in their class names, i.e. `NamedStructuredOpTrait`
109	Nit: `init_tensors` does not appear in the IR, did you mean `init` ?
mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
307–324	I don't understand why is this necessary.
1261–1266	Nit: now that you take an `OpBuidler`, I'd advise to use `OpBuilder::createBlock` instead.
1316–1318	Nit: I'd expect Twine or formatv to be more efficient than stitching std strings

Tmp OpBuilder creation.

Harbormaster completed remote builds in B72017: Diff 292504.Sep 17 2020, 7:53 AM

Address review

mlir/docs/Dialects/Linalg.md
533	let's remove later when we have some experience using it if it feels too redundant.
mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
307–324	the goal is to make everyone use the same verifiers but atm Generic and IndexedGeneric have this `view` property that I need to kill. This will be done in a followup and then all can be unified.

nicolasvasilache added a reviewer: burmako.Sep 17 2020, 8:52 AM

Harbormaster completed remote builds in B72026: Diff 292525.Sep 17 2020, 9:03 AM

Better building of block.

Harbormaster completed remote builds in B72033: Diff 292544.Sep 17 2020, 9:49 AM

Adding some comments. Looking through this right now.

mlir/docs/Dialects/Linalg.md
488	This seems to be more complex because the effort is being made to mix tensor and buffer semantics. Is it possible for the time being to just keep them completely separate (at least by convention).
495	Since we are going this route, why not just add a new region to the named op that describes the computation to generate the `init` tensor. This region has the same semantics as a `linalg.generic/linalg.indexed_generic` op?
525	This seems to deviate from the existing form of ops in MLIR, i.e. linalg.matmul ins(%a : memref<?x?xf32>, %b : tensor<?x?xf32>)...

Fix incorrect OpBuilder state with an InsertionGuard.

mravishankar added inline comments.Sep 17 2020, 11:57 AM

mlir/docs/Dialects/Linalg.md

495

To get a little more specific, we can do

<linalg named-op> (ins ...) (outs ...)
    init (%init : tensor<...f32>) {
    ^bb0(%arg0 : f32) :
         linalg.yield %arg0 : f32
    }

and that could generalize to

<linalg named-op> (ins ...) (outs ...)
    init (%a : tensor<...f32>, %b : tensor<...f32>) {
    ^bb0(%arg0 : f32, %arg1 : f32) :
         %0 = std.addf %arg0, %arg1 : f32
         linalg.yield %0 : f32
    }

where the initialization is done via a computation.

mlir/integration_test/Dialect/Linalg/CPU/test-conv-1d-ncw-call.mlir

super nit: align ins and outs ?

Harbormaster completed remote builds in B72053: Diff 292574.Sep 17 2020, 11:58 AM

nicolasvasilache marked 3 inline comments as done.Sep 17 2020, 12:07 PM

nicolasvasilache added inline comments.

mlir/docs/Dialects/Linalg.md
488	The bigger underlying achievement allowed by this convention is that we have named ops that are automatically generated from the TC spec and work with either tensors and buffers. This was the key blocker to scaling these concepts and was introduced by the args_in / args_out that will be removed entirely in a followup commit. Making them work with mixed tensor / buffer does not brings real additional complexity: if you want buffer only you'd still use `ins/outs`; if you want tensors only you'd still use `ins/result` for pointwise and `ins/init/result` for reductions.
495	That's significantly more work and out of the scope of this CL. For instance, I do not know yet how to make it work with the TC lang. Another tricky point is what would the init region look like and expand in higher-D tensors (e.g. imagine a 3-D 2x2x2 tensor, how does a region encode a 2-D tensor broadcast along some dim)? It would seem the `indexed_generic` would be required. Still it is an orthogonal improvement that can we can table into a separate discussion once the existing dead-end state is improved.
525	Yes, this is unfortunate and seems to be a byproduct of using the declarative assembly format. I do not know how to make it generate interleaved types and uses. If/when it is available we should go to that. As an illustration, note that ReturnOp uses the declarative assembly but FuncOp has a custom handwritten parser.

nicolasvasilache marked 4 inline comments as done.Sep 17 2020, 12:13 PM

nicolasvasilache added inline comments.

mlir/docs/Dialects/Linalg.md
495	Looks nice! I still view it as future improvement though so I'd keep it for a separate PR once the generic ops are up to speed too :) The args_in / args_out has to be deprecated with fire first.
mlir/integration_test/Dialect/Linalg/CPU/test-conv-1d-ncw-call.mlir
34	I aligned the parenthesis consistently throughout the examples. I'm somewhat reluctant to change all tests to align the `i`s and the `o`s now :)

InsertionGuard.

Harbormaster completed remote builds in B72058: Diff 292585.Sep 17 2020, 12:30 PM

Spin a custom parser as declarative assembly extension is blocked for now.

DCE

Drop extra dialect name form printing.

Harbormaster completed remote builds in B72150: Diff 292735.Sep 18 2020, 3:18 AM

Harbormaster completed remote builds in B72151: Diff 292736.Sep 18 2020, 3:34 AM

Harbormaster completed remote builds in B72154: Diff 292745.

Looks good to me for a first draft.

mlir/docs/Dialects/Linalg.md
495	Thanks! Thinking a bit more about this, I think its fine to go with juts an `init` tensor you have. I was only trying to handle the case where the initialization is done by a scalar value, but you could use a `fill` operaiton for this.

burmako accepted this revision.Sep 18 2020, 9:31 AM

nicolasvasilache mentioned this in D87938: [mlir][Linalg] Uniformize linalg.generic with named ops..Sep 21 2020, 8:36 AM

Wasn't this submitted?

Landed in 93fd30bac3345fea4f5beba3241f1ef4f2f5f419.

nicolasvasilache abandoned this revision.Oct 2 2020, 1:29 AM

Revision Contents

Path

Size

mlir/

docs/

Dialects/

Linalg.md

81 lines

include/

mlir/

Dialect/

Linalg/

IR/

LinalgOps.h

6 lines

LinalgStructuredOps.td

20 lines

LinalgStructuredOpsInterface.td

76 lines

LinalgTraits.h

74 lines

Shape/

IR/

ShapeBase.td

2 lines

IR/

OpBase.td

4 lines

integration_test/

Dialect/

Linalg/

CPU/

test-conv-1d-call.mlir

3 lines

test-conv-1d-ncw-call.mlir

3 lines

test-conv-1d-nwc-call.mlir

3 lines

test-conv-2d-call.mlir

3 lines

test-conv-2d-nchw-call.mlir

3 lines

test-conv-2d-nhwc-call.mlir

3 lines

test-conv-3d-call.mlir

3 lines

test-conv-3d-ncdhw-call.mlir

3 lines

test-conv-3d-ndhwc-call.mlir

3 lines

lib/

Dialect/

Linalg/

IR/

LinalgOps.cpp

191 lines

LinalgTypes.cpp

3 lines

test/

Conversion/

LinalgToVector/

linalg-to-vector.mlir

3 lines

Dialect/

Linalg/

affine.mlir

6 lines

canonicalize.mlir

5 lines

fold-affine-min-scf.mlir

3 lines

6 lines

176 lines

59 lines

27 lines

26 lines

promotion_options.mlir

7 lines

roundtrip.mlir

64 lines

standard.mlir

6 lines

tile-and-distribute.mlir

42 lines

tile.mlir

77 lines

tile_parallel_reduce.mlir

10 lines

transform-patterns-matmul-to-vector.mlir

24 lines

transform-patterns.mlir

97 lines

IR/

slice.mlir

6 lines

lib/

Dialect/

Test/

TestOps.td

2 lines

mlir-cpu-runner/

linalg_integration_test.mlir

6 lines

mlir-linalg-ods-gen/

test-linalg-ods-gen.tc

18 lines

tools/

mlir-linalg-ods-gen/

mlir-linalg-ods-gen.cpp

136 lines

Diff 292525

mlir/docs/Dialects/Linalg.md

Show All 34 Lines

## High-Level Description of Linalg Ops<a name="linalg_ops"></a>		## High-Level Description of Linalg Ops<a name="linalg_ops"></a>
Linalg takes at least some inspiration from all previously [listed prior		Linalg takes at least some inspiration from all previously [listed prior
art](#prior_art). The design enables the definition of *CustomOps* with		art](#prior_art). The design enables the definition of *CustomOps* with
generic properties that enable [key transformations](#key_transformations),		generic properties that enable [key transformations](#key_transformations),
including lowering to scalar load/store and other operations or to external		including lowering to scalar load/store and other operations or to external
library calls and intrinsics.		library calls and intrinsics.

These ops can have *either tensor or buffer operands*.		These ops can have *either tensor or buffer operands*, subject to
		[conventions and limitations](#tensors_and_buffers).

### Payload-Carrying Ops<a name="payload_ops"></a>		### Payload-Carrying Ops<a name="payload_ops"></a>
Linalg defines two payload carrying operations that implement the [structured ops](		Linalg defines two payload carrying operations that implement the [structured ops](
https://docs.google.com/presentation/d/1P-j1GrH6Q5gLBjao0afQ-GfvcAeF-QU4GXXeSy0eJ9I/edit#slide=id.p		https://docs.google.com/presentation/d/1P-j1GrH6Q5gLBjao0afQ-GfvcAeF-QU4GXXeSy0eJ9I/edit#slide=id.p
) abstraction on tensors and buffers. This is architected as two generic operations		) abstraction on tensors and buffers. This is architected as two generic operations
`linalg.generic` (resp. `linalg.indexed_generic`) that can express custom		`linalg.generic` (resp. `linalg.indexed_generic`) that can express custom
operations with index-free semantics (resp. indexing semantics).		operations with index-free semantics (resp. indexing semantics).
The properties of these generic ops are the result of applying the		The properties of these generic ops are the result of applying the
▲ Show 20 Lines • Show All 406 Lines • ▼ Show 20 Lines
automatically while still maintaining the [core guiding		automatically while still maintaining the [core guiding
principles](#guiding_principles).		principles](#guiding_principles).

For the time being, we have settled on the combination of these properties		For the time being, we have settled on the combination of these properties
because of empirical evidence building and working on multiple high-level		because of empirical evidence building and working on multiple high-level
compilers. As we lay those down and engage more with the community, we expect		compilers. As we lay those down and engage more with the community, we expect
multiple rounds of discussions and design changes to the original architecture.		multiple rounds of discussions and design changes to the original architecture.

		### Tensors and Buffers: Conventions and Limitations <a name="tensors_and_buffers"></a>

		Tensors are immutable SSA values, buffers are mutable regions of memory subject
		to side-effects and aliasing. As a consequence, output buffers are passed as
		operands whereas output tensors are new SSA values corresponding to op results.
		Inputs can be arbitrary tensors or buffers and are always passed as operands.

		The following convention is currently in-flight and is in the process of
		replacing other existing conventions. The following convention currently applies
		to "named" structured ops which are auto-generated by the linalg-ods tool.

		The convention adopted is as follows:

		1. A first block of `ins` op operands hold read-only inputs of ShapedType.
		2. An optional second block of `outs` op operands hold read-write output
		buffers of MemRefType.
		3. An optional third block of `init` operands hold initialization tensors of
		RankedTensorType. Such tensors can appear when the op performs a reduction
		and returns a tensor.

		Structured ops with fully parallel semantics, have empty `init`. They may either
		write in-place into `outs` buffers or return new tensors.
		mravishankarUnsubmitted Done Reply Inline Actions This seems to be more complex because the effort is being made to mix tensor and buffer semantics. Is it possible for the time being to just keep them completely separate (at least by convention). mravishankar: This seems to be more complex because the effort is being made to mix tensor and buffer…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions The bigger underlying achievement allowed by this convention is that we have named ops that are automatically generated from the TC spec and work with either tensors and buffers. This was the key blocker to scaling these concepts and was introduced by the args_in / args_out that will be removed entirely in a followup commit. Making them work with mixed tensor / buffer does not brings real additional complexity: if you want buffer only you'd still use `ins/outs`; if you want tensors only you'd still use `ins/result` for pointwise and `ins/init/result` for reductions. nicolasvasilache: The bigger underlying achievement allowed by this convention is that we have named ops that are…

		Structured ops with reduction semantics and output tensor(s) however have
		additional restrictions:

		1. They can only return a single tensor for now.
		2. They cannot have any output buffer operand (i.e. `outs` is empty).
		3. They have exactly one `init` tensor of the same type as the unique output
		ftynseUnsubmitted Done Reply Inline Actions Does this support the change in elemental types? Otherwise it's not only the same shape, but the types must match completely. ftynse: Does this support the change in elemental types? Otherwise it's not only the same shape, but…
		mravishankarUnsubmitted Done Reply Inline Actions Since we are going this route, why not just add a new region to the named op that describes the computation to generate the `init` tensor. This region has the same semantics as a `linalg.generic/linalg.indexed_generic` op? mravishankar: Since we are going this route, why not just add a new region to the named op that describes the…
		mravishankarUnsubmitted Done Reply Inline Actions To get a little more specific, we can do <linalg named-op> (ins ...) (outs ...) init (%init : tensor<...f32>) { ^bb0(%arg0 : f32) : linalg.yield %arg0 : f32 } and that could generalize to <linalg named-op> (ins ...) (outs ...) init (%a : tensor<...f32>, %b : tensor<...f32>) { ^bb0(%arg0 : f32, %arg1 : f32) : %0 = std.addf %arg0, %arg1 : f32 linalg.yield %0 : f32 } where the initialization is done via a computation. mravishankar: To get a little more specific, we can do ``` <linalg named-op> (ins ...) (outs ...) init…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Looks nice! I still view it as future improvement though so I'd keep it for a separate PR once the generic ops are up to speed too :) The args_in / args_out has to be deprecated with fire first. nicolasvasilache: Looks nice! I still view it as future improvement though so I'd keep it for a separate PR once…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions That's significantly more work and out of the scope of this CL. For instance, I do not know yet how to make it work with the TC lang. Another tricky point is what would the init region look like and expand in higher-D tensors (e.g. imagine a 3-D 2x2x2 tensor, how does a region encode a 2-D tensor broadcast along some dim)? It would seem the `indexed_generic` would be required. Still it is an orthogonal improvement that can we can table into a separate discussion once the existing dead-end state is improved. nicolasvasilache: That's significantly more work and out of the scope of this CL. For instance, I do not know…
		mravishankarUnsubmitted Not Done Reply Inline Actions Thanks! Thinking a bit more about this, I think its fine to go with juts an `init` tensor you have. I was only trying to handle the case where the initialization is done by a scalar value, but you could use a `fill` operaiton for this. mravishankar: Thanks! Thinking a bit more about this, I think its fine to go with juts an `init` tensor you…
		tensor. Such an `init` tensor does not have an explicit associate indexing
		map. Instead the map of the result tensor is used to signify that the `init`
		and the `result` are "tied".

		Points 1. and 2. keep complexity of the representation in check by allowing only
		a single result tensor, when reductions are present.

		Point 3. is related to the fact that SSA values cannot represent in-place
		updates. Instead, linalg adopts a similar convention that exists in e.g.
		`vector.outerproduct`: the value that is reduced into is passed as an explicit
		argument and a new result of the same shape is produced.

		It is expected buffer allocation will fold this last input onto the result in a
		single output buffer argument, which is why the same indexing map is required:
		the last input operand is said to be "tied" to the result.

		Alternative, more complex representations, would allow for:

		1. Multiple results and `init` tensors in arbitrary orders, which could be
		captured by an extra ArrayAttr of position pairs.
		2. Relaxing the conditions on the indexing map equalities on the each pair and
		e.g. allow implicit broadcasts of the input.

		These representations are deemed unnecessarily complex for now and are left for
		future discussion.

		As an illustration, the syntax for a `linalg.matmul` writing into a buffer is:

		```
		linalg.matmul ins(%a, %b : memref<?x?xf32>, tensor<?x?xf32>)
		mravishankarUnsubmitted Done Reply Inline Actions This seems to deviate from the existing form of ops in MLIR, i.e. linalg.matmul ins(%a : memref<?x?xf32>, %b : tensor<?x?xf32>)... mravishankar: This seems to deviate from the existing form of ops in MLIR, i.e. ``` linalg.matmul ins(%a…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Yes, this is unfortunate and seems to be a byproduct of using the declarative assembly format. I do not know how to make it generate interleaved types and uses. If/when it is available we should go to that. As an illustration, note that ReturnOp uses the declarative assembly but FuncOp has a custom handwritten parser. nicolasvasilache: Yes, this is unfortunate and seems to be a byproduct of using the declarative assembly format.
		outs(%c : memref<?x?xf32>)
		```

		, whereas the syntax for a `linalg.matmul` returning a new tensor is:

		```
		%d = linalg.matmul ins(%a, %b : tensor<?x?xf32>, memref<?x?xf32>)
		init(%c : tensor<?x?xf32>)
		ftynseUnsubmitted Done Reply Inline Actions This should be a tensor. If we expect it to be strictly the same type, we can also omit the type here. ftynse: This should be a tensor. If we expect it to be strictly the same type, we can also omit the…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions let's remove later when we have some experience using it if it feels too redundant. nicolasvasilache: let's remove later when we have some experience using it if it feels too redundant.
		-> tensor<?x?xf32>
		```

### Data Representation: Views<a name="views"></a>		### Data Representation: Views<a name="views"></a>
The current implementation uses the [Strided MemRef (a.k.a View)](		The current implementation uses the [Strided MemRef (a.k.a View)](
https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio)		https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio)
abstraction. The name View is used interchangeably in `linalg` to signify		abstraction. The name View is used interchangeably in `linalg` to signify
Strided MemRef.		Strided MemRef.
In the future we expect to use other structured data types and		In the future we expect to use other structured data types and
support ragged, mixed-sparse and other types. We expect to draw on the		support ragged, mixed-sparse and other types. We expect to draw on the
experience from existing LIFT abstractions for		experience from existing LIFT abstractions for
▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	def batchmatmul(A: f32(Batch, M, K), B: f32(K, N)) -> (C: f32(Batch, M, N)) {
C(b, m, n) = std_addf<k>(std_mulf(A(b, m, k), B(k, n)));		C(b, m, n) = std_addf<k>(std_mulf(A(b, m, k), B(k, n)));
}		}
```		```

When `mlir-linalg-ods-gen -gen-ods-decl=1` is called, the following ODS is		When `mlir-linalg-ods-gen -gen-ods-decl=1` is called, the following ODS is
produced:		produced:

```		```
def batchmatmulOp : LinalgNamedStructured_Op<"batchmatmul", [		def batchmatmulOp : LinalgNamedStructured_Op<"batchmatmul", [
NInputs<2>,		NInputs<2>,
NOutputs<1>,		NOutputs<1>,
NamedStructuredOpTraits]> { ... }		NamedStructuredOpTrait]> { ... }
```		```

When `mlir-linalg-ods-gen -gen-impl=1` is called, the following C++ is produced:		When `mlir-linalg-ods-gen -gen-impl=1` is called, the following C++ is produced:

```		```
llvm::Optional<SmallVector<StringRef, 8>> batchmatmul::referenceIterators() {		llvm::Optional<SmallVector<StringRef, 8>> batchmatmul::referenceIterators() {
return SmallVector<StringRef, 8>{		return SmallVector<StringRef, 8>{
getParallelIteratorTypeName(),		getParallelIteratorTypeName(),
▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Linalg/IR/LinalgOps.h

	Show First 20 Lines • Show All 79 Lines • ▼ Show 20 Lines
	/// symbol-less identity map of `rank`.			/// symbol-less identity map of `rank`.
	AffineMap extractOrIdentityMap(Optional<AffineMap> maybeMap, unsigned rank,			AffineMap extractOrIdentityMap(Optional<AffineMap> maybeMap, unsigned rank,
	MLIRContext *context);			MLIRContext *context);

	/// Return the vector that is the concatenation of `a` and `b`.			/// Return the vector that is the concatenation of `a` and `b`.
	SmallVector<AffineExpr, 4> concat(ArrayRef<AffineExpr> a,			SmallVector<AffineExpr, 4> concat(ArrayRef<AffineExpr> a,
	ArrayRef<AffineExpr> b);			ArrayRef<AffineExpr> b);

				/// Return the dims that are `iteratorTypeName` loops in the LinalgOp `op`.
				/// Assumes `op` is a LinalgOp.
				void getDimsOfType(Operation *op, StringRef iteratorTypeName,
				SmallVectorImpl<AffineExpr> &res);

	} // namespace linalg			} // namespace linalg
	} // namespace mlir			} // namespace mlir

	#include "mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterfaces.h.inc"			#include "mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterfaces.h.inc"
				Lint: Pre-merge checks Inline Actions clang-tidy: error: 'mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterfaces.h.inc' file not found [clang-diagnostic-error] not useful clang-tidy: error: 'mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterfaces.h.inc' file not found [clang-diagnostic-error] not useful clang-tidy: error: 'mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterfaces.h.inc' file not found [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: 'mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterfaces.h.inc' file not found…

	#define GET_OP_CLASSES			#define GET_OP_CLASSES
	#include "mlir/Dialect/Linalg/IR/LinalgOps.h.inc"			#include "mlir/Dialect/Linalg/IR/LinalgOps.h.inc"

	#define GET_OP_CLASSES			#define GET_OP_CLASSES
	#include "mlir/Dialect/Linalg/IR/LinalgStructuredOps.h.inc"			#include "mlir/Dialect/Linalg/IR/LinalgStructuredOps.h.inc"


	#endif // MLIR_DIALECT_LINALG_LINALGOPS_H_			#endif // MLIR_DIALECT_LINALG_LINALGOPS_H_

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOps.td

Show All 26 Lines

// The Linalg `NOutputs` trait provides the API for ops that are known		// The Linalg `NOutputs` trait provides the API for ops that are known
// to have a specified number of outputs, all passed as operands.		// to have a specified number of outputs, all passed as operands.
// See Linalg/LinalgTraits.h for implementation details an usage.		// See Linalg/LinalgTraits.h for implementation details an usage.
class NOutputs<int args_out> :		class NOutputs<int args_out> :
NativeOpTrait<"linalg::NOutputs<" # !cast<string>(args_out) # ">::Impl"> {}		NativeOpTrait<"linalg::NOutputs<" # !cast<string>(args_out) # ">::Impl"> {}

def StructuredOpTraits : NativeOpTrait<"linalg::StructuredOpTraits">;		def StructuredOpTraits : NativeOpTrait<"linalg::StructuredOpTraits">;
		def NamedStructuredOpTrait : NativeOpTrait<"linalg::NamedStructuredOpTrait">;

// Base Tablegen class for Linalg ops.		// Base Tablegen class for Linalg ops.
// Linalg ops that correspond to library calls operate on linalg::View as their		// Linalg ops that correspond to library calls operate on linalg::View as their
// first operands. These may be optionally followed by non-view operands		// first operands. These may be optionally followed by non-view operands
// depending on the specific Linalg op.		// depending on the specific Linalg op.
class LinalgStructuredBase_Op<string mnemonic, list<OpTrait> props>		class LinalgStructuredBase_Op<string mnemonic, list<OpTrait> props>
: Op<Linalg_Dialect, mnemonic,		: Op<Linalg_Dialect, mnemonic,
!listconcat(props, [StructuredOpTraits, LinalgStructuredInterface])> {		!listconcat(props, [StructuredOpTraits, LinalgStructuredInterface])> {
▲ Show 20 Lines • Show All 750 Lines • ▼ Show 20 Lines	def IndexedGenericOp : GenericOpBase<"indexed_generic"> {
let hasFolder = 1;		let hasFolder = 1;
let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Named Linalg ops, implemented as a declarative configurations of generic ops.		// Named Linalg ops, implemented as a declarative configurations of generic ops.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

class LinalgNamedStructured_Op<string mnemonic, list<OpTrait> props>		// This file is auto-generated from a TC def specification.
: LinalgStructuredBase_Op<mnemonic, props> {
string spec = ?;
// We cannot use an assemblyFormat atm because we need to hook in a custom-
// built implicit region from a static OpClass method.
// TODO: Revisit in the future if/when appropriate.
// let assemblyFormat = "`(` operands `)` attr-dict `:` "
// "functional-type(operands, results)";

// The parser needs to specialize on the OpType so it has to be auto-generated
// in the linalg-ods tool.
let printer = [{ return ::printNamedStructuredOp(p, *this); }];
let verifier = [{ return ::verifyNamedStructuredOp(*this); }];
let hasFolder = 1;
let hasCanonicalizer = 1;
}

// This file is auto-generated from a tc specification.
include "mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.td"		include "mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.td"

#endif // LINALG_STRUCTURED_OPS		#endif // LINALG_STRUCTURED_OPS

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterface.td

Show All 19 Lines
def LinalgStructuredInterface : OpInterface<"LinalgOp"> {		def LinalgStructuredInterface : OpInterface<"LinalgOp"> {
let cppNamespace = "::mlir::linalg";		let cppNamespace = "::mlir::linalg";
let methods = [		let methods = [
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Loop types handling.		// Loop types handling.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the number of parallel loops within the current operation.		Return the number of parallel loops.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumParallelLoops",		/methodName=/"getNumParallelLoops",
		ftynseUnsubmitted Done Reply Inline Actions Can this rather be in `extraClassDeclaraiton` or, even better, a static function in the C++ implementation file? It does not look like this can ever have a non-default implementation so why pay the cost of making it "virtual"? ftynse: Can this rather be in `extraClassDeclaraiton` or, even better, a static function in the C++…
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return getNumIterators(getParallelIteratorTypeName(),		return getNumIterators(getParallelIteratorTypeName(),
$_op.iterator_types());		$_op.iterator_types());
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the number of reduction loops within the current operation.		Return the dims that are parallel loops.
		}],
		/retTy=/"void",
		/methodName=/"getParallelDims",
		/args=/(ins "SmallVectorImpl<AffineExpr> &":$res),
		/methodBody=/"",
		/defaultImplementation=/[{
		return getDimsOfType($_op, getParallelIteratorTypeName(), res);
		}]
		>,
		InterfaceMethod<
		/desc=/[{
		Return the number of reduction loops.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumReductionLoops",		/methodName=/"getNumReductionLoops",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return getNumIterators(getReductionIteratorTypeName(),		return getNumIterators(getReductionIteratorTypeName(),
$_op.iterator_types());		$_op.iterator_types());
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the number of window loops within the current operation.		Return the dims that are reduction loops.
		}],
		/retTy=/"void",
		/methodName=/"getReductionDims",
		/args=/(ins "SmallVectorImpl<AffineExpr> &":$res),
		/methodBody=/"",
		/defaultImplementation=/[{
		return getDimsOfType($_op, getReductionIteratorTypeName(), res);
		}]
		>,
		InterfaceMethod<
		/desc=/[{
		Return the number of window loops.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumWindowLoops",		/methodName=/"getNumWindowLoops",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return getNumIterators(getWindowIteratorTypeName(),		return getNumIterators(getWindowIteratorTypeName(),
$_op.iterator_types());		$_op.iterator_types());
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
		Return the dims that are window loops.
		}],
		/retTy=/"void",
		/methodName=/"getWindowDims",
		/args=/(ins "SmallVectorImpl<AffineExpr> &":$res),
		/methodBody=/"",
		/defaultImplementation=/[{
		return getDimsOfType($_op.getOperation(), getWindowIteratorTypeName(), res);
		}]
		>,
		InterfaceMethod<
		/desc=/[{
Return the total number of loops within the current operation.		Return the total number of loops within the current operation.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumLoops",		/methodName=/"getNumLoops",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return getNumIterators($_op.iterator_types());		return getNumIterators($_op.iterator_types());
Show All 19 Lines	let methods = [
// These special methods must be defined by each op that wants to implement		// These special methods must be defined by each op that wants to implement
// the LinalgStructuredInterface. For now, this is either:		// the LinalgStructuredInterface. For now, this is either:
// - inherited statically by using the NInputs<unsigned> or		// - inherited statically by using the NInputs<unsigned> or
// NOutputs<unsigned> traits.		// NOutputs<unsigned> traits.
// - derived from args_in/args_out attributes (for linalg.generic and		// - derived from args_in/args_out attributes (for linalg.generic and
// linalg.indexed_generic ops).		// linalg.indexed_generic ops).
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the number of inputs from the current operation.		Return the number of inputs.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumInputs"		/methodName=/"getNumInputs"
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the number of outputs from the current operation.		Return the number of outputs.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumOutputs"		/methodName=/"getNumOutputs"
>,		>,
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Input arguments handling.		// Input arguments handling.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
InterfaceMethod<		InterfaceMethod<
Show All 37 Lines	InterfaceMethod<
/args=/(ins "unsigned":$i),		/args=/(ins "unsigned":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return getInput(i).getType().template cast<ShapedType>();		return getInput(i).getType().template cast<ShapedType>();
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the input operands from the current operation.		Return the input operands.
}],		}],
/retTy=/"Operation::operand_range",		/retTy=/"Operation::operand_range",
/methodName=/"getInputs",		/methodName=/"getInputs",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
auto range = this->getOperation()->getOperands();		auto range = this->getOperation()->getOperands();
return {range.begin(), range.begin() + $_op.getNumInputs()};		return {range.begin(), range.begin() + $_op.getNumInputs()};
Show All 10 Lines	InterfaceMethod<
/defaultImplementation=/[{		/defaultImplementation=/[{
SmallVector<RankedTensorType, 4> res;		SmallVector<RankedTensorType, 4> res;
for (Type type : getInputs().getTypes())		for (Type type : getInputs().getTypes())
if (auto t = type.template dyn_cast<RankedTensorType>())		if (auto t = type.template dyn_cast<RankedTensorType>())
res.push_back(t);		res.push_back(t);
return res;		return res;
}]		}]
>,		>,

//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Output arguments handling.		// Output arguments handling.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the output buffer at the given index, asserts that this is a		Return the output buffer at the given index, asserts that this is a
buffer operand and not a tensor result.		buffer operand and not a tensor result.
The `i^th` output argument is an operand (resp. a return value) iff it		The `i^th` output argument is an operand (resp. a return value) iff it
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	InterfaceMethod<
/defaultImplementation=/[{		/defaultImplementation=/[{
SmallVector<RankedTensorType, 4> res;		SmallVector<RankedTensorType, 4> res;
for (Type type : this->getOperation()->getResults().getTypes())		for (Type type : this->getOperation()->getResults().getTypes())
res.push_back(type.template cast<RankedTensorType>());		res.push_back(type.template cast<RankedTensorType>());
return res;		return res;
}]>,		}]>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the output buffers (operands) from the current operation.		Return the output buffers (operands).
}],		}],
/retTy=/"Operation::operand_range",		/retTy=/"Operation::operand_range",
/methodName=/"getOutputBuffers",		/methodName=/"getOutputBuffers",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
auto range = this->getOperation()->getOperands();		auto range = this->getOperation()->getOperands();
return {range.begin() + $_op.getNumInputs(),		return {range.begin() + $_op.getNumInputs(),
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	InterfaceMethod<
/methodName=/"getShapedType",		/methodName=/"getShapedType",
/args=/(ins "unsigned":$i),		/args=/(ins "unsigned":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
if (i < $_op.getNumInputs())		if (i < $_op.getNumInputs())
return getInputShapedType(i);		return getInputShapedType(i);
if (i < getNumInputsAndOutputBuffers())		if (i < getNumInputsAndOutputBuffers())
return getOutputBufferType(i - $_op.getNumInputs());		return getOutputBufferType(i - $_op.getNumInputs());
return getOutputTensorTypes()[i - getNumInputsAndOutputBuffers()];		return this->getOperation()->getResult(
		i - getNumInputsAndOutputBuffers()).
		getType().template cast<ShapedType>();
}]>,		}]>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the shaped types for all the inputs and outputs		Return the shaped types for all the inputs and outputs
}],		}],
/retTy=/"SmallVector<ShapedType, 4>",		/retTy=/"SmallVector<ShapedType, 4>",
/methodName=/"getInputOutputShapedTypes",		/methodName=/"getInputOutputShapedTypes",
/args=/(ins),		/args=/(ins),
Show All 37 Lines	InterfaceMethod<
/desc=/[{		/desc=/[{
Return the indexing maps within the current operation.		Return the indexing maps within the current operation.
}],		}],
/retTy=/"SmallVector<AffineMap, 4>",		/retTy=/"SmallVector<AffineMap, 4>",
/methodName=/"getIndexingMaps",		/methodName=/"getIndexingMaps",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return llvm::to_vector<4>(		return llvm::to_vector<4>($_op.indexing_maps().template getAsValueRange<AffineMapAttr>());
llvm::map_range($_op.indexing_maps(),
[](Attribute attr) -> AffineMap {
return attr.cast<AffineMapAttr>().getValue();
}));
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the input or output indexing map at index `i`.		Return the input or output indexing map at index `i`.
}],		}],
/retTy=/"AffineMap",		/retTy=/"AffineMap",
/methodName=/"getIndexingMap",		/methodName=/"getIndexingMap",
/args=/(ins "unsigned":$i),		/args=/(ins "unsigned":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(i < getNumInputsAndOutputs());		assert(i < getNumInputsAndOutputs());
return $_op.indexing_maps()		return getIndexingMaps()[i];
.getValue()[i]
.template cast<AffineMapAttr>()
.getValue();
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the input indexing map at index `i`.		Return the input indexing map at index `i`.
}],		}],
/retTy=/"AffineMap",		/retTy=/"AffineMap",
/methodName=/"getInputIndexingMap",		/methodName=/"getInputIndexingMap",
/args=/(ins "unsigned":$i),		/args=/(ins "unsigned":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(i < $_op.getNumInputs());		assert(i < $_op.getNumInputs());
return $_op.indexing_maps()		return getIndexingMaps()[i];
.getValue()[i]
.template cast<AffineMapAttr>()
.getValue();
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the output indexing map at index `i`.		Return the output indexing map at index `i`.
}],		}],
/retTy=/"AffineMap",		/retTy=/"AffineMap",
/methodName=/"getOutputIndexingMap",		/methodName=/"getOutputIndexingMap",
/args=/(ins "unsigned":$i),		/args=/(ins "unsigned":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(i < $_op.getNumOutputs());		assert(i < $_op.getNumOutputs());
return $_op.indexing_maps()		return getIndexingMaps()[i + $_op.getNumInputs()];
.getValue()[i + $_op.getNumInputs()]
.template cast<AffineMapAttr>()
.getValue();
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return whether the op has only MemRef input and outputs.		Return whether the op has only MemRef input and outputs.
}],		}],
/retTy=/"bool",		/retTy=/"bool",
/methodName=/"hasBufferSemantics",		/methodName=/"hasBufferSemantics",
▲ Show 20 Lines • Show All 64 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	static LogicalResult verifyTrait(Operation *op) {
if (failed(OpTrait::impl::verifyAtLeastNOperands(op, nOperands)))		if (failed(OpTrait::impl::verifyAtLeastNOperands(op, nOperands)))
return failure();		return failure();
if (op->getNumResults() > concreteOp.getNumOutputs())		if (op->getNumResults() > concreteOp.getNumOutputs())
return op->emitError("unexpected #results > #outputs");		return op->emitError("unexpected #results > #outputs");
return success();		return success();
}		}
};		};

		/// This class provides a verifier for structured ops that are known to operate
		/// on buffers or tensors and that support `ins`, `outs` and `init` arguments.
		ftynseUnsubmitted Done Reply Inline Actions `ins`, `outs` and `init` ? ftynse: `ins`, `outs` and `init` ?
		/// This trait must be used in conjunction with an op definition or a trait that
		/// provides the methods `getNumInputs` and `getNumOutputs`.
		///
		/// Use as a trait as follows:
		///
		/// class MatmulOp : public Op<MatmulOp, OpTrait::NamedStructuredOpTrait> {
		///
		template <typename ConcreteType>
		class NamedStructuredOpTrait
		ftynseUnsubmitted Done Reply Inline Actions Nit: traits seem to be using singular in their class names, i.e. `NamedStructuredOpTrait` ftynse: Nit: traits seem to be using singular in their class names, i.e. `NamedStructuredOpTrait`
		: public OpTrait::TraitBase<ConcreteType, NamedStructuredOpTrait> {
		public:
		unsigned getNumInputs() {
		return cast<ConcreteType>(this->getOperation()).inputs().size();
		}
		unsigned getNumOutputs() {
		ConcreteType concreteOp = cast<ConcreteType>(this->getOperation());
		return concreteOp.output_buffers().size() +
		concreteOp.output_tensors().size();
		}
		static LogicalResult verifyTrait(Operation *op) {
		ConcreteType concreteOp = cast<ConcreteType>(op);
		unsigned nInputAndBufferOperands =
		concreteOp.getNumInputsAndOutputBuffers();
		if (failed(
		OpTrait::impl::verifyAtLeastNOperands(op, nInputAndBufferOperands)))
		return failure();

		SmallVector<AffineExpr, 4> redDims;
		concreteOp.getReductionDims(redDims);
		// If no result and no reduction, only check there is no init tensor and we
		// are done.
		if (redDims.empty() \|\| op->getNumResults() == 0) {
		if (!concreteOp.init_tensors().empty())
		return op->emitError("expected empty `init` when op has no "
		ftynseUnsubmitted Done Reply Inline Actions Nit: `init_tensors` does not appear in the IR, did you mean `init` ? ftynse: Nit: `init_tensors` does not appear in the IR, did you mean `init` ?
		"results or no reduction dims");
		return success();
		}

		// Only a single tensor result supported atm.
		if (op->getNumResults() != 1)
		return op->emitError(
		"expected single tensor result when reduction present");

		if (concreteOp.init_tensors().size() != op->getNumResults())
		return op->emitError(
		"expected #init tensors to match #results when reduction present");

		for (unsigned idx = 0, e = op->getNumResults(); idx < e; ++idx)
		if (concreteOp.init_tensors()[idx].getType() != op->getResultTypes()[idx])
		return op->emitError("expected init tensor #")
		<< idx << " of the same type as result #" << idx;

		// Output tensor indexing map may not depend on reduction index.
		// TODO: this is not yet tested. Add a test when linalg.generic switches to
		// this representation.
		for (unsigned idx = 0, e = concreteOp.getNumOutputs(); idx < e; ++idx) {
		AffineMap outputMap = concreteOp.getOutputIndexingMap(idx);
		for (auto expr : outputMap.getResults()) {
		for (auto dim : redDims) {
		unsigned pos = dim.cast<AffineDimExpr>().getPosition();
		if (expr.isFunctionOfDim(pos))
		return op->emitError(
		"unexpected single tensor output indexing map ")
		<< "is function of reduction dim @" << pos;
		}
		}
		}

		return success();
		}
		};

} // namespace linalg		} // namespace linalg
} // namespace OpTrait		} // namespace OpTrait
} // namespace mlir		} // namespace mlir

#endif // MLIR_DIALECT_LINALG_LINALGTRAITS_H_		#endif // MLIR_DIALECT_LINALG_LINALGTRAITS_H_

mlir/include/mlir/Dialect/Shape/IR/ShapeBase.td

	Show All 9 Lines
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef SHAPE_BASE_TD			#ifndef SHAPE_BASE_TD
	#define SHAPE_BASE_TD			#define SHAPE_BASE_TD

	include "mlir/IR/OpBase.td"			include "mlir/IR/OpBase.td"

	def AnyShaped: ShapedContainerType<[AnyType], IsShapedTypePred, "shaped">;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Shape Inference dialect definitions			// Shape Inference dialect definitions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def ShapeDialect : Dialect {			def ShapeDialect : Dialect {
	let name = "shape";			let name = "shape";

	let summary = "Types and operations for shape dialect";			let summary = "Types and operations for shape dialect";
	▲ Show 20 Lines • Show All 130 Lines • Show Last 20 Lines

mlir/include/mlir/IR/OpBase.td

Show First 20 Lines • Show All 565 Lines • ▼ Show 20 Lines	class VectorOfLengthAndType<list<int> allowedLengths,
list<Type> allowedTypes> : Type<		list<Type> allowedTypes> : Type<
And<[VectorOf<allowedTypes>.predicate,		And<[VectorOf<allowedTypes>.predicate,
VectorOfLength<allowedLengths>.predicate]>,		VectorOfLength<allowedLengths>.predicate]>,
VectorOf<allowedTypes>.description #		VectorOf<allowedTypes>.description #
VectorOfLength<allowedLengths>.description>;		VectorOfLength<allowedLengths>.description>;

def AnyVector : VectorOf<[AnyType]>;		def AnyVector : VectorOf<[AnyType]>;

		// Shaped types.

		def AnyShaped: ShapedContainerType<[AnyType], IsShapedTypePred, "shaped">;

// Tensor types.		// Tensor types.

// Any tensor type whose element type is from the given `allowedTypes` list		// Any tensor type whose element type is from the given `allowedTypes` list
class TensorOf<list<Type> allowedTypes> :		class TensorOf<list<Type> allowedTypes> :
ShapedContainerType<allowedTypes, IsTensorTypePred, "tensor">;		ShapedContainerType<allowedTypes, IsTensorTypePred, "tensor">;

def AnyTensor : TensorOf<[AnyType]>;		def AnyTensor : TensorOf<[AnyType]>;

▲ Show 20 Lines • Show All 1,769 Lines • Show Last 20 Lines

mlir/integration_test/Dialect/Linalg/CPU/test-conv-1d-call.mlir

	Show All 24 Lines
	// Creates and returns a 1-D buffer of size %s1 filled with the value %f			// Creates and returns a 1-D buffer of size %s1 filled with the value %f
	func @alloc_1d_filled_f32(%s1 : index, %f : f32) -> memref<?xf32> {			func @alloc_1d_filled_f32(%s1 : index, %f : f32) -> memref<?xf32> {
	%buf = alloc(%s1) : memref<?xf32>			%buf = alloc(%s1) : memref<?xf32>
	linalg.fill(%buf, %f) : memref<?xf32>, f32			linalg.fill(%buf, %f) : memref<?xf32>, f32
	return %buf : memref<?xf32>			return %buf : memref<?xf32>
	}			}

	func @conv_1d(%arg0: memref<?xf32>, %arg1: memref<?xf32>, %arg2: memref<?xf32>) {			func @conv_1d(%arg0: memref<?xf32>, %arg1: memref<?xf32>, %arg2: memref<?xf32>) {
	linalg.conv_1d %arg0, %arg1, %arg2 : (memref<?xf32>, memref<?xf32>, memref<?xf32>)			linalg.conv_1d ins (%arg0, %arg1: memref<?xf32>, memref<?xf32>)
				outs (%arg2: memref<?xf32>)
	return			return
	}			}

	func @main() {			func @main() {
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	%c8 = constant 8 : index			%c8 = constant 8 : index
	%f10 = constant 10.00000e+00 : f32			%f10 = constant 10.00000e+00 : f32
	Show All 20 Lines

mlir/integration_test/Dialect/Linalg/CPU/test-conv-1d-ncw-call.mlir

	Show All 24 Lines
	// Creates and returns 3-D buffer of size (%s1, %s2, %s3) filled with the value %f			// Creates and returns 3-D buffer of size (%s1, %s2, %s3) filled with the value %f
	func @alloc_3d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %f : f32) -> memref<?x?x?xf32> {			func @alloc_3d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %f : f32) -> memref<?x?x?xf32> {
	%buf = alloc(%s1, %s2, %s3) : memref<?x?x?xf32>			%buf = alloc(%s1, %s2, %s3) : memref<?x?x?xf32>
	linalg.fill(%buf, %f) : memref<?x?x?xf32>, f32			linalg.fill(%buf, %f) : memref<?x?x?xf32>, f32
	return %buf : memref<?x?x?xf32>			return %buf : memref<?x?x?xf32>
	}			}

	func @conv_1d_ncw(%arg0: memref<?x?x?xf32>, %arg1: memref<?x?x?xf32>, %arg2: memref<?x?x?xf32>) {			func @conv_1d_ncw(%arg0: memref<?x?x?xf32>, %arg1: memref<?x?x?xf32>, %arg2: memref<?x?x?xf32>) {
	linalg.conv_1d_ncw %arg0, %arg1, %arg2 : (memref<?x?x?xf32>, memref<?x?x?xf32>, memref<?x?x?xf32>)			linalg.conv_1d_ncw ins (%arg0, %arg1: memref<?x?x?xf32>, memref<?x?x?xf32>)
				outs (%arg2: memref<?x?x?xf32>)
				mravishankarUnsubmitted Done Reply Inline Actions super nit: align `ins` and `outs` ? mravishankar: super nit: align `ins` and `outs` ?
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I aligned the parenthesis consistently throughout the examples. I'm somewhat reluctant to change all tests to align the `i`s and the `o`s now :) nicolasvasilache: I aligned the parenthesis consistently throughout the examples. I'm somewhat reluctant to…
	return			return
	}			}

	func @main() {			func @main() {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	Show All 26 Lines

mlir/integration_test/Dialect/Linalg/CPU/test-conv-1d-nwc-call.mlir

	Show All 24 Lines
	// Creates and returns 3-D buffer of size (%s1, %s2, %s3) filled with the value %f			// Creates and returns 3-D buffer of size (%s1, %s2, %s3) filled with the value %f
	func @alloc_3d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %f : f32) -> memref<?x?x?xf32> {			func @alloc_3d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %f : f32) -> memref<?x?x?xf32> {
	%buf = alloc(%s1, %s2, %s3) : memref<?x?x?xf32>			%buf = alloc(%s1, %s2, %s3) : memref<?x?x?xf32>
	linalg.fill(%buf, %f) : memref<?x?x?xf32>, f32			linalg.fill(%buf, %f) : memref<?x?x?xf32>, f32
	return %buf : memref<?x?x?xf32>			return %buf : memref<?x?x?xf32>
	}			}

	func @conv_1d_nwc(%arg0: memref<?x?x?xf32>, %arg1: memref<?x?x?xf32>, %arg2: memref<?x?x?xf32>) {			func @conv_1d_nwc(%arg0: memref<?x?x?xf32>, %arg1: memref<?x?x?xf32>, %arg2: memref<?x?x?xf32>) {
	linalg.conv_1d_nwc %arg0, %arg1, %arg2 : (memref<?x?x?xf32>, memref<?x?x?xf32>, memref<?x?x?xf32>)			linalg.conv_1d_nwc ins (%arg0, %arg1: memref<?x?x?xf32>, memref<?x?x?xf32>)
				outs (%arg2: memref<?x?x?xf32>)
	return			return
	}			}

	func @main() {			func @main() {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	Show All 37 Lines

mlir/integration_test/Dialect/Linalg/CPU/test-conv-2d-call.mlir

	Show All 24 Lines
	// Creates and returns a 2-D buffer of size (%s1, %s2) filled with the value %f			// Creates and returns a 2-D buffer of size (%s1, %s2) filled with the value %f
	func @alloc_2d_filled_f32(%s1 : index, %s2 : index, %f : f32) -> memref<?x?xf32> {			func @alloc_2d_filled_f32(%s1 : index, %s2 : index, %f : f32) -> memref<?x?xf32> {
	%buf = alloc(%s1, %s2) : memref<?x?xf32>			%buf = alloc(%s1, %s2) : memref<?x?xf32>
	linalg.fill(%buf, %f) : memref<?x?xf32>, f32			linalg.fill(%buf, %f) : memref<?x?xf32>, f32
	return %buf : memref<?x?xf32>			return %buf : memref<?x?xf32>
	}			}

	func @conv_2d(%arg0: memref<?x?xf32>, %arg1: memref<?x?xf32>, %arg2: memref<?x?xf32>) {			func @conv_2d(%arg0: memref<?x?xf32>, %arg1: memref<?x?xf32>, %arg2: memref<?x?xf32>) {
	linalg.conv_2d %arg0, %arg1, %arg2 : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			linalg.conv_2d ins (%arg0, %arg1: memref<?x?xf32>, memref<?x?xf32>)
				outs (%arg2: memref<?x?xf32>)
	return			return
	}			}

	func @main() {			func @main() {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	Show All 25 Lines

mlir/integration_test/Dialect/Linalg/CPU/test-conv-2d-nchw-call.mlir

	Show All 24 Lines
	// Creates and returns 4-D buffer of size (%s1, %s2, %s3, %s4) filled with the value %f			// Creates and returns 4-D buffer of size (%s1, %s2, %s3, %s4) filled with the value %f
	func @alloc_4d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %s4 : index, %f : f32) -> memref<?x?x?x?xf32> {			func @alloc_4d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %s4 : index, %f : f32) -> memref<?x?x?x?xf32> {
	%buf = alloc(%s1, %s2, %s3, %s4) : memref<?x?x?x?xf32>			%buf = alloc(%s1, %s2, %s3, %s4) : memref<?x?x?x?xf32>
	linalg.fill(%buf, %f) : memref<?x?x?x?xf32>, f32			linalg.fill(%buf, %f) : memref<?x?x?x?xf32>, f32
	return %buf : memref<?x?x?x?xf32>			return %buf : memref<?x?x?x?xf32>
	}			}

	func @conv_2d_nchw(%arg0: memref<?x?x?x?xf32>, %arg1: memref<?x?x?x?xf32>, %arg2: memref<?x?x?x?xf32>) {			func @conv_2d_nchw(%arg0: memref<?x?x?x?xf32>, %arg1: memref<?x?x?x?xf32>, %arg2: memref<?x?x?x?xf32>) {
	linalg.conv_2d_nchw %arg0, %arg1, %arg2 : (memref<?x?x?x?xf32>, memref<?x?x?x?xf32>, memref<?x?x?x?xf32>)			linalg.conv_2d_nchw ins (%arg0, %arg1: memref<?x?x?x?xf32>, memref<?x?x?x?xf32>)
				outs (%arg2: memref<?x?x?x?xf32>)
	return			return
	}			}

	func @main() {			func @main() {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	Show All 39 Lines

mlir/integration_test/Dialect/Linalg/CPU/test-conv-2d-nhwc-call.mlir

	Show All 24 Lines
	// Creates and returns 4-D buffer of size (%s1, %s2, %s3, %s4) filled with the value %f			// Creates and returns 4-D buffer of size (%s1, %s2, %s3, %s4) filled with the value %f
	func @alloc_4d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %s4 : index, %f : f32) -> memref<?x?x?x?xf32> {			func @alloc_4d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %s4 : index, %f : f32) -> memref<?x?x?x?xf32> {
	%buf = alloc(%s1, %s2, %s3, %s4) : memref<?x?x?x?xf32>			%buf = alloc(%s1, %s2, %s3, %s4) : memref<?x?x?x?xf32>
	linalg.fill(%buf, %f) : memref<?x?x?x?xf32>, f32			linalg.fill(%buf, %f) : memref<?x?x?x?xf32>, f32
	return %buf : memref<?x?x?x?xf32>			return %buf : memref<?x?x?x?xf32>
	}			}

	func @conv_2d_nhwc(%arg0: memref<?x?x?x?xf32>, %arg1: memref<?x?x?x?xf32>, %arg2: memref<?x?x?x?xf32>) {			func @conv_2d_nhwc(%arg0: memref<?x?x?x?xf32>, %arg1: memref<?x?x?x?xf32>, %arg2: memref<?x?x?x?xf32>) {
	linalg.conv_2d_nhwc %arg0, %arg1, %arg2 : (memref<?x?x?x?xf32>, memref<?x?x?x?xf32>, memref<?x?x?x?xf32>)			linalg.conv_2d_nhwc ins (%arg0, %arg1: memref<?x?x?x?xf32>, memref<?x?x?x?xf32>)
				outs (%arg2: memref<?x?x?x?xf32>)
	return			return
	}			}

	func @main() {			func @main() {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	▲ Show 20 Lines • Show All 85 Lines • Show Last 20 Lines

mlir/integration_test/Dialect/Linalg/CPU/test-conv-3d-call.mlir

	Show All 24 Lines
	// Creates and returns 3-D buffer of size (%s1, %s2, %s3) filled with the value %f			// Creates and returns 3-D buffer of size (%s1, %s2, %s3) filled with the value %f
	func @alloc_3d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %f : f32) -> memref<?x?x?xf32> {			func @alloc_3d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %f : f32) -> memref<?x?x?xf32> {
	%buf = alloc(%s1, %s2, %s3) : memref<?x?x?xf32>			%buf = alloc(%s1, %s2, %s3) : memref<?x?x?xf32>
	linalg.fill(%buf, %f) : memref<?x?x?xf32>, f32			linalg.fill(%buf, %f) : memref<?x?x?xf32>, f32
	return %buf : memref<?x?x?xf32>			return %buf : memref<?x?x?xf32>
	}			}

	func @conv_3d(%arg0: memref<?x?x?xf32>, %arg1: memref<?x?x?xf32>, %arg2: memref<?x?x?xf32>) {			func @conv_3d(%arg0: memref<?x?x?xf32>, %arg1: memref<?x?x?xf32>, %arg2: memref<?x?x?xf32>) {
	linalg.conv_3d %arg0, %arg1, %arg2 : (memref<?x?x?xf32>, memref<?x?x?xf32>, memref<?x?x?xf32>)			linalg.conv_3d ins (%arg0, %arg1: memref<?x?x?xf32>, memref<?x?x?xf32>)
				outs (%arg2: memref<?x?x?xf32>)
	return			return
	}			}

	func @main() {			func @main() {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

mlir/integration_test/Dialect/Linalg/CPU/test-conv-3d-ncdhw-call.mlir

	Show All 24 Lines
	// Creates and returns 5-D buffer of size (%s1, %s2, %s3, %s4, %s5) filled with the value %f			// Creates and returns 5-D buffer of size (%s1, %s2, %s3, %s4, %s5) filled with the value %f
	func @alloc_5d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %s4 : index, %s5 : index, %f : f32) -> memref<?x?x?x?x?xf32> {			func @alloc_5d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %s4 : index, %s5 : index, %f : f32) -> memref<?x?x?x?x?xf32> {
	%buf = alloc(%s1, %s2, %s3, %s4, %s5) : memref<?x?x?x?x?xf32>			%buf = alloc(%s1, %s2, %s3, %s4, %s5) : memref<?x?x?x?x?xf32>
	linalg.fill(%buf, %f) : memref<?x?x?x?x?xf32>, f32			linalg.fill(%buf, %f) : memref<?x?x?x?x?xf32>, f32
	return %buf : memref<?x?x?x?x?xf32>			return %buf : memref<?x?x?x?x?xf32>
	}			}

	func @conv_3d_ncdhw(%arg0: memref<?x?x?x?x?xf32>, %arg1: memref<?x?x?x?x?xf32>, %arg2: memref<?x?x?x?x?xf32>) {			func @conv_3d_ncdhw(%arg0: memref<?x?x?x?x?xf32>, %arg1: memref<?x?x?x?x?xf32>, %arg2: memref<?x?x?x?x?xf32>) {
	linalg.conv_3d_ncdhw %arg0, %arg1, %arg2 : (memref<?x?x?x?x?xf32>, memref<?x?x?x?x?xf32>, memref<?x?x?x?x?xf32>)			linalg.conv_3d_ncdhw ins (%arg0, %arg1: memref<?x?x?x?x?xf32>, memref<?x?x?x?x?xf32>)
				outs (%arg2: memref<?x?x?x?x?xf32>)
	return			return
	}			}

	func @main() {			func @main() {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

mlir/integration_test/Dialect/Linalg/CPU/test-conv-3d-ndhwc-call.mlir

	Show All 24 Lines
	// Creates and returns 5-D buffer of size (%s1, %s2, %s3, %s4, %s5) filled with the value %f			// Creates and returns 5-D buffer of size (%s1, %s2, %s3, %s4, %s5) filled with the value %f
	func @alloc_5d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %s4 : index, %s5 : index, %f : f32) -> memref<?x?x?x?x?xf32> {			func @alloc_5d_filled_f32(%s1 : index, %s2 : index, %s3 : index, %s4 : index, %s5 : index, %f : f32) -> memref<?x?x?x?x?xf32> {
	%buf = alloc(%s1, %s2, %s3, %s4, %s5) : memref<?x?x?x?x?xf32>			%buf = alloc(%s1, %s2, %s3, %s4, %s5) : memref<?x?x?x?x?xf32>
	linalg.fill(%buf, %f) : memref<?x?x?x?x?xf32>, f32			linalg.fill(%buf, %f) : memref<?x?x?x?x?xf32>, f32
	return %buf : memref<?x?x?x?x?xf32>			return %buf : memref<?x?x?x?x?xf32>
	}			}

	func @conv_3d_ndhwc(%arg0: memref<?x?x?x?x?xf32>, %arg1: memref<?x?x?x?x?xf32>, %arg2: memref<?x?x?x?x?xf32>) {			func @conv_3d_ndhwc(%arg0: memref<?x?x?x?x?xf32>, %arg1: memref<?x?x?x?x?xf32>, %arg2: memref<?x?x?x?x?xf32>) {
	linalg.conv_3d_ndhwc %arg0, %arg1, %arg2 : (memref<?x?x?x?x?xf32>, memref<?x?x?x?x?xf32>, memref<?x?x?x?x?xf32>)			linalg.conv_3d_ndhwc ins (%arg0, %arg1: memref<?x?x?x?x?xf32>, memref<?x?x?x?x?xf32>)
				outs (%arg2: memref<?x?x?x?x?xf32>)
	return			return
	}			}


	func @main() {			func @main() {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	▲ Show 20 Lines • Show All 148 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp

Show All 20 Lines
#include "mlir/IR/Matchers.h"		#include "mlir/IR/Matchers.h"
#include "mlir/IR/Module.h"		#include "mlir/IR/Module.h"
#include "mlir/IR/OpImplementation.h"		#include "mlir/IR/OpImplementation.h"
#include "mlir/IR/PatternMatch.h"		#include "mlir/IR/PatternMatch.h"
#include "mlir/IR/StandardTypes.h"		#include "mlir/IR/StandardTypes.h"
#include "mlir/Support/LLVM.h"		#include "mlir/Support/LLVM.h"

#include "llvm/ADT/StringSet.h"		#include "llvm/ADT/StringSet.h"
		#include "llvm/Support/FormatVariadic.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::linalg;		using namespace mlir::linalg;

/// Forward declarations.		/// Forward declarations.
template <typename NamedStructuredOpType>		template <typename NamedStructuredOpType>
static void buildNamedStructuredOpRegionAndAttributes(		static ParseResult
Builder &builder, OperationState &result, TypeRange operandTypes,		parseNamedStructuredOpRegion(OpAsmParser &parser, Region &region,
TypeRange tensorResultTypes);		TypeRange inputTypes, TypeRange outputBufferTypes,
template <typename NamedStructuredOpType>		TypeRange initTensorTypes, TypeRange resultTypes);
static void printNamedStructuredOp(OpAsmPrinter &p, NamedStructuredOpType op);
template <typename NamedStructuredOpType>		template <typename NamedStructuredOpType>
static ParseResult parseNamedStructuredOp(OpAsmParser &parser,		static void buildNamedStructuredOpRegionAndAttributes(
OperationState &result);		OpBuilder &opBuilder, OperationState &result, TypeRange inputTypes,
		TypeRange outputBufferTypes, TypeRange initTensorTypes,
		TypeRange resultTypes);
		static ParseResult
		parseNamedStructuredOpResults(OpAsmParser &parser,
		SmallVectorImpl<Type> &resultTypes);
		static void printNamedStructuredOpResults(OpAsmPrinter &p,
		TypeRange resultTypes);
template <typename NamedStructuredOpType>		template <typename NamedStructuredOpType>
static LogicalResult verifyNamedStructuredOp(NamedStructuredOpType op);		static LogicalResult verifyNamedStructuredOp(NamedStructuredOpType op);

/// This is a common class used for patterns of the form		/// This is a common class used for patterns of the form
/// ```		/// ```
/// someop(memrefcast) -> someop		/// someop(memrefcast) -> someop
/// ```		/// ```
/// It folds the source of the memref_cast into the root operation directly.		/// It folds the source of the memref_cast into the root operation directly.
▲ Show 20 Lines • Show All 189 Lines • ▼ Show 20 Lines	LogicalResult BlockArgsVerifier<IndexedGenericOp>::verify(IndexedGenericOp op,
return success();		return success();
}		}
} // namespace		} // namespace

template <typename GenericOpType>		template <typename GenericOpType>
static LogicalResult verifyGenericOp(GenericOpType op) {		static LogicalResult verifyGenericOp(GenericOpType op) {
auto nInputViews = op.getNumInputs();		auto nInputViews = op.getNumInputs();
auto nLoops = op.getNumLoops();		auto nLoops = op.getNumLoops();
auto nInputsAndOutputBuffers = op.getNumInputsAndOutputBuffers();
if (nInputsAndOutputBuffers != llvm::size(op.views()))
return op.emitOpError("expected exactly ")
<< nInputsAndOutputBuffers
<< " inputs (tensor or buffer) and output buffer operands";

auto &region = op.region();		auto &region = op.region();
if (!llvm::hasSingleElement(region))		if (!llvm::hasSingleElement(region))
return op.emitOpError("expected region with 1 block");		return op.emitOpError("expected region with 1 block");
if (failed(BlockArgsVerifier<GenericOpType>::verify(op, region.front())))		if (failed(BlockArgsVerifier<GenericOpType>::verify(op, region.front())))
return failure();		return failure();

auto symbolSourceAttr =		auto symbolSourceAttr =
Show All 33 Lines	static LogicalResult verifyGenericOp(GenericOpType op) {
// TODO: Bound inference for maps with symbols		// TODO: Bound inference for maps with symbols
if (!concatMap.getNumSymbols() && !inversePermutation(concatMap))		if (!concatMap.getNumSymbols() && !inversePermutation(concatMap))
return op.emitOpError("expected the concatenation of maps in indexing_map "		return op.emitOpError("expected the concatenation of maps in indexing_map "
"to be invertible");		"to be invertible");

return success();		return success();
}		}

static LogicalResult verify(GenericOp op) { return verifyGenericOp(op); }		static LogicalResult verify(GenericOp op) {
static LogicalResult verify(IndexedGenericOp op) { return verifyGenericOp(op); }		// Temporarily hoisted here to avoid duplicating more code.
		// TODO: uniformize with named structured ops.
		auto nInputsAndOutputBuffers = op.getNumInputsAndOutputBuffers();
		if (nInputsAndOutputBuffers != llvm::size(op.views()))
		return op.emitOpError("expected exactly ")
		<< nInputsAndOutputBuffers
		<< " inputs (tensor or buffer) and output buffer operands";
		return verifyGenericOp(op);
		}

		static LogicalResult verify(IndexedGenericOp op) {
		// Temporarily hoisted here to avoid duplicating more code.
		// TODO: uniformize with named structured ops.
		auto nInputsAndOutputBuffers = op.getNumInputsAndOutputBuffers();
		if (nInputsAndOutputBuffers != llvm::size(op.views()))
		return op.emitOpError("expected exactly ")
		<< nInputsAndOutputBuffers
		<< " inputs (tensor or buffer) and output buffer operands";
		ftynseUnsubmitted Done Reply Inline Actions I don't understand why is this necessary. ftynse: I don't understand why is this necessary.
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions the goal is to make everyone use the same verifiers but atm Generic and IndexedGeneric have this `view` property that I need to kill. This will be done in a followup and then all can be unified. nicolasvasilache: the goal is to make everyone use the same verifiers but atm Generic and IndexedGeneric have…
		return verifyGenericOp(op);
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ReshapeOp		// ReshapeOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// Collapse reassociation maps that are used in pair of reshape ops where one		/// Collapse reassociation maps that are used in pair of reshape ops where one
/// is a producer and other is the consumer. Only valid to use this method when		/// is a producer and other is the consumer. Only valid to use this method when
/// both the producer and consumer are collapsing dimensions or both are		/// both the producer and consumer are collapsing dimensions or both are
▲ Show 20 Lines • Show All 778 Lines • ▼ Show 20 Lines	static LogicalResult verify(PoolingMinOp op) {
return verifySingleInputPoolingOp(op);		return verifySingleInputPoolingOp(op);
}		}
static LogicalResult verify(PoolingSumOp op) {		static LogicalResult verify(PoolingSumOp op) {
return verifySingleInputPoolingOp(op);		return verifySingleInputPoolingOp(op);
}		}

#include "mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterfaces.cpp.inc"		#include "mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterfaces.cpp.inc"

		#include "mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.cpp.inc"

#define GET_OP_CLASSES		#define GET_OP_CLASSES
#include "mlir/Dialect/Linalg/IR/LinalgOps.cpp.inc"		#include "mlir/Dialect/Linalg/IR/LinalgOps.cpp.inc"

#define GET_OP_CLASSES		#define GET_OP_CLASSES
#include "mlir/Dialect/Linalg/IR/LinalgStructuredOps.cpp.inc"		#include "mlir/Dialect/Linalg/IR/LinalgStructuredOps.cpp.inc"

		/// Return the dims that are `iteratorTypeName` loops in the LinalgOp `op`.
		/// Assumes `op` is a LinalgOp.
		void mlir::linalg::getDimsOfType(Operation *op, StringRef iteratorTypeName,
		SmallVectorImpl<AffineExpr> &res) {
		unsigned dim = 0;
		MLIRContext *ctx = op->getContext();
		for (auto tn :
		cast<LinalgOp>(op).iterator_types().getAsValueRange<StringAttr>()) {
		if (tn == iteratorTypeName)
		res.push_back(getAffineDimExpr(dim, ctx));
		++dim;
		}
		}

AffineMap mlir::linalg::extractOrIdentityMap(Optional<AffineMap> maybeMap,		AffineMap mlir::linalg::extractOrIdentityMap(Optional<AffineMap> maybeMap,
unsigned rank,		unsigned rank,
MLIRContext *context) {		MLIRContext *context) {
if (maybeMap)		if (maybeMap)
return maybeMap.getValue();		return maybeMap.getValue();
if (rank == 0)		if (rank == 0)
return AffineMap::get(context);		return AffineMap::get(context);
return AffineMap::getMultiDimIdentityMap(rank, context);		return AffineMap::getMultiDimIdentityMap(rank, context);
▲ Show 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	std::string mlir::linalg::generateLibraryCallName(Operation *op) {
auto types = op->getOperandTypes();		auto types = op->getOperandTypes();
llvm::interleave(		llvm::interleave(
types.begin(), types.end(), [&](Type t) { appendMangledType(ss, t); },		types.begin(), types.end(), [&](Type t) { appendMangledType(ss, t); },
[&]() { ss << "_"; });		[&]() { ss << "_"; });
return ss.str();		return ss.str();
}		}

// TODO: Consider making all this boilerplate easy to autogenerate		// TODO: Consider making all this boilerplate easy to autogenerate
// with Tablegen. This seems a desirable property in the context of OpInterfaces		// with Tablegen. This seems a desirable property in the context of
// where a Linalg "named" op isa LinalgOp.		// OpInterfaces where a Linalg "named" op isa LinalgOp.
OpFoldResult ReshapeOp::fold(ArrayRef<Attribute> operands) {		OpFoldResult ReshapeOp::fold(ArrayRef<Attribute> operands) {
if (succeeded(foldMemRefCast(*this)))		if (succeeded(foldMemRefCast(*this)))
return getResult();		return getResult();
return foldReshapeOp(*this, operands);		return foldReshapeOp(*this, operands);
}		}
OpFoldResult SliceOp::fold(ArrayRef<Attribute>) {		OpFoldResult SliceOp::fold(ArrayRef<Attribute>) {
if (succeeded(foldMemRefCast(*this)))		if (succeeded(foldMemRefCast(*this)))
return getResult();		return getResult();
return {};		return {};
}		}
OpFoldResult TensorReshapeOp::fold(ArrayRef<Attribute> operands) {		OpFoldResult TensorReshapeOp::fold(ArrayRef<Attribute> operands) {
return foldReshapeOp(*this, operands);		return foldReshapeOp(*this, operands);
}		}
OpFoldResult TransposeOp::fold(ArrayRef<Attribute>) {		OpFoldResult TransposeOp::fold(ArrayRef<Attribute>) {
if (succeeded(foldMemRefCast(*this)))		if (succeeded(foldMemRefCast(*this)))
return getResult();		return getResult();
return {};		return {};
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Auto-generated Linalg named ops.		// Auto-generated Linalg named ops.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

template <typename NamedStructuredOpType>		template <typename NamedStructuredOpType>
void buildNamedStructuredOpRegionAndAttributes(Builder &builder,		static void buildNamedStructuredOpRegionAndAttributesImpl(
OperationState &result,		OpBuilder &opBuilder, Region &region, TypeRange inputTypes,
TypeRange operandTypes,		TypeRange outputBufferTypes, TypeRange initTensorTypes,
TypeRange tensorResultTypes) {		TypeRange resultTypes,
Region &region = *result.addRegion();		std::function<void(unsigned, unsigned)> errorHandler) {
Block *body = new Block();		Block *body = opBuilder.createBlock(&region);
		ftynseUnsubmitted Done Reply Inline Actions Nit: now that you take an `OpBuidler`, I'd advise to use `OpBuilder::createBlock` instead. ftynse: Nit: now that you take an `OpBuidler`, I'd advise to use `OpBuilder::createBlock` instead.
// TODO: atm all operands go through getElementTypeOrSelf,		// TODO: atm all operands go through getElementTypeOrSelf,
// reconsider when we have evidence we need to.		// reconsider when we have evidence we need to.
for (auto t : operandTypes)		for (auto containers : {inputTypes, outputBufferTypes, resultTypes})
		for (auto t : containers)
body->addArgument(getElementTypeOrSelf(t));		body->addArgument(getElementTypeOrSelf(t));
for (auto t : tensorResultTypes)
body->addArgument(getElementTypeOrSelf(t));
region.push_back(body);

OpBuilder opBuilder(builder.getContext());		unsigned actual = body->getNumArguments();
		unsigned expected = NamedStructuredOpType::getNumRegionArgs();
		if (expected != actual)
		return errorHandler(expected, actual);

opBuilder.setInsertionPointToStart(&region.front());		opBuilder.setInsertionPointToStart(&region.front());
mlir::edsc::ScopedContext scope(opBuilder, builder.getUnknownLoc());		mlir::edsc::ScopedContext scope(opBuilder, opBuilder.getUnknownLoc());
NamedStructuredOpType::regionBuilder(*body);		NamedStructuredOpType::regionBuilder(*body);

// indexing_maps is an auto-generated method.		// indexing_maps is an auto-generated method.

// iterator_types is an auto-generated method.		// iterator_types is an auto-generated method.
}		}

template <typename NamedStructuredOpType>		template <typename NamedStructuredOpType>
static void printNamedStructuredOp(OpAsmPrinter &p, NamedStructuredOpType op) {		void buildNamedStructuredOpRegionAndAttributes(OpBuilder &opBuilder,
std::array<StringRef, 2> silentAttrNames{getIndexingMapsAttrName(),		OperationState &result,
getIteratorTypesAttrName()};		TypeRange inputTypes,
p << op.getOperationName() << ' ';		TypeRange outputBufferTypes,
p.printOptionalAttrDict(op.getAttrs(), silentAttrNames);		TypeRange initTensorTypes,
p << ' ' << op.getOperands();		TypeRange resultTypes) {
p << " : (" << op.getOperandTypes() << ")";		// TODO: why does programmatic creation fail if not using a local builder?
auto outputTensorTypes = op.getResultTypes();		OpBuilder localOpBuilder(opBuilder.getContext());
if (!outputTensorTypes.empty())		Region &region = *result.addRegion();
p << " -> (" << outputTensorTypes << ")";		buildNamedStructuredOpRegionAndAttributesImpl<NamedStructuredOpType>(
		localOpBuilder, region, inputTypes, outputBufferTypes, initTensorTypes,
		resultTypes, [&](unsigned expected, unsigned actual) {
		llvm::errs() << "region expects " << expected << " args, got "
		<< actual;
		assert(expected != actual && "incorrect number of arguments");
		});
}		}

template <typename NamedStructuredOpType>		template <typename NamedStructuredOpType>
static ParseResult parseNamedStructuredOp(OpAsmParser &parser,		static ParseResult
OperationState &result) {		parseNamedStructuredOpRegion(OpAsmParser &parser, Region &region,
SmallVector<OpAsmParser::OperandType, 8> operandsInfo;		TypeRange inputTypes, TypeRange outputBufferTypes,
result.getContext()->getOrLoadDialect<StandardOpsDialect>();		TypeRange initTensorTypes, TypeRange resultTypes) {
		ParseResult res = success();
// Optional attributes may be added.		OpBuilder opBuilder(parser.getBuilder().getContext());
if (parser.parseOperandList(operandsInfo) \|\|		buildNamedStructuredOpRegionAndAttributesImpl<NamedStructuredOpType>(
parser.parseOptionalAttrDict(result.attributes))		opBuilder, region, inputTypes, outputBufferTypes, initTensorTypes,
return failure();		resultTypes, [&](unsigned expected, unsigned actual) {
		res = parser.emitError(parser.getCurrentLocation(),
SmallVector<Type, 8> operandTypes;		llvm::formatv("region expects {0} args, got {1}",
if (parser.parseColon() \|\| parser.parseLParen() \|\|		expected, actual));
		ftynseUnsubmitted Done Reply Inline Actions Nit: I'd expect Twine or formatv to be more efficient than stitching std strings ftynse: Nit: I'd expect Twine or formatv to be more efficient than stitching std strings
parser.parseTypeList(operandTypes) \|\| parser.parseRParen())		});
return failure();		return res;
		}

// Generic ops may specify that a subset of its outputs are tensors. Such		static ParseResult
// outputs are specified in the result type.		parseNamedStructuredOpResults(OpAsmParser &parser,
SmallVector<Type, 8> tensorResultTypes;		SmallVectorImpl<Type> &resultTypes) {
if (parser.parseOptionalArrowTypeList(tensorResultTypes))		if (succeeded(parser.parseOptionalArrow()))
		if (parser.parseTypeList(resultTypes))
return failure();		return failure();
		return success();
		}

if (!tensorResultTypes.empty())		static void printNamedStructuredOpResults(OpAsmPrinter &p,
result.addTypes(tensorResultTypes);		TypeRange resultTypes) {
		if (resultTypes.empty())
// The number of parsed arguments must equal		return;
// the number of expected arguments for the current operation.		p << "-> " << resultTypes;
auto parsedArgs = operandsInfo.size();
auto expectedArgs = NamedStructuredOpType::getNumInputs() +
NamedStructuredOpType::getNumOutputs();
if (parsedArgs != expectedArgs)
return parser.emitError(parser.getNameLoc(),
"expects " + std::to_string(expectedArgs) +
" operands, but found " +
std::to_string(parsedArgs));

buildNamedStructuredOpRegionAndAttributes<NamedStructuredOpType>(
parser.getBuilder(), result, operandTypes, tensorResultTypes);

return parser.resolveOperands(operandsInfo, operandTypes,
parser.getCurrentLocation(), result.operands);
}		}

template <typename NamedStructuredOpType>		template <typename NamedStructuredOpType>
static LogicalResult verifyNamedStructuredOp(NamedStructuredOpType op) {		static LogicalResult verifyNamedStructuredOp(NamedStructuredOpType op) {
return verifyGenericOp<NamedStructuredOpType>(op);		return verifyGenericOp<NamedStructuredOpType>(op);
}		}

namespace {		namespace {
Show All 38 Lines
CANONICALIZERS_AND_FOLDERS(PoolingMaxOp)		CANONICALIZERS_AND_FOLDERS(PoolingMaxOp)
CANONICALIZERS_AND_FOLDERS(PoolingMinOp)		CANONICALIZERS_AND_FOLDERS(PoolingMinOp)
CANONICALIZERS_AND_FOLDERS(PoolingSumOp)		CANONICALIZERS_AND_FOLDERS(PoolingSumOp)
CANONICALIZERS_AND_FOLDERS(CopyOp)		CANONICALIZERS_AND_FOLDERS(CopyOp)
CANONICALIZERS_AND_FOLDERS(FillOp)		CANONICALIZERS_AND_FOLDERS(FillOp)
CANONICALIZERS_AND_FOLDERS(GenericOp)		CANONICALIZERS_AND_FOLDERS(GenericOp)
CANONICALIZERS_AND_FOLDERS(IndexedGenericOp)		CANONICALIZERS_AND_FOLDERS(IndexedGenericOp)

#include "mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.cpp.inc"

// TODO: Determine whether we can generate the folders and verifiers.		// TODO: Determine whether we can generate the folders and verifiers.
CANONICALIZERS_AND_FOLDERS(BatchMatmulOp)		CANONICALIZERS_AND_FOLDERS(BatchMatmulOp)
CANONICALIZERS_AND_FOLDERS(DotOp)		CANONICALIZERS_AND_FOLDERS(DotOp)
CANONICALIZERS_AND_FOLDERS(MatmulOp)		CANONICALIZERS_AND_FOLDERS(MatmulOp)
CANONICALIZERS_AND_FOLDERS(MatvecOp)		CANONICALIZERS_AND_FOLDERS(MatvecOp)
CANONICALIZERS_AND_FOLDERS(VecmatOp)		CANONICALIZERS_AND_FOLDERS(VecmatOp)
CANONICALIZERS_AND_FOLDERS(ConvWOp)		CANONICALIZERS_AND_FOLDERS(ConvWOp)
CANONICALIZERS_AND_FOLDERS(ConvNWCOp)		CANONICALIZERS_AND_FOLDERS(ConvNWCOp)
CANONICALIZERS_AND_FOLDERS(ConvNCWOp)		CANONICALIZERS_AND_FOLDERS(ConvNCWOp)
CANONICALIZERS_AND_FOLDERS(ConvHWOp)		CANONICALIZERS_AND_FOLDERS(ConvHWOp)
CANONICALIZERS_AND_FOLDERS(ConvNHWCOp)		CANONICALIZERS_AND_FOLDERS(ConvNHWCOp)
CANONICALIZERS_AND_FOLDERS(ConvNCHWOp)		CANONICALIZERS_AND_FOLDERS(ConvNCHWOp)
CANONICALIZERS_AND_FOLDERS(ConvDHWOp)		CANONICALIZERS_AND_FOLDERS(ConvDHWOp)
CANONICALIZERS_AND_FOLDERS(ConvNDHWCOp)		CANONICALIZERS_AND_FOLDERS(ConvNDHWCOp)
CANONICALIZERS_AND_FOLDERS(ConvNCDHWOp)		CANONICALIZERS_AND_FOLDERS(ConvNCDHWOp)

mlir/lib/Dialect/Linalg/IR/LinalgTypes.cpp

	Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines

	} // end anonymous namespace			} // end anonymous namespace

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// LinalgDialect			// LinalgDialect
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	void mlir::linalg::LinalgDialect::initialize() {			void mlir::linalg::LinalgDialect::initialize() {
				getContext()->getOrLoadDialect("std");

	addTypes<RangeType>();			addTypes<RangeType>();
	addOperations<			addOperations<
	#define GET_OP_LIST			#define GET_OP_LIST
	#include "mlir/Dialect/Linalg/IR/LinalgOps.cpp.inc"			#include "mlir/Dialect/Linalg/IR/LinalgOps.cpp.inc"
	>();			>();
	addOperations<			addOperations<
	#define GET_OP_LIST			#define GET_OP_LIST
	#include "mlir/Dialect/Linalg/IR/LinalgStructuredOps.cpp.inc"			#include "mlir/Dialect/Linalg/IR/LinalgStructuredOps.cpp.inc"
	>();			>();

	addInterfaces<LinalgInlinerInterface>();			addInterfaces<LinalgInlinerInterface>();
	}			}

	Type mlir::linalg::LinalgDialect::parseType(DialectAsmParser &parser) const {			Type mlir::linalg::LinalgDialect::parseType(DialectAsmParser &parser) const {
	// Parse the main keyword for the type.			// Parse the main keyword for the type.
	StringRef keyword;			StringRef keyword;
	if (parser.parseKeyword(&keyword))			if (parser.parseKeyword(&keyword))
	return Type();			return Type();
	Show All 17 Lines

mlir/test/Conversion/LinalgToVector/linalg-to-vector.mlir

	// RUN: mlir-opt %s -test-conv-vectorization --cse \| FileCheck %s			// RUN: mlir-opt %s -test-conv-vectorization --cse \| FileCheck %s

	// CHECK-DAG: #[[$map0:.*]] = affine_map<(d0)[s0] -> (1, -d0 + s0)>			// CHECK-DAG: #[[$map0:.*]] = affine_map<(d0)[s0] -> (1, -d0 + s0)>
	// CHECK-DAG: #[[$map1:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>			// CHECK-DAG: #[[$map1:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>
	// CHECK-DAG: #[[$map2:.*]] = affine_map<(d0, d1) -> (d0 + d1)>			// CHECK-DAG: #[[$map2:.*]] = affine_map<(d0, d1) -> (d0 + d1)>
	// CHECK-DAG: #[[$map3:.*]] = affine_map<(d0, d1)[s0] -> (3, -d0 - d1 + s0)>			// CHECK-DAG: #[[$map3:.*]] = affine_map<(d0, d1)[s0] -> (3, -d0 - d1 + s0)>
	// CHECK-DAG: #[[$map4:.*]] = affine_map<(d0)[s0] -> (3, -d0 + s0)>			// CHECK-DAG: #[[$map4:.*]] = affine_map<(d0)[s0] -> (3, -d0 + s0)>
	// CHECK-DAG: #[[$map5:.*]] = affine_map<(d0) -> (d0)>			// CHECK-DAG: #[[$map5:.*]] = affine_map<(d0) -> (d0)>

	func @conv_1d(%arg0: memref<?xf32>, %arg1: memref<?xf32>, %arg2: memref<?xf32>) {			func @conv_1d(%arg0: memref<?xf32>, %arg1: memref<?xf32>, %arg2: memref<?xf32>) {
	linalg.conv_1d %arg0, %arg1, %arg2 : (memref<?xf32>, memref<?xf32>, memref<?xf32>)			linalg.conv_1d ins(%arg0, %arg1 : memref<?xf32>, memref<?xf32>)
				outs(%arg2 : memref<?xf32>)
	return			return
	}			}

	// CHECK-LABEL: @conv_1d			// CHECK-LABEL: @conv_1d
	// CHECK-SAME: %[[arg0:[a-zA-Z0-9]+]]: memref<?xf32>			// CHECK-SAME: %[[arg0:[a-zA-Z0-9]+]]: memref<?xf32>
	// CHECK-SAME: %[[arg1:[a-zA-Z0-9]+]]: memref<?xf32>			// CHECK-SAME: %[[arg1:[a-zA-Z0-9]+]]: memref<?xf32>
	// CHECK-SAME: %[[arg2:[a-zA-Z0-9]+]]: memref<?xf32			// CHECK-SAME: %[[arg2:[a-zA-Z0-9]+]]: memref<?xf32
	// CHECK-DAG: %[[c12:.*]] = constant 12 : index			// CHECK-DAG: %[[c12:.*]] = constant 12 : index
	Show All 33 Lines

mlir/test/Dialect/Linalg/affine.mlir

	Show All 9 Lines
	// CHECK-DAG: #[[$clampMinMap:.*]] = affine_map<(d0) -> (d0, 0)>			// CHECK-DAG: #[[$clampMinMap:.*]] = affine_map<(d0) -> (d0, 0)>

	func @matmul(%arg0: memref<?xi8>, %M: index, %N: index, %K: index) {			func @matmul(%arg0: memref<?xi8>, %M: index, %N: index, %K: index) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%A = view %arg0[%c0][%M, %K] : memref<?xi8> to memref<?x?xf32>			%A = view %arg0[%c0][%M, %K] : memref<?xi8> to memref<?x?xf32>
	%B = view %arg0[%c0][%K, %N] : memref<?xi8> to memref<?x?xf32>			%B = view %arg0[%c0][%K, %N] : memref<?xi8> to memref<?x?xf32>
	%C = view %arg0[%c0][%M, %N] : memref<?xi8> to memref<?x?xf32>			%C = view %arg0[%c0][%M, %N] : memref<?xi8> to memref<?x?xf32>
	linalg.matmul %A, %B, %C : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			linalg.matmul ins(%A, %B: memref<?x?xf32>, memref<?x?xf32>)
				outs(%C: memref<?x?xf32>)
	return			return
	}			}

	// CHECK-LABEL: func @matmul(%{{.*}}: memref<?xi8>,			// CHECK-LABEL: func @matmul(%{{.*}}: memref<?xi8>,
	// CHECK-SAME: [[M:arg[0-9]+]]: index			// CHECK-SAME: [[M:arg[0-9]+]]: index
	// CHECK-SAME: [[N:arg[0-9]+]]: index			// CHECK-SAME: [[N:arg[0-9]+]]: index
	// CHECK-SAME: [[K:arg[0-9]+]]: index			// CHECK-SAME: [[K:arg[0-9]+]]: index
	// CHECK: %[[A:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?x?xf32>			// CHECK: %[[A:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?x?xf32>
	▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines
	// CHECK: %{{.}} = affine.load %{{.}}[%{{.}}, %{{.}}, %{{.}}, %{{.}}] : memref<?x?x?x?xf32>			// CHECK: %{{.}} = affine.load %{{.}}[%{{.}}, %{{.}}, %{{.}}, %{{.}}] : memref<?x?x?x?xf32>
	// CHECK: %{{.}} = addf %{{.}}, %{{.*}} : f32			// CHECK: %{{.}} = addf %{{.}}, %{{.*}} : f32
	// CHECK: affine.store %{{.}}, %{{.}}[%{{.}}, %{{.}}, %{{.}}, %{{.}}] : memref<?x?x?x?xf32>			// CHECK: affine.store %{{.}}, %{{.}}[%{{.}}, %{{.}}, %{{.}}, %{{.}}] : memref<?x?x?x?xf32>

	//----------------------------------------------------------------------------//			//----------------------------------------------------------------------------//
	// Named ops to loops.			// Named ops to loops.
	//----------------------------------------------------------------------------//			//----------------------------------------------------------------------------//
	func @named_batch_matmul(%A: memref<?x?x?xf32>, %B: memref<?x?x?xf32>, %C: memref<?x?x?xf32>) {			func @named_batch_matmul(%A: memref<?x?x?xf32>, %B: memref<?x?x?xf32>, %C: memref<?x?x?xf32>) {
	linalg.batch_matmul %A, %B, %C : (memref<?x?x?xf32>, memref<?x?x?xf32>, memref<?x?x?xf32>) -> ()			linalg.batch_matmul ins(%A, %B: memref<?x?x?xf32>, memref<?x?x?xf32>)
				outs(%C : memref<?x?x?xf32>)
	return			return
	}			}
	// CHECK-LABEL: @named_batch_matmul			// CHECK-LABEL: @named_batch_matmul
	// CHECK-SAME: %[[mA:[a-zA-Z0-9]+]]: memref<?x?x?xf32>			// CHECK-SAME: %[[mA:[a-zA-Z0-9]+]]: memref<?x?x?xf32>
	// CHECK-SAME: %[[mB:[a-zA-Z0-9]+]]: memref<?x?x?xf32>			// CHECK-SAME: %[[mB:[a-zA-Z0-9]+]]: memref<?x?x?xf32>
	// CHECK-SAME: %[[mC:[a-zA-Z0-9]+]]: memref<?x?x?xf32>			// CHECK-SAME: %[[mC:[a-zA-Z0-9]+]]: memref<?x?x?xf32>
	// CHECK: %[[B:.*]] = dim %[[mA]], %c0 : memref<?x?x?xf32>			// CHECK: %[[B:.*]] = dim %[[mA]], %c0 : memref<?x?x?xf32>
	// CHECK: %[[M:.*]] = dim %[[mA]], %c1 : memref<?x?x?xf32>			// CHECK: %[[M:.*]] = dim %[[mA]], %c1 : memref<?x?x?xf32>
	Show All 36 Lines

mlir/test/Dialect/Linalg/canonicalize.mlir

	// RUN: mlir-opt %s -canonicalize -split-input-file \| FileCheck %s			// RUN: mlir-opt %s -canonicalize -split-input-file \| FileCheck %s

	// CHECK-LABEL: func @memref_cast(			// CHECK-LABEL: func @memref_cast(
	func @memref_cast(%a: index, %b: index) -> memref<?x?xf32> {			func @memref_cast(%a: index, %b: index) -> memref<?x?xf32> {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c8 = constant 8 : index			%c8 = constant 8 : index
	%c16 = constant 16 : index			%c16 = constant 16 : index
	%1 = alloc (%b) : memref<?xi8>			%1 = alloc (%b) : memref<?xi8>
	%2 = view %1[%c0][] : memref<?xi8> to memref<16x16xf32>			%2 = view %1[%c0][] : memref<?xi8> to memref<16x16xf32>
	%3 = memref_cast %2 : memref<16x16xf32> to memref<?x?xf32>			%3 = memref_cast %2 : memref<16x16xf32> to memref<?x?xf32>
	%r0 = linalg.range %c0:%c8:%c1 : !linalg.range			%r0 = linalg.range %c0:%c8:%c1 : !linalg.range

	// CHECK: linalg.slice {{.*}} : memref<16x16xf32>, !linalg.range, !linalg.range, memref<?x?xf32>			// CHECK: linalg.slice {{.*}} : memref<16x16xf32>, !linalg.range, !linalg.range, memref<?x?xf32>
	%4 = linalg.slice %3[%r0, %r0] : memref<?x?xf32>, !linalg.range, !linalg.range, memref<?x?xf32>			%4 = linalg.slice %3[%r0, %r0] : memref<?x?xf32>, !linalg.range, !linalg.range, memref<?x?xf32>

	// CHECK: linalg.matmul{{.*}}: (memref<16x16xf32>, memref<16x16xf32>, memref<16x16xf32>)			// CHECK: linalg.matmul ins({{.}}memref<16x16xf32>, memref<16x16xf32>) outs({{.}}memref<16x16xf32>)
	linalg.matmul %3, %3, %3 : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			linalg.matmul ins(%3, %3: memref<?x?xf32>, memref<?x?xf32>)
				outs(%3: memref<?x?xf32>)
	return %4: memref<?x?xf32>			return %4: memref<?x?xf32>
	}			}

	// -----			// -----

	func @collapsing_tensor_reshapes(%arg0 : tensor<?x?x?x?x?xf32>) -> tensor<?x?xf32>			func @collapsing_tensor_reshapes(%arg0 : tensor<?x?x?x?x?xf32>) -> tensor<?x?xf32>
	{			{
	%0 = linalg.tensor_reshape %arg0			%0 = linalg.tensor_reshape %arg0
	▲ Show 20 Lines • Show All 236 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/fold-affine-min-scf.mlir

	// RUN: mlir-opt %s -test-linalg-transform-patterns=test-affine-min-scf-canonicalization-patterns			// RUN: mlir-opt %s -test-linalg-transform-patterns=test-affine-min-scf-canonicalization-patterns \| FileCheck %s
	//\| FileCheck %s

	// CHECK-LABEL: scf_for			// CHECK-LABEL: scf_for
	func @scf_for(%A : memref<i64>, %step : index) {			func @scf_for(%A : memref<i64>, %step : index) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c2 = constant 2 : index			%c2 = constant 2 : index
	%c7 = constant 7 : index			%c7 = constant 7 : index
	%c4 = constant 4 : index			%c4 = constant 4 : index
	▲ Show 20 Lines • Show All 138 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/fusion-2-level.mlir

	// RUN: mlir-opt %s -linalg-fusion \| FileCheck %s			// RUN: mlir-opt %s -linalg-fusion \| FileCheck %s

	func @f1(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>, %B: memref<?x?xf32, offset: ?, strides: [?, 1]>, %C: memref<?x?xf32, offset: ?, strides: [?, 1]>, %D: memref<?x?xf32, offset: ?, strides: [?, 1]>, %E: memref<?x?xf32, offset: ?, strides: [?, 1]>) -> memref<?x?xf32, offset: ?, strides: [?, 1]> {			func @f1(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>, %B: memref<?x?xf32, offset: ?, strides: [?, 1]>, %C: memref<?x?xf32, offset: ?, strides: [?, 1]>, %D: memref<?x?xf32, offset: ?, strides: [?, 1]>, %E: memref<?x?xf32, offset: ?, strides: [?, 1]>) -> memref<?x?xf32, offset: ?, strides: [?, 1]> {
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c4 = constant 4 : index			%c4 = constant 4 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c2 = constant 2 : index			%c2 = constant 2 : index
	%c40 = constant 40 : index			%c40 = constant 40 : index
	%c30 = constant 30 : index			%c30 = constant 30 : index
	%c20 = constant 20 : index			%c20 = constant 20 : index
	%0 = dim %C, %c0 : memref<?x?xf32, offset: ?, strides: [?, 1]>			%0 = dim %C, %c0 : memref<?x?xf32, offset: ?, strides: [?, 1]>
	%1 = dim %C, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>			%1 = dim %C, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>
	%2 = dim %D, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>			%2 = dim %D, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>
	linalg.matmul %A, %B, %C : (memref<?x?xf32, offset: ?, strides: [?, 1]>, memref<?x?xf32, offset: ?, strides: [?, 1]>, memref<?x?xf32, offset: ?, strides: [?, 1]>)			linalg.matmul ins(%A, %B: memref<?x?xf32, offset: ?, strides: [?, 1]>, memref<?x?xf32, offset: ?, strides: [?, 1]>)
				outs(%C: memref<?x?xf32, offset: ?, strides: [?, 1]>)
	scf.for %arg5 = %c0 to %0 step %c20 {			scf.for %arg5 = %c0 to %0 step %c20 {
	scf.for %arg6 = %c0 to %2 step %c30 {			scf.for %arg6 = %c0 to %2 step %c30 {
	scf.for %arg7 = %c0 to %1 step %c40 {			scf.for %arg7 = %c0 to %1 step %c40 {
	%5 = std.subview %C[%arg5, %arg7][%c20, %c40][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			%5 = std.subview %C[%arg5, %arg7][%c20, %c40][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	%7 = std.subview %D[%arg7, %arg6][%c40, %c30][%c1, %c1]: memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			%7 = std.subview %D[%arg7, %arg6][%c40, %c30][%c1, %c1]: memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	%8 = std.subview %E[%arg5, %arg6][%c20, %c40][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			%8 = std.subview %E[%arg5, %arg6][%c20, %c40][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	%9 = dim %5, %c0 : memref<?x?xf32, offset: ?, strides: [?, ?]>			%9 = dim %5, %c0 : memref<?x?xf32, offset: ?, strides: [?, ?]>
	%10 = dim %5, %c1 : memref<?x?xf32, offset: ?, strides: [?, ?]>			%10 = dim %5, %c1 : memref<?x?xf32, offset: ?, strides: [?, ?]>
	%11 = dim %7, %c1 : memref<?x?xf32, offset: ?, strides: [?, ?]>			%11 = dim %7, %c1 : memref<?x?xf32, offset: ?, strides: [?, ?]>
	scf.for %arg8 = %c0 to %9 step %c2 {			scf.for %arg8 = %c0 to %9 step %c2 {
	scf.for %arg9 = %c0 to %11 step %c3 {			scf.for %arg9 = %c0 to %11 step %c3 {
	scf.for %arg10 = %c0 to %10 step %c4 {			scf.for %arg10 = %c0 to %10 step %c4 {
	%14 = std.subview %5[%arg8, %arg10][%c2, %c4][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			%14 = std.subview %5[%arg8, %arg10][%c2, %c4][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	%16 = std.subview %7[%arg10, %arg9][%c4, %c3][%c1, %c1]: memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			%16 = std.subview %7[%arg10, %arg9][%c4, %c3][%c1, %c1]: memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	%17 = std.subview %8[%arg8, %arg9][%c2, %c4][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			%17 = std.subview %8[%arg8, %arg9][%c2, %c4][%c1, %c1] : memref<?x?xf32, offset: ?, strides: [?, ?]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	linalg.matmul %14, %16, %17 : (memref<?x?xf32, offset: ?, strides: [?, ?]>, memref<?x?xf32, offset: ?, strides: [?, ?]>, memref<?x?xf32, offset: ?, strides: [?, ?]>)			linalg.matmul ins(%14, %16: memref<?x?xf32, offset: ?, strides: [?, ?]>, memref<?x?xf32, offset: ?, strides: [?, ?]>)
				outs(%17: memref<?x?xf32, offset: ?, strides: [?, ?]>)
	}			}
	}			}
	}			}
	}			}
	}			}
	}			}
	return %E : memref<?x?xf32, offset: ?, strides: [?, 1]>			return %E : memref<?x?xf32, offset: ?, strides: [?, 1]>
	}			}
	Show All 10 Lines

mlir/test/Dialect/Linalg/fusion.mlir

// RUN: mlir-opt %s -linalg-fusion -split-input-file \| FileCheck %s		// RUN: mlir-opt %s -linalg-fusion -split-input-file \| FileCheck %s

func @f1(%A: memref<?x?xf32, offset: 0, strides: [?, 1]>,		func @f1(%A: memref<?x?xf32, offset: 0, strides: [?, 1]>,
%B: memref<?x?xf32, offset: 0, strides: [?, 1]>,		%B: memref<?x?xf32, offset: 0, strides: [?, 1]>,
%C: memref<?x?xf32, offset: 0, strides: [?, 1]>,		%C: memref<?x?xf32, offset: 0, strides: [?, 1]>,
%D: memref<?x?xf32, offset: 0, strides: [?, 1]>,		%D: memref<?x?xf32, offset: 0, strides: [?, 1]>,
%E: memref<?x?xf32, offset: 0, strides: [?, 1]>		%E: memref<?x?xf32, offset: 0, strides: [?, 1]>
) -> memref<?x?xf32, offset: 0, strides: [?, 1]> {		) -> memref<?x?xf32, offset: 0, strides: [?, 1]> {
%c0 = constant 0 : index		%c0 = constant 0 : index
%c4 = constant 4 : index		%c4 = constant 4 : index
%c3 = constant 3 : index		%c3 = constant 3 : index
%c2 = constant 2 : index		%c2 = constant 2 : index
%c1 = constant 1 : index		%c1 = constant 1 : index
%0 = dim %A, %c0 : memref<?x?xf32, offset: 0, strides: [?, 1]>		%0 = dim %A, %c0 : memref<?x?xf32, offset: 0, strides: [?, 1]>
%1 = dim %A, %c1 : memref<?x?xf32, offset: 0, strides: [?, 1]>		%1 = dim %A, %c1 : memref<?x?xf32, offset: 0, strides: [?, 1]>
%2 = dim %B, %c1 : memref<?x?xf32, offset: 0, strides: [?, 1]>		%2 = dim %B, %c1 : memref<?x?xf32, offset: 0, strides: [?, 1]>
linalg.matmul %A, %B, %C :		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, 1]>,
(memref<?x?xf32, offset: 0, strides: [?, 1]>,
memref<?x?xf32, offset: 0, strides: [?, 1]>,
memref<?x?xf32, offset: 0, strides: [?, 1]>)		memref<?x?xf32, offset: 0, strides: [?, 1]>)
		outs(%C : memref<?x?xf32, offset: 0, strides: [?, 1]>)
scf.for %arg5 = %c0 to %0 step %c2 {		scf.for %arg5 = %c0 to %0 step %c2 {
scf.for %arg6 = %c0 to %2 step %c3 {		scf.for %arg6 = %c0 to %2 step %c3 {
scf.for %arg7 = %c0 to %1 step %c4 {		scf.for %arg7 = %c0 to %1 step %c4 {
%5 = std.subview %A[%arg5, %arg7][%c2, %c4][%c1, %c1] :		%5 = std.subview %A[%arg5, %arg7][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, 1]> to		memref<?x?xf32, offset: 0, strides: [?, 1]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%7 = std.subview %B[%arg7, %arg6][%c4, %c3][%c1, %c1] :		%7 = std.subview %B[%arg7, %arg6][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, 1]> to		memref<?x?xf32, offset: 0, strides: [?, 1]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%8 = std.subview %C[%arg5, %arg6][%c2, %c3][%c1, %c1] :		%8 = std.subview %C[%arg5, %arg6][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, 1]> to		memref<?x?xf32, offset: 0, strides: [?, 1]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %5, %7, %8 :		linalg.matmul ins(%5, %7 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%8: memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
return %E : memref<?x?xf32, offset: 0, strides: [?, 1]>		return %E : memref<?x?xf32, offset: 0, strides: [?, 1]>
}		}
// CHECK-LABEL: func @f1		// CHECK-LABEL: func @f1
// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})		// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})
// CHECK: scf.for		// CHECK: scf.for
Show All 11 Lines	func @f2(%A: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%D: memref<?x?xf32, offset: 0, strides: [?, ?]>,		%D: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%E: memref<?x?xf32, offset: 0, strides: [?, ?]>		%E: memref<?x?xf32, offset: 0, strides: [?, ?]>
) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {		) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {
%c1 = constant 1 : index		%c1 = constant 1 : index
%c0 = constant 0 : index		%c0 = constant 0 : index
%c4 = constant 4 : index		%c4 = constant 4 : index
%c3 = constant 3 : index		%c3 = constant 3 : index
%c2 = constant 2 : index		%c2 = constant 2 : index
linalg.matmul %A, %B, %C :		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
(memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
		outs(%C: memref<?x?xf32, offset: 0, strides: [?, ?]>)
%0 = dim %C, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%0 = dim %C, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%1 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%1 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
scf.for %arg5 = %c0 to %0 step %c2 {		scf.for %arg5 = %c0 to %0 step %c2 {
scf.for %arg6 = %c0 to %2 step %c3 {		scf.for %arg6 = %c0 to %2 step %c3 {
scf.for %arg7 = %c0 to %1 step %c4 {		scf.for %arg7 = %c0 to %1 step %c4 {
%5 = std.subview %C[%arg5, %arg7][%c2, %c4][%c1, %c1] :		%5 = std.subview %C[%arg5, %arg7][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%7 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :		%7 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :		%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %5, %7, %8 :		linalg.matmul ins(%5, %7 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%8 : memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>		return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>
}		}
// CHECK-LABEL: func @f2		// CHECK-LABEL: func @f2
// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})		// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})
// CHECK-DAG: %[[C_0:.]] = dim %[[C]], %c0{{[_0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK-DAG: %[[C_0:.]] = dim %[[C]], %c0{{[_0-9]}} : memref<?x?xf32, #[[$strided2D]]>
Show All 13 Lines	func @f3(%A: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%D: memref<?x?xf32, offset: 0, strides: [?, ?]>,		%D: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%E: memref<?x?xf32, offset: 0, strides: [?, ?]>		%E: memref<?x?xf32, offset: 0, strides: [?, ?]>
) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {		) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {
%c1 = constant 1 : index		%c1 = constant 1 : index
%c0 = constant 0 : index		%c0 = constant 0 : index
%c4 = constant 4 : index		%c4 = constant 4 : index
%c3 = constant 3 : index		%c3 = constant 3 : index
%c2 = constant 2 : index		%c2 = constant 2 : index
linalg.matmul %A, %B, %C :		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
(memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
		outs(%C : memref<?x?xf32, offset: 0, strides: [?, ?]>)
%0 = dim %D, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%0 = dim %D, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%1 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%1 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%2 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%2 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
scf.for %arg5 = %c0 to %0 step %c2 {		scf.for %arg5 = %c0 to %0 step %c2 {
scf.for %arg6 = %c0 to %2 step %c3 {		scf.for %arg6 = %c0 to %2 step %c3 {
scf.for %arg7 = %c0 to %1 step %c4 {		scf.for %arg7 = %c0 to %1 step %c4 {
%5 = std.subview %D[%arg5, %arg7][%c2, %c4][%c1, %c1] :		%5 = std.subview %D[%arg5, %arg7][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%7 = std.subview %C[%arg7, %arg6][%c4, %c3][%c1, %c1] :		%7 = std.subview %C[%arg7, %arg6][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :		%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %5, %7, %8 :		linalg.matmul ins(%5, %7 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%8 : memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>		return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>
}		}
// CHECK-LABEL: func @f3		// CHECK-LABEL: func @f3
// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})		// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})
// CHECK: %[[D_0:.]] = dim %[[D]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK: %[[D_0:.]] = dim %[[D]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
Show All 13 Lines	func @f4(%A: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%D: memref<?x?xf32, offset: 0, strides: [?, ?]>,		%D: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%E: memref<?x?xf32, offset: 0, strides: [?, ?]>		%E: memref<?x?xf32, offset: 0, strides: [?, ?]>
) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {		) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {
%c1 = constant 1 : index		%c1 = constant 1 : index
%c0 = constant 0 : index		%c0 = constant 0 : index
%c4 = constant 4 : index		%c4 = constant 4 : index
%c3 = constant 3 : index		%c3 = constant 3 : index
%c2 = constant 2 : index		%c2 = constant 2 : index
linalg.matmul %A, %B, %C :		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
(memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
linalg.matmul %A, %B, %D :		outs(%C : memref<?x?xf32, offset: 0, strides: [?, ?]>)
(memref<?x?xf32, offset: 0, strides: [?, ?]>,		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
		outs(%D : memref<?x?xf32, offset: 0, strides: [?, ?]>)
%0 = dim %C, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%0 = dim %C, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%1 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%1 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
scf.for %arg5 = %c0 to %0 step %c2 {		scf.for %arg5 = %c0 to %0 step %c2 {
scf.for %arg6 = %c0 to %2 step %c3 {		scf.for %arg6 = %c0 to %2 step %c3 {
scf.for %arg7 = %c0 to %1 step %c4 {		scf.for %arg7 = %c0 to %1 step %c4 {
%5 = std.subview %C[%arg5, %arg7][%c2, %c4][%c1, %c1] :		%5 = std.subview %C[%arg5, %arg7][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%7 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :		%7 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :		%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %5, %7, %8 :		linalg.matmul ins(%5, %7 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%8 : memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>		return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>
}		}
// CHECK-LABEL: func @f4		// CHECK-LABEL: func @f4
// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})		// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})
// CHECK: %[[C_0:.]] = dim %[[C]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK: %[[C_0:.]] = dim %[[C]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
Show All 19 Lines	func @f5(%A: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%c1 = constant 1 : index		%c1 = constant 1 : index
%c0 = constant 0 : index		%c0 = constant 0 : index
%c4 = constant 4 : index		%c4 = constant 4 : index
%c3 = constant 3 : index		%c3 = constant 3 : index
%c2 = constant 2 : index		%c2 = constant 2 : index
%0 = dim %B, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%0 = dim %B, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%1 = dim %D, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%1 = dim %D, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
linalg.matmul %A, %B, %C :		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
(memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
linalg.matmul %C, %B, %D :		outs(%C : memref<?x?xf32, offset: 0, strides: [?, ?]>)
(memref<?x?xf32, offset: 0, strides: [?, ?]>,		linalg.matmul ins(%C, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
		outs(%D : memref<?x?xf32, offset: 0, strides: [?, ?]>)
scf.for %arg5 = %c0 to %1 step %c2 {		scf.for %arg5 = %c0 to %1 step %c2 {
scf.for %arg6 = %c0 to %0 step %c3 {		scf.for %arg6 = %c0 to %0 step %c3 {
scf.for %arg7 = %c0 to %2 step %c4 {		scf.for %arg7 = %c0 to %2 step %c4 {
%5 = std.subview %D[%arg5, %arg7][%c2, %c4][%c1, %c1] :		%5 = std.subview %D[%arg5, %arg7][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%7 = std.subview %B[%arg7, %arg6][%c4, %c3][%c1, %c1] :		%7 = std.subview %B[%arg7, %arg6][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :		%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %5, %7, %8 :		linalg.matmul ins(%5, %7 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%8 : memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>		return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>
}		}
// CHECK-LABEL: func @f5		// CHECK-LABEL: func @f5
// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})		// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})
// CHECK-DAG: %[[B_1:.]] = dim %[[B]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK-DAG: %[[B_1:.]] = dim %[[B]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
// CHECK-DAG: %[[D_0:.]] = dim %[[D]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK-DAG: %[[D_0:.]] = dim %[[D]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
// CHECK-DAG: %[[D_1:.]] = dim %[[D]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK-DAG: %[[D_1:.]] = dim %[[D]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
// CHECK: scf.for %[[I:.]] = %{{.}} to %[[D_0]] step %{{.*}} {		// CHECK: scf.for %[[I:.]] = %{{.}} to %[[D_0]] step %{{.*}} {
// CHECK: scf.for %[[J:.]] = %{{.}} to %[[B_1]] step %{{.*}} {		// CHECK: scf.for %[[J:.]] = %{{.}} to %[[B_1]] step %{{.*}} {
// CHECK: scf.for %[[K:.]] = %{{.}} to %[[D_1]] step %{{.*}} {		// CHECK: scf.for %[[K:.]] = %{{.}} to %[[D_1]] step %{{.*}} {
// CHECK-DAG: %[[D_IK:.*]] = subview %[[D]][%[[I]], %[[K]]]		// CHECK-DAG: %[[D_IK:.*]] = subview %[[D]][%[[I]], %[[K]]]
// CHECK-DAG: %[[B_KJ:.*]] = subview %[[B]][%[[K]], %[[J]]]		// CHECK-DAG: %[[B_KJ:.*]] = subview %[[B]][%[[K]], %[[J]]]
// CHECK-DAG: %[[E_IJ:.*]] = subview %[[E]][%[[I]], %[[J]]]		// CHECK-DAG: %[[E_IJ:.*]] = subview %[[E]][%[[I]], %[[J]]]
// CHECK: dim		// CHECK: dim
// CHECK-DAG: %[[C_I0:.]] = subview %[[C]][%[[I]], %{{.}}]		// CHECK-DAG: %[[C_I0:.]] = subview %[[C]][%[[I]], %{{.}}]
// CHECK-DAG: %[[B_0K:.]] = subview %[[B]][%{{.}}, %[[K]]]		// CHECK-DAG: %[[B_0K:.]] = subview %[[B]][%{{.}}, %[[K]]]
// CHECK-DAG: %[[D_IK_:.*]] = subview %[[D]][%[[I]], %[[K]]]		// CHECK-DAG: %[[D_IK_:.*]] = subview %[[D]][%[[I]], %[[K]]]
// CHECK: dim		// CHECK: dim
// CHECK-DAG: %[[A_I0:.]] = subview %[[A]][%[[I]], %{{.}}]		// CHECK-DAG: %[[A_I0:.]] = subview %[[A]][%[[I]], %{{.}}]
// CHECK-DAG: %[[B_00:.]] = subview %[[B]][%{{.}}, %{{.*}}]		// CHECK-DAG: %[[B_00:.]] = subview %[[B]][%{{.}}, %{{.*}}]
// CHECK-DAG: %[[C_I0_:.]] = subview %[[C]][%[[I]], %{{.}}]		// CHECK-DAG: %[[C_I0_:.]] = subview %[[C]][%[[I]], %{{.}}]
// CHECK: linalg.matmul %[[A_I0]], %[[B_00]], %[[C_I0_]]		// CHECK: linalg.matmul ins(%[[A_I0]], %[[B_00]]{{.*}} outs(%[[C_I0_]]
// CHECK: linalg.matmul %[[C_I0]], %[[B_0K]], %[[D_IK_]]		// CHECK: linalg.matmul ins(%[[C_I0]], %[[B_0K]]{{.*}} outs(%[[D_IK_]]
// CHECK: linalg.matmul %[[D_IK]], %[[B_KJ]], %[[E_IJ]]		// CHECK: linalg.matmul ins(%[[D_IK]], %[[B_KJ]]{{.*}} outs(%[[E_IJ]]

// -----		// -----

#map0 = affine_map<(d0) -> (d0 + 2)>		#map0 = affine_map<(d0) -> (d0 + 2)>
#map1 = affine_map<(d0) -> (d0 + 4)>		#map1 = affine_map<(d0) -> (d0 + 4)>
#map2 = affine_map<(d0) -> (d0 + 3)>		#map2 = affine_map<(d0) -> (d0 + 3)>

func @f6(%A: memref<?x?xf32, offset: 0, strides: [?, ?]>,		func @f6(%A: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%B: memref<?x?xf32, offset: 0, strides: [?, ?]>,		%B: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%C: memref<?x?xf32, offset: 0, strides: [?, ?]>,		%C: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%D: memref<?x?xf32, offset: 0, strides: [?, ?]>,		%D: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%E: memref<?x?xf32, offset: 0, strides: [?, ?]>		%E: memref<?x?xf32, offset: 0, strides: [?, ?]>
) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {		) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {
%c1 = constant 1 : index		%c1 = constant 1 : index
%c0 = constant 0 : index		%c0 = constant 0 : index
%c4 = constant 4 : index		%c4 = constant 4 : index
%c3 = constant 3 : index		%c3 = constant 3 : index
%c2 = constant 2 : index		%c2 = constant 2 : index
%0 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%0 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
linalg.matmul %A, %B, %C :		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
(memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
linalg.matmul %A, %C, %E :		outs(%C : memref<?x?xf32, offset: 0, strides: [?, ?]>)
(memref<?x?xf32, offset: 0, strides: [?, ?]>,		linalg.matmul ins(%A, %C : memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
		outs(%E : memref<?x?xf32, offset: 0, strides: [?, ?]>)
%1 = dim %C, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%1 = dim %C, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
scf.for %arg5 = %c0 to %1 step %c2 {		scf.for %arg5 = %c0 to %1 step %c2 {
scf.for %arg6 = %c0 to %2 step %c3 {		scf.for %arg6 = %c0 to %2 step %c3 {
scf.for %arg7 = %c0 to %0 step %c4 {		scf.for %arg7 = %c0 to %0 step %c4 {
%3 = affine.apply #map0(%arg5)		%3 = affine.apply #map0(%arg5)
%4 = affine.apply #map1(%arg7)		%4 = affine.apply #map1(%arg7)
%5 = std.subview %C[%arg5, %arg7][%c2, %c4][%c1, %c1] :		%5 = std.subview %C[%arg5, %arg7][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%6 = affine.apply #map2(%arg6)		%6 = affine.apply #map2(%arg6)
%7 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :		%7 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :		%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %5, %7, %8 :		linalg.matmul ins(%5, %7 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%8 : memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>		return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>
}		}
// CHECK-LABEL: func @f6		// CHECK-LABEL: func @f6
// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})		// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})
// Fuse the producer of E (WAW) then the producer of C (WAR).		// Fuse the producer of E (WAW) then the producer of C (WAR).
Show All 17 Lines	func @f7(%A: memref<?x?xf32, offset: 0, strides: [?, ?]>,
%c4 = constant 4 : index		%c4 = constant 4 : index
%c3 = constant 3 : index		%c3 = constant 3 : index
%c2 = constant 2 : index		%c2 = constant 2 : index
%0 = dim %A, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%0 = dim %A, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%1 = dim %A, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%1 = dim %A, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%2 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%2 = dim %C, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%3 = dim %C, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%3 = dim %C, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%4 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%4 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
linalg.matmul %A, %C, %E :		linalg.matmul ins(%A, %C : memref<?x?xf32, offset: 0, strides: [?, ?]>,
(memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
linalg.matmul %A, %B, %C :		outs(%E : memref<?x?xf32, offset: 0, strides: [?, ?]>)
(memref<?x?xf32, offset: 0, strides: [?, ?]>,		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
		outs(%C : memref<?x?xf32, offset: 0, strides: [?, ?]>)
scf.for %arg5 = %c0 to %0 step %c2 {		scf.for %arg5 = %c0 to %0 step %c2 {
scf.for %arg6 = %c0 to %2 step %c3 {		scf.for %arg6 = %c0 to %2 step %c3 {
scf.for %arg7 = %c0 to %1 step %c4 {		scf.for %arg7 = %c0 to %1 step %c4 {
%7 = std.subview %A[%arg5, %arg7][%c2, %c4][%c1, %c1] :		%7 = std.subview %A[%arg5, %arg7][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%9 = std.subview %C[%arg7, %arg6][%c4, %c3][%c1, %c1] :		%9 = std.subview %C[%arg7, %arg6][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%10 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :		%10 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %7, %9, %10 :		linalg.matmul ins(%7, %9 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%10 : memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
scf.for %arg5 = %c0 to %3 step %c2 {		scf.for %arg5 = %c0 to %3 step %c2 {
scf.for %arg6 = %c0 to %4 step %c3 {		scf.for %arg6 = %c0 to %4 step %c3 {
scf.for %arg7 = %c0 to %2 step %c4 {		scf.for %arg7 = %c0 to %2 step %c4 {
%7 = std.subview %C[%arg5, %arg7][%c2, %c4][%c1, %c1] :		%7 = std.subview %C[%arg5, %arg7][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%9 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :		%9 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%10 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :		%10 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %7, %9, %10 :		linalg.matmul ins(%7, %9 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%10 : memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>		return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>
}		}
// CHECK-LABEL: func @f7		// CHECK-LABEL: func @f7
// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})		// CHECK: (%[[A:.]]:{{.}}, %[[B:.]]:{{.}}, %[[C:.]]:{{.}}, %[[D:.]]:{{.}}, %[[E:.]]:{{.}})
// CHECK: %[[A_0:.]] = dim %[[A]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK: %[[A_0:.]] = dim %[[A]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
// CHECK: %[[A_1:.]] = dim %[[A]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK: %[[A_1:.]] = dim %[[A]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
// CHECK: %[[C_1:.]] = dim %[[C]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK: %[[C_1:.]] = dim %[[C]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
// CHECK: %[[C_0:.]] = dim %[[C]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK: %[[C_0:.]] = dim %[[C]], %c0{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
// CHECK: %[[D_1:.]] = dim %[[D]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>		// CHECK: %[[D_1:.]] = dim %[[D]], %c1{{_[0-9]}} : memref<?x?xf32, #[[$strided2D]]>
// CHECK: linalg.matmul %[[A]], %[[C]], %[[E]]		// CHECK: linalg.matmul ins(%[[A]], %[[C]]{{.*}} outs(%[[E]]
// CHECK: scf.for %{{.}} = %{{.}} to %[[A_0]] step %{{.*}} {		// CHECK: scf.for %{{.}} = %{{.}} to %[[A_0]] step %{{.*}} {
// CHECK: scf.for %{{.}} = %{{.}} to %[[C_1]] step %{{.*}} {		// CHECK: scf.for %{{.}} = %{{.}} to %[[C_1]] step %{{.*}} {
// CHECK: scf.for %{{.}} = %{{.}} to %[[A_1]] step %{{.*}} {		// CHECK: scf.for %{{.}} = %{{.}} to %[[A_1]] step %{{.*}} {
// CHECK: linalg.matmul		// CHECK: linalg.matmul
// CHECK: linalg.matmul		// CHECK: linalg.matmul
// CHECK: scf.for %{{.}} = %{{.}} to %[[C_0]] step %{{.*}} {		// CHECK: scf.for %{{.}} = %{{.}} to %[[C_0]] step %{{.*}} {
// CHECK: scf.for %{{.}} = %{{.}} to %[[D_1]] step %{{.*}} {		// CHECK: scf.for %{{.}} = %{{.}} to %[[D_1]] step %{{.*}} {
// CHECK: scf.for %{{.}} = %{{.}} to %[[C_1]] step %{{.*}} {		// CHECK: scf.for %{{.}} = %{{.}} to %[[C_1]] step %{{.*}} {
Show All 14 Lines	func @f8(%A: memref<?x?xf32, offset: 0, strides: [?, ?]>,
) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {		) -> memref<?x?xf32, offset: 0, strides: [?, ?]> {
%c1 = constant 1 : index		%c1 = constant 1 : index
%c0 = constant 0 : index		%c0 = constant 0 : index
%c4 = constant 4 : index		%c4 = constant 4 : index
%c3 = constant 3 : index		%c3 = constant 3 : index
%c2 = constant 2 : index		%c2 = constant 2 : index
%0 = dim %A, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%0 = dim %A, %c0 : memref<?x?xf32, offset: 0, strides: [?, ?]>
%1 = dim %A, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%1 = dim %A, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
linalg.matmul %A, %C, %D :		linalg.matmul ins(%A, %C : memref<?x?xf32, offset: 0, strides: [?, ?]>,
(memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
linalg.matmul %A, %B, %C :		outs(%D : memref<?x?xf32, offset: 0, strides: [?, ?]>)
(memref<?x?xf32, offset: 0, strides: [?, ?]>,		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
		outs(%C : memref<?x?xf32, offset: 0, strides: [?, ?]>)
%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>		%2 = dim %D, %c1 : memref<?x?xf32, offset: 0, strides: [?, ?]>
scf.for %arg5 = %c0 to %0 step %c2 {		scf.for %arg5 = %c0 to %0 step %c2 {
scf.for %arg6 = %c0 to %2 step %c3 {		scf.for %arg6 = %c0 to %2 step %c3 {
scf.for %arg7 = %c0 to %1 step %c4 {		scf.for %arg7 = %c0 to %1 step %c4 {
%3 = affine.apply #map0(%arg5)		%3 = affine.apply #map0(%arg5)
%4 = affine.apply #map1(%arg7)		%4 = affine.apply #map1(%arg7)
%5 = std.subview %A[%arg5, %arg7][%c2, %c4][%c1, %c1] :		%5 = std.subview %A[%arg5, %arg7][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%6 = affine.apply #map2(%arg6)		%6 = affine.apply #map2(%arg6)
%7 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :		%7 = std.subview %D[%arg7, %arg6][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :		%8 = std.subview %E[%arg5, %arg6][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %5, %7, %8 :		linalg.matmul ins(%5, %7 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%8 : memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>		return %E : memref<?x?xf32, offset: 0, strides: [?, ?]>
}		}
// CHECK-LABEL: func @f8		// CHECK-LABEL: func @f8
// CHECK: (%[[A:.]]: memref{{.}}, %[[B:.]]: memref{{.}}, %[[C:.]]: memref{{.}}, %[[D:.]]: memref{{.}}, %[[E:.]]: memref{{.}})		// CHECK: (%[[A:.]]: memref{{.}}, %[[B:.]]: memref{{.}}, %[[C:.]]: memref{{.}}, %[[D:.]]: memref{{.}}, %[[E:.]]: memref{{.}})
// CHECK: linalg.matmul		// CHECK: linalg.matmul
▲ Show 20 Lines • Show All 253 Lines • ▼ Show 20 Lines	func @accept_different_alloc_ops(%dim: index, %s0 : index, %s1: index) {
%c2 = constant 2 : index		%c2 = constant 2 : index
%c3 = constant 3 : index		%c3 = constant 3 : index
%c4 = constant 4 : index		%c4 = constant 4 : index

%A = alloca(%dim, %dim)[%s0, %s1] : memref<?x?xf32, offset: 0, strides: [?, ?]>		%A = alloca(%dim, %dim)[%s0, %s1] : memref<?x?xf32, offset: 0, strides: [?, ?]>
%B = alloca(%dim, %dim)[%s0, %s1] : memref<?x?xf32, offset: 0, strides: [?, ?]>		%B = alloca(%dim, %dim)[%s0, %s1] : memref<?x?xf32, offset: 0, strides: [?, ?]>
%C = alloc(%dim, %dim)[%s0, %s1] : memref<?x?xf32, offset: 0, strides: [?, ?]>		%C = alloc(%dim, %dim)[%s0, %s1] : memref<?x?xf32, offset: 0, strides: [?, ?]>

linalg.matmul %A, %B, %C :		linalg.matmul ins(%A, %B : memref<?x?xf32, offset: 0, strides: [?, ?]>,
(memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>,
memref<?x?xf32, offset: 0, strides: [?, ?]>)		memref<?x?xf32, offset: 0, strides: [?, ?]>)
		outs(%C : memref<?x?xf32, offset: 0, strides: [?, ?]>)

scf.for %i = %c0 to %dim step %c2 {		scf.for %i = %c0 to %dim step %c2 {
scf.for %j = %c0 to %dim step %c3 {		scf.for %j = %c0 to %dim step %c3 {
scf.for %k = %c0 to %dim step %c4 {		scf.for %k = %c0 to %dim step %c4 {
%0 = std.subview %A[%i, %k][%c2, %c4][%c1, %c1] :		%0 = std.subview %A[%i, %k][%c2, %c4][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%1 = std.subview %B[%k, %j][%c4, %c3][%c1, %c1] :		%1 = std.subview %B[%k, %j][%c4, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
%2 = std.subview %C[%i, %j][%c2, %c3][%c1, %c1] :		%2 = std.subview %C[%i, %j][%c2, %c3][%c1, %c1] :
memref<?x?xf32, offset: 0, strides: [?, ?]> to		memref<?x?xf32, offset: 0, strides: [?, ?]> to
memref<?x?xf32, offset: ?, strides: [?, ?]>		memref<?x?xf32, offset: ?, strides: [?, ?]>
linalg.matmul %0, %1, %2 :		linalg.matmul ins(%0, %1 : memref<?x?xf32, offset: ?, strides: [?, ?]>,
(memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>,
memref<?x?xf32, offset: ?, strides: [?, ?]>)		memref<?x?xf32, offset: ?, strides: [?, ?]>)
		outs(%2 : memref<?x?xf32, offset: ?, strides: [?, ?]>)
}		}
}		}
}		}
return		return
}		}

// CHECK-LABEL: func @accept_different_alloc_ops		// CHECK-LABEL: func @accept_different_alloc_ops
// CHECK-COUNT-3: scf.for		// CHECK-COUNT-3: scf.for
// CHECK-COUNT-2: linalg.matmul		// CHECK-COUNT-2: linalg.matmul

mlir/test/Dialect/Linalg/invalid.mlir

Show First 20 Lines • Show All 422 Lines • ▼ Show 20 Lines	linalg.generic {
^bb(%0: i4) :		^bb(%0: i4) :
%1 = std.addf %0, %0: i4		%1 = std.addf %0, %0: i4
} : memref<?x?xi4>		} : memref<?x?xi4>
return		return
}		}

// -----		// -----

func @generic_result_0_element_type(%arg0: memref<?xf32>) {
// expected-error @+1 {{'linalg.dot' expects 3 operands, but found 2}}
linalg.dot %arg0, %arg0 : (memref<?xf32>, memref<?xf32>)
}

// -----

func @conv_rank_limit(%arg0: memref<?xf32>, %arg1: memref<?xf32>, %arg2: memref<?xf32>) {		func @conv_rank_limit(%arg0: memref<?xf32>, %arg1: memref<?xf32>, %arg2: memref<?xf32>) {
// expected-error @+1 {{expects memref ranks to be greater than 2}}		// expected-error @+1 {{expects memref ranks to be greater than 2}}
linalg.conv(%arg0, %arg1, %arg2) : memref<?xf32>, memref<?xf32>, memref<?xf32>		linalg.conv(%arg0, %arg1, %arg2) : memref<?xf32>, memref<?xf32>, memref<?xf32>
}		}

// -----		// -----

// expected-error @+1 {{unknown Linalg type}}		// expected-error @+1 {{unknown Linalg type}}
▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	linalg.pooling_max(%arg0, %arg1, %arg2) {strides = [2, 1, 2]}:
memref<?x?x?xf32>, memref<2x3xf32>, memref<?x?x?xf32>		memref<?x?x?xf32>, memref<2x3xf32>, memref<?x?x?xf32>
return		return
}		}

// -----		// -----

func @named_ops(%a3: memref<?x?x?xf32>, %b3: memref<?x?xf32>, %c3: memref<?x?x?xf32>) {		func @named_ops(%a3: memref<?x?x?xf32>, %b3: memref<?x?xf32>, %c3: memref<?x?x?xf32>) {
// expected-error @+1 {{op expected indexing_map #1 results to match view rank: 'memref<?x?xf32>'}}		// expected-error @+1 {{op expected indexing_map #1 results to match view rank: 'memref<?x?xf32>'}}
linalg.batch_matmul %a3, %b3, %c3 : (memref<?x?x?xf32>, memref<?x?xf32>, memref<?x?x?xf32>) -> ()		linalg.batch_matmul ins(%a3, %b3: memref<?x?x?xf32>, memref<?x?xf32>)
		outs(%c3 : memref<?x?x?xf32>)
return		return
}		}

// -----		// -----

func @generic(%arg0: tensor<?x?xi4>) {		func @generic(%arg0: tensor<?x?xi4>) {
// expected-error @+1 {{unexpected #results > #outputs}}		// expected-error @+1 {{unexpected #results > #outputs}}
linalg.generic {		linalg.generic {
args_in = 1,		args_in = 1,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<(i) -> (i)> ],		indexing_maps = [ affine_map<(i) -> (i)> ],
iterator_types = ["parallel"]		iterator_types = ["parallel"]
} %arg0 {		} %arg0 {
^bb(%0: i4) :		^bb(%0: i4) :
%1 = std.addi %0, %0: i4		%1 = std.addi %0, %0: i4
linalg.yield %1, %1: i4, i4		linalg.yield %1, %1: i4, i4
} : tensor<?x?xi4> -> (tensor<?x?xi4>, tensor<?x?xi4>)		} : tensor<?x?xi4> -> (tensor<?x?xi4>, tensor<?x?xi4>)
return		return
}		}

		// -----

		func @empty_init_expected(%m: memref<?x?xf32>, %t: tensor<?x?xf32>) {
		// expected-error @+1 {{expected empty `init` when op has no results or no reduction dims}}
		linalg.matmul ins(%m, %m: memref<?x?xf32>, memref<?x?xf32>)
		outs(%m : memref<?x?xf32>)
		init(%t : tensor<?x?xf32>)
		return
		}

		// -----

		func @incorrect_region_arg_count(%m: memref<?x?xf32>) {
		// expected-error @+3 {{region expects 3 args, got 4}}
		%res = linalg.matmul ins(%m, %m : memref<?x?xf32>, memref<?x?xf32>)
		-> tensor<?x?xf32>, tensor<?x?xf32>
		return
		}

		// -----

		func @single_tensor_result(%m: memref<?x?xf32>, %t: tensor<?x?xf32>) {
		// expected-error @+1 {{expected single tensor result when reduction present}}
		%res:2 = linalg.matmul ins(%m : memref<?x?xf32>)
		init(%t, %t : tensor<?x?xf32>, tensor<?x?xf32>)
		-> tensor<?x?xf32>, tensor<?x?xf32>
		return
		}

		// -----

		func @matching_inits(%m: memref<?x?xf32>, %t: tensor<?x?xf32>) {
		// expected-error @+1 {{expected #init tensors to match #results when reduction present}}
		%res = linalg.matmul ins(%m, %m : memref<?x?xf32>, memref<?x?xf32>)
		init(%t, %t : tensor<?x?xf32>, tensor<?x?xf32>)
		-> tensor<?x?xf32>
		return
		}

		// -----

		func @matching_inits(%m: memref<?x?xf32>, %t: tensor<?x?xf32>) {
		// expected-error @+1 {{expected init tensor #0 of the same type as result #0}}
		%res = linalg.matmul ins(%m, %m : memref<?x?xf32>, memref<?x?xf32>)
		init(%t : tensor<?x?xf32>)
		-> tensor<?xf32>
		return
		}

mlir/test/Dialect/Linalg/loops.mlir

	Show All 33 Lines


	func @matmul(%arg0: memref<?xi8>, %M: index, %N: index, %K: index) {			func @matmul(%arg0: memref<?xi8>, %M: index, %N: index, %K: index) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%A = view %arg0[%c0][%M, %K] : memref<?xi8> to memref<?x?xf32>			%A = view %arg0[%c0][%M, %K] : memref<?xi8> to memref<?x?xf32>
	%B = view %arg0[%c0][%K, %N] : memref<?xi8> to memref<?x?xf32>			%B = view %arg0[%c0][%K, %N] : memref<?xi8> to memref<?x?xf32>
	%C = view %arg0[%c0][%M, %N] : memref<?xi8> to memref<?x?xf32>			%C = view %arg0[%c0][%M, %N] : memref<?xi8> to memref<?x?xf32>
	linalg.matmul %A, %B, %C : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			linalg.matmul ins(%A, %B: memref<?x?xf32>, memref<?x?xf32>)
				outs(%C: memref<?x?xf32>)
	return			return
	}			}
	// CHECKLOOP-LABEL: func @matmul(%{{.*}}: memref<?xi8>,			// CHECKLOOP-LABEL: func @matmul(%{{.*}}: memref<?xi8>,
	// CHECKLOOP-SAME: [[M:arg[0-9]+]]: index			// CHECKLOOP-SAME: [[M:arg[0-9]+]]: index
	// CHECKLOOP-SAME: [[N:arg[0-9]+]]: index			// CHECKLOOP-SAME: [[N:arg[0-9]+]]: index
	// CHECKLOOP-SAME: [[K:arg[0-9]+]]: index			// CHECKLOOP-SAME: [[K:arg[0-9]+]]: index
	// CHECKLOOP: %[[A:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?x?xf32>			// CHECKLOOP: %[[A:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?x?xf32>
	// CHECKLOOP: %[[B:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?x?xf32>			// CHECKLOOP: %[[B:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?x?xf32>
	Show All 27 Lines


	func @matvec(%arg0: memref<?xi8>, %M: index, %N: index) {			func @matvec(%arg0: memref<?xi8>, %M: index, %N: index) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%2 = view %arg0[%c0][%M, %N] : memref<?xi8> to memref<?x?xf32>			%2 = view %arg0[%c0][%M, %N] : memref<?xi8> to memref<?x?xf32>
	%3 = view %arg0[%c0][%M] : memref<?xi8> to memref<?xf32>			%3 = view %arg0[%c0][%M] : memref<?xi8> to memref<?xf32>
	%4 = view %arg0[%c0][%N] : memref<?xi8> to memref<?xf32>			%4 = view %arg0[%c0][%N] : memref<?xi8> to memref<?xf32>
	linalg.matvec %2, %3, %4 : (memref<?x?xf32>, memref<?xf32>, memref<?xf32>)			linalg.matvec ins(%2, %3: memref<?x?xf32>, memref<?xf32>)
				outs(%4 : memref<?xf32>)
	return			return
	}			}
	// CHECKLOOP-LABEL: func @matvec(%{{.*}}: memref<?xi8>,			// CHECKLOOP-LABEL: func @matvec(%{{.*}}: memref<?xi8>,
	// CHECKLOOP-SAME: [[M:arg[0-9]+]]: index			// CHECKLOOP-SAME: [[M:arg[0-9]+]]: index
	// CHECKLOOP-SAME: [[K:arg[0-9]+]]: index			// CHECKLOOP-SAME: [[K:arg[0-9]+]]: index
	// CHECKLOOP: %[[A:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?x?xf32>			// CHECKLOOP: %[[A:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?x?xf32>
	// CHECKLOOP: %[[B:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?xf32>			// CHECKLOOP: %[[B:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?xf32>
	// CHECKLOOP: %[[C:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?xf32>			// CHECKLOOP: %[[C:.]] = std.view %{{.}}[{{.*}}] : memref<?xi8> to memref<?xf32>
	Show All 23 Lines


	func @dot(%arg0: memref<?xi8>, %M: index) {			func @dot(%arg0: memref<?xi8>, %M: index) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%1 = view %arg0[%c0][%M] : memref<?xi8> to memref<?xf32>			%1 = view %arg0[%c0][%M] : memref<?xi8> to memref<?xf32>
	%2 = view %arg0[%c0][%M] : memref<?xi8> to memref<?xf32>			%2 = view %arg0[%c0][%M] : memref<?xi8> to memref<?xf32>
	%3 = view %arg0[%c0][] : memref<?xi8> to memref<f32>			%3 = view %arg0[%c0][] : memref<?xi8> to memref<f32>
	linalg.dot %1, %2, %3 : (memref<?xf32>, memref<?xf32>, memref<f32>)			linalg.dot ins(%1, %2 : memref<?xf32>, memref<?xf32>)
				outs(%3 : memref<f32>)
	return			return
	}			}
	// CHECKLOOP-LABEL: func @dot(%{{.*}}: memref<?xi8>,			// CHECKLOOP-LABEL: func @dot(%{{.*}}: memref<?xi8>,
	// CHECKLOOP-SAME: [[K:arg[0-9]+]]: index			// CHECKLOOP-SAME: [[K:arg[0-9]+]]: index
	// CHECKLOOP: %[[A:.]] = std.view %{{.}}[{{.}}][{{.}}] : memref<?xi8> to memref<?xf32>			// CHECKLOOP: %[[A:.]] = std.view %{{.}}[{{.}}][{{.}}] : memref<?xi8> to memref<?xf32>
	// CHECKLOOP: %[[B:.]] = std.view %{{.}}[{{.}}][{{.}}] : memref<?xi8> to memref<?xf32>			// CHECKLOOP: %[[B:.]] = std.view %{{.}}[{{.}}][{{.}}] : memref<?xi8> to memref<?xf32>
	// CHECKLOOP: %[[C:.]] = std.view %{{.}}[{{.*}}][] : memref<?xi8> to memref<f32>			// CHECKLOOP: %[[C:.]] = std.view %{{.}}[{{.*}}][] : memref<?xi8> to memref<f32>
	// CHECKLOOP: scf.for %{{.}} = %{{.}} to %[[K]] step %{{.*}} {			// CHECKLOOP: scf.for %{{.}} = %{{.}} to %[[K]] step %{{.*}} {
	Show All 14 Lines
	// CHECKPARALLEL-DAG: %[[b:.]] = load %[[B]][%{{.}}] : memref<?xf32>			// CHECKPARALLEL-DAG: %[[b:.]] = load %[[B]][%{{.}}] : memref<?xf32>
	// CHECKPARALLEL-DAG: %[[inc:.*]] = mulf %[[a]], %[[b]] : f32			// CHECKPARALLEL-DAG: %[[inc:.*]] = mulf %[[a]], %[[b]] : f32
	// CHECKPARALLEL-DAG: %[[c:.*]] = load %[[C]][] : memref<f32>			// CHECKPARALLEL-DAG: %[[c:.*]] = load %[[C]][] : memref<f32>
	// CHECKPARALLEL-DAG: %[[res:.*]] = addf %[[c]], %[[inc]] : f32			// CHECKPARALLEL-DAG: %[[res:.*]] = addf %[[c]], %[[inc]] : f32
	// CHECKPARALLEL: store %[[res]], %[[C]][] : memref<f32>			// CHECKPARALLEL: store %[[res]], %[[C]][] : memref<f32>


	func @dot_view(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<f32>) {			func @dot_view(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<f32>) {
	linalg.dot %arg0, %arg1, %arg2 : (memref<?xf32, offset: ?, strides: [1]>,			linalg.dot ins(%arg0, %arg1 : memref<?xf32, offset: ?, strides: [1]>,
	memref<?xf32, offset: ?, strides: [1]>,			memref<?xf32, offset: ?, strides: [1]>)
	memref<f32>)			outs(%arg2: memref<f32>)
	return			return
	}			}
	// CHECKLOOP-LABEL: func @dot_view(			// CHECKLOOP-LABEL: func @dot_view(
	// CHECKLOOP: %{{.}}: memref<?xf32, #[[$strided1D]]>, %{{.}}: memref<?xf32, #[[$strided1D]]>, %{{.*}}: memref<f32>) {			// CHECKLOOP: %{{.}}: memref<?xf32, #[[$strided1D]]>, %{{.}}: memref<?xf32, #[[$strided1D]]>, %{{.*}}: memref<f32>) {
	// CHECKLOOP: %[[K:.*]] = dim %arg0, %c0 : memref<?xf32, #[[$strided1D]]>			// CHECKLOOP: %[[K:.*]] = dim %arg0, %c0 : memref<?xf32, #[[$strided1D]]>
	// CHECKLOOP: scf.for %{{.}} = %{{.}} to %[[K]] step %{{.*}} {			// CHECKLOOP: scf.for %{{.}} = %{{.}} to %[[K]] step %{{.*}} {
	// CHECKLOOP-DAG: %[[a:.]] = load %arg0[%{{.}}] : memref<?xf32, #[[$strided1D]]>			// CHECKLOOP-DAG: %[[a:.]] = load %arg0[%{{.}}] : memref<?xf32, #[[$strided1D]]>
	// CHECKLOOP-DAG: %[[b:.]] = load %{{.}}[%{{.*}}] : memref<?xf32, #[[$strided1D]]>			// CHECKLOOP-DAG: %[[b:.]] = load %{{.}}[%{{.*}}] : memref<?xf32, #[[$strided1D]]>
	▲ Show 20 Lines • Show All 707 Lines • ▼ Show 20 Lines
	// CHECKPARALLEL: load %[[ARG1]][]			// CHECKPARALLEL: load %[[ARG1]][]
	// CHECKPARALLEL: addf			// CHECKPARALLEL: addf
	// CHECKPARALLEL: store %{{.*}}, %[[ARG2]][]			// CHECKPARALLEL: store %{{.*}}, %[[ARG2]][]

	//----------------------------------------------------------------------------//			//----------------------------------------------------------------------------//
	// Named ops to loops.			// Named ops to loops.
	//----------------------------------------------------------------------------//			//----------------------------------------------------------------------------//
	func @named_batch_matmul(%A: memref<?x?x?xf32>, %B: memref<?x?x?xf32>, %C: memref<?x?x?xf32>) {			func @named_batch_matmul(%A: memref<?x?x?xf32>, %B: memref<?x?x?xf32>, %C: memref<?x?x?xf32>) {
	linalg.batch_matmul %A, %B, %C : (memref<?x?x?xf32>, memref<?x?x?xf32>, memref<?x?x?xf32>) -> ()			linalg.batch_matmul ins(%A, %B : memref<?x?x?xf32>, memref<?x?x?xf32>)
				outs(%C : memref<?x?x?xf32>)
	return			return
	}			}
	// CHECKLOOP-LABEL: @named_batch_matmul			// CHECKLOOP-LABEL: @named_batch_matmul
	// CHECKLOOP-SAME: %[[mA:[a-zA-Z0-9]+]]: memref<?x?x?xf32>			// CHECKLOOP-SAME: %[[mA:[a-zA-Z0-9]+]]: memref<?x?x?xf32>
	// CHECKLOOP-SAME: %[[mB:[a-zA-Z0-9]+]]: memref<?x?x?xf32>			// CHECKLOOP-SAME: %[[mB:[a-zA-Z0-9]+]]: memref<?x?x?xf32>
	// CHECKLOOP-SAME: %[[mC:[a-zA-Z0-9]+]]: memref<?x?x?xf32>			// CHECKLOOP-SAME: %[[mC:[a-zA-Z0-9]+]]: memref<?x?x?xf32>
	// CHECKLOOP: %[[B:.*]] = dim %[[mA]], %c0 : memref<?x?x?xf32>			// CHECKLOOP: %[[B:.*]] = dim %[[mA]], %c0 : memref<?x?x?xf32>
	// CHECKLOOP: %[[M:.*]] = dim %[[mA]], %c1 : memref<?x?x?xf32>			// CHECKLOOP: %[[M:.*]] = dim %[[mA]], %c1 : memref<?x?x?xf32>
	▲ Show 20 Lines • Show All 391 Lines • ▼ Show 20 Lines
	// CHECKPARALLEL: %[[va:.*]] = load %[[arg0]][%[[aff1]], %[[aff2]], %[[aff3]], %[[aff4]]] : memref<?x?x?x?xf32>			// CHECKPARALLEL: %[[va:.*]] = load %[[arg0]][%[[aff1]], %[[aff2]], %[[aff3]], %[[aff4]]] : memref<?x?x?x?xf32>
	// CHECKPARALLEL: %[[vb:.*]] = load %[[arg1]][%[[i4]], %[[i5]], %[[i6]], %[[i7]]] : memref<?x?x?x?xf32>			// CHECKPARALLEL: %[[vb:.*]] = load %[[arg1]][%[[i4]], %[[i5]], %[[i6]], %[[i7]]] : memref<?x?x?x?xf32>
	// CHECKPARALLEL: %[[vc:.*]] = load %[[arg2]][%[[i0]], %[[i1]], %[[i2]], %[[i3]]] : memref<?x?x?x?xf32>			// CHECKPARALLEL: %[[vc:.*]] = load %[[arg2]][%[[i0]], %[[i1]], %[[i2]], %[[i3]]] : memref<?x?x?x?xf32>
	// CHECKPARALLEL: %[[inc:.*]] = mulf %[[va]], %[[vb]] : f32			// CHECKPARALLEL: %[[inc:.*]] = mulf %[[va]], %[[vb]] : f32
	// CHECKPARALLEL: %[[res:.*]] = addf %[[vc]], %[[inc]] : f32			// CHECKPARALLEL: %[[res:.*]] = addf %[[vc]], %[[inc]] : f32
	// CHECKPARALLEL: store %[[res]], %[[arg2]][%[[i0]], %[[i1]], %[[i2]], %[[i3]]] : memref<?x?x?x?xf32>			// CHECKPARALLEL: store %[[res]], %[[arg2]][%[[i0]], %[[i1]], %[[i2]], %[[i3]]] : memref<?x?x?x?xf32>

	func @conv1d_no_symbols(%in : memref<?xf32>, %filter : memref<?xf32>, %out : memref<?xf32>) -> () {			func @conv1d_no_symbols(%in : memref<?xf32>, %filter : memref<?xf32>, %out : memref<?xf32>) -> () {
	linalg.conv_1d %in, %filter, %out : (memref<?xf32>, memref<?xf32>, memref<?xf32>)			linalg.conv_1d ins(%in, %filter : memref<?xf32>, memref<?xf32>)
				outs(%out : memref<?xf32>)
	return			return
	}			}

	// CHECKLOOP-LABEL: @conv1d_no_symbols			// CHECKLOOP-LABEL: @conv1d_no_symbols
	// CHECKLOOP-SAME: %[[arg0:[a-zA-Z0-9]+]]: memref<?xf32>			// CHECKLOOP-SAME: %[[arg0:[a-zA-Z0-9]+]]: memref<?xf32>
	// CHECKLOOP-SAME: %[[arg1:[a-zA-Z0-9]+]]: memref<?xf32>			// CHECKLOOP-SAME: %[[arg1:[a-zA-Z0-9]+]]: memref<?xf32>
	// CHECKLOOP-SAME: %[[arg2:[a-zA-Z0-9]+]]: memref<?xf32>			// CHECKLOOP-SAME: %[[arg2:[a-zA-Z0-9]+]]: memref<?xf32>
	// CHECKLOOP: %[[c0:.*]] = constant 0 : index			// CHECKLOOP: %[[c0:.*]] = constant 0 : index
	Show All 25 Lines
	// CHECKPARALLEL: %[[va:.*]] = load %[[arg1]][%[[m]]] : memref<?xf32>			// CHECKPARALLEL: %[[va:.*]] = load %[[arg1]][%[[m]]] : memref<?xf32>
	// CHECKPARALLEL: %[[vc:.*]] = load %[[arg2]][%[[b]]] : memref<?xf32>			// CHECKPARALLEL: %[[vc:.*]] = load %[[arg2]][%[[b]]] : memref<?xf32>
	// CHECKPARALLEL: %[[inc:.*]] = mulf %[[vb]], %[[va]] : f32			// CHECKPARALLEL: %[[inc:.*]] = mulf %[[vb]], %[[va]] : f32
	// CHECKPARALLEL: %[[res:.*]] = addf %[[vc]], %[[inc]] : f32			// CHECKPARALLEL: %[[res:.*]] = addf %[[vc]], %[[inc]] : f32
	// CHECKPARALLEL: store %[[res]], %[[arg2]][%[[b]]] : memref<?xf32>			// CHECKPARALLEL: store %[[res]], %[[arg2]][%[[b]]] : memref<?xf32>


	func @conv2d_no_symbols(%in : memref<?x?xf32>, %filter : memref<?x?xf32>, %out : memref<?x?xf32>) -> () {			func @conv2d_no_symbols(%in : memref<?x?xf32>, %filter : memref<?x?xf32>, %out : memref<?x?xf32>) -> () {
	linalg.conv_2d %in, %filter, %out : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			linalg.conv_2d ins(%in, %filter : memref<?x?xf32>, memref<?x?xf32>)
				outs(%out: memref<?x?xf32>)
	return			return
	}			}
	// CHECKLOOP-LABEL: @conv2d_no_symbols			// CHECKLOOP-LABEL: @conv2d_no_symbols
	// CHECKLOOP-SAME: %[[arg0:[a-zA-Z0-9]+]]: memref<?x?xf32>			// CHECKLOOP-SAME: %[[arg0:[a-zA-Z0-9]+]]: memref<?x?xf32>
	// CHECKLOOP-SAME: %[[arg1:[a-zA-Z0-9]+]]: memref<?x?xf32>			// CHECKLOOP-SAME: %[[arg1:[a-zA-Z0-9]+]]: memref<?x?xf32>
	// CHECKLOOP-SAME: %[[arg2:[a-zA-Z0-9]+]]: memref<?x?xf32>			// CHECKLOOP-SAME: %[[arg2:[a-zA-Z0-9]+]]: memref<?x?xf32>
	// CHECKLOOP: %[[c0:.*]] = constant 0 : index			// CHECKLOOP: %[[c0:.*]] = constant 0 : index
	// CHECKLOOP: %[[c1:.*]] = constant 1 : index			// CHECKLOOP: %[[c1:.*]] = constant 1 : index
	Show All 35 Lines
	// CHECKPARALLEL: %[[va:.*]] = load %[[arg1]][%[[arg5]], %[[arg6]]] : memref<?x?xf32>			// CHECKPARALLEL: %[[va:.*]] = load %[[arg1]][%[[arg5]], %[[arg6]]] : memref<?x?xf32>
	// CHECKPARALLEL: %[[vc:.*]] = load %[[arg2]][%[[arg3]], %[[arg4]]] : memref<?x?xf32>			// CHECKPARALLEL: %[[vc:.*]] = load %[[arg2]][%[[arg3]], %[[arg4]]] : memref<?x?xf32>
	// CHECKPARALLEL: %[[inc:.*]] = mulf %[[vb]], %[[va]] : f32			// CHECKPARALLEL: %[[inc:.*]] = mulf %[[vb]], %[[va]] : f32
	// CHECKPARALLEL: %[[res:.*]] = addf %[[vc]], %[[inc]] : f32			// CHECKPARALLEL: %[[res:.*]] = addf %[[vc]], %[[inc]] : f32
	// CHECKPARALLEL: store %[[res]], %[[arg2]][%[[arg3]], %[[arg4]]] : memref<?x?xf32>			// CHECKPARALLEL: store %[[res]], %[[arg2]][%[[arg3]], %[[arg4]]] : memref<?x?xf32>


	func @conv3d_no_symbols(%in : memref<?x?x?xf32>, %filter : memref<?x?x?xf32>, %out : memref<?x?x?xf32>) -> () {			func @conv3d_no_symbols(%in : memref<?x?x?xf32>, %filter : memref<?x?x?xf32>, %out : memref<?x?x?xf32>) -> () {
	linalg.conv_3d %in, %filter, %out : (memref<?x?x?xf32>, memref<?x?x?xf32>, memref<?x?x?xf32>)			linalg.conv_3d ins(%in, %filter : memref<?x?x?xf32>, memref<?x?x?xf32>)
				outs(%out : memref<?x?x?xf32>)
	return			return
	}			}

	// CHECKLOOP-LABEL: @conv3d_no_symbols			// CHECKLOOP-LABEL: @conv3d_no_symbols
	// CHECKLOOP-SAME: %[[arg0:[a-zA-Z0-9]+]]: memref<?x?x?xf32>			// CHECKLOOP-SAME: %[[arg0:[a-zA-Z0-9]+]]: memref<?x?x?xf32>
	// CHECKLOOP-SAME: %[[arg1:[a-zA-Z0-9]+]]: memref<?x?x?xf32>			// CHECKLOOP-SAME: %[[arg1:[a-zA-Z0-9]+]]: memref<?x?x?xf32>
	// CHECKLOOP-SAME: %[[arg2:[a-zA-Z0-9]+]]: memref<?x?x?xf32>			// CHECKLOOP-SAME: %[[arg2:[a-zA-Z0-9]+]]: memref<?x?x?xf32>
	// CHECKLOOP: %[[c2:.*]] = constant 2 : index			// CHECKLOOP: %[[c2:.*]] = constant 2 : index
	▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/promote.mlir

Show All 21 Lines	func @matmul_f32(%A: memref<?xi8>, %M: index, %N: index, %K: index) {
%7 = dim %3, %c1 : memref<?x?xf32>		%7 = dim %3, %c1 : memref<?x?xf32>
%8 = dim %4, %c1 : memref<?x?xf32>		%8 = dim %4, %c1 : memref<?x?xf32>
scf.for %arg4 = %c0 to %6 step %c2 {		scf.for %arg4 = %c0 to %6 step %c2 {
scf.for %arg5 = %c0 to %8 step %c3 {		scf.for %arg5 = %c0 to %8 step %c3 {
scf.for %arg6 = %c0 to %7 step %c4 {		scf.for %arg6 = %c0 to %7 step %c4 {
%11 = std.subview %3[%arg4, %arg6][%c2, %c4][1, 1] : memref<?x?xf32> to memref<?x?xf32, offset: ?, strides: [?, 1]>		%11 = std.subview %3[%arg4, %arg6][%c2, %c4][1, 1] : memref<?x?xf32> to memref<?x?xf32, offset: ?, strides: [?, 1]>
%14 = std.subview %4[%arg6, %arg5][%c4, %c3][1, 1] : memref<?x?xf32> to memref<?x?xf32, offset: ?, strides: [?, 1]>		%14 = std.subview %4[%arg6, %arg5][%c4, %c3][1, 1] : memref<?x?xf32> to memref<?x?xf32, offset: ?, strides: [?, 1]>
%17 = std.subview %5[%arg4, %arg5][%c2, %c3][1, 1] : memref<?x?xf32> to memref<?x?xf32, offset: ?, strides: [?, 1]>		%17 = std.subview %5[%arg4, %arg5][%c2, %c3][1, 1] : memref<?x?xf32> to memref<?x?xf32, offset: ?, strides: [?, 1]>
linalg.matmul %11, %14, %17 :		linalg.matmul
(memref<?x?xf32, offset: ?, strides: [?, 1]>,		ins(%11, %14: memref<?x?xf32, offset: ?, strides: [?, 1]>,
memref<?x?xf32, offset: ?, strides: [?, 1]>,
memref<?x?xf32, offset: ?, strides: [?, 1]>)		memref<?x?xf32, offset: ?, strides: [?, 1]>)
		outs(%17: memref<?x?xf32, offset: ?, strides: [?, 1]>)
}		}
}		}
}		}
return		return
}		}

// CHECK-LABEL: func @matmul_f32(%{{.}}: memref<?xi8>, %{{.}}: index, %{{.}}: index, %{{.}}: index) {		// CHECK-LABEL: func @matmul_f32(%{{.}}: memref<?xi8>, %{{.}}: index, %{{.}}: index, %{{.}}: index) {
// CHECK: scf.for %{{.}} = %{{.}} to %{{.}} step %{{.}} {		// CHECK: scf.for %{{.}} = %{{.}} to %{{.}} step %{{.}} {
Show All 20 Lines
// CHECK: %[[fullC:.]] = std.view %[[tmpC]][{{.}}][{{.*}}] : memref<24xi8> to memref<?x?xf32>		// CHECK: %[[fullC:.]] = std.view %[[tmpC]][{{.}}][{{.*}}] : memref<24xi8> to memref<?x?xf32>
// DYNAMIC: std.view %{{.}}[{{.}}][{{.*}}] : memref<?xi8> to memref<?x?xf32>		// DYNAMIC: std.view %{{.}}[{{.}}][{{.*}}] : memref<?xi8> to memref<?x?xf32>
// CHECK: %[[partialC:.]] = subview %[[fullC]]{{.}} : memref<?x?xf32> to memref<?x?xf32, #[[$strided2D_dynamic]]>		// CHECK: %[[partialC:.]] = subview %[[fullC]]{{.}} : memref<?x?xf32> to memref<?x?xf32, #[[$strided2D_dynamic]]>

// CHECK: linalg.copy(%[[vA]], %[[partialA]]) : memref<?x?xf32, #[[$strided2D]]>, memref<?x?xf32, #[[$strided2D_dynamic]]>		// CHECK: linalg.copy(%[[vA]], %[[partialA]]) : memref<?x?xf32, #[[$strided2D]]>, memref<?x?xf32, #[[$strided2D_dynamic]]>
// CHECK: linalg.copy(%[[vB]], %[[partialB]]) : memref<?x?xf32, #[[$strided2D]]>, memref<?x?xf32, #[[$strided2D_dynamic]]>		// CHECK: linalg.copy(%[[vB]], %[[partialB]]) : memref<?x?xf32, #[[$strided2D]]>, memref<?x?xf32, #[[$strided2D_dynamic]]>
// CHECK: linalg.copy(%[[vC]], %[[partialC]]) : memref<?x?xf32, #[[$strided2D]]>, memref<?x?xf32, #[[$strided2D_dynamic]]>		// CHECK: linalg.copy(%[[vC]], %[[partialC]]) : memref<?x?xf32, #[[$strided2D]]>, memref<?x?xf32, #[[$strided2D_dynamic]]>
//		//
// CHECK: linalg.matmul %[[partialA]], %[[partialB]], %[[partialC]] :		// CHECK: linalg.matmul ins(%[[partialA]], %[[partialB]]{{.*}} outs(%[[partialC]]
// CHECK: memref<?x?xf32, #[[$strided2D_dynamic]]>,
// CHECK: memref<?x?xf32, #[[$strided2D_dynamic]]>,
// CHECK: memref<?x?xf32, #[[$strided2D_dynamic]]>
//		//
// CHECK: linalg.copy(%[[partialC]], %[[vC]]) :		// CHECK: linalg.copy(%[[partialC]], %[[vC]]) :
// CHECK: memref<?x?xf32, #[[$strided2D_dynamic]]>,		// CHECK: memref<?x?xf32, #[[$strided2D_dynamic]]>,
// CHECK: memref<?x?xf32, #[[$strided2D]]>		// CHECK: memref<?x?xf32, #[[$strided2D]]>
//		//
// CHECK: dealloc %[[tmpA]] : memref<32xi8>		// CHECK: dealloc %[[tmpA]] : memref<32xi8>
// CHECK: dealloc %[[tmpB]] : memref<48xi8>		// CHECK: dealloc %[[tmpB]] : memref<48xi8>
// CHECK: dealloc %[[tmpC]] : memref<24xi8>		// CHECK: dealloc %[[tmpC]] : memref<24xi8>
Show All 16 Lines	func @matmul_f64(%A: memref<?xi8>, %M: index, %N: index, %K: index) {
%7 = dim %3, %c1 : memref<?x?xf64>		%7 = dim %3, %c1 : memref<?x?xf64>
%8 = dim %4, %c1 : memref<?x?xf64>		%8 = dim %4, %c1 : memref<?x?xf64>
scf.for %arg4 = %c0 to %6 step %c2 {		scf.for %arg4 = %c0 to %6 step %c2 {
scf.for %arg5 = %c0 to %8 step %c3 {		scf.for %arg5 = %c0 to %8 step %c3 {
scf.for %arg6 = %c0 to %7 step %c4 {		scf.for %arg6 = %c0 to %7 step %c4 {
%11 = std.subview %3[%arg4, %arg6][%c2, %c4][1, 1] : memref<?x?xf64> to memref<?x?xf64, offset: ?, strides: [?, 1]>		%11 = std.subview %3[%arg4, %arg6][%c2, %c4][1, 1] : memref<?x?xf64> to memref<?x?xf64, offset: ?, strides: [?, 1]>
%14 = std.subview %4[%arg6, %arg5][%c4, %c3][1, 1] : memref<?x?xf64> to memref<?x?xf64, offset: ?, strides: [?, 1]>		%14 = std.subview %4[%arg6, %arg5][%c4, %c3][1, 1] : memref<?x?xf64> to memref<?x?xf64, offset: ?, strides: [?, 1]>
%17 = std.subview %5[%arg4, %arg5][%c2, %c3][1, 1] : memref<?x?xf64> to memref<?x?xf64, offset: ?, strides: [?, 1]>		%17 = std.subview %5[%arg4, %arg5][%c2, %c3][1, 1] : memref<?x?xf64> to memref<?x?xf64, offset: ?, strides: [?, 1]>
linalg.matmul %11, %14, %17 :		linalg.matmul
(memref<?x?xf64, offset: ?, strides: [?, 1]>,		ins(%11, %14: memref<?x?xf64, offset: ?, strides: [?, 1]>,
memref<?x?xf64, offset: ?, strides: [?, 1]>,
memref<?x?xf64, offset: ?, strides: [?, 1]>)		memref<?x?xf64, offset: ?, strides: [?, 1]>)
		outs(%17: memref<?x?xf64, offset: ?, strides: [?, 1]>)
}		}
}		}
}		}
return		return
}		}

// CHECK-LABEL: func @matmul_f64(%{{.}}: memref<?xi8>, %{{.}}: index, %{{.}}: index, %{{.}}: index) {		// CHECK-LABEL: func @matmul_f64(%{{.}}: memref<?xi8>, %{{.}}: index, %{{.}}: index, %{{.}}: index) {
// CHECK: scf.for %{{.}} = %{{.}} to %{{.}} step %{{.}} {		// CHECK: scf.for %{{.}} = %{{.}} to %{{.}} step %{{.}} {
Show All 17 Lines
// CHECK: %[[fullC_f64:.]] = std.view %[[tmpC_f64]][{{.}}][{{.*}}] : memref<48xi8> to memref<?x?xf64>		// CHECK: %[[fullC_f64:.]] = std.view %[[tmpC_f64]][{{.}}][{{.*}}] : memref<48xi8> to memref<?x?xf64>
// DYNAMIC: std.view %{{.}}[{{.}}][{{.*}}] : memref<?xi8> to memref<?x?xf64>		// DYNAMIC: std.view %{{.}}[{{.}}][{{.*}}] : memref<?xi8> to memref<?x?xf64>
// CHECK: %[[partialC_f64:.]] = subview %[[fullC_f64]][%{{.}}, %{{.*}}] : memref<?x?xf64> to memref<?x?xf64, #[[$strided2D_dynamic]]>		// CHECK: %[[partialC_f64:.]] = subview %[[fullC_f64]][%{{.}}, %{{.*}}] : memref<?x?xf64> to memref<?x?xf64, #[[$strided2D_dynamic]]>

// CHECK: linalg.copy(%[[vA_f64]], %[[partialA_f64]]) : memref<?x?xf64, #[[$strided2D]]>, memref<?x?xf64, #[[$strided2D_dynamic]]>		// CHECK: linalg.copy(%[[vA_f64]], %[[partialA_f64]]) : memref<?x?xf64, #[[$strided2D]]>, memref<?x?xf64, #[[$strided2D_dynamic]]>
// CHECK: linalg.copy(%[[vB_f64]], %[[partialB_f64]]) : memref<?x?xf64, #[[$strided2D]]>, memref<?x?xf64, #[[$strided2D_dynamic]]>		// CHECK: linalg.copy(%[[vB_f64]], %[[partialB_f64]]) : memref<?x?xf64, #[[$strided2D]]>, memref<?x?xf64, #[[$strided2D_dynamic]]>
// CHECK: linalg.copy(%[[vC_f64]], %[[partialC_f64]]) : memref<?x?xf64, #[[$strided2D]]>, memref<?x?xf64, #[[$strided2D_dynamic]]>		// CHECK: linalg.copy(%[[vC_f64]], %[[partialC_f64]]) : memref<?x?xf64, #[[$strided2D]]>, memref<?x?xf64, #[[$strided2D_dynamic]]>
//		//
// CHECK: linalg.matmul %[[partialA_f64]], %[[partialB_f64]], %[[partialC_f64]] :		// CHECK: linalg.matmul ins(%[[partialA_f64]], %[[partialB_f64]]{{.*}} outs(%[[partialC_f64]]
// CHECK: memref<?x?xf64, #[[$strided2D_dynamic]]>,
// CHECK: memref<?x?xf64, #[[$strided2D_dynamic]]>,
// CHECK: memref<?x?xf64, #[[$strided2D_dynamic]]>
//		//
// CHECK: linalg.copy(%[[partialC_f64]], %[[vC_f64]]) :		// CHECK: linalg.copy(%[[partialC_f64]], %[[vC_f64]]) :
// CHECK: memref<?x?xf64, #[[$strided2D_dynamic]]>,		// CHECK: memref<?x?xf64, #[[$strided2D_dynamic]]>,
// CHECK: memref<?x?xf64, #[[$strided2D]]>		// CHECK: memref<?x?xf64, #[[$strided2D]]>
//		//
// CHECK: dealloc %[[tmpA_f64]] : memref<64xi8>		// CHECK: dealloc %[[tmpA_f64]] : memref<64xi8>
// CHECK: dealloc %[[tmpB_f64]] : memref<96xi8>		// CHECK: dealloc %[[tmpB_f64]] : memref<96xi8>
// CHECK: dealloc %[[tmpC_f64]] : memref<48xi8>		// CHECK: dealloc %[[tmpC_f64]] : memref<48xi8>

mlir/test/Dialect/Linalg/promotion_options.mlir

	// RUN: mlir-opt %s -test-linalg-transform-patterns=test-linalg-promotion-options -split-input-file \| FileCheck %s			// RUN: mlir-opt %s -test-linalg-transform-patterns=test-linalg-promotion-options -split-input-file \| FileCheck %s

	func @gemm(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)			func @gemm(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)
	{			{
	linalg.matmul %a, %b, %c {__internal_linalg_transform__ = "START"}			linalg.matmul {__internal_linalg_transform__ = "START"}
	: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			ins(%a, %b: memref<?x?xf32>, memref<?x?xf32>)
				outs(%c: memref<?x?xf32>)
	return			return
	}			}

	// CHECK: func @gemm			// CHECK: func @gemm
	// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]+]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]+]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]+]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]+]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]+]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]+]]: memref<?x?xf32>
	// CHECK-DAG: %[[C42:.+]] = constant 4.200000e+01 : f32			// CHECK-DAG: %[[C42:.+]] = constant 4.200000e+01 : f32
	// CHECK: scf.for			// CHECK: scf.for
	// CHECK: scf.for			// CHECK: scf.for
	// CHECK: scf.for			// CHECK: scf.for
	// CHECK: %[[T7:.+]] = subview %[[ARG0]]			// CHECK: %[[T7:.+]] = subview %[[ARG0]]
	// CHECK: %[[T12:.+]] = subview %[[ARG1]]			// CHECK: %[[T12:.+]] = subview %[[ARG1]]
	// CHECK: %[[T17:.+]] = subview %[[ARG2]]			// CHECK: %[[T17:.+]] = subview %[[ARG2]]
	// CHECK: %[[T18:.+]] = alloc(%{{.}}, %{{.}}) : memref<?x?xf32, 3>			// CHECK: %[[T18:.+]] = alloc(%{{.}}, %{{.}}) : memref<?x?xf32, 3>
	// CHECK: %[[T19:.+]] = subview %[[T18]]			// CHECK: %[[T19:.+]] = subview %[[T18]]
	// CHECK: %[[T20:.+]] = alloc(%{{.}}, %{{.}}) : memref<?x?xf32, 3>			// CHECK: %[[T20:.+]] = alloc(%{{.}}, %{{.}}) : memref<?x?xf32, 3>
	// CHECK: %[[T21:.+]] = subview %[[T20]]			// CHECK: %[[T21:.+]] = subview %[[T20]]
	// CHECK: linalg.fill(%[[T19]], %[[C42]])			// CHECK: linalg.fill(%[[T19]], %[[C42]])
	// CHECK: linalg.copy(%[[T7]], %[[T19]])			// CHECK: linalg.copy(%[[T7]], %[[T19]])
	// CHECK: linalg.fill(%[[T21]], %[[C42]])			// CHECK: linalg.fill(%[[T21]], %[[C42]])
	// CHECK: linalg.copy(%[[T17]], %[[T21]])			// CHECK: linalg.copy(%[[T17]], %[[T21]])
	// CHECK: linalg.matmul %[[T19]], %[[T12]], %[[T21]]			// CHECK: linalg.matmul ins(%[[T19]], %[[T12]]{{.*}} outs(%[[T21]]
	// CHECK-NOT: linalg.fill			// CHECK-NOT: linalg.fill
	// CHECK: linalg.copy(%[[T21]], %[[T17]])			// CHECK: linalg.copy(%[[T21]], %[[T17]])
	// CHECK: dealloc %[[T18]]			// CHECK: dealloc %[[T18]]
	// CHECK: dealloc %[[T20]]			// CHECK: dealloc %[[T20]]

mlir/test/Dialect/Linalg/roundtrip.mlir

	Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines

	// CHECK-DAG: #[[$strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>			// CHECK-DAG: #[[$strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>
	// CHECK-DAG: #[[$strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>			// CHECK-DAG: #[[$strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>

	func @ops(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>,			func @ops(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%arg1: memref<?xf32, offset: ?, strides: [1]>,			%arg1: memref<?xf32, offset: ?, strides: [1]>,
	%arg2: memref<?xf32, offset: ?, strides: [1]>,			%arg2: memref<?xf32, offset: ?, strides: [1]>,
	%arg3: memref<f32>) {			%arg3: memref<f32>) {
	linalg.matmul %arg0, %arg0, %arg0 : (memref<?x?xf32, offset: ?, strides: [?, 1]>,			linalg.matmul ins(%arg0, %arg0 : memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?x?xf32, offset: ?, strides: [?, 1]>)			memref<?x?xf32, offset: ?, strides: [?, 1]>)
	linalg.matvec %arg0, %arg1, %arg2 : (memref<?x?xf32, offset: ?, strides: [?, 1]>,			outs(%arg0 : memref<?x?xf32, offset: ?, strides: [?, 1]>)
	memref<?xf32, offset: ?, strides: [1]>,			linalg.matvec ins(%arg0, %arg1: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?xf32, offset: ?, strides: [1]>)			memref<?xf32, offset: ?, strides: [1]>)
	linalg.dot %arg1, %arg2, %arg3 : (memref<?xf32, offset: ?, strides: [1]>,			outs(%arg2: memref<?xf32, offset: ?, strides: [1]>)
	memref<?xf32, offset: ?, strides: [1]>,			linalg.dot ins(%arg1, %arg2: memref<?xf32, offset: ?, strides: [1]>,
	memref<f32>)			memref<?xf32, offset: ?, strides: [1]>)
				outs(%arg3: memref<f32>)
	return			return
	}			}
	// CHECK-LABEL: func @ops(%			// CHECK-LABEL: func @ops(%
	// CHECK-NEXT: linalg.matmul %{{.}}, %{{.}}, %{{.*}} :			// CHECK: linalg.matmul
	// CHECK-SAME: (memref<?x?xf32, #[[$strided2D]]>,			// CHECK-SAME: ins(%{{.}}, %{{.}} : memref<?x?xf32, #[[$strided2D]]>,
	// CHECK-SAME: memref<?x?xf32, #[[$strided2D]]>,
	// CHECK-SAME: memref<?x?xf32, #[[$strided2D]]>)			// CHECK-SAME: memref<?x?xf32, #[[$strided2D]]>)
	// CHECK-NEXT: linalg.matvec %{{.}}, %{{.}}, %{{.*}} :			// CHECK-SAME: outs(%{{.*}} : memref<?x?xf32, #[[$strided2D]]>)
	// CHECK-SAME: (memref<?x?xf32, #[[$strided2D]]>,			// CHECK: linalg.matvec
	// CHECK-SAME: memref<?xf32, #[[$strided1D]]>,			// CHECK-SAME: ins(%{{.}}, %{{.}}: memref<?x?xf32, #[[$strided2D]]>,
				// CHECK-SAME: memref<?xf32, #[[$strided1D]]>)
				// CHECK-SAME: outs(%{{.*}}: memref<?xf32, #[[$strided1D]]>)
				// CHECK: linalg.dot
				// CHECK-SAME: ins(%{{.}}, %{{.}}: memref<?xf32, #[[$strided1D]]>,
	// CHECK-SAME: memref<?xf32, #[[$strided1D]]>)			// CHECK-SAME: memref<?xf32, #[[$strided1D]]>)
	// CHECK-NEXT: linalg.dot %{{.}}, %{{.}}, %{{.*}} :			// CHECK-SAME: outs(%{{.*}}: memref<f32>)
	// CHECK-SAME: (memref<?xf32, #[[$strided1D]]>,
	// CHECK-SAME: memref<?xf32, #[[$strided1D]]>,
	// CHECK-SAME: memref<f32>)

	// -----			// -----

	// CHECK-DAG: #[[$strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>			// CHECK-DAG: #[[$strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>

	func @fill_view(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: f32) {			func @fill_view(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: f32) {
	linalg.fill(%arg0, %arg1) : memref<?xf32, offset: ?, strides: [1]>, f32			linalg.fill(%arg0, %arg1) : memref<?xf32, offset: ?, strides: [1]>, f32
	return			return
	▲ Show 20 Lines • Show All 496 Lines • ▼ Show 20 Lines
	// CHECK-SAME: memref<?x?x?xf32, #[[$strided3DOFF0]]> into memref<?x?xf32, #[[$strided2DOFF0]]>			// CHECK-SAME: memref<?x?x?xf32, #[[$strided3DOFF0]]> into memref<?x?xf32, #[[$strided2DOFF0]]>
	// CHECK: linalg.reshape {{.*}} [#[[$reshapeD01]], #[[$reshapeD2]]]			// CHECK: linalg.reshape {{.*}} [#[[$reshapeD01]], #[[$reshapeD2]]]
	// CHECK-SAME: memref<?x?xf32, #[[$strided2DOFF0]]> into memref<?x?x?xf32, #[[$strided3DOFF0]]>			// CHECK-SAME: memref<?x?xf32, #[[$strided2DOFF0]]> into memref<?x?x?xf32, #[[$strided3DOFF0]]>
	// CHECK: linalg.reshape {{.*}} [#[[$reshapeD01]], #[[$reshapeD2]]]			// CHECK: linalg.reshape {{.*}} [#[[$reshapeD01]], #[[$reshapeD2]]]
	// CHECK-SAME: memref<?x?x?xf32, #[[$strided3D]]> into memref<?x?xf32, #[[$strided2D]]>			// CHECK-SAME: memref<?x?x?xf32, #[[$strided3D]]> into memref<?x?xf32, #[[$strided2D]]>
	// CHECK: linalg.reshape {{.*}} [#[[$reshapeD01]], #[[$reshapeD2]]]			// CHECK: linalg.reshape {{.*}} [#[[$reshapeD01]], #[[$reshapeD2]]]
	// CHECK-SAME: memref<?x?xf32, #[[$strided2D]]> into memref<?x?x?xf32, #[[$strided3D]]>			// CHECK-SAME: memref<?x?xf32, #[[$strided2D]]> into memref<?x?x?xf32, #[[$strided3D]]>


	// TODO: Return tensors need a semantics convention update.
	func @named_ops(%a3: memref<?x?x?xf32>, %b3: memref<?x?x?xf32>, %c3: memref<?x?x?xf32>,			func @named_ops(%a3: memref<?x?x?xf32>, %b3: memref<?x?x?xf32>, %c3: memref<?x?x?xf32>,
	%ta3: tensor<?x?x?xf32>, %tb3: tensor<?x?x?xf32>, %tc3: tensor<?x?x?xf32>) {			%ta3: tensor<?x?x?xf32>, %tb3: tensor<?x?x?xf32>, %tc3: tensor<?x?x?xf32>)
	linalg.batch_matmul %a3, %b3, %c3 : (memref<?x?x?xf32>, memref<?x?x?xf32>, memref<?x?x?xf32>) -> ()			-> (tensor<?x?x?xf32>, tensor<?x?x?xf32>)
	linalg.batch_matmul %ta3, %tb3, %c3 : (tensor<?x?x?xf32>, tensor<?x?x?xf32>, memref<?x?x?xf32>) -> ()			{
	return			linalg.batch_matmul ins(%a3, %b3: memref<?x?x?xf32>, memref<?x?x?xf32>)
				outs(%c3: memref<?x?x?xf32>)
				linalg.batch_matmul ins(%ta3, %tb3: tensor<?x?x?xf32>, tensor<?x?x?xf32>)
				outs(%c3: memref<?x?x?xf32>)
				%res1 = linalg.batch_matmul ins(%ta3, %tb3: tensor<?x?x?xf32>, tensor<?x?x?xf32>)
				init(%tc3: tensor<?x?x?xf32>)
				-> tensor<?x?x?xf32>
				%res2 = linalg.batch_matmul ins(%ta3, %b3: tensor<?x?x?xf32>, memref<?x?x?xf32>)
				init(%tc3: tensor<?x?x?xf32>)
				-> tensor<?x?x?xf32>
				return %res1, %res2 : tensor<?x?x?xf32>, tensor<?x?x?xf32>
	}			}
	// CHECK-LABEL: func @named_ops			// CHECK-LABEL: func @named_ops
	// CHECK: linalg.batch_matmul			// CHECK: linalg.batch_matmul
	// CHECK: linalg.batch_matmul			// CHECK: linalg.batch_matmul
				// CHECK: linalg.batch_matmul
				// CHECK: linalg.batch_matmul

	// -----			// -----

	func @tensor_reshape_zero_dim(%arg0 : tensor<1x1xf32>, %arg1 : tensor<f32>) -> (tensor<f32>, tensor<1x1xf32>)			func @tensor_reshape_zero_dim(%arg0 : tensor<1x1xf32>, %arg1 : tensor<f32>) -> (tensor<f32>, tensor<1x1xf32>)
	{			{
	%0 = linalg.tensor_reshape %arg0 [] : tensor<1x1xf32> into tensor<f32>			%0 = linalg.tensor_reshape %arg0 [] : tensor<1x1xf32> into tensor<f32>
	%1 = linalg.tensor_reshape %0 [] : tensor<f32> into tensor<1x1xf32>			%1 = linalg.tensor_reshape %0 [] : tensor<f32> into tensor<1x1xf32>
	return %0, %1 : tensor<f32>, tensor<1x1xf32>			return %0, %1 : tensor<f32>, tensor<1x1xf32>
	Show All 16 Lines

mlir/test/Dialect/Linalg/standard.mlir

	// RUN: mlir-opt %s -convert-linalg-to-std \| FileCheck %s			// RUN: mlir-opt %s -convert-linalg-to-std \| FileCheck %s

	// CHECK-DAG: #[[$map0:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>			// CHECK-DAG: #[[$map0:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>
	// CHECK-DAG: #[[$map1:.]] = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d0 s1 + s0 + d1 * s2 + d2)>			// CHECK-DAG: #[[$map1:.]] = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d0 s1 + s0 + d1 * s2 + d2)>
	// CHECK-DAG: #[[$map2:.]] = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d0 s1 + s0 + d2 * s2 + d1)>			// CHECK-DAG: #[[$map2:.]] = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d0 s1 + s0 + d2 * s2 + d1)>
	// CHECK-DAG: #[[$map3:.*]] = affine_map<(d0, d1, d2) -> (d0, d2, d1)>			// CHECK-DAG: #[[$map3:.*]] = affine_map<(d0, d1, d2) -> (d0, d2, d1)>
	// CHECK-DAG: #[[$map4:.]] = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d2 s1 + s0 + d1 * s2 + d0)>			// CHECK-DAG: #[[$map4:.]] = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d2 s1 + s0 + d1 * s2 + d0)>
	// CHECK-DAG: #[[$map5:.*]] = affine_map<(d0, d1, d2) -> (d2, d1, d0)>			// CHECK-DAG: #[[$map5:.*]] = affine_map<(d0, d1, d2) -> (d2, d1, d0)>
	// CHECK-DAG: #[[$map6:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>			// CHECK-DAG: #[[$map6:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>
	// CHECK-DAG: #[[$map7:.*]] = affine_map<()[s0] -> (s0)>			// CHECK-DAG: #[[$map7:.*]] = affine_map<()[s0] -> (s0)>
	// CHECK-DAG: #[[$map8:.]] = affine_map<(d0, d1, d2)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3)>			// CHECK-DAG: #[[$map8:.]] = affine_map<(d0, d1, d2)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3)>

	func @dot(%arg0: memref<?xf32, offset: ?, strides: [1]>,			func @dot(%arg0: memref<?xf32, offset: ?, strides: [1]>,
	%arg1: memref<?xf32, offset: ?, strides: [1]>,			%arg1: memref<?xf32, offset: ?, strides: [1]>,
	%arg2: memref<f32>) {			%arg2: memref<f32>) {
	linalg.dot %arg0, %arg1, %arg2 : (memref<?xf32, offset: ?, strides: [1]>,			linalg.dot ins(%arg0, %arg1: memref<?xf32, offset: ?, strides: [1]>,
	memref<?xf32, offset: ?, strides: [1]>,			memref<?xf32, offset: ?, strides: [1]>)
	memref<f32>)			outs(%arg2: memref<f32>)
	return			return
	}			}
	// CHECK-LABEL: func @dot(			// CHECK-LABEL: func @dot(
	// CHECK-SAME: %[[arg0:[a-zA-z0-9]*]]: memref<?xf32, #[[$map0]]>,			// CHECK-SAME: %[[arg0:[a-zA-z0-9]*]]: memref<?xf32, #[[$map0]]>,
	// CHECK-SAME: %[[arg1:[a-zA-z0-9]*]]: memref<?xf32, #[[$map0]]>,			// CHECK-SAME: %[[arg1:[a-zA-z0-9]*]]: memref<?xf32, #[[$map0]]>,
	// CHECK-SAME: %[[arg2:[a-zA-z0-9]*]]: memref<f32>) {			// CHECK-SAME: %[[arg2:[a-zA-z0-9]*]]: memref<f32>) {
	// CHECK: %[[o0:.*]] = memref_cast %[[arg0]] :			// CHECK: %[[o0:.*]] = memref_cast %[[arg0]] :
	// CHECK-SAME: memref<?xf32, #[[$map0]]> to memref<?xf32, #[[$map6]]>			// CHECK-SAME: memref<?xf32, #[[$map0]]> to memref<?xf32, #[[$map6]]>
	▲ Show 20 Lines • Show All 96 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/tile-and-distribute.mlir

	// RUN: mlir-opt %s -test-linalg-transform-patterns=test-tile-and-distribute-options -split-input-file \| FileCheck %s			// RUN: mlir-opt %s -test-linalg-transform-patterns=test-tile-and-distribute-options -split-input-file \| FileCheck %s

	func @gemm1(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)			func @gemm1(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)
	{			{
	linalg.matmul %a, %b, %c {__internal_linalg_transform__ = "distribute1"}			linalg.matmul {__internal_linalg_transform__ = "distribute1"}
	: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			ins(%a, %b: memref<?x?xf32>, memref<?x?xf32>)
				outs(%c: memref<?x?xf32>)
	return			return
	}			}
	// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>			// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>
	// CHECK: func @gemm1(			// CHECK: func @gemm1(
	// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}			// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}
	// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}			// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}
	// CHECK: scf.for %[[ARG3:.*]] =			// CHECK: scf.for %[[ARG3:.*]] =
	// CHECK: %[[OFFSETY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[OFFSETY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[OFFSETY]], %[[ARG3]]]			// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[OFFSETY]], %[[ARG3]]]
	// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG3]], %[[OFFSETX]]]			// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG3]], %[[OFFSETX]]]
	// CHECK: %[[OFFSETY_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[OFFSETY_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[OFFSETY_2]], %[[OFFSETX]]]			// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[OFFSETY_2]], %[[OFFSETX]]]
	// CHECK: linalg.matmul %[[SV1]], %[[SV2]], %[[SV3]]			// CHECK: linalg.matmul ins(%[[SV1]], %[[SV2]]{{.*}} outs(%[[SV3]]

	// -----			// -----

	func @gemm2(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)			func @gemm2(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)
	{			{
	linalg.matmul %a, %b, %c {__internal_linalg_transform__ = "distribute2"}			linalg.matmul {__internal_linalg_transform__ = "distribute2"}
	: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			ins(%a, %b: memref<?x?xf32>, memref<?x?xf32>)
				outs(%c:memref<?x?xf32>)
	return			return
	}			}
	// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>			// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>
	// CHECK: func @gemm2(			// CHECK: func @gemm2(
	// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-DAG: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}			// CHECK-DAG: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}
	// CHECK-DAG: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}			// CHECK-DAG: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}
	// CHECK: %[[ITERY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[ITERY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[ITERX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[ITERX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[INBOUNDSY:.]] = cmpi "slt", %[[ITERY]], %{{.}}			// CHECK: %[[INBOUNDSY:.]] = cmpi "slt", %[[ITERY]], %{{.}}
	// CHECK: %[[INBOUNDSX:.]] = cmpi "slt", %[[ITERX]], %{{.}}			// CHECK: %[[INBOUNDSX:.]] = cmpi "slt", %[[ITERX]], %{{.}}
	// CHECK: %[[INBOUNDS:.*]] = and %[[INBOUNDSY]], %[[INBOUNDSX]]			// CHECK: %[[INBOUNDS:.*]] = and %[[INBOUNDSY]], %[[INBOUNDSX]]
	// CHECK: scf.if %[[INBOUNDS]]			// CHECK: scf.if %[[INBOUNDS]]
	// CHECK: scf.for %[[ARG3:.*]] =			// CHECK: scf.for %[[ARG3:.*]] =
	// CHECK: %[[OFFSETY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[OFFSETY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[OFFSETY]], %[[ARG3]]]			// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[OFFSETY]], %[[ARG3]]]
	// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG3]], %[[OFFSETX]]]			// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG3]], %[[OFFSETX]]]
	// CHECK: %[[OFFSETY_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[OFFSETY_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[OFFSETX_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[OFFSETX_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[OFFSETY_2]], %[[OFFSETX_2]]]			// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[OFFSETY_2]], %[[OFFSETX_2]]]
	// CHECK: linalg.matmul %[[SV1]], %[[SV2]], %[[SV3]]			// CHECK: linalg.matmul ins(%[[SV1]], %[[SV2]]{{.*}} outs(%[[SV3]]

	// -----			// -----

	func @gemm3(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)			func @gemm3(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)
	{			{
	linalg.matmul %a, %b, %c {__internal_linalg_transform__ = "distribute3"}			linalg.matmul {__internal_linalg_transform__ = "distribute3"}
	: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			ins(%a, %b: memref<?x?xf32>, memref<?x?xf32>)
				outs(%c: memref<?x?xf32>)
	return			return
	}			}
	// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>			// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>
	// CHECK: func @gemm3(			// CHECK: func @gemm3(
	// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}			// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}
	// CHECK: %[[NBLOCKSY:.*]] = "gpu.grid_dim"() {dimension = "y"}			// CHECK: %[[NBLOCKSY:.*]] = "gpu.grid_dim"() {dimension = "y"}
	// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}			// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}
	// CHECK: %[[NBLOCKSX:.*]] = "gpu.grid_dim"() {dimension = "x"}			// CHECK: %[[NBLOCKSX:.*]] = "gpu.grid_dim"() {dimension = "x"}
	// CHECK: %[[LBY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[LBY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[STEPY:.*]] = affine.apply #[[MAP0]]()[%[[NBLOCKSY]]]			// CHECK: %[[STEPY:.*]] = affine.apply #[[MAP0]]()[%[[NBLOCKSY]]]
	// CHECK: %[[LBX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[LBX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[STEPX:.*]] = affine.apply #[[MAP0]]()[%[[NBLOCKSX]]]			// CHECK: %[[STEPX:.*]] = affine.apply #[[MAP0]]()[%[[NBLOCKSX]]]
	// CHECK: scf.parallel (%[[ARG3:.]], %[[ARG4:.]]) = (%[[LBY]], %[[LBX]]) to (%{{.}}, %{{.}}) step (%[[STEPY]], %[[STEPX]])			// CHECK: scf.parallel (%[[ARG3:.]], %[[ARG4:.]]) = (%[[LBY]], %[[LBX]]) to (%{{.}}, %{{.}}) step (%[[STEPY]], %[[STEPX]])
	// CHECK: scf.for %[[ARG5:.*]] =			// CHECK: scf.for %[[ARG5:.*]] =
	// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[ARG3]], %[[ARG5]]]			// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[ARG3]], %[[ARG5]]]
	// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG5]], %[[ARG4]]]			// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG5]], %[[ARG4]]]
	// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[ARG3]], %[[ARG4]]]			// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[ARG3]], %[[ARG4]]]
	// CHECK: linalg.matmul %[[SV1]], %[[SV2]], %[[SV3]]			// CHECK: linalg.matmul ins(%[[SV1]], %[[SV2]]{{.*}} outs(%[[SV3]]

	// -----			// -----

	func @gemm4(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)			func @gemm4(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)
	{			{
	linalg.matmul %a, %b, %c {__internal_linalg_transform__ = "distribute4"}			linalg.matmul {__internal_linalg_transform__ = "distribute4"}
	: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			ins(%a, %b: memref<?x?xf32>, memref<?x?xf32>)
				outs(%c: memref<?x?xf32>)
	return			return
	}			}
	// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>			// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>
	// CHECK: func @gemm4(			// CHECK: func @gemm4(
	// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}			// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}
	// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}			// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}
	// CHECK: %[[LBX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[LBX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[INBOUNDS:.]] = cmpi "slt", %[[LBX]], %{{.}}			// CHECK: %[[INBOUNDS:.]] = cmpi "slt", %[[LBX]], %{{.}}
	// CHECK: scf.if %[[INBOUNDS]]			// CHECK: scf.if %[[INBOUNDS]]
	// CHECK: scf.for %[[ARG3:.*]] =			// CHECK: scf.for %[[ARG3:.*]] =
	// CHECK: %[[OFFSETY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[OFFSETY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[OFFSETY]], %[[ARG3]]]			// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[OFFSETY]], %[[ARG3]]]
	// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG3]], %[[OFFSETX]]]			// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG3]], %[[OFFSETX]]]
	// CHECK: %[[OFFSETY_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[OFFSETY_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[OFFSETX_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[OFFSETX_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[OFFSETY_2]], %[[OFFSETX_2]]]			// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[OFFSETY_2]], %[[OFFSETX_2]]]
	// CHECK: linalg.matmul %[[SV1]], %[[SV2]], %[[SV3]]			// CHECK: linalg.matmul ins(%[[SV1]], %[[SV2]]{{.*}} outs(%[[SV3]]

	// -----			// -----

	func @gemm5(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)			func @gemm5(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)
	{			{
	linalg.matmul %a, %b, %c {__internal_linalg_transform__ = "distribute5"}			linalg.matmul {__internal_linalg_transform__ = "distribute5"}
	: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			ins(%a, %b: memref<?x?xf32>, memref<?x?xf32>)
				outs(%c: memref<?x?xf32>)
	return			return
	}			}
	// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>			// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>
	// CHECK: func @gemm5(			// CHECK: func @gemm5(
	// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}			// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}
	// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}			// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}
	// CHECK: %[[NBLOCKSX:.*]] = "gpu.grid_dim"() {dimension = "x"}			// CHECK: %[[NBLOCKSX:.*]] = "gpu.grid_dim"() {dimension = "x"}
	// CHECK: %[[LBY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[LBY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[LBX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[LBX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[STEPX:.*]] = affine.apply #[[MAP0]]()[%[[NBLOCKSX]]]			// CHECK: %[[STEPX:.*]] = affine.apply #[[MAP0]]()[%[[NBLOCKSX]]]
	// CHECK: %[[INBOUNDS:.]] = cmpi "slt", %[[LBY]], %{{.}}			// CHECK: %[[INBOUNDS:.]] = cmpi "slt", %[[LBY]], %{{.}}
	// CHECK: scf.if %[[INBOUNDS]]			// CHECK: scf.if %[[INBOUNDS]]
	// CHECK: scf.parallel (%[[ARG3.]]) = (%[[LBX]]) to (%{{.}}) step (%[[STEPX]])			// CHECK: scf.parallel (%[[ARG3.]]) = (%[[LBX]]) to (%{{.}}) step (%[[STEPX]])
	// CHECK: scf.for %[[ARG4:.*]] =			// CHECK: scf.for %[[ARG4:.*]] =
	// CHECK: %[[OFFSETY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[OFFSETY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[OFFSETY]], %[[ARG4]]]			// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[OFFSETY]], %[[ARG4]]]
	// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG4]], %[[ARG3]]]			// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG4]], %[[ARG3]]]
	// CHECK: %[[OFFSETY_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[OFFSETY_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[OFFSETY_2]], %[[ARG3]]]			// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[OFFSETY_2]], %[[ARG3]]]
	// CHECK: linalg.matmul %[[SV1]], %[[SV2]], %[[SV3]]			// CHECK: linalg.matmul ins(%[[SV1]], %[[SV2]]{{.*}} outs(%[[SV3]]

	// -----			// -----

	func @gemm6(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)			func @gemm6(%a : memref<?x?xf32>, %b : memref<?x?xf32>, %c : memref<?x?xf32>)
	{			{
	linalg.matmul %a, %b, %c {__internal_linalg_transform__ = "distribute6"}			linalg.matmul {__internal_linalg_transform__ = "distribute6"}
	: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			ins(%a, %b: memref<?x?xf32>, memref<?x?xf32>)
				outs(%c: memref<?x?xf32>)
	return			return
	}			}
	// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>			// CHECK-DAG: #[[MAP0:.]] = affine_map<()[s0] -> (s0 8)>
	// CHECK: func @gemm6(			// CHECK: func @gemm6(
	// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]*]]: memref<?x?xf32>
	// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}			// CHECK: %[[BIDY:.*]] = "gpu.block_id"() {dimension = "y"}
	// CHECK: %[[NBLOCKSY:.*]] = "gpu.grid_dim"() {dimension = "y"}			// CHECK: %[[NBLOCKSY:.*]] = "gpu.grid_dim"() {dimension = "y"}
	// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}			// CHECK: %[[BIDX:.*]] = "gpu.block_id"() {dimension = "x"}
	// CHECK: %[[LBY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]			// CHECK: %[[LBY:.*]] = affine.apply #[[MAP0]]()[%[[BIDY]]]
	// CHECK: %[[STEPY:.*]] = affine.apply #[[MAP0]]()[%[[NBLOCKSY]]]			// CHECK: %[[STEPY:.*]] = affine.apply #[[MAP0]]()[%[[NBLOCKSY]]]
	// CHECK: scf.parallel (%[[ARG3.]]) = (%[[LBY]]) to (%{{.}}) step (%[[STEPY]])			// CHECK: scf.parallel (%[[ARG3.]]) = (%[[LBY]]) to (%{{.}}) step (%[[STEPY]])
	// CHECK: scf.for %[[ARG4:.*]] =			// CHECK: scf.for %[[ARG4:.*]] =
	// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[ARG3]], %[[ARG4]]]			// CHECK: %[[SV1:.*]] = subview %[[ARG0]][%[[ARG3]], %[[ARG4]]]
	// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[OFFSETX:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG4]], %[[OFFSETX]]]			// CHECK: %[[SV2:.*]] = subview %[[ARG1]][%[[ARG4]], %[[OFFSETX]]]
	// CHECK: %[[OFFSETX_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]			// CHECK: %[[OFFSETX_2:.*]] = affine.apply #[[MAP0]]()[%[[BIDX]]]
	// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[ARG3]], %[[OFFSETX_2]]]			// CHECK: %[[SV3:.*]] = subview %[[ARG2]][%[[ARG3]], %[[OFFSETX_2]]]
	// CHECK: linalg.matmul %[[SV1]], %[[SV2]], %[[SV3]]			// CHECK: linalg.matmul ins(%[[SV1]], %[[SV2]]{{.*}} outs(%[[SV3]]

mlir/test/Dialect/Linalg/tile.mlir

	Show All 25 Lines

	// TILE-2-DAG: #[[$stride_99_1_layout_map:.]] = affine_map<(d0, d1)[s0] -> (d0 99 + s0 + d1)>			// TILE-2-DAG: #[[$stride_99_1_layout_map:.]] = affine_map<(d0, d1)[s0] -> (d0 99 + s0 + d1)>
	// TILE-02-DAG: #[[$stride_99_1_layout_map:.]] = affine_map<(d0, d1)[s0] -> (d0 99 + s0 + d1)>			// TILE-02-DAG: #[[$stride_99_1_layout_map:.]] = affine_map<(d0, d1)[s0] -> (d0 99 + s0 + d1)>
	// TILE-234-DAG: #[[$stride_99_1_layout_map:.]] = affine_map<(d0, d1)[s0] -> (d0 99 + s0 + d1)>			// TILE-234-DAG: #[[$stride_99_1_layout_map:.]] = affine_map<(d0, d1)[s0] -> (d0 99 + s0 + d1)>

	func @matmul(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>,			func @matmul(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%arg1: memref<?x?xf32, offset: ?, strides: [?, 1]>,			%arg1: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%arg2: memref<?x?xf32, offset: ?, strides: [?, 1]>) {			%arg2: memref<?x?xf32, offset: ?, strides: [?, 1]>) {
	linalg.matmul %arg0, %arg1, %arg2 :			linalg.matmul
	(memref<?x?xf32, offset: ?, strides: [?, 1]>,			ins(%arg0, %arg1: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?x?xf32, offset: ?, strides: [?, 1]>)			memref<?x?xf32, offset: ?, strides: [?, 1]>)
				outs(%arg2: memref<?x?xf32, offset: ?, strides: [?, 1]>)
	return			return
	}			}
	// TILE-2-LABEL: func @matmul(			// TILE-2-LABEL: func @matmul(
	// TILE-2-DAG: %[[C0:.*]] = constant 0 : index			// TILE-2-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-2-DAG: %[[C2:.*]] = constant 2 : index			// TILE-2-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-2: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>			// TILE-2: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-2: scf.for %[[I:.]] = %{{.}}{{.}} to %[[M]] step %{{.}} {			// TILE-2: scf.for %[[I:.]] = %{{.}}{{.}} to %[[M]] step %{{.}} {
	// TILE-2: %[[localM:.]] = dim %{{.}}, %c0			// TILE-2: %[[localM:.]] = dim %{{.}}, %c0
	// TILE-2: %[[szM:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localM]]]			// TILE-2: %[[szM:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localM]]]
	// TILE-2: %[[K:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>			// TILE-2: %[[K:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-2: %[[sAi:.]] = subview %{{.}}[%[[I]], 0] [%[[szM]], %[[K]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-2: %[[sAi:.]] = subview %{{.}}[%[[I]], 0] [%[[szM]], %[[K]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-2: %[[localK:.]] = dim %{{.}}, %c0			// TILE-2: %[[localK:.]] = dim %{{.}}, %c0
	// TILE-2: %[[szK:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localK]]]			// TILE-2: %[[szK:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localK]]]
	// TILE-2: %[[N:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>			// TILE-2: %[[N:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-2: %[[sCi:.]] = subview %{{.}}[%[[I]], 0] [%[[szK]], %[[N]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-2: %[[sCi:.]] = subview %{{.}}[%[[I]], 0] [%[[szK]], %[[N]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-2: linalg.matmul %[[sAi]], %{{.*}}, %[[sCi]] :			// TILE-2: linalg.matmul ins(%[[sAi]]{{.*}} outs(%[[sCi]]
	// TILE-2: (memref<?x?xf32, #[[$strided2D]]>,
	// TILE-2: memref<?x?xf32, #[[$strided2D]]>,
	// TILE-2: memref<?x?xf32, #[[$strided2D]]>)

	// TILE-02-LABEL: func @matmul(			// TILE-02-LABEL: func @matmul(
	// TILE-02-DAG: %[[C0:.*]] = constant 0 : index			// TILE-02-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-02-DAG: %[[C2:.*]] = constant 2 : index			// TILE-02-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-02: %[[N:.*]] = dim %arg1, %c1 : memref<?x?xf32, #[[$strided2D]]>			// TILE-02: %[[N:.*]] = dim %arg1, %c1 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-02: scf.for %[[J:.]] = %{{.}} to %[[N]] step %{{.*}} {			// TILE-02: scf.for %[[J:.]] = %{{.}} to %[[N]] step %{{.*}} {
	// TILE-02: %[[K:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>			// TILE-02: %[[K:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-02: %[[localN:.]] = dim %{{.}}, %c1			// TILE-02: %[[localN:.]] = dim %{{.}}, %c1
	// TILE-02: %[[szN:.*]] = affine.min #[[$bound_map]](%[[J]])[%[[localN]]]			// TILE-02: %[[szN:.*]] = affine.min #[[$bound_map]](%[[J]])[%[[localN]]]
	// TILE-02: %[[sBj:.]] = subview %{{.}}[0, %[[J]]] [%[[K]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-02: %[[sBj:.]] = subview %{{.}}[0, %[[J]]] [%[[K]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-02: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>			// TILE-02: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-02: %[[localK:.]] = dim %{{.}}, %c1			// TILE-02: %[[localK:.]] = dim %{{.}}, %c1
	// TILE-02: %[[szK:.*]] = affine.min #[[$bound_map]](%[[J]])[%[[localK]]]			// TILE-02: %[[szK:.*]] = affine.min #[[$bound_map]](%[[J]])[%[[localK]]]
	// TILE-02: %[[sCj:.]] = subview %{{.}}[0, %[[J]]] [%[[M]], %[[szK]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-02: %[[sCj:.]] = subview %{{.}}[0, %[[J]]] [%[[M]], %[[szK]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-02: linalg.matmul %{{.*}}, %[[sBj]], %[[sCj]] :			// TILE-02: linalg.matmul ins(%{{.}}, %[[sBj]]{{.}} outs(%[[sCj]]
	// TILE-02: (memref<?x?xf32, #[[$strided2D]]>,
	// TILE-02: memref<?x?xf32, #[[$strided2D]]>,
	// TILE-02: memref<?x?xf32, #[[$strided2D]]>)

	// TILE-002-LABEL: func @matmul(			// TILE-002-LABEL: func @matmul(
	// TILE-002-DAG: %[[C0:.*]] = constant 0 : index			// TILE-002-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-002-DAG: %[[C2:.*]] = constant 2 : index			// TILE-002-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-002: %[[ubK:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>			// TILE-002: %[[ubK:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-002: scf.for %[[K:.]] = %{{.}}{{.}} to %[[ubK]] step %{{.}} {			// TILE-002: scf.for %[[K:.]] = %{{.}}{{.}} to %[[ubK]] step %{{.}} {
	// TILE-002: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>			// TILE-002: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-002: %[[localK:.]] = dim %{{.}}, %c1			// TILE-002: %[[localK:.]] = dim %{{.}}, %c1
	// TILE-002: %[[szK:.*]] = affine.min #[[$bound_map]](%[[K]])[%[[localK]]]			// TILE-002: %[[szK:.*]] = affine.min #[[$bound_map]](%[[K]])[%[[localK]]]
	// TILE-002: %[[sAj:.]] = subview %{{.}}[0, %[[K]]] [%[[M]], %[[szK]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-002: %[[sAj:.]] = subview %{{.}}[0, %[[K]]] [%[[M]], %[[szK]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-002: %[[localK:.]] = dim %{{.}}, %c0			// TILE-002: %[[localK:.]] = dim %{{.}}, %c0
	// TILE-002: %[[szK:.*]] = affine.min #[[$bound_map]](%[[K]])[%[[localK]]]			// TILE-002: %[[szK:.*]] = affine.min #[[$bound_map]](%[[K]])[%[[localK]]]
	// TILE-002: %[[N:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>			// TILE-002: %[[N:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-002: %[[sBj:.]] = subview %{{.}}[%[[K]], 0] [%[[szK]], %[[N]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-002: %[[sBj:.]] = subview %{{.}}[%[[K]], 0] [%[[szK]], %[[N]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-002: linalg.matmul %[[sAj]], %[[sBj]], %{{.*}} :			// TILE-002: linalg.matmul ins(%[[sAj]], %[[sBj]]{{.}} outs(%{{.}}
	// TILE-002: (memref<?x?xf32, #[[$strided2D]]>,
	// TILE-002: memref<?x?xf32, #[[$strided2D]]>,
	// TILE-002: memref<?x?xf32, #[[$strided2D]]>)

	// TILE-234-LABEL: func @matmul(			// TILE-234-LABEL: func @matmul(
	// TILE-234-DAG: %[[C0:.*]] = constant 0 : index			// TILE-234-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-234-DAG: %[[C2:.*]] = constant 2 : index			// TILE-234-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-234-DAG: %[[C3:.*]] = constant 3 : index			// TILE-234-DAG: %[[C3:.*]] = constant 3 : index
	// TILE-234-DAG: %[[C4:.*]] = constant 4 : index			// TILE-234-DAG: %[[C4:.*]] = constant 4 : index
	// TILE-234: %[[ubM:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>			// TILE-234: %[[ubM:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-234: %[[ubK:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>			// TILE-234: %[[ubK:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>
	Show All 12 Lines
	// TILE-234: %[[szN:.*]] = affine.min #[[$bound_map_3]](%[[J]])[%[[localN]]]			// TILE-234: %[[szN:.*]] = affine.min #[[$bound_map_3]](%[[J]])[%[[localN]]]
	// TILE-234: %[[sBkj:.]] = subview %{{.}}[%[[K]], %[[J]]] [%[[szK]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-234: %[[sBkj:.]] = subview %{{.}}[%[[K]], %[[J]]] [%[[szK]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-234: %[[localM:.]] = dim %{{.}}, %c0			// TILE-234: %[[localM:.]] = dim %{{.}}, %c0
	// TILE-234: %[[szM:.*]] = affine.min #[[$bound_map_2]](%[[I]])[%[[localM]]]			// TILE-234: %[[szM:.*]] = affine.min #[[$bound_map_2]](%[[I]])[%[[localM]]]
	// TILE-234: %[[localN:.]] = dim %{{.}}, %c1			// TILE-234: %[[localN:.]] = dim %{{.}}, %c1
	// TILE-234: %[[szN:.*]] = affine.min #[[$bound_map_3]](%[[J]])[%[[localN]]]			// TILE-234: %[[szN:.*]] = affine.min #[[$bound_map_3]](%[[J]])[%[[localN]]]
	// TILE-234: %[[sCij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%[[szM]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-234: %[[sCij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%[[szM]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	//			//
	// TILE-234: linalg.matmul %[[sAik]], %[[sBkj]], %[[sCij]] :			// TILE-234: linalg.matmul ins(%[[sAik]], %[[sBkj]]{{.*}} outs(%[[sCij]]
	// TILE-234: (memref<?x?xf32, #[[$strided2D]]>,
	// TILE-234: memref<?x?xf32, #[[$strided2D]]>,
	// TILE-234: memref<?x?xf32, #[[$strided2D]]>)

	// When the buffer shapes are known at compile time, it is possible to avoid			// When the buffer shapes are known at compile time, it is possible to avoid
	// the "min" in subview size computation. This test uses buffer sizes divisible			// the "min" in subview size computation. This test uses buffer sizes divisible
	// by respective tile sizes (M=10 divisble by 2, N=12 divisible by 2 and 3,			// by respective tile sizes (M=10 divisble by 2, N=12 divisible by 2 and 3,
	// K=16 divisble by 2 and 4).			// K=16 divisble by 2 and 4).
	func @matmul_static(%arg0: memref<10x16xf32, offset: ?, strides: [?, 1]>,			func @matmul_static(%arg0: memref<10x16xf32, offset: ?, strides: [?, 1]>,
	%arg1: memref<16x12xf32, offset: ?, strides: [?, 1]>,			%arg1: memref<16x12xf32, offset: ?, strides: [?, 1]>,
	%arg2: memref<10x12xf32, offset: ?, strides: [?, 1]>) {			%arg2: memref<10x12xf32, offset: ?, strides: [?, 1]>) {
	linalg.matmul %arg0, %arg1, %arg2 :			linalg.matmul
	(memref<10x16xf32, offset: ?, strides: [?, 1]>,			ins(%arg0, %arg1: memref<10x16xf32, offset: ?, strides: [?, 1]>,
	memref<16x12xf32, offset: ?, strides: [?, 1]>,			memref<16x12xf32, offset: ?, strides: [?, 1]>)
	memref<10x12xf32, offset: ?, strides: [?, 1]>)			outs(%arg2: memref<10x12xf32, offset: ?, strides: [?, 1]>)
	return			return
	}			}
	// TILE-2-LABEL: func @matmul_static(			// TILE-2-LABEL: func @matmul_static(
	// TILE-2-SAME: %[[ARG0:[0-9a-zA-Z]*]]: memref			// TILE-2-SAME: %[[ARG0:[0-9a-zA-Z]*]]: memref
	// TILE-2-SAME: %[[ARG1:[0-9a-zA-Z]*]]: memref			// TILE-2-SAME: %[[ARG1:[0-9a-zA-Z]*]]: memref
	// TILE-2-SAME: %[[ARG2:[0-9a-zA-Z]*]]: memref			// TILE-2-SAME: %[[ARG2:[0-9a-zA-Z]*]]: memref
	// TILE-2-DAG: %[[C0:.*]] = constant 0 : index			// TILE-2-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-2-DAG: %[[C2:.*]] = constant 2 : index			// TILE-2-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-2-DAG: %[[M:.*]] = constant 10 : index			// TILE-2-DAG: %[[M:.*]] = constant 10 : index
	// TILE-2: scf.for %[[I:.]] = %{{.}} to %[[M]] step %{{.*}} {			// TILE-2: scf.for %[[I:.]] = %{{.}} to %[[M]] step %{{.*}} {
	// TILE-2: %[[MIN2:.*]] = affine.min #[[$bound_map_static]](%[[I]])			// TILE-2: %[[MIN2:.*]] = affine.min #[[$bound_map_static]](%[[I]])
	// TILE-2: %[[sAi:.]] = subview %{{.}}[%[[I]], 0] [%[[MIN2]], 16] [1, 1] : memref<10x16xf32, #[[$strided2D]]> to memref<?x16xf32, #[[$strided2D]]>			// TILE-2: %[[sAi:.]] = subview %{{.}}[%[[I]], 0] [%[[MIN2]], 16] [1, 1] : memref<10x16xf32, #[[$strided2D]]> to memref<?x16xf32, #[[$strided2D]]>
	// TILE-2: %[[MIN22:.*]] = affine.min #[[$bound_map_static]](%[[I]])			// TILE-2: %[[MIN22:.*]] = affine.min #[[$bound_map_static]](%[[I]])
	// TILE-2: %[[sCi:.]] = subview %{{.}}[%[[I]], 0] [%[[MIN22]], 12] [1, 1] : memref<10x12xf32, #[[$strided2D]]> to memref<?x12xf32, #[[$strided2D]]>			// TILE-2: %[[sCi:.]] = subview %{{.}}[%[[I]], 0] [%[[MIN22]], 12] [1, 1] : memref<10x12xf32, #[[$strided2D]]> to memref<?x12xf32, #[[$strided2D]]>
	// TILE-2: linalg.matmul %[[sAi]], %{{.*}}, %[[sCi]]			// TILE-2: linalg.matmul ins(%[[sAi]], %{{.}}{{.}} outs(%[[sCi]]

	// TILE-02-LABEL: func @matmul_static(			// TILE-02-LABEL: func @matmul_static(
	// TILE-02-DAG: %[[C0:.*]] = constant 0 : index			// TILE-02-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-02-DAG: %[[C2:.*]] = constant 2 : index			// TILE-02-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-02-DAG: %[[N:.*]] = constant 12 : index			// TILE-02-DAG: %[[N:.*]] = constant 12 : index
	// TILE-02: scf.for %[[J:.]] = %{{.}} to %[[N]] step %{{.*}} {			// TILE-02: scf.for %[[J:.]] = %{{.}} to %[[N]] step %{{.*}} {
	// TILE-02: %[[MIN2:.*]] = affine.min #[[$bound_map_static]](%[[J]])			// TILE-02: %[[MIN2:.*]] = affine.min #[[$bound_map_static]](%[[J]])
	// TILE-02: %[[sBj:.]] = subview %{{.}}[0, %[[J]]] [16, %[[MIN2]]] [1, 1] : memref<16x12xf32, #[[$strided2D]]> to memref<16x?xf32, #[[$strided2D]]>			// TILE-02: %[[sBj:.]] = subview %{{.}}[0, %[[J]]] [16, %[[MIN2]]] [1, 1] : memref<16x12xf32, #[[$strided2D]]> to memref<16x?xf32, #[[$strided2D]]>
	// TILE-02: %[[MIN22:.*]] = affine.min #[[$bound_map_static]](%[[J]])			// TILE-02: %[[MIN22:.*]] = affine.min #[[$bound_map_static]](%[[J]])
	// TILE-02: %[[sCj:.]] = subview %{{.}}[0, %[[J]]] [10, %[[MIN22]]] [1, 1] : memref<10x12xf32, #[[$strided2D]]> to memref<10x?xf32, #[[$strided2D]]>			// TILE-02: %[[sCj:.]] = subview %{{.}}[0, %[[J]]] [10, %[[MIN22]]] [1, 1] : memref<10x12xf32, #[[$strided2D]]> to memref<10x?xf32, #[[$strided2D]]>
	// TILE-02: linalg.matmul %{{.*}}, %[[sBj]], %[[sCj]] :			// TILE-02: linalg.matmul ins(%{{.}}, %[[sBj]]{{.}} outs(%[[sCj]]
	// TILE-02: (memref<10x16xf32, #[[$strided2D]]>,
	// TILE-02: memref<16x?xf32, #[[$strided2D]]>,
	// TILE-02: memref<10x?xf32, #[[$strided2D]]>)

	// TILE-002-LABEL: func @matmul_static(			// TILE-002-LABEL: func @matmul_static(
	// TILE-002-DAG: %[[C0:.*]] = constant 0 : index			// TILE-002-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-002-DAG: %[[C2:.*]] = constant 2 : index			// TILE-002-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-002-DAG: %[[C16:.*]] = constant 16 : index			// TILE-002-DAG: %[[C16:.*]] = constant 16 : index
	// TILE-002: scf.for %[[K:.]] = %{{.}}{{.}} to %[[C16]] step %{{.}} {			// TILE-002: scf.for %[[K:.]] = %{{.}}{{.}} to %[[C16]] step %{{.}} {
	// TILE-002: %[[MIN2:.*]] = affine.min #[[$bound_map_static]](%[[K]])			// TILE-002: %[[MIN2:.*]] = affine.min #[[$bound_map_static]](%[[K]])
	// TILE-002: %[[sAj:.]] = subview %{{.}}[0, %[[K]]] [10, %[[MIN2]]] [1, 1] : memref<10x16xf32, #[[$strided2D]]> to memref<10x?xf32, #[[$strided2D]]>			// TILE-002: %[[sAj:.]] = subview %{{.}}[0, %[[K]]] [10, %[[MIN2]]] [1, 1] : memref<10x16xf32, #[[$strided2D]]> to memref<10x?xf32, #[[$strided2D]]>
	// TILE-002: %[[MIN22:.*]] = affine.min #[[$bound_map_static]](%[[K]])			// TILE-002: %[[MIN22:.*]] = affine.min #[[$bound_map_static]](%[[K]])
	// TILE-002: %[[sBj:.]] = subview %{{.}}[%[[K]], 0] [%[[MIN22]], 12] [1, 1] : memref<16x12xf32, #[[$strided2D]]> to memref<?x12xf32, #[[$strided2D]]>			// TILE-002: %[[sBj:.]] = subview %{{.}}[%[[K]], 0] [%[[MIN22]], 12] [1, 1] : memref<16x12xf32, #[[$strided2D]]> to memref<?x12xf32, #[[$strided2D]]>
	// TILE-002: linalg.matmul %[[sAj]], %[[sBj]], %{{.*}} :			// TILE-002: linalg.matmul ins(%[[sAj]], %[[sBj]]{{.}} outs(%{{.}}
	// TILE-002: (memref<10x?xf32, #[[$strided2D]]>,
	// TILE-002: memref<?x12xf32, #[[$strided2D]]>,
	// TILE-002: memref<10x12xf32, #[[$strided2D]]>)

	// TILE-234-LABEL: func @matmul_static(			// TILE-234-LABEL: func @matmul_static(
	// TILE-234-DAG: %[[C0:.*]] = constant 0 : index			// TILE-234-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-234-DAG: %[[C2:.*]] = constant 2 : index			// TILE-234-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-234-DAG: %[[C3:.*]] = constant 3 : index			// TILE-234-DAG: %[[C3:.*]] = constant 3 : index
	// TILE-234-DAG: %[[C4:.*]] = constant 4 : index			// TILE-234-DAG: %[[C4:.*]] = constant 4 : index
	// TILE-234-DAG: %[[C10:.*]] = constant 10 : index			// TILE-234-DAG: %[[C10:.*]] = constant 10 : index
	// TILE-234-DAG: %[[C16:.*]] = constant 16 : index			// TILE-234-DAG: %[[C16:.*]] = constant 16 : index
	// TILE-234-DAG: %[[C12:.*]] = constant 12 : index			// TILE-234-DAG: %[[C12:.*]] = constant 12 : index
	// TILE-234: scf.for %[[I:.]] = %{{.}}{{.}} to %[[C10]] step %{{.}} {			// TILE-234: scf.for %[[I:.]] = %{{.}}{{.}} to %[[C10]] step %{{.}} {
	// TILE-234: scf.for %[[J:.]] = %{{.}}{{.}} to %[[C12]] step %{{.}} {			// TILE-234: scf.for %[[J:.]] = %{{.}}{{.}} to %[[C12]] step %{{.}} {
	// TILE-234: scf.for %[[K:.]] = %{{.}}{{.}} to %[[C16]] step %{{.}} {			// TILE-234: scf.for %[[K:.]] = %{{.}}{{.}} to %[[C16]] step %{{.}} {
	// TILE-234: %[[sAik:.]] = subview %{{.}}[%[[I]], %[[K]]] [%{{.}}, %{{.}}] [1, 1] : memref<10x16xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-234: %[[sAik:.]] = subview %{{.}}[%[[I]], %[[K]]] [%{{.}}, %{{.}}] [1, 1] : memref<10x16xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-234: %[[sBkj:.]] = subview %{{.}}[%[[K]], %[[J]]] [%{{.}}, %{{.}}] [1, 1] : memref<16x12xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-234: %[[sBkj:.]] = subview %{{.}}[%[[K]], %[[J]]] [%{{.}}, %{{.}}] [1, 1] : memref<16x12xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-234: %[[sCij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%{{.}}, %{{.}}] [1, 1] : memref<10x12xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-234: %[[sCij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%{{.}}, %{{.}}] [1, 1] : memref<10x12xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	//			//
	// TILE-234: linalg.matmul %[[sAik]], %[[sBkj]], %[[sCij]] :			// TILE-234: linalg.matmul ins(%[[sAik]], %[[sBkj]]{{.*}} outs(%[[sCij]]
	// TILE-234: (memref<?x?xf32, #[[$strided2D]]>,
	// TILE-234: memref<?x?xf32, #[[$strided2D]]>,
	// TILE-234: memref<?x?xf32, #[[$strided2D]]>)

	func @matvec(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<?xf32, offset: ?, strides: [1]>) {			func @matvec(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<?xf32, offset: ?, strides: [1]>) {
	linalg.matvec %arg0, %arg1, %arg2 : (			linalg.matvec
	memref<?x?xf32, offset: ?, strides: [?, 1]>,			ins(%arg0, %arg1: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?xf32, offset: ?, strides: [1]>,
	memref<?xf32, offset: ?, strides: [1]>)			memref<?xf32, offset: ?, strides: [1]>)
				outs(%arg2: memref<?xf32, offset: ?, strides: [1]>)
	return			return
	}			}
	// TILE-2-LABEL: func @matvec(			// TILE-2-LABEL: func @matvec(
	// TILE-2-SAME: %[[ARG0:[0-9a-zA-Z]*]]: memref			// TILE-2-SAME: %[[ARG0:[0-9a-zA-Z]*]]: memref
	// TILE-2-SAME: %[[ARG1:[0-9a-zA-Z]*]]: memref			// TILE-2-SAME: %[[ARG1:[0-9a-zA-Z]*]]: memref
	// TILE-2-SAME: %[[ARG2:[0-9a-zA-Z]*]]: memref			// TILE-2-SAME: %[[ARG2:[0-9a-zA-Z]*]]: memref
	// TILE-2-DAG: %[[C0:.*]] = constant 0 : index			// TILE-2-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-2-DAG: %[[C2:.*]] = constant 2 : index			// TILE-2-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-2: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>			// TILE-2: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-2: scf.for %[[I:.]] = %{{.}}{{.}} to %[[M]] step %{{.}} {			// TILE-2: scf.for %[[I:.]] = %{{.}}{{.}} to %[[M]] step %{{.}} {
	// TILE-2: %[[localM:.*]] = dim %[[ARG0]], %c0			// TILE-2: %[[localM:.*]] = dim %[[ARG0]], %c0
	// TILE-2: %[[szM:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localM]]]			// TILE-2: %[[szM:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localM]]]
	// TILE-2: %[[N:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>			// TILE-2: %[[N:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-2: %[[sAi:.]] = subview %{{.}}[%[[I]], 0] [%[[szM]], %[[N]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-2: %[[sAi:.]] = subview %{{.}}[%[[I]], 0] [%[[szM]], %[[N]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-2: %[[localN:.]] = dim %{{.}}, %c0			// TILE-2: %[[localN:.]] = dim %{{.}}, %c0
	// TILE-2: %[[szN:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localN]]]			// TILE-2: %[[szN:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localN]]]
	// TILE-2: %[[sCi:.]] = subview %{{.}}[%[[I]]] [%[[szN]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>			// TILE-2: %[[sCi:.]] = subview %{{.}}[%[[I]]] [%[[szN]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>
	// TILE-2: linalg.matvec %[[sAi]], %{{.*}}, %[[sCi]] : (memref<?x?xf32, #[[$strided2D]]>, memref<?xf32, #[[$strided1D]]>, memref<?xf32, #[[$strided1D]]>)			// TILE-2: linalg.matvec ins(%[[sAi]], %{{.*}} outs(%[[sCi]]

	// TILE-02-LABEL: func @matvec(			// TILE-02-LABEL: func @matvec(
	// TILE-02-SAME: %[[ARG0:[0-9a-zA-Z]*]]: memref			// TILE-02-SAME: %[[ARG0:[0-9a-zA-Z]*]]: memref
	// TILE-02-SAME: %[[ARG1:[0-9a-zA-Z]*]]: memref			// TILE-02-SAME: %[[ARG1:[0-9a-zA-Z]*]]: memref
	// TILE-02-SAME: %[[ARG2:[0-9a-zA-Z]*]]: memref			// TILE-02-SAME: %[[ARG2:[0-9a-zA-Z]*]]: memref
	// TILE-02-DAG: %[[C0:.*]] = constant 0 : index			// TILE-02-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-02-DAG: %[[C2:.*]] = constant 2 : index			// TILE-02-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-02: %[[K:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>			// TILE-02: %[[K:.]] = dim %{{.}}, %c1 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-02: scf.for %[[J:.]] = %{{.}}{{.}} to %[[K]] step %{{.}} {			// TILE-02: scf.for %[[J:.]] = %{{.}}{{.}} to %[[K]] step %{{.}} {
	// TILE-02: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>			// TILE-02: %[[M:.]] = dim %{{.}}, %c0 : memref<?x?xf32, #[[$strided2D]]>
	// TILE-02: %[[localN:.]] = dim %{{.}}, %c1			// TILE-02: %[[localN:.]] = dim %{{.}}, %c1
	// TILE-02: %[[szN:.*]] = affine.min #[[$bound_map]](%[[J]])[%[[localN]]]			// TILE-02: %[[szN:.*]] = affine.min #[[$bound_map]](%[[J]])[%[[localN]]]
	// TILE-02: %[[sAj:.]] = subview %{{.}}[0, %[[J]]] [%[[M]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-02: %[[sAj:.]] = subview %{{.}}[0, %[[J]]] [%[[M]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-02: %[[localN:.]] = dim %{{.}}, %c0			// TILE-02: %[[localN:.]] = dim %{{.}}, %c0
	// TILE-02: %[[szN:.*]] = affine.min #[[$bound_map]](%[[J]])[%[[localN]]]			// TILE-02: %[[szN:.*]] = affine.min #[[$bound_map]](%[[J]])[%[[localN]]]
	// TILE-02: %[[sBj:.]] = subview %{{.}}[%[[J]]] [%[[szN]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>			// TILE-02: %[[sBj:.]] = subview %{{.}}[%[[J]]] [%[[szN]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>
	// TILE-02: linalg.matvec %[[sAj]], %[[sBj]], %{{.*}} : (memref<?x?xf32, #[[$strided2D]]>, memref<?xf32, #[[$strided1D]]>, memref<?xf32, #[[$strided1D]]>)			// TILE-02: linalg.matvec ins(%[[sAj]], %[[sBj]]{{.}} outs(%{{.}}

	// TILE-002-LABEL: func @matvec(			// TILE-002-LABEL: func @matvec(
	// TILE-002-SAME: %[[ARG0:[0-9a-zA-Z]*]]: memref			// TILE-002-SAME: %[[ARG0:[0-9a-zA-Z]*]]: memref
	// TILE-002-SAME: %[[ARG1:[0-9a-zA-Z]*]]: memref			// TILE-002-SAME: %[[ARG1:[0-9a-zA-Z]*]]: memref
	// TILE-002-SAME: %[[ARG2:[0-9a-zA-Z]*]]: memref			// TILE-002-SAME: %[[ARG2:[0-9a-zA-Z]*]]: memref
	// TILE-002-NOT: scf.for			// TILE-002-NOT: scf.for

	// TILE-234-LABEL: func @matvec(			// TILE-234-LABEL: func @matvec(
	Show All 14 Lines
	// TILE-234: %[[sAij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%[[szM]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>			// TILE-234: %[[sAij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%[[szM]], %[[szN]]] [1, 1] : memref<?x?xf32, #[[$strided2D]]> to memref<?x?xf32, #[[$strided2D]]>
	// TILE-234: %[[localN:.]] = dim %{{.}}, %c0			// TILE-234: %[[localN:.]] = dim %{{.}}, %c0
	// TILE-234: %[[szN:.*]] = affine.min #[[$bound_map_3]](%[[J]])[%[[localN]]]			// TILE-234: %[[szN:.*]] = affine.min #[[$bound_map_3]](%[[J]])[%[[localN]]]
	// TILE-234: %[[sBj:.]] = subview %{{.}}[%[[J]]] [%[[szN]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>			// TILE-234: %[[sBj:.]] = subview %{{.}}[%[[J]]] [%[[szN]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>
	// TILE-234: %[[localM:.]] = dim %{{.}}, %c0			// TILE-234: %[[localM:.]] = dim %{{.}}, %c0
	// TILE-234: %[[szM:.*]] = affine.min #[[$bound_map_2]](%[[I]])[%[[localM]]]			// TILE-234: %[[szM:.*]] = affine.min #[[$bound_map_2]](%[[I]])[%[[localM]]]
	// TILE-234: %[[sCi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>			// TILE-234: %[[sCi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>
	//			//
	// TILE-234: linalg.matvec %[[sAij]], %[[sBj]], %[[sCi]] : (memref<?x?xf32, #[[$strided2D]]>, memref<?xf32, #[[$strided1D]]>, memref<?xf32, #[[$strided1D]]>)			// TILE-234: linalg.matvec ins(%[[sAij]], %[[sBj]]{{.*}} outs(%[[sCi]]

	func @dot(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<f32>) {			func @dot(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<f32>) {
	linalg.dot %arg0, %arg1, %arg2 : (memref<?xf32, offset: ?, strides: [1]>,			linalg.dot
	memref<?xf32, offset: ?, strides: [1]>,			ins(%arg0, %arg1: memref<?xf32, offset: ?, strides: [1]>, memref<?xf32, offset: ?, strides: [1]>)
	memref<f32>)			outs(%arg2: memref<f32>)
	return			return
	}			}
	// TILE-2-LABEL: func @dot(			// TILE-2-LABEL: func @dot(
	// TILE-2-DAG: %[[C0:.*]] = constant 0 : index			// TILE-2-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-2-DAG: %[[C2:.*]] = constant 2 : index			// TILE-2-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-2: %[[M:.]] = dim %{{.}}, %c0 : memref<?xf32, #[[$strided1D]]>			// TILE-2: %[[M:.]] = dim %{{.}}, %c0 : memref<?xf32, #[[$strided1D]]>
	// TILE-2: scf.for %[[I:.]] = %{{.}}{{.}} to %[[M]] step %{{.}} {			// TILE-2: scf.for %[[I:.]] = %{{.}}{{.}} to %[[M]] step %{{.}} {
	// TILE-2: %[[localM:.]] = dim %{{.}}, %c0			// TILE-2: %[[localM:.]] = dim %{{.}}, %c0
	// TILE-2: %[[szM:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localM]]]			// TILE-2: %[[szM:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localM]]]
	// TILE-2: %[[sAi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>			// TILE-2: %[[sAi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>
	// TILE-2: %[[localM:.]] = dim %{{.}}, %c0			// TILE-2: %[[localM:.]] = dim %{{.}}, %c0
	// TILE-2: %[[szM:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localM]]]			// TILE-2: %[[szM:.*]] = affine.min #[[$bound_map]](%[[I]])[%[[localM]]]
	// TILE-2: %[[sBi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>			// TILE-2: %[[sBi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>
	// TILE-2: linalg.dot %[[sAi]], %[[sBi]], {{.*}} : (memref<?xf32, #[[$strided1D]]>, memref<?xf32, #[[$strided1D]]>, memref<f32>)			// TILE-2: linalg.dot ins(%[[sAi]], %[[sBi]]{{.*}} outs(

	// TILE-02-LABEL: func @dot(			// TILE-02-LABEL: func @dot(
	// TILE-02-NOT: scf.for			// TILE-02-NOT: scf.for

	// TILE-002-LABEL: func @dot(			// TILE-002-LABEL: func @dot(
	// TILE-002-NOT: scf.for			// TILE-002-NOT: scf.for

	// TILE-234-LABEL: func @dot(			// TILE-234-LABEL: func @dot(
	// TILE-234-DAG: %[[C0:.*]] = constant 0 : index			// TILE-234-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-234-DAG: %[[C2:.*]] = constant 2 : index			// TILE-234-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-234: %[[ubK:.]] = dim %{{.}}, %c0 : memref<?xf32, #[[$strided1D]]>			// TILE-234: %[[ubK:.]] = dim %{{.}}, %c0 : memref<?xf32, #[[$strided1D]]>
	// TILE-234: scf.for %[[I:.]] = %{{.}} to %[[ubK]] step %{{.*}} {			// TILE-234: scf.for %[[I:.]] = %{{.}} to %[[ubK]] step %{{.*}} {
	// TILE-234: %[[localM:.]] = dim %{{.}}, %c0			// TILE-234: %[[localM:.]] = dim %{{.}}, %c0
	// TILE-234: %[[szM:.*]] = affine.min #[[$bound_map_2]](%[[I]])[%[[localM]]]			// TILE-234: %[[szM:.*]] = affine.min #[[$bound_map_2]](%[[I]])[%[[localM]]]
	// TILE-234: %[[sAi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>			// TILE-234: %[[sAi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>
	// TILE-234: %[[localM:.]] = dim %{{.}}, %c0			// TILE-234: %[[localM:.]] = dim %{{.}}, %c0
	// TILE-234: %[[szM:.*]] = affine.min #[[$bound_map_2]](%[[I]])[%[[localM]]]			// TILE-234: %[[szM:.*]] = affine.min #[[$bound_map_2]](%[[I]])[%[[localM]]]
	// TILE-234: %[[sBi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>			// TILE-234: %[[sBi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [1] : memref<?xf32, #[[$strided1D]]> to memref<?xf32, #[[$strided1D]]>
	// TILE-234: linalg.dot %[[sAi]], %[[sBi]], %{{.*}} : (memref<?xf32, #[[$strided1D]]>, memref<?xf32, #[[$strided1D]]>, memref<f32>)			// TILE-234: linalg.dot ins(%[[sAi]], %[[sBi]]{{.*}} outs(

	func @fill_static(%arg0: memref<127x99xf32>, %arg1: f32) {			func @fill_static(%arg0: memref<127x99xf32>, %arg1: f32) {
	linalg.fill(%arg0, %arg1) : memref<127x99xf32>, f32			linalg.fill(%arg0, %arg1) : memref<127x99xf32>, f32
	return			return
	}			}
	// TILE-2-LABEL: func @fill_static			// TILE-2-LABEL: func @fill_static
	// TILE-2: for			// TILE-2: for
	// TILE-2-NOT: for			// TILE-2-NOT: for
	▲ Show 20 Lines • Show All 81 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/tile_parallel_reduce.mlir

	// RUN: mlir-opt %s -linalg-tile-to-parallel-loops="linalg-tile-sizes=2,4,8" -split-input-file \| FileCheck %s			// RUN: mlir-opt %s -linalg-tile-to-parallel-loops="linalg-tile-sizes=2,4,8" -split-input-file \| FileCheck %s
	// RUN: mlir-opt %s -linalg-tile-to-parallel-loops="linalg-tile-sizes=2" -split-input-file \| FileCheck %s -check-prefix=TILE1			// RUN: mlir-opt %s -linalg-tile-to-parallel-loops="linalg-tile-sizes=2" -split-input-file \| FileCheck %s -check-prefix=TILE1
	// RUN: mlir-opt %s -linalg-tile-to-parallel-loops="linalg-tile-sizes=2,4" -split-input-file \| FileCheck %s -check-prefix=TILE2			// RUN: mlir-opt %s -linalg-tile-to-parallel-loops="linalg-tile-sizes=2,4" -split-input-file \| FileCheck %s -check-prefix=TILE2

	func @gemm(%arg0 : memref<?x?xf32>,			func @gemm(%arg0 : memref<?x?xf32>,
	%arg1 : memref<?x?xf32>,			%arg1 : memref<?x?xf32>,
	%arg2 : memref<?x?xf32>)			%arg2 : memref<?x?xf32>)
	{			{
	linalg.matmul %arg0, %arg1, %arg2			linalg.matmul ins(%arg0, %arg1: memref<?x?xf32>, memref<?x?xf32>)
	: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			outs(%arg2: memref<?x?xf32>)
	return			return
	}			}
	// CHECK-LABEL: func @gemm			// CHECK-LABEL: func @gemm
	// CHECK-DAG: %[[C2:.*]] = constant 2 : index			// CHECK-DAG: %[[C2:.*]] = constant 2 : index
	// CHECK-DAG: %[[C4:.*]] = constant 4 : index			// CHECK-DAG: %[[C4:.*]] = constant 4 : index
	// CHECK-DAG: %[[C8:.*]] = constant 8 : index			// CHECK-DAG: %[[C8:.*]] = constant 8 : index
	// CHECK: scf.parallel (%[[ARG3:.]], %[[ARG4:.]]) =			// CHECK: scf.parallel (%[[ARG3:.]], %[[ARG4:.]]) =
	// CHECK-SAME: step (%[[C2]], %[[C4]])			// CHECK-SAME: step (%[[C2]], %[[C4]])
	// CHECK: scf.for %[[ARG5:.*]] =			// CHECK: scf.for %[[ARG5:.*]] =
	// CHECK-SAME: step %[[C8]]			// CHECK-SAME: step %[[C8]]
	// CHECK: %[[SV1:.]] = subview %{{.}}[%[[ARG3]], %[[ARG5]]]			// CHECK: %[[SV1:.]] = subview %{{.}}[%[[ARG3]], %[[ARG5]]]
	// CHECK: %[[SV2:.]] = subview %{{.}}[%[[ARG5]], %[[ARG4]]]			// CHECK: %[[SV2:.]] = subview %{{.}}[%[[ARG5]], %[[ARG4]]]
	// CHECK: %[[SV3:.]] = subview %{{.}}[%[[ARG3]], %[[ARG4]]]			// CHECK: %[[SV3:.]] = subview %{{.}}[%[[ARG3]], %[[ARG4]]]
	// CHECK: linalg.matmul %[[SV1]], %[[SV2]], %[[SV3]]			// CHECK: linalg.matmul ins(%[[SV1]], %[[SV2]]{{.*}} outs(%[[SV3]]

	// TILE1-LABEL: func @gemm			// TILE1-LABEL: func @gemm
	// TILE1-DAG: %[[C2:.*]] = constant 2 : index			// TILE1-DAG: %[[C2:.*]] = constant 2 : index
	// TILE1: scf.parallel (%[[ARG3:.*]]) =			// TILE1: scf.parallel (%[[ARG3:.*]]) =
	// TILE1-SAME: step (%[[C2]])			// TILE1-SAME: step (%[[C2]])
	// TILE1: %[[SV1:.]] = subview %{{.}}[%[[ARG3]], 0]			// TILE1: %[[SV1:.]] = subview %{{.}}[%[[ARG3]], 0]
	// TILE1: %[[SV3:.]] = subview %{{.}}[%[[ARG3]], 0]			// TILE1: %[[SV3:.]] = subview %{{.}}[%[[ARG3]], 0]
	// TILE1-NOT: subview			// TILE1-NOT: subview
	// TILE1: linalg.matmul %[[SV1]], %{{.*}}, %[[SV3]]			// TILE1: linalg.matmul ins(%[[SV1]], %{{.*}} outs(%[[SV3]]

	// TILE2-LABEL: func @gemm			// TILE2-LABEL: func @gemm
	// TILE2-DAG: %[[C2:.*]] = constant 2 : index			// TILE2-DAG: %[[C2:.*]] = constant 2 : index
	// TILE2-DAG: %[[C4:.*]] = constant 4 : index			// TILE2-DAG: %[[C4:.*]] = constant 4 : index
	// TILE2: scf.parallel (%[[ARG3:.]], %[[ARG4:.]]) =			// TILE2: scf.parallel (%[[ARG3:.]], %[[ARG4:.]]) =
	// TILE2-SAME: step (%[[C2]], %[[C4]])			// TILE2-SAME: step (%[[C2]], %[[C4]])
	// TILE2: %[[SV1:.]] = subview %{{.}}[%[[ARG3]], 0]			// TILE2: %[[SV1:.]] = subview %{{.}}[%[[ARG3]], 0]
	// TILE2: %[[SV2:.]] = subview %{{.}}[0, %[[ARG4]]]			// TILE2: %[[SV2:.]] = subview %{{.}}[0, %[[ARG4]]]
	// TILE2: %[[SV3:.]] = subview %{{.}}[%[[ARG3]], %[[ARG4]]]			// TILE2: %[[SV3:.]] = subview %{{.}}[%[[ARG3]], %[[ARG4]]]
	// TILE2: linalg.matmul %[[SV1]], %[[SV2]], %[[SV3]]			// TILE2: linalg.matmul ins(%[[SV1]], %[[SV2]]{{.*}} outs(%[[SV3]]

	// -----			// -----

	#map0 = affine_map<(d0, d1, d2) -> (d0, d1, d2)>			#map0 = affine_map<(d0, d1, d2) -> (d0, d1, d2)>
	#map1 = affine_map<(d0, d1, d2) -> (d0, d2)>			#map1 = affine_map<(d0, d1, d2) -> (d0, d2)>
	#map2 = affine_map<(d0, d1, d2) -> (d1)>			#map2 = affine_map<(d0, d1, d2) -> (d1)>
	#accesses = [#map0, #map1, #map2]			#accesses = [#map0, #map1, #map2]
	#trait = {			#trait = {
	▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/transform-patterns-matmul-to-vector.mlir

	// RUN: mlir-opt %s -test-linalg-transform-patterns=test-matmul-to-vector-patterns-tile-1d \| FileCheck %s			// RUN: mlir-opt %s -test-linalg-transform-patterns=test-matmul-to-vector-patterns-tile-1d \| FileCheck %s
	// RUN: mlir-opt %s -test-linalg-transform-patterns=test-matmul-to-vector-patterns-tile-2d \| FileCheck %s			// RUN: mlir-opt %s -test-linalg-transform-patterns=test-matmul-to-vector-patterns-tile-2d \| FileCheck %s
	// RUN: mlir-opt %s -test-linalg-transform-patterns=test-contraction-to-vector-patterns \| FileCheck %s --check-prefix=VECTOR-CONTRACTION			// RUN: mlir-opt %s -test-linalg-transform-patterns=test-contraction-to-vector-patterns \| FileCheck %s --check-prefix=VECTOR-CONTRACTION

	func @matmul(%A: memref<1584x1584xf32, offset: 0, strides: [1584, 1]>,			func @matmul(%A: memref<1584x1584xf32, offset: 0, strides: [1584, 1]>,
	%B: memref<1584x1584xf32, offset: 0, strides: [1584, 1]>,			%B: memref<1584x1584xf32, offset: 0, strides: [1584, 1]>,
	%C: memref<1584x1584xf32, offset: 0, strides: [1584, 1]>) {			%C: memref<1584x1584xf32, offset: 0, strides: [1584, 1]>) {
	linalg.matmul %A, %B, %C {__internal_linalg_transform__ = "START"} :			linalg.matmul {__internal_linalg_transform__ = "START"}
	(memref<1584x1584xf32, offset: 0, strides: [1584, 1]>,			ins(%A, %B: memref<1584x1584xf32, offset: 0, strides: [1584, 1]>,
	memref<1584x1584xf32, offset: 0, strides: [1584, 1]>,
	memref<1584x1584xf32, offset: 0, strides: [1584, 1]>)			memref<1584x1584xf32, offset: 0, strides: [1584, 1]>)
				outs(%C: memref<1584x1584xf32, offset: 0, strides: [1584, 1]>)
	return			return
	}			}

	// CHECK-LABEL:func @matmul			// CHECK-LABEL:func @matmul
	// CHECK: vector.broadcast {{.*}} : f32 to vector<8x16xf32>			// CHECK: vector.broadcast {{.*}} : f32 to vector<8x16xf32>
	// CHECK: store {{.*}}[] : memref<vector<8x16xf32>>			// CHECK: store {{.*}}[] : memref<vector<8x16xf32>>
	//			//
	// CHECK: vector.broadcast {{.*}} : f32 to vector<16x12xf32>			// CHECK: vector.broadcast {{.*}} : f32 to vector<16x12xf32>
	Show All 11 Lines
	// CHECK-SAME: : vector<8x16xf32>, vector<16x12xf32> into vector<8x12xf32>			// CHECK-SAME: : vector<8x16xf32>, vector<16x12xf32> into vector<8x12xf32>
	//			//
	// CHECK: linalg.copy			// CHECK: linalg.copy

	// VECTOR-CONTRACTION-LABEL: contraction_dot			// VECTOR-CONTRACTION-LABEL: contraction_dot
	func @contraction_dot(%A: memref<1584xf32>, %B: memref<1584xf32>, %C: memref<f32>) {			func @contraction_dot(%A: memref<1584xf32>, %B: memref<1584xf32>, %C: memref<f32>) {
	// VECTOR-CONTRACTION: vector.contract			// VECTOR-CONTRACTION: vector.contract
	// VECTOR-CONTRACTION-SAME: vector<1584xf32>, vector<1584xf32> into f32			// VECTOR-CONTRACTION-SAME: vector<1584xf32>, vector<1584xf32> into f32
	linalg.dot %A, %B, %C : (memref<1584xf32>, memref<1584xf32>, memref<f32>)			linalg.dot ins(%A, %B: memref<1584xf32>, memref<1584xf32>)
				outs(%C: memref<f32>)
	return			return
	}			}

	// VECTOR-CONTRACTION-LABEL: contraction_matvec			// VECTOR-CONTRACTION-LABEL: contraction_matvec
	func @contraction_matvec(%A: memref<1584x1584xf32>, %B: memref<1584xf32>, %C: memref<1584xf32>) {			func @contraction_matvec(%A: memref<1584x1584xf32>, %B: memref<1584xf32>, %C: memref<1584xf32>) {
	// VECTOR-CONTRACTION: vector.contract			// VECTOR-CONTRACTION: vector.contract
	// VECTOR-CONTRACTION-SAME: vector<1584x1584xf32>, vector<1584xf32> into vector<1584xf32>			// VECTOR-CONTRACTION-SAME: vector<1584x1584xf32>, vector<1584xf32> into vector<1584xf32>
	linalg.matvec %A, %B, %C :			linalg.matvec ins(%A, %B: memref<1584x1584xf32>, memref<1584xf32>)
	(memref<1584x1584xf32>, memref<1584xf32>, memref<1584xf32>)			outs(%C: memref<1584xf32>)
	return			return
	}			}

	// VECTOR-CONTRACTION-LABEL: contraction_matmul			// VECTOR-CONTRACTION-LABEL: contraction_matmul
	func @contraction_matmul(%A: memref<1584x1584xf32>, %B: memref<1584x1584xf32>, %C: memref<1584x1584xf32>) {			func @contraction_matmul(%A: memref<1584x1584xf32>, %B: memref<1584x1584xf32>, %C: memref<1584x1584xf32>) {
	// VECTOR-CONTRACTION: vector.contract			// VECTOR-CONTRACTION: vector.contract
	// VECTOR-CONTRACTION-SAME: vector<1584x1584xf32>, vector<1584x1584xf32> into vector<1584x1584xf32>			// VECTOR-CONTRACTION-SAME: vector<1584x1584xf32>, vector<1584x1584xf32> into vector<1584x1584xf32>
	linalg.matmul %A, %B, %C :			linalg.matmul ins(%A, %B: memref<1584x1584xf32>, memref<1584x1584xf32>)
	(memref<1584x1584xf32>, memref<1584x1584xf32>, memref<1584x1584xf32>)			outs(%C: memref<1584x1584xf32>)
	return			return
	}			}

	// VECTOR-CONTRACTION-LABEL: contraction_batch_matmul			// VECTOR-CONTRACTION-LABEL: contraction_batch_matmul
	func @contraction_batch_matmul(%A: memref<1584x1584x1584xf32>, %B: memref<1584x1584x1584xf32>, %C: memref<1584x1584x1584xf32>) {			func @contraction_batch_matmul(%A: memref<1584x1584x1584xf32>, %B: memref<1584x1584x1584xf32>, %C: memref<1584x1584x1584xf32>) {
	// VECTOR-CONTRACTION: vector.contract			// VECTOR-CONTRACTION: vector.contract
	// VECTOR-CONTRACTION-SAME: vector<1584x1584x1584xf32>, vector<1584x1584x1584xf32> into vector<1584x1584x1584xf32>			// VECTOR-CONTRACTION-SAME: vector<1584x1584x1584xf32>, vector<1584x1584x1584xf32> into vector<1584x1584x1584xf32>
	linalg.batch_matmul %A, %B, %C :			linalg.batch_matmul
	(memref<1584x1584x1584xf32>, memref<1584x1584x1584xf32>, memref<1584x1584x1584xf32>)			ins(%A, %B: memref<1584x1584x1584xf32>, memref<1584x1584x1584xf32>)
				outs(%C: memref<1584x1584x1584xf32>)
	return			return
	}			}

mlir/test/Dialect/Linalg/transform-patterns.mlir

	// RUN: mlir-opt %s -test-linalg-transform-patterns=test-patterns \| FileCheck %s			// RUN: mlir-opt %s -test-linalg-transform-patterns=test-patterns \| FileCheck %s

	// CHECK-DAG: #[[$STRIDED_1D:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>			// CHECK-DAG: #[[$STRIDED_1D:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>
	// Map corresponding to a 2D memory access where the stride along the last dim is known to be 1.			// Map corresponding to a 2D memory access where the stride along the last dim is known to be 1.
	// CHECK-DAG: #[[$STRIDED_2D_u_1:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>			// CHECK-DAG: #[[$STRIDED_2D_u_1:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>
	// Map corresponding to a 2D memory access where the stride along all dims are unknown.			// Map corresponding to a 2D memory access where the stride along all dims are unknown.
	// CHECK-DAG: #[[$STRIDED_2D:.]] = affine_map<(d0, d1)[s0, s1, s2] -> (d0 s1 + s0 + d1 * s2)>			// CHECK-DAG: #[[$STRIDED_2D:.]] = affine_map<(d0, d1)[s0, s1, s2] -> (d0 s1 + s0 + d1 * s2)>
	// CHECK-DAG: #[[$mk:.*]] = affine_map<(d0, d1, d2) -> (d0, d2)>			// CHECK-DAG: #[[$mk:.*]] = affine_map<(d0, d1, d2) -> (d0, d2)>
	// CHECK-DAG: #[[$kn:.*]] = affine_map<(d0, d1, d2) -> (d2, d1)>			// CHECK-DAG: #[[$kn:.*]] = affine_map<(d0, d1, d2) -> (d2, d1)>
	// CHECK-DAG: #[[$mn:.*]] = affine_map<(d0, d1, d2) -> (d0, d1)>			// CHECK-DAG: #[[$mn:.*]] = affine_map<(d0, d1, d2) -> (d0, d1)>
	// CHECK-DAG: #[[$nm:.*]] = affine_map<(d0, d1, d2) -> (d1, d0)>			// CHECK-DAG: #[[$nm:.*]] = affine_map<(d0, d1, d2) -> (d1, d0)>
	// CHECK-DAG: #[[$km:.*]] = affine_map<(d0, d1, d2) -> (d2, d0)>			// CHECK-DAG: #[[$km:.*]] = affine_map<(d0, d1, d2) -> (d2, d0)>

	func @dot(%x: memref<?xf32, offset: ?, strides: [1]>,			func @dot(%x: memref<?xf32, offset: ?, strides: [1]>,
	%y: memref<?xf32, offset: ?, strides: [1]>,			%y: memref<?xf32, offset: ?, strides: [1]>,
	%v: memref<f32>) {			%v: memref<f32>) {
	linalg.dot %x, %y, %v { __internal_linalg_transform__ = "MEM" } :			linalg.dot { __internal_linalg_transform__ = "MEM" }
	(memref<?xf32, offset: ?, strides: [1]>,			ins(%x, %y: memref<?xf32, offset: ?, strides: [1]>,
	memref<?xf32, offset: ?, strides: [1]>,			memref<?xf32, offset: ?, strides: [1]>)
	memref<f32>)			outs(%v: memref<f32>)

	return			return
	}			}
	// CHECK-LABEL: func @dot			// CHECK-LABEL: func @dot
	// CHECK-DAG: %[[c0:.*]] = constant 0 : index			// CHECK-DAG: %[[c0:.*]] = constant 0 : index
	// CHECK-DAG: %[[c1:.*]] = constant 1 : index			// CHECK-DAG: %[[c1:.*]] = constant 1 : index
	// CHECK-DAG: %[[c8000:.*]] = constant 8000 : index			// CHECK-DAG: %[[c8000:.*]] = constant 8000 : index
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c8000]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c8000]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c1]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c1]] {
	// CHECK: load			// CHECK: load
	// CHECK: load			// CHECK: load
	// CHECK: load			// CHECK: load
	// CHECK: mulf			// CHECK: mulf
	// CHECK: addf			// CHECK: addf
	// CHECK: store			// CHECK: store

	func @matvec(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>,			func @matvec(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%x: memref<?xf32, offset: ?, strides: [1]>,			%x: memref<?xf32, offset: ?, strides: [1]>,
	%y: memref<?xf32, offset: ?, strides: [1]>) {			%y: memref<?xf32, offset: ?, strides: [1]>) {
	linalg.matvec %A, %x, %y :			linalg.matvec
	(memref<?x?xf32, offset: ?, strides: [?, 1]>,			ins(%A, %x: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?xf32, offset: ?, strides: [1]>,
	memref<?xf32, offset: ?, strides: [1]>)			memref<?xf32, offset: ?, strides: [1]>)
				outs(%y: memref<?xf32, offset: ?, strides: [1]>)
	return			return
	}			}
	// CHECK-LABEL: func @matvec			// CHECK-LABEL: func @matvec
	// CHECK-DAG: %[[c0:.*]] = constant 0 : index			// CHECK-DAG: %[[c0:.*]] = constant 0 : index
	// CHECK-DAG: %[[c5:.*]] = constant 5 : index			// CHECK-DAG: %[[c5:.*]] = constant 5 : index
	// CHECK-DAG: %[[c6:.*]] = constant 6 : index			// CHECK-DAG: %[[c6:.*]] = constant 6 : index
	// CHECK: scf.parallel {{.*}} step (%[[c5]])			// CHECK: scf.parallel {{.*}} step (%[[c5]])
	// CHECK: scf.for {{.*}} step %[[c6]]			// CHECK: scf.for {{.*}} step %[[c6]]
	// CHECK: linalg.matvec {{.}}, {{.}}, {{.*}} : (memref<?x?xf32, #[[$STRIDED_2D]]>, memref<?xf32, #[[$STRIDED_1D]]>, memref<?xf32, #[[$STRIDED_1D]]>)			// CHECK: linalg.matvec
				// CHECK: ins({{.}}, {{.}}: memref<?x?xf32, #[[$STRIDED_2D]]>, memref<?xf32, #[[$STRIDED_1D]]>)
				// CHECK: outs({{.*}}: memref<?xf32, #[[$STRIDED_1D]]>)

	func @matmul(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>,			func @matmul(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%B: memref<?x?xf32, offset: ?, strides: [?, 1]>,			%B: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%C: memref<?x?xf32, offset: ?, strides: [?, 1]>) {			%C: memref<?x?xf32, offset: ?, strides: [?, 1]>) {
	linalg.matmul %A, %B, %C { __internal_linalg_transform__ = "MEM" } :			linalg.matmul { __internal_linalg_transform__ = "MEM" }
	(memref<?x?xf32, offset: ?, strides: [?, 1]>,			ins(%A, %B: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?x?xf32, offset: ?, strides: [?, 1]>)			memref<?x?xf32, offset: ?, strides: [?, 1]>)
				outs(%C: memref<?x?xf32, offset: ?, strides: [?, 1]>)
	return			return
	}			}
	// CHECK-LABEL: func @matmul			// CHECK-LABEL: func @matmul
	// CHECK-DAG: %[[c0:.*]] = constant 0 : index			// CHECK-DAG: %[[c0:.*]] = constant 0 : index
	// CHECK-DAG: %[[c2:.*]] = constant 2 : index			// CHECK-DAG: %[[c2:.*]] = constant 2 : index
	// CHECK-DAG: %[[c3:.*]] = constant 3 : index			// CHECK-DAG: %[[c3:.*]] = constant 3 : index
	// CHECK-DAG: %[[c4:.*]] = constant 4 : index			// CHECK-DAG: %[[c4:.*]] = constant 4 : index
	// CHECK-DAG: %[[c20:.*]] = constant 20 : index			// CHECK-DAG: %[[c20:.*]] = constant 20 : index
	Show All 12 Lines
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c300]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c300]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c400]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c400]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c20]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c20]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c30]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c30]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c40]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c40]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c2]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c2]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c3]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c3]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c4]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c4]] {
	// CHECK: linalg.matmul {{.}}, {{.}}, {{.*}} : (			// CHECK: linalg.matmul
	// CHECK: memref<?x?xf32, #[[$STRIDED_2D]]>,			// CHECK: ins({{.}}, {{.}}: memref<?x?xf32, #[[$STRIDED_2D]]>, memref<?x?xf32, #[[$STRIDED_2D]]>)
	// CHECK: memref<?x?xf32, #[[$STRIDED_2D]]>,			// CHECK: outs({{.*}}: memref<?x?xf32, #[[$STRIDED_2D]]>)
	// CHECK: memref<?x?xf32, #[[$STRIDED_2D]]>)

	#matmul_trait = {			#matmul_trait = {
	args_in = 2,			args_in = 2,
	args_out = 1,			args_out = 1,
	indexing_maps = [			indexing_maps = [
	affine_map<(m, n, k) -> (m, k)>,			affine_map<(m, n, k) -> (m, k)>,
	affine_map<(m, n, k) -> (k, n)>,			affine_map<(m, n, k) -> (k, n)>,
	affine_map<(m, n, k) -> (m, n)>			affine_map<(m, n, k) -> (m, n)>
	Show All 32 Lines
	// CHECK: vector.transfer_read %{{.*}} : memref<8x16xi32>, vector<8x16xi32>			// CHECK: vector.transfer_read %{{.*}} : memref<8x16xi32>, vector<8x16xi32>
	// CHECK: vector.transfer_read %{{.*}} : memref<16x32xi32>, vector<16x32xi32>			// CHECK: vector.transfer_read %{{.*}} : memref<16x32xi32>, vector<16x32xi32>
	// CHECK: vector.transfer_read %{{.*}} : memref<8x32xi32>, vector<8x32xi32>			// CHECK: vector.transfer_read %{{.*}} : memref<8x32xi32>, vector<8x32xi32>
	// CHECK: vector.contract {indexing_maps = [#[[$mk]], #[[$kn]], #[[$mn]]], iterator_types = ["parallel", "parallel", "reduction"]} %{{.}}, %{{.}}, %{{.*}} : vector<8x16xi32>, vector<16x32xi32> into vector<8x32xi32>			// CHECK: vector.contract {indexing_maps = [#[[$mk]], #[[$kn]], #[[$mn]]], iterator_types = ["parallel", "parallel", "reduction"]} %{{.}}, %{{.}}, %{{.*}} : vector<8x16xi32>, vector<16x32xi32> into vector<8x32xi32>
	// CHECK: vector.transfer_write %{{.}}, %{{.}} : vector<8x32xi32>, memref<8x32xi32>			// CHECK: vector.transfer_write %{{.}}, %{{.}} : vector<8x32xi32>, memref<8x32xi32>

	func @vectorization_test_2(%A: memref<8x16xf32>, %B: memref<16x32xf32>,			func @vectorization_test_2(%A: memref<8x16xf32>, %B: memref<16x32xf32>,
	%C: memref<8x32xf32>) {			%C: memref<8x32xf32>) {
	linalg.matmul %A, %B, %C { __internal_linalg_transform__ = "VECTORIZE"} :			linalg.matmul { __internal_linalg_transform__ = "VECTORIZE"}
	(memref<8x16xf32>, memref<16x32xf32>, memref<8x32xf32>)			ins(%A, %B: memref<8x16xf32>, memref<16x32xf32>)
				outs(%C: memref<8x32xf32>)
	return			return
	}			}
	// CHECK-LABEL: func @vectorization_test_2			// CHECK-LABEL: func @vectorization_test_2
	// CHECK: vector.contract {{.*}} :			// CHECK: vector.contract {{.*}} :
	// vector<8x16xf32>, vector<16x32xf32> into vector<8x32xf32>			// vector<8x16xf32>, vector<16x32xf32> into vector<8x32xf32>

	func @test_vectorize_fill(%A : memref<8x16xf32>, %arg0 : f32) {			func @test_vectorize_fill(%A : memref<8x16xf32>, %arg0 : f32) {
	linalg.fill(%A, %arg0) { __internal_linalg_transform__ = "VECTORIZE"} : memref<8x16xf32>, f32			linalg.fill(%A, %arg0) { __internal_linalg_transform__ = "VECTORIZE"} : memref<8x16xf32>, f32
	▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines
	// CHECK-SAME: library_call = "linalg_matmul_indexed"} %{{.}}, %{{.}}, %{{.*}}			// CHECK-SAME: library_call = "linalg_matmul_indexed"} %{{.}}, %{{.}}, %{{.*}}
	// CHECK: memref<?x?xf32, #[[$STRIDED_2D_u_1]]>,			// CHECK: memref<?x?xf32, #[[$STRIDED_2D_u_1]]>,
	// CHECK-SAME: memref<?x?xf32, #[[$STRIDED_2D_u_1]]>,			// CHECK-SAME: memref<?x?xf32, #[[$STRIDED_2D_u_1]]>,
	// CHECK-SAME: memref<?x?xf32, #[[$STRIDED_2D_u_1]]>			// CHECK-SAME: memref<?x?xf32, #[[$STRIDED_2D_u_1]]>

	func @matvec_perm(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>,			func @matvec_perm(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%x: memref<?xf32, offset: ?, strides: [1]>,			%x: memref<?xf32, offset: ?, strides: [1]>,
	%y: memref<?xf32, offset: ?, strides: [1]>) {			%y: memref<?xf32, offset: ?, strides: [1]>) {
	linalg.matvec %A, %x, %y {__internal_linalg_transform__ = "__with_perm__"} :			linalg.matvec {__internal_linalg_transform__ = "__with_perm__"}
	(memref<?x?xf32, offset: ?, strides: [?, 1]>,			ins(%A, %x: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?xf32, offset: ?, strides: [1]>,
	memref<?xf32, offset: ?, strides: [1]>)			memref<?xf32, offset: ?, strides: [1]>)
				outs(%y: memref<?xf32, offset: ?, strides: [1]>)
	return			return
	}			}
	// CHECK-LABEL: func @matvec_perm			// CHECK-LABEL: func @matvec_perm
	// CHECK-DAG: %[[c0:.*]] = constant 0 : index			// CHECK-DAG: %[[c0:.*]] = constant 0 : index
	// CHECK-DAG: %[[c5:.*]] = constant 5 : index			// CHECK-DAG: %[[c5:.*]] = constant 5 : index
	// CHECK-DAG: %[[c6:.*]] = constant 6 : index			// CHECK-DAG: %[[c6:.*]] = constant 6 : index
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c6]]			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c6]]
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c5]]			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c5]]
	// CHECK: linalg.matvec {{.}}, {{.}}, {{.*}} : (memref<?x?xf32, #[[$STRIDED_2D]]>, memref<?xf32, #[[$STRIDED_1D]]>, memref<?xf32, #[[$STRIDED_1D]]>)			// CHECK: linalg.matvec
				// CHECK: ins({{.}}, {{.}}: memref<?x?xf32, #[[$STRIDED_2D]]>, memref<?xf32, #[[$STRIDED_1D]]>)
				// CHECK: outs({{.*}}: memref<?xf32, #[[$STRIDED_1D]]>)

	func @matmul_perm(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>,			func @matmul_perm(%A: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%B: memref<?x?xf32, offset: ?, strides: [?, 1]>,			%B: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%C: memref<?x?xf32, offset: ?, strides: [?, 1]>) {			%C: memref<?x?xf32, offset: ?, strides: [?, 1]>) {
	linalg.matmul %A, %B, %C {__internal_linalg_transform__ = "__with_perm__"} :			linalg.matmul {__internal_linalg_transform__ = "__with_perm__"}
	(memref<?x?xf32, offset: ?, strides: [?, 1]>,			ins(%A, %B: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?x?xf32, offset: ?, strides: [?, 1]>,
	memref<?x?xf32, offset: ?, strides: [?, 1]>)			memref<?x?xf32, offset: ?, strides: [?, 1]>)
				outs(%C : memref<?x?xf32, offset: ?, strides: [?, 1]>)
	return			return
	}			}
	// CHECK-LABEL: func @matmul_perm			// CHECK-LABEL: func @matmul_perm
	// CHECK-DAG: %[[c0:.*]] = constant 0 : index			// CHECK-DAG: %[[c0:.*]] = constant 0 : index
	// CHECK-DAG: %[[c20:.*]] = constant 20 : index			// CHECK-DAG: %[[c20:.*]] = constant 20 : index
	// CHECK-DAG: %[[c30:.*]] = constant 30 : index			// CHECK-DAG: %[[c30:.*]] = constant 30 : index
	// CHECK-DAG: %[[c40:.*]] = constant 40 : index			// CHECK-DAG: %[[c40:.*]] = constant 40 : index
	// CHECK-DAG: %[[c200:.*]] = constant 200 : index			// CHECK-DAG: %[[c200:.*]] = constant 200 : index
	// CHECK-DAG: %[[c300:.*]] = constant 300 : index			// CHECK-DAG: %[[c300:.*]] = constant 300 : index
	// CHECK-DAG: %[[c400:.*]] = constant 400 : index			// CHECK-DAG: %[[c400:.*]] = constant 400 : index
	// CHECK-DAG: %[[c2000:.*]] = constant 2000 : index			// CHECK-DAG: %[[c2000:.*]] = constant 2000 : index
	// CHECK-DAG: %[[c3000:.*]] = constant 3000 : index			// CHECK-DAG: %[[c3000:.*]] = constant 3000 : index
	// CHECK-DAG: %[[c4000:.*]] = constant 4000 : index			// CHECK-DAG: %[[c4000:.*]] = constant 4000 : index
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c3000]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c3000]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c4000]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c4000]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c2000]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c2000]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c300]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c300]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c200]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c200]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c400]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c400]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c20]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c20]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c30]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c30]] {
	// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c40]] {			// CHECK: scf.for {{.}} = %[[c0]] to {{.}} step %[[c40]] {
	// CHECK: linalg.matmul {{.}}, {{.}}, {{.*}} : (			// CHECK: linalg.matmul
	// CHECK: memref<?x?xf32, #[[$STRIDED_2D]]>,			// CHECK: ins({{.}}, {{.}}: memref<?x?xf32, #[[$STRIDED_2D]]>, memref<?x?xf32, #[[$STRIDED_2D]]>)
	// CHECK: memref<?x?xf32, #[[$STRIDED_2D]]>,			// CHECK: outs({{.*}}: memref<?x?xf32, #[[$STRIDED_2D]]>)
	// CHECK: memref<?x?xf32, #[[$STRIDED_2D]]>)

	func @promote_subview_matmul(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>,			func @promote_subview_matmul(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%arg1: memref<?x?xf32, offset: ?, strides: [?, 1]>,			%arg1: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%arg2: memref<?x?xf32, offset: ?, strides: [?, 1]>) {			%arg2: memref<?x?xf32, offset: ?, strides: [?, 1]>) {
	%c2000 = constant 2000 : index			%c2000 = constant 2000 : index
	%c3000 = constant 3000 : index			%c3000 = constant 3000 : index
	%c4000 = constant 4000 : index			%c4000 = constant 4000 : index
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%0 = dim %arg0, %c0 : memref<?x?xf32, offset: ?, strides: [?, 1]>			%0 = dim %arg0, %c0 : memref<?x?xf32, offset: ?, strides: [?, 1]>
	%1 = dim %arg0, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>			%1 = dim %arg0, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>
	%2 = dim %arg1, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>			%2 = dim %arg1, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>
	scf.for %arg3 = %c0 to %0 step %c2000 {			scf.for %arg3 = %c0 to %0 step %c2000 {
	scf.for %arg4 = %c0 to %2 step %c3000 {			scf.for %arg4 = %c0 to %2 step %c3000 {
	scf.for %arg5 = %c0 to %1 step %c4000 {			scf.for %arg5 = %c0 to %1 step %c4000 {
	%3 = subview %arg0[%arg3, %arg5][%c2000, %c4000][%c1, %c1] :			%3 = subview %arg0[%arg3, %arg5][%c2000, %c4000][%c1, %c1] :
	memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	%4 = subview %arg1[%arg5, %arg4][%c4000, %c3000][%c1, %c1] :			%4 = subview %arg1[%arg5, %arg4][%c4000, %c3000][%c1, %c1] :
	memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	%5 = subview %arg2[%arg3, %arg4][%c2000, %c3000][%c1, %c1] :			%5 = subview %arg2[%arg3, %arg4][%c2000, %c3000][%c1, %c1] :
	memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	linalg.matmul %3, %4, %5 {__internal_linalg_transform__ = "_promote_views_"} :			linalg.matmul {__internal_linalg_transform__ = "_promote_views_"}
	(memref<?x?xf32, offset: ?, strides: [?, ?]>,			ins(%3, %4: memref<?x?xf32, offset: ?, strides: [?, ?]>,
	memref<?x?xf32, offset: ?, strides: [?, ?]>,
	memref<?x?xf32, offset: ?, strides: [?, ?]>)			memref<?x?xf32, offset: ?, strides: [?, ?]>)
				outs(%5: memref<?x?xf32, offset: ?, strides: [?, ?]>)
	}			}
	}			}
	}			}
	return			return
	}			}
	// CHECK-LABEL: func @promote_subview_matmul			// CHECK-LABEL: func @promote_subview_matmul
	// CHECK-DAG: %[[c0:.*]] = constant 0 : index			// CHECK-DAG: %[[c0:.*]] = constant 0 : index
	// CHECK-DAG: %[[c2000:.*]] = constant 2000 : index			// CHECK-DAG: %[[c2000:.*]] = constant 2000 : index
	Show All 12 Lines
	// CHECK: %[[v1:.]] = std.view %[[a1]][{{.}}][{{%.}}, {{%.}}] : memref<?xi8> to memref<?x?xf32>			// CHECK: %[[v1:.]] = std.view %[[a1]][{{.}}][{{%.}}, {{%.}}] : memref<?xi8> to memref<?x?xf32>
	// CHECK: %[[l1:.]] = subview %[[v1]][{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>			// CHECK: %[[l1:.]] = subview %[[v1]][{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>
	// CHECK: %[[a2:.]] = alloc({{%.}}) : memref<?xi8>			// CHECK: %[[a2:.]] = alloc({{%.}}) : memref<?xi8>
	// CHECK: %[[v2:.]] = std.view %[[a2]][{{.}}][{{%.}}, {{%.}}] : memref<?xi8> to memref<?x?xf32>			// CHECK: %[[v2:.]] = std.view %[[a2]][{{.}}][{{%.}}, {{%.}}] : memref<?xi8> to memref<?x?xf32>
	// CHECK: %[[l2:.]] = subview %[[v2]][{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>			// CHECK: %[[l2:.]] = subview %[[v2]][{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>
	// CHECK: linalg.copy(%[[s0]], %[[l0]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>			// CHECK: linalg.copy(%[[s0]], %[[l0]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>
	// CHECK: linalg.copy(%[[s1]], %[[l1]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>			// CHECK: linalg.copy(%[[s1]], %[[l1]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>
	// CHECK: linalg.copy(%[[s2]], %[[l2]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>			// CHECK: linalg.copy(%[[s2]], %[[l2]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>
	// CHECK: linalg.matmul %[[v0]], %[[v1]], %[[v2]] :			// CHECK: linalg.matmul
	// CHECK: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			// CHECK-SAME: ins(%[[v0]], %[[v1]] : memref<?x?xf32>, memref<?x?xf32>)
				// CHECK-SAME: outs(%[[v2]] : memref<?x?xf32>)

	func @promote_first_subview_matmul(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>,			func @promote_first_subview_matmul(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%arg1: memref<?x?xf32, offset: ?, strides: [?, 1]>,			%arg1: memref<?x?xf32, offset: ?, strides: [?, 1]>,
	%arg2: memref<?x?xf32, offset: ?, strides: [?, 1]>) {			%arg2: memref<?x?xf32, offset: ?, strides: [?, 1]>) {
	%c2000 = constant 2000 : index			%c2000 = constant 2000 : index
	%c3000 = constant 3000 : index			%c3000 = constant 3000 : index
	%c4000 = constant 4000 : index			%c4000 = constant 4000 : index
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%0 = dim %arg0, %c0 : memref<?x?xf32, offset: ?, strides: [?, 1]>			%0 = dim %arg0, %c0 : memref<?x?xf32, offset: ?, strides: [?, 1]>
	%1 = dim %arg0, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>			%1 = dim %arg0, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>
	%2 = dim %arg1, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>			%2 = dim %arg1, %c1 : memref<?x?xf32, offset: ?, strides: [?, 1]>
	scf.for %arg3 = %c0 to %0 step %c2000 {			scf.for %arg3 = %c0 to %0 step %c2000 {
	scf.for %arg4 = %c0 to %2 step %c3000 {			scf.for %arg4 = %c0 to %2 step %c3000 {
	scf.for %arg5 = %c0 to %1 step %c4000 {			scf.for %arg5 = %c0 to %1 step %c4000 {
	%3 = std.subview %arg0[%arg3, %arg5][%c2000, %c4000][%c1, %c1] :			%3 = std.subview %arg0[%arg3, %arg5][%c2000, %c4000][%c1, %c1] :
	memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	%4 = std.subview %arg1[%arg5, %arg4][%c4000, %c3000][%c1, %c1] :			%4 = std.subview %arg1[%arg5, %arg4][%c4000, %c3000][%c1, %c1] :
	memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	%5 = std.subview %arg2[%arg3, %arg4][%c2000, %c3000][%c1, %c1] :			%5 = std.subview %arg2[%arg3, %arg4][%c2000, %c3000][%c1, %c1] :
	memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>			memref<?x?xf32, offset: ?, strides: [?, 1]> to memref<?x?xf32, offset: ?, strides: [?, ?]>
	linalg.matmul %3, %4, %5 {__internal_linalg_transform__ = "_promote_first_view_"} :			linalg.matmul {__internal_linalg_transform__ = "_promote_first_view_"}
	(memref<?x?xf32, offset: ?, strides: [?, ?]>,			ins(%3, %4: memref<?x?xf32, offset: ?, strides: [?, ?]>,
	memref<?x?xf32, offset: ?, strides: [?, ?]>,
	memref<?x?xf32, offset: ?, strides: [?, ?]>)			memref<?x?xf32, offset: ?, strides: [?, ?]>)
				outs(%5: memref<?x?xf32, offset: ?, strides: [?, ?]>)
	}			}
	}			}
	}			}
	return			return
	}			}
	// CHECK-LABEL: func @promote_first_subview_matmul			// CHECK-LABEL: func @promote_first_subview_matmul
	// CHECK-DAG: %[[c0:.*]] = constant 0 : index			// CHECK-DAG: %[[c0:.*]] = constant 0 : index
	// CHECK-DAG: %[[c2000:.*]] = constant 2000 : index			// CHECK-DAG: %[[c2000:.*]] = constant 2000 : index
	Show All 12 Lines
	// CHECK-NOT: %[[v1:.]] = std.view %[[a1]][{{.}}][{{%.}}, {{%.}}] : memref<?xi8> to memref<?x?xf32>			// CHECK-NOT: %[[v1:.]] = std.view %[[a1]][{{.}}][{{%.}}, {{%.}}] : memref<?xi8> to memref<?x?xf32>
	// CHECK-NOT: %[[l0:.]] = subview %[[v1]][{{%.}}, {{%.}}] [{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>			// CHECK-NOT: %[[l0:.]] = subview %[[v1]][{{%.}}, {{%.}}] [{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>
	// CHECK-NOT: %[[a2:.]] = alloc({{%.}}) : memref<?xi8>			// CHECK-NOT: %[[a2:.]] = alloc({{%.}}) : memref<?xi8>
	// CHECK-NOT: %[[v2:.]] = std.view %[[a2]][{{.}}][{{%.}}, {{%.}}] : memref<?xi8> to memref<?x?xf32>			// CHECK-NOT: %[[v2:.]] = std.view %[[a2]][{{.}}][{{%.}}, {{%.}}] : memref<?xi8> to memref<?x?xf32>
	// CHECK-NOT: %[[l0:.]] = subview %[[v2]][{{%.}}, {{%.}}] [{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>			// CHECK-NOT: %[[l0:.]] = subview %[[v2]][{{%.}}, {{%.}}] [{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>
	// CHECK: linalg.copy(%[[s0]], %[[l0]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>			// CHECK: linalg.copy(%[[s0]], %[[l0]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>
	// CHECK-NOT: linalg.copy(%[[s1]], %[[l1]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>			// CHECK-NOT: linalg.copy(%[[s1]], %[[l1]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>
	// CHECK-NOT: linalg.copy(%[[s2]], %[[l2]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>^			// CHECK-NOT: linalg.copy(%[[s2]], %[[l2]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>^
	// CHECK: linalg.matmul %[[v0]], %[[s1]], %[[s2]] :			// CHECK: linalg.matmul
	// CHECK: (memref<?x?xf32>,			// CHECK-SAME: ins(%[[v0]], %[[s1]] : memref<?x?xf32>, memref<?x?xf32, #[[$STRIDED_2D]]>)
	// CHECK: memref<?x?xf32, #[[$STRIDED_2D]]>,			// CHECK-SAME: outs(%[[s2]] : memref<?x?xf32, #[[$STRIDED_2D]]>)
	// CHECK: memref<?x?xf32, #[[$STRIDED_2D]]>)

	func @aligned_promote_fill(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>) {			func @aligned_promote_fill(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>) {
	%c2000 = constant 2000 : index			%c2000 = constant 2000 : index
	%c4000 = constant 4000 : index			%c4000 = constant 4000 : index
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%cf = constant 1.0 : f32			%cf = constant 1.0 : f32
	%3 = std.subview %arg0[%c0, %c0][%c2000, %c4000][%c1, %c1] :			%3 = std.subview %arg0[%c0, %c0][%c2000, %c4000][%c1, %c1] :
	Show All 10 Lines
	// CHECK: %[[l0:.]] = subview %[[v0]][{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>			// CHECK: %[[l0:.]] = subview %[[v0]][{{%.}}, {{%.}}] [{{%.}}, {{%.*}}] : memref<?x?xf32> to memref<?x?xf32, #[[$STRIDED_2D]]>
	// CHECK: linalg.fill(%[[v0]], {{%.*}}) : memref<?x?xf32>, f32			// CHECK: linalg.fill(%[[v0]], {{%.*}}) : memref<?x?xf32>, f32
	// CHECK: linalg.copy(%[[s0]], %[[l0]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>			// CHECK: linalg.copy(%[[s0]], %[[l0]]) : memref<?x?xf32, #map{{.}}>, memref<?x?xf32, #map{{.}}>
	// CHECK: linalg.fill(%[[v0]], %[[cf]]) : memref<?x?xf32>, f32			// CHECK: linalg.fill(%[[v0]], %[[cf]]) : memref<?x?xf32>, f32

	func @tile_permute_parallel_loop(%arg0: memref<?x?xf32>,			func @tile_permute_parallel_loop(%arg0: memref<?x?xf32>,
	%arg1: memref<?x?xf32>,			%arg1: memref<?x?xf32>,
	%arg2: memref<?x?xf32>) {			%arg2: memref<?x?xf32>) {
	linalg.matmul %arg0, %arg1, %arg2 {__internal_linalg_transform__ = "par__with_perm__"}			linalg.matmul {__internal_linalg_transform__ = "par__with_perm__"}
	: (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			ins(%arg0, %arg1: memref<?x?xf32>, memref<?x?xf32>)
				outs(%arg2: memref<?x?xf32>)
	return			return
	}			}
	// CHECK-LABEL: func @tile_permute_parallel_loop			// CHECK-LABEL: func @tile_permute_parallel_loop
	// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]+]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]+]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]+]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]+]]: memref<?x?xf32>
	// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]+]]: memref<?x?xf32>			// CHECK-SAME: %[[ARG2:[a-zA-Z0-9_]+]]: memref<?x?xf32>
	// CHECK-DAG: %[[C16:.*]] = constant 16 : index			// CHECK-DAG: %[[C16:.*]] = constant 16 : index
	// CHECK-DAG: %[[C8:.*]] = constant 8 : index			// CHECK-DAG: %[[C8:.*]] = constant 8 : index
	// CHECK-DAG: %[[C4:.*]] = constant 4 : index			// CHECK-DAG: %[[C4:.*]] = constant 4 : index
	// CHECK-DAG: %[[C0:.*]] = constant 0 : index			// CHECK-DAG: %[[C0:.*]] = constant 0 : index
	// CHECK-DAG: %[[D0:.*]] = dim %[[ARG0]], %c0			// CHECK-DAG: %[[D0:.*]] = dim %[[ARG0]], %c0
	// CHECK-DAG: %[[D1:.*]] = dim %[[ARG0]], %c1			// CHECK-DAG: %[[D1:.*]] = dim %[[ARG0]], %c1
	// CHECK-DAG: %[[D2:.*]] = dim %[[ARG1]], %c1			// CHECK-DAG: %[[D2:.*]] = dim %[[ARG1]], %c1
	// CHECK: scf.parallel (%{{.*}}) = (%[[C0]]) to (%[[D2]]) step (%[[C8]])			// CHECK: scf.parallel (%{{.*}}) = (%[[C0]]) to (%[[D2]]) step (%[[C8]])
	// CHECK: scf.for %{{.*}} = %[[C0]] to %[[D1]] step %[[C4]]			// CHECK: scf.for %{{.*}} = %[[C0]] to %[[D1]] step %[[C4]]
	// CHECK: scf.parallel (%{{.*}}) = (%[[C0]]) to (%[[D0]]) step (%[[C16]])			// CHECK: scf.parallel (%{{.*}}) = (%[[C0]]) to (%[[D0]]) step (%[[C16]])

mlir/test/IR/slice.mlir

	// RUN: mlir-opt -slice-analysis-test %s \| FileCheck %s			// RUN: mlir-opt -slice-analysis-test %s \| FileCheck %s

	func @slicing_linalg_op(%arg0 : index, %arg1 : index, %arg2 : index) {			func @slicing_linalg_op(%arg0 : index, %arg1 : index, %arg2 : index) {
	%a = alloc(%arg0, %arg2) : memref<?x?xf32>			%a = alloc(%arg0, %arg2) : memref<?x?xf32>
	%b = alloc(%arg2, %arg1) : memref<?x?xf32>			%b = alloc(%arg2, %arg1) : memref<?x?xf32>
	%c = alloc(%arg0, %arg1) : memref<?x?xf32>			%c = alloc(%arg0, %arg1) : memref<?x?xf32>
	%d = alloc(%arg0, %arg1) : memref<?x?xf32>			%d = alloc(%arg0, %arg1) : memref<?x?xf32>
	linalg.matmul %a, %b, %c : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			linalg.matmul ins(%a, %b : memref<?x?xf32>, memref<?x?xf32>)
	linalg.matmul %a, %b, %d : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)			outs(%c : memref<?x?xf32>)
				linalg.matmul ins(%a, %b : memref<?x?xf32>, memref<?x?xf32>)
				outs(%d : memref<?x?xf32>)
	dealloc %c : memref<?x?xf32>			dealloc %c : memref<?x?xf32>
	dealloc %b : memref<?x?xf32>			dealloc %b : memref<?x?xf32>
	dealloc %a : memref<?x?xf32>			dealloc %a : memref<?x?xf32>
	dealloc %d : memref<?x?xf32>			dealloc %d : memref<?x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @slicing_linalg_op__backward_slice__0			// CHECK-LABEL: func @slicing_linalg_op__backward_slice__0
	Show All 16 Lines

mlir/test/lib/Dialect/Test/TestOps.td

	Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	def ComplexOp : TEST_Op<"complex_f64"> {			def ComplexOp : TEST_Op<"complex_f64"> {
	let results = (outs ComplexF64);			let results = (outs ComplexF64);
	}			}

	def ComplexTensorOp : TEST_Op<"complex_f64_tensor"> {			def ComplexTensorOp : TEST_Op<"complex_f64_tensor"> {
	let results = (outs TensorOf<[ComplexF64]>);			let results = (outs TensorOf<[ComplexF64]>);
	}			}

	def AnyShaped: ShapedContainerType<[AnyType], IsShapedTypePred, "shaped">;

	def TupleOp : TEST_Op<"tuple_32_bit"> {			def TupleOp : TEST_Op<"tuple_32_bit"> {
	let results = (outs TupleOf<[I32, F32]>);			let results = (outs TupleOf<[I32, F32]>);
	}			}

	def NestedTupleOp : TEST_Op<"nested_tuple_32_bit"> {			def NestedTupleOp : TEST_Op<"nested_tuple_32_bit"> {
	let results = (outs NestedTupleOf<[I32, F32]>);			let results = (outs NestedTupleOf<[I32, F32]>);
	}			}

	▲ Show 20 Lines • Show All 1,660 Lines • Show Last 20 Lines

mlir/test/mlir-cpu-runner/linalg_integration_test.mlir

Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	func @dot() -> f32 {
%bA = call @alloc_filled_f32(%c16, %f2) : (index, f32) -> (memref<?xi8>)		%bA = call @alloc_filled_f32(%c16, %f2) : (index, f32) -> (memref<?xi8>)
%bB = call @alloc_filled_f32(%c16, %f1) : (index, f32) -> (memref<?xi8>)		%bB = call @alloc_filled_f32(%c16, %f1) : (index, f32) -> (memref<?xi8>)
%bC = call @alloc_filled_f32(%c1, %f10) : (index, f32) -> (memref<?xi8>)		%bC = call @alloc_filled_f32(%c1, %f10) : (index, f32) -> (memref<?xi8>)

%A = view %bA[%c0][%c16] : memref<?xi8> to memref<?xf32>		%A = view %bA[%c0][%c16] : memref<?xi8> to memref<?xf32>
%B = view %bB[%c0][%c16] : memref<?xi8> to memref<?xf32>		%B = view %bB[%c0][%c16] : memref<?xi8> to memref<?xf32>
%C = view %bC[%c0][] : memref<?xi8> to memref<f32>		%C = view %bC[%c0][] : memref<?xi8> to memref<f32>

linalg.dot %A, %B, %C : (memref<?xf32>, memref<?xf32>, memref<f32>)		linalg.dot ins(%A, %B : memref<?xf32>, memref<?xf32>)
		outs(%C : memref<f32>)
%res = load %C[] : memref<f32>		%res = load %C[] : memref<f32>

dealloc %bC : memref<?xi8>		dealloc %bC : memref<?xi8>
dealloc %bB : memref<?xi8>		dealloc %bB : memref<?xi8>
dealloc %bA : memref<?xi8>		dealloc %bA : memref<?xi8>

return %res : f32		return %res : f32
}		}
Show All 15 Lines	func @matmul() -> f32 {
%bA = call @alloc_filled_f32(%c32, %f2) : (index, f32) -> (memref<?xi8>)		%bA = call @alloc_filled_f32(%c32, %f2) : (index, f32) -> (memref<?xi8>)
%bB = call @alloc_filled_f32(%c32, %f1) : (index, f32) -> (memref<?xi8>)		%bB = call @alloc_filled_f32(%c32, %f1) : (index, f32) -> (memref<?xi8>)
%bC = call @alloc_filled_f32(%c4, %f10) : (index, f32) -> (memref<?xi8>)		%bC = call @alloc_filled_f32(%c4, %f10) : (index, f32) -> (memref<?xi8>)

%A = view %bA[%c0][%c2, %c16] : memref<?xi8> to memref<?x?xf32>		%A = view %bA[%c0][%c2, %c16] : memref<?xi8> to memref<?x?xf32>
%B = view %bB[%c0][%c16, %c2] : memref<?xi8> to memref<?x?xf32>		%B = view %bB[%c0][%c16, %c2] : memref<?xi8> to memref<?x?xf32>
%C = view %bC[%c0][%c2, %c2] : memref<?xi8> to memref<?x?xf32>		%C = view %bC[%c0][%c2, %c2] : memref<?xi8> to memref<?x?xf32>

linalg.matmul %A, %B, %C : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>)		linalg.matmul ins(%A, %B : memref<?x?xf32>, memref<?x?xf32>)
		outs(%C : memref<?x?xf32>)
%res = load %C[%c0, %c1] : memref<?x?xf32>		%res = load %C[%c0, %c1] : memref<?x?xf32>

dealloc %bC : memref<?xi8>		dealloc %bC : memref<?xi8>
dealloc %bB : memref<?xi8>		dealloc %bB : memref<?xi8>
dealloc %bA : memref<?xi8>		dealloc %bA : memref<?xi8>

return %res : f32		return %res : f32
}		}

// All tests return this value		// All tests return this value
// CHECK: 4.2{{0+}}e+01		// CHECK: 4.2{{0+}}e+01

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc

	// RUN: mlir-linalg-ods-gen %s -gen-ods-decl=1 \| FileCheck %s --check-prefix=ODS			// RUN: mlir-linalg-ods-gen %s -gen-ods-decl=1 \| FileCheck %s --check-prefix=ODS
	// RUN: mlir-linalg-ods-gen %s -gen-impl=1 \| FileCheck %s --check-prefix=IMPL			// RUN: mlir-linalg-ods-gen %s -gen-impl=1 \| FileCheck %s --check-prefix=IMPL

	// ODS-LABEL: def Test1Op : LinalgNamedStructured_Op<"test1", [			// ODS-LABEL: def Test1Op : LinalgStructuredBase_Op<"test1", [
	// ODS-NEXT: NInputs<2>			// ODS-NEXT: NamedStructuredOpTrait
	// ODS-NEXT: NOutputs<1>			// ODS-NEXT: AttrSizedOperandSegments
	// ODS-NEXT: SingleBlockImplicitTerminator<"YieldOp">			// ODS-NEXT: SingleBlockImplicitTerminator<"YieldOp">
	//			//
	// IMPL-LABEL: ArrayAttr Test1Op::iterator_types() {			// IMPL-LABEL: ArrayAttr Test1Op::iterator_types() {
	// IMPL: { {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }			// IMPL: { {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }
	//			//
	// IMPL: ArrayAttr Test1Op::indexing_maps() {			// IMPL: ArrayAttr Test1Op::indexing_maps() {
	// IMPL: AffineMap::get(2, 0, {d0, d1}, context),			// IMPL: AffineMap::get(2, 0, {d0, d1}, context),
	// IMPL-NEXT: AffineMap::get(2, 0, {d1}, context),			// IMPL-NEXT: AffineMap::get(2, 0, {d1}, context),
	// IMPL-NEXT: AffineMap::get(2, 0, {d0}, context) });			// IMPL-NEXT: AffineMap::get(2, 0, {d0}, context) });
	//			//
	// IMPL: void Test1Op::regionBuilder(Block &block) {			// IMPL: void Test1Op::regionBuilder(Block &block) {
	// IMPL: Value [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);			// IMPL: Value [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);
	// IMPL: Value [[d:.*]] = std_mulf([[a]], [[b]]);			// IMPL: Value [[d:.*]] = std_mulf([[a]], [[b]]);
	// IMPL: Value [[e:.*]] = std_addf([[c]], [[d]]);			// IMPL: Value [[e:.*]] = std_addf([[c]], [[d]]);
	// IMPL: (linalg_yield(ValueRange{ [[e]] }));			// IMPL: (linalg_yield(ValueRange{ [[e]] }));
	//			//
	ods_def<Test1Op> :			ods_def<Test1Op> :
	def test1(A: f32(M, K), B: f32(K)) -> (C: f32(M)) {			def test1(A: f32(M, K), B: f32(K)) -> (C: f32(M)) {
	C(m) = std_addf<k>(std_mulf(A(m, k), B(k)));			C(m) = std_addf<k>(std_mulf(A(m, k), B(k)));
	}			}

	// ODS-LABEL: def Test2Op : LinalgNamedStructured_Op<"test2", [			// ODS-LABEL: def Test2Op : LinalgStructuredBase_Op<"test2", [
	// ODS-NEXT: NInputs<2>			// ODS-NEXT: NamedStructuredOpTrait
	// ODS-NEXT: NOutputs<1>			// ODS-NEXT: AttrSizedOperandSegments
	// ODS-NEXT: SingleBlockImplicitTerminator<"YieldOp">			// ODS-NEXT: SingleBlockImplicitTerminator<"YieldOp">
	//			//
	// IMPL-LABEL: ArrayAttr Test2Op::iterator_types() {			// IMPL-LABEL: ArrayAttr Test2Op::iterator_types() {
	// IMPL: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }			// IMPL: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }
	//			//
	// IMPL: ArrayAttr Test2Op::indexing_maps() {			// IMPL: ArrayAttr Test2Op::indexing_maps() {
	// IMPL: AffineMap::get(3, 0, {d0, d2}, context),			// IMPL: AffineMap::get(3, 0, {d0, d2}, context),
	// IMPL-NEXT: AffineMap::get(3, 0, {d2, d1}, context),			// IMPL-NEXT: AffineMap::get(3, 0, {d2, d1}, context),
	// IMPL-NEXT: AffineMap::get(3, 0, {d0, d1}, context) });			// IMPL-NEXT: AffineMap::get(3, 0, {d0, d1}, context) });
	//			//
	// IMPL: Test2Op::regionBuilder(Block &block) {			// IMPL: Test2Op::regionBuilder(Block &block) {
	// IMPL: Value [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);			// IMPL: Value [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);
	// IMPL: Value [[d:.*]] = std_mulf([[a]], [[b]]);			// IMPL: Value [[d:.*]] = std_mulf([[a]], [[b]]);
	// IMPL: Value [[e:.*]] = std_addf([[c]], [[d]]);			// IMPL: Value [[e:.*]] = std_addf([[c]], [[d]]);
	// IMPL: (linalg_yield(ValueRange{ [[e]] }));			// IMPL: (linalg_yield(ValueRange{ [[e]] }));
	//			//
	ods_def<Test2Op> :			ods_def<Test2Op> :
	def test2(A: f32(M, K), B: f32(K, N)) -> (C: f32(M, N)) {			def test2(A: f32(M, K), B: f32(K, N)) -> (C: f32(M, N)) {
	C(m, n) = std_addf<k>(std_mulf(A(m, k), B(k, n)));			C(m, n) = std_addf<k>(std_mulf(A(m, k), B(k, n)));
	}			}

	// ODS-LABEL: def Test3Op : LinalgNamedStructured_Op<"test3", [			// ODS-LABEL: def Test3Op : LinalgStructuredBase_Op<"test3", [
	// ODS-NEXT: NInputs<2>			// ODS-NEXT: NamedStructuredOpTrait
	// ODS-NEXT: NOutputs<1>			// ODS-NEXT: AttrSizedOperandSegments
	// ODS-NEXT: SingleBlockImplicitTerminator<"YieldOp">			// ODS-NEXT: SingleBlockImplicitTerminator<"YieldOp">
	//			//
	// IMPL-LABEL: ArrayAttr Test3Op::iterator_types() {			// IMPL-LABEL: ArrayAttr Test3Op::iterator_types() {
	// IMPL: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }			// IMPL: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }
	//			//
	// IMPL: ArrayAttr Test3Op::indexing_maps() {			// IMPL: ArrayAttr Test3Op::indexing_maps() {
	// IMPL: AffineMap::get(4, 0, {d0, d1, d3}, context),			// IMPL: AffineMap::get(4, 0, {d0, d1, d3}, context),
	// IMPL-NEXT: AffineMap::get(4, 0, {d3, d2}, context),			// IMPL-NEXT: AffineMap::get(4, 0, {d3, d2}, context),
	Show All 12 Lines

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp

Show First 20 Lines • Show All 974 Lines • ▼ Show 20 Lines	public:
/// When `gen-ods-decl` is used, this prints the ODS declaration for the TC.		/// When `gen-ods-decl` is used, this prints the ODS declaration for the TC.
/// When `gen-impl` is used, this prints the C++ implementation for the extra		/// When `gen-impl` is used, this prints the C++ implementation for the extra
/// methods defined in ODS (`iterator_types`, `indexing_maps` and		/// methods defined in ODS (`iterator_types`, `indexing_maps` and
/// `regionBuilder`).		/// `regionBuilder`).
LogicalResult parseAndEmitODSDef(llvm::raw_ostream &os);		LogicalResult parseAndEmitODSDef(llvm::raw_ostream &os);

/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.		/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.
void printODS(llvm::raw_ostream &os, StringRef cppOpName,		void printODS(llvm::raw_ostream &os, StringRef cppOpName,
StringRef linalgOpName);		StringRef linalgOpName, ComprehensionParsingState &state);

		/// Print the C++ parser and printer for `cppOpName`.
		void printParserAndPrinter(llvm::raw_ostream &os, StringRef cppOpName);

/// Print the C++ StructuredOpsInterface impl of `iterator_types`.		/// Print the C++ StructuredOpsInterface impl of `iterator_types`.
void printReferenceIterators(llvm::raw_ostream &os, StringRef cppOpName,		void printReferenceIterators(llvm::raw_ostream &os, StringRef cppOpName,
ComprehensionParsingState &state);		ComprehensionParsingState &state);

/// Print the C++ StructuredOpsInterface impl of `indexing_maps`.		/// Print the C++ StructuredOpsInterface impl of `indexing_maps`.
void printReferenceIndexingMaps(llvm::raw_ostream &os, StringRef cppOpName,		void printReferenceIndexingMaps(llvm::raw_ostream &os, StringRef cppOpName,
ComprehensionParsingState &state);		ComprehensionParsingState &state);
▲ Show 20 Lines • Show All 422 Lines • ▼ Show 20 Lines	LogicalResult TCParser::parseAndEmitODSDef(llvm::raw_ostream &os) {
// Print.		// Print.
auto nComprehensions = perComprehensionStates.size();		auto nComprehensions = perComprehensionStates.size();
if (nComprehensions != 1) {		if (nComprehensions != 1) {
parser.emitError("only 1 comprehension supported for now, got: " +		parser.emitError("only 1 comprehension supported for now, got: " +
llvm::Twine(nComprehensions));		llvm::Twine(nComprehensions));
return failure();		return failure();
}		}
if (genODSDecl) {		if (genODSDecl) {
printODS(os, cppOpName, tcName);		auto &state = perComprehensionStates.back();
		printODS(os, cppOpName, tcName, state);
os << "\n";		os << "\n";
}		}
if (genODSImpl) {		if (genODSImpl) {
auto &state = perComprehensionStates.back();		auto &state = perComprehensionStates.back();
std::string extraMethods;		std::string extraMethods;
llvm::raw_string_ostream ss(extraMethods);		llvm::raw_string_ostream ss(extraMethods);
		printParserAndPrinter(ss, cppOpName);
printReferenceIterators(ss, cppOpName, state);		printReferenceIterators(ss, cppOpName, state);
printReferenceIndexingMaps(ss, cppOpName, state);		printReferenceIndexingMaps(ss, cppOpName, state);
printRegionBuilder(ss, cppOpName, state);		printRegionBuilder(ss, cppOpName, state);
ss.flush();		ss.flush();
os << extraMethods << "\n";		os << extraMethods << "\n";
}		}

return success();		return success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Printing functions		// Printing functions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.		/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.
void TCParser::printODS(llvm::raw_ostream &os, StringRef cppOpName,		void TCParser::printODS(llvm::raw_ostream &os, StringRef cppOpName,
StringRef linalgOpName) {		StringRef linalgOpName,
const char *header = R"FMT( def {0} : LinalgNamedStructured_Op<"{1}", [		ComprehensionParsingState &state) {
NInputs<{2}>,		const char *header = R"FMT( def {0} : LinalgStructuredBase_Op<"{1}", [
NOutputs<{3}>,		NamedStructuredOpTrait,
		AttrSizedOperandSegments,
SingleBlockImplicitTerminator<"YieldOp">]> {		SingleBlockImplicitTerminator<"YieldOp">]> {
let arguments = (ins Variadic<LinalgOperand>:$views);		let arguments = (ins Variadic<AnyShaped>:$inputs,
		Variadic<AnyMemRef>:$output_buffers,
		Variadic<AnyRankedTensor>:$init_tensors);
let results = (outs Variadic<AnyRankedTensor>:$output_tensors);		let results = (outs Variadic<AnyRankedTensor>:$output_tensors);
let regions = (region SizedRegion<1>:$region);		let regions = (region AnyRegion:$region);

		// Format uses a custom return to parse an optional `:` type-list.
		// Format uses a custom region to elide the programmatically constructed
		// region.
		let assemblyFormat = [{
		attr-dict
		`ins` `(` $inputs `:` type($inputs) `)`
		(`outs` `(` $output_buffers^ `:` type($output_buffers) `)`)?
		(`init` `(` $init_tensors^ `:` type($init_tensors) `)`)?
		custom<NamedStructuredOpResults>(
		type($output_tensors))
		custom<{0}NamedStructuredOpRegion>(
		$region,
		type_ref($inputs),
		type_ref($output_buffers),
		type_ref($init_tensors),
		type_ref($output_tensors))
		}];

let builders = [OpBuilder<		let builders = [ OpBuilder<
"OpBuilder &b, OperationState &result, TypeRange outputTypes, "		"OpBuilder &b, OperationState &result,"
# "ValueRange views",		"ValueRange inputs, ValueRange outputBuffers",
[{{		[{{
result.addOperands(views);		result.addOperands(inputs);
result.addTypes(outputTypes);		result.addOperands(outputBuffers);
		result.addAttribute(
		"operand_segment_sizes",
		b.getI32VectorAttr({{static_cast<int32_t>(inputs.size()),
		static_cast<int32_t>(outputBuffers.size()),
		static_cast<int32_t>(0)}));
buildNamedStructuredOpRegionAndAttributes<{0}>(		buildNamedStructuredOpRegionAndAttributes<{0}>(
b, result, TypeRange(views), outputTypes);		b,
		result,
		TypeRange(inputs),
		TypeRange(outputBuffers),
		TypeRange(),
		TypeRange());
		}]>, OpBuilder<
		"OpBuilder &b, OperationState &result, TypeRange resultTensorTypes,"
		"ValueRange inputs, ValueRange outputBuffers, ValueRange initTensors",
		[{{
		result.addOperands(inputs);
		result.addOperands(outputBuffers);
		result.addOperands(initTensors);
		result.addTypes(resultTensorTypes);
		result.addAttribute(
		"operand_segment_sizes",
		b.getI32VectorAttr({{static_cast<int32_t>(inputs.size()),
		static_cast<int32_t>(outputBuffers.size()),
		static_cast<int32_t>(initTensors.size())}));
		buildNamedStructuredOpRegionAndAttributes<{0}>(
		b,
		result,
		TypeRange(inputs),
		TypeRange(outputBuffers),
		TypeRange(initTensors),
		resultTensorTypes);
}]>		}]>
];		];
let parser = [{
return ::parseNamedStructuredOp<{0}>(parser, result);		let verifier = [{{ return ::verifyNamedStructuredOp(*this); }];
}];		let hasFolder = 1;
		let hasCanonicalizer = 1;

let extraClassDeclaration = [{{		let extraClassDeclaration = [{{
		// Auto-generated.
ArrayAttr iterator_types();		ArrayAttr iterator_types();
ArrayAttr indexing_maps();		ArrayAttr indexing_maps();
static void regionBuilder(Block &block);		static void regionBuilder(Block &block);

		// Generic methods.
		static unsigned getNumRegionArgs() {{ return {4}; }
std::string getLibraryCallName() {{		std::string getLibraryCallName() {{
return generateLibraryCallName(getOperation());		return generateLibraryCallName(getOperation());
}		}
}];		}];
})FMT";		})FMT";

unsigned nInputs = 0, nOutputs = 0;		unsigned nInputs = 0, nOutputs = 0;
for (auto &t : registeredTensors) {		for (auto &t : registeredTensors) {
if (t.getValue().isOutput)		if (t.getValue().isOutput)
nOutputs++;		nOutputs++;
else		else
nInputs++;		nInputs++;
}		}

os << llvm::formatv(header, cppOpName, linalgOpName, nInputs, nOutputs);		os << llvm::formatv(header, cppOpName, linalgOpName, nInputs, nOutputs,
		state.orderedTensorArgs.size());
		}

		/// Print the C++ parser and printer for `cppOpName`.
		void TCParser::printParserAndPrinter(llvm::raw_ostream &os,
		StringRef cppOpName) {
		const char *parserAndPrinterFmt =
		R"FMT(
		static ParseResult parse{0}NamedStructuredOpRegion(
		OpAsmParser &parser,
		Region &region,
		TypeRange inputOperands,
		TypeRange outputBufferOperands,
		TypeRange initTensorOperands,
		TypeRange results) {{
		return parseNamedStructuredOpRegion<{0}>(
		parser,
		region,
		inputOperands,
		outputBufferOperands,
		initTensorOperands,
		results);
		}
		static void print{0}NamedStructuredOpRegion(
		OpAsmPrinter &printer,
		Region &region,
		TypeRange inputOperands,
		TypeRange outputBufferOperands,
		TypeRange initTensorOperands,
		TypeRange results) {{
		// noop
		}
		)FMT";

		os << llvm::formatv(parserAndPrinterFmt, cppOpName);
}		}

/// Print the C++ StructuredOpsInterface impl of `iterator_types`.		/// Print the C++ StructuredOpsInterface impl of `iterator_types`.
void TCParser::printReferenceIterators(llvm::raw_ostream &os,		void TCParser::printReferenceIterators(llvm::raw_ostream &os,
StringRef cppOpName,		StringRef cppOpName,
ComprehensionParsingState &state) {		ComprehensionParsingState &state) {
const char *referenceReferenceIteratorsFmt =		const char *referenceReferenceIteratorsFmt =
R"FMT(		R"FMT(
▲ Show 20 Lines • Show All 182 Lines • ▼ Show 20 Lines	int main(int argc, char **argv) {
std::unique_ptr<llvm::ToolOutputFile> output =		std::unique_ptr<llvm::ToolOutputFile> output =
openOutputFile(outputFilename, &errorMessage);		openOutputFile(outputFilename, &errorMessage);
if (!output) {		if (!output) {
llvm::errs() << errorMessage << "\n";		llvm::errs() << errorMessage << "\n";
exit(1);		exit(1);
}		}

// Include the proper Linalg header for end-to-end tblgen testing without		// Include the proper Linalg header for end-to-end tblgen testing without
// resorting to non-portable shgell manipulations.		// resorting to non-portable shell manipulations.
if (testEmitIncludeTdHeader)		if (testEmitIncludeTdHeader)
output->os() << "include \"mlir/Dialect/Linalg/IR/LinalgStructuredOps.td\"";		output->os() << "include \"mlir/Dialect/Linalg/IR/LinalgStructuredOps.td\"";

MLIRContext context(/loadAllDialects=/false);		MLIRContext context(/loadAllDialects=/false);
llvm::SourceMgr mgr;		llvm::SourceMgr mgr;
mgr.AddNewSourceBuffer(std::move(file), llvm::SMLoc());		mgr.AddNewSourceBuffer(std::move(file), llvm::SMLoc());
Parser parser(mgr, &context);		Parser parser(mgr, &context);
parseAndEmitAllTensorComprehensions(output->os(), parser);		parseAndEmitAllTensorComprehensions(output->os(), parser);
output->keep();		output->keep();

return 0;		return 0;
}		}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Linalg] Evolve named ops to use assembly form and support linalg on tensors.AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 292525

mlir/docs/Dialects/Linalg.md

mlir/include/mlir/Dialect/Linalg/IR/LinalgOps.h

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOps.td

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterface.td

mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h

mlir/include/mlir/Dialect/Shape/IR/ShapeBase.td

mlir/include/mlir/IR/OpBase.td

mlir/integration_test/Dialect/Linalg/CPU/test-conv-1d-call.mlir

mlir/integration_test/Dialect/Linalg/CPU/test-conv-1d-ncw-call.mlir

mlir/integration_test/Dialect/Linalg/CPU/test-conv-1d-nwc-call.mlir

mlir/integration_test/Dialect/Linalg/CPU/test-conv-2d-call.mlir

mlir/integration_test/Dialect/Linalg/CPU/test-conv-2d-nchw-call.mlir

mlir/integration_test/Dialect/Linalg/CPU/test-conv-2d-nhwc-call.mlir

mlir/integration_test/Dialect/Linalg/CPU/test-conv-3d-call.mlir

mlir/integration_test/Dialect/Linalg/CPU/test-conv-3d-ncdhw-call.mlir

mlir/integration_test/Dialect/Linalg/CPU/test-conv-3d-ndhwc-call.mlir

mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp

mlir/lib/Dialect/Linalg/IR/LinalgTypes.cpp

mlir/test/Conversion/LinalgToVector/linalg-to-vector.mlir

mlir/test/Dialect/Linalg/affine.mlir

mlir/test/Dialect/Linalg/canonicalize.mlir

mlir/test/Dialect/Linalg/fold-affine-min-scf.mlir

mlir/test/Dialect/Linalg/fusion-2-level.mlir

mlir/test/Dialect/Linalg/fusion.mlir

mlir/test/Dialect/Linalg/invalid.mlir

mlir/test/Dialect/Linalg/loops.mlir

mlir/test/Dialect/Linalg/promote.mlir

mlir/test/Dialect/Linalg/promotion_options.mlir

mlir/test/Dialect/Linalg/roundtrip.mlir

mlir/test/Dialect/Linalg/standard.mlir

mlir/test/Dialect/Linalg/tile-and-distribute.mlir

mlir/test/Dialect/Linalg/tile.mlir

mlir/test/Dialect/Linalg/tile_parallel_reduce.mlir

mlir/test/Dialect/Linalg/transform-patterns-matmul-to-vector.mlir

mlir/test/Dialect/Linalg/transform-patterns.mlir

mlir/test/IR/slice.mlir

mlir/test/lib/Dialect/Test/TestOps.td

mlir/test/mlir-cpu-runner/linalg_integration_test.mlir

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp

[mlir][Linalg] Evolve named ops to use assembly form and support linalg on tensors.
AbandonedPublic