This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Linalg/
-
mlir/
-
Dialect/
-
Linalg/
-
IR/
-
CMakeLists.txt
4/6
LinalgNamedStructuredOpsSpec.tc
-
LinalgStructuredOps.td
1/1
LinalgStructuredOpsInterface.td
2/4
LinalgTraits.h
-
Transforms/
1
CMakeLists.txt
-
lib/Dialect/Linalg/
-
Dialect/
-
Linalg/
-
IR/
8/8
LinalgOps.cpp
-
Transforms/
1/1
LinalgToLoops.cpp
-
test/
-
Dialect/Linalg/
-
Linalg/
-
invalid.mlir
2/2
roundtrip.mlir
-
lib/DeclarativeTransforms/
-
DeclarativeTransforms/
-
CMakeLists.txt
-
mlir-linalg-ods-gen/
-
test-linalg-ods-gen.tc
-
tools/mlir-linalg-ods-gen/
-
mlir-linalg-ods-gen/
-
mlir-linalg-ods-gen.cpp

Differential D78327

[mlir][Linalg] Create a named batch_matmul op and pipe it through.
ClosedPublic

Authored by nicolasvasilache on Apr 16 2020, 1:37 PM.

Download Raw Diff

Details

Reviewers

silvas
mehdi_amini
jpienaar
ftynse
stellaraccident
antiagainst
mravishankar
aartbik
bondhugula

Commits

rG538ac26f25d9: [mlir][Linalg] Create a named batch_matmul op and pipe it through.

Summary

This revision is the first in a set of improvements that aim at allowing
more generalized named Linalg op generation from a mathematical
specification.

This revision allows creating a new op and checks that the parser,
printer and verifier are hooked up properly.

This opened up a few design points that will be addressed in the future:

A named linalg op has a static region builder instead of an explicitly parsed region. This is not currently compatible with assemblyFormat so a custom parser / printer are needed.
The convention for structured ops and tensor return values needs to evolve to allow tensor-land and buffer land specifications to agree
ReferenceIndexingMaps and referenceIterators will need to become static to allow building attributes at parse time.
Error messages will be improved once we have 3. and we pretty print in custom form.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nicolasvasilache created this revision.Apr 16 2020, 1:37 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 16 2020, 1:37 PM

Herald added subscribers: llvm-commits, frgossen, grosul1 and 12 others. · View Herald Transcript

Harbormaster failed remote builds in B53633: Diff 258162!Apr 16 2020, 1:57 PM

nicolasvasilache edited the summary of this revision. (Show Details)Apr 16 2020, 2:59 PM

Rebase

nicolasvasilache added reviewers: silvas, mehdi_amini, jpienaar, ftynse, stellaraccident, antiagainst, mravishankar, aartbik.Apr 16 2020, 3:31 PM

Harbormaster failed remote builds in B53650: Diff 258188!Apr 16 2020, 3:38 PM

Building region from an OperationState at parse time is not straightforward because the region is not attached to an op, which is required to fill it with a builder. A temporary fake op is
created for this purpose and the resulting region is taken from it.

Is this comment relevant to this revision? What region are you trying to "build" at parse time here? Why can't the typical addRegion() followed parseRegionBody be used?

mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h
123–124	of -> or buffer -> memref
mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
273	You don't need `getBlocks()` - `region.front()` will work.
mlir/test/Dialect/Linalg/roundtrip.mlir
628	For the custom format, better to drop the parentheses `linalg.batchmatmul %a3, %b3, %c3 :` FWIW, it'd be better with an underscore batch_matmul.

Building region from an OperationState at parse time is not straightforward because the region is not attached to an op, which is required to fill it with a builder. A temporary fake op is
created for this purpose and the resulting region is taken from it.

I now see the relevant method. But to use the builder, you only need a block or a region, and not an op(?). You could do an addRegion(), followed by pushing a new block into it, and then create the builder to insert. Am I missing something?

I now see the relevant method. But to use the builder, you only need a block or a region, and not an op(?). You could do an addRegion(), followed by pushing a new block into it, and then create the builder to insert. Am I missing something?

This is what I started with but along the line I realized that region.getContext() requires region to be attached to a Operation*.
See: https://github.com/llvm/llvm-project/blob/master/mlir/include/mlir/IR/Region.h#L31

However this was a red herring only due to the fact that I was constructing OpBuilder from region.
I.e. this fails:

Region *region = result.addRegion();
OpBuilder opBuilder(region);

This works fine:

Region *region = result.addRegion();
OpBuilder opBuilder(context);
opBuilder.setInsertionPoint(&region.front(), region.front().begin());

Thanks for noting!

mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h
123–124	First typo is corrected. For the second, Linalg uses the terminology tensors and buffers consistently. Memref is a particular implementation of buffers, the only one supported for now.
mlir/test/Dialect/Linalg/roundtrip.mlir
628	Fair enough.

nicolasvasilache edited the summary of this revision. (Show Details)Apr 17 2020, 7:45 AM

In D78327#1988920, @nicolasvasilache wrote:
I now see the relevant method. But to use the builder, you only need a block or a region, and not an op(?). You could do an addRegion(), followed by pushing a new block into it, and then create the builder to insert. Am I missing something?

This is what I started with but along the line I realized that region.getContext() requires region to be attached to a Operation*.
See: https://github.com/llvm/llvm-project/blob/master/mlir/include/mlir/IR/Region.h#L31

However this was a red herring only due to the fact that I was constructing OpBuilder from region.
I.e. this fails:
Region *region = result.addRegion();
OpBuilder opBuilder(region);
This works fine:
Region *region = result.addRegion();
OpBuilder opBuilder(context);
opBuilder.setInsertionPoint(&region.front(), region.front().begin());
Thanks for noting!

That's great. In fact, the following is less verbose and should work:

auto opBuilder = OpBuilder::atBlockBegin(&region.front());

That's great. In fact, the following is less verbose and should work:
auto opBuilder = OpBuilder::atBlockBegin(&region.front());

It doesn't unfortunately because of exactly the same problem: there is no context in the block or the region.
The OpBuilder must be constructed with a context and then step into the region.

Address comments.

Herald added a subscriber: mgorny. · View Herald TranscriptApr 17 2020, 11:36 AM

Harbormaster failed remote builds in B53757: Diff 258382!Apr 17 2020, 11:53 AM

In D78327#1989405, @nicolasvasilache wrote:
That's great. In fact, the following is less verbose and should work:
auto opBuilder = OpBuilder::atBlockBegin(&region.front());
It doesn't unfortunately because of exactly the same problem: there is no context in the block or the region.
The OpBuilder must be constructed with a context and then step into the region.

I see - it won't have the context with the block based ctors as well. Nit: then you could just use setInsertionPointToStart(block) after the context arg ctor.

Mostly superficial comments...

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterface.td
94	of -> or
mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
763–764	Doc comments please.
791–794	All of this looks problematic. Do you need this on YieldOp's verifier instead of LinalgOp's verifier? (Check for its terminator there and verify?)
1086	Nit: `bodyRegion` is weird. Just `region` is good or `opRegion`.
1097	`setInsertionPointToStart(&bodyRegion.front())`
1104	Micronit: " " -> ' '; likewise below.
mlir/lib/Dialect/Linalg/Transforms/LinalgToLoops.cpp
124	Doc comment please.

This revision now requires changes to proceed.Apr 17 2020, 12:17 PM

mravishankar added inline comments.Apr 17 2020, 12:54 PM

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOpsSpec.tc
3	What is the reference for this specification ? ONNX/TF both seem to have a batch dimension for B as well. Without that this is effectively broadcasting B

silvas added inline comments.Apr 17 2020, 1:40 PM

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOpsSpec.tc
3	This isn't enough to legalize e.g. tf.BatchMatMul or torch.matmul, which allow leading batch dimensions on both sides. https://www.tensorflow.org/api_docs/cc/class/tensorflow/ops/batch-mat-mul https://pytorch.org/docs/stable/torch.html#torch.matmul In IREE we have a batch matmul op that handles batch on both sides: https://github.com/google/iree/blob/f80f39c7e96c2af15741e9c774eb8b54bf38df28/iree/compiler/Dialect/VMLA/IR/VMLAOps.td#L323 I expect that in a typical lowering flow, we will legalize tf.BatchMatMul or torch.matmul by reshaping all the batch dimensions into a single dimension on both sides (possibly a dummy "1" dimension in case of no batch on one side). Then we can expand this op into generic form and fuse/cleanup those reshapes which will eliminate batch dimensions on either side. I don't see a situation where we would create this op. My intuition is that batch matmul with a batch dimension only on one side is not that interesting, because fundamentally it is the same as a regular matmul, because you just fold the batch dimension into the free dimension of the respective operand (e.g. in the case you have here, you can just reshape the two dimensions Batch,M in the LHS into a single dimension of extent Batch*M). Batch matmul is only interesting from a lowering perspective when you have a batch dimension on both sides, which introduces a distinct data-reuse behavior as compared to a normal matmul. So in terms of defining a set of "primitives" or lowering to library calls (e.g. https://devblogs.nvidia.com/cublas-strided-batched-matrix-multiply/), having a batch on both sides seems to be the only relevant case. So I would recommend defining this as: def batch_matmul(A: f32(Batch, M, K), B: f32(Batch, K, N)) -> (C: f32(Batch, M, N)) {

Address review.
Hacking tablgen.

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOpsSpec.tc
3	It's just something to get started, the semantics are variadic and will require extensions. Once these are implemented it will be easy to update. If you have a strong preference for another op, let me know what you'd prefer (an op that also exercises reduction). It can't be dot/matvec/matmul for now because that's already taken and more work is needed to replace them.
mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
791–794	This is consistent with `LoopOps::YieldOp` and `ReturnOp`, what would justify diverging from that in Linalg specifically? If relevant this seems like it would warrant a global change. Note that LinalgOp is an interface though, not an Op per se. I'll have a followup NFC to rename globally. Still, I was missing the `SingleBlockImplicitTerminator<"YieldOp">` so I added it where relevant + updated some tests.

The tablegen story is awful atm, suggestions most welcome.

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOpsSpec.tc
3	I went with @silvas ' suggestion, we can iterate on the semantics later once we have variadic support.

Another tablgen hack at a distance.

Harbormaster failed remote builds in B53819: Diff 258473!Apr 17 2020, 9:16 PM

Harbormaster failed remote builds in B53820: Diff 258474!

Using @mehdi_amini's version for CMake dependencies.

Ok, the cmake story now seems as good as it is going to get in the near future.

Harbormaster failed remote builds in B53822: Diff 258476!Apr 17 2020, 10:01 PM

Add missing comment.

Harbormaster failed remote builds in B53824: Diff 258478!Apr 17 2020, 10:20 PM

Minor thing: please update 'batch_matmul' in revision title.

mravishankar added inline comments.Apr 18 2020, 10:16 PM

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOpsSpec.tc
3	Thanks for the update. I am not sure I follow what the semantics is variadic implies, i.e. I dont see anything variadic about the op as defined here, but I might be misreading the terms. My concern was merely if the named ops are supposed to have implicit broadcast semantics (in thoery it can, but that seems to lead to complications when it comes to things like dynamic broadcasting, etc. based on discussion on discourse). As it was defined previously, I read B as having broadcast semantics. Anyway, its OK now so thanks for taking care of it.

mravishankar added inline comments.Apr 18 2020, 11:28 PM

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOpsSpec.tc
3	Actually strike the last comment. The spec has nothing to do with broadcasting. But, the current spec is indeed more preferable.

ftynse accepted this revision.Apr 20 2020, 7:11 AM

ftynse added inline comments.

mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h
351	Nit: why plural "Traits" ?
356	Nit: consider adding a short doc and/or vertical whitespace around declarations
mlir/include/mlir/Dialect/Linalg/Transforms/CMakeLists.txt
6	Nit typo: depend

nicolasvasilache retitled this revision from [mlir][Linalg] Create a named batchmatmul op and pipe it through. to [mlir][Linalg] Create a named batch_matmul op and pipe it through..Apr 20 2020, 7:17 AM

bondhugula marked an inline comment as done.Apr 20 2020, 1:16 PM

bondhugula added inline comments.

mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
791–794	I actually also meant to highlight the lines above as well (788-789). This change will go against unifying the loop dialect and linalg dialect yield ops (including any other) into a single std yield - but we don't have to worry about it all now. It can be easily adjusted when/if an std.yield is added later.

This revision was not accepted when it landed; it landed in state Needs Review.Apr 21 2020, 9:42 AM

Closed by commit rG538ac26f25d9: [mlir][Linalg] Create a named batch_matmul op and pipe it through. (authored by nicolasvasilache). · Explain Why

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Linalg/

IR/

CMakeLists.txt

37 lines

LinalgNamedStructuredOpsSpec.tc

4 lines

LinalgStructuredOps.td

22 lines

LinalgStructuredOpsInterface.td

7 lines

LinalgTraits.h

15 lines

Transforms/

CMakeLists.txt

5 lines

lib/

Dialect/

Linalg/

IR/

LinalgOps.cpp

135 lines

Transforms/

LinalgToLoops.cpp

16 lines

test/

Dialect/

Linalg/

invalid.mlir

74 lines

roundtrip.mlir

13 lines

lib/

DeclarativeTransforms/

CMakeLists.txt

4 lines

mlir-linalg-ods-gen/

test-linalg-ods-gen.tc

76 lines

tools/

mlir-linalg-ods-gen/

mlir-linalg-ods-gen.cpp

94 lines

Diff 259033

mlir/include/mlir/Dialect/Linalg/IR/CMakeLists.txt

				# Declare a function to generate ODS with mlir-linalg-ods-gen
				function(add_linalg_ods_gen tc_filename output_file)
				set(TC_SOURCE ${CMAKE_CURRENT_SOURCE_DIR}/${tc_filename})
				set(GEN_ODS_FILE ${CMAKE_CURRENT_BINARY_DIR}/${output_file}.td)
				set(GEN_CPP_FILE ${CMAKE_CURRENT_BINARY_DIR}/${output_file}.cpp.inc)
				set_source_files_properties(
				${GEN_ODS_FILE}
				PROPERTIES GENERATED TRUE)
				set_source_files_properties(
				${GEN_CPP_FILE}
				PROPERTIES GENERATED TRUE)
				add_custom_command(
				OUTPUT ${GEN_ODS_FILE} ${GEN_CPP_FILE}
				COMMAND mlir-linalg-ods-gen -gen-ods-decl ${TC_SOURCE} > ${GEN_ODS_FILE}
				COMMAND mlir-linalg-ods-gen -gen-impl ${TC_SOURCE} > ${GEN_CPP_FILE}
				MAIN_DEPENDENCY
				${TC_SOURCE}
				DEPENDS
				mlir-linalg-ods-gen
				VERBATIM)
				add_custom_target(
				MLIR${output_file}IncGen
				DEPENDS
				mlir-linalg-ods-gen
				${GEN_ODS_FILE} ${GEN_CPP_FILE})
				endfunction()

				add_linalg_ods_gen(LinalgNamedStructuredOpsSpec.tc LinalgNamedStructuredOps)
				# Provide a short name for all external dependency that needs to
				# include Linalg in ODS
				add_custom_target(LinalgOdsGen DEPENDS MLIRLinalgNamedStructuredOpsIncGen)

	add_mlir_dialect(LinalgOps linalg)			add_mlir_dialect(LinalgOps linalg)

	add_mlir_doc(LinalgDoc -gen-op-doc LinalgOps Dialects/)			add_mlir_doc(LinalgDoc -gen-op-doc LinalgOps Dialects/)
				add_dependencies(LinalgOpsDocGen LinalgOdsGen)

	set(LLVM_TARGET_DEFINITIONS LinalgStructuredOps.td)			set(LLVM_TARGET_DEFINITIONS LinalgStructuredOps.td)
	mlir_tablegen(LinalgStructuredOps.h.inc -gen-op-decls)			mlir_tablegen(LinalgStructuredOps.h.inc -gen-op-decls)
	mlir_tablegen(LinalgStructuredOps.cpp.inc -gen-op-defs)			mlir_tablegen(LinalgStructuredOps.cpp.inc -gen-op-defs)
	add_public_tablegen_target(MLIRLinalgStructuredOpsIncGen)			add_public_tablegen_target(MLIRLinalgStructuredOpsIncGen)
				add_dependencies(MLIRLinalgStructuredOpsIncGen LinalgOdsGen)

	set(LLVM_TARGET_DEFINITIONS LinalgStructuredOpsInterface.td)			set(LLVM_TARGET_DEFINITIONS LinalgStructuredOpsInterface.td)
	mlir_tablegen(LinalgStructuredOpsInterfaces.h.inc -gen-op-interface-decls)			mlir_tablegen(LinalgStructuredOpsInterfaces.h.inc -gen-op-interface-decls)
	mlir_tablegen(LinalgStructuredOpsInterfaces.cpp.inc -gen-op-interface-defs)			mlir_tablegen(LinalgStructuredOpsInterfaces.cpp.inc -gen-op-interface-defs)
	add_public_tablegen_target(MLIRLinalgStructuredOpsInterfaceIncGen)			add_public_tablegen_target(MLIRLinalgStructuredOpsInterfaceIncGen)

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOpsSpec.tc

This file was added.

				ods_def<BatchMatmulOp>:
				def batch_matmul(A: f32(Batch, M, K), B: f32(Batch, K, N)) -> (C: f32(Batch, M, N)) {
				C(b, m, n) = std_addf<k>(std_mulf(A(b, m, k), B(b, k, n)));
				mravishankarUnsubmitted Done Reply Inline Actions What is the reference for this specification ? ONNX/TF both seem to have a batch dimension for B as well. Without that this is effectively broadcasting B mravishankar: What is the reference for this specification ? ONNX/TF both seem to have a batch dimension for…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions It's just something to get started, the semantics are variadic and will require extensions. Once these are implemented it will be easy to update. If you have a strong preference for another op, let me know what you'd prefer (an op that also exercises reduction). It can't be dot/matvec/matmul for now because that's already taken and more work is needed to replace them. nicolasvasilache: It's just something to get started, the semantics are variadic and will require extensions.
				silvasUnsubmitted Done Reply Inline Actions This isn't enough to legalize e.g. tf.BatchMatMul or torch.matmul, which allow leading batch dimensions on both sides. https://www.tensorflow.org/api_docs/cc/class/tensorflow/ops/batch-mat-mul https://pytorch.org/docs/stable/torch.html#torch.matmul In IREE we have a batch matmul op that handles batch on both sides: https://github.com/google/iree/blob/f80f39c7e96c2af15741e9c774eb8b54bf38df28/iree/compiler/Dialect/VMLA/IR/VMLAOps.td#L323 I expect that in a typical lowering flow, we will legalize tf.BatchMatMul or torch.matmul by reshaping all the batch dimensions into a single dimension on both sides (possibly a dummy "1" dimension in case of no batch on one side). Then we can expand this op into generic form and fuse/cleanup those reshapes which will eliminate batch dimensions on either side. I don't see a situation where we would create this op. My intuition is that batch matmul with a batch dimension only on one side is not that interesting, because fundamentally it is the same as a regular matmul, because you just fold the batch dimension into the free dimension of the respective operand (e.g. in the case you have here, you can just reshape the two dimensions Batch,M in the LHS into a single dimension of extent BatchM). Batch matmul is only interesting from a lowering perspective when you have a batch dimension on both sides, which introduces a distinct data-reuse behavior as compared to a normal matmul. So in terms of defining a set of "primitives" or lowering to library calls (e.g. https://devblogs.nvidia.com/cublas-strided-batched-matrix-multiply/), having a batch on both sides seems to be the only relevant case. So I would recommend defining this as: def batch_matmul(A: f32(Batch, M, K), B: f32(Batch, K, N)) -> (C: f32(Batch, M, N)) { silvas:* This isn't enough to legalize e.g. tf.BatchMatMul or torch.matmul, which allow leading batch…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I went with @silvas ' suggestion, we can iterate on the semantics later once we have variadic support. nicolasvasilache: I went with @silvas ' suggestion, we can iterate on the semantics later once we have variadic…
				mravishankarUnsubmitted Not Done Reply Inline Actions Thanks for the update. I am not sure I follow what the semantics is variadic implies, i.e. I dont see anything variadic about the op as defined here, but I might be misreading the terms. My concern was merely if the named ops are supposed to have implicit broadcast semantics (in thoery it can, but that seems to lead to complications when it comes to things like dynamic broadcasting, etc. based on discussion on discourse). As it was defined previously, I read B as having broadcast semantics. Anyway, its OK now so thanks for taking care of it. mravishankar: Thanks for the update. I am not sure I follow what the semantics is variadic implies, i.e. I…
				mravishankarUnsubmitted Not Done Reply Inline Actions Actually strike the last comment. The spec has nothing to do with broadcasting. But, the current spec is indeed more preferable. mravishankar: Actually strike the last comment. The spec has nothing to do with broadcasting. But, the…
				}

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOps.td

Show First 20 Lines • Show All 510 Lines • ▼ Show 20 Lines	def LinalgOperand: Type<
Or<[AnyRankedTensor.predicate, AnyStridedMemRef.predicate]>>;		Or<[AnyRankedTensor.predicate, AnyStridedMemRef.predicate]>>;

class LinalgOperandOfRank<int rank>: Type<		class LinalgOperandOfRank<int rank>: Type<
And<[		And<[
LinalgOperand.predicate,		LinalgOperand.predicate,
CPred<"$_self.cast<ShapedType>().getRank() == " # rank>]		CPred<"$_self.cast<ShapedType>().getRank() == " # rank>]
>>;		>>;

class GenericOpBase<string mnemonic> : LinalgStructuredBase_Op<mnemonic, []> {		class GenericOpBase<string mnemonic> : LinalgStructuredBase_Op<mnemonic,
		[SingleBlockImplicitTerminator<"YieldOp">]> {
let arguments = (ins Variadic<LinalgOperand>:$views,		let arguments = (ins Variadic<LinalgOperand>:$views,
I64Attr:$args_in,		I64Attr:$args_in,
I64Attr:$args_out,		I64Attr:$args_out,
AffineMapArrayAttr:$indexing_maps,		AffineMapArrayAttr:$indexing_maps,
ArrayAttr:$iterator_types,		ArrayAttr:$iterator_types,
OptionalAttr<StrAttr>:$doc,		OptionalAttr<StrAttr>:$doc,
OptionalAttr<StrAttr>:$library_call);		OptionalAttr<StrAttr>:$library_call);
let results = (outs Variadic<AnyRankedTensor>:$output_tensors);		let results = (outs Variadic<AnyRankedTensor>:$output_tensors);
▲ Show 20 Lines • Show All 273 Lines • ▼ Show 20 Lines

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Named Linalg ops, implemented as a declarative configurations of generic ops.		// Named Linalg ops, implemented as a declarative configurations of generic ops.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def NamedStructuredOpTraits : NativeOpTrait<"linalg::NamedStructuredOpTraits">;		def NamedStructuredOpTraits : NativeOpTrait<"linalg::NamedStructuredOpTraits">;

class LinalgNamedStructured_Op<string mnemonic, list<OpTrait> props>		class LinalgNamedStructured_Op<string mnemonic, list<OpTrait> props>
: Op<Linalg_Dialect, mnemonic,		: LinalgStructuredBase_Op<mnemonic, props> {
!listconcat(props, [StructuredOpTraits, LinalgStructuredInterface])> {
string spec = ?;		string spec = ?;
let assemblyFormat = "`(` operands `)` attr-dict `:` "		// We cannot use an assemblyFormat atm because we need to hook in a custom-
"functional-type(operands, results)";		// built implicit region from a static OpClass method.
		// TODO: Revisit in the future if/when appropriate.
		// let assemblyFormat = "`(` operands `)` attr-dict `:` "
		// "functional-type(operands, results)";

		// The parser needs to specialize on the OpType so it has to be auto-generated
		// in the linalg-ods tool.
		let printer = [{ return ::printNamedStructuredOp(p, *this); }];
		let verifier = [{ return ::verifyNamedStructuredOp(*this); }];
		let hasFolder = 1;
}		}

		// This file is auto-generated from a tc specification.
		include "mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.td"

#endif // LINALG_STRUCTURED_OPS		#endif // LINALG_STRUCTURED_OPS

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterface.td

Show First 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	InterfaceMethod<[{
}],		}],
"llvm::Optional<unsigned>", "getIndexOfInput", (ins "Value":$v)		"llvm::Optional<unsigned>", "getIndexOfInput", (ins "Value":$v)
>,		>,
InterfaceMethod<		InterfaceMethod<
"Return the input operands from the current operation.",		"Return the input operands from the current operation.",
"Operation::operand_range", "getInputs"		"Operation::operand_range", "getInputs"
>,		>,
InterfaceMethod<[{		InterfaceMethod<[{
Return the type of the input shape at the given index.		Return the `i`-th input shaped type, irrespective of buffer or tensor
		type.
}], "ShapedType", "getInputShapedType", (ins "unsigned":$i)>,		}], "ShapedType", "getInputShapedType", (ins "unsigned":$i)>,
InterfaceMethod<[{		InterfaceMethod<[{
Return the subset of input operands that are of ranked tensor type.		Return the subset of input operands that are of ranked tensor type.
}], "SmallVector<RankedTensorType, 4>", "getInputTensorTypes">,		}], "SmallVector<RankedTensorType, 4>", "getInputTensorTypes">,

//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Output arguments handling.		// Output arguments handling.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
Show All 9 Lines	InterfaceMethod<[{
not part of the output buffers.		not part of the output buffers.
}],		}],
"llvm::Optional<unsigned>", "getIndexOfOutputBuffer", (ins "Value":$view)		"llvm::Optional<unsigned>", "getIndexOfOutputBuffer", (ins "Value":$view)
>,		>,
InterfaceMethod<[{		InterfaceMethod<[{
Return the type of the output buffer at the given index.		Return the type of the output buffer at the given index.
}], "MemRefType", "getOutputBufferType", (ins "unsigned":$i)>,		}], "MemRefType", "getOutputBufferType", (ins "unsigned":$i)>,
InterfaceMethod<[{		InterfaceMethod<[{
		Return the `i`-th output shaped type, irrespective of buffer or tensor
		bondhugulaUnsubmitted Done Reply Inline Actions of -> or bondhugula: of -> or
		type.
		}], "ShapedType", "getOutputShapedType", (ins "unsigned":$i)>,
		InterfaceMethod<[{
Return the results that are of ranked tensor type.		Return the results that are of ranked tensor type.
}], "SmallVector<RankedTensorType, 4>", "getOutputTensorTypes">,		}], "SmallVector<RankedTensorType, 4>", "getOutputTensorTypes">,
InterfaceMethod<		InterfaceMethod<
"Return the output buffers (operands) from the current operation.",		"Return the output buffers (operands) from the current operation.",
"Operation::operand_range", "getOutputBuffers"		"Operation::operand_range", "getOutputBuffers"
>,		>,

//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 101 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h

//===- LinalgTraits.h - Linalg Traits ---------------------------- C++ --===//		//===- LinalgTraits.h - Linalg Traits ---------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef MLIR_DIALECT_LINALG_LINALGTRAITS_H_		#ifndef MLIR_DIALECT_LINALG_LINALGTRAITS_H_
#define MLIR_DIALECT_LINALG_LINALGTRAITS_H_		#define MLIR_DIALECT_LINALG_LINALGTRAITS_H_

#include "mlir/Dialect/Linalg/IR/LinalgTypes.h"		#include "mlir/Dialect/Linalg/IR/LinalgTypes.h"
#include "mlir/Dialect/Utils/StructuredOpsUtils.h"		#include "mlir/Dialect/Utils/StructuredOpsUtils.h"
#include "mlir/IR/AffineMap.h"		#include "mlir/IR/AffineMap.h"
		#include "mlir/IR/Function.h"
#include "mlir/IR/OpDefinition.h"		#include "mlir/IR/OpDefinition.h"
#include "mlir/IR/StandardTypes.h"		#include "mlir/IR/StandardTypes.h"
#include "mlir/Support/LLVM.h"		#include "mlir/Support/LLVM.h"

namespace mlir {		namespace mlir {
namespace OpTrait {		namespace OpTrait {
namespace linalg {		namespace linalg {

▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	public:
/// Return the index of `value` in the list of inputs if found, llvm::None		/// Return the index of `value` in the list of inputs if found, llvm::None
/// otherwise.		/// otherwise.
Optional<unsigned> getIndexOfInput(Value value) {		Optional<unsigned> getIndexOfInput(Value value) {
auto it = llvm::find(getInputs(), value);		auto it = llvm::find(getInputs(), value);
if (it != getInputs().end())		if (it != getInputs().end())
return it - getInputs().begin();		return it - getInputs().begin();
return llvm::None;		return llvm::None;
}		}
/// Return the `i`-th input buffer type.		/// Return the `i`-th input shaped type, irrespective of buffer or tensor
		/// type.
		bondhugulaUnsubmitted Done Reply Inline Actions of -> or buffer -> memref bondhugula: of -> or buffer -> memref
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions First typo is corrected. For the second, Linalg uses the terminology tensors and buffers consistently. Memref is a particular implementation of buffers, the only one supported for now. nicolasvasilache: First typo is corrected. For the second, Linalg uses the terminology tensors and buffers…
ShapedType getInputShapedType(unsigned i) {		ShapedType getInputShapedType(unsigned i) {
return getInput(i).getType().template cast<ShapedType>();		return getInput(i).getType().template cast<ShapedType>();
}		}
/// Return the range over inputs.		/// Return the range over inputs.
Operation::operand_range getInputs() {		Operation::operand_range getInputs() {
auto range = this->getOperation()->getOperands();		auto range = this->getOperation()->getOperands();
return {range.begin(), range.begin() + nInputs()};		return {range.begin(), range.begin() + nInputs()};
}		}
▲ Show 20 Lines • Show All 208 Lines • ▼ Show 20 Lines	public:
static LogicalResult verifyTrait(Operation *op) {		static LogicalResult verifyTrait(Operation *op) {
auto nOperands = cast<ConcreteType>(op).getNumInputsAndOutputBuffers();		auto nOperands = cast<ConcreteType>(op).getNumInputsAndOutputBuffers();
if (failed(OpTrait::impl::verifyAtLeastNOperands(op, nOperands)))		if (failed(OpTrait::impl::verifyAtLeastNOperands(op, nOperands)))
return failure();		return failure();
return success();		return success();
}		}
};		};

		/// This class provides the API for named Linalg StructuredOps.
		template <typename ConcreteType>
		class NamedStructuredOpTraits
		ftynseUnsubmitted Not Done Reply Inline Actions Nit: why plural "Traits" ? ftynse: Nit: why plural "Traits" ?
		: public OpTrait::TraitBase<ConcreteType, NamedStructuredOpTraits> {
		public:
		llvm::Optional<SmallVector<StringRef, 8>> referenceIterators();
		llvm::Optional<SmallVector<AffineMap, 8>> referenceIndexingMaps();
		std::function<void(OpBuilder &, Location, ArrayRef<Value>)>
		ftynseUnsubmitted Not Done Reply Inline Actions Nit: consider adding a short doc and/or vertical whitespace around declarations ftynse: Nit: consider adding a short doc and/or vertical whitespace around declarations
		emitScalarImplementation();
		};

} // namespace linalg		} // namespace linalg
} // namespace OpTrait		} // namespace OpTrait
} // namespace mlir		} // namespace mlir

#endif // MLIR_DIALECT_LINALG_LINALGTRAITS_H_		#endif // MLIR_DIALECT_LINALG_LINALGTRAITS_H_

mlir/include/mlir/Dialect/Linalg/Transforms/CMakeLists.txt

	set(LLVM_TARGET_DEFINITIONS LinalgTransformPatterns.td)			set(LLVM_TARGET_DEFINITIONS LinalgTransformPatterns.td)
				mlir_tablegen(TestLinalgMatmulToVectorPatterns.h.inc -gen-rewriters)
	mlir_tablegen(LinalgTransformPatterns.h.inc -gen-rewriters)			mlir_tablegen(LinalgTransformPatterns.h.inc -gen-rewriters)
	add_public_tablegen_target(MLIRLinalgTransformPatternsIncGen)			add_public_tablegen_target(MLIRLinalgTransformPatternsIncGen)

				# Including Linalg in TableGen requires to depends on generated files
				ftynseUnsubmitted Not Done Reply Inline Actions Nit typo: depend ftynse: Nit typo: depend
				add_dependencies(MLIRLinalgTransformPatternsIncGen LinalgOdsGen)

mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp

//===- LinalgOps.cpp - Implementation of the linalg operations ------------===//		//===- LinalgOps.cpp - Implementation of the linalg operations ------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file implements the Linalg operations.		// This file implements the Linalg operations.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Dialect/Linalg/IR/LinalgOps.h"		#include "mlir/Dialect/Linalg/IR/LinalgOps.h"
		#include "mlir/Dialect/Linalg/EDSC/Intrinsics.h"
#include "mlir/Dialect/Linalg/IR/LinalgTypes.h"		#include "mlir/Dialect/Linalg/IR/LinalgTypes.h"
#include "mlir/Dialect/StandardOps/IR/Ops.h"		#include "mlir/Dialect/StandardOps/IR/Ops.h"
#include "mlir/IR/AffineExpr.h"		#include "mlir/IR/AffineExpr.h"
#include "mlir/IR/AffineMap.h"		#include "mlir/IR/AffineMap.h"
#include "mlir/IR/Builders.h"		#include "mlir/IR/Builders.h"
#include "mlir/IR/Function.h"		#include "mlir/IR/Function.h"
#include "mlir/IR/Module.h"		#include "mlir/IR/Module.h"
#include "mlir/IR/OpImplementation.h"		#include "mlir/IR/OpImplementation.h"
#include "mlir/IR/PatternMatch.h"		#include "mlir/IR/PatternMatch.h"
#include "mlir/IR/StandardTypes.h"		#include "mlir/IR/StandardTypes.h"
#include "mlir/Support/LLVM.h"		#include "mlir/Support/LLVM.h"

#include "llvm/ADT/StringSet.h"		#include "llvm/ADT/StringSet.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::linalg;		using namespace mlir::linalg;

		/// Forward declarations.
		template <typename NamedStructuredOpType>
		static void buildNamedStructuredOpRegion(Builder &builder,
		OperationState &result,
		TypeRange operandTypes,
		TypeRange tensorResultTypes);
		template <typename NamedStructuredOpType>
		static void printNamedStructuredOp(OpAsmPrinter &p, NamedStructuredOpType op);
		template <typename NamedStructuredOpType>
		static ParseResult parseNamedStructuredOp(OpAsmParser &parser,
		OperationState &result);
		template <typename NamedStructuredOpType>
		static LogicalResult verifyNamedStructuredOp(NamedStructuredOpType op);

/// Determines whether it is possible to fold it away in the parent Linalg op:		/// Determines whether it is possible to fold it away in the parent Linalg op:
///		///
/// ```mlir		/// ```mlir
/// %1 = memref_cast %0 : memref<8x16xf32> to memref<?x?xf32>		/// %1 = memref_cast %0 : memref<8x16xf32> to memref<?x?xf32>
/// %2 = linalg.slice %1 ... : memref<?x?xf32> ...		/// %2 = linalg.slice %1 ... : memref<?x?xf32> ...
/// // or		/// // or
/// %1 = memref_cast %0 : memref<8x16xf32, affine_map<(i, j)->(16 * i + j)>>		/// %1 = memref_cast %0 : memref<8x16xf32, affine_map<(i, j)->(16 * i + j)>>
/// to memref<?x?xf32>		/// to memref<?x?xf32>
▲ Show 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	static ParseResult parseGenericOp(OpAsmParser &parser, OperationState &result) {
if (parser.parseOptionalArrowTypeList(tensorResultTypes))		if (parser.parseOptionalArrowTypeList(tensorResultTypes))
return failure();		return failure();
if (!tensorResultTypes.empty())		if (!tensorResultTypes.empty())
result.addTypes(tensorResultTypes);		result.addTypes(tensorResultTypes);
return parser.resolveOperands(operandsInfo, operandTypes,		return parser.resolveOperands(operandsInfo, operandTypes,
parser.getCurrentLocation(), result.operands);		parser.getCurrentLocation(), result.operands);
}		}

LogicalResult verifyBlockArgs(GenericOp op, Block &block) {		template <typename GenericOpType>
		struct BlockArgsVerifier {
		static LogicalResult verify(GenericOpType op, Block &block);
		};

		template <typename GenericOpType>
		LogicalResult BlockArgsVerifier<GenericOpType>::verify(GenericOpType op,
		Block &block) {
auto nOperands = op.getNumOperands();		auto nOperands = op.getNumOperands();
if (block.getNumArguments() != nOperands)		if (block.getNumArguments() != nOperands)
return op.emitOpError("expected number of block arguments to match number "		return op.emitOpError("expected number of block arguments to match number "
"of operands");		"of operands");

// Note: the number and type of yield values are checked in the YieldOp.		// Note: the number and type of yield values are checked in the YieldOp.
auto nInputViews = op.getNumInputs();		auto nInputViews = op.getNumInputs();
for (unsigned i = 0; i < nOperands; ++i) {		for (unsigned i = 0; i < nOperands; ++i) {
auto viewType = op.getShapedType(i);		auto viewType = op.getShapedType(i);
if (viewType.getElementType() != block.getArgument(i).getType())		if (viewType.getElementType() != block.getArgument(i).getType())
return op.emitOpError("expected block argument ")		return op.emitOpError("expected block argument ")
<< (i + 1) << " of the same type as elemental type of "		<< (i + 1) << " of the same type as elemental type of "
<< ((i < nInputViews) ? "input " : "output ")		<< ((i < nInputViews) ? "input " : "output ")
<< "operand: " << viewType;		<< "operand: " << viewType;
}		}
return success();		return success();
}		}

LogicalResult verifyBlockArgs(IndexedGenericOp op, Block &block) {		template <>
		LogicalResult BlockArgsVerifier<IndexedGenericOp>::verify(IndexedGenericOp op,
		Block &block) {
auto nInputViews = op.getNumInputs();		auto nInputViews = op.getNumInputs();
auto nLoops = op.getNumLoops();		auto nLoops = op.getNumLoops();
auto nOperands = op.getNumOperands();		auto nOperands = op.getNumOperands();
if (block.getNumArguments() != nOperands + nLoops)		if (block.getNumArguments() != nOperands + nLoops)
return op.emitOpError(		return op.emitOpError(
"expected number of block arguments to match number of operands + "		"expected number of block arguments to match number of operands + "
"number of loops");		"number of loops");

Show All 25 Lines	static LogicalResult verifyGenericOp(GenericOpType op) {
if (nInputsAndOutputBuffers != llvm::size(op.views()))		if (nInputsAndOutputBuffers != llvm::size(op.views()))
return op.emitOpError("expected exactly ")		return op.emitOpError("expected exactly ")
<< nInputsAndOutputBuffers		<< nInputsAndOutputBuffers
<< " inputs (tensor or buffer) and output buffer operands";		<< " inputs (tensor or buffer) and output buffer operands";

auto &region = op.region();		auto &region = op.region();
if (region.getBlocks().size() != 1)		if (region.getBlocks().size() != 1)
return op.emitOpError("expected region with 1 block");		return op.emitOpError("expected region with 1 block");
if (failed(verifyBlockArgs(op, region.getBlocks().front())))		if (failed(BlockArgsVerifier<GenericOpType>::verify(op, region.front())))
return failure();		return failure();
		bondhugulaUnsubmitted Done Reply Inline Actions You don't need `getBlocks()` - `region.front()` will work. bondhugula: You don't need `getBlocks()` - `region.front()` will work.

SmallVector<AffineMap, 4> indexingMaps;		SmallVector<AffineMap, 4> indexingMaps;
indexingMaps.reserve(op.indexing_maps().size());		indexingMaps.reserve(op.indexing_maps().size());
for (auto en : llvm::enumerate(op.indexing_maps())) {		for (auto en : llvm::enumerate(op.indexing_maps())) {
auto idx = en.index();		auto idx = en.index();
auto m = en.value().template cast<AffineMapAttr>().getValue();		auto m = en.value().template cast<AffineMapAttr>().getValue();
indexingMaps.push_back(m); // Save reference to map for further checks.		indexingMaps.push_back(m); // Save reference to map for further checks.
auto view = (idx < nInputViews) ? op.getInputShapedType(idx)		auto view = (idx < nInputViews) ? op.getInputShapedType(idx)
▲ Show 20 Lines • Show All 473 Lines • ▼ Show 20 Lines	static ParseResult parseYieldOp(OpAsmParser &parser, OperationState &result) {
SmallVector<OpAsmParser::OperandType, 2> opInfo;		SmallVector<OpAsmParser::OperandType, 2> opInfo;
SmallVector<Type, 2> types;		SmallVector<Type, 2> types;
llvm::SMLoc loc = parser.getCurrentLocation();		llvm::SMLoc loc = parser.getCurrentLocation();
return failure(parser.parseOperandList(opInfo) \|\|		return failure(parser.parseOperandList(opInfo) \|\|
parser.parseOptionalAttrDict(result.attributes) \|\|		parser.parseOptionalAttrDict(result.attributes) \|\|
(!opInfo.empty() && parser.parseColonTypeList(types)) \|\|		(!opInfo.empty() && parser.parseColonTypeList(types)) \|\|
parser.resolveOperands(opInfo, types, loc, result.operands));		parser.resolveOperands(opInfo, types, loc, result.operands));
}		}

template <typename GenericOpType>		// Check the operand number and types must match the element types of the
		bondhugulaUnsubmitted Done Reply Inline Actions Doc comments please. bondhugula: Doc comments please.
static LogicalResult verifyYield(YieldOp op, GenericOpType genericOp) {		// LinalgOp interface's shaped operands.
// The operand number and types must match the view element types.		static LogicalResult verifyYield(YieldOp op, LinalgOp linalgOpInterface) {
auto nOutputs = genericOp.getNumOutputs();		auto nOutputs = linalgOpInterface.getNumOutputs();
if (op.getNumOperands() != nOutputs)		if (op.getNumOperands() != nOutputs)
return op.emitOpError("expected number of yield values (")		return op.emitOpError("expected number of yield values (")
<< nOutputs << ") to match the number of operands of the enclosing "		<< nOutputs << ") to match the number of operands of the enclosing "
<< "linalg.generic op (" << op.getNumOperands() << ")";		<< "LinalgOp (" << op.getNumOperands() << ")";

for (unsigned i = 0; i != nOutputs; ++i) {		for (unsigned i = 0; i != nOutputs; ++i) {
auto elementType = genericOp.getOutputShapedType(i).getElementType();		auto elementType =
		linalgOpInterface.getOutputShapedType(i).getElementType();
if (op.getOperand(i).getType() != elementType)		if (op.getOperand(i).getType() != elementType)
return op.emitOpError("type of yield operand ")		return op.emitOpError("type of yield operand ")
<< (i + 1) << " (" << op.getOperand(i).getType()		<< (i + 1) << " (" << op.getOperand(i).getType()
<< ") doesn't match "		<< ") doesn't match "
<< "the element type of the enclosing linalg.generic op ("		<< "the element type of the enclosing linalg.generic op ("
<< elementType << ")";		<< elementType << ")";
}		}
return success();		return success();
}		}

static LogicalResult verify(YieldOp op) {		static LogicalResult verify(YieldOp op) {
auto *parentOp = op.getParentOp();		auto *parentOp = op.getParentOp();
if (parentOp->getNumRegions() != 1 \|\| parentOp->getRegion(0).empty())		if (parentOp->getNumRegions() != 1 \|\| parentOp->getRegion(0).empty())
return op.emitOpError("expected single non-empty parent region");		return op.emitOpError("expected single non-empty parent region");

auto genericOp = dyn_cast<GenericOp>(parentOp);		if (auto linalgOp = dyn_cast<LinalgOp>(parentOp))
if (genericOp)		return verifyYield(op, cast<LinalgOp>(parentOp));
return verifyYield(op, genericOp);
		return op.emitOpError("expected parent op with LinalgOp interface");
		bondhugulaUnsubmitted Done Reply Inline Actions All of this looks problematic. Do you need this on YieldOp's verifier instead of LinalgOp's verifier? (Check for its terminator there and verify?) bondhugula: All of this looks problematic. Do you need this on YieldOp's verifier instead of LinalgOp's…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions This is consistent with `LoopOps::YieldOp` and `ReturnOp`, what would justify diverging from that in Linalg specifically? If relevant this seems like it would warrant a global change. Note that LinalgOp is an interface though, not an Op per se. I'll have a followup NFC to rename globally. Still, I was missing the `SingleBlockImplicitTerminator<"YieldOp">` so I added it where relevant + updated some tests. nicolasvasilache: This is consistent with `LoopOps::YieldOp` and `ReturnOp`, what would justify diverging from…
		bondhugulaUnsubmitted Done Reply Inline Actions I actually also meant to highlight the lines above as well (788-789). This change will go against unifying the loop dialect and linalg dialect yield ops (including any other) into a single std yield - but we don't have to worry about it all now. It can be easily adjusted when/if an std.yield is added later. bondhugula: I actually also meant to highlight the lines above as well (788-789). This change will go…
auto indexedGenericOp = dyn_cast<IndexedGenericOp>(parentOp);
if (indexedGenericOp)
return verifyYield(op, indexedGenericOp);

return op.emitOpError("expected '")
<< GenericOp::getOperationName() << "' or '"
<< IndexedGenericOp::getOperationName() << "' parent op";
}		}

/////// Operations corresponding to library calls defined with Tablegen ////////		/////// Operations corresponding to library calls defined with Tablegen ////////

static LogicalResult verify(FillOp op) {		static LogicalResult verify(FillOp op) {
auto viewType = op.getOutputShapedType(0);		auto viewType = op.getOutputShapedType(0);
auto fillType = op.value().getType();		auto fillType = op.value().getType();
if (viewType.getElementType() != fillType)		if (viewType.getElementType() != fillType)
▲ Show 20 Lines • Show All 266 Lines • ▼ Show 20 Lines	if (succeeded(foldMemRefCast(*this)))
return getResult();		return getResult();
return {};		return {};
}		}
OpFoldResult TransposeOp::fold(ArrayRef<Attribute>) {		OpFoldResult TransposeOp::fold(ArrayRef<Attribute>) {
if (succeeded(foldMemRefCast(*this)))		if (succeeded(foldMemRefCast(*this)))
return getResult();		return getResult();
return {};		return {};
}		}

		//===----------------------------------------------------------------------===//
		// Auto-generated Linalg named ops.
		//===----------------------------------------------------------------------===//

		template <typename NamedStructuredOpType>
		void buildNamedStructuredOpRegion(Builder &builder, OperationState &result,
		TypeRange operandTypes,
		TypeRange tensorResultTypes) {
		Region &region = *result.addRegion();
		bondhugulaUnsubmitted Done Reply Inline Actions Nit: `bodyRegion` is weird. Just `region` is good or `opRegion`. bondhugula: Nit: `bodyRegion` is weird. Just `region` is good or `opRegion`.
		Block *body = new Block();
		// TODO: atm all operands go through getElementTypeOrSelf,
		// reconsider when we have evidence we need to.
		for (auto t : operandTypes)
		body->addArgument(getElementTypeOrSelf(t));
		for (auto t : tensorResultTypes)
		body->addArgument(getElementTypeOrSelf(t));
		region.push_back(body);

		OpBuilder opBuilder(builder.getContext());
		opBuilder.setInsertionPointToStart(&region.front());
		bondhugulaUnsubmitted Done Reply Inline Actions `setInsertionPointToStart(&bodyRegion.front())` bondhugula: `setInsertionPointToStart(&bodyRegion.front())`
		mlir::edsc::ScopedContext scope(opBuilder, builder.getUnknownLoc());
		NamedStructuredOpType::regionBuilder(*body);
		}

		template <typename NamedStructuredOpType>
		static void printNamedStructuredOp(OpAsmPrinter &p, NamedStructuredOpType op) {
		p << op.getOperationName() << ' ';
		bondhugulaUnsubmitted Done Reply Inline Actions Micronit: " " -> ' '; likewise below. bondhugula: Micronit: " " -> ' '; likewise below.
		p.printOptionalAttrDict(op.getAttrs());
		p << ' ' << op.getOperands();
		p << ": (" << op.getOperandTypes() << ")";
		auto outputTensorTypes = op.getResultTypes();
		if (!outputTensorTypes.empty())
		p << " -> (" << outputTensorTypes << ")";
		}

		template <typename NamedStructuredOpType>
		static ParseResult parseNamedStructuredOp(OpAsmParser &parser,
		OperationState &result) {
		SmallVector<OpAsmParser::OperandType, 8> operandsInfo;

		// Optional attributes may be added.
		if (parser.parseOptionalAttrDict(result.attributes) \|\|
		parser.parseOperandList(operandsInfo))
		return failure();

		SmallVector<Type, 8> operandTypes;
		if (parser.parseColon() \|\| parser.parseLParen() \|\|
		parser.parseTypeList(operandTypes) \|\| parser.parseRParen())
		return failure();

		// Generic ops may specify that a subset of its outputs are tensors. Such
		// outputs are specified in the result type.
		SmallVector<Type, 8> tensorResultTypes;
		if (parser.parseOptionalArrowTypeList(tensorResultTypes))
		return failure();

		if (!tensorResultTypes.empty())
		result.addTypes(tensorResultTypes);

		buildNamedStructuredOpRegion<NamedStructuredOpType>(
		parser.getBuilder(), result, operandTypes, tensorResultTypes);

		return parser.resolveOperands(operandsInfo, operandTypes,
		parser.getCurrentLocation(), result.operands);
		}

		template <typename NamedStructuredOpType>
		static LogicalResult verifyNamedStructuredOp(NamedStructuredOpType op) {
		return verifyGenericOp<NamedStructuredOpType>(op);
		}

		#include "mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.cpp.inc"

		// TODO: Determine whether we can generate the folders and verifiers.
		LogicalResult BatchMatmulOp::fold(ArrayRef<Attribute>,
		SmallVectorImpl<OpFoldResult> &) {
		return foldMemRefCast(*this);
		}

mlir/lib/Dialect/Linalg/Transforms/LinalgToLoops.cpp

Show First 20 Lines • Show All 115 Lines • ▼ Show 20 Lines	getInputAndOutputIndices(ArrayRef<Value> allIvs, SingleInputPoolingOp op) {
SmallVector<ValueHandle, 8> iIdx(		SmallVector<ValueHandle, 8> iIdx(
makeCanonicalAffineApplies(b, loc, maps[0], allIvs));		makeCanonicalAffineApplies(b, loc, maps[0], allIvs));
SmallVector<ValueHandle, 8> oIdx(		SmallVector<ValueHandle, 8> oIdx(
makeCanonicalAffineApplies(b, loc, maps[2], allIvs));		makeCanonicalAffineApplies(b, loc, maps[2], allIvs));
return {iIdx, oIdx};		return {iIdx, oIdx};
}		}

namespace {		namespace {

		bondhugulaUnsubmitted Done Reply Inline Actions Doc comment please. bondhugula: Doc comment please.
		// Generic loop emitter, to be specialized on an op-per op basis.
		// TODO: Hook up to named ops interface and, later, retire when all named ops
		// are auto-generated.
template <typename IndexedValueType, typename LinalgOpType>		template <typename IndexedValueType, typename LinalgOpType>
class LinalgScopedEmitter {};		class LinalgScopedEmitter {
		public:
		static void emitScalarImplementation(ArrayRef<Value> allIvs,
		LinalgOpType linalgOp) {
		assert(linalgOp.hasBufferSemantics() &&
		"expected linalg op with buffer semantics");
		llvm_unreachable("NYI");
		linalgOp.emitScalarImplementation()(ScopedContext::getBuilder(),
		ScopedContext::getLocation(), allIvs);
		}
		};

template <typename IndexedValueType>		template <typename IndexedValueType>
class LinalgScopedEmitter<IndexedValueType, CopyOp> {		class LinalgScopedEmitter<IndexedValueType, CopyOp> {
public:		public:
static void emitScalarImplementation(ArrayRef<Value> allIvs, CopyOp copyOp) {		static void emitScalarImplementation(ArrayRef<Value> allIvs, CopyOp copyOp) {
assert(copyOp.hasBufferSemantics() &&		assert(copyOp.hasBufferSemantics() &&
"expected linalg op with buffer semantics");		"expected linalg op with buffer semantics");
auto nPar = copyOp.getNumParallelLoops();		auto nPar = copyOp.getNumParallelLoops();
▲ Show 20 Lines • Show All 712 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/invalid.mlir

Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
func @transpose_bad_rank(%v : memref<?x?xf32, affine_map<(i, j)[off, M]->(off + M * i + j)>>) {		func @transpose_bad_rank(%v : memref<?x?xf32, affine_map<(i, j)[off, M]->(off + M * i + j)>>) {
// expected-error @+1 {{expected a permutation map of same rank as the view}}		// expected-error @+1 {{expected a permutation map of same rank as the view}}
linalg.transpose %v (i) -> (i) : memref<?x?xf32, affine_map<(i, j)[off, M]->(off + M * i + j)>>		linalg.transpose %v (i) -> (i) : memref<?x?xf32, affine_map<(i, j)[off, M]->(off + M * i + j)>>
}		}

// -----		// -----

func @yield_parent(%arg0: memref<?xf32, affine_map<(i)[off]->(off + i)>>) {		func @yield_parent(%arg0: memref<?xf32, affine_map<(i)[off]->(off + i)>>) {
// expected-error @+1 {{op expected 'linalg.generic' or 'linalg.indexed_generic' parent op}}		// expected-error @+1 {{op expected parent op with LinalgOp interface}}
linalg.yield %arg0: memref<?xf32, affine_map<(i)[off]->(off + i)>>		linalg.yield %arg0: memref<?xf32, affine_map<(i)[off]->(off + i)>>
}		}

// -----		// -----

func @generic_no_region(%arg0: memref<f32>) {		func @generic_no_region(%arg0: memref<f32>) {
// expected-error @+6 {{expected '{' to begin a region}}		// expected-error @+6 {{expected '{' to begin a region}}
linalg.generic {		linalg.generic {
Show All 26 Lines	linalg.generic {
indexing_maps = [ affine_map<() -> (0)> ],		indexing_maps = [ affine_map<() -> (0)> ],
iterator_types = []		iterator_types = []
} %arg0, %arg0, %arg0 {}: memref<f32>, memref<f32>, memref<f32>		} %arg0, %arg0, %arg0 {}: memref<f32>, memref<f32>, memref<f32>
}		}

// -----		// -----

func @generic_mismatched_num_returns(%arg0: memref<f32>) {		func @generic_mismatched_num_returns(%arg0: memref<f32>) {
// expected-error @+8 {{op expected number of yield values (1) to match the number of operands of the enclosing linalg.generic op (0)}}		// expected-error @+8 {{op expected number of yield values (1) to match the number of operands of the enclosing LinalgOp (0)}}
linalg.generic {		linalg.generic {
args_in = 0,		args_in = 0,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<() -> ()> ],		indexing_maps = [ affine_map<() -> ()> ],
iterator_types = []		iterator_types = []
} %arg0 {		} %arg0 {
^bb(%0: f32):		^bb(%0: f32):
linalg.yield		linalg.yield
}: memref<f32>		}: memref<f32>
}		}

// -----		// -----

func @generic_symbol_in_map(%arg0: memref<i32>) {		func @generic_symbol_in_map(%arg0: memref<i32>) {
// expected-error @+1 {{op expected indexing_map #0 to have no symbols}}		// expected-error @+1 {{op expected indexing_map #0 to have no symbols}}
linalg.generic {		linalg.generic {
args_in = 0,		args_in = 0,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<()[N] -> (0)> ],		indexing_maps = [ affine_map<()[N] -> (0)> ],
iterator_types = ["parallel"]		iterator_types = ["parallel"]
} %arg0 {		} %arg0 {
^bb(%i : i32):		^bb(%i : i32):
		linalg.yield %i : i32
}: memref<i32>		}: memref<i32>
}		}

// -----		// -----

func @generic_wrong_dim_in_map(%arg0: memref<1xi32>) {		func @generic_wrong_dim_in_map(%arg0: memref<1xi32>) {
// expected-error @+1 {{op expected indexing_map #0 to have 1 dim(s) to match the number of loops}}		// expected-error @+1 {{op expected indexing_map #0 to have 1 dim(s) to match the number of loops}}
linalg.generic {		linalg.generic {
args_in = 0,		args_in = 0,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<() -> (0)> ],		indexing_maps = [ affine_map<() -> (0)> ],
iterator_types = ["parallel"]		iterator_types = ["parallel"]
} %arg0 {		} %arg0 {
^bb(%i : i32):		^bb(%i : i32):
		linalg.yield %i : i32
}: memref<1xi32>		}: memref<1xi32>
}		}

// -----		// -----

func @generic_one_d_view(%arg0: memref<?xf32, affine_map<(i)[off]->(off + i)>>) {		func @generic_one_d_view(%arg0: memref<?xf32, affine_map<(i)[off]->(off + i)>>) {
// expected-error @+1 {{op expected indexing_map #0 results to match view rank: 'memref<?xf32, affine_map<(d0)[s0] -> (d0 + s0)>>'}}		// expected-error @+1 {{op expected indexing_map #0 results to match view rank: 'memref<?xf32, affine_map<(d0)[s0] -> (d0 + s0)>>'}}
linalg.generic {		linalg.generic {
▲ Show 20 Lines • Show All 44 Lines • ▼ Show 20 Lines

////////////////////////////////////////////////////////////////////////////////		////////////////////////////////////////////////////////////////////////////////
///////////////////////////// Region tests /////////////////////////////////////		///////////////////////////// Region tests /////////////////////////////////////
////////////////////////////////////////////////////////////////////////////////		////////////////////////////////////////////////////////////////////////////////

// -----		// -----

func @generic_empty_region(%arg0: memref<f32>) {		func @generic_empty_region(%arg0: memref<f32>) {
// expected-error @+1 {{op expected region with 1 block}}		%f0 = constant 0.0: f32
		// expected-error @+1 {{op expects region #0 to have 0 or 1 blocks}}
linalg.generic {		linalg.generic {
args_in = 1,		args_in = 1,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<() -> (0)> ],		indexing_maps = [ affine_map<() -> (0)> ],
iterator_types = []		iterator_types = []
} %arg0, %arg0 {		} %arg0, %arg0 {
^bb1:		^bb1:
		linalg.yield %f0: f32
^bb2:		^bb2:
		linalg.yield %f0: f32
		}: memref<f32>, memref<f32>
		}

		// -----

		func @generic_empty_region(%arg0: memref<f32>) {
		%f0 = constant 0.0: f32
		// expected-error @+1 {{linalg.generic' op expected region with 1 block}}
		linalg.generic {
		args_in = 1,
		args_out = 1,
		indexing_maps = [ affine_map<() -> (0)> ],
		iterator_types = []
		} %arg0, %arg0 {
}: memref<f32>, memref<f32>		}: memref<f32>, memref<f32>
}		}

// -----		// -----

func @generic_mismatched_num_arguments(%arg0: memref<f32>) {		func @generic_mismatched_num_arguments(%arg0: memref<f32>) {
// expected-error @+1 {{op expected number of block arguments to match number of operands}}		// expected-error @+1 {{op expected number of block arguments to match number of operands}}
linalg.generic {		linalg.generic {
args_in = 0,		args_in = 0,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<() -> (0)> ],		indexing_maps = [ affine_map<() -> (0)> ],
iterator_types = []		iterator_types = []
} %arg0 {		} %arg0 {
^bb:		^bb(%f: f32, %g: f32):
		linalg.yield %f: f32
}: memref<f32>		}: memref<f32>
}		}

// -----		// -----

func @generic_block_arg_type(%arg0: memref<f32>) {		func @generic_block_arg_type(%arg0: memref<f32>) {
// expected-error @+1 {{op expected block argument 1 of the same type as elemental type of output operand: 'memref<f32>'}}		// expected-error @+1 {{op expected block argument 1 of the same type as elemental type of output operand: 'memref<f32>'}}
linalg.generic {		linalg.generic {
args_in = 0,		args_in = 0,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<() -> (0)> ],		indexing_maps = [ affine_map<() -> (0)> ],
iterator_types = []		iterator_types = []
} %arg0 {		} %arg0 {
^bb(%i: i1):		^bb(%i: i1):
		linalg.yield %i : i1
}: memref<f32>		}: memref<f32>
}		}

// -----		// -----

func @indexed_generic_block_arg_count(%arg0: memref<f32>) {		func @indexed_generic_block_arg_count(%arg0: memref<f32>) {
// expected-error @+1 {{op expected number of block arguments to match number of operands + number of loops}}		// expected-error @+1 {{op expected number of block arguments to match number of operands + number of loops}}
linalg.indexed_generic {		linalg.indexed_generic {
args_in = 0,		args_in = 0,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<(d0) -> (d0)> ],		indexing_maps = [ affine_map<(d0) -> (d0)> ],
iterator_types = ["parallel"]		iterator_types = ["parallel"]
} %arg0 {		} %arg0 {
^bb(%f: f32):		^bb(%f: f32):
		linalg.yield %f : f32
}: memref<f32>		}: memref<f32>
}		}

// -----		// -----

func @indexed_generic_block_induction_var_arg_type(%arg0: memref<f32>) {		func @indexed_generic_block_induction_var_arg_type(%arg0: memref<f32>) {
// expected-error @+1 {{op expected block argument 1 to be an index}}		// expected-error @+1 {{op expected block argument 1 to be an index}}
linalg.indexed_generic {		linalg.indexed_generic {
args_in = 0,		args_in = 0,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<(d0) -> (d0)> ],		indexing_maps = [ affine_map<(d0) -> (d0)> ],
iterator_types = ["parallel"]		iterator_types = ["parallel"]
} %arg0 {		} %arg0 {
^bb(%i: f64, %f: f32):		^bb(%i: f64, %f: f32):
		linalg.yield %f: f32
}: memref<f32>		}: memref<f32>
}		}

// -----		// -----

func @indexed_generic_block_arg_type(%arg0: memref<f32>) {		func @indexed_generic_block_arg_type(%arg0: memref<f32>) {
// expected-error @+1 {{op expected block argument 2 of the same type as elemental type of output operand: 'memref<f32>'}}		// expected-error @+1 {{op expected block argument 2 of the same type as elemental type of output operand: 'memref<f32>'}}
linalg.indexed_generic {		linalg.indexed_generic {
args_in = 0,		args_in = 0,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<(d0) -> (d0)> ],		indexing_maps = [ affine_map<(d0) -> (d0)> ],
iterator_types = ["parallel"]		iterator_types = ["parallel"]
} %arg0 {		} %arg0 {
^bb(%i: index, %f: i1):		^bb(%i: index, %f: i1):
		linalg.yield %i: index
}: memref<f32>		}: memref<f32>
}		}

// -----		// -----

func @indexed_generic_arg_count(%arg0: memref<f32>) {		func @indexed_generic_arg_count(%arg0: memref<f32>) {
// expected-error @+1 {{op expected number of block arguments to match number of operands + number of loops}}		// expected-error @+1 {{op expected number of block arguments to match number of operands + number of loops}}
linalg.indexed_generic {		linalg.indexed_generic {
Show All 21 Lines	linalg.indexed_generic {
^bb(%0: i32, %1: f32):		^bb(%0: i32, %1: f32):
linalg.yield %1: f32		linalg.yield %1: f32
} : memref<f32>		} : memref<f32>
}		}

// -----		// -----

func @indexed_generic_result_count(%arg0: memref<?xf32>) {		func @indexed_generic_result_count(%arg0: memref<?xf32>) {
// expected-error @+8 {{op expected number of yield values (1) to match the number of operands of the enclosing linalg.generic op (2)}}		// expected-error @+8 {{op expected number of yield values (1) to match the number of operands of the enclosing LinalgOp (2)}}
linalg.indexed_generic {		linalg.indexed_generic {
args_in = 0,		args_in = 0,
args_out = 1,		args_out = 1,
indexing_maps = [ affine_map<(d0) -> (d0)> ],		indexing_maps = [ affine_map<(d0) -> (d0)> ],
iterator_types = ["parallel"]		iterator_types = ["parallel"]
} %arg0 {		} %arg0 {
^bb(%i: index, %val: f32):		^bb(%i: index, %val: f32):
linalg.yield %val, %val: f32, f32		linalg.yield %val, %val: f32, f32
Show All 28 Lines	func @generic_result_tensor_type(%arg0: memref<?xf32, affine_map<(i)[off]->(off + i)>>) {
} %arg0 {		} %arg0 {
^bb(%i: f32):		^bb(%i: f32):
linalg.yield %i: f32		linalg.yield %i: f32
}: memref<?xf32, affine_map<(i)[off]->(off + i)>> -> f32		}: memref<?xf32, affine_map<(i)[off]->(off + i)>> -> f32
}		}

// -----		// -----

		func @generic_result_tensor_type(%arg0: memref<?xf32, affine_map<(i)[off]->(off + i)>>) {
		// expected-error @+1 {{op result #0 must be ranked tensor of any type values, but got 'f32'}}
		%0 = linalg.generic {
		args_in = 0,
		args_out = 1,
		indexing_maps = [ affine_map<(i) -> (i)> ],
		iterator_types = ["parallel"]
		} %arg0 {
		^bb(%i: f32):
		linalg.yield %i: f32
		}: memref<?xf32, affine_map<(i)[off]->(off + i)>> -> f32
		}

		// -----

		func @generic(%arg0: memref<?x?xi4>) {
		// expected-error @+2 {{op expects regions to end with 'linalg.yield', found 'std.addf'}}
		// expected-note @+1 {{in custom textual format, the absence of terminator implies 'linalg.yield'}}
		linalg.generic {
		args_in = 0,
		args_out = 1,
		indexing_maps = [ affine_map<(i) -> (i)> ],
		iterator_types = ["parallel"]
		} %arg0 {
		^bb(%0: i4) :
		%1 = std.addf %0, %0: i4
		} : memref<?x?xi4>
		return
		}

		// -----

func @generic_result_0_element_type(%arg0: memref<?xf32>) {		func @generic_result_0_element_type(%arg0: memref<?xf32>) {
// expected-error @+1 {{'linalg.dot' op expected 3 operands, but found 2}}		// expected-error @+1 {{'linalg.dot' op expected 3 operands, but found 2}}
linalg.dot(%arg0, %arg0): memref<?xf32>, memref<?xf32>		linalg.dot(%arg0, %arg0): memref<?xf32>, memref<?xf32>
}		}

// -----		// -----

// expected-error @+1 {{unknown Linalg type}}		// expected-error @+1 {{unknown Linalg type}}
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines
func @pooling_rank_mismatch(%arg0: memref<?x?x?xf32>,		func @pooling_rank_mismatch(%arg0: memref<?x?x?xf32>,
%arg1: memref<2x3xf32>,		%arg1: memref<2x3xf32>,
%arg2: memref<?x?x?xf32>) {		%arg2: memref<?x?x?xf32>) {
// expected-error @+1 {{expects memref ranks to match}}		// expected-error @+1 {{expects memref ranks to match}}
linalg.pooling_max(%arg0, %arg1, %arg2) {strides = [2, 1, 2]}:		linalg.pooling_max(%arg0, %arg1, %arg2) {strides = [2, 1, 2]}:
memref<?x?x?xf32>, memref<2x3xf32>, memref<?x?x?xf32>		memref<?x?x?xf32>, memref<2x3xf32>, memref<?x?x?xf32>
return		return
}		}

		// -----

		func @named_ops(%a3: memref<?x?x?xf32>, %b3: memref<?x?xf32>, %c3: memref<?x?x?xf32>) {
		// expected-error @+1 {{op expected indexing_map #1 results to match view rank: 'memref<?x?xf32>'}}
		linalg.batch_matmul %a3, %b3, %c3 : (memref<?x?x?xf32>, memref<?x?xf32>, memref<?x?x?xf32>) -> ()
		return
		}

mlir/test/Dialect/Linalg/roundtrip.mlir

	Show First 20 Lines • Show All 614 Lines • ▼ Show 20 Lines
	// CHECK: linalg.reshape {{.*}} [#[[reshapeD01]], #[[reshapeD2]]]			// CHECK: linalg.reshape {{.*}} [#[[reshapeD01]], #[[reshapeD2]]]
	// CHECK-SAME: memref<?x?x?xf32, #[[strided3DOFF0]]> into memref<?x?xf32, #[[strided2DOFF0]]>			// CHECK-SAME: memref<?x?x?xf32, #[[strided3DOFF0]]> into memref<?x?xf32, #[[strided2DOFF0]]>
	// CHECK: linalg.reshape {{.*}} [#[[reshapeD01]], #[[reshapeD2]]]			// CHECK: linalg.reshape {{.*}} [#[[reshapeD01]], #[[reshapeD2]]]
	// CHECK-SAME: memref<?x?xf32, #[[strided2DOFF0]]> into memref<?x?x?xf32, #[[strided3DOFF0]]>			// CHECK-SAME: memref<?x?xf32, #[[strided2DOFF0]]> into memref<?x?x?xf32, #[[strided3DOFF0]]>
	// CHECK: linalg.reshape {{.*}} [#[[reshapeD01]], #[[reshapeD2]]]			// CHECK: linalg.reshape {{.*}} [#[[reshapeD01]], #[[reshapeD2]]]
	// CHECK-SAME: memref<?x?x?xf32, #[[strided3D]]> into memref<?x?xf32, #[[strided2D]]>			// CHECK-SAME: memref<?x?x?xf32, #[[strided3D]]> into memref<?x?xf32, #[[strided2D]]>
	// CHECK: linalg.reshape {{.*}} [#[[reshapeD01]], #[[reshapeD2]]]			// CHECK: linalg.reshape {{.*}} [#[[reshapeD01]], #[[reshapeD2]]]
	// CHECK-SAME: memref<?x?xf32, #[[strided2D]]> into memref<?x?x?xf32, #[[strided3D]]>			// CHECK-SAME: memref<?x?xf32, #[[strided2D]]> into memref<?x?x?xf32, #[[strided3D]]>


				// TODO: Return tensors need a semantics convention update.
				func @named_ops(%a3: memref<?x?x?xf32>, %b3: memref<?x?x?xf32>, %c3: memref<?x?x?xf32>,
				%ta3: tensor<?x?x?xf32>, %tb3: tensor<?x?x?xf32>, %tc3: tensor<?x?x?xf32>) {
				linalg.batch_matmul %a3, %b3, %c3 : (memref<?x?x?xf32>, memref<?x?x?xf32>, memref<?x?x?xf32>) -> ()
				bondhugulaUnsubmitted Done Reply Inline Actions For the custom format, better to drop the parentheses `linalg.batchmatmul %a3, %b3, %c3 :` FWIW, it'd be better with an underscore batch_matmul. bondhugula: For the custom format, better to drop the parentheses `linalg.batchmatmul %a3, %b3, %c3 : `…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Fair enough. nicolasvasilache: Fair enough.
				linalg.batch_matmul %ta3, %tb3, %c3 : (tensor<?x?x?xf32>, tensor<?x?x?xf32>, memref<?x?x?xf32>) -> ()
				return
				}
				// CHECK-LABEL: func @named_ops
				// CHECK: linalg.batch_matmul
				// CHECK: linalg.batch_matmul

mlir/test/lib/DeclarativeTransforms/CMakeLists.txt

	set(LLVM_TARGET_DEFINITIONS TestLinalgTransformPatterns.td)			set(LLVM_TARGET_DEFINITIONS TestLinalgTransformPatterns.td)
	mlir_tablegen(TestLinalgTransformPatterns.h.inc -gen-rewriters)			mlir_tablegen(TestLinalgTransformPatterns.h.inc -gen-rewriters)
	add_public_tablegen_target(MLIRTestLinalgTransformPatternsIncGen)			add_public_tablegen_target(MLIRTestLinalgTransformPatternsIncGen)
				# Including Linalg in TableGen requires to depends on generated files
				add_dependencies(MLIRTestLinalgTransformPatternsIncGen LinalgOdsGen)

	set(LLVM_TARGET_DEFINITIONS TestVectorTransformPatterns.td)			set(LLVM_TARGET_DEFINITIONS TestVectorTransformPatterns.td)
	mlir_tablegen(TestVectorTransformPatterns.h.inc -gen-rewriters)			mlir_tablegen(TestVectorTransformPatterns.h.inc -gen-rewriters)
	add_public_tablegen_target(MLIRTestVectorTransformPatternsIncGen)			add_public_tablegen_target(MLIRTestVectorTransformPatternsIncGen)

	set(LLVM_TARGET_DEFINITIONS TestLinalgMatmulToVectorPatterns.td)			set(LLVM_TARGET_DEFINITIONS TestLinalgMatmulToVectorPatterns.td)
	mlir_tablegen(TestLinalgMatmulToVectorPatterns.h.inc -gen-rewriters)			mlir_tablegen(TestLinalgMatmulToVectorPatterns.h.inc -gen-rewriters)
	add_public_tablegen_target(MLIRTestLinalgMatmulToVectorPatternsIncGen)			add_public_tablegen_target(MLIRTestLinalgMatmulToVectorPatternsIncGen)
				# Including Linalg in TableGen requires to depends on generated files
				add_dependencies(MLIRTestLinalgTransformPatternsIncGen LinalgOdsGen)

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc

	// RUN: mlir-linalg-ods-gen %s -gen-ods-decl=1 \| FileCheck %s --check-prefix=ODS			// RUN: mlir-linalg-ods-gen %s -gen-ods-decl=1 \| FileCheck %s --check-prefix=ODS
	// RUN: mlir-linalg-ods-gen %s -gen-impl=1 \| FileCheck %s --check-prefix=IMPL			// RUN: mlir-linalg-ods-gen %s -gen-impl=1 \| FileCheck %s --check-prefix=IMPL

	// RUN: mlir-linalg-ods-gen %s -gen-ods-decl=1 -test-emit-include-td-header \			// ODS-LABEL: def Test1Op : LinalgNamedStructured_Op<"test1", [
	// RUN: \| mlir-tblgen -gen-op-decls -I %S/../../include			// ODS-NEXT: NInputs<2>
				// ODS-NEXT: NOutputs<1>
	// ODS-LABEL: def matvecOp : LinalgNamedStructured_Op<"matvec", [			// ODS-NEXT: NamedStructuredOpTraits
	// ODS-NEXT: NInputs<2>,			// ODS-NEXT: SingleBlockImplicitTerminator<"YieldOp">
	// ODS-NEXT: NOutputs<1>,
	// ODS-NEXT: NamedStructuredOpTraits]>
	//			//
	// IMPL-LABEL: matvec::referenceIterators() {			// IMPL-LABEL: Test1Op::referenceIterators() {
	// IMPL-NEXT: { {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }			// IMPL-NEXT: { {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }
	//			//
	// IMPL: matvec::referenceIndexingMaps() {			// IMPL: Test1Op::referenceIndexingMaps() {
	// IMPL: AffineMap::get(2, 0, {d0, d1}),			// IMPL: AffineMap::get(2, 0, {d0, d1}, context),
	// IMPL-NEXT: AffineMap::get(2, 0, {d1}),			// IMPL-NEXT: AffineMap::get(2, 0, {d1}, context),
	// IMPL-NEXT: AffineMap::get(2, 0, {d0}) };			// IMPL-NEXT: AffineMap::get(2, 0, {d0}, context) };
	//			//
	// IMPL: matvec::regionBuilder(ArrayRef<BlockArgument> args) {			// IMPL: Test1Op::regionBuilder(Block &block) {
	// IMPL: ValueHandle [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);			// IMPL: ValueHandle [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);
	// IMPL: ValueHandle [[d:.*]] = std_mulf([[a]], [[b]]);			// IMPL: ValueHandle [[d:.*]] = std_mulf([[a]], [[b]]);
	// IMPL: ValueHandle [[e:.*]] = std_addf([[c]], [[d]]);			// IMPL: ValueHandle [[e:.*]] = std_addf([[c]], [[d]]);
	// IMPL: (linalg_yield(ValueRange{ [[e]] }));			// IMPL: (linalg_yield(ValueRange{ [[e]] }));
	//			//
	def matvec(A: f32(M, K), B: f32(K)) -> (C: f32(M)) {			ods_def<Test1Op> :
				def test1(A: f32(M, K), B: f32(K)) -> (C: f32(M)) {
	C(m) = std_addf<k>(std_mulf(A(m, k), B(k)));			C(m) = std_addf<k>(std_mulf(A(m, k), B(k)));
	}			}

	// ODS-LABEL: def matmulOp : LinalgNamedStructured_Op<"matmul", [			// ODS-LABEL: def Test2Op : LinalgNamedStructured_Op<"test2", [
	// ODS-NEXT: NInputs<2>,			// ODS-NEXT: NInputs<2>
	// ODS-NEXT: NOutputs<1>,			// ODS-NEXT: NOutputs<1>
	// ODS-NEXT: NamedStructuredOpTraits]>			// ODS-NEXT: NamedStructuredOpTraits
				// ODS-NEXT: SingleBlockImplicitTerminator<"YieldOp">
	//			//
	// IMPL-LABEL: matmul::referenceIterators() {			// IMPL-LABEL: Test2Op::referenceIterators() {
	// IMPL-NEXT: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }			// IMPL-NEXT: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }
	//			//
	// IMPL: matmul::referenceIndexingMaps() {			// IMPL: Test2Op::referenceIndexingMaps() {
	// IMPL: AffineMap::get(3, 0, {d0, d2}),			// IMPL: AffineMap::get(3, 0, {d0, d2}, context),
	// IMPL-NEXT: AffineMap::get(3, 0, {d2, d1}),			// IMPL-NEXT: AffineMap::get(3, 0, {d2, d1}, context),
	// IMPL-NEXT: AffineMap::get(3, 0, {d0, d1}) };			// IMPL-NEXT: AffineMap::get(3, 0, {d0, d1}, context) };
	//			//
	// IMPL: matmul::regionBuilder(ArrayRef<BlockArgument> args) {			// IMPL: Test2Op::regionBuilder(Block &block) {
	// IMPL: ValueHandle [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);			// IMPL: ValueHandle [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);
	// IMPL: ValueHandle [[d:.*]] = std_mulf([[a]], [[b]]);			// IMPL: ValueHandle [[d:.*]] = std_mulf([[a]], [[b]]);
	// IMPL: ValueHandle [[e:.*]] = std_addf([[c]], [[d]]);			// IMPL: ValueHandle [[e:.*]] = std_addf([[c]], [[d]]);
	// IMPL: (linalg_yield(ValueRange{ [[e]] }));			// IMPL: (linalg_yield(ValueRange{ [[e]] }));
	//			//
	def matmul(A: f32(M, K), B: f32(K, N)) -> (C: f32(M, N)) {			ods_def<Test2Op> :
				def test2(A: f32(M, K), B: f32(K, N)) -> (C: f32(M, N)) {
	C(m, n) = std_addf<k>(std_mulf(A(m, k), B(k, n)));			C(m, n) = std_addf<k>(std_mulf(A(m, k), B(k, n)));
	}			}

	// ODS-LABEL: def batchmatmulOp : LinalgNamedStructured_Op<"batchmatmul", [			// ODS-LABEL: def Test3Op : LinalgNamedStructured_Op<"test3", [
	// ODS-NEXT: NInputs<2>,			// ODS-NEXT: NInputs<2>
	// ODS-NEXT: NOutputs<1>,			// ODS-NEXT: NOutputs<1>
	// ODS-NEXT: NamedStructuredOpTraits]>			// ODS-NEXT: NamedStructuredOpTraits
				// ODS-NEXT: SingleBlockImplicitTerminator<"YieldOp">
	//			//
	// IMPL-LABEL: batchmatmul::referenceIterators() {			// IMPL-LABEL: Test3Op::referenceIterators() {
	// IMPL-NEXT: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }			// IMPL-NEXT: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }
	//			//
	// IMPL: batchmatmul::referenceIndexingMaps() {			// IMPL: Test3Op::referenceIndexingMaps() {
	// IMPL: AffineMap::get(4, 0, {d0, d1, d3}),			// IMPL: AffineMap::get(4, 0, {d0, d1, d3}, context),
	// IMPL-NEXT: AffineMap::get(4, 0, {d3, d2}),			// IMPL-NEXT: AffineMap::get(4, 0, {d3, d2}, context),
	// IMPL-NEXT: AffineMap::get(4, 0, {d0, d1, d2}) };			// IMPL-NEXT: AffineMap::get(4, 0, {d0, d1, d2}, context) };
	//			//
	// IMPL: batchmatmul::regionBuilder(ArrayRef<BlockArgument> args) {			// IMPL: Test3Op::regionBuilder(Block &block) {
	// IMPL: ValueHandle [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);			// IMPL: ValueHandle [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);
	// IMPL: ValueHandle [[d:.*]] = std_mulf([[a]], [[b]]);			// IMPL: ValueHandle [[d:.*]] = std_mulf([[a]], [[b]]);
	// IMPL: ValueHandle [[e:.*]] = std_addf([[c]], [[d]]);			// IMPL: ValueHandle [[e:.*]] = std_addf([[c]], [[d]]);
	// IMPL: (linalg_yield(ValueRange{ [[e]] }));			// IMPL: (linalg_yield(ValueRange{ [[e]] }));
	//			//
	// TBLGEN: batchmatmulOp			ods_def<Test3Op> :
	def batchmatmul(A: f32(Batch, M, K), B: f32(K, N)) -> (C: f32(Batch, M, N)) {			def test3(A: f32(Batch, M, K), B: f32(K, N)) -> (C: f32(Batch, M, N)) {
	C(b, m, n) = std_addf<k>(std_mulf(A(b, m, k), B(k, n)));			C(b, m, n) = std_addf<k>(std_mulf(A(b, m, k), B(k, n)));
	}			}

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp

Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	enum class Kind {
r_brace,		r_brace,
r_paren,		r_paren,
semicolon,		semicolon,
star,		star,

// Keywords.		// Keywords.
kw_def,		kw_def,
FIRST_KEYWORD = kw_def,		FIRST_KEYWORD = kw_def,
		kw_ods_def,
kw_floordiv,		kw_floordiv,
kw_ceildiv,		kw_ceildiv,
kw_mod,		kw_mod,
LAST_KEYWORD = kw_mod,		LAST_KEYWORD = kw_mod,

// String valued tokens.		// String valued tokens.
id,		id,
integer,		integer,
▲ Show 20 Lines • Show All 183 Lines • ▼ Show 20 Lines	Token Lexer::lexIdentifier(const char *tokStart) {
// Match the rest of the identifier regex: [0-9a-zA-Z_\-]*		// Match the rest of the identifier regex: [0-9a-zA-Z_\-]*
while (isalnum(curPtr) \|\| curPtr == '_' \|\| *curPtr == '-')		while (isalnum(curPtr) \|\| curPtr == '_' \|\| *curPtr == '-')
++curPtr;		++curPtr;

// Check to see if this identifier is a keyword.		// Check to see if this identifier is a keyword.
StringRef str(tokStart, curPtr - tokStart);		StringRef str(tokStart, curPtr - tokStart);
Token::Kind kind = llvm::StringSwitch<Token::Kind>(str)		Token::Kind kind = llvm::StringSwitch<Token::Kind>(str)
.Case("def", Token::Kind::kw_def)		.Case("def", Token::Kind::kw_def)
		.Case("ods_def", Token::Kind::kw_ods_def)
.Case("floordiv", Token::Kind::kw_floordiv)		.Case("floordiv", Token::Kind::kw_floordiv)
.Case("ceildiv", Token::Kind::kw_ceildiv)		.Case("ceildiv", Token::Kind::kw_ceildiv)
.Case("mod", Token::Kind::kw_mod)		.Case("mod", Token::Kind::kw_mod)
.Default(Token::Kind::id);		.Default(Token::Kind::id);

return Token(kind, str);		return Token(kind, str);
}		}

▲ Show 20 Lines • Show All 591 Lines • ▼ Show 20 Lines
/// op-arg-list ::= op-arg (`,` op-arg)*		/// op-arg-list ::= op-arg (`,` op-arg)*
/// tensor-expr ::= op-spec `(` op-arg-list `)`		/// tensor-expr ::= op-spec `(` op-arg-list `)`
///		///
/// Underlying op-arg are stored by unique_ptr to base class.		/// Underlying op-arg are stored by unique_ptr to base class.
struct TensorExpr : public Expression {		struct TensorExpr : public Expression {
TensorExpr(StringRef name,		TensorExpr(StringRef name,
SmallVectorImpl<std::unique_ptr<Expression>> &&exprs,		SmallVectorImpl<std::unique_ptr<Expression>> &&exprs,
ArrayRef<unsigned> reductionDims)		ArrayRef<unsigned> reductionDims)
: Expression(Kind::TensorExpr), opId(name), expressions(std::move(exprs)),		: Expression(Kind::TensorExpr), operationName(name),
		expressions(std::move(exprs)),
reductionDimensions(reductionDims.begin(), reductionDims.end()) {}		reductionDimensions(reductionDims.begin(), reductionDims.end()) {}

static bool classof(const Expression *e) {		static bool classof(const Expression *e) {
return e->kind == Kind::TensorExpr;		return e->kind == Kind::TensorExpr;
}		}

bool operator==(const TensorExpr &other) const {		bool operator==(const TensorExpr &other) const {
if (opId != other.opId)		if (operationName != other.operationName)
return false;		return false;
if (expressions.size() != other.expressions.size())		if (expressions.size() != other.expressions.size())
return false;		return false;
for (unsigned i = 0, e = expressions.size(); i < e; ++i)		for (unsigned i = 0, e = expressions.size(); i < e; ++i)
if (expressions[i] != other.expressions[i])		if (expressions[i] != other.expressions[i])
return false;		return false;
for (unsigned i = 0, e = reductionDimensions.size(); i < e; ++i)		for (unsigned i = 0, e = reductionDimensions.size(); i < e; ++i)
if (reductionDimensions[i] != other.reductionDimensions[i])		if (reductionDimensions[i] != other.reductionDimensions[i])
return false;		return false;
return true;		return true;
}		}

/// Visitation function. Performs preorder or postorder traversal depending on		/// Visitation function. Performs preorder or postorder traversal depending on
/// `PreOrder` and applies `callback` on each node.		/// `PreOrder` and applies `callback` on each node.
template <typename Lambda, bool PreOrder>		template <typename Lambda, bool PreOrder>
void visit(Lambda callback) const;		void visit(Lambda callback) const;

StringRef opId;		StringRef operationName;
SmallVector<std::unique_ptr<Expression>, 4> expressions;		SmallVector<std::unique_ptr<Expression>, 4> expressions;
SetVector<unsigned> reductionDimensions;		SetVector<unsigned> reductionDimensions;
};		};

Expression::~Expression() {}		Expression::~Expression() {}

bool Expression::operator==(const Expression &e) const {		bool Expression::operator==(const Expression &e) const {
if (this->kind != e.kind)		if (this->kind != e.kind)
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	LogicalResult parseOneComprehension(StringRef cppOpName,
StringRef linalgOpName,		StringRef linalgOpName,
ComprehensionParsingState &state);		ComprehensionParsingState &state);

/// Parse and print the information for a TC def.		/// Parse and print the information for a TC def.
/// When `gen-ods-decl` is used, this prints the ODS declaration for the TC.		/// When `gen-ods-decl` is used, this prints the ODS declaration for the TC.
/// When `gen-impl` is used, this prints the C++ implementation for the extra		/// When `gen-impl` is used, this prints the C++ implementation for the extra
/// methods defined in ODS (referenceIterators, referenceIndexingMaps and		/// methods defined in ODS (referenceIterators, referenceIndexingMaps and
/// regionBuilder).		/// regionBuilder).
LogicalResult parseAndEmitTCDef(llvm::raw_ostream &os);		LogicalResult parseAndEmitODSDef(llvm::raw_ostream &os);

/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.		/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.
void printODS(llvm::raw_ostream &os, StringRef cppOpName,		void printODS(llvm::raw_ostream &os, StringRef cppOpName,
StringRef linalgOpName);		StringRef linalgOpName);

/// Print the C++ StructuredOpsInterface impl of `referenceIterators`.		/// Print the C++ StructuredOpsInterface impl of `referenceIterators`.
void printReferenceIterators(llvm::raw_ostream &os, StringRef opId,		void printReferenceIterators(llvm::raw_ostream &os, StringRef cppOpName,
ComprehensionParsingState &state);		ComprehensionParsingState &state);

/// Print the C++ StructuredOpsInterface impl of `referenceIndexingMaps`.		/// Print the C++ StructuredOpsInterface impl of `referenceIndexingMaps`.
void printReferenceIndexingMaps(llvm::raw_ostream &os, StringRef opId,		void printReferenceIndexingMaps(llvm::raw_ostream &os, StringRef cppOpName,
ComprehensionParsingState &state);		ComprehensionParsingState &state);

/// Print the C++ StructuredOpsInterface impl of `regionBuilder`.		/// Print the C++ StructuredOpsInterface impl of `regionBuilder`.
void printRegionBuilder(llvm::raw_ostream &os, StringRef opId,		void printRegionBuilder(llvm::raw_ostream &os, StringRef cppOpName,
ComprehensionParsingState &state);		ComprehensionParsingState &state);

private:		private:
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Internal bookkeeping of tensors.		// Internal bookkeeping of tensors.
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
struct RegisteredTensor {		struct RegisteredTensor {
StringRef type;		StringRef type;
▲ Show 20 Lines • Show All 326 Lines • ▼ Show 20 Lines	visitPostorder(*pExpr, [&](const Expression &e) {
state.orderedTensorArgs[use] = tensor.index;		state.orderedTensorArgs[use] = tensor.index;
});		});
if (failed)		if (failed)
return failure();		return failure();

return success();		return success();
}		}

/// Parse and print the information for a TC def.		/// Parse and print the information for a ODS def.
///		///
/// tensor-def-list ::= tensor-def (`,` tensor-def )*		/// tensor-def-list ::= tensor-def (`,` tensor-def )*
///		///
/// comprehension-list ::= comprehension comprehension*		/// comprehension-list ::= comprehension comprehension*
///		///
/// tc-def ::= `def` bare-id `(`tensor-def-list`)` `->` `(` tensor-def-list`)`		/// tc-def ::= `def` bare-id `(`tensor-def-list`)` `->` `(` tensor-def-list`)`
/// `{` comprehension-list `}`		/// `{` comprehension-list `}`
///		///
		/// ods-def ::= `ods_def` `<` bare-id `>` `:` tc-def
		///
/// All the affine-expr in a `tensor-typedef` must be dimensionless (i.e.		/// All the affine-expr in a `tensor-typedef` must be dimensionless (i.e.
/// contain only expressions involving symbols and constants), but can		/// contain only expressions involving symbols and constants), but can
/// otherwise contain arbitrary affine expressions.		/// otherwise contain arbitrary affine expressions.
LogicalResult TCParser::parseAndEmitTCDef(llvm::raw_ostream &os) {		LogicalResult TCParser::parseAndEmitODSDef(llvm::raw_ostream &os) {
		if (failed(parser.parseToken(Token::Kind::kw_ods_def,
		"expected 'ods_def' to define a TC ODS")) \|\|
		failed(parser.parseToken(Token::Kind::lt, "expected '<'")))
		return failure();
		StringRef cppOpName = parser.curToken.getSpelling();
		LLVM_DEBUG(llvm::dbgs() << "\n\nStart parsing ODS: " << cppOpName << "\n");

		if (failed(parser.parseToken(Token::Kind::id, "expected id")) \|\|
		failed(parser.parseToken(Token::Kind::gt, "expected '>'")) \|\|
		failed(parser.parseToken(Token::Kind::colon, "expected ':'")))
		return failure();
if (failed(parser.parseToken(Token::Kind::kw_def,		if (failed(parser.parseToken(Token::Kind::kw_def,
"expected 'def' to define a TC")))		"expected 'def' to define a TC")))
return failure();		return failure();

StringRef tcName = parser.curToken.getSpelling();		StringRef tcName = parser.curToken.getSpelling();
LLVM_DEBUG(llvm::dbgs() << "\n\nStart parsing tc: " << tcName << "\n");		LLVM_DEBUG(llvm::dbgs() << "\n\nStart parsing TC: " << tcName << "\n");
if (failed(parser.parseToken(Token::Kind::id, "expected id")) \|\|		if (failed(parser.parseToken(Token::Kind::id, "expected id")) \|\|
failed(parser.parseToken(Token::Kind::l_paren, "expected '('")))		failed(parser.parseToken(Token::Kind::l_paren, "expected '('")))
return failure();		return failure();

auto parseInputDef = [&]() -> LogicalResult {		auto parseInputDef = [&]() -> LogicalResult {
return parseTensorDef(/isOutput=/false);		return parseTensorDef(/isOutput=/false);
};		};
if (failed(parser.parseCommaSeparatedListUntil(		if (failed(parser.parseCommaSeparatedListUntil(
Show All 23 Lines	LogicalResult TCParser::parseAndEmitODSDef(llvm::raw_ostream &os) {
}		}

if (failed(parser.parseToken(Token::Kind::l_brace, "expected '{'")))		if (failed(parser.parseToken(Token::Kind::l_brace, "expected '{'")))
return failure();		return failure();

SmallVector<ComprehensionParsingState, 4> perComprehensionStates;		SmallVector<ComprehensionParsingState, 4> perComprehensionStates;
while (parser.curToken.isNot(Token::Kind::r_brace)) {		while (parser.curToken.isNot(Token::Kind::r_brace)) {
perComprehensionStates.push_back(ComprehensionParsingState());		perComprehensionStates.push_back(ComprehensionParsingState());
if (failed(parseOneComprehension(tcName, tcName,		if (failed(parseOneComprehension(cppOpName, tcName,
perComprehensionStates.back())))		perComprehensionStates.back())))
return failure();		return failure();
};		};
parser.parseToken(Token::Kind::r_brace, "expected '}'");		parser.parseToken(Token::Kind::r_brace, "expected '}'");

// Print.		// Print.
auto nComprehensions = perComprehensionStates.size();		auto nComprehensions = perComprehensionStates.size();
if (nComprehensions != 1) {		if (nComprehensions != 1) {
parser.emitError("only 1 comprehension supported for now, got: " +		parser.emitError("only 1 comprehension supported for now, got: " +
llvm::Twine(nComprehensions));		llvm::Twine(nComprehensions));
return failure();		return failure();
}		}
if (genODSDecl) {		if (genODSDecl) {
printODS(os, tcName, tcName);		printODS(os, cppOpName, tcName);
os << "\n";		os << "\n";
}		}
if (genODSImpl) {		if (genODSImpl) {
auto &state = perComprehensionStates.back();		auto &state = perComprehensionStates.back();
std::string extraMethods;		std::string extraMethods;
llvm::raw_string_ostream ss(extraMethods);		llvm::raw_string_ostream ss(extraMethods);
printReferenceIterators(ss, tcName, state);		printReferenceIterators(ss, cppOpName, state);
printReferenceIndexingMaps(ss, tcName, state);		printReferenceIndexingMaps(ss, cppOpName, state);
printRegionBuilder(ss, tcName, state);		printRegionBuilder(ss, cppOpName, state);
ss.flush();		ss.flush();
os << extraMethods << "\n";		os << extraMethods << "\n";
}		}

return success();		return success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Printing functions		// Printing functions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.		/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.
void TCParser::printODS(llvm::raw_ostream &os, StringRef cppOpName,		void TCParser::printODS(llvm::raw_ostream &os, StringRef cppOpName,
StringRef linalgOpName) {		StringRef linalgOpName) {
const char *header = R"FMT( def {0}Op : LinalgNamedStructured_Op<"{1}", [		const char *header = R"FMT( def {0} : LinalgNamedStructured_Op<"{1}", [
NInputs<{2}>,		NInputs<{2}>,
NOutputs<{3}>,		NOutputs<{3}>,
NamedStructuredOpTraits]> {		NamedStructuredOpTraits,
		SingleBlockImplicitTerminator<"YieldOp">]> {
let arguments = (ins Variadic<LinalgOperand>:$views);		let arguments = (ins Variadic<LinalgOperand>:$views);
let results = (outs Variadic<AnyRankedTensor>:$output_tensors);		let results = (outs Variadic<AnyRankedTensor>:$output_tensors);
		let regions = (region SizedRegion<1>:$region);
		let builders = [OpBuilder<
		"Builder *b, OperationState &result, TypeRange outputTypes, "
		# "ValueRange views",
		[{{
		result.addOperands(views);
		result.addTypes(outputTypes);
		buildNamedStructuredOpRegion<{0}>(
		*b, result, TypeRange(views), outputTypes);
		}]>
		];
		let parser = [{
		return ::parseNamedStructuredOp<{0}>(parser, result);
		}];
let extraClassDeclaration = [{{		let extraClassDeclaration = [{{
llvm::Optional<SmallVector<StringRef, 8>> referenceIterators();		llvm::Optional<SmallVector<StringRef, 8>> referenceIterators();
llvm::Optional<SmallVector<AffineMap, 8>> referenceIndexingMaps();		llvm::Optional<SmallVector<AffineMap, 8>> referenceIndexingMaps();
void regionBuilder(ArrayRef<BlockArgument> args);		static void regionBuilder(Block &block);
}];		}];
let hasFolder = 1;
})FMT";		})FMT";

unsigned nInputs = 0, nOutputs = 0;		unsigned nInputs = 0, nOutputs = 0;
for (auto &t : registeredTensors) {		for (auto &t : registeredTensors) {
if (t.getValue().isOutput)		if (t.getValue().isOutput)
nOutputs++;		nOutputs++;
else		else
nInputs++;		nInputs++;
}		}

os << llvm::formatv(header, cppOpName, linalgOpName, nInputs, nOutputs);		os << llvm::formatv(header, cppOpName, linalgOpName, nInputs, nOutputs);
}		}

/// Print the C++ StructuredOpsInterface impl of `referenceIterators`.		/// Print the C++ StructuredOpsInterface impl of `referenceIterators`.
void TCParser::printReferenceIterators(llvm::raw_ostream &os, StringRef opId,		void TCParser::printReferenceIterators(llvm::raw_ostream &os,
		StringRef cppOpName,
ComprehensionParsingState &state) {		ComprehensionParsingState &state) {
const char *referenceReferenceIteratorsFmt =		const char *referenceReferenceIteratorsFmt =
R"FMT(		R"FMT(
llvm::Optional<SmallVector<StringRef, 8>> {0}::referenceIterators() {		llvm::Optional<SmallVector<StringRef, 8>> {0}::referenceIterators() {
return SmallVector<StringRef, 8>{{ {1} };		return SmallVector<StringRef, 8>{{ {1} };
})FMT";		})FMT";

std::string iteratorsStr;		std::string iteratorsStr;
Show All 13 Lines	llvm::interleaveComma(
break;		break;
}		}
ss << (reduction ? "getReductionIteratorTypeName()"		ss << (reduction ? "getReductionIteratorTypeName()"
: "getParallelIteratorTypeName()");		: "getParallelIteratorTypeName()");
pos++;		pos++;
});		});
ss.flush();		ss.flush();

os << llvm::formatv(referenceReferenceIteratorsFmt, opId, iteratorsStr);		os << llvm::formatv(referenceReferenceIteratorsFmt, cppOpName, iteratorsStr);
}		}

/// Print the C++ StructuredOpsInterface impl of `referenceIndexingMaps`.		/// Print the C++ StructuredOpsInterface impl of `referenceIndexingMaps`.
void TCParser::printReferenceIndexingMaps(llvm::raw_ostream &os, StringRef opId,		void TCParser::printReferenceIndexingMaps(llvm::raw_ostream &os,
		StringRef cppOpName,
ComprehensionParsingState &state) {		ComprehensionParsingState &state) {
const char *referenceIndexingMapsFmt =		const char *referenceIndexingMapsFmt =
R"FMT(		R"FMT(
llvm::Optional<SmallVector<AffineMap, 8>> {0}::referenceIndexingMaps() {		llvm::Optional<SmallVector<AffineMap, 8>> {0}::referenceIndexingMaps() {
MLIRContext *context = getContext();		MLIRContext *context = getContext();
AffineExpr {1};		AffineExpr {1};
bindDims(context, {1});		bindDims(context, {1});
return SmallVector<AffineMap, 8>{{ {2} };		return SmallVector<AffineMap, 8>{{ {2} };
})FMT";		})FMT";

std::string dimsStr;		std::string dimsStr;
llvm::raw_string_ostream ss(dimsStr);		llvm::raw_string_ostream ss(dimsStr);
llvm::interleaveComma(		llvm::interleaveComma(
state.dims, ss,		state.dims, ss,
[&](std::pair<StringRef, AffineExpr> p) { ss << p.second; });		[&](std::pair<StringRef, AffineExpr> p) { ss << p.second; });
ss.flush();		ss.flush();

std::string mapsStr;		std::string mapsStr;
llvm::raw_string_ostream mapsStringStream(mapsStr);		llvm::raw_string_ostream mapsStringStream(mapsStr);
SmallVector<TensorUse, 4> orderedUses(state.orderedTensorArgs.size());		SmallVector<TensorUse, 4> orderedUses(state.orderedTensorArgs.size());
for (auto it : state.orderedTensorArgs)		for (auto it : state.orderedTensorArgs)
orderedUses[it.second] = it.first;		orderedUses[it.second] = it.first;
llvm::interleaveComma(orderedUses, mapsStringStream, [&](TensorUse u) {		llvm::interleaveComma(orderedUses, mapsStringStream, [&](TensorUse u) {
assert(u.indexingMap);		assert(u.indexingMap);
const char *mapFmt = "\n\tAffineMap::get({0}, 0, {1})";		const char *mapFmt = "\n\tAffineMap::get({0}, 0, {1}, context)";
if (u.indexingMap.isEmpty()) {		if (u.indexingMap.isEmpty()) {
mapsStringStream << llvm::formatv(mapFmt, state.dims.size(), "context");		mapsStringStream << llvm::formatv(mapFmt, state.dims.size(), "context");
return;		return;
}		}

std::string exprsStr;		std::string exprsStr;
llvm::raw_string_ostream exprsStringStream(exprsStr);		llvm::raw_string_ostream exprsStringStream(exprsStr);
exprsStringStream << "{";		exprsStringStream << "{";
llvm::interleaveComma(u.indexingMap.getResults(), exprsStringStream);		llvm::interleaveComma(u.indexingMap.getResults(), exprsStringStream);
exprsStringStream << "}";		exprsStringStream << "}";
exprsStringStream.flush();		exprsStringStream.flush();

mapsStringStream << llvm::formatv(mapFmt, state.dims.size(), exprsStr);		mapsStringStream << llvm::formatv(mapFmt, state.dims.size(), exprsStr);
});		});
mapsStringStream.flush();		mapsStringStream.flush();

os << llvm::formatv(referenceIndexingMapsFmt, opId, dimsStr, mapsStr);		os << llvm::formatv(referenceIndexingMapsFmt, cppOpName, dimsStr, mapsStr);
}		}

/// Print the C++ StructuredOpsInterface impl of `regionBuilder`.		/// Print the C++ StructuredOpsInterface impl of `regionBuilder`.
void TCParser::printRegionBuilder(llvm::raw_ostream &os, StringRef opId,		void TCParser::printRegionBuilder(llvm::raw_ostream &os, StringRef cppOpName,
ComprehensionParsingState &state) {		ComprehensionParsingState &state) {
unsigned count = state.orderedTensorArgs.size();		unsigned count = state.orderedTensorArgs.size();
llvm::DenseMap<const TensorExpr *, unsigned> subExprsMap;		llvm::DenseMap<const TensorExpr *, unsigned> subExprsMap;
std::function<void(llvm::raw_ostream & os, const Expression &)> printExpr;		std::function<void(llvm::raw_ostream & os, const Expression &)> printExpr;
printExpr = [&](llvm::raw_ostream &os, const Expression &e) -> void {		printExpr = [&](llvm::raw_ostream &os, const Expression &e) -> void {
if (auto *pUse = dyn_cast<TensorUse>(&e)) {		if (auto *pUse = dyn_cast<TensorUse>(&e)) {
os << "_" << state.orderedTensorArgs.find(*pUse)->second;		os << "_" << state.orderedTensorArgs.find(*pUse)->second;
return;		return;
}		}
auto *pTensorExpr = cast<TensorExpr>(&e);		auto *pTensorExpr = cast<TensorExpr>(&e);
if (subExprsMap.count(pTensorExpr) > 0) {		if (subExprsMap.count(pTensorExpr) > 0) {
os << "_" << subExprsMap[pTensorExpr];		os << "_" << subExprsMap[pTensorExpr];
} else {		} else {
std::string subExprs;		std::string subExprs;
llvm::raw_string_ostream subExprsStringStream(subExprs);		llvm::raw_string_ostream subExprsStringStream(subExprs);
llvm::interleaveComma(pTensorExpr->expressions, subExprsStringStream,		llvm::interleaveComma(pTensorExpr->expressions, subExprsStringStream,
[&](const std::unique_ptr<Expression> &e) {		[&](const std::unique_ptr<Expression> &e) {
printExpr(subExprsStringStream, *e);		printExpr(subExprsStringStream, *e);
});		});
subExprsStringStream.flush();		subExprsStringStream.flush();
const char *tensorExprFmt = "\n ValueHandle _{0} = {1}({2});";		const char *tensorExprFmt = "\n ValueHandle _{0} = {1}({2});";
os << llvm::formatv(tensorExprFmt, ++count, pTensorExpr->opId, subExprs);		os << llvm::formatv(tensorExprFmt, ++count, pTensorExpr->operationName,
		subExprs);
subExprsMap[pTensorExpr] = count;		subExprsMap[pTensorExpr] = count;
}		}
};		};

const char *regionBuilderFmt = R"FMT(		const char *regionBuilderFmt = R"FMT(
void {0}::regionBuilder(ArrayRef<BlockArgument> args) {		void {0}::regionBuilder(Block &block) {
using namespace edsc;		using namespace edsc;
using namespace intrinsics;		using namespace intrinsics;
		auto args = block.getArguments();
ValueHandle {1};		ValueHandle {1};
{2}		{2}
(linalg_yield(ValueRange{ {3} }));		(linalg_yield(ValueRange{ {3} }));
})FMT";		})FMT";

unsigned idx = 0;		unsigned idx = 0;
std::string valueHandleStr;		std::string valueHandleStr;
llvm::raw_string_ostream valueHandleStringStream(valueHandleStr);		llvm::raw_string_ostream valueHandleStringStream(valueHandleStr);
Show All 17 Lines	llvm::interleaveComma(state.expressions, yieldStringStream,
[&](const std::unique_ptr<Expression> &e) {		[&](const std::unique_ptr<Expression> &e) {
printExpr(yieldStringStream, *e);		printExpr(yieldStringStream, *e);
});		});

valueHandleStringStream.flush();		valueHandleStringStream.flush();
expressionStringStream.flush();		expressionStringStream.flush();
yieldStringStream.flush();		yieldStringStream.flush();

os << llvm::formatv(regionBuilderFmt, opId, valueHandleStr, expressionsStr,		os << llvm::formatv(regionBuilderFmt, cppOpName, valueHandleStr,
yieldStr);		expressionsStr, yieldStr);
}		}

/// Iterate over each Tensor Comprehension def.		/// Iterate over each Tensor Comprehension def.
LogicalResult parseAndEmitAllTensorComprehensions(llvm::raw_ostream &os,		LogicalResult parseAndEmitAllTensorComprehensions(llvm::raw_ostream &os,
Parser &parser) {		Parser &parser) {
while (parser.curToken.getKind() != Token::Kind::eof) {		while (parser.curToken.getKind() != Token::Kind::eof) {
TCParser tcParser(parser);		TCParser tcParser(parser);
if (failed(tcParser.parseAndEmitTCDef(os)))		if (failed(tcParser.parseAndEmitODSDef(os)))
return failure();		return failure();
}		}
return success();		return success();
}		}

int main(int argc, char **argv) {		int main(int argc, char **argv) {
llvm::cl::ParseCommandLineOptions(argc, argv, "Linalg ODS Gen");		llvm::cl::ParseCommandLineOptions(argc, argv, "Linalg ODS Gen");

Show All 30 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Linalg] Create a named batch_matmul op and pipe it through.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 259033

mlir/include/mlir/Dialect/Linalg/IR/CMakeLists.txt

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOpsSpec.tc

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOps.td

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterface.td

mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h

mlir/include/mlir/Dialect/Linalg/Transforms/CMakeLists.txt

mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp

mlir/lib/Dialect/Linalg/Transforms/LinalgToLoops.cpp

mlir/test/Dialect/Linalg/invalid.mlir

mlir/test/Dialect/Linalg/roundtrip.mlir

mlir/test/lib/DeclarativeTransforms/CMakeLists.txt

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp

[mlir][Linalg] Create a named batch_matmul op and pipe it through.
ClosedPublic