This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
docs/Dialects/
-
Dialects/
6
Linalg.md
-
include/mlir/Dialect/Linalg/IR/
-
mlir/
-
Dialect/
-
Linalg/
-
IR/
2
LinalgStructuredOpsInterface.td
2
LinalgTraits.h
-
test/Dialect/Linalg/
-
Dialect/
-
Linalg/
-
invalid.mlir
-
roundtrip.mlir

Differential D87509

[MLIR][Linalg] Add minimal support for linalg on tensors with one reduction and one result.
AbandonedPublic

Authored by nicolasvasilache on Sep 11 2020, 6:02 AM.

Download Raw Diff

Details

Reviewers

ftynse
pifon2a
mravishankar
stellaraccident
silvas
benvanik

Summary

This revision allows representing a minimal reduction at the level of linalg on tensors.
When a structured op has a reduction and returns tensor(s), the conventions are:

it can only return a single tensor
it cannot have any output buffer operand
as a consequence of points 1. + 2., it must have exactly one output
its last input argument must be a tensor of the same shape and with the same indexing map as its output.

Points 1-3 keep complexity of the representation in check by allowing only 1 result tensor, when reductions are present.

Point 4 is related to the fact that SSA values cannot represent in-place updates.
Instead, linalg adopts a similar convention that exists in e.g. vector.outerproduct: the value that is reduced into is passed as an explicit argument and a new result of the same shape is produced.

It is expected buffer allocation will fold this last input onto the result in a single output buffer, which is why linalg require the same indexing map: the last input operand is "tied" to the result.

An alternative, more complex representation, would allow for multiple results and arbitrary tied input/result pairs as well as relaxing the conditions on the indexing map equalities on the pairs. This is deemed unnecessarily complex for now and is left for a future discussion.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nicolasvasilache created this revision.Sep 11 2020, 6:02 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 11 2020, 6:02 AM

Herald added subscribers: msifontes, jurahul, Kayjukh and 12 others. · View Herald Transcript

nicolasvasilache requested review of this revision.Sep 11 2020, 6:02 AM

Herald added subscribers: limo1996, stephenneuendorffer. · View Herald TranscriptSep 11 2020, 6:02 AM

Harbormaster completed remote builds in B71358: Diff 291192.Sep 11 2020, 6:28 AM

Fix bug, add helpers and a roundtrip test.

Harbormaster completed remote builds in B71377: Diff 291236.Sep 11 2020, 9:00 AM

Fix test

Harbormaster completed remote builds in B71385: Diff 291245.Sep 11 2020, 9:29 AM

ftynse accepted this revision.Sep 11 2020, 9:46 AM

ftynse added inline comments.

mlir/docs/Dialects/Linalg.md
467	Does the tag actually work?
477	It's confusing that `args_out` comprises the number of both actual outputs and operands-used-as-outputs. `args` is implicitly associated with "arguments". Consider clarifying this point in the text.
479	Can't other ShapedType operands appear after as additional operands?
502	expected that
mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterface.td
27	Documentation does not correspond to what the function does.
66	I wonder if there is ever a non-default implementation? If not, I'd rather put them into `extraClassDeclarations` that are much easier to read.
mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h
96	Are there other intended uses of `getReductionDims`? Otherwise, it looks like it is constructing AffineDimExpr only to extract the position back from it, and it would have been simpler to just return positions directly.
110	Spurious comment?

This revision is now accepted and ready to land.Sep 11 2020, 9:46 AM

mehdi_amini added inline comments.Sep 12 2020, 7:44 PM

mlir/docs/Dialects/Linalg.md
467	It shouldn't be necessary: the HTML generator is adding a tag with the title name for each section.
479	I'm not following what are the `args_out` after the `n` results?

Herald added a subscriber: tatianashp. · View Herald TranscriptSep 12 2020, 7:44 PM

Thanks for your reviews, I have something better planned so I am abandonning this one.

Revision Contents

Path

Size

mlir/

docs/

Dialects/

Linalg.md

52 lines

include/

mlir/

Dialect/

Linalg/

IR/

LinalgStructuredOpsInterface.td

118 lines

LinalgTraits.h

51 lines

test/

Dialect/

Linalg/

invalid.mlir

85 lines

roundtrip.mlir

37 lines

Diff 291245

mlir/docs/Dialects/Linalg.md

	Show All 34 Lines

	## High-Level Description of Linalg Ops<a name="linalg_ops"></a>			## High-Level Description of Linalg Ops<a name="linalg_ops"></a>
	Linalg takes at least some inspiration from all previously [listed prior			Linalg takes at least some inspiration from all previously [listed prior
	art](#prior_art). The design enables the definition of *CustomOps* with			art](#prior_art). The design enables the definition of *CustomOps* with
	generic properties that enable [key transformations](#key_transformations),			generic properties that enable [key transformations](#key_transformations),
	including lowering to scalar load/store and other operations or to external			including lowering to scalar load/store and other operations or to external
	library calls and intrinsics.			library calls and intrinsics.

	These ops can have *either tensor or buffer operands*.			These ops can have *either tensor or buffer operands*, subject to
				[conventions and limitations](#tensors_and_buffers).

	### Payload-Carrying Ops<a name="payload_ops"></a>			### Payload-Carrying Ops<a name="payload_ops"></a>
	Linalg defines two payload carrying operations that implement the [structured ops](			Linalg defines two payload carrying operations that implement the [structured ops](
	https://docs.google.com/presentation/d/1P-j1GrH6Q5gLBjao0afQ-GfvcAeF-QU4GXXeSy0eJ9I/edit#slide=id.p			https://docs.google.com/presentation/d/1P-j1GrH6Q5gLBjao0afQ-GfvcAeF-QU4GXXeSy0eJ9I/edit#slide=id.p
	) abstraction on tensors and buffers. This is architected as two generic operations			) abstraction on tensors and buffers. This is architected as two generic operations
	`linalg.generic` (resp. `linalg.indexed_generic`) that can express custom			`linalg.generic` (resp. `linalg.indexed_generic`) that can express custom
	operations with index-free semantics (resp. indexing semantics).			operations with index-free semantics (resp. indexing semantics).
	The properties of these generic ops are the result of applying the			The properties of these generic ops are the result of applying the
	▲ Show 20 Lines • Show All 406 Lines • ▼ Show 20 Lines
	automatically while still maintaining the [core guiding			automatically while still maintaining the [core guiding
	principles](#guiding_principles).			principles](#guiding_principles).

	For the time being, we have settled on the combination of these properties			For the time being, we have settled on the combination of these properties
	because of empirical evidence building and working on multiple high-level			because of empirical evidence building and working on multiple high-level
	compilers. As we lay those down and engage more with the community, we expect			compilers. As we lay those down and engage more with the community, we expect
	multiple rounds of discussions and design changes to the original architecture.			multiple rounds of discussions and design changes to the original architecture.

				### Tensors and Buffers: Conventions and Limitations <a name="tensors_and_buffers"></a>
				ftynseUnsubmitted Not Done Reply Inline Actions Does the tag actually work? ftynse: Does the tag actually work?
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions It shouldn't be necessary: the HTML generator is adding a tag with the title name for each section. mehdi_amini: It shouldn't be necessary: the HTML generator is adding a tag with the title name for each…

				Tensors are immutable SSA values, buffers are mutable regions of memory subject
				to side-effects and aliasing. As a consequence, output buffers are passed as
				operands whereas output tensors are new SSA values corresponding to op results.
				Inputs can be arbitrary tensors or buffers and are always passed as operands.
				The convention adopted is as follows:

				1. The first `[0 .. args_in)` op operands are read-only input ShapedType.
				2. The `n` results are write-only output RankedTensorType. Note that `n <=
				args_out`.
				ftynseUnsubmitted Not Done Reply Inline Actions It's confusing that `args_out` comprises the number of both actual outputs and operands-used-as-outputs. `args` is implicitly associated with "arguments". Consider clarifying this point in the text. ftynse: It's confusing that `args_out` comprises the number of both actual outputs and operands-used-as…
				3. The operands `[args_in .. args_in + args_out - n)` are MemRefType buffers.
				4. Other non-ShapedType operands may appear as operands `[args_in + args_out -
				ftynseUnsubmitted Not Done Reply Inline Actions Can't other ShapedType operands appear after as additional operands? ftynse: Can't other ShapedType operands appear after as additional operands?
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions I'm not following what are the `args_out` after the `n` results? mehdi_amini: I'm not following what are the `args_out` after the `n` results?
				n .. getNumOperands())`

				In the case of structured ops with fully parallel semantics, inputs and outputs
				can be tensors or buffers without requiring additional constraints.

				Structured ops with reduction semantics and output tensor(s) however have
				additional restrictions:

				1. they can only return a single tensor
				2. they cannot have any output buffer operand
				3. as a consequence of points 1. + 2., they must have exactly one output
				4. their last input argument must be a tensor of the same shape and with the
				same indexing map as their unique output tensor.

				Points 1. - 3. keep complexity of the representation in check by allowing only 1
				result tensor, when reductions are present.

				Point 4 is related to the fact that SSA values cannot represent in-place
				updates. Instead, linalg adopts a similar convention that exists in e.g.
				`vector.outerproduct`: the value that is reduced into is passed as an explicit
				argument and a new result of the same shape is produced.

				It is expected buffer allocation will fold this last input onto the result in a
				ftynseUnsubmitted Not Done Reply Inline Actions expected that ftynse: expected that
				single output buffer argument, which is why the same indexing map is required:
				the last input operand is said to be "tied" to the result.

				Alternative, more complex representations, would allow for:

				1. Multiple tensor results and tied inputs in arbitrary orders, that could be
				captured by an ArrayAttr of position pairs.
				2. Relaxing the conditions on the indexing map equalities on the each pair and
				e.g. allow implicit broadcasts of the input.

				These representations are deemed unnecessarily complex for now and are left for
				future discussion.

	### Data Representation: Views<a name="views"></a>			### Data Representation: Views<a name="views"></a>
	The current implementation uses the [Strided MemRef (a.k.a View)](			The current implementation uses the [Strided MemRef (a.k.a View)](
	https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio)			https://groups.google.com/a/tensorflow.org/forum/#!topic/mlir/MaL8m2nXuio)
	abstraction. The name View is used interchangeably in `linalg` to signify			abstraction. The name View is used interchangeably in `linalg` to signify
	Strided MemRef.			Strided MemRef.
	In the future we expect to use other structured data types and			In the future we expect to use other structured data types and
	support ragged, mixed-sparse and other types. We expect to draw on the			support ragged, mixed-sparse and other types. We expect to draw on the
	experience from existing LIFT abstractions for			experience from existing LIFT abstractions for
	▲ Show 20 Lines • Show All 156 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOpsInterface.td

Show All 18 Lines
// interface.		// interface.
def LinalgStructuredInterface : OpInterface<"LinalgOp"> {		def LinalgStructuredInterface : OpInterface<"LinalgOp"> {
let methods = [		let methods = [
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Loop types handling.		// Loop types handling.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
		Return the dims that are reduction loops within the current operation.
		ftynseUnsubmitted Not Done Reply Inline Actions Documentation does not correspond to what the function does. ftynse: Documentation does not correspond to what the function does.
		}],
		/retTy=/"void",
		/methodName=/"getDimsOfType",
		/args=/(ins "StringRef":$iteratorTypeName,
		"SmallVectorImpl<AffineExpr> &":$res),
		/methodBody=/"",
		/defaultImplementation=/[{
		unsigned dim = 0;
		MLIRContext *ctx = this->getOperation()->getContext();
		for (auto tn : $_op.iterator_types().
		template getAsValueRange<StringAttr>()) {
		if (tn == iteratorTypeName)
		res.push_back(getAffineDimExpr(dim, ctx));
		++dim;
		}
		}]
		>,
		InterfaceMethod<
		/desc=/[{
Return the number of parallel loops within the current operation.		Return the number of parallel loops within the current operation.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumParallelLoops",		/methodName=/"getNumParallelLoops",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return getNumIterators(getParallelIteratorTypeName(),		return getNumIterators(getParallelIteratorTypeName(),
$_op.iterator_types());		$_op.iterator_types());
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
		Return the dims that are parallel loops within the current operation.
		}],
		/retTy=/"void",
		/methodName=/"getParallelDims",
		/args=/(ins "SmallVectorImpl<AffineExpr> &":$res),
		/methodBody=/"",
		/defaultImplementation=/[{
		ftynseUnsubmitted Not Done Reply Inline Actions I wonder if there is ever a non-default implementation? If not, I'd rather put them into `extraClassDeclarations` that are much easier to read. ftynse: I wonder if there is ever a non-default implementation? If not, I'd rather put them into…
		return getDimsOfType(getParallelIteratorTypeName(), res);
		}]
		>,
		InterfaceMethod<
		/desc=/[{
Return the number of reduction loops within the current operation.		Return the number of reduction loops within the current operation.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumReductionLoops",		/methodName=/"getNumReductionLoops",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return getNumIterators(getReductionIteratorTypeName(),		return getNumIterators(getReductionIteratorTypeName(),
$_op.iterator_types());		$_op.iterator_types());
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
		Return the dims that are reduction loops within the current operation.
		}],
		/retTy=/"void",
		/methodName=/"getReductionDims",
		/args=/(ins "SmallVectorImpl<AffineExpr> &":$res),
		/methodBody=/"",
		/defaultImplementation=/[{
		return getDimsOfType(getReductionIteratorTypeName(), res);
		}]
		>,
		InterfaceMethod<
		/desc=/[{
Return the number of window loops within the current operation.		Return the number of window loops within the current operation.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumWindowLoops",		/methodName=/"getNumWindowLoops",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return getNumIterators(getWindowIteratorTypeName(),		return getNumIterators(getWindowIteratorTypeName(),
$_op.iterator_types());		$_op.iterator_types());
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
		Return the dims that are window loops within the current operation.
		}],
		/retTy=/"void",
		/methodName=/"getWindowDims",
		/args=/(ins "SmallVectorImpl<AffineExpr> &":$res),
		/methodBody=/"",
		/defaultImplementation=/[{
		return getDimsOfType(getWindowIteratorTypeName(), res);
		}]
		>,
		InterfaceMethod<
		/desc=/[{
Return the total number of loops within the current operation.		Return the total number of loops within the current operation.
}],		}],
/retTy=/"unsigned",		/retTy=/"unsigned",
/methodName=/"getNumLoops",		/methodName=/"getNumLoops",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return getNumIterators($_op.iterator_types());		return getNumIterators($_op.iterator_types());
▲ Show 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	InterfaceMethod<
/defaultImplementation=/[{		/defaultImplementation=/[{
SmallVector<RankedTensorType, 4> res;		SmallVector<RankedTensorType, 4> res;
for (Type type : getInputs().getTypes())		for (Type type : getInputs().getTypes())
if (auto t = type.template dyn_cast<RankedTensorType>())		if (auto t = type.template dyn_cast<RankedTensorType>())
res.push_back(t);		res.push_back(t);
return res;		return res;
}]		}]
>,		>,
		InterfaceMethod<
		/desc=/[{
		Return `true` if there exists a tied input ShapedType / output
		RankedTensorType pair. This is the case when the op has return values (
		which are RankedTensorTypes by construction) and a reduction.
		}],
		/retTy=/"bool",
		/methodName=/"hasTiedResultTensor",
		/args=/(ins),
		/methodBody=/"",
		/defaultImplementation=/[{
		if (this->getOperation()->getNumResults() == 0)
		return false;
		SmallVector<AffineExpr, 8> redDims;
		$_op.getReductionDims(redDims);
		return !redDims.empty();
		}]
		>,
		InterfaceMethod<
		/desc=/[{
		Return the index for the input operand that is tied to the result index
		`resultIdx`.
		}],
		/retTy=/"bool",
		/methodName=/"getTiedInputOperandIndex",
		/args=/(ins "unsigned":$resultIdx),
		/methodBody=/"",
		/defaultImplementation=/[{
		assert(resultIdx < this->getOperation()->getNumResults() &&
		"result index overflow");
		assert(resultIdx == 0 && "only single result tensor supported for now");
		unsigned nInputs = $_op.getNumInputs();
		unsigned nInAndOutBuffers = $_op.getNumInputsAndOutputBuffers();
		assert(nInputs == nInAndOutBuffers && "cannot have output buffer");
		return nInputs - 1;
		}]
		>,

//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Output arguments handling.		// Output arguments handling.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the output buffer at the given index, asserts that this is a		Return the output buffer at the given index, asserts that this is a
buffer operand and not a tensor result.		buffer operand and not a tensor result.
▲ Show 20 Lines • Show All 151 Lines • ▼ Show 20 Lines	InterfaceMethod<
/methodName=/"getShapedType",		/methodName=/"getShapedType",
/args=/(ins "unsigned":$i),		/args=/(ins "unsigned":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
if (i < $_op.getNumInputs())		if (i < $_op.getNumInputs())
return getInputShapedType(i);		return getInputShapedType(i);
if (i < getNumInputsAndOutputBuffers())		if (i < getNumInputsAndOutputBuffers())
return getOutputBufferType(i - $_op.getNumInputs());		return getOutputBufferType(i - $_op.getNumInputs());
return getOutputTensorTypes()[i - getNumInputsAndOutputBuffers()];		return this->getOperation()->getResult(
		i - getNumInputsAndOutputBuffers()).
		getType().template cast<ShapedType>();
}]>,		}]>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the shaped types for all the inputs and outputs		Return the shaped types for all the inputs and outputs
}],		}],
/retTy=/"SmallVector<ShapedType, 4>",		/retTy=/"SmallVector<ShapedType, 4>",
/methodName=/"getInputOutputShapedTypes",		/methodName=/"getInputOutputShapedTypes",
/args=/(ins),		/args=/(ins),
Show All 37 Lines	InterfaceMethod<
/desc=/[{		/desc=/[{
Return the indexing maps within the current operation.		Return the indexing maps within the current operation.
}],		}],
/retTy=/"SmallVector<AffineMap, 4>",		/retTy=/"SmallVector<AffineMap, 4>",
/methodName=/"getIndexingMaps",		/methodName=/"getIndexingMaps",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return llvm::to_vector<4>(		return llvm::to_vector<4>($_op.indexing_maps().template getAsValueRange<AffineMapAttr>());
llvm::map_range($_op.indexing_maps(),
[](Attribute attr) -> AffineMap {
return attr.cast<AffineMapAttr>().getValue();
}));
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the input or output indexing map at index `i`.		Return the input or output indexing map at index `i`.
}],		}],
/retTy=/"AffineMap",		/retTy=/"AffineMap",
/methodName=/"getIndexingMap",		/methodName=/"getIndexingMap",
/args=/(ins "unsigned":$i),		/args=/(ins "unsigned":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(i < getNumInputsAndOutputs());		assert(i < getNumInputsAndOutputs());
return $_op.indexing_maps()		return getIndexingMaps()[i];
.getValue()[i]
.template cast<AffineMapAttr>()
.getValue();
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the input indexing map at index `i`.		Return the input indexing map at index `i`.
}],		}],
/retTy=/"AffineMap",		/retTy=/"AffineMap",
/methodName=/"getInputIndexingMap",		/methodName=/"getInputIndexingMap",
/args=/(ins "unsigned":$i),		/args=/(ins "unsigned":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(i < $_op.getNumInputs());		assert(i < $_op.getNumInputs());
return $_op.indexing_maps()		return getIndexingMaps()[i];
.getValue()[i]
.template cast<AffineMapAttr>()
.getValue();
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the output indexing map at index `i`.		Return the output indexing map at index `i`.
}],		}],
/retTy=/"AffineMap",		/retTy=/"AffineMap",
/methodName=/"getOutputIndexingMap",		/methodName=/"getOutputIndexingMap",
/args=/(ins "unsigned":$i),		/args=/(ins "unsigned":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(i < $_op.getNumOutputs());		assert(i < $_op.getNumOutputs());
return $_op.indexing_maps()		return getIndexingMaps()[i + $_op.getNumInputs()];
.getValue()[i + $_op.getNumInputs()]
.template cast<AffineMapAttr>()
.getValue();
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return whether the op has only MemRef input and outputs.		Return whether the op has only MemRef input and outputs.
}],		}],
/retTy=/"bool",		/retTy=/"bool",
/methodName=/"hasBufferSemantics",		/methodName=/"hasBufferSemantics",
▲ Show 20 Lines • Show All 64 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Linalg/IR/LinalgTraits.h

	Show All 11 Lines
	#include "mlir/Dialect/Linalg/IR/LinalgTypes.h"			#include "mlir/Dialect/Linalg/IR/LinalgTypes.h"
	#include "mlir/Dialect/Utils/StructuredOpsUtils.h"			#include "mlir/Dialect/Utils/StructuredOpsUtils.h"
	#include "mlir/IR/AffineMap.h"			#include "mlir/IR/AffineMap.h"
	#include "mlir/IR/Function.h"			#include "mlir/IR/Function.h"
	#include "mlir/IR/OpDefinition.h"			#include "mlir/IR/OpDefinition.h"
	#include "mlir/IR/StandardTypes.h"			#include "mlir/IR/StandardTypes.h"
	#include "mlir/Support/LLVM.h"			#include "mlir/Support/LLVM.h"

				#include "llvm/ADT/SmallVector.h"

	namespace mlir {			namespace mlir {
	namespace OpTrait {			namespace OpTrait {
	namespace linalg {			namespace linalg {

	/// This class provides the API for ops that are known to have a specified			/// This class provides the API for ops that are known to have a specified
	/// number of inputs, all passed as operands. Use as a trait as follows:			/// number of inputs, all passed as operands. Use as a trait as follows:
	///			///
	/// class DotOp : public Op<DotOp, OpTrait::NInputs<2>::Impl> {			/// class DotOp : public Op<DotOp, OpTrait::NInputs<2>::Impl> {
	Show All 29 Lines
	/// class DotOp : public Op<DotOp, OpTrait::StructuredOpTraits> {			/// class DotOp : public Op<DotOp, OpTrait::StructuredOpTraits> {
	///			///
	template <typename ConcreteType>			template <typename ConcreteType>
	class StructuredOpTraits			class StructuredOpTraits
	: public OpTrait::TraitBase<ConcreteType, StructuredOpTraits> {			: public OpTrait::TraitBase<ConcreteType, StructuredOpTraits> {
	public:			public:
	static LogicalResult verifyTrait(Operation *op) {			static LogicalResult verifyTrait(Operation *op) {
	ConcreteType concreteOp = cast<ConcreteType>(op);			ConcreteType concreteOp = cast<ConcreteType>(op);
	auto nOperands = cast<ConcreteType>(op).getNumInputsAndOutputBuffers();			unsigned nInputAndBufferOperands =
	if (failed(OpTrait::impl::verifyAtLeastNOperands(op, nOperands)))			concreteOp.getNumInputsAndOutputBuffers();
				if (failed(
				OpTrait::impl::verifyAtLeastNOperands(op, nInputAndBufferOperands)))
	return failure();			return failure();

	if (op->getNumResults() > concreteOp.getNumOutputs())			if (op->getNumResults() > concreteOp.getNumOutputs())
	return op->emitError("unexpected #results > #outputs");			return op->emitError("unexpected #results > #outputs");

				if (!concreteOp.hasTiedResultTensor())
				return success();

				// Only a single tensor result supported atm.
				if (op->getNumResults() != 1)
				return op->emitError(
				"expected single tensor result when reduction present");

				if (concreteOp.getNumOutputs() != 1)
				return op->emitError(
				"expected single tensor output when result and reduction present");

				// Result-returning op with at least a reduction.
				SmallVector<AffineExpr, 8> redDims;
				concreteOp.getReductionDims(redDims);

				// Output tensor indexing map may not depend on reduction index.
				AffineMap outputMap = concreteOp.getOutputIndexingMap(0);
				for (auto expr : outputMap.getResults()) {
				for (auto dim : redDims) {
				unsigned pos = dim.cast<AffineDimExpr>().getPosition();
				ftynseUnsubmitted Not Done Reply Inline Actions Are there other intended uses of `getReductionDims`? Otherwise, it looks like it is constructing AffineDimExpr only to extract the position back from it, and it would have been simpler to just return positions directly. ftynse: Are there other intended uses of `getReductionDims`? Otherwise, it looks like it is…
				if (expr.isFunctionOfDim(pos))
				return op->emitError("unexpected single tensor output indexing map ")
				<< "is function of reduction dim @" << pos;
				}
				}

				unsigned nInputs = concreteOp.getNumInputs();
				if (nInputs < op->getNumResults() + 1)
				return op->emitError("expected at least one more input than results to "
				"accomodate reduction and tied results");

				// There must be a matching last input buffer or tensor operand for the
				// tensor result.
				// Tied input
				ftynseUnsubmitted Not Done Reply Inline Actions Spurious comment? ftynse: Spurious comment?
				AffineMap lastInputMap =
				concreteOp.getInputIndexingMap(concreteOp.getTiedInputOperandIndex(0));
				if (outputMap != lastInputMap)
				return op->emitError("expected last input operand with indexing map "
				"matching the tensor result's map");

	return success();			return success();
	}			}
	};			};

	} // namespace linalg			} // namespace linalg
	} // namespace OpTrait			} // namespace OpTrait
	} // namespace mlir			} // namespace mlir

	#endif // MLIR_DIALECT_LINALG_LINALGTRAITS_H_			#endif // MLIR_DIALECT_LINALG_LINALGTRAITS_H_

mlir/test/Dialect/Linalg/invalid.mlir

Show First 20 Lines • Show All 518 Lines • ▼ Show 20 Lines	linalg.generic {
iterator_types = ["parallel"]		iterator_types = ["parallel"]
} %arg0 {		} %arg0 {
^bb(%0: i4) :		^bb(%0: i4) :
%1 = std.addi %0, %0: i4		%1 = std.addi %0, %0: i4
linalg.yield %1, %1: i4, i4		linalg.yield %1, %1: i4, i4
} : tensor<?x?xi4> -> (tensor<?x?xi4>, tensor<?x?xi4>)		} : tensor<?x?xi4> -> (tensor<?x?xi4>, tensor<?x?xi4>)
return		return
}		}

		// -----

		func @single_tensor_result(%arg0: memref<?xf32>) {
		// expected-error @+1 {{expected single tensor result when reduction present}}
		linalg.generic {
		args_in = 1,
		args_out = 2,
		indexing_maps = [ affine_map<(i) -> (i)>, affine_map<(i) -> (i)>, affine_map<(i) -> (i)> ],
		iterator_types = [ "reduction" ]
		} %arg0 {
		^bb(%0: f32):
		%f0 = constant 0.0 : f32
		linalg.yield %f0, %f0: f32, f32
		} : memref<?xf32> -> (tensor<?xf32>, tensor<?xf32>)
		return
		}

		// -----

		func @single_tensor_result(%arg0: memref<?xf32>) {
		// expected-error @+1 {{expected single tensor output when result and reduction present}}
		linalg.generic {
		args_in = 1,
		args_out = 2,
		indexing_maps = [ affine_map<(i) -> (i)>, affine_map<(i) -> (i)>, affine_map<(i) -> (i)> ],
		iterator_types = [ "reduction" ]
		} %arg0, %arg0 {
		^bb(%0: f32):
		%f0 = constant 0.0 : f32
		linalg.yield %f0: f32
		} : memref<?xf32>, memref<?xf32> -> tensor<?xf32>
		return
		}

		// -----

		func @single_tensor_result_not_function_of_reduction(%arg0: memref<?xf32>) {
		// expected-error @+1 {{unexpected single tensor output indexing map is function of reduction dim @0}}
		linalg.generic {
		args_in = 1,
		args_out = 1,
		indexing_maps = [ affine_map<(i) -> (i)>, affine_map<(i) -> (i)>, affine_map<(i) -> (i)> ],
		iterator_types = [ "reduction" ]
		} %arg0 {
		^bb(%0: f32):
		%f0 = constant 0.0 : f32
		linalg.yield %f0: f32
		} : memref<?xf32> -> tensor<?xf32>
		return
		}

		// -----

		func @single_tensor_result_last_input_not_matching(%arg0: memref<?xf32>) {
		// expected-error @+1 {{expected at least one more input than results to accomodate reduction and tied results}}
		linalg.generic {
		args_in = 1,
		args_out = 1,
		indexing_maps = [ affine_map<(i) -> (i)>, affine_map<(i) -> (0)> ],
		iterator_types = [ "reduction" ]
		} %arg0 {
		^bb(%0: f32):
		%f0 = constant 0.0 : f32
		linalg.yield %f0: f32
		} : memref<?xf32> -> tensor<f32>
		return
		}

		// -----

		func @single_tensor_result_last_input_not_matching(%arg0: memref<?xf32>) {
		// expected-error @+1 {{expected last input operand with indexing map matching the tensor result's map}}
		linalg.generic {
		args_in = 2,
		args_out = 1,
		indexing_maps = [ affine_map<(i) -> (i)>, affine_map<(i) -> (i)>, affine_map<(i) -> (0)> ],
		iterator_types = [ "reduction" ]
		} %arg0, %arg0 {
		^bb(%0: f32, %1: f32):
		%f0 = constant 0.0 : f32
		linalg.yield %f0: f32
		} : memref<?xf32>, memref<?xf32> -> tensor<f32>
		return
		}

mlir/test/Dialect/Linalg/roundtrip.mlir

Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	func @ops(%arg0: memref<?x?xf32, offset: ?, strides: [?, 1]>,
%arg1: memref<?xf32, offset: ?, strides: [1]>,		%arg1: memref<?xf32, offset: ?, strides: [1]>,
%arg2: memref<?xf32, offset: ?, strides: [1]>,		%arg2: memref<?xf32, offset: ?, strides: [1]>,
%arg3: memref<f32>) {		%arg3: memref<f32>) {
linalg.matmul %arg0, %arg0, %arg0 : (memref<?x?xf32, offset: ?, strides: [?, 1]>,		linalg.matmul %arg0, %arg0, %arg0 : (memref<?x?xf32, offset: ?, strides: [?, 1]>,
memref<?x?xf32, offset: ?, strides: [?, 1]>,		memref<?x?xf32, offset: ?, strides: [?, 1]>,
memref<?x?xf32, offset: ?, strides: [?, 1]>)		memref<?x?xf32, offset: ?, strides: [?, 1]>)
linalg.matvec %arg0, %arg1, %arg2 : (memref<?x?xf32, offset: ?, strides: [?, 1]>,		linalg.matvec %arg0, %arg1, %arg2 : (memref<?x?xf32, offset: ?, strides: [?, 1]>,
memref<?xf32, offset: ?, strides: [1]>,		memref<?xf32, offset: ?, strides: [1]>,
memref<?xf32, offset: ?, strides: [1]>)		memref<?xf32, offset: ?, strides: [1]>)
linalg.dot %arg1, %arg2, %arg3 : (memref<?xf32, offset: ?, strides: [1]>,		linalg.dot %arg1, %arg2, %arg3 : (memref<?xf32, offset: ?, strides: [1]>,
memref<?xf32, offset: ?, strides: [1]>,		memref<?xf32, offset: ?, strides: [1]>,
memref<f32>)		memref<f32>)
return		return
}		}
// CHECK-LABEL: func @ops(%		// CHECK-LABEL: func @ops(%
// CHECK-NEXT: linalg.matmul %{{.}}, %{{.}}, %{{.*}} :		// CHECK-NEXT: linalg.matmul %{{.}}, %{{.}}, %{{.*}} :
// CHECK-SAME: (memref<?x?xf32, #[[$strided2D]]>,		// CHECK-SAME: (memref<?x?xf32, #[[$strided2D]]>,
▲ Show 20 Lines • Show All 548 Lines • ▼ Show 20 Lines
{		{
%0 = linalg.reshape %arg0 [] : memref<1x1xf32> into memref<f32>		%0 = linalg.reshape %arg0 [] : memref<1x1xf32> into memref<f32>
%1 = linalg.reshape %0 [] : memref<f32> into memref<1x1xf32>		%1 = linalg.reshape %0 [] : memref<f32> into memref<1x1xf32>
return %0, %1 : memref<f32>, memref<1x1xf32>		return %0, %1 : memref<f32>, memref<1x1xf32>
}		}
// CHECK-LABEL: func @memref_reshape_zero_dim		// CHECK-LABEL: func @memref_reshape_zero_dim
// CHECK: linalg.reshape %{{.*}} [] : memref<1x1xf32> into memref<f32>		// CHECK: linalg.reshape %{{.*}} [] : memref<1x1xf32> into memref<f32>
// CHECK: linalg.reshape %{{.*}} [] : memref<f32> into memref<1x1xf32>		// CHECK: linalg.reshape %{{.*}} [] : memref<f32> into memref<1x1xf32>


		// -----

		#accesses = [
		affine_map<(i, j, k) -> (j, i, k)>,
		affine_map<(i, j, k) -> (i, j)>,
		affine_map<(i, j, k) -> (i, j)>
		]

		#trait = {
		args_in = 2,
		args_out = 1,
		indexing_maps = #accesses,
		iterator_types = ["parallel", "parallel", "reduction"],
		library_call = "some_external_function_name_1"
		}

		func @generic_with_tied_result_tensor(
		%arg0: tensor<?x?x?xvector<3x4xi4>>, %arg1: tensor<?x?xf32>)
		-> (tensor<?x?xf32>) {
		%0 = linalg.indexed_generic #trait %arg0, %arg1 {
		^bb(%i: index, %j: index, %k: index, %v0: vector<3x4xi4>, %v1: f32) :
		%f0 = constant 0.0 : f32
		linalg.yield %f0 : f32
		} : tensor<?x?x?xvector<3x4xi4>>, tensor<?x?xf32> -> tensor<?x?xf32>
		return %0 : tensor<?x?xf32>
		}
		// CHECK-LABEL: func @generic_with_tied_result_tensor
		// CHECK: linalg.indexed_generic {args_in = 2 : i64, args_out = 1 : i64,
		// CHECK-SAME: indexing_maps = [#{{.}}, #{{.}}], iterator_types = ["parallel", "parallel", "reduction"],
		// CHECK-SAME: library_call = "some_external_function_name_1"}
		// CHECK-SAME: %{{.}}, %{{.}}
		// CHECK: tensor<?x?x?xvector<3x4xi4>>, tensor<?x?xf32> -> tensor<?x?xf32>
		// CHECK: return {{.*}} : tensor<?x?xf32>