This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Dialect/Linalg/
-
Linalg/
-
IR/
2/3
LinalgOps.td
-
Transforms/
-
Transforms.h
-
Interfaces/
3/3
ViewLikeInterface.h
-
ViewLikeInterface.td
-
lib/Dialect/
-
Dialect/
-
Linalg/Transforms/
-
Transforms/
7/8
Transforms.cpp
-
StandardOps/IR/
-
IR/
-
Ops.cpp
-
test/
-
Dialect/Linalg/
-
Linalg/
-
roundtrip.mlir
-
tile-and-pad-tensors.mlir
-
lib/Transforms/
-
Transforms/
-
TestLinalgTransforms.cpp

Differential D95149

[mlir][Linalg] Add a padding option to Linalg tiling
ClosedPublic

Authored by nicolasvasilache on Jan 21 2021, 9:53 AM.

Download Raw Diff

Details

Reviewers

ThomasRaoux
mravishankar
hanchung
ftynse

Commits

rG3747eb9c85b3: [mlir][Linalg] Add a padding option to Linalg tiling

Summary

This revision allows the base Linalg tiling pattern to optionally require padding to
a constant bounding shape.
When requested, a simple analysis is performed, similar to buffer promotion.
A temporary linalg.simple_pad op is added to model padding for the purpose of
connecting the dots. This will be replaced by a more fleshed out linalg.pad_tensor
op when it is available.
In the meantime, this temporary op serves the purpose of exhibiting the necessary
properties required from a more fleshed out pad op, to compose with transformations
properly.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	30 ms	x64 debian > MLIR.Dialect/Linalg::tile-and-pad-tensors.mlir
	40 ms	x64 windows > MLIR.Dialect/Linalg::tile-and-pad-tensors.mlir

Event Timeline

nicolasvasilache created this revision.Jan 21 2021, 9:53 AM

Herald added subscribers: teijeong, rdzhabarov, tatianashp and 14 others. · View Herald TranscriptJan 21 2021, 9:53 AM

nicolasvasilache requested review of this revision.Jan 21 2021, 9:53 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 21 2021, 9:53 AM

Herald added subscribers: limo1996, stephenneuendorffer. · View Herald Transcript

rriddle added inline comments.Jan 21 2021, 10:02 AM

mlir/include/mlir/Interfaces/ViewLikeInterface.h
25	This typedef pollutes the entire mlir namespace.

Harbormaster completed remote builds in B86121: Diff 318246.Jan 21 2021, 11:12 AM

nicolasvasilache marked an inline comment as done.Jan 22 2021, 12:51 AM

nicolasvasilache added inline comments.

mlir/include/mlir/Interfaces/ViewLikeInterface.h
25	I can just use OpFoldResult. OTOH, I'd welcome renaming OpFoldResult to ValueOrAttr which is much more descriptive IMO. Thoughts?

Drop ValueOrAttr.

nicolasvasilache added a reviewer: ftynse.Jan 22 2021, 1:08 AM

Harbormaster completed remote builds in B86244: Diff 318451.Jan 22 2021, 1:29 AM

Attempt to appease win.

Harbormaster completed remote builds in B86253: Diff 318465.Jan 22 2021, 2:32 AM

ftynse added inline comments.Jan 22 2021, 4:15 AM

mlir/include/mlir/Dialect/Linalg/IR/LinalgOps.td
479–483	It is still helpful to document the requirements and conventions of this op, instead of it being just a placeholder with unknown semantics.
488–490	Just drop the verifier field.
mlir/include/mlir/Interfaces/ViewLikeInterface.h
17–22	Leftover unnecessary changes.
mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp
116	We usually pass builder by-reference. Do you need a copy for some reason?
126	Nit: `SmallVector` can now infer the default number of stack elements, so if there is no reason to require `4` specifically, it can be just omitted.
133	`size.get<Attribute>` (there's no need to `dyn_cast` after a check).
138–139	I'm not a fan of emitting warnings _inside_ transformations. IMO, errors and warnings are there for the validity state of the op in general, not for its suitability to a particular transformation. Remarks may be fine (the user will have to ask for them explicitly to see them). `rewriter.notifyMatchFailure()` is probably preferred.
150	Write this comment? :)

nicolasvasilache added a child revision: D95243: [mlir][Linalg] Add a hoistPaddingOnTensors transformation.Jan 22 2021, 8:52 AM

hanchung added inline comments.Jan 22 2021, 8:55 AM

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp
122	use `auto` since getDefiningOp already states the type.
151–153	Please add documentation because this is not a trivial function, e.g., the result will be stored to `res` if success.

This feels like it is promotion on tensors with the linalg.simple_pad (for now and linalg.pad later) being used instead of the alloc(...) ; copy(...) method. So the tensor "promotion" and buffer promotion are living in entirely different places. Ill go deeper into this code, but my read of the tests informs this comment.

nicolasvasilache marked 10 inline comments as done.Jan 22 2021, 2:09 PM

Address comments.

In D95149#2516098, @mravishankar wrote:

This feels like it is promotion on tensors with the linalg.simple_pad (for now and linalg.pad later) being used instead of the alloc(...) ; copy(...) method. So the tensor "promotion" and buffer promotion are living in entirely different places. Ill go deeper into this code, but my read of the tests informs this comment.

There are similarities and differences.
Promotion is guaranteed to alloc and copy but is not guaranteed to create static shapes.

This in contrast has no guarantees of alloc/copy and is guaranteed to create static shapes.
This is meant to be used mostly as an enabler for vectorization as well as packing.
Packing is more closely related to promotion with the big difference that we keep SSA use-def chains and hoisting is significantly simpler to implement.

In any case I expect we'll need to experiment before we can come to a conclusion, this gives us the ability to experiment.

Harbormaster completed remote builds in B86356: Diff 318650.Jan 22 2021, 2:32 PM

Promotion is guaranteed to alloc and copy but is not guaranteed to create static shapes.
This in contrast has no guarantees of alloc/copy and is guaranteed to create static shapes.

That seems to be a small difference. I agree that promotion itself doesn't gaurantee static shape but it does do a best effort (and actually in IREE it always ends up being static shaped if the tile size is static). It should be easy to extend promotion to abort if it doesnt figure out the static sizes when "promotion" is applied to ops with tensor semantics.

It should be easy to extend promotion to abort if it doesnt figure out the static sizes when "promotion" is applied to ops with tensor semantics.

The fact that promotion tries to create static buffers is very much due to history and needs to go away.

Here is what happens concretely today to inject static sizes in the memory pipeline:

Promotion creates alloc + fill + copy after some best effort local analysis to try and create a static buffer.
Fingers are crossed so that vectorization succeeds.
After vectorization, LinalgCopyVTRForwardingPattern and LinalgCopyVTWForwardingPattern are applied with a best effort dependence analysis to undo 1. and replace by vector.transfer.
Later masked/unmasked vector.transfer splitting is performed to isolate parts.

In contrast, thanks to the progress on transformations on tensors we can:

Tile on tensors with padding at the level(s) we are interested in.
Hoist, as we wish to create and amortize packing.
Decide which packed tensors materialize in memory, which ones turn into simple subview + control flow and which ones turn into vector transfers.

The implications of padding at the tiling on tensor level are far reaching as you have already seen in IREE, phases are being reordered and Promotion is a bystanding casualty.
See the followup https://reviews.llvm.org/D95243 to get a better idea how things look like after padding + packing.
I would not be surprised that Promotion just gets deleted / becomes part of some bufferization in a not too distant future: the ease with which we can perform transformations thanks to SSA use-def chains is just a no-brainer.

Attempt to appeas win, again..

Harbormaster completed remote builds in B86452: Diff 318831.Jan 24 2021, 5:08 AM

Attempt to appease win, 3rd take..

Harbormaster completed remote builds in B86457: Diff 318841.Jan 24 2021, 7:00 AM

In D95149#2518130, @nicolasvasilache wrote:

It should be easy to extend promotion to abort if it doesnt figure out the static sizes when "promotion" is applied to ops with tensor semantics.

The fact that promotion tries to create static buffers is very much due to history and needs to go away.

Here is what happens concretely today to inject static sizes in the memory pipeline:

Promotion creates alloc + fill + copy after some best effort local analysis to try and create a static buffer.

Fingers are crossed so that vectorization succeeds.

After vectorization, LinalgCopyVTRForwardingPattern and LinalgCopyVTWForwardingPattern are applied with a best effort dependence analysis to undo 1. and replace by vector.transfer.

Later masked/unmasked vector.transfer splitting is performed to isolate parts.

In contrast, thanks to the progress on transformations on tensors we can:

Tile on tensors with padding at the level(s) we are interested in.

Hoist, as we wish to create and amortize packing.

Decide which packed tensors materialize in memory, which ones turn into simple subview + control flow and which ones turn into vector transfers.

The implications of padding at the tiling on tensor level are far reaching as you have already seen in IREE, phases are being reordered and Promotion is a bystanding casualty.
See the followup https://reviews.llvm.org/D95243 to get a better idea how things look like after padding + packing.
I would not be surprised that Promotion just gets deleted / becomes part of some bufferization in a not too distant future: the ease with which we can perform transformations thanks to SSA use-def chains is just a no-brainer.

I am not disputing that this is useful, but at least within IREE, I am not sure we can actually set a static tile size at the time of tiling. In IREE with the linalg on tensors path the tile size has to by dynamic during tile and distribute since it is done at the Flow dialect level where no backend specific information is available. The static tile size picked while tiling might not be one that is best for the backend that is targeted (information that is available only at the HAL level). So making this padding as part of the tiling transformation might not work out for IREE. My understanding then is that if we separate out the padding parts from the tiling transformation itself then it has the same constraints/restrictions as promotion, but with the advantage of SSA use-def chains.

I am not disputing that this is useful, but at least within IREE, I am not sure we can actually set a static tile size at the time of tiling. In IREE with the linalg on tensors path the tile size has to by dynamic during tile and distribute since it is done at the Flow dialect level where no backend specific information is available. The static tile size picked while tiling might not be one that is best for the backend that is targeted (information that is available only at the HAL level). So making this padding as part of the tiling transformation might not work out for IREE. My understanding then is that if we separate out the padding parts from the tiling transformation itself then it has the same constraints/restrictions as promotion, but with the advantage of SSA use-def chains.

In the particular context of IREE, I see 2 things we can do easily:

Pad at the level of tile and fuse and vectorize on tensors at the HAL level to enable vectorization (i.e. at the k^-th level of padding)
If we have a need for it, this is not hard to generalize to "pad to the next multiple of X" where X can either be a constant or an SSA value.

The fact that the option exists to pad does not mean you have to do it at the same time as "tile-and-distribute."

I wouldn't pattern match this first implementation to how general the approach is: we also want to use it in the context of sparse, so I am really not scared of some dynamic values here and there.
After we connect this to linalg.pad_tensors and have gained some experience, we can decide where we want to push this as we will also need better types to support more advanced behaviors.

Still, I don't see promotion evolving to support tensors, which is the suggestion I seemed to get from your previous comment?

Fix use after move.

Harbormaster completed remote builds in B86468: Diff 318853.Jan 24 2021, 12:10 PM

Undo bad fix and fix use-after-move properly.

Harbormaster completed remote builds in B86471: Diff 318859.Jan 24 2021, 1:13 PM

ftynse accepted this revision.Jan 25 2021, 1:08 AM

This revision is now accepted and ready to land.Jan 25 2021, 1:08 AM

Closed by commit rG3747eb9c85b3: [mlir][Linalg] Add a padding option to Linalg tiling (authored by nicolasvasilache). · Explain WhyJan 25 2021, 1:21 AM

This revision was automatically updated to reflect the committed changes.

nicolasvasilache added a commit: rG3747eb9c85b3: [mlir][Linalg] Add a padding option to Linalg tiling.

THanks Nicolas for the explanation. Code looks fine to me. Still think that this is effectively adding promotion during tiling and doing this on tensors only is strange. Just trying to see if there is a more commonality between buffers and tensors so as to reuse things instead of completely new paths. Lets land and experiment. I understand how this would subsume promotion though.

mlir/include/mlir/Dialect/Linalg/IR/LinalgOps.td
478	Do we still need this. I think the `linalg.pad` operation landed already
mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp
183	Cant we just use `shapedOutputOperands` for this? Maybe just add a method for getting the shaped output types?

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Linalg/

IR/

LinalgOps.td

32 lines

Transforms/

Transforms.h

20 lines

Interfaces/

ViewLikeInterface.h

1 line

ViewLikeInterface.td

25 lines

lib/

Dialect/

Linalg/

Transforms/

Transforms.cpp

144 lines

StandardOps/

IR/

Ops.cpp

11 lines

test/

Dialect/

Linalg/

roundtrip.mlir

10 lines

tile-and-pad-tensors.mlir

38 lines

lib/

Transforms/

TestLinalgTransforms.cpp

32 lines

Diff 318853

mlir/include/mlir/Dialect/Linalg/IR/LinalgOps.td

Show First 20 Lines • Show All 469 Lines • ▼ Show 20 Lines	SmallVector<Value, 8> getRanges() {
res.push_back(operand);		res.push_back(operand);
return res;		return res;
}		}
}];		}];

let hasFolder = 1;		let hasFolder = 1;
}		}

		def Linalg_SimplePadOp : Linalg_Op<"simple_pad", [NoSideEffect]> {
		mravishankarUnsubmitted Not Done Reply Inline Actions Do we still need this. I think the `linalg.pad` operation landed already mravishankar: Do we still need this. I think the `linalg.pad` operation landed already
		let summary = "TODO: replace with pad_tensors when ready.";

		let description = [{
		`linalg.simple_pad` is a tmp placeholder for padding and packing on tensors.
		Its semantics are to pad a partially dynamic tensor to a fully static tensor
		ftynseUnsubmitted Done Reply Inline Actions It is still helpful to document the requirements and conventions of this op, instead of it being just a placeholder with unknown semantics. ftynse: It is still helpful to document the requirements and conventions of this op, instead of it…
		where the static sizes are assumed to be greater than the dynamic sizes. The
		op perforrms "high" padding (i.e. it adds trailing padding values until the
		desired size is met).
		}];

		let arguments = (ins AnyRankedTensor:$tensor, AnyType:$padding);
		let results = (outs AnyRankedTensor:$result);
		ftynseUnsubmitted Done Reply Inline Actions Just drop the verifier field. ftynse: Just drop the verifier field.

		// TODO: verify all static result, some dynamic input, static shapes match,
		// element types match, ranks match etc. Use pad_tensors when ready but for
		// now just let it ne fully specified by traits.
		let verifier = ?;

		let extraClassDeclaration = [{
		RankedTensorType getSourceType() {
		return tensor().getType().cast<RankedTensorType>(); }
		RankedTensorType getResultType() {
		return getResult().getType().cast<RankedTensorType>(); }
		}];

		let assemblyFormat = [{
		$tensor `pad` $padding attr-dict `:`
		type($tensor) `to` type($result) `pad` type($padding)
		}];
		}

def Linalg_YieldOp : Linalg_Op<"yield", [NoSideEffect, ReturnLike, Terminator]>,		def Linalg_YieldOp : Linalg_Op<"yield", [NoSideEffect, ReturnLike, Terminator]>,
Arguments<(ins Variadic<AnyType>:$values)> {		Arguments<(ins Variadic<AnyType>:$values)> {
let summary = "Linalg yield operation";		let summary = "Linalg yield operation";
let description = [{		let description = [{
`linalg.yield` is a special terminator operation for blocks inside regions		`linalg.yield` is a special terminator operation for blocks inside regions
in `linalg` generic ops. It returns values to the immediately enclosing		in `linalg` generic ops. It returns values to the immediately enclosing
`linalg` generic op.		`linalg` generic op.

Show All 9 Lines

mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h

Show First 20 Lines • Show All 339 Lines • ▼ Show 20 Lines	enum class LinalgTilingLoopType {
Loops = 0,		Loops = 0,
AffineLoops = 1,		AffineLoops = 1,
ParallelLoops = 2,		ParallelLoops = 2,
};		};

using TileSizeComputationFunction =		using TileSizeComputationFunction =
std::function<SmallVector<Value, 4>(OpBuilder &, Operation *)>;		std::function<SmallVector<Value, 4>(OpBuilder &, Operation *)>;

		using PaddingValueComputationFunction =
		std::function<Value(OpBuilder &, Operation *)>;

struct LinalgTilingOptions {		struct LinalgTilingOptions {
/// Computation function that returns the tile sizes for each operation.		/// Computation function that returns the tile sizes for each operation.
/// Delayed construction of constant tile sizes should occur to interoperate		/// Delayed construction of constant tile sizes should occur to interoperate
/// with folding.		/// with folding.
TileSizeComputationFunction tileSizeComputationFunction = nullptr;		TileSizeComputationFunction tileSizeComputationFunction = nullptr;

LinalgTilingOptions &		LinalgTilingOptions &
setTileSizeComputationFunction(TileSizeComputationFunction fun) {		setTileSizeComputationFunction(TileSizeComputationFunction fun) {
Show All 32 Lines	struct LinalgTilingOptions {
/// processors.		/// processors.
Optional<LinalgLoopDistributionOptions> distribution = None;		Optional<LinalgLoopDistributionOptions> distribution = None;

LinalgTilingOptions &		LinalgTilingOptions &
setDistributionOptions(LinalgLoopDistributionOptions distributionOptions) {		setDistributionOptions(LinalgLoopDistributionOptions distributionOptions) {
distribution = std::move(distributionOptions);		distribution = std::move(distributionOptions);
return *this;		return *this;
}		}

		/// Computation function that returns a padding value to use when padding to
		/// force static sizes. When `paddingValueComputationFunction` is set, padding
		/// operations are introduced, that guarantee the underlying op is statically
		/// shaped and can thus be vectorized.
		PaddingValueComputationFunction paddingValueComputationFunction = nullptr;

		LinalgTilingOptions &
		setPaddingValueComputationFunction(PaddingValueComputationFunction fun) {
		paddingValueComputationFunction = std::move(fun);
		return *this;
		}
};		};

/// Canonicalization patterns relevant to apply after tiling patterns. These are		/// Canonicalization patterns relevant to apply after tiling patterns. These are
/// applied automatically by the tiling pass but need to be applied manually		/// applied automatically by the tiling pass but need to be applied manually
/// when tiling is called programmatically.		/// when tiling is called programmatically.
OwningRewritePatternList		OwningRewritePatternList
getLinalgTilingCanonicalizationPatterns(MLIRContext *ctx);		getLinalgTilingCanonicalizationPatterns(MLIRContext *ctx);
void populateLinalgTilingCanonicalizationPatterns(		void populateLinalgTilingCanonicalizationPatterns(
OwningRewritePatternList &patterns, MLIRContext *ctx);		OwningRewritePatternList &patterns, MLIRContext *ctx);

		/// Base pattern that applied the tiling transformation specified by `options`.
		/// Abort and return failure in 2 cases:
		/// 1. if the tiling specification is invalid and tiling fails to occur.
		/// 2. if tiling occurs but `options.paddingValueComputationFunction` is set
		/// and some operand shape cannot be bounded statically.
struct LinalgBaseTilingPattern : public RewritePattern {		struct LinalgBaseTilingPattern : public RewritePattern {
// Entry point to match any LinalgOp OpInterface.		// Entry point to match any LinalgOp OpInterface.
LinalgBaseTilingPattern(LinalgTilingOptions options,		LinalgBaseTilingPattern(LinalgTilingOptions options,
LinalgMarker marker = LinalgMarker(),		LinalgMarker marker = LinalgMarker(),
PatternBenefit benefit = 1);		PatternBenefit benefit = 1);
// Entry point to match a specific Linalg op.		// Entry point to match a specific Linalg op.
LinalgBaseTilingPattern(StringRef opName, MLIRContext *context,		LinalgBaseTilingPattern(StringRef opName, MLIRContext *context,
LinalgTilingOptions options,		LinalgTilingOptions options,
▲ Show 20 Lines • Show All 483 Lines • Show Last 20 Lines

mlir/include/mlir/Interfaces/ViewLikeInterface.h

	//===- ViewLikeInterface.h - View-like operations interface ---------------===//			//===- ViewLikeInterface.h - View-like operations interface ---------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file implements the operation interface for view-like operations.			// This file implements the operation interface for view-like operations.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_INTERFACES_VIEWLIKEINTERFACE_H_			#ifndef MLIR_INTERFACES_VIEWLIKEINTERFACE_H_
	#define MLIR_INTERFACES_VIEWLIKEINTERFACE_H_			#define MLIR_INTERFACES_VIEWLIKEINTERFACE_H_

	#include "mlir/IR/Builders.h"			#include "mlir/IR/Builders.h"
				#include "mlir/IR/BuiltinAttributes.h"
	#include "mlir/IR/BuiltinTypes.h"			#include "mlir/IR/BuiltinTypes.h"
	#include "mlir/IR/OpImplementation.h"			#include "mlir/IR/OpImplementation.h"

	namespace mlir {			namespace mlir {
	/// Auxiliary range data structure to unpack the offset, size and stride			/// Auxiliary range data structure to unpack the offset, size and stride
				ftynseUnsubmitted Done Reply Inline Actions Leftover unnecessary changes. ftynse: Leftover unnecessary changes.
	/// operands into a list of triples. Such a list can be more convenient to			/// operands into a list of triples. Such a list can be more convenient to
	/// manipulate.			/// manipulate.
	struct Range {			struct Range {
				rriddleUnsubmitted Done Reply Inline Actions This typedef pollutes the entire mlir namespace. rriddle: This typedef pollutes the entire mlir namespace.
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I can just use OpFoldResult. OTOH, I'd welcome renaming OpFoldResult to ValueOrAttr which is much more descriptive IMO. Thoughts? nicolasvasilache: I can just use OpFoldResult. OTOH, I'd welcome renaming OpFoldResult to ValueOrAttr which is…
	Value offset;			Value offset;
	Value size;			Value size;
	Value stride;			Value stride;
	};			};

	class OffsetSizeAndStrideOpInterface;			class OffsetSizeAndStrideOpInterface;
	LogicalResult verify(OffsetSizeAndStrideOpInterface op);			LogicalResult verify(OffsetSizeAndStrideOpInterface op);
	} // namespace mlir			} // namespace mlir
	▲ Show 20 Lines • Show All 87 Lines • Show Last 20 Lines

mlir/include/mlir/Interfaces/ViewLikeInterface.td

Show First 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	InterfaceMethod<
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return $_op.sizes();		return $_op.sizes();
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
		Return a vector of all the static or dynamic sizes of the op.
		}],
		/retTy=/"SmallVector<OpFoldResult, 4>",
		/methodName=/"getMixedSizes",
		/args=/(ins),
		/methodBody=/"",
		/defaultImplementation=/[{
		SmallVector<OpFoldResult, 4> res;
		std::array<unsigned, 3> ranks = $_op.getArrayAttrRanks();
		unsigned numDynamic = 0;
		unsigned count = ranks[getOffsetOperandGroupPosition()];
		for (unsigned idx = 0; idx < count; ++idx) {
		if (isDynamicSize(idx))
		res.push_back($_op.sizes()[numDynamic++]);
		else
		res.push_back($_op.static_sizes()[idx]);
		}
		return res;
		}]
		>,
		InterfaceMethod<
		/desc=/[{
Return the dynamic stride operands.		Return the dynamic stride operands.
}],		}],
/retTy=/"OperandRange",		/retTy=/"OperandRange",
/methodName=/"strides",		/methodName=/"strides",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return $_op.strides();		return $_op.strides();
▲ Show 20 Lines • Show All 233 Lines • ▼ Show 20 Lines	InterfaceMethod<
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return $_op.getOperand(getIndexOfDynamicStride(idx));		return $_op.getOperand(getIndexOfDynamicStride(idx));
}]		}]
>,		>,
];		];

let extraClassDeclaration = [{		let extraClassDeclaration = [{
		static unsigned getOffsetOperandGroupPosition() { return 0; }
		static unsigned getSizeOperandGroupPosition() { return 1; }
		static unsigned getStrideOperandGroupPosition() { return 2; }
static StringRef getStaticOffsetsAttrName() {		static StringRef getStaticOffsetsAttrName() {
return "static_offsets";		return "static_offsets";
}		}
static StringRef getStaticSizesAttrName() {		static StringRef getStaticSizesAttrName() {
return "static_sizes";		return "static_sizes";
}		}
static StringRef getStaticStridesAttrName() {		static StringRef getStaticStridesAttrName() {
return "static_strides";		return "static_strides";
Show All 17 Lines

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp

Show All 19 Lines
#include "mlir/Dialect/Utils/StructuredOpsUtils.h"		#include "mlir/Dialect/Utils/StructuredOpsUtils.h"
#include "mlir/Dialect/Vector/EDSC/Intrinsics.h"		#include "mlir/Dialect/Vector/EDSC/Intrinsics.h"
#include "mlir/Dialect/Vector/VectorOps.h"		#include "mlir/Dialect/Vector/VectorOps.h"
#include "mlir/IR/AffineExpr.h"		#include "mlir/IR/AffineExpr.h"
#include "mlir/IR/Matchers.h"		#include "mlir/IR/Matchers.h"
#include "mlir/Pass/Pass.h"		#include "mlir/Pass/Pass.h"
#include "mlir/Support/LLVM.h"		#include "mlir/Support/LLVM.h"
#include "mlir/Transforms/GreedyPatternRewriteDriver.h"		#include "mlir/Transforms/GreedyPatternRewriteDriver.h"
		#include "llvm/ADT/ScopeExit.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <type_traits>		#include <type_traits>

#define DEBUG_TYPE "linalg-transforms"		#define DEBUG_TYPE "linalg-transforms"

using namespace mlir;		using namespace mlir;
using namespace mlir::edsc;		using namespace mlir::edsc;
▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines	tileSizeComputationFunction = [tileSizes](OpBuilder &b, Operation *op) {
return llvm::to_vector<4>(map_range(tileSizes, [&](int64_t s) {		return llvm::to_vector<4>(map_range(tileSizes, [&](int64_t s) {
Value v = b.create<ConstantIndexOp>(op->getLoc(), s);		Value v = b.create<ConstantIndexOp>(op->getLoc(), s);
return v;		return v;
}));		}));
};		};
return *this;		return *this;
}		}

		/// Try to compute a static bounding box for `operand`
		/// Return success if either:
		/// 1. The operand is already statically shaped, `result` is left unchanged.
		/// 2. The operand is (partially) dynamic, `result` is the result of a freshly
		/// created SimplePadOp.
		/// Return failure if the operand cannot be padded to a static shape.
		static LogicalResult padOperandToSmallestStaticBoundingBox(
		PatternRewriter &rewriter, linalg::LinalgOp opToPad, Value operand,
		ftynseUnsubmitted Done Reply Inline Actions We usually pass builder by-reference. Do you need a copy for some reason? ftynse: We usually pass builder by-reference. Do you need a copy for some reason?
		const LinalgTilingOptions &options, Value &result) {
		auto tensorType = operand.getType().cast<RankedTensorType>();
		// Already static shape, no need to pad.
		if (tensorType.hasStaticShape())
		return success();
		auto subtensor = operand.getDefiningOp<SubTensorOp>();
		hanchungUnsubmitted Done Reply Inline Actions use `auto` since getDefiningOp already states the type. hanchung: use `auto` since getDefiningOp already states the type.
		// Not a subtensor, cannot construct a static bounding box.
		if (!subtensor)
		return failure();
		SmallVector<int64_t> staticSizes;
		ftynseUnsubmitted Done Reply Inline Actions Nit: `SmallVector` can now infer the default number of stack elements, so if there is no reason to require `4` specifically, it can be just omitted. ftynse: Nit: `SmallVector` can now infer the default number of stack elements, so if there is no reason…
		staticSizes.reserve(tensorType.getRank());
		auto shapedOp =
		cast<OffsetSizeAndStrideOpInterface>(subtensor.getOperation());
		for (auto size : shapedOp.getMixedSizes()) {
		auto indexAttr = size.is<Attribute>()
		? size.get<Attribute>().dyn_cast<IntegerAttr>()
		: linalg::getSmallestBoundingIndex(size.get<Value>());
		ftynseUnsubmitted Done Reply Inline Actions `size.get<Attribute>` (there's no need to `dyn_cast` after a check). ftynse: `size.get<Attribute>` (there's no need to `dyn_cast` after a check).
		// SmallestBoundingIndex must exist for all sizes.
		// For now return an error if we can't find it.
		if (!indexAttr)
		return rewriter.notifyMatchFailure(
		opToPad, "No constant bounding box can be found for padding");
		staticSizes.push_back(indexAttr.getInt());
		ftynseUnsubmitted Done Reply Inline Actions I'm not a fan of emitting warnings _inside_ transformations. IMO, errors and warnings are there for the validity state of the op in general, not for its suitability to a particular transformation. Remarks may be fine (the user will have to ask for them explicitly to see them). `rewriter.notifyMatchFailure()` is probably preferred. ftynse: I'm not a fan of emitting warnings _inside_ transformations. IMO, errors and warnings are there…
		}
		Value pad = options.paddingValueComputationFunction(rewriter, opToPad);
		auto staticTensorType =
		RankedTensorType::get(staticSizes, tensorType.getElementType());
		result = rewriter.create<linalg::SimplePadOp>(opToPad->getLoc(),
		staticTensorType, operand, pad);
		return success();
		}

		// Try to create a static bounding box around each operand of `res.op`.
		// If successful, `res.op` is rewritten in static form with padded operands.
		ftynseUnsubmitted Done Reply Inline Actions Write this comment? :) ftynse: Write this comment? :)
		// `res.op` is updated to the cloned static form of the op on success.
		static LogicalResult rewriteAsPaddedOp(PatternRewriter &rewriter,
		TiledLinalgOp &res,
		hanchungUnsubmitted Done Reply Inline Actions Please add documentation because this is not a trivial function, e.g., the result will be stored to `res` if success. hanchung: Please add documentation because this is not a trivial function, e.g., the result will be…
		const LinalgTilingOptions &options) {
		LinalgOp opToPad = res.op;
		Location loc = opToPad->getLoc();

		// If the op is fully static, it does not need padding.
		// TODO: there are cases where we may still want to pad to larger sizes.
		if (llvm::all_of(opToPad.getShapedOperands(), [](Value v) {
		return v.getType().cast<RankedTensorType>().hasStaticShape();
		}))
		return success();

		OpBuilder::InsertionGuard g(rewriter);
		// Set IP after op because we also take the dims of the original output.
		rewriter.setInsertionPointAfter(opToPad);
		// Make a copy of the shaped operands and update it.
		SmallVector<Value> operands = opToPad.getShapedOperands();
		for (Value &v : operands) {
		Value paddedOperand;
		// If padding was requested but the shape cannot be bounded statically then
		// the pattern fails to apply.
		if (failed(padOperandToSmallestStaticBoundingBox(rewriter, opToPad, v,
		options, paddedOperand))) {
		return failure();
		}
		// Update v if we indeed got a padded operand.
		v = paddedOperand ? paddedOperand : v;
		}

		// Clone `opToPad` to operate on the statically padded shapes.
		auto resultTensorTypes =
		mravishankarUnsubmitted Not Done Reply Inline Actions Cant we just use `shapedOutputOperands` for this? Maybe just add a method for getting the shaped output types? mravishankar: Cant we just use `shapedOutputOperands` for this? Maybe just add a method for getting the…
		ValueRange(operands).take_back(opToPad.getNumOutputs()).getTypes();
		ValueRange otherOperands = opToPad.getAssumedNonShapedOperands();
		operands.append(otherOperands.begin(), otherOperands.end());
		linalg::LinalgOp paddedOp =
		opToPad.clone(rewriter, loc, resultTensorTypes, operands);

		// Recover the subtensor out of the new static results. This keeps the
		// original linalg op around because it uses the dims of the original results.
		// This later folds away.
		SmallVector<Value> paddedSubviewResults;
		paddedSubviewResults.reserve(opToPad->getNumResults());
		Value zero = rewriter.create<ConstantIndexOp>(loc, 0);
		Value one = rewriter.create<ConstantIndexOp>(loc, 1);
		llvm::SetVector<Operation *> newUsersOfOpToPad;
		for (auto it : llvm::zip(opToPad->getResults(), paddedOp->getResults())) {
		auto rank = std::get<0>(it).getType().cast<RankedTensorType>().getRank();
		SmallVector<Value> offsets(rank, zero);
		auto sizes = llvm::to_vector<4>(
		llvm::map_range(llvm::seq<unsigned>(0, rank), [&](unsigned d) -> Value {
		auto dimOp = rewriter.create<DimOp>(loc, std::get<0>(it), d);
		newUsersOfOpToPad.insert(dimOp);
		return dimOp;
		}));
		SmallVector<Value> strides(rank, one);
		paddedSubviewResults.push_back(rewriter.create<SubTensorOp>(
		loc, std::get<1>(it), offsets, sizes, strides));
		}
		// Replace the transient `opToPad` locally, except for uses that we just
		// created for the purpose of extracting the dims.
		rewriter.replaceOpWithIf(opToPad, paddedSubviewResults, [&](OpOperand &opOp) {
		return !newUsersOfOpToPad.contains(opOp.getOwner());
		});

		res = TiledLinalgOp{paddedOp, res.loops, res.tensorResults};
		return success();
		}

/// Linalg base tiling pattern.		/// Linalg base tiling pattern.
mlir::linalg::LinalgBaseTilingPattern::LinalgBaseTilingPattern(		mlir::linalg::LinalgBaseTilingPattern::LinalgBaseTilingPattern(
StringRef opName, MLIRContext *context, LinalgTilingOptions options,		StringRef opName, MLIRContext *context, LinalgTilingOptions options,
LinalgMarker marker, PatternBenefit benefit)		LinalgMarker marker, PatternBenefit benefit)
: RewritePattern(opName, {}, benefit, context), marker(marker),		: RewritePattern(opName, {}, benefit, context), marker(marker),
options(options) {}		options(options) {}

mlir::linalg::LinalgBaseTilingPattern::LinalgBaseTilingPattern(		mlir::linalg::LinalgBaseTilingPattern::LinalgBaseTilingPattern(
Show All 9 Lines	LogicalResult mlir::linalg::LinalgBaseTilingPattern::matchAndRewriteBase(
if (failed(marker.checkAndNotify(rewriter, linalgOp)))		if (failed(marker.checkAndNotify(rewriter, linalgOp)))
return failure();		return failure();

Optional<TiledLinalgOp> res = tileLinalgOp(rewriter, linalgOp, options);		Optional<TiledLinalgOp> res = tileLinalgOp(rewriter, linalgOp, options);

if (!res)		if (!res)
return failure();		return failure();

		// Setup RAII guard to return properly.
		bool succeeded = true;
		LinalgOp tiledOp = res->op;
		auto guard = llvm::make_scope_exit([&]() {
		if (!succeeded)
		return;
// Return relevant information to derived pattern.		// Return relevant information to derived pattern.
result = *res;		result = *res;
		// Replace marker on both tiledOp and tiledAndPaddedOp, if necessary.
		marker.replaceLinalgMarker(rewriter, tiledOp);
		if (tiledOp != res->op)
		marker.replaceLinalgMarker(rewriter, res->op);
		});

// New marker if specified.		// Consider padding on the fly only if the op has tensor semantics.
marker.replaceLinalgMarker(rewriter, res->op.getOperation());		if (!options.paddingValueComputationFunction \|\|
		!linalgOp.hasTensorSemantics())
		return success();

		// Try to pad on the fly by rewriting res->op as a padded op.
		if (failed(rewriteAsPaddedOp(rewriter, *res, options))) {
		// Set so RAII guard does not propagate TiledLinalgOp to `result`.
		succeeded = false;
		return failure();
		}

		// Do not perform replacement of `linalgOp`, let the derived patterns
		// do this as they see fit, from the resulting TiledLinalgOp.
return success();		return success();
}		}

mlir::linalg::LinalgBaseTileAndFusePattern::LinalgBaseTileAndFusePattern(		mlir::linalg::LinalgBaseTileAndFusePattern::LinalgBaseTileAndFusePattern(
StringRef opName, MLIRContext *context,		StringRef opName, MLIRContext *context,
const LinalgDependenceGraph &dependenceGraph,		const LinalgDependenceGraph &dependenceGraph,
LinalgTilingOptions tilingOptions, LinalgFusionOptions fusionOptions,		LinalgTilingOptions tilingOptions, LinalgFusionOptions fusionOptions,
LinalgMarker marker, LinalgMarker fusedOpMarker,		LinalgMarker marker, LinalgMarker fusedOpMarker,
▲ Show 20 Lines • Show All 331 Lines • Show Last 20 Lines

mlir/lib/Dialect/StandardOps/IR/Ops.cpp

Show First 20 Lines • Show All 1,405 Lines • ▼ Show 20 Lines	if (auto fromElements = dyn_cast_or_null<tensor::GenerateOp>(definingOp)) {
auto dynExtents = fromElements.dynamicExtents().begin();		auto dynExtents = fromElements.dynamicExtents().begin();
for (auto dim : resultType.getShape().take_front(index.getInt()))		for (auto dim : resultType.getShape().take_front(index.getInt()))
if (dim == RankedTensorType::kDynamicSize)		if (dim == RankedTensorType::kDynamicSize)
dynExtents++;		dynExtents++;

return Value{*dynExtents};		return Value{*dynExtents};
}		}

		// The size at the given index is now known to be a dynamic size.
		unsigned unsignedIndex = index.getValue().getZExtValue();

		if (auto subtensor = dyn_cast_or_null<SubTensorOp>(definingOp)) {
		assert(subtensor.isDynamicSize(unsignedIndex) &&
		"Expected dynamic subtensor size");
		return subtensor.getDynamicSize(unsignedIndex);
		}

// Fold dim to the size argument for an `AllocOp`, `ViewOp`, or `SubViewOp`.		// Fold dim to the size argument for an `AllocOp`, `ViewOp`, or `SubViewOp`.
auto memrefType = argTy.dyn_cast<MemRefType>();		auto memrefType = argTy.dyn_cast<MemRefType>();
if (!memrefType)		if (!memrefType)
return {};		return {};

// The size at the given index is now known to be a dynamic size of a memref.
unsigned unsignedIndex = index.getValue().getZExtValue();
if (auto alloc = dyn_cast_or_null<AllocOp>(definingOp))		if (auto alloc = dyn_cast_or_null<AllocOp>(definingOp))
return *(alloc.getDynamicSizes().begin() +		return *(alloc.getDynamicSizes().begin() +
memrefType.getDynamicDimIndex(unsignedIndex));		memrefType.getDynamicDimIndex(unsignedIndex));

if (auto view = dyn_cast_or_null<ViewOp>(definingOp))		if (auto view = dyn_cast_or_null<ViewOp>(definingOp))
return *(view.getDynamicSizes().begin() +		return *(view.getDynamicSizes().begin() +
memrefType.getDynamicDimIndex(unsignedIndex));		memrefType.getDynamicDimIndex(unsignedIndex));

▲ Show 20 Lines • Show All 2,570 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/roundtrip.mlir

Show First 20 Lines • Show All 799 Lines • ▼ Show 20 Lines	%0 = linalg.reshape %arg0
memref<?x?x?x4x?xf32> into memref<?x?x?xf32>		memref<?x?x?x4x?xf32> into memref<?x?x?xf32>
return %0 : memref<?x?x?xf32>		return %0 : memref<?x?x?xf32>
}		}
// CHECK-DAG: #[[MAP0:.+]] = affine_map<(d0, d1, d2, d3, d4) -> (d0)>		// CHECK-DAG: #[[MAP0:.+]] = affine_map<(d0, d1, d2, d3, d4) -> (d0)>
// CHECK-DAG: #[[MAP1:.+]] = affine_map<(d0, d1, d2, d3, d4) -> (d1)>		// CHECK-DAG: #[[MAP1:.+]] = affine_map<(d0, d1, d2, d3, d4) -> (d1)>
// CHECK-DAG: #[[MAP2:.+]] = affine_map<(d0, d1, d2, d3, d4) -> (d2, d3, d4)>		// CHECK-DAG: #[[MAP2:.+]] = affine_map<(d0, d1, d2, d3, d4) -> (d2, d3, d4)>
// CHECK: func @legal_collapsing_reshape_dynamic_memref		// CHECK: func @legal_collapsing_reshape_dynamic_memref
// CHECK: linalg.reshape %{{.+}} [#[[MAP0]], #[[MAP1]], #[[MAP2]]]		// CHECK: linalg.reshape %{{.+}} [#[[MAP0]], #[[MAP1]], #[[MAP2]]]

		// -----

		// TODO: this op should disappear once pad_tensors is available.
		// CHECK-LABEL: func @simple_pad
		func @simple_pad(%0: tensor<?x4x?xf32>, %pad: f32) {
		// CHECK: linalg.simple_pad %{{.+}} pad %{{.+}}: tensor<?x4x?xf32> to tensor<8x4x8xf32>
		%1 = linalg.simple_pad %0 pad %pad: tensor<?x4x?xf32> to tensor<8x4x8xf32> pad f32
		return
		}

mlir/test/Dialect/Linalg/tile-and-pad-tensors.mlir

This file was added.

				// RUN: mlir-opt %s -test-linalg-transform-patterns=test-tile-and-pad-pattern -canonicalize \| FileCheck %s

				// CHECK-LABEL: func @matmul_tensors(
				// CHECK-SAME: %[[TA:[0-9a-z]+]]: tensor<?x?xf32>
				// CHECK-SAME: %[[TB:[0-9a-z]+]]: tensor<?x?xf32>
				// CHECK-SAME: %[[TC:[0-9a-z]+]]: tensor<?x?xf32>) -> tensor<?x?xf32> {
				func @matmul_tensors(
				%arg0: tensor<?x?xf32>, %arg1: tensor<?x?xf32>, %arg2: tensor<?x?xf32>)
				-> tensor<?x?xf32> {
				// CHECK: %[[TD0:.]] = scf.for {{.}} to {{.}} step {{.}} iter_args(%[[TC0:.*]] = %[[TC]]) -> (tensor<?x?xf32>) {
				// CHECK: %[[TD1:.]] = scf.for {{.}} to {{.}} step {{.}} iter_args(%[[TC1:.*]] = %[[TC0]]) -> (tensor<?x?xf32>) {
				// CHECK: %[[TD2:.]] = scf.for {{.}} to {{.}} step {{.}} iter_args(%[[TC2:.*]] = %[[TC1]]) -> (tensor<?x?xf32>) {
				// CHECK: %[[sTA:.]] = subtensor %[[TA]][{{.}}] : tensor<?x?xf32> to tensor<?x?xf32>
				// CHECK: %[[sTB:.]] = subtensor %[[TB]][{{.}}] : tensor<?x?xf32> to tensor<?x?xf32>
				// CHECK: %[[sTC:.]] = subtensor %[[TC2]][{{.}}] : tensor<?x?xf32> to tensor<?x?xf32>

				// Dynamic op has been canonicalized away.
				// CHECK-NOT: linalg.matmul {{.*}} tensor<?x?xf32>

				// Padding injects static information.
				// CHECK: %[[pA:.]] = linalg.simple_pad %[[sTA]] pad %{{.}} : tensor<?x?xf32> to tensor<2x4xf32> pad f32
				// CHECK: %[[pB:.]] = linalg.simple_pad %[[sTB]] pad %{{.}} : tensor<?x?xf32> to tensor<4x3xf32> pad f32
				// CHECK: %[[pC:.]] = linalg.simple_pad %[[sTC]] pad %{{.}} : tensor<?x?xf32> to tensor<2x3xf32> pad f32
				// CHECK: %[[pD:.*]] = linalg.matmul ins(%[[pA]], %[[pB]] : tensor<2x4xf32>, tensor<4x3xf32>)
				// CHECK-SAME: outs(%[[pC]] : tensor<2x3xf32>) -> tensor<2x3xf32>
				// CHECK: %[[sTD:.]] = subtensor %[[pD]][0, 0] [%{{.}}, %{{.*}}] [1, 1] : tensor<2x3xf32> to tensor<?x?xf32>
				// CHECK: %[[TD:.]] = subtensor_insert %[[sTD]] into %[[TC2]][{{.}}] : tensor<?x?xf32> into tensor<?x?xf32>
				// CHECK: scf.yield %[[TD]] : tensor<?x?xf32>
				// CHECK: scf.yield %[[TD2]] : tensor<?x?xf32>
				// CHECK: scf.yield %[[TD1]] : tensor<?x?xf32>
				%0 = linalg.matmul {__internal_linalg_transform__ = "tile-and-pad"}
				ins(%arg0, %arg1: tensor<?x?xf32>, tensor<?x?xf32>)
				outs(%arg2: tensor<?x?xf32>)
				-> tensor<?x?xf32>

				// CHECK: return %[[TD0]] : tensor<?x?xf32>
				return %0 : tensor<?x?xf32>
				}

mlir/test/lib/Transforms/TestLinalgTransforms.cpp

Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	Option<bool> testGenericToVectorPattern{
*this, "test-linalg-to-vector-patterns",		*this, "test-linalg-to-vector-patterns",
llvm::cl::desc("Test a set of patterns that rewrite a linalg contraction "		llvm::cl::desc("Test a set of patterns that rewrite a linalg contraction "
"in vector.contract form"),		"in vector.contract form"),
llvm::cl::init(false)};		llvm::cl::init(false)};
Option<bool> testAffineMinSCFCanonicalizationPatterns{		Option<bool> testAffineMinSCFCanonicalizationPatterns{
*this, "test-affine-min-scf-canonicalization-patterns",		*this, "test-affine-min-scf-canonicalization-patterns",
llvm::cl::desc("Test affine-min + scf canonicalization patterns."),		llvm::cl::desc("Test affine-min + scf canonicalization patterns."),
llvm::cl::init(false)};		llvm::cl::init(false)};
		Option<bool> testTileAndPadPattern{
		*this, "test-tile-and-pad-pattern",
		llvm::cl::desc("Test tile and pad pattern"), llvm::cl::init(false)};
};		};
} // end anonymous namespace		} // end anonymous namespace

static void applyPatterns(FuncOp funcOp) {		static void applyPatterns(FuncOp funcOp) {
MLIRContext *ctx = funcOp.getContext();		MLIRContext *ctx = funcOp.getContext();
OwningRewritePatternList patterns;		OwningRewritePatternList patterns;

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 392 Lines • ▼ Show 20 Lines	static void applyAffineMinSCFCanonicalizationPatterns(FuncOp funcOp) {
FrozenRewritePatternList frozenPatterns(std::move(foldPattern));		FrozenRewritePatternList frozenPatterns(std::move(foldPattern));

// Explicitly walk and apply the pattern locally to avoid more general folding		// Explicitly walk and apply the pattern locally to avoid more general folding
// on the rest of the IR.		// on the rest of the IR.
funcOp.walk([&frozenPatterns](AffineMinOp minOp) {		funcOp.walk([&frozenPatterns](AffineMinOp minOp) {
applyOpPatternsAndFold(minOp, frozenPatterns);		applyOpPatternsAndFold(minOp, frozenPatterns);
});		});
}		}

		// For now, just assume it is the zero of type.
		// In the future, it should be the zero of type + op.
		static Value getNeutralOfLinalgOp(OpBuilder &b, Operation *op) {
		auto t = op->getResult(0).getType().cast<ShapedType>().getElementType();
		return b.create<ConstantOp>(op->getLoc(), t, b.getZeroAttr(t));
		}

		static void applyTileAndPadPattern(FuncOp funcOp) {
		MLIRContext *context = funcOp.getContext();
		OwningRewritePatternList tilingPattern;
		auto linalgTilingOptions =
		linalg::LinalgTilingOptions()
		.setTileSizes({2, 3, 4})
		.setPaddingValueComputationFunction(getNeutralOfLinalgOp);
		tilingPattern.insert<linalg::LinalgTilingPattern<linalg::MatmulOp>>(
		context, linalgTilingOptions,
		linalg::LinalgMarker(Identifier::get("tile-and-pad", context)));

		// Explicitly walk and apply the pattern locally to avoid more general folding
		// on the rest of the IR.
		FrozenRewritePatternList frozenPatterns(std::move(tilingPattern));
		funcOp.walk([&frozenPatterns](AffineMinOp minOp) {
		applyOpPatternsAndFold(minOp, frozenPatterns);
		});
		}

/// Apply transformations specified as patterns.		/// Apply transformations specified as patterns.
void TestLinalgTransforms::runOnFunction() {		void TestLinalgTransforms::runOnFunction() {
auto lambda = [&](void *) {		auto lambda = [&](void *) {
getFunction().walk([](LinalgOp op) {		getFunction().walk([](LinalgOp op) {
op.removeAttr(LinalgTransforms::kLinalgTransformMarker);		op.removeAttr(LinalgTransforms::kLinalgTransformMarker);
});		});
};		};
std::unique_ptr<void, decltype(lambda)> cleanupGuard{(void *)1, lambda};		std::unique_ptr<void, decltype(lambda)> cleanupGuard{(void *)1, lambda};
Show All 17 Lines	return applyMatmulToVectorPatterns(getFunction(),
testMatmulToVectorPatterns1dTiling,		testMatmulToVectorPatterns1dTiling,
testMatmulToVectorPatterns2dTiling);		testMatmulToVectorPatterns2dTiling);
if (testVectorTransferForwardingPatterns)		if (testVectorTransferForwardingPatterns)
return applyVectorTransferForwardingPatterns(getFunction());		return applyVectorTransferForwardingPatterns(getFunction());
if (testGenericToVectorPattern)		if (testGenericToVectorPattern)
return applyLinalgToVectorPatterns(getFunction());		return applyLinalgToVectorPatterns(getFunction());
if (testAffineMinSCFCanonicalizationPatterns)		if (testAffineMinSCFCanonicalizationPatterns)
return applyAffineMinSCFCanonicalizationPatterns(getFunction());		return applyAffineMinSCFCanonicalizationPatterns(getFunction());
		if (testTileAndPadPattern)
		return applyTileAndPadPattern(getFunction());
}		}

namespace mlir {		namespace mlir {
namespace test {		namespace test {
void registerTestLinalgTransforms() {		void registerTestLinalgTransforms() {
PassRegistration<TestLinalgTransforms> testTransformPatternsPass(		PassRegistration<TestLinalgTransforms> testTransformPatternsPass(
"test-linalg-transform-patterns",		"test-linalg-transform-patterns",
"Test Linalg transformation patterns by applying them greedily.");		"Test Linalg transformation patterns by applying them greedily.");
}		}
} // namespace test		} // namespace test
} // namespace mlir		} // namespace mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Linalg] Add a padding option to Linalg tilingClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 318853

mlir/include/mlir/Dialect/Linalg/IR/LinalgOps.td

mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h

mlir/include/mlir/Interfaces/ViewLikeInterface.h

mlir/include/mlir/Interfaces/ViewLikeInterface.td

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp

mlir/lib/Dialect/StandardOps/IR/Ops.cpp

mlir/test/Dialect/Linalg/roundtrip.mlir

mlir/test/Dialect/Linalg/tile-and-pad-tensors.mlir

mlir/test/lib/Transforms/TestLinalgTransforms.cpp

[mlir][Linalg] Add a padding option to Linalg tiling
ClosedPublic