This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/IR/
-
mlir/
-
IR/
1/1
AffineMap.h
-
lib/
-
Dialect/Affine/IR/
-
Affine/
-
IR/
7/8
AffineOps.cpp
-
IR/
2/3
AffineMap.cpp
-
test/Dialect/Linalg/
-
Dialect/
-
Linalg/
-
tile.mlir
-
tile_conv.mlir
-
tile_conv_padding.mlir

Differential D79502

[mlir] Support partial folding of affine.min/max
ClosedPublic

Authored by ftynse on May 6 2020, 10:06 AM.

Download Raw Diff

Details

Reviewers

nicolasvasilache
poechsel
andydavis1
rriddle

Commits

rGa87db48e6fda: [mlir] Support partial folding of affine.min/max

Summary

Originally, these operations were folded only if all expressions in their
affine maps could be folded to a constant expression that can be then subject
to numeric min/max computation. This introduces a more advanced version that
partially folds the affine map by lifting individual constant expression in it
even if some of the expressions remain variable. The folding can update the
operation in place to use a simpler map. Note that this is not as powerful as
canonicalization, in particular this does not remove dimensions or symbols that
became useless. This allows for better composition of Linalg tiling and
promotion transformation, where the latter can handle some canonical forms of
affine.min that the folding can now produce.

Depends On D79497

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ftynse created this revision.May 6 2020, 10:06 AM

Herald added a reviewer: rriddle. · View Herald TranscriptMay 6 2020, 10:06 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, Kayjukh, frgossen and 12 others. · View Herald Transcript

Harbormaster failed remote builds in B55947: Diff 262407!May 6 2020, 10:46 AM

partially folds the affine map by lifting individual constant expression in it

Thanks for the detailed summary, but I'm sorry this part isn't clear. canonicalizeMapAndOperands does propagate constants from the operands into the maps. It isn't clear if you are doing the same or whether you are taking min/max over the subset of the results that are constant: for eg. affine.min affine_map<(d0, d1, d2) -> (d0, d1, d2)> (2, 3, %N) is affine.min affine_map<(d0) -> (2, d0)> (%N). I don't see any test cases of the latter form - so didn't look at the code carefully.

mlir/include/mlir/IR/AffineMap.h
147	By 'Folds', did you mean propagates?

rriddle resigned from this revision.May 6 2020, 11:02 AM

ftynse added a reviewer: andydavis1.May 6 2020, 11:40 AM

Thanks Alex!

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2131–2132	Can you remove this TODO now?
2132	"returns failure"
2150	Seems like this block of code could be shared with AffineMin::Fold
mlir/lib/IR/AffineMap.cpp
263	Could this function also return if all results were constants? Would this allow you to avoid calling extractIntegerResults in affine.min/max.fold above?

This revision is now accepted and ready to land.May 6 2020, 12:49 PM

nicolasvasilache accepted this revision.May 6 2020, 2:01 PM

nicolasvasilache added inline comments.

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2137	How about a 2 steps impl? if (llvm::any_of(...)) return failure(); auto range = llvm::map_range(); results.assign(range.begin(), range.end()); I'd say the readability improvements are worth it, you choose.
2150	Or have the whole impl factored out and just take either `std::min/max_element` as a parameter?

In D79502#2023207, @bondhugula wrote:

partially folds the affine map by lifting individual constant expression in it

Thanks for the detailed summary, but I'm sorry this part isn't clear. canonicalizeMapAndOperands does propagate constants from the operands into the maps. It isn't clear if you are doing the same or whether you are taking min/max over the subset of the results that are constant: for eg. affine.min affine_map<(d0, d1, d2) -> (d0, d1, d2)> (2, 3, %N) is affine.min affine_map<(d0) -> (2, d0)> (%N). I don't see any test cases of the latter form - so didn't look at the code carefully.

Canonicalization does more than just propagating the constants, it moves dimensions and symbols around, drops them, or does more aggressive simplifications of affine maps. The folding only looks if it can replace one AffineExpr (however complex) with an AffineConstantExpr by substituting the dimensions and symbols with constant operands. It will do the replacement if possible.

Taking a subset of results looks interesting, but this change doesn't do it either (I had a specific use case, TBH, where having any constant in a map suffices to overapproximate the result, so I did just that). I think there are more simplifications that we can do as foldings rather than canonicalizations, including operand/result removal, but leaving this for future work.

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2131–2132	My change doesn't do the folding given as example in the todo.

nicolasvasilache added inline comments.May 6 2020, 2:34 PM

mlir/lib/IR/AffineMap.cpp
242	Same potential 2-step readability gains as above.

In D79502#2023690, @ftynse wrote:

In D79502#2023207, @bondhugula wrote:

partially folds the affine map by lifting individual constant expression in it

Thanks for the detailed summary, but I'm sorry this part isn't clear. canonicalizeMapAndOperands does propagate constants from the operands into the maps. It isn't clear if you are doing the same or whether you are taking min/max over the subset of the results that are constant: for eg. affine.min affine_map<(d0, d1, d2) -> (d0, d1, d2)> (2, 3, %N) is affine.min affine_map<(d0) -> (2, d0)> (%N). I don't see any test cases of the latter form - so didn't look at the code carefully.

Canonicalization does more than just propagating the constants, it moves dimensions and symbols around, drops them, or does more aggressive simplifications of affine maps. The folding only looks if it can replace one AffineExpr (however complex) with an AffineConstantExpr by substituting the dimensions and symbols with constant operands. It will do the replacement if possible.

For all the in-place folding for AffineForOp / AffineIfOp, we just make a call to canonicalizeMap/Set/AndOperands and then do the bound folding (the order has to be fixed though - a TODO). On the other hand, these min/max ops have results unlike AffineFor/IfOp and so you are just using the constantFold on the map itself. But SimplifyAffineOp is already registered on min/max ops, and the composeAffineMapAndOperands there is already making a call to canonicalizeMapAndOperands and thus already doing the constant folding for the result expressions - so isn't -canonicalize already accomplishing what you are doing here as part of fold? (albeit via the pattern). You just need the simple min of constants in its folder?

Address reviews

ftynse added inline comments.May 7 2020, 3:31 AM

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2137	I tried, and it's actually more verbose because of lambdas.
2150	taking an instantiation of an STL algorithm as a parameter looks very ugly, so I just went for an if(std::is_same)
mlir/lib/IR/AffineMap.cpp
242	It's even better with the refactoring suggested by Andy below.

Closed by commit rGa87db48e6fda: [mlir] Support partial folding of affine.min/max (authored by ftynse). · Explain WhyMay 7 2020, 3:55 AM

This revision was automatically updated to reflect the committed changes.

Harbormaster failed remote builds in B56040: Diff 262587!May 7 2020, 3:56 AM

In D79502#2024115, @bondhugula wrote:

In D79502#2023690, @ftynse wrote:

In D79502#2023207, @bondhugula wrote:

partially folds the affine map by lifting individual constant expression in it

Thanks for the detailed summary, but I'm sorry this part isn't clear. canonicalizeMapAndOperands does propagate constants from the operands into the maps. It isn't clear if you are doing the same or whether you are taking min/max over the subset of the results that are constant: for eg. affine.min affine_map<(d0, d1, d2) -> (d0, d1, d2)> (2, 3, %N) is affine.min affine_map<(d0) -> (2, d0)> (%N). I don't see any test cases of the latter form - so didn't look at the code carefully.

Canonicalization does more than just propagating the constants, it moves dimensions and symbols around, drops them, or does more aggressive simplifications of affine maps. The folding only looks if it can replace one AffineExpr (however complex) with an AffineConstantExpr by substituting the dimensions and symbols with constant operands. It will do the replacement if possible.

For all the in-place folding for AffineForOp / AffineIfOp, we just make a call to canonicalizeMap/Set/AndOperands and then do the bound folding (the order has to be fixed though - a TODO). On the other hand, these min/max ops have results unlike AffineFor/IfOp and so you are just using the constantFold on the map itself. But SimplifyAffineOp is already registered on min/max ops, and the composeAffineMapAndOperands there is already making a call to canonicalizeMapAndOperands and thus already doing the constant folding for the result expressions - so isn't -canonicalize already accomplishing what you are doing here as part of fold? (albeit via the pattern). You just need the simple min of constants in its folder?

Looks like you might have missed this comment @ftynse Just curious to know what you think here in response.

In D79502#2026333, @bondhugula wrote:

In D79502#2024115, @bondhugula wrote:

In D79502#2023690, @ftynse wrote:

In D79502#2023207, @bondhugula wrote:

partially folds the affine map by lifting individual constant expression in it

Thanks for the detailed summary, but I'm sorry this part isn't clear. canonicalizeMapAndOperands does propagate constants from the operands into the maps. It isn't clear if you are doing the same or whether you are taking min/max over the subset of the results that are constant: for eg. affine.min affine_map<(d0, d1, d2) -> (d0, d1, d2)> (2, 3, %N) is affine.min affine_map<(d0) -> (2, d0)> (%N). I don't see any test cases of the latter form - so didn't look at the code carefully.

Canonicalization does more than just propagating the constants, it moves dimensions and symbols around, drops them, or does more aggressive simplifications of affine maps. The folding only looks if it can replace one AffineExpr (however complex) with an AffineConstantExpr by substituting the dimensions and symbols with constant operands. It will do the replacement if possible.

For all the in-place folding for AffineForOp / AffineIfOp, we just make a call to canonicalizeMap/Set/AndOperands and then do the bound folding (the order has to be fixed though - a TODO). On the other hand, these min/max ops have results unlike AffineFor/IfOp and so you are just using the constantFold on the map itself. But SimplifyAffineOp is already registered on min/max ops, and the composeAffineMapAndOperands there is already making a call to canonicalizeMapAndOperands and thus already doing the constant folding for the result expressions - so isn't -canonicalize already accomplishing what you are doing here as part of fold? (albeit via the pattern). You just need the simple min of constants in its folder?

Looks like you might have missed this comment @ftynse Just curious to know what you think here in response.

Indeed, I missed that one somehow. I need this transformation to happen in during _folding_, not canonicalization. Folding is independent of the canonicalization pass and may happen if you use OpeationFolder when building operations, unlike canonicalization. Arguably, most of the transformations currently performed by affine canonicalizers can actually be done in folding.

Revision Contents

Path

Size

mlir/

include/

mlir/

IR/

AffineMap.h

10 lines

lib/

Dialect/

Affine/

IR/

AffineOps.cpp

74 lines

IR/

AffineMap.cpp

47 lines

test/

Dialect/

Linalg/

tile.mlir

34 lines

tile_conv.mlir

6 lines

tile_conv_padding.mlir

2 lines

Diff 262599

mlir/include/mlir/IR/AffineMap.h

Show First 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	AffineMap replaceDimsAndSymbols(ArrayRef<AffineExpr> dimReplacements,
unsigned numResultDims,		unsigned numResultDims,
unsigned numResultSyms);		unsigned numResultSyms);

/// Folds the results of the application of an affine map on the provided		/// Folds the results of the application of an affine map on the provided
/// operands to a constant if possible.		/// operands to a constant if possible.
LogicalResult constantFold(ArrayRef<Attribute> operandConstants,		LogicalResult constantFold(ArrayRef<Attribute> operandConstants,
SmallVectorImpl<Attribute> &results) const;		SmallVectorImpl<Attribute> &results) const;

		/// Propagates the constant operands into this affine map. Operands are
		bondhugulaUnsubmitted Done Reply Inline Actions By 'Folds', did you mean propagates? bondhugula: By 'Folds', did you mean propagates?
		/// allowed to be null, at which point they are treated as non-constant. This
		/// does not change the number of symbols and dimensions. Returns a new map,
		/// which may be equal to the old map if no folding happened. If `results` is
		/// provided and if all expressions in the map were folded to constants,
		/// `results` will contain the values of these constants.
		AffineMap
		partialConstantFold(ArrayRef<Attribute> operandConstants,
		SmallVectorImpl<int64_t> *results = nullptr) const;

/// Returns the AffineMap resulting from composing `this` with `map`.		/// Returns the AffineMap resulting from composing `this` with `map`.
/// The resulting AffineMap has as many AffineDimExpr as `map` and as many		/// The resulting AffineMap has as many AffineDimExpr as `map` and as many
/// AffineSymbolExpr as the concatenation of `this` and `map` (in which case		/// AffineSymbolExpr as the concatenation of `this` and `map` (in which case
/// the symbols of `this` map come first).		/// the symbols of `this` map come first).
///		///
/// Prerequisites:		/// Prerequisites:
/// The maps are composable, i.e. that the number of AffineDimExpr of `this`		/// The maps are composable, i.e. that the number of AffineDimExpr of `this`
/// matches the number of results of `map`.		/// matches the number of results of `map`.
▲ Show 20 Lines • Show All 168 Lines • Show Last 20 Lines

mlir/lib/Dialect/Affine/IR/AffineOps.cpp

Show First 20 Lines • Show All 2,083 Lines • ▼ Show 20 Lines	return failure(
parser.parseOperandList(sym_infos,		parser.parseOperandList(sym_infos,
OpAsmParser::Delimiter::OptionalSquare) \|\|		OpAsmParser::Delimiter::OptionalSquare) \|\|
parser.parseOptionalAttrDict(result.attributes) \|\|		parser.parseOptionalAttrDict(result.attributes) \|\|
parser.resolveOperands(dim_infos, indexType, result.operands) \|\|		parser.resolveOperands(dim_infos, indexType, result.operands) \|\|
parser.resolveOperands(sym_infos, indexType, result.operands) \|\|		parser.resolveOperands(sym_infos, indexType, result.operands) \|\|
parser.addTypeToList(indexType, result.types));		parser.addTypeToList(indexType, result.types));
}		}

		/// Fold an affine min or max operation with the given operands. The operand
		/// list may contain nulls, which are interpreted as the operand not being a
		/// constant.
		template <typename T>
		OpFoldResult foldMinMaxOp(T op, ArrayRef<Attribute> operands) {
		static_assert(llvm::is_one_of<T, AffineMinOp, AffineMaxOp>::value,
		"expected affine min or max op");

		// Fold the affine map.
		// TODO(andydavis, ntv) Fold more cases:
		// min(some_affine, some_affine + constant, ...), etc.
		SmallVector<int64_t, 2> results;
		auto foldedMap = op.map().partialConstantFold(operands, &results);

		// If some of the map results are not constant, try changing the map in-place.
		if (results.empty()) {
		// If the map is the same, report that folding did not happen.
		if (foldedMap == op.map())
		return {};
		op.setAttr("map", AffineMapAttr::get(foldedMap));
		return op.getResult();
		}

		// Otherwise, completely fold the op into a constant.
		auto resultIt = std::is_same<T, AffineMinOp>::value
		? std::min_element(results.begin(), results.end())
		: std::max_element(results.begin(), results.end());
		if (resultIt == results.end())
		return {};
		return IntegerAttr::get(IndexType::get(op.getContext()), *resultIt);
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AffineMinOp		// AffineMinOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// %0 = affine.min (d0) -> (1000, d0 + 512) (%i0)		// %0 = affine.min (d0) -> (1000, d0 + 512) (%i0)
//		//

OpFoldResult AffineMinOp::fold(ArrayRef<Attribute> operands) {		OpFoldResult AffineMinOp::fold(ArrayRef<Attribute> operands) {
// Fold the affine map.		return foldMinMaxOp(*this, operands);
		andydavis1Unsubmitted Done Reply Inline Actions "returns failure" andydavis1: "returns failure"
		andydavis1Unsubmitted Done Reply Inline Actions Can you remove this TODO now? andydavis1: Can you remove this TODO now?
		ftynseAuthorUnsubmitted Done Reply Inline Actions My change doesn't do the folding given as example in the todo. ftynse: My change doesn't do the folding given as example in the todo.
// TODO(andydavis, ntv) Fold more cases: partial static information,
// min(some_affine, some_affine + constant, ...).
SmallVector<Attribute, 2> results;
if (failed(map().constantFold(operands, results)))
return {};

// Compute and return min of folded map results.
int64_t min = std::numeric_limits<int64_t>::max();
int minIndex = -1;
for (unsigned i = 0, e = results.size(); i < e; ++i) {
auto intAttr = results[i].cast<IntegerAttr>();
if (intAttr.getInt() < min) {
min = intAttr.getInt();
minIndex = i;
}
}
if (minIndex < 0)
return {};
return results[minIndex];
}		}

void AffineMinOp::getCanonicalizationPatterns(		void AffineMinOp::getCanonicalizationPatterns(
OwningRewritePatternList &patterns, MLIRContext *context) {		OwningRewritePatternList &patterns, MLIRContext *context) {
patterns.insert<SimplifyAffineOp<AffineMinOp>>(context);		patterns.insert<SimplifyAffineOp<AffineMinOp>>(context);
		nicolasvasilacheUnsubmitted Not Done Reply Inline Actions How about a 2 steps impl? if (llvm::any_of(...)) return failure(); auto range = llvm::map_range(); results.assign(range.begin(), range.end()); I'd say the readability improvements are worth it, you choose. nicolasvasilache: How about a 2 steps impl? ``` if (llvm::any_of(...)) return failure(); auto range = llvm…
		ftynseAuthorUnsubmitted Done Reply Inline Actions I tried, and it's actually more verbose because of lambdas. ftynse: I tried, and it's actually more verbose because of lambdas.
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AffineMaxOp		// AffineMaxOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// %0 = affine.max (d0) -> (1000, d0 + 512) (%i0)		// %0 = affine.max (d0) -> (1000, d0 + 512) (%i0)
//		//

OpFoldResult AffineMaxOp::fold(ArrayRef<Attribute> operands) {		OpFoldResult AffineMaxOp::fold(ArrayRef<Attribute> operands) {
// Fold the affine map.		return foldMinMaxOp(*this, operands);
// TODO(andydavis, ntv, ouhang) Fold more cases: partial static information,
// max(some_affine, some_affine + constant, ...).
SmallVector<Attribute, 2> results;
if (failed(map().constantFold(operands, results)))
return {};

// Compute and return max of folded map results.
int64_t max = std::numeric_limits<int64_t>::min();
int maxIndex = -1;
for (unsigned i = 0, e = results.size(); i < e; ++i) {
auto intAttr = results[i].cast<IntegerAttr>();
if (intAttr.getInt() > max) {
max = intAttr.getInt();
maxIndex = i;
}
}
if (maxIndex < 0)
return {};
return results[maxIndex];
}		}

		andydavis1Unsubmitted Done Reply Inline Actions Seems like this block of code could be shared with AffineMin::Fold andydavis1: Seems like this block of code could be shared with AffineMin::Fold
		nicolasvasilacheUnsubmitted Done Reply Inline Actions Or have the whole impl factored out and just take either `std::min/max_element` as a parameter? nicolasvasilache: Or have the whole impl factored out and just take either `std::min/max_element` as a parameter?
		ftynseAuthorUnsubmitted Done Reply Inline Actions taking an instantiation of an STL algorithm as a parameter looks very ugly, so I just went for an if(std::is_same) ftynse: taking an instantiation of an STL algorithm as a parameter looks very ugly, so I just went for…
void AffineMaxOp::getCanonicalizationPatterns(		void AffineMaxOp::getCanonicalizationPatterns(
OwningRewritePatternList &patterns, MLIRContext *context) {		OwningRewritePatternList &patterns, MLIRContext *context) {
patterns.insert<SimplifyAffineOp<AffineMaxOp>>(context);		patterns.insert<SimplifyAffineOp<AffineMaxOp>>(context);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AffinePrefetchOp		// AffinePrefetchOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 334 Lines • Show Last 20 Lines

mlir/lib/IR/AffineMap.cpp

	Show First 20 Lines • Show All 228 Lines • ▼ Show 20 Lines
	}			}

	/// Folds the results of the application of an affine map on the provided			/// Folds the results of the application of an affine map on the provided
	/// operands to a constant if possible. Returns false if the folding happens,			/// operands to a constant if possible. Returns false if the folding happens,
	/// true otherwise.			/// true otherwise.
	LogicalResult			LogicalResult
	AffineMap::constantFold(ArrayRef<Attribute> operandConstants,			AffineMap::constantFold(ArrayRef<Attribute> operandConstants,
	SmallVectorImpl<Attribute> &results) const {			SmallVectorImpl<Attribute> &results) const {
				// Attempt partial folding.
				SmallVector<int64_t, 2> integers;
				partialConstantFold(operandConstants, &integers);

				// If all expressions folded to a constant, populate results with attributes
				// containing those constants.
				nicolasvasilacheUnsubmitted Not Done Reply Inline Actions Same potential 2-step readability gains as above. nicolasvasilache: Same potential 2-step readability gains as above.
				ftynseAuthorUnsubmitted Done Reply Inline Actions It's even better with the refactoring suggested by Andy below. ftynse: It's even better with the refactoring suggested by Andy below.
				if (integers.empty())
				return failure();

				auto range = llvm::map_range(integers, [this](int64_t i) {
				return IntegerAttr::get(IndexType::get(getContext()), i);
				});
				results.append(range.begin(), range.end());
				return success();
				}

				AffineMap
				AffineMap::partialConstantFold(ArrayRef<Attribute> operandConstants,
				SmallVectorImpl<int64_t> *results) const {
	assert(getNumInputs() == operandConstants.size());			assert(getNumInputs() == operandConstants.size());

	// Fold each of the result expressions.			// Fold each of the result expressions.
	AffineExprConstantFolder exprFolder(getNumDims(), operandConstants);			AffineExprConstantFolder exprFolder(getNumDims(), operandConstants);
	// Constant fold each AffineExpr in AffineMap and add to 'results'.			SmallVector<AffineExpr, 4> exprs;
				exprs.reserve(getNumResults());

	for (auto expr : getResults()) {			for (auto expr : getResults()) {
				andydavis1Unsubmitted Done Reply Inline Actions Could this function also return if all results were constants? Would this allow you to avoid calling extractIntegerResults in affine.min/max.fold above? andydavis1: Could this function also return if all results were constants? Would this allow you to avoid…
	auto folded = exprFolder.constantFold(expr);			auto folded = exprFolder.constantFold(expr);
	// If we didn't fold to a constant, then folding fails.			// If did not fold to a constant, keep the original expression, and clear
	if (!folded)			// the integer results vector.
	return failure();			if (folded) {
				exprs.push_back(
	results.push_back(folded);			getAffineConstantExpr(folded.getInt(), folded.getContext()));
				if (results)
				results->push_back(folded.getInt());
				} else {
				exprs.push_back(expr);
				if (results) {
				results->clear();
				results = nullptr;
	}			}
	assert(results.size() == getNumResults() &&			}
	"constant folding produced the wrong number of results");			}
	return success();
				return get(getNumDims(), getNumSymbols(), exprs, getContext());
	}			}

	/// Walk all of the AffineExpr's in this mapping. Each node in an expression			/// Walk all of the AffineExpr's in this mapping. Each node in an expression
	/// tree is visited in postorder.			/// tree is visited in postorder.
	void AffineMap::walkExprs(std::function<void(AffineExpr)> callback) const {			void AffineMap::walkExprs(std::function<void(AffineExpr)> callback) const {
	for (auto expr : getResults())			for (auto expr : getResults())
	expr.walk(callback);			expr.walk(callback);
	}			}
	▲ Show 20 Lines • Show All 175 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/tile.mlir

	// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2" \| FileCheck %s -check-prefix=TILE-2			// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2" \| FileCheck %s -check-prefix=TILE-2
	// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=0,2" \| FileCheck %s -check-prefix=TILE-02			// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=0,2" \| FileCheck %s -check-prefix=TILE-02
	// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=0,0,2" \| FileCheck %s -check-prefix=TILE-002			// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=0,0,2" \| FileCheck %s -check-prefix=TILE-002
	// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,4" \| FileCheck %s -check-prefix=TILE-234			// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,4" \| FileCheck %s -check-prefix=TILE-234

	// TILE-2-DAG: #[[strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>			// TILE-2-DAG: #[[strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>
	// TILE-02-DAG: #[[strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>			// TILE-02-DAG: #[[strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>
	// TILE-002-DAG: #[[strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>			// TILE-002-DAG: #[[strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>
	// TILE-234-DAG: #[[strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>			// TILE-234-DAG: #[[strided1D:.*]] = affine_map<(d0)[s0] -> (d0 + s0)>

	// TILE-2-DAG: #[[strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>			// TILE-2-DAG: #[[strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>
	// TILE-02-DAG: #[[strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>			// TILE-02-DAG: #[[strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>
	// TILE-002-DAG: #[[strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>			// TILE-002-DAG: #[[strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>
	// TILE-234-DAG: #[[strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>			// TILE-234-DAG: #[[strided2D:.]] = affine_map<(d0, d1)[s0, s1] -> (d0 s1 + s0 + d1)>

	// TILE-2-DAG: #[[bound_map:.*]] = affine_map<(d0, d1, d2) -> (d0, d1 - d2)>			// TILE-2-DAG: #[[bound_map:.*]] = affine_map<(d0, d1, d2) -> (2, d1 - d2)>
	// TILE-02-DAG: #[[bound_map:.*]] = affine_map<(d0, d1, d2) -> (d0, d1 - d2)>			// TILE-02-DAG: #[[bound_map:.*]] = affine_map<(d0, d1, d2) -> (2, d1 - d2)>
	// TILE-002-DAG: #[[bound_map:.*]] = affine_map<(d0, d1, d2) -> (d0, d1 - d2)>			// TILE-002-DAG: #[[bound_map:.*]] = affine_map<(d0, d1, d2) -> (2, d1 - d2)>
	// TILE-234-DAG: #[[bound_map:.*]] = affine_map<(d0, d1, d2) -> (d0, d1 - d2)>			// TILE-234-DAG: #[[bound_map_2:.*]] = affine_map<(d0, d1, d2) -> (2, d1 - d2)>
				// TILE-234-DAG: #[[bound_map_3:.*]] = affine_map<(d0, d1, d2) -> (3, d1 - d2)>
				// TILE-234-DAG: #[[bound_map_4:.*]] = affine_map<(d0, d1, d2) -> (4, d1 - d2)>

	// TILE-2-DAG: #[[strided1D_dynamic:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>			// TILE-2-DAG: #[[strided1D_dynamic:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>
	// TILE-02-DAG: #[[strided1D_dynamic:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>			// TILE-02-DAG: #[[strided1D_dynamic:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>
	// T_ILE-002-DAG: #[[strided1D_dynamic:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>			// T_ILE-002-DAG: #[[strided1D_dynamic:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>
	// TILE-234-DAG: #[[strided1D_dynamic:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>			// TILE-234-DAG: #[[strided1D_dynamic:.]] = affine_map<(d0)[s0, s1] -> (d0 s1 + s0)>

	// TILE-2-DAG: #[[strided2D_dynamic:.]] = affine_map<(d0, d1)[s0, s1, s2] -> (d0 s1 + s0 + d1 * s2)>			// TILE-2-DAG: #[[strided2D_dynamic:.]] = affine_map<(d0, d1)[s0, s1, s2] -> (d0 s1 + s0 + d1 * s2)>
	// TILE-02-DAG: #[[strided2D_dynamic:.]] = affine_map<(d0, d1)[s0, s1, s2] -> (d0 s1 + s0 + d1 * s2)>			// TILE-02-DAG: #[[strided2D_dynamic:.]] = affine_map<(d0, d1)[s0, s1, s2] -> (d0 s1 + s0 + d1 * s2)>
	▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines
	// TILE-234-DAG: %[[C4:.*]] = constant 4 : index			// TILE-234-DAG: %[[C4:.*]] = constant 4 : index
	// TILE-234: %[[ubM:.]] = dim %{{.}}, 0 : memref<?x?xf32, #[[strided2D]]>			// TILE-234: %[[ubM:.]] = dim %{{.}}, 0 : memref<?x?xf32, #[[strided2D]]>
	// TILE-234: %[[ubK:.]] = dim %{{.}}, 1 : memref<?x?xf32, #[[strided2D]]>			// TILE-234: %[[ubK:.]] = dim %{{.}}, 1 : memref<?x?xf32, #[[strided2D]]>
	// TILE-234: %[[ubN:.]] = dim %{{.}}, 1 : memref<?x?xf32, #[[strided2D]]>			// TILE-234: %[[ubN:.]] = dim %{{.}}, 1 : memref<?x?xf32, #[[strided2D]]>
	// TILE-234: loop.for %[[I:.]] = %{{.}}{{.}} to %[[ubM]] step %{{.}} {			// TILE-234: loop.for %[[I:.]] = %{{.}}{{.}} to %[[ubM]] step %{{.}} {
	// TILE-234: loop.for %[[J:.]] = %{{.}}{{.}} to %[[ubN]] step %{{.}} {			// TILE-234: loop.for %[[J:.]] = %{{.}}{{.}} to %[[ubN]] step %{{.}} {
	// TILE-234: loop.for %[[K:.]] = %{{.}}{{.}} to %[[ubK]] step %{{.}} {			// TILE-234: loop.for %[[K:.]] = %{{.}}{{.}} to %[[ubK]] step %{{.}} {
	// TILE-234: %[[localM:.]] = dim %{{.}}, 0			// TILE-234: %[[localM:.]] = dim %{{.}}, 0
	// TILE-234: %[[szM:.*]] = affine.min #[[bound_map]](%[[C2]], %[[localM]], %[[I]])			// TILE-234: %[[szM:.*]] = affine.min #[[bound_map_2]](%[[C2]], %[[localM]], %[[I]])
	// TILE-234: %[[localK:.]] = dim %{{.}}, 1			// TILE-234: %[[localK:.]] = dim %{{.}}, 1
	// TILE-234: %[[szK:.*]] = affine.min #[[bound_map]](%[[C4]], %[[localK]], %[[K]])			// TILE-234: %[[szK:.*]] = affine.min #[[bound_map_4]](%[[C4]], %[[localK]], %[[K]])
	// TILE-234: %[[sAik:.]] = subview %{{.}}[%[[I]], %[[K]]] [%[[szM]], %[[szK]]] [%[[C1]], %[[C1]]] : memref<?x?xf32, #[[strided2D]]> to memref<?x?xf32, #[[strided2D_dynamic]]>			// TILE-234: %[[sAik:.]] = subview %{{.}}[%[[I]], %[[K]]] [%[[szM]], %[[szK]]] [%[[C1]], %[[C1]]] : memref<?x?xf32, #[[strided2D]]> to memref<?x?xf32, #[[strided2D_dynamic]]>
	// TILE-234: %[[localK:.]] = dim %{{.}}, 0			// TILE-234: %[[localK:.]] = dim %{{.}}, 0
	// TILE-234: %[[szK:.*]] = affine.min #[[bound_map]](%[[C4]], %[[localK]], %[[K]])			// TILE-234: %[[szK:.*]] = affine.min #[[bound_map_4]](%[[C4]], %[[localK]], %[[K]])
	// TILE-234: %[[localN:.]] = dim %{{.}}, 1			// TILE-234: %[[localN:.]] = dim %{{.}}, 1
	// TILE-234: %[[szN:.*]] = affine.min #[[bound_map]](%[[C3]], %[[localN]], %[[J]])			// TILE-234: %[[szN:.*]] = affine.min #[[bound_map_3]](%[[C3]], %[[localN]], %[[J]])
	// TILE-234: %[[sBkj:.]] = subview %{{.}}[%[[K]], %[[J]]] [%[[szK]], %[[szN]]] [%[[C1]], %[[C1]]] : memref<?x?xf32, #[[strided2D]]> to memref<?x?xf32, #[[strided2D_dynamic]]>			// TILE-234: %[[sBkj:.]] = subview %{{.}}[%[[K]], %[[J]]] [%[[szK]], %[[szN]]] [%[[C1]], %[[C1]]] : memref<?x?xf32, #[[strided2D]]> to memref<?x?xf32, #[[strided2D_dynamic]]>
	// TILE-234: %[[localM:.]] = dim %{{.}}, 0			// TILE-234: %[[localM:.]] = dim %{{.}}, 0
	// TILE-234: %[[szM:.*]] = affine.min #[[bound_map]](%[[C2]], %[[localM]], %[[I]])			// TILE-234: %[[szM:.*]] = affine.min #[[bound_map_2]](%[[C2]], %[[localM]], %[[I]])
	// TILE-234: %[[localN:.]] = dim %{{.}}, 1			// TILE-234: %[[localN:.]] = dim %{{.}}, 1
	// TILE-234: %[[szN:.*]] = affine.min #[[bound_map]](%[[C3]], %[[localN]], %[[J]])			// TILE-234: %[[szN:.*]] = affine.min #[[bound_map_3]](%[[C3]], %[[localN]], %[[J]])
	// TILE-234: %[[sCij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%[[szM]], %[[szN]]] [%[[C1]], %[[C1]]] : memref<?x?xf32, #[[strided2D]]> to memref<?x?xf32, #[[strided2D_dynamic]]>			// TILE-234: %[[sCij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%[[szM]], %[[szN]]] [%[[C1]], %[[C1]]] : memref<?x?xf32, #[[strided2D]]> to memref<?x?xf32, #[[strided2D_dynamic]]>
	//			//
	// TILE-234: linalg.matmul(%[[sAik]], %[[sBkj]], %[[sCij]]) : memref<?x?xf32, #[[strided2D_dynamic]]>, memref<?x?xf32, #[[strided2D_dynamic]]>, memref<?x?xf32, #[[strided2D_dynamic]]>			// TILE-234: linalg.matmul(%[[sAik]], %[[sBkj]], %[[sCij]]) : memref<?x?xf32, #[[strided2D_dynamic]]>, memref<?x?xf32, #[[strided2D_dynamic]]>, memref<?x?xf32, #[[strided2D_dynamic]]>

	// When the buffer shapes are known at compile time, it is possible to avoid			// When the buffer shapes are known at compile time, it is possible to avoid
	// the "min" in subview size computation. This test uses buffer sizes divisible			// the "min" in subview size computation. This test uses buffer sizes divisible
	// by respective tile sizes (M=10 divisble by 2, N=12 divisible by 2 and 3,			// by respective tile sizes (M=10 divisble by 2, N=12 divisible by 2 and 3,
	// K=16 divisble by 2 and 4).			// K=16 divisble by 2 and 4).
	▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines
	// TILE-234-DAG: %[[C1:.*]] = constant 1 : index			// TILE-234-DAG: %[[C1:.*]] = constant 1 : index
	// TILE-234-DAG: %[[C2:.*]] = constant 2 : index			// TILE-234-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-234-DAG: %[[C3:.*]] = constant 3 : index			// TILE-234-DAG: %[[C3:.*]] = constant 3 : index
	// TILE-234: %[[M:.]] = dim %{{.}}, 0 : memref<?x?xf32, #[[strided2D]]>			// TILE-234: %[[M:.]] = dim %{{.}}, 0 : memref<?x?xf32, #[[strided2D]]>
	// TILE-234: %[[K:.]] = dim %{{.}}, 1 : memref<?x?xf32, #[[strided2D]]>			// TILE-234: %[[K:.]] = dim %{{.}}, 1 : memref<?x?xf32, #[[strided2D]]>
	// TILE-234: loop.for %[[I:.]] = %{{.}}{{.}} to %[[M]] step %{{.}} {			// TILE-234: loop.for %[[I:.]] = %{{.}}{{.}} to %[[M]] step %{{.}} {
	// TILE-234: loop.for %[[J:.]] = %{{.}}{{.}} to %[[K]] step %{{.}} {			// TILE-234: loop.for %[[J:.]] = %{{.}}{{.}} to %[[K]] step %{{.}} {
	// TILE-234: %[[localM:.]] = dim %{{.}}, 0			// TILE-234: %[[localM:.]] = dim %{{.}}, 0
	// TILE-234: %[[szM:.*]] = affine.min #[[bound_map]](%[[C2]], %[[localM]], %[[I]])			// TILE-234: %[[szM:.*]] = affine.min #[[bound_map_2]](%[[C2]], %[[localM]], %[[I]])
	// TILE-234: %[[localN:.]] = dim %{{.}}, 1			// TILE-234: %[[localN:.]] = dim %{{.}}, 1
	// TILE-234: %[[szN:.*]] = affine.min #[[bound_map]](%[[C3]], %[[localN]], %[[J]])			// TILE-234: %[[szN:.*]] = affine.min #[[bound_map_3]](%[[C3]], %[[localN]], %[[J]])
	// TILE-234: %[[sAij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%[[szM]], %[[szN]]] [%[[C1]], %[[C1]]] : memref<?x?xf32, #[[strided2D]]> to memref<?x?xf32, #[[strided2D_dynamic]]>			// TILE-234: %[[sAij:.]] = subview %{{.}}[%[[I]], %[[J]]] [%[[szM]], %[[szN]]] [%[[C1]], %[[C1]]] : memref<?x?xf32, #[[strided2D]]> to memref<?x?xf32, #[[strided2D_dynamic]]>
	// TILE-234: %[[localN:.]] = dim %{{.}}, 0			// TILE-234: %[[localN:.]] = dim %{{.}}, 0
	// TILE-234: %[[szN:.*]] = affine.min #[[bound_map]](%[[C3]], %[[localN]], %[[J]])			// TILE-234: %[[szN:.*]] = affine.min #[[bound_map_3]](%[[C3]], %[[localN]], %[[J]])
	// TILE-234: %[[sBj:.]] = subview %{{.}}[%[[J]]] [%[[szN]]] [%[[C1]]] : memref<?xf32, #[[strided1D]]> to memref<?xf32, #[[strided1D_dynamic]]>			// TILE-234: %[[sBj:.]] = subview %{{.}}[%[[J]]] [%[[szN]]] [%[[C1]]] : memref<?xf32, #[[strided1D]]> to memref<?xf32, #[[strided1D_dynamic]]>
	// TILE-234: %[[localM:.]] = dim %{{.}}, 0			// TILE-234: %[[localM:.]] = dim %{{.}}, 0
	// TILE-234: %[[szM:.*]] = affine.min #[[bound_map]](%[[C2]], %[[localM]], %[[I]])			// TILE-234: %[[szM:.*]] = affine.min #[[bound_map_2]](%[[C2]], %[[localM]], %[[I]])
	// TILE-234: %[[sCi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [%[[C1]]] : memref<?xf32, #[[strided1D]]> to memref<?xf32, #[[strided1D_dynamic]]>			// TILE-234: %[[sCi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [%[[C1]]] : memref<?xf32, #[[strided1D]]> to memref<?xf32, #[[strided1D_dynamic]]>
	//			//
	// TILE-234: linalg.matvec(%[[sAij]], %[[sBj]], %[[sCi]]) : memref<?x?xf32, #[[strided2D_dynamic]]>, memref<?xf32, #[[strided1D_dynamic]]>, memref<?xf32, #[[strided1D_dynamic]]>			// TILE-234: linalg.matvec(%[[sAij]], %[[sBj]], %[[sCi]]) : memref<?x?xf32, #[[strided2D_dynamic]]>, memref<?xf32, #[[strided1D_dynamic]]>, memref<?xf32, #[[strided1D_dynamic]]>

	func @dot(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<f32>) {			func @dot(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<f32>) {
	linalg.dot(%arg0, %arg1, %arg2) : memref<?xf32, offset: ?, strides: [1]>, memref<?xf32, offset: ?, strides: [1]>, memref<f32>			linalg.dot(%arg0, %arg1, %arg2) : memref<?xf32, offset: ?, strides: [1]>, memref<?xf32, offset: ?, strides: [1]>, memref<f32>
	return			return
	}			}
	Show All 19 Lines

	// TILE-234-LABEL: func @dot(			// TILE-234-LABEL: func @dot(
	// TILE-234-DAG: %[[C0:.*]] = constant 0 : index			// TILE-234-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-234-DAG: %[[C1:.*]] = constant 1 : index			// TILE-234-DAG: %[[C1:.*]] = constant 1 : index
	// TILE-234-DAG: %[[C2:.*]] = constant 2 : index			// TILE-234-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-234: %[[ubK:.]] = dim %{{.}}, 0 : memref<?xf32, #[[strided1D]]>			// TILE-234: %[[ubK:.]] = dim %{{.}}, 0 : memref<?xf32, #[[strided1D]]>
	// TILE-234: loop.for %[[I:.]] = %{{.}} to %[[ubK]] step %{{.*}} {			// TILE-234: loop.for %[[I:.]] = %{{.}} to %[[ubK]] step %{{.*}} {
	// TILE-234: %[[localM:.]] = dim %{{.}}, 0			// TILE-234: %[[localM:.]] = dim %{{.}}, 0
	// TILE-234: %[[szM:.*]] = affine.min #[[bound_map]](%[[C2]], %[[localM]], %[[I]])			// TILE-234: %[[szM:.*]] = affine.min #[[bound_map_2]](%[[C2]], %[[localM]], %[[I]])
	// TILE-234: %[[sAi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [%[[C1]]] : memref<?xf32, #[[strided1D]]> to memref<?xf32, #[[strided1D_dynamic]]>			// TILE-234: %[[sAi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [%[[C1]]] : memref<?xf32, #[[strided1D]]> to memref<?xf32, #[[strided1D_dynamic]]>
	// TILE-234: %[[localM:.]] = dim %{{.}}, 0			// TILE-234: %[[localM:.]] = dim %{{.}}, 0
	// TILE-234: %[[szM:.*]] = affine.min #[[bound_map]](%[[C2]], %[[localM]], %[[I]])			// TILE-234: %[[szM:.*]] = affine.min #[[bound_map_2]](%[[C2]], %[[localM]], %[[I]])
	// TILE-234: %[[sBi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [%[[C1]]] : memref<?xf32, #[[strided1D]]> to memref<?xf32, #[[strided1D_dynamic]]>			// TILE-234: %[[sBi:.]] = subview %{{.}}[%[[I]]] [%[[szM]]] [%[[C1]]] : memref<?xf32, #[[strided1D]]> to memref<?xf32, #[[strided1D_dynamic]]>
	// TILE-234: linalg.dot(%[[sAi]], %[[sBi]], %{{.*}}) : memref<?xf32, #[[strided1D_dynamic]]>, memref<?xf32, #[[strided1D_dynamic]]>, memref<f32>			// TILE-234: linalg.dot(%[[sAi]], %[[sBi]], %{{.*}}) : memref<?xf32, #[[strided1D_dynamic]]>, memref<?xf32, #[[strided1D_dynamic]]>, memref<f32>

	func @fill_static(%arg0: memref<127x99xf32>, %arg1: f32) {			func @fill_static(%arg0: memref<127x99xf32>, %arg1: f32) {
	linalg.fill(%arg0, %arg1) : memref<127x99xf32>, f32			linalg.fill(%arg0, %arg1) : memref<127x99xf32>, f32
	return			return
	}			}
	// TILE-2-LABEL: func @fill_static			// TILE-2-LABEL: func @fill_static
	▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/tile_conv.mlir

	// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,0,0,4" \| FileCheck %s -check-prefix=TILE-23004			// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,0,0,4" \| FileCheck %s -check-prefix=TILE-23004

	// TILE-23004-DAG: #[[D0x30pS0x10:.]] = affine_map<(d0) -> (d0 30)>			// TILE-23004-DAG: #[[D0x30pS0x10:.]] = affine_map<(d0) -> (d0 30)>
	// TILE-23004-DAG: #[[S0x10p90:.]] = affine_map<()[s0] -> (s0 10 + 90)>			// TILE-23004-DAG: #[[S0x10p90:.]] = affine_map<()[s0] -> (s0 10 + 90)>
	// TILE-23004-DAG: #[[strided4D:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3)>			// TILE-23004-DAG: #[[strided4D:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3)>
	// TILE-23004-DAG: #[[strided4D_dynamic:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3, s4] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3 * s4)>			// TILE-23004-DAG: #[[strided4D_dynamic:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3, s4] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3 * s4)>
	// TILE-23004-DAG: #[[bound_map:.*]] = affine_map<(d0, d1, d2) -> (d0, d1 - d2)>			// TILE-23004-DAG: #[[bound_map_4:.*]] = affine_map<(d0, d1, d2) -> (4, d1 - d2)>

	func @conv(%arg0: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg1: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg2: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>) {			func @conv(%arg0: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg1: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg2: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>) {
	linalg.conv(%arg0, %arg1, %arg2) {dilations = [10, 20], strides = [30, 40]} : memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>			linalg.conv(%arg0, %arg1, %arg2) {dilations = [10, 20], strides = [30, 40]} : memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>
	return			return
	}			}
	// TILE-23004-LABEL: func @conv(			// TILE-23004-LABEL: func @conv(
	// TILE-23004: %{{.}}: memref<?x?x?x?xf32, #[[strided4D]]>, %{{.}}: memref<?x?x?x?xf32, #[[strided4D]]>, %{{.*}}: memref<?x?x?x?xf32, #[[strided4D]]>) {			// TILE-23004: %{{.}}: memref<?x?x?x?xf32, #[[strided4D]]>, %{{.}}: memref<?x?x?x?xf32, #[[strided4D]]>, %{{.*}}: memref<?x?x?x?xf32, #[[strided4D]]>) {
	// TILE-23004-DAG: %[[C0:.*]] = constant 0 : index			// TILE-23004-DAG: %[[C0:.*]] = constant 0 : index
	// TILE-23004-DAG: %[[C1:.*]] = constant 1 : index			// TILE-23004-DAG: %[[C1:.*]] = constant 1 : index
	// TILE-23004-DAG: %[[C2:.*]] = constant 2 : index			// TILE-23004-DAG: %[[C2:.*]] = constant 2 : index
	// TILE-23004-DAG: %[[C3:.*]] = constant 3 : index			// TILE-23004-DAG: %[[C3:.*]] = constant 3 : index
	// TILE-23004-DAG: %[[C4:.*]] = constant 4 : index			// TILE-23004-DAG: %[[C4:.*]] = constant 4 : index
	// TILE-23004: %[[Q:.]] = dim %{{.}}, 2 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[Q:.]] = dim %{{.}}, 2 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[B:.]] = dim %{{.}}, 0 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[B:.]] = dim %{{.}}, 0 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[PaddedInput0:.]] = dim %{{.}}, 1 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[PaddedInput0:.]] = dim %{{.}}, 1 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[X0:.]] = dim %{{.}}, 1 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[X0:.]] = dim %{{.}}, 1 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: loop.for %[[ivI:.]] = %{{.}} to %[[B]] step %{{.*}} {			// TILE-23004: loop.for %[[ivI:.]] = %{{.}} to %[[B]] step %{{.*}} {
	// TILE-23004: loop.for %[[ivJ:.]] = %{{.}} to %[[X0]] step %{{.*}} {			// TILE-23004: loop.for %[[ivJ:.]] = %{{.}} to %[[X0]] step %{{.*}} {
	// TILE-23004: loop.for %[[ivK:.]] = %{{.}} to %[[Q]] step %{{.*}} {			// TILE-23004: loop.for %[[ivK:.]] = %{{.}} to %[[Q]] step %{{.*}} {
	// TILE-23004: %[[Z0:.]] = dim %{{.}}, 0 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[Z0:.]] = dim %{{.}}, 0 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[Z1:.]] = dim %{{.}}, 1 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[Z1:.]] = dim %{{.}}, 1 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[Z2:.]] = dim %{{.}}, 2 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[Z2:.]] = dim %{{.}}, 2 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[szK:.*]] = affine.min #[[bound_map]](%[[C4]], %[[Z2]], %[[ivK]])			// TILE-23004: %[[szK:.*]] = affine.min #[[bound_map_4]](%[[C4]], %[[Z2]], %[[ivK]])
	// TILE-23004: %[[K:.]] = dim %{{.}}, 3 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[K:.]] = dim %{{.}}, 3 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[FilterView:.]] = subview %{{.}}[%[[C0]], %[[C0]], %[[ivK]], %[[C0]]] [%[[Z0]], %[[Z1]], %[[szK]], %[[K]]] [%[[C1]], %[[C1]], %[[C1]], %[[C1]]] : memref<?x?x?x?xf32, #[[strided4D]]> to memref<?x?x?x?xf32, #[[strided4D_dynamic]]>			// TILE-23004: %[[FilterView:.]] = subview %{{.}}[%[[C0]], %[[C0]], %[[ivK]], %[[C0]]] [%[[Z0]], %[[Z1]], %[[szK]], %[[K]]] [%[[C1]], %[[C1]], %[[C1]], %[[C1]]] : memref<?x?x?x?xf32, #[[strided4D]]> to memref<?x?x?x?xf32, #[[strided4D_dynamic]]>
	//			//
	// TILE-23004: %[[J1:.*]] = affine.apply #[[D0x30pS0x10]](%[[ivJ]])			// TILE-23004: %[[J1:.*]] = affine.apply #[[D0x30pS0x10]](%[[ivJ]])
	// T__ILE-23004: %[[I1pStep:.*]] = affine.apply #[[S0x10p90]]()[%[[I1]]]			// T__ILE-23004: %[[I1pStep:.*]] = affine.apply #[[S0x10p90]]()[%[[I1]]]
	// TILE-23004: %[[SZ2:.]] = dim %{{.}}, 2 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[SZ2:.]] = dim %{{.}}, 2 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[dim3:.]] = dim %{{.}}, 3			// TILE-23004: %[[dim3:.]] = dim %{{.}}, 3
	// TILE-23004: %[[sz3:.*]] = affine.min #[[bound_map]](%[[C4]], %[[dim3]], %[[ivK]]			// TILE-23004: %[[sz3:.*]] = affine.min #[[bound_map_4]](%[[C4]], %[[dim3]], %[[ivK]]
	// TILE-23004: %[[InputView:.]] = subview %{{.}}[%[[ivI]], %[[J1]], %[[C0]], %[[ivK]]] [%{{.}}, %{{.}}, %[[SZ2]], %[[sz3]]] [%[[C1]], %[[C1]], %[[C1]], %[[C1]]] : memref<?x?x?x?xf32, #[[strided4D]]> to memref<?x?x?x?xf32, #[[strided4D_dynamic]]>			// TILE-23004: %[[InputView:.]] = subview %{{.}}[%[[ivI]], %[[J1]], %[[C0]], %[[ivK]]] [%{{.}}, %{{.}}, %[[SZ2]], %[[sz3]]] [%[[C1]], %[[C1]], %[[C1]], %[[C1]]] : memref<?x?x?x?xf32, #[[strided4D]]> to memref<?x?x?x?xf32, #[[strided4D_dynamic]]>
	//			//
	// TILE-23004: %[[X0:.]] = dim %{{.}}, 2 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[X0:.]] = dim %{{.}}, 2 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[X1:.]] = dim %{{.}}, 3 : memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004: %[[X1:.]] = dim %{{.}}, 3 : memref<?x?x?x?xf32, #[[strided4D]]>
	// TILE-23004: %[[OutputView:.]] = subview %{{.}}[%[[ivI]], %[[ivJ]], %[[C0]], %[[C0]]] [%{{.}}, %{{.}}, %[[X0]], %[[X1]]] [%[[C1]], %[[C1]], %[[C1]], %[[C1]]] : memref<?x?x?x?xf32, #[[strided4D]]> to memref<?x?x?x?xf32, #[[strided4D_dynamic]]>			// TILE-23004: %[[OutputView:.]] = subview %{{.}}[%[[ivI]], %[[ivJ]], %[[C0]], %[[C0]]] [%{{.}}, %{{.}}, %[[X0]], %[[X1]]] [%[[C1]], %[[C1]], %[[C1]], %[[C1]]] : memref<?x?x?x?xf32, #[[strided4D]]> to memref<?x?x?x?xf32, #[[strided4D_dynamic]]>
	//			//
	// TILE-23004: linalg.conv(%[[FilterView]], %[[InputView]], %[[OutputView]]) {dilations = [10, 20], strides = [30, 40]} : memref<?x?x?x?xf32, #[[strided4D_dynamic]]>, memref<?x?x?x?xf32, #[[strided4D_dynamic]]>, memref<?x?x?x?xf32, #[[strided4D_dynamic]]>			// TILE-23004: linalg.conv(%[[FilterView]], %[[InputView]], %[[OutputView]]) {dilations = [10, 20], strides = [30, 40]} : memref<?x?x?x?xf32, #[[strided4D_dynamic]]>, memref<?x?x?x?xf32, #[[strided4D_dynamic]]>, memref<?x?x?x?xf32, #[[strided4D_dynamic]]>

mlir/test/Dialect/Linalg/tile_conv_padding.mlir

	// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,0,0,4" \| FileCheck %s -check-prefix=TILE-23004			// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,0,0,4" \| FileCheck %s -check-prefix=TILE-23004
	// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2" \| FileCheck %s -check-prefix=TILE-20000			// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2" \| FileCheck %s -check-prefix=TILE-20000

	// TILE-23004-DAG: #[[strided4D:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3)>			// TILE-23004-DAG: #[[strided4D:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3)>
	// TILE-20000-DAG: #[[strided4D:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3)>			// TILE-20000-DAG: #[[strided4D:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3)>
	// TILE-20000-DAG: #[[minmap:.*]] = affine_map<(d0, d1, d2) -> (d0, d1 - d2)>			// TILE-20000-DAG: #[[minmap:.*]] = affine_map<(d0, d1, d2) -> (2, d1 - d2)>
	// TILE-20000-DAG: #[[subviewstride:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3, s4] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3 * s4)>			// TILE-20000-DAG: #[[subviewstride:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3, s4] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3 * s4)>

	func @conv_padding(%arg0: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg1: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg2: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>) {			func @conv_padding(%arg0: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg1: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg2: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>) {
	linalg.conv(%arg0, %arg1, %arg2) {dilations = [10, 20], padding = dense<[[1, 1], [0, 1]]> : tensor<2x2xi64>, strides = [30, 40]} : memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>			linalg.conv(%arg0, %arg1, %arg2) {dilations = [10, 20], padding = dense<[[1, 1], [0, 1]]> : tensor<2x2xi64>, strides = [30, 40]} : memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>
	return			return
	}			}
	// TILE-23004-LABEL: func @conv_padding(			// TILE-23004-LABEL: func @conv_padding(
	// TILE-23004-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?x?x?xf32, #[[strided4D]]>			// TILE-23004-SAME: %[[ARG0:[a-zA-Z0-9_]*]]: memref<?x?x?x?xf32, #[[strided4D]]>
	Show All 26 Lines