This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Linalg/Transforms/
-
mlir/
-
Dialect/
-
Linalg/
-
Transforms/
-
Transforms.h
-
lib/Dialect/Linalg/Transforms/
-
Dialect/
-
Linalg/
-
Transforms/
-
CMakeLists.txt
2/7
MatmulToMMT4d.cpp
-
test/
-
Dialect/Linalg/
-
Linalg/
-
matmul_to_mmt4d.mlir
-
lib/Dialect/Linalg/
-
Dialect/
-
Linalg/
-
CMakeLists.txt
-
TestLinalgMatmulToMMT4d.cpp
-
tools/mlir-opt/
-
mlir-opt/
-
mlir-opt.cpp

Differential D106006

Add a pattern to convert static linalg.matmul -> linalg.mmt4d
AbandonedPublic

Authored by asaadaldien on Jul 14 2021, 12:33 PM.

Download Raw Diff

Details

Reviewers

mravishankar
nicolasvasilache
Benoit

Summary

This pattern converts statically shaped linalg.matmul when operands sizes (m, n, k) are an integer multiple of
mmt4d's operands inner most dims (m0, n0, k0).

This pattern does the following transofmrations to operands and mmt4d results:

operands:

lhs: (m, k) -(reshape)-> (m1, m0, k1, k0) -(transpose)-> (m1, k1, m0, k0)
rhs: (k, n) -(reshape)-> (k1, k0, n1, n0) -(transpose)-> (n1, k1, n0, k0)

result:

mmt4d: (m1, n1, m0, n0) -(transpose)-> (m1, m0, n1, n0) -(reshape)-> (m, n)

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

asaadaldien created this revision.Jul 14 2021, 12:33 PM

Herald added a reviewer: mravishankar. · View Herald TranscriptJul 14 2021, 12:33 PM

Herald added subscribers: ormris, dcaballe, cota and 21 others. · View Herald Transcript

asaadaldien requested review of this revision.Jul 14 2021, 12:33 PM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptJul 14 2021, 12:33 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: limo1996, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

asaadaldien added a reviewer: Benoit.Jul 14 2021, 12:34 PM

asaadaldien edited the summary of this revision. (Show Details)

Benoit added inline comments.Jul 14 2021, 12:45 PM

mlir/lib/Dialect/Linalg/Transforms/MatmulToMMT4d.cpp
53–54	Add a comment that it's temporary that we are bailing out here, and that we will eventually want to accept any sizes by doing padding?
76	I'm thinking that it might help readability of this code if this function were named 'packMatrix' : "transpose" is an implementation detail, and the concept of "transposition" is a bit overloaded in the present context since we are also dealing with transposing the RHS matrix (which gives the letter T in the name, "MMT4D"). 'Matrix' vs 'Operand' because I'm thinking, this function in itself only cares that it gets a matrix to work on; that matrix only needs to be called an 'operand' in the context of the mmt4d op.
124	If the above function were renamed to packMatrix, then it might make sense to rename this one to unpackMatrix.

Benoit added inline comments.Jul 14 2021, 12:48 PM

mlir/lib/Dialect/Linalg/Transforms/MatmulToMMT4d.cpp
76	Ah sorry I had missed that the 2D->4D conversion was done above separately in expandTo4D. Do you think that it would be reasonable to call expandTo4D inside packMatrix (and unpackMatrix below) i.e. my proposed new names for transposeOperand, collapseOperand, so that the internals of packing/unpacking are abstracted behind these functions and the higher-level code here can thus look simpler?

Harbormaster completed remote builds in B114058: Diff 358699.Jul 14 2021, 1:35 PM

asaadaldien added inline comments.Jul 14 2021, 1:41 PM

mlir/lib/Dialect/Linalg/Transforms/MatmulToMMT4d.cpp
53–54	We have to check divisibility here as a precondition, padding is something that is handled outside this pattern (e.g we do pad operands in IREE to an next integer multiple prior to apply this).

Benoit added inline comments.Jul 14 2021, 1:45 PM

mlir/lib/Dialect/Linalg/Transforms/MatmulToMMT4d.cpp
53–54	I see; I understand how in the current context where K0,M0,N0 are compile-time constants from the get go (ie already here where we are lowering matmul to mmt4d) , it's possible to split the padding part into a separate pattern. But how is this going to generalize to when K0,M0,N0 are '?' dynamic shape dimensions at the time when we need to convert matmul to mmt4d?

asaadaldien added inline comments.Jul 14 2021, 2:00 PM

mlir/lib/Dialect/Linalg/Transforms/MatmulToMMT4d.cpp
76	There will be another pattern that doesn't have `M0, N0, K0` as constants and doesn't have to run on statically shaped operands as it will do `<?x?xf32> -> <?x?x?x?xf32>` which `linalg.tensor_expand_shape` doesn't support atm. (In addition to compiler support to specialize M0, N0, K0 for runtime versioning in IREE...etc).

We are adding this pattern to IREE for now..

Herald added a subscriber: Chia-hungDuan. · View Herald TranscriptAug 2 2021, 1:29 PM

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Linalg/

Transforms/

Transforms.h

8 lines

lib/

Dialect/

Linalg/

Transforms/

CMakeLists.txt

1 line

MatmulToMMT4d.cpp

153 lines

test/

Dialect/

Linalg/

matmul_to_mmt4d.mlir

52 lines

lib/

Dialect/

Linalg/

CMakeLists.txt

1 line

TestLinalgMatmulToMMT4d.cpp

66 lines

tools/

mlir-opt/

mlir-opt.cpp

2 lines

Diff 358699

mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h

	Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines
	void populateFoldUnitExtentDimsPatterns(RewritePatternSet &patterns);			void populateFoldUnitExtentDimsPatterns(RewritePatternSet &patterns);

	/// Patterns that are used to inline constant operands into linalg generic ops.			/// Patterns that are used to inline constant operands into linalg generic ops.
	void populateInlineConstantOperandsPatterns(RewritePatternSet &patterns);			void populateInlineConstantOperandsPatterns(RewritePatternSet &patterns);

	/// Pattern to convert TiledLoopOp to SCF loops.			/// Pattern to convert TiledLoopOp to SCF loops.
	void populateTiledLoopToSCFPattern(RewritePatternSet &patterns);			void populateTiledLoopToSCFPattern(RewritePatternSet &patterns);

				/// Pattern to convert linalg.matmul to linalg.mmt4d.
				void populateMatmulToMMT4DPatterns(RewritePatternSet &patterns, int M0, int N0,
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for parameter 'M0' [readability-identifier-naming] not useful clang-tidy: warning: invalid case style for parameter 'N0' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for parameter 'M0' [readability-identifier-naming]…
				int K0);
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for parameter 'K0' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for parameter 'K0' [readability-identifier-naming]…

	/// Options that control fusion of elementwise operations.			/// Options that control fusion of elementwise operations.
	struct LinalgElementwiseFusionOptions {			struct LinalgElementwiseFusionOptions {
	/// Enable fusion of reshapes into the shape with elementwise operations. By			/// Enable fusion of reshapes into the shape with elementwise operations. By
	/// default it is disabled for unit dimensions reshape.			/// default it is disabled for unit dimensions reshape.
	ControlElementwiseOpsFusionFn controlFoldingReshapesFn = skipUnitDimReshape;			ControlElementwiseOpsFusionFn controlFoldingReshapesFn = skipUnitDimReshape;

	LinalgElementwiseFusionOptions &			LinalgElementwiseFusionOptions &
	setControlFoldingReshapes(ControlElementwiseOpsFusionFn fun) {			setControlFoldingReshapes(ControlElementwiseOpsFusionFn fun) {
	▲ Show 20 Lines • Show All 808 Lines • ▼ Show 20 Lines
	};			};

	/// Populates `patterns` with patterns that vectorize linalg.pad_tensor.			/// Populates `patterns` with patterns that vectorize linalg.pad_tensor.
	/// These patterns are meant to apply in a complementary fashion. Benefits			/// These patterns are meant to apply in a complementary fashion. Benefits
	/// are used to encode a certain ordering of pattern application. To avoid			/// are used to encode a certain ordering of pattern application. To avoid
	/// scattering magic constants throughout the code base, the patterns must be			/// scattering magic constants throughout the code base, the patterns must be
	/// added with this function. `baseBenefit` can be used to offset the benefit			/// added with this function. `baseBenefit` can be used to offset the benefit
	/// of all PadTensorOp vectorization patterns by a certain value.			/// of all PadTensorOp vectorization patterns by a certain value.
	void populatePadTensorOpVectorizationPatterns(			void populatePadTensorOpVectorizationPatterns(RewritePatternSet &patterns,
	RewritePatternSet &patterns, PatternBenefit baseBenefit = 1);			PatternBenefit baseBenefit = 1);

	/// Match and rewrite for the pattern:			/// Match and rewrite for the pattern:
	/// ```			/// ```
	/// %alloc = ...			/// %alloc = ...
	/// [optional] %view = memref.view %alloc ...			/// [optional] %view = memref.view %alloc ...
	/// %subView = subview %allocOrView ...			/// %subView = subview %allocOrView ...
	/// [optional] linalg.fill(%allocOrView, %cst) ...			/// [optional] linalg.fill(%allocOrView, %cst) ...
	/// ...			/// ...
	▲ Show 20 Lines • Show All 193 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/CMakeLists.txt

	add_mlir_dialect_library(MLIRLinalgTransforms			add_mlir_dialect_library(MLIRLinalgTransforms
	Bufferize.cpp			Bufferize.cpp
	CodegenStrategy.cpp			CodegenStrategy.cpp
	ComprehensiveBufferize.cpp			ComprehensiveBufferize.cpp
	Detensorize.cpp			Detensorize.cpp
	Distribution.cpp			Distribution.cpp
	DropUnitDims.cpp			DropUnitDims.cpp
	ElementwiseToLinalg.cpp			ElementwiseToLinalg.cpp
	Fusion.cpp			Fusion.cpp
	FusionOnTensors.cpp			FusionOnTensors.cpp
	Generalization.cpp			Generalization.cpp
	Hoisting.cpp			Hoisting.cpp
	InlineScalarOperands.cpp			InlineScalarOperands.cpp
	Interchange.cpp			Interchange.cpp
	Loops.cpp			Loops.cpp
				MatmulToMMT4d.cpp
	Promotion.cpp			Promotion.cpp
	Tiling.cpp			Tiling.cpp
	Transforms.cpp			Transforms.cpp
	Vectorization.cpp			Vectorization.cpp

	ADDITIONAL_HEADER_DIRS			ADDITIONAL_HEADER_DIRS
	${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Linalg			${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Linalg

	Show All 25 Lines

mlir/lib/Dialect/Linalg/Transforms/MatmulToMMT4d.cpp

This file was added.

				//===- MatmulToMMT4d.cpp - Pass to inline scalar operands =============//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements patterns/pass to convert linalg.matmul into linalg.mmt4d
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/Linalg/Transforms/Transforms.h"

				#include "mlir/Dialect/Linalg/IR/LinalgOps.h"
				#include "mlir/IR/PatternMatch.h"
				#include "mlir/Pass/Pass.h"
				#include "mlir/Transforms/GreedyPatternRewriteDriver.h"

				using namespace mlir;
				using namespace mlir::linalg;

				namespace {
				class LinalgStaticMatmulOpToLinalgMMT4dOpPattern
				: public OpRewritePattern<MatmulOp> {
				public:
				LinalgStaticMatmulOpToLinalgMMT4dOpPattern(MLIRContext *context, int M0,
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for parameter 'M0' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for parameter 'M0' [readability-identifier-naming]…
				int N0, int K0,
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for parameter 'N0' [readability-identifier-naming] not useful clang-tidy: warning: invalid case style for parameter 'K0' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for parameter 'N0' [readability-identifier-naming]…
				PatternBenefit benefit = 1)
				: OpRewritePattern<MatmulOp>(context, benefit), M0Size(M0), N0Size(N0),
				K0Size(K0) {}

				LogicalResult matchAndRewrite(MatmulOp matmulOp,
				PatternRewriter &rewriter) const override {
				auto loc = matmulOp.getLoc();

				Value lhs = matmulOp.getInputOperand(0)->get();
				Value rhs = matmulOp.getInputOperand(1)->get();
				Value dst = matmulOp.getOutputOperand(0)->get();

				RankedTensorType lhsType = lhs.getType().dyn_cast<RankedTensorType>();
				RankedTensorType rhsType = rhs.getType().dyn_cast<RankedTensorType>();

				if (!lhsType \|\| !rhsType \|\| !lhsType.hasStaticShape() \|\|
				!rhsType.hasStaticShape()) {
				return failure();
				}

				int m = lhsType.getShape()[0];
				int n = rhsType.getShape()[1];
				int k = rhsType.getShape()[0];

				if (m % M0Size != 0 \|\| n % N0Size != 0 \|\| k % K0Size != 0)
				return failure();
				BenoitUnsubmitted Not Done Reply Inline Actions Add a comment that it's temporary that we are bailing out here, and that we will eventually want to accept any sizes by doing padding? Benoit: Add a comment that it's temporary that we are bailing out here, and that we will eventually…
				asaadaldienAuthorUnsubmitted Done Reply Inline Actions We have to check divisibility here as a precondition, padding is something that is handled outside this pattern (e.g we do pad operands in IREE to an next integer multiple prior to apply this). asaadaldien: We have to check divisibility here as a precondition, padding is something that is handled…
				BenoitUnsubmitted Not Done Reply Inline Actions I see; I understand how in the current context where K0,M0,N0 are compile-time constants from the get go (ie already here where we are lowering matmul to mmt4d) , it's possible to split the padding part into a separate pattern. But how is this going to generalize to when K0,M0,N0 are '?' dynamic shape dimensions at the time when we need to convert matmul to mmt4d? Benoit: I see; I understand how in the current context where K0,M0,N0 are compile-time constants from…

				int m1 = m / M0Size;
				int n1 = n / N0Size;
				int k1 = k / K0Size;

				// Expands a 2d tensor operand to 4d given its target shape.
				auto expandTo4D = [&](Value operand,
				ArrayRef<int64_t> targetShape) -> Value {
				auto operandType = operand.getType().cast<RankedTensorType>();
				auto targetType =
				RankedTensorType::get(targetShape, operandType.getElementType());
				SmallVector<ReassociationIndices> expandIndices = {{0, 1}, {2, 3}};
				Value reshapedOperand = rewriter.create<TensorExpandShapeOp>(
				loc, targetType, operand, expandIndices);
				return reshapedOperand;
				};

				auto lhs4D = expandTo4D(lhs, {m1, M0Size, k1, K0Size});
				auto rhs4D = expandTo4D(rhs, {k1, K0Size, n1, N0Size});
				auto dst4D = expandTo4D(dst, {m1, M0Size, n1, N0Size});

				auto transposeOperand = [&](Value operand,
				BenoitUnsubmitted Not Done Reply Inline Actions I'm thinking that it might help readability of this code if this function were named 'packMatrix' : "transpose" is an implementation detail, and the concept of "transposition" is a bit overloaded in the present context since we are also dealing with transposing the RHS matrix (which gives the letter T in the name, "MMT4D"). 'Matrix' vs 'Operand' because I'm thinking, this function in itself only cares that it gets a matrix to work on; that matrix only needs to be called an 'operand' in the context of the mmt4d op. Benoit: I'm thinking that it might help readability of this code if this function were named…
				BenoitUnsubmitted Not Done Reply Inline Actions Ah sorry I had missed that the 2D->4D conversion was done above separately in expandTo4D. Do you think that it would be reasonable to call expandTo4D inside packMatrix (and unpackMatrix below) i.e. my proposed new names for transposeOperand, collapseOperand, so that the internals of packing/unpacking are abstracted behind these functions and the higher-level code here can thus look simpler? Benoit: Ah sorry I had missed that the 2D->4D conversion was done above separately in expandTo4D. Do…
				asaadaldienAuthorUnsubmitted Done Reply Inline Actions There will be another pattern that doesn't have `M0, N0, K0` as constants and doesn't have to run on statically shaped operands as it will do `<?x?xf32> -> <?x?x?x?xf32>` which `linalg.tensor_expand_shape` doesn't support atm. (In addition to compiler support to specialize M0, N0, K0 for runtime versioning in IREE...etc). asaadaldien: There will be another pattern that doesn't have `M0, N0, K0` as constants and doesn't have to…
				ArrayRef<int64_t> indices) -> Value {
				RankedTensorType operandTensorType =
				operand.getType().cast<RankedTensorType>();
				auto nloops = indices.size();
				auto inputShape = operandTensorType.getShape();

				SmallVector<AffineExpr, 4> exprs = llvm::to_vector<4>(
				llvm::map_range(indices, [&](int64_t index) -> AffineExpr {
				return rewriter.getAffineDimExpr(index);
				}));

				SmallVector<int64_t> targetShape = llvm::to_vector<4>(
				llvm::map_range(indices, [&](int64_t index) -> int64_t {
				return inputShape[index];
				}));

				Value outputTensor = rewriter.create<InitTensorOp>(
				loc, targetShape, operandTensorType.getElementType());

				SmallVector<StringRef> loopAttributeTypes(nloops, "parallel");

				SmallVector<AffineMap> indexingMaps = {
				inversePermutation(
				AffineMap::get(nloops, 0, exprs, rewriter.getContext())),
				AffineMap::getMultiDimIdentityMap(nloops, rewriter.getContext())};

				auto transposedOp = rewriter.create<GenericOp>(
				loc, outputTensor.getType(),
				/inputs=/operand, /outputs=/outputTensor, indexingMaps,
				loopAttributeTypes,
				[&](OpBuilder &nestedBuilder, Location nestedLoc, ValueRange args) {
				nestedBuilder.create<YieldOp>(nestedLoc, args[0]);
				});

				return transposedOp.getResult(0);
				};

				auto lhs4DT = transposeOperand(lhs4D, {0, 2, 1, 3});
				auto rhs4DT = transposeOperand(rhs4D, {2, 0, 3, 1});
				auto dst4DT = transposeOperand(dst4D, {0, 2, 1, 3});

				auto mmt4DResult = rewriter.create<Mmt4DOp>(
				loc, dst4DT.getType(), ValueRange{lhs4DT, rhs4DT}, ValueRange{dst4DT});

				auto mmt4dResultTransposed =
				transposeOperand(mmt4DResult.getResult(0), {0, 2, 1, 3});

				auto collapseTo2D = [&](Value operand,
				BenoitUnsubmitted Not Done Reply Inline Actions If the above function were renamed to packMatrix, then it might make sense to rename this one to unpackMatrix. Benoit: If the above function were renamed to packMatrix, then it might make sense to rename this one…
				ArrayRef<int64_t> targetShape) -> Value {
				auto operandType = operand.getType().cast<RankedTensorType>();
				auto targetType =
				RankedTensorType::get(targetShape, operandType.getElementType());
				SmallVector<ReassociationIndices> collapseIndices = {{0, 1}, {2, 3}};
				Value reshapedOperand = rewriter.create<TensorCollapseShapeOp>(
				loc, targetType, operand, collapseIndices);
				return reshapedOperand;
				};

				Value result = collapseTo2D(mmt4dResultTransposed, {m, n});

				rewriter.replaceOp(matmulOp, ArrayRef<Value>{result});

				return success();
				}

				private:
				int M0Size;
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'M0Size' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'M0Size' [readability-identifier-naming]…
				int N0Size;
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'N0Size' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'N0Size' [readability-identifier-naming]…
				int K0Size;
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for member 'K0Size' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for member 'K0Size' [readability-identifier-naming]…
				};
				} // namespace

				void mlir::linalg::populateMatmulToMMT4DPatterns(RewritePatternSet &patterns,
				int M0, int N0, int K0) {
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for parameter 'M0' [readability-identifier-naming] not useful clang-tidy: warning: invalid case style for parameter 'N0' [readability-identifier-naming] not useful clang-tidy: warning: invalid case style for parameter 'K0' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for parameter 'M0' [readability-identifier-naming]…
				auto *context = patterns.getContext();
				patterns.add<LinalgStaticMatmulOpToLinalgMMT4dOpPattern>(context, M0, N0, K0);
				}

mlir/test/Dialect/Linalg/matmul_to_mmt4d.mlir

This file was added.

				// RUN: mlir-opt -split-input-file --test-linalg-matmul-to-mmt4d %s \| FileCheck --check-prefix=CHECK %s

				func @check_mmt4d(%arg0: tensor<24x8xf32>, %arg1: tensor<8x32xf32>, %arg2: tensor<24x32xf32>) -> tensor<24x32xf32> {
				%0 = linalg.matmul ins(%arg0, %arg1 : tensor<24x8xf32>, tensor<8x32xf32>) outs(%arg2 : tensor<24x32xf32>) -> tensor<24x32xf32>
				return %0 : tensor<24x32xf32>
				}
				// CHECK-DAG:#[[MAP0:.+]] = affine_map<(d0, d1, d2, d3) -> (d0, d2, d1, d3)>
				// CHECK-DAG:#[[MAP1:.+]] = affine_map<(d0, d1, d2, d3) -> (d0, d1, d2, d3)>
				// CHECK-DAG:#[[MAP2:.+]] = affine_map<(d0, d1, d2, d3) -> (d1, d3, d0, d2)>
				// CHECK: @check_mmt4d(%[[LHS:.+]]: tensor<24x8xf32>, %[[RHS:.+]]: tensor<8x32xf32>, %[[DST:.+]]: tensor<24x32xf32>)
				// CHECK: %[[LHS4D:.+]] = linalg.tensor_expand_shape %[[LHS]]
				// CHECK-SAME: tensor<24x8xf32> into tensor<6x4x2x4xf32>
				// CHECK: %[[RHS4D:.+]] = linalg.tensor_expand_shape %[[RHS]]
				// CHECK-SAME: tensor<8x32xf32> into tensor<2x4x8x4xf32>
				// CHECK: %[[DST4D:.+]] = linalg.tensor_expand_shape %[[DST]]
				// CHECK-SAME: tensor<24x32xf32> into tensor<6x4x8x4xf32>
				// CHECK: %[[LHS4DT_INIT:.+]] = linalg.init_tensor [6, 2, 4, 4] : tensor<6x2x4x4xf32>
				// CHECK: %[[LHS4DT:.+]] = linalg.generic
				// CHECK-SAME: indexing_maps = [#[[MAP0]], #[[MAP1]]]
				// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel", "parallel"]
				// CHECK-SAME: ins(%[[LHS4D]] : tensor<6x4x2x4xf32>) outs(%[[LHS4DT_INIT]] : tensor<6x2x4x4xf32>) {
				// CHECK-NEXT: ^bb0(%{{.}}: f32, %{{.}}: f32):
				// CHECK-NEXT: linalg.yield
				// CHECK-NEXT: } -> tensor<6x2x4x4xf32>
				// CHECK: %[[RHS4DT_INIT:.+]] = linalg.init_tensor [8, 2, 4, 4] : tensor<8x2x4x4xf32>
				// CHECK: %[[RHS4DT:.+]] = linalg.generic
				// CHECK-SAME: indexing_maps = [#[[MAP2]], #[[MAP1]]],
				// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel", "parallel"]
				// CHECK-SAME: ins(%[[RHS4D]] : tensor<2x4x8x4xf32>) outs(%[[RHS4DT_INIT]] : tensor<8x2x4x4xf32>) {
				// CHECK-NEXT: ^bb0(%{{.}}: f32, %{{.}}: f32):
				// CHECK-NEXT: linalg.yield %arg3 : f32
				// CHECK-NEXT: } -> tensor<8x2x4x4xf32>
				// CHECK-NEXT: %[[DST4DT_INIT:.+]] = linalg.init_tensor [6, 8, 4, 4] : tensor<6x8x4x4xf32>
				// CHECK: %[[DST4DT:.+]] = linalg.generic
				// CHECK-SAME: indexing_maps = [#[[MAP0]], #[[MAP1]]]
				// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel", "parallel"]}
				// CHECK-SAME: ins(%[[DST4D]] : tensor<6x4x8x4xf32>) outs(%[[DST4DT_INIT]] : tensor<6x8x4x4xf32>) {
				// CHECK-NEXT: ^bb0(%{{.}}: f32, %{{.}}: f32):
				// CHECK-NEXT: linalg.yield %arg3 : f32
				// CHECK-NEXT: } -> tensor<6x8x4x4xf32>
				// CHECK: %[[MMT4D:.+]] = linalg.mmt4d ins(%[[LHS4DT]], %[[RHS4DT]] : tensor<6x2x4x4xf32>, tensor<8x2x4x4xf32>) outs(%[[DST4DT]] : tensor<6x8x4x4xf32>) -> tensor<6x8x4x4xf32>
				// CHECK: %[[MMT4DT_INIT:.+]] = linalg.init_tensor [6, 4, 8, 4] : tensor<6x4x8x4xf32>
				// CHECK: %[[MMT4DT:.+]] = linalg.generic
				// CHECK-SAME: indexing_maps = [#[[MAP0]], #[[MAP1]]]
				// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel", "parallel"]}
				// CHECK-SAME: ins(%[[MMT4D]] : tensor<6x8x4x4xf32>) outs(%[[MMT4DT_INIT]] : tensor<6x4x8x4xf32>) {
				// CHECK-NEXT: ^bb0(%{{.}}: f32, %{{.}}: f32):
				// CHECK-NEXT: linalg.yield %arg3 : f32
				// CHECK-NEXT: } -> tensor<6x4x8x4xf32>
				// CHECK: %[[RESULT:.+]] = linalg.tensor_collapse_shape %[[MMT4DT]]
				// CHECK-SAME: tensor<6x4x8x4xf32> into tensor<24x32xf32>
				// CHECK: return %[[RESULT]] : tensor<24x32xf32>

mlir/test/lib/Dialect/Linalg/CMakeLists.txt

	# Exclude tests from libMLIR.so			# Exclude tests from libMLIR.so
	add_mlir_library(MLIRLinalgTestPasses			add_mlir_library(MLIRLinalgTestPasses
	TestConvVectorization.cpp			TestConvVectorization.cpp
	TestLinalgCodegenStrategy.cpp			TestLinalgCodegenStrategy.cpp
	TestLinalgDistribution.cpp			TestLinalgDistribution.cpp
	TestLinalgElementwiseFusion.cpp			TestLinalgElementwiseFusion.cpp
	TestLinalgFusionTransforms.cpp			TestLinalgFusionTransforms.cpp
	TestLinalgHoisting.cpp			TestLinalgHoisting.cpp
				TestLinalgMatmulToMMT4d.cpp
	TestLinalgTransforms.cpp			TestLinalgTransforms.cpp

	EXCLUDE_FROM_LIBMLIR			EXCLUDE_FROM_LIBMLIR

	LINK_LIBS PUBLIC			LINK_LIBS PUBLIC
	MLIRAffine			MLIRAffine
	MLIRGPUTransforms			MLIRGPUTransforms
	MLIRLinalg			MLIRLinalg
	MLIRLinalgTransforms			MLIRLinalgTransforms
	MLIRLLVMToLLVMIRTranslation			MLIRLLVMToLLVMIRTranslation
	MLIRPass			MLIRPass
	MLIRStandard			MLIRStandard
	MLIRTransformUtils			MLIRTransformUtils
	MLIRVector			MLIRVector
	MLIRVectorToSCF			MLIRVectorToSCF
	)			)

mlir/test/lib/Dialect/Linalg/TestLinalgMatmulToMMT4d.cpp

This file was added.

				//===- TestLinalgHoisting.cpp - Test Linalg hoisting functions ------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements logic for testing linalg.matmul to linalg.mmt4d
				// conversion.
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/Linalg/Transforms/Transforms.h"
				#include "mlir/Pass/Pass.h"
				#include "mlir/Transforms/GreedyPatternRewriteDriver.h"

				using namespace mlir;
				using namespace mlir::linalg;

				namespace {
				struct TestLinalgMatmulToMMT4D
				: PassWrapper<TestLinalgMatmulToMMT4D, FunctionPass> {
				TestLinalgMatmulToMMT4D() = default;
				TestLinalgMatmulToMMT4D(const TestLinalgMatmulToMMT4D &pass) {}

				StringRef getArgument() const final { return "test-linalg-matmul-to-mmt4d"; }
				StringRef getDescription() const final {
				return "Test Linalg matmul -> mmt4d functions.";
				}
				void runOnFunction() override;

				Option<int> testWithInnerDimM0{
				*this, "test-with-inner-dim-m0",
				llvm::cl::desc("Test hoisting transfer_read/transfer_write pairs"),
				llvm::cl::init(4)};

				Option<int> testWithInnerDimN0{
				*this, "test-with-inner-dim-n0",
				llvm::cl::desc("Test hoisting transfer_read/transfer_write pairs"),
				llvm::cl::init(4)};

				Option<int> testWithInnerDimK0{
				*this, "test-with-inner-dim-k0",
				llvm::cl::desc("Test hoisting transfer_read/transfer_write pairs"),
				llvm::cl::init(4)};
				};

				void TestLinalgMatmulToMMT4D::runOnFunction() {
				MLIRContext *context = &this->getContext();
				FuncOp funcOp = this->getFunction();
				RewritePatternSet patterns(context);
				populateMatmulToMMT4DPatterns(patterns, testWithInnerDimM0,
				testWithInnerDimN0, testWithInnerDimK0);
				(void)applyPatternsAndFoldGreedily(funcOp, std::move(patterns));
				}

				} // namespace

				namespace mlir {
				namespace test {
				void registerTestLinalgMatmulToMMT4D() {
				PassRegistration<TestLinalgMatmulToMMT4D>();
				}
				} // namespace test
				} // namespace mlir

mlir/tools/mlir-opt/mlir-opt.cpp

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines
void registerTestLinalgDistribution();		void registerTestLinalgDistribution();
void registerTestLinalgElementwiseFusion();		void registerTestLinalgElementwiseFusion();
void registerTestPushExpandingReshape();		void registerTestPushExpandingReshape();
void registerTestLinalgFusionTransforms();		void registerTestLinalgFusionTransforms();
void registerTestLinalgTensorFusionTransforms();		void registerTestLinalgTensorFusionTransforms();
void registerTestLinalgTiledLoopFusionTransforms();		void registerTestLinalgTiledLoopFusionTransforms();
void registerTestLinalgGreedyFusion();		void registerTestLinalgGreedyFusion();
void registerTestLinalgHoisting();		void registerTestLinalgHoisting();
		void registerTestLinalgMatmulToMMT4D();
void registerTestLinalgTileAndFuseSequencePass();		void registerTestLinalgTileAndFuseSequencePass();
void registerTestLinalgTransforms();		void registerTestLinalgTransforms();
void registerTestLivenessPass();		void registerTestLivenessPass();
void registerTestLoopFusion();		void registerTestLoopFusion();
void registerTestLoopMappingPass();		void registerTestLoopMappingPass();
void registerTestLoopUnrollingPass();		void registerTestLoopUnrollingPass();
void registerTestMathPolynomialApproximationPass();		void registerTestMathPolynomialApproximationPass();
void registerTestMemRefDependenceCheck();		void registerTestMemRefDependenceCheck();
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	#endif
test::registerTestLinalgDistribution();		test::registerTestLinalgDistribution();
test::registerTestLinalgElementwiseFusion();		test::registerTestLinalgElementwiseFusion();
test::registerTestPushExpandingReshape();		test::registerTestPushExpandingReshape();
test::registerTestLinalgFusionTransforms();		test::registerTestLinalgFusionTransforms();
test::registerTestLinalgTensorFusionTransforms();		test::registerTestLinalgTensorFusionTransforms();
test::registerTestLinalgTiledLoopFusionTransforms();		test::registerTestLinalgTiledLoopFusionTransforms();
test::registerTestLinalgGreedyFusion();		test::registerTestLinalgGreedyFusion();
test::registerTestLinalgHoisting();		test::registerTestLinalgHoisting();
		test::registerTestLinalgMatmulToMMT4D();
test::registerTestLinalgTileAndFuseSequencePass();		test::registerTestLinalgTileAndFuseSequencePass();
test::registerTestLinalgTransforms();		test::registerTestLinalgTransforms();
test::registerTestLivenessPass();		test::registerTestLivenessPass();
test::registerTestLoopFusion();		test::registerTestLoopFusion();
test::registerTestLoopMappingPass();		test::registerTestLoopMappingPass();
test::registerTestLoopUnrollingPass();		test::registerTestLoopUnrollingPass();
test::registerTestMathPolynomialApproximationPass();		test::registerTestMathPolynomialApproximationPass();
test::registerTestMemRefDependenceCheck();		test::registerTestMemRefDependenceCheck();
Show All 25 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add a pattern to convert static linalg.matmul -> linalg.mmt4dAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 358699

mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h

mlir/lib/Dialect/Linalg/Transforms/CMakeLists.txt

mlir/lib/Dialect/Linalg/Transforms/MatmulToMMT4d.cpp

mlir/test/Dialect/Linalg/matmul_to_mmt4d.mlir

mlir/test/lib/Dialect/Linalg/CMakeLists.txt

mlir/test/lib/Dialect/Linalg/TestLinalgMatmulToMMT4d.cpp

mlir/tools/mlir-opt/mlir-opt.cpp

Add a pattern to convert static linalg.matmul -> linalg.mmt4d
AbandonedPublic