This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Linalg/Transforms/
-
mlir/
-
Dialect/
-
Linalg/
-
Transforms/
1/1
Transforms.h
-
lib/Dialect/Linalg/Transforms/
-
Dialect/
-
Linalg/
-
Transforms/
-
CMakeLists.txt
8/9
PadOpInterchange.cpp
-
test/
-
Dialect/Linalg/
-
Linalg/
2/2
pad_fusion.mlir
-
lib/Dialect/Linalg/
-
Dialect/
-
Linalg/
-
CMakeLists.txt
4/4
TestPadFusion.cpp
-
tools/mlir-opt/
-
mlir-opt/
-
mlir-opt.cpp

Differential D116418

[mlir][Linalg] Pattern to fuse pad operation with elementwise operations.
ClosedPublic

Authored by mravishankar on Dec 30 2021, 10:07 AM.

Download Raw Diff

Details

Reviewers

nicolasvasilache
antiagainst
stellaraccident

Commits

rGe7cb716ef955: [mlir][Linalg] Pattern to fuse pad operation with elementwise operations.

Summary

Most convolution operations need explicit padding of the input to
ensure all accesses are inbounds. In such cases, having a pad
operation can be a significant overhead. One way to reduce that
overhead is to try to fuse the pad operation with the producer of its
source.

A sequence

linalg.generic -> linalg.pad_tensor

can be replaced with

linalg.fill -> tensor.extract_slice -> linalg.generic ->
tensor.insert_slice.

if the linalg.generic has all parallel iterator types.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

mravishankar created this revision.Dec 30 2021, 10:07 AM

Herald added subscribers: sdasgup3, wenzhicui, wrengr and 21 others. · View Herald TranscriptDec 30 2021, 10:07 AM

mravishankar requested review of this revision.Dec 30 2021, 10:07 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptDec 30 2021, 10:07 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: limo1996, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

mravishankar added reviewers: antiagainst, stellaraccident.Dec 30 2021, 10:08 AM

Harbormaster completed remote builds in B141035: Diff 396691.Dec 30 2021, 10:23 AM

Nice, thanks Mahesh! Generally LGTM. Only blocking issue is the new pass, which seems unnecessary to me.

mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h
93	..WithProducerGeneric.. to be clear?
mlir/lib/Dialect/Linalg/Transforms/PadOpInterchange.cpp
97	Nit: just use `else` here.
112	Nit: replaceOpWithNewOp?
mlir/test/Dialect/Linalg/pad_fusion.mlir
94	Add a negative case for non-parallel generic ops?
mlir/test/lib/Dialect/Linalg/TestPadFusion.cpp
2	Do we really need a new pass for this? Can we add another case in https://github.com/llvm/llvm-project/blob/main/mlir/test/lib/Dialect/Linalg/TestLinalgTransforms.cpp? That seems to be the "uber" pass for testing linalg transforms.
10	with its producer ..

This revision now requires changes to proceed.Jan 10 2022, 11:08 AM

nicolasvasilache added inline comments.Jan 11 2022, 4:32 AM

mlir/lib/Dialect/Linalg/Transforms/PadOpInterchange.cpp
10	nit: patterns
49	nit: "not constant padding" and save 2 lines
55	this seems unnecessary, I'd just drop the constraint
82	add a TODO that you could only want to fill the boundary region?
87	This seems generally useful as a helper on the op directly: insert/extractInto/FromPaddedSubRegion
mlir/test/lib/Dialect/Linalg/TestPadFusion.cpp
2	We want to graduate linalg.pad_tensor to tensor.pad, having a separate pass for this will reduce future churn.

nicolasvasilache accepted this revision.Jan 11 2022, 4:32 AM

antiagainst accepted this revision.Jan 11 2022, 5:08 AM

antiagainst added inline comments.

mlir/test/lib/Dialect/Linalg/TestPadFusion.cpp
2	Okay that makes sense then.

This revision is now accepted and ready to land.Jan 11 2022, 5:08 AM

Rebase and address comments.

Fix typo.

mlir/lib/Dialect/Linalg/Transforms/PadOpInterchange.cpp
55	Fair enough. I guess then I do need to rename the method.
87	Hmm, not sure how this would work. The pad values are being used to extract the "interior" of a value that is not the source of the pad tensor. So the method would be "use the padding information from the op and extract an equivalent subregion from a different value (which is not the source)". I dont see where that would be useful. In any case, I'll just add a TODO for this here?
mlir/test/Dialect/Linalg/pad_fusion.mlir
94	I dont think that is really needed for now.

This revision was landed with ongoing or failed builds.Jan 11 2022, 1:37 PM

Closed by commit rGe7cb716ef955: [mlir][Linalg] Pattern to fuse pad operation with elementwise operations. (authored by mravishankar). · Explain Why

This revision was automatically updated to reflect the committed changes.

mravishankar added a commit: rGe7cb716ef955: [mlir][Linalg] Pattern to fuse pad operation with elementwise operations..

Harbormaster completed remote builds in B142736: Diff 399063.Jan 11 2022, 2:13 PM

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Linalg/

Transforms/

Transforms.h

6 lines

lib/

Dialect/

Linalg/

Transforms/

CMakeLists.txt

1 line

PadOpInterchange.cpp

122 lines

test/

Dialect/

Linalg/

pad_fusion.mlir

93 lines

lib/

Dialect/

Linalg/

CMakeLists.txt

1 line

TestPadFusion.cpp

48 lines

tools/

mlir-opt/

mlir-opt.cpp

2 lines

Diff 399063

mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h

	Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines
	/// Patterns to fold a collapsing (expanding) tensor_reshape operation with its			/// Patterns to fold a collapsing (expanding) tensor_reshape operation with its
	/// producer (consumer) generic operation by linearizing the indexing map used			/// producer (consumer) generic operation by linearizing the indexing map used
	/// to access the source (target) of the reshape operation in the generic			/// to access the source (target) of the reshape operation in the generic
	/// operation. The patterns are applied only when the tensor reshape involved is			/// operation. The patterns are applied only when the tensor reshape involved is
	/// collapsing (introducing) unit-extent dimensions.			/// collapsing (introducing) unit-extent dimensions.
	void populateFoldUnitDimsReshapeOpsByLinearizationPatterns(			void populateFoldUnitDimsReshapeOpsByLinearizationPatterns(
	RewritePatternSet &patterns);			RewritePatternSet &patterns);

				/// Pattern to fuse a `linalg.pad_tensor` operation with the producer of its
				/// source, if the producer is a `linalg` operation with all parallel iterator
				/// types.
				void populateFusePadTensorWithProducerLinalgOpPatterns(
				antiagainstUnsubmitted Done Reply Inline Actions ..WithProducerGeneric.. to be clear? antiagainst: ..WithProducerGeneric.. to be clear?
				RewritePatternSet &patterns);

	/// Patterns to convert from one named op to another. These can be seen as			/// Patterns to convert from one named op to another. These can be seen as
	/// canonicalizations of named ops into another named op.			/// canonicalizations of named ops into another named op.
	void populateLinalgNamedOpConversionPatterns(RewritePatternSet &patterns);			void populateLinalgNamedOpConversionPatterns(RewritePatternSet &patterns);

	/// Populate the given list with patterns to bufferize linalg ops.			/// Populate the given list with patterns to bufferize linalg ops.
	void populateLinalgBufferizePatterns(			void populateLinalgBufferizePatterns(
	bufferization::BufferizeTypeConverter &converter,			bufferization::BufferizeTypeConverter &converter,
	RewritePatternSet &patterns);			RewritePatternSet &patterns);
	▲ Show 20 Lines • Show All 1,290 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/CMakeLists.txt

Show All 11 Lines	add_mlir_dialect_library(MLIRLinalgTransforms
Generalization.cpp		Generalization.cpp
Hoisting.cpp		Hoisting.cpp
HoistPadding.cpp		HoistPadding.cpp
InlineScalarOperands.cpp		InlineScalarOperands.cpp
Interchange.cpp		Interchange.cpp
Loops.cpp		Loops.cpp
LinalgStrategyPasses.cpp		LinalgStrategyPasses.cpp
NamedOpConversions.cpp		NamedOpConversions.cpp
		PadOpInterchange.cpp
Promotion.cpp		Promotion.cpp
Tiling.cpp		Tiling.cpp
Transforms.cpp		Transforms.cpp
Vectorization.cpp		Vectorization.cpp

ADDITIONAL_HEADER_DIRS		ADDITIONAL_HEADER_DIRS
${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Linalg		${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Linalg

Show All 36 Lines

mlir/lib/Dialect/Linalg/Transforms/PadOpInterchange.cpp

This file was added.

				//===- PadOpInterchange.cpp - Interchange pad operation with Generic ops --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements patterns that intechanges a generic op -> pad_tensor
				// pattern into extract_slice -> generic_op.
				nicolasvasilacheUnsubmitted Done Reply Inline Actions nit: patterns nicolasvasilache: nit: patterns
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/Linalg/Transforms/Transforms.h"

				#include "mlir/Dialect/Linalg/IR/Linalg.h"
				#include "mlir/Transforms/GreedyPatternRewriteDriver.h"

				using namespace mlir;
				using namespace mlir::linalg;

				namespace {

				/// A sequence of operations
				///
				/// ```mlir
				/// %0 = linalg. ...
				/// %1 = linalg.pad_tensor %0 ...
				/// ```
				///
				/// can be replaced with
				///
				/// ```mlir
				/// %0 = linalg.fill
				/// %1 = tensor.extract_slice %0 ...
				/// %2 = linalg. .... outs(..., %1, ....) ....
				/// %3 = tensor.insert_slice %2 into %1 ...
				/// ```
				///
				/// if the `linalg.generic` has all parallel iterator types.
				struct FusePadTensorOp : OpRewritePattern<PadTensorOp> {
				using OpRewritePattern<PadTensorOp>::OpRewritePattern;
				LogicalResult matchAndRewrite(PadTensorOp padOp,
				PatternRewriter &rewriter) const override {
				// Only works on padding op that sets the padded value to a constant.
				Value padValue = padOp.getConstantPaddingValue();
				if (!padValue)
				return rewriter.notifyMatchFailure(padOp, "non constant padding");

				nicolasvasilacheUnsubmitted Done Reply Inline Actions nit: "not constant padding" and save 2 lines nicolasvasilache: nit: "not constant padding" and save 2 lines
				// This pattern could work for any Linalg op. For now restrict it to generic
				// ops.
				Value source = padOp.source();
				auto linalgOp = source.getDefiningOp<GenericOp>();
				if (!linalgOp) {
				return rewriter.notifyMatchFailure(
				nicolasvasilacheUnsubmitted Done Reply Inline Actions this seems unnecessary, I'd just drop the constraint nicolasvasilache: this seems unnecessary, I'd just drop the constraint
				mravishankarAuthorUnsubmitted Done Reply Inline Actions Fair enough. I guess then I do need to rename the method. mravishankar: Fair enough. I guess then I do need to rename the method.
				padOp, "expected source to be linalg.generic op");
				}
				// All iterator types need to be parallel.
				if (linalgOp.getNumLoops() != linalgOp.getNumParallelLoops()) {
				return rewriter.notifyMatchFailure(
				padOp, "only supported for ops with all parallel iterator types");
				}
				ReifiedRankedShapedTypeDims resultShape;
				if (failed(padOp.reifyResultShapes(rewriter, resultShape)) \|\|
				resultShape.size() != 1) {
				return rewriter.notifyMatchFailure(
				padOp, "failed to get shape of pad op result");
				}

				Location loc = padOp.getLoc();

				// Create the tensor of same size as output of the pad op.
				RankedTensorType padResultType = padOp.getResultType();
				auto resultSizes = getAsOpFoldResult(resultShape[0]);
				auto initTensor = rewriter.create<InitTensorOp>(
				loc, resultSizes, padResultType.getElementType());

				// Fill the tensor with the pad value.
				// TODO: There is an option to fill only the boundaries. For now just
				// filling the whole tensor.
				auto fillTensor =
				rewriter.create<FillOp>(loc, padValue, initTensor.getResult());
				nicolasvasilacheUnsubmitted Done Reply Inline Actions add a TODO that you could only want to fill the boundary region? nicolasvasilache: add a TODO that you could only want to fill the boundary region?

				// Construct a slice of the fill result that is to be replaced with the
				// result of the generic op. The low pad values are the offsets, the size of
				// the source is the size of the slice.
				// TODO: This insert/extract could be potentially made a utility method.
				nicolasvasilacheUnsubmitted Not Done Reply Inline Actions This seems generally useful as a helper on the op directly: insert/extractInto/FromPaddedSubRegion nicolasvasilache: This seems generally useful as a helper on the op directly…
				mravishankarAuthorUnsubmitted Done Reply Inline Actions Hmm, not sure how this would work. The pad values are being used to extract the "interior" of a value that is not the source of the pad tensor. So the method would be "use the padding information from the op and extract an equivalent subregion from a different value (which is not the source)". I dont see where that would be useful. In any case, I'll just add a TODO for this here? mravishankar: Hmm, not sure how this would work. The pad values are being used to extract the "interior" of a…
				unsigned resultNumber = source.cast<OpResult>().getResultNumber();
				SmallVector<OpFoldResult> offsets = padOp.getMixedLowPad();
				SmallVector<OpFoldResult> sizes;
				sizes.reserve(offsets.size());
				for (auto shape : llvm::enumerate(
				source.getType().cast<RankedTensorType>().getShape())) {
				if (ShapedType::isDynamic(shape.value())) {
				sizes.push_back(
				rewriter.create<tensor::DimOp>(loc, source, shape.index())
				.getResult());
				antiagainstUnsubmitted Done Reply Inline Actions Nit: just use `else` here. antiagainst: Nit: just use `else` here.
				} else {
				sizes.push_back(rewriter.getIndexAttr(shape.value()));
				}
				}
				SmallVector<OpFoldResult> strides(offsets.size(), rewriter.getIndexAttr(1));
				auto slice = rewriter.create<tensor::ExtractSliceOp>(
				loc, fillTensor.getResult(0), offsets, sizes, strides);

				// Clone the generic op.
				auto clonedOp = cast<GenericOp>(rewriter.clone(*linalgOp.getOperation()));
				clonedOp.setOutputOperand(resultNumber, slice.getResult());

				// Insert it back into the result of the fill.
				rewriter.replaceOpWithNewOp<tensor::InsertSliceOp>(
				padOp, clonedOp.getResult(resultNumber), fillTensor.getResult(0),
				antiagainstUnsubmitted Done Reply Inline Actions Nit: replaceOpWithNewOp? antiagainst: Nit: replaceOpWithNewOp?
				offsets, sizes, strides);
				return success();
				}
				};
				} // namespace

				void mlir::linalg::populateFusePadTensorWithProducerLinalgOpPatterns(
				RewritePatternSet &patterns) {
				patterns.add<FusePadTensorOp>(patterns.getContext());
				}

mlir/test/Dialect/Linalg/pad_fusion.mlir

This file was added.

				// RUN: mlir-opt -test-linalg-pad-fusion -split-input-file %s \| FileCheck %s

				func @dynamic_pad_fusion(%arg0 : tensor<?x?xf32>, %arg1 : index, %arg2 : index,
				%arg3 : index, %arg4 : index, %arg5 : f32) -> tensor<?x?xf32> {
				%c0 = arith.constant 0 : index
				%c1 = arith.constant 1 : index
				%d0 = tensor.dim %arg0, %c0 : tensor<?x?xf32>
				%d1 = tensor.dim %arg0, %c1 : tensor<?x?xf32>
				%init = linalg.init_tensor [%d0, %d1] : tensor<?x?xf32>
				%0 = linalg.generic {
				indexing_maps = [affine_map<(d0, d1) -> (d0, d1)>, affine_map<(d0, d1) -> (d0, d1)>],
				iterator_types = ["parallel", "parallel"]}
				ins(%arg0 : tensor<?x?xf32>) outs(%init : tensor<?x?xf32>) {
				^bb0(%arg6 : f32, %arg7 : f32):
				%1 = arith.mulf %arg6, %arg6 : f32
				linalg.yield %1 : f32
				} -> tensor<?x?xf32>
				%1 = linalg.pad_tensor %0 low [%arg1, %arg2] high [%arg3, %arg4] {
				^bb0(%arg6: index, %arg7 : index):
				linalg.yield %arg5 : f32
				} : tensor<?x?xf32> to tensor<?x?xf32>
				return %1 : tensor<?x?xf32>
				}

				// CHECK-DAG: #[[MAP:.+]] = affine_map<()[s0, s1, s2] -> (s2 + s0 + s1)>
				// CHECK: func @dynamic_pad_fusion
				// CHECK-SAME: %[[ARG0:.+]]: tensor<?x?xf32>
				// CHECK-SAME: %[[ARG1:[a-zA-Z0-9]+]]: index
				// CHECK-SAME: %[[ARG2:[a-zA-Z0-9]+]]: index
				// CHECK-SAME: %[[ARG3:[a-zA-Z0-9]+]]: index
				// CHECK-SAME: %[[ARG4:[a-zA-Z0-9]+]]: index
				// CHECK-SAME: %[[ARG5:[a-zA-Z0-9]+]]: f32
				// CHECK-DAG: %[[C0:.+]] = arith.constant 0 : index
				// CHECK-DAG: %[[C1:.+]] = arith.constant 1 : index
				// CHECK-DAG: %[[SOURCE:.+]] = linalg.generic
				// CHECK-DAG: %[[SOURCE_D0:.+]] = tensor.dim %[[SOURCE]], %[[C0]]
				// CHECK-DAG: %[[TARGET_D0:.+]] = affine.apply #[[MAP]]()[%[[ARG1]], %[[ARG3]], %[[SOURCE_D0]]]
				// CHECK-DAG: %[[SOURCE_D1:.+]] = tensor.dim %[[SOURCE]], %[[C1]]
				// CHECK-DAG: %[[TARGET_D1:.+]] = affine.apply #[[MAP]]()[%[[ARG2]], %[[ARG4]], %[[SOURCE_D1]]]
				// CHECK: %[[INIT:.+]] = linalg.init_tensor [%[[TARGET_D0]], %[[TARGET_D1]]]
				// CHECK: %[[FILL:.+]] = linalg.fill(%[[ARG5]], %[[INIT]])
				// CHECK-DAG: %[[SIZE_D0:.+]] = tensor.dim %[[SOURCE]], %[[C0]]
				// CHECK-DAG: %[[SIZE_D1:.+]] = tensor.dim %[[SOURCE]], %[[C1]]
				// CHECK: %[[SLICE:.+]] = tensor.extract_slice %[[FILL]]
				// CHECK-SAME: [%[[ARG1]], %[[ARG2]]] [%[[SIZE_D0]], %[[SIZE_D1]]] [1, 1]
				// CHECK: %[[SOURCE:.+]] = linalg.generic
				// CHECK-SAME: outs(%[[SLICE]] : tensor<?x?xf32>)
				// CHECK: %[[RESULT:.+]] = tensor.insert_slice %[[SOURCE]] into %[[FILL]]
				// CHECK-SAME: [%[[ARG1]], %[[ARG2]]] [%[[SIZE_D0]], %[[SIZE_D1]]] [1, 1]
				// CHECK: return %[[RESULT]]

				// -----

				func @mixed_pad_fusion(%arg0 : tensor<?x42xf32>, %arg1 : index, %arg2 : index,
				%arg3 : f32) -> tensor<49x?xf32> {
				%c0 = arith.constant 0 : index
				%d0 = tensor.dim %arg0, %c0 : tensor<?x42xf32>
				%init = linalg.init_tensor [42, %d0] : tensor<42x?xf32>
				%0 = linalg.generic {
				indexing_maps = [affine_map<(d0, d1) -> (d0, d1)>, affine_map<(d0, d1) -> (d1, d0)>],
				iterator_types = ["parallel", "parallel"]}
				ins(%arg0 : tensor<?x42xf32>) outs(%init : tensor<42x?xf32>) {
				^bb0(%arg4 : f32, %arg5 : f32):
				%1 = arith.mulf %arg4, %arg4 : f32
				linalg.yield %1 : f32
				} -> tensor<42x?xf32>
				%1 = linalg.pad_tensor %0 low [3, %arg1] high [4, %arg2] {
				^bb0(%arg4: index, %arg5 : index):
				linalg.yield %arg3 : f32
				} : tensor<42x?xf32> to tensor<49x?xf32>
				return %1 : tensor<49x?xf32>
				}
				// CHECK-DAG: #[[MAP:.+]] = affine_map<()[s0, s1, s2] -> (s2 + s0 + s1)>
				// CHECK: func @mixed_pad_fusion
				// CHECK-SAME: %[[ARG0:.+]]: tensor<?x42xf32>
				// CHECK-SAME: %[[ARG1:[a-zA-Z0-9]+]]: index
				// CHECK-SAME: %[[ARG2:[a-zA-Z0-9]+]]: index
				// CHECK-SAME: %[[ARG3:[a-zA-Z0-9]+]]: f32
				// CHECK-DAG: %[[C0:.+]] = arith.constant 0 : index
				// CHECK-DAG: %[[C1:.+]] = arith.constant 1 : index
				// CHECK-DAG: %[[SOURCE:.+]] = linalg.generic
				// CHECK-DAG: %[[SOURCE_D1:.+]] = tensor.dim %[[SOURCE]], %[[C1]]
				// CHECK-DAG: %[[TARGET_D1:.+]] = affine.apply #[[MAP]]()[%[[ARG1]], %[[ARG2]], %[[SOURCE_D1]]]
				// CHECK: %[[INIT:.+]] = linalg.init_tensor [49, %[[TARGET_D1]]]
				// CHECK: %[[FILL:.+]] = linalg.fill(%[[ARG3]], %[[INIT]])
				// CHECK-DAG: %[[SIZE_D1:.+]] = tensor.dim %[[SOURCE]], %[[C1]]
				// CHECK: %[[SLICE:.+]] = tensor.extract_slice %[[FILL]]
				// CHECK-SAME: [3, %[[ARG1]]] [42, %[[SIZE_D1]]] [1, 1]
				// CHECK: %[[SOURCE:.+]] = linalg.generic
				// CHECK-SAME: outs(%[[SLICE]] : tensor<42x?xf32>)
				// CHECK: %[[RESULT:.+]] = tensor.insert_slice %[[SOURCE]] into %[[FILL]]
				// CHECK-SAME: [3, %[[ARG1]]] [42, %[[SIZE_D1]]] [1, 1]
				// CHECK: return %[[RESULT]]
				antiagainstUnsubmitted Done Reply Inline Actions Add a negative case for non-parallel generic ops? antiagainst: Add a negative case for non-parallel generic ops?
				mravishankarAuthorUnsubmitted Done Reply Inline Actions I dont think that is really needed for now. mravishankar: I dont think that is really needed for now.

mlir/test/lib/Dialect/Linalg/CMakeLists.txt

	# Exclude tests from libMLIR.so			# Exclude tests from libMLIR.so
	add_mlir_library(MLIRLinalgTestPasses			add_mlir_library(MLIRLinalgTestPasses
	TestComprehensiveBufferize.cpp			TestComprehensiveBufferize.cpp
	TestConvVectorization.cpp			TestConvVectorization.cpp
	TestLinalgCodegenStrategy.cpp			TestLinalgCodegenStrategy.cpp
	TestLinalgDistribution.cpp			TestLinalgDistribution.cpp
	TestLinalgElementwiseFusion.cpp			TestLinalgElementwiseFusion.cpp
	TestLinalgFusionTransforms.cpp			TestLinalgFusionTransforms.cpp
	TestLinalgHoisting.cpp			TestLinalgHoisting.cpp
	TestLinalgTransforms.cpp			TestLinalgTransforms.cpp
				TestPadFusion.cpp

	EXCLUDE_FROM_LIBMLIR			EXCLUDE_FROM_LIBMLIR

	LINK_LIBS PUBLIC			LINK_LIBS PUBLIC
	MLIRAffine			MLIRAffine
	MLIRAffineBufferizableOpInterfaceImpl			MLIRAffineBufferizableOpInterfaceImpl
	MLIRArithBufferizableOpInterfaceImpl			MLIRArithBufferizableOpInterfaceImpl
	MLIRArithmetic			MLIRArithmetic
	Show All 19 Lines

mlir/test/lib/Dialect/Linalg/TestPadFusion.cpp

This file was added.

				//===- TestPadFusion.cpp - Test fusion of pad op with Linalg ops ---------===//
				//
				antiagainstUnsubmitted Done Reply Inline Actions Do we really need a new pass for this? Can we add another case in https://github.com/llvm/llvm-project/blob/main/mlir/test/lib/Dialect/Linalg/TestLinalgTransforms.cpp? That seems to be the "uber" pass for testing linalg transforms. antiagainst: Do we really need a new pass for this? Can we add another case in https://github.com/llvm/llvm…
				nicolasvasilacheUnsubmitted Done Reply Inline Actions We want to graduate linalg.pad_tensor to tensor.pad, having a separate pass for this will reduce future churn. nicolasvasilache: We want to graduate linalg.pad_tensor to tensor.pad, having a separate pass for this will…
				antiagainstUnsubmitted Done Reply Inline Actions Okay that makes sense then. antiagainst: Okay that makes sense then.
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements a pass for testing fusion of pad ops with its producer
				// Linalg op.
				antiagainstUnsubmitted Done Reply Inline Actions with its producer .. antiagainst: with its producer ..
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/Linalg/Transforms/Transforms.h"
				#include "mlir/Pass/Pass.h"
				#include "mlir/Pass/PassManager.h"
				#include "mlir/Transforms/GreedyPatternRewriteDriver.h"

				namespace mlir {

				namespace {
				struct TestPadFusionPass : public PassWrapper<TestPadFusionPass, FunctionPass> {

				void getDependentDialects(DialectRegistry &registry) const override {
				registry
				.insert<AffineDialect, linalg::LinalgDialect, tensor::TensorDialect>();
				}

				StringRef getArgument() const final { return "test-linalg-pad-fusion"; }
				StringRef getDescription() const final { return "Test PadOp fusion"; }

				void runOnFunction() override {
				MLIRContext *context = &getContext();
				FuncOp funcOp = getFunction();
				RewritePatternSet patterns(context);
				linalg::populateFusePadTensorWithProducerLinalgOpPatterns(patterns);
				if (failed(applyPatternsAndFoldGreedily(funcOp.getBody(),
				std::move(patterns))))
				return signalPassFailure();
				}
				};
				} // namespace

				namespace test {
				void registerTestPadFusion() { PassRegistration<TestPadFusionPass>(); }
				} // namespace test

				} // namespace mlir

mlir/tools/mlir-opt/mlir-opt.cpp

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines
void registerTestMatchReductionPass();		void registerTestMatchReductionPass();
void registerTestMathAlgebraicSimplificationPass();		void registerTestMathAlgebraicSimplificationPass();
void registerTestMathPolynomialApproximationPass();		void registerTestMathPolynomialApproximationPass();
void registerTestMemRefDependenceCheck();		void registerTestMemRefDependenceCheck();
void registerTestMemRefStrideCalculation();		void registerTestMemRefStrideCalculation();
void registerTestNumberOfBlockExecutionsPass();		void registerTestNumberOfBlockExecutionsPass();
void registerTestNumberOfOperationExecutionsPass();		void registerTestNumberOfOperationExecutionsPass();
void registerTestOpaqueLoc();		void registerTestOpaqueLoc();
		void registerTestPadFusion();
void registerTestPDLByteCodePass();		void registerTestPDLByteCodePass();
void registerTestPreparationPassWithAllowedMemrefResults();		void registerTestPreparationPassWithAllowedMemrefResults();
void registerTestRecursiveTypesPass();		void registerTestRecursiveTypesPass();
void registerTestSCFUtilsPass();		void registerTestSCFUtilsPass();
void registerTestSliceAnalysisPass();		void registerTestSliceAnalysisPass();
void registerTestVectorLowerings();		void registerTestVectorLowerings();
} // namespace test		} // namespace test
} // namespace mlir		} // namespace mlir
▲ Show 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	#endif
mlir::test::registerTestMatchReductionPass();		mlir::test::registerTestMatchReductionPass();
mlir::test::registerTestMathAlgebraicSimplificationPass();		mlir::test::registerTestMathAlgebraicSimplificationPass();
mlir::test::registerTestMathPolynomialApproximationPass();		mlir::test::registerTestMathPolynomialApproximationPass();
mlir::test::registerTestMemRefDependenceCheck();		mlir::test::registerTestMemRefDependenceCheck();
mlir::test::registerTestMemRefStrideCalculation();		mlir::test::registerTestMemRefStrideCalculation();
mlir::test::registerTestNumberOfBlockExecutionsPass();		mlir::test::registerTestNumberOfBlockExecutionsPass();
mlir::test::registerTestNumberOfOperationExecutionsPass();		mlir::test::registerTestNumberOfOperationExecutionsPass();
mlir::test::registerTestOpaqueLoc();		mlir::test::registerTestOpaqueLoc();
		mlir::test::registerTestPadFusion();
mlir::test::registerTestPDLByteCodePass();		mlir::test::registerTestPDLByteCodePass();
mlir::test::registerTestRecursiveTypesPass();		mlir::test::registerTestRecursiveTypesPass();
mlir::test::registerTestSCFUtilsPass();		mlir::test::registerTestSCFUtilsPass();
mlir::test::registerTestSliceAnalysisPass();		mlir::test::registerTestSliceAnalysisPass();
mlir::test::registerTestVectorLowerings();		mlir::test::registerTestVectorLowerings();
}		}
#endif		#endif

Show All 14 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Linalg] Pattern to fuse pad operation with elementwise operations.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 399063

mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h

mlir/lib/Dialect/Linalg/Transforms/CMakeLists.txt

mlir/lib/Dialect/Linalg/Transforms/PadOpInterchange.cpp

mlir/test/Dialect/Linalg/pad_fusion.mlir

mlir/test/lib/Dialect/Linalg/CMakeLists.txt

mlir/test/lib/Dialect/Linalg/TestPadFusion.cpp

mlir/tools/mlir-opt/mlir-opt.cpp

[mlir][Linalg] Pattern to fuse pad operation with elementwise operations.
ClosedPublic