This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Linalg/Transforms/
-
mlir/
-
Dialect/
-
Linalg/
-
Transforms/
-
Transforms.h
-
lib/Dialect/Linalg/Transforms/
-
Dialect/
-
Linalg/
-
Transforms/
1/2
Transforms.cpp
-
test/
-
Dialect/Linalg/
-
Linalg/
-
decompose-convolution.mlir
-
lib/Dialect/Linalg/
-
Dialect/
-
Linalg/
-
TestLinalgTransforms.cpp

Differential D112928

[mlir][linalg] Rewrite `linalg.conv_2d_nhwc_hwcf` into 1-D
ClosedPublic

Authored by antiagainst on Nov 1 2021, 7:18 AM.

Download Raw Diff

Details

Reviewers

mravishankar
nicolasvasilache

Commits

rG7b615a87dc55: [mlir][linalg] Rewrite `linalg.conv_2d_nhwc_hwcf` into 1-D

Summary

We'd like to take a progressive approach towards Fconvolution op
CodeGen, by 1) tiling it to fit compute hierarchy first, and then

tiling along window dimensions with size 1 to reduce the problem

to be matmul-like. After that, we can 3) downscale high-D convolution
ops to low-D by removing the size-1 window dimensions. The final
step would be 4) vectorizing the low-D convolution op directly.

We have patterns for 1), 2), and 4). This commit adds a pattern for

for linalg.conv_2d_nhwc_hwcf ops as a starter. Supporting other

high-D convolution ops should be similar and mechanical.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

antiagainst created this revision.Nov 1 2021, 7:18 AM

Herald added a reviewer: mravishankar. · View Herald TranscriptNov 1 2021, 7:18 AM

Herald added subscribers: wenzhicui, wrengr, Chia-hungDuan and 19 others. · View Herald Transcript

antiagainst requested review of this revision.Nov 1 2021, 7:18 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptNov 1 2021, 7:18 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: limo1996, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B131744: Diff 383800.Nov 1 2021, 7:28 AM

antiagainst mentioned this in D112470: [mlir][linalg] Vectorize 2-D NHWC-HWCF convolution ops.Nov 1 2021, 7:29 AM

LGTM

Next step would be to make this available to the CodegenStrategy and the sandbox, probably via this pass https://sourcegraph.com/github.com/llvm/llvm-project/-/blob/mlir/lib/Dialect/Linalg/Transforms/LinalgStrategyPasses.cpp?L242.
e2e experiments will tell us if this needs to be exposed in a better way.

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp
908	We can iterate from this to get started but note that there are implications on bufferization. Alternatively we could use rank-reducing InsertSliceOp / ExtractSliceOp; this may be needed for proper inplace bufferization but let's punt for now until we see the whole end-to-end story.

This revision is now accepted and ready to land.Nov 1 2021, 11:42 AM

antiagainst added inline comments.Nov 2 2021, 6:56 AM

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp
908	Yeah. I was a bit worried about its implication over vectorization too. I think insert/extract slice op might be better supported there. But I think this reshape ops should be also be supported too. Need to fix if missing. Good point about bufferization; I wasn't thinking much about it previously. We can change this if it turns out to be an issue.

Closed by commit rG7b615a87dc55: [mlir][linalg] Rewrite `linalg.conv_2d_nhwc_hwcf` into 1-D (authored by antiagainst). · Explain WhyNov 2 2021, 7:00 AM

This revision was automatically updated to reflect the committed changes.

antiagainst added a commit: rG7b615a87dc55: [mlir][linalg] Rewrite `linalg.conv_2d_nhwc_hwcf` into 1-D.

devajith-huawei mentioned this in D145162: [mlir][linalg] Downscale 2D convolution with unit dimensions to 1D convolution.Mar 3 2023, 1:24 PM

hanchung mentioned this in rG991945f4410a: [mlir][linalg] Downscale 2D convolution with unit dimensions to 1D convolution.Mar 8 2023, 2:32 PM

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Linalg/

Transforms/

Transforms.h

10 lines

lib/

Dialect/

Linalg/

Transforms/

Transforms.cpp

95 lines

test/

Dialect/

Linalg/

decompose-convolution.mlir

67 lines

lib/

Dialect/

Linalg/

TestLinalgTransforms.cpp

13 lines

Diff 384082

mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h

	Show All 40 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	using LinalgLoops = SmallVector<Operation *, 4>;			using LinalgLoops = SmallVector<Operation *, 4>;

	/// [DEPRECATED] Populates patterns for vectorization of all ConvN-D ops.			/// [DEPRECATED] Populates patterns for vectorization of all ConvN-D ops.
	void populateConvVectorizationPatterns(			void populateConvVectorizationPatterns(
	MLIRContext *context, SmallVectorImpl<RewritePatternSet> &patterns,			MLIRContext *context, SmallVectorImpl<RewritePatternSet> &patterns,
	ArrayRef<int64_t> tileSizes);			ArrayRef<int64_t> tileSizes);

	/// Populates patterns for vectorizing convolution ops.			/// Populates patterns to decompose high-D convolution ops into low-D ones. This
				/// is a step in progressive lowering for convolution ops, afterwards we can
				/// vectorize the low-D convolution ops.
				void populateDecomposeConvolutionPatterns(RewritePatternSet &patterns,
				PatternBenefit benefit = 1);

				/// Populates patterns for vectorizing low-D convolution ops. This is a step in
				/// progressive lowering for convolution ops, it assume high-D convolution ops
				/// were decomposed previously.
	void populateConvolutionVectorizationPatterns(RewritePatternSet &patterns,			void populateConvolutionVectorizationPatterns(RewritePatternSet &patterns,
	PatternBenefit benefit = 1);			PatternBenefit benefit = 1);

	/// Populate patterns that convert `ElementwiseMappable` ops to linalg			/// Populate patterns that convert `ElementwiseMappable` ops to linalg
	/// parallel loops.			/// parallel loops.
	void populateElementwiseToLinalgConversionPatterns(RewritePatternSet &patterns);			void populateElementwiseToLinalgConversionPatterns(RewritePatternSet &patterns);

	/// Function type which is used to control when to stop fusion. It is expected			/// Function type which is used to control when to stop fusion. It is expected
	▲ Show 20 Lines • Show All 1,304 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp

Show First 20 Lines • Show All 834 Lines • ▼ Show 20 Lines	LogicalResult ExtractSliceOfPadTensorSwapPattern::matchAndRewrite(
Operation *tiledPadOp = padOp.getTiledImplementation(		Operation *tiledPadOp = padOp.getTiledImplementation(
rewriter, /dest=/ValueRange{}, sliceOp.getMixedOffsets(),		rewriter, /dest=/ValueRange{}, sliceOp.getMixedOffsets(),
sliceOp.getMixedSizes());		sliceOp.getMixedSizes());
// All shapes are static and the data source is actually used. Rewrite into		// All shapes are static and the data source is actually used. Rewrite into
// pad_tensor(subtensor(x)).		// pad_tensor(subtensor(x)).
rewriter.replaceOp(sliceOp, tiledPadOp->getResults());		rewriter.replaceOp(sliceOp, tiledPadOp->getResults());
return success();		return success();
}		}

		namespace {
		// The following are patterns for downscaling convolution ops with size-1
		// window dimensions.
		//
		// Note that we'd eventually want to write such transformations in a generic
		// way, e.g., converting to linalg.generic, removing the size-1 dimensions,
		// and then turning back to named ops. But for now it's fine to have a few
		// patterns matching special ops to get started.

		/// Rewrites 2-D convolution ops with size-1 window dimensions into 1-D
		/// convolution ops.
		struct DownscaleSizeOneWindowed2DConvolution final
		: public OpRewritePattern<Conv2DNhwcHwcfOp> {
		using OpRewritePattern::OpRewritePattern;

		LogicalResult matchAndRewrite(linalg::Conv2DNhwcHwcfOp convOp,
		PatternRewriter &rewriter) const override {
		auto linalgOp = cast<linalg::LinalgOp>(*convOp);
		if (linalgOp.hasBufferSemantics())
		return failure(); // To be implemented

		Value input = convOp.inputs().front();
		Value filter = convOp.inputs().back();
		Value output = convOp.outputs().front();

		auto inputType = input.getType().dyn_cast<RankedTensorType>();
		auto filterType = filter.getType().dyn_cast<RankedTensorType>();
		auto outputType = output.getType().dyn_cast<RankedTensorType>();

		auto inputShape = inputType.getShape();
		auto filterShape = filterType.getShape();
		auto outputShape = outputType.getShape();

		// Only handle the case where at least one of the window dimensions is
		// of size 1. Other cases can rely on tiling to reduce to such cases.
		int64_t fhSize = filterShape[0], fwSize = filterShape[1];
		int64_t ohSize = outputShape[1], owSize = outputShape[2];
		if (!(fhSize == 1 && ohSize == 1) && !(fwSize == 1 && owSize == 1))
		return failure();
		bool removeH = ohSize == 1;

		// Get new shapes and types for all operands by removing the size-1
		// dimension.

		SmallVector<int64_t, 3> newInputShape{
		inputShape[0], inputShape[removeH ? 2 : 1], inputShape[3]};
		auto newInputType = RankedTensorType::get(
		newInputShape, inputType.getElementType(), inputType.getEncoding());

		SmallVector<int64_t, 3> newFilterShape{filterShape[removeH ? 1 : 0],
		filterShape[2], filterShape[3]};
		auto newFilterType = RankedTensorType::get(
		newFilterShape, filterType.getElementType(), filterType.getEncoding());

		SmallVector<int64_t, 3> newOutputShape{
		outputShape[0], outputShape[removeH ? 2 : 1], outputShape[3]};
		auto newOutputType = RankedTensorType::get(
		newOutputShape, outputType.getElementType(), outputType.getEncoding());

		SmallVector<ReassociationIndices, 3> ioReshapeIndices = {{0}, {1, 2}, {3}};
		SmallVector<ReassociationIndices, 3> fReshapeIndices = {{0, 1}, {2}, {3}};

		// Reshape all operands for 1-D convolution.
		Location loc = convOp.getLoc();
		Value newInput = rewriter.create<linalg::TensorCollapseShapeOp>(
		nicolasvasilacheUnsubmitted Not Done Reply Inline Actions We can iterate from this to get started but note that there are implications on bufferization. Alternatively we could use rank-reducing InsertSliceOp / ExtractSliceOp; this may be needed for proper inplace bufferization but let's punt for now until we see the whole end-to-end story. nicolasvasilache: We can iterate from this to get started but note that there are implications on bufferization.
		antiagainstAuthorUnsubmitted Done Reply Inline Actions Yeah. I was a bit worried about its implication over vectorization too. I think insert/extract slice op might be better supported there. But I think this reshape ops should be also be supported too. Need to fix if missing. Good point about bufferization; I wasn't thinking much about it previously. We can change this if it turns out to be an issue. antiagainst: Yeah. I was a bit worried about its implication over vectorization too. I think insert/extract…
		loc, newInputType, input, ioReshapeIndices);
		Value newFilter = rewriter.create<linalg::TensorCollapseShapeOp>(
		loc, newFilterType, filter, fReshapeIndices);
		Value newOutput = rewriter.create<linalg::TensorCollapseShapeOp>(
		loc, newOutputType, output, ioReshapeIndices);

		// We need to shrink the strides and dilations too.
		auto stride = convOp.strides().getFlatValue<int64_t>(removeH ? 1 : 0);
		auto stridesAttr = rewriter.getI64VectorAttr(stride);
		auto dilation = convOp.dilations().getFlatValue<int64_t>(removeH ? 1 : 0);
		auto dilationsAttr = rewriter.getI64VectorAttr(dilation);

		auto conv1DOp = rewriter.create<linalg::Conv1DNwcWcfOp>(
		loc, newOutputType, ValueRange{newInput, newFilter},
		ValueRange{newOutput}, stridesAttr, dilationsAttr);

		rewriter.replaceOpWithNewOp<linalg::TensorExpandShapeOp>(
		convOp, outputType, conv1DOp.getResult(0), ioReshapeIndices);
		return success();
		};
		};

		} // namespace

		void linalg::populateDecomposeConvolutionPatterns(RewritePatternSet &patterns,
		PatternBenefit benefit) {
		patterns.add<DownscaleSizeOneWindowed2DConvolution>(patterns.getContext(),
		benefit);
		}

mlir/test/Dialect/Linalg/decompose-convolution.mlir

This file was added.

				// RUN: mlir-opt -split-input-file -test-linalg-transform-patterns=test-decompose-convolution-patterns %s \| FileCheck %s

				// CHECK-LABEL: func @conv2d_nhwc_4x1x2x8_tensor
				// CHECK-SAME: (%[[INPUT:.+]]: tensor<4x1x6x3xf32>, %[[FILTER:.+]]: tensor<1x2x3x8xf32>, %[[INIT:.+]]: tensor<4x1x2x8xf32>)
				func @conv2d_nhwc_4x1x2x8_tensor(%input: tensor<4x1x6x3xf32>, %filter: tensor<1x2x3x8xf32>, %init: tensor<4x1x2x8xf32>) -> tensor<4x1x2x8xf32> {
				%0 = linalg.conv_2d_nhwc_hwcf
				{dilations = dense<[2, 3]> : tensor<2xi64>, strides = dense<[3, 2]> : tensor<2xi64>}
				ins(%input, %filter : tensor<4x1x6x3xf32>, tensor<1x2x3x8xf32>)
				outs(%init : tensor<4x1x2x8xf32>) -> tensor<4x1x2x8xf32>
				return %0 : tensor<4x1x2x8xf32>
				}

				// CHECK: %[[INPUT_1D:.+]] = linalg.tensor_collapse_shape %[[INPUT]]
				// CHECK-SAME{LITERAL}: [[0], [1, 2], [3]] : tensor<4x1x6x3xf32> into tensor<4x6x3xf32>
				// CHECK: %[[FILTER_1D:.+]] = linalg.tensor_collapse_shape %[[FILTER]]
				// CHECK-SAME{LITERAL}: [[0, 1], [2], [3]] : tensor<1x2x3x8xf32> into tensor<2x3x8xf32>
				// CHECK: %[[INIT_1D:.+]] = linalg.tensor_collapse_shape %[[INIT]]
				// CHECK-SAME{LITERAL}: [[0], [1, 2], [3]] : tensor<4x1x2x8xf32> into tensor<4x2x8xf32>
				// CHECK: %[[CONV_1D:.+]] = linalg.conv_1d_nwc_wcf
				// CHECK-SAME: dilations = dense<3> : vector<1xi64>
				// CHECK-SAME: strides = dense<2> : vector<1xi64>
				// CHECK-SAME: ins(%[[INPUT_1D]], %[[FILTER_1D]] : tensor<4x6x3xf32>, tensor<2x3x8xf32>)
				// CHECK-SAME: outs(%[[INIT_1D]] : tensor<4x2x8xf32>)
				// CHECK: %[[CONV_2D:.+]] = linalg.tensor_expand_shape %[[CONV_1D]]
				// CHECK-SAME{LITERAL}: [[0], [1, 2], [3]] : tensor<4x2x8xf32> into tensor<4x1x2x8xf32>
				// CHECK: return %[[CONV_2D]]

				// -----

				// CHECK-LABEL: func @conv2d_nhwc_qxqx1xq_tensor
				// CHECK-SAME: (%[[INPUT:.+]]: tensor<?x?x1x?xf32>, %[[FILTER:.+]]: tensor<?x1x?x?xf32>, %[[INIT:.+]]: tensor<?x?x1x?xf32>)
				func @conv2d_nhwc_qxqx1xq_tensor(%input: tensor<?x?x1x?xf32>, %filter: tensor<?x1x?x?xf32>, %init: tensor<?x?x1x?xf32>) -> tensor<?x?x1x?xf32> {
				%0 = linalg.conv_2d_nhwc_hwcf
				{dilations = dense<[2, 3]> : tensor<2xi64>, strides = dense<[3, 2]> : tensor<2xi64>}
				ins(%input, %filter : tensor<?x?x1x?xf32>, tensor<?x1x?x?xf32>)
				outs(%init : tensor<?x?x1x?xf32>) -> tensor<?x?x1x?xf32>
				return %0 : tensor<?x?x1x?xf32>
				}

				// CHECK: %[[INPUT_1D:.+]] = linalg.tensor_collapse_shape %[[INPUT]]
				// CHECK-SAME{LITERAL}: [[0], [1, 2], [3]] : tensor<?x?x1x?xf32> into tensor<?x?x?xf32>
				// CHECK: %[[FILTER_1D:.+]] = linalg.tensor_collapse_shape %[[FILTER]]
				// CHECK-SAME{LITERAL}: [[0, 1], [2], [3]] : tensor<?x1x?x?xf32> into tensor<?x?x?xf32>
				// CHECK: %[[INIT_1D:.+]] = linalg.tensor_collapse_shape %[[INIT]]
				// CHECK-SAME{LITERAL}: [[0], [1, 2], [3]] : tensor<?x?x1x?xf32> into tensor<?x?x?xf32>
				// CHECK: %[[CONV_1D:.+]] = linalg.conv_1d_nwc_wcf
				// CHECK-SAME: dilations = dense<2> : vector<1xi64>
				// CHECK-SAME: strides = dense<3> : vector<1xi64>
				// CHECK-SAME: ins(%[[INPUT_1D]], %[[FILTER_1D]] : tensor<?x?x?xf32>, tensor<?x?x?xf32>)
				// CHECK-SAME: outs(%[[INIT_1D]] : tensor<?x?x?xf32>)
				// CHECK: %[[CONV_2D:.+]] = linalg.tensor_expand_shape %[[CONV_1D]]
				// CHECK-SAME{LITERAL}: [[0], [1, 2], [3]] : tensor<?x?x?xf32> into tensor<?x?x1x?xf32>
				// CHECK: return %[[CONV_2D]]

				// -----

				// Do not convert convolution ops whose window dimensions are not ones.

				// CHECK-LABEL: func @conv2d_nhwc_4x1x2x8_tensor
				func @conv2d_nhwc_4x1x2x8_tensor(%input: tensor<4x3x5x3xf32>, %filter: tensor<2x2x3x8xf32>, %init: tensor<4x1x2x8xf32>) -> tensor<4x1x2x8xf32> {
				// CHECK: linalg.conv_2d_nhwc_hwcf
				%0 = linalg.conv_2d_nhwc_hwcf
				{dilations = dense<[2, 3]> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>}
				ins(%input, %filter : tensor<4x3x5x3xf32>, tensor<2x2x3x8xf32>)
				outs(%init : tensor<4x1x2x8xf32>) -> tensor<4x1x2x8xf32>
				return %0 : tensor<4x1x2x8xf32>
				}

mlir/test/lib/Dialect/Linalg/TestLinalgTransforms.cpp

Show First 20 Lines • Show All 146 Lines • ▼ Show 20 Lines	Option<bool> skipPartial{
*this, "skip-partial",		*this, "skip-partial",
llvm::cl::desc("Skip loops inside partial iterations during peeling"),		llvm::cl::desc("Skip loops inside partial iterations during peeling"),
llvm::cl::init(false)};		llvm::cl::init(false)};
Option<std::string> loopType{		Option<std::string> loopType{
*this, "loop-type",		*this, "loop-type",
llvm::cl::desc("Specify the type of loops to generate: for, parallel or "		llvm::cl::desc("Specify the type of loops to generate: for, parallel or "
"tiled_loop"),		"tiled_loop"),
llvm::cl::init("for")};		llvm::cl::init("for")};
		Option<bool> testDecomposeConvolutionPattern{
		*this, "test-decompose-convolution-patterns",
		llvm::cl::desc("Test a set of patterns to rewrite high-D convolution ops "
		"into low-D ones"),
		llvm::cl::init(false)};
};		};
} // end anonymous namespace		} // end anonymous namespace

static void applyPatterns(FuncOp funcOp) {		static void applyPatterns(FuncOp funcOp) {
MLIRContext *ctx = funcOp.getContext();		MLIRContext *ctx = funcOp.getContext();
RewritePatternSet patterns(ctx);		RewritePatternSet patterns(ctx);

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 408 Lines • ▼ Show 20 Lines	patterns.add<LinalgVectorizationPattern>(
funcOp.getContext(),		funcOp.getContext(),
LinalgTransformationFilter()		LinalgTransformationFilter()
.addOpFilter<ContractionOpInterface, FillOp, CopyOp, GenericOp>());		.addOpFilter<ContractionOpInterface, FillOp, CopyOp, GenericOp>());
populatePadTensorOpVectorizationPatterns(patterns);		populatePadTensorOpVectorizationPatterns(patterns);
populateConvolutionVectorizationPatterns(patterns);		populateConvolutionVectorizationPatterns(patterns);
(void)applyPatternsAndFoldGreedily(funcOp, std::move(patterns));		(void)applyPatternsAndFoldGreedily(funcOp, std::move(patterns));
}		}

		static void applyDecomposeConvolutionPatterns(FuncOp funcOp) {
		RewritePatternSet patterns(funcOp.getContext());
		populateDecomposeConvolutionPatterns(patterns);
		(void)applyPatternsAndFoldGreedily(funcOp, std::move(patterns));
		}

static void applyPadTensorToGenericPatterns(FuncOp funcOp) {		static void applyPadTensorToGenericPatterns(FuncOp funcOp) {
RewritePatternSet patterns(funcOp.getContext());		RewritePatternSet patterns(funcOp.getContext());
patterns.add<PadTensorOpTransformationPattern>(funcOp.getContext());		patterns.add<PadTensorOpTransformationPattern>(funcOp.getContext());
(void)applyPatternsAndFoldGreedily(funcOp, std::move(patterns));		(void)applyPatternsAndFoldGreedily(funcOp, std::move(patterns));
}		}

static void applyGeneralizePadTensorPatterns(FuncOp funcOp) {		static void applyGeneralizePadTensorPatterns(FuncOp funcOp) {
RewritePatternSet patterns(funcOp.getContext());		RewritePatternSet patterns(funcOp.getContext());
▲ Show 20 Lines • Show All 227 Lines • ▼ Show 20 Lines	getFunction().walk([&](linalg::PadTensorOp padTensorOp) {
padTensorOp->erase();		padTensorOp->erase();
}		}
});		});
}		}
if (testPadPattern)		if (testPadPattern)
return applyPadPattern(getFunction(), packPaddings, hoistPaddings);		return applyPadPattern(getFunction(), packPaddings, hoistPaddings);
if (testInterchangePattern.hasValue())		if (testInterchangePattern.hasValue())
return applyInterchangePattern(getFunction(), testInterchangePattern);		return applyInterchangePattern(getFunction(), testInterchangePattern);
		if (testDecomposeConvolutionPattern)
		return applyDecomposeConvolutionPatterns(getFunction());
}		}

namespace mlir {		namespace mlir {
namespace test {		namespace test {
void registerTestLinalgTransforms() {		void registerTestLinalgTransforms() {
PassRegistration<TestLinalgTransforms>();		PassRegistration<TestLinalgTransforms>();
}		}
} // namespace test		} // namespace test
} // namespace mlir		} // namespace mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][linalg] Rewrite `linalg.conv_2d_nhwc_hwcf` into 1-DClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 384082

mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp

mlir/test/Dialect/Linalg/decompose-convolution.mlir

mlir/test/lib/Dialect/Linalg/TestLinalgTransforms.cpp

[mlir][linalg] Rewrite `linalg.conv_2d_nhwc_hwcf` into 1-D
ClosedPublic