This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
lib/Dialect/Linalg/Transforms/
-
Dialect/
-
Linalg/
-
Transforms/
2/2
Tiling.cpp
-
test/Dialect/Linalg/
-
Dialect/
-
Linalg/
-
tile_conv.mlir
1/1
tile_simple_conv.mlir

Differential D86638

[mlir][Linalg] Wrong tile size for convolutions fixed
ClosedPublic

Authored by limo1996 on Aug 26 2020, 9:35 AM.

Download Raw Diff

Details

Reviewers

ftynse
nicolasvasilache
mravishankar

Commits

rG8d35080ebbea: [mlir][Linalg] Wrong tile size for convolutions fixed

Summary

Sizes of tiles (subviews) are bigger by 1 than they should. Let's consider
1D convolution without batches or channels. Furthermore let m iterate over
the output and n over the kernel then input is accessed with m + n. In tiling
subview sizes for convolutions are computed by applying requested tile size
together with kernel size to the above mentioned expression thus let's say
for tile size of 2 the subview size is 2 + size(n), which is bigger by one
than it should since we move kernel only once. The problem behind it is that
range is not turned into closed interval before the composition. This commit
fixes the problem by turning ranges first into closed intervals by substracting
1 and after the composition back to half open by adding 1.

PHAB_REVIEW=D86638

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

limo1996 created this revision.Aug 26 2020, 9:35 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 26 2020, 9:35 AM

Herald added subscribers: msifontes, jurahul, Kayjukh and 12 others. · View Herald Transcript

limo1996 requested review of this revision.Aug 26 2020, 9:35 AM

Herald added a subscriber: stephenneuendorffer. · View Herald TranscriptAug 26 2020, 9:35 AM

Harbormaster completed remote builds in B69623: Diff 288012.Aug 26 2020, 9:55 AM

@limo1996 This is really nice find! I have been struggling with figuring out why convolutions are giving me errors in some cases, and I suspected something like, but wasnt able to nail this down.

There might be a better fix though. I'll take a look at this as well. For now I am going to just "Request changes" but I dont really have any changes to request. Just a placeholder for me to take a look again.

This revision now requires changes to proceed.Aug 26 2020, 10:14 AM

In D86638#2239561, @mravishankar wrote:

@limo1996 This is really nice find! I have been struggling with figuring out why convolutions are giving me errors in some cases, and I suspected something like, but wasnt able to nail this down.

There might be a better fix though. I'll take a look at this as well. For now I am going to just "Request changes" but I dont really have any changes to request. Just a placeholder for me to take a look again.

Thank you @mravishankar! I know for sure there is a better fix but my goal here was to initiate a discussion.

You can try new Convolutions for different layouts and ranks that were recently added. I also saw that traditional linalg.conv is quite buggy but the plan is that it will be replaced so I did not invest time into fixing it.

Could we start a discourse post on this. Its more easy to document issues and discuss solutions there rather than on the patch.

Also, I am curious what other bugs you found in the convolution operation. Please add those to the discourse post if possible.

In D86638#2242434, @mravishankar wrote:

Could we start a discourse post on this. Its more easy to document issues and discuss solutions there rather than on the patch.

Also, I am curious what other bugs you found in the convolution operation. Please add those to the discourse post if possible.

Here is the discourse post: https://llvm.discourse.group/t/wrong-tile-size-for-convolution-ops/1699

Implementation changed as discussed here:
https://llvm.discourse.group/t/wrong-tile-size-for-convolution-ops/1699

limo1996 edited the summary of this revision. (Show Details)Sep 2 2020, 5:20 AM

limo1996 retitled this revision from [mlir][WIP] Tiling for convolutions to [mlir][Linalg] Wrong tile size for convolutions fixed.

limo1996 edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B70367: Diff 289402.Sep 2 2020, 5:34 AM

nicolasvasilache requested changes to this revision.Sep 2 2020, 5:48 AM

nicolasvasilache added inline comments.

mlir/lib/Dialect/Linalg/Transforms/Tiling.cpp
248	Please use `mlir::edsc::op::operator+` and write it as `size + std_constant_index(1)` to get the proper affine_apply operation generated. Without that things won't canonicalize nicely. See e.g. mlir/test/EDSC/builder-api-test.cpp line 82 for some C++ and the generated IR.
289	same thing here re `operator+`
mlir/test/Dialect/Linalg/tile_simple_conv.mlir
38	these shouls not appear as subi / addi but instead use affine_apply with the properly canonicalized/simplified expressions.

This revision now requires changes to proceed.Sep 2 2020, 5:48 AM

Comments of nicolas resolved

limo1996 marked 3 inline comments as done.Sep 2 2020, 7:03 AM

Harbormaster completed remote builds in B70396: Diff 289440.Sep 2 2020, 7:35 AM

Great, thank you @limo1996 !

Thanks!

This revision is now accepted and ready to land.Sep 2 2020, 9:16 AM

Closed by commit rG8d35080ebbea: [mlir][Linalg] Wrong tile size for convolutions fixed (authored by limo1996). · Explain WhySep 2 2020, 11:02 PM

This revision was automatically updated to reflect the committed changes.

limo1996 added a commit: rG8d35080ebbea: [mlir][Linalg] Wrong tile size for convolutions fixed.

Revision Contents

Path

Size

mlir/

lib/

Dialect/

Linalg/

Transforms/

Tiling.cpp

8 lines

test/

Dialect/

Linalg/

tile_conv.mlir

2 lines

tile_simple_conv.mlir

6 lines

Diff 289643

mlir/lib/Dialect/Linalg/Transforms/Tiling.cpp

Show First 20 Lines • Show All 237 Lines • ▼ Show 20 Lines	static SmallVector<Value, 4> makeTiledViews(OpBuilder &b, Location loc,

auto viewSizes = applyMapToValues(b, loc, map, allViewSizes);		auto viewSizes = applyMapToValues(b, loc, map, allViewSizes);
// Construct (potentially temporary) mins and maxes on which to apply maps		// Construct (potentially temporary) mins and maxes on which to apply maps
// that define tile subviews.		// that define tile subviews.
SmallVector<Value, 8> lbs, subViewSizes;		SmallVector<Value, 8> lbs, subViewSizes;
for (unsigned idx = 0, idxIvs = 0, e = tileSizes.size(); idx < e; ++idx) {		for (unsigned idx = 0, idxIvs = 0, e = tileSizes.size(); idx < e; ++idx) {
bool isTiled = !isZero(tileSizes[idx]);		bool isTiled = !isZero(tileSizes[idx]);
lbs.push_back(isTiled ? ivs[idxIvs++] : (Value)std_constant_index(0));		lbs.push_back(isTiled ? ivs[idxIvs++] : (Value)std_constant_index(0));
subViewSizes.push_back(isTiled ? tileSizes[idx] : viewSizes[idx]);		// Before composing, we need to make range a closed interval.
		Value size = isTiled ? tileSizes[idx] : viewSizes[idx];
		subViewSizes.push_back(size - std_constant_index(1));
		nicolasvasilacheUnsubmitted Done Reply Inline Actions Please use `mlir::edsc::op::operator+` and write it as `size + std_constant_index(1)` to get the proper affine_apply operation generated. Without that things won't canonicalize nicely. See e.g. mlir/test/EDSC/builder-api-test.cpp line 82 for some C++ and the generated IR. nicolasvasilache: Please use `mlir::edsc::op::operator+` and write it as `size + std_constant_index(1)` to get…
}		}

auto *op = linalgOp.getOperation();		auto *op = linalgOp.getOperation();

SmallVector<Value, 4> res;		SmallVector<Value, 4> res;
res.reserve(op->getNumOperands());		res.reserve(op->getNumOperands());
auto viewIteratorBegin = linalgOp.getInputsAndOutputBuffers().begin();		auto viewIteratorBegin = linalgOp.getInputsAndOutputBuffers().begin();
for (unsigned viewIndex = 0; viewIndex < linalgOp.getNumInputsAndOutputs();		for (unsigned viewIndex = 0; viewIndex < linalgOp.getNumInputsAndOutputs();
Show All 22 Lines	for (unsigned r = 0; r < rank; ++r) {
continue;		continue;
}		}

// Tiling creates a new slice at the proper index, the slice step is 1		// Tiling creates a new slice at the proper index, the slice step is 1
// (i.e. the slice view does not subsample, stepping occurs in the loop).		// (i.e. the slice view does not subsample, stepping occurs in the loop).
auto m = map.getSubMap({r});		auto m = map.getSubMap({r});
auto offset = applyMapToValues(b, loc, m, lbs).front();		auto offset = applyMapToValues(b, loc, m, lbs).front();
offsets.push_back(offset);		offsets.push_back(offset);
auto size = applyMapToValues(b, loc, m, subViewSizes).front();		auto closedIntSize = applyMapToValues(b, loc, m, subViewSizes).front();
		// Resulting size needs to be made half open interval again.
		auto size = closedIntSize + std_constant_index(1);
		nicolasvasilacheUnsubmitted Done Reply Inline Actions same thing here re `operator+` nicolasvasilache: same thing here re `operator+`

// The size of the subview should be trimmed to avoid out-of-bounds		// The size of the subview should be trimmed to avoid out-of-bounds
// accesses, unless we statically know the subview size divides the view		// accesses, unless we statically know the subview size divides the view
// size evenly.		// size evenly.
int64_t viewSize = viewType.getDimSize(r);		int64_t viewSize = viewType.getDimSize(r);
auto sizeCst = size.getDefiningOp<ConstantIndexOp>();		auto sizeCst = size.getDefiningOp<ConstantIndexOp>();
if (ShapedType::isDynamic(viewSize) \|\| !sizeCst \|\|		if (ShapedType::isDynamic(viewSize) \|\| !sizeCst \|\|
(viewSize % sizeCst.getValue()) != 0) {		(viewSize % sizeCst.getValue()) != 0) {
▲ Show 20 Lines • Show All 268 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/tile_conv.mlir

	// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,0,0,4" \| FileCheck %s -check-prefix=TILE-23004			// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,0,0,4" \| FileCheck %s -check-prefix=TILE-23004

	// TILE-23004-DAG: #[[$D0x30pS0x10:.]] = affine_map<(d0) -> (d0 30)>			// TILE-23004-DAG: #[[$D0x30pS0x10:.]] = affine_map<(d0) -> (d0 30)>
	// TILE-23004-DAG: #[[$S0x10p90D0x30pS1:.]] = affine_map<(d0)[s0, s1] -> (s0 10 + 90, d0 * -30 + s1)>			// TILE-23004-DAG: #[[$S0x10p90D0x30pS1:.]] = affine_map<(d0)[s0, s1] -> (s0 10 + 51, d0 * -30 + s1)>
	// TILE-23004-DAG: #[[$strided4D:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3)>			// TILE-23004-DAG: #[[$strided4D:.]] = affine_map<(d0, d1, d2, d3)[s0, s1, s2, s3] -> (d0 s1 + s0 + d1 * s2 + d2 * s3 + d3)>
	// TILE-23004-DAG: #[[$bound_map_4:.*]] = affine_map<(d0)[s0] -> (4, -d0 + s0)>			// TILE-23004-DAG: #[[$bound_map_4:.*]] = affine_map<(d0)[s0] -> (4, -d0 + s0)>

	func @conv(%arg0: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg1: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg2: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>) {			func @conv(%arg0: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg1: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, %arg2: memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>) {
	linalg.conv(%arg0, %arg1, %arg2) {dilations = [10, 20], strides = [30, 40]} : memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>			linalg.conv(%arg0, %arg1, %arg2) {dilations = [10, 20], strides = [30, 40]} : memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>, memref<?x?x?x?xf32, offset: ?, strides: [?, ?, ?, 1]>
	return			return
	}			}
	// TILE-23004: func @conv(			// TILE-23004: func @conv(
	Show All 34 Lines

mlir/test/Dialect/Linalg/tile_simple_conv.mlir

	// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,4" \| FileCheck %s			// RUN: mlir-opt %s -linalg-tile="linalg-tile-sizes=2,3,4" \| FileCheck %s

	// CHECK-DAG: #[[MAP0:.*]] = affine_map<(d0)[s0] -> (2, -d0 + s0)>			// CHECK-DAG: #[[MAP0:.*]] = affine_map<(d0)[s0] -> (2, -d0 + s0)>
	// CHECK-DAG: #[[MAP1:.*]] = affine_map<(d0)[s0, s1] -> (s0 + 3, -d0 + s1)>			// CHECK-DAG: #[[MAP1:.*]] = affine_map<(d0)[s0, s1] -> (s0 + 2, -d0 + s1)>
	// CHECK-DAG: #[[MAP2:.*]] = affine_map<(d0)[s0, s1] -> (s0 + 4, -d0 + s1)>			// CHECK-DAG: #[[MAP2:.*]] = affine_map<(d0)[s0, s1] -> (s0 + 3, -d0 + s1)>
	// CHECK-DAG: #[[MAP4:.*]] = affine_map<(d0)[s0] -> (3, -d0 + s0)>			// CHECK-DAG: #[[MAP4:.*]] = affine_map<(d0)[s0] -> (3, -d0 + s0)>
	// CHECK-DAG: #[[MAP5:.*]] = affine_map<(d0)[s0] -> (4, -d0 + s0)>			// CHECK-DAG: #[[MAP5:.*]] = affine_map<(d0)[s0] -> (4, -d0 + s0)>

	func @conv(%arg0 : memref<?x?x?x?xf32>, %arg1 : memref<?x?x?x?xf32>, %arg2 : memref<?x?x?x?xf32>) {			func @conv(%arg0 : memref<?x?x?x?xf32>, %arg1 : memref<?x?x?x?xf32>, %arg2 : memref<?x?x?x?xf32>) {
	linalg.conv(%arg0, %arg1, %arg2) : memref<?x?x?x?xf32>, memref<?x?x?x?xf32>, memref<?x?x?x?xf32>			linalg.conv(%arg0, %arg1, %arg2) : memref<?x?x?x?xf32>, memref<?x?x?x?xf32>, memref<?x?x?x?xf32>
	return			return
	}			}

	Show All 16 Lines
	// CHECK: scf.for %[[ARG5:.*]] = %[[C0]] to %[[T4]] step %[[C4]]			// CHECK: scf.for %[[ARG5:.*]] = %[[C0]] to %[[T4]] step %[[C4]]
	// CHECK: %[[T5:.*]] = dim %[[ARG1]], %[[C0]]			// CHECK: %[[T5:.*]] = dim %[[ARG1]], %[[C0]]
	// CHECK: %[[T6:.*]] = affine.min #[[MAP0]](%[[ARG3]])[%[[T5]]]			// CHECK: %[[T6:.*]] = affine.min #[[MAP0]](%[[ARG3]])[%[[T5]]]
	// CHECK: %[[T7:.*]] = dim %[[ARG1]], %[[C1]]			// CHECK: %[[T7:.*]] = dim %[[ARG1]], %[[C1]]
	// CHECK: %[[T8:.*]] = affine.min #[[MAP1]](%[[ARG4]])[%[[T0]], %[[T7]]]			// CHECK: %[[T8:.*]] = affine.min #[[MAP1]](%[[ARG4]])[%[[T0]], %[[T7]]]
	// CHECK: %[[T9:.*]] = dim %[[ARG1]], %[[C2]]			// CHECK: %[[T9:.*]] = dim %[[ARG1]], %[[C2]]
	// CHECK: %[[T10:.*]] = affine.min #[[MAP2]](%[[ARG5]])[%[[T1]], %[[T9]]]			// CHECK: %[[T10:.*]] = affine.min #[[MAP2]](%[[ARG5]])[%[[T1]], %[[T9]]]
	// CHECK: %[[T11:.*]] = dim %[[ARG1]], %[[C3]]			// CHECK: %[[T11:.*]] = dim %[[ARG1]], %[[C3]]
	// CHECK: %[[SV1:.*]] = subview %[[ARG1]][%[[ARG3]], %[[ARG4]], %[[ARG5]], 0]			// CHECK: %[[SV1:.*]] = subview %[[ARG1]][%[[ARG3]], %[[ARG4]], %[[ARG5]], 0]
				nicolasvasilacheUnsubmitted Done Reply Inline Actions these shouls not appear as subi / addi but instead use affine_apply with the properly canonicalized/simplified expressions. nicolasvasilache: these shouls not appear as subi / addi but instead use affine_apply with the properly…
	// CHECK-SAME: [%[[T6]], %[[T8]], %[[T10]], %[[T11]]]			// CHECK-SAME: [%[[T6]], %[[T8]], %[[T10]], %[[T11]]]
	// CHECK: %[[T13:.*]] = dim %[[ARG2]], %[[C0]]			// CHECK: %[[T13:.*]] = dim %[[ARG2]], %[[C0]]
	// CHECK: %[[T14:.*]] = affine.min #[[MAP0]](%[[ARG3]])[%[[T13]]]			// CHECK: %[[T14:.*]] = affine.min #[[MAP0]](%[[ARG3]])[%[[T13]]]
	// CHECK: %[[T15:.*]] = dim %[[ARG2]], %[[C1]]			// CHECK: %[[T15:.*]] = dim %[[ARG2]], %[[C1]]
	// CHECK: %[[T16:.*]] = affine.min #[[MAP4]](%[[ARG4]])[%[[T15]]]			// CHECK: %[[T16:.*]] = affine.min #[[MAP4]](%[[ARG4]])[%[[T15]]]
	// CHECK: %[[T17:.*]] = dim %[[ARG2]], %[[C2]]			// CHECK: %[[T17:.*]] = dim %[[ARG2]], %[[C2]]
	// CHECK: %[[T18:.*]] = affine.min #[[MAP5]](%[[ARG5]])[%[[T17]]]			// CHECK: %[[T18:.*]] = affine.min #[[MAP5]](%[[ARG5]])[%[[T17]]]
	// CHECK: %[[T19:.*]] = dim %[[ARG2]], %[[C3]]			// CHECK: %[[T19:.*]] = dim %[[ARG2]], %[[C3]]
	// CHECK: %[[SV2:.*]] = subview %[[ARG2]][%[[ARG3]], %[[ARG4]], %[[ARG5]], 0]			// CHECK: %[[SV2:.*]] = subview %[[ARG2]][%[[ARG3]], %[[ARG4]], %[[ARG5]], 0]
	// CHECK-SAME: [%[[T14]], %[[T16]], %[[T18]], %[[T19]]]			// CHECK-SAME: [%[[T14]], %[[T16]], %[[T18]], %[[T19]]]
	// CHECK: linalg.conv(%[[ARG0]], %[[SV1]], %[[SV2]])			// CHECK: linalg.conv(%[[ARG0]], %[[SV1]], %[[SV2]])
	No newline at end of file