This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/IR/
-
mlir/
-
IR/
2/2
BuiltinTypes.h
-
lib/Dialect/
-
Dialect/
-
Linalg/Transforms/
-
Transforms/
-
Transforms.cpp
-
Vector/
-
VectorTransforms.cpp

Differential D113933

[mlir] Fix unintentional mutation by VectorType/RankedTensorType::Builder dropDim
ClosedPublic

Authored by nicolasvasilache on Nov 15 2021, 12:33 PM.

Download Raw Diff

Details

Reviewers

mehdi_amini
rriddle
aartbik
gysit

Commits

rG789c88e80e87: [mlir] Fix unintentional mutation by VectorType/RankedTensorType::Builder…

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nicolasvasilache created this revision.Nov 15 2021, 12:33 PM

Herald added subscribers: sdasgup3, wenzhicui, wrengr and 19 others. · View Herald TranscriptNov 15 2021, 12:33 PM

nicolasvasilache requested review of this revision.Nov 15 2021, 12:33 PM

Herald added a project: Restricted Project. · View Herald TranscriptNov 15 2021, 12:33 PM

Herald added a subscriber: stephenneuendorffer. · View Herald Transcript

jpienaar added a subscriber: jpienaar.Nov 15 2021, 12:42 PM

jpienaar added inline comments.

mlir/include/mlir/IR/BuiltinTypes.h
286–287	Why not have this return *this still? dropDim is different than all the others here.

nicolasvasilache marked an inline comment as done.Nov 15 2021, 12:46 PM

nicolasvasilache added inline comments.

mlir/include/mlir/IR/BuiltinTypes.h
286–287	Because dropping a dim would break ArrayRef contiguity requirements.

Harbormaster completed remote builds in B134329: Diff 387358.Nov 15 2021, 12:52 PM

This LG, but this does not entirely address the quirks of this builder: we have a "mostly flex" API but not for the "dropDim" which is somehow always "terminal" in the chain? It also means you can only drop a single dimension really.

In D113933#3133765, @mehdi_amini wrote:

This LG, but this does not entirely address the quirks of this builder: we have a "mostly flex" API but not for the "dropDim" which is somehow always "terminal" in the chain? It also means you can only drop a single dimension really.

Since it is non-owning, it has to be terminal unless we change the underlying design of MemRefType::Builder, VectorType::Builder and TensorType::Builder consistently to all take an owning SmallVector<int64_t> shape which I don't think is a good idea. I wouldn't go as far as creating a new Builder class just for this purpose, this would be more confusing. Maybe a better naming scheme would help?

Yes, atm this only drops a single dimension, I have not had needs for more than this so far. As usual with MLIR we can extend when the need arises.
Generally cleaning up all the verbose multiline type creations in MLIR is a cleanup step that I'd like to turn into intro tasks.

I don't know the history of all these Builders actually, it just gives me a feel of a fairly "ad-hoc" API that makes it a bit specialized for a subset of the users, whereas I'd have thought of APIs exposed in mlir/include/mlir/IR/BuiltinTypes.h to be a bit more generic.
I wonder how @rriddle would see all this?

MemRefType::Builder was introduced here: https://reviews.llvm.org/D73296

In D113933#3133770, @nicolasvasilache wrote:

In D113933#3133765, @mehdi_amini wrote:

This LG, but this does not entirely address the quirks of this builder: we have a "mostly flex" API but not for the "dropDim" which is somehow always "terminal" in the chain? It also means you can only drop a single dimension really.

Since it is non-owning, it has to be terminal unless we change the underlying design of MemRefType::Builder, VectorType::Builder and TensorType::Builder consistently to all take an owning SmallVector<int64_t> shape which I don't think is a good idea. I wouldn't go as far as creating a new Builder class just for this purpose, this would be more confusing.

I don't see a problem of having an owning vector there, especially the SmallVector that keeps ~6 elements on stack (IIRC, it's 64 bytes total storage, and 16 are used for size and capacity). If we were using the regular type construction API, we would have used SmallVector anyway.

Maybe a better naming scheme would help?

We can also go Java-style and add an explicit .build() method that constructs the type, and have the .dropDimensionsAndBuild() counterpart.

In D113933#3133792, @mehdi_amini wrote:

I don't know the history of all these Builders actually, it just gives me a feel of a fairly "ad-hoc" API that makes it a bit specialized for a subset of the users, whereas I'd have thought of APIs exposed in mlir/include/mlir/IR/BuiltinTypes.h to be a bit more generic.
I wonder how @rriddle would see all this?

The original MemRefBuilder was added to remove the boilerplate of creating memref type that is exactly the same as another memref except for one property (address space or layout), abundant in conversions. It looks much cleaner to write MemRefType::Builder(original).setMemorySpace(42) than to write MemRefType::get(original.getContext(), original.getShape(), original.getLayout(), /*memorySpace=*/42). Both ways are equally generic and equally BuiltinTypes.h.

Both ways are equally generic

I agree, but only in absence of terminal APIs (ones that don't return *this) in these fluent style Builder classes, because they are non-composable. These APIs makes it look to me like they are motivated by specific call sites / uses, and hence are not really generic anymore.
Having something like MemRefType::Builder(original).setMemorySpace(42) which returns a builder and rely on the conversion operator to finalize (or with an extra build() or finalize() method as you suggested) would seems fine to me on the other hand.

There is some cost as Nicolas mentioned: you always have to copy the shape into a SmallVector when creating the builder instead of copying only if the user is using dropDim. I don't know if this is something to optimize for here though (memcpy of a few int64 to the stack), we may be able to do a copy-on-write as well, but that may not be trivial / worth it.

In D113933#3136339, @mehdi_amini wrote:

Both ways are equally generic

I agree, but only in absence of terminal APIs (ones that don't return *this) in these fluent style Builder classes, because they are non-composable. These APIs makes it look to me like they are motivated by specific call sites / uses, and hence are not really generic anymore.
Having something like MemRefType::Builder(original).setMemorySpace(42) which returns a builder and rely on the conversion operator to finalize (or with an extra build() or finalize() method as you suggested) would seems fine to me on the other hand.

There is some cost as Nicolas mentioned: you always have to copy the shape into a SmallVector when creating the builder instead of copying only if the user is using dropDim. I don't know if this is something to optimize for here though (memcpy of a few int64 to the stack), we may be able to do a copy-on-write as well, but that may not be trivial / worth it.

Can't we do copy-on-write for the shape? I don't see why we need to copy the shape if we haven't modified it. These builders are short lived on the stack anyways, so I don't see a big deal about making the builder slightly bigger.

How would copy-on-write be implemented?
More specifically who has the ownership of the copied vector and guarantees that it is live until the builder dies?

Atm the vector is local to dropDim and I don't have a good way to let it escape.
Is there maybe a specific CopyOnWriteArrayRef<int64_t> shape; in LLVM I could use in place of the ArrayRef<int64_t> shape member in the Builder ?

In D113933#3137055, @nicolasvasilache wrote:

How would copy-on-write be implemented?

With the builder having a SmallVector member *and* an ArrayRef.

SmallVector<int64_t> storage;
ArrayRef<int64_t> shape;

Type dropDim(unsigned pos) const {
  if (storage.empty()) storage = shape;
  assert(pos < shape.size() && "overflow");
  storage.erase(storage.begin() + pos);
  shape = {storage.data(), storage.size()};
  return *this;
}

Implement CoW + rebase.

Herald added a reviewer: aartbik. · View Herald TranscriptNov 22 2021, 2:08 AM

Harbormaster completed remote builds in B135360: Diff 388824.Nov 22 2021, 2:27 AM

gysit accepted this revision.Nov 22 2021, 2:39 AM

This revision is now accepted and ready to land.Nov 22 2021, 2:39 AM

Fix bug: assign -> append

Herald added a subscriber: mravishankar. · View Herald TranscriptNov 22 2021, 2:51 AM

This revision was landed with ongoing or failed builds.Nov 22 2021, 2:55 AM

Closed by commit rG789c88e80e87: [mlir] Fix unintentional mutation by VectorType/RankedTensorType::Builder… (authored by nicolasvasilache). · Explain Why

This revision was automatically updated to reflect the committed changes.

nicolasvasilache added a commit: rG789c88e80e87: [mlir] Fix unintentional mutation by VectorType/RankedTensorType::Builder….

Harbormaster completed remote builds in B135369: Diff 388838.Nov 22 2021, 3:07 AM

Revision Contents

Path

Size

mlir/

include/

mlir/

IR/

BuiltinTypes.h

38 lines

lib/

Dialect/

Linalg/

Transforms/

Transforms.cpp

18 lines

Vector/

VectorTransforms.cpp

11 lines

Diff 388841

mlir/include/mlir/IR/BuiltinTypes.h

Show First 20 Lines • Show All 277 Lines • ▼ Show 20 Lines	Builder &setElementType(Type newElementType) {
return *this;		return *this;
}		}

Builder &setEncoding(Attribute newEncoding) {		Builder &setEncoding(Attribute newEncoding) {
encoding = newEncoding;		encoding = newEncoding;
return *this;		return *this;
}		}

/// Create a new RankedTensor by erasing a dim from shape @pos.		/// Erase a dim from shape @pos.
RankedTensorType dropDim(unsigned pos) {		Builder &dropDim(unsigned pos) {
		jpienaarUnsubmitted Done Reply Inline Actions Why not have this return this still? dropDim is different than all the others here. jpienaar:* Why not have this return *this still? dropDim is different than all the others here.
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Because dropping a dim would break ArrayRef contiguity requirements. nicolasvasilache: Because dropping a dim would break ArrayRef contiguity requirements.
assert(pos < shape.size() && "overflow");		assert(pos < shape.size() && "overflow");
SmallVector<int64_t, 4> newShape(shape.begin(), shape.end());		if (storage.empty())
newShape.erase(newShape.begin() + pos);		storage.append(shape.begin(), shape.end());
return setShape(newShape);		storage.erase(storage.begin() + pos);
		shape = {storage.data(), storage.size()};
		return *this;
}		}

operator RankedTensorType() {		operator RankedTensorType() {
return RankedTensorType::get(shape, elementType, encoding);		return RankedTensorType::get(shape, elementType, encoding);
}		}

private:		private:
ArrayRef<int64_t> shape;		ArrayRef<int64_t> shape;
		// Owning shape data for copy-on-write operations.
		SmallVector<int64_t> storage;
Type elementType;		Type elementType;
Attribute encoding;		Attribute encoding;
};		};

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// VectorType		// VectorType
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

Show All 14 Lines	Builder &setShape(ArrayRef<int64_t> newShape) {
return *this;		return *this;
}		}

Builder &setElementType(Type newElementType) {		Builder &setElementType(Type newElementType) {
elementType = newElementType;		elementType = newElementType;
return *this;		return *this;
}		}

/// Create a new VectorType by erasing a dim from shape @pos.		/// Erase a dim from shape @pos.
		Builder &dropDim(unsigned pos) {
		assert(pos < shape.size() && "overflow");
		if (storage.empty())
		storage.append(shape.begin(), shape.end());
		storage.erase(storage.begin() + pos);
		shape = {storage.data(), storage.size()};
		return *this;
		}

/// In the particular case where the vector has a single dimension that we		/// In the particular case where the vector has a single dimension that we
/// drop, return the scalar element type.		/// drop, return the scalar element type.
// TODO: unify once we have a VectorType that supports 0-D.		// TODO: unify once we have a VectorType that supports 0-D.
Type dropDim(unsigned pos) {		operator Type() {
assert(pos < shape.size() && "overflow");		if (shape.empty())
if (shape.size() == 1)
return elementType;		return elementType;
SmallVector<int64_t, 4> newShape(shape.begin(), shape.end());		return VectorType::get(shape, elementType);
newShape.erase(newShape.begin() + pos);
return setShape(newShape);
}		}

operator VectorType() { return VectorType::get(shape, elementType); }

private:		private:
ArrayRef<int64_t> shape;		ArrayRef<int64_t> shape;
		// Owning shape data for copy-on-write operations.
		SmallVector<int64_t> storage;
Type elementType;		Type elementType;
};		};

/// Given an `originalShape` and a `reducedShape` assumed to be a subset of		/// Given an `originalShape` and a `reducedShape` assumed to be a subset of
/// `originalShape` with some `1` entries erased, return the set of indices		/// `originalShape` with some `1` entries erased, return the set of indices
/// that specifies which of the entries of `originalShape` are dropped to obtain		/// that specifies which of the entries of `originalShape` are dropped to obtain
/// `reducedShape`. The returned mask can be applied as a projection to		/// `reducedShape`. The returned mask can be applied as a projection to
/// `originalShape` to obtain the `reducedShape`. This mask is useful to track		/// `originalShape` to obtain the `reducedShape`. This mask is useful to track
▲ Show 20 Lines • Show All 153 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp

Show First 20 Lines • Show All 870 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(linalg::Conv2DNhwcHwcfOp convOp,
bool removeH = (fhSize == 1 && ohSize == 1);		bool removeH = (fhSize == 1 && ohSize == 1);
bool removeW = (fwSize == 1 && owSize == 1);		bool removeW = (fwSize == 1 && owSize == 1);
if (!removeH && !removeW)		if (!removeH && !removeW)
return failure();		return failure();

// Get new shapes and types for all operands by removing the size-1		// Get new shapes and types for all operands by removing the size-1
// dimension.		// dimension.
using RTTBuilder = RankedTensorType::Builder;		using RTTBuilder = RankedTensorType::Builder;
auto newInputType = RTTBuilder(inputType).dropDim((removeH ? 1 : 2));		RankedTensorType newInputType =
auto newFilterType = RTTBuilder(filterType).dropDim((removeH ? 0 : 1));		RTTBuilder(inputType).dropDim((removeH ? 1 : 2));
auto newOutputType = RTTBuilder(outputType).dropDim(removeH ? 1 : 2);		RankedTensorType newFilterType =
		RTTBuilder(filterType).dropDim((removeH ? 0 : 1));
		RankedTensorType newOutputType =
		RTTBuilder(outputType).dropDim(removeH ? 1 : 2);

// Rank-reduce operands.		// Rank-reduce operands.
Location loc = convOp.getLoc();		Location loc = convOp.getLoc();
Value newInput = tensor::createCanonicalRankReducingExtractSliceOp(		Value newInput = tensor::createCanonicalRankReducingExtractSliceOp(
rewriter, loc, input, newInputType);		rewriter, loc, input, newInputType);
Value newFilter = tensor::createCanonicalRankReducingExtractSliceOp(		Value newFilter = tensor::createCanonicalRankReducingExtractSliceOp(
rewriter, loc, filter, newFilterType);		rewriter, loc, filter, newFilterType);
Value newOutput = tensor::createCanonicalRankReducingExtractSliceOp(		Value newOutput = tensor::createCanonicalRankReducingExtractSliceOp(
▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(DepthwiseConv2DNhwcHwcOp convOp,
bool removeH = (khSize == 1 && ohSize == 1);		bool removeH = (khSize == 1 && ohSize == 1);
bool removeW = (kwSize == 1 && owSize == 1);		bool removeW = (kwSize == 1 && owSize == 1);
if (!removeH && !removeW)		if (!removeH && !removeW)
return failure();		return failure();

// Get new shapes and types for all operands by removing the size-1		// Get new shapes and types for all operands by removing the size-1
// dimension.		// dimension.
using RTTBuilder = RankedTensorType::Builder;		using RTTBuilder = RankedTensorType::Builder;
auto newInputType = RTTBuilder(inputType).dropDim((removeH ? 1 : 2));		RankedTensorType newInputType =
auto newKernelType = RTTBuilder(kernelType).dropDim((removeH ? 0 : 1));		RTTBuilder(inputType).dropDim((removeH ? 1 : 2));
auto newOutputType = RTTBuilder(outputType).dropDim(removeH ? 1 : 2);		RankedTensorType newKernelType =
		RTTBuilder(kernelType).dropDim((removeH ? 0 : 1));
		RankedTensorType newOutputType =
		RTTBuilder(outputType).dropDim(removeH ? 1 : 2);

// Rank-reduce operands.		// Rank-reduce operands.
Location loc = convOp.getLoc();		Location loc = convOp.getLoc();
Value newInput = tensor::createCanonicalRankReducingExtractSliceOp(		Value newInput = tensor::createCanonicalRankReducingExtractSliceOp(
rewriter, loc, input, newInputType);		rewriter, loc, input, newInputType);
Value newKernel = tensor::createCanonicalRankReducingExtractSliceOp(		Value newKernel = tensor::createCanonicalRankReducingExtractSliceOp(
rewriter, loc, kernel, newKernelType);		rewriter, loc, kernel, newKernelType);
Value newOutput = tensor::createCanonicalRankReducingExtractSliceOp(		Value newOutput = tensor::createCanonicalRankReducingExtractSliceOp(
Show All 34 Lines

mlir/lib/Dialect/Vector/VectorTransforms.cpp

Show First 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	static Value reshapeLoad(Location loc, Value val, VectorType type,
Type lowType = VectorType::Builder(type).dropDim(0);		Type lowType = VectorType::Builder(type).dropDim(0);
// At extraction dimension?		// At extraction dimension?
if (index == 0) {		if (index == 0) {
auto posAttr = rewriter.getI64ArrayAttr(pos);		auto posAttr = rewriter.getI64ArrayAttr(pos);
return rewriter.create<vector::ExtractOp>(loc, lowType, val, posAttr);		return rewriter.create<vector::ExtractOp>(loc, lowType, val, posAttr);
}		}
// Unroll leading dimensions.		// Unroll leading dimensions.
VectorType vType = lowType.cast<VectorType>();		VectorType vType = lowType.cast<VectorType>();
auto resType = VectorType::Builder(type).dropDim(index).cast<VectorType>();		Type resType = VectorType::Builder(type).dropDim(index);
		auto resVectorType = resType.cast<VectorType>();
Value result = rewriter.create<arith::ConstantOp>(		Value result = rewriter.create<arith::ConstantOp>(
loc, resType, rewriter.getZeroAttr(resType));		loc, resVectorType, rewriter.getZeroAttr(resVectorType));
for (int64_t d = 0, e = resType.getDimSize(0); d < e; d++) {		for (int64_t d = 0, e = resVectorType.getDimSize(0); d < e; d++) {
auto posAttr = rewriter.getI64ArrayAttr(d);		auto posAttr = rewriter.getI64ArrayAttr(d);
Value ext = rewriter.create<vector::ExtractOp>(loc, vType, val, posAttr);		Value ext = rewriter.create<vector::ExtractOp>(loc, vType, val, posAttr);
Value load = reshapeLoad(loc, ext, vType, index - 1, pos, rewriter);		Value load = reshapeLoad(loc, ext, vType, index - 1, pos, rewriter);
result =		result = rewriter.create<vector::InsertOp>(loc, resVectorType, load, result,
rewriter.create<vector::InsertOp>(loc, resType, load, result, posAttr);		posAttr);
}		}
return result;		return result;
}		}

// Helper method to possibly drop a dimension in a store.		// Helper method to possibly drop a dimension in a store.
// TODO		// TODO
static Value reshapeStore(Location loc, Value val, Value result,		static Value reshapeStore(Location loc, Value val, Value result,
VectorType type, int64_t index, int64_t pos,		VectorType type, int64_t index, int64_t pos,
▲ Show 20 Lines • Show All 3,366 Lines • Show Last 20 Lines