Download Raw Diff

Details

Reviewers

nicolasvasilache
andydavis1

Commits

rG479ee1106153: [mlir] [VectorOps] Introduce vector.transpose

Summary

Introduced in order to introduce specialized lowering passes that implement transposition operations efficiently.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aartbik created this revision.Mar 19 2020, 4:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 19 2020, 4:01 PM

Herald added subscribers: llvm-commits, Joonsoo, liufengdb and 10 others. · View Herald Transcript

aartbik added reviewers: nicolasvasilache, andydavis1.Mar 19 2020, 4:04 PM

fixed accidentally removed line

Thanks Aart. How hard would this be to generalize this op to make it N-D transpose op, instead of limit to 2D cases? For example, I may want to transpose the inner two dimensions of a N-D vector

Harbormaster completed remote builds in B49809: Diff 251494.Mar 19 2020, 6:07 PM

In D76460#1932449, @andydavis1 wrote:

Thanks Aart. How hard would this be to generalize this op to make it N-D transpose op, instead of limit to 2D cases? For example, I may want to transpose the inner two dimensions of a N-D vector

Not very hard for the reference implementation (so the direct lowering into a bunch of extra/insert operations), but a lot harder for efficient implementations.
Would you rather that I lift the restriction on 2-D right now already and provide a reference lowering for all ranks and sizes?
(happy to do that)

Harbormaster completed remote builds in B49811: Diff 251496.Mar 19 2020, 6:40 PM

Thanks Aart. Yes, if its not too much work, I'd love to see this op take a permutation of dimensions of an N-D vector.

However, if you are concerned about having to support lowerings for all permutations, then how about removing the restriction on the operand being 2-D vectors, but limit the transpose op to picking a single pair of dimensions to transpose.

For example:

func @transpose(%arg0: vector<3x4x11x7xi32>) -> vector<3x4x7x11xi32> {

%0 = vector.transpose %arg0, 2, 3 : vector<3x4x11x7xi32>
return %0 : vector<3x4x7x11xi32>

}

Thanks!

Thanks Aart!

@andydavis1 if you don't mind, I'd like to go DFS on the 2-d transpose and connect it all the way to the LLVM intrinsics from this state.
We have short-term use/benchmarking cases for it.

I think it is easy to extend the op semantics now and have the matcher fail on the non-2-D case.
How do we want to represent permutation? AffineMapAttr or I32ArrayAttr?

limit the transpose op to picking a single pair of dimensions to transpose

This will still be tricky because we'll have a bunch of slices etc.
I'd just go with fail if not a 2-D op so that the lowering remains very simple towards LLVM matrix intrinsics (for now).

How do we want to represent permutation? AffineMapAttr or I32ArrayAttr?

If we go the route of general semantics and dimensions, I think arrayattr would the the right syntax

%1 = vector.tranpose %0 [0,1,3,2] : vector<2x3x4x5xf32>

<----->
  rank-length, all members 0 <= i < rank, proper permutation

It will not be much more work to introduce this syntax, and still verify for proper 2-D transpose (or only lower those for the time being).

WDYT? Right syntax, shall I do it?

nicolasvasilache added inline comments.Mar 20 2020, 11:23 AM

mlir/lib/Dialect/Vector/VectorOps.cpp
1534	Drop all this and use assemblyFormat instead please. See: https://reviews.llvm.org/D75987 for the usage of `TypesMatchWith` for dependent types.

WDYT? Right syntax, shall I do it?

Sure go for it, it is intuitive enough IMO.

aartbik marked an inline comment as done.Mar 20 2020, 11:46 AM

aartbik added inline comments.

mlir/lib/Dialect/Vector/VectorOps.cpp
1534	Yes, the sole purpose of having this as C++ was to compute the type dependence. But from this feedback, I am guessing we can express this in *.td now also. Cool! Let me look into that and make the change.

generalized to n-D vectors, more invalid tests

aartbik retitled this revision from [mlir] [VectorOps] Introduce vector.transpose on 2-D vectors to [mlir] [VectorOps] Introduce vector.transpose.Mar 20 2020, 2:15 PM

aartbik edited the summary of this revision. (Show Details)

PTAL

Thanks Aart!

This revision is now accepted and ready to land.Mar 20 2020, 3:19 PM

rriddle added inline comments.Mar 20 2020, 4:08 PM

mlir/include/mlir/Dialect/Vector/VectorOps.td
1284	Please make sure that you use mlir code blocks for descriptions.
mlir/lib/Dialect/Vector/VectorOps.cpp
1538	nit: Prefer using llvm::SmallBitVector or llvm::BitVector.

Closed by commit rG479ee1106153: [mlir] [VectorOps] Introduce vector.transpose (authored by aartbik). · Explain WhyMar 20 2020, 4:18 PM

This revision was automatically updated to reflect the committed changes.

Harbormaster completed remote builds in B49962: Diff 251764.Mar 20 2020, 4:51 PM

aartbik marked 2 inline comments as done.Mar 20 2020, 5:26 PM

aartbik added inline comments.

mlir/include/mlir/Dialect/Vector/VectorOps.td
1284	You mean block right? If so, we seem to have been a bit inconsistent with that anyway. Follow up coming....
mlir/lib/Dialect/Vector/VectorOps.cpp
1538	noted, will follow up

Harbormaster completed remote builds in B49958: Diff 251759.Mar 20 2020, 5:55 PM

Diff 251797

mlir/include/mlir/Dialect/Vector/VectorOps.td

Show First 20 Lines • Show All 1,263 Lines • ▼ Show 20 Lines	def Vector_TupleOp :

let extraClassDeclaration = [{		let extraClassDeclaration = [{
TupleType getResultTupleType() {		TupleType getResultTupleType() {
return getResult().getType().cast<TupleType>();		return getResult().getType().cast<TupleType>();
}		}
}];		}];
}		}

		def Vector_TransposeOp :
		Vector_Op<"transpose", [NoSideEffect,
		PredOpTrait<"operand and result have same element type",
		TCresVTEtIsSameAsOpBase<0, 0>>]>,
		Arguments<(ins AnyVector:$vector, I64ArrayAttr:$transp)>,
		Results<(outs AnyVector:$result)> {
		let summary = "vector transpose operation";
		let description = [{
		Takes a n-D vector and returns the transposed n-D vector defined by
		the permutation of ranks in the n-sized integer array attribute.
		In the operation

		%1 = vector.tranpose %0, [i_1, .., i_n]
		rriddleUnsubmitted Not Done Reply Inline Actions Please make sure that you use mlir code blocks for descriptions. rriddle: Please make sure that you use mlir code blocks for descriptions.
		aartbikAuthorUnsubmitted Done Reply Inline Actions You mean block right? If so, we seem to have been a bit inconsistent with that anyway. Follow up coming.... aartbik: You mean ``` block ``` right? If so, we seem to have been a bit inconsistent with that anyway.
		: vector<d_1 x .. x d_n x f32>
		to vector<d_trans[0] x .. x d_trans[n-1] x f32>

		the transp array [i_1, .., i_n] must be a permutation of [0, .., n-1].

		Example:

		%1 = vector.tranpose %0, [1, 0] : vector<2x3xf32> to vector<3x2xf32>

		[ [a, b, c], [ [a, d],
		[d, e, f] ] -> [b, e],
		[c, f] ]
		}];
		let extraClassDeclaration = [{
		VectorType getVectorType() {
		return vector().getType().cast<VectorType>();
		}
		VectorType getResultType() {
		return result().getType().cast<VectorType>();
		}
		}];
		let assemblyFormat = [{
		$vector `,` $transp attr-dict `:` type($vector) `to` type($result)
		}];
		}

def Vector_TupleGetOp :		def Vector_TupleGetOp :
Vector_Op<"tuple_get", [NoSideEffect]>,		Vector_Op<"tuple_get", [NoSideEffect]>,
Arguments<(ins TupleOf<[AnyVector]>:$vectors, APIntAttr:$index)>,		Arguments<(ins TupleOf<[AnyVector]>:$vectors, APIntAttr:$index)>,
Results<(outs AnyVector)> {		Results<(outs AnyVector)> {
let summary = "vector tuple get operation";		let summary = "vector tuple get operation";
let description = [{		let description = [{
Returns the tuple element of 'vectors' at 'index'.		Returns the tuple element of 'vectors' at 'index'.

▲ Show 20 Lines • Show All 122 Lines • Show Last 20 Lines

mlir/lib/Dialect/Vector/VectorOps.cpp

Show First 20 Lines • Show All 1,515 Lines • ▼ Show 20 Lines	static void print(OpAsmPrinter &p, TupleOp op) {
p.printOptionalAttrDict(op.getAttrs());		p.printOptionalAttrDict(op.getAttrs());
p << " : ";		p << " : ";
interleaveComma(op.getOperation()->getOperandTypes(), p);		interleaveComma(op.getOperation()->getOperandTypes(), p);
}		}

static LogicalResult verify(TupleOp op) { return success(); }		static LogicalResult verify(TupleOp op) { return success(); }

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// TransposeOp
		//===----------------------------------------------------------------------===//

		static LogicalResult verify(TransposeOp op) {
		VectorType vectorType = op.getVectorType();
		VectorType resultType = op.getResultType();
		int64_t rank = resultType.getRank();
		if (vectorType.getRank() != rank)
		return op.emitOpError("vector result rank mismatch: ") << rank;
		// Verify transposition array.
		auto transpAttr = op.transp().getValue();
		nicolasvasilacheUnsubmitted Not Done Reply Inline Actions Drop all this and use assemblyFormat instead please. See: https://reviews.llvm.org/D75987 for the usage of `TypesMatchWith` for dependent types. nicolasvasilache: Drop all this and use assemblyFormat instead please. See: https://reviews.llvm.org/D75987 for…
		aartbikAuthorUnsubmitted Done Reply Inline Actions Yes, the sole purpose of having this as C++ was to compute the type dependence. But from this feedback, I am guessing we can express this in .td now also. Cool! Let me look into that and make the change. aartbik:* Yes, the sole purpose of having this as C++ was to compute the type dependence. But from this…
		int64_t size = transpAttr.size();
		if (rank != size)
		return op.emitOpError("transposition length mismatch: ") << size;
		SmallVector<bool, 8> seen(rank, false);
		rriddleUnsubmitted Not Done Reply Inline Actions nit: Prefer using llvm::SmallBitVector or llvm::BitVector. rriddle: nit: Prefer using llvm::SmallBitVector or llvm::BitVector.
		aartbikAuthorUnsubmitted Done Reply Inline Actions noted, will follow up aartbik: noted, will follow up
		for (auto ta : llvm::enumerate(transpAttr)) {
		int64_t i = ta.value().cast<IntegerAttr>().getInt();
		if (i < 0 \|\| i >= rank)
		return op.emitOpError("transposition index out of range: ") << i;
		if (seen[i])
		return op.emitOpError("duplicate position index: ") << i;
		seen[i] = true;
		if (resultType.getDimSize(ta.index()) != vectorType.getDimSize(i))
		return op.emitOpError("dimension size mismatch at: ") << i;
		}
		return success();
		}

		//===----------------------------------------------------------------------===//
// TupleGetOp		// TupleGetOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static ParseResult parseTupleGetOp(OpAsmParser &parser,		static ParseResult parseTupleGetOp(OpAsmParser &parser,
OperationState &result) {		OperationState &result) {
OpAsmParser::OperandType operandInfo;		OpAsmParser::OperandType operandInfo;
IntegerAttr indexAttr;		IntegerAttr indexAttr;
StringRef indexAttrName = TupleGetOp::getIndexAttrName();		StringRef indexAttrName = TupleGetOp::getIndexAttrName();
▲ Show 20 Lines • Show All 136 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/invalid.mlir

	Show First 20 Lines • Show All 1,043 Lines • ▼ Show 20 Lines

	func @reduce_unsupported_rank(%arg0: vector<4x16xf32>) -> f32 {			func @reduce_unsupported_rank(%arg0: vector<4x16xf32>) -> f32 {
	// expected-error@+1 {{'vector.reduction' op unsupported reduction rank: 2}}			// expected-error@+1 {{'vector.reduction' op unsupported reduction rank: 2}}
	%0 = vector.reduction "add", %arg0 : vector<4x16xf32> into f32			%0 = vector.reduction "add", %arg0 : vector<4x16xf32> into f32
	}			}

	// -----			// -----

				func @transpose_rank_mismatch(%arg0: vector<4x16x11xf32>) {
				// expected-error@+1 {{'vector.transpose' op vector result rank mismatch: 1}}
				%0 = vector.transpose %arg0, [2, 1, 0] : vector<4x16x11xf32> to vector<100xf32>
				}

				// -----

				func @transpose_length_mismatch(%arg0: vector<4x4xf32>) {
				// expected-error@+1 {{'vector.transpose' op transposition length mismatch: 3}}
				%0 = vector.transpose %arg0, [2, 0, 1] : vector<4x4xf32> to vector<4x4xf32>
				}

				// -----

				func @transpose_index_oob(%arg0: vector<4x4xf32>) {
				// expected-error@+1 {{'vector.transpose' op transposition index out of range: 2}}
				%0 = vector.transpose %arg0, [2, 0] : vector<4x4xf32> to vector<4x4xf32>
				}

				// -----

				func @transpose_index_dup(%arg0: vector<4x4xf32>) {
				// expected-error@+1 {{'vector.transpose' op duplicate position index: 0}}
				%0 = vector.transpose %arg0, [0, 0] : vector<4x4xf32> to vector<4x4xf32>
				}

				// -----

				func @transpose_dim_size_mismatch(%arg0: vector<11x7x3x2xi32>) {
				// expected-error@+1 {{'vector.transpose' op dimension size mismatch at: 0}}
				%0 = vector.transpose %arg0, [3, 0, 1, 2] : vector<11x7x3x2xi32> to vector<2x3x7x11xi32>
				}

				// -----

	func @type_cast_layout(%arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>>) {			func @type_cast_layout(%arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>>) {
	// expected-error@+1 {{expects operand to be a memref with no layout}}			// expected-error@+1 {{expects operand to be a memref with no layout}}
	%0 = vector.type_cast %arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>> to memref<vector<4x3xf32>>			%0 = vector.type_cast %arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>> to memref<vector<4x3xf32>>
	}			}

mlir/test/Dialect/Vector/ops.mlir

Show First 20 Lines • Show All 309 Lines • ▼ Show 20 Lines	func @reduce_int(%arg0: vector<16xi32>) -> i32 {
vector.reduction "and", %arg0 : vector<16xi32> into i32		vector.reduction "and", %arg0 : vector<16xi32> into i32
// CHECK: vector.reduction "or", %{{.*}} : vector<16xi32> into i32		// CHECK: vector.reduction "or", %{{.*}} : vector<16xi32> into i32
vector.reduction "or", %arg0 : vector<16xi32> into i32		vector.reduction "or", %arg0 : vector<16xi32> into i32
// CHECK: %[[X:.]] = vector.reduction "xor", %{{.}} : vector<16xi32> into i32		// CHECK: %[[X:.]] = vector.reduction "xor", %{{.}} : vector<16xi32> into i32
%0 = vector.reduction "xor", %arg0 : vector<16xi32> into i32		%0 = vector.reduction "xor", %arg0 : vector<16xi32> into i32
// CHECK: return %[[X]] : i32		// CHECK: return %[[X]] : i32
return %0 : i32		return %0 : i32
}		}

		// CHECK-LABEL: transpose_fp
		func @transpose_fp(%arg0: vector<3x7xf32>) -> vector<7x3xf32> {
		%0 = vector.transpose %arg0, [1, 0] : vector<3x7xf32> to vector<7x3xf32>
		return %0 : vector<7x3xf32>
		}

		// CHECK-LABEL: transpose_int
		func @transpose_int(%arg0: vector<11x7x3x2xi32>) -> vector<2x11x7x3xi32> {
		%0 = vector.transpose %arg0, [3, 0, 1, 2] : vector<11x7x3x2xi32> to vector<2x11x7x3xi32>
		return %0 : vector<2x11x7x3xi32>
		}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] [VectorOps] Introduce vector.transpose
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 251797

mlir/include/mlir/Dialect/Vector/VectorOps.td

mlir/lib/Dialect/Vector/VectorOps.cpp

mlir/test/Dialect/Vector/invalid.mlir

mlir/test/Dialect/Vector/ops.mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] [VectorOps] Introduce vector.transposeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 251797

mlir/include/mlir/Dialect/Vector/VectorOps.td

mlir/lib/Dialect/Vector/VectorOps.cpp

mlir/test/Dialect/Vector/invalid.mlir

mlir/test/Dialect/Vector/ops.mlir

[mlir] [VectorOps] Introduce vector.transpose
ClosedPublic