This is an archive of the discontinued LLVM Phabricator instance.

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorOps.td
171	Concatenates a list of tensor into a single tensor.
173	Please mention that all operands have same rank, and that 0 <= dimension < rank
mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp
359	check 0 <= concatDim < rank ?
364	You could refine the error by emitting the argument number and mismatched rank (optional, but it is usually better to have precise error messages)
368	Can you add some comment here on what you allow for dynamic sizes? The logic for static sizes is clear (all the same for != concatDim and sum of all if == concatDim, but it it not so clear what cases you allow when some or dyanmic). Would it make sense to simply require them all to be static for the first version? Did you observe the need to support dynamic?

Peiming added inline comments.Aug 3 2022, 4:36 PM

mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp
359	The concatDim is of type `index`, it should guarantee the `0 <= concatDim`. I will add check `concatDim < rank`
368	I will require them all to be static then

addressed comments

Peiming marked 5 inline comments as done.Aug 4 2022, 9:38 AM

Peiming added inline comments.

mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp
364	See L375
368	See SparseTensorOps.td@L179

Harbormaster completed remote builds in B179309: Diff 450020.Aug 4 2022, 10:24 AM

Peiming added a child revision: D131200: [mlir][sparse] Implements concatenate operation for sparse tensor.Aug 4 2022, 1:57 PM

aartbik added inline comments.Aug 4 2022, 3:02 PM

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorOps.td
174	typo: negaive -> negative also, although you are right that (unsigned) indices are always non-negative, I would , at least here, in the doc, simply say 0 <= dimension < rank for readability
mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp
361	no { } needed in single stmt if also, we can decided to allow for size 1 and simply fold it into itself (this has something nice properties when progressive lowering stuff into lower ranked parts)
367	no { }
373	same, no { } needed
382	and here, no { } , also below

Addressed comments

Peiming marked 4 inline comments as done.Aug 4 2022, 4:08 PM

Peiming added inline comments.

mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp
361	Should I do it now or in the future (when it is needed) Also, probably need to lowering it to sparse_tensor.convert in the case that input and output tensor have different encodings

Harbormaster completed remote builds in B179418: Diff 450165.Aug 4 2022, 5:46 PM

Two last nits, but good to go

mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp
361	We can do that later. I was just thinking "out loud" here.
404	it works but I would find code that assign the first dimension size here and then checks if they are all equal without the "prev" trick a bit cleaner (and you have just verified there are sufficient inputs for this)

This revision is now accepted and ready to land.Aug 8 2022, 8:59 AM

Closed by commit rGde907138ec96: [mlir][sparse] Add new concatente operator to sparse tensor (authored by Peiming). · Explain WhyAug 8 2022, 10:23 AM

This revision was automatically updated to reflect the committed changes.

Peiming marked an inline comment as done.

Peiming added a commit: rGde907138ec96: [mlir][sparse] Add new concatente operator to sparse tensor.

Some post-landing comments since I was out on vacation

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorOps.td
173	Note that there's an `AllRanksMatch` trait to ensure that ranks match between operands (and/or results iiuc). I'll post a CL to add that. Don't know that there's a premade trait for the other constraint, but it should be easy enough to write up
179–180	As I mentioned in D130287, I think it'd be good to have this op not introduce any new dynamic sizes but rather force there to be an explicit cast doing so (like `memref::CollapseShapeOp`).
mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp
368	@aartbik fwiw, we had a big long discussion over at D130287. IMO the only valid thing would be to allow dynamic sizes for the dimension being concatenated along (and if any input has such, then the corresponding output dimension would also be dynamic). Since anything else would require introducing runtime assertions (or a sufficiently powerful analysis to statically determine that those assertions must always succeed).

Peiming marked an inline comment as done.Aug 16 2022, 11:43 AM

Peiming added inline comments.

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorOps.td
173	Good to know! Thank you!
179–180	Either way is okay for me.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

SparseTensor/

IR/

SparseTensorOps.td

24 lines

lib/

Dialect/

SparseTensor/

IR/

SparseTensorDialect.cpp

55 lines

test/

Dialect/

SparseTensor/

invalid.mlir

59 lines

roundtrip.mlir

26 lines

Diff 449769

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorOps.td

Show First 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	let description = [{
```mlir		```mlir
%1 = sparse_tensor.values %0 : tensor<64x64xf64, #CSR> to memref<?xf64>		%1 = sparse_tensor.values %0 : tensor<64x64xf64, #CSR> to memref<?xf64>
```		```
}];		}];
let assemblyFormat = "$tensor attr-dict `:` type($tensor) `to` type($result)";		let assemblyFormat = "$tensor attr-dict `:` type($tensor) `to` type($result)";
let hasVerifier = 1;		let hasVerifier = 1;
}		}

		def SparseTensor_ConcatenateOp : SparseTensor_Op<"concatenate", []>,
		Arguments<(ins Variadic<AnyRankedTensor>:$inputs,
		IndexAttr:$dimension)>,
		Results<(outs AnyRankedTensor:$result)> {

		let summary = "Concatenates a list of concatenate op";
		aartbikUnsubmitted Done Reply Inline Actions Concatenates a list of tensor into a single tensor. aartbik: Concatenates a list of tensor into a single tensor.
		let description = [{
		The concatenation happens on the specified `dimension`. The resulting `dimension`
		aartbikUnsubmitted Done Reply Inline Actions Please mention that all operands have same rank, and that 0 <= dimension < rank aartbik: Please mention that all operands have same rank, and that 0 <= dimension < rank
		wrengrUnsubmitted Not Done Reply Inline Actions Note that there's an `AllRanksMatch` trait to ensure that ranks match between operands (and/or results iiuc). I'll post a CL to add that. Don't know that there's a premade trait for the other constraint, but it should be easy enough to write up wrengr: Note that there's an `AllRanksMatch` trait to ensure that ranks match between operands (and/or…
		PeimingAuthorUnsubmitted Done Reply Inline Actions Good to know! Thank you! Peiming: Good to know! Thank you!
		size is the sum of all the input dimension sizes, while all the other dimensions
		aartbikUnsubmitted Done Reply Inline Actions typo: negaive -> negative also, although you are right that (unsigned) indices are always non-negative, I would , at least here, in the doc, simply say 0 <= dimension < rank for readability aartbik: typo: negaive -> negative also, although you are right that (unsigned) indices are always non…
		should have the same size in the input and output tensors.


		Example:

		```mlir
		wrengrUnsubmitted Not Done Reply Inline Actions As I mentioned in D130287, I think it'd be good to have this op not introduce any new dynamic sizes but rather force there to be an explicit cast doing so (like `memref::CollapseShapeOp`). wrengr: As I mentioned in D130287, I think it'd be good to have this op not introduce any new dynamic…
		PeimingAuthorUnsubmitted Done Reply Inline Actions Either way is okay for me. Peiming: Either way is okay for me.
		%0 = sparse_tensor.concatenate %1, %2 { dimension = 0 : index }
		: tensor<64x64xf64, #CSR>, tensor<64x64xf64, #CSR> to tensor<128x64xf64, #CSR>
		```
		}];

		let assemblyFormat = "$inputs attr-dict `:` type($inputs) `to` type($result)";
		let hasVerifier = 1;
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Sparse Tensor Management Operations. These operations are "impure" in the		// Sparse Tensor Management Operations. These operations are "impure" in the
// sense that they do not properly operate on SSA values. Instead, the behavior		// sense that they do not properly operate on SSA values. Instead, the behavior
// is solely defined by side-effects. These operations provide a bridge between		// is solely defined by side-effects. These operations provide a bridge between
// the code generator and the support library. The semantics of these operations		// the code generator and the support library. The semantics of these operations
// may be refined over time as our sparse abstractions evolve.		// may be refined over time as our sparse abstractions evolve.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

▲ Show 20 Lines • Show All 423 Lines • Show Last 20 Lines

mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp

Show First 20 Lines • Show All 346 Lines • ▼ Show 20 Lines	regionResult =
verifyNumBlockArgs(this, absent, "absent", TypeRange{}, outputType);		verifyNumBlockArgs(this, absent, "absent", TypeRange{}, outputType);
if (failed(regionResult))		if (failed(regionResult))
return regionResult;		return regionResult;
}		}

return success();		return success();
}		}

		LogicalResult ConcatenateOp::verify() {
		auto dstTp = getType().cast<RankedTensorType>();
		uint64_t concatDim = getDimension().getZExtValue();
		unsigned rank = dstTp.getRank();

		aartbikUnsubmitted Done Reply Inline Actions check 0 <= concatDim < rank ? aartbik: check 0 <= concatDim < rank ?
		PeimingAuthorUnsubmitted Done Reply Inline Actions The concatDim is of type `index`, it should guarantee the `0 <= concatDim`. I will add check `concatDim < rank` Peiming: The concatDim is of type `index`, it should guarantee the `0 <= concatDim`. I will add check…
		for (auto input : getInputs()) {
		auto inputRank = input.getType().cast<RankedTensorType>().getRank();
		aartbikUnsubmitted Done Reply Inline Actions no { } needed in single stmt if also, we can decided to allow for size 1 and simply fold it into itself (this has something nice properties when progressive lowering stuff into lower ranked parts) aartbik: no { } needed in single stmt if also, we can decided to allow for size 1 and simply fold it…
		PeimingAuthorUnsubmitted Done Reply Inline Actions Should I do it now or in the future (when it is needed) Also, probably need to lowering it to sparse_tensor.convert in the case that input and output tensor have different encodings Peiming: Should I do it now or in the future (when it is needed) Also, probably need to lowering it to…
		aartbikUnsubmitted Not Done Reply Inline Actions We can do that later. I was just thinking "out loud" here. aartbik: We can do that later. I was just thinking "out loud" here.
		if (inputRank != rank) {
		return emitError(
		"All input tensors and output tensor should have the same rank.");
		aartbikUnsubmitted Done Reply Inline Actions You could refine the error by emitting the argument number and mismatched rank (optional, but it is usually better to have precise error messages) aartbik: You could refine the error by emitting the argument number and mismatched rank (optional, but…
		PeimingAuthorUnsubmitted Done Reply Inline Actions See L375 Peiming: See L375
		}
		}

		aartbikUnsubmitted Done Reply Inline Actions no { } aartbik: no { }
		for (unsigned i = 0; i < rank; i++) {
		aartbikUnsubmitted Done Reply Inline Actions Can you add some comment here on what you allow for dynamic sizes? The logic for static sizes is clear (all the same for != concatDim and sum of all if == concatDim, but it it not so clear what cases you allow when some or dyanmic). Would it make sense to simply require them all to be static for the first version? Did you observe the need to support dynamic? aartbik: Can you add some comment here on what you allow for dynamic sizes? The logic for static sizes…
		PeimingAuthorUnsubmitted Done Reply Inline Actions I will require them all to be static then Peiming: I will require them all to be static then
		PeimingAuthorUnsubmitted Done Reply Inline Actions See SparseTensorOps.td@L179 Peiming: See SparseTensorOps.td@L179
		wrengrUnsubmitted Not Done Reply Inline Actions @aartbik fwiw, we had a big long discussion over at D130287. IMO the only valid thing would be to allow dynamic sizes for the dimension being concatenated along (and if any input has such, then the corresponding output dimension would also be dynamic). Since anything else would require introducing runtime assertions (or a sufficiently powerful analysis to statically determine that those assertions must always succeed). wrengr: @aartbik fwiw, we had a big long discussion over at D130287. IMO the only valid thing would be…
		auto dstDim = dstTp.getShape()[i];
		if (i == concatDim) {
		if (dstDim != ShapedType::kDynamicSize) {
		unsigned sumDim = 0;
		for (auto src : getInputs()) {
		aartbikUnsubmitted Done Reply Inline Actions same, no { } needed aartbik: same, no { } needed
		auto d = src.getType().cast<RankedTensorType>().getShape()[i];
		if (d == ShapedType::kDynamicSize)
		// If the output tensor has static dimension, yet there is an input
		// tensor with dynamic dimension.
		return emitError(
		"Failed to verify the shaping rules with dynamically "
		"shaped inputs");
		sumDim += d;
		}
		aartbikUnsubmitted Done Reply Inline Actions and here, no { } , also below aartbik: and here, no { } , also below
		// If all dimension are statically known, the sum of all the input
		// dimensions should be equal to the output dimension.
		if (sumDim != dstDim) {
		return emitError(
		"The concatenation dimension of the output tensor should be the "
		"sum of all the concatenation dimensions of the input tensors.");
		}
		}
		} else {
		int prev = dstDim;
		for (auto src : getInputs()) {
		auto d = src.getType().cast<RankedTensorType>().getShape()[i];
		if (d != ShapedType::kDynamicSize) {
		if (prev != ShapedType::kDynamicSize && d != prev) {
		return emitError(
		"All dimensions (expect for the concatenating one) "
		"should be equal.");
		}
		prev = d;
		}
		}
		}
		aartbikUnsubmitted Not Done Reply Inline Actions it works but I would find code that assign the first dimension size here and then checks if they are all equal without the "prev" trick a bit cleaner (and you have just verified there are sufficient inputs for this) aartbik: it works but I would find code that assign the first dimension size here and then checks if…
		}

		return success();
		}

LogicalResult ReduceOp::verify() {		LogicalResult ReduceOp::verify() {
Type inputType = getX().getType();		Type inputType = getX().getType();
LogicalResult regionResult = success();		LogicalResult regionResult = success();

// Check correct number of block arguments and return type.		// Check correct number of block arguments and return type.
Region &formula = getRegion();		Region &formula = getRegion();
if (!formula.empty()) {		if (!formula.empty()) {
regionResult = verifyNumBlockArgs(		regionResult = verifyNumBlockArgs(
Show All 38 Lines

mlir/test/Dialect/SparseTensor/invalid.mlir

Show First 20 Lines • Show All 354 Lines • ▼ Show 20 Lines	func.func @invalid_reduce_wrong_yield(%arg0: f64, %arg1: f64) -> f64 {
// expected-error@+1 {{reduce region must end with sparse_tensor.yield}}		// expected-error@+1 {{reduce region must end with sparse_tensor.yield}}
%r = sparse_tensor.reduce %arg0, %arg1, %cf1 : f64 {		%r = sparse_tensor.reduce %arg0, %arg1, %cf1 : f64 {
^bb0(%x: f64, %y: f64):		^bb0(%x: f64, %y: f64):
%cst = arith.constant 2 : i64		%cst = arith.constant 2 : i64
tensor.yield %cst : i64		tensor.yield %cst : i64
}		}
return %r : f64		return %r : f64
}		}

		// -----

		#C = #sparse_tensor.encoding<{dimLevelType = ["compressed"]}>
		#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>
		#DCC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed", "compressed"]}>
		func.func @invalid_concat_rank_mismatch(%arg0: tensor<2xf64, #C>,
		%arg1: tensor<3x4xf64, #DC>,
		%arg2: tensor<4x4x4xf64, #DCC>) -> tensor<9x4xf64, #DC> {
		// expected-error@+1 {{All input tensors and output tensor should have the same rank}}
		%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 0 : index}
		: tensor<2xf64, #C>,
		tensor<3x4xf64, #DC>,
		tensor<4x4x4xf64, #DCC> to tensor<9x4xf64, #DC>
		return %0 : tensor<9x4xf64, #DC>
		}

		// -----

		#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>
		func.func @invalid_concat_size_mismatch_dyn(%arg0: tensor<?x4xf64, #DC>,
		%arg1: tensor<5x4xf64, #DC>,
		%arg2: tensor<4x4xf64, #DC>) -> tensor<9x4xf64, #DC> {
		// expected-error@+1 {{Failed to verify the shaping rules with dynamically shaped inputs}}
		%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 0 : index}
		: tensor<?x4xf64, #DC>,
		tensor<5x4xf64, #DC>,
		tensor<4x4xf64, #DC> to tensor<9x4xf64, #DC>
		return %0 : tensor<9x4xf64, #DC>
		}

		// -----

		#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>
		func.func @invalid_concat_size_mismatch(%arg0: tensor<3x4xf64, #DC>,
		%arg1: tensor<5x4xf64, #DC>,
		%arg2: tensor<4x4xf64, #DC>) -> tensor<9x4xf64, #DC> {
		// expected-error@+1 {{The concatenation dimension of the output tensor should be the sum of}}
		%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 0 : index}
		: tensor<3x4xf64, #DC>,
		tensor<5x4xf64, #DC>,
		tensor<4x4xf64, #DC> to tensor<9x4xf64, #DC>
		return %0 : tensor<9x4xf64, #DC>
		}

		// -----

		#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>
		func.func @invalid_concat_size_mismatch(%arg0: tensor<2x4xf64, #DC>,
		%arg1: tensor<3x3xf64, #DC>,
		%arg2: tensor<4x4xf64, #DC>) -> tensor<9x4xf64, #DC> {
		// expected-error@+1 {{All dimensions (expect for the concatenating one) should be equal}}
		%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 0 : index}
		: tensor<2x4xf64, #DC>,
		tensor<3x3xf64, #DC>,
		tensor<4x4xf64, #DC> to tensor<9x4xf64, #DC>
		return %0 : tensor<9x4xf64, #DC>
		}

mlir/test/Dialect/SparseTensor/roundtrip.mlir

	Show First 20 Lines • Show All 283 Lines • ▼ Show 20 Lines
	// CHECK: }			// CHECK: }
	func.func @sparse_reduce_2d_to_1d(%arg0: f64, %arg1: f64) -> f64 {			func.func @sparse_reduce_2d_to_1d(%arg0: f64, %arg1: f64) -> f64 {
	%cf0 = arith.constant 0.0 : f64			%cf0 = arith.constant 0.0 : f64
	%r = sparse_tensor.reduce %arg0, %arg1, %cf0 : f64 {			%r = sparse_tensor.reduce %arg0, %arg1, %cf0 : f64 {
	^bb0(%x: f64, %y: f64):			^bb0(%x: f64, %y: f64):
	sparse_tensor.yield %x : f64			sparse_tensor.yield %x : f64
	}			}
	return %r : f64			return %r : f64
	}			}
	No newline at end of file
				// -----

				#SparseMatrix = #sparse_tensor.encoding<{dimLevelType = ["compressed", "compressed"]}>

				// CHECK-LABEL: func @concat_sparse_sparse(
				// CHECK-SAME: %[[A0:.*]]: tensor<2x4xf64
				// CHECK-SAME: %[[A1:.*]]: tensor<3x4xf64
				// CHECK-SAME: %[[A2:.*]]: tensor<4x4xf64
				// CHECK: %[[TMP0:.*]] = sparse_tensor.concatenate %[[A0]], %[[A1]], %[[A2]] {dimension = 0 : index} :
				// CHECK-SAME: tensor<2x4xf64
				// CHECK-SAME: tensor<3x4xf64
				// CHECK-SAME: tensor<4x4xf64
				// CHECK-SAME: tensor<9x4xf64
				// CHECK: return %[[TMP0]] : tensor<9x4xf64
				func.func @concat_sparse_sparse(%arg0: tensor<2x4xf64, #SparseMatrix>,
				%arg1: tensor<3x4xf64, #SparseMatrix>,
				%arg2: tensor<4x4xf64, #SparseMatrix>) -> tensor<9x4xf64, #SparseMatrix> {
				%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 0 : index}
				: tensor<2x4xf64, #SparseMatrix>,
				tensor<3x4xf64, #SparseMatrix>,
				tensor<4x4xf64, #SparseMatrix> to tensor<9x4xf64, #SparseMatrix>
				return %0 : tensor<9x4xf64, #SparseMatrix>
				}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][sparse] Add new concatente operator to sparse tensorClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 449769

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorOps.td

mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp

mlir/test/Dialect/SparseTensor/invalid.mlir

mlir/test/Dialect/SparseTensor/roundtrip.mlir

[mlir][sparse] Add new concatente operator to sparse tensor
ClosedPublic