Diff 543646

mlir/include/mlir/Dialect/Tensor/IR/TensorOps.td

Show First 20 Lines • Show All 254 Lines • ▼ Show 20 Lines	def Tensor_ExtractOp : Tensor_Op<"extract", [
Pure,		Pure,
TypesMatchWith<"result type matches element type of tensor",		TypesMatchWith<"result type matches element type of tensor",
"tensor", "result",		"tensor", "result",
"::llvm::cast<TensorType>($_self).getElementType()">]> {		"::llvm::cast<TensorType>($_self).getElementType()">]> {
let summary = "element extraction operation";		let summary = "element extraction operation";
let description = [{		let description = [{
The `tensor.extract` op reads a ranked tensor and returns one element as		The `tensor.extract` op reads a ranked tensor and returns one element as
specified by the given indices. The result of the op is a value with the		specified by the given indices. The result of the op is a value with the
same type as the elements of the tensor. The arity of indices must match		same type as the elements of the tensor.
the rank of the accessed value. All indices should all be of `index` type.
		The arity of indices must match the rank of the accessed value. All indices
		should all be of `index` type. They must be non-negative and within the
		kuharUnsubmitted Not Done Reply Inline Actions Did you want to highlight some difference by using 'should' and 'must'? If there is none, I'd prefer to use 'must' everywhere kuhar: Did you want to highlight some difference by using 'should' and 'must'? If there is none, I'd…
		bounds of the tensor.

Example:		Example:

```mlir		```mlir
%4 = tensor.extract %t[%1, %2] : tensor<4x4xi32>		%4 = tensor.extract %t[%1, %2] : tensor<4x4xi32>
%5 = tensor.extract %rt[%1, %2] : tensor<?x?xi32>		%5 = tensor.extract %rt[%1, %2] : tensor<?x?xi32>
```		```
}];		}];
Show All 23 Lines	def Tensor_ExtractSliceOp : Tensor_OpWithOffsetSizesAndStrides<"extract_slice", [
let description = [{		let description = [{
The "extract_slice" operation extract a tensor from another tensor as		The "extract_slice" operation extract a tensor from another tensor as
specified by the operation's offsets, sizes and strides arguments.		specified by the operation's offsets, sizes and strides arguments.

The extract_slice operation supports the following arguments:		The extract_slice operation supports the following arguments:

* source: the "base" tensor from which to extract a slice.		* source: the "base" tensor from which to extract a slice.
* offsets: tensor-rank number of offsets into the "base" tensor from which		* offsets: tensor-rank number of offsets into the "base" tensor from which
to extract the slice.		to extract the slice. Offsets must be non-negative and within the
		bounds of the source tensor.
* sizes: tensor-rank number of sizes which specify the sizes of the result		* sizes: tensor-rank number of sizes which specify the sizes of the result
tensor type.		tensor type. "offset + size" must not exceed the respective
		source dimension size.
* strides: tensor-rank number of strides specifying subsampling in each		* strides: tensor-rank number of strides specifying subsampling in each
dimension.		dimension.

The representation based on offsets, sizes and strides support a		The representation based on offsets, sizes and strides support a
partially-static specification via attributes specified through the		partially-static specification via attributes specified through the
`static_offsets`, `static_sizes` and `static_strides` arguments. A special		`static_offsets`, `static_sizes` and `static_strides` arguments. A special
sentinel value ShapedType::kDynamic encodes that the corresponding entry has		sentinel value ShapedType::kDynamic encodes that the corresponding entry has
a dynamic value.		a dynamic value.
▲ Show 20 Lines • Show All 411 Lines • ▼ Show 20 Lines	def Tensor_InsertOp : Tensor_Op<"insert", [
let description = [{		let description = [{
The `tensor.insert` op inserts a scalar into a ranked tensor `dest` as		The `tensor.insert` op inserts a scalar into a ranked tensor `dest` as
specified by the operation's indices.		specified by the operation's indices.

It returns a copy of `dest` with the indexed position updated to the value		It returns a copy of `dest` with the indexed position updated to the value
of `scalar`.		of `scalar`.

The arity of `indices `must match the rank of the tensor `dest`. All		The arity of `indices `must match the rank of the tensor `dest`. All
indices should be of `index` type.		indices should be of `index` type. They must be non-negative and within the
		kuharUnsubmitted Not Done Reply Inline Actions same here kuhar: same here
		bounds of the tensor.

Example:		Example:

```mlir		```mlir
%4 = tensor.insert %t into %dest[%1, %2] : tensor<4x4xi32>		%4 = tensor.insert %t into %dest[%1, %2] : tensor<4x4xi32>
%5 = tensor.insert %rt into %dest[%1, %2] : tensor<?x?xi32>		%5 = tensor.insert %rt into %dest[%1, %2] : tensor<?x?xi32>
```		```
}];		}];
Show All 39 Lines	let description = [{
It returns a copy of `dest` with the proper slice updated with the value		It returns a copy of `dest` with the proper slice updated with the value
of `source`.		of `source`.

The insert_slice operation supports the following arguments:		The insert_slice operation supports the following arguments:

* source: the tensor that is inserted.		* source: the tensor that is inserted.
* dest: the tensor into which the source tensor is inserted.		* dest: the tensor into which the source tensor is inserted.
* offsets: tensor-rank number of offsets into the `dest` tensor into which		* offsets: tensor-rank number of offsets into the `dest` tensor into which
the slice is inserted.		the slice is inserted. Offsets must be non-negative.
* sizes: tensor-rank number of sizes which specify the sizes of the source		* sizes: tensor-rank number of sizes which specify the sizes of the source
tensor type.		tensor type. "offset + size" must not exceed the respective
		destination dimension size.
* strides: tensor-rank number of strides that specify subsampling in each		* strides: tensor-rank number of strides that specify subsampling in each
dimension.		dimension.

The representation based on offsets, sizes and strides support a		The representation based on offsets, sizes and strides support a
partially-static specification via attributes specified through the		partially-static specification via attributes specified through the
`static_offsets`, `static_sizes` and `static_strides` arguments. A special		`static_offsets`, `static_sizes` and `static_strides` arguments. A special
sentinel value ShapedType::kDynamic encodes that the corresponding entry has		sentinel value ShapedType::kDynamic encodes that the corresponding entry has
a dynamic value.		a dynamic value.
▲ Show 20 Lines • Show All 591 Lines • ▼ Show 20 Lines	let description = [{
though it has no side effects, because it will get DCEd during		though it has no side effects, because it will get DCEd during
canonicalization.		canonicalization.

The parallel_insert_slice operation supports the following arguments:		The parallel_insert_slice operation supports the following arguments:

* source: the tensor that is inserted.		* source: the tensor that is inserted.
* dest: the tensor into which the source tensor is inserted.		* dest: the tensor into which the source tensor is inserted.
* offsets: tensor-rank number of offsets into the `dest` tensor into which		* offsets: tensor-rank number of offsets into the `dest` tensor into which
the slice is inserted.		the slice is inserted. Offsets must be non-negative.
* sizes: tensor-rank number of sizes which specify the sizes of the source		* sizes: tensor-rank number of sizes which specify the sizes of the source
tensor type.		tensor type. "offset + size" must not exceed the respective
		destination dimension size.
* strides: tensor-rank number of strides that specify subsampling in each		* strides: tensor-rank number of strides that specify subsampling in each
dimension.		dimension.

The representation based on offsets, sizes and strides support a		The representation based on offsets, sizes and strides support a
partially-static specification via attributes specified through the		partially-static specification via attributes specified through the
`static_offsets`, `static_sizes` and `static_strides` arguments. A special		`static_offsets`, `static_sizes` and `static_strides` arguments. A special
sentinel value ShapedType::kDynamic encodes that the corresponding entry has		sentinel value ShapedType::kDynamic encodes that the corresponding entry has
a dynamic value.		a dynamic value.
▲ Show 20 Lines • Show All 556 Lines • Show Last 20 Lines

mlir/lib/Dialect/Tensor/IR/TensorOps.cpp

Show First 20 Lines • Show All 160 Lines • ▼ Show 20 Lines

for (const auto &size : enumerate(mixedSizes)) {

droppedDims.set(size.index());

}

assert(shapePos == static_cast<int64_t>(reducedShape.size()) &&

"dimension mismatch");

return droppedDims;

}

LogicalResult verifySliceOffsetsAndSizes(Operation *op, RankedTensorType type,

ArrayRef<int64_t> mixedOffsets,

ArrayRef<int64_t> mixedSizes) {

// No out-of-bounds accesses.

for (int64_t i = 0, e = type.getRank(); i < e; ++i) {

int64_t offset = mixedOffsets[i];

int64_t size = mixedSizes[i];

if (!type.isDynamicDim(i) && !ShapedType::isDynamic(size))

if (!ShapedType::isDynamic(offset) && offset + size > type.getDimSize(i))

return op->emitOpError("dimension #") << i << " runs out of bounds";

}

// No negative offsets.

for (const auto &it : llvm::enumerate(mixedOffsets))

if (!ShapedType::isDynamic(it.value()))

if (it.value() < 0)

return op->emitOpError("offset #") << it.index() << " is negative";

kuharUnsubmitted

Not Done

// No negative offsets.

- for (const auto &it : llvm::enumerate(mixedOffsets))

- if (!ShapedType::isDynamic(it.value()))

- if (it.value() < 0)

- return op->emitOpError("offset #") << it.index() << " is negative";

+ for (auto [i, offset] : llvm::enumerate(mixedOffsets))

+ if (!ShapedType::isDynamic(offset))

+ if (offset < 0)

+ return op->emitOpError("offset #") << i << " is negative";

return success();

kuhar:

return success();

}

//===----------------------------------------------------------------------===//

// BitcastOp

//===----------------------------------------------------------------------===//

bool BitcastOp::areCastCompatible(TypeRange inputs, TypeRange outputs) {

if (inputs.size() != 1 || outputs.size() != 1)

return false;

Type a = inputs.front(), b = outputs.front();

▲ Show 20 Lines • Show All 1,657 Lines • ▼ Show 20 Lines

}

/// Verifier for ExtractSliceOp.

LogicalResult ExtractSliceOp::verify() {

// Verify result type against inferred type.

RankedTensorType expectedType = ExtractSliceOp::inferResultType(

getSourceType(), getMixedOffsets(), getMixedSizes(), getMixedStrides());

SliceVerificationResult result = isRankReducedType(expectedType, getType());

return produceSliceErrorMsg(result, *this, expectedType);

if (failed(produceSliceErrorMsg(result, *this, expectedType)))

return failure();

return verifySliceOffsetsAndSizes(*this, getSourceType(), getStaticOffsets(),

getStaticSizes());

}

llvm::SmallBitVector ExtractSliceOp::getDroppedDims() {

return ::getDroppedDims(getType().getShape(), getMixedSizes());

}

FailureOr<Value>

ExtractSliceOp::rankReduceIfNeeded(OpBuilder &b, Location loc, Value value,

▲ Show 20 Lines • Show All 365 Lines • ▼ Show 20 Lines

}

/// Verifier for InsertSliceOp.

LogicalResult InsertSliceOp::verify() {

RankedTensorType expectedType;

SliceVerificationResult result =

verifyInsertSliceOp(getSourceType(), getType(), getStaticOffsets(),

getStaticSizes(), getStaticStrides(), &expectedType);

return produceSliceErrorMsg(result, *this, expectedType);

if (failed(produceSliceErrorMsg(result, *this, expectedType)))

return failure();

return verifySliceOffsetsAndSizes(*this, getDestType(), getStaticOffsets(),

getStaticSizes());

}

/// If we have two consecutive InsertSliceOp writing to the same slice, we

/// can mutate the second InsertSliceOp's destination to the first one's.

///

/// Example:

///

/// ```mlir

▲ Show 20 Lines • Show All 888 Lines • ▼ Show 20 Lines

LogicalResult ParallelInsertSliceOp::verify() {

if (!isa<ParallelCombiningOpInterface>(getOperation()->getParentOp()))

return this->emitError("expected ParallelCombiningOpInterface parent, got:")

<< *(getOperation()->getParentOp());

RankedTensorType expectedType;

SliceVerificationResult result =

verifyInsertSliceOp(getSourceType(), getDestType(), getStaticOffsets(),

getStaticSizes(), getStaticStrides(), &expectedType);

return produceSliceErrorMsg(result, *this, expectedType);

if (failed(produceSliceErrorMsg(result, *this, expectedType)))

return failure();

return verifySliceOffsetsAndSizes(*this, getDestType(), getStaticOffsets(),

getStaticSizes());

}

void ParallelInsertSliceOp::getCanonicalizationPatterns(

RewritePatternSet &results, MLIRContext *context) {

results.add<InsertSliceOpConstantArgumentFolder<ParallelInsertSliceOp>,

InsertSliceOpCastFolder<ParallelInsertSliceOp>,

InsertSliceOpSourceCastInserter<ParallelInsertSliceOp>>(context);

}

▲ Show 20 Lines • Show All 866 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/vectorize-tensor-extract-masked.mlir

	Show First 20 Lines • Show All 83 Lines • ▼ Show 20 Lines
	^bb1(%arg1: !transform.any_op):			^bb1(%arg1: !transform.any_op):
	%0 = transform.structured.match ops{["linalg.generic"]} in %arg1 : (!transform.any_op) -> !transform.any_op			%0 = transform.structured.match ops{["linalg.generic"]} in %arg1 : (!transform.any_op) -> !transform.any_op
	transform.structured.masked_vectorize %0 vector_sizes [1, 4] { vectorize_nd_extract } : !transform.any_op			transform.structured.masked_vectorize %0 vector_sizes [1, 4] { vectorize_nd_extract } : !transform.any_op
	}			}

	// -----			// -----

	func.func @masked_vectorize_nd_tensor_extract_with_affine_apply_gather(%6: tensor<80x16xf32>, %arg0: index, %extracted_slice : tensor<1x3xf32>) -> tensor<1x3xf32> {			func.func @masked_vectorize_nd_tensor_extract_with_affine_apply_gather(%6: tensor<80x16xf32>, %arg0: index, %extracted_slice : tensor<1x3xf32>) -> tensor<1x3xf32> {
	%c16 = arith.constant 16 : index			%c15 = arith.constant 15 : index
	%1 = linalg.generic {			%1 = linalg.generic {
	indexing_maps = [affine_map<(d0, d1) -> (d0, d1)>],			indexing_maps = [affine_map<(d0, d1) -> (d0, d1)>],
	iterator_types = ["parallel", "parallel"]			iterator_types = ["parallel", "parallel"]
	} outs(%extracted_slice : tensor<1x3xf32>) {			} outs(%extracted_slice : tensor<1x3xf32>) {
	^bb0(%out: f32):			^bb0(%out: f32):
	%2 = linalg.index 1 : index			%2 = linalg.index 1 : index
	%3 = affine.apply affine_map<(d0, d1) -> (d0 + d1)>(%2, %arg0)			%3 = affine.apply affine_map<(d0, d1) -> (d0 + d1)>(%2, %arg0)
	%extracted = tensor.extract %6[%3, %c16] : tensor<80x16xf32>			%extracted = tensor.extract %6[%3, %c15] : tensor<80x16xf32>
	linalg.yield %extracted : f32			linalg.yield %extracted : f32
	} -> tensor<1x3xf32>			} -> tensor<1x3xf32>
	return %1 : tensor<1x3xf32>			return %1 : tensor<1x3xf32>
	}			}

	// CHECK-LABEL: func.func @masked_vectorize_nd_tensor_extract_with_affine_apply_gather			// CHECK-LABEL: func.func @masked_vectorize_nd_tensor_extract_with_affine_apply_gather
	// CHECK-DAG: %[[VAL_4:.*]] = arith.constant 1 : index			// CHECK-DAG: %[[VAL_4:.*]] = arith.constant 1 : index
	// CHECK-DAG: %[[VAL_5:.*]] = arith.constant 3 : index			// CHECK-DAG: %[[VAL_5:.*]] = arith.constant 3 : index
	▲ Show 20 Lines • Show All 165 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/vectorize-tensor-extract.mlir

Show First 20 Lines • Show All 335 Lines • ▼ Show 20 Lines	^bb1(%arg1: !transform.any_op):
%1 = get_parent_op %0 {isolated_from_above} : (!transform.any_op) -> !transform.any_op		%1 = get_parent_op %0 {isolated_from_above} : (!transform.any_op) -> !transform.any_op
%2 = transform.structured.vectorize %1 { vectorize_nd_extract } : (!transform.any_op) -> !transform.any_op		%2 = transform.structured.vectorize %1 { vectorize_nd_extract } : (!transform.any_op) -> !transform.any_op
}		}

// -----		// -----

// The vectorizer converts `affine.apply` so that the subsequent Ops can be vectorised based on the converted ops. Gather load.		// The vectorizer converts `affine.apply` so that the subsequent Ops can be vectorised based on the converted ops. Gather load.
func.func @vectorize_nd_tensor_extract_with_affine_apply_gather(%6: tensor<80x16xf32>, %arg0: index, %extracted_slice : tensor<1x4xf32>) -> tensor<1x4xf32> {		func.func @vectorize_nd_tensor_extract_with_affine_apply_gather(%6: tensor<80x16xf32>, %arg0: index, %extracted_slice : tensor<1x4xf32>) -> tensor<1x4xf32> {
%c16 = arith.constant 16 : index		%c15 = arith.constant 15 : index
%1 = linalg.generic {		%1 = linalg.generic {
indexing_maps = [affine_map<(d0, d1) -> (d0, d1)>],		indexing_maps = [affine_map<(d0, d1) -> (d0, d1)>],
iterator_types = ["parallel", "parallel"]		iterator_types = ["parallel", "parallel"]
} outs(%extracted_slice : tensor<1x4xf32>) {		} outs(%extracted_slice : tensor<1x4xf32>) {
^bb0(%out: f32):		^bb0(%out: f32):
%2 = linalg.index 1 : index		%2 = linalg.index 1 : index
%3 = affine.apply affine_map<(d0, d1) -> (d0 + d1)>(%2, %arg0)		%3 = affine.apply affine_map<(d0, d1) -> (d0 + d1)>(%2, %arg0)
%extracted = tensor.extract %6[%3, %c16] : tensor<80x16xf32>		%extracted = tensor.extract %6[%3, %c15] : tensor<80x16xf32>
linalg.yield %extracted : f32		linalg.yield %extracted : f32
} -> tensor<1x4xf32>		} -> tensor<1x4xf32>
return %1 : tensor<1x4xf32>		return %1 : tensor<1x4xf32>
}		}

// CHECK-LABEL: func.func @vectorize_nd_tensor_extract_with_affine_apply_gather(		// CHECK-LABEL: func.func @vectorize_nd_tensor_extract_with_affine_apply_gather(
// CHECK-SAME: %[[VAL_0:.*]]: tensor<80x16xf32>,		// CHECK-SAME: %[[VAL_0:.*]]: tensor<80x16xf32>,
// CHECK-SAME: %[[VAL_1:.*]]: index,		// CHECK-SAME: %[[VAL_1:.*]]: index,
// CHECK-SAME: %[[VAL_2:.*]]: tensor<1x4xf32>) -> tensor<1x4xf32> {		// CHECK-SAME: %[[VAL_2:.*]]: tensor<1x4xf32>) -> tensor<1x4xf32> {
// CHECK: %[[VAL_3:.*]] = arith.constant dense<[0, 1, 2, 3]> : vector<4xindex>		// CHECK: %[[VAL_3:.*]] = arith.constant dense<[0, 1, 2, 3]> : vector<4xindex>
// CHECK: %[[VAL_4:.*]] = arith.constant dense<true> : vector<1x4xi1>		// CHECK: %[[VAL_4:.*]] = arith.constant dense<true> : vector<1x4xi1>
// CHECK: %[[VAL_5:.*]] = arith.constant dense<0.000000e+00> : vector<1x4xf32>		// CHECK: %[[VAL_5:.*]] = arith.constant dense<0.000000e+00> : vector<1x4xf32>
// CHECK: %[[VAL_6:.*]] = arith.constant 0 : index		// CHECK: %[[VAL_6:.*]] = arith.constant 0 : index
// CHECK: %[[VAL_7:.*]] = arith.constant dense<16> : vector<1x4xindex>		// CHECK: %[[VAL_7a:.*]] = arith.constant dense<16> : vector<1x4xindex>
		// CHECK: %[[VAL_7b:.*]] = arith.constant dense<15> : vector<1x4xindex>
// CHECK: %[[VAL_8:.*]] = vector.broadcast %[[VAL_1]] : index to vector<4xindex>		// CHECK: %[[VAL_8:.*]] = vector.broadcast %[[VAL_1]] : index to vector<4xindex>
// CHECK: %[[VAL_9:.*]] = arith.addi %[[VAL_8]], %[[VAL_3]] : vector<4xindex>		// CHECK: %[[VAL_9:.*]] = arith.addi %[[VAL_8]], %[[VAL_3]] : vector<4xindex>
// CHECK: %[[VAL_10:.*]] = vector.broadcast %[[VAL_9]] : vector<4xindex> to vector<1x4xindex>		// CHECK: %[[VAL_10:.*]] = vector.broadcast %[[VAL_9]] : vector<4xindex> to vector<1x4xindex>
// CHECK: %[[VAL_11:.*]] = arith.muli %[[VAL_10]], %[[VAL_7]] : vector<1x4xindex>		// CHECK: %[[VAL_11:.*]] = arith.muli %[[VAL_10]], %[[VAL_7a]] : vector<1x4xindex>
// CHECK: %[[VAL_12:.*]] = arith.addi %[[VAL_11]], %[[VAL_7]] : vector<1x4xindex>		// CHECK: %[[VAL_12:.*]] = arith.addi %[[VAL_11]], %[[VAL_7b]] : vector<1x4xindex>
// CHECK: %[[VAL_13:.*]] = vector.gather %[[VAL_0]]{{\[}}%[[VAL_6]], %[[VAL_6]]] {{\[}}%[[VAL_12]]], %[[VAL_4]], %[[VAL_5]] : tensor<80x16xf32>, vector<1x4xindex>, vector<1x4xi1>, vector<1x4xf32> into vector<1x4xf32>		// CHECK: %[[VAL_13:.*]] = vector.gather %[[VAL_0]]{{\[}}%[[VAL_6]], %[[VAL_6]]] {{\[}}%[[VAL_12]]], %[[VAL_4]], %[[VAL_5]] : tensor<80x16xf32>, vector<1x4xindex>, vector<1x4xi1>, vector<1x4xf32> into vector<1x4xf32>
// CHECK: %[[VAL_14:.*]] = vector.transfer_write %[[VAL_13]], %[[VAL_2]]{{\[}}%[[VAL_6]], %[[VAL_6]]] {in_bounds = [true, true]} : vector<1x4xf32>, tensor<1x4xf32>		// CHECK: %[[VAL_14:.*]] = vector.transfer_write %[[VAL_13]], %[[VAL_2]]{{\[}}%[[VAL_6]], %[[VAL_6]]] {in_bounds = [true, true]} : vector<1x4xf32>, tensor<1x4xf32>
// CHECK: return %[[VAL_14]] : tensor<1x4xf32>		// CHECK: return %[[VAL_14]] : tensor<1x4xf32>
// CHECK: }		// CHECK: }

transform.sequence failures(propagate) {		transform.sequence failures(propagate) {
^bb1(%arg1: !transform.any_op):		^bb1(%arg1: !transform.any_op):
%0 = transform.structured.match ops{["linalg.generic"]} in %arg1 : (!transform.any_op) -> !transform.any_op		%0 = transform.structured.match ops{["linalg.generic"]} in %arg1 : (!transform.any_op) -> !transform.any_op
▲ Show 20 Lines • Show All 149 Lines • Show Last 20 Lines

mlir/test/Dialect/Tensor/fold-tensor-subset-ops.mlir

Show First 20 Lines • Show All 257 Lines • ▼ Show 20 Lines	func.func @insert_slice_of_transfer_write_rank_extending(%t1 : tensor<?x?x12xf32>, %v : vector<5x6xf32>, %s : index, %t2 : tensor<5x6xf32>) -> tensor<?x?x12xf32> {
%c0 = arith.constant 0 : index		%c0 = arith.constant 0 : index
%0 = vector.transfer_write %v, %t2[%c0, %c0] {in_bounds = [true, true]} : vector<5x6xf32>, tensor<5x6xf32>		%0 = vector.transfer_write %v, %t2[%c0, %c0] {in_bounds = [true, true]} : vector<5x6xf32>, tensor<5x6xf32>
%1 = tensor.insert_slice %0 into %t1[4, 3, %s] [1, 5, 6] [1, 1, 1] : tensor<5x6xf32> into tensor<?x?x12xf32>		%1 = tensor.insert_slice %0 into %t1[4, 3, %s] [1, 5, 6] [1, 1, 1] : tensor<5x6xf32> into tensor<?x?x12xf32>
return %1 : tensor<?x?x12xf32>		return %1 : tensor<?x?x12xf32>
}		}

// -----		// -----

// CHECK: #[[$map:.*]] = affine_map<()[s0] -> (s0 + 2)>
// CHECK-LABEL: func @insert_slice_of_insert_slice(		// CHECK-LABEL: func @insert_slice_of_insert_slice(
// CHECK-SAME: %[[t:[0-9a-z]*]]: tensor<f32>		// CHECK-SAME: %[[t:[0-9a-z]*]]: tensor<f32>
// CHECK-SAME: %[[r1:[0-9a-z]*]]: tensor<1x14xf32>		// CHECK-SAME: %[[r1:[0-9a-z]*]]: tensor<1x14xf32>
// CHECK-SAME: %[[pos:[0-9a-z]*]]: index		// CHECK-SAME: %[[pos:[0-9a-z]*]]: index
// CHECK: %[[add:.*]] = affine.apply #[[$map]]()[%[[pos]]]		// CHECK: tensor.insert_slice %[[t]] into %[[r1]][0, %[[pos]]] [1, 1] [1, 1] : tensor<f32> into tensor<1x14xf32>
// CHECK: tensor.insert_slice %[[t]] into %[[r1]][4, %[[add]]] [1, 1] [1, 1] : tensor<f32> into tensor<1x14xf32>
func.func @insert_slice_of_insert_slice(%t: tensor<f32>, %r0: tensor<1x1xf32>, %r1: tensor<1x14xf32>, %pos: index)		func.func @insert_slice_of_insert_slice(%t: tensor<f32>, %r0: tensor<1x1xf32>, %r1: tensor<1x14xf32>, %pos: index)
-> tensor<1x14xf32>		-> tensor<1x14xf32>
{		{
%0 = tensor.insert_slice %t into %r0[1, 2] [1, 1] [1, 1]		%0 = tensor.insert_slice %t into %r0[0, 0] [1, 1] [1, 1]
: tensor<f32> into tensor<1x1xf32>		: tensor<f32> into tensor<1x1xf32>
%1 = tensor.insert_slice %0 into %r1[3, %pos] [1, 1] [1, 1]		%1 = tensor.insert_slice %0 into %r1[0, %pos] [1, 1] [1, 1]
: tensor<1x1xf32> into tensor<1x14xf32>		: tensor<1x1xf32> into tensor<1x14xf32>
return %1 : tensor<1x14xf32>		return %1 : tensor<1x14xf32>
}		}

// -----		// -----

// CHECK-LABEL: func @insert_slice_of_insert_slice(		// CHECK-LABEL: func @insert_slice_of_insert_slice(
// CHECK-SAME: %[[t:[0-9a-z]*]]: tensor<f32>		// CHECK-SAME: %[[t:[0-9a-z]*]]: tensor<f32>
// CHECK-SAME: %[[r1:[0-9a-z]*]]: tensor<1x14xf32>		// CHECK-SAME: %[[r1:[0-9a-z]*]]: tensor<1x14xf32>
// CHECK-SAME: %[[pos:[0-9a-z]*]]: index		// CHECK-SAME: %[[pos:[0-9a-z]*]]: index
// CHECK: tensor.insert_slice %[[t]] into %[[r1]][5, %[[pos]]] [1, 1] [1, 1] : tensor<f32> into tensor<1x14xf32>		// CHECK: tensor.insert_slice %[[t]] into %[[r1]][0, %[[pos]]] [1, 1] [1, 1] : tensor<f32> into tensor<1x14xf32>
func.func @insert_slice_of_insert_slice(%t: tensor<f32>, %r0: tensor<1xf32>, %r1: tensor<1x14xf32>, %pos: index)		func.func @insert_slice_of_insert_slice(%t: tensor<f32>, %r0: tensor<1xf32>, %r1: tensor<1x14xf32>, %pos: index)
-> tensor<1x14xf32>		-> tensor<1x14xf32>
{		{
%0 = tensor.insert_slice %t into %r0[2] [1] [1]		%0 = tensor.insert_slice %t into %r0[0] [1] [1]
: tensor<f32> into tensor<1xf32>		: tensor<f32> into tensor<1xf32>
%1 = tensor.insert_slice %0 into %r1[3, %pos] [1, 1] [1, 1]		%1 = tensor.insert_slice %0 into %r1[0, %pos] [1, 1] [1, 1]
: tensor<1xf32> into tensor<1x14xf32>		: tensor<1xf32> into tensor<1x14xf32>
return %1 : tensor<1x14xf32>		return %1 : tensor<1x14xf32>
}		}

// -----		// -----

// This test fails to fold because the size `4` and `%pos` do not match:		// This test fails to fold because the size `4` and `%pos` do not match:
// this requires a copy		// this requires a copy
▲ Show 20 Lines • Show All 89 Lines • Show Last 20 Lines

mlir/test/Dialect/Tensor/invalid.mlir

Show First 20 Lines • Show All 194 Lines • ▼ Show 20 Lines	func.func @extract_slice_wrong_dynamic_type(%t: tensor<8x16x4xf32>, %idx : index) {
%0 = tensor.extract_slice %t[0, 2, 0][4, 4, 4][1, 1, 1]		%0 = tensor.extract_slice %t[0, 2, 0][4, 4, 4][1, 1, 1]
: tensor<8x16x4xf32> to tensor<?x4x4xf32>		: tensor<8x16x4xf32> to tensor<?x4x4xf32>

return		return
}		}

// -----		// -----

		func.func @extract_slice_out_of_bounds(%t: tensor<5xf32>) {
		// expected-error @+1 {{dimension #0 runs out of bounds}}
		%0 = tensor.extract_slice %t[5][4][1] : tensor<5xf32> to tensor<4xf32>

		return
		}

		// -----

		func.func @extract_slice_out_of_bounds_2(%t: tensor<5xf32>) {
		// expected-error @+1 {{dimension #0 runs out of bounds}}
		%0 = tensor.extract_slice %t[3][4][1] : tensor<5xf32> to tensor<4xf32>

		return
		}

		// -----

		func.func @extract_slice_negative(%t: tensor<5xf32>) {
		// expected-error @+1 {{offset #0 is negative}}
		%0 = tensor.extract_slice %t[-1][4][1] : tensor<5xf32> to tensor<4xf32>

		return
		}

		// -----

func.func @insert_slice_wrong_result_rank(%t1: tensor<?xf32>, %t2: tensor<?x?xf32>, %idx : index) {		func.func @insert_slice_wrong_result_rank(%t1: tensor<?xf32>, %t2: tensor<?x?xf32>, %idx : index) {
// expected-error @+1 {{expected rank to be smaller or equal to the other rank.}}		// expected-error @+1 {{expected rank to be smaller or equal to the other rank.}}
%0 = tensor.insert_slice %t2 into %t1[0][4][1] : tensor<?x?xf32> into tensor<?xf32>		%0 = tensor.insert_slice %t2 into %t1[0][4][1] : tensor<?x?xf32> into tensor<?xf32>

return		return
}		}

// -----		// -----
Show All 22 Lines	func.func @insert_slice_wrong_dynamic_type(%t1: tensor<?x4x4xf32>, %t2: tensor<8x16x4xf32>, %idx : index) {
%0 = tensor.insert_slice %t1 into %t2[0, 2, 0][4, 4, 4][1, 1, 1]		%0 = tensor.insert_slice %t1 into %t2[0, 2, 0][4, 4, 4][1, 1, 1]
: tensor<?x4x4xf32> into tensor<8x16x4xf32>		: tensor<?x4x4xf32> into tensor<8x16x4xf32>

return		return
}		}

// -----		// -----

		func.func @insert_slice_out_of_bounds(%t1: tensor<4x4xf32>, %t2: tensor<8x16x2xf32>) {
		// expected-error @+1 {{dimension #2 runs out of bounds}}
		%0 = tensor.insert_slice %t1 into %t2[0, 0, 3][4, 4, 1][1, 1, 1]
		: tensor<4x4xf32> into tensor<8x16x2xf32>

		return
		}

		// -----

		func.func @insert_slice_out_of_bounds_2(%t1: tensor<4x4xf32>, %t2: tensor<8x16x2xf32>) {
		// expected-error @+1 {{dimension #1 runs out of bounds}}
		%0 = tensor.insert_slice %t1 into %t2[0, 15, 0][4, 4, 1][1, 1, 1]
		: tensor<4x4xf32> into tensor<8x16x2xf32>

		return
		}

		// -----

		func.func @insert_slice_negative(%t1: tensor<4x4xf32>, %t2: tensor<8x16x2xf32>) {
		// expected-error @+1 {{offset #2 is negative}}
		%0 = tensor.insert_slice %t1 into %t2[0, 0, -1][4, 4, 1][1, 1, 1]
		: tensor<4x4xf32> into tensor<8x16x2xf32>

		return
		}

		// -----

func.func @illegal_expanding_reshape_dynamic_tensor		func.func @illegal_expanding_reshape_dynamic_tensor
(%arg0: tensor<?x?x?xf32>) -> tensor<?x?x?x4x?xf32> {		(%arg0: tensor<?x?x?xf32>) -> tensor<?x?x?x4x?xf32> {
// expected-error @+1 {{invalid to have a single dimension (2) expanded into multiple dynamic dims (2,4)}}		// expected-error @+1 {{invalid to have a single dimension (2) expanded into multiple dynamic dims (2,4)}}
%0 = tensor.expand_shape %arg0 [[0], [1], [2, 3, 4]]		%0 = tensor.expand_shape %arg0 [[0], [1], [2, 3, 4]]
: tensor<?x?x?xf32> into tensor<?x?x?x4x?xf32>		: tensor<?x?x?xf32> into tensor<?x?x?x4x?xf32>
return %0 : tensor<?x?x?x4x?xf32>		return %0 : tensor<?x?x?x4x?xf32>
}		}

▲ Show 20 Lines • Show All 409 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][tensor] Improve verifiers: detect out-of-bounds accesses
Changes PlannedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 543646

mlir/include/mlir/Dialect/Tensor/IR/TensorOps.td

mlir/lib/Dialect/Tensor/IR/TensorOps.cpp

mlir/test/Dialect/Linalg/vectorize-tensor-extract-masked.mlir

mlir/test/Dialect/Linalg/vectorize-tensor-extract.mlir

mlir/test/Dialect/Tensor/fold-tensor-subset-ops.mlir

mlir/test/Dialect/Tensor/invalid.mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][tensor] Improve verifiers: detect out-of-bounds accessesChanges PlannedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 543646

mlir/include/mlir/Dialect/Tensor/IR/TensorOps.td

mlir/lib/Dialect/Tensor/IR/TensorOps.cpp

mlir/test/Dialect/Linalg/vectorize-tensor-extract-masked.mlir

mlir/test/Dialect/Linalg/vectorize-tensor-extract.mlir

mlir/test/Dialect/Tensor/fold-tensor-subset-ops.mlir

mlir/test/Dialect/Tensor/invalid.mlir

[mlir][tensor] Improve verifiers: detect out-of-bounds accesses
Changes PlannedPublic