This is an archive of the discontinued LLVM Phabricator instance.

I'm currently working on extending ExtractOp/InsertOp to support Value indices. I should be able to have something this week. Is this change blocking you? If not, it would help if we can revert it and wait for my change so that I don't have to deal with all the conflicts, as I'm touching the exact same lines and more. Do you think that would be feasible?

Can you explain a bit how you are planning to change these ops? I think it was on purpose that these ops only support static indices. How does this lower to LLVM? There are comments such as this, which sounds like it may not be possible to lower such ops to LLVM without bigger changes:

In VectorToSCF.h
[...]
/// This is consistent with the lack of an LLVM instruction to dynamically
/// index into an aggregate (see the Vector dialect lowering to LLVM deep dive).

(Also see https://mlir.llvm.org/docs/Dialects/Vector/#deeperdive.)

Also, how are vectors different from tensors after this change? At the moment, one main difference is that tensors can be indexed dynamically (vector cannot).

If we actually need Value indices, can we make these mixed static/dynamic? Like tensor.extract_slice, tensor.insert_slice, memref.subview, etc. These accept a mix of SSA values and int64_t. This composes better with the remaining OpFoldResult/getMixedSizes/OpBuilder::createOrFold/affine::makeComposedFoldedAffineApply/... APIs that we have throughout MLIR. If we have only Value indices, we have to create many arith.constant ops. E.g., this is happening at moment when unrolling n-D vector transfers with VectorToSCF because many vector ops do not yet support "mixed" indices.

We can add a new interface called MixedIndicesOpInterface to ViewLikeInterfaces.td that is like OffsetSizeAndStrideOpInterface but has only one offsets/indices, but no sizes or strides. This change here would actually be the first step towards that direction, because the static offsets should be a DenseI64ArrayAttr.

In D156684#4547806, @springerm wrote:

Can you explain a bit how you are planning to change these ops?

The following links should give you enough context about the motivation:

https://discourse.llvm.org/t/rfc-psa-remove-vector-extractelement-and-vector-insertelement-ops-in-favor-of-vector-extract-and-vector-insert-ops/71116/5
https://reviews.llvm.org/D155034

I think it was on purpose that these ops only support static indices. How does this lower to LLVM? There are comments such as this, which sounds like it may not be possible to lower such ops to LLVM without bigger changes:

LLVM shouldn't be a limitation for what we can model at MLIR level, esp. now that we have "real" 2D vector operations in MLIR, such as SME.

Also, how are vectors different from tensors after this change? At the moment, one main difference is that tensors can be indexed dynamically (vector cannot).

We can insert/extract elements from dynamic positions even in LLVM, right? https://llvm.org/docs/LangRef.html#extractvalue-instruction

If we actually need Value indices, can we make these mixed static/dynamic? Like tensor.extract_slice, tensor.insert_slice, memref.subview, etc. These accept a mix of SSA values and int64_t. This composes better with the remaining OpFoldResult/getMixedSizes/OpBuilder::createOrFold/affine::makeComposedFoldedAffineApply/... APIs that we have throughout MLIR. If we have only Value indices, we have to create many arith.constant ops. E.g., this is happening at moment when unrolling n-D vector transfers with VectorToSCF because many vector ops do not yet support "mixed" indices.

We can add a new interface called MixedIndicesOpInterface to ViewLikeInterfaces.td that is like OffsetSizeAndStrideOpInterface but has only one offsets/indices, but no sizes or strides. This change here would actually be the first step towards that direction, because the static offsets should be a DenseI64ArrayAttr.

Yeah, using mixed static/dynamic indices was pointed out in the review and I have all of that implemented locally. I don't have the interface refactoring, though, which is something we can introduce after the fact. However, if you think you have cycles to help with the interface refactoring I would be more than happy to refactor what I have on top of that :).

Hopefully it makes more sense now!

We can insert/extract elements from dynamic positions even in LLVM, right? https://llvm.org/docs/LangRef.html#extractvalue-instruction

This instruction won't work for LLVM vectors:

The ‘extractvalue’ instruction extracts the value of a member field from an aggregate value.
[...]
Aggregate Types are a subset of derived types that can contain multiple member types. Arrays and structs are aggregate types. Vectors are not considered to be aggregate types.

Also, the indices must be static:

The other operands are constant indices to specify which value to extract in a similar manner as indices in a ‘getelementptr’ instruction.

In D155034 (mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp), you are currently using convertConstantsToInts, so there is an implicit assumption that all indices are static when lowering to LLVM.

Yeah, using mixed static/dynamic indices was pointed out in the review and I have all of that implemented locally. I don't have the interface refactoring, though, which is something we can introduce after the fact. However, if you think you have cycles to help with the interface refactoring I would be more than happy to refactor what I have on top of that :).

I can add that tomorrow.

Sorry, I sent you the wrong link. This is it: https://llvm.org/docs/LangRef.html#extractelement-instruction
From which I wanted to point to: The second operand is an index indicating the position from which to extract the element. The index may be a variable of any integer type, and will be treated as an unsigned integer.

D155034 is not up-to-date. I have a version locally that works with variable indices.

The LLVM lowering expects constant indices, which is fine. We will reject those ops without constant indices when it comes to lowering to LLVM but we will have a single op to represent vector extraction regardless of the number of vector dimensions. We currently have duplicated patterns in MLIR, for vector.extract and for vector.extractelement, which is a pain.

Thanks for helping with this!

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Vector/

IR/

VectorOps.td

7 lines

lib/

Conversion/

VectorToGPU/

VectorToGPU.cpp

5 lines

VectorToLLVM/

ConvertVectorToLLVM.cpp

45 lines

VectorToSCF/

VectorToSCF.cpp

14 lines

VectorToSPIRV/

VectorToSPIRV.cpp

4 lines

Dialect/

Vector/

IR/

VectorOps.cpp

97 lines

Transforms/

LowerVectorContract.cpp

41 lines

LowerVectorMask.cpp

10 lines

VectorDistribute.cpp

9 lines

VectorDropLeadUnitDim.cpp

12 lines

VectorTransferOpTransforms.cpp

2 lines

VectorTransforms.cpp

11 lines

test/

Conversion/

VectorToSPIRV/

vector-to-spirv.mlir

4 lines

Dialect/

Vector/

invalid.mlir

2 lines

Diff 545629

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

Show First 20 Lines • Show All 567 Lines • ▼ Show 20 Lines	def Vector_ExtractElementOp :
let hasFolder = 1;		let hasFolder = 1;
}		}

def Vector_ExtractOp :		def Vector_ExtractOp :
Vector_Op<"extract", [Pure,		Vector_Op<"extract", [Pure,
PredOpTrait<"operand and result have same element type",		PredOpTrait<"operand and result have same element type",
TCresVTEtIsSameAsOpBase<0, 0>>,		TCresVTEtIsSameAsOpBase<0, 0>>,
InferTypeOpAdaptorWithIsCompatible]>,		InferTypeOpAdaptorWithIsCompatible]>,
Arguments<(ins AnyVectorOfAnyRank:$vector, I64ArrayAttr:$position)>,		Arguments<(ins AnyVectorOfAnyRank:$vector, DenseI64ArrayAttr:$position)>,
Results<(outs AnyType)> {		Results<(outs AnyType)> {
let summary = "extract operation";		let summary = "extract operation";
let description = [{		let description = [{
Takes an n-D vector and a k-D position and extracts the (n-k)-D vector at		Takes an n-D vector and a k-D position and extracts the (n-k)-D vector at
the proper position. Degenerates to an element type if n-k is zero.		the proper position. Degenerates to an element type if n-k is zero.

Example:		Example:

```mlir		```mlir
%1 = vector.extract %0[3]: vector<4x8x16xf32>		%1 = vector.extract %0[3]: vector<4x8x16xf32>
%2 = vector.extract %0[3, 3, 3]: vector<4x8x16xf32>		%2 = vector.extract %0[3, 3, 3]: vector<4x8x16xf32>
%3 = vector.extract %1[]: vector<f32>		%3 = vector.extract %1[]: vector<f32>
```		```
}];		}];
let builders = [		let builders = [
OpBuilder<(ins "Value":$source, "ArrayRef<int64_t>":$position)>,
// Convenience builder which assumes the values in `position` are defined by		// Convenience builder which assumes the values in `position` are defined by
// ConstantIndexOp.		// ConstantIndexOp.
OpBuilder<(ins "Value":$source, "ValueRange":$position)>		OpBuilder<(ins "Value":$source, "ValueRange":$position)>
];		];
let extraClassDeclaration = [{		let extraClassDeclaration = [{
VectorType getSourceVectorType() {		VectorType getSourceVectorType() {
return ::llvm::cast<VectorType>(getVector().getType());		return ::llvm::cast<VectorType>(getVector().getType());
}		}
▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	def Vector_InsertElementOp :
let hasFolder = 1;		let hasFolder = 1;
}		}

def Vector_InsertOp :		def Vector_InsertOp :
Vector_Op<"insert", [Pure,		Vector_Op<"insert", [Pure,
PredOpTrait<"source operand and result have same element type",		PredOpTrait<"source operand and result have same element type",
TCresVTEtIsSameAsOpBase<0, 0>>,		TCresVTEtIsSameAsOpBase<0, 0>>,
AllTypesMatch<["dest", "res"]>]>,		AllTypesMatch<["dest", "res"]>]>,
Arguments<(ins AnyType:$source, AnyVectorOfAnyRank:$dest, I64ArrayAttr:$position)>,		Arguments<(ins AnyType:$source, AnyVectorOfAnyRank:$dest, DenseI64ArrayAttr:$position)>,
Results<(outs AnyVectorOfAnyRank:$res)> {		Results<(outs AnyVectorOfAnyRank:$res)> {
let summary = "insert operation";		let summary = "insert operation";
let description = [{		let description = [{
Takes an n-D source vector, an (n+k)-D destination vector and a k-D position		Takes an n-D source vector, an (n+k)-D destination vector and a k-D position
and inserts the n-D source into the (n+k)-D destination at the proper		and inserts the n-D source into the (n+k)-D destination at the proper
position. Degenerates to a scalar or a 0-d vector source type when n = 0.		position. Degenerates to a scalar or a 0-d vector source type when n = 0.

Example:		Example:

```mlir		```mlir
%2 = vector.insert %0, %1[3] : vector<8x16xf32> into vector<4x8x16xf32>		%2 = vector.insert %0, %1[3] : vector<8x16xf32> into vector<4x8x16xf32>
%5 = vector.insert %3, %4[3, 3, 3] : f32 into vector<4x8x16xf32>		%5 = vector.insert %3, %4[3, 3, 3] : f32 into vector<4x8x16xf32>
%8 = vector.insert %6, %7[] : f32 into vector<f32>		%8 = vector.insert %6, %7[] : f32 into vector<f32>
%11 = vector.insert %9, %10[3, 3, 3] : vector<f32> into vector<4x8x16xf32>		%11 = vector.insert %9, %10[3, 3, 3] : vector<f32> into vector<4x8x16xf32>
```		```
}];		}];
let assemblyFormat = [{		let assemblyFormat = [{
$source `,` $dest $position attr-dict `:` type($source) `into` type($dest)		$source `,` $dest $position attr-dict `:` type($source) `into` type($dest)
}];		}];

let builders = [		let builders = [
OpBuilder<(ins "Value":$source, "Value":$dest,
"ArrayRef<int64_t>":$position)>,
// Convenience builder which assumes all values are constant indices.		// Convenience builder which assumes all values are constant indices.
OpBuilder<(ins "Value":$source, "Value":$dest, "ValueRange":$position)>		OpBuilder<(ins "Value":$source, "Value":$dest, "ValueRange":$position)>
];		];
let extraClassDeclaration = [{		let extraClassDeclaration = [{
Type getSourceType() { return getSource().getType(); }		Type getSourceType() { return getSource().getType(); }
VectorType getDestVectorType() {		VectorType getDestVectorType() {
return ::llvm::cast<VectorType>(getDest().getType());		return ::llvm::cast<VectorType>(getDest().getType());
}		}
▲ Show 20 Lines • Show All 2,176 Lines • Show Last 20 Lines

mlir/lib/Conversion/VectorToGPU/VectorToGPU.cpp

Show First 20 Lines • Show All 801 Lines • ▼ Show 20 Lines	for (int i = 0; i < vectorType.getShape()[0]; i++) {
loc, rewriter.getIndexType(),		loc, rewriter.getIndexType(),
rewriter.getIndexAttr(i * regInfo->elementsPerRegister));		rewriter.getIndexAttr(i * regInfo->elementsPerRegister));
SmallVector<Value, 4> newIndices;		SmallVector<Value, 4> newIndices;
getXferIndices<vector::TransferReadOp>(		getXferIndices<vector::TransferReadOp>(
rewriter, op, *coords, {laneId, logicalValueId}, newIndices);		rewriter, op, *coords, {laneId, logicalValueId}, newIndices);

Value el = rewriter.create<vector::LoadOp>(loc, loadedElType,		Value el = rewriter.create<vector::LoadOp>(loc, loadedElType,
op.getSource(), newIndices);		op.getSource(), newIndices);
result = rewriter.create<vector::InsertOp>(loc, el, result,		result = rewriter.create<vector::InsertOp>(loc, el, result, i);
rewriter.getI64ArrayAttr(i));
}		}
} else {		} else {
if (auto vecType = dyn_cast<VectorType>(loadedElType)) {		if (auto vecType = dyn_cast<VectorType>(loadedElType)) {
loadedElType = vecType.getElementType();		loadedElType = vecType.getElementType();
}		}
for (int i = 0; i < vectorType.getShape()[0]; i++) {		for (int i = 0; i < vectorType.getShape()[0]; i++) {
for (unsigned innerIdx = 0; innerIdx < vectorType.getShape()[1];		for (unsigned innerIdx = 0; innerIdx < vectorType.getShape()[1];
innerIdx++) {		innerIdx++) {

Value logicalValueId = rewriter.create<arith::ConstantOp>(		Value logicalValueId = rewriter.create<arith::ConstantOp>(
loc, rewriter.getIndexType(),		loc, rewriter.getIndexType(),
rewriter.getIndexAttr(i * regInfo->elementsPerRegister + innerIdx));		rewriter.getIndexAttr(i * regInfo->elementsPerRegister + innerIdx));
FailureOr<AffineMap> coords = nvgpu::getLaneIdAndValueIdToOperandCoord(		FailureOr<AffineMap> coords = nvgpu::getLaneIdAndValueIdToOperandCoord(
rewriter, op.getLoc(), *warpMatrixInfo);		rewriter, op.getLoc(), *warpMatrixInfo);
if (failed(coords))		if (failed(coords))
return rewriter.notifyMatchFailure(op, "no coords");		return rewriter.notifyMatchFailure(op, "no coords");

SmallVector<Value, 4> newIndices;		SmallVector<Value, 4> newIndices;
getXferIndices<vector::TransferReadOp>(		getXferIndices<vector::TransferReadOp>(
rewriter, op, *coords, {laneId, logicalValueId}, newIndices);		rewriter, op, *coords, {laneId, logicalValueId}, newIndices);
Value el = rewriter.create<memref::LoadOp>(op.getLoc(), loadedElType,		Value el = rewriter.create<memref::LoadOp>(op.getLoc(), loadedElType,
op.getSource(), newIndices);		op.getSource(), newIndices);
result = rewriter.create<vector::InsertOp>(		result = rewriter.create<vector::InsertOp>(
op.getLoc(), el, result, rewriter.getI64ArrayAttr({i, innerIdx}));		op.getLoc(), el, result, ArrayRef<int64_t>{i, innerIdx});
}		}
}		}
}		}

valueMapping[op.getResult()] = result;		valueMapping[op.getResult()] = result;
return success();		return success();
}		}

▲ Show 20 Lines • Show All 485 Lines • Show Last 20 Lines

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

Show First 20 Lines • Show All 1,019 Lines • ▼ Show 20 Lines	public:
using ConvertOpToLLVMPattern<vector::ExtractOp>::ConvertOpToLLVMPattern;		using ConvertOpToLLVMPattern<vector::ExtractOp>::ConvertOpToLLVMPattern;

LogicalResult		LogicalResult
matchAndRewrite(vector::ExtractOp extractOp, OpAdaptor adaptor,		matchAndRewrite(vector::ExtractOp extractOp, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
auto loc = extractOp->getLoc();		auto loc = extractOp->getLoc();
auto resultType = extractOp.getResult().getType();		auto resultType = extractOp.getResult().getType();
auto llvmResultType = typeConverter->convertType(resultType);		auto llvmResultType = typeConverter->convertType(resultType);
auto positionArrayAttr = extractOp.getPosition();		ArrayRef<int64_t> positionArray = extractOp.getPosition();

// Bail if result type cannot be lowered.		// Bail if result type cannot be lowered.
if (!llvmResultType)		if (!llvmResultType)
return failure();		return failure();

// Extract entire vector. Should be handled by folder, but just to be safe.		// Extract entire vector. Should be handled by folder, but just to be safe.
if (positionArrayAttr.empty()) {		if (positionArray.empty()) {
rewriter.replaceOp(extractOp, adaptor.getVector());		rewriter.replaceOp(extractOp, adaptor.getVector());
return success();		return success();
}		}

// One-shot extraction of vector from array (only requires extractvalue).		// One-shot extraction of vector from array (only requires extractvalue).
if (isa<VectorType>(resultType)) {		if (isa<VectorType>(resultType)) {
SmallVector<int64_t> indices;
for (auto idx : positionArrayAttr.getAsRange<IntegerAttr>())
indices.push_back(idx.getInt());
Value extracted = rewriter.create<LLVM::ExtractValueOp>(		Value extracted = rewriter.create<LLVM::ExtractValueOp>(
loc, adaptor.getVector(), indices);		loc, adaptor.getVector(), positionArray);
rewriter.replaceOp(extractOp, extracted);		rewriter.replaceOp(extractOp, extracted);
return success();		return success();
}		}

// Potential extraction of 1-D vector from array.		// Potential extraction of 1-D vector from array.
Value extracted = adaptor.getVector();		Value extracted = adaptor.getVector();
auto positionAttrs = positionArrayAttr.getValue();		if (positionArray.size() > 1) {
if (positionAttrs.size() > 1) {		extracted = rewriter.create<LLVM::ExtractValueOp>(
SmallVector<int64_t> nMinusOnePosition;		loc, extracted, positionArray.drop_back());
for (auto idx : positionAttrs.drop_back())
nMinusOnePosition.push_back(cast<IntegerAttr>(idx).getInt());
extracted = rewriter.create<LLVM::ExtractValueOp>(loc, extracted,
nMinusOnePosition);
}		}

// Remaining extraction of element from 1-D LLVM vector		// Remaining extraction of element from 1-D LLVM vector
auto position = cast<IntegerAttr>(positionAttrs.back());
auto i64Type = IntegerType::get(rewriter.getContext(), 64);		auto i64Type = IntegerType::get(rewriter.getContext(), 64);
auto constant = rewriter.create<LLVM::ConstantOp>(loc, i64Type, position);		auto constant =
		rewriter.create<LLVM::ConstantOp>(loc, i64Type, positionArray.back());
extracted =		extracted =
rewriter.create<LLVM::ExtractElementOp>(loc, extracted, constant);		rewriter.create<LLVM::ExtractElementOp>(loc, extracted, constant);
rewriter.replaceOp(extractOp, extracted);		rewriter.replaceOp(extractOp, extracted);

return success();		return success();
}		}
};		};

▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	public:

LogicalResult		LogicalResult
matchAndRewrite(vector::InsertOp insertOp, OpAdaptor adaptor,		matchAndRewrite(vector::InsertOp insertOp, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
auto loc = insertOp->getLoc();		auto loc = insertOp->getLoc();
auto sourceType = insertOp.getSourceType();		auto sourceType = insertOp.getSourceType();
auto destVectorType = insertOp.getDestVectorType();		auto destVectorType = insertOp.getDestVectorType();
auto llvmResultType = typeConverter->convertType(destVectorType);		auto llvmResultType = typeConverter->convertType(destVectorType);
auto positionArrayAttr = insertOp.getPosition();		ArrayRef<int64_t> positionArray = insertOp.getPosition();

// Bail if result type cannot be lowered.		// Bail if result type cannot be lowered.
if (!llvmResultType)		if (!llvmResultType)
return failure();		return failure();

// Overwrite entire vector with value. Should be handled by folder, but		// Overwrite entire vector with value. Should be handled by folder, but
// just to be safe.		// just to be safe.
if (positionArrayAttr.empty()) {		if (positionArray.empty()) {
rewriter.replaceOp(insertOp, adaptor.getSource());		rewriter.replaceOp(insertOp, adaptor.getSource());
return success();		return success();
}		}

// One-shot insertion of a vector into an array (only requires insertvalue).		// One-shot insertion of a vector into an array (only requires insertvalue).
if (isa<VectorType>(sourceType)) {		if (isa<VectorType>(sourceType)) {
Value inserted = rewriter.create<LLVM::InsertValueOp>(		Value inserted = rewriter.create<LLVM::InsertValueOp>(
loc, adaptor.getDest(), adaptor.getSource(),		loc, adaptor.getDest(), adaptor.getSource(), positionArray);
LLVM::convertArrayToIndices(positionArrayAttr));
rewriter.replaceOp(insertOp, inserted);		rewriter.replaceOp(insertOp, inserted);
return success();		return success();
}		}

// Potential extraction of 1-D vector from array.		// Potential extraction of 1-D vector from array.
Value extracted = adaptor.getDest();		Value extracted = adaptor.getDest();
auto positionAttrs = positionArrayAttr.getValue();
auto position = cast<IntegerAttr>(positionAttrs.back());
auto oneDVectorType = destVectorType;		auto oneDVectorType = destVectorType;
if (positionAttrs.size() > 1) {		if (positionArray.size() > 1) {
oneDVectorType = reducedVectorTypeBack(destVectorType);		oneDVectorType = reducedVectorTypeBack(destVectorType);
extracted = rewriter.create<LLVM::ExtractValueOp>(		extracted = rewriter.create<LLVM::ExtractValueOp>(
loc, extracted,		loc, extracted, positionArray.drop_back());
LLVM::convertArrayToIndices(positionAttrs.drop_back()));
}		}

// Insertion of an element into a 1-D LLVM vector.		// Insertion of an element into a 1-D LLVM vector.
auto i64Type = IntegerType::get(rewriter.getContext(), 64);		auto i64Type = IntegerType::get(rewriter.getContext(), 64);
auto constant = rewriter.create<LLVM::ConstantOp>(loc, i64Type, position);		auto constant =
		rewriter.create<LLVM::ConstantOp>(loc, i64Type, positionArray.back());
Value inserted = rewriter.create<LLVM::InsertElementOp>(		Value inserted = rewriter.create<LLVM::InsertElementOp>(
loc, typeConverter->convertType(oneDVectorType), extracted,		loc, typeConverter->convertType(oneDVectorType), extracted,
adaptor.getSource(), constant);		adaptor.getSource(), constant);

// Potential insertion of resulting 1-D vector into array.		// Potential insertion of resulting 1-D vector into array.
if (positionAttrs.size() > 1) {		if (positionArray.size() > 1) {
inserted = rewriter.create<LLVM::InsertValueOp>(		inserted = rewriter.create<LLVM::InsertValueOp>(
loc, adaptor.getDest(), inserted,		loc, adaptor.getDest(), inserted, positionArray.drop_back());
LLVM::convertArrayToIndices(positionAttrs.drop_back()));
}		}

rewriter.replaceOp(insertOp, inserted);		rewriter.replaceOp(insertOp, inserted);
return success();		return success();
}		}
};		};

/// Lower vector.scalable.insert ops to LLVM vector.insert		/// Lower vector.scalable.insert ops to LLVM vector.insert
▲ Show 20 Lines • Show All 520 Lines • Show Last 20 Lines

mlir/lib/Conversion/VectorToSCF/VectorToSCF.cpp

Show First 20 Lines • Show All 880 Lines • ▼ Show 20 Lines	vector::InsertOp getInsertOp(TransferReadOp xferOp) const {

return vector::InsertOp();		return vector::InsertOp();
}		}

/// If the result of the TransferReadOp has exactly one user, which is a		/// If the result of the TransferReadOp has exactly one user, which is a
/// vector::InsertOp, return that operation's indices.		/// vector::InsertOp, return that operation's indices.
void getInsertionIndices(TransferReadOp xferOp,		void getInsertionIndices(TransferReadOp xferOp,
SmallVector<int64_t, 8> &indices) const {		SmallVector<int64_t, 8> &indices) const {
if (auto insertOp = getInsertOp(xferOp)) {		if (auto insertOp = getInsertOp(xferOp))
for (Attribute attr : insertOp.getPosition())		indices.assign(insertOp.getPosition().begin(),
indices.push_back(dyn_cast<IntegerAttr>(attr).getInt());		insertOp.getPosition().end());
}
}		}

/// Rewrite the op: Unpack one dimension. Can handle masks, out-of-bounds		/// Rewrite the op: Unpack one dimension. Can handle masks, out-of-bounds
/// accesses, and broadcasts and transposes in permutation maps.		/// accesses, and broadcasts and transposes in permutation maps.
LogicalResult matchAndRewrite(TransferReadOp xferOp,		LogicalResult matchAndRewrite(TransferReadOp xferOp,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
if (xferOp.getVectorType().getRank() <= options.targetRank)		if (xferOp.getVectorType().getRank() <= options.targetRank)
return failure();		return failure();
▲ Show 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	if (auto *op = xferOp.getVector().getDefiningOp())
return dyn_cast<vector::ExtractOp>(op);		return dyn_cast<vector::ExtractOp>(op);
return vector::ExtractOp();		return vector::ExtractOp();
}		}

/// If the input of the given TransferWriteOp is an ExtractOp, return its		/// If the input of the given TransferWriteOp is an ExtractOp, return its
/// indices.		/// indices.
void getExtractionIndices(TransferWriteOp xferOp,		void getExtractionIndices(TransferWriteOp xferOp,
SmallVector<int64_t, 8> &indices) const {		SmallVector<int64_t, 8> &indices) const {
if (auto extractOp = getExtractOp(xferOp)) {		if (auto extractOp = getExtractOp(xferOp))
for (Attribute attr : extractOp.getPosition())		indices.assign(extractOp.getPosition().begin(),
indices.push_back(dyn_cast<IntegerAttr>(attr).getInt());		extractOp.getPosition().end());
}
}		}

/// Rewrite the op: Unpack one dimension. Can handle masks, out-of-bounds		/// Rewrite the op: Unpack one dimension. Can handle masks, out-of-bounds
/// accesses, and broadcasts and transposes in permutation maps.		/// accesses, and broadcasts and transposes in permutation maps.
LogicalResult matchAndRewrite(TransferWriteOp xferOp,		LogicalResult matchAndRewrite(TransferWriteOp xferOp,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
if (xferOp.getVectorType().getRank() <= options.targetRank)		if (xferOp.getVectorType().getRank() <= options.targetRank)
return failure();		return failure();
▲ Show 20 Lines • Show All 295 Lines • Show Last 20 Lines

mlir/lib/Conversion/VectorToSPIRV/VectorToSPIRV.cpp

Show First 20 Lines • Show All 146 Lines • ▼ Show 20 Lines	matchAndRewrite(vector::ExtractOp extractOp, OpAdaptor adaptor,
if (!dstType)		if (!dstType)
return failure();		return failure();

if (isa<spirv::ScalarType>(adaptor.getVector().getType())) {		if (isa<spirv::ScalarType>(adaptor.getVector().getType())) {
rewriter.replaceOp(extractOp, adaptor.getVector());		rewriter.replaceOp(extractOp, adaptor.getVector());
return success();		return success();
}		}

int32_t id = getFirstIntValue(extractOp.getPosition());		int32_t id = extractOp.getPosition()[0];
rewriter.replaceOpWithNewOp<spirv::CompositeExtractOp>(		rewriter.replaceOpWithNewOp<spirv::CompositeExtractOp>(
extractOp, adaptor.getVector(), id);		extractOp, adaptor.getVector(), id);
return success();		return success();
}		}
};		};

struct VectorExtractStridedSliceOpConvert final		struct VectorExtractStridedSliceOpConvert final
: public OpConversionPattern<vector::ExtractStridedSliceOp> {		: public OpConversionPattern<vector::ExtractStridedSliceOp> {
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	matchAndRewrite(vector::InsertOp insertOp, OpAdaptor adaptor,

// Special case for inserting scalar values into size-1 vectors.		// Special case for inserting scalar values into size-1 vectors.
if (insertOp.getSourceType().isIntOrFloat() &&		if (insertOp.getSourceType().isIntOrFloat() &&
insertOp.getDestVectorType().getNumElements() == 1) {		insertOp.getDestVectorType().getNumElements() == 1) {
rewriter.replaceOp(insertOp, adaptor.getSource());		rewriter.replaceOp(insertOp, adaptor.getSource());
return success();		return success();
}		}

int32_t id = getFirstIntValue(insertOp.getPosition());		int32_t id = insertOp.getPosition()[0];
rewriter.replaceOpWithNewOp<spirv::CompositeInsertOp>(		rewriter.replaceOpWithNewOp<spirv::CompositeInsertOp>(
insertOp, adaptor.getSource(), adaptor.getDest(), id);		insertOp, adaptor.getSource(), adaptor.getDest(), id);
return success();		return success();
}		}
};		};

struct VectorExtractElementOpConvert final		struct VectorExtractElementOpConvert final
: public OpConversionPattern<vector::ExtractElementOp> {		: public OpConversionPattern<vector::ExtractElementOp> {
▲ Show 20 Lines • Show All 348 Lines • Show Last 20 Lines

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

Show First 20 Lines • Show All 379 Lines • ▼ Show 20 Lines	if (auto dstVecType = dyn_cast<VectorType>(reductionOp.getDestType())) {
VectorType::get(dstVecType.getShape(), rewriter.getI1Type());		VectorType::get(dstVecType.getShape(), rewriter.getI1Type());
mask = rewriter.create<vector::ShapeCastOp>(loc, newMaskType, mask);		mask = rewriter.create<vector::ShapeCastOp>(loc, newMaskType, mask);
}		}
cast = rewriter.create<vector::ShapeCastOp>(		cast = rewriter.create<vector::ShapeCastOp>(
loc, reductionOp.getDestType(), reductionOp.getSource());		loc, reductionOp.getDestType(), reductionOp.getSource());
} else {		} else {
// This means we are reducing all the dimensions, and all reduction		// This means we are reducing all the dimensions, and all reduction
// dimensions are of size 1. So a simple extraction would do.		// dimensions are of size 1. So a simple extraction would do.
auto zeroAttr =		SmallVector<int64_t> zeroAttr(shape.size(), 0);
rewriter.getI64ArrayAttr(SmallVector<int64_t>(shape.size(), 0));
if (mask)		if (mask)
mask = rewriter.create<vector::ExtractOp>(loc, rewriter.getI1Type(),		mask = rewriter.create<vector::ExtractOp>(loc, rewriter.getI1Type(),
mask, zeroAttr);		mask, zeroAttr);
cast = rewriter.create<vector::ExtractOp>(		cast = rewriter.create<vector::ExtractOp>(
loc, reductionOp.getDestType(), reductionOp.getSource(), zeroAttr);		loc, reductionOp.getDestType(), reductionOp.getSource(), zeroAttr);
}		}

Value result = vector::makeArithReduction(		Value result = vector::makeArithReduction(
▲ Show 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(ReductionOp reductionOp,
Location loc = reductionOp.getLoc();		Location loc = reductionOp.getLoc();
Value result;		Value result;
if (vectorType.getRank() == 0) {		if (vectorType.getRank() == 0) {
if (mask)		if (mask)
mask = rewriter.create<ExtractElementOp>(loc, mask);		mask = rewriter.create<ExtractElementOp>(loc, mask);
result = rewriter.create<ExtractElementOp>(loc, reductionOp.getVector());		result = rewriter.create<ExtractElementOp>(loc, reductionOp.getVector());
} else {		} else {
if (mask) {		if (mask) {
mask = rewriter.create<ExtractOp>(loc, rewriter.getI1Type(), mask,		mask = rewriter.create<ExtractOp>(loc, rewriter.getI1Type(), mask, 0);
rewriter.getI64ArrayAttr(0));
}		}
result = rewriter.create<ExtractOp>(loc, reductionOp.getType(),		result = rewriter.create<ExtractOp>(loc, reductionOp.getType(),
reductionOp.getVector(),		reductionOp.getVector(), 0);
rewriter.getI64ArrayAttr(0));
}		}

if (Value acc = reductionOp.getAcc())		if (Value acc = reductionOp.getAcc())
result = vector::makeArithReduction(rewriter, loc, reductionOp.getKind(),		result = vector::makeArithReduction(rewriter, loc, reductionOp.getKind(),
result, acc, mask);		result, acc, mask);

rewriter.replaceOp(rootOp, result);		rewriter.replaceOp(rootOp, result);
return success();		return success();
▲ Show 20 Lines • Show All 547 Lines • ▼ Show 20 Lines	OpFoldResult vector::ExtractElementOp::fold(FoldAdaptor adaptor) {

return srcElements[posIdx];		return srcElements[posIdx];
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ExtractOp		// ExtractOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void vector::ExtractOp::build(OpBuilder &builder, OperationState &result,
Value source, ArrayRef<int64_t> position) {
build(builder, result, source, getVectorSubscriptAttr(builder, position));
}

// Convenience builder which assumes the values are constant indices.		// Convenience builder which assumes the values are constant indices.
void vector::ExtractOp::build(OpBuilder &builder, OperationState &result,		void vector::ExtractOp::build(OpBuilder &builder, OperationState &result,
Value source, ValueRange position) {		Value source, ValueRange position) {
SmallVector<int64_t, 4> positionConstants =		SmallVector<int64_t> positionConstants = llvm::to_vector(llvm::map_range(
llvm::to_vector<4>(llvm::map_range(position, [](Value pos) {		position, [](Value pos) { return getConstantIntValue(pos).value(); }));
return getConstantIntValue(pos).value();
}));
build(builder, result, source, positionConstants);		build(builder, result, source, positionConstants);
}		}

LogicalResult		LogicalResult
ExtractOp::inferReturnTypes(MLIRContext *, std::optional<Location>,		ExtractOp::inferReturnTypes(MLIRContext *, std::optional<Location>,
ExtractOp::Adaptor adaptor,		ExtractOp::Adaptor adaptor,
SmallVectorImpl<Type> &inferredReturnTypes) {		SmallVectorImpl<Type> &inferredReturnTypes) {
auto vectorType = llvm::cast<VectorType>(adaptor.getVector().getType());		auto vectorType = llvm::cast<VectorType>(adaptor.getVector().getType());
Show All 18 Lines	bool ExtractOp::isCompatibleReturnTypes(TypeRange l, TypeRange r) {
};		};
if (l.size() == 1 && r.size() == 1 &&		if (l.size() == 1 && r.size() == 1 &&
(isCompatible(l, r) \|\| isCompatible(r, l)))		(isCompatible(l, r) \|\| isCompatible(r, l)))
return true;		return true;
return l == r;		return l == r;
}		}

LogicalResult vector::ExtractOp::verify() {		LogicalResult vector::ExtractOp::verify() {
auto positionAttr = getPosition().getValue();		ArrayRef<int64_t> position = getPosition();
if (positionAttr.size() >		if (position.size() > static_cast<unsigned>(getSourceVectorType().getRank()))
static_cast<unsigned>(getSourceVectorType().getRank()))
return emitOpError(		return emitOpError(
"expected position attribute of rank no greater than vector rank");		"expected position attribute of rank no greater than vector rank");
for (const auto &en : llvm::enumerate(positionAttr)) {		for (const auto &en : llvm::enumerate(position)) {
auto attr = llvm::dyn_cast<IntegerAttr>(en.value());		if (en.value() < 0 \|\|
if (!attr \|\| attr.getInt() < 0 \|\|		en.value() >= getSourceVectorType().getDimSize(en.index()))
attr.getInt() >= getSourceVectorType().getDimSize(en.index()))
return emitOpError("expected position attribute #")		return emitOpError("expected position attribute #")
<< (en.index() + 1)		<< (en.index() + 1)
<< " to be a non-negative integer smaller than the corresponding "		<< " to be a non-negative integer smaller than the corresponding "
"vector dimension";		"vector dimension";
}		}
return success();		return success();
}		}

template <typename IntType>		template <typename IntType>
static SmallVector<IntType> extractVector(ArrayAttr arrayAttr) {		static SmallVector<IntType> extractVector(ArrayAttr arrayAttr) {
return llvm::to_vector<4>(llvm::map_range(		return llvm::to_vector<4>(llvm::map_range(
arrayAttr.getAsRange<IntegerAttr>(),		arrayAttr.getAsRange<IntegerAttr>(),
[](IntegerAttr attr) { return static_cast<IntType>(attr.getInt()); }));		[](IntegerAttr attr) { return static_cast<IntType>(attr.getInt()); }));
}		}

/// Fold the result of chains of ExtractOp in place by simply concatenating the		/// Fold the result of chains of ExtractOp in place by simply concatenating the
/// positions.		/// positions.
static LogicalResult foldExtractOpFromExtractChain(ExtractOp extractOp) {		static LogicalResult foldExtractOpFromExtractChain(ExtractOp extractOp) {
if (!extractOp.getVector().getDefiningOp<ExtractOp>())		if (!extractOp.getVector().getDefiningOp<ExtractOp>())
return failure();		return failure();

SmallVector<int64_t, 4> globalPosition;		SmallVector<int64_t, 4> globalPosition;
ExtractOp currentOp = extractOp;		ExtractOp currentOp = extractOp;
auto extrPos = extractVector<int64_t>(currentOp.getPosition());		ArrayRef<int64_t> extrPos = currentOp.getPosition();
globalPosition.append(extrPos.rbegin(), extrPos.rend());		globalPosition.append(extrPos.rbegin(), extrPos.rend());
while (ExtractOp nextOp = currentOp.getVector().getDefiningOp<ExtractOp>()) {		while (ExtractOp nextOp = currentOp.getVector().getDefiningOp<ExtractOp>()) {
currentOp = nextOp;		currentOp = nextOp;
auto extrPos = extractVector<int64_t>(currentOp.getPosition());		ArrayRef<int64_t> extrPos = currentOp.getPosition();
globalPosition.append(extrPos.rbegin(), extrPos.rend());		globalPosition.append(extrPos.rbegin(), extrPos.rend());
}		}
extractOp.setOperand(currentOp.getVector());		extractOp.setOperand(currentOp.getVector());
// OpBuilder is only used as a helper to build an I64ArrayAttr.		// OpBuilder is only used as a helper to build an I64ArrayAttr.
OpBuilder b(extractOp.getContext());		OpBuilder b(extractOp.getContext());
std::reverse(globalPosition.begin(), globalPosition.end());		std::reverse(globalPosition.begin(), globalPosition.end());
extractOp.setPositionAttr(b.getI64ArrayAttr(globalPosition));		extractOp.setPosition(globalPosition);
return success();		return success();
}		}

namespace {		namespace {
/// Fold an ExtractOp that is fed by a chain of InsertOps and TransposeOps.		/// Fold an ExtractOp that is fed by a chain of InsertOps and TransposeOps.
/// Walk back a chain of InsertOp/TransposeOp until we hit a match.		/// Walk back a chain of InsertOp/TransposeOp until we hit a match.
/// Compose TransposeOp permutations as we walk back.		/// Compose TransposeOp permutations as we walk back.
/// This helper class keeps an updated extraction position `extractPosition`		/// This helper class keeps an updated extraction position `extractPosition`
▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines
ExtractFromInsertTransposeChainState::ExtractFromInsertTransposeChainState(		ExtractFromInsertTransposeChainState::ExtractFromInsertTransposeChainState(
ExtractOp e)		ExtractOp e)
: extractOp(e), vectorRank(extractOp.getSourceVectorType().getRank()),		: extractOp(e), vectorRank(extractOp.getSourceVectorType().getRank()),
extractedRank(extractOp.getPosition().size()) {		extractedRank(extractOp.getPosition().size()) {
assert(vectorRank >= extractedRank && "extracted pos overflow");		assert(vectorRank >= extractedRank && "extracted pos overflow");
sentinels.reserve(vectorRank - extractedRank);		sentinels.reserve(vectorRank - extractedRank);
for (int64_t i = 0, e = vectorRank - extractedRank; i < e; ++i)		for (int64_t i = 0, e = vectorRank - extractedRank; i < e; ++i)
sentinels.push_back(-(i + 1));		sentinels.push_back(-(i + 1));
extractPosition = extractVector<int64_t>(extractOp.getPosition());		extractPosition.assign(extractOp.getPosition().begin(),
		extractOp.getPosition().end());
llvm::append_range(extractPosition, sentinels);		llvm::append_range(extractPosition, sentinels);
}		}

// Case 1. If we hit a transpose, just compose the map and iterate.		// Case 1. If we hit a transpose, just compose the map and iterate.
// Invariant: insert + transpose do not change rank, we can always compose.		// Invariant: insert + transpose do not change rank, we can always compose.
LogicalResult ExtractFromInsertTransposeChainState::handleTransposeOp() {		LogicalResult ExtractFromInsertTransposeChainState::handleTransposeOp() {
if (!nextTransposeOp)		if (!nextTransposeOp)
return failure();		return failure();
auto permutation = extractVector<unsigned>(nextTransposeOp.getTransp());		auto permutation = extractVector<unsigned>(nextTransposeOp.getTransp());
AffineMap m = inversePermutation(		AffineMap m = inversePermutation(
AffineMap::getPermutationMap(permutation, extractOp.getContext()));		AffineMap::getPermutationMap(permutation, extractOp.getContext()));
extractPosition = applyPermutationMap(m, ArrayRef(extractPosition));		extractPosition = applyPermutationMap(m, ArrayRef(extractPosition));
return success();		return success();
}		}

// Case 2: the insert position matches extractPosition exactly, early return.		// Case 2: the insert position matches extractPosition exactly, early return.
LogicalResult		LogicalResult
ExtractFromInsertTransposeChainState::handleInsertOpWithMatchingPos(		ExtractFromInsertTransposeChainState::handleInsertOpWithMatchingPos(
Value &res) {		Value &res) {
auto insertedPos = extractVector<int64_t>(nextInsertOp.getPosition());		ArrayRef<int64_t> insertedPos = nextInsertOp.getPosition();
if (ArrayRef(insertedPos) !=		if (insertedPos != llvm::ArrayRef(extractPosition).take_front(extractedRank))
llvm::ArrayRef(extractPosition).take_front(extractedRank))
return failure();		return failure();
// Case 2.a. early-exit fold.		// Case 2.a. early-exit fold.
res = nextInsertOp.getSource();		res = nextInsertOp.getSource();
// Case 2.b. if internal transposition is present, canFold will be false.		// Case 2.b. if internal transposition is present, canFold will be false.
return success(canFold());		return success(canFold());
}		}

/// Case 3: if inserted position is a prefix of extractPosition,		/// Case 3: if inserted position is a prefix of extractPosition,
/// extract a portion of the source of the insertion.		/// extract a portion of the source of the insertion.
/// This method updates the internal state.		/// This method updates the internal state.
LogicalResult		LogicalResult
ExtractFromInsertTransposeChainState::handleInsertOpWithPrefixPos(Value &res) {		ExtractFromInsertTransposeChainState::handleInsertOpWithPrefixPos(Value &res) {
auto insertedPos = extractVector<int64_t>(nextInsertOp.getPosition());		ArrayRef<int64_t> insertedPos = nextInsertOp.getPosition();
if (!isContainedWithin(insertedPos, extractPosition))		if (!isContainedWithin(insertedPos, extractPosition))
return failure();		return failure();
// Set leading dims to zero.		// Set leading dims to zero.
std::fill_n(extractPosition.begin(), insertedPos.size(), 0);		std::fill_n(extractPosition.begin(), insertedPos.size(), 0);
// Drop extra leading dims.		// Drop extra leading dims.
extractPosition.erase(extractPosition.begin(),		extractPosition.erase(extractPosition.begin(),
extractPosition.begin() + insertedPos.size());		extractPosition.begin() + insertedPos.size());
extractedRank = extractPosition.size() - sentinels.size();		extractedRank = extractPosition.size() - sentinels.size();
Show All 9 Lines
Value ExtractFromInsertTransposeChainState::tryToFoldExtractOpInPlace(		Value ExtractFromInsertTransposeChainState::tryToFoldExtractOpInPlace(
Value source) {		Value source) {
// If we can't fold (either internal transposition, or nothing to fold), bail.		// If we can't fold (either internal transposition, or nothing to fold), bail.
bool nothingToFold = (source == extractOp.getVector());		bool nothingToFold = (source == extractOp.getVector());
if (nothingToFold \|\| !canFold())		if (nothingToFold \|\| !canFold())
return Value();		return Value();
// Otherwise, fold by updating the op inplace and return its result.		// Otherwise, fold by updating the op inplace and return its result.
OpBuilder b(extractOp.getContext());		OpBuilder b(extractOp.getContext());
extractOp->setAttr(		extractOp.setPosition(ArrayRef(extractPosition).take_front(extractedRank));
extractOp.getPositionAttrName(),
b.getI64ArrayAttr(ArrayRef(extractPosition).take_front(extractedRank)));
extractOp.getVectorMutable().assign(source);		extractOp.getVectorMutable().assign(source);
return extractOp.getResult();		return extractOp.getResult();
}		}

/// Iterate over producing insert and transpose ops until we find a fold.		/// Iterate over producing insert and transpose ops until we find a fold.
Value ExtractFromInsertTransposeChainState::fold() {		Value ExtractFromInsertTransposeChainState::fold() {
Value valueToExtractFrom = extractOp.getVector();		Value valueToExtractFrom = extractOp.getVector();
updateStateForNextIteration(valueToExtractFrom);		updateStateForNextIteration(valueToExtractFrom);
Show All 13 Lines	while (nextInsertOp \|\| nextTransposeOp) {

// Case 3: if the inserted position is a prefix of extractPosition, we can		// Case 3: if the inserted position is a prefix of extractPosition, we can
// just extract a portion of the source of the insert.		// just extract a portion of the source of the insert.
if (succeeded(handleInsertOpWithPrefixPos(result)))		if (succeeded(handleInsertOpWithPrefixPos(result)))
return tryToFoldExtractOpInPlace(result);		return tryToFoldExtractOpInPlace(result);

// Case 4: extractPositionRef intersects insertedPosRef on non-sentinel		// Case 4: extractPositionRef intersects insertedPosRef on non-sentinel
// values. This is a more difficult case and we bail.		// values. This is a more difficult case and we bail.
auto insertedPos = extractVector<int64_t>(nextInsertOp.getPosition());		ArrayRef<int64_t> insertedPos = nextInsertOp.getPosition();
if (isContainedWithin(extractPosition, insertedPos) \|\|		if (isContainedWithin(extractPosition, insertedPos) \|\|
intersectsWhereNonNegative(extractPosition, insertedPos))		intersectsWhereNonNegative(extractPosition, insertedPos))
return Value();		return Value();

// Case 5: No intersection, we forward the extract to insertOp.dest().		// Case 5: No intersection, we forward the extract to insertOp.dest().
valueToExtractFrom = nextInsertOp.getDest();		valueToExtractFrom = nextInsertOp.getDest();
updateStateForNextIteration(valueToExtractFrom);		updateStateForNextIteration(valueToExtractFrom);
}		}
▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	static Value foldExtractFromBroadcast(ExtractOp extractOp) {

auto broadcastOp = cast<vector::BroadcastOp>(defOp);		auto broadcastOp = cast<vector::BroadcastOp>(defOp);
int64_t rankDiff = broadcastSrcRank - extractResultRank;		int64_t rankDiff = broadcastSrcRank - extractResultRank;
// Detect all the positions that come from "dim-1" broadcasting.		// Detect all the positions that come from "dim-1" broadcasting.
// These dimensions correspond to "dim-1" broadcasted dims; set the mathching		// These dimensions correspond to "dim-1" broadcasted dims; set the mathching
// extract position to `0` when extracting from the source operand.		// extract position to `0` when extracting from the source operand.
llvm::SetVector<int64_t> broadcastedUnitDims =		llvm::SetVector<int64_t> broadcastedUnitDims =
broadcastOp.computeBroadcastedUnitDims();		broadcastOp.computeBroadcastedUnitDims();
auto extractPos = extractVector<int64_t>(extractOp.getPosition());		SmallVector<int64_t> extractPos(extractOp.getPosition());
for (int64_t i = rankDiff, e = extractPos.size(); i < e; ++i)		for (int64_t i = rankDiff, e = extractPos.size(); i < e; ++i)
if (broadcastedUnitDims.contains(i))		if (broadcastedUnitDims.contains(i))
extractPos[i] = 0;		extractPos[i] = 0;
// `rankDiff` leading dimensions correspond to new broadcasted dims, drop the		// `rankDiff` leading dimensions correspond to new broadcasted dims, drop the
// matching extract position when extracting from the source operand.		// matching extract position when extracting from the source operand.
extractPos.erase(extractPos.begin(),		extractPos.erase(extractPos.begin(),
std::next(extractPos.begin(), extractPos.size() - rankDiff));		std::next(extractPos.begin(), extractPos.size() - rankDiff));
// OpBuilder is only used as a helper to build an I64ArrayAttr.		// OpBuilder is only used as a helper to build an I64ArrayAttr.
OpBuilder b(extractOp.getContext());		OpBuilder b(extractOp.getContext());
extractOp.setOperand(source);		extractOp.setOperand(source);
extractOp.setPositionAttr(b.getI64ArrayAttr(extractPos));		extractOp.setPosition(extractPos);
return extractOp.getResult();		return extractOp.getResult();
}		}

// Fold extractOp with source coming from ShapeCast op.		// Fold extractOp with source coming from ShapeCast op.
static Value foldExtractFromShapeCast(ExtractOp extractOp) {		static Value foldExtractFromShapeCast(ExtractOp extractOp) {
auto shapeCastOp = extractOp.getVector().getDefiningOp<vector::ShapeCastOp>();		auto shapeCastOp = extractOp.getVector().getDefiningOp<vector::ShapeCastOp>();
if (!shapeCastOp)		if (!shapeCastOp)
return Value();		return Value();
Show All 22 Lines	for (int64_t i = 0; i < destinationRank; i++) {
// TODO: This case could be support in a canonicalization pattern.		// TODO: This case could be support in a canonicalization pattern.
if (getDimReverse(shapeCastOp.getSourceVectorType(), i) !=		if (getDimReverse(shapeCastOp.getSourceVectorType(), i) !=
getDimReverse(destinationType, i))		getDimReverse(destinationType, i))
return Value();		return Value();
}		}
}		}
// Extract the strides associated with the extract op vector source. Then use		// Extract the strides associated with the extract op vector source. Then use
// this to calculate a linearized position for the extract.		// this to calculate a linearized position for the extract.
auto extractedPos = extractVector<int64_t>(extractOp.getPosition());		SmallVector<int64_t> extractedPos(extractOp.getPosition());
std::reverse(extractedPos.begin(), extractedPos.end());		std::reverse(extractedPos.begin(), extractedPos.end());
SmallVector<int64_t, 4> strides;		SmallVector<int64_t, 4> strides;
int64_t stride = 1;		int64_t stride = 1;
for (int64_t i = 0, e = extractedPos.size(); i < e; i++) {		for (int64_t i = 0, e = extractedPos.size(); i < e; i++) {
strides.push_back(stride);		strides.push_back(stride);
stride *=		stride *=
getDimReverse(extractOp.getSourceVectorType(), i + destinationRank);		getDimReverse(extractOp.getSourceVectorType(), i + destinationRank);
}		}
Show All 9 Lines	for (int64_t i = 0; i < numDimension; i++) {
newStrides.push_back(stride);		newStrides.push_back(stride);
stride *=		stride *=
getDimReverse(shapeCastOp.getSourceVectorType(), i + destinationRank);		getDimReverse(shapeCastOp.getSourceVectorType(), i + destinationRank);
}		}
std::reverse(newStrides.begin(), newStrides.end());		std::reverse(newStrides.begin(), newStrides.end());
SmallVector<int64_t, 4> newPosition = delinearize(position, newStrides);		SmallVector<int64_t, 4> newPosition = delinearize(position, newStrides);
// OpBuilder is only used as a helper to build an I64ArrayAttr.		// OpBuilder is only used as a helper to build an I64ArrayAttr.
OpBuilder b(extractOp.getContext());		OpBuilder b(extractOp.getContext());
extractOp.setPositionAttr(b.getI64ArrayAttr(newPosition));		extractOp.setPosition(newPosition);
extractOp.setOperand(shapeCastOp.getSource());		extractOp.setOperand(shapeCastOp.getSource());
return extractOp.getResult();		return extractOp.getResult();
}		}

/// Fold an ExtractOp from ExtractStridedSliceOp.		/// Fold an ExtractOp from ExtractStridedSliceOp.
static Value foldExtractFromExtractStrided(ExtractOp extractOp) {		static Value foldExtractFromExtractStrided(ExtractOp extractOp) {
auto extractStridedSliceOp =		auto extractStridedSliceOp =
extractOp.getVector().getDefiningOp<vector::ExtractStridedSliceOp>();		extractOp.getVector().getDefiningOp<vector::ExtractStridedSliceOp>();
Show All 23 Lines	static Value foldExtractFromExtractStrided(ExtractOp extractOp) {
unsigned destinationRank = 0;		unsigned destinationRank = 0;
if (auto vecType = llvm::dyn_cast<VectorType>(extractOp.getType()))		if (auto vecType = llvm::dyn_cast<VectorType>(extractOp.getType()))
destinationRank = vecType.getRank();		destinationRank = vecType.getRank();
// The dimensions of the result need to be untouched by the		// The dimensions of the result need to be untouched by the
// extractStridedSlice op.		// extractStridedSlice op.
if (destinationRank > extractStridedSliceOp.getSourceVectorType().getRank() -		if (destinationRank > extractStridedSliceOp.getSourceVectorType().getRank() -
sliceOffsets.size())		sliceOffsets.size())
return Value();		return Value();
auto extractedPos = extractVector<int64_t>(extractOp.getPosition());		SmallVector<int64_t> extractedPos(extractOp.getPosition());
assert(extractedPos.size() >= sliceOffsets.size());		assert(extractedPos.size() >= sliceOffsets.size());
for (size_t i = 0, e = sliceOffsets.size(); i < e; i++)		for (size_t i = 0, e = sliceOffsets.size(); i < e; i++)
extractedPos[i] = extractedPos[i] + sliceOffsets[i];		extractedPos[i] = extractedPos[i] + sliceOffsets[i];
extractOp.getVectorMutable().assign(extractStridedSliceOp.getVector());		extractOp.getVectorMutable().assign(extractStridedSliceOp.getVector());
// OpBuilder is only used as a helper to build an I64ArrayAttr.		// OpBuilder is only used as a helper to build an I64ArrayAttr.
OpBuilder b(extractOp.getContext());		OpBuilder b(extractOp.getContext());
extractOp.setPositionAttr(b.getI64ArrayAttr(extractedPos));		extractOp.setPosition(extractedPos);
return extractOp.getResult();		return extractOp.getResult();
}		}

/// Fold extract_op fed from a chain of insertStridedSlice ops.		/// Fold extract_op fed from a chain of insertStridedSlice ops.
static Value foldExtractStridedOpFromInsertChain(ExtractOp extractOp) {		static Value foldExtractStridedOpFromInsertChain(ExtractOp extractOp) {
int64_t destinationRank =		int64_t destinationRank =
llvm::isa<VectorType>(extractOp.getType())		llvm::isa<VectorType>(extractOp.getType())
? llvm::cast<VectorType>(extractOp.getType()).getRank()		? llvm::cast<VectorType>(extractOp.getType()).getRank()
: 0;		: 0;
auto insertOp = extractOp.getVector().getDefiningOp<InsertStridedSliceOp>();		auto insertOp = extractOp.getVector().getDefiningOp<InsertStridedSliceOp>();
if (!insertOp)		if (!insertOp)
return Value();		return Value();

// 0-D vectors not supported.		// 0-D vectors not supported.
assert(!hasZeroDimVectors(extractOp) && "0-D vectors not supported");		assert(!hasZeroDimVectors(extractOp) && "0-D vectors not supported");
if (hasZeroDimVectors(insertOp))		if (hasZeroDimVectors(insertOp))
return Value();		return Value();

while (insertOp) {		while (insertOp) {
int64_t insertRankDiff = insertOp.getDestVectorType().getRank() -		int64_t insertRankDiff = insertOp.getDestVectorType().getRank() -
insertOp.getSourceVectorType().getRank();		insertOp.getSourceVectorType().getRank();
if (destinationRank > insertOp.getSourceVectorType().getRank())		if (destinationRank > insertOp.getSourceVectorType().getRank())
return Value();		return Value();
auto insertOffsets = extractVector<int64_t>(insertOp.getOffsets());		auto insertOffsets = extractVector<int64_t>(insertOp.getOffsets());
auto extractOffsets = extractVector<int64_t>(extractOp.getPosition());		ArrayRef<int64_t> extractOffsets = extractOp.getPosition();

if (llvm::any_of(insertOp.getStrides(), [](Attribute attr) {		if (llvm::any_of(insertOp.getStrides(), [](Attribute attr) {
return llvm::cast<IntegerAttr>(attr).getInt() != 1;		return llvm::cast<IntegerAttr>(attr).getInt() != 1;
}))		}))
return Value();		return Value();
bool disjoint = false;		bool disjoint = false;
SmallVector<int64_t, 4> offsetDiffs;		SmallVector<int64_t, 4> offsetDiffs;
for (unsigned dim = 0, e = extractOffsets.size(); dim < e; ++dim) {		for (unsigned dim = 0, e = extractOffsets.size(); dim < e; ++dim) {
Show All 23 Lines	if (!disjoint) {
if (insertOp.getSourceVectorType().getDimSize(i + srcRankDiff) !=		if (insertOp.getSourceVectorType().getDimSize(i + srcRankDiff) !=
insertOp.getDestVectorType().getDimSize(i + srcRankDiff +		insertOp.getDestVectorType().getDimSize(i + srcRankDiff +
insertRankDiff))		insertRankDiff))
return Value();		return Value();
}		}
extractOp.getVectorMutable().assign(insertOp.getSource());		extractOp.getVectorMutable().assign(insertOp.getSource());
// OpBuilder is only used as a helper to build an I64ArrayAttr.		// OpBuilder is only used as a helper to build an I64ArrayAttr.
OpBuilder b(extractOp.getContext());		OpBuilder b(extractOp.getContext());
extractOp.setPositionAttr(b.getI64ArrayAttr(offsetDiffs));		extractOp.setPosition(offsetDiffs);
return extractOp.getResult();		return extractOp.getResult();
}		}
// If the chunk extracted is disjoint from the chunk inserted, keep		// If the chunk extracted is disjoint from the chunk inserted, keep
// looking in the insert chain.		// looking in the insert chain.
insertOp = insertOp.getDest().getDefiningOp<InsertStridedSliceOp>();		insertOp = insertOp.getDest().getDefiningOp<InsertStridedSliceOp>();
}		}
return Value();		return Value();
}		}
▲ Show 20 Lines • Show All 103 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(ExtractOp extractOp,
// The splat case is handled by `ExtractOpSplatConstantFolder`.		// The splat case is handled by `ExtractOpSplatConstantFolder`.
auto dense = llvm::dyn_cast<DenseElementsAttr>(vectorCst);		auto dense = llvm::dyn_cast<DenseElementsAttr>(vectorCst);
if (!dense \|\| dense.isSplat())		if (!dense \|\| dense.isSplat())
return failure();		return failure();

// Calculate the linearized position of the continuous chunk of elements to		// Calculate the linearized position of the continuous chunk of elements to
// extract.		// extract.
llvm::SmallVector<int64_t> completePositions(vecTy.getRank(), 0);		llvm::SmallVector<int64_t> completePositions(vecTy.getRank(), 0);
copy(getI64SubArray(extractOp.getPosition()), completePositions.begin());		copy(extractOp.getPosition(), completePositions.begin());
int64_t elemBeginPosition =		int64_t elemBeginPosition =
linearize(completePositions, computeStrides(vecTy.getShape()));		linearize(completePositions, computeStrides(vecTy.getShape()));
auto denseValuesBegin = dense.value_begin<TypedAttr>() + elemBeginPosition;		auto denseValuesBegin = dense.value_begin<TypedAttr>() + elemBeginPosition;

TypedAttr newAttr;		TypedAttr newAttr;
if (auto resVecTy = llvm::dyn_cast<VectorType>(extractOp.getType())) {		if (auto resVecTy = llvm::dyn_cast<VectorType>(extractOp.getType())) {
SmallVector<Attribute> elementValues(		SmallVector<Attribute> elementValues(
denseValuesBegin, denseValuesBegin + resVecTy.getNumElements());		denseValuesBegin, denseValuesBegin + resVecTy.getNumElements());
▲ Show 20 Lines • Show All 476 Lines • ▼ Show 20 Lines	OpFoldResult vector::InsertElementOp::fold(FoldAdaptor adaptor) {

return DenseElementsAttr::get(getDestVectorType(), results);		return DenseElementsAttr::get(getDestVectorType(), results);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// InsertOp		// InsertOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void InsertOp::build(OpBuilder &builder, OperationState &result, Value source,
Value dest, ArrayRef<int64_t> position) {
result.addOperands({source, dest});
auto positionAttr = getVectorSubscriptAttr(builder, position);
result.addTypes(dest.getType());
result.addAttribute(InsertOp::getPositionAttrName(result.name), positionAttr);
}

// Convenience builder which assumes the values are constant indices.		// Convenience builder which assumes the values are constant indices.
void InsertOp::build(OpBuilder &builder, OperationState &result, Value source,		void InsertOp::build(OpBuilder &builder, OperationState &result, Value source,
Value dest, ValueRange position) {		Value dest, ValueRange position) {
SmallVector<int64_t, 4> positionConstants =		SmallVector<int64_t, 4> positionConstants =
llvm::to_vector<4>(llvm::map_range(position, [](Value pos) {		llvm::to_vector<4>(llvm::map_range(position, [](Value pos) {
return getConstantIntValue(pos).value();		return getConstantIntValue(pos).value();
}));		}));
build(builder, result, source, dest, positionConstants);		build(builder, result, source, dest, positionConstants);
}		}

LogicalResult InsertOp::verify() {		LogicalResult InsertOp::verify() {
auto positionAttr = getPosition().getValue();		ArrayRef<int64_t> position = getPosition();
auto destVectorType = getDestVectorType();		auto destVectorType = getDestVectorType();
if (positionAttr.size() > static_cast<unsigned>(destVectorType.getRank()))		if (position.size() > static_cast<unsigned>(destVectorType.getRank()))
return emitOpError(		return emitOpError(
"expected position attribute of rank no greater than dest vector rank");		"expected position attribute of rank no greater than dest vector rank");
auto srcVectorType = llvm::dyn_cast<VectorType>(getSourceType());		auto srcVectorType = llvm::dyn_cast<VectorType>(getSourceType());
if (srcVectorType &&		if (srcVectorType &&
(static_cast<unsigned>(srcVectorType.getRank()) + positionAttr.size() !=		(static_cast<unsigned>(srcVectorType.getRank()) + position.size() !=
static_cast<unsigned>(destVectorType.getRank())))		static_cast<unsigned>(destVectorType.getRank())))
return emitOpError("expected position attribute rank + source rank to "		return emitOpError("expected position attribute rank + source rank to "
"match dest vector rank");		"match dest vector rank");
if (!srcVectorType &&		if (!srcVectorType &&
(positionAttr.size() != static_cast<unsigned>(destVectorType.getRank())))		(position.size() != static_cast<unsigned>(destVectorType.getRank())))
return emitOpError(		return emitOpError(
"expected position attribute rank to match the dest vector rank");		"expected position attribute rank to match the dest vector rank");
for (const auto &en : llvm::enumerate(positionAttr)) {		for (const auto &en : llvm::enumerate(position)) {
auto attr = llvm::dyn_cast<IntegerAttr>(en.value());		int64_t attr = en.value();
if (!attr \|\| attr.getInt() < 0 \|\|		if (attr < 0 \|\| attr >= destVectorType.getDimSize(en.index()))
attr.getInt() >= destVectorType.getDimSize(en.index()))
return emitOpError("expected position attribute #")		return emitOpError("expected position attribute #")
<< (en.index() + 1)		<< (en.index() + 1)
<< " to be a non-negative integer smaller than the corresponding "		<< " to be a non-negative integer smaller than the corresponding "
"dest vector dimension";		"dest vector dimension";
}		}
return success();		return success();
}		}

▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(InsertOp op,
Value sourceValue = op.getSource();		Value sourceValue = op.getSource();
Attribute sourceCst;		Attribute sourceCst;
if (!matchPattern(sourceValue, m_Constant(&sourceCst)))		if (!matchPattern(sourceValue, m_Constant(&sourceCst)))
return failure();		return failure();

// Calculate the linearized position of the continuous chunk of elements to		// Calculate the linearized position of the continuous chunk of elements to
// insert.		// insert.
llvm::SmallVector<int64_t> completePositions(destTy.getRank(), 0);		llvm::SmallVector<int64_t> completePositions(destTy.getRank(), 0);
copy(getI64SubArray(op.getPosition()), completePositions.begin());		copy(op.getPosition(), completePositions.begin());
int64_t insertBeginPosition =		int64_t insertBeginPosition =
linearize(completePositions, computeStrides(destTy.getShape()));		linearize(completePositions, computeStrides(destTy.getShape()));

SmallVector<Attribute> insertedValues;		SmallVector<Attribute> insertedValues;
if (auto denseSource = llvm::dyn_cast<DenseElementsAttr>(sourceCst))		if (auto denseSource = llvm::dyn_cast<DenseElementsAttr>(sourceCst))
llvm::append_range(insertedValues, denseSource.getValues<Attribute>());		llvm::append_range(insertedValues, denseSource.getValues<Attribute>());
else		else
insertedValues.push_back(sourceCst);		insertedValues.push_back(sourceCst);
▲ Show 20 Lines • Show All 3,490 Lines • Show Last 20 Lines

mlir/lib/Dialect/Vector/Transforms/LowerVectorContract.cpp

Show First 20 Lines • Show All 85 Lines • ▼ Show 20 Lines
// TODO		// TODO
static Value reshapeLoad(Location loc, Value val, VectorType type,		static Value reshapeLoad(Location loc, Value val, VectorType type,
int64_t index, int64_t pos,		int64_t index, int64_t pos,
PatternRewriter &rewriter) {		PatternRewriter &rewriter) {
if (index == -1)		if (index == -1)
return val;		return val;
Type lowType = VectorType::Builder(type).dropDim(0);		Type lowType = VectorType::Builder(type).dropDim(0);
// At extraction dimension?		// At extraction dimension?
if (index == 0) {		if (index == 0)
auto posAttr = rewriter.getI64ArrayAttr(pos);		return rewriter.create<vector::ExtractOp>(loc, lowType, val, pos);
return rewriter.create<vector::ExtractOp>(loc, lowType, val, posAttr);
}
// Unroll leading dimensions.		// Unroll leading dimensions.
VectorType vType = cast<VectorType>(lowType);		VectorType vType = cast<VectorType>(lowType);
Type resType = VectorType::Builder(type).dropDim(index);		Type resType = VectorType::Builder(type).dropDim(index);
auto resVectorType = cast<VectorType>(resType);		auto resVectorType = cast<VectorType>(resType);
Value result = rewriter.create<arith::ConstantOp>(		Value result = rewriter.create<arith::ConstantOp>(
loc, resVectorType, rewriter.getZeroAttr(resVectorType));		loc, resVectorType, rewriter.getZeroAttr(resVectorType));
for (int64_t d = 0, e = resVectorType.getDimSize(0); d < e; d++) {		for (int64_t d = 0, e = resVectorType.getDimSize(0); d < e; d++) {
auto posAttr = rewriter.getI64ArrayAttr(d);		Value ext = rewriter.create<vector::ExtractOp>(loc, vType, val, d);
Value ext = rewriter.create<vector::ExtractOp>(loc, vType, val, posAttr);
Value load = reshapeLoad(loc, ext, vType, index - 1, pos, rewriter);		Value load = reshapeLoad(loc, ext, vType, index - 1, pos, rewriter);
result = rewriter.create<vector::InsertOp>(loc, resVectorType, load, result,		result =
posAttr);		rewriter.create<vector::InsertOp>(loc, resVectorType, load, result, d);
}		}
return result;		return result;
}		}

// Helper method to possibly drop a dimension in a store.		// Helper method to possibly drop a dimension in a store.
// TODO		// TODO
static Value reshapeStore(Location loc, Value val, Value result,		static Value reshapeStore(Location loc, Value val, Value result,
VectorType type, int64_t index, int64_t pos,		VectorType type, int64_t index, int64_t pos,
PatternRewriter &rewriter) {		PatternRewriter &rewriter) {
// Unmodified?		// Unmodified?
if (index == -1)		if (index == -1)
return val;		return val;
// At insertion dimension?		// At insertion dimension?
if (index == 0) {		if (index == 0)
auto posAttr = rewriter.getI64ArrayAttr(pos);		return rewriter.create<vector::InsertOp>(loc, type, val, result, pos);
return rewriter.create<vector::InsertOp>(loc, type, val, result, posAttr);
}
// Unroll leading dimensions.		// Unroll leading dimensions.
Type lowType = VectorType::Builder(type).dropDim(0);		Type lowType = VectorType::Builder(type).dropDim(0);
VectorType vType = cast<VectorType>(lowType);		VectorType vType = cast<VectorType>(lowType);
Type insType = VectorType::Builder(vType).dropDim(0);		Type insType = VectorType::Builder(vType).dropDim(0);
for (int64_t d = 0, e = type.getDimSize(0); d < e; d++) {		for (int64_t d = 0, e = type.getDimSize(0); d < e; d++) {
auto posAttr = rewriter.getI64ArrayAttr(d);		Value ext = rewriter.create<vector::ExtractOp>(loc, vType, result, d);
Value ext = rewriter.create<vector::ExtractOp>(loc, vType, result, posAttr);		Value ins = rewriter.create<vector::ExtractOp>(loc, insType, val, d);
Value ins = rewriter.create<vector::ExtractOp>(loc, insType, val, posAttr);
Value sto = reshapeStore(loc, ins, ext, vType, index - 1, pos, rewriter);		Value sto = reshapeStore(loc, ins, ext, vType, index - 1, pos, rewriter);
result = rewriter.create<vector::InsertOp>(loc, type, sto, result, posAttr);		result = rewriter.create<vector::InsertOp>(loc, type, sto, result, d);
}		}
return result;		return result;
}		}

/// Helper to create arithmetic operation associated with a kind of contraction.		/// Helper to create arithmetic operation associated with a kind of contraction.
static std::optional<Value>		static std::optional<Value>
createContractArithOp(Location loc, Value x, Value y, Value acc,		createContractArithOp(Location loc, Value x, Value y, Value acc,
vector::CombiningKind kind, PatternRewriter &rewriter,		vector::CombiningKind kind, PatternRewriter &rewriter,
▲ Show 20 Lines • Show All 673 Lines • ▼ Show 20 Lines	if (!rhsDims.empty()) {
VectorType::get(rhsDims, contractOp.getRhsType().getElementType());		VectorType::get(rhsDims, contractOp.getRhsType().getElementType());
newRhs = rewriter.create<vector::BroadcastOp>(loc, expandedType, newRhs);		newRhs = rewriter.create<vector::BroadcastOp>(loc, expandedType, newRhs);
}		}
bool isInt = contractOp.getLhsType().getElementType().isIntOrIndex();		bool isInt = contractOp.getLhsType().getElementType().isIntOrIndex();
newLhs = rewriter.create<vector::TransposeOp>(loc, newLhs, lhsTranspose);		newLhs = rewriter.create<vector::TransposeOp>(loc, newLhs, lhsTranspose);
newRhs = rewriter.create<vector::TransposeOp>(loc, newRhs, rhsTranspose);		newRhs = rewriter.create<vector::TransposeOp>(loc, newRhs, rhsTranspose);
SmallVector<int64_t> lhsOffsets(lhsReductionDims.size(), 0);		SmallVector<int64_t> lhsOffsets(lhsReductionDims.size(), 0);
SmallVector<int64_t> rhsOffsets(rhsReductionDims.size(), 0);		SmallVector<int64_t> rhsOffsets(rhsReductionDims.size(), 0);
newLhs = rewriter.create<vector::ExtractOp>(		newLhs = rewriter.create<vector::ExtractOp>(loc, newLhs, lhsOffsets);
loc, newLhs, rewriter.getI64ArrayAttr(lhsOffsets));		newRhs = rewriter.create<vector::ExtractOp>(loc, newRhs, rhsOffsets);
newRhs = rewriter.create<vector::ExtractOp>(
loc, newRhs, rewriter.getI64ArrayAttr(rhsOffsets));
std::optional<Value> result =		std::optional<Value> result =
createContractArithOp(loc, newLhs, newRhs, contractOp.getAcc(),		createContractArithOp(loc, newLhs, newRhs, contractOp.getAcc(),
contractOp.getKind(), rewriter, isInt);		contractOp.getKind(), rewriter, isInt);
rewriter.replaceOp(contractOp, {*result});		rewriter.replaceOp(contractOp, {*result});
return success();		return success();
}		}

private:		private:
▲ Show 20 Lines • Show All 324 Lines • ▼ Show 20 Lines	if (!rhsType) {
return failure();		return failure();
rewriter.replaceOp(rootOp, *mult);		rewriter.replaceOp(rootOp, *mult);
return success();		return success();
}		}

Value result = rewriter.create<arith::ConstantOp>(		Value result = rewriter.create<arith::ConstantOp>(
loc, resType, rewriter.getZeroAttr(resType));		loc, resType, rewriter.getZeroAttr(resType));
for (int64_t d = 0, e = resType.getDimSize(0); d < e; ++d) {		for (int64_t d = 0, e = resType.getDimSize(0); d < e; ++d) {
auto pos = rewriter.getI64ArrayAttr(d);		Value x = rewriter.create<vector::ExtractOp>(loc, op.getLhs(), d);
Value x = rewriter.create<vector::ExtractOp>(loc, op.getLhs(), pos);
Value a = rewriter.create<vector::BroadcastOp>(loc, rhsType, x);		Value a = rewriter.create<vector::BroadcastOp>(loc, rhsType, x);
Value r = nullptr;		Value r = nullptr;
if (acc)		if (acc)
r = rewriter.create<vector::ExtractOp>(loc, acc, pos);		r = rewriter.create<vector::ExtractOp>(loc, acc, d);
Value extrMask;		Value extrMask;
if (mask)		if (mask)
extrMask = rewriter.create<vector::ExtractOp>(loc, mask, pos);		extrMask = rewriter.create<vector::ExtractOp>(loc, mask, d);

std::optional<Value> m = createContractArithOp(		std::optional<Value> m = createContractArithOp(
loc, a, op.getRhs(), r, kind, rewriter, isInt, extrMask);		loc, a, op.getRhs(), r, kind, rewriter, isInt, extrMask);
if (!m.has_value())		if (!m.has_value())
return failure();		return failure();
result = rewriter.create<vector::InsertOp>(loc, resType, *m, result, pos);		result = rewriter.create<vector::InsertOp>(loc, resType, *m, result, d);
}		}

rewriter.replaceOp(rootOp, result);		rewriter.replaceOp(rootOp, result);
return success();		return success();
}		}
};		};

/// Progressively lower a `vector.contract %a, %b, %c` with row-major matmul		/// Progressively lower a `vector.contract %a, %b, %c` with row-major matmul
▲ Show 20 Lines • Show All 123 Lines • Show Last 20 Lines

mlir/lib/Dialect/Vector/Transforms/LowerVectorMask.cpp

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(vector::CreateMaskOp op,
Value result = rewriter.create<arith::ConstantOp>(		Value result = rewriter.create<arith::ConstantOp>(
loc, dstType, rewriter.getZeroAttr(dstType));		loc, dstType, rewriter.getZeroAttr(dstType));
for (int64_t d = 0; d < dim; d++) {		for (int64_t d = 0; d < dim; d++) {
Value bnd =		Value bnd =
rewriter.create<arith::ConstantOp>(loc, rewriter.getIndexAttr(d));		rewriter.create<arith::ConstantOp>(loc, rewriter.getIndexAttr(d));
Value val = rewriter.create<arith::CmpIOp>(loc, arith::CmpIPredicate::slt,		Value val = rewriter.create<arith::CmpIOp>(loc, arith::CmpIPredicate::slt,
bnd, idx);		bnd, idx);
Value sel = rewriter.create<arith::SelectOp>(loc, val, trueVal, falseVal);		Value sel = rewriter.create<arith::SelectOp>(loc, val, trueVal, falseVal);
auto pos = rewriter.getI64ArrayAttr(d);		result = rewriter.create<vector::InsertOp>(loc, dstType, sel, result, d);
result =
rewriter.create<vector::InsertOp>(loc, dstType, sel, result, pos);
}		}
rewriter.replaceOp(op, result);		rewriter.replaceOp(op, result);
return success();		return success();
}		}
};		};

/// Progressive lowering of ConstantMaskOp.		/// Progressive lowering of ConstantMaskOp.
/// One:		/// One:
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	VectorType lowType =
VectorType::get(dstType.getShape().drop_front(), eltType);		VectorType::get(dstType.getShape().drop_front(), eltType);
SmallVector<int64_t> newDimSizes;		SmallVector<int64_t> newDimSizes;
for (int64_t r = 1; r < rank; r++)		for (int64_t r = 1; r < rank; r++)
newDimSizes.push_back(cast<IntegerAttr>(dimSizes[r]).getInt());		newDimSizes.push_back(cast<IntegerAttr>(dimSizes[r]).getInt());
Value trueVal = rewriter.create<vector::ConstantMaskOp>(		Value trueVal = rewriter.create<vector::ConstantMaskOp>(
loc, lowType, rewriter.getI64ArrayAttr(newDimSizes));		loc, lowType, rewriter.getI64ArrayAttr(newDimSizes));
Value result = rewriter.create<arith::ConstantOp>(		Value result = rewriter.create<arith::ConstantOp>(
loc, dstType, rewriter.getZeroAttr(dstType));		loc, dstType, rewriter.getZeroAttr(dstType));
for (int64_t d = 0; d < trueDim; d++) {		for (int64_t d = 0; d < trueDim; d++)
auto pos = rewriter.getI64ArrayAttr(d);
result =		result =
rewriter.create<vector::InsertOp>(loc, dstType, trueVal, result, pos);		rewriter.create<vector::InsertOp>(loc, dstType, trueVal, result, d);
}
rewriter.replaceOp(op, result);		rewriter.replaceOp(op, result);
return success();		return success();
}		}
};		};
} // namespace		} // namespace

void mlir::vector::populateVectorMaskOpLoweringPatterns(		void mlir::vector::populateVectorMaskOpLoweringPatterns(
RewritePatternSet &patterns, PatternBenefit benefit) {		RewritePatternSet &patterns, PatternBenefit benefit) {
▲ Show 20 Lines • Show All 145 Lines • Show Last 20 Lines

mlir/lib/Dialect/Vector/Transforms/VectorDistribute.cpp

Show First 20 Lines • Show All 938 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(WarpExecuteOnLane0Op warpOp,

// "vector.extract %v[] : vector<...xf32>" can be canonicalized to %v.		// "vector.extract %v[] : vector<...xf32>" can be canonicalized to %v.
if (extractOp.getPosition().empty())		if (extractOp.getPosition().empty())
return failure();		return failure();

// Rewrite vector.extract with 1d source to vector.extractelement.		// Rewrite vector.extract with 1d source to vector.extractelement.
if (extractSrcType.getRank() == 1) {		if (extractSrcType.getRank() == 1) {
assert(extractOp.getPosition().size() == 1 && "expected 1 index");		assert(extractOp.getPosition().size() == 1 && "expected 1 index");
int64_t pos = cast<IntegerAttr>(extractOp.getPosition()[0]).getInt();		int64_t pos = extractOp.getPosition()[0];
rewriter.setInsertionPoint(extractOp);		rewriter.setInsertionPoint(extractOp);
rewriter.replaceOpWithNewOp<vector::ExtractElementOp>(		rewriter.replaceOpWithNewOp<vector::ExtractElementOp>(
extractOp, extractOp.getVector(),		extractOp, extractOp.getVector(),
rewriter.create<arith::ConstantIndexOp>(loc, pos));		rewriter.create<arith::ConstantIndexOp>(loc, pos));
return success();		return success();
}		}

// All following cases are 2d or higher dimensional source vectors.		// All following cases are 2d or higher dimensional source vectors.
▲ Show 20 Lines • Show All 240 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(WarpExecuteOnLane0Op warpOp,

// "vector.insert %v, %v[] : ..." can be canonicalized to %v.		// "vector.insert %v, %v[] : ..." can be canonicalized to %v.
if (insertOp.getPosition().empty())		if (insertOp.getPosition().empty())
return failure();		return failure();

// Rewrite vector.insert with 1d dest to vector.insertelement.		// Rewrite vector.insert with 1d dest to vector.insertelement.
if (insertOp.getDestVectorType().getRank() == 1) {		if (insertOp.getDestVectorType().getRank() == 1) {
assert(insertOp.getPosition().size() == 1 && "expected 1 index");		assert(insertOp.getPosition().size() == 1 && "expected 1 index");
int64_t pos = cast<IntegerAttr>(insertOp.getPosition()[0]).getInt();		int64_t pos = insertOp.getPosition()[0];
rewriter.setInsertionPoint(insertOp);		rewriter.setInsertionPoint(insertOp);
rewriter.replaceOpWithNewOp<vector::InsertElementOp>(		rewriter.replaceOpWithNewOp<vector::InsertElementOp>(
insertOp, insertOp.getSource(), insertOp.getDest(),		insertOp, insertOp.getSource(), insertOp.getDest(),
rewriter.create<arith::ConstantIndexOp>(loc, pos));		rewriter.create<arith::ConstantIndexOp>(loc, pos));
return success();		return success();
}		}

if (warpOp.getResult(operandNumber).getType() == operand->get().getType()) {		if (warpOp.getResult(operandNumber).getType() == operand->get().getType()) {
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(WarpExecuteOnLane0Op warpOp,
Value newResult;		Value newResult;
if (distrSrcDim >= 0) {		if (distrSrcDim >= 0) {
// Every lane inserts a small piece.		// Every lane inserts a small piece.
newResult = rewriter.create<vector::InsertOp>(		newResult = rewriter.create<vector::InsertOp>(
loc, distributedSrc, distributedDest, insertOp.getPosition());		loc, distributedSrc, distributedDest, insertOp.getPosition());
} else {		} else {
// One lane inserts the entire source vector.		// One lane inserts the entire source vector.
int64_t elementsPerLane = distrDestType.getDimSize(distrDestDim);		int64_t elementsPerLane = distrDestType.getDimSize(distrDestDim);
SmallVector<int64_t> newPos = llvm::to_vector(		SmallVector<int64_t> newPos(insertOp.getPosition());
llvm::map_range(insertOp.getPosition(), [](Attribute attr) {
return cast<IntegerAttr>(attr).getInt();
}));
// tid of inserting lane: pos / elementsPerLane		// tid of inserting lane: pos / elementsPerLane
Value insertingLane = rewriter.create<arith::ConstantIndexOp>(		Value insertingLane = rewriter.create<arith::ConstantIndexOp>(
loc, newPos[distrDestDim] / elementsPerLane);		loc, newPos[distrDestDim] / elementsPerLane);
Value isInsertingLane = rewriter.create<arith::CmpIOp>(		Value isInsertingLane = rewriter.create<arith::CmpIOp>(
loc, arith::CmpIPredicate::eq, newWarpOp.getLaneid(), insertingLane);		loc, arith::CmpIPredicate::eq, newWarpOp.getLaneid(), insertingLane);
// Insert position: pos % elementsPerLane		// Insert position: pos % elementsPerLane
newPos[distrDestDim] %= elementsPerLane;		newPos[distrDestDim] %= elementsPerLane;
auto insertingBuilder = [&](OpBuilder &builder, Location loc) {		auto insertingBuilder = [&](OpBuilder &builder, Location loc) {
▲ Show 20 Lines • Show All 316 Lines • Show Last 20 Lines

mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp

Show First 20 Lines • Show All 159 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(vector::InsertOp insertOp,
}		}
Value newDstVector = rewriter.create<vector::ExtractOp>(		Value newDstVector = rewriter.create<vector::ExtractOp>(
loc, insertOp.getDest(), splatZero(dstDropCount));		loc, insertOp.getDest(), splatZero(dstDropCount));

// New position rank needs to be computed in two steps: (1) if destination		// New position rank needs to be computed in two steps: (1) if destination
// type has leading unit dims, we also trim the position array accordingly,		// type has leading unit dims, we also trim the position array accordingly,
// then (2) if source type also has leading unit dims, we need to append		// then (2) if source type also has leading unit dims, we need to append
// zeroes to the position array accordingly.		// zeroes to the position array accordingly.
unsigned oldPosRank = insertOp.getPosition().getValue().size();		unsigned oldPosRank = insertOp.getPosition().size();
unsigned newPosRank = std::max<int64_t>(0, oldPosRank - dstDropCount);		unsigned newPosRank = std::max<int64_t>(0, oldPosRank - dstDropCount);
SmallVector<Attribute> newPositions = llvm::to_vector(		SmallVector<int64_t> newPositions =
insertOp.getPosition().getValue().take_back(newPosRank));		llvm::to_vector(insertOp.getPosition().take_back(newPosRank));
newPositions.resize(newDstType.getRank() - newSrcRank,		newPositions.resize(newDstType.getRank() - newSrcRank, 0);
rewriter.getI64IntegerAttr(0));

auto newInsertOp = rewriter.create<vector::InsertOp>(		auto newInsertOp = rewriter.create<vector::InsertOp>(
loc, newDstType, newSrcVector, newDstVector,		loc, newDstType, newSrcVector, newDstVector, newPositions);
rewriter.getArrayAttr(newPositions));

rewriter.replaceOpWithNewOp<vector::BroadcastOp>(insertOp, oldDstType,		rewriter.replaceOpWithNewOp<vector::BroadcastOp>(insertOp, oldDstType,
newInsertOp);		newInsertOp);

return success();		return success();
}		}
};		};

▲ Show 20 Lines • Show All 272 Lines • Show Last 20 Lines

mlir/lib/Dialect/Vector/Transforms/VectorTransferOpTransforms.cpp

Show First 20 Lines • Show All 698 Lines • ▼ Show 20 Lines	class RewriteScalarExtractOfTransferRead

void rewrite(vector::ExtractOp extractOp,		void rewrite(vector::ExtractOp extractOp,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Construct scalar load.		// Construct scalar load.
auto xferOp = extractOp.getVector().getDefiningOp<vector::TransferReadOp>();		auto xferOp = extractOp.getVector().getDefiningOp<vector::TransferReadOp>();
SmallVector<Value> newIndices(xferOp.getIndices().begin(),		SmallVector<Value> newIndices(xferOp.getIndices().begin(),
xferOp.getIndices().end());		xferOp.getIndices().end());
for (const auto &it : llvm::enumerate(extractOp.getPosition())) {		for (const auto &it : llvm::enumerate(extractOp.getPosition())) {
int64_t offset = cast<IntegerAttr>(it.value()).getInt();		int64_t offset = it.value();
int64_t idx =		int64_t idx =
newIndices.size() - extractOp.getPosition().size() + it.index();		newIndices.size() - extractOp.getPosition().size() + it.index();
OpFoldResult ofr = affine::makeComposedFoldedAffineApply(		OpFoldResult ofr = affine::makeComposedFoldedAffineApply(
rewriter, extractOp.getLoc(),		rewriter, extractOp.getLoc(),
rewriter.getAffineSymbolExpr(0) + offset, {newIndices[idx]});		rewriter.getAffineSymbolExpr(0) + offset, {newIndices[idx]});
if (ofr.is<Value>()) {		if (ofr.is<Value>()) {
newIndices[idx] = ofr.get<Value>();		newIndices[idx] = ofr.get<Value>();
} else {		} else {
▲ Show 20 Lines • Show All 98 Lines • Show Last 20 Lines

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp

Show First 20 Lines • Show All 592 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(vector::ExtractOp extractOp,
// Only support casting to a larger number of elements or now.		// Only support casting to a larger number of elements or now.
// E.g., vector<4xf32> -> vector<8xf16>.		// E.g., vector<4xf32> -> vector<8xf16>.
if (castSrcType.getNumElements() > castDstType.getNumElements())		if (castSrcType.getNumElements() > castDstType.getNumElements())
return failure();		return failure();

unsigned expandRatio =		unsigned expandRatio =
castDstType.getNumElements() / castSrcType.getNumElements();		castDstType.getNumElements() / castSrcType.getNumElements();

auto getFirstIntValue = [](ArrayAttr attr) -> uint64_t {		uint64_t index = extractOp.getPosition()[0];
return (*attr.getAsValueRange<IntegerAttr>().begin()).getZExtValue();
};

uint64_t index = getFirstIntValue(extractOp.getPosition());

// Get the single scalar (as a vector) in the source value that packs the		// Get the single scalar (as a vector) in the source value that packs the
// desired scalar. E.g. extract vector<1xf32> from vector<4xf32>		// desired scalar. E.g. extract vector<1xf32> from vector<4xf32>
VectorType oneScalarType =		VectorType oneScalarType =
VectorType::get({1}, castSrcType.getElementType());		VectorType::get({1}, castSrcType.getElementType());
Value packedValue = rewriter.create<vector::ExtractOp>(		Value packedValue = rewriter.create<vector::ExtractOp>(
extractOp.getLoc(), oneScalarType, castOp.getSource(),		extractOp.getLoc(), oneScalarType, castOp.getSource(),
rewriter.getI64ArrayAttr(index / expandRatio));		index / expandRatio);

// Cast it to a vector with the desired scalar's type.		// Cast it to a vector with the desired scalar's type.
// E.g. f32 -> vector<2xf16>		// E.g. f32 -> vector<2xf16>
VectorType packedType =		VectorType packedType =
VectorType::get({expandRatio}, castDstType.getElementType());		VectorType::get({expandRatio}, castDstType.getElementType());
Value castedValue = rewriter.create<vector::BitCastOp>(		Value castedValue = rewriter.create<vector::BitCastOp>(
extractOp.getLoc(), packedType, packedValue);		extractOp.getLoc(), packedType, packedValue);

// Finally extract the desired scalar.		// Finally extract the desired scalar.
rewriter.replaceOpWithNewOp<vector::ExtractOp>(		rewriter.replaceOpWithNewOp<vector::ExtractOp>(
extractOp, extractOp.getType(), castedValue,		extractOp, extractOp.getType(), castedValue, index % expandRatio);
rewriter.getI64ArrayAttr(index % expandRatio));

return success();		return success();
}		}
};		};

// Shuffles vector.bitcast op after vector.extract_strided_slice op.		// Shuffles vector.bitcast op after vector.extract_strided_slice op.
//		//
// This transforms IR like:		// This transforms IR like:
▲ Show 20 Lines • Show All 751 Lines • Show Last 20 Lines

mlir/test/Conversion/VectorToSPIRV/vector-to-spirv.mlir

	Show First 20 Lines • Show All 149 Lines • ▼ Show 20 Lines

	// -----			// -----

	// CHECK-LABEL: @extract			// CHECK-LABEL: @extract
	// CHECK-SAME: %[[ARG:.+]]: vector<2xf32>			// CHECK-SAME: %[[ARG:.+]]: vector<2xf32>
	// CHECK: spirv.CompositeExtract %[[ARG]][0 : i32] : vector<2xf32>			// CHECK: spirv.CompositeExtract %[[ARG]][0 : i32] : vector<2xf32>
	// CHECK: spirv.CompositeExtract %[[ARG]][1 : i32] : vector<2xf32>			// CHECK: spirv.CompositeExtract %[[ARG]][1 : i32] : vector<2xf32>
	func.func @extract(%arg0 : vector<2xf32>) -> (vector<1xf32>, f32) {			func.func @extract(%arg0 : vector<2xf32>) -> (vector<1xf32>, f32) {
	%0 = "vector.extract"(%arg0) {position = [0]} : (vector<2xf32>) -> vector<1xf32>			%0 = "vector.extract"(%arg0) <{position = array<i64: 0>}> : (vector<2xf32>) -> vector<1xf32>
	%1 = "vector.extract"(%arg0) {position = [1]} : (vector<2xf32>) -> f32			%1 = "vector.extract"(%arg0) <{position = array<i64: 1>}> : (vector<2xf32>) -> f32
	return %0, %1: vector<1xf32>, f32			return %0, %1: vector<1xf32>, f32
	}			}

	// -----			// -----

	// CHECK-LABEL: @extract_size1_vector			// CHECK-LABEL: @extract_size1_vector
	// CHECK-SAME: %[[ARG0:.+]]: vector<1xf32>			// CHECK-SAME: %[[ARG0:.+]]: vector<1xf32>
	// CHECK: %[[R:.+]] = builtin.unrealized_conversion_cast %[[ARG0]]			// CHECK: %[[R:.+]] = builtin.unrealized_conversion_cast %[[ARG0]]
	▲ Show 20 Lines • Show All 425 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/invalid.mlir

Show First 20 Lines • Show All 127 Lines • ▼ Show 20 Lines	func.func @extract_position_rank_overflow(%arg0: vector<4x8x16xf32>) {
// expected-error@+1 {{expected position attribute of rank no greater than vector rank}}		// expected-error@+1 {{expected position attribute of rank no greater than vector rank}}
%1 = vector.extract %arg0[0, 0, 0, 0] : vector<4x8x16xf32>		%1 = vector.extract %arg0[0, 0, 0, 0] : vector<4x8x16xf32>
}		}

// -----		// -----

func.func @extract_position_rank_overflow_generic(%arg0: vector<4x8x16xf32>) {		func.func @extract_position_rank_overflow_generic(%arg0: vector<4x8x16xf32>) {
// expected-error@+1 {{expected position attribute of rank no greater than vector rank}}		// expected-error@+1 {{expected position attribute of rank no greater than vector rank}}
%1 = "vector.extract" (%arg0) { position = [0, 0, 0, 0] } : (vector<4x8x16xf32>) -> (vector<16xf32>)		%1 = "vector.extract" (%arg0) <{position = array<i64: 0, 0, 0, 0>}> : (vector<4x8x16xf32>) -> (vector<16xf32>)
}		}

// -----		// -----

func.func @extract_position_overflow(%arg0: vector<4x8x16xf32>) {		func.func @extract_position_overflow(%arg0: vector<4x8x16xf32>) {
// expected-error@+1 {{expected position attribute #2 to be a non-negative integer smaller than the corresponding vector dimension}}		// expected-error@+1 {{expected position attribute #2 to be a non-negative integer smaller than the corresponding vector dimension}}
%1 = vector.extract %arg0[0, 43, 0] : vector<4x8x16xf32>		%1 = vector.extract %arg0[0, 43, 0] : vector<4x8x16xf32>
}		}
▲ Show 20 Lines • Show All 1,543 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][vector] Use DenseI64ArrayAttr for ExtractOp/InsertOp positionsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 545629

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

mlir/lib/Conversion/VectorToGPU/VectorToGPU.cpp

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

mlir/lib/Conversion/VectorToSCF/VectorToSCF.cpp

mlir/lib/Conversion/VectorToSPIRV/VectorToSPIRV.cpp

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

mlir/lib/Dialect/Vector/Transforms/LowerVectorContract.cpp

mlir/lib/Dialect/Vector/Transforms/LowerVectorMask.cpp

mlir/lib/Dialect/Vector/Transforms/VectorDistribute.cpp

mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp

mlir/lib/Dialect/Vector/Transforms/VectorTransferOpTransforms.cpp

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp

mlir/test/Conversion/VectorToSPIRV/vector-to-spirv.mlir

mlir/test/Dialect/Vector/invalid.mlir

[mlir][vector] Use DenseI64ArrayAttr for ExtractOp/InsertOp positions
ClosedPublic