This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Vector/
-
mlir/
-
Dialect/
-
Vector/
5/8
VectorOps.td
-
lib/
-
Conversion/
-
AffineToStandard/
-
AffineToStandard.cpp
-
VectorToLLVM/
-
ConvertVectorToLLVM.cpp
-
Dialect/Vector/
-
Vector/
1/2
VectorOps.cpp
-
test/
-
Conversion/
-
AffineToStandard/
-
lower-affine-to-vector.mlir
-
VectorToLLVM/
-
vector-to-llvm.mlir
-
Dialect/Vector/
-
Vector/
-
invalid.mlir
-
ops.mlir

Differential D96185

[mlir][Vector] Introduce 'vector.load' and 'vector.store' ops
ClosedPublic

Authored by dcaballe on Feb 5 2021, 3:13 PM.

Download Raw Diff

Details

Reviewers

ftynse
nicolasvasilache
mehdi_amini
aartbik
sgrechanik
bondhugula

Commits

rGee66e43a96e1: [mlir][Vector] Introduce 'vector.load' and 'vector.store' ops

Summary

This patch adds the 'vector.load' and 'vector.store' ops to the Vector
dialect [1]. These operations model *contiguous* vector loads and stores
from/to memory. Their semantics are similar to the 'affine.vector_load' and
'affine.vector_store' counterparts but without the affine constraints. The
most relevant feature is that these new vector operations may perform a vector
load/store on memrefs with a non-vector element type, unlike 'std.load' and
'std.store' ops. This opens the representation to model more generic vector
load/store scenarios: unaligned vector loads/stores, perform scalar and vector
memory access on the same memref, decouple memory allocation constraints from
memory accesses, etc [1]. These operations will also facilitate the progressive
lowering of both Affine vector loads/stores and Vector transfer reads/writes
for those that read/write contiguous slices from/to memory.

In particular, this patch adds the 'vector.load' and 'vector.store' ops to the
Vector dialect, implements their lowering to the LLVM dialect, and changes the
lowering of 'affine.vector_load' and 'affine.vector_store' ops to the new vector
ops. The lowering of Vector transfer reads/writes will be implemented in the
future, probably as an independent pass. The API of 'vector.maskedload' and
'vector.maskedstore' has also been changed slightly to align it with the
transfer read/write ops and the vector new ops. This will improve reusability
among all these operations. For example, the lowering of 'vector.load',
'vector.store', 'vector.maskedload' and 'vector.maskedstore' to the LLVM dialect
is implemented with a single template conversion pattern.

[1] https://llvm.discourse.group/t/memref-type-and-data-layout/

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dcaballe created this revision.Feb 5 2021, 3:13 PM

Herald added subscribers: teijeong, rdzhabarov, tatianashp and 12 others. · View Herald TranscriptFeb 5 2021, 3:13 PM

dcaballe requested review of this revision.Feb 5 2021, 3:13 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 5 2021, 3:13 PM

Herald added a subscriber: stephenneuendorffer. · View Herald Transcript

Harbormaster completed remote builds in B88146: Diff 321887.Feb 5 2021, 3:51 PM

dcaballe added a reviewer: bondhugula.Feb 5 2021, 4:03 PM

bondhugula added inline comments.Feb 5 2021, 9:12 PM

mlir/include/mlir/Dialect/Vector/VectorOps.td
1322	`contiguous n-D slice of memory` is a bit inherently contradictory to me.

One nitpicky request already on the order in the td file.

We thought for a while to complete the memory operations with these unmasked versions, so thanks for doing this.
I am okay with this, but you may want to also check with Nicolas who had big future plans for the transfer operations
(but I feel these new versions are always good for progressive lowering).

I am a bit more on the fence of some of the name changes you did in the existing ones. Can you please clarify a bit what your reasoning was for these?

mlir/include/mlir/Dialect/Vector/VectorOps.td
1318	A bit nitpicky request, but I insist ;-) The order in the ops file for memory operations is strictly in pairs Vector_TransferReadOp / Vector_TransferWriteOp Vector_MaskedLoadOp / Vector_MaskedStoreOp Vector_GatherOp / Vector_ScatterOp Vector_ExpandLoadOp / Vector_CompressStoreOp you break that convention by placing the unmasked versions before the load and store respectively. I would prefer to have the new ones in between the Transfer and Masked versions, but paired together and in the order load / store as you have

Thanks for the feedback!

We thought for a while to complete the memory operations with these unmasked versions, so thanks for doing this.
I am okay with this, but you may want to also check with Nicolas who had big future plans for the transfer operations
(but I feel these new versions are always good for progressive lowering).

Yeah, this is what we discussed with @nicolasvasilache and @ftynse to lower vector transfer ops. Hopefully the plan hasn't changed.

I am a bit more on the fence of some of the name changes you did in the existing ones. Can you please clarify a bit what your reasoning was for these?

The intention was to align the APIs so that we can write generic code for all the vector load/store flavors. For example, vector.maskedload had getResultVectorType() and vector.maskedstore had getValueVectorType() to return the vector type used in the op. These two methods were changed to getVectorType() to be able to write a single conversion pattern to LLVM for both of them, and also for vector.load and vector.store. As you can see in ConvertVectorToLLVM.cpp, VectorLoadStoreConversion pattern is used for the four ops thanks to the unified API, instead of having four independent patterns. For example, see the following code in VectorLoadStoreConversion:

// Resolve address.
auto vtype = this->typeConverter->convertType(*loadOrStoreOp.getVectorType()*)

I think this will expose more reusability opportunities beyond the current LLVM conversion pattern since these four operations are pretty similar. The same approach is followed by vector transfer ops and affine vector loads/stores. They also have getVectorType(), getMemRefType(), valueToStore, etc. I thought it would be a good idea to align vector.load/store and vector.maskedload/maskedstore with that existing API to facilitate reusability. We are somehow paving the way for a VectorMemoryOp interface. However, no strong opinion about the API names. We could also change the API of transfer and affine ops but that would require far more changes all over the place.

mlir/include/mlir/Dialect/Vector/VectorOps.td
1318	Sure, no problem at all! I thought that having the `vector.load` next to the `vector.maskedload` would help quickly see the different variants that we have to perform a vector load. I'll change that. Thanks!
1322	Ok, I can remove `contiguous` from this line. It should be enough with the line below: `This slice is contiguous along the respective dimensions of the shape`. This is what we have in the description of the affine counterparts.

In D96185#2549128, @dcaballe wrote:

The intention was to align the APIs so that we can write generic code for all the vector load/store flavors. For example, vector.maskedload had getResultVectorType() and vector.maskedstore had getValueVectorType() to return the vector type used in the op. These two methods were changed to getVectorType() to be able to write a single conversion pattern to LLVM for both of them, and also for vector.load and vector.store. As you can see in ConvertVectorToLLVM.cpp, VectorLoadStoreConversion pattern is used for the four ops thanks to the unified API, instead of having four independent patterns. For example, see the following code in VectorLoadStoreConversion:

Yes, aligning the API for that purpose makes sense, but you changed the API of masked-load/store, but not of gather/scatter and expand/compress, and all these mem ops more or less go together.
Perhaps, if you don't mind, you can send out another CL that does the API name change for all these, just so that we have "progressive" lowering for this change as well (and consistent over all mem ops)?

Thanks for working on this Diego.
Let's please connect vector.transfer_read/write too otherwise it is unclear the abstraction is the right progressive lowering funnel.

mlir/include/mlir/Dialect/Vector/VectorOps.td
1359	assumptions
1448	Same remark re. contiguous here, this makes me think that `vector<2x4xf32>` stores 32 contiguous bytes which is not the case. I'd actually rephrase this part and describe it as something carrying the intent of "strided by the memref strides" but not contiguous.

This revision now requires changes to proceed.Feb 8 2021, 11:58 PM

Address feedback. Thanks!

Perhaps, if you don't mind, you can send out another CL that does the API name change for all these, just so that we have "progressive" lowering for this change as well (and consistent over all mem ops)?

Sure! I will follow up with a patch for those ops.

Let's please connect vector.transfer_read/write too otherwise it is unclear the abstraction is the right progressive lowering funnel.

Would you mind if we do that in a separate patch? It would be a new feature and lowering vector transfers require much more work since we have to analyze the stride, the permutation map, the mask, the padding, etc. and then generate the corresponding lower level vector ops (vector.load/store, vector.broadcast, vector.shuffle, vector.maskedload/maskedstore, vector.gather/scatter, etc.). I was thinking about creating a LowerVectorTransferOps pass in a follow-up patch so that we all can contribute lowering cases incrementally. Does it sound reasonable?

mlir/include/mlir/Dialect/Vector/VectorOps.td
1322	I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better now.
1448	I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better now.

Harbormaster completed remote builds in B88548: Diff 322549.Feb 9 2021, 6:19 PM

bondhugula requested changes to this revision.Feb 9 2021, 6:31 PM

bondhugula added inline comments.

mlir/lib/Dialect/Vector/VectorOps.cpp
2368	`affineMaps.empty()` would be the right check here as opposed to `affineMaps.size() != 1`.

This revision now requires changes to proceed.Feb 9 2021, 6:31 PM

dcaballe added inline comments.Feb 9 2021, 9:02 PM

mlir/lib/Dialect/Vector/VectorOps.cpp
2368	Oh, if a single identity map is provided, it's not kept around. Got it. Thanks!

Addressed Uday's comment and reorganized the code as suggested by Aart.

Harbormaster completed remote builds in B88573: Diff 322582.Feb 9 2021, 9:34 PM

In D96185#2552901, @dcaballe wrote:

Perhaps, if you don't mind, you can send out another CL that does the API name change for all these, just so that we have "progressive" lowering for this change as well (and consistent over all mem ops)?

Sure! I will follow up with a patch for those ops.

Let's please connect vector.transfer_read/write too otherwise it is unclear the abstraction is the right progressive lowering funnel.

Would you mind if we do that in a separate patch? It would be a new feature and lowering vector transfers require much more work since we have to analyze the stride, the permutation map, the mask, the padding, etc. and then generate the corresponding lower level vector ops (vector.load/store, vector.broadcast, vector.shuffle, vector.maskedload/maskedstore, vector.gather/scatter, etc.). I was thinking about creating a LowerVectorTransferOps pass in a follow-up patch so that we all can contribute lowering cases incrementally. Does it sound reasonable?

If this is a commitment that you are working on starting to connect the pieces next then fair enough :)

If this is a commitment that you are working on starting to connect the pieces next then fair enough :)

Yeah, we have that in our schedule. We'll start with the basic 1-D contiguous cases.

Any other comments?

Thanks!

This revision was not accepted when it landed; it landed in state Needs Review.Feb 12 2021, 10:53 AM

Closed by commit rGee66e43a96e1: [mlir][Vector] Introduce 'vector.load' and 'vector.store' ops (authored by dcaballe). · Explain Why

This revision was automatically updated to reflect the committed changes.

dcaballe added a commit: rGee66e43a96e1: [mlir][Vector] Introduce 'vector.load' and 'vector.store' ops.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Vector/

VectorOps.td

159 lines

lib/

Conversion/

AffineToStandard/

AffineToStandard.cpp

23 lines

VectorToLLVM/

ConvertVectorToLLVM.cpp

102 lines

Dialect/

Vector/

VectorOps.cpp

73 lines

test/

Conversion/

AffineToStandard/

lower-affine-to-vector.mlir

47 lines

VectorToLLVM/

vector-to-llvm.mlir

28 lines

Dialect/

Vector/

invalid.mlir

36 lines

ops.mlir

50 lines

Diff 321887

mlir/include/mlir/Dialect/Vector/VectorOps.td

Show First 20 Lines • Show All 1,309 Lines • ▼ Show 20 Lines	let builders = [
OpBuilderDAG<(ins "Value":$vector, "Value":$source, "ValueRange":$indices,		OpBuilderDAG<(ins "Value":$vector, "Value":$source, "ValueRange":$indices,
"AffineMapAttr":$permutationMap, "ArrayAttr":$masked)>,		"AffineMapAttr":$permutationMap, "ArrayAttr":$masked)>,
OpBuilderDAG<(ins "Value":$vector, "Value":$source, "ValueRange":$indices,		OpBuilderDAG<(ins "Value":$vector, "Value":$source, "ValueRange":$indices,
"AffineMap":$permutationMap, "ArrayAttr":$masked)>,		"AffineMap":$permutationMap, "ArrayAttr":$masked)>,
];		];

let hasFolder = 1;		let hasFolder = 1;
}		}

		aartbikUnsubmitted Not Done Reply Inline Actions A bit nitpicky request, but I insist ;-) The order in the ops file for memory operations is strictly in pairs Vector_TransferReadOp / Vector_TransferWriteOp Vector_MaskedLoadOp / Vector_MaskedStoreOp Vector_GatherOp / Vector_ScatterOp Vector_ExpandLoadOp / Vector_CompressStoreOp you break that convention by placing the unmasked versions before the load and store respectively. I would prefer to have the new ones in between the Transfer and Masked versions, but paired together and in the order load / store as you have aartbik: A bit nitpicky request, but I insist ;-) The order in the ops file for memory operations is…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Sure, no problem at all! I thought that having the `vector.load` next to the `vector.maskedload` would help quickly see the different variants that we have to perform a vector load. I'll change that. Thanks! dcaballe: Sure, no problem at all! I thought that having the `vector.load` next to the `vector.
		def Vector_LoadOp : Vector_Op<"load"> {
		let summary = "reads contiguous elements from memory into an 1-D vector";
		let description = [{
		The 'vector.load' operation reads a contiguous n-D slice of memory into
		bondhugulaUnsubmitted Not Done Reply Inline Actions `contiguous n-D slice of memory` is a bit inherently contradictory to me. bondhugula: `contiguous n-D slice of memory` is a bit inherently contradictory to me.
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Ok, I can remove `contiguous` from this line. It should be enough with the line below: `This slice is contiguous along the respective dimensions of the shape`. This is what we have in the description of the affine counterparts. dcaballe: Ok, I can remove `contiguous` from this line. It should be enough with the line below: `This…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better now. dcaballe: I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better…
		an n-D vector. It takes a 'base' memref, an index for each memref
		dimension and a result vector type as arguments. It returns a value of the
		result vector type. The 'base' memref and indices determine the start memory
		address from which to read. Each index provides an offset for each memref
		dimension based on the element type of the memref. The shape of the result
		vector type determines the shape of the slice read from the start memory
		address. This slice is contiguous along the respective dimensions of the
		shape.

		The memref element type can be a scalar or a vector type. If the memref
		element type is a scalar, it should match the element type of the result
		vector. If the memref element type is vector, it should match the result
		vector type.

		Example 1: 1-D vector load on a scalar memref.
		```mlir
		%result = vector.load %base[%i, %j] : memref<100x100xf32>, vector<8xf32>
		```

		Example 2: 1-D vector load on a vector memref.
		```mlir
		%result = vector.load %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
		```

		Example 3: 2-D vector load on a scalar memref.
		```mlir
		%result = vector.load %memref[%i, %j] : memref<200x100xf32>, vector<4x8xf32>
		```

		Example 4: 2-D vector load on a vector memref.
		```mlir
		%result = vector.load %memref[%i, %j] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
		```

		Representation-wise, the 'vector.load' operation permits out-of-bounds
		reads. Support and implementation of out-of-bounds vector loads is
		target-specific. No assumtions should be made on the value of elements
		nicolasvasilacheUnsubmitted Done Reply Inline Actions assumptions nicolasvasilache: assumptions
		loaded out of bounds. Not all targets may support out-of-bounds vector
		loads.

		Example 5: Potential out-of-bound vector load.
		```mlir
		%result = vector.load %memref[%index] : memref<?xf32>, vector<8xf32>
		```

		Example 6: Explicit out-of-bound vector load.
		```mlir
		%result = vector.load %memref[%c0] : memref<7xf32>, vector<8xf32>
		```
		}];

		let arguments = (ins Arg<AnyMemRef, "the reference to load from",
		[MemRead]>:$base,
		Variadic<Index>:$indices);
		let results = (outs AnyVector:$result);

		let extraClassDeclaration = [{
		MemRefType getMemRefType() {
		return base().getType().cast<MemRefType>();
		}

		VectorType getVectorType() {
		return result().getType().cast<VectorType>();
		}
		}];

		let assemblyFormat =
		"$base `[` $indices `]` attr-dict `:` type($base) `,` type($result)";
		}

def Vector_MaskedLoadOp :		def Vector_MaskedLoadOp :
Vector_Op<"maskedload">,		Vector_Op<"maskedload">,
Arguments<(ins Arg<AnyMemRef, "", [MemRead]>:$base,		Arguments<(ins Arg<AnyMemRef, "", [MemRead]>:$base,
Variadic<Index>:$indices,		Variadic<Index>:$indices,
VectorOfRankAndType<[1], [I1]>:$mask,		VectorOfRankAndType<[1], [I1]>:$mask,
VectorOfRank<[1]>:$pass_thru)>,		VectorOfRank<[1]>:$pass_thru)>,
Results<(outs VectorOfRank<[1]>:$result)> {		Results<(outs VectorOfRank<[1]>:$result)> {

Show All 27 Lines	MemRefType getMemRefType() {
return base().getType().cast<MemRefType>();		return base().getType().cast<MemRefType>();
}		}
VectorType getMaskVectorType() {		VectorType getMaskVectorType() {
return mask().getType().cast<VectorType>();		return mask().getType().cast<VectorType>();
}		}
VectorType getPassThruVectorType() {		VectorType getPassThruVectorType() {
return pass_thru().getType().cast<VectorType>();		return pass_thru().getType().cast<VectorType>();
}		}
VectorType getResultVectorType() {		VectorType getVectorType() {
return result().getType().cast<VectorType>();		return result().getType().cast<VectorType>();
}		}
}];		}];
let assemblyFormat = "$base `[` $indices `]` `,` $mask `,` $pass_thru attr-dict `:` "		let assemblyFormat = "$base `[` $indices `]` `,` $mask `,` $pass_thru attr-dict `:` "
"type($base) `,` type($mask) `,` type($pass_thru) `into` type($result)";		"type($base) `,` type($mask) `,` type($pass_thru) `into` type($result)";
let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
}		}

		def Vector_StoreOp : Vector_Op<"store"> {
		let summary = "writes an n-D vector to contiguous elements in memory";
		let description = [{
		The 'vector.store' operation writes an n-D vector to a contiguous n-D
		nicolasvasilacheUnsubmitted Not Done Reply Inline Actions Same remark re. contiguous here, this makes me think that `vector<2x4xf32>` stores 32 contiguous bytes which is not the case. I'd actually rephrase this part and describe it as something carrying the intent of "strided by the memref strides" but not contiguous. nicolasvasilache: Same remark re. contiguous here, this makes me think that `vector<2x4xf32>` stores 32…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better now. dcaballe: I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better…
		slice of memory. It takes the vector value to be stored, a 'base' memref
		and an index for each memref dimension. The 'base' memref and indices
		determine the start memory address from which to write. Each index
		provides an offset for each memref dimension based on the element type
		of the memref. The shape of the vector value to store determines the
		shape of the slice written from the start memory address. This slice is
		contiguous along the respective dimensions of the shape.

		The memref element type can be a scalar or a vector type. If the memref
		element type is a scalar, it should match the element type of the value
		to store. If the memref element type is vector, it should match the type
		of the value to store.

		Example 1: 1-D vector store on a scalar memref.
		```mlir
		vector.store %valueToStore, %memref[%i, %j] : memref<200x100xf32>, vector<8xf32>
		```

		Example 2: 1-D vector store on a vector memref.
		```mlir
		vector.store %valueToStore, %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
		```

		Example 3: 2-D vector store on a scalar memref.
		```mlir
		vector.store %valueToStore, %memref[%i, %j] : memref<200x100xf32>, vector<4x8xf32>
		```

		Example 4: 2-D vector store on a vector memref.
		```mlir
		vector.store %valueToStore, %memref[%i, %j] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
		```

		Representation-wise, the 'vector.store' operation permits out-of-bounds
		writes. Support and implementation of out-of-bounds vector stores are
		target-specific. No assumtions should be made on the memory written out of
		bounds. Not all targets may support out-of-bounds vector stores.

		Example 5: Potential out-of-bounds vector store.
		```mlir
		vector.store %valueToStore, %memref[%index] : memref<?xf32>, vector<8xf32>
		```

		Example 6: Explicit out-of-bounds vector store.
		```mlir
		vector.store %valueToStore, %memref[%c0] : memref<7xf32>, vector<8xf32>
		```
		}];

		let arguments = (ins AnyVector:$valueToStore,
		Arg<AnyMemRef, "the reference to store to",
		[MemWrite]>:$base,
		Variadic<Index>:$indices);

		let extraClassDeclaration = [{
		MemRefType getMemRefType() {
		return base().getType().cast<MemRefType>();
		}

		VectorType getVectorType() {
		return valueToStore().getType().cast<VectorType>();
		}
		}];

		let assemblyFormat = "$valueToStore `,` $base `[` $indices `]` attr-dict "
		"`:` type($base) `,` type($valueToStore)";
		}

def Vector_MaskedStoreOp :		def Vector_MaskedStoreOp :
Vector_Op<"maskedstore">,		Vector_Op<"maskedstore">,
Arguments<(ins Arg<AnyMemRef, "", [MemWrite]>:$base,		Arguments<(ins Arg<AnyMemRef, "", [MemWrite]>:$base,
Variadic<Index>:$indices,		Variadic<Index>:$indices,
VectorOfRankAndType<[1], [I1]>:$mask,		VectorOfRankAndType<[1], [I1]>:$mask,
VectorOfRank<[1]>:$value)> {		VectorOfRank<[1]>:$valueToStore)> {

let summary = "stores elements from a vector into memory as defined by a mask vector";		let summary = "stores elements from a vector into memory as defined by a mask vector";

let description = [{		let description = [{
The masked store operation writes elements from a 1-D vector into memory		The masked store operation writes elements from a 1-D vector into memory
as defined by a base with indices and a 1-D mask vector. When the mask is		as defined by a base with indices and a 1-D mask vector. When the mask is
set, the corresponding element from the vector is written to memory. Otherwise,		set, the corresponding element from the vector is written to memory. Otherwise,
no action is taken for the element. Informally the semantics are:		no action is taken for the element. Informally the semantics are:
Show All 17 Lines	def Vector_MaskedStoreOp :
}];		}];
let extraClassDeclaration = [{		let extraClassDeclaration = [{
MemRefType getMemRefType() {		MemRefType getMemRefType() {
return base().getType().cast<MemRefType>();		return base().getType().cast<MemRefType>();
}		}
VectorType getMaskVectorType() {		VectorType getMaskVectorType() {
return mask().getType().cast<VectorType>();		return mask().getType().cast<VectorType>();
}		}
VectorType getValueVectorType() {		VectorType getVectorType() {
return value().getType().cast<VectorType>();		return valueToStore().getType().cast<VectorType>();
}		}
}];		}];
let assemblyFormat = "$base `[` $indices `]` `,` $mask `,` $value attr-dict `:` "		let assemblyFormat =
"type($base) `,` type($mask) `,` type($value)";		"$base `[` $indices `]` `,` $mask `,` $valueToStore "
		"attr-dict `:` type($base) `,` type($mask) `,` type($valueToStore)";
let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
}		}

def Vector_GatherOp :		def Vector_GatherOp :
Vector_Op<"gather">,		Vector_Op<"gather">,
Arguments<(ins Arg<AnyMemRef, "", [MemRead]>:$base,		Arguments<(ins Arg<AnyMemRef, "", [MemRead]>:$base,
VectorOfRankAndType<[1], [AnyInteger]>:$indices,		VectorOfRankAndType<[1], [AnyInteger]>:$indices,
VectorOfRankAndType<[1], [I1]>:$mask,		VectorOfRankAndType<[1], [I1]>:$mask,
▲ Show 20 Lines • Show All 698 Lines • Show Last 20 Lines

mlir/lib/Conversion/AffineToStandard/AffineToStandard.cpp

Show First 20 Lines • Show All 570 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(AffineLoadOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Expand affine map from 'affineLoadOp'.		// Expand affine map from 'affineLoadOp'.
SmallVector<Value, 8> indices(op.getMapOperands());		SmallVector<Value, 8> indices(op.getMapOperands());
auto resultOperands =		auto resultOperands =
expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);		expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);
if (!resultOperands)		if (!resultOperands)
return failure();		return failure();

// Build std.load memref[expandedMap.results].		// Build vector.load memref[expandedMap.results].
rewriter.replaceOpWithNewOp<LoadOp>(op, op.getMemRef(), *resultOperands);		rewriter.replaceOpWithNewOp<mlir::LoadOp>(op, op.getMemRef(),
		*resultOperands);
return success();		return success();
}		}
};		};

/// Apply the affine map from an 'affine.prefetch' operation to its operands,		/// Apply the affine map from an 'affine.prefetch' operation to its operands,
/// and feed the results to a newly created 'std.prefetch' operation (which		/// and feed the results to a newly created 'std.prefetch' operation (which
/// replaces the original 'affine.prefetch').		/// replaces the original 'affine.prefetch').
class AffinePrefetchLowering : public OpRewritePattern<AffinePrefetchOp> {		class AffinePrefetchLowering : public OpRewritePattern<AffinePrefetchOp> {
Show All 29 Lines	LogicalResult matchAndRewrite(AffineStoreOp op,
// Expand affine map from 'affineStoreOp'.		// Expand affine map from 'affineStoreOp'.
SmallVector<Value, 8> indices(op.getMapOperands());		SmallVector<Value, 8> indices(op.getMapOperands());
auto maybeExpandedMap =		auto maybeExpandedMap =
expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);		expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);
if (!maybeExpandedMap)		if (!maybeExpandedMap)
return failure();		return failure();

// Build std.store valueToStore, memref[expandedMap.results].		// Build std.store valueToStore, memref[expandedMap.results].
rewriter.replaceOpWithNewOp<StoreOp>(op, op.getValueToStore(),		rewriter.replaceOpWithNewOp<mlir::StoreOp>(
op.getMemRef(), *maybeExpandedMap);		op, op.getValueToStore(), op.getMemRef(), *maybeExpandedMap);
return success();		return success();
}		}
};		};

/// Apply the affine maps from an 'affine.dma_start' operation to each of their		/// Apply the affine maps from an 'affine.dma_start' operation to each of their
/// respective map operands, and feed the results to a newly created		/// respective map operands, and feed the results to a newly created
/// 'std.dma_start' operation (which replaces the original 'affine.dma_start').		/// 'std.dma_start' operation (which replaces the original 'affine.dma_start').
class AffineDmaStartLowering : public OpRewritePattern<AffineDmaStartOp> {		class AffineDmaStartLowering : public OpRewritePattern<AffineDmaStartOp> {
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(AffineDmaWaitOp op,
// Build std.dma_wait operation with affine map results.		// Build std.dma_wait operation with affine map results.
rewriter.replaceOpWithNewOp<DmaWaitOp>(		rewriter.replaceOpWithNewOp<DmaWaitOp>(
op, op.getTagMemRef(), *maybeExpandedTagMap, op.getNumElements());		op, op.getTagMemRef(), *maybeExpandedTagMap, op.getNumElements());
return success();		return success();
}		}
};		};

/// Apply the affine map from an 'affine.vector_load' operation to its operands,		/// Apply the affine map from an 'affine.vector_load' operation to its operands,
/// and feed the results to a newly created 'vector.transfer_read' operation		/// and feed the results to a newly created 'vector.load' operation (which
/// (which replaces the original 'affine.vector_load').		/// replaces the original 'affine.vector_load').
class AffineVectorLoadLowering : public OpRewritePattern<AffineVectorLoadOp> {		class AffineVectorLoadLowering : public OpRewritePattern<AffineVectorLoadOp> {
public:		public:
using OpRewritePattern<AffineVectorLoadOp>::OpRewritePattern;		using OpRewritePattern<AffineVectorLoadOp>::OpRewritePattern;

LogicalResult matchAndRewrite(AffineVectorLoadOp op,		LogicalResult matchAndRewrite(AffineVectorLoadOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Expand affine map from 'affineVectorLoadOp'.		// Expand affine map from 'affineVectorLoadOp'.
SmallVector<Value, 8> indices(op.getMapOperands());		SmallVector<Value, 8> indices(op.getMapOperands());
auto resultOperands =		auto resultOperands =
expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);		expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);
if (!resultOperands)		if (!resultOperands)
return failure();		return failure();

// Build vector.transfer_read memref[expandedMap.results].		// Build vector.load memref[expandedMap.results].
rewriter.replaceOpWithNewOp<TransferReadOp>(		rewriter.replaceOpWithNewOp<vector::LoadOp>(
op, op.getVectorType(), op.getMemRef(), *resultOperands);		op, op.getVectorType(), op.getMemRef(), *resultOperands);
return success();		return success();
}		}
};		};

/// Apply the affine map from an 'affine.vector_store' operation to its		/// Apply the affine map from an 'affine.vector_store' operation to its
/// operands, and feed the results to a newly created 'vector.transfer_write'		/// operands, and feed the results to a newly created 'vector.store' operation
/// operation (which replaces the original 'affine.vector_store').		/// (which replaces the original 'affine.vector_store').
class AffineVectorStoreLowering : public OpRewritePattern<AffineVectorStoreOp> {		class AffineVectorStoreLowering : public OpRewritePattern<AffineVectorStoreOp> {
public:		public:
using OpRewritePattern<AffineVectorStoreOp>::OpRewritePattern;		using OpRewritePattern<AffineVectorStoreOp>::OpRewritePattern;

LogicalResult matchAndRewrite(AffineVectorStoreOp op,		LogicalResult matchAndRewrite(AffineVectorStoreOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Expand affine map from 'affineVectorStoreOp'.		// Expand affine map from 'affineVectorStoreOp'.
SmallVector<Value, 8> indices(op.getMapOperands());		SmallVector<Value, 8> indices(op.getMapOperands());
auto maybeExpandedMap =		auto maybeExpandedMap =
expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);		expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);
if (!maybeExpandedMap)		if (!maybeExpandedMap)
return failure();		return failure();

rewriter.replaceOpWithNewOp<TransferWriteOp>(		rewriter.replaceOpWithNewOp<vector::StoreOp>(
op, op.getValueToStore(), op.getMemRef(), *maybeExpandedMap);		op, op.getValueToStore(), op.getMemRef(), *maybeExpandedMap);
return success();		return success();
}		}
};		};

} // end namespace		} // end namespace

void mlir::populateAffineToStdConversionPatterns(		void mlir::populateAffineToStdConversionPatterns(
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

Show First 20 Lines • Show All 311 Lines • ▼ Show 20 Lines	matchAndRewrite(vector::FlatTransposeOp transOp, ArrayRef<Value> operands,
auto adaptor = vector::FlatTransposeOpAdaptor(operands);		auto adaptor = vector::FlatTransposeOpAdaptor(operands);
rewriter.replaceOpWithNewOp<LLVM::MatrixTransposeOp>(		rewriter.replaceOpWithNewOp<LLVM::MatrixTransposeOp>(
transOp, typeConverter->convertType(transOp.res().getType()),		transOp, typeConverter->convertType(transOp.res().getType()),
adaptor.matrix(), transOp.rows(), transOp.columns());		adaptor.matrix(), transOp.rows(), transOp.columns());
return success();		return success();
}		}
};		};

/// Conversion pattern for a vector.maskedload.		/// Overloaded utility that replaces a vector.load, vector.store,
class VectorMaskedLoadOpConversion		/// vector.maskedload and vector.maskedstore with their respective LLVM
: public ConvertOpToLLVMPattern<vector::MaskedLoadOp> {		/// couterparts.
public:		static void replaceLoadOrStoreOp(vector::LoadOp loadOp,
using ConvertOpToLLVMPattern<vector::MaskedLoadOp>::ConvertOpToLLVMPattern;		vector::LoadOpAdaptor adaptor,
		VectorType vectorTy, Value ptr, unsigned align,
LogicalResult		ConversionPatternRewriter &rewriter) {
matchAndRewrite(vector::MaskedLoadOp load, ArrayRef<Value> operands,		rewriter.replaceOpWithNewOp<LLVM::LoadOp>(loadOp, ptr, align);
ConversionPatternRewriter &rewriter) const override {		}
auto loc = load->getLoc();
auto adaptor = vector::MaskedLoadOpAdaptor(operands);
MemRefType memRefType = load.getMemRefType();

// Resolve alignment.		static void replaceLoadOrStoreOp(vector::MaskedLoadOp loadOp,
unsigned align;		vector::MaskedLoadOpAdaptor adaptor,
if (failed(getMemRefAlignment(*getTypeConverter(), memRefType, align)))		VectorType vectorTy, Value ptr, unsigned align,
return failure();		ConversionPatternRewriter &rewriter) {
		rewriter.replaceOpWithNewOp<LLVM::MaskedLoadOp>(
		loadOp, vectorTy, ptr, adaptor.mask(), adaptor.pass_thru(), align);
		}

// Resolve address.		static void replaceLoadOrStoreOp(vector::StoreOp storeOp,
auto vtype = typeConverter->convertType(load.getResultVectorType());		vector::StoreOpAdaptor adaptor,
Value dataPtr = this->getStridedElementPtr(loc, memRefType, adaptor.base(),		VectorType vectorTy, Value ptr, unsigned align,
adaptor.indices(), rewriter);		ConversionPatternRewriter &rewriter) {
Value ptr = castDataPtr(rewriter, loc, dataPtr, memRefType, vtype);		rewriter.replaceOpWithNewOp<LLVM::StoreOp>(storeOp, adaptor.valueToStore(),
		ptr, align);
		}

rewriter.replaceOpWithNewOp<LLVM::MaskedLoadOp>(		static void replaceLoadOrStoreOp(vector::MaskedStoreOp storeOp,
load, vtype, ptr, adaptor.mask(), adaptor.pass_thru(),		vector::MaskedStoreOpAdaptor adaptor,
rewriter.getI32IntegerAttr(align));		VectorType vectorTy, Value ptr, unsigned align,
return success();		ConversionPatternRewriter &rewriter) {
		rewriter.replaceOpWithNewOp<LLVM::MaskedStoreOp>(
		storeOp, adaptor.valueToStore(), ptr, adaptor.mask(), align);
}		}
};

/// Conversion pattern for a vector.maskedstore.		/// Conversion pattern for a vector.load, vector.store, vector.maskedload, and
class VectorMaskedStoreOpConversion		/// vector.maskedstore.
: public ConvertOpToLLVMPattern<vector::MaskedStoreOp> {		template <class LoadOrStoreOp, class LoadOrStoreOpAdaptor>
		class VectorLoadStoreConversion : public ConvertOpToLLVMPattern<LoadOrStoreOp> {
public:		public:
using ConvertOpToLLVMPattern<vector::MaskedStoreOp>::ConvertOpToLLVMPattern;		using ConvertOpToLLVMPattern<LoadOrStoreOp>::ConvertOpToLLVMPattern;

LogicalResult		LogicalResult
matchAndRewrite(vector::MaskedStoreOp store, ArrayRef<Value> operands,		matchAndRewrite(LoadOrStoreOp loadOrStoreOp, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
auto loc = store->getLoc();		// Only 1-D vectors can be lowered to LLVM.
auto adaptor = vector::MaskedStoreOpAdaptor(operands);		VectorType vectorTy = loadOrStoreOp.getVectorType();
MemRefType memRefType = store.getMemRefType();		if (vectorTy.getRank() > 1)
		return failure();

		auto loc = loadOrStoreOp->getLoc();
		auto adaptor = LoadOrStoreOpAdaptor(operands);
		MemRefType memRefTy = loadOrStoreOp.getMemRefType();

// Resolve alignment.		// Resolve alignment.
unsigned align;		unsigned align;
if (failed(getMemRefAlignment(*getTypeConverter(), memRefType, align)))		if (failed(getMemRefAlignment(*this->getTypeConverter(), memRefTy, align)))
return failure();		return failure();

// Resolve address.		// Resolve address.
auto vtype = typeConverter->convertType(store.getValueVectorType());		auto vtype = this->typeConverter->convertType(loadOrStoreOp.getVectorType())
Value dataPtr = this->getStridedElementPtr(loc, memRefType, adaptor.base(),		.template cast<VectorType>();
		Value dataPtr = this->getStridedElementPtr(loc, memRefTy, adaptor.base(),
adaptor.indices(), rewriter);		adaptor.indices(), rewriter);
Value ptr = castDataPtr(rewriter, loc, dataPtr, memRefType, vtype);		Value ptr = castDataPtr(rewriter, loc, dataPtr, memRefTy, vtype);

rewriter.replaceOpWithNewOp<LLVM::MaskedStoreOp>(		replaceLoadOrStoreOp(loadOrStoreOp, adaptor, vtype, ptr, align, rewriter);
store, adaptor.value(), ptr, adaptor.mask(),
rewriter.getI32IntegerAttr(align));
return success();		return success();
}		}
};		};

/// Conversion pattern for a vector.gather.		/// Conversion pattern for a vector.gather.
class VectorGatherOpConversion		class VectorGatherOpConversion
: public ConvertOpToLLVMPattern<vector::GatherOp> {		: public ConvertOpToLLVMPattern<vector::GatherOp> {
public:		public:
▲ Show 20 Lines • Show All 1,109 Lines • ▼ Show 20 Lines	patterns
.insert<VectorShuffleOpConversion,		.insert<VectorShuffleOpConversion,
VectorExtractElementOpConversion,		VectorExtractElementOpConversion,
VectorExtractOpConversion,		VectorExtractOpConversion,
VectorFMAOp1DConversion,		VectorFMAOp1DConversion,
VectorInsertElementOpConversion,		VectorInsertElementOpConversion,
VectorInsertOpConversion,		VectorInsertOpConversion,
VectorPrintOpConversion,		VectorPrintOpConversion,
VectorTypeCastOpConversion,		VectorTypeCastOpConversion,
VectorMaskedLoadOpConversion,		VectorLoadStoreConversion<vector::LoadOp,
VectorMaskedStoreOpConversion,		vector::LoadOpAdaptor>,
		VectorLoadStoreConversion<vector::MaskedLoadOp,
		vector::MaskedLoadOpAdaptor>,
		VectorLoadStoreConversion<vector::StoreOp,
		vector::StoreOpAdaptor>,
		VectorLoadStoreConversion<vector::MaskedStoreOp,
		vector::MaskedStoreOpAdaptor>,
VectorGatherOpConversion,		VectorGatherOpConversion,
VectorScatterOpConversion,		VectorScatterOpConversion,
VectorExpandLoadOpConversion,		VectorExpandLoadOpConversion,
VectorCompressStoreOpConversion>(converter);		VectorCompressStoreOpConversion>(converter);
// clang-format on		// clang-format on
}		}

void mlir::populateVectorToLLVMMatrixConversionPatterns(		void mlir::populateVectorToLLVMMatrixConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
patterns.insert<VectorMatmulOpConversion>(converter);		patterns.insert<VectorMatmulOpConversion>(converter);
patterns.insert<VectorFlatTransposeOpConversion>(converter);		patterns.insert<VectorFlatTransposeOpConversion>(converter);
}		}

mlir/lib/Dialect/Vector/VectorOps.cpp

Show First 20 Lines • Show All 2,352 Lines • ▼ Show 20 Lines	void TransferWriteOp::getEffects(
SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>		SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>
&effects) {		&effects) {
if (getShapedType().isa<MemRefType>())		if (getShapedType().isa<MemRefType>())
effects.emplace_back(MemoryEffects::Write::get(), source(),		effects.emplace_back(MemoryEffects::Write::get(), source(),
SideEffects::DefaultResource::get());		SideEffects::DefaultResource::get());
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// LoadOp
		//===----------------------------------------------------------------------===//

		static LogicalResult verifyLoadStoreMemRefLayout(Operation *op,
		MemRefType memRefTy) {
		auto affineMaps = memRefTy.getAffineMaps();
		if (!affineMaps.empty() &&
		(affineMaps.size() != 1 \|\| !affineMaps[0].isIdentity()))
		bondhugulaUnsubmitted Not Done Reply Inline Actions `affineMaps.empty()` would be the right check here as opposed to `affineMaps.size() != 1`. bondhugula: `affineMaps.empty()` would be the right check here as opposed to `affineMaps.size() != 1`.
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Oh, if a single identity map is provided, it's not kept around. Got it. Thanks! dcaballe: Oh, if a single identity map is provided, it's not kept around. Got it. Thanks!
		return op->emitOpError(
		"base memref should have a default or identity layout");
		return success();
		}

		static LogicalResult verify(vector::LoadOp op) {
		VectorType resVecTy = op.getVectorType();
		MemRefType memRefTy = op.getMemRefType();

		if (failed(verifyLoadStoreMemRefLayout(op, memRefTy)))
		return failure();

		// Checks for vector memrefs.
		Type memElemTy = memRefTy.getElementType();
		if (auto memVecTy = memElemTy.dyn_cast<VectorType>()) {
		if (memVecTy != resVecTy)
		return op.emitOpError("base memref and result vector types should match");
		memElemTy = memVecTy.getElementType();
		}

		if (resVecTy.getElementType() != memElemTy)
		return op.emitOpError("base and result element types should match");
		if (llvm::size(op.indices()) != memRefTy.getRank())
		return op.emitOpError("requires ") << memRefTy.getRank() << " indices";
		return success();
		}

		//===----------------------------------------------------------------------===//
// MaskedLoadOp		// MaskedLoadOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static LogicalResult verify(MaskedLoadOp op) {		static LogicalResult verify(MaskedLoadOp op) {
VectorType maskVType = op.getMaskVectorType();		VectorType maskVType = op.getMaskVectorType();
VectorType passVType = op.getPassThruVectorType();		VectorType passVType = op.getPassThruVectorType();
VectorType resVType = op.getResultVectorType();		VectorType resVType = op.getVectorType();
MemRefType memType = op.getMemRefType();		MemRefType memType = op.getMemRefType();

if (resVType.getElementType() != memType.getElementType())		if (resVType.getElementType() != memType.getElementType())
return op.emitOpError("base and result element type should match");		return op.emitOpError("base and result element type should match");
if (llvm::size(op.indices()) != memType.getRank())		if (llvm::size(op.indices()) != memType.getRank())
return op.emitOpError("requires ") << memType.getRank() << " indices";		return op.emitOpError("requires ") << memType.getRank() << " indices";
if (resVType.getDimSize(0) != maskVType.getDimSize(0))		if (resVType.getDimSize(0) != maskVType.getDimSize(0))
return op.emitOpError("expected result dim to match mask dim");		return op.emitOpError("expected result dim to match mask dim");
Show All 25 Lines
} // namespace		} // namespace

void MaskedLoadOp::getCanonicalizationPatterns(		void MaskedLoadOp::getCanonicalizationPatterns(
OwningRewritePatternList &results, MLIRContext *context) {		OwningRewritePatternList &results, MLIRContext *context) {
results.insert<MaskedLoadFolder>(context);		results.insert<MaskedLoadFolder>(context);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// StoreOp
		//===----------------------------------------------------------------------===//

		static LogicalResult verify(vector::StoreOp op) {
		VectorType valueVecTy = op.getVectorType();
		MemRefType memRefTy = op.getMemRefType();

		if (failed(verifyLoadStoreMemRefLayout(op, memRefTy)))
		return failure();

		// Checks for vector memrefs.
		Type memElemTy = memRefTy.getElementType();
		if (auto memVecTy = memElemTy.dyn_cast<VectorType>()) {
		if (memVecTy != valueVecTy)
		return op.emitOpError(
		"base memref and valueToStore vector types should match");
		memElemTy = memVecTy.getElementType();
		}

		if (valueVecTy.getElementType() != memElemTy)
		return op.emitOpError("base and valueToStore element type should match");
		if (llvm::size(op.indices()) != memRefTy.getRank())
		return op.emitOpError("requires ") << memRefTy.getRank() << " indices";
		return success();
		}

		//===----------------------------------------------------------------------===//
// MaskedStoreOp		// MaskedStoreOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static LogicalResult verify(MaskedStoreOp op) {		static LogicalResult verify(MaskedStoreOp op) {
VectorType maskVType = op.getMaskVectorType();		VectorType maskVType = op.getMaskVectorType();
VectorType valueVType = op.getValueVectorType();		VectorType valueVType = op.getVectorType();
MemRefType memType = op.getMemRefType();		MemRefType memType = op.getMemRefType();

if (valueVType.getElementType() != memType.getElementType())		if (valueVType.getElementType() != memType.getElementType())
return op.emitOpError("base and value element type should match");		return op.emitOpError("base and valueToStore element type should match");
if (llvm::size(op.indices()) != memType.getRank())		if (llvm::size(op.indices()) != memType.getRank())
return op.emitOpError("requires ") << memType.getRank() << " indices";		return op.emitOpError("requires ") << memType.getRank() << " indices";
if (valueVType.getDimSize(0) != maskVType.getDimSize(0))		if (valueVType.getDimSize(0) != maskVType.getDimSize(0))
return op.emitOpError("expected value dim to match mask dim");		return op.emitOpError("expected valueToStore dim to match mask dim");
return success();		return success();
}		}

namespace {		namespace {
class MaskedStoreFolder final : public OpRewritePattern<MaskedStoreOp> {		class MaskedStoreFolder final : public OpRewritePattern<MaskedStoreOp> {
public:		public:
using OpRewritePattern<MaskedStoreOp>::OpRewritePattern;		using OpRewritePattern<MaskedStoreOp>::OpRewritePattern;
LogicalResult matchAndRewrite(MaskedStoreOp store,		LogicalResult matchAndRewrite(MaskedStoreOp store,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
switch (get1DMaskFormat(store.mask())) {		switch (get1DMaskFormat(store.mask())) {
case MaskFormat::AllTrue:		case MaskFormat::AllTrue:
rewriter.replaceOpWithNewOp<vector::TransferWriteOp>(		rewriter.replaceOpWithNewOp<vector::TransferWriteOp>(
store, store.value(), store.base(), store.indices(), false);		store, store.valueToStore(), store.base(), store.indices(), false);
return success();		return success();
case MaskFormat::AllFalse:		case MaskFormat::AllFalse:
rewriter.eraseOp(store);		rewriter.eraseOp(store);
return success();		return success();
case MaskFormat::Unknown:		case MaskFormat::Unknown:
return failure();		return failure();
}		}
llvm_unreachable("Unexpected 1DMaskFormat on MaskedStore");		llvm_unreachable("Unexpected 1DMaskFormat on MaskedStore");
▲ Show 20 Lines • Show All 696 Lines • Show Last 20 Lines

mlir/test/Conversion/AffineToStandard/lower-affine-to-vector.mlir

	// RUN: mlir-opt -lower-affine --split-input-file %s \| FileCheck %s			// RUN: mlir-opt -lower-affine --split-input-file %s \| FileCheck %s

	// CHECK-LABEL: func @affine_vector_load
	func @affine_vector_load(%arg0 : index) {
	%0 = alloc() : memref<100xf32>
	affine.for %i0 = 0 to 16 {
	%1 = affine.vector_load %0[%i0 + symbol(%arg0) + 7] : memref<100xf32>, vector<8xf32>
	}
	// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[a:.]] = addi %{{.}}, %{{.*}} : index
	// CHECK-NEXT: %[[c7:.*]] = constant 7 : index
	// CHECK-NEXT: %[[b:.*]] = addi %[[a]], %[[c7]] : index
	// CHECK-NEXT: %[[pad:.*]] = constant 0.0
	// CHECK-NEXT: vector.transfer_read %[[buf]][%[[b]]], %[[pad]] : memref<100xf32>, vector<8xf32>
	return
	}

	// -----

	// CHECK-LABEL: func @affine_vector_store
	func @affine_vector_store(%arg0 : index) {
	%0 = alloc() : memref<100xf32>
	%1 = constant dense<11.0> : vector<4xf32>
	affine.for %i0 = 0 to 16 {
	affine.vector_store %1, %0[%i0 - symbol(%arg0) + 7] : memref<100xf32>, vector<4xf32>
	}
	// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[val:.*]] = constant dense
	// CHECK: %[[c_1:.*]] = constant -1 : index
	// CHECK-NEXT: %[[a:.*]] = muli %arg0, %[[c_1]] : index
	// CHECK-NEXT: %[[b:.]] = addi %{{.}}, %[[a]] : index
	// CHECK-NEXT: %[[c7:.*]] = constant 7 : index
	// CHECK-NEXT: %[[c:.*]] = addi %[[b]], %[[c7]] : index
	// CHECK-NEXT: vector.transfer_write %[[val]], %[[buf]][%[[c]]] : vector<4xf32>, memref<100xf32>
	return
	}

	// -----

	// CHECK-LABEL: func @affine_vector_load			// CHECK-LABEL: func @affine_vector_load
	func @affine_vector_load(%arg0 : index) {			func @affine_vector_load(%arg0 : index) {
	%0 = alloc() : memref<100xf32>			%0 = alloc() : memref<100xf32>
	affine.for %i0 = 0 to 16 {			affine.for %i0 = 0 to 16 {
	%1 = affine.vector_load %0[%i0 + symbol(%arg0) + 7] : memref<100xf32>, vector<8xf32>			%1 = affine.vector_load %0[%i0 + symbol(%arg0) + 7] : memref<100xf32>, vector<8xf32>
	}			}
	// CHECK: %[[buf:.*]] = alloc			// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[a:.]] = addi %{{.}}, %{{.*}} : index			// CHECK: %[[a:.]] = addi %{{.}}, %{{.*}} : index
	// CHECK-NEXT: %[[c7:.*]] = constant 7 : index			// CHECK-NEXT: %[[c7:.*]] = constant 7 : index
	// CHECK-NEXT: %[[b:.*]] = addi %[[a]], %[[c7]] : index			// CHECK-NEXT: %[[b:.*]] = addi %[[a]], %[[c7]] : index
	// CHECK-NEXT: %[[pad:.*]] = constant 0.0			// CHECK-NEXT: vector.load %[[buf]][%[[b]]] : memref<100xf32>, vector<8xf32>
	// CHECK-NEXT: vector.transfer_read %[[buf]][%[[b]]], %[[pad]] : memref<100xf32>, vector<8xf32>
	return			return
	}			}

	// -----			// -----

	// CHECK-LABEL: func @affine_vector_store			// CHECK-LABEL: func @affine_vector_store
	func @affine_vector_store(%arg0 : index) {			func @affine_vector_store(%arg0 : index) {
	%0 = alloc() : memref<100xf32>			%0 = alloc() : memref<100xf32>
	%1 = constant dense<11.0> : vector<4xf32>			%1 = constant dense<11.0> : vector<4xf32>
	affine.for %i0 = 0 to 16 {			affine.for %i0 = 0 to 16 {
	affine.vector_store %1, %0[%i0 - symbol(%arg0) + 7] : memref<100xf32>, vector<4xf32>			affine.vector_store %1, %0[%i0 - symbol(%arg0) + 7] : memref<100xf32>, vector<4xf32>
	}			}
	// CHECK: %[[buf:.*]] = alloc			// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[val:.*]] = constant dense			// CHECK: %[[val:.*]] = constant dense
	// CHECK: %[[c_1:.*]] = constant -1 : index			// CHECK: %[[c_1:.*]] = constant -1 : index
	// CHECK-NEXT: %[[a:.*]] = muli %arg0, %[[c_1]] : index			// CHECK-NEXT: %[[a:.*]] = muli %arg0, %[[c_1]] : index
	// CHECK-NEXT: %[[b:.]] = addi %{{.}}, %[[a]] : index			// CHECK-NEXT: %[[b:.]] = addi %{{.}}, %[[a]] : index
	// CHECK-NEXT: %[[c7:.*]] = constant 7 : index			// CHECK-NEXT: %[[c7:.*]] = constant 7 : index
	// CHECK-NEXT: %[[c:.*]] = addi %[[b]], %[[c7]] : index			// CHECK-NEXT: %[[c:.*]] = addi %[[b]], %[[c7]] : index
	// CHECK-NEXT: vector.transfer_write %[[val]], %[[buf]][%[[c]]] : vector<4xf32>, memref<100xf32>			// CHECK-NEXT: vector.store %[[val]], %[[buf]][%[[c]]] : memref<100xf32>, vector<4xf32>
	return			return
	}			}

	// -----			// -----

	// CHECK-LABEL: func @vector_load_2d			// CHECK-LABEL: func @vector_load_2d
	func @vector_load_2d() {			func @vector_load_2d() {
	%0 = alloc() : memref<100x100xf32>			%0 = alloc() : memref<100x100xf32>
	affine.for %i0 = 0 to 16 step 2{			affine.for %i0 = 0 to 16 step 2{
	affine.for %i1 = 0 to 16 step 8 {			affine.for %i1 = 0 to 16 step 8 {
	%1 = affine.vector_load %0[%i0, %i1] : memref<100x100xf32>, vector<2x8xf32>			%1 = affine.vector_load %0[%i0, %i1] : memref<100x100xf32>, vector<2x8xf32>
	// CHECK: %[[buf:.*]] = alloc			// CHECK: %[[buf:.*]] = alloc
	// CHECK: scf.for %[[i0:.*]] =			// CHECK: scf.for %[[i0:.*]] =
	// CHECK: scf.for %[[i1:.*]] =			// CHECK: scf.for %[[i1:.*]] =
	// CHECK-NEXT: %[[pad:.*]] = constant 0.0			// CHECK-NEXT: vector.load %[[buf]][%[[i0]], %[[i1]]] : memref<100x100xf32>, vector<2x8xf32>
	// CHECK-NEXT: vector.transfer_read %[[buf]][%[[i0]], %[[i1]]], %[[pad]] : memref<100x100xf32>, vector<2x8xf32>
	}			}
	}			}
	return			return
	}			}

	// -----			// -----

	// CHECK-LABEL: func @vector_store_2d			// CHECK-LABEL: func @vector_store_2d
	func @vector_store_2d() {			func @vector_store_2d() {
	%0 = alloc() : memref<100x100xf32>			%0 = alloc() : memref<100x100xf32>
	%1 = constant dense<11.0> : vector<2x8xf32>			%1 = constant dense<11.0> : vector<2x8xf32>
	affine.for %i0 = 0 to 16 step 2{			affine.for %i0 = 0 to 16 step 2{
	affine.for %i1 = 0 to 16 step 8 {			affine.for %i1 = 0 to 16 step 8 {
	affine.vector_store %1, %0[%i0, %i1] : memref<100x100xf32>, vector<2x8xf32>			affine.vector_store %1, %0[%i0, %i1] : memref<100x100xf32>, vector<2x8xf32>
	// CHECK: %[[buf:.*]] = alloc			// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[val:.*]] = constant dense			// CHECK: %[[val:.*]] = constant dense
	// CHECK: scf.for %[[i0:.*]] =			// CHECK: scf.for %[[i0:.*]] =
	// CHECK: scf.for %[[i1:.*]] =			// CHECK: scf.for %[[i1:.*]] =
	// CHECK-NEXT: vector.transfer_write %[[val]], %[[buf]][%[[i0]], %[[i1]]] : vector<2x8xf32>, memref<100x100xf32>			// CHECK-NEXT: vector.store %[[val]], %[[buf]][%[[i0]], %[[i1]]] : memref<100x100xf32>, vector<2x8xf32>
	}			}
	}			}
	return			return
	}			}

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

	// RUN: mlir-opt %s -convert-vector-to-llvm \| FileCheck %s			// RUN: mlir-opt %s -convert-vector-to-llvm \| FileCheck %s


	func @broadcast_vec1d_from_scalar(%arg0: f32) -> vector<2xf32> {			func @broadcast_vec1d_from_scalar(%arg0: f32) -> vector<2xf32> {
	%0 = vector.broadcast %arg0 : f32 to vector<2xf32>			%0 = vector.broadcast %arg0 : f32 to vector<2xf32>
	return %0 : vector<2xf32>			return %0 : vector<2xf32>
	}			}
	// CHECK-LABEL: llvm.func @broadcast_vec1d_from_scalar(			// CHECK-LABEL: llvm.func @broadcast_vec1d_from_scalar(
	// CHECK-SAME: %[[A:.*]]: f32)			// CHECK-SAME: %[[A:.*]]: f32)
	// CHECK: %[[T0:.*]] = llvm.mlir.undef : vector<2xf32>			// CHECK: %[[T0:.*]] = llvm.mlir.undef : vector<2xf32>
	// CHECK: %[[T1:.*]] = llvm.mlir.constant(0 : i32) : i32			// CHECK: %[[T1:.*]] = llvm.mlir.constant(0 : i32) : i32
	▲ Show 20 Lines • Show All 1,053 Lines • ▼ Show 20 Lines

	// CHECK-LABEL: func @flat_transpose			// CHECK-LABEL: func @flat_transpose
	// CHECK-SAME: %[[A:.*]]: vector<16xf32>			// CHECK-SAME: %[[A:.*]]: vector<16xf32>
	// CHECK: %[[T:.*]] = llvm.intr.matrix.transpose %[[A]]			// CHECK: %[[T:.*]] = llvm.intr.matrix.transpose %[[A]]
	// CHECK-SAME: {columns = 4 : i32, rows = 4 : i32} :			// CHECK-SAME: {columns = 4 : i32, rows = 4 : i32} :
	// CHECK-SAME: vector<16xf32> into vector<16xf32>			// CHECK-SAME: vector<16xf32> into vector<16xf32>
	// CHECK: llvm.return %[[T]] : vector<16xf32>			// CHECK: llvm.return %[[T]] : vector<16xf32>

				func @vector_load_op(%memref : memref<200x100xf32>, %i : index, %j : index) -> vector<8xf32> {
				%0 = vector.load %memref[%i, %j] : memref<200x100xf32>, vector<8xf32>
				return %0 : vector<8xf32>
				}

				// CHECK-LABEL: func @vector_load_op
				// CHECK: %[[c100:.*]] = llvm.mlir.constant(100 : index) : i64
				// CHECK: %[[mul:.]] = llvm.mul %{{.}}, %[[c100]] : i64
				// CHECK: %[[add:.]] = llvm.add %[[mul]], %{{.}} : i64
				// CHECK: %[[gep:.]] = llvm.getelementptr %{{.}}[%[[add]]] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
				// CHECK: %[[bcast:.*]] = llvm.bitcast %[[gep]] : !llvm.ptr<f32> to !llvm.ptr<vector<8xf32>>
				// CHECK: llvm.load %[[bcast]] {alignment = 4 : i64} : !llvm.ptr<vector<8xf32>>

	func @masked_load_op(%arg0: memref<?xf32>, %arg1: vector<16xi1>, %arg2: vector<16xf32>) -> vector<16xf32> {			func @masked_load_op(%arg0: memref<?xf32>, %arg1: vector<16xi1>, %arg2: vector<16xf32>) -> vector<16xf32> {
	%c0 = constant 0: index			%c0 = constant 0: index
	%0 = vector.maskedload %arg0[%c0], %arg1, %arg2 : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			%0 = vector.maskedload %arg0[%c0], %arg1, %arg2 : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	return %0 : vector<16xf32>			return %0 : vector<16xf32>
	}			}

	// CHECK-LABEL: func @masked_load_op			// CHECK-LABEL: func @masked_load_op
	// CHECK: %[[C:.*]] = llvm.mlir.constant(0 : index) : i64			// CHECK: %[[C:.*]] = llvm.mlir.constant(0 : index) : i64
	// CHECK: %[[P:.]] = llvm.getelementptr %{{.}}[%[[C]]] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>			// CHECK: %[[P:.]] = llvm.getelementptr %{{.}}[%[[C]]] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
	// CHECK: %[[B:.*]] = llvm.bitcast %[[P]] : !llvm.ptr<f32> to !llvm.ptr<vector<16xf32>>			// CHECK: %[[B:.*]] = llvm.bitcast %[[P]] : !llvm.ptr<f32> to !llvm.ptr<vector<16xf32>>
	// CHECK: %[[L:.]] = llvm.intr.masked.load %[[B]], %{{.}}, %{{.*}} {alignment = 4 : i32} : (!llvm.ptr<vector<16xf32>>, vector<16xi1>, vector<16xf32>) -> vector<16xf32>			// CHECK: %[[L:.]] = llvm.intr.masked.load %[[B]], %{{.}}, %{{.*}} {alignment = 4 : i32} : (!llvm.ptr<vector<16xf32>>, vector<16xi1>, vector<16xf32>) -> vector<16xf32>
	// CHECK: llvm.return %[[L]] : vector<16xf32>			// CHECK: llvm.return %[[L]] : vector<16xf32>

				func @vector_store_op(%memref : memref<200x100xf32>, %i : index, %j : index) {
				%val = constant dense<11.0> : vector<4xf32>
				vector.store %val, %memref[%i, %j] : memref<200x100xf32>, vector<4xf32>
				return
				}

				// CHECK-LABEL: func @vector_store_op
				// CHECK: %[[c100:.*]] = llvm.mlir.constant(100 : index) : i64
				// CHECK: %[[mul:.]] = llvm.mul %{{.}}, %[[c100]] : i64
				// CHECK: %[[add:.]] = llvm.add %[[mul]], %{{.}} : i64
				// CHECK: %[[gep:.]] = llvm.getelementptr %{{.}}[%[[add]]] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
				// CHECK: %[[bcast:.*]] = llvm.bitcast %[[gep]] : !llvm.ptr<f32> to !llvm.ptr<vector<4xf32>>
				// CHECK: llvm.store %{{.*}}, %[[bcast]] {alignment = 4 : i64} : !llvm.ptr<vector<4xf32>>

	func @masked_store_op(%arg0: memref<?xf32>, %arg1: vector<16xi1>, %arg2: vector<16xf32>) {			func @masked_store_op(%arg0: memref<?xf32>, %arg1: vector<16xi1>, %arg2: vector<16xf32>) {
	%c0 = constant 0: index			%c0 = constant 0: index
	vector.maskedstore %arg0[%c0], %arg1, %arg2 : memref<?xf32>, vector<16xi1>, vector<16xf32>			vector.maskedstore %arg0[%c0], %arg1, %arg2 : memref<?xf32>, vector<16xi1>, vector<16xf32>
	return			return
	}			}

	// CHECK-LABEL: func @masked_store_op			// CHECK-LABEL: func @masked_store_op
	// CHECK: %[[C:.*]] = llvm.mlir.constant(0 : index) : i64			// CHECK: %[[C:.*]] = llvm.mlir.constant(0 : index) : i64
	▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/invalid.mlir

Show First 20 Lines • Show All 1,192 Lines • ▼ Show 20 Lines

func @type_cast_layout(%arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>>) {		func @type_cast_layout(%arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>>) {
// expected-error@+1 {{expects operand to be a memref with no layout}}		// expected-error@+1 {{expects operand to be a memref with no layout}}
%0 = vector.type_cast %arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>> to memref<vector<4x3xf32>>		%0 = vector.type_cast %arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>> to memref<vector<4x3xf32>>
}		}

// -----		// -----

		func @store_unsupported_layout(%memref : memref<200x100xf32, affine_map<(d0, d1) -> (d1, d0)>>,
		%i : index, %j : index, %value : vector<8xf32>) {
		// expected-error@+1 {{'vector.store' op base memref should have a default or identity layout}}
		vector.store %value, %memref[%i, %j] : memref<200x100xf32, affine_map<(d0, d1) -> (d1, d0)>>,
		vector<8xf32>
		}

		// -----

		func @vector_memref_mismatch(%memref : memref<200x100xvector<4xf32>>, %i : index,
		%j : index, %value : vector<8xf32>) {
		// expected-error@+1 {{'vector.store' op base memref and valueToStore vector types should match}}
		vector.store %value, %memref[%i, %j] : memref<200x100xvector<4xf32>>, vector<8xf32>
		}

		// -----

		func @store_base_type_mismatch(%base : memref<?xf64>, %value : vector<16xf32>) {
		%c0 = constant 0 : index
		// expected-error@+1 {{'vector.store' op base and valueToStore element type should match}}
		vector.store %value, %base[%c0] : memref<?xf64>, vector<16xf32>
		}

		// -----

		func @store_memref_index_mismatch(%base : memref<?xf32>, %value : vector<16xf32>) {
		// expected-error@+1 {{'vector.store' op requires 1 indices}}
		vector.store %value, %base[] : memref<?xf32>, vector<16xf32>
		}

		// -----

func @maskedload_base_type_mismatch(%base: memref<?xf64>, %mask: vector<16xi1>, %pass: vector<16xf32>) {		func @maskedload_base_type_mismatch(%base: memref<?xf64>, %mask: vector<16xi1>, %pass: vector<16xf32>) {
%c0 = constant 0 : index		%c0 = constant 0 : index
// expected-error@+1 {{'vector.maskedload' op base and result element type should match}}		// expected-error@+1 {{'vector.maskedload' op base and result element type should match}}
%0 = vector.maskedload %base[%c0], %mask, %pass : memref<?xf64>, vector<16xi1>, vector<16xf32> into vector<16xf32>		%0 = vector.maskedload %base[%c0], %mask, %pass : memref<?xf64>, vector<16xi1>, vector<16xf32> into vector<16xf32>
}		}

// -----		// -----

Show All 17 Lines	func @maskedload_memref_mismatch(%base: memref<?xf32>, %mask: vector<16xi1>, %pass: vector<16xf32>) {
// expected-error@+1 {{'vector.maskedload' op requires 1 indices}}		// expected-error@+1 {{'vector.maskedload' op requires 1 indices}}
%0 = vector.maskedload %base[], %mask, %pass : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>		%0 = vector.maskedload %base[], %mask, %pass : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
}		}

// -----		// -----

func @maskedstore_base_type_mismatch(%base: memref<?xf64>, %mask: vector<16xi1>, %value: vector<16xf32>) {		func @maskedstore_base_type_mismatch(%base: memref<?xf64>, %mask: vector<16xi1>, %value: vector<16xf32>) {
%c0 = constant 0 : index		%c0 = constant 0 : index
// expected-error@+1 {{'vector.maskedstore' op base and value element type should match}}		// expected-error@+1 {{'vector.maskedstore' op base and valueToStore element type should match}}
vector.maskedstore %base[%c0], %mask, %value : memref<?xf64>, vector<16xi1>, vector<16xf32>		vector.maskedstore %base[%c0], %mask, %value : memref<?xf64>, vector<16xi1>, vector<16xf32>
}		}

// -----		// -----

func @maskedstore_dim_mask_mismatch(%base: memref<?xf32>, %mask: vector<15xi1>, %value: vector<16xf32>) {		func @maskedstore_dim_mask_mismatch(%base: memref<?xf32>, %mask: vector<15xi1>, %value: vector<16xf32>) {
%c0 = constant 0 : index		%c0 = constant 0 : index
// expected-error@+1 {{'vector.maskedstore' op expected value dim to match mask dim}}		// expected-error@+1 {{'vector.maskedstore' op expected valueToStore dim to match mask dim}}
vector.maskedstore %base[%c0], %mask, %value : memref<?xf32>, vector<15xi1>, vector<16xf32>		vector.maskedstore %base[%c0], %mask, %value : memref<?xf32>, vector<15xi1>, vector<16xf32>
}		}

// -----		// -----

func @maskedstore_memref_mismatch(%base: memref<?xf32>, %mask: vector<16xi1>, %value: vector<16xf32>) {		func @maskedstore_memref_mismatch(%base: memref<?xf32>, %mask: vector<16xi1>, %value: vector<16xf32>) {
%c0 = constant 0 : index		%c0 = constant 0 : index
// expected-error@+1 {{'vector.maskedstore' op requires 1 indices}}		// expected-error@+1 {{'vector.maskedstore' op requires 1 indices}}
▲ Show 20 Lines • Show All 181 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/ops.mlir

	Show First 20 Lines • Show All 444 Lines • ▼ Show 20 Lines
	// CHECK-LABEL: @flat_transpose_int			// CHECK-LABEL: @flat_transpose_int
	func @flat_transpose_int(%arg0: vector<16xi32>) -> vector<16xi32> {			func @flat_transpose_int(%arg0: vector<16xi32>) -> vector<16xi32> {
	// CHECK: %[[X:.]] = vector.flat_transpose %{{.}} {columns = 8 : i32, rows = 2 : i32} : vector<16xi32> -> vector<16xi32>			// CHECK: %[[X:.]] = vector.flat_transpose %{{.}} {columns = 8 : i32, rows = 2 : i32} : vector<16xi32> -> vector<16xi32>
	%0 = vector.flat_transpose %arg0 { rows = 2: i32, columns = 8: i32 } : vector<16xi32> -> vector<16xi32>			%0 = vector.flat_transpose %arg0 { rows = 2: i32, columns = 8: i32 } : vector<16xi32> -> vector<16xi32>
	// CHECK: return %[[X]] : vector<16xi32>			// CHECK: return %[[X]] : vector<16xi32>
	return %0 : vector<16xi32>			return %0 : vector<16xi32>
	}			}

				// CHECK-LABEL: @vector_load_and_store_1d_scalar_memref
				func @vector_load_and_store_1d_scalar_memref(%memref : memref<200x100xf32>,
				%i : index, %j : index) {
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xf32>, vector<8xf32>
				%0 = vector.load %memref[%i, %j] : memref<200x100xf32>, vector<8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xf32>, vector<8xf32>
				vector.store %0, %memref[%i, %j] : memref<200x100xf32>, vector<8xf32>
				return
				}

				// CHECK-LABEL: @vector_load_and_store_1d_vector_memref
				func @vector_load_and_store_1d_vector_memref(%memref : memref<200x100xvector<8xf32>>,
				%i : index, %j : index) {
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xvector<8xf32>>, vector<8xf32>
				%0 = vector.load %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xvector<8xf32>>, vector<8xf32>
				vector.store %0, %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
				return
				}

				// CHECK-LABEL: @vector_load_and_store_out_of_bounds
				func @vector_load_and_store_out_of_bounds(%memref : memref<7xf32>) {
				%c0 = constant 0 : index
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<7xf32>, vector<8xf32>
				%0 = vector.load %memref[%c0] : memref<7xf32>, vector<8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<7xf32>, vector<8xf32>
				vector.store %0, %memref[%c0] : memref<7xf32>, vector<8xf32>
				return
				}

				// CHECK-LABEL: @vector_load_and_store_2d_scalar_memref
				func @vector_load_and_store_2d_scalar_memref(%memref : memref<200x100xf32>,
				%i : index, %j : index) {
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xf32>, vector<4x8xf32>
				%0 = vector.load %memref[%i, %j] : memref<200x100xf32>, vector<4x8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xf32>, vector<4x8xf32>
				vector.store %0, %memref[%i, %j] : memref<200x100xf32>, vector<4x8xf32>
				return
				}

				// CHECK-LABEL: @vector_load_and_store_2d_vector_memref
				func @vector_load_and_store_2d_vector_memref(%memref : memref<200x100xvector<4x8xf32>>,
				%i : index, %j : index) {
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
				%0 = vector.load %memref[%i, %j] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
				vector.store %0, %memref[%i, %j] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
				return
				}

	// CHECK-LABEL: @masked_load_and_store			// CHECK-LABEL: @masked_load_and_store
	func @masked_load_and_store(%base: memref<?xf32>, %mask: vector<16xi1>, %passthru: vector<16xf32>) {			func @masked_load_and_store(%base: memref<?xf32>, %mask: vector<16xi1>, %passthru: vector<16xf32>) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	// CHECK: %[[X:.]] = vector.maskedload %{{.}}[%{{.}}], %{{.}}, %{{.*}} : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			// CHECK: %[[X:.]] = vector.maskedload %{{.}}[%{{.}}], %{{.}}, %{{.*}} : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	%0 = vector.maskedload %base[%c0], %mask, %passthru : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			%0 = vector.maskedload %base[%c0], %mask, %passthru : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	// CHECK: vector.maskedstore %{{.}}[%{{.}}], %{{.*}}, %[[X]] : memref<?xf32>, vector<16xi1>, vector<16xf32>			// CHECK: vector.maskedstore %{{.}}[%{{.}}], %{{.*}}, %[[X]] : memref<?xf32>, vector<16xi1>, vector<16xf32>
	vector.maskedstore %base[%c0], %mask, %0 : memref<?xf32>, vector<16xi1>, vector<16xf32>			vector.maskedstore %base[%c0], %mask, %0 : memref<?xf32>, vector<16xi1>, vector<16xf32>
	return			return
	Show All 36 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Vector] Introduce 'vector.load' and 'vector.store' opsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 321887

mlir/include/mlir/Dialect/Vector/VectorOps.td

mlir/lib/Conversion/AffineToStandard/AffineToStandard.cpp

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

mlir/lib/Dialect/Vector/VectorOps.cpp

mlir/test/Conversion/AffineToStandard/lower-affine-to-vector.mlir

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

mlir/test/Dialect/Vector/invalid.mlir

mlir/test/Dialect/Vector/ops.mlir

[mlir][Vector] Introduce 'vector.load' and 'vector.store' ops
ClosedPublic