This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Vector/
-
mlir/
-
Dialect/
-
Vector/
5/8
VectorOps.td
-
lib/
-
Conversion/
-
AffineToStandard/
-
AffineToStandard.cpp
-
VectorToLLVM/
-
ConvertVectorToLLVM.cpp
-
Dialect/Vector/
-
Vector/
1/2
VectorOps.cpp
-
test/
-
Conversion/
-
AffineToStandard/
-
lower-affine-to-vector.mlir
-
VectorToLLVM/
-
vector-to-llvm.mlir
-
Dialect/Vector/
-
Vector/
-
invalid.mlir
-
ops.mlir

Differential D96185

[mlir][Vector] Introduce 'vector.load' and 'vector.store' ops
ClosedPublic

Authored by dcaballe on Feb 5 2021, 3:13 PM.

Download Raw Diff

Details

Reviewers

ftynse
nicolasvasilache
mehdi_amini
aartbik
sgrechanik
bondhugula

Commits

rGee66e43a96e1: [mlir][Vector] Introduce 'vector.load' and 'vector.store' ops

Summary

This patch adds the 'vector.load' and 'vector.store' ops to the Vector
dialect [1]. These operations model *contiguous* vector loads and stores
from/to memory. Their semantics are similar to the 'affine.vector_load' and
'affine.vector_store' counterparts but without the affine constraints. The
most relevant feature is that these new vector operations may perform a vector
load/store on memrefs with a non-vector element type, unlike 'std.load' and
'std.store' ops. This opens the representation to model more generic vector
load/store scenarios: unaligned vector loads/stores, perform scalar and vector
memory access on the same memref, decouple memory allocation constraints from
memory accesses, etc [1]. These operations will also facilitate the progressive
lowering of both Affine vector loads/stores and Vector transfer reads/writes
for those that read/write contiguous slices from/to memory.

In particular, this patch adds the 'vector.load' and 'vector.store' ops to the
Vector dialect, implements their lowering to the LLVM dialect, and changes the
lowering of 'affine.vector_load' and 'affine.vector_store' ops to the new vector
ops. The lowering of Vector transfer reads/writes will be implemented in the
future, probably as an independent pass. The API of 'vector.maskedload' and
'vector.maskedstore' has also been changed slightly to align it with the
transfer read/write ops and the vector new ops. This will improve reusability
among all these operations. For example, the lowering of 'vector.load',
'vector.store', 'vector.maskedload' and 'vector.maskedstore' to the LLVM dialect
is implemented with a single template conversion pattern.

[1] https://llvm.discourse.group/t/memref-type-and-data-layout/

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dcaballe created this revision.Feb 5 2021, 3:13 PM

Herald added subscribers: teijeong, rdzhabarov, tatianashp and 12 others. · View Herald TranscriptFeb 5 2021, 3:13 PM

dcaballe requested review of this revision.Feb 5 2021, 3:13 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 5 2021, 3:13 PM

Herald added a subscriber: stephenneuendorffer. · View Herald Transcript

Harbormaster completed remote builds in B88146: Diff 321887.Feb 5 2021, 3:51 PM

dcaballe added a reviewer: bondhugula.Feb 5 2021, 4:03 PM

bondhugula added inline comments.Feb 5 2021, 9:12 PM

mlir/include/mlir/Dialect/Vector/VectorOps.td
1326	`contiguous n-D slice of memory` is a bit inherently contradictory to me.

One nitpicky request already on the order in the td file.

We thought for a while to complete the memory operations with these unmasked versions, so thanks for doing this.
I am okay with this, but you may want to also check with Nicolas who had big future plans for the transfer operations
(but I feel these new versions are always good for progressive lowering).

I am a bit more on the fence of some of the name changes you did in the existing ones. Can you please clarify a bit what your reasoning was for these?

mlir/include/mlir/Dialect/Vector/VectorOps.td
1322–1472	A bit nitpicky request, but I insist ;-) The order in the ops file for memory operations is strictly in pairs Vector_TransferReadOp / Vector_TransferWriteOp Vector_MaskedLoadOp / Vector_MaskedStoreOp Vector_GatherOp / Vector_ScatterOp Vector_ExpandLoadOp / Vector_CompressStoreOp you break that convention by placing the unmasked versions before the load and store respectively. I would prefer to have the new ones in between the Transfer and Masked versions, but paired together and in the order load / store as you have

Thanks for the feedback!

We thought for a while to complete the memory operations with these unmasked versions, so thanks for doing this.
I am okay with this, but you may want to also check with Nicolas who had big future plans for the transfer operations
(but I feel these new versions are always good for progressive lowering).

Yeah, this is what we discussed with @nicolasvasilache and @ftynse to lower vector transfer ops. Hopefully the plan hasn't changed.

I am a bit more on the fence of some of the name changes you did in the existing ones. Can you please clarify a bit what your reasoning was for these?

The intention was to align the APIs so that we can write generic code for all the vector load/store flavors. For example, vector.maskedload had getResultVectorType() and vector.maskedstore had getValueVectorType() to return the vector type used in the op. These two methods were changed to getVectorType() to be able to write a single conversion pattern to LLVM for both of them, and also for vector.load and vector.store. As you can see in ConvertVectorToLLVM.cpp, VectorLoadStoreConversion pattern is used for the four ops thanks to the unified API, instead of having four independent patterns. For example, see the following code in VectorLoadStoreConversion:

// Resolve address.
auto vtype = this->typeConverter->convertType(*loadOrStoreOp.getVectorType()*)

I think this will expose more reusability opportunities beyond the current LLVM conversion pattern since these four operations are pretty similar. The same approach is followed by vector transfer ops and affine vector loads/stores. They also have getVectorType(), getMemRefType(), valueToStore, etc. I thought it would be a good idea to align vector.load/store and vector.maskedload/maskedstore with that existing API to facilitate reusability. We are somehow paving the way for a VectorMemoryOp interface. However, no strong opinion about the API names. We could also change the API of transfer and affine ops but that would require far more changes all over the place.

mlir/include/mlir/Dialect/Vector/VectorOps.td
1322–1472	Sure, no problem at all! I thought that having the `vector.load` next to the `vector.maskedload` would help quickly see the different variants that we have to perform a vector load. I'll change that. Thanks!
1326	Ok, I can remove `contiguous` from this line. It should be enough with the line below: `This slice is contiguous along the respective dimensions of the shape`. This is what we have in the description of the affine counterparts.

In D96185#2549128, @dcaballe wrote:

The intention was to align the APIs so that we can write generic code for all the vector load/store flavors. For example, vector.maskedload had getResultVectorType() and vector.maskedstore had getValueVectorType() to return the vector type used in the op. These two methods were changed to getVectorType() to be able to write a single conversion pattern to LLVM for both of them, and also for vector.load and vector.store. As you can see in ConvertVectorToLLVM.cpp, VectorLoadStoreConversion pattern is used for the four ops thanks to the unified API, instead of having four independent patterns. For example, see the following code in VectorLoadStoreConversion:

Yes, aligning the API for that purpose makes sense, but you changed the API of masked-load/store, but not of gather/scatter and expand/compress, and all these mem ops more or less go together.
Perhaps, if you don't mind, you can send out another CL that does the API name change for all these, just so that we have "progressive" lowering for this change as well (and consistent over all mem ops)?

Thanks for working on this Diego.
Let's please connect vector.transfer_read/write too otherwise it is unclear the abstraction is the right progressive lowering funnel.

mlir/include/mlir/Dialect/Vector/VectorOps.td
1363	assumptions
1528	Same remark re. contiguous here, this makes me think that `vector<2x4xf32>` stores 32 contiguous bytes which is not the case. I'd actually rephrase this part and describe it as something carrying the intent of "strided by the memref strides" but not contiguous.

This revision now requires changes to proceed.Feb 8 2021, 11:58 PM

Address feedback. Thanks!

Perhaps, if you don't mind, you can send out another CL that does the API name change for all these, just so that we have "progressive" lowering for this change as well (and consistent over all mem ops)?

Sure! I will follow up with a patch for those ops.

Let's please connect vector.transfer_read/write too otherwise it is unclear the abstraction is the right progressive lowering funnel.

Would you mind if we do that in a separate patch? It would be a new feature and lowering vector transfers require much more work since we have to analyze the stride, the permutation map, the mask, the padding, etc. and then generate the corresponding lower level vector ops (vector.load/store, vector.broadcast, vector.shuffle, vector.maskedload/maskedstore, vector.gather/scatter, etc.). I was thinking about creating a LowerVectorTransferOps pass in a follow-up patch so that we all can contribute lowering cases incrementally. Does it sound reasonable?

mlir/include/mlir/Dialect/Vector/VectorOps.td
1326	I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better now.
1528	I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better now.

Harbormaster completed remote builds in B88548: Diff 322549.Feb 9 2021, 6:19 PM

bondhugula requested changes to this revision.Feb 9 2021, 6:31 PM

bondhugula added inline comments.

mlir/lib/Dialect/Vector/VectorOps.cpp
2384	`affineMaps.empty()` would be the right check here as opposed to `affineMaps.size() != 1`.

This revision now requires changes to proceed.Feb 9 2021, 6:31 PM

dcaballe added inline comments.Feb 9 2021, 9:02 PM

mlir/lib/Dialect/Vector/VectorOps.cpp
2384	Oh, if a single identity map is provided, it's not kept around. Got it. Thanks!

Addressed Uday's comment and reorganized the code as suggested by Aart.

Harbormaster completed remote builds in B88573: Diff 322582.Feb 9 2021, 9:34 PM

In D96185#2552901, @dcaballe wrote:

Perhaps, if you don't mind, you can send out another CL that does the API name change for all these, just so that we have "progressive" lowering for this change as well (and consistent over all mem ops)?

Sure! I will follow up with a patch for those ops.

Let's please connect vector.transfer_read/write too otherwise it is unclear the abstraction is the right progressive lowering funnel.

Would you mind if we do that in a separate patch? It would be a new feature and lowering vector transfers require much more work since we have to analyze the stride, the permutation map, the mask, the padding, etc. and then generate the corresponding lower level vector ops (vector.load/store, vector.broadcast, vector.shuffle, vector.maskedload/maskedstore, vector.gather/scatter, etc.). I was thinking about creating a LowerVectorTransferOps pass in a follow-up patch so that we all can contribute lowering cases incrementally. Does it sound reasonable?

If this is a commitment that you are working on starting to connect the pieces next then fair enough :)

If this is a commitment that you are working on starting to connect the pieces next then fair enough :)

Yeah, we have that in our schedule. We'll start with the basic 1-D contiguous cases.

Any other comments?

Thanks!

This revision was not accepted when it landed; it landed in state Needs Review.Feb 12 2021, 10:53 AM

Closed by commit rGee66e43a96e1: [mlir][Vector] Introduce 'vector.load' and 'vector.store' ops (authored by dcaballe). · Explain Why

This revision was automatically updated to reflect the committed changes.

dcaballe added a commit: rGee66e43a96e1: [mlir][Vector] Introduce 'vector.load' and 'vector.store' ops.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Vector/

VectorOps.td

163 lines

lib/

Conversion/

AffineToStandard/

AffineToStandard.cpp

23 lines

VectorToLLVM/

ConvertVectorToLLVM.cpp

102 lines

Dialect/

Vector/

VectorOps.cpp

71 lines

test/

Conversion/

AffineToStandard/

lower-affine-to-vector.mlir

47 lines

VectorToLLVM/

vector-to-llvm.mlir

28 lines

Dialect/

Vector/

invalid.mlir

36 lines

ops.mlir

50 lines

Diff 323402

mlir/include/mlir/Dialect/Vector/VectorOps.td

Show First 20 Lines • Show All 1,313 Lines • ▼ Show 20 Lines	let builders = [
OpBuilderDAG<(ins "Value":$vector, "Value":$source, "ValueRange":$indices,		OpBuilderDAG<(ins "Value":$vector, "Value":$source, "ValueRange":$indices,
"AffineMapAttr":$permutationMap, "ArrayAttr":$masked)>,		"AffineMapAttr":$permutationMap, "ArrayAttr":$masked)>,
OpBuilderDAG<(ins "Value":$vector, "Value":$source, "ValueRange":$indices,		OpBuilderDAG<(ins "Value":$vector, "Value":$source, "ValueRange":$indices,
"AffineMap":$permutationMap, "ArrayAttr":$masked)>,		"AffineMap":$permutationMap, "ArrayAttr":$masked)>,
];		];

let hasFolder = 1;		let hasFolder = 1;
}		}

		def Vector_LoadOp : Vector_Op<"load"> {
		let summary = "reads an n-D slice of memory into an n-D vector";
		let description = [{
		The 'vector.load' operation reads an n-D slice of memory into an n-D
		bondhugulaUnsubmitted Not Done Reply Inline Actions `contiguous n-D slice of memory` is a bit inherently contradictory to me. bondhugula: `contiguous n-D slice of memory` is a bit inherently contradictory to me.
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Ok, I can remove `contiguous` from this line. It should be enough with the line below: `This slice is contiguous along the respective dimensions of the shape`. This is what we have in the description of the affine counterparts. dcaballe: Ok, I can remove `contiguous` from this line. It should be enough with the line below: `This…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better now. dcaballe: I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better…
		vector. It takes a 'base' memref, an index for each memref dimension and a
		result vector type as arguments. It returns a value of the result vector
		type. The 'base' memref and indices determine the start memory address from
		which to read. Each index provides an offset for each memref dimension
		based on the element type of the memref. The shape of the result vector
		type determines the shape of the slice read from the start memory address.
		The elements along each dimension of the slice are strided by the memref
		strides. Only memref with default strides are allowed. These constraints
		guarantee that elements read along the first dimension of the slice are
		contiguous in memory.

		The memref element type can be a scalar or a vector type. If the memref
		element type is a scalar, it should match the element type of the result
		vector. If the memref element type is vector, it should match the result
		vector type.

		Example 1: 1-D vector load on a scalar memref.
		```mlir
		%result = vector.load %base[%i, %j] : memref<100x100xf32>, vector<8xf32>
		```

		Example 2: 1-D vector load on a vector memref.
		```mlir
		%result = vector.load %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
		```

		Example 3: 2-D vector load on a scalar memref.
		```mlir
		%result = vector.load %memref[%i, %j] : memref<200x100xf32>, vector<4x8xf32>
		```

		Example 4: 2-D vector load on a vector memref.
		```mlir
		%result = vector.load %memref[%i, %j] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
		```

		Representation-wise, the 'vector.load' operation permits out-of-bounds
		nicolasvasilacheUnsubmitted Done Reply Inline Actions assumptions nicolasvasilache: assumptions
		reads. Support and implementation of out-of-bounds vector loads is
		target-specific. No assumptions should be made on the value of elements
		loaded out of bounds. Not all targets may support out-of-bounds vector
		loads.

		Example 5: Potential out-of-bound vector load.
		```mlir
		%result = vector.load %memref[%index] : memref<?xf32>, vector<8xf32>
		```

		Example 6: Explicit out-of-bound vector load.
		```mlir
		%result = vector.load %memref[%c0] : memref<7xf32>, vector<8xf32>
		```
		}];

		let arguments = (ins Arg<AnyMemRef, "the reference to load from",
		[MemRead]>:$base,
		Variadic<Index>:$indices);
		let results = (outs AnyVector:$result);

		let extraClassDeclaration = [{
		MemRefType getMemRefType() {
		return base().getType().cast<MemRefType>();
		}

		VectorType getVectorType() {
		return result().getType().cast<VectorType>();
		}
		}];

		let assemblyFormat =
		"$base `[` $indices `]` attr-dict `:` type($base) `,` type($result)";
		}

		def Vector_StoreOp : Vector_Op<"store"> {
		let summary = "writes an n-D vector to an n-D slice of memory";
		let description = [{
		The 'vector.store' operation writes an n-D vector to an n-D slice of memory.
		It takes the vector value to be stored, a 'base' memref and an index for
		each memref dimension. The 'base' memref and indices determine the start
		memory address from which to write. Each index provides an offset for each
		memref dimension based on the element type of the memref. The shape of the
		vector value to store determines the shape of the slice written from the
		start memory address. The elements along each dimension of the slice are
		strided by the memref strides. Only memref with default strides are allowed.
		These constraints guarantee that elements written along the first dimension
		of the slice are contiguous in memory.

		The memref element type can be a scalar or a vector type. If the memref
		element type is a scalar, it should match the element type of the value
		to store. If the memref element type is vector, it should match the type
		of the value to store.

		Example 1: 1-D vector store on a scalar memref.
		```mlir
		vector.store %valueToStore, %memref[%i, %j] : memref<200x100xf32>, vector<8xf32>
		```

		Example 2: 1-D vector store on a vector memref.
		```mlir
		vector.store %valueToStore, %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
		```

		Example 3: 2-D vector store on a scalar memref.
		```mlir
		vector.store %valueToStore, %memref[%i, %j] : memref<200x100xf32>, vector<4x8xf32>
		```

		Example 4: 2-D vector store on a vector memref.
		```mlir
		vector.store %valueToStore, %memref[%i, %j] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
		```

		Representation-wise, the 'vector.store' operation permits out-of-bounds
		writes. Support and implementation of out-of-bounds vector stores are
		target-specific. No assumptions should be made on the memory written out of
		bounds. Not all targets may support out-of-bounds vector stores.

		Example 5: Potential out-of-bounds vector store.
		```mlir
		vector.store %valueToStore, %memref[%index] : memref<?xf32>, vector<8xf32>
		```

		Example 6: Explicit out-of-bounds vector store.
		```mlir
		vector.store %valueToStore, %memref[%c0] : memref<7xf32>, vector<8xf32>
		```
		}];

		let arguments = (ins AnyVector:$valueToStore,
		Arg<AnyMemRef, "the reference to store to",
		[MemWrite]>:$base,
		Variadic<Index>:$indices);

		let extraClassDeclaration = [{
		MemRefType getMemRefType() {
		return base().getType().cast<MemRefType>();
		}

		VectorType getVectorType() {
		return valueToStore().getType().cast<VectorType>();
		}
		}];

		let assemblyFormat = "$valueToStore `,` $base `[` $indices `]` attr-dict "
		"`:` type($base) `,` type($valueToStore)";
		}

		aartbikUnsubmitted Not Done Reply Inline Actions A bit nitpicky request, but I insist ;-) The order in the ops file for memory operations is strictly in pairs Vector_TransferReadOp / Vector_TransferWriteOp Vector_MaskedLoadOp / Vector_MaskedStoreOp Vector_GatherOp / Vector_ScatterOp Vector_ExpandLoadOp / Vector_CompressStoreOp you break that convention by placing the unmasked versions before the load and store respectively. I would prefer to have the new ones in between the Transfer and Masked versions, but paired together and in the order load / store as you have aartbik: A bit nitpicky request, but I insist ;-) The order in the ops file for memory operations is…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Sure, no problem at all! I thought that having the `vector.load` next to the `vector.maskedload` would help quickly see the different variants that we have to perform a vector load. I'll change that. Thanks! dcaballe: Sure, no problem at all! I thought that having the `vector.load` next to the `vector.
def Vector_MaskedLoadOp :		def Vector_MaskedLoadOp :
Vector_Op<"maskedload">,		Vector_Op<"maskedload">,
Arguments<(ins Arg<AnyMemRef, "", [MemRead]>:$base,		Arguments<(ins Arg<AnyMemRef, "", [MemRead]>:$base,
Variadic<Index>:$indices,		Variadic<Index>:$indices,
VectorOfRankAndType<[1], [I1]>:$mask,		VectorOfRankAndType<[1], [I1]>:$mask,
VectorOfRank<[1]>:$pass_thru)>,		VectorOfRank<[1]>:$pass_thru)>,
Results<(outs VectorOfRank<[1]>:$result)> {		Results<(outs VectorOfRank<[1]>:$result)> {

Show All 27 Lines	MemRefType getMemRefType() {
return base().getType().cast<MemRefType>();		return base().getType().cast<MemRefType>();
}		}
VectorType getMaskVectorType() {		VectorType getMaskVectorType() {
return mask().getType().cast<VectorType>();		return mask().getType().cast<VectorType>();
}		}
VectorType getPassThruVectorType() {		VectorType getPassThruVectorType() {
return pass_thru().getType().cast<VectorType>();		return pass_thru().getType().cast<VectorType>();
}		}
VectorType getResultVectorType() {		VectorType getVectorType() {
return result().getType().cast<VectorType>();		return result().getType().cast<VectorType>();
}		}
}];		}];
let assemblyFormat = "$base `[` $indices `]` `,` $mask `,` $pass_thru attr-dict `:` "		let assemblyFormat = "$base `[` $indices `]` `,` $mask `,` $pass_thru attr-dict `:` "
"type($base) `,` type($mask) `,` type($pass_thru) `into` type($result)";		"type($base) `,` type($mask) `,` type($pass_thru) `into` type($result)";
let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
}		}

def Vector_MaskedStoreOp :		def Vector_MaskedStoreOp :
Vector_Op<"maskedstore">,		Vector_Op<"maskedstore">,
Arguments<(ins Arg<AnyMemRef, "", [MemWrite]>:$base,		Arguments<(ins Arg<AnyMemRef, "", [MemWrite]>:$base,
Variadic<Index>:$indices,		Variadic<Index>:$indices,
		nicolasvasilacheUnsubmitted Not Done Reply Inline Actions Same remark re. contiguous here, this makes me think that `vector<2x4xf32>` stores 32 contiguous bytes which is not the case. I'd actually rephrase this part and describe it as something carrying the intent of "strided by the memref strides" but not contiguous. nicolasvasilache: Same remark re. contiguous here, this makes me think that `vector<2x4xf32>` stores 32…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better now. dcaballe: I removed `contiguous` and changed the last part of this paragraph. Let me know if it's better…
VectorOfRankAndType<[1], [I1]>:$mask,		VectorOfRankAndType<[1], [I1]>:$mask,
VectorOfRank<[1]>:$value)> {		VectorOfRank<[1]>:$valueToStore)> {

let summary = "stores elements from a vector into memory as defined by a mask vector";		let summary = "stores elements from a vector into memory as defined by a mask vector";

let description = [{		let description = [{
The masked store operation writes elements from a 1-D vector into memory		The masked store operation writes elements from a 1-D vector into memory
as defined by a base with indices and a 1-D mask vector. When the mask is		as defined by a base with indices and a 1-D mask vector. When the mask is
set, the corresponding element from the vector is written to memory. Otherwise,		set, the corresponding element from the vector is written to memory. Otherwise,
no action is taken for the element. Informally the semantics are:		no action is taken for the element. Informally the semantics are:
Show All 17 Lines	def Vector_MaskedStoreOp :
}];		}];
let extraClassDeclaration = [{		let extraClassDeclaration = [{
MemRefType getMemRefType() {		MemRefType getMemRefType() {
return base().getType().cast<MemRefType>();		return base().getType().cast<MemRefType>();
}		}
VectorType getMaskVectorType() {		VectorType getMaskVectorType() {
return mask().getType().cast<VectorType>();		return mask().getType().cast<VectorType>();
}		}
VectorType getValueVectorType() {		VectorType getVectorType() {
return value().getType().cast<VectorType>();		return valueToStore().getType().cast<VectorType>();
}		}
}];		}];
let assemblyFormat = "$base `[` $indices `]` `,` $mask `,` $value attr-dict `:` "		let assemblyFormat =
"type($base) `,` type($mask) `,` type($value)";		"$base `[` $indices `]` `,` $mask `,` $valueToStore "
		"attr-dict `:` type($base) `,` type($mask) `,` type($valueToStore)";
let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
}		}

def Vector_GatherOp :		def Vector_GatherOp :
Vector_Op<"gather">,		Vector_Op<"gather">,
Arguments<(ins Arg<AnyMemRef, "", [MemRead]>:$base,		Arguments<(ins Arg<AnyMemRef, "", [MemRead]>:$base,
VectorOfRankAndType<[1], [AnyInteger]>:$indices,		VectorOfRankAndType<[1], [AnyInteger]>:$indices,
VectorOfRankAndType<[1], [I1]>:$mask,		VectorOfRankAndType<[1], [I1]>:$mask,
▲ Show 20 Lines • Show All 698 Lines • Show Last 20 Lines

mlir/lib/Conversion/AffineToStandard/AffineToStandard.cpp

Show First 20 Lines • Show All 572 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(AffineLoadOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Expand affine map from 'affineLoadOp'.		// Expand affine map from 'affineLoadOp'.
SmallVector<Value, 8> indices(op.getMapOperands());		SmallVector<Value, 8> indices(op.getMapOperands());
auto resultOperands =		auto resultOperands =
expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);		expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);
if (!resultOperands)		if (!resultOperands)
return failure();		return failure();

// Build std.load memref[expandedMap.results].		// Build vector.load memref[expandedMap.results].
rewriter.replaceOpWithNewOp<LoadOp>(op, op.getMemRef(), *resultOperands);		rewriter.replaceOpWithNewOp<mlir::LoadOp>(op, op.getMemRef(),
		*resultOperands);
return success();		return success();
}		}
};		};

/// Apply the affine map from an 'affine.prefetch' operation to its operands,		/// Apply the affine map from an 'affine.prefetch' operation to its operands,
/// and feed the results to a newly created 'std.prefetch' operation (which		/// and feed the results to a newly created 'std.prefetch' operation (which
/// replaces the original 'affine.prefetch').		/// replaces the original 'affine.prefetch').
class AffinePrefetchLowering : public OpRewritePattern<AffinePrefetchOp> {		class AffinePrefetchLowering : public OpRewritePattern<AffinePrefetchOp> {
Show All 29 Lines	LogicalResult matchAndRewrite(AffineStoreOp op,
// Expand affine map from 'affineStoreOp'.		// Expand affine map from 'affineStoreOp'.
SmallVector<Value, 8> indices(op.getMapOperands());		SmallVector<Value, 8> indices(op.getMapOperands());
auto maybeExpandedMap =		auto maybeExpandedMap =
expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);		expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);
if (!maybeExpandedMap)		if (!maybeExpandedMap)
return failure();		return failure();

// Build std.store valueToStore, memref[expandedMap.results].		// Build std.store valueToStore, memref[expandedMap.results].
rewriter.replaceOpWithNewOp<StoreOp>(op, op.getValueToStore(),		rewriter.replaceOpWithNewOp<mlir::StoreOp>(
op.getMemRef(), *maybeExpandedMap);		op, op.getValueToStore(), op.getMemRef(), *maybeExpandedMap);
return success();		return success();
}		}
};		};

/// Apply the affine maps from an 'affine.dma_start' operation to each of their		/// Apply the affine maps from an 'affine.dma_start' operation to each of their
/// respective map operands, and feed the results to a newly created		/// respective map operands, and feed the results to a newly created
/// 'std.dma_start' operation (which replaces the original 'affine.dma_start').		/// 'std.dma_start' operation (which replaces the original 'affine.dma_start').
class AffineDmaStartLowering : public OpRewritePattern<AffineDmaStartOp> {		class AffineDmaStartLowering : public OpRewritePattern<AffineDmaStartOp> {
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(AffineDmaWaitOp op,
// Build std.dma_wait operation with affine map results.		// Build std.dma_wait operation with affine map results.
rewriter.replaceOpWithNewOp<DmaWaitOp>(		rewriter.replaceOpWithNewOp<DmaWaitOp>(
op, op.getTagMemRef(), *maybeExpandedTagMap, op.getNumElements());		op, op.getTagMemRef(), *maybeExpandedTagMap, op.getNumElements());
return success();		return success();
}		}
};		};

/// Apply the affine map from an 'affine.vector_load' operation to its operands,		/// Apply the affine map from an 'affine.vector_load' operation to its operands,
/// and feed the results to a newly created 'vector.transfer_read' operation		/// and feed the results to a newly created 'vector.load' operation (which
/// (which replaces the original 'affine.vector_load').		/// replaces the original 'affine.vector_load').
class AffineVectorLoadLowering : public OpRewritePattern<AffineVectorLoadOp> {		class AffineVectorLoadLowering : public OpRewritePattern<AffineVectorLoadOp> {
public:		public:
using OpRewritePattern<AffineVectorLoadOp>::OpRewritePattern;		using OpRewritePattern<AffineVectorLoadOp>::OpRewritePattern;

LogicalResult matchAndRewrite(AffineVectorLoadOp op,		LogicalResult matchAndRewrite(AffineVectorLoadOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Expand affine map from 'affineVectorLoadOp'.		// Expand affine map from 'affineVectorLoadOp'.
SmallVector<Value, 8> indices(op.getMapOperands());		SmallVector<Value, 8> indices(op.getMapOperands());
auto resultOperands =		auto resultOperands =
expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);		expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);
if (!resultOperands)		if (!resultOperands)
return failure();		return failure();

// Build vector.transfer_read memref[expandedMap.results].		// Build vector.load memref[expandedMap.results].
rewriter.replaceOpWithNewOp<TransferReadOp>(		rewriter.replaceOpWithNewOp<vector::LoadOp>(
op, op.getVectorType(), op.getMemRef(), *resultOperands);		op, op.getVectorType(), op.getMemRef(), *resultOperands);
return success();		return success();
}		}
};		};

/// Apply the affine map from an 'affine.vector_store' operation to its		/// Apply the affine map from an 'affine.vector_store' operation to its
/// operands, and feed the results to a newly created 'vector.transfer_write'		/// operands, and feed the results to a newly created 'vector.store' operation
/// operation (which replaces the original 'affine.vector_store').		/// (which replaces the original 'affine.vector_store').
class AffineVectorStoreLowering : public OpRewritePattern<AffineVectorStoreOp> {		class AffineVectorStoreLowering : public OpRewritePattern<AffineVectorStoreOp> {
public:		public:
using OpRewritePattern<AffineVectorStoreOp>::OpRewritePattern;		using OpRewritePattern<AffineVectorStoreOp>::OpRewritePattern;

LogicalResult matchAndRewrite(AffineVectorStoreOp op,		LogicalResult matchAndRewrite(AffineVectorStoreOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Expand affine map from 'affineVectorStoreOp'.		// Expand affine map from 'affineVectorStoreOp'.
SmallVector<Value, 8> indices(op.getMapOperands());		SmallVector<Value, 8> indices(op.getMapOperands());
auto maybeExpandedMap =		auto maybeExpandedMap =
expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);		expandAffineMap(rewriter, op.getLoc(), op.getAffineMap(), indices);
if (!maybeExpandedMap)		if (!maybeExpandedMap)
return failure();		return failure();

rewriter.replaceOpWithNewOp<TransferWriteOp>(		rewriter.replaceOpWithNewOp<vector::StoreOp>(
op, op.getValueToStore(), op.getMemRef(), *maybeExpandedMap);		op, op.getValueToStore(), op.getMemRef(), *maybeExpandedMap);
return success();		return success();
}		}
};		};

} // end namespace		} // end namespace

void mlir::populateAffineToStdConversionPatterns(		void mlir::populateAffineToStdConversionPatterns(
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

Show First 20 Lines • Show All 351 Lines • ▼ Show 20 Lines	matchAndRewrite(vector::FlatTransposeOp transOp, ArrayRef<Value> operands,
auto adaptor = vector::FlatTransposeOpAdaptor(operands);		auto adaptor = vector::FlatTransposeOpAdaptor(operands);
rewriter.replaceOpWithNewOp<LLVM::MatrixTransposeOp>(		rewriter.replaceOpWithNewOp<LLVM::MatrixTransposeOp>(
transOp, typeConverter->convertType(transOp.res().getType()),		transOp, typeConverter->convertType(transOp.res().getType()),
adaptor.matrix(), transOp.rows(), transOp.columns());		adaptor.matrix(), transOp.rows(), transOp.columns());
return success();		return success();
}		}
};		};

/// Conversion pattern for a vector.maskedload.		/// Overloaded utility that replaces a vector.load, vector.store,
class VectorMaskedLoadOpConversion		/// vector.maskedload and vector.maskedstore with their respective LLVM
: public ConvertOpToLLVMPattern<vector::MaskedLoadOp> {		/// couterparts.
public:		static void replaceLoadOrStoreOp(vector::LoadOp loadOp,
using ConvertOpToLLVMPattern<vector::MaskedLoadOp>::ConvertOpToLLVMPattern;		vector::LoadOpAdaptor adaptor,
		VectorType vectorTy, Value ptr, unsigned align,
LogicalResult		ConversionPatternRewriter &rewriter) {
matchAndRewrite(vector::MaskedLoadOp load, ArrayRef<Value> operands,		rewriter.replaceOpWithNewOp<LLVM::LoadOp>(loadOp, ptr, align);
ConversionPatternRewriter &rewriter) const override {		}
auto loc = load->getLoc();
auto adaptor = vector::MaskedLoadOpAdaptor(operands);
MemRefType memRefType = load.getMemRefType();

// Resolve alignment.		static void replaceLoadOrStoreOp(vector::MaskedLoadOp loadOp,
unsigned align;		vector::MaskedLoadOpAdaptor adaptor,
if (failed(getMemRefAlignment(*getTypeConverter(), memRefType, align)))		VectorType vectorTy, Value ptr, unsigned align,
return failure();		ConversionPatternRewriter &rewriter) {
		rewriter.replaceOpWithNewOp<LLVM::MaskedLoadOp>(
		loadOp, vectorTy, ptr, adaptor.mask(), adaptor.pass_thru(), align);
		}

// Resolve address.		static void replaceLoadOrStoreOp(vector::StoreOp storeOp,
auto vtype = typeConverter->convertType(load.getResultVectorType());		vector::StoreOpAdaptor adaptor,
Value dataPtr = this->getStridedElementPtr(loc, memRefType, adaptor.base(),		VectorType vectorTy, Value ptr, unsigned align,
adaptor.indices(), rewriter);		ConversionPatternRewriter &rewriter) {
Value ptr = castDataPtr(rewriter, loc, dataPtr, memRefType, vtype);		rewriter.replaceOpWithNewOp<LLVM::StoreOp>(storeOp, adaptor.valueToStore(),
		ptr, align);
		}

rewriter.replaceOpWithNewOp<LLVM::MaskedLoadOp>(		static void replaceLoadOrStoreOp(vector::MaskedStoreOp storeOp,
load, vtype, ptr, adaptor.mask(), adaptor.pass_thru(),		vector::MaskedStoreOpAdaptor adaptor,
rewriter.getI32IntegerAttr(align));		VectorType vectorTy, Value ptr, unsigned align,
return success();		ConversionPatternRewriter &rewriter) {
		rewriter.replaceOpWithNewOp<LLVM::MaskedStoreOp>(
		storeOp, adaptor.valueToStore(), ptr, adaptor.mask(), align);
}		}
};

/// Conversion pattern for a vector.maskedstore.		/// Conversion pattern for a vector.load, vector.store, vector.maskedload, and
class VectorMaskedStoreOpConversion		/// vector.maskedstore.
: public ConvertOpToLLVMPattern<vector::MaskedStoreOp> {		template <class LoadOrStoreOp, class LoadOrStoreOpAdaptor>
		class VectorLoadStoreConversion : public ConvertOpToLLVMPattern<LoadOrStoreOp> {
public:		public:
using ConvertOpToLLVMPattern<vector::MaskedStoreOp>::ConvertOpToLLVMPattern;		using ConvertOpToLLVMPattern<LoadOrStoreOp>::ConvertOpToLLVMPattern;

LogicalResult		LogicalResult
matchAndRewrite(vector::MaskedStoreOp store, ArrayRef<Value> operands,		matchAndRewrite(LoadOrStoreOp loadOrStoreOp, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
auto loc = store->getLoc();		// Only 1-D vectors can be lowered to LLVM.
auto adaptor = vector::MaskedStoreOpAdaptor(operands);		VectorType vectorTy = loadOrStoreOp.getVectorType();
MemRefType memRefType = store.getMemRefType();		if (vectorTy.getRank() > 1)
		return failure();

		auto loc = loadOrStoreOp->getLoc();
		auto adaptor = LoadOrStoreOpAdaptor(operands);
		MemRefType memRefTy = loadOrStoreOp.getMemRefType();

// Resolve alignment.		// Resolve alignment.
unsigned align;		unsigned align;
if (failed(getMemRefAlignment(*getTypeConverter(), memRefType, align)))		if (failed(getMemRefAlignment(*this->getTypeConverter(), memRefTy, align)))
return failure();		return failure();

// Resolve address.		// Resolve address.
auto vtype = typeConverter->convertType(store.getValueVectorType());		auto vtype = this->typeConverter->convertType(loadOrStoreOp.getVectorType())
Value dataPtr = this->getStridedElementPtr(loc, memRefType, adaptor.base(),		.template cast<VectorType>();
		Value dataPtr = this->getStridedElementPtr(loc, memRefTy, adaptor.base(),
adaptor.indices(), rewriter);		adaptor.indices(), rewriter);
Value ptr = castDataPtr(rewriter, loc, dataPtr, memRefType, vtype);		Value ptr = castDataPtr(rewriter, loc, dataPtr, memRefTy, vtype);

rewriter.replaceOpWithNewOp<LLVM::MaskedStoreOp>(		replaceLoadOrStoreOp(loadOrStoreOp, adaptor, vtype, ptr, align, rewriter);
store, adaptor.value(), ptr, adaptor.mask(),
rewriter.getI32IntegerAttr(align));
return success();		return success();
}		}
};		};

/// Conversion pattern for a vector.gather.		/// Conversion pattern for a vector.gather.
class VectorGatherOpConversion		class VectorGatherOpConversion
: public ConvertOpToLLVMPattern<vector::GatherOp> {		: public ConvertOpToLLVMPattern<vector::GatherOp> {
public:		public:
▲ Show 20 Lines • Show All 1,080 Lines • ▼ Show 20 Lines	patterns
VectorShuffleOpConversion,		VectorShuffleOpConversion,
VectorExtractElementOpConversion,		VectorExtractElementOpConversion,
VectorExtractOpConversion,		VectorExtractOpConversion,
VectorFMAOp1DConversion,		VectorFMAOp1DConversion,
VectorInsertElementOpConversion,		VectorInsertElementOpConversion,
VectorInsertOpConversion,		VectorInsertOpConversion,
VectorPrintOpConversion,		VectorPrintOpConversion,
VectorTypeCastOpConversion,		VectorTypeCastOpConversion,
VectorMaskedLoadOpConversion,		VectorLoadStoreConversion<vector::LoadOp,
VectorMaskedStoreOpConversion,		vector::LoadOpAdaptor>,
		VectorLoadStoreConversion<vector::MaskedLoadOp,
		vector::MaskedLoadOpAdaptor>,
		VectorLoadStoreConversion<vector::StoreOp,
		vector::StoreOpAdaptor>,
		VectorLoadStoreConversion<vector::MaskedStoreOp,
		vector::MaskedStoreOpAdaptor>,
VectorGatherOpConversion,		VectorGatherOpConversion,
VectorScatterOpConversion,		VectorScatterOpConversion,
VectorExpandLoadOpConversion,		VectorExpandLoadOpConversion,
VectorCompressStoreOpConversion>(converter);		VectorCompressStoreOpConversion>(converter);
// clang-format on		// clang-format on
}		}

void mlir::populateVectorToLLVMMatrixConversionPatterns(		void mlir::populateVectorToLLVMMatrixConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
patterns.insert<VectorMatmulOpConversion>(converter);		patterns.insert<VectorMatmulOpConversion>(converter);
patterns.insert<VectorFlatTransposeOpConversion>(converter);		patterns.insert<VectorFlatTransposeOpConversion>(converter);
}		}

mlir/lib/Dialect/Vector/VectorOps.cpp

Show First 20 Lines • Show All 2,368 Lines • ▼ Show 20 Lines	void TransferWriteOp::getEffects(
SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>		SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>
&effects) {		&effects) {
if (getShapedType().isa<MemRefType>())		if (getShapedType().isa<MemRefType>())
effects.emplace_back(MemoryEffects::Write::get(), source(),		effects.emplace_back(MemoryEffects::Write::get(), source(),
SideEffects::DefaultResource::get());		SideEffects::DefaultResource::get());
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// LoadOp
		//===----------------------------------------------------------------------===//

		static LogicalResult verifyLoadStoreMemRefLayout(Operation *op,
		MemRefType memRefTy) {
		auto affineMaps = memRefTy.getAffineMaps();
		if (!affineMaps.empty())
		return op->emitOpError("base memref should have a default identity layout");
		bondhugulaUnsubmitted Not Done Reply Inline Actions `affineMaps.empty()` would be the right check here as opposed to `affineMaps.size() != 1`. bondhugula: `affineMaps.empty()` would be the right check here as opposed to `affineMaps.size() != 1`.
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Oh, if a single identity map is provided, it's not kept around. Got it. Thanks! dcaballe: Oh, if a single identity map is provided, it's not kept around. Got it. Thanks!
		return success();
		}

		static LogicalResult verify(vector::LoadOp op) {
		VectorType resVecTy = op.getVectorType();
		MemRefType memRefTy = op.getMemRefType();

		if (failed(verifyLoadStoreMemRefLayout(op, memRefTy)))
		return failure();

		// Checks for vector memrefs.
		Type memElemTy = memRefTy.getElementType();
		if (auto memVecTy = memElemTy.dyn_cast<VectorType>()) {
		if (memVecTy != resVecTy)
		return op.emitOpError("base memref and result vector types should match");
		memElemTy = memVecTy.getElementType();
		}

		if (resVecTy.getElementType() != memElemTy)
		return op.emitOpError("base and result element types should match");
		if (llvm::size(op.indices()) != memRefTy.getRank())
		return op.emitOpError("requires ") << memRefTy.getRank() << " indices";
		return success();
		}

		//===----------------------------------------------------------------------===//
		// StoreOp
		//===----------------------------------------------------------------------===//

		static LogicalResult verify(vector::StoreOp op) {
		VectorType valueVecTy = op.getVectorType();
		MemRefType memRefTy = op.getMemRefType();

		if (failed(verifyLoadStoreMemRefLayout(op, memRefTy)))
		return failure();

		// Checks for vector memrefs.
		Type memElemTy = memRefTy.getElementType();
		if (auto memVecTy = memElemTy.dyn_cast<VectorType>()) {
		if (memVecTy != valueVecTy)
		return op.emitOpError(
		"base memref and valueToStore vector types should match");
		memElemTy = memVecTy.getElementType();
		}

		if (valueVecTy.getElementType() != memElemTy)
		return op.emitOpError("base and valueToStore element type should match");
		if (llvm::size(op.indices()) != memRefTy.getRank())
		return op.emitOpError("requires ") << memRefTy.getRank() << " indices";
		return success();
		}

		//===----------------------------------------------------------------------===//
// MaskedLoadOp		// MaskedLoadOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static LogicalResult verify(MaskedLoadOp op) {		static LogicalResult verify(MaskedLoadOp op) {
VectorType maskVType = op.getMaskVectorType();		VectorType maskVType = op.getMaskVectorType();
VectorType passVType = op.getPassThruVectorType();		VectorType passVType = op.getPassThruVectorType();
VectorType resVType = op.getResultVectorType();		VectorType resVType = op.getVectorType();
MemRefType memType = op.getMemRefType();		MemRefType memType = op.getMemRefType();

if (resVType.getElementType() != memType.getElementType())		if (resVType.getElementType() != memType.getElementType())
return op.emitOpError("base and result element type should match");		return op.emitOpError("base and result element type should match");
if (llvm::size(op.indices()) != memType.getRank())		if (llvm::size(op.indices()) != memType.getRank())
return op.emitOpError("requires ") << memType.getRank() << " indices";		return op.emitOpError("requires ") << memType.getRank() << " indices";
if (resVType.getDimSize(0) != maskVType.getDimSize(0))		if (resVType.getDimSize(0) != maskVType.getDimSize(0))
return op.emitOpError("expected result dim to match mask dim");		return op.emitOpError("expected result dim to match mask dim");
Show All 30 Lines
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// MaskedStoreOp		// MaskedStoreOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static LogicalResult verify(MaskedStoreOp op) {		static LogicalResult verify(MaskedStoreOp op) {
VectorType maskVType = op.getMaskVectorType();		VectorType maskVType = op.getMaskVectorType();
VectorType valueVType = op.getValueVectorType();		VectorType valueVType = op.getVectorType();
MemRefType memType = op.getMemRefType();		MemRefType memType = op.getMemRefType();

if (valueVType.getElementType() != memType.getElementType())		if (valueVType.getElementType() != memType.getElementType())
return op.emitOpError("base and value element type should match");		return op.emitOpError("base and valueToStore element type should match");
if (llvm::size(op.indices()) != memType.getRank())		if (llvm::size(op.indices()) != memType.getRank())
return op.emitOpError("requires ") << memType.getRank() << " indices";		return op.emitOpError("requires ") << memType.getRank() << " indices";
if (valueVType.getDimSize(0) != maskVType.getDimSize(0))		if (valueVType.getDimSize(0) != maskVType.getDimSize(0))
return op.emitOpError("expected value dim to match mask dim");		return op.emitOpError("expected valueToStore dim to match mask dim");
return success();		return success();
}		}

namespace {		namespace {
class MaskedStoreFolder final : public OpRewritePattern<MaskedStoreOp> {		class MaskedStoreFolder final : public OpRewritePattern<MaskedStoreOp> {
public:		public:
using OpRewritePattern<MaskedStoreOp>::OpRewritePattern;		using OpRewritePattern<MaskedStoreOp>::OpRewritePattern;
LogicalResult matchAndRewrite(MaskedStoreOp store,		LogicalResult matchAndRewrite(MaskedStoreOp store,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
switch (get1DMaskFormat(store.mask())) {		switch (get1DMaskFormat(store.mask())) {
case MaskFormat::AllTrue:		case MaskFormat::AllTrue:
rewriter.replaceOpWithNewOp<vector::TransferWriteOp>(		rewriter.replaceOpWithNewOp<vector::TransferWriteOp>(
store, store.value(), store.base(), store.indices(), false);		store, store.valueToStore(), store.base(), store.indices(), false);
return success();		return success();
case MaskFormat::AllFalse:		case MaskFormat::AllFalse:
rewriter.eraseOp(store);		rewriter.eraseOp(store);
return success();		return success();
case MaskFormat::Unknown:		case MaskFormat::Unknown:
return failure();		return failure();
}		}
llvm_unreachable("Unexpected 1DMaskFormat on MaskedStore");		llvm_unreachable("Unexpected 1DMaskFormat on MaskedStore");
▲ Show 20 Lines • Show All 720 Lines • Show Last 20 Lines

mlir/test/Conversion/AffineToStandard/lower-affine-to-vector.mlir

	// RUN: mlir-opt -lower-affine --split-input-file %s \| FileCheck %s			// RUN: mlir-opt -lower-affine --split-input-file %s \| FileCheck %s

	// CHECK-LABEL: func @affine_vector_load
	func @affine_vector_load(%arg0 : index) {
	%0 = alloc() : memref<100xf32>
	affine.for %i0 = 0 to 16 {
	%1 = affine.vector_load %0[%i0 + symbol(%arg0) + 7] : memref<100xf32>, vector<8xf32>
	}
	// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[a:.]] = addi %{{.}}, %{{.*}} : index
	// CHECK-NEXT: %[[c7:.*]] = constant 7 : index
	// CHECK-NEXT: %[[b:.*]] = addi %[[a]], %[[c7]] : index
	// CHECK-NEXT: %[[pad:.*]] = constant 0.0
	// CHECK-NEXT: vector.transfer_read %[[buf]][%[[b]]], %[[pad]] : memref<100xf32>, vector<8xf32>
	return
	}

	// -----

	// CHECK-LABEL: func @affine_vector_store
	func @affine_vector_store(%arg0 : index) {
	%0 = alloc() : memref<100xf32>
	%1 = constant dense<11.0> : vector<4xf32>
	affine.for %i0 = 0 to 16 {
	affine.vector_store %1, %0[%i0 - symbol(%arg0) + 7] : memref<100xf32>, vector<4xf32>
	}
	// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[val:.*]] = constant dense
	// CHECK: %[[c_1:.*]] = constant -1 : index
	// CHECK-NEXT: %[[a:.*]] = muli %arg0, %[[c_1]] : index
	// CHECK-NEXT: %[[b:.]] = addi %{{.}}, %[[a]] : index
	// CHECK-NEXT: %[[c7:.*]] = constant 7 : index
	// CHECK-NEXT: %[[c:.*]] = addi %[[b]], %[[c7]] : index
	// CHECK-NEXT: vector.transfer_write %[[val]], %[[buf]][%[[c]]] : vector<4xf32>, memref<100xf32>
	return
	}

	// -----

	// CHECK-LABEL: func @affine_vector_load			// CHECK-LABEL: func @affine_vector_load
	func @affine_vector_load(%arg0 : index) {			func @affine_vector_load(%arg0 : index) {
	%0 = alloc() : memref<100xf32>			%0 = alloc() : memref<100xf32>
	affine.for %i0 = 0 to 16 {			affine.for %i0 = 0 to 16 {
	%1 = affine.vector_load %0[%i0 + symbol(%arg0) + 7] : memref<100xf32>, vector<8xf32>			%1 = affine.vector_load %0[%i0 + symbol(%arg0) + 7] : memref<100xf32>, vector<8xf32>
	}			}
	// CHECK: %[[buf:.*]] = alloc			// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[a:.]] = addi %{{.}}, %{{.*}} : index			// CHECK: %[[a:.]] = addi %{{.}}, %{{.*}} : index
	// CHECK-NEXT: %[[c7:.*]] = constant 7 : index			// CHECK-NEXT: %[[c7:.*]] = constant 7 : index
	// CHECK-NEXT: %[[b:.*]] = addi %[[a]], %[[c7]] : index			// CHECK-NEXT: %[[b:.*]] = addi %[[a]], %[[c7]] : index
	// CHECK-NEXT: %[[pad:.*]] = constant 0.0			// CHECK-NEXT: vector.load %[[buf]][%[[b]]] : memref<100xf32>, vector<8xf32>
	// CHECK-NEXT: vector.transfer_read %[[buf]][%[[b]]], %[[pad]] : memref<100xf32>, vector<8xf32>
	return			return
	}			}

	// -----			// -----

	// CHECK-LABEL: func @affine_vector_store			// CHECK-LABEL: func @affine_vector_store
	func @affine_vector_store(%arg0 : index) {			func @affine_vector_store(%arg0 : index) {
	%0 = alloc() : memref<100xf32>			%0 = alloc() : memref<100xf32>
	%1 = constant dense<11.0> : vector<4xf32>			%1 = constant dense<11.0> : vector<4xf32>
	affine.for %i0 = 0 to 16 {			affine.for %i0 = 0 to 16 {
	affine.vector_store %1, %0[%i0 - symbol(%arg0) + 7] : memref<100xf32>, vector<4xf32>			affine.vector_store %1, %0[%i0 - symbol(%arg0) + 7] : memref<100xf32>, vector<4xf32>
	}			}
	// CHECK: %[[buf:.*]] = alloc			// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[val:.*]] = constant dense			// CHECK: %[[val:.*]] = constant dense
	// CHECK: %[[c_1:.*]] = constant -1 : index			// CHECK: %[[c_1:.*]] = constant -1 : index
	// CHECK-NEXT: %[[a:.*]] = muli %arg0, %[[c_1]] : index			// CHECK-NEXT: %[[a:.*]] = muli %arg0, %[[c_1]] : index
	// CHECK-NEXT: %[[b:.]] = addi %{{.}}, %[[a]] : index			// CHECK-NEXT: %[[b:.]] = addi %{{.}}, %[[a]] : index
	// CHECK-NEXT: %[[c7:.*]] = constant 7 : index			// CHECK-NEXT: %[[c7:.*]] = constant 7 : index
	// CHECK-NEXT: %[[c:.*]] = addi %[[b]], %[[c7]] : index			// CHECK-NEXT: %[[c:.*]] = addi %[[b]], %[[c7]] : index
	// CHECK-NEXT: vector.transfer_write %[[val]], %[[buf]][%[[c]]] : vector<4xf32>, memref<100xf32>			// CHECK-NEXT: vector.store %[[val]], %[[buf]][%[[c]]] : memref<100xf32>, vector<4xf32>
	return			return
	}			}

	// -----			// -----

	// CHECK-LABEL: func @vector_load_2d			// CHECK-LABEL: func @vector_load_2d
	func @vector_load_2d() {			func @vector_load_2d() {
	%0 = alloc() : memref<100x100xf32>			%0 = alloc() : memref<100x100xf32>
	affine.for %i0 = 0 to 16 step 2{			affine.for %i0 = 0 to 16 step 2{
	affine.for %i1 = 0 to 16 step 8 {			affine.for %i1 = 0 to 16 step 8 {
	%1 = affine.vector_load %0[%i0, %i1] : memref<100x100xf32>, vector<2x8xf32>			%1 = affine.vector_load %0[%i0, %i1] : memref<100x100xf32>, vector<2x8xf32>
	// CHECK: %[[buf:.*]] = alloc			// CHECK: %[[buf:.*]] = alloc
	// CHECK: scf.for %[[i0:.*]] =			// CHECK: scf.for %[[i0:.*]] =
	// CHECK: scf.for %[[i1:.*]] =			// CHECK: scf.for %[[i1:.*]] =
	// CHECK-NEXT: %[[pad:.*]] = constant 0.0			// CHECK-NEXT: vector.load %[[buf]][%[[i0]], %[[i1]]] : memref<100x100xf32>, vector<2x8xf32>
	// CHECK-NEXT: vector.transfer_read %[[buf]][%[[i0]], %[[i1]]], %[[pad]] : memref<100x100xf32>, vector<2x8xf32>
	}			}
	}			}
	return			return
	}			}

	// -----			// -----

	// CHECK-LABEL: func @vector_store_2d			// CHECK-LABEL: func @vector_store_2d
	func @vector_store_2d() {			func @vector_store_2d() {
	%0 = alloc() : memref<100x100xf32>			%0 = alloc() : memref<100x100xf32>
	%1 = constant dense<11.0> : vector<2x8xf32>			%1 = constant dense<11.0> : vector<2x8xf32>
	affine.for %i0 = 0 to 16 step 2{			affine.for %i0 = 0 to 16 step 2{
	affine.for %i1 = 0 to 16 step 8 {			affine.for %i1 = 0 to 16 step 8 {
	affine.vector_store %1, %0[%i0, %i1] : memref<100x100xf32>, vector<2x8xf32>			affine.vector_store %1, %0[%i0, %i1] : memref<100x100xf32>, vector<2x8xf32>
	// CHECK: %[[buf:.*]] = alloc			// CHECK: %[[buf:.*]] = alloc
	// CHECK: %[[val:.*]] = constant dense			// CHECK: %[[val:.*]] = constant dense
	// CHECK: scf.for %[[i0:.*]] =			// CHECK: scf.for %[[i0:.*]] =
	// CHECK: scf.for %[[i1:.*]] =			// CHECK: scf.for %[[i1:.*]] =
	// CHECK-NEXT: vector.transfer_write %[[val]], %[[buf]][%[[i0]], %[[i1]]] : vector<2x8xf32>, memref<100x100xf32>			// CHECK-NEXT: vector.store %[[val]], %[[buf]][%[[i0]], %[[i1]]] : memref<100x100xf32>, vector<2x8xf32>
	}			}
	}			}
	return			return
	}			}

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

	Show All 17 Lines
	}			}

	// CHECK-LABEL: @bitcast_i8_to_f32_vector			// CHECK-LABEL: @bitcast_i8_to_f32_vector
	// CHECK-SAME: %[[input:.*]]: vector<64xi8>			// CHECK-SAME: %[[input:.*]]: vector<64xi8>
	// CHECK: llvm.bitcast %[[input]] : vector<64xi8> to vector<16xf32>			// CHECK: llvm.bitcast %[[input]] : vector<64xi8> to vector<16xf32>

	// -----			// -----


	func @broadcast_vec1d_from_scalar(%arg0: f32) -> vector<2xf32> {			func @broadcast_vec1d_from_scalar(%arg0: f32) -> vector<2xf32> {
	%0 = vector.broadcast %arg0 : f32 to vector<2xf32>			%0 = vector.broadcast %arg0 : f32 to vector<2xf32>
	return %0 : vector<2xf32>			return %0 : vector<2xf32>
	}			}
	// CHECK-LABEL: @broadcast_vec1d_from_scalar			// CHECK-LABEL: @broadcast_vec1d_from_scalar
	// CHECK-SAME: %[[A:.*]]: f32)			// CHECK-SAME: %[[A:.*]]: f32)
	// CHECK: %[[T0:.*]] = splat %[[A]] : vector<2xf32>			// CHECK: %[[T0:.*]] = splat %[[A]] : vector<2xf32>
	// CHECK: return %[[T0]] : vector<2xf32>			// CHECK: return %[[T0]] : vector<2xf32>
	▲ Show 20 Lines • Show All 1,203 Lines • ▼ Show 20 Lines
	// CHECK-SAME: %[[A:.*]]: vector<16xf32>			// CHECK-SAME: %[[A:.*]]: vector<16xf32>
	// CHECK: %[[T:.*]] = llvm.intr.matrix.transpose %[[A]]			// CHECK: %[[T:.*]] = llvm.intr.matrix.transpose %[[A]]
	// CHECK-SAME: {columns = 4 : i32, rows = 4 : i32} :			// CHECK-SAME: {columns = 4 : i32, rows = 4 : i32} :
	// CHECK-SAME: vector<16xf32> into vector<16xf32>			// CHECK-SAME: vector<16xf32> into vector<16xf32>
	// CHECK: return %[[T]] : vector<16xf32>			// CHECK: return %[[T]] : vector<16xf32>

	// -----			// -----

				func @vector_load_op(%memref : memref<200x100xf32>, %i : index, %j : index) -> vector<8xf32> {
				%0 = vector.load %memref[%i, %j] : memref<200x100xf32>, vector<8xf32>
				return %0 : vector<8xf32>
				}

				// CHECK-LABEL: func @vector_load_op
				// CHECK: %[[c100:.*]] = llvm.mlir.constant(100 : index) : i64
				// CHECK: %[[mul:.]] = llvm.mul %{{.}}, %[[c100]] : i64
				// CHECK: %[[add:.]] = llvm.add %[[mul]], %{{.}} : i64
				// CHECK: %[[gep:.]] = llvm.getelementptr %{{.}}[%[[add]]] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
				// CHECK: %[[bcast:.*]] = llvm.bitcast %[[gep]] : !llvm.ptr<f32> to !llvm.ptr<vector<8xf32>>
				// CHECK: llvm.load %[[bcast]] {alignment = 4 : i64} : !llvm.ptr<vector<8xf32>>

				func @vector_store_op(%memref : memref<200x100xf32>, %i : index, %j : index) {
				%val = constant dense<11.0> : vector<4xf32>
				vector.store %val, %memref[%i, %j] : memref<200x100xf32>, vector<4xf32>
				return
				}

				// CHECK-LABEL: func @vector_store_op
				// CHECK: %[[c100:.*]] = llvm.mlir.constant(100 : index) : i64
				// CHECK: %[[mul:.]] = llvm.mul %{{.}}, %[[c100]] : i64
				// CHECK: %[[add:.]] = llvm.add %[[mul]], %{{.}} : i64
				// CHECK: %[[gep:.]] = llvm.getelementptr %{{.}}[%[[add]]] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
				// CHECK: %[[bcast:.*]] = llvm.bitcast %[[gep]] : !llvm.ptr<f32> to !llvm.ptr<vector<4xf32>>
				// CHECK: llvm.store %{{.*}}, %[[bcast]] {alignment = 4 : i64} : !llvm.ptr<vector<4xf32>>

	func @masked_load_op(%arg0: memref<?xf32>, %arg1: vector<16xi1>, %arg2: vector<16xf32>) -> vector<16xf32> {			func @masked_load_op(%arg0: memref<?xf32>, %arg1: vector<16xi1>, %arg2: vector<16xf32>) -> vector<16xf32> {
	%c0 = constant 0: index			%c0 = constant 0: index
	%0 = vector.maskedload %arg0[%c0], %arg1, %arg2 : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			%0 = vector.maskedload %arg0[%c0], %arg1, %arg2 : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	return %0 : vector<16xf32>			return %0 : vector<16xf32>
	}			}

	// CHECK-LABEL: func @masked_load_op			// CHECK-LABEL: func @masked_load_op
	// CHECK: %[[CO:.*]] = constant 0 : index			// CHECK: %[[CO:.*]] = constant 0 : index
	▲ Show 20 Lines • Show All 72 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/invalid.mlir

Show First 20 Lines • Show All 1,192 Lines • ▼ Show 20 Lines

func @type_cast_layout(%arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>>) {		func @type_cast_layout(%arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>>) {
// expected-error@+1 {{expects operand to be a memref with no layout}}		// expected-error@+1 {{expects operand to be a memref with no layout}}
%0 = vector.type_cast %arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>> to memref<vector<4x3xf32>>		%0 = vector.type_cast %arg0: memref<4x3xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s0 + d1 * s1 + s2)>> to memref<vector<4x3xf32>>
}		}

// -----		// -----

		func @store_unsupported_layout(%memref : memref<200x100xf32, affine_map<(d0, d1) -> (d1, d0)>>,
		%i : index, %j : index, %value : vector<8xf32>) {
		// expected-error@+1 {{'vector.store' op base memref should have a default identity layout}}
		vector.store %value, %memref[%i, %j] : memref<200x100xf32, affine_map<(d0, d1) -> (d1, d0)>>,
		vector<8xf32>
		}

		// -----

		func @vector_memref_mismatch(%memref : memref<200x100xvector<4xf32>>, %i : index,
		%j : index, %value : vector<8xf32>) {
		// expected-error@+1 {{'vector.store' op base memref and valueToStore vector types should match}}
		vector.store %value, %memref[%i, %j] : memref<200x100xvector<4xf32>>, vector<8xf32>
		}

		// -----

		func @store_base_type_mismatch(%base : memref<?xf64>, %value : vector<16xf32>) {
		%c0 = constant 0 : index
		// expected-error@+1 {{'vector.store' op base and valueToStore element type should match}}
		vector.store %value, %base[%c0] : memref<?xf64>, vector<16xf32>
		}

		// -----

		func @store_memref_index_mismatch(%base : memref<?xf32>, %value : vector<16xf32>) {
		// expected-error@+1 {{'vector.store' op requires 1 indices}}
		vector.store %value, %base[] : memref<?xf32>, vector<16xf32>
		}

		// -----

func @maskedload_base_type_mismatch(%base: memref<?xf64>, %mask: vector<16xi1>, %pass: vector<16xf32>) {		func @maskedload_base_type_mismatch(%base: memref<?xf64>, %mask: vector<16xi1>, %pass: vector<16xf32>) {
%c0 = constant 0 : index		%c0 = constant 0 : index
// expected-error@+1 {{'vector.maskedload' op base and result element type should match}}		// expected-error@+1 {{'vector.maskedload' op base and result element type should match}}
%0 = vector.maskedload %base[%c0], %mask, %pass : memref<?xf64>, vector<16xi1>, vector<16xf32> into vector<16xf32>		%0 = vector.maskedload %base[%c0], %mask, %pass : memref<?xf64>, vector<16xi1>, vector<16xf32> into vector<16xf32>
}		}

// -----		// -----

Show All 17 Lines	func @maskedload_memref_mismatch(%base: memref<?xf32>, %mask: vector<16xi1>, %pass: vector<16xf32>) {
// expected-error@+1 {{'vector.maskedload' op requires 1 indices}}		// expected-error@+1 {{'vector.maskedload' op requires 1 indices}}
%0 = vector.maskedload %base[], %mask, %pass : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>		%0 = vector.maskedload %base[], %mask, %pass : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
}		}

// -----		// -----

func @maskedstore_base_type_mismatch(%base: memref<?xf64>, %mask: vector<16xi1>, %value: vector<16xf32>) {		func @maskedstore_base_type_mismatch(%base: memref<?xf64>, %mask: vector<16xi1>, %value: vector<16xf32>) {
%c0 = constant 0 : index		%c0 = constant 0 : index
// expected-error@+1 {{'vector.maskedstore' op base and value element type should match}}		// expected-error@+1 {{'vector.maskedstore' op base and valueToStore element type should match}}
vector.maskedstore %base[%c0], %mask, %value : memref<?xf64>, vector<16xi1>, vector<16xf32>		vector.maskedstore %base[%c0], %mask, %value : memref<?xf64>, vector<16xi1>, vector<16xf32>
}		}

// -----		// -----

func @maskedstore_dim_mask_mismatch(%base: memref<?xf32>, %mask: vector<15xi1>, %value: vector<16xf32>) {		func @maskedstore_dim_mask_mismatch(%base: memref<?xf32>, %mask: vector<15xi1>, %value: vector<16xf32>) {
%c0 = constant 0 : index		%c0 = constant 0 : index
// expected-error@+1 {{'vector.maskedstore' op expected value dim to match mask dim}}		// expected-error@+1 {{'vector.maskedstore' op expected valueToStore dim to match mask dim}}
vector.maskedstore %base[%c0], %mask, %value : memref<?xf32>, vector<15xi1>, vector<16xf32>		vector.maskedstore %base[%c0], %mask, %value : memref<?xf32>, vector<15xi1>, vector<16xf32>
}		}

// -----		// -----

func @maskedstore_memref_mismatch(%base: memref<?xf32>, %mask: vector<16xi1>, %value: vector<16xf32>) {		func @maskedstore_memref_mismatch(%base: memref<?xf32>, %mask: vector<16xi1>, %value: vector<16xf32>) {
%c0 = constant 0 : index		%c0 = constant 0 : index
// expected-error@+1 {{'vector.maskedstore' op requires 1 indices}}		// expected-error@+1 {{'vector.maskedstore' op requires 1 indices}}
▲ Show 20 Lines • Show All 181 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/ops.mlir

	Show First 20 Lines • Show All 444 Lines • ▼ Show 20 Lines
	// CHECK-LABEL: @flat_transpose_int			// CHECK-LABEL: @flat_transpose_int
	func @flat_transpose_int(%arg0: vector<16xi32>) -> vector<16xi32> {			func @flat_transpose_int(%arg0: vector<16xi32>) -> vector<16xi32> {
	// CHECK: %[[X:.]] = vector.flat_transpose %{{.}} {columns = 8 : i32, rows = 2 : i32} : vector<16xi32> -> vector<16xi32>			// CHECK: %[[X:.]] = vector.flat_transpose %{{.}} {columns = 8 : i32, rows = 2 : i32} : vector<16xi32> -> vector<16xi32>
	%0 = vector.flat_transpose %arg0 { rows = 2: i32, columns = 8: i32 } : vector<16xi32> -> vector<16xi32>			%0 = vector.flat_transpose %arg0 { rows = 2: i32, columns = 8: i32 } : vector<16xi32> -> vector<16xi32>
	// CHECK: return %[[X]] : vector<16xi32>			// CHECK: return %[[X]] : vector<16xi32>
	return %0 : vector<16xi32>			return %0 : vector<16xi32>
	}			}

				// CHECK-LABEL: @vector_load_and_store_1d_scalar_memref
				func @vector_load_and_store_1d_scalar_memref(%memref : memref<200x100xf32>,
				%i : index, %j : index) {
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xf32>, vector<8xf32>
				%0 = vector.load %memref[%i, %j] : memref<200x100xf32>, vector<8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xf32>, vector<8xf32>
				vector.store %0, %memref[%i, %j] : memref<200x100xf32>, vector<8xf32>
				return
				}

				// CHECK-LABEL: @vector_load_and_store_1d_vector_memref
				func @vector_load_and_store_1d_vector_memref(%memref : memref<200x100xvector<8xf32>>,
				%i : index, %j : index) {
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xvector<8xf32>>, vector<8xf32>
				%0 = vector.load %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xvector<8xf32>>, vector<8xf32>
				vector.store %0, %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
				return
				}

				// CHECK-LABEL: @vector_load_and_store_out_of_bounds
				func @vector_load_and_store_out_of_bounds(%memref : memref<7xf32>) {
				%c0 = constant 0 : index
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<7xf32>, vector<8xf32>
				%0 = vector.load %memref[%c0] : memref<7xf32>, vector<8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<7xf32>, vector<8xf32>
				vector.store %0, %memref[%c0] : memref<7xf32>, vector<8xf32>
				return
				}

				// CHECK-LABEL: @vector_load_and_store_2d_scalar_memref
				func @vector_load_and_store_2d_scalar_memref(%memref : memref<200x100xf32>,
				%i : index, %j : index) {
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xf32>, vector<4x8xf32>
				%0 = vector.load %memref[%i, %j] : memref<200x100xf32>, vector<4x8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xf32>, vector<4x8xf32>
				vector.store %0, %memref[%i, %j] : memref<200x100xf32>, vector<4x8xf32>
				return
				}

				// CHECK-LABEL: @vector_load_and_store_2d_vector_memref
				func @vector_load_and_store_2d_vector_memref(%memref : memref<200x100xvector<4x8xf32>>,
				%i : index, %j : index) {
				// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
				%0 = vector.load %memref[%i, %j] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
				// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
				vector.store %0, %memref[%i, %j] : memref<200x100xvector<4x8xf32>>, vector<4x8xf32>
				return
				}

	// CHECK-LABEL: @masked_load_and_store			// CHECK-LABEL: @masked_load_and_store
	func @masked_load_and_store(%base: memref<?xf32>, %mask: vector<16xi1>, %passthru: vector<16xf32>) {			func @masked_load_and_store(%base: memref<?xf32>, %mask: vector<16xi1>, %passthru: vector<16xf32>) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	// CHECK: %[[X:.]] = vector.maskedload %{{.}}[%{{.}}], %{{.}}, %{{.*}} : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			// CHECK: %[[X:.]] = vector.maskedload %{{.}}[%{{.}}], %{{.}}, %{{.*}} : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	%0 = vector.maskedload %base[%c0], %mask, %passthru : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			%0 = vector.maskedload %base[%c0], %mask, %passthru : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	// CHECK: vector.maskedstore %{{.}}[%{{.}}], %{{.*}}, %[[X]] : memref<?xf32>, vector<16xi1>, vector<16xf32>			// CHECK: vector.maskedstore %{{.}}[%{{.}}], %{{.*}}, %[[X]] : memref<?xf32>, vector<16xi1>, vector<16xf32>
	vector.maskedstore %base[%c0], %mask, %0 : memref<?xf32>, vector<16xi1>, vector<16xf32>			vector.maskedstore %base[%c0], %mask, %0 : memref<?xf32>, vector<16xi1>, vector<16xf32>
	return			return
	Show All 36 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Vector] Introduce 'vector.load' and 'vector.store' opsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 323402

mlir/include/mlir/Dialect/Vector/VectorOps.td

mlir/lib/Conversion/AffineToStandard/AffineToStandard.cpp

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

mlir/lib/Dialect/Vector/VectorOps.cpp

mlir/test/Conversion/AffineToStandard/lower-affine-to-vector.mlir

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

mlir/test/Dialect/Vector/invalid.mlir

mlir/test/Dialect/Vector/ops.mlir

[mlir][Vector] Introduce 'vector.load' and 'vector.store' ops
ClosedPublic