This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Dialect/ArmSME/IR/
-
ArmSME/
-
IR/
21/32
ArmSME.td
-
IR/
-
OpBase.td
-
lib/Dialect/ArmSME/
-
Dialect/
-
ArmSME/
-
IR/
-
ArmSME.cpp
-
Transforms/
-
LegalizeForLLVMExport.cpp
-
test/Dialect/ArmSME/
-
Dialect/
-
ArmSME/
2/2
canonicalize.mlir
1/1
invalid.mlir
1/2
roundtrip.mlir

Differential D154941

[mlir][ArmSME] Add custom get_tile_id and cast ops
ClosedPublic

Authored by c-rhodes on Jul 11 2023, 3:31 AM.

Download Raw Diff

Details

Reviewers

awarzynski
WanderAway
rriddle
antiagainst
ftynse
dcaballe
nicolasvasilache

Commits

rG6ff9761a69df: [mlir][ArmSME] Add custom get_tile_id and cast ops

Summary

This patch adds three new custom ops to the ArmSME dialect:

arm_sme.get_tile_id - returns a scalar integer representing an SME "virtual tile" that is not in use.
arm_sme.cast_tile_to_vector - casts from a tile id to a 2-d scalable vector type, which represents an SME "virtual tile".
arm_sme.cast_vector_to_tile - casts from a 2-d scalable vector type, which represents an SME "virtual tile", to a tile id.

The 'arm_sme.get_tile_id' op currently only supports tile 0, a follow-up
patch will implement proper tile allocation. A further follow-up patch
will demonstrate load/store to/from ZA using these ops.

See the op descriptions for further details and examples.

Thanks to @paulwalker-arm and @awarzynski for helping drive this.

Diff Detail

Event Timeline

c-rhodes created this revision.Jul 11 2023, 3:31 AM

Herald added a reviewer: rriddle. · View Herald TranscriptJul 11 2023, 3:31 AM

Herald added a reviewer: antiagainst. · View Herald Transcript

Herald added a reviewer: ftynse. · View Herald Transcript

Herald added a reviewer: dcaballe. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: gysit, Dinistro, bviyer and 24 others. · View Herald Transcript

c-rhodes requested review of this revision.Jul 11 2023, 3:31 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptJul 11 2023, 3:31 AM

Herald added subscribers: alextsao1999, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

c-rhodes edited the summary of this revision. (Show Details)Jul 11 2023, 4:20 AM

c-rhodes added a subscriber: paulwalker-arm.

Herald added a subscriber: wangpc. · View Herald TranscriptJul 11 2023, 4:20 AM

Harbormaster completed remote builds in B244403: Diff 539000.Jul 11 2023, 4:35 AM

c-rhodes added a child revision: D154955: [mlir][ArmSME] Implement tile allocation.Jul 11 2023, 5:49 AM

LGTM % a few minor details. Great work!

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
54	What about `f32` and `f64`?
188	It may be useful to also indicate what is the intended live range of the allocated tile?

Add FP types.

In D154941#4489283, @WanderAway wrote:

LGTM % a few minor details. Great work!

thanks for taking a look!

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
54	What about `f32` and `f64`? Thanks for pointing that out, I've also added F16/BF16.
188	It may be useful to also indicate what is the intended live range of the allocated tile? Sorry could clarify? Not sure exactly what you mean

It may be useful to also indicate what is the intended live range of the allocated tile?

Sorry could clarify? Not sure exactly what you mean

I wasn't entirely sure when the tile allocated by this will be "deallocated" since I misread the description of D154955 earlier, but upon a closer look I realized the tile allocated is valid in the scope of the function. I thought it'd be a good idea to make this clear in the description (and also how one probably wouldn't be able to pass this as an argument to another function call)

awarzynski mentioned this in D154867: [mlir][ArmSME] Introduce custom ops for SME.Jul 11 2023, 8:14 AM

awarzynski added a child revision: D154867: [mlir][ArmSME] Introduce custom ops for SME.Jul 11 2023, 8:15 AM

In D154941#4489650, @WanderAway wrote:

It may be useful to also indicate what is the intended live range of the allocated tile?

Sorry could clarify? Not sure exactly what you mean

I wasn't entirely sure when the tile allocated by this will be "deallocated" since I misread the description of D154955 earlier, but upon a closer look I realized the tile allocated is valid in the scope of the function. I thought it'd be a good idea to make this clear in the description (and also how one probably wouldn't be able to pass this as an argument to another function call)

Yes that's right the scope is a function, we could add a deallocation op to free up tiles within functions and also support spilling/filling to memory instead of throwing a "ran out of tiles" error in the future. I'll fix the description to capture what you raised. Cheers.

Harbormaster completed remote builds in B244474: Diff 539100.Jul 11 2023, 10:16 AM

Great work @c-rhodes , thank you! I've actually immediately rebased https://reviews.llvm.org/D154867 on top of this change and that immediately solved the "data flow" issue 🙏🏻

Overall this looks solid to me. I've left a few minor suggestions - mostly to clarify the documentation.

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
46–47	[nit] Perhaps extract the preconditions to dedicated definition? Also, when would this be triggered: CPred<"::llvm::cast<::mlir::VectorType>($_self).getShape() == ArrayRef<int64_t>({" # !interleave(dims, ", ") # "})" It feels like a "VectorType verifier" that could be safely skipped (i.e. nothing SME specific).
75–76	Perhaps: "This is used in conjunction with `cast_vector_to_tile" --> "This would normally be used in conjunction with "virtual tile load" operations to model the output of such Ops. This is required to preserve data-flow as SME intrinsics do not return values." Basically, this Op and `CastVectorToTile` complement each other, right? And I guess that's what we want to say here? But IMHO, this description should focus on `CastTileToVector`.
81–96	This example is a bit busy.I would focus on the Op that's defined here (i.e. `CastTileToVector`), so that this description is self-contained (try to avoid references to `CastVectorToTile`). My suggestion: EXAMPLE: Input: ```lang=cpp vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32> vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32> ``` After lowering `vector.load`: ```lang=cpp %tile_id = arith.constant 0 : i32 scf.for %vnum = %c0 to %num_vectors step %c1 { // ... "arm_sme.intr.ld1w.horiz"(%pg, %ptr, %tile_id, %vnum) : (vector<[4]xi1>, !llvm.ptr, i32, i32) -> () } %tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[4]xi32> vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32> ``` Another question - are `vector.load` and `vector.store` the right Ops here? We don't really lower from these ATM.
105–108	This comment refers to `CastVectorToTile`
110–113	This comment refers to "these casts", but this is just one cast ;-)
130–161	This example is a bit busy.I would focus on the Op that's defined here (i.e. `CastVectorToTile`), so that this description is self-contained (try to avoid references to `CastTileToVector`). My suggestion: EXAMPLE: Input: ```mlir %tile = vector.load %mem1[%c0] : memref<?xi32>, vector<[4]x[4]xi32> vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32> ``` Output after lowering `vector.store`: ```mlir %tile = vector.load %mem1[%c0] : memref<?xi32>, vector<[4]x[4]xi32> %tile_id = arm_sme.cast_vector_to_tile %tile : : (vector<[4]x[4]xi32>) -> i32 scf.for %vnum = %c0 to %num_vectors step %c1 { // ... "arm_sme.intr.st1w.horiz"(%pg, %ptr, %tile_id, %vnum) : (vector<[4]xi1>, !llvm.ptr, i32, i32) -> () } ``` Additionally, canonicalization will look through `cast_vector_to_tile` Ops and fold the cast ops away if they come from `cast_tile_to_vector`.
178	[nit] Is there any "tile allocation" really taking place? Perhaps "Allocate and return a "virtual tile" ID"?
mlir/test/Dialect/ArmSME/canonicalize.mlir
10–11	What about "the other way round"? %tile_id = arm_sme.cast_vector_to_tile %tile_1 : vector<[16]x[16]xi8> to i8 %tile_2 = arm_sme.cast_tile_to_vector %tile_id : i8 to vector<[16]x[16]xi8>
mlir/test/Dialect/ArmSME/invalid.mlir
5	How about: func.func @arm_sme_cast_tile_to_vector__bad_vector_type(%tile_id : i8) -> vector<[16]x16xi8> and other combinations? For example: `vector<[16]x[16]xi4>` `vector<16x[16]xi8>`
mlir/test/Dialect/ArmSME/roundtrip.mlir
8	Could you add one more other element type? For example, `vector<[1]x[1]xi128>` (i.e. the other extreme).

Address comments

c-rhodes marked 10 inline comments as done and an inline comment as not done.Jul 12 2023, 8:49 AM

c-rhodes added inline comments.

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
46–47	Also, when would this be triggered: CPred<"::llvm::cast<::mlir::VectorType>($_self).getShape() == ArrayRef<int64_t>({" # !interleave(dims, ", ") # "})" It feels like a "VectorType verifier" that could be safely skipped (i.e. nothing SME specific). Please could you clarify, not sure what you mean? This verifies the shape, i.e. `vector<[16]x[16]xi8>` is (16, 16).
75–76	Perhaps: "This is used in conjunction with `cast_vector_to_tile" --> "This would normally be used in conjunction with "virtual tile load" operations to model the output of such Ops. This is required to preserve data-flow as SME intrinsics do not return values." Basically, this Op and `CastVectorToTile` complement each other, right? And I guess that's what we want to say here? But IMHO, this description should focus on `CastTileToVector`. Thanks for the suggestion this has cleaned it up nicely
178	[nit] Is there any "tile allocation" really taking place? Perhaps "Allocate and return a "virtual tile" ID"? There isn't from the perspective of the op I suppose, it's the pass that does that. Updated the comment.
mlir/test/Dialect/ArmSME/canonicalize.mlir
10–11	What about "the other way round"? %tile_id = arm_sme.cast_vector_to_tile %tile_1 : vector<[16]x[16]xi8> to i8 %tile_2 = arm_sme.cast_tile_to_vector %tile_id : i8 to vector<[16]x[16]xi8> Good spot!
mlir/test/Dialect/ArmSME/roundtrip.mlir
8	Could you add one more other element type? For example, `vector<[1]x[1]xi128>` (i.e. the other extreme). I've added tests for all element types

Harbormaster completed remote builds in B244815: Diff 539587.Jul 12 2023, 12:23 PM

LGTM! I like the abstraction! Awesome to see this moving forward!

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
44	Is there a construct to make this a VectorType instead of a ShapedType? I guess the subsequent predicates constraint the shaped type a bit more but it would be great if this could be an vector type directly.
118	This abstraction sounds really great!
131	I guess you also considered introducing a single cast op that could cast both ways depending on the order of the operand/types. I think having two makes sense since this cast is kind of crossing two domains...
166	nit: do the quotes imply anything on `virtual tiles`? I think I don't get what it is :)

This revision is now accepted and ready to land.Jul 12 2023, 11:05 PM

c-rhodes marked 4 inline comments as done.Jul 13 2023, 1:41 AM

c-rhodes added inline comments.

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
44	Is there a construct to make this a VectorType instead of a ShapedType? I guess the subsequent predicates constraint the shaped type a bit more but it would be great if this could be an vector type directly. I don't believe there is an existing one, the vector ones in `mlir/include/mlir/IR/OpBase.td` I used for reference here also use this, but perhaps `ShapedContainerType` could be copied for a `VectorType`.
131	I guess you also considered introducing a single cast op that could cast both ways depending on the order of the operand/types. I think having two makes sense since this cast is kind of crossing two domains... That didn't cross my mind actually, it's a good point I think of these casts as being similar to `builtin.unrealized_conversion_cast` and that does similar but with a single cast op like you say, perhaps this could be a single cast as well.
166	nit: do the quotes imply anything on `virtual tiles`? I think I don't get what it is :) To be honest I would prefer we just use `tile`, but the rationale is that these are not real tiles but merely “views” into ZA.

LGTM, thanks for addressing my comments :)

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
44	How about ScalableVectorOf? Could that be applicable here?
46–47	I am just thinking that every vector that you create like this will satisfy this condition and to me this check feels redundant. But I am probably just failing to understand the underlying rationale. No harm in keeping this.
166	I've been suggesting "virtual tiles" as different people mean different things when referring to tiles. "SME virtual tiles" is just a way to highlight that: We mean the tiles in the context of the Arm SME extension (as opposed to e.g. tiles when tiling a matmul). These tiles are actually "views" into ZA rather than "tiles". A "tile" to me suggests that it's something "square" and so "ZA tile" could, incorrectly, imply "a square section of ZA". It's a name that one of our architects at Arm has been using and I feel that's very fitting. Naming is hard!

c-rhodes added inline comments.Jul 13 2023, 6:35 AM

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
44	How about ScalableVectorOf? Could that be applicable here? All of the existing scalable vector predicates only check is scalable (i.e. any dim) not all dims scalable
46–47	I am just thinking that every vector that you create like this will satisfy this condition and to me this check feels redundant. And this verifies that :)
166	In the context of SME I think it's clear what a tile is, but I have no strong feelings either way.

c-rhodes added inline comments.Jul 13 2023, 6:47 AM

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
46–47	I am just thinking that every vector that you create like this will satisfy this condition and to me this check feels redundant. And this verifies that :) To clarify, without this check: %tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[8]xi32> would be valid

benmxwl-arm added a subscriber: benmxwl-arm.Jul 14 2023, 2:26 AM

Minor update to add scalar int type to name of functions for get_tile_id tests

Harbormaster completed remote builds in B245404: Diff 540436.Jul 14 2023, 9:40 AM

Matt added a subscriber: Matt.Jul 14 2023, 2:45 PM

awarzynski added inline comments.Jul 17 2023, 12:16 AM

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
46–47	To clarify, without this check: %tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[8]xi32> would be valid Could you double-check? This works fine: module { func.func @arm_sme_cast_tile_to_vector_i8(%arg0: i8) -> vector<[4]x[16]xi8> { %0 = arm_sme.cast_tile_to_vector %arg0 : i8 to vector<[4]x[16]xi8> return %0 : vector<[4]x[16]xi8> } } You will need to replace `SMETile` with `AnyVectorOfAnyRank` in the definition of `CastTileToVector`, but `IsVectorOfShape` should trigger in both cases, right?

c-rhodes added inline comments.Jul 17 2023, 1:30 AM

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td

46–47

To clarify, without this check:
%tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[8]xi32>
would be valid
Could you double-check? This works fine:

module {
  func.func @arm_sme_cast_tile_to_vector_i8(%arg0: i8) -> vector<[4]x[16]xi8> {
    %0 = arm_sme.cast_tile_to_vector %arg0 : i8 to vector<[4]x[16]xi8>
    return %0 : vector<[4]x[16]xi8>
  }
}

this fails for me (as expected):

build/bin/mlir-opt foo.mlir
foo.mlir:4:8: error: 'arm_sme.cast_tile_to_vector' op result #0 must be vector<[16]x[16]xi8> of 8-bit signless integer values or vector<[8]x[8]xi16> of 16-bit signless integer values or vector<[4]x[4]xi32> of 32-bit signless integer values or vector<[2]x[2]xi64> of 64-bit signless integer values or vector<[1]x[1]xi128> of 128-bit signless integer values or vector<[8]x[8]xf16> of 16-bit float values or vector<[8]x[8]xbf16> of bfloat16 type values or vector<[4]x[4]xf32> of 32-bit float values or vector<[2]x[2]xf64> of 64-bit float values, but got 'vector<[4]x[16]xi8>'
  %0 = arm_sme.cast_tile_to_vector %tile_id : i8 to vector<[4]x[16]xi8>
       ^
foo.mlir:4:8: note: see current operation: %0 = "arm_sme.cast_tile_to_vector"(%arg0) : (i8) -> vector<[4]x[16]xi8>

and doesn't if I remove IsVectorOfShape<dims> check.

You will need to replace SMETile with AnyVectorOfAnyRank in the definition of CastTileToVector, but IsVectorOfShape should trigger in both cases, right?

I'm not sure I follow, please could you clarify?

awarzynski added inline comments.Jul 17 2023, 2:36 AM

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
46–47	You are right and I am wrong, sorry. I've just checked the generated CPP code and it's this: ((::llvm::cast<::mlir::VectorType>(type).getShape() == ArrayRef<int64_t>({16, 16}))) // other similar checks So RHS is taken from: def SMETile : AnyTypeOf<[nxnxv16i8, nxnxv8i16, nxnxv4i32, nxnxv2i64, nxnxv1i128, nxnxv8f16, nxnxv8bf16, nxnxv4f32, nxnxv2f64]>; I thought that for this example (`vector<[4]x[8]xi32>`) it would check the following instead: ((::llvm::cast<::mlir::VectorType>(type).getShape() == ArrayRef<int64_t>({4, 8}))) i.e. take the RHS from the input (`vector<[4]x[8]xi32>`). Hence the confusion.

c-rhodes added inline comments.Jul 17 2023, 2:56 AM

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td
46–47	no worries, thanks for checking

I'll land this tomorrow unless there's any further comments by then.

This revision was landed with ongoing or failed builds.Jul 18 2023, 12:42 AM

Closed by commit rG6ff9761a69df: [mlir][ArmSME] Add custom get_tile_id and cast ops (authored by c-rhodes). · Explain Why

This revision was automatically updated to reflect the committed changes.

c-rhodes added a commit: rG6ff9761a69df: [mlir][ArmSME] Add custom get_tile_id and cast ops.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

ArmSME/

IR/

ArmSME.td

161 lines

IR/

OpBase.td

1 line

lib/

Dialect/

ArmSME/

IR/

ArmSME.cpp

11 lines

Transforms/

LegalizeForLLVMExport.cpp

17 lines

test/

Dialect/

ArmSME/

canonicalize.mlir

14 lines

invalid.mlir

25 lines

roundtrip.mlir

25 lines

Diff 539100

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td

Show All 31 Lines	let description = [{
Sources:		Sources:
https://developer.arm.com/documentation/ddi0616		https://developer.arm.com/documentation/ddi0616
https://developer.arm.com/documentation/ddi0602/2023-03/SME-Instructions		https://developer.arm.com/documentation/ddi0602/2023-03/SME-Instructions
}];		}];
let dependentDialects = ["scf::SCFDialect"];		let dependentDialects = ["scf::SCFDialect"];
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// ArmSME type definitions
		//===----------------------------------------------------------------------===//

		class SMETileType<Type datatype, list<int> dims, string description>
		: ShapedContainerType<[datatype],
		dcaballeUnsubmitted Not Done Reply Inline Actions Is there a construct to make this a VectorType instead of a ShapedType? I guess the subsequent predicates constraint the shaped type a bit more but it would be great if this could be an vector type directly. dcaballe: Is there a construct to make this a VectorType instead of a ShapedType? I guess the subsequent…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions Is there a construct to make this a VectorType instead of a ShapedType? I guess the subsequent predicates constraint the shaped type a bit more but it would be great if this could be an vector type directly. I don't believe there is an existing one, the vector ones in `mlir/include/mlir/IR/OpBase.td` I used for reference here also use this, but perhaps `ShapedContainerType` could be copied for a `VectorType`. c-rhodes: > Is there a construct to make this a VectorType instead of a ShapedType? I guess the…
		awarzynskiUnsubmitted Not Done Reply Inline Actions How about ScalableVectorOf? Could that be applicable here? awarzynski: How about [[ https://github.com/llvm/llvm-project/blob/60c9d2993bbf1594e89e1e6f72e1472eb1aeb8ef…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions How about ScalableVectorOf? Could that be applicable here? All of the existing scalable vector predicates only check is scalable (i.e. any dim) not all dims scalable c-rhodes: > How about [[ https://github.com/llvm/llvm-project/blob/60c9d2993bbf1594e89e1e6f72e1472eb1aeb8…
		And<[IsVectorOfRankPred<[2]>,
		CPred<[{::llvm::cast<::mlir::VectorType>($_self).allDimsScalable()}]>,
		CPred<"::llvm::cast<::mlir::VectorType>($_self).getShape() == ArrayRef<int64_t>({" # !interleave(dims, ", ") # "})">]>,
		awarzynskiUnsubmitted Done Reply Inline Actions [nit] Perhaps extract the preconditions to dedicated definition? Also, when would this be triggered: CPred<"::llvm::cast<::mlir::VectorType>($_self).getShape() == ArrayRef<int64_t>({" # !interleave(dims, ", ") # "})" It feels like a "VectorType verifier" that could be safely skipped (i.e. nothing SME specific). awarzynski: [nit] Perhaps extract the preconditions to dedicated definition? Also, when would this be…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions Also, when would this be triggered: CPred<"::llvm::cast<::mlir::VectorType>($_self).getShape() == ArrayRef<int64_t>({" # !interleave(dims, ", ") # "})" It feels like a "VectorType verifier" that could be safely skipped (i.e. nothing SME specific). Please could you clarify, not sure what you mean? This verifies the shape, i.e. `vector<[16]x[16]xi8>` is (16, 16). c-rhodes: > Also, when would this be triggered: > ``` > CPred<"::llvm::cast<::mlir::VectorType>($_self).
		awarzynskiUnsubmitted Not Done Reply Inline Actions I am just thinking that every vector that you create like this will satisfy this condition and to me this check feels redundant. But I am probably just failing to understand the underlying rationale. No harm in keeping this. awarzynski: I am just thinking that every vector that you create like this will satisfy this condition and…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions I am just thinking that every vector that you create like this will satisfy this condition and to me this check feels redundant. And this verifies that :) c-rhodes: > I am just thinking that every vector that you create like this will satisfy this condition…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions I am just thinking that every vector that you create like this will satisfy this condition and to me this check feels redundant. And this verifies that :) To clarify, without this check: %tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[8]xi32> would be valid c-rhodes: > > I am just thinking that every vector that you create like this will satisfy this condition…
		awarzynskiUnsubmitted Not Done Reply Inline Actions To clarify, without this check: %tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[8]xi32> would be valid Could you double-check? This works fine: module { func.func @arm_sme_cast_tile_to_vector_i8(%arg0: i8) -> vector<[4]x[16]xi8> { %0 = arm_sme.cast_tile_to_vector %arg0 : i8 to vector<[4]x[16]xi8> return %0 : vector<[4]x[16]xi8> } } You will need to replace `SMETile` with `AnyVectorOfAnyRank` in the definition of `CastTileToVector`, but `IsVectorOfShape` should trigger in both cases, right? awarzynski: > To clarify, without this check: > ```%tile = arm_sme.cast_tile_to_vector %tile_id : i32 to…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions To clarify, without this check: %tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[8]xi32> would be valid Could you double-check? This works fine: module { func.func @arm_sme_cast_tile_to_vector_i8(%arg0: i8) -> vector<[4]x[16]xi8> { %0 = arm_sme.cast_tile_to_vector %arg0 : i8 to vector<[4]x[16]xi8> return %0 : vector<[4]x[16]xi8> } } this fails for me (as expected): build/bin/mlir-opt foo.mlir foo.mlir:4:8: error: 'arm_sme.cast_tile_to_vector' op result #0 must be vector<[16]x[16]xi8> of 8-bit signless integer values or vector<[8]x[8]xi16> of 16-bit signless integer values or vector<[4]x[4]xi32> of 32-bit signless integer values or vector<[2]x[2]xi64> of 64-bit signless integer values or vector<[1]x[1]xi128> of 128-bit signless integer values or vector<[8]x[8]xf16> of 16-bit float values or vector<[8]x[8]xbf16> of bfloat16 type values or vector<[4]x[4]xf32> of 32-bit float values or vector<[2]x[2]xf64> of 64-bit float values, but got 'vector<[4]x[16]xi8>' %0 = arm_sme.cast_tile_to_vector %tile_id : i8 to vector<[4]x[16]xi8> ^ foo.mlir:4:8: note: see current operation: %0 = "arm_sme.cast_tile_to_vector"(%arg0) : (i8) -> vector<[4]x[16]xi8> and doesn't if I remove `IsVectorOfShape<dims>` check. You will need to replace `SMETile` with `AnyVectorOfAnyRank` in the definition of `CastTileToVector`, but `IsVectorOfShape` should trigger in both cases, right? I'm not sure I follow, please could you clarify? c-rhodes: > > To clarify, without this check: > > ```%tile = arm_sme.cast_tile_to_vector %tile_id : i32…
		awarzynskiUnsubmitted Not Done Reply Inline Actions You are right and I am wrong, sorry. I've just checked the generated CPP code and it's this: ((::llvm::cast<::mlir::VectorType>(type).getShape() == ArrayRef<int64_t>({16, 16}))) // other similar checks So RHS is taken from: def SMETile : AnyTypeOf<[nxnxv16i8, nxnxv8i16, nxnxv4i32, nxnxv2i64, nxnxv1i128, nxnxv8f16, nxnxv8bf16, nxnxv4f32, nxnxv2f64]>; I thought that for this example (`vector<[4]x[8]xi32>`) it would check the following instead: ((::llvm::cast<::mlir::VectorType>(type).getShape() == ArrayRef<int64_t>({4, 8}))) i.e. take the RHS from the input (`vector<[4]x[8]xi32>`). Hence the confusion. awarzynski: You are right and I am wrong, sorry. I've just checked the generated CPP code and it's this…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions no worries, thanks for checking c-rhodes: no worries, thanks for checking
		description>;

		def nxnxv16i8 : SMETileType<I8, [16, 16], "vector<[16]x[16]xi8>">;
		def nxnxv8i16 : SMETileType<I16, [8, 8 ], "vector<[8]x[8]xi16>">;
		def nxnxv4i32 : SMETileType<I32, [4, 4 ], "vector<[4]x[4]xi32>">;
		def nxnxv2i64 : SMETileType<I64, [2, 2 ], "vector<[2]x[2]xi64>">;
		def nxnxv1i128 : SMETileType<I128, [1, 1 ], "vector<[1]x[1]xi128>">;
		WanderAwayUnsubmitted Done Reply Inline Actions What about `f32` and `f64`? WanderAway: What about `f32` and `f64`?
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions What about `f32` and `f64`? Thanks for pointing that out, I've also added F16/BF16. c-rhodes: > What about `f32` and `f64`? Thanks for pointing that out, I've also added F16/BF16.

		def nxnxv8f16 : SMETileType<F16, [8, 8 ], "vector<[8]x[8]xf16>">;
		def nxnxv8bf16 : SMETileType<BF16, [8, 8 ], "vector<[8]x[8]xbf16>">;
		def nxnxv4f32 : SMETileType<F32, [4, 4 ], "vector<[4]x[4]xf32>">;
		def nxnxv2f64 : SMETileType<F64, [2, 2 ], "vector<[2]x[2]xf64>">;

		def SMETile : AnyTypeOf<[nxnxv16i8, nxnxv8i16, nxnxv4i32, nxnxv2i64, nxnxv1i128,
		nxnxv8f16, nxnxv8bf16, nxnxv4f32, nxnxv2f64]>;

		//===----------------------------------------------------------------------===//
		// ArmSME op definitions
		//===----------------------------------------------------------------------===//

		class ArmSME_Op<string mnemonic, list<Trait> traits = []> :
		Op<ArmSME_Dialect, mnemonic, traits> {}

		def CastTileToVector : ArmSME_Op<"cast_tile_to_vector", [Pure]> {
		let summary = "Cast from tile id to 2-d scalable vector type";
		let description = [{
		A `cast_tile_to_vector` operation does a cast from a tile id to a 2-d
		scalable vector type, which represents an SME "virtual tile". This is used
		in conjunction with `cast_vector_to_tile` to preserve dataflow and type
		awarzynskiUnsubmitted Done Reply Inline Actions Perhaps: "This is used in conjunction with `cast_vector_to_tile" --> "This would normally be used in conjunction with "virtual tile load" operations to model the output of such Ops. This is required to preserve data-flow as SME intrinsics do not return values." Basically, this Op and `CastVectorToTile` complement each other, right? And I guess that's what we want to say here? But IMHO, this description should focus on `CastTileToVector`. awarzynski: Perhaps: * "This is used in conjunction with `cast_vector_to_tile" --> "This would normally be…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions Perhaps: "This is used in conjunction with `cast_vector_to_tile" --> "This would normally be used in conjunction with "virtual tile load" operations to model the output of such Ops. This is required to preserve data-flow as SME intrinsics do not return values." Basically, this Op and `CastVectorToTile` complement each other, right? And I guess that's what we want to say here? But IMHO, this description should focus on `CastTileToVector`. Thanks for the suggestion this has cleaned it up nicely c-rhodes: > Perhaps: > * "This is used in conjunction with `cast_vector_to_tile" --> "This would…
		legality when lowering vector ops that have both inputs and outputs, to SME
		intrinsics that have only inputs.

		Example:
		```mlir

		// input
		%tile = vector.load %mem1[%c0] : memref<?xi32>, vector<[4]x[4]xi32>
		vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32>

		// lower vector.load -> SME intrinsics

		%tile_id = arm_sme.get_tile_id : i32
		scf.for %vnum = %c0 to %num_vectors step %c1 {
		// ...
		"arm_sme.intr.ld1w.horiz"(%pg, %ptr, %tile_id, %vnum) : (vector<[4]xi1>, !llvm.ptr, i32, i32) -> ()
		}
		%tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[4]xi32>
		vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32>
		```
		awarzynskiUnsubmitted Done Reply Inline Actions This example is a bit busy.I would focus on the Op that's defined here (i.e. `CastTileToVector`), so that this description is self-contained (try to avoid references to `CastVectorToTile`). My suggestion: EXAMPLE: Input: ```lang=cpp vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32> vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32> ``` After lowering `vector.load`: ```lang=cpp %tile_id = arith.constant 0 : i32 scf.for %vnum = %c0 to %num_vectors step %c1 { // ... "arm_sme.intr.ld1w.horiz"(%pg, %ptr, %tile_id, %vnum) : (vector<[4]xi1>, !llvm.ptr, i32, i32) -> () } %tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[4]xi32> vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32> ``` Another question - are `vector.load` and `vector.store` the right Ops here? We don't really lower from these ATM. awarzynski: This example is a bit busy.I would focus on the Op that's defined here (i.e.

		In the example above, the `vector.load` can't be replaced with an SME
		intrinsic that has no outputs since it is used by the `vector.store`.
		However, by inserting a `cast_tile_to_vector` op after the load intrinsics
		the `vector.load` can be replaced. This enables "local" rewrites on
		individual vector ops, rather than "global" rewrites that would have to
		look at the vector op uses and also lower them.

		The opposite is true for the `vector.store`, when lowered to intrinsics
		they would be preceded by a `cast_vector_to_tile` op. Once the lowering is
		complete the canonicalizer will fold the casts away. The
		`cast_vector_to_tile` op example shows the other half of the lowering.
		awarzynskiUnsubmitted Done Reply Inline Actions This comment refers to `CastVectorToTile` awarzynski: This comment refers to `CastVectorToTile`

		These casts are expected to be folded, but may persist if there's an
		incomplete lowering where a vector op has been lowered to SME but the uses
		haven't, much like if `-reconcile-unrealized-casts` fails. Currently these
		cast ops cannot be lowered to LLVM, but may be in the future.
		awarzynskiUnsubmitted Done Reply Inline Actions This comment refers to "these casts", but this is just one cast ;-) awarzynski: This comment refers to "these casts", but this is just one cast ;-)
		}];
		let arguments = (ins AnyTypeOf<[I8, I16, I32, I64, I128]>:$tile_id);
		let results = (outs SMETile:$vector);
		let assemblyFormat =
		"$tile_id attr-dict `:` type($tile_id) `to` type($vector)";
		dcaballeUnsubmitted Not Done Reply Inline Actions This abstraction sounds really great! dcaballe: This abstraction sounds really great!
		}

		def CastVectorToTile : ArmSME_Op<"cast_vector_to_tile", [Pure]> {
		let summary = "Cast from 2-d scalable vector type to tile id";
		let description = [{
		A `cast_vector_to_tile` operation does a cast from a 2-d scalable vector
		type, which represents an SME "virtual tile", to a tile id. This is used in
		conjunction with `cast_tile_to_vector` to preserve dataflow and type
		legality when lowering vector ops that have both inputs and outputs, to SME
		intrinsics that have only inputs.

		Example:
		```mlir
		dcaballeUnsubmitted Not Done Reply Inline Actions I guess you also considered introducing a single cast op that could cast both ways depending on the order of the operand/types. I think having two makes sense since this cast is kind of crossing two domains... dcaballe: I guess you also considered introducing a single cast op that could cast both ways depending on…
		c-rhodesAuthorUnsubmitted Not Done Reply Inline Actions I guess you also considered introducing a single cast op that could cast both ways depending on the order of the operand/types. I think having two makes sense since this cast is kind of crossing two domains... That didn't cross my mind actually, it's a good point I think of these casts as being similar to `builtin.unrealized_conversion_cast` and that does similar but with a single cast op like you say, perhaps this could be a single cast as well. c-rhodes: > I guess you also considered introducing a single cast op that could cast both ways depending…

		// input
		%tile = vector.load %mem1[%c0] : memref<?xi32>, vector<[4]x[4]xi32>
		vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32>

		// lower vector.load -> SME intrinsics
		%tile_id = arm_sme.get_tile_id : i32
		scf.for %vnum = %c0 to %num_vectors step %c1 {
		// ...
		"arm_sme.intr.ld1w.horiz"(%pg, %ptr, %tile_id, %vnum) : (vector<[4]xi1>, !llvm.ptr, i32, i32) -> ()
		}
		%tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[4]xi32>
		vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32>

		// lower vector.store -> SME intrinsics
		%tile_id_0 = arm_sme.get_tile_id : i32
		scf.for %vnum = %c0 to %num_vectors step %c1 {
		// ...
		"arm_sme.intr.ld1w.horiz"(%pg, %ptr, %tile_id_0, %vnum) : (vector<[4]xi1>, !llvm.ptr, i32, i32) -> ()
		}
		%tile = arm_sme.cast_tile_to_vector %tile_id : i32 to vector<[4]x[4]xi32>
		scf.for %vnum = %c0 to %num_vectors step %c1 {
		// ...
		%tile_id_1 = arm_sme.cast_vector_to_tile %tile : : (vector<[4]x[4]xi32>) -> i32
		"arm_sme.intr.st1w.horiz"(%pg, %ptr, %tile_id_1, %vnum) : (vector<[4]xi1>, !llvm.ptr, i32, i32) -> ()
		}

		// canonicalization will look through `cast_vector_to_tile` and fold the
		// cast ops away.
		```
		awarzynskiUnsubmitted Done Reply Inline Actions This example is a bit busy.I would focus on the Op that's defined here (i.e. `CastVectorToTile`), so that this description is self-contained (try to avoid references to `CastTileToVector`). My suggestion: EXAMPLE: Input: ```mlir %tile = vector.load %mem1[%c0] : memref<?xi32>, vector<[4]x[4]xi32> vector.store %tile, %mem2[%c0] : memref<?xi32>, vector<[4]x[4]xi32> ``` Output after lowering `vector.store`: ```mlir %tile = vector.load %mem1[%c0] : memref<?xi32>, vector<[4]x[4]xi32> %tile_id = arm_sme.cast_vector_to_tile %tile : : (vector<[4]x[4]xi32>) -> i32 scf.for %vnum = %c0 to %num_vectors step %c1 { // ... "arm_sme.intr.st1w.horiz"(%pg, %ptr, %tile_id, %vnum) : (vector<[4]xi1>, !llvm.ptr, i32, i32) -> () } ``` Additionally, canonicalization will look through `cast_vector_to_tile` Ops and fold the cast ops away if they come from `cast_tile_to_vector`. awarzynski: This example is a bit busy.I would focus on the Op that's defined here (i.e.
		}];
		let arguments = (ins SMETile:$vector);
		let results = (outs AnyTypeOf<[I8, I16, I32, I64, I128]>:$tile_id);
		let assemblyFormat =
		"$vector attr-dict `:` type($vector) `to` type($tile_id)";
		dcaballeUnsubmitted Not Done Reply Inline Actions nit: do the quotes imply anything on `virtual tiles`? I think I don't get what it is :) dcaballe: nit: do the quotes imply anything on `virtual tiles`? I think I don't get what it is :)
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions nit: do the quotes imply anything on `virtual tiles`? I think I don't get what it is :) To be honest I would prefer we just use `tile`, but the rationale is that these are not real tiles but merely “views” into ZA. c-rhodes: > nit: do the quotes imply anything on `virtual tiles`? I think I don't get what it is :) To…
		awarzynskiUnsubmitted Not Done Reply Inline Actions I've been suggesting "virtual tiles" as different people mean different things when referring to tiles. "SME virtual tiles" is just a way to highlight that: We mean the tiles in the context of the Arm SME extension (as opposed to e.g. tiles when tiling a matmul). These tiles are actually "views" into ZA rather than "tiles". A "tile" to me suggests that it's something "square" and so "ZA tile" could, incorrectly, imply "a square section of ZA". It's a name that one of our architects at Arm has been using and I feel that's very fitting. Naming is hard! awarzynski: I've been suggesting "virtual tiles" as different people mean different things when referring…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions In the context of SME I think it's clear what a tile is, but I have no strong feelings either way. c-rhodes: In the context of SME I think it's clear what a tile is, but I have no strong feelings either…
		let hasCanonicalizeMethod = 1;
		}

		def GetTileID : ArmSME_Op<"get_tile_id", [Pure]> {
		let summary = "Returns an SME \"virtual tile\" id that is not in use";
		let description = [{
		A `get_tile_id` operation returns a scalar integer of given type
		representing a tile id of an SME "virtual tile" that is not in use.

		Example:
		```mlir
		// Allocate an 8-bit element ZA tile
		awarzynskiUnsubmitted Done Reply Inline Actions [nit] Is there any "tile allocation" really taking place? Perhaps "Allocate and return a "virtual tile" ID"? awarzynski: [nit] Is there any "tile allocation" really taking place? Perhaps "Allocate and return a…
		c-rhodesAuthorUnsubmitted Done Reply Inline Actions [nit] Is there any "tile allocation" really taking place? Perhaps "Allocate and return a "virtual tile" ID"? There isn't from the perspective of the op I suppose, it's the pass that does that. Updated the comment. c-rhodes: > [nit] Is there any "tile allocation" really taking place? Perhaps "Allocate and return a…
		%za0_b = arm_sme.get_tile_id : i8
		```

		Example:
		```
		// Allocate two 16-bit element ZA tiles
		%za0_h = arm_sme.get_tile_id : i16
		%za1_h = arm_sme.get_tile_id : i16
		```

		WanderAwayUnsubmitted Done Reply Inline Actions It may be useful to also indicate what is the intended live range of the allocated tile? WanderAway: It may be useful to also indicate what is the intended live range of the allocated tile?
		c-rhodesAuthorUnsubmitted Not Done Reply Inline Actions It may be useful to also indicate what is the intended live range of the allocated tile? Sorry could clarify? Not sure exactly what you mean c-rhodes: > It may be useful to also indicate what is the intended live range of the allocated tile?
		Example:
		```
		// Allocate a 128-bit element ZA tile
		%za0_q = arm_sme.get_tile_id : i128
		```
		}];

		let results = (outs AnyTypeOf<[I8, I16, I32, I64, I128]>:$tile_id);
		let assemblyFormat = "attr-dict `:` type($tile_id)";
		}

		//===----------------------------------------------------------------------===//
// ArmSME Intrinsic op definitions		// ArmSME Intrinsic op definitions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def MOPPredicate : ScalableVectorOfLengthAndType<[16, 8, 4, 2], [I1]>;		def MOPPredicate : ScalableVectorOfLengthAndType<[16, 8, 4, 2], [I1]>;
def MOPVector : ScalableVectorOfLengthAndType<[16, 8, 4, 2],		def MOPVector : ScalableVectorOfLengthAndType<[16, 8, 4, 2],
[I8, I16, BF16, F16, F32, F64]>;		[I8, I16, BF16, F16, F32, F64]>;
def LDSTPredicate : ScalableVectorOfLengthAndType<[16, 8, 4, 2, 1], [I1]>;		def LDSTPredicate : ScalableVectorOfLengthAndType<[16, 8, 4, 2, 1], [I1]>;

▲ Show 20 Lines • Show All 84 Lines • Show Last 20 Lines

mlir/include/mlir/IR/OpBase.td

Show First 20 Lines • Show All 482 Lines • ▼ Show 20 Lines	class SignlessIntOfWidths<list<int> widths> :
AnyTypeOf<!foreach(w, widths, I<w>),		AnyTypeOf<!foreach(w, widths, I<w>),
!interleave(widths, "/") # "-bit signless integer">;		!interleave(widths, "/") # "-bit signless integer">;

def I1 : I<1>;		def I1 : I<1>;
def I8 : I<8>;		def I8 : I<8>;
def I16 : I<16>;		def I16 : I<16>;
def I32 : I<32>;		def I32 : I<32>;
def I64 : I<64>;		def I64 : I<64>;
		def I128 : I<128>;

// Any signed integer type irrespective of its width.		// Any signed integer type irrespective of its width.
def AnySignedInteger : Type<		def AnySignedInteger : Type<
CPred<"$_self.isSignedInteger()">, "signed integer">;		CPred<"$_self.isSignedInteger()">, "signed integer">;

// Signed integer type of a specific width.		// Signed integer type of a specific width.
class SI<int width>		class SI<int width>
: Type<CPred<"$_self.isSignedInteger(" # width # ")">,		: Type<CPred<"$_self.isSignedInteger(" # width # ")">,
▲ Show 20 Lines • Show All 2,243 Lines • Show Last 20 Lines

mlir/lib/Dialect/ArmSME/IR/ArmSME.cpp

	Show All 28 Lines
	#include "mlir/Dialect/ArmSME/IR/ArmSMETypes.cpp.inc"			#include "mlir/Dialect/ArmSME/IR/ArmSMETypes.cpp.inc"

	void ArmSMEDialect::initialize() {			void ArmSMEDialect::initialize() {
	addOperations<			addOperations<
	#define GET_OP_LIST			#define GET_OP_LIST
	#include "mlir/Dialect/ArmSME/IR/ArmSME.cpp.inc"			#include "mlir/Dialect/ArmSME/IR/ArmSME.cpp.inc"
	>();			>();
	}			}

				// cast_vector_to_tile(cast_tile_to_vector(tile_id)) -> tile_id
				LogicalResult CastVectorToTile::canonicalize(CastVectorToTile op,
				PatternRewriter &rewriter) {
				if (auto vectorOp =
				dyn_cast<CastTileToVector>(op.getVector().getDefiningOp())) {
				op.replaceAllUsesWith(vectorOp.getTileId());
				return success();
				}
				return failure();
				}

mlir/lib/Dialect/ArmSME/Transforms/LegalizeForLLVMExport.cpp

//===- LegalizeForLLVMExport.cpp - Prepare ArmSME for LLVM translation ----===//		//===- LegalizeForLLVMExport.cpp - Prepare ArmSME for LLVM translation ----===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Conversion/LLVMCommon/ConversionTarget.h"		#include "mlir/Conversion/LLVMCommon/ConversionTarget.h"
		#include "mlir/Conversion/LLVMCommon/Pattern.h"
#include "mlir/Dialect/ArmSME/IR/ArmSME.h"		#include "mlir/Dialect/ArmSME/IR/ArmSME.h"
#include "mlir/Dialect/ArmSME/Transforms/Transforms.h"		#include "mlir/Dialect/ArmSME/Transforms/Transforms.h"
#include "mlir/Dialect/Func/IR/FuncOps.h"		#include "mlir/Dialect/Func/IR/FuncOps.h"
		#include "mlir/Dialect/LLVMIR/LLVMDialect.h"
#include "mlir/Dialect/SCF/IR/SCF.h"		#include "mlir/Dialect/SCF/IR/SCF.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::arm_sme;		using namespace mlir::arm_sme;

namespace {		namespace {
/// Insert 'llvm.aarch64.sme.za.enable' intrinsic at the start of 'func.func'		/// Insert 'llvm.aarch64.sme.za.enable' intrinsic at the start of 'func.func'
/// ops to enable the ZA storage array.		/// ops to enable the ZA storage array.
Show All 17 Lines	LogicalResult matchAndRewrite(func::ReturnOp op,
PatternRewriter &rewriter) const final {		PatternRewriter &rewriter) const final {
OpBuilder::InsertionGuard g(rewriter);		OpBuilder::InsertionGuard g(rewriter);
rewriter.setInsertionPoint(op);		rewriter.setInsertionPoint(op);
rewriter.create<arm_sme::aarch64_sme_za_disable>(op->getLoc());		rewriter.create<arm_sme::aarch64_sme_za_disable>(op->getLoc());
rewriter.updateRootInPlace(op, [] {});		rewriter.updateRootInPlace(op, [] {});
return success();		return success();
}		}
};		};

		struct GetTileIDConversion : public ConvertOpToLLVMPattern<GetTileID> {
		using ConvertOpToLLVMPattern<GetTileID>::ConvertOpToLLVMPattern;
		LogicalResult
		matchAndRewrite(GetTileID op, OpAdaptor adaptor,
		ConversionPatternRewriter &rewriter) const override {
		// TODO: implement tile allocation, currently only tile 0 is supported.
		rewriter.replaceOpWithNewOp<LLVM::ConstantOp>(op, rewriter.getI32Type(), 0);
		return success();
		}
		};
} // namespace		} // namespace

void mlir::populateArmSMELegalizeForLLVMExportPatterns(		void mlir::populateArmSMELegalizeForLLVMExportPatterns(
LLVMTypeConverter &converter, RewritePatternSet &patterns) {		LLVMTypeConverter &converter, RewritePatternSet &patterns) {
patterns.add<EnableZAPattern, DisableZAPattern>(patterns.getContext());		patterns.add<EnableZAPattern, DisableZAPattern>(patterns.getContext());
}		}

void mlir::configureArmSMELegalizeForExportTarget(		void mlir::configureArmSMELegalizeForExportTarget(
LLVMConversionTarget &target) {		LLVMConversionTarget &target) {
target.addLegalOp<scf::ForOp, scf::YieldOp, arm_sme::aarch64_sme_zero,		target.addLegalOp<scf::ForOp, scf::YieldOp, arm_sme::CastTileToVector,
		arm_sme::CastVectorToTile, arm_sme::aarch64_sme_zero,
arm_sme::aarch64_sme_str, arm_sme::aarch64_sme_za_enable,		arm_sme::aarch64_sme_str, arm_sme::aarch64_sme_za_enable,
arm_sme::aarch64_sme_za_disable>();		arm_sme::aarch64_sme_za_disable>();
		target.addLegalOp<GetTileID>();

// Mark 'func.func' ops as legal if either:		// Mark 'func.func' ops as legal if either:
// 1. no 'arm_za' function attribute is present.		// 1. no 'arm_za' function attribute is present.
// 2. the 'arm_za' function attribute is present and the first op in the		// 2. the 'arm_za' function attribute is present and the first op in the
// function is an 'arm_sme::aarch64_sme_za_enable' intrinsic.		// function is an 'arm_sme::aarch64_sme_za_enable' intrinsic.
target.addDynamicallyLegalOp<func::FuncOp>([&](func::FuncOp funcOp) {		target.addDynamicallyLegalOp<func::FuncOp>([&](func::FuncOp funcOp) {
if (funcOp.isDeclaration())		if (funcOp.isDeclaration())
return true;		return true;
Show All 17 Lines

mlir/test/Dialect/ArmSME/canonicalize.mlir

This file was added.

				// RUN: mlir-opt -canonicalize -split-input-file -verify-diagnostics %s \| mlir-opt \| FileCheck %s

				// -----

				// CHECK-LABEL: @canonicalize_casts
				// CHECK-SAME: %[[TILE_ID:.*]]: i8
				func.func @canonicalize_casts(%tile_id_0 : i8) -> i8 {
				// CHECK-NOT: arm_sme.cast_tile_to_vector
				// CHECK-NOT: arm_sme.cast_vector_to_tile
				%tile = arm_sme.cast_tile_to_vector %tile_id_0 : i8 to vector<[16]x[16]xi8>
				%tile_id_1 = arm_sme.cast_vector_to_tile %tile : vector<[16]x[16]xi8> to i8
				awarzynskiUnsubmitted Done Reply Inline Actions What about "the other way round"? %tile_id = arm_sme.cast_vector_to_tile %tile_1 : vector<[16]x[16]xi8> to i8 %tile_2 = arm_sme.cast_tile_to_vector %tile_id : i8 to vector<[16]x[16]xi8> awarzynski: What about "the other way round"? ``` %tile_id = arm_sme.cast_vector_to_tile %tile_1 : vector<…
				c-rhodesAuthorUnsubmitted Done Reply Inline Actions What about "the other way round"? %tile_id = arm_sme.cast_vector_to_tile %tile_1 : vector<[16]x[16]xi8> to i8 %tile_2 = arm_sme.cast_tile_to_vector %tile_id : i8 to vector<[16]x[16]xi8> Good spot! c-rhodes: > What about "the other way round"? > ``` > %tile_id = arm_sme.cast_vector_to_tile %tile_1…
				// CHECK-NEXT: return %[[TILE_ID]] : i8
				return %tile_id_1 : i8
				}

mlir/test/Dialect/ArmSME/invalid.mlir

This file was added.

				// RUN: mlir-opt %s -split-input-file -verify-diagnostics

				// -----

				func.func @arm_sme_cast_tile_to_vector__bad_vector_type(%tile_id : i8) -> vector<[16]xi8> {
				awarzynskiUnsubmitted Done Reply Inline Actions How about: func.func @arm_sme_cast_tile_to_vector__bad_vector_type(%tile_id : i8) -> vector<[16]x16xi8> and other combinations? For example: `vector<[16]x[16]xi4>` `vector<16x[16]xi8>` awarzynski: How about: ``` func.func @arm_sme_cast_tile_to_vector__bad_vector_type(%tile_id : i8) ->…
				// expected-error@+1 {{op result #0 must be vector<[16]x[16]xi8> of 8-bit signless integer values or vector<[8]x[8]xi16> of 16-bit signless integer values or vector<[4]x[4]xi32> of 32-bit signless integer values or vector<[2]x[2]xi64> of 64-bit signless integer values or vector<[1]x[1]xi128> of 128-bit signless integer values or vector<[8]x[8]xf16> of 16-bit float values or vector<[8]x[8]xbf16> of bfloat16 type values or vector<[4]x[4]xf32> of 32-bit float values or vector<[2]x[2]xf64> of 64-bit float values, but got 'vector<[16]xi8>'}}
				%0 = arm_sme.cast_tile_to_vector %tile_id : i8 to vector<[16]xi8>
				return %0 : vector<[16]xi8>
				}

				// -----

				func.func @arm_sme_cast_vector_to_tile__bad_rank_1d(%vector : vector<[16]xi8>) -> i8 {
				// expected-error@+1 {{op operand #0 must be vector<[16]x[16]xi8> of 8-bit signless integer values or vector<[8]x[8]xi16> of 16-bit signless integer values or vector<[4]x[4]xi32> of 32-bit signless integer values or vector<[2]x[2]xi64> of 64-bit signless integer values or vector<[1]x[1]xi128> of 128-bit signless integer values or vector<[8]x[8]xf16> of 16-bit float values or vector<[8]x[8]xbf16> of bfloat16 type values or vector<[4]x[4]xf32> of 32-bit float values or vector<[2]x[2]xf64> of 64-bit float values, but got 'vector<[16]xi8>'}}
				%0 = arm_sme.cast_vector_to_tile %vector : vector<[16]xi8> to i8
				return %0 : i8
				}

				// -----

				func.func @arm_sme_get_tile_id__bad_type() -> i1 {
				// expected-error@+1 {{op result #0 must be 8-bit signless integer or 16-bit signless integer or 32-bit signless integer or 64-bit signless integer or 128-bit signless integer}}
				%0 = arm_sme.get_tile_id : i1
				return %0 : i1
				}

mlir/test/Dialect/ArmSME/roundtrip.mlir

This file was added.

				// RUN: mlir-opt -split-input-file -verify-diagnostics %s \| mlir-opt \| FileCheck %s

				// -----

				func.func @arm_sme_cast_tile_to_vector(%tile_id : i8) -> vector<[16]x[16]xi8> {
				// CHECK: arm_sme.cast_tile_to_vector {{.*}} : i8 to vector<[16]x[16]xi8>
				%0 = arm_sme.cast_tile_to_vector %tile_id : i8 to vector<[16]x[16]xi8>
				return %0 : vector<[16]x[16]xi8>
				awarzynskiUnsubmitted Not Done Reply Inline Actions Could you add one more other element type? For example, `vector<[1]x[1]xi128>` (i.e. the other extreme). awarzynski: Could you add one more other element type? For example, `vector<[1]x[1]xi128>` (i.e. the other…
				c-rhodesAuthorUnsubmitted Done Reply Inline Actions Could you add one more other element type? For example, `vector<[1]x[1]xi128>` (i.e. the other extreme). I've added tests for all element types c-rhodes: > Could you add one more other element type? For example, `vector<[1]x[1]xi128>` (i.e. the…
				}

				// -----

				func.func @arm_sme_cast_vector_to_tile(%vector : vector<[16]x[16]xi8>) -> i8 {
				// CHECK: arm_sme.cast_vector_to_tile {{.*}} : vector<[16]x[16]xi8> to i8
				%0 = arm_sme.cast_vector_to_tile %vector : vector<[16]x[16]xi8> to i8
				return %0 : i8
				}

				// -----

				func.func @arm_sme_get_tile_id() -> i32 {
				// CHECK: arm_sme.get_tile_id : i32
				%0 = arm_sme.get_tile_id : i32
				return %0 : i32
				}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][ArmSME] Add custom get_tile_id and cast opsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 539100

mlir/include/mlir/Dialect/ArmSME/IR/ArmSME.td

mlir/include/mlir/IR/OpBase.td

mlir/lib/Dialect/ArmSME/IR/ArmSME.cpp

mlir/lib/Dialect/ArmSME/Transforms/LegalizeForLLVMExport.cpp

mlir/test/Dialect/ArmSME/canonicalize.mlir

mlir/test/Dialect/ArmSME/invalid.mlir

mlir/test/Dialect/ArmSME/roundtrip.mlir

[mlir][ArmSME] Add custom get_tile_id and cast ops
ClosedPublic