This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Dialect/
-
ArmSVE/
-
ArmSVE.td
-
ArmSVEDialect.h
-
ArmSVEOpBase.td
-
LLVMIR/
-
LLVMOps.td
3/3
LLVMTypes.h
-
Vector/
9/9
VectorOps.td
-
IR/
2/2
BuiltinTypes.h
11/11
BuiltinTypes.td
1/1
OpBase.td
-
lib/
-
Conversion/
-
LLVMCommon/
-
TypeConverter.cpp
-
VectorToLLVM/
2/2
ConvertVectorToLLVM.cpp
-
Dialect/
-
Arithmetic/IR/
-
IR/
-
ArithmeticOps.cpp
-
ArmSVE/
-
IR/
1/1
ArmSVEDialect.cpp
-
Transforms/
-
LegalizeForLLVMExport.cpp
-
LLVMIR/IR/
-
IR/
1/1
LLVMDialect.cpp
4/4
LLVMTypes.cpp
-
StandardOps/IR/
-
IR/
-
Ops.cpp
-
IR/
2/2
AsmPrinter.cpp
-
BuiltinAttributes.cpp
6/6
BuiltinTypes.cpp
-
Parser/
-
Parser.h
3/3
TypeParser.cpp
-
Target/LLVMIR/
-
LLVMIR/
-
ModuleTranslation.cpp
1/1
TypeToLLVM.cpp
-
test/
-
Dialect/
-
Arithmetic/
-
ops.mlir
-
ArmSVE/
2/2
legalize-for-llvm.mlir
-
memcpy.mlir
-
roundtrip.mlir
-
Builtin/
-
invalid.mlir
-
ops.mlir
-
Vector/
-
ops.mlir
-
vector-scalable-memcpy.mlir
-
Target/LLVMIR/
-
LLVMIR/
-
arm-sve.mlir

Differential D111819

[mlir][RFC] Add scalable dimensions to VectorType
ClosedPublic

Authored by jsetoain on Oct 14 2021, 9:51 AM.

Download Raw Diff

Details

Reviewers

rriddle
antiagainst
aartbik
ftynse
nicolasvasilache
ThomasRaoux
dcaballe
springerm

Commits

rGa4830d14edbb: [mlir][RFC] Add scalable dimensions to VectorType

Summary

With VectorType supporting scalable dimensions, we don't need many of
the operations currently present in ArmSVE, like mask generation and
basic arithmetic instructions. Therefore, this patch also gets
rid of those.

Having built-in scalable vector support also simplifies the lowering of
scalable vector dialects down to LLVMIR.

Scalable dimensions are indicated with the scalable dimensions
between square brackets:

        vector<[4]xf32>

Is a scalable vector of 4 single precission floating point elements.

More generally, a VectorType can have a set of fixed-length dimensions
followed by a set of scalable dimensions:

        vector<2x[4x4]xf32>

Is a vector with 2 scalable 4x4 vectors of single precission floating
point elements.

The scale of the scalable dimensions can be obtained with the Vector
operation:

        %vs = vector.vscale

This change is being discussed in the discourse RFC:

https://llvm.discourse.group/t/rfc-add-built-in-support-for-scalable-vector-types/4484

Differential Revision: https://reviews.llvm.org/D111819

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jsetoain created this revision.Oct 14 2021, 9:51 AM

Herald added a reviewer: rriddle. · View Herald TranscriptOct 14 2021, 9:51 AM

Herald added a reviewer: antiagainst. · View Herald Transcript

Herald added a reviewer: aartbik. · View Herald Transcript

Herald added a reviewer: ftynse. · View Herald Transcript

Herald added subscribers: wenzhicui, wrengr, Chia-hungDuan and 21 others. · View Herald Transcript

Harbormaster completed remote builds in B128890: Diff 379753.Oct 14 2021, 10:08 AM

jsetoain published this revision for review.Oct 14 2021, 10:44 AM

jsetoain retitled this revision from [mlir] Make scalable vector type a built-in type to [mlir][RFC] Make scalable vector type a built-in type.

jsetoain edited the summary of this revision. (Show Details)

Herald added a project: Restricted Project. · View Herald TranscriptOct 14 2021, 10:44 AM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Matt added a subscriber: Matt.Oct 15 2021, 3:14 PM

meshtag added a subscriber: meshtag.Oct 19 2021, 4:28 AM

jsetoain added a reviewer: nicolasvasilache.Oct 21 2021, 8:12 AM

This direction makes a lot of sense to me, if we want to avoid code dup between the upcoming vector specific dialects (SVE and RISC-V at the moment).
Since this touches "core", however, I hope others chime in too.

mlir/include/mlir/Dialect/LLVMIR/LLVMTypes.h
486	period at end
487	would IsScalableVectorType be a bit more consistent with naming? (in sentence you would put scalable at the end, but since we use "ScalableVectorType" as typename this seems a bit better)
mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
1123 ↗	(On Diff #379753)	compile-time
1125 ↗	(On Diff #379753)	comma, no : Also, is this not better an e.g.? Or just: For example, ....
mlir/include/mlir/IR/BuiltinTypes.td
905	Perhaps you can add some text calling out that < > is fixed length and << >> is scalable? Just because it is a new syntax that we have to get used to ;-)
921	period at end
924	period at end
mlir/include/mlir/IR/OpBase.td
663	I wanted to say period at end, but I see that is not really the style in this file

rriddle requested changes to this revision.Oct 21 2021, 1:02 PM

rriddle added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
1118–1120 ↗	(On Diff #379753)	Why would this be here and not in say, the vector dialect?
mlir/lib/Dialect/LLVMIR/IR/LLVMTypes.cpp
781–782	Drop else after return.
795–797
814–815	Drop else after return.
mlir/lib/IR/BuiltinTypes.cpp
298	Use `cast` if you aren't checking the result, `dyn_cast>` can return null.
321	Same here.
344	And here, and others.
mlir/lib/Target/LLVMIR/TypeToLLVM.cpp
147–148	Drop else after return.

This revision now requires changes to proceed.Oct 21 2021, 1:02 PM

jsetoain added inline comments.Oct 21 2021, 1:42 PM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
1118–1120 ↗	(On Diff #379753)	Indeed, my first instinct was to put it in Vector, but since this value is tightly coupled with the type itself, which is _not_ part of Vector, it feels a bit out of place there as well. I believe that the Vector type not being part of the Vector dialect is what creates the situation. As things are, if there's a place to put runtime constants, that's where this should go. Alternatively, if there's a way to express runtime properties of a type, say: %0 = vector<<>>.scale : index That also looks somewhat right (if a bit ugly). In any case, Standard or Vector, neither place looks better than the other to me, if you see clearly that this makes more sense in Vector, I can move it there and everything else works the same. This is something I actually wanted feedback about, thanks :-)

rriddle added inline comments.Oct 21 2021, 1:43 PM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
1118–1120 ↗	(On Diff #379753)	I don't think it fits well in standard though, given that the standard dialect is going away. It seems like a better home should be found for this.

jsetoain added inline comments.Oct 21 2021, 2:44 PM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
1118–1120 ↗	(On Diff #379753)	Agree. I'll move it to Vector as a non-entirely-terrible option and, should a better place become apparent, we can reconsider in the future. It's not intertwined with anything else, it'd be a quite innocuous change. Thanks!

This direction makes a lot of sense to me, if we want to avoid code dup between the upcoming vector specific dialects (SVE and RISC-V at the moment).
Since this touches "core", however, I hope others chime in too.

My main concern is that the discussion on Discourse does not seem to have a conclusion right now.

In D111819#3081297, @mehdi_amini wrote:

This direction makes a lot of sense to me, if we want to avoid code dup between the upcoming vector specific dialects (SVE and RISC-V at the moment).
Since this touches "core", however, I hope others chime in too.

My main concern is that the discussion on Discourse does not seem to have a conclusion right now.

My apologies. We did have a conversation this week but I didn't update the RFC (I have now). Although some details are still in the air, the need for a built-in type is not in question. This is just the first, most basic implementation possible.

Address reviewers' comments

I've addressed all the comments.

jsetoain edited the summary of this revision. (Show Details)Nov 2 2021, 4:41 AM

Harbormaster completed remote builds in B131940: Diff 384043.Nov 2 2021, 5:05 AM

zhanghb97 added a subscriber: zhanghb97.Nov 2 2021, 7:29 PM

Fixed formatting

Harbormaster completed remote builds in B132257: Diff 384491.Nov 3 2021, 10:08 AM

Rebase on main

Harbormaster completed remote builds in B133450: Diff 386113.Nov 10 2021, 3:57 AM

Rebase on main

Herald added a subscriber: sdasgup3. · View Herald TranscriptNov 23 2021, 3:35 AM

Harbormaster completed remote builds in B135594: Diff 389149.Nov 23 2021, 7:13 AM

Rebase on main

Harbormaster completed remote builds in B136257: Diff 390086.Nov 26 2021, 9:51 AM

rriddle added inline comments.Nov 30 2021, 4:12 PM

mlir/include/mlir/Dialect/LLVMIR/LLVMTypes.h
487	The `get` here is a bit weird, why not something like `isScalableVectorType`?
mlir/include/mlir/Dialect/Vector/VectorOps.td
2404–2405	You could also drop the trailing type if you want, it can be inferred. (i.e. `= "attr-dict"`)
mlir/include/mlir/IR/BuiltinTypes.td
941–942	Is this wrapped at 80 characters?
mlir/lib/Dialect/ArmSVE/IR/ArmSVEDialect.cpp
51
mlir/lib/Dialect/LLVMIR/IR/LLVMDialect.cpp
161	Why the extra spaces?
mlir/lib/Dialect/LLVMIR/IR/LLVMTypes.cpp
778–784
mlir/lib/IR/BuiltinTypes.cpp
296–299
319–320	Same here.
342–343	and here.
mlir/lib/Parser/TypeParser.cpp
456–458

Address review comments

jsetoain edited the summary of this revision. (Show Details)Dec 1 2021, 2:45 AM

What's the next step on this? Seems like the RFC discussion got to a resolution right?

@nicolasvasilache @ftynse : can you chime in on the change in mlir/include/mlir/IR/BuiltinTypes.td ?

Harbormaster completed remote builds in B136880: Diff 390963.Dec 1 2021, 3:02 AM

My apologies for the long delay, my biggest problem atm is I do not have a good mental model of how RVV and Arm SVE operate in detail.
I had started to read specs but it invariably gets pushed back on the stack as it is not high on my priority list ..

From a pure cleanup perspective, I generally like it.

From a composability perspective, I think I would prefer to have it spelled out as vector<<4>xf32> or vector<4*xf32> or vector<(4s)xf32>.
The rationale is that I think we still want to have n-D scalable vector types in MLIR to allow expressing a statically known number of 1-D scalable vectors that serves as an "unroll-and-jammed vector pack" vector<8x4*xf32>.
This would be more consistent with the design of the rest of the vector dialect.

One thing that is higher priority to me personally these days is that we are also exploring using the vector dialect as a programming model for GPUs.
In this context, vector<4x8x16*x32*xf32> would also make sense for us.

Bottom line, if we avoided anchoring on the current LLVM / HW implementation that only support 1-D scalable vectors and we made it future-proof in that direction, I am fine with proceeding.

Generalize the concept of scalable dimensions to support use cases unrelated to scalable vectors

Herald added a subscriber: jdoerfert. · View Herald TranscriptDec 3 2021, 3:38 PM

jsetoain retitled this revision from [mlir][RFC] Make scalable vector type a built-in type to [mlir][RFC] Add scalable dimensions to VectorType.Dec 3 2021, 3:39 PM

jsetoain edited the summary of this revision. (Show Details)

mlir/include/mlir/Dialect/Vector/VectorOps.td
2399	I've seen this, I'll take care of it together with any other necessary fix.
2404–2405	It feels a bit "naked", but it might be because I'm used to see it with the return type attached. We can give it a go and see what people think, if people don't care, going "concise" is my preferred option. Is there a "good practices" manual for dialect syntax? I can't find one.
mlir/include/mlir/IR/BuiltinTypes.td
907	And this...
941–942	Not sure what happened there. Good catch, thanks!
mlir/lib/Parser/TypeParser.cpp
456–458	Arg! That was embarrassing... Sorry about that!

Harbormaster completed remote builds in B137455: Diff 391762.Dec 3 2021, 3:57 PM

Mostly LGTM, I added 3 areas of improvements.
Once these are addressed I'll happily accept.
Thanks for your hard work and patience!

mlir/include/mlir/Dialect/Vector/VectorOps.td
2390	Should we call this `vector.vscale` ?
2394	I would emphasize that this is for 1-D scalable vectors and that there is currently no way to extract the scale for a >1-D scalable vectors. This instruction may be extended in the future to take a position but I am unclear whether this is what we want atm. I think the global vs local property of vscale should also be discussed here. I'd maybe even go as far as spelling it `vector.scale.global` in the future? Edit: as I read deeper through the PR, I am now unclear whether `vector<[2x8]xf32>` is the same as `vector<[2]x[8]xf32>` ? I think `vector<[2x8]xf32>` would make sense for SVE in MLIR (and would then get flattened to 1-D going through LLVM). In the future we may also want `vector<[2]x[8]xf32>` for GPUs but this is not the same representation? Is this what you have in mind ? In any case, please propose a few wording changes to integrate the relevant parts of my comments and disregard/add a TODO for the others :)
mlir/include/mlir/IR/BuiltinTypes.h
323	I fear this will prove annoying to use in practice .. Could we go with `unsigned numScalableDims`? Then you can just use APIS such as ArrayRef's `shape.take_back(numScalableDims);` and friends.
342	this would get nicer with `numScalableDims`.
mlir/include/mlir/IR/BuiltinTypes.td
928	Now that I read this, I am unclear whether `vector<[2x8]xf32>` is the same as `vector<[2]x[8]xf32>`, I would think not and the latter form could be a future extension (if so, add a TODO)? This really depends on whether you think you can make use of `vector<[2x8]xf32>` in MLIR instead of having to represent as `vector<[16]xf32>`; I claim you would have a bunch of nice use cases for this (coupled with the shape_cast op once properly extended).
mlir/test/Dialect/ArmSVE/legalize-for-llvm.mlir
1	I would expect to see a test file (somwhere in the builtin stuff) where you have both: negative tests for various failure modes of misuses of scalable vectors (with appropriate error messages) positive tests with multi-dim multi-scale vector (atm everything I see is 0-dim 1-scale only). In a followup PR, I'd love to see a 1-dim, 2-scale version of the neon 2d dot (or something equivalent) and see it lower to unrolled LLVM.

Also signal boosting for @ThomasRaoux @dcaballe @springerm ; no need to review but be aware that this is coming.

rriddle added inline comments.Dec 6 2021, 12:35 AM

mlir/include/mlir/IR/BuiltinTypes.td
942	Can we use Optional here instead? -1 is a bit magic.
960–961	Why not just isScalable? The naming here is a bit weird.
mlir/lib/IR/AsmPrinter.cpp
1966	Please cache the end iterator to avoid recomputing it every iteration.
mlir/lib/Parser/TypeParser.cpp
522	Looks like this is missing test coverage.

jsetoain added inline comments.Dec 6 2021, 4:11 AM

mlir/include/mlir/IR/BuiltinTypes.td
928	It is, indeed, very much not the same. I find it useful to think about something like [2x8] as a series of 2x8 blocks, one after another. Therefore, even though they would have the same memory requirements, [2x8], [8x2], [4x4], and [16] can represent different data arrangements when you're loading your data from memory. From that point of view, [2]x[8] can't be the same as [2x8] even if the scale for both dimensions is the same. In fact, I don't think something like [2]x[8] makes sense in the context of scalable vectors. For GPU thread blocks, the situation is different. I'm not involved with that work so I can't come up with anything on the spot, but I intuit it could have potentially useful cases. As this work progresses, I suspect we will need to come back to it.

Addressed latest round of reviews

I've also moved a bunch of tests around. Scalable vector tests that are not SVE-specific have been moved either to Arithmetic or Vector, depending on their nature.

mlir/include/mlir/Dialect/Vector/VectorOps.td
2394	That's not exactly right. You can have a 2D scalable vector, and vscale represents its multiplicity, but you can't have a 2D scalable vector with two different scales (which we might want to have for GPUs). As it is, we can't represent those yet, so I don't think we need to clarify that in the description. I've added the multi-dimensional multi-scale vector and local/global scale to a TODO for future reference.
mlir/include/mlir/IR/BuiltinTypes.td
942	Not sure how to use Optional for Types, this is the only way I found to provide a default value in a type builder. In any case, I've changed it to "numScalableDims" as suggested by Nicolas. It makes code a bit less awkward and conveniently replaces a arguably ugly "first dimension = -1" to a more semantically sensible "number of dimensions = 0". If you still find this unacceptable, I can look into adding an "Optional" equivalent for types.
mlir/test/Dialect/ArmSVE/legalize-for-llvm.mlir
1	RE follow-up PR, that was in my low priority TODO list, I'll move it to the main TODO, it should be a quick and easy change.

jsetoain edited the summary of this revision. (Show Details)Dec 10 2021, 10:45 AM

Harbormaster completed remote builds in B138702: Diff 393535.Dec 10 2021, 11:10 AM

LG from my point-of-view, but also get an LGTM from Nicolas as well.

mlir/include/mlir/Dialect/Vector/VectorOps.td
2390	Can you move this into the documentation of the op? This seems useful to expose in the user facing docs.
2404–2405	I don't think we have a "good practices" manual, though that sounds useful.
2415	If a verifier isn't necessary, you can just ignore it.
mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp
31	nit: Prefer pre-increment unless you need post increment behavior.
41	Same here.
mlir/lib/IR/AsmPrinter.cpp
1966	Unresolved.

This revision is now accepted and ready to land.Dec 10 2021, 7:27 PM

Address reviewer comments

Addressed comments.

Thanks for getting these through @jsetoain !

Harbormaster completed remote builds in B139020: Diff 393961.Dec 13 2021, 11:28 AM

Rebase on main

Harbormaster completed remote builds in B139194: Diff 394198.Dec 14 2021, 3:41 AM

Closed by commit rGa4830d14edbb: [mlir][RFC] Add scalable dimensions to VectorType (authored by jsetoain). · Explain WhyDec 15 2021, 1:37 AM

This revision was automatically updated to reflect the committed changes.

jsetoain added a commit: rGa4830d14edbb: [mlir][RFC] Add scalable dimensions to VectorType.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

ArmSVE/

336 lines

3 lines

LLVMIR/

4 lines

12 lines

Vector/

VectorOps.td

32 lines

IR/

BuiltinTypes.h

20 lines

BuiltinTypes.td

46 lines

OpBase.td

72 lines

lib/

Conversion/

LLVMCommon/

TypeConverter.cpp

3 lines

VectorToLLVM/

ConvertVectorToLLVM.cpp

18 lines

Dialect/

Arithmetic/

IR/

ArithmeticOps.cpp

3 lines

ArmSVE/

IR/

ArmSVEDialect.cpp

59 lines

Transforms/

LegalizeForLLVMExport.cpp

189 lines

LLVMIR/

IR/

LLVMDialect.cpp

12 lines

LLVMTypes.cpp

44 lines

StandardOps/

IR/

Ops.cpp

3 lines

IR/

AsmPrinter.cpp

15 lines

BuiltinAttributes.cpp

5 lines

BuiltinTypes.cpp

19 lines

Parser/

Parser.h

3 lines

TypeParser.cpp

108 lines

Target/

LLVMIR/

ModuleTranslation.cpp

10 lines

TypeToLLVM.cpp

3 lines

test/

Dialect/

Arithmetic/

ops.mlir

220 lines

ArmSVE/

legalize-for-llvm.mlir

234 lines

memcpy.mlir

roundtrip.mlir

183 lines

Builtin/

invalid.mlir

8 lines

ops.mlir

16 lines

Vector/

ops.mlir

26 lines

vector-scalable-memcpy.mlir

27 lines

Target/

LLVMIR/

arm-sve.mlir

266 lines

Diff 394488

mlir/include/mlir/Dialect/ArmSVE/ArmSVE.td

Show All 10 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef ARMSVE_OPS		#ifndef ARMSVE_OPS
#define ARMSVE_OPS		#define ARMSVE_OPS

include "mlir/Interfaces/SideEffectInterfaces.td"		include "mlir/Interfaces/SideEffectInterfaces.td"
include "mlir/Dialect/LLVMIR/LLVMOpBase.td"		include "mlir/Dialect/LLVMIR/LLVMOpBase.td"
include "mlir/Dialect/Arithmetic/IR/ArithmeticBase.td"		include "mlir/Dialect/Arithmetic/IR/ArithmeticBase.td"
include "mlir/Dialect/ArmSVE/ArmSVEOpBase.td"

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ArmSVE dialect definition		// ArmSVE dialect definition
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def ArmSVE_Dialect : Dialect {		def ArmSVE_Dialect : Dialect {
let name = "arm_sve";		let name = "arm_sve";
let cppNamespace = "::mlir::arm_sve";		let cppNamespace = "::mlir::arm_sve";
let summary = "Basic dialect to target Arm SVE architectures";		let summary = "Basic dialect to target Arm SVE architectures";
let description = [{		let description = [{
This dialect contains the definitions necessary to target Arm SVE scalable		This dialect contains the definitions necessary to target specific Arm SVE
vector operations, including a scalable vector type and intrinsics for		scalable vector operations.
some Arm SVE instructions.
}];		}];
let useDefaultTypePrinterParser = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ArmSVE type definitions
//===----------------------------------------------------------------------===//

def ArmSVE_ScalableVectorType : DialectType<ArmSVE_Dialect,
CPred<"$_self.isa<ScalableVectorType>()">,
"scalable vector type">,
BuildableType<"$_builder.getType<ScalableVectorType>()"> {
let description = [{
`arm_sve.vector` represents vectors that will be processed by a scalable
vector architecture.
}];
}

class ArmSVE_Type<string name> : TypeDef<ArmSVE_Dialect, name> { }

def ScalableVectorType : ArmSVE_Type<"ScalableVector"> {
let mnemonic = "vector";

let summary = "Scalable vector type";

let description = [{
A type representing scalable length SIMD vectors. Unlike fixed-length SIMD
vectors, whose size is constant and known at compile time, scalable
vectors' length is constant but determined by the specific hardware at
run time.
}];

let parameters = (ins
ArrayRefParameter<"int64_t", "Vector shape">:$shape,
"Type":$elementType
);

let extraClassDeclaration = [{
bool hasStaticShape() const {
return llvm::none_of(getShape(), ShapedType::isDynamic);
}
int64_t getNumElements() const {
assert(hasStaticShape() &&
"cannot get element count of dynamic shaped type");
ArrayRef<int64_t> shape = getShape();
int64_t num = 1;
for (auto dim : shape)
num *= dim;
return num;
}
}];
}

//===----------------------------------------------------------------------===//
// Additional LLVM type constraints
//===----------------------------------------------------------------------===//
def LLVMScalableVectorType :
Type<CPred<"$_self.isa<::mlir::LLVM::LLVMScalableVectorType>()">,
"LLVM dialect scalable vector type">;

//===----------------------------------------------------------------------===//
// ArmSVE op definitions		// ArmSVE op definitions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

class ArmSVE_Op<string mnemonic, list<OpTrait> traits = []> :		class ArmSVE_Op<string mnemonic, list<OpTrait> traits = []> :
Op<ArmSVE_Dialect, mnemonic, traits> {}		Op<ArmSVE_Dialect, mnemonic, traits> {}

class ArmSVE_NonSVEIntrUnaryOverloadedOp<string mnemonic,
list<OpTrait> traits =[]> :
LLVM_IntrOpBase</Dialect dialect=/ArmSVE_Dialect,
/string opName=/mnemonic,
/string enumName=/mnemonic,
/list<int> overloadedResults=/[0],
/list<int> overloadedOperands=/[], // defined by result overload
/list<OpTrait> traits=/traits,
/int numResults=/1>;

class ArmSVE_IntrBinaryOverloadedOp<string mnemonic,		class ArmSVE_IntrBinaryOverloadedOp<string mnemonic,
list<OpTrait> traits = []> :		list<OpTrait> traits = []> :
LLVM_IntrOpBase</Dialect dialect=/ArmSVE_Dialect,		LLVM_IntrOpBase</Dialect dialect=/ArmSVE_Dialect,
/string opName=/"intr." # mnemonic,		/string opName=/"intr." # mnemonic,
/string enumName=/"aarch64_sve_" # !subst(".", "_", mnemonic),		/string enumName=/"aarch64_sve_" # !subst(".", "_", mnemonic),
/list<int> overloadedResults=/[0],		/list<int> overloadedResults=/[0],
/list<int> overloadedOperands=/[], // defined by result overload		/list<int> overloadedOperands=/[], // defined by result overload
/list<OpTrait> traits=/traits,		/list<OpTrait> traits=/traits,
/int numResults=/1>;		/int numResults=/1>;

class ScalableFOp<string mnemonic, string op_description,
list<OpTrait> traits = []> :
ArmSVE_Op<mnemonic, !listconcat(traits,
[AllTypesMatch<["src1", "src2", "dst"]>])> {
let summary = op_description # " for scalable vectors of floats";
let description = [{
The `arm_sve.}] # mnemonic # [{` operations takes two scalable vectors and
returns one scalable vector with the result of the }] # op_description # [{.
}];
let arguments = (ins
ScalableVectorOf<[AnyFloat]>:$src1,
ScalableVectorOf<[AnyFloat]>:$src2
);
let results = (outs ScalableVectorOf<[AnyFloat]>:$dst);
let assemblyFormat =
"$src1 `,` $src2 attr-dict `:` type($src1)";
}

class ScalableIOp<string mnemonic, string op_description,
list<OpTrait> traits = []> :
ArmSVE_Op<mnemonic, !listconcat(traits,
[AllTypesMatch<["src1", "src2", "dst"]>])> {
let summary = op_description # " for scalable vectors of integers";
let description = [{
The `arm_sve.}] # mnemonic # [{` operation takes two scalable vectors and
returns one scalable vector with the result of the }] # op_description # [{.
}];
let arguments = (ins
ScalableVectorOf<[I8, I16, I32, I64]>:$src1,
ScalableVectorOf<[I8, I16, I32, I64]>:$src2
);
let results = (outs ScalableVectorOf<[I8, I16, I32, I64]>:$dst);
let assemblyFormat =
"$src1 `,` $src2 attr-dict `:` type($src1)";
}

class ScalableMaskedFOp<string mnemonic, string op_description,		class ScalableMaskedFOp<string mnemonic, string op_description,
list<OpTrait> traits = []> :		list<OpTrait> traits = []> :
ArmSVE_Op<mnemonic, !listconcat(traits,		ArmSVE_Op<mnemonic, !listconcat(traits,
[AllTypesMatch<["src1", "src2", "res"]>,		[AllTypesMatch<["src1", "src2", "res"]>,
TypesMatchWith<		TypesMatchWith<
"mask has i1 element type and same shape as operands",		"mask has i1 element type and same shape as operands",
"src1", "mask", "getI1SameShape($_self)">])> {		"src1", "mask", "getI1SameShape($_self)">])> {
let summary = "masked " # op_description # " for scalable vectors of floats";		let summary = "masked " # op_description # " for scalable vectors of floats";
▲ Show 20 Lines • Show All 156 Lines • ▼ Show 20 Lines	let arguments = (ins
ScalableVectorOfLengthAndType<[16], [I8]>:$src1,		ScalableVectorOfLengthAndType<[16], [I8]>:$src1,
ScalableVectorOfLengthAndType<[16], [I8]>:$src2		ScalableVectorOfLengthAndType<[16], [I8]>:$src2
);		);
let results = (outs ScalableVectorOfLengthAndType<[4], [I32]>:$dst);		let results = (outs ScalableVectorOfLengthAndType<[4], [I32]>:$dst);
let assemblyFormat =		let assemblyFormat =
"$acc `,` $src1 `,` $src2 attr-dict `:` type($src1) `to` type($dst)";		"$acc `,` $src1 `,` $src2 attr-dict `:` type($src1) `to` type($dst)";
}		}

def VectorScaleOp : ArmSVE_Op<"vector_scale",
[NoSideEffect]> {
let summary = "Load vector scale size";
let description = [{
The vector_scale op returns the scale of the scalable vectors, a positive
integer value that is constant at runtime but unknown at compile time.
The scale of the vector indicates the multiplicity of the vectors and
vector operations. I.e.: an !arm_sve.vector<4xi32> is equivalent to
vector_scale consecutive vector<4xi32>; and an operation on an
!arm_sve.vector<4xi32> is equivalent to performing that operation vector_scale
times, once on each <4xi32> segment of the scalable vector. The vector_scale
op can be used to calculate the step in vector-length agnostic (VLA) loops.
}];
let results = (outs Index:$res);
let assemblyFormat =
"attr-dict `:` type($res)";
}

def ScalableLoadOp : ArmSVE_Op<"load">,
Arguments<(ins Arg<AnyMemRef, "", [MemRead]>:$base, Index:$index)>,
Results<(outs ScalableVectorOf<[AnyType]>:$result)> {
let summary = "Load scalable vector from memory";
let description = [{
Load a slice of memory into a scalable vector.
}];
let extraClassDeclaration = [{
MemRefType getMemRefType() {
return base().getType().cast<MemRefType>();
}
}];
let assemblyFormat = "$base `[` $index `]` attr-dict `:` "
"type($result) `from` type($base)";
}

def ScalableStoreOp : ArmSVE_Op<"store">,
Arguments<(ins Arg<AnyMemRef, "", [MemWrite]>:$base, Index:$index,
ScalableVectorOf<[AnyType]>:$value)> {
let summary = "Store scalable vector into memory";
let description = [{
Store a scalable vector on a slice of memory.
}];
let extraClassDeclaration = [{
MemRefType getMemRefType() {
return base().getType().cast<MemRefType>();
}
}];
let assemblyFormat = "$value `,` $base `[` $index `]` attr-dict `:` "
"type($value) `to` type($base)";
}

def ScalableAddIOp : ScalableIOp<"addi", "addition", [Commutative]>;

def ScalableAddFOp : ScalableFOp<"addf", "addition", [Commutative]>;

def ScalableSubIOp : ScalableIOp<"subi", "subtraction">;

def ScalableSubFOp : ScalableFOp<"subf", "subtraction">;

def ScalableMulIOp : ScalableIOp<"muli", "multiplication", [Commutative]>;

def ScalableMulFOp : ScalableFOp<"mulf", "multiplication", [Commutative]>;

def ScalableSDivIOp : ScalableIOp<"divi_signed", "signed division">;

def ScalableUDivIOp : ScalableIOp<"divi_unsigned", "unsigned division">;

def ScalableDivFOp : ScalableFOp<"divf", "division">;

def ScalableMaskedAddIOp : ScalableMaskedIOp<"masked.addi", "addition",		def ScalableMaskedAddIOp : ScalableMaskedIOp<"masked.addi", "addition",
[Commutative]>;		[Commutative]>;

def ScalableMaskedAddFOp : ScalableMaskedFOp<"masked.addf", "addition",		def ScalableMaskedAddFOp : ScalableMaskedFOp<"masked.addf", "addition",
[Commutative]>;		[Commutative]>;

def ScalableMaskedSubIOp : ScalableMaskedIOp<"masked.subi", "subtraction">;		def ScalableMaskedSubIOp : ScalableMaskedIOp<"masked.subi", "subtraction">;

def ScalableMaskedSubFOp : ScalableMaskedFOp<"masked.subf", "subtraction">;		def ScalableMaskedSubFOp : ScalableMaskedFOp<"masked.subf", "subtraction">;

def ScalableMaskedMulIOp : ScalableMaskedIOp<"masked.muli", "multiplication",		def ScalableMaskedMulIOp : ScalableMaskedIOp<"masked.muli", "multiplication",
[Commutative]>;		[Commutative]>;

def ScalableMaskedMulFOp : ScalableMaskedFOp<"masked.mulf", "multiplication",		def ScalableMaskedMulFOp : ScalableMaskedFOp<"masked.mulf", "multiplication",
[Commutative]>;		[Commutative]>;

def ScalableMaskedSDivIOp : ScalableMaskedIOp<"masked.divi_signed",		def ScalableMaskedSDivIOp : ScalableMaskedIOp<"masked.divi_signed",
"signed division">;		"signed division">;

def ScalableMaskedUDivIOp : ScalableMaskedIOp<"masked.divi_unsigned",		def ScalableMaskedUDivIOp : ScalableMaskedIOp<"masked.divi_unsigned",
"unsigned division">;		"unsigned division">;

def ScalableMaskedDivFOp : ScalableMaskedFOp<"masked.divf", "division">;		def ScalableMaskedDivFOp : ScalableMaskedFOp<"masked.divf", "division">;

//===----------------------------------------------------------------------===//
// ScalableCmpFOp
//===----------------------------------------------------------------------===//

def ScalableCmpFOp : ArmSVE_Op<"cmpf", [NoSideEffect, SameTypeOperands,
TypesMatchWith<"result type has i1 element type and same shape as operands",
"lhs", "result", "getI1SameShape($_self)">]> {
let summary = "floating-point comparison operation for scalable vectors";
let description = [{
The `arm_sve.cmpf` operation compares two scalable vectors of floating point
elements according to the float comparison rules and the predicate specified
by the respective attribute. The predicate defines the type of comparison:
(un)orderedness, (in)equality and signed less/greater than (or equal to) as
well as predicates that are always true or false. The result is a scalable
vector of i1 elements. Unlike `arm_sve.cmpi`, the operands are always
treated as signed. The u prefix indicates unordered comparison, not
unsigned comparison, so "une" means unordered not equal. For the sake of
readability by humans, custom assembly form for the operation uses a
string-typed attribute for the predicate. The value of this attribute
corresponds to lower-cased name of the predicate constant, e.g., "one" means
"ordered not equal". The string representation of the attribute is merely a
syntactic sugar and is converted to an integer attribute by the parser.

Example:

```mlir
%r = arm_sve.cmpf oeq, %0, %1 : !arm_sve.vector<4xf32>
```
}];
let arguments = (ins
Arith_CmpFPredicateAttr:$predicate,
ScalableVectorOf<[AnyFloat]>:$lhs,
ScalableVectorOf<[AnyFloat]>:$rhs // TODO: This should support a simple scalar
);
let results = (outs ScalableVectorOf<[I1]>:$result);

let builders = [
OpBuilder<(ins "arith::CmpFPredicate":$predicate, "Value":$lhs,
"Value":$rhs), [{
buildScalableCmpFOp($_builder, $_state, predicate, lhs, rhs);
}]>];

let extraClassDeclaration = [{
static StringRef getPredicateAttrName() { return "predicate"; }
static arith::CmpFPredicate getPredicateByName(StringRef name);

arith::CmpFPredicate getPredicate() {
return (arith::CmpFPredicate) (*this)->getAttrOfType<IntegerAttr>(
getPredicateAttrName()).getInt();
}
}];

let verifier = [{ return success(); }];

let assemblyFormat = "$predicate `,` $lhs `,` $rhs attr-dict `:` type($lhs)";
}

//===----------------------------------------------------------------------===//
// ScalableCmpIOp
//===----------------------------------------------------------------------===//

def ScalableCmpIOp : ArmSVE_Op<"cmpi", [NoSideEffect, SameTypeOperands,
TypesMatchWith<"result type has i1 element type and same shape as operands",
"lhs", "result", "getI1SameShape($_self)">]> {
let summary = "integer comparison operation for scalable vectors";
let description = [{
The `arm_sve.cmpi` operation compares two scalable vectors of integer
elements according to the predicate specified by the respective attribute.

The predicate defines the type of comparison:

- equal (mnemonic: `"eq"`; integer value: `0`)
- not equal (mnemonic: `"ne"`; integer value: `1`)
- signed less than (mnemonic: `"slt"`; integer value: `2`)
- signed less than or equal (mnemonic: `"sle"`; integer value: `3`)
- signed greater than (mnemonic: `"sgt"`; integer value: `4`)
- signed greater than or equal (mnemonic: `"sge"`; integer value: `5`)
- unsigned less than (mnemonic: `"ult"`; integer value: `6`)
- unsigned less than or equal (mnemonic: `"ule"`; integer value: `7`)
- unsigned greater than (mnemonic: `"ugt"`; integer value: `8`)
- unsigned greater than or equal (mnemonic: `"uge"`; integer value: `9`)

Example:

```mlir
%r = arm_sve.cmpi uge, %0, %1 : !arm_sve.vector<4xi32>
```
}];

let arguments = (ins
Arith_CmpIPredicateAttr:$predicate,
ScalableVectorOf<[I8, I16, I32, I64]>:$lhs,
ScalableVectorOf<[I8, I16, I32, I64]>:$rhs
);
let results = (outs ScalableVectorOf<[I1]>:$result);

let builders = [
OpBuilder<(ins "arith::CmpIPredicate":$predicate, "Value":$lhs,
"Value":$rhs), [{
buildScalableCmpIOp($_builder, $_state, predicate, lhs, rhs);
}]>];

let extraClassDeclaration = [{
static StringRef getPredicateAttrName() { return "predicate"; }
static arith::CmpIPredicate getPredicateByName(StringRef name);

arith::CmpIPredicate getPredicate() {
return (arith::CmpIPredicate) (*this)->getAttrOfType<IntegerAttr>(
getPredicateAttrName()).getInt();
}
}];

let verifier = [{ return success(); }];

let assemblyFormat = "$predicate `,` $lhs `,` $rhs attr-dict `:` type($lhs)";
}

def UmmlaIntrOp :		def UmmlaIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"ummla">,		ArmSVE_IntrBinaryOverloadedOp<"ummla">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def SmmlaIntrOp :		def SmmlaIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"smmla">,		ArmSVE_IntrBinaryOverloadedOp<"smmla">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def SdotIntrOp :		def SdotIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"sdot">,		ArmSVE_IntrBinaryOverloadedOp<"sdot">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def UdotIntrOp :		def UdotIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"udot">,		ArmSVE_IntrBinaryOverloadedOp<"udot">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def ScalableMaskedAddIIntrOp :		def ScalableMaskedAddIIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"add">,		ArmSVE_IntrBinaryOverloadedOp<"add">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def ScalableMaskedAddFIntrOp :		def ScalableMaskedAddFIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"fadd">,		ArmSVE_IntrBinaryOverloadedOp<"fadd">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def ScalableMaskedMulIIntrOp :		def ScalableMaskedMulIIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"mul">,		ArmSVE_IntrBinaryOverloadedOp<"mul">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def ScalableMaskedMulFIntrOp :		def ScalableMaskedMulFIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"fmul">,		ArmSVE_IntrBinaryOverloadedOp<"fmul">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def ScalableMaskedSubIIntrOp :		def ScalableMaskedSubIIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"sub">,		ArmSVE_IntrBinaryOverloadedOp<"sub">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def ScalableMaskedSubFIntrOp :		def ScalableMaskedSubFIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"fsub">,		ArmSVE_IntrBinaryOverloadedOp<"fsub">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def ScalableMaskedSDivIIntrOp :		def ScalableMaskedSDivIIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"sdiv">,		ArmSVE_IntrBinaryOverloadedOp<"sdiv">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def ScalableMaskedUDivIIntrOp :		def ScalableMaskedUDivIIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"udiv">,		ArmSVE_IntrBinaryOverloadedOp<"udiv">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def ScalableMaskedDivFIntrOp :		def ScalableMaskedDivFIntrOp :
ArmSVE_IntrBinaryOverloadedOp<"fdiv">,		ArmSVE_IntrBinaryOverloadedOp<"fdiv">,
Arguments<(ins LLVMScalableVectorType, LLVMScalableVectorType,		Arguments<(ins AnyScalableVector, AnyScalableVector, AnyScalableVector)>;
LLVMScalableVectorType)>;

def VectorScaleIntrOp:
ArmSVE_NonSVEIntrUnaryOverloadedOp<"vscale">;

#endif // ARMSVE_OPS		#endif // ARMSVE_OPS

mlir/include/mlir/Dialect/ArmSVE/ArmSVEDialect.h

	Show All 15 Lines
	#include "mlir/IR/BuiltinTypes.h"			#include "mlir/IR/BuiltinTypes.h"
	#include "mlir/IR/Dialect.h"			#include "mlir/IR/Dialect.h"
	#include "mlir/IR/OpDefinition.h"			#include "mlir/IR/OpDefinition.h"
	#include "mlir/Interfaces/SideEffectInterfaces.h"			#include "mlir/Interfaces/SideEffectInterfaces.h"

	#include "mlir/Dialect/ArmSVE/ArmSVEDialect.h.inc"			#include "mlir/Dialect/ArmSVE/ArmSVEDialect.h.inc"
	#include "mlir/Dialect/StandardOps/IR/Ops.h"			#include "mlir/Dialect/StandardOps/IR/Ops.h"

	#define GET_TYPEDEF_CLASSES
	#include "mlir/Dialect/ArmSVE/ArmSVETypes.h.inc"

	#define GET_OP_CLASSES			#define GET_OP_CLASSES
	#include "mlir/Dialect/ArmSVE/ArmSVE.h.inc"			#include "mlir/Dialect/ArmSVE/ArmSVE.h.inc"

	#endif // MLIR_DIALECT_ARMSVE_ARMSVEDIALECT_H			#endif // MLIR_DIALECT_ARMSVE_ARMSVEDIALECT_H

mlir/include/mlir/Dialect/ArmSVE/ArmSVEOpBase.td

This file was deleted.

	//===-- ArmSVEOpBase.td - Base op definitions for ArmSVE ---- tablegen --===//
	//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//
	//===----------------------------------------------------------------------===//
	//
	// This is the base operation definition file for ArmSVE scalable vector types.
	//
	//===----------------------------------------------------------------------===//

	#ifndef ARMSVE_OP_BASE
	#define ARMSVE_OP_BASE

	//===----------------------------------------------------------------------===//
	// ArmSVE scalable vector type constraints
	//===----------------------------------------------------------------------===//

	def IsScalableVectorTypePred :
	CPred<"$_self.isa<::mlir::arm_sve::ScalableVectorType>()">;

	class ScalableVectorOf<list<Type> allowedTypes> :
	ContainerType<AnyTypeOf<allowedTypes>, IsScalableVectorTypePred,
	"$_self.cast<::mlir::arm_sve::ScalableVectorType>().getElementType()",
	"scalable vector">;

	// Whether the number of elements of a scalable vector is from the given
	// `allowedLengths` list
	class IsScalableVectorOfLengthPred<list<int> allowedLengths> :
	And<[IsScalableVectorTypePred,
	Or<!foreach(allowedlength, allowedLengths, CPred<
	[{$_self.cast<::mlir::arm_sve::ScalableVectorType>().getNumElements() == }]
	# allowedlength>)>]>;

	// Any scalable vector where the number of elements is from the given
	// `allowedLengths` list
	class ScalableVectorOfLength<list<int> allowedLengths> : Type<
	IsScalableVectorOfLengthPred<allowedLengths>,
	" of length " # !interleave(allowedLengths, "/"),
	"::mlir::arm_sve::ScalableVectorType">;

	// Any scalable vector where the number of elements is from the given
	// `allowedLengths` list and the type is from the given `allowedTypes` list
	class ScalableVectorOfLengthAndType<list<int> allowedLengths,
	list<Type> allowedTypes> : Type<
	And<[ScalableVectorOf<allowedTypes>.predicate,
	ScalableVectorOfLength<allowedLengths>.predicate]>,
	ScalableVectorOf<allowedTypes>.summary #
	ScalableVectorOfLength<allowedLengths>.summary,
	"::mlir::arm_sve::ScalableVectorType">;

	#endif // ARMSVE_OP_BASE

mlir/include/mlir/Dialect/LLVMIR/LLVMOps.td

	Show First 20 Lines • Show All 1,728 Lines • ▼ Show 20 Lines
	}			}

	/// Create a call to Masked Compress Store intrinsic.			/// Create a call to Masked Compress Store intrinsic.
	def LLVM_masked_compressstore			def LLVM_masked_compressstore
	: LLVM_IntrOp<"masked.compressstore", [], [0], [], 0> {			: LLVM_IntrOp<"masked.compressstore", [], [0], [], 0> {
	let arguments = (ins LLVM_Type, LLVM_Type, LLVM_Type);			let arguments = (ins LLVM_Type, LLVM_Type, LLVM_Type);
	}			}

	//			/// Create a call to vscale intrinsic.
				def LLVM_vscale : LLVM_IntrOp<"vscale", [0], [], [], 1>;

	// Atomic operations.			// Atomic operations.
	//			//

	def AtomicBinOpXchg : I64EnumAttrCase<"xchg", 0>;			def AtomicBinOpXchg : I64EnumAttrCase<"xchg", 0>;
	def AtomicBinOpAdd : I64EnumAttrCase<"add", 1>;			def AtomicBinOpAdd : I64EnumAttrCase<"add", 1>;
	def AtomicBinOpSub : I64EnumAttrCase<"sub", 2>;			def AtomicBinOpSub : I64EnumAttrCase<"sub", 2>;
	def AtomicBinOpAnd : I64EnumAttrCase<"_and", 3>;			def AtomicBinOpAnd : I64EnumAttrCase<"_and", 3>;
	def AtomicBinOpNand : I64EnumAttrCase<"nand", 4>;			def AtomicBinOpNand : I64EnumAttrCase<"nand", 4>;
	▲ Show 20 Lines • Show All 149 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/LLVMIR/LLVMTypes.h

	Show First 20 Lines • Show All 477 Lines • ▼ Show 20 Lines

	/// Returns the element type of any vector type compatible with the LLVM			/// Returns the element type of any vector type compatible with the LLVM
	/// dialect.			/// dialect.
	Type getVectorElementType(Type type);			Type getVectorElementType(Type type);

	/// Returns the element count of any LLVM-compatible vector type.			/// Returns the element count of any LLVM-compatible vector type.
	llvm::ElementCount getVectorNumElements(Type type);			llvm::ElementCount getVectorNumElements(Type type);

				/// Returns whether a vector type is scalable or not.
				aartbikUnsubmitted Done Reply Inline Actions period at end aartbik: period at end
				bool isScalableVectorType(Type vectorType);
				aartbikUnsubmitted Done Reply Inline Actions would IsScalableVectorType be a bit more consistent with naming? (in sentence you would put scalable at the end, but since we use "ScalableVectorType" as typename this seems a bit better) aartbik: would IsScalableVectorType be a bit more consistent with naming? (in sentence you would put…
				rriddleUnsubmitted Done Reply Inline Actions The `get` here is a bit weird, why not something like `isScalableVectorType`? rriddle: The `get` here is a bit weird, why not something like `isScalableVectorType`?

				/// Creates an LLVM dialect-compatible vector type with the given element type
				/// and length.
				Type getVectorType(Type elementType, unsigned numElements,
				bool isScalable = false);

	/// Creates an LLVM dialect-compatible type with the given element type and			/// Creates an LLVM dialect-compatible type with the given element type and
	/// length.			/// length.
	Type getFixedVectorType(Type elementType, unsigned numElements);			Type getFixedVectorType(Type elementType, unsigned numElements);

				/// Creates an LLVM dialect-compatible type with the given element type and
				/// length.
				Type getScalableVectorType(Type elementType, unsigned numElements);

	/// Returns the size of the given primitive LLVM dialect-compatible type			/// Returns the size of the given primitive LLVM dialect-compatible type
	/// (including vectors) in bits, for example, the size of i16 is 16 and			/// (including vectors) in bits, for example, the size of i16 is 16 and
	/// the size of vector<4xi16> is 64. Returns 0 for non-primitive			/// the size of vector<4xi16> is 64. Returns 0 for non-primitive
	/// (aggregates such as struct) or types that don't have a size (such as void).			/// (aggregates such as struct) or types that don't have a size (such as void).
	llvm::TypeSize getPrimitiveTypeSizeInBits(Type type);			llvm::TypeSize getPrimitiveTypeSizeInBits(Type type);

	} // namespace LLVM			} // namespace LLVM
	} // namespace mlir			} // namespace mlir

	#endif // MLIR_DIALECT_LLVMIR_LLVMTYPES_H_			#endif // MLIR_DIALECT_LLVMIR_LLVMTYPES_H_

mlir/include/mlir/Dialect/Vector/VectorOps.td

Show First 20 Lines • Show All 2,377 Lines • ▼ Show 20 Lines let description = [{

%1 = vector.flat_transpose %0 { rows = 4: i32, columns = 4: i32 } %1 = vector.flat_transpose %0 { rows = 4: i32, columns = 4: i32 }

: (vector<16xf32>) -> vector<16xf32> : (vector<16xf32>) -> vector<16xf32>

``` ```

}]; }];

let verifier = ?; let verifier = ?;

let assemblyFormat = "$matrix attr-dict `:` type($matrix) `->` type($res)"; let assemblyFormat = "$matrix attr-dict `:` type($matrix) `->` type($res)";

} }

//===----------------------------------------------------------------------===//

// VectorScaleOp

//===----------------------------------------------------------------------===//

// TODO: In the future, we might want to have scalable vectors with different

nicolasvasilacheUnsubmitted

Done

Should we call this vector.vscale ?

nicolasvasilache: Should we call this `vector.vscale` ?

rriddleUnsubmitted

Done

Can you move this into the documentation of the op? This seems useful to expose in the user facing docs.

rriddle: Can you move this into the documentation of the op? This seems useful to expose in the user…

// scales for different dimensions. E.g.: vector<[16]x[16]xf32>, in

// which case we might need to add an index to 'vscale' to select one

// of them. In order to support GPUs, we might also want to differentiate

// between a 'global' scale, a scale that's fixed throughout the

nicolasvasilacheUnsubmitted

Done

I would emphasize that this is for 1-D scalable vectors and that there is currently no way to extract the scale
for a >1-D scalable vectors.
This instruction may be extended in the future to take a position but I am unclear whether this is what we want atm.

I think the global vs local property of vscale should also be discussed here.

I'd maybe even go as far as spelling it vector.scale.global in the future?

Edit: as I read deeper through the PR, I am now unclear whether vector<[2x8]xf32> is the same as vector<[2]x[8]xf32> ?

I think vector<[2x8]xf32> would make sense for SVE in MLIR (and would then get flattened to 1-D going through LLVM).
In the future we may also want vector<[2]x[8]xf32> for GPUs but this is not the same representation?
Is this what you have in mind ?

In any case, please propose a few wording changes to integrate the relevant parts of my comments and disregard/add a TODO for the others :)

nicolasvasilache: I would emphasize that this is for 1-D scalable vectors and that there is currently no way to…

jsetoainAuthorUnsubmitted

Done

That's not exactly right. You can have a 2D scalable vector, and vscale represents its multiplicity, but you can't have a 2D scalable vector with two different scales (which we might want to have for GPUs). As it is, we can't represent those yet, so I don't think we need to clarify that in the description. I've added the multi-dimensional multi-scale vector and local/global scale to a TODO for future reference.

jsetoain: That's not exactly right. You can have a 2D scalable vector, and vscale represents its…

// execution, and a 'local' scale that is fixed but might vary with each

// call to the function. For that, it might be useful to have a

// 'vector.scale.global' and a 'vector.scale.local' operation.

def VectorScaleOp : Vector_Op<"vscale",

[NoSideEffect]> {

jsetoainAuthorUnsubmitted

Done

vector_scale consecutive vector<4xi32>; and an operation on a

- vector<<4xi32>> is equivalent to performing that operation vector_scale

+ vector<[4]xi32> is equivalent to performing that operation vector_scale

times, once on each <4xi32> segment of the scalable vector. The vector_scale

I've seen this, I'll take care of it together with any other necessary fix.

jsetoain: I've seen this, I'll take care of it together with any other necessary fix.

let summary = "Load vector scale size";

let description = [{

The `vscale` op returns the scale of the scalable vectors, a positive

integer value that is constant at runtime but unknown at compile-time.

The scale of the vector indicates the multiplicity of the vectors and

vector operations. For example, a `vector<[4]xi32>` is equivalent to

rriddleUnsubmitted

Done

let results = (outs Index:$res);

- let assemblyFormat =

- "attr-dict `:` type($res)";

+ let assemblyFormat = "attr-dict `:` type($res)";

let verifier = [{ return success(); }];

You could also drop the trailing type if you want, it can be inferred. (i.e. = "attr-dict")

rriddle: You could also drop the trailing type if you want, it can be inferred. (i.e. `= "attr-dict"`)

jsetoainAuthorUnsubmitted

Done

It feels a bit "naked", but it might be because I'm used to see it with the return type attached. We can give it a go and see what people think, if people don't care, going "concise" is my preferred option. Is there a "good practices" manual for dialect syntax? I can't find one.

jsetoain: It feels a bit "naked", but it might be because I'm used to see it with the return type…

rriddleUnsubmitted

Done

I don't think we have a "good practices" manual, though that sounds useful.

rriddle: I don't think we have a "good practices" manual, though that sounds useful.

`vscale` consecutive `vector<4xi32>`; and an operation on a

`vector<[4]xi32>` is equivalent to performing that operation `vscale`

times, once on each `<4xi32>` segment of the scalable vector. The `vscale`

op can be used to calculate the step in vector-length agnostic (VLA) loops.

Right now we only support one contiguous set of scalable dimensions, all of

them grouped and scaled with the value returned by 'vscale'.

}];

let results = (outs Index:$res);

let assemblyFormat = "attr-dict";

let verifier = ?;

rriddleUnsubmitted

Done

let assemblyFormat = "attr-dict";

- let verifier = [{ return success(); }];

+ let verifier = ?;

}

#endif // VECTOR_OPS

If a verifier isn't necessary, you can just ignore it.

rriddle: If a verifier isn't necessary, you can just ignore it.

}

#endif // VECTOR_OPS #endif // VECTOR_OPS

mlir/include/mlir/IR/BuiltinTypes.h

	Show First 20 Lines • Show All 309 Lines • ▼ Show 20 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	/// This is a builder type that keeps local references to arguments. Arguments			/// This is a builder type that keeps local references to arguments. Arguments
	/// that are passed into the builder must outlive the builder.			/// that are passed into the builder must outlive the builder.
	class VectorType::Builder {			class VectorType::Builder {
	public:			public:
	/// Build from another VectorType.			/// Build from another VectorType.
	explicit Builder(VectorType other)			explicit Builder(VectorType other)
	: shape(other.getShape()), elementType(other.getElementType()) {}			: shape(other.getShape()), elementType(other.getElementType()),
				numScalableDims(other.getNumScalableDims()) {}

	/// Build from scratch.			/// Build from scratch.
	Builder(ArrayRef<int64_t> shape, Type elementType)			Builder(ArrayRef<int64_t> shape, Type elementType,
	: shape(shape), elementType(elementType) {}			unsigned numScalableDims = 0)
				nicolasvasilacheUnsubmitted Done Reply Inline Actions I fear this will prove annoying to use in practice .. Could we go with `unsigned numScalableDims`? Then you can just use APIS such as ArrayRef's `shape.take_back(numScalableDims);` and friends. nicolasvasilache: I fear this will prove annoying to use in practice .. Could we go with `unsigned…
				: shape(shape), elementType(elementType),
	Builder &setShape(ArrayRef<int64_t> newShape) {			numScalableDims(numScalableDims) {}

				Builder &setShape(ArrayRef<int64_t> newShape,
				unsigned newNumScalableDims = 0) {
				numScalableDims = newNumScalableDims;
	shape = newShape;			shape = newShape;
	return *this;			return *this;
	}			}

	Builder &setElementType(Type newElementType) {			Builder &setElementType(Type newElementType) {
	elementType = newElementType;			elementType = newElementType;
	return *this;			return *this;
	}			}

	/// Erase a dim from shape @pos.			/// Erase a dim from shape @pos.
	Builder &dropDim(unsigned pos) {			Builder &dropDim(unsigned pos) {
	assert(pos < shape.size() && "overflow");			assert(pos < shape.size() && "overflow");
				if (pos >= shape.size() - numScalableDims)
				nicolasvasilacheUnsubmitted Done Reply Inline Actions this would get nicer with `numScalableDims`. nicolasvasilache: this would get nicer with `numScalableDims`.
				numScalableDims--;
	if (storage.empty())			if (storage.empty())
	storage.append(shape.begin(), shape.end());			storage.append(shape.begin(), shape.end());
	storage.erase(storage.begin() + pos);			storage.erase(storage.begin() + pos);
	shape = {storage.data(), storage.size()};			shape = {storage.data(), storage.size()};
	return *this;			return *this;
	}			}

	/// In the particular case where the vector has a single dimension that we			/// In the particular case where the vector has a single dimension that we
	/// drop, return the scalar element type.			/// drop, return the scalar element type.
	// TODO: unify once we have a VectorType that supports 0-D.			// TODO: unify once we have a VectorType that supports 0-D.
	operator Type() {			operator Type() {
	if (shape.empty())			if (shape.empty())
	return elementType;			return elementType;
	return VectorType::get(shape, elementType);			return VectorType::get(shape, elementType, numScalableDims);
	}			}

	private:			private:
	ArrayRef<int64_t> shape;			ArrayRef<int64_t> shape;
	// Owning shape data for copy-on-write operations.			// Owning shape data for copy-on-write operations.
	SmallVector<int64_t> storage;			SmallVector<int64_t> storage;
	Type elementType;			Type elementType;
				unsigned numScalableDims;
	};			};

	/// Given an `originalShape` and a `reducedShape` assumed to be a subset of			/// Given an `originalShape` and a `reducedShape` assumed to be a subset of
	/// `originalShape` with some `1` entries erased, return the set of indices			/// `originalShape` with some `1` entries erased, return the set of indices
	/// that specifies which of the entries of `originalShape` are dropped to obtain			/// that specifies which of the entries of `originalShape` are dropped to obtain
	/// `reducedShape`. The returned mask can be applied as a projection to			/// `reducedShape`. The returned mask can be applied as a projection to
	/// `originalShape` to obtain the `reducedShape`. This mask is useful to track			/// `originalShape` to obtain the `reducedShape`. This mask is useful to track
	/// which dimensions must be kept when e.g. compute MemRef strides under			/// which dimensions must be kept when e.g. compute MemRef strides under
	▲ Show 20 Lines • Show All 176 Lines • Show Last 20 Lines

mlir/include/mlir/IR/BuiltinTypes.td

Show First 20 Lines • Show All 886 Lines • ▼ Show 20 Lines

def Builtin_Vector : Builtin_Type<"Vector", [ def Builtin_Vector : Builtin_Type<"Vector", [

DeclareTypeInterfaceMethods<SubElementTypeInterface> DeclareTypeInterfaceMethods<SubElementTypeInterface>

], "ShapedType"> { ], "ShapedType"> {

let summary = "Multi-dimensional SIMD vector type"; let summary = "Multi-dimensional SIMD vector type";

let description = [{ let description = [{

Syntax: Syntax:

``` ```

vector-type ::= `vector` `<` static-dimension-list vector-element-type `>` vector-type ::= `vector` `<` vector-dim-list vector-element-type `>`

vector-element-type ::= float-type | integer-type | index-type vector-element-type ::= float-type | integer-type | index-type

vector-dim-list := (static-dim-list `x`)? (`[` static-dim-list `]` `x`)?

static-dimension-list ::= (decimal-literal `x`)* static-dim-list ::= decimal-literal (`x` decimal-literal)*

``` ```

The vector type represents a SIMD style vector, used by target-specific The vector type represents a SIMD style vector used by target-specific

operation sets like AVX. While the most common use is for 1D vectors (e.g. operation sets like AVX or SVE. While the most common use is for 1D

vector<16 x f32>) we also support multidimensional registers on targets that vectors (e.g. vector<16 x f32>) we also support multidimensional registers

support them (like TPUs). on targets that support them (like TPUs). The dimensions of a vector type

can be fixed-length, scalable, or a combination of the two. The scalable

aartbikUnsubmitted

Done

Perhaps you can add some text calling out that < > is fixed length and << >> is scalable?
Just because it is a new syntax that we have to get used to ;-)

aartbik: Perhaps you can add some text calling out that < > is fixed length and << >> is scalable? Just…

dimensions in a vector are indicated between square brackets ([ ]), and

all fixed-length dimensions, if present, must precede the set of scalable

jsetoainAuthorUnsubmitted

Done

dimensions in a vector are indicated between square brackets ([ ]), and

- all fixed-length dimensions, if present, must preceed the set of scalable

+ all fixed-length dimensions, if present, must precede the set of scalable

dimensions. That is, a `vector<2x[4]xf32>` is valid, but `vector<[4]x2xf32>`

And this...

jsetoain: And this...

dimensions. That is, a `vector<2x[4]xf32>` is valid, but `vector<[4]x2xf32>`

is not.

Vector shapes must be positive decimal integers. 0D vectors are allowed by Vector shapes must be positive decimal integers. 0D vectors are allowed by

omitting the dimension: `vector<f32>`. omitting the dimension: `vector<f32>`.

Note: hexadecimal integer literals are not allowed in vector type Note: hexadecimal integer literals are not allowed in vector type

declarations, `vector<0x42xi32>` is invalid because it is interpreted as a declarations, `vector<0x42xi32>` is invalid because it is interpreted as a

2D vector with shape `(0, 42)` and zero shapes are not allowed. 2D vector with shape `(0, 42)` and zero shapes are not allowed.

Examples: Examples:

```mlir ```mlir

// A 2D fixed-length vector of 3x42 i32 elements.

aartbikUnsubmitted

Done

period at end

aartbik: period at end

vector<3x42xi32> vector<3x42xi32>

// A 1D scalable-length vector that contains a multiple of 4 f32 elements.

aartbikUnsubmitted

Done

period at end

aartbik: period at end

vector<[4]xf32>

// A 2D scalable-length vector that contains a multiple of 2x8 i8 elements.

vector<[2x8]xf32>

nicolasvasilacheUnsubmitted

Done

Now that I read this, I am unclear whether vector<[2x8]xf32> is the same as vector<[2]x[8]xf32>, I would think not and the latter form could be a future extension (if so, add a TODO)?
This really depends on whether you think you can make use of vector<[2x8]xf32> in MLIR instead of having to represent as vector<[16]xf32>; I claim you would have a bunch of nice use cases for this (coupled with the shape_cast op once properly extended).

nicolasvasilache: Now that I read this, I am unclear whether `vector<[2x8]xf32>` is the same as `vector<[2]x…

jsetoainAuthorUnsubmitted

Done

It is, indeed, very much not the same. I find it useful to think about something like [2x8] as a series of 2x8 blocks, one after another. Therefore, even though they would have the same memory requirements, [2x8], [8x2], [4x4], and [16] can represent different data arrangements when you're loading your data from memory. From that point of view, [2]x[8] can't be the same as [2x8] even if the scale for both dimensions is the same. In fact, I don't think something like [2]x[8] makes sense in the context of scalable vectors. For GPU thread blocks, the situation is different. I'm not involved with that work so I can't come up with anything on the spot, but I intuit it could have potentially useful cases. As this work progresses, I suspect we will need to come back to it.

jsetoain: It is, indeed, very much not the same. I find it useful to think about something like [2x8] as…

// A 2D mixed fixed/scalable vector that contains 4 scalable vectors of 4 f32 elements.

vector<4x[4]xf32>

``` ```

}]; }];

let parameters = (ins let parameters = (ins

ArrayRefParameter<"int64_t">:$shape, ArrayRefParameter<"int64_t">:$shape,

"Type":$elementType "Type":$elementType,

"unsigned":$numScalableDims

); );

let builders = [ let builders = [

TypeBuilderWithInferredContext<(ins TypeBuilderWithInferredContext<(ins

"ArrayRef<int64_t>":$shape, "Type":$elementType "ArrayRef<int64_t>":$shape, "Type":$elementType,

CArg<"unsigned", "0">:$numScalableDims

rriddleUnsubmitted

Done

Is this wrapped at 80 characters?

rriddle: Is this wrapped at 80 characters?

jsetoainAuthorUnsubmitted

Done

Not sure what happened there. Good catch, thanks!

jsetoain: Not sure what happened there. Good catch, thanks!

rriddleUnsubmitted

Done

Can we use Optional here instead? -1 is a bit magic.

rriddle: Can we use Optional here instead? -1 is a bit magic.

jsetoainAuthorUnsubmitted

Done

Not sure how to use Optional for Types, this is the only way I found to provide a default value in a type builder. In any case, I've changed it to "numScalableDims" as suggested by Nicolas. It makes code a bit less awkward and conveniently replaces a arguably ugly "first dimension = -1" to a more semantically sensible "number of dimensions = 0". If you still find this unacceptable, I can look into adding an "Optional" equivalent for types.

jsetoain: Not sure how to use Optional for Types, this is the only way I found to provide a default value…

), [{ ), [{

return $_get(elementType.getContext(), shape, elementType); return $_get(elementType.getContext(), shape, elementType,

numScalableDims);

}]> }]>

]; ];

let extraClassDeclaration = [{ let extraClassDeclaration = [{

/// This is a builder type that keeps local references to arguments. /// This is a builder type that keeps local references to arguments.

/// Arguments that are passed into the builder must outlive the builder. /// Arguments that are passed into the builder must outlive the builder.

class Builder; class Builder;

/// Returns true of the given type can be used as an element of a vector /// Returns true if the given type can be used as an element of a vector

/// type. In particular, vectors can consist of integer, index, or float /// type. In particular, vectors can consist of integer, index, or float

/// primitives. /// primitives.

static bool isValidElementType(Type t) { static bool isValidElementType(Type t) {

return t.isa<IntegerType, IndexType, FloatType>(); return t.isa<IntegerType, IndexType, FloatType>();

} }

/// Returns true if the vector contains scalable dimensions.

bool isScalable() const {

rriddleUnsubmitted

Done

Why not just isScalable? The naming here is a bit weird.

rriddle: Why not just isScalable? The naming here is a bit weird.

return getNumScalableDims() > 0;

}

/// Get or create a new VectorType with the same shape as `this` and an /// Get or create a new VectorType with the same shape as `this` and an

/// element type of bitwidth scaled by `scale`. /// element type of bitwidth scaled by `scale`.

/// Return null if the scaled element type cannot be represented. /// Return null if the scaled element type cannot be represented.

VectorType scaleElementBitwidth(unsigned scale); VectorType scaleElementBitwidth(unsigned scale);

}]; }];

let skipDefaultBuilders = 1; let skipDefaultBuilders = 1;

let genVerifyDecl = 1; let genVerifyDecl = 1;

} }

#endif // BUILTIN_TYPES #endif // BUILTIN_TYPES

mlir/include/mlir/IR/OpBase.td

Show First 20 Lines • Show All 210 Lines • ▼ Show 20 Lines
// Explicitly disallow 0-D vectors for now until we have good enough coverage.		// Explicitly disallow 0-D vectors for now until we have good enough coverage.
def IsVectorTypePred : And<[CPred<"$_self.isa<::mlir::VectorType>()">,		def IsVectorTypePred : And<[CPred<"$_self.isa<::mlir::VectorType>()">,
CPred<"$_self.cast<::mlir::VectorType>().getRank() > 0">]>;		CPred<"$_self.cast<::mlir::VectorType>().getRank() > 0">]>;

// Temporary vector type clone that allows gradual transition to 0-D vectors.		// Temporary vector type clone that allows gradual transition to 0-D vectors.
// TODO: Remove this when all ops support 0-D vectors.		// TODO: Remove this when all ops support 0-D vectors.
def IsVectorOfAnyRankTypePred : CPred<"$_self.isa<::mlir::VectorType>()">;		def IsVectorOfAnyRankTypePred : CPred<"$_self.isa<::mlir::VectorType>()">;

		// Whether a type is a fixed-length VectorType.
		def IsFixedVectorTypePred : CPred<[{$_self.isa<::mlir::VectorType>() &&
		!$_self.cast<VectorType>().isScalable()}]>;

		// Whether a type is a scalable VectorType.
		def IsScalableVectorTypePred : CPred<[{$_self.isa<::mlir::VectorType>() &&
		$_self.cast<VectorType>().isScalable()}]>;

// Whether a type is a TensorType.		// Whether a type is a TensorType.
def IsTensorTypePred : CPred<"$_self.isa<::mlir::TensorType>()">;		def IsTensorTypePred : CPred<"$_self.isa<::mlir::TensorType>()">;

// Whether a type is a MemRefType.		// Whether a type is a MemRefType.
def IsMemRefTypePred : CPred<"$_self.isa<::mlir::MemRefType>()">;		def IsMemRefTypePred : CPred<"$_self.isa<::mlir::MemRefType>()">;

// Whether a type is an UnrankedMemRefType		// Whether a type is an UnrankedMemRefType
def IsUnrankedMemRefTypePred		def IsUnrankedMemRefTypePred
▲ Show 20 Lines • Show All 379 Lines • ▼ Show 20 Lines	ShapedContainerType<allowedTypes, IsVectorTypePred, "vector",
"::mlir::VectorType">;		"::mlir::VectorType">;

// Temporary vector type clone that allows gradual transition to 0-D vectors.		// Temporary vector type clone that allows gradual transition to 0-D vectors.
// TODO: Remove this when all ops support 0-D vectors.		// TODO: Remove this when all ops support 0-D vectors.
class VectorOfAnyRankOf<list<Type> allowedTypes> :		class VectorOfAnyRankOf<list<Type> allowedTypes> :
ShapedContainerType<allowedTypes, IsVectorOfAnyRankTypePred, "vector",		ShapedContainerType<allowedTypes, IsVectorOfAnyRankTypePred, "vector",
"::mlir::VectorType">;		"::mlir::VectorType">;

		class FixedVectorOf<list<Type> allowedTypes> :
		ShapedContainerType<allowedTypes, IsFixedVectorTypePred,
		"fixed-length vector", "::mlir::VectorType">;

		class ScalableVectorOf<list<Type> allowedTypes> :
		ShapedContainerType<allowedTypes, IsScalableVectorTypePred,
		"scalable vector", "::mlir::VectorType">;

// Whether the number of elements of a vector is from the given		// Whether the number of elements of a vector is from the given
// `allowedRanks` list		// `allowedRanks` list
class IsVectorOfRankPred<list<int> allowedRanks> :		class IsVectorOfRankPred<list<int> allowedRanks> :
And<[IsVectorTypePred,		And<[IsVectorTypePred,
Or<!foreach(allowedlength, allowedRanks,		Or<!foreach(allowedlength, allowedRanks,
CPred<[{$_self.cast<::mlir::VectorType>().getRank()		CPred<[{$_self.cast<::mlir::VectorType>().getRank()
== }]		== }]
# allowedlength>)>]>;		# allowedlength>)>]>;
Show All 16 Lines
// `allowedLengths` list		// `allowedLengths` list
class IsVectorOfLengthPred<list<int> allowedLengths> :		class IsVectorOfLengthPred<list<int> allowedLengths> :
And<[IsVectorTypePred,		And<[IsVectorTypePred,
Or<!foreach(allowedlength, allowedLengths,		Or<!foreach(allowedlength, allowedLengths,
CPred<[{$_self.cast<::mlir::VectorType>().getNumElements()		CPred<[{$_self.cast<::mlir::VectorType>().getNumElements()
== }]		== }]
# allowedlength>)>]>;		# allowedlength>)>]>;

		// Whether the number of elements of a fixed-length vector is from the given
		// `allowedLengths` list
		aartbikUnsubmitted Done Reply Inline Actions I wanted to say period at end, but I see that is not really the style in this file aartbik: I wanted to say period at end, but I see that is not really the style in this file
		class IsFixedVectorOfLengthPred<list<int> allowedLengths> :
		And<[IsFixedVectorTypePred,
		Or<!foreach(allowedlength, allowedLengths,
		CPred<[{$_self.cast<::mlir::VectorType>().getNumElements()
		== }]
		# allowedlength>)>]>;

		// Whether the number of elements of a scalable vector is from the given
		// `allowedLengths` list
		class IsScalableVectorOfLengthPred<list<int> allowedLengths> :
		And<[IsScalableVectorTypePred,
		Or<!foreach(allowedlength, allowedLengths,
		CPred<[{$_self.cast<::mlir::VectorType>().getNumElements()
		== }]
		# allowedlength>)>]>;

// Any vector where the number of elements is from the given		// Any vector where the number of elements is from the given
// `allowedLengths` list		// `allowedLengths` list
class VectorOfLength<list<int> allowedLengths> : Type<		class VectorOfLength<list<int> allowedLengths> : Type<
IsVectorOfLengthPred<allowedLengths>,		IsVectorOfLengthPred<allowedLengths>,
" of length " # !interleave(allowedLengths, "/"),		" of length " # !interleave(allowedLengths, "/"),
"::mlir::VectorType">;		"::mlir::VectorType">;

		// Any fixed-length vector where the number of elements is from the given
		// `allowedLengths` list
		class FixedVectorOfLength<list<int> allowedLengths> : Type<
		IsFixedVectorOfLengthPred<allowedLengths>,
		" of length " # !interleave(allowedLengths, "/"),
		"::mlir::VectorType">;

		// Any scalable vector where the number of elements is from the given
		// `allowedLengths` list
		class ScalableVectorOfLength<list<int> allowedLengths> : Type<
		IsScalableVectorOfLengthPred<allowedLengths>,
		" of length " # !interleave(allowedLengths, "/"),
		"::mlir::VectorType">;

// Any vector where the number of elements is from the given		// Any vector where the number of elements is from the given
// `allowedLengths` list and the type is from the given `allowedTypes`		// `allowedLengths` list and the type is from the given `allowedTypes`
// list		// list
class VectorOfLengthAndType<list<int> allowedLengths,		class VectorOfLengthAndType<list<int> allowedLengths,
list<Type> allowedTypes> : Type<		list<Type> allowedTypes> : Type<
And<[VectorOf<allowedTypes>.predicate,		And<[VectorOf<allowedTypes>.predicate,
VectorOfLength<allowedLengths>.predicate]>,		VectorOfLength<allowedLengths>.predicate]>,
VectorOf<allowedTypes>.summary # VectorOfLength<allowedLengths>.summary,		VectorOf<allowedTypes>.summary # VectorOfLength<allowedLengths>.summary,
"::mlir::VectorType">;		"::mlir::VectorType">;

		// Any fixed-length vector where the number of elements is from the given
		// `allowedLengths` list and the type is from the given `allowedTypes` list
		class FixedVectorOfLengthAndType<list<int> allowedLengths,
		list<Type> allowedTypes> : Type<
		And<[FixedVectorOf<allowedTypes>.predicate,
		FixedVectorOfLength<allowedLengths>.predicate]>,
		FixedVectorOf<allowedTypes>.summary #
		FixedVectorOfLength<allowedLengths>.summary,
		"::mlir::VectorType">;

		// Any scalable vector where the number of elements is from the given
		// `allowedLengths` list and the type is from the given `allowedTypes` list
		class ScalableVectorOfLengthAndType<list<int> allowedLengths,
		list<Type> allowedTypes> : Type<
		And<[ScalableVectorOf<allowedTypes>.predicate,
		ScalableVectorOfLength<allowedLengths>.predicate]>,
		ScalableVectorOf<allowedTypes>.summary #
		ScalableVectorOfLength<allowedLengths>.summary,
		"::mlir::VectorType">;

def AnyVector : VectorOf<[AnyType]>;		def AnyVector : VectorOf<[AnyType]>;
// Temporary vector type clone that allows gradual transition to 0-D vectors.		// Temporary vector type clone that allows gradual transition to 0-D vectors.
def AnyVectorOfAnyRank : VectorOfAnyRankOf<[AnyType]>;		def AnyVectorOfAnyRank : VectorOfAnyRankOf<[AnyType]>;

		def AnyFixedVector : FixedVectorOf<[AnyType]>;

		def AnyScalableVector : ScalableVectorOf<[AnyType]>;

// Shaped types.		// Shaped types.

def AnyShaped: ShapedContainerType<[AnyType], IsShapedTypePred, "shaped",		def AnyShaped: ShapedContainerType<[AnyType], IsShapedTypePred, "shaped",
"::mlir::ShapedType">;		"::mlir::ShapedType">;

// Tensor types.		// Tensor types.

// Any tensor type whose element type is from the given `allowedTypes` list		// Any tensor type whose element type is from the given `allowedTypes` list
▲ Show 20 Lines • Show All 2,425 Lines • Show Last 20 Lines

mlir/lib/Conversion/LLVMCommon/TypeConverter.cpp

	Show First 20 Lines • Show All 405 Lines • ▼ Show 20 Lines
	/// * n>1 `vector<ax...xkxT>` convert via an (n-1)-D array type to			/// * n>1 `vector<ax...xkxT>` convert via an (n-1)-D array type to
	/// `!llvm.array<ax...array<jxvector<kxT>>>`.			/// `!llvm.array<ax...array<jxvector<kxT>>>`.
	Type LLVMTypeConverter::convertVectorType(VectorType type) {			Type LLVMTypeConverter::convertVectorType(VectorType type) {
	auto elementType = convertType(type.getElementType());			auto elementType = convertType(type.getElementType());
	if (!elementType)			if (!elementType)
	return {};			return {};
	if (type.getShape().empty())			if (type.getShape().empty())
	return VectorType::get({1}, elementType);			return VectorType::get({1}, elementType);
	Type vectorType = VectorType::get(type.getShape().back(), elementType);			Type vectorType = VectorType::get(type.getShape().back(), elementType,
				type.getNumScalableDims());
	assert(LLVM::isCompatibleVectorType(vectorType) &&			assert(LLVM::isCompatibleVectorType(vectorType) &&
	"expected vector type compatible with the LLVM dialect");			"expected vector type compatible with the LLVM dialect");
	auto shape = type.getShape();			auto shape = type.getShape();
	for (int i = shape.size() - 2; i >= 0; --i)			for (int i = shape.size() - 2; i >= 0; --i)
	vectorType = LLVM::LLVMArrayType::get(vectorType, shape[i]);			vectorType = LLVM::LLVMArrayType::get(vectorType, shape[i]);
	return vectorType;			return vectorType;
	}			}

	▲ Show 20 Lines • Show All 146 Lines • Show Last 20 Lines

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

Show All 20 Lines

#include "mlir/Transforms/DialectConversion.h"

using namespace mlir;

using namespace mlir::vector;

// Helper to reduce vector type by one rank at front.

static VectorType reducedVectorTypeFront(VectorType tp) {

assert((tp.getRank() > 1) && "unlowerable vector type");

return VectorType::get(tp.getShape().drop_front(), tp.getElementType());

unsigned numScalableDims = tp.getNumScalableDims();

if (tp.getShape().size() == numScalableDims)

--numScalableDims;

rriddleUnsubmitted

Done

if (tp.getShape().size() == numScalableDims)

- numScalableDims--;

+ --numScalableDims;

return VectorType::get(tp.getShape().drop_front(), tp.getElementType(),

nit: Prefer pre-increment unless you need post increment behavior.

rriddle: nit: Prefer pre-increment unless you need post increment behavior.

return VectorType::get(tp.getShape().drop_front(), tp.getElementType(),

numScalableDims);

}

// Helper to reduce vector type by *all* but one rank at back.

static VectorType reducedVectorTypeBack(VectorType tp) {

assert((tp.getRank() > 1) && "unlowerable vector type");

return VectorType::get(tp.getShape().take_back(), tp.getElementType());

unsigned numScalableDims = tp.getNumScalableDims();

if (numScalableDims > 0)

--numScalableDims;

rriddleUnsubmitted

Done

Same here.

rriddle: Same here.

return VectorType::get(tp.getShape().take_back(), tp.getElementType(),

numScalableDims);

}

// Helper that picks the proper sequence for inserting.

static Value insertOne(ConversionPatternRewriter &rewriter,

LLVMTypeConverter &typeConverter, Location loc,

Value val1, Value val2, Type llvmType, int64_t rank,

int64_t pos) {

assert(rank > 0 && "0-D vector corner case should have been handled already");

▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines

static Value castDataPtr(ConversionPatternRewriter &rewriter, Location loc,

Value ptr, MemRefType memRefType, Type vt) {

auto pType = LLVM::LLVMPointerType::get(vt, memRefType.getMemorySpaceAsInt());

return rewriter.create<LLVM::BitcastOp>(loc, pType, ptr);

}

namespace {

/// Trivial Vector to LLVM conversions

using VectorScaleOpConversion =

OneToOneConvertToLLVMPattern<vector::VectorScaleOp, LLVM::vscale>;

/// Conversion pattern for a vector.bitcast.

class VectorBitCastOpConversion

: public ConvertOpToLLVMPattern<vector::BitCastOp> {

public:

using ConvertOpToLLVMPattern<vector::BitCastOp>::ConvertOpToLLVMPattern;

LogicalResult

matchAndRewrite(vector::BitCastOp bitCastOp, OpAdaptor adaptor,

▲ Show 20 Lines • Show All 936 Lines • ▼ Show 20 Lines

void mlir::populateVectorToLLVMConversionPatterns(

patterns.add<VectorFMAOpNDRewritePattern>(ctx);

populateVectorInsertExtractStridedSliceTransforms(patterns);

patterns.add<VectorReductionOpConversion>(converter, reassociateFPReductions);

patterns

.add<VectorBitCastOpConversion, VectorShuffleOpConversion,

VectorExtractElementOpConversion, VectorExtractOpConversion,

VectorFMAOp1DConversion, VectorInsertElementOpConversion,

VectorInsertOpConversion, VectorPrintOpConversion,

VectorTypeCastOpConversion,

VectorTypeCastOpConversion, VectorScaleOpConversion,

VectorLoadStoreConversion<vector::LoadOp, vector::LoadOpAdaptor>,

VectorLoadStoreConversion<vector::MaskedLoadOp,

vector::MaskedLoadOpAdaptor>,

VectorLoadStoreConversion<vector::StoreOp, vector::StoreOpAdaptor>,

VectorLoadStoreConversion<vector::MaskedStoreOp,

vector::MaskedStoreOpAdaptor>,

VectorGatherOpConversion, VectorScatterOpConversion,

VectorExpandLoadOpConversion, VectorCompressStoreOpConversion>(

Show All 10 Lines

mlir/lib/Dialect/Arithmetic/IR/ArithmeticOps.cpp

	Show First 20 Lines • Show All 993 Lines • ▼ Show 20 Lines
	/// Return the type of the same shape (scalar, vector or tensor) containing i1.			/// Return the type of the same shape (scalar, vector or tensor) containing i1.
	static Type getI1SameShape(Type type) {			static Type getI1SameShape(Type type) {
	auto i1Type = IntegerType::get(type.getContext(), 1);			auto i1Type = IntegerType::get(type.getContext(), 1);
	if (auto tensorType = type.dyn_cast<RankedTensorType>())			if (auto tensorType = type.dyn_cast<RankedTensorType>())
	return RankedTensorType::get(tensorType.getShape(), i1Type);			return RankedTensorType::get(tensorType.getShape(), i1Type);
	if (type.isa<UnrankedTensorType>())			if (type.isa<UnrankedTensorType>())
	return UnrankedTensorType::get(i1Type);			return UnrankedTensorType::get(i1Type);
	if (auto vectorType = type.dyn_cast<VectorType>())			if (auto vectorType = type.dyn_cast<VectorType>())
	return VectorType::get(vectorType.getShape(), i1Type);			return VectorType::get(vectorType.getShape(), i1Type,
				vectorType.getNumScalableDims());
	return i1Type;			return i1Type;
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// CmpIOp			// CmpIOp
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	/// Compute `lhs` `pred` `rhs`, where `pred` is one of the known integer			/// Compute `lhs` `pred` `rhs`, where `pred` is one of the known integer
	▲ Show 20 Lines • Show All 142 Lines • Show Last 20 Lines

mlir/lib/Dialect/ArmSVE/IR/ArmSVEDialect.cpp

Show All 19 Lines

#include "llvm/ADT/TypeSwitch.h"

using namespace mlir;

using namespace arm_sve;

#include "mlir/Dialect/ArmSVE/ArmSVEDialect.cpp.inc"

static Type getI1SameShape(Type type);

static void buildScalableCmpIOp(OpBuilder &build, OperationState &result,

arith::CmpIPredicate predicate, Value lhs,

Value rhs);

static void buildScalableCmpFOp(OpBuilder &build, OperationState &result,

arith::CmpFPredicate predicate, Value lhs,

Value rhs);

#define GET_OP_CLASSES

#include "mlir/Dialect/ArmSVE/ArmSVE.cpp.inc"

#define GET_TYPEDEF_CLASSES

#include "mlir/Dialect/ArmSVE/ArmSVETypes.cpp.inc"

void ArmSVEDialect::initialize() {

addOperations<

#define GET_OP_LIST

#include "mlir/Dialect/ArmSVE/ArmSVE.cpp.inc"

>();

addTypes<

#define GET_TYPEDEF_LIST

#include "mlir/Dialect/ArmSVE/ArmSVETypes.cpp.inc"

>();

}

//===----------------------------------------------------------------------===//

// ScalableVectorType

//===----------------------------------------------------------------------===//

void ScalableVectorType::print(AsmPrinter &printer) const {

printer << "<";

for (int64_t dim : getShape())

printer << dim << 'x';

printer << getElementType() << '>';

}

Type ScalableVectorType::parse(AsmParser &parser) {

SmallVector<int64_t> dims;

Type eltType;

if (parser.parseLess() ||

parser.parseDimensionList(dims, /*allowDynamic=*/false) ||

parser.parseType(eltType) || parser.parseGreater())

return {};

return ScalableVectorType::get(eltType.getContext(), dims, eltType);

}

//===----------------------------------------------------------------------===//

// ScalableVector versions of general helpers for comparison ops

//===----------------------------------------------------------------------===//

// Return the scalable vector of the same shape and containing i1.

static Type getI1SameShape(Type type) {

auto i1Type = IntegerType::get(type.getContext(), 1);

if (auto sVectorType = type.dyn_cast<ScalableVectorType>())

if (auto sVectorType = type.dyn_cast<VectorType>())

return ScalableVectorType::get(type.getContext(), sVectorType.getShape(),

return VectorType::get(sVectorType.getShape(), i1Type,

i1Type);

sVectorType.getNumScalableDims());

rriddleUnsubmitted

Done

return VectorType::get(sVectorType.getShape(), i1Type,

- /* isScalable = */ true);

+ /*isScalable=*/true);

return nullptr;

rriddle:

return nullptr;

}

//===----------------------------------------------------------------------===//

// CmpFOp

//===----------------------------------------------------------------------===//

static void buildScalableCmpFOp(OpBuilder &build, OperationState &result,

arith::CmpFPredicate predicate, Value lhs,

Value rhs) {

result.addOperands({lhs, rhs});

result.types.push_back(getI1SameShape(lhs.getType()));

result.addAttribute(ScalableCmpFOp::getPredicateAttrName(),

build.getI64IntegerAttr(static_cast<int64_t>(predicate)));

}

static void buildScalableCmpIOp(OpBuilder &build, OperationState &result,

arith::CmpIPredicate predicate, Value lhs,

Value rhs) {

result.addOperands({lhs, rhs});

result.types.push_back(getI1SameShape(lhs.getType()));

result.addAttribute(ScalableCmpIOp::getPredicateAttrName(),

build.getI64IntegerAttr(static_cast<int64_t>(predicate)));

}

mlir/lib/Dialect/ArmSVE/Transforms/LegalizeForLLVMExport.cpp

Show All 12 Lines
#include "mlir/Dialect/LLVMIR/LLVMDialect.h"		#include "mlir/Dialect/LLVMIR/LLVMDialect.h"
#include "mlir/Dialect/StandardOps/IR/Ops.h"		#include "mlir/Dialect/StandardOps/IR/Ops.h"
#include "mlir/IR/BuiltinOps.h"		#include "mlir/IR/BuiltinOps.h"
#include "mlir/IR/PatternMatch.h"		#include "mlir/IR/PatternMatch.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::arm_sve;		using namespace mlir::arm_sve;

// Extract an LLVM IR type from the LLVM IR dialect type.
static Type unwrap(Type type) {
if (!type)
return nullptr;
auto *mlirContext = type.getContext();
if (!LLVM::isCompatibleType(type))
emitError(UnknownLoc::get(mlirContext),
"conversion resulted in a non-LLVM type");
return type;
}

static Optional<Type>
convertScalableVectorTypeToLLVM(ScalableVectorType svType,
LLVMTypeConverter &converter) {
auto elementType = unwrap(converter.convertType(svType.getElementType()));
if (!elementType)
return {};

auto sVectorType =
LLVM::LLVMScalableVectorType::get(elementType, svType.getShape().back());
return sVectorType;
}

template <typename OpTy>		template <typename OpTy>
class ForwardOperands : public OpConversionPattern<OpTy> {		class ForwardOperands : public OpConversionPattern<OpTy> {
using OpConversionPattern<OpTy>::OpConversionPattern;		using OpConversionPattern<OpTy>::OpConversionPattern;

LogicalResult		LogicalResult
matchAndRewrite(OpTy op, typename OpTy::Adaptor adaptor,		matchAndRewrite(OpTy op, typename OpTy::Adaptor adaptor,
ConversionPatternRewriter &rewriter) const final {		ConversionPatternRewriter &rewriter) const final {
if (adaptor.getOperands().getTypes() == op->getOperands().getTypes())		if (adaptor.getOperands().getTypes() == op->getOperands().getTypes())
Show All 13 Lines	public:
matchAndRewrite(ReturnOp op, OpAdaptor adaptor,		matchAndRewrite(ReturnOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const final {		ConversionPatternRewriter &rewriter) const final {
rewriter.updateRootInPlace(		rewriter.updateRootInPlace(
op, [&]() { op->setOperands(adaptor.getOperands()); });		op, [&]() { op->setOperands(adaptor.getOperands()); });
return success();		return success();
}		}
};		};

static Optional<Value> addUnrealizedCast(OpBuilder &builder,
ScalableVectorType svType,
ValueRange inputs, Location loc) {
if (inputs.size() != 1 \|\|
!inputs[0].getType().isa<LLVM::LLVMScalableVectorType>())
return Value();
return builder.create<UnrealizedConversionCastOp>(loc, svType, inputs)
.getResult(0);
}

using SdotOpLowering = OneToOneConvertToLLVMPattern<SdotOp, SdotIntrOp>;		using SdotOpLowering = OneToOneConvertToLLVMPattern<SdotOp, SdotIntrOp>;
using SmmlaOpLowering = OneToOneConvertToLLVMPattern<SmmlaOp, SmmlaIntrOp>;		using SmmlaOpLowering = OneToOneConvertToLLVMPattern<SmmlaOp, SmmlaIntrOp>;
using UdotOpLowering = OneToOneConvertToLLVMPattern<UdotOp, UdotIntrOp>;		using UdotOpLowering = OneToOneConvertToLLVMPattern<UdotOp, UdotIntrOp>;
using UmmlaOpLowering = OneToOneConvertToLLVMPattern<UmmlaOp, UmmlaIntrOp>;		using UmmlaOpLowering = OneToOneConvertToLLVMPattern<UmmlaOp, UmmlaIntrOp>;
using VectorScaleOpLowering =
OneToOneConvertToLLVMPattern<VectorScaleOp, VectorScaleIntrOp>;
using ScalableMaskedAddIOpLowering =		using ScalableMaskedAddIOpLowering =
OneToOneConvertToLLVMPattern<ScalableMaskedAddIOp,		OneToOneConvertToLLVMPattern<ScalableMaskedAddIOp,
ScalableMaskedAddIIntrOp>;		ScalableMaskedAddIIntrOp>;
using ScalableMaskedAddFOpLowering =		using ScalableMaskedAddFOpLowering =
OneToOneConvertToLLVMPattern<ScalableMaskedAddFOp,		OneToOneConvertToLLVMPattern<ScalableMaskedAddFOp,
ScalableMaskedAddFIntrOp>;		ScalableMaskedAddFIntrOp>;
using ScalableMaskedSubIOpLowering =		using ScalableMaskedSubIOpLowering =
OneToOneConvertToLLVMPattern<ScalableMaskedSubIOp,		OneToOneConvertToLLVMPattern<ScalableMaskedSubIOp,
Show All 12 Lines	OneToOneConvertToLLVMPattern<ScalableMaskedSDivIOp,
ScalableMaskedSDivIIntrOp>;		ScalableMaskedSDivIIntrOp>;
using ScalableMaskedUDivIOpLowering =		using ScalableMaskedUDivIOpLowering =
OneToOneConvertToLLVMPattern<ScalableMaskedUDivIOp,		OneToOneConvertToLLVMPattern<ScalableMaskedUDivIOp,
ScalableMaskedUDivIIntrOp>;		ScalableMaskedUDivIIntrOp>;
using ScalableMaskedDivFOpLowering =		using ScalableMaskedDivFOpLowering =
OneToOneConvertToLLVMPattern<ScalableMaskedDivFOp,		OneToOneConvertToLLVMPattern<ScalableMaskedDivFOp,
ScalableMaskedDivFIntrOp>;		ScalableMaskedDivFIntrOp>;

// Load operation is lowered to code that obtains a pointer to the indexed
// element and loads from it.
struct ScalableLoadOpLowering : public ConvertOpToLLVMPattern<ScalableLoadOp> {
using ConvertOpToLLVMPattern<ScalableLoadOp>::ConvertOpToLLVMPattern;

LogicalResult
matchAndRewrite(ScalableLoadOp loadOp, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {
auto type = loadOp.getMemRefType();
if (!isConvertibleAndHasIdentityMaps(type))
return failure();

LLVMTypeConverter converter(loadOp.getContext());

auto resultType = loadOp.result().getType();
LLVM::LLVMPointerType llvmDataTypePtr;
if (resultType.isa<VectorType>()) {
llvmDataTypePtr =
LLVM::LLVMPointerType::get(resultType.cast<VectorType>());
} else if (resultType.isa<ScalableVectorType>()) {
llvmDataTypePtr = LLVM::LLVMPointerType::get(
convertScalableVectorTypeToLLVM(resultType.cast<ScalableVectorType>(),
converter)
.getValue());
}
Value dataPtr = getStridedElementPtr(loadOp.getLoc(), type, adaptor.base(),
adaptor.index(), rewriter);
Value bitCastedPtr = rewriter.create<LLVM::BitcastOp>(
loadOp.getLoc(), llvmDataTypePtr, dataPtr);
rewriter.replaceOpWithNewOp<LLVM::LoadOp>(loadOp, bitCastedPtr);
return success();
}
};

// Store operation is lowered to code that obtains a pointer to the indexed
// element, and stores the given value to it.
struct ScalableStoreOpLowering
: public ConvertOpToLLVMPattern<ScalableStoreOp> {
using ConvertOpToLLVMPattern<ScalableStoreOp>::ConvertOpToLLVMPattern;

LogicalResult
matchAndRewrite(ScalableStoreOp storeOp, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {
auto type = storeOp.getMemRefType();
if (!isConvertibleAndHasIdentityMaps(type))
return failure();

LLVMTypeConverter converter(storeOp.getContext());

auto resultType = storeOp.value().getType();
LLVM::LLVMPointerType llvmDataTypePtr;
if (resultType.isa<VectorType>()) {
llvmDataTypePtr =
LLVM::LLVMPointerType::get(resultType.cast<VectorType>());
} else if (resultType.isa<ScalableVectorType>()) {
llvmDataTypePtr = LLVM::LLVMPointerType::get(
convertScalableVectorTypeToLLVM(resultType.cast<ScalableVectorType>(),
converter)
.getValue());
}
Value dataPtr = getStridedElementPtr(storeOp.getLoc(), type, adaptor.base(),
adaptor.index(), rewriter);
Value bitCastedPtr = rewriter.create<LLVM::BitcastOp>(
storeOp.getLoc(), llvmDataTypePtr, dataPtr);
rewriter.replaceOpWithNewOp<LLVM::StoreOp>(storeOp, adaptor.value(),
bitCastedPtr);
return success();
}
};

static void
populateBasicSVEArithmeticExportPatterns(LLVMTypeConverter &converter,
OwningRewritePatternList &patterns) {
// clang-format off
patterns.add<OneToOneConvertToLLVMPattern<ScalableAddIOp, LLVM::AddOp>,
OneToOneConvertToLLVMPattern<ScalableAddFOp, LLVM::FAddOp>,
OneToOneConvertToLLVMPattern<ScalableSubIOp, LLVM::SubOp>,
OneToOneConvertToLLVMPattern<ScalableSubFOp, LLVM::FSubOp>,
OneToOneConvertToLLVMPattern<ScalableMulIOp, LLVM::MulOp>,
OneToOneConvertToLLVMPattern<ScalableMulFOp, LLVM::FMulOp>,
OneToOneConvertToLLVMPattern<ScalableSDivIOp, LLVM::SDivOp>,
OneToOneConvertToLLVMPattern<ScalableUDivIOp, LLVM::UDivOp>,
OneToOneConvertToLLVMPattern<ScalableDivFOp, LLVM::FDivOp>
>(converter);
// clang-format on
}

static void
configureBasicSVEArithmeticLegalizations(LLVMConversionTarget &target) {
// clang-format off
target.addIllegalOp<ScalableAddIOp,
ScalableAddFOp,
ScalableSubIOp,
ScalableSubFOp,
ScalableMulIOp,
ScalableMulFOp,
ScalableSDivIOp,
ScalableUDivIOp,
ScalableDivFOp>();
// clang-format on
}

static void
populateSVEMaskGenerationExportPatterns(LLVMTypeConverter &converter,
OwningRewritePatternList &patterns) {
// clang-format off
patterns.add<OneToOneConvertToLLVMPattern<ScalableCmpFOp, LLVM::FCmpOp>,
OneToOneConvertToLLVMPattern<ScalableCmpIOp, LLVM::ICmpOp>
>(converter);
// clang-format on
}

static void
configureSVEMaskGenerationLegalizations(LLVMConversionTarget &target) {
// clang-format off
target.addIllegalOp<ScalableCmpFOp,
ScalableCmpIOp>();
// clang-format on
}

/// Populate the given list with patterns that convert from ArmSVE to LLVM.		/// Populate the given list with patterns that convert from ArmSVE to LLVM.
void mlir::populateArmSVELegalizeForLLVMExportPatterns(		void mlir::populateArmSVELegalizeForLLVMExportPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
// Populate conversion patterns		// Populate conversion patterns
// Remove any ArmSVE-specific types from function signatures and results.
populateFuncOpTypeConversionPattern(patterns, converter);
converter.addConversion([&converter](ScalableVectorType svType) {
return convertScalableVectorTypeToLLVM(svType, converter);
});
converter.addSourceMaterialization(addUnrealizedCast);

// clang-format off		// clang-format off
patterns.add<ForwardOperands<CallOp>,		patterns.add<ForwardOperands<CallOp>,
ForwardOperands<CallIndirectOp>,		ForwardOperands<CallIndirectOp>,
ForwardOperands<ReturnOp>>(converter,		ForwardOperands<ReturnOp>>(converter,
&converter.getContext());		&converter.getContext());
patterns.add<SdotOpLowering,		patterns.add<SdotOpLowering,
SmmlaOpLowering,		SmmlaOpLowering,
UdotOpLowering,		UdotOpLowering,
UmmlaOpLowering,		UmmlaOpLowering,
VectorScaleOpLowering,
ScalableMaskedAddIOpLowering,		ScalableMaskedAddIOpLowering,
ScalableMaskedAddFOpLowering,		ScalableMaskedAddFOpLowering,
ScalableMaskedSubIOpLowering,		ScalableMaskedSubIOpLowering,
ScalableMaskedSubFOpLowering,		ScalableMaskedSubFOpLowering,
ScalableMaskedMulIOpLowering,		ScalableMaskedMulIOpLowering,
ScalableMaskedMulFOpLowering,		ScalableMaskedMulFOpLowering,
ScalableMaskedSDivIOpLowering,		ScalableMaskedSDivIOpLowering,
ScalableMaskedUDivIOpLowering,		ScalableMaskedUDivIOpLowering,
ScalableMaskedDivFOpLowering>(converter);		ScalableMaskedDivFOpLowering>(converter);
patterns.add<ScalableLoadOpLowering,
ScalableStoreOpLowering>(converter);
// clang-format on		// clang-format on
populateBasicSVEArithmeticExportPatterns(converter, patterns);
populateSVEMaskGenerationExportPatterns(converter, patterns);
}		}

void mlir::configureArmSVELegalizeForExportTarget(		void mlir::configureArmSVELegalizeForExportTarget(
LLVMConversionTarget &target) {		LLVMConversionTarget &target) {
// clang-format off		// clang-format off
target.addLegalOp<SdotIntrOp,		target.addLegalOp<SdotIntrOp,
SmmlaIntrOp,		SmmlaIntrOp,
UdotIntrOp,		UdotIntrOp,
UmmlaIntrOp,		UmmlaIntrOp,
VectorScaleIntrOp,
ScalableMaskedAddIIntrOp,		ScalableMaskedAddIIntrOp,
ScalableMaskedAddFIntrOp,		ScalableMaskedAddFIntrOp,
ScalableMaskedSubIIntrOp,		ScalableMaskedSubIIntrOp,
ScalableMaskedSubFIntrOp,		ScalableMaskedSubFIntrOp,
ScalableMaskedMulIIntrOp,		ScalableMaskedMulIIntrOp,
ScalableMaskedMulFIntrOp,		ScalableMaskedMulFIntrOp,
ScalableMaskedSDivIIntrOp,		ScalableMaskedSDivIIntrOp,
ScalableMaskedUDivIIntrOp,		ScalableMaskedUDivIIntrOp,
ScalableMaskedDivFIntrOp>();		ScalableMaskedDivFIntrOp>();
target.addIllegalOp<SdotOp,		target.addIllegalOp<SdotOp,
SmmlaOp,		SmmlaOp,
UdotOp,		UdotOp,
UmmlaOp,		UmmlaOp,
VectorScaleOp,
ScalableMaskedAddIOp,		ScalableMaskedAddIOp,
ScalableMaskedAddFOp,		ScalableMaskedAddFOp,
ScalableMaskedSubIOp,		ScalableMaskedSubIOp,
ScalableMaskedSubFOp,		ScalableMaskedSubFOp,
ScalableMaskedMulIOp,		ScalableMaskedMulIOp,
ScalableMaskedMulFOp,		ScalableMaskedMulFOp,
ScalableMaskedSDivIOp,		ScalableMaskedSDivIOp,
ScalableMaskedUDivIOp,		ScalableMaskedUDivIOp,
ScalableMaskedDivFOp,		ScalableMaskedDivFOp>();
ScalableLoadOp,
ScalableStoreOp>();
// clang-format on		// clang-format on
auto hasScalableVectorType = [](TypeRange types) {
for (Type type : types)
if (type.isa<arm_sve::ScalableVectorType>())
return true;
return false;
};
target.addDynamicallyLegalOp<FuncOp>([hasScalableVectorType](FuncOp op) {
return !hasScalableVectorType(op.getType().getInputs()) &&
!hasScalableVectorType(op.getType().getResults());
});
target.addDynamicallyLegalOp<CallOp, CallIndirectOp, ReturnOp>(
[hasScalableVectorType](Operation *op) {
return !hasScalableVectorType(op->getOperandTypes()) &&
!hasScalableVectorType(op->getResultTypes());
});
configureBasicSVEArithmeticLegalizations(target);
configureSVEMaskGenerationLegalizations(target);
}		}

mlir/lib/Dialect/LLVMIR/IR/LLVMDialect.cpp

Show First 20 Lines • Show All 149 Lines • ▼ Show 20 Lines	static ParseResult parseCmpOp(OpAsmParser &parser, OperationState &result) {

// The result type is either i1 or a vector type <? x i1> if the inputs are		// The result type is either i1 or a vector type <? x i1> if the inputs are
// vectors.		// vectors.
Type resultType = IntegerType::get(builder.getContext(), 1);		Type resultType = IntegerType::get(builder.getContext(), 1);
if (!isCompatibleType(type))		if (!isCompatibleType(type))
return parser.emitError(trailingTypeLoc,		return parser.emitError(trailingTypeLoc,
"expected LLVM dialect-compatible type");		"expected LLVM dialect-compatible type");
if (LLVM::isCompatibleVectorType(type)) {		if (LLVM::isCompatibleVectorType(type)) {
if (type.isa<LLVM::LLVMScalableVectorType>()) {		if (LLVM::isScalableVectorType(type)) {
resultType = LLVM::LLVMScalableVectorType::get(		resultType = LLVM::getVectorType(
resultType, LLVM::getVectorNumElements(type).getKnownMinValue());		resultType, LLVM::getVectorNumElements(type).getKnownMinValue(),
		/isScalable=/true);
		rriddleUnsubmitted Done Reply Inline Actions Why the extra spaces? rriddle: Why the extra spaces?
} else {		} else {
resultType = LLVM::getFixedVectorType(		resultType = LLVM::getVectorType(
resultType, LLVM::getVectorNumElements(type).getFixedValue());		resultType, LLVM::getVectorNumElements(type).getFixedValue(),
		/isScalable=/false);
}		}
}		}

result.addTypes({resultType});		result.addTypes({resultType});
return success();		return success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 2,529 Lines • Show Last 20 Lines

mlir/lib/Dialect/LLVMIR/IR/LLVMTypes.cpp

Show First 20 Lines • Show All 769 Lines • ▼ Show 20 Lines

return llvm::TypeSwitch<Type, Type>(type)

[](auto ty) { return ty.getElementType(); })

.Default([](Type) -> Type {

llvm_unreachable("incompatible with LLVM vector type");

});

}

llvm::ElementCount mlir::LLVM::getVectorNumElements(Type type) {

return llvm::TypeSwitch<Type, llvm::ElementCount>(type)

.Case<LLVMFixedVectorType, VectorType>([](auto ty) {

.Case([](VectorType ty) {

if (ty.isScalable())

return llvm::ElementCount::getScalable(ty.getNumElements());

return llvm::ElementCount::getFixed(ty.getNumElements());

})

rriddleUnsubmitted

Done

Drop else after return.

rriddle: Drop else after return.

.Case([](LLVMFixedVectorType ty) {

return llvm::ElementCount::getFixed(ty.getNumElements());

rriddleUnsubmitted

Done

return llvm::TypeSwitch<Type, llvm::ElementCount>(type)

- .Case<VectorType>([](auto ty) {

+ .Case([](VectorType ty) {

if (ty.getIsScalable())

return llvm::ElementCount::getScalable(ty.getNumElements());

return llvm::ElementCount::getFixed(ty.getNumElements());

})

- .Case<LLVMFixedVectorType>([](auto ty) {

+ .Case([](LLVMFixedVectorType ty) {

return llvm::ElementCount::getFixed(ty.getNumElements());

rriddle:

})

.Case([](LLVMScalableVectorType ty) {

return llvm::ElementCount::getScalable(ty.getMinNumElements());

})

.Default([](Type) -> llvm::ElementCount {

llvm_unreachable("incompatible with LLVM vector type");

});

}

bool mlir::LLVM::isScalableVectorType(Type vectorType) {

assert(

(vectorType

.isa<LLVMFixedVectorType, LLVMScalableVectorType, VectorType>()) &&

rriddleUnsubmitted

Done

bool mlir::LLVM::getIsVectorTypeScalable(Type vectorType) {

- assert((vectorType.isa<LLVMFixedVectorType>() ||

- vectorType.isa<LLVMScalableVectorType>() ||

- vectorType.isa<VectorType>()) &&

+ assert(vectorType.isa<LLVMFixedVectorType,

+ LLVMScalableVectorType, VectorType>() &&

"expected LLVM-compatible vector type");

rriddle:

"expected LLVM-compatible vector type");

return !vectorType.isa<LLVMFixedVectorType>() &&

(vectorType.isa<LLVMScalableVectorType>() ||

vectorType.cast<VectorType>().isScalable());

}

Type mlir::LLVM::getVectorType(Type elementType, unsigned numElements,

bool isScalable) {

bool useLLVM = LLVMFixedVectorType::isValidElementType(elementType);

bool useBuiltIn = VectorType::isValidElementType(elementType);

(void)useBuiltIn;

assert((useLLVM ^ useBuiltIn) && "expected LLVM-compatible fixed-vector type "

"to be either builtin or LLVM dialect type");

if (useLLVM) {

if (isScalable)

return LLVMScalableVectorType::get(elementType, numElements);

return LLVMFixedVectorType::get(elementType, numElements);

}

rriddleUnsubmitted

Done

Drop else after return.

rriddle: Drop else after return.

return VectorType::get(numElements, elementType, (unsigned)isScalable);

}

Type mlir::LLVM::getFixedVectorType(Type elementType, unsigned numElements) {

bool useLLVM = LLVMFixedVectorType::isValidElementType(elementType);

bool useBuiltIn = VectorType::isValidElementType(elementType);

(void)useBuiltIn;

assert((useLLVM ^ useBuiltIn) && "expected LLVM-compatible fixed-vector type "

"to be either builtin or LLVM dialect type");

if (useLLVM)

return LLVMFixedVectorType::get(elementType, numElements);

return VectorType::get(numElements, elementType);

}

Type mlir::LLVM::getScalableVectorType(Type elementType, unsigned numElements) {

bool useLLVM = LLVMScalableVectorType::isValidElementType(elementType);

bool useBuiltIn = VectorType::isValidElementType(elementType);

(void)useBuiltIn;

assert((useLLVM ^ useBuiltIn) && "expected LLVM-compatible scalable-vector "

"type to be either builtin or LLVM dialect "

"type");

if (useLLVM)

return LLVMScalableVectorType::get(elementType, numElements);

return VectorType::get(numElements, elementType, /*numScalableDims=*/1);

}

llvm::TypeSize mlir::LLVM::getPrimitiveTypeSizeInBits(Type type) {

assert(isCompatibleType(type) &&

"expected a type compatible with the LLVM dialect");

return llvm::TypeSwitch<Type, llvm::TypeSize>(type)

.Case<BFloat16Type, Float16Type>(

[](Type) { return llvm::TypeSize::Fixed(16); })

.Case<Float32Type>([](Type) { return llvm::TypeSize::Fixed(32); })

Show All 32 Lines

mlir/lib/Dialect/StandardOps/IR/Ops.cpp

	Show First 20 Lines • Show All 509 Lines • ▼ Show 20 Lines
	// Return the type of the same shape (scalar, vector or tensor) containing i1.			// Return the type of the same shape (scalar, vector or tensor) containing i1.
	static Type getI1SameShape(Type type) {			static Type getI1SameShape(Type type) {
	auto i1Type = IntegerType::get(type.getContext(), 1);			auto i1Type = IntegerType::get(type.getContext(), 1);
	if (auto tensorType = type.dyn_cast<RankedTensorType>())			if (auto tensorType = type.dyn_cast<RankedTensorType>())
	return RankedTensorType::get(tensorType.getShape(), i1Type);			return RankedTensorType::get(tensorType.getShape(), i1Type);
	if (type.isa<UnrankedTensorType>())			if (type.isa<UnrankedTensorType>())
	return UnrankedTensorType::get(i1Type);			return UnrankedTensorType::get(i1Type);
	if (auto vectorType = type.dyn_cast<VectorType>())			if (auto vectorType = type.dyn_cast<VectorType>())
	return VectorType::get(vectorType.getShape(), i1Type);			return VectorType::get(vectorType.getShape(), i1Type,
				vectorType.getNumScalableDims());
	return i1Type;			return i1Type;
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// CondBranchOp			// CondBranchOp
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	namespace {			namespace {
	▲ Show 20 Lines • Show All 993 Lines • Show Last 20 Lines

mlir/lib/IR/AsmPrinter.cpp

Show First 20 Lines • Show All 1,948 Lines • ▼ Show 20 Lines	TypeSwitch<Type>(type)
} else {		} else {
os << '(';		os << '(';
interleaveComma(results, [&](Type ty) { printType(ty); });		interleaveComma(results, [&](Type ty) { printType(ty); });
os << ')';		os << ')';
}		}
})		})
.Case<VectorType>([&](VectorType vectorTy) {		.Case<VectorType>([&](VectorType vectorTy) {
os << "vector<";		os << "vector<";
for (int64_t dim : vectorTy.getShape())		auto vShape = vectorTy.getShape();
os << dim << 'x';		unsigned lastDim = vShape.size();
		unsigned lastFixedDim = lastDim - vectorTy.getNumScalableDims();
		unsigned dimIdx = 0;
		for (dimIdx = 0; dimIdx < lastFixedDim; dimIdx++)
		os << vShape[dimIdx] << 'x';
		if (vectorTy.isScalable()) {
		os << '[';
		unsigned secondToLastDim = lastDim - 1;
		for (; dimIdx < secondToLastDim; dimIdx++)
		rriddleUnsubmitted Done Reply Inline Actions Please cache the end iterator to avoid recomputing it every iteration. rriddle: Please cache the end iterator to avoid recomputing it every iteration.
		rriddleUnsubmitted Done Reply Inline Actions Unresolved. rriddle: Unresolved.
		os << vShape[dimIdx] << 'x';
		os << vShape[dimIdx] << "]x";
		}
printType(vectorTy.getElementType());		printType(vectorTy.getElementType());
os << '>';		os << '>';
})		})
.Case<RankedTensorType>([&](RankedTensorType tensorTy) {		.Case<RankedTensorType>([&](RankedTensorType tensorTy) {
os << "tensor<";		os << "tensor<";
for (int64_t dim : tensorTy.getShape()) {		for (int64_t dim : tensorTy.getShape()) {
if (ShapedType::isDynamic(dim))		if (ShapedType::isDynamic(dim))
os << '?';		os << '?';
▲ Show 20 Lines • Show All 957 Lines • Show Last 20 Lines

mlir/lib/IR/BuiltinAttributes.cpp

Show First 20 Lines • Show All 1,159 Lines • ▼ Show 20 Lines	static ShapedType mappingHelper(Fn mapping, Attr &attr, ShapedType inType,
size_t bitWidth = getDenseElementBitWidth(newElementType);		size_t bitWidth = getDenseElementBitWidth(newElementType);
size_t storageBitWidth = getDenseElementStorageWidth(bitWidth);		size_t storageBitWidth = getDenseElementStorageWidth(bitWidth);

ShapedType newArrayType;		ShapedType newArrayType;
if (inType.isa<RankedTensorType>())		if (inType.isa<RankedTensorType>())
newArrayType = RankedTensorType::get(inType.getShape(), newElementType);		newArrayType = RankedTensorType::get(inType.getShape(), newElementType);
else if (inType.isa<UnrankedTensorType>())		else if (inType.isa<UnrankedTensorType>())
newArrayType = RankedTensorType::get(inType.getShape(), newElementType);		newArrayType = RankedTensorType::get(inType.getShape(), newElementType);
else if (inType.isa<VectorType>())		else if (auto vType = inType.dyn_cast<VectorType>())
newArrayType = VectorType::get(inType.getShape(), newElementType);		newArrayType = VectorType::get(vType.getShape(), newElementType,
		vType.getNumScalableDims());
else		else
assert(newArrayType && "Unhandled tensor type");		assert(newArrayType && "Unhandled tensor type");

size_t numRawElements = attr.isSplat() ? 1 : newArrayType.getNumElements();		size_t numRawElements = attr.isSplat() ? 1 : newArrayType.getNumElements();
data.resize(llvm::divideCeil(storageBitWidth, CHAR_BIT) * numRawElements);		data.resize(llvm::divideCeil(storageBitWidth, CHAR_BIT) * numRawElements);

// Functor used to process a single element value of the attribute.		// Functor used to process a single element value of the attribute.
auto processElt = [&](decltype(*attr.begin()) value, size_t index) {		auto processElt = [&](decltype(*attr.begin()) value, size_t index) {
▲ Show 20 Lines • Show All 200 Lines • Show Last 20 Lines

mlir/lib/IR/BuiltinTypes.cpp

Show First 20 Lines • Show All 287 Lines • ▼ Show 20 Lines ShapedType ShapedType::clone(ArrayRef<int64_t> shape, Type elementType) {

if (auto other = dyn_cast<UnrankedMemRefType>()) { if (auto other = dyn_cast<UnrankedMemRefType>()) {

MemRefType::Builder b(shape, elementType); MemRefType::Builder b(shape, elementType);

b.setMemorySpace(other.getMemorySpace()); b.setMemorySpace(other.getMemorySpace());

return b; return b;

} }

if (isa<TensorType>()) if (isa<TensorType>())

return RankedTensorType::get(shape, elementType); return RankedTensorType::get(shape, elementType);

if (isa<VectorType>()) if (auto vecTy = dyn_cast<VectorType>())

return VectorType::get(shape, elementType); return VectorType::get(shape, elementType, vecTy.getNumScalableDims());

rriddleUnsubmitted

Done

Use cast if you aren't checking the result, dyn_cast> can return null.

rriddle: Use `cast` if you aren't checking the result, `dyn_cast>` can return null.

rriddleUnsubmitted

Done

return RankedTensorType::get(shape, elementType);

- if (isa<VectorType>())

+ if (auto vecTy = dyn_cast<VectorType>())

return VectorType::get(shape, elementType,

- cast<VectorType>().getIsScalable());

+ vecTy.getIsScalable());

llvm_unreachable("Unhandled ShapedType clone case");

rriddle:

llvm_unreachable("Unhandled ShapedType clone case"); llvm_unreachable("Unhandled ShapedType clone case");

} }

ShapedType ShapedType::clone(ArrayRef<int64_t> shape) { ShapedType ShapedType::clone(ArrayRef<int64_t> shape) {

if (auto other = dyn_cast<MemRefType>()) { if (auto other = dyn_cast<MemRefType>()) {

MemRefType::Builder b(other); MemRefType::Builder b(other);

b.setShape(shape); b.setShape(shape);

return b; return b;

} }

if (auto other = dyn_cast<UnrankedMemRefType>()) { if (auto other = dyn_cast<UnrankedMemRefType>()) {

MemRefType::Builder b(shape, other.getElementType()); MemRefType::Builder b(shape, other.getElementType());

b.setShape(shape); b.setShape(shape);

b.setMemorySpace(other.getMemorySpace()); b.setMemorySpace(other.getMemorySpace());

return b; return b;

} }

if (isa<TensorType>()) if (isa<TensorType>())

return RankedTensorType::get(shape, getElementType()); return RankedTensorType::get(shape, getElementType());

if (isa<VectorType>()) if (auto vecTy = dyn_cast<VectorType>())

rriddleUnsubmitted

Done

Same here.

rriddle: Same here.

return VectorType::get(shape, getElementType()); return VectorType::get(shape, getElementType(), vecTy.getNumScalableDims());

rriddleUnsubmitted

Done

Same here.

rriddle: Same here.

llvm_unreachable("Unhandled ShapedType clone case"); llvm_unreachable("Unhandled ShapedType clone case");

} }

ShapedType ShapedType::clone(Type elementType) { ShapedType ShapedType::clone(Type elementType) {

if (auto other = dyn_cast<MemRefType>()) { if (auto other = dyn_cast<MemRefType>()) {

MemRefType::Builder b(other); MemRefType::Builder b(other);

b.setElementType(elementType); b.setElementType(elementType);

return b; return b;

} }

if (auto other = dyn_cast<UnrankedMemRefType>()) { if (auto other = dyn_cast<UnrankedMemRefType>()) {

return UnrankedMemRefType::get(elementType, other.getMemorySpace()); return UnrankedMemRefType::get(elementType, other.getMemorySpace());

} }

if (isa<TensorType>()) { if (isa<TensorType>()) {

if (hasRank()) if (hasRank())

return RankedTensorType::get(getShape(), elementType); return RankedTensorType::get(getShape(), elementType);

return UnrankedTensorType::get(elementType); return UnrankedTensorType::get(elementType);

} }

if (isa<VectorType>()) if (auto vecTy = dyn_cast<VectorType>())

rriddleUnsubmitted

Done

and here.

rriddle: and here.

return VectorType::get(getShape(), elementType); return VectorType::get(getShape(), elementType, vecTy.getNumScalableDims());

rriddleUnsubmitted

Done

And here, and others.

rriddle: And here, and others.

llvm_unreachable("Unhandled ShapedType clone hit"); llvm_unreachable("Unhandled ShapedType clone hit");

} }

Type ShapedType::getElementType() const { Type ShapedType::getElementType() const {

return TypeSwitch<Type, Type>(*this) return TypeSwitch<Type, Type>(*this)

.Case<VectorType, RankedTensorType, UnrankedTensorType, MemRefType, .Case<VectorType, RankedTensorType, UnrankedTensorType, MemRefType,

UnrankedMemRefType>([](auto ty) { return ty.getElementType(); }); UnrankedMemRefType>([](auto ty) { return ty.getElementType(); });

▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines bool ShapedType::hasStaticShape(ArrayRef<int64_t> shape) const {

return hasStaticShape() && getShape() == shape; return hasStaticShape() && getShape() == shape;

} }

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

// VectorType // VectorType

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

LogicalResult VectorType::verify(function_ref<InFlightDiagnostic()> emitError, LogicalResult VectorType::verify(function_ref<InFlightDiagnostic()> emitError,

ArrayRef<int64_t> shape, Type elementType) { ArrayRef<int64_t> shape, Type elementType,

unsigned numScalableDims) {

if (!isValidElementType(elementType)) if (!isValidElementType(elementType))

return emitError() return emitError()

<< "vector elements must be int/index/float type but got " << "vector elements must be int/index/float type but got "

<< elementType; << elementType;

if (any_of(shape, [](int64_t i) { return i <= 0; })) if (any_of(shape, [](int64_t i) { return i <= 0; }))

return emitError() return emitError()

<< "vector types must have positive constant sizes but got " << "vector types must have positive constant sizes but got "

<< shape; << shape;

return success(); return success();

} }

VectorType VectorType::scaleElementBitwidth(unsigned scale) { VectorType VectorType::scaleElementBitwidth(unsigned scale) {

if (!scale) if (!scale)

return VectorType(); return VectorType();

if (auto et = getElementType().dyn_cast<IntegerType>()) if (auto et = getElementType().dyn_cast<IntegerType>())

if (auto scaledEt = et.scaleElementBitwidth(scale)) if (auto scaledEt = et.scaleElementBitwidth(scale))

return VectorType::get(getShape(), scaledEt); return VectorType::get(getShape(), scaledEt, getNumScalableDims());

if (auto et = getElementType().dyn_cast<FloatType>()) if (auto et = getElementType().dyn_cast<FloatType>())

if (auto scaledEt = et.scaleElementBitwidth(scale)) if (auto scaledEt = et.scaleElementBitwidth(scale))

return VectorType::get(getShape(), scaledEt); return VectorType::get(getShape(), scaledEt, getNumScalableDims());

return VectorType(); return VectorType();

} }

void VectorType::walkImmediateSubElements( void VectorType::walkImmediateSubElements(

function_ref<void(Attribute)> walkAttrsFn, function_ref<void(Attribute)> walkAttrsFn,

function_ref<void(Type)> walkTypesFn) const { function_ref<void(Type)> walkTypesFn) const {

walkTypesFn(getElementType()); walkTypesFn(getElementType());

} }

▲ Show 20 Lines • Show All 733 Lines • Show Last 20 Lines

mlir/lib/Parser/Parser.h

Show First 20 Lines • Show All 190 Lines • ▼ Show 20 Lines	public:
/// Parse a tensor type.		/// Parse a tensor type.
Type parseTensorType();		Type parseTensorType();

/// Parse a tuple type.		/// Parse a tuple type.
Type parseTupleType();		Type parseTupleType();

/// Parse a vector type.		/// Parse a vector type.
VectorType parseVectorType();		VectorType parseVectorType();
		ParseResult parseVectorDimensionList(SmallVectorImpl<int64_t> &dimensions,
		unsigned &numScalableDims);
ParseResult parseDimensionListRanked(SmallVectorImpl<int64_t> &dimensions,		ParseResult parseDimensionListRanked(SmallVectorImpl<int64_t> &dimensions,
bool allowDynamic = true);		bool allowDynamic = true);
		ParseResult parseIntegerInDimensionList(int64_t &value);
ParseResult parseXInDimensionList();		ParseResult parseXInDimensionList();

/// Parse strided layout specification.		/// Parse strided layout specification.
ParseResult parseStridedLayout(int64_t &offset,		ParseResult parseStridedLayout(int64_t &offset,
SmallVectorImpl<int64_t> &strides);		SmallVectorImpl<int64_t> &strides);

// Parse a brace-delimiter list of comma-separated integers with `?` as an		// Parse a brace-delimiter list of comma-separated integers with `?` as an
// unknown marker.		// unknown marker.
▲ Show 20 Lines • Show All 99 Lines • Show Last 20 Lines

mlir/lib/Parser/TypeParser.cpp

//===- TypeParser.cpp - MLIR Type Parser Implementation -------------------===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

// This file implements the parser for the MLIR Types.

//===----------------------------------------------------------------------===//

#include "Parser.h"

#include "mlir/IR/AffineMap.h"

#include "mlir/IR/BuiltinTypes.h"

#include "mlir/IR/OpDefinition.h"

#include "mlir/IR/TensorEncoding.h"

using namespace mlir;

using namespace mlir::detail;

/// Optionally parse a type.

OptionalParseResult Parser::parseOptionalType(Type &type) {

// There are many different starting tokens for a type, check them here.

▲ Show 20 Lines • Show All 413 Lines • ▼ Show 20 Lines

if (parseTypeListNoParens(types) ||

parseToken(Token::greater, "expected '>' in tuple type"))

return nullptr;

return TupleType::get(getContext(), types);

}

/// Parse a vector type.

///

/// vector-type ::= `vector` `<` static-dimension-list type `>`

/// vector-type ::= `vector` `<` vector-dim-list vector-element-type `>`

/// static-dimension-list ::= (decimal-literal `x`)*

/// vector-dim-list := (static-dim-list `x`)? (`[` static-dim-list `]` `x`)?

/// static-dim-list ::= decimal-literal (`x` decimal-literal)*

///

VectorType Parser::parseVectorType() {

consumeToken(Token::kw_vector);

if (parseToken(Token::less, "expected '<' in vector type"))

return nullptr;

SmallVector<int64_t, 4> dimensions;

if (parseDimensionListRanked(dimensions, /*allowDynamic=*/false))

unsigned numScalableDims;

if (parseVectorDimensionList(dimensions, numScalableDims))

rriddleUnsubmitted

Done

return nullptr;

- bool isScalable = false;

- if (consumeIf(Token::less))

- isScalable = true;

+ bool isScalable = consumeIf(Token::less);

SmallVector<int64_t, 4> dimensions;

rriddle:

jsetoainAuthorUnsubmitted

Done

Arg! That was embarrassing... Sorry about that!

jsetoain: Arg! That was embarrassing... Sorry about that!

return nullptr;

if (any_of(dimensions, [](int64_t i) { return i <= 0; }))

return emitError(getToken().getLoc(),

"vector types must have positive constant sizes"),

nullptr;

// Parse the element type.

auto typeLoc = getToken().getLoc();

auto elementType = parseType();

if (!elementType || parseToken(Token::greater, "expected '>' in vector type"))

return nullptr;

if (!VectorType::isValidElementType(elementType))

return emitError(typeLoc, "vector elements must be int/index/float type"),

nullptr;

return VectorType::get(dimensions, elementType);

return VectorType::get(dimensions, elementType, numScalableDims);

}

/// Parse a dimension list in a vector type. This populates the dimension list,

/// and returns the number of scalable dimensions in `numScalableDims`.

///

/// vector-dim-list := (static-dim-list `x`)? (`[` static-dim-list `]` `x`)?

/// static-dim-list ::= decimal-literal (`x` decimal-literal)*

///

ParseResult

Parser::parseVectorDimensionList(SmallVectorImpl<int64_t> &dimensions,

unsigned &numScalableDims) {

numScalableDims = 0;

// If there is a set of fixed-length dimensions, consume it

while (getToken().is(Token::integer)) {

int64_t value;

if (parseIntegerInDimensionList(value))

return failure();

dimensions.push_back(value);

// Make sure we have an 'x' or something like 'xbf32'.

if (parseXInDimensionList())

return failure();

}

// If there is a set of scalable dimensions, consume it

if (consumeIf(Token::l_square)) {

while (getToken().is(Token::integer)) {

int64_t value;

if (parseIntegerInDimensionList(value))

return failure();

dimensions.push_back(value);

numScalableDims++;

// Check if we have reached the end of the scalable dimension list

if (consumeIf(Token::r_square)) {

// Make sure we have something like 'xbf32'.

if (parseXInDimensionList())

return failure();

return success();

}

// Make sure we have an 'x'

if (parseXInDimensionList())

return failure();

}

// If we make it here, we've finished parsing the dimension list

// without finding ']' closing the set of scalable dimensions

return emitError("missing ']' closing set of scalable dimensions");

}

return success();

rriddleUnsubmitted

Done

Looks like this is missing test coverage.

rriddle: Looks like this is missing test coverage.

}

/// Parse a dimension list of a tensor or memref type. This populates the

/// dimension list, using -1 for the `?` dimensions if `allowDynamic` is set and

/// errors out on `?` otherwise.

///

/// dimension-list-ranked ::= (dimension `x`)*

/// dimension ::= `?` | decimal-literal

///

/// When `allowDynamic` is not set, this is used to parse:

///

/// static-dimension-list ::= (decimal-literal `x`)*

ParseResult

Parser::parseDimensionListRanked(SmallVectorImpl<int64_t> &dimensions,

bool allowDynamic) {

while (getToken().isAny(Token::integer, Token::question)) {

if (consumeIf(Token::question)) {

if (!allowDynamic)

return emitError("expected static shape");

dimensions.push_back(-1);

} else {

int64_t value;

if (parseIntegerInDimensionList(value))

return failure();

dimensions.push_back(value);

}

// Make sure we have an 'x' or something like 'xbf32'.

if (parseXInDimensionList())

return failure();

}

return success();

}

ParseResult Parser::parseIntegerInDimensionList(int64_t &value) {

// Hexadecimal integer literals (starting with `0x`) are not allowed in

// aggregate type declarations. Therefore, `0xf32` should be processed as

// a sequence of separate elements `0`, `x`, `f32`.

if (getTokenSpelling().size() > 1 && getTokenSpelling()[1] == 'x') {

// We can get here only if the token is an integer literal. Hexadecimal

// integer literals can only start with `0x` (`1x` wouldn't lex as a

// literal, just `1` would, at which point we don't get into this

// branch).

assert(getTokenSpelling()[0] == '0' && "invalid integer literal");

dimensions.push_back(0);

value = 0;

state.lex.resetPointer(getTokenSpelling().data() + 1);

consumeToken();

} else {

// Make sure this integer value is in bound and valid.

Optional<uint64_t> dimension = getToken().getUInt64IntegerValue();

if (!dimension || *dimension > std::numeric_limits<int64_t>::max())

return emitError("invalid dimension");

dimensions.push_back((int64_t)dimension.getValue());

value = (int64_t)dimension.getValue();

consumeToken(Token::integer);

}

// Make sure we have an 'x' or something like 'xbf32'.

if (parseXInDimensionList())

return failure();

}

return success();

}

/// Parse an 'x' token in a dimension list, handling the case where the x is

/// juxtaposed with an element type, as in "xf32", leaving the "f32" as the next

/// token.

ParseResult Parser::parseXInDimensionList() {

if (getToken().isNot(Token::bare_identifier) || getTokenSpelling()[0] != 'x')

▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp

Show First 20 Lines • Show All 236 Lines • ▼ Show 20 Lines	if (auto funcAttr = attr.dyn_cast<FlatSymbolRefAttr>())
return llvm::ConstantExpr::getBitCast(		return llvm::ConstantExpr::getBitCast(
moduleTranslation.lookupFunction(funcAttr.getValue()), llvmType);		moduleTranslation.lookupFunction(funcAttr.getValue()), llvmType);
if (auto splatAttr = attr.dyn_cast<SplatElementsAttr>()) {		if (auto splatAttr = attr.dyn_cast<SplatElementsAttr>()) {
llvm::Type *elementType;		llvm::Type *elementType;
uint64_t numElements;		uint64_t numElements;
if (auto *arrayTy = dyn_cast<llvm::ArrayType>(llvmType)) {		if (auto *arrayTy = dyn_cast<llvm::ArrayType>(llvmType)) {
elementType = arrayTy->getElementType();		elementType = arrayTy->getElementType();
numElements = arrayTy->getNumElements();		numElements = arrayTy->getNumElements();
		} else if (auto fVectorTy = dyn_cast<llvm::FixedVectorType>(llvmType)) {
		elementType = fVectorTy->getElementType();
		numElements = fVectorTy->getNumElements();
		} else if (auto sVectorTy = dyn_cast<llvm::ScalableVectorType>(llvmType)) {
		elementType = sVectorTy->getElementType();
		numElements = sVectorTy->getMinNumElements();
} else {		} else {
auto *vectorTy = cast<llvm::FixedVectorType>(llvmType);		llvm_unreachable("unrecognized constant vector type");
elementType = vectorTy->getElementType();
numElements = vectorTy->getNumElements();
}		}
// Splat value is a scalar. Extract it only if the element type is not		// Splat value is a scalar. Extract it only if the element type is not
// another sequence type. The recursion terminates because each step removes		// another sequence type. The recursion terminates because each step removes
// one outer sequential type.		// one outer sequential type.
bool elementTypeSequential =		bool elementTypeSequential =
isa<llvm::ArrayType, llvm::VectorType>(elementType);		isa<llvm::ArrayType, llvm::VectorType>(elementType);
llvm::Constant *child = getLLVMConstant(		llvm::Constant *child = getLLVMConstant(
elementType,		elementType,
▲ Show 20 Lines • Show All 786 Lines • Show Last 20 Lines

mlir/lib/Target/LLVMIR/TypeToLLVM.cpp

Show First 20 Lines • Show All 131 Lines • ▼ Show 20 Lines	llvm::Type *translate(LLVM::LLVMStructType type) {
structType->setBody(subtypes, type.isPacked());		structType->setBody(subtypes, type.isPacked());
return structType;		return structType;
}		}

/// Translates the given built-in vector type compatible with LLVM.		/// Translates the given built-in vector type compatible with LLVM.
llvm::Type *translate(VectorType type) {		llvm::Type *translate(VectorType type) {
assert(LLVM::isCompatibleVectorType(type) &&		assert(LLVM::isCompatibleVectorType(type) &&
"expected compatible with LLVM vector type");		"expected compatible with LLVM vector type");
		if (type.isScalable())
		return llvm::ScalableVectorType::get(translateType(type.getElementType()),
		type.getNumElements());
return llvm::FixedVectorType::get(translateType(type.getElementType()),		return llvm::FixedVectorType::get(translateType(type.getElementType()),
type.getNumElements());		type.getNumElements());
}		}

/// Translates the given fixed-vector type.		/// Translates the given fixed-vector type.
llvm::Type *translate(LLVM::LLVMFixedVectorType type) {		llvm::Type *translate(LLVM::LLVMFixedVectorType type) {
		rriddleUnsubmitted Done Reply Inline Actions Drop else after return. rriddle: Drop else after return.
return llvm::FixedVectorType::get(translateType(type.getElementType()),		return llvm::FixedVectorType::get(translateType(type.getElementType()),
type.getNumElements());		type.getNumElements());
}		}

/// Translates the given scalable-vector type.		/// Translates the given scalable-vector type.
llvm::Type *translate(LLVM::LLVMScalableVectorType type) {		llvm::Type *translate(LLVM::LLVMScalableVectorType type) {
return llvm::ScalableVectorType::get(translateType(type.getElementType()),		return llvm::ScalableVectorType::get(translateType(type.getElementType()),
type.getMinNumElements());		type.getMinNumElements());
Show All 36 Lines

mlir/test/Dialect/Arithmetic/ops.mlir

Show All 13 Lines
}		}

// CHECK-LABEL: test_addi_vector		// CHECK-LABEL: test_addi_vector
func @test_addi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_addi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.addi %arg0, %arg1 : vector<8xi64>		%0 = arith.addi %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_addi_scalable_vector
		func @test_addi_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.addi %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_subi		// CHECK-LABEL: test_subi
func @test_subi(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_subi(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.subi %arg0, %arg1 : i64		%0 = arith.subi %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_subi_tensor		// CHECK-LABEL: test_subi_tensor
func @test_subi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_subi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.subi %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.subi %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_subi_vector		// CHECK-LABEL: test_subi_vector
func @test_subi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_subi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.subi %arg0, %arg1 : vector<8xi64>		%0 = arith.subi %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_subi_scalable_vector
		func @test_subi_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.subi %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_muli		// CHECK-LABEL: test_muli
func @test_muli(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_muli(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.muli %arg0, %arg1 : i64		%0 = arith.muli %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_muli_tensor		// CHECK-LABEL: test_muli_tensor
func @test_muli_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_muli_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.muli %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.muli %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_muli_vector		// CHECK-LABEL: test_muli_vector
func @test_muli_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_muli_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.muli %arg0, %arg1 : vector<8xi64>		%0 = arith.muli %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_muli_scalable_vector
		func @test_muli_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.muli %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_divui		// CHECK-LABEL: test_divui
func @test_divui(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_divui(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.divui %arg0, %arg1 : i64		%0 = arith.divui %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_divui_tensor		// CHECK-LABEL: test_divui_tensor
func @test_divui_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_divui_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.divui %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.divui %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_divui_vector		// CHECK-LABEL: test_divui_vector
func @test_divui_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_divui_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.divui %arg0, %arg1 : vector<8xi64>		%0 = arith.divui %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_divui_scalable_vector
		func @test_divui_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.divui %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_divsi		// CHECK-LABEL: test_divsi
func @test_divsi(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_divsi(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.divsi %arg0, %arg1 : i64		%0 = arith.divsi %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_divsi_tensor		// CHECK-LABEL: test_divsi_tensor
func @test_divsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_divsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.divsi %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.divsi %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_divsi_vector		// CHECK-LABEL: test_divsi_vector
func @test_divsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_divsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.divsi %arg0, %arg1 : vector<8xi64>		%0 = arith.divsi %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_divsi_scalable_vector
		func @test_divsi_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.divsi %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_remui		// CHECK-LABEL: test_remui
func @test_remui(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_remui(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.remui %arg0, %arg1 : i64		%0 = arith.remui %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_remui_tensor		// CHECK-LABEL: test_remui_tensor
func @test_remui_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_remui_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.remui %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.remui %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_remui_vector		// CHECK-LABEL: test_remui_vector
func @test_remui_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_remui_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.remui %arg0, %arg1 : vector<8xi64>		%0 = arith.remui %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_remui_scalable_vector
		func @test_remui_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.remui %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_remsi		// CHECK-LABEL: test_remsi
func @test_remsi(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_remsi(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.remsi %arg0, %arg1 : i64		%0 = arith.remsi %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_remsi_tensor		// CHECK-LABEL: test_remsi_tensor
func @test_remsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_remsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.remsi %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.remsi %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_remsi_vector		// CHECK-LABEL: test_remsi_vector
func @test_remsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_remsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.remsi %arg0, %arg1 : vector<8xi64>		%0 = arith.remsi %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_remsi_scalable_vector
		func @test_remsi_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.remsi %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_andi		// CHECK-LABEL: test_andi
func @test_andi(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_andi(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.andi %arg0, %arg1 : i64		%0 = arith.andi %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_andi_tensor		// CHECK-LABEL: test_andi_tensor
func @test_andi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_andi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.andi %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.andi %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_andi_vector		// CHECK-LABEL: test_andi_vector
func @test_andi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_andi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.andi %arg0, %arg1 : vector<8xi64>		%0 = arith.andi %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_andi_scalable_vector
		func @test_andi_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.andi %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_ori		// CHECK-LABEL: test_ori
func @test_ori(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_ori(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.ori %arg0, %arg1 : i64		%0 = arith.ori %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_ori_tensor		// CHECK-LABEL: test_ori_tensor
func @test_ori_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_ori_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.ori %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.ori %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_ori_vector		// CHECK-LABEL: test_ori_vector
func @test_ori_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_ori_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.ori %arg0, %arg1 : vector<8xi64>		%0 = arith.ori %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_ori_scalable_vector
		func @test_ori_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.ori %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_xori		// CHECK-LABEL: test_xori
func @test_xori(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_xori(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.xori %arg0, %arg1 : i64		%0 = arith.xori %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_xori_tensor		// CHECK-LABEL: test_xori_tensor
func @test_xori_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_xori_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.xori %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.xori %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_xori_vector		// CHECK-LABEL: test_xori_vector
func @test_xori_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_xori_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.xori %arg0, %arg1 : vector<8xi64>		%0 = arith.xori %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_xori_scalable_vector
		func @test_xori_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.xori %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_ceildivsi		// CHECK-LABEL: test_ceildivsi
func @test_ceildivsi(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_ceildivsi(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.ceildivsi %arg0, %arg1 : i64		%0 = arith.ceildivsi %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_ceildivsi_tensor		// CHECK-LABEL: test_ceildivsi_tensor
func @test_ceildivsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_ceildivsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.ceildivsi %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.ceildivsi %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_ceildivsi_vector		// CHECK-LABEL: test_ceildivsi_vector
func @test_ceildivsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_ceildivsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.ceildivsi %arg0, %arg1 : vector<8xi64>		%0 = arith.ceildivsi %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_ceildivsi_scalable_vector
		func @test_ceildivsi_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.ceildivsi %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_floordivsi		// CHECK-LABEL: test_floordivsi
func @test_floordivsi(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_floordivsi(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.floordivsi %arg0, %arg1 : i64		%0 = arith.floordivsi %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_floordivsi_tensor		// CHECK-LABEL: test_floordivsi_tensor
func @test_floordivsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_floordivsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.floordivsi %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.floordivsi %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_floordivsi_vector		// CHECK-LABEL: test_floordivsi_vector
func @test_floordivsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_floordivsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.floordivsi %arg0, %arg1 : vector<8xi64>		%0 = arith.floordivsi %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_floordivsi_scalable_vector
		func @test_floordivsi_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.floordivsi %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_shli		// CHECK-LABEL: test_shli
func @test_shli(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_shli(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.shli %arg0, %arg1 : i64		%0 = arith.shli %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_shli_tensor		// CHECK-LABEL: test_shli_tensor
func @test_shli_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_shli_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.shli %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.shli %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_shli_vector		// CHECK-LABEL: test_shli_vector
func @test_shli_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_shli_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.shli %arg0, %arg1 : vector<8xi64>		%0 = arith.shli %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_shli_scalable_vector
		func @test_shli_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.shli %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_shrui		// CHECK-LABEL: test_shrui
func @test_shrui(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_shrui(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.shrui %arg0, %arg1 : i64		%0 = arith.shrui %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_shrui_tensor		// CHECK-LABEL: test_shrui_tensor
func @test_shrui_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_shrui_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.shrui %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.shrui %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_shrui_vector		// CHECK-LABEL: test_shrui_vector
func @test_shrui_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_shrui_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.shrui %arg0, %arg1 : vector<8xi64>		%0 = arith.shrui %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_shrui_scalable_vector
		func @test_shrui_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.shrui %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_shrsi		// CHECK-LABEL: test_shrsi
func @test_shrsi(%arg0 : i64, %arg1 : i64) -> i64 {		func @test_shrsi(%arg0 : i64, %arg1 : i64) -> i64 {
%0 = arith.shrsi %arg0, %arg1 : i64		%0 = arith.shrsi %arg0, %arg1 : i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_shrsi_tensor		// CHECK-LABEL: test_shrsi_tensor
func @test_shrsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {		func @test_shrsi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi64> {
%0 = arith.shrsi %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.shrsi %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_shrsi_vector		// CHECK-LABEL: test_shrsi_vector
func @test_shrsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {		func @test_shrsi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi64> {
%0 = arith.shrsi %arg0, %arg1 : vector<8xi64>		%0 = arith.shrsi %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_shrsi_scalable_vector
		func @test_shrsi_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi64> {
		%0 = arith.shrsi %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_negf		// CHECK-LABEL: test_negf
func @test_negf(%arg0 : f64) -> f64 {		func @test_negf(%arg0 : f64) -> f64 {
%0 = arith.negf %arg0 : f64		%0 = arith.negf %arg0 : f64
return %0 : f64		return %0 : f64
}		}

// CHECK-LABEL: test_negf_tensor		// CHECK-LABEL: test_negf_tensor
func @test_negf_tensor(%arg0 : tensor<8x8xf64>) -> tensor<8x8xf64> {		func @test_negf_tensor(%arg0 : tensor<8x8xf64>) -> tensor<8x8xf64> {
%0 = arith.negf %arg0 : tensor<8x8xf64>		%0 = arith.negf %arg0 : tensor<8x8xf64>
return %0 : tensor<8x8xf64>		return %0 : tensor<8x8xf64>
}		}

// CHECK-LABEL: test_negf_vector		// CHECK-LABEL: test_negf_vector
func @test_negf_vector(%arg0 : vector<8xf64>) -> vector<8xf64> {		func @test_negf_vector(%arg0 : vector<8xf64>) -> vector<8xf64> {
%0 = arith.negf %arg0 : vector<8xf64>		%0 = arith.negf %arg0 : vector<8xf64>
return %0 : vector<8xf64>		return %0 : vector<8xf64>
}		}

		// CHECK-LABEL: test_negf_scalable_vector
		func @test_negf_scalable_vector(%arg0 : vector<[8]xf64>) -> vector<[8]xf64> {
		%0 = arith.negf %arg0 : vector<[8]xf64>
		return %0 : vector<[8]xf64>
		}

// CHECK-LABEL: test_addf		// CHECK-LABEL: test_addf
func @test_addf(%arg0 : f64, %arg1 : f64) -> f64 {		func @test_addf(%arg0 : f64, %arg1 : f64) -> f64 {
%0 = arith.addf %arg0, %arg1 : f64		%0 = arith.addf %arg0, %arg1 : f64
return %0 : f64		return %0 : f64
}		}

// CHECK-LABEL: test_addf_tensor		// CHECK-LABEL: test_addf_tensor
func @test_addf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {		func @test_addf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {
%0 = arith.addf %arg0, %arg1 : tensor<8x8xf64>		%0 = arith.addf %arg0, %arg1 : tensor<8x8xf64>
return %0 : tensor<8x8xf64>		return %0 : tensor<8x8xf64>
}		}

// CHECK-LABEL: test_addf_vector		// CHECK-LABEL: test_addf_vector
func @test_addf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {		func @test_addf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {
%0 = arith.addf %arg0, %arg1 : vector<8xf64>		%0 = arith.addf %arg0, %arg1 : vector<8xf64>
return %0 : vector<8xf64>		return %0 : vector<8xf64>
}		}

		// CHECK-LABEL: test_addf_scalable_vector
		func @test_addf_scalable_vector(%arg0 : vector<[8]xf64>, %arg1 : vector<[8]xf64>) -> vector<[8]xf64> {
		%0 = arith.addf %arg0, %arg1 : vector<[8]xf64>
		return %0 : vector<[8]xf64>
		}

// CHECK-LABEL: test_subf		// CHECK-LABEL: test_subf
func @test_subf(%arg0 : f64, %arg1 : f64) -> f64 {		func @test_subf(%arg0 : f64, %arg1 : f64) -> f64 {
%0 = arith.subf %arg0, %arg1 : f64		%0 = arith.subf %arg0, %arg1 : f64
return %0 : f64		return %0 : f64
}		}

// CHECK-LABEL: test_subf_tensor		// CHECK-LABEL: test_subf_tensor
func @test_subf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {		func @test_subf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {
%0 = arith.subf %arg0, %arg1 : tensor<8x8xf64>		%0 = arith.subf %arg0, %arg1 : tensor<8x8xf64>
return %0 : tensor<8x8xf64>		return %0 : tensor<8x8xf64>
}		}

// CHECK-LABEL: test_subf_vector		// CHECK-LABEL: test_subf_vector
func @test_subf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {		func @test_subf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {
%0 = arith.subf %arg0, %arg1 : vector<8xf64>		%0 = arith.subf %arg0, %arg1 : vector<8xf64>
return %0 : vector<8xf64>		return %0 : vector<8xf64>
}		}

		// CHECK-LABEL: test_subf_scalable_vector
		func @test_subf_scalable_vector(%arg0 : vector<[8]xf64>, %arg1 : vector<[8]xf64>) -> vector<[8]xf64> {
		%0 = arith.subf %arg0, %arg1 : vector<[8]xf64>
		return %0 : vector<[8]xf64>
		}

// CHECK-LABEL: test_mulf		// CHECK-LABEL: test_mulf
func @test_mulf(%arg0 : f64, %arg1 : f64) -> f64 {		func @test_mulf(%arg0 : f64, %arg1 : f64) -> f64 {
%0 = arith.mulf %arg0, %arg1 : f64		%0 = arith.mulf %arg0, %arg1 : f64
return %0 : f64		return %0 : f64
}		}

// CHECK-LABEL: test_mulf_tensor		// CHECK-LABEL: test_mulf_tensor
func @test_mulf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {		func @test_mulf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {
%0 = arith.mulf %arg0, %arg1 : tensor<8x8xf64>		%0 = arith.mulf %arg0, %arg1 : tensor<8x8xf64>
return %0 : tensor<8x8xf64>		return %0 : tensor<8x8xf64>
}		}

// CHECK-LABEL: test_mulf_vector		// CHECK-LABEL: test_mulf_vector
func @test_mulf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {		func @test_mulf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {
%0 = arith.mulf %arg0, %arg1 : vector<8xf64>		%0 = arith.mulf %arg0, %arg1 : vector<8xf64>
return %0 : vector<8xf64>		return %0 : vector<8xf64>
}		}

		// CHECK-LABEL: test_mulf_scalable_vector
		func @test_mulf_scalable_vector(%arg0 : vector<[8]xf64>, %arg1 : vector<[8]xf64>) -> vector<[8]xf64> {
		%0 = arith.mulf %arg0, %arg1 : vector<[8]xf64>
		return %0 : vector<[8]xf64>
		}

// CHECK-LABEL: test_divf		// CHECK-LABEL: test_divf
func @test_divf(%arg0 : f64, %arg1 : f64) -> f64 {		func @test_divf(%arg0 : f64, %arg1 : f64) -> f64 {
%0 = arith.divf %arg0, %arg1 : f64		%0 = arith.divf %arg0, %arg1 : f64
return %0 : f64		return %0 : f64
}		}

// CHECK-LABEL: test_divf_tensor		// CHECK-LABEL: test_divf_tensor
func @test_divf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {		func @test_divf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {
%0 = arith.divf %arg0, %arg1 : tensor<8x8xf64>		%0 = arith.divf %arg0, %arg1 : tensor<8x8xf64>
return %0 : tensor<8x8xf64>		return %0 : tensor<8x8xf64>
}		}

// CHECK-LABEL: test_divf_vector		// CHECK-LABEL: test_divf_vector
func @test_divf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {		func @test_divf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {
%0 = arith.divf %arg0, %arg1 : vector<8xf64>		%0 = arith.divf %arg0, %arg1 : vector<8xf64>
return %0 : vector<8xf64>		return %0 : vector<8xf64>
}		}

		// CHECK-LABEL: test_divf_scalable_vector
		func @test_divf_scalable_vector(%arg0 : vector<[8]xf64>, %arg1 : vector<[8]xf64>) -> vector<[8]xf64> {
		%0 = arith.divf %arg0, %arg1 : vector<[8]xf64>
		return %0 : vector<[8]xf64>
		}

// CHECK-LABEL: test_remf		// CHECK-LABEL: test_remf
func @test_remf(%arg0 : f64, %arg1 : f64) -> f64 {		func @test_remf(%arg0 : f64, %arg1 : f64) -> f64 {
%0 = arith.remf %arg0, %arg1 : f64		%0 = arith.remf %arg0, %arg1 : f64
return %0 : f64		return %0 : f64
}		}

// CHECK-LABEL: test_remf_tensor		// CHECK-LABEL: test_remf_tensor
func @test_remf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {		func @test_remf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xf64> {
%0 = arith.remf %arg0, %arg1 : tensor<8x8xf64>		%0 = arith.remf %arg0, %arg1 : tensor<8x8xf64>
return %0 : tensor<8x8xf64>		return %0 : tensor<8x8xf64>
}		}

// CHECK-LABEL: test_remf_vector		// CHECK-LABEL: test_remf_vector
func @test_remf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {		func @test_remf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xf64> {
%0 = arith.remf %arg0, %arg1 : vector<8xf64>		%0 = arith.remf %arg0, %arg1 : vector<8xf64>
return %0 : vector<8xf64>		return %0 : vector<8xf64>
}		}

		// CHECK-LABEL: test_remf_scalable_vector
		func @test_remf_scalable_vector(%arg0 : vector<[8]xf64>, %arg1 : vector<[8]xf64>) -> vector<[8]xf64> {
		%0 = arith.remf %arg0, %arg1 : vector<[8]xf64>
		return %0 : vector<[8]xf64>
		}

// CHECK-LABEL: test_extui		// CHECK-LABEL: test_extui
func @test_extui(%arg0 : i32) -> i64 {		func @test_extui(%arg0 : i32) -> i64 {
%0 = arith.extui %arg0 : i32 to i64		%0 = arith.extui %arg0 : i32 to i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_extui_tensor		// CHECK-LABEL: test_extui_tensor
func @test_extui_tensor(%arg0 : tensor<8x8xi32>) -> tensor<8x8xi64> {		func @test_extui_tensor(%arg0 : tensor<8x8xi32>) -> tensor<8x8xi64> {
%0 = arith.extui %arg0 : tensor<8x8xi32> to tensor<8x8xi64>		%0 = arith.extui %arg0 : tensor<8x8xi32> to tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_extui_vector		// CHECK-LABEL: test_extui_vector
func @test_extui_vector(%arg0 : vector<8xi32>) -> vector<8xi64> {		func @test_extui_vector(%arg0 : vector<8xi32>) -> vector<8xi64> {
%0 = arith.extui %arg0 : vector<8xi32> to vector<8xi64>		%0 = arith.extui %arg0 : vector<8xi32> to vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_extui_scalable_vector
		func @test_extui_scalable_vector(%arg0 : vector<[8]xi32>) -> vector<[8]xi64> {
		%0 = arith.extui %arg0 : vector<[8]xi32> to vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_extsi		// CHECK-LABEL: test_extsi
func @test_extsi(%arg0 : i32) -> i64 {		func @test_extsi(%arg0 : i32) -> i64 {
%0 = arith.extsi %arg0 : i32 to i64		%0 = arith.extsi %arg0 : i32 to i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_extsi_tensor		// CHECK-LABEL: test_extsi_tensor
func @test_extsi_tensor(%arg0 : tensor<8x8xi32>) -> tensor<8x8xi64> {		func @test_extsi_tensor(%arg0 : tensor<8x8xi32>) -> tensor<8x8xi64> {
%0 = arith.extsi %arg0 : tensor<8x8xi32> to tensor<8x8xi64>		%0 = arith.extsi %arg0 : tensor<8x8xi32> to tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_extsi_vector		// CHECK-LABEL: test_extsi_vector
func @test_extsi_vector(%arg0 : vector<8xi32>) -> vector<8xi64> {		func @test_extsi_vector(%arg0 : vector<8xi32>) -> vector<8xi64> {
%0 = arith.extsi %arg0 : vector<8xi32> to vector<8xi64>		%0 = arith.extsi %arg0 : vector<8xi32> to vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_extsi_scalable_vector
		func @test_extsi_scalable_vector(%arg0 : vector<[8]xi32>) -> vector<[8]xi64> {
		%0 = arith.extsi %arg0 : vector<[8]xi32> to vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_extf		// CHECK-LABEL: test_extf
func @test_extf(%arg0 : f32) -> f64 {		func @test_extf(%arg0 : f32) -> f64 {
%0 = arith.extf %arg0 : f32 to f64		%0 = arith.extf %arg0 : f32 to f64
return %0 : f64		return %0 : f64
}		}

// CHECK-LABEL: test_extf_tensor		// CHECK-LABEL: test_extf_tensor
func @test_extf_tensor(%arg0 : tensor<8x8xf32>) -> tensor<8x8xf64> {		func @test_extf_tensor(%arg0 : tensor<8x8xf32>) -> tensor<8x8xf64> {
%0 = arith.extf %arg0 : tensor<8x8xf32> to tensor<8x8xf64>		%0 = arith.extf %arg0 : tensor<8x8xf32> to tensor<8x8xf64>
return %0 : tensor<8x8xf64>		return %0 : tensor<8x8xf64>
}		}

// CHECK-LABEL: test_extf_vector		// CHECK-LABEL: test_extf_vector
func @test_extf_vector(%arg0 : vector<8xf32>) -> vector<8xf64> {		func @test_extf_vector(%arg0 : vector<8xf32>) -> vector<8xf64> {
%0 = arith.extf %arg0 : vector<8xf32> to vector<8xf64>		%0 = arith.extf %arg0 : vector<8xf32> to vector<8xf64>
return %0 : vector<8xf64>		return %0 : vector<8xf64>
}		}

		// CHECK-LABEL: test_extf_scalable_vector
		func @test_extf_scalable_vector(%arg0 : vector<[8]xf32>) -> vector<[8]xf64> {
		%0 = arith.extf %arg0 : vector<[8]xf32> to vector<[8]xf64>
		return %0 : vector<[8]xf64>
		}

// CHECK-LABEL: test_trunci		// CHECK-LABEL: test_trunci
func @test_trunci(%arg0 : i32) -> i16 {		func @test_trunci(%arg0 : i32) -> i16 {
%0 = arith.trunci %arg0 : i32 to i16		%0 = arith.trunci %arg0 : i32 to i16
return %0 : i16		return %0 : i16
}		}

// CHECK-LABEL: test_trunci_tensor		// CHECK-LABEL: test_trunci_tensor
func @test_trunci_tensor(%arg0 : tensor<8x8xi32>) -> tensor<8x8xi16> {		func @test_trunci_tensor(%arg0 : tensor<8x8xi32>) -> tensor<8x8xi16> {
%0 = arith.trunci %arg0 : tensor<8x8xi32> to tensor<8x8xi16>		%0 = arith.trunci %arg0 : tensor<8x8xi32> to tensor<8x8xi16>
return %0 : tensor<8x8xi16>		return %0 : tensor<8x8xi16>
}		}

// CHECK-LABEL: test_trunci_vector		// CHECK-LABEL: test_trunci_vector
func @test_trunci_vector(%arg0 : vector<8xi32>) -> vector<8xi16> {		func @test_trunci_vector(%arg0 : vector<8xi32>) -> vector<8xi16> {
%0 = arith.trunci %arg0 : vector<8xi32> to vector<8xi16>		%0 = arith.trunci %arg0 : vector<8xi32> to vector<8xi16>
return %0 : vector<8xi16>		return %0 : vector<8xi16>
}		}

		// CHECK-LABEL: test_trunci_scalable_vector
		func @test_trunci_scalable_vector(%arg0 : vector<[8]xi32>) -> vector<[8]xi16> {
		%0 = arith.trunci %arg0 : vector<[8]xi32> to vector<[8]xi16>
		return %0 : vector<[8]xi16>
		}

// CHECK-LABEL: test_truncf		// CHECK-LABEL: test_truncf
func @test_truncf(%arg0 : f32) -> bf16 {		func @test_truncf(%arg0 : f32) -> bf16 {
%0 = arith.truncf %arg0 : f32 to bf16		%0 = arith.truncf %arg0 : f32 to bf16
return %0 : bf16		return %0 : bf16
}		}

// CHECK-LABEL: test_truncf_tensor		// CHECK-LABEL: test_truncf_tensor
func @test_truncf_tensor(%arg0 : tensor<8x8xf32>) -> tensor<8x8xbf16> {		func @test_truncf_tensor(%arg0 : tensor<8x8xf32>) -> tensor<8x8xbf16> {
%0 = arith.truncf %arg0 : tensor<8x8xf32> to tensor<8x8xbf16>		%0 = arith.truncf %arg0 : tensor<8x8xf32> to tensor<8x8xbf16>
return %0 : tensor<8x8xbf16>		return %0 : tensor<8x8xbf16>
}		}

// CHECK-LABEL: test_truncf_vector		// CHECK-LABEL: test_truncf_vector
func @test_truncf_vector(%arg0 : vector<8xf32>) -> vector<8xbf16> {		func @test_truncf_vector(%arg0 : vector<8xf32>) -> vector<8xbf16> {
%0 = arith.truncf %arg0 : vector<8xf32> to vector<8xbf16>		%0 = arith.truncf %arg0 : vector<8xf32> to vector<8xbf16>
return %0 : vector<8xbf16>		return %0 : vector<8xbf16>
}		}

		// CHECK-LABEL: test_truncf_scalable_vector
		func @test_truncf_scalable_vector(%arg0 : vector<[8]xf32>) -> vector<[8]xbf16> {
		%0 = arith.truncf %arg0 : vector<[8]xf32> to vector<[8]xbf16>
		return %0 : vector<[8]xbf16>
		}

// CHECK-LABEL: test_uitofp		// CHECK-LABEL: test_uitofp
func @test_uitofp(%arg0 : i32) -> f32 {		func @test_uitofp(%arg0 : i32) -> f32 {
%0 = arith.uitofp %arg0 : i32 to f32		%0 = arith.uitofp %arg0 : i32 to f32
return %0 : f32		return %0 : f32
}		}

// CHECK-LABEL: test_uitofp_tensor		// CHECK-LABEL: test_uitofp_tensor
func @test_uitofp_tensor(%arg0 : tensor<8x8xi32>) -> tensor<8x8xf32> {		func @test_uitofp_tensor(%arg0 : tensor<8x8xi32>) -> tensor<8x8xf32> {
%0 = arith.uitofp %arg0 : tensor<8x8xi32> to tensor<8x8xf32>		%0 = arith.uitofp %arg0 : tensor<8x8xi32> to tensor<8x8xf32>
return %0 : tensor<8x8xf32>		return %0 : tensor<8x8xf32>
}		}

// CHECK-LABEL: test_uitofp_vector		// CHECK-LABEL: test_uitofp_vector
func @test_uitofp_vector(%arg0 : vector<8xi32>) -> vector<8xf32> {		func @test_uitofp_vector(%arg0 : vector<8xi32>) -> vector<8xf32> {
%0 = arith.uitofp %arg0 : vector<8xi32> to vector<8xf32>		%0 = arith.uitofp %arg0 : vector<8xi32> to vector<8xf32>
return %0 : vector<8xf32>		return %0 : vector<8xf32>
}		}

		// CHECK-LABEL: test_uitofp_scalable_vector
		func @test_uitofp_scalable_vector(%arg0 : vector<[8]xi32>) -> vector<[8]xf32> {
		%0 = arith.uitofp %arg0 : vector<[8]xi32> to vector<[8]xf32>
		return %0 : vector<[8]xf32>
		}

// CHECK-LABEL: test_sitofp		// CHECK-LABEL: test_sitofp
func @test_sitofp(%arg0 : i16) -> f64 {		func @test_sitofp(%arg0 : i16) -> f64 {
%0 = arith.sitofp %arg0 : i16 to f64		%0 = arith.sitofp %arg0 : i16 to f64
return %0 : f64		return %0 : f64
}		}

// CHECK-LABEL: test_sitofp_tensor		// CHECK-LABEL: test_sitofp_tensor
func @test_sitofp_tensor(%arg0 : tensor<8x8xi16>) -> tensor<8x8xf64> {		func @test_sitofp_tensor(%arg0 : tensor<8x8xi16>) -> tensor<8x8xf64> {
%0 = arith.sitofp %arg0 : tensor<8x8xi16> to tensor<8x8xf64>		%0 = arith.sitofp %arg0 : tensor<8x8xi16> to tensor<8x8xf64>
return %0 : tensor<8x8xf64>		return %0 : tensor<8x8xf64>
}		}

// CHECK-LABEL: test_sitofp_vector		// CHECK-LABEL: test_sitofp_vector
func @test_sitofp_vector(%arg0 : vector<8xi16>) -> vector<8xf64> {		func @test_sitofp_vector(%arg0 : vector<8xi16>) -> vector<8xf64> {
%0 = arith.sitofp %arg0 : vector<8xi16> to vector<8xf64>		%0 = arith.sitofp %arg0 : vector<8xi16> to vector<8xf64>
return %0 : vector<8xf64>		return %0 : vector<8xf64>
}		}

		// CHECK-LABEL: test_sitofp_scalable_vector
		func @test_sitofp_scalable_vector(%arg0 : vector<[8]xi16>) -> vector<[8]xf64> {
		%0 = arith.sitofp %arg0 : vector<[8]xi16> to vector<[8]xf64>
		return %0 : vector<[8]xf64>
		}

// CHECK-LABEL: test_fptoui		// CHECK-LABEL: test_fptoui
func @test_fptoui(%arg0 : bf16) -> i8 {		func @test_fptoui(%arg0 : bf16) -> i8 {
%0 = arith.fptoui %arg0 : bf16 to i8		%0 = arith.fptoui %arg0 : bf16 to i8
return %0 : i8		return %0 : i8
}		}

// CHECK-LABEL: test_fptoui_tensor		// CHECK-LABEL: test_fptoui_tensor
func @test_fptoui_tensor(%arg0 : tensor<8x8xbf16>) -> tensor<8x8xi8> {		func @test_fptoui_tensor(%arg0 : tensor<8x8xbf16>) -> tensor<8x8xi8> {
%0 = arith.fptoui %arg0 : tensor<8x8xbf16> to tensor<8x8xi8>		%0 = arith.fptoui %arg0 : tensor<8x8xbf16> to tensor<8x8xi8>
return %0 : tensor<8x8xi8>		return %0 : tensor<8x8xi8>
}		}

// CHECK-LABEL: test_fptoui_vector		// CHECK-LABEL: test_fptoui_vector
func @test_fptoui_vector(%arg0 : vector<8xbf16>) -> vector<8xi8> {		func @test_fptoui_vector(%arg0 : vector<8xbf16>) -> vector<8xi8> {
%0 = arith.fptoui %arg0 : vector<8xbf16> to vector<8xi8>		%0 = arith.fptoui %arg0 : vector<8xbf16> to vector<8xi8>
return %0 : vector<8xi8>		return %0 : vector<8xi8>
}		}

		// CHECK-LABEL: test_fptoui_scalable_vector
		func @test_fptoui_scalable_vector(%arg0 : vector<[8]xbf16>) -> vector<[8]xi8> {
		%0 = arith.fptoui %arg0 : vector<[8]xbf16> to vector<[8]xi8>
		return %0 : vector<[8]xi8>
		}

// CHECK-LABEL: test_fptosi		// CHECK-LABEL: test_fptosi
func @test_fptosi(%arg0 : f64) -> i64 {		func @test_fptosi(%arg0 : f64) -> i64 {
%0 = arith.fptosi %arg0 : f64 to i64		%0 = arith.fptosi %arg0 : f64 to i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_fptosi_tensor		// CHECK-LABEL: test_fptosi_tensor
func @test_fptosi_tensor(%arg0 : tensor<8x8xf64>) -> tensor<8x8xi64> {		func @test_fptosi_tensor(%arg0 : tensor<8x8xf64>) -> tensor<8x8xi64> {
%0 = arith.fptosi %arg0 : tensor<8x8xf64> to tensor<8x8xi64>		%0 = arith.fptosi %arg0 : tensor<8x8xf64> to tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_fptosi_vector		// CHECK-LABEL: test_fptosi_vector
func @test_fptosi_vector(%arg0 : vector<8xf64>) -> vector<8xi64> {		func @test_fptosi_vector(%arg0 : vector<8xf64>) -> vector<8xi64> {
%0 = arith.fptosi %arg0 : vector<8xf64> to vector<8xi64>		%0 = arith.fptosi %arg0 : vector<8xf64> to vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_fptosi_scalable_vector
		func @test_fptosi_scalable_vector(%arg0 : vector<[8]xf64>) -> vector<[8]xi64> {
		%0 = arith.fptosi %arg0 : vector<[8]xf64> to vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_index_cast0		// CHECK-LABEL: test_index_cast0
func @test_index_cast0(%arg0 : i32) -> index {		func @test_index_cast0(%arg0 : i32) -> index {
%0 = arith.index_cast %arg0 : i32 to index		%0 = arith.index_cast %arg0 : i32 to index
return %0 : index		return %0 : index
}		}

// CHECK-LABEL: test_index_cast_tensor0		// CHECK-LABEL: test_index_cast_tensor0
func @test_index_cast_tensor0(%arg0 : tensor<8x8xi32>) -> tensor<8x8xindex> {		func @test_index_cast_tensor0(%arg0 : tensor<8x8xi32>) -> tensor<8x8xindex> {
%0 = arith.index_cast %arg0 : tensor<8x8xi32> to tensor<8x8xindex>		%0 = arith.index_cast %arg0 : tensor<8x8xi32> to tensor<8x8xindex>
return %0 : tensor<8x8xindex>		return %0 : tensor<8x8xindex>
}		}

// CHECK-LABEL: test_index_cast_vector0		// CHECK-LABEL: test_index_cast_vector0
func @test_index_cast_vector0(%arg0 : vector<8xi32>) -> vector<8xindex> {		func @test_index_cast_vector0(%arg0 : vector<8xi32>) -> vector<8xindex> {
%0 = arith.index_cast %arg0 : vector<8xi32> to vector<8xindex>		%0 = arith.index_cast %arg0 : vector<8xi32> to vector<8xindex>
return %0 : vector<8xindex>		return %0 : vector<8xindex>
}		}

		// CHECK-LABEL: test_index_cast_scalable_vector0
		func @test_index_cast_scalable_vector0(%arg0 : vector<[8]xi32>) -> vector<[8]xindex> {
		%0 = arith.index_cast %arg0 : vector<[8]xi32> to vector<[8]xindex>
		return %0 : vector<[8]xindex>
		}

// CHECK-LABEL: test_index_cast1		// CHECK-LABEL: test_index_cast1
func @test_index_cast1(%arg0 : index) -> i64 {		func @test_index_cast1(%arg0 : index) -> i64 {
%0 = arith.index_cast %arg0 : index to i64		%0 = arith.index_cast %arg0 : index to i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_index_cast_tensor1		// CHECK-LABEL: test_index_cast_tensor1
func @test_index_cast_tensor1(%arg0 : tensor<8x8xindex>) -> tensor<8x8xi64> {		func @test_index_cast_tensor1(%arg0 : tensor<8x8xindex>) -> tensor<8x8xi64> {
%0 = arith.index_cast %arg0 : tensor<8x8xindex> to tensor<8x8xi64>		%0 = arith.index_cast %arg0 : tensor<8x8xindex> to tensor<8x8xi64>
return %0 : tensor<8x8xi64>		return %0 : tensor<8x8xi64>
}		}

// CHECK-LABEL: test_index_cast_vector1		// CHECK-LABEL: test_index_cast_vector1
func @test_index_cast_vector1(%arg0 : vector<8xindex>) -> vector<8xi64> {		func @test_index_cast_vector1(%arg0 : vector<8xindex>) -> vector<8xi64> {
%0 = arith.index_cast %arg0 : vector<8xindex> to vector<8xi64>		%0 = arith.index_cast %arg0 : vector<8xindex> to vector<8xi64>
return %0 : vector<8xi64>		return %0 : vector<8xi64>
}		}

		// CHECK-LABEL: test_index_cast_scalable_vector1
		func @test_index_cast_scalable_vector1(%arg0 : vector<[8]xindex>) -> vector<[8]xi64> {
		%0 = arith.index_cast %arg0 : vector<[8]xindex> to vector<[8]xi64>
		return %0 : vector<[8]xi64>
		}

// CHECK-LABEL: test_bitcast0		// CHECK-LABEL: test_bitcast0
func @test_bitcast0(%arg0 : i64) -> f64 {		func @test_bitcast0(%arg0 : i64) -> f64 {
%0 = arith.bitcast %arg0 : i64 to f64		%0 = arith.bitcast %arg0 : i64 to f64
return %0 : f64		return %0 : f64
}		}

// CHECK-LABEL: test_bitcast_tensor0		// CHECK-LABEL: test_bitcast_tensor0
func @test_bitcast_tensor0(%arg0 : tensor<8x8xi64>) -> tensor<8x8xf64> {		func @test_bitcast_tensor0(%arg0 : tensor<8x8xi64>) -> tensor<8x8xf64> {
%0 = arith.bitcast %arg0 : tensor<8x8xi64> to tensor<8x8xf64>		%0 = arith.bitcast %arg0 : tensor<8x8xi64> to tensor<8x8xf64>
return %0 : tensor<8x8xf64>		return %0 : tensor<8x8xf64>
}		}

// CHECK-LABEL: test_bitcast_vector0		// CHECK-LABEL: test_bitcast_vector0
func @test_bitcast_vector0(%arg0 : vector<8xi64>) -> vector<8xf64> {		func @test_bitcast_vector0(%arg0 : vector<8xi64>) -> vector<8xf64> {
%0 = arith.bitcast %arg0 : vector<8xi64> to vector<8xf64>		%0 = arith.bitcast %arg0 : vector<8xi64> to vector<8xf64>
return %0 : vector<8xf64>		return %0 : vector<8xf64>
}		}

		// CHECK-LABEL: test_bitcast_scalable_vector0
		func @test_bitcast_scalable_vector0(%arg0 : vector<[8]xi64>) -> vector<[8]xf64> {
		%0 = arith.bitcast %arg0 : vector<[8]xi64> to vector<[8]xf64>
		return %0 : vector<[8]xf64>
		}

// CHECK-LABEL: test_bitcast1		// CHECK-LABEL: test_bitcast1
func @test_bitcast1(%arg0 : f32) -> i32 {		func @test_bitcast1(%arg0 : f32) -> i32 {
%0 = arith.bitcast %arg0 : f32 to i32		%0 = arith.bitcast %arg0 : f32 to i32
return %0 : i32		return %0 : i32
}		}

// CHECK-LABEL: test_bitcast_tensor1		// CHECK-LABEL: test_bitcast_tensor1
func @test_bitcast_tensor1(%arg0 : tensor<8x8xf32>) -> tensor<8x8xi32> {		func @test_bitcast_tensor1(%arg0 : tensor<8x8xf32>) -> tensor<8x8xi32> {
%0 = arith.bitcast %arg0 : tensor<8x8xf32> to tensor<8x8xi32>		%0 = arith.bitcast %arg0 : tensor<8x8xf32> to tensor<8x8xi32>
return %0 : tensor<8x8xi32>		return %0 : tensor<8x8xi32>
}		}

// CHECK-LABEL: test_bitcast_vector1		// CHECK-LABEL: test_bitcast_vector1
func @test_bitcast_vector1(%arg0 : vector<8xf32>) -> vector<8xi32> {		func @test_bitcast_vector1(%arg0 : vector<8xf32>) -> vector<8xi32> {
%0 = arith.bitcast %arg0 : vector<8xf32> to vector<8xi32>		%0 = arith.bitcast %arg0 : vector<8xf32> to vector<8xi32>
return %0 : vector<8xi32>		return %0 : vector<8xi32>
}		}

		// CHECK-LABEL: test_bitcast_scalable_vector1
		func @test_bitcast_scalable_vector1(%arg0 : vector<[8]xf32>) -> vector<[8]xi32> {
		%0 = arith.bitcast %arg0 : vector<[8]xf32> to vector<[8]xi32>
		return %0 : vector<[8]xi32>
		}

// CHECK-LABEL: test_cmpi		// CHECK-LABEL: test_cmpi
func @test_cmpi(%arg0 : i64, %arg1 : i64) -> i1 {		func @test_cmpi(%arg0 : i64, %arg1 : i64) -> i1 {
%0 = arith.cmpi ne, %arg0, %arg1 : i64		%0 = arith.cmpi ne, %arg0, %arg1 : i64
return %0 : i1		return %0 : i1
}		}

// CHECK-LABEL: test_cmpi_tensor		// CHECK-LABEL: test_cmpi_tensor
func @test_cmpi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi1> {		func @test_cmpi_tensor(%arg0 : tensor<8x8xi64>, %arg1 : tensor<8x8xi64>) -> tensor<8x8xi1> {
%0 = arith.cmpi slt, %arg0, %arg1 : tensor<8x8xi64>		%0 = arith.cmpi slt, %arg0, %arg1 : tensor<8x8xi64>
return %0 : tensor<8x8xi1>		return %0 : tensor<8x8xi1>
}		}

// CHECK-LABEL: test_cmpi_vector		// CHECK-LABEL: test_cmpi_vector
func @test_cmpi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi1> {		func @test_cmpi_vector(%arg0 : vector<8xi64>, %arg1 : vector<8xi64>) -> vector<8xi1> {
%0 = arith.cmpi ult, %arg0, %arg1 : vector<8xi64>		%0 = arith.cmpi ult, %arg0, %arg1 : vector<8xi64>
return %0 : vector<8xi1>		return %0 : vector<8xi1>
}		}

		// CHECK-LABEL: test_cmpi_scalable_vector
		func @test_cmpi_scalable_vector(%arg0 : vector<[8]xi64>, %arg1 : vector<[8]xi64>) -> vector<[8]xi1> {
		%0 = arith.cmpi ult, %arg0, %arg1 : vector<[8]xi64>
		return %0 : vector<[8]xi1>
		}

// CHECK-LABEL: test_cmpi_vector_0d		// CHECK-LABEL: test_cmpi_vector_0d
func @test_cmpi_vector_0d(%arg0 : vector<i64>, %arg1 : vector<i64>) -> vector<i1> {		func @test_cmpi_vector_0d(%arg0 : vector<i64>, %arg1 : vector<i64>) -> vector<i1> {
%0 = arith.cmpi ult, %arg0, %arg1 : vector<i64>		%0 = arith.cmpi ult, %arg0, %arg1 : vector<i64>
return %0 : vector<i1>		return %0 : vector<i1>
}		}

// CHECK-LABEL: test_cmpf		// CHECK-LABEL: test_cmpf
func @test_cmpf(%arg0 : f64, %arg1 : f64) -> i1 {		func @test_cmpf(%arg0 : f64, %arg1 : f64) -> i1 {
%0 = arith.cmpf oeq, %arg0, %arg1 : f64		%0 = arith.cmpf oeq, %arg0, %arg1 : f64
return %0 : i1		return %0 : i1
}		}

// CHECK-LABEL: test_cmpf_tensor		// CHECK-LABEL: test_cmpf_tensor
func @test_cmpf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xi1> {		func @test_cmpf_tensor(%arg0 : tensor<8x8xf64>, %arg1 : tensor<8x8xf64>) -> tensor<8x8xi1> {
%0 = arith.cmpf olt, %arg0, %arg1 : tensor<8x8xf64>		%0 = arith.cmpf olt, %arg0, %arg1 : tensor<8x8xf64>
return %0 : tensor<8x8xi1>		return %0 : tensor<8x8xi1>
}		}

// CHECK-LABEL: test_cmpf_vector		// CHECK-LABEL: test_cmpf_vector
func @test_cmpf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xi1> {		func @test_cmpf_vector(%arg0 : vector<8xf64>, %arg1 : vector<8xf64>) -> vector<8xi1> {
%0 = arith.cmpf ult, %arg0, %arg1 : vector<8xf64>		%0 = arith.cmpf ult, %arg0, %arg1 : vector<8xf64>
return %0 : vector<8xi1>		return %0 : vector<8xi1>
}		}

		// CHECK-LABEL: test_cmpf_scalable_vector
		func @test_cmpf_scalable_vector(%arg0 : vector<[8]xf64>, %arg1 : vector<[8]xf64>) -> vector<[8]xi1> {
		%0 = arith.cmpf ult, %arg0, %arg1 : vector<[8]xf64>
		return %0 : vector<[8]xi1>
		}

// CHECK-LABEL: test_index_cast		// CHECK-LABEL: test_index_cast
func @test_index_cast(%arg0 : index) -> i64 {		func @test_index_cast(%arg0 : index) -> i64 {
%0 = arith.index_cast %arg0 : index to i64		%0 = arith.index_cast %arg0 : index to i64
return %0 : i64		return %0 : i64
}		}

// CHECK-LABEL: test_index_cast_tensor		// CHECK-LABEL: test_index_cast_tensor
func @test_index_cast_tensor(%arg0 : tensor<index>) -> tensor<i64> {		func @test_index_cast_tensor(%arg0 : tensor<index>) -> tensor<i64> {
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	func @test_constant() -> () {
// CHECK: %false = arith.constant false		// CHECK: %false = arith.constant false
%8 = arith.constant false		%8 = arith.constant false

return		return
}		}

// CHECK-LABEL: func @maximum		// CHECK-LABEL: func @maximum
func @maximum(%v1: vector<4xf32>, %v2: vector<4xf32>,		func @maximum(%v1: vector<4xf32>, %v2: vector<4xf32>,
		%sv1: vector<[4]xf32>, %sv2: vector<[4]xf32>,
%f1: f32, %f2: f32,		%f1: f32, %f2: f32,
%i1: i32, %i2: i32) {		%i1: i32, %i2: i32) {
%max_vector = arith.maxf %v1, %v2 : vector<4xf32>		%max_vector = arith.maxf %v1, %v2 : vector<4xf32>
		%max_scalable_vector = arith.maxf %sv1, %sv2 : vector<[4]xf32>
%max_float = arith.maxf %f1, %f2 : f32		%max_float = arith.maxf %f1, %f2 : f32
%max_signed = arith.maxsi %i1, %i2 : i32		%max_signed = arith.maxsi %i1, %i2 : i32
%max_unsigned = arith.maxui %i1, %i2 : i32		%max_unsigned = arith.maxui %i1, %i2 : i32
return		return
}		}

// CHECK-LABEL: func @minimum		// CHECK-LABEL: func @minimum
func @minimum(%v1: vector<4xf32>, %v2: vector<4xf32>,		func @minimum(%v1: vector<4xf32>, %v2: vector<4xf32>,
		%sv1: vector<[4]xf32>, %sv2: vector<[4]xf32>,
%f1: f32, %f2: f32,		%f1: f32, %f2: f32,
%i1: i32, %i2: i32) {		%i1: i32, %i2: i32) {
%min_vector = arith.minf %v1, %v2 : vector<4xf32>		%min_vector = arith.minf %v1, %v2 : vector<4xf32>
		%min_scalable_vector = arith.minf %sv1, %sv2 : vector<[4]xf32>
%min_float = arith.minf %f1, %f2 : f32		%min_float = arith.minf %f1, %f2 : f32
%min_signed = arith.minsi %i1, %i2 : i32		%min_signed = arith.minsi %i1, %i2 : i32
%min_unsigned = arith.minui %i1, %i2 : i32		%min_unsigned = arith.minui %i1, %i2 : i32
return		return
}		}

mlir/test/Dialect/ArmSVE/legalize-for-llvm.mlir

	// RUN: mlir-opt %s -convert-vector-to-llvm="enable-arm-sve" -convert-std-to-llvm \| mlir-opt \| FileCheck %s			// RUN: mlir-opt %s -convert-vector-to-llvm="enable-arm-sve" -convert-std-to-llvm -reconcile-unrealized-casts \| mlir-opt \| FileCheck %s
				nicolasvasilacheUnsubmitted Done Reply Inline Actions I would expect to see a test file (somwhere in the builtin stuff) where you have both: negative tests for various failure modes of misuses of scalable vectors (with appropriate error messages) positive tests with multi-dim multi-scale vector (atm everything I see is 0-dim 1-scale only). In a followup PR, I'd love to see a 1-dim, 2-scale version of the neon 2d dot (or something equivalent) and see it lower to unrolled LLVM. nicolasvasilache: I would expect to see a test file (somwhere in the builtin stuff) where you have both…
				jsetoainAuthorUnsubmitted Done Reply Inline Actions RE follow-up PR, that was in my low priority TODO list, I'll move it to the main TODO, it should be a quick and easy change. jsetoain: RE follow-up PR, that was in my low priority TODO list, I'll move it to the main TODO, it…

	func @arm_sve_sdot(%a: !arm_sve.vector<16xi8>,			func @arm_sve_sdot(%a: vector<[16]xi8>,
	%b: !arm_sve.vector<16xi8>,			%b: vector<[16]xi8>,
	%c: !arm_sve.vector<4xi32>)			%c: vector<[4]xi32>)
	-> !arm_sve.vector<4xi32> {			-> vector<[4]xi32> {
	// CHECK: arm_sve.intr.sdot			// CHECK: arm_sve.intr.sdot
	%0 = arm_sve.sdot %c, %a, %b :			%0 = arm_sve.sdot %c, %a, %b :
	!arm_sve.vector<16xi8> to !arm_sve.vector<4xi32>			vector<[16]xi8> to vector<[4]xi32>
	return %0 : !arm_sve.vector<4xi32>			return %0 : vector<[4]xi32>
	}			}

	func @arm_sve_smmla(%a: !arm_sve.vector<16xi8>,			func @arm_sve_smmla(%a: vector<[16]xi8>,
	%b: !arm_sve.vector<16xi8>,			%b: vector<[16]xi8>,
	%c: !arm_sve.vector<4xi32>)			%c: vector<[4]xi32>)
	-> !arm_sve.vector<4xi32> {			-> vector<[4]xi32> {
	// CHECK: arm_sve.intr.smmla			// CHECK: arm_sve.intr.smmla
	%0 = arm_sve.smmla %c, %a, %b :			%0 = arm_sve.smmla %c, %a, %b :
	!arm_sve.vector<16xi8> to !arm_sve.vector<4xi32>			vector<[16]xi8> to vector<[4]xi32>
	return %0 : !arm_sve.vector<4xi32>			return %0 : vector<[4]xi32>
	}			}

	func @arm_sve_udot(%a: !arm_sve.vector<16xi8>,			func @arm_sve_udot(%a: vector<[16]xi8>,
	%b: !arm_sve.vector<16xi8>,			%b: vector<[16]xi8>,
	%c: !arm_sve.vector<4xi32>)			%c: vector<[4]xi32>)
	-> !arm_sve.vector<4xi32> {			-> vector<[4]xi32> {
	// CHECK: arm_sve.intr.udot			// CHECK: arm_sve.intr.udot
	%0 = arm_sve.udot %c, %a, %b :			%0 = arm_sve.udot %c, %a, %b :
	!arm_sve.vector<16xi8> to !arm_sve.vector<4xi32>			vector<[16]xi8> to vector<[4]xi32>
	return %0 : !arm_sve.vector<4xi32>			return %0 : vector<[4]xi32>
	}			}

	func @arm_sve_ummla(%a: !arm_sve.vector<16xi8>,			func @arm_sve_ummla(%a: vector<[16]xi8>,
	%b: !arm_sve.vector<16xi8>,			%b: vector<[16]xi8>,
	%c: !arm_sve.vector<4xi32>)			%c: vector<[4]xi32>)
	-> !arm_sve.vector<4xi32> {			-> vector<[4]xi32> {
	// CHECK: arm_sve.intr.ummla			// CHECK: arm_sve.intr.ummla
	%0 = arm_sve.ummla %c, %a, %b :			%0 = arm_sve.ummla %c, %a, %b :
	!arm_sve.vector<16xi8> to !arm_sve.vector<4xi32>			vector<[16]xi8> to vector<[4]xi32>
	return %0 : !arm_sve.vector<4xi32>			return %0 : vector<[4]xi32>
	}			}

	func @arm_sve_arithi(%a: !arm_sve.vector<4xi32>,			func @arm_sve_arithi_masked(%a: vector<[4]xi32>,
	%b: !arm_sve.vector<4xi32>,			%b: vector<[4]xi32>,
	%c: !arm_sve.vector<4xi32>,			%c: vector<[4]xi32>,
	%d: !arm_sve.vector<4xi32>,			%d: vector<[4]xi32>,
	%e: !arm_sve.vector<4xi32>) -> !arm_sve.vector<4xi32> {			%e: vector<[4]xi32>,
	// CHECK: llvm.mul {{.*}}: !llvm.vec<? x 4 x i32>			%mask: vector<[4]xi1>
	%0 = arm_sve.muli %a, %b : !arm_sve.vector<4xi32>			) -> vector<[4]xi32> {
	// CHECK: llvm.add {{.*}}: !llvm.vec<? x 4 x i32>			// CHECK: arm_sve.intr.add{{.*}}: (vector<[4]xi1>, vector<[4]xi32>, vector<[4]xi32>) -> vector<[4]xi32>
	%1 = arm_sve.addi %0, %c : !arm_sve.vector<4xi32>			%0 = arm_sve.masked.addi %mask, %a, %b : vector<[4]xi1>,
	// CHECK: llvm.sub {{.*}}: !llvm.vec<? x 4 x i32>			vector<[4]xi32>
	%2 = arm_sve.subi %1, %d : !arm_sve.vector<4xi32>			// CHECK: arm_sve.intr.sub{{.*}}: (vector<[4]xi1>, vector<[4]xi32>, vector<[4]xi32>) -> vector<[4]xi32>
	// CHECK: llvm.sdiv {{.*}}: !llvm.vec<? x 4 x i32>			%1 = arm_sve.masked.subi %mask, %0, %c : vector<[4]xi1>,
	%3 = arm_sve.divi_signed %2, %e : !arm_sve.vector<4xi32>			vector<[4]xi32>
	// CHECK: llvm.udiv {{.*}}: !llvm.vec<? x 4 x i32>			// CHECK: arm_sve.intr.mul{{.*}}: (vector<[4]xi1>, vector<[4]xi32>, vector<[4]xi32>) -> vector<[4]xi32>
	%4 = arm_sve.divi_unsigned %2, %e : !arm_sve.vector<4xi32>			%2 = arm_sve.masked.muli %mask, %1, %d : vector<[4]xi1>,
	return %4 : !arm_sve.vector<4xi32>			vector<[4]xi32>
	}			// CHECK: arm_sve.intr.sdiv{{.*}}: (vector<[4]xi1>, vector<[4]xi32>, vector<[4]xi32>) -> vector<[4]xi32>
				%3 = arm_sve.masked.divi_signed %mask, %2, %e : vector<[4]xi1>,
	func @arm_sve_arithf(%a: !arm_sve.vector<4xf32>,			vector<[4]xi32>
	%b: !arm_sve.vector<4xf32>,			// CHECK: arm_sve.intr.udiv{{.*}}: (vector<[4]xi1>, vector<[4]xi32>, vector<[4]xi32>) -> vector<[4]xi32>
	%c: !arm_sve.vector<4xf32>,			%4 = arm_sve.masked.divi_unsigned %mask, %3, %e : vector<[4]xi1>,
	%d: !arm_sve.vector<4xf32>,			vector<[4]xi32>
	%e: !arm_sve.vector<4xf32>) -> !arm_sve.vector<4xf32> {			return %4 : vector<[4]xi32>
	// CHECK: llvm.fmul {{.*}}: !llvm.vec<? x 4 x f32>			}
	%0 = arm_sve.mulf %a, %b : !arm_sve.vector<4xf32>
	// CHECK: llvm.fadd {{.*}}: !llvm.vec<? x 4 x f32>			func @arm_sve_arithf_masked(%a: vector<[4]xf32>,
	%1 = arm_sve.addf %0, %c : !arm_sve.vector<4xf32>			%b: vector<[4]xf32>,
	// CHECK: llvm.fsub {{.*}}: !llvm.vec<? x 4 x f32>			%c: vector<[4]xf32>,
	%2 = arm_sve.subf %1, %d : !arm_sve.vector<4xf32>			%d: vector<[4]xf32>,
	// CHECK: llvm.fdiv {{.*}}: !llvm.vec<? x 4 x f32>			%e: vector<[4]xf32>,
	%3 = arm_sve.divf %2, %e : !arm_sve.vector<4xf32>			%mask: vector<[4]xi1>
	return %3 : !arm_sve.vector<4xf32>			) -> vector<[4]xf32> {
	}			// CHECK: arm_sve.intr.fadd{{.*}}: (vector<[4]xi1>, vector<[4]xf32>, vector<[4]xf32>) -> vector<[4]xf32>
				%0 = arm_sve.masked.addf %mask, %a, %b : vector<[4]xi1>,
	func @arm_sve_arithi_masked(%a: !arm_sve.vector<4xi32>,			vector<[4]xf32>
	%b: !arm_sve.vector<4xi32>,			// CHECK: arm_sve.intr.fsub{{.*}}: (vector<[4]xi1>, vector<[4]xf32>, vector<[4]xf32>) -> vector<[4]xf32>
	%c: !arm_sve.vector<4xi32>,			%1 = arm_sve.masked.subf %mask, %0, %c : vector<[4]xi1>,
	%d: !arm_sve.vector<4xi32>,			vector<[4]xf32>
	%e: !arm_sve.vector<4xi32>,			// CHECK: arm_sve.intr.fmul{{.*}}: (vector<[4]xi1>, vector<[4]xf32>, vector<[4]xf32>) -> vector<[4]xf32>
	%mask: !arm_sve.vector<4xi1>			%2 = arm_sve.masked.mulf %mask, %1, %d : vector<[4]xi1>,
	) -> !arm_sve.vector<4xi32> {			vector<[4]xf32>
	// CHECK: arm_sve.intr.add{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x i32>, !llvm.vec<? x 4 x i32>) -> !llvm.vec<? x 4 x i32>			// CHECK: arm_sve.intr.fdiv{{.*}}: (vector<[4]xi1>, vector<[4]xf32>, vector<[4]xf32>) -> vector<[4]xf32>
	%0 = arm_sve.masked.addi %mask, %a, %b : !arm_sve.vector<4xi1>,			%3 = arm_sve.masked.divf %mask, %2, %e : vector<[4]xi1>,
	!arm_sve.vector<4xi32>			vector<[4]xf32>
	// CHECK: arm_sve.intr.sub{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x i32>, !llvm.vec<? x 4 x i32>) -> !llvm.vec<? x 4 x i32>			return %3 : vector<[4]xf32>
	%1 = arm_sve.masked.subi %mask, %0, %c : !arm_sve.vector<4xi1>,			}
	!arm_sve.vector<4xi32>
	// CHECK: arm_sve.intr.mul{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x i32>, !llvm.vec<? x 4 x i32>) -> !llvm.vec<? x 4 x i32>			func @arm_sve_abs_diff(%a: vector<[4]xi32>,
	%2 = arm_sve.masked.muli %mask, %1, %d : !arm_sve.vector<4xi1>,			%b: vector<[4]xi32>)
	!arm_sve.vector<4xi32>			-> vector<[4]xi32> {
	// CHECK: arm_sve.intr.sdiv{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x i32>, !llvm.vec<? x 4 x i32>) -> !llvm.vec<? x 4 x i32>			// CHECK: llvm.mlir.constant(dense<0> : vector<[4]xi32>) : vector<[4]xi32>
	%3 = arm_sve.masked.divi_signed %mask, %2, %e : !arm_sve.vector<4xi1>,			%z = arith.subi %a, %a : vector<[4]xi32>
	!arm_sve.vector<4xi32>			// CHECK: llvm.icmp "sge" {{.*}}: vector<[4]xi32>
	// CHECK: arm_sve.intr.udiv{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x i32>, !llvm.vec<? x 4 x i32>) -> !llvm.vec<? x 4 x i32>			%agb = arith.cmpi sge, %a, %b : vector<[4]xi32>
	%4 = arm_sve.masked.divi_unsigned %mask, %3, %e : !arm_sve.vector<4xi1>,			// CHECK: llvm.icmp "slt" {{.*}}: vector<[4]xi32>
	!arm_sve.vector<4xi32>			%bga = arith.cmpi slt, %a, %b : vector<[4]xi32>
	return %4 : !arm_sve.vector<4xi32>			// CHECK: "arm_sve.intr.sub"{{.*}}: (vector<[4]xi1>, vector<[4]xi32>, vector<[4]xi32>) -> vector<[4]xi32>
	}			%0 = arm_sve.masked.subi %agb, %a, %b : vector<[4]xi1>,
				vector<[4]xi32>
	func @arm_sve_arithf_masked(%a: !arm_sve.vector<4xf32>,			// CHECK: "arm_sve.intr.sub"{{.*}}: (vector<[4]xi1>, vector<[4]xi32>, vector<[4]xi32>) -> vector<[4]xi32>
	%b: !arm_sve.vector<4xf32>,			%1 = arm_sve.masked.subi %bga, %b, %a : vector<[4]xi1>,
	%c: !arm_sve.vector<4xf32>,			vector<[4]xi32>
	%d: !arm_sve.vector<4xf32>,			// CHECK: "arm_sve.intr.add"{{.*}}: (vector<[4]xi1>, vector<[4]xi32>, vector<[4]xi32>) -> vector<[4]xi32>
	%e: !arm_sve.vector<4xf32>,			%2 = arm_sve.masked.addi %agb, %z, %0 : vector<[4]xi1>,
	%mask: !arm_sve.vector<4xi1>			vector<[4]xi32>
	) -> !arm_sve.vector<4xf32> {			// CHECK: "arm_sve.intr.add"{{.*}}: (vector<[4]xi1>, vector<[4]xi32>, vector<[4]xi32>) -> vector<[4]xi32>
	// CHECK: arm_sve.intr.fadd{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x f32>, !llvm.vec<? x 4 x f32>) -> !llvm.vec<? x 4 x f32>			%3 = arm_sve.masked.addi %bga, %2, %1 : vector<[4]xi1>,
	%0 = arm_sve.masked.addf %mask, %a, %b : !arm_sve.vector<4xi1>,			vector<[4]xi32>
	!arm_sve.vector<4xf32>			return %3 : vector<[4]xi32>
	// CHECK: arm_sve.intr.fsub{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x f32>, !llvm.vec<? x 4 x f32>) -> !llvm.vec<? x 4 x f32>
	%1 = arm_sve.masked.subf %mask, %0, %c : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xf32>
	// CHECK: arm_sve.intr.fmul{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x f32>, !llvm.vec<? x 4 x f32>) -> !llvm.vec<? x 4 x f32>
	%2 = arm_sve.masked.mulf %mask, %1, %d : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xf32>
	// CHECK: arm_sve.intr.fdiv{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x f32>, !llvm.vec<? x 4 x f32>) -> !llvm.vec<? x 4 x f32>
	%3 = arm_sve.masked.divf %mask, %2, %e : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xf32>
	return %3 : !arm_sve.vector<4xf32>
	}

	func @arm_sve_mask_genf(%a: !arm_sve.vector<4xf32>,
	%b: !arm_sve.vector<4xf32>)
	-> !arm_sve.vector<4xi1> {
	// CHECK: llvm.fcmp "oeq" {{.*}}: !llvm.vec<? x 4 x f32>
	%0 = arm_sve.cmpf oeq, %a, %b : !arm_sve.vector<4xf32>
	return %0 : !arm_sve.vector<4xi1>
	}

	func @arm_sve_mask_geni(%a: !arm_sve.vector<4xi32>,
	%b: !arm_sve.vector<4xi32>)
	-> !arm_sve.vector<4xi1> {
	// CHECK: llvm.icmp "uge" {{.*}}: !llvm.vec<? x 4 x i32>
	%0 = arm_sve.cmpi uge, %a, %b : !arm_sve.vector<4xi32>
	return %0 : !arm_sve.vector<4xi1>
	}

	func @arm_sve_abs_diff(%a: !arm_sve.vector<4xi32>,
	%b: !arm_sve.vector<4xi32>)
	-> !arm_sve.vector<4xi32> {
	// CHECK: llvm.sub {{.*}}: !llvm.vec<? x 4 x i32>
	%z = arm_sve.subi %a, %a : !arm_sve.vector<4xi32>
	// CHECK: llvm.icmp "sge" {{.*}}: !llvm.vec<? x 4 x i32>
	%agb = arm_sve.cmpi sge, %a, %b : !arm_sve.vector<4xi32>
	// CHECK: llvm.icmp "slt" {{.*}}: !llvm.vec<? x 4 x i32>
	%bga = arm_sve.cmpi slt, %a, %b : !arm_sve.vector<4xi32>
	// CHECK: "arm_sve.intr.sub"{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x i32>, !llvm.vec<? x 4 x i32>) -> !llvm.vec<? x 4 x i32>
	%0 = arm_sve.masked.subi %agb, %a, %b : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xi32>
	// CHECK: "arm_sve.intr.sub"{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x i32>, !llvm.vec<? x 4 x i32>) -> !llvm.vec<? x 4 x i32>
	%1 = arm_sve.masked.subi %bga, %b, %a : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xi32>
	// CHECK: "arm_sve.intr.add"{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x i32>, !llvm.vec<? x 4 x i32>) -> !llvm.vec<? x 4 x i32>
	%2 = arm_sve.masked.addi %agb, %z, %0 : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xi32>
	// CHECK: "arm_sve.intr.add"{{.*}}: (!llvm.vec<? x 4 x i1>, !llvm.vec<? x 4 x i32>, !llvm.vec<? x 4 x i32>) -> !llvm.vec<? x 4 x i32>
	%3 = arm_sve.masked.addi %bga, %2, %1 : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xi32>
	return %3 : !arm_sve.vector<4xi32>
	}			}

	func @get_vector_scale() -> index {			func @get_vector_scale() -> index {
	// CHECK: arm_sve.vscale			// CHECK: llvm.intr.vscale
	%0 = arm_sve.vector_scale : index			%0 = vector.vscale
	return %0 : index			return %0 : index
	}			}

mlir/test/Dialect/ArmSVE/memcpy.mlir

This file was deleted.

	// RUN: mlir-opt %s -convert-vector-to-llvm="enable-arm-sve" \| mlir-opt \| FileCheck %s

	// CHECK: memcopy([[SRC:%arg[0-9]+]]: memref<?xf32>, [[DST:%arg[0-9]+]]
	func @memcopy(%src : memref<?xf32>, %dst : memref<?xf32>, %size : index) {
	%c0 = arith.constant 0 : index
	%c4 = arith.constant 4 : index
	%vs = arm_sve.vector_scale : index
	%step = arith.muli %c4, %vs : index

	// CHECK: [[SRCMRS:%[0-9]+]] = builtin.unrealized_conversion_cast [[SRC]] : memref<?xf32> to !llvm.struct<(ptr<f32>
	// CHECK: [[DSTMRS:%[0-9]+]] = builtin.unrealized_conversion_cast [[DST]] : memref<?xf32> to !llvm.struct<(ptr<f32>
	// CHECK: scf.for [[LOOPIDX:%arg[0-9]+]] = {{.*}}
	scf.for %i0 = %c0 to %size step %step {
	// CHECK: [[SRCIDX:%[0-9]+]] = builtin.unrealized_conversion_cast [[LOOPIDX]] : index to i64
	// CHECK: [[SRCMEM:%[0-9]+]] = llvm.extractvalue [[SRCMRS]][1] : !llvm.struct<(ptr<f32>
	// CHECK-NEXT: [[SRCPTR:%[0-9]+]] = llvm.getelementptr [[SRCMEM]]{{.}}[[SRCIDX]]{{.}} : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
	// CHECK-NEXT: [[SRCVPTR:%[0-9]+]] = llvm.bitcast [[SRCPTR]] : !llvm.ptr<f32> to !llvm.ptr<vec<? x 4 x f32>>
	// CHECK-NEXT: [[LDVAL:%[0-9]+]] = llvm.load [[SRCVPTR]] : !llvm.ptr<vec<? x 4 x f32>>
	%0 = arm_sve.load %src[%i0] : !arm_sve.vector<4xf32> from memref<?xf32>
	// CHECK: [[DSTMEM:%[0-9]+]] = llvm.extractvalue [[DSTMRS]][1] : !llvm.struct<(ptr<f32>
	// CHECK-NEXT: [[DSTPTR:%[0-9]+]] = llvm.getelementptr [[DSTMEM]]{{.}}[[SRCIDX]]{{.}} : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
	// CHECK-NEXT: [[DSTVPTR:%[0-9]+]] = llvm.bitcast [[DSTPTR]] : !llvm.ptr<f32> to !llvm.ptr<vec<? x 4 x f32>>
	// CHECK-NEXT: llvm.store [[LDVAL]], [[DSTVPTR]] : !llvm.ptr<vec<? x 4 x f32>>
	arm_sve.store %0, %dst[%i0] : !arm_sve.vector<4xf32> to memref<?xf32>
	}

	return
	}

mlir/test/Dialect/ArmSVE/roundtrip.mlir

	// RUN: mlir-opt -verify-diagnostics %s \| mlir-opt \| FileCheck %s			// RUN: mlir-opt -verify-diagnostics %s \| mlir-opt \| FileCheck %s

	func @arm_sve_sdot(%a: !arm_sve.vector<16xi8>,			func @arm_sve_sdot(%a: vector<[16]xi8>,
	%b: !arm_sve.vector<16xi8>,			%b: vector<[16]xi8>,
	%c: !arm_sve.vector<4xi32>) -> !arm_sve.vector<4xi32> {			%c: vector<[4]xi32>) -> vector<[4]xi32> {
	// CHECK: arm_sve.sdot {{.*}}: <16xi8> to <4xi32			// CHECK: arm_sve.sdot {{.*}}: vector<[16]xi8> to vector<[4]xi32
	%0 = arm_sve.sdot %c, %a, %b :			%0 = arm_sve.sdot %c, %a, %b :
	!arm_sve.vector<16xi8> to !arm_sve.vector<4xi32>			vector<[16]xi8> to vector<[4]xi32>
	return %0 : !arm_sve.vector<4xi32>			return %0 : vector<[4]xi32>
	}			}

	func @arm_sve_smmla(%a: !arm_sve.vector<16xi8>,			func @arm_sve_smmla(%a: vector<[16]xi8>,
	%b: !arm_sve.vector<16xi8>,			%b: vector<[16]xi8>,
	%c: !arm_sve.vector<4xi32>) -> !arm_sve.vector<4xi32> {			%c: vector<[4]xi32>) -> vector<[4]xi32> {
	// CHECK: arm_sve.smmla {{.*}}: <16xi8> to <4xi3			// CHECK: arm_sve.smmla {{.*}}: vector<[16]xi8> to vector<[4]xi3
	%0 = arm_sve.smmla %c, %a, %b :			%0 = arm_sve.smmla %c, %a, %b :
	!arm_sve.vector<16xi8> to !arm_sve.vector<4xi32>			vector<[16]xi8> to vector<[4]xi32>
	return %0 : !arm_sve.vector<4xi32>			return %0 : vector<[4]xi32>
	}			}

	func @arm_sve_udot(%a: !arm_sve.vector<16xi8>,			func @arm_sve_udot(%a: vector<[16]xi8>,
	%b: !arm_sve.vector<16xi8>,			%b: vector<[16]xi8>,
	%c: !arm_sve.vector<4xi32>) -> !arm_sve.vector<4xi32> {			%c: vector<[4]xi32>) -> vector<[4]xi32> {
	// CHECK: arm_sve.udot {{.*}}: <16xi8> to <4xi32			// CHECK: arm_sve.udot {{.*}}: vector<[16]xi8> to vector<[4]xi32
	%0 = arm_sve.udot %c, %a, %b :			%0 = arm_sve.udot %c, %a, %b :
	!arm_sve.vector<16xi8> to !arm_sve.vector<4xi32>			vector<[16]xi8> to vector<[4]xi32>
	return %0 : !arm_sve.vector<4xi32>			return %0 : vector<[4]xi32>
	}			}

	func @arm_sve_ummla(%a: !arm_sve.vector<16xi8>,			func @arm_sve_ummla(%a: vector<[16]xi8>,
	%b: !arm_sve.vector<16xi8>,			%b: vector<[16]xi8>,
	%c: !arm_sve.vector<4xi32>) -> !arm_sve.vector<4xi32> {			%c: vector<[4]xi32>) -> vector<[4]xi32> {
	// CHECK: arm_sve.ummla {{.*}}: <16xi8> to <4xi3			// CHECK: arm_sve.ummla {{.*}}: vector<[16]xi8> to vector<[4]xi3
	%0 = arm_sve.ummla %c, %a, %b :			%0 = arm_sve.ummla %c, %a, %b :
	!arm_sve.vector<16xi8> to !arm_sve.vector<4xi32>			vector<[16]xi8> to vector<[4]xi32>
	return %0 : !arm_sve.vector<4xi32>			return %0 : vector<[4]xi32>
	}			}

	func @arm_sve_arithi(%a: !arm_sve.vector<4xi32>,			func @arm_sve_masked_arithi(%a: vector<[4]xi32>,
	%b: !arm_sve.vector<4xi32>,			%b: vector<[4]xi32>,
	%c: !arm_sve.vector<4xi32>) -> !arm_sve.vector<4xi32> {			%c: vector<[4]xi32>,
	// CHECK: arm_sve.muli {{.*}}: !arm_sve.vector<4xi32>			%d: vector<[4]xi32>,
	%0 = arm_sve.muli %a, %b : !arm_sve.vector<4xi32>			%e: vector<[4]xi32>,
	// CHECK: arm_sve.addi {{.*}}: !arm_sve.vector<4xi32>			%mask: vector<[4]xi1>)
	%1 = arm_sve.addi %0, %c : !arm_sve.vector<4xi32>			-> vector<[4]xi32> {
	return %1 : !arm_sve.vector<4xi32>			// CHECK: arm_sve.masked.muli {{.*}}: vector<[4]xi1>, vector<
	}			%0 = arm_sve.masked.muli %mask, %a, %b : vector<[4]xi1>,
				vector<[4]xi32>
	func @arm_sve_arithf(%a: !arm_sve.vector<4xf32>,			// CHECK: arm_sve.masked.addi {{.*}}: vector<[4]xi1>, vector<
	%b: !arm_sve.vector<4xf32>,			%1 = arm_sve.masked.addi %mask, %0, %c : vector<[4]xi1>,
	%c: !arm_sve.vector<4xf32>) -> !arm_sve.vector<4xf32> {			vector<[4]xi32>
	// CHECK: arm_sve.mulf {{.*}}: !arm_sve.vector<4xf32>			// CHECK: arm_sve.masked.subi {{.*}}: vector<[4]xi1>, vector<
	%0 = arm_sve.mulf %a, %b : !arm_sve.vector<4xf32>			%2 = arm_sve.masked.subi %mask, %1, %d : vector<[4]xi1>,
	// CHECK: arm_sve.addf {{.*}}: !arm_sve.vector<4xf32>			vector<[4]xi32>
	%1 = arm_sve.addf %0, %c : !arm_sve.vector<4xf32>
	return %1 : !arm_sve.vector<4xf32>
	}

	func @arm_sve_masked_arithi(%a: !arm_sve.vector<4xi32>,
	%b: !arm_sve.vector<4xi32>,
	%c: !arm_sve.vector<4xi32>,
	%d: !arm_sve.vector<4xi32>,
	%e: !arm_sve.vector<4xi32>,
	%mask: !arm_sve.vector<4xi1>)
	-> !arm_sve.vector<4xi32> {
	// CHECK: arm_sve.masked.muli {{.*}}: !arm_sve.vector<4xi1>, !arm_sve.vector
	%0 = arm_sve.masked.muli %mask, %a, %b : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xi32>
	// CHECK: arm_sve.masked.addi {{.*}}: !arm_sve.vector<4xi1>, !arm_sve.vector
	%1 = arm_sve.masked.addi %mask, %0, %c : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xi32>
	// CHECK: arm_sve.masked.subi {{.*}}: !arm_sve.vector<4xi1>, !arm_sve.vector
	%2 = arm_sve.masked.subi %mask, %1, %d : !arm_sve.vector<4xi1>,
	!arm_sve.vector<4xi32>
	// CHECK: arm_sve.masked.divi_signed			// CHECK: arm_sve.masked.divi_signed
	%3 = arm_sve.masked.divi_signed %mask, %2, %e : !arm_sve.vector<4xi1>,			%3 = arm_sve.masked.divi_signed %mask, %2, %e : vector<[4]xi1>,
	!arm_sve.vector<4xi32>			vector<[4]xi32>
	// CHECK: arm_sve.masked.divi_unsigned			// CHECK: arm_sve.masked.divi_unsigned
	%4 = arm_sve.masked.divi_unsigned %mask, %3, %e : !arm_sve.vector<4xi1>,			%4 = arm_sve.masked.divi_unsigned %mask, %3, %e : vector<[4]xi1>,
	!arm_sve.vector<4xi32>			vector<[4]xi32>
	return %2 : !arm_sve.vector<4xi32>			return %2 : vector<[4]xi32>
	}			}

	func @arm_sve_masked_arithf(%a: !arm_sve.vector<4xf32>,			func @arm_sve_masked_arithf(%a: vector<[4]xf32>,
	%b: !arm_sve.vector<4xf32>,			%b: vector<[4]xf32>,
	%c: !arm_sve.vector<4xf32>,			%c: vector<[4]xf32>,
	%d: !arm_sve.vector<4xf32>,			%d: vector<[4]xf32>,
	%e: !arm_sve.vector<4xf32>,			%e: vector<[4]xf32>,
	%mask: !arm_sve.vector<4xi1>)			%mask: vector<[4]xi1>)
	-> !arm_sve.vector<4xf32> {			-> vector<[4]xf32> {
	// CHECK: arm_sve.masked.mulf {{.*}}: !arm_sve.vector<4xi1>, !arm_sve.vector			// CHECK: arm_sve.masked.mulf {{.*}}: vector<[4]xi1>, vector<
	%0 = arm_sve.masked.mulf %mask, %a, %b : !arm_sve.vector<4xi1>,			%0 = arm_sve.masked.mulf %mask, %a, %b : vector<[4]xi1>,
	!arm_sve.vector<4xf32>			vector<[4]xf32>
	// CHECK: arm_sve.masked.addf {{.*}}: !arm_sve.vector<4xi1>, !arm_sve.vector			// CHECK: arm_sve.masked.addf {{.*}}: vector<[4]xi1>, vector<
	%1 = arm_sve.masked.addf %mask, %0, %c : !arm_sve.vector<4xi1>,			%1 = arm_sve.masked.addf %mask, %0, %c : vector<[4]xi1>,
	!arm_sve.vector<4xf32>			vector<[4]xf32>
	// CHECK: arm_sve.masked.subf {{.*}}: !arm_sve.vector<4xi1>, !arm_sve.vector			// CHECK: arm_sve.masked.subf {{.*}}: vector<[4]xi1>, vector<
	%2 = arm_sve.masked.subf %mask, %1, %d : !arm_sve.vector<4xi1>,			%2 = arm_sve.masked.subf %mask, %1, %d : vector<[4]xi1>,
	!arm_sve.vector<4xf32>			vector<[4]xf32>
	// CHECK: arm_sve.masked.divf {{.*}}: !arm_sve.vector<4xi1>, !arm_sve.vector			// CHECK: arm_sve.masked.divf {{.*}}: vector<[4]xi1>, vector<
	%3 = arm_sve.masked.divf %mask, %2, %e : !arm_sve.vector<4xi1>,			%3 = arm_sve.masked.divf %mask, %2, %e : vector<[4]xi1>,
	!arm_sve.vector<4xf32>			vector<[4]xf32>
	return %3 : !arm_sve.vector<4xf32>			return %3 : vector<[4]xf32>
	}

	func @arm_sve_mask_genf(%a: !arm_sve.vector<4xf32>,
	%b: !arm_sve.vector<4xf32>)
	-> !arm_sve.vector<4xi1> {
	// CHECK: arm_sve.cmpf oeq, {{.*}}: !arm_sve.vector<4xf32>
	%0 = arm_sve.cmpf oeq, %a, %b : !arm_sve.vector<4xf32>
	return %0 : !arm_sve.vector<4xi1>
	}

	func @arm_sve_mask_geni(%a: !arm_sve.vector<4xi32>,
	%b: !arm_sve.vector<4xi32>)
	-> !arm_sve.vector<4xi1> {
	// CHECK: arm_sve.cmpi uge, {{.*}}: !arm_sve.vector<4xi32>
	%0 = arm_sve.cmpi uge, %a, %b : !arm_sve.vector<4xi32>
	return %0 : !arm_sve.vector<4xi1>
	}

	func @arm_sve_memory(%v: !arm_sve.vector<4xi32>,
	%m: memref<?xi32>)
	-> !arm_sve.vector<4xi32> {
	%c0 = arith.constant 0 : index
	// CHECK: arm_sve.load {{.*}}: !arm_sve.vector<4xi32> from memref<?xi32>
	%0 = arm_sve.load %m[%c0] : !arm_sve.vector<4xi32> from memref<?xi32>
	// CHECK: arm_sve.store {{.*}}: !arm_sve.vector<4xi32> to memref<?xi32>
	arm_sve.store %v, %m[%c0] : !arm_sve.vector<4xi32> to memref<?xi32>
	return %0 : !arm_sve.vector<4xi32>
	}

	func @get_vector_scale() -> index {
	// CHECK: arm_sve.vector_scale : index
	%0 = arm_sve.vector_scale : index
	return %0 : index
	}			}

mlir/test/Dialect/Builtin/invalid.mlir

	// RUN: mlir-opt %s -split-input-file -verify-diagnostics			// RUN: mlir-opt %s -split-input-file -verify-diagnostics

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// UnrealizedConversionCastOp			// UnrealizedConversionCastOp
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	// expected-error@+1 {{expected at least one result for cast operation}}			// expected-error@+1 {{expected at least one result for cast operation}}
	"builtin.unrealized_conversion_cast"() : () -> ()			"builtin.unrealized_conversion_cast"() : () -> ()

	// -----			// -----

				//===----------------------------------------------------------------------===//
				// VectorType
				//===----------------------------------------------------------------------===//

				// expected-error@+1 {{missing ']' closing set of scalable dimensions}}
				func @scalable_vector_arg(%arg0: vector<[4xf32>) { }

				// -----

mlir/test/Dialect/Builtin/ops.mlir

	Show All 12 Lines
	// An unrealized 1-1 conversion.			// An unrealized 1-1 conversion.
	%result1 = unrealized_conversion_cast %operand : !foo.type to !bar.lowered_type			%result1 = unrealized_conversion_cast %operand : !foo.type to !bar.lowered_type

	// An unrealized 1-N conversion.			// An unrealized 1-N conversion.
	%results2:2 = unrealized_conversion_cast %tuple_operand : !foo.tuple_type<!foo.type, !foo.type> to !foo.type, !foo.type			%results2:2 = unrealized_conversion_cast %tuple_operand : !foo.tuple_type<!foo.type, !foo.type> to !foo.type, !foo.type

	// An unrealized N-1 conversion.			// An unrealized N-1 conversion.
	%result3 = unrealized_conversion_cast %operand, %operand : !foo.type, !foo.type to !bar.tuple_type<!foo.type, !foo.type>			%result3 = unrealized_conversion_cast %operand, %operand : !foo.type, !foo.type to !bar.tuple_type<!foo.type, !foo.type>

				//===----------------------------------------------------------------------===//
				// VectorType
				//===----------------------------------------------------------------------===//

				// A basic 1D scalable vector
				%scalable_vector_1d = "foo.op"() : () -> vector<[4]xi32>

				// A 2D scalable vector
				%scalable_vector_2d = "foo.op"() : () -> vector<[2x2]xf64>

				// A 2D scalable vector with fixed-length dimensions
				%scalable_vector_2d_mixed = "foo.op"() : () -> vector<2x[4]xbf16>

				// A multi-dimensional vector with mixed scalable and fixed-length dimensions
				%scalable_vector_multi_mixed = "foo.op"() : () -> vector<2x2x[4x4]xi8>

mlir/test/Dialect/Vector/ops.mlir

Show First 20 Lines • Show All 572 Lines • ▼ Show 20 Lines	func @vector_load_and_store_1d_vector_memref(%memref : memref<200x100xvector<8xf32>>,
%i : index, %j : index) {		%i : index, %j : index) {
// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xvector<8xf32>>, vector<8xf32>		// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xvector<8xf32>>, vector<8xf32>
%0 = vector.load %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>		%0 = vector.load %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xvector<8xf32>>, vector<8xf32>		// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xvector<8xf32>>, vector<8xf32>
vector.store %0, %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>		vector.store %0, %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
return		return
}		}

		// CHECK-LABEL: @vector_load_and_store_scalable_vector_memref
		func @vector_load_and_store_scalable_vector_memref(%v: vector<[4]xi32>, %m: memref<?xi32>) -> vector<[4]xi32> {
		%c0 = arith.constant 0 : index
		// CHECK: vector.load {{.*}}: memref<?xi32>, vector<[4]xi32>
		%0 = vector.load %m[%c0] : memref<?xi32>, vector<[4]xi32>
		// CHECK: vector.store {{.*}}: memref<?xi32>, vector<[4]xi32>
		vector.store %v, %m[%c0] : memref<?xi32>, vector<[4]xi32>
		return %0 : vector<[4]xi32>
		}

		func @vector_load_and_store_1d_scalable_vector_memref(%memref : memref<200x100xvector<8xf32>>,
		%i : index, %j : index) {
		// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<200x100xvector<8xf32>>, vector<8xf32>
		%0 = vector.load %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
		// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<200x100xvector<8xf32>>, vector<8xf32>
		vector.store %0, %memref[%i, %j] : memref<200x100xvector<8xf32>>, vector<8xf32>
		return
		}

// CHECK-LABEL: @vector_load_and_store_out_of_bounds		// CHECK-LABEL: @vector_load_and_store_out_of_bounds
func @vector_load_and_store_out_of_bounds(%memref : memref<7xf32>) {		func @vector_load_and_store_out_of_bounds(%memref : memref<7xf32>) {
%c0 = arith.constant 0 : index		%c0 = arith.constant 0 : index
// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<7xf32>, vector<8xf32>		// CHECK: %[[ld:.]] = vector.load %{{.}}[%{{.*}}] : memref<7xf32>, vector<8xf32>
%0 = vector.load %memref[%c0] : memref<7xf32>, vector<8xf32>		%0 = vector.load %memref[%c0] : memref<7xf32>, vector<8xf32>
// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<7xf32>, vector<8xf32>		// CHECK: vector.store %[[ld]], %{{.}}[%{{.}}] : memref<7xf32>, vector<8xf32>
vector.store %0, %memref[%c0] : memref<7xf32>, vector<8xf32>		vector.store %0, %memref[%c0] : memref<7xf32>, vector<8xf32>
return		return
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines
// CHECK-LABEL: @multi_reduction		// CHECK-LABEL: @multi_reduction
func @multi_reduction(%0: vector<4x8x16x32xf32>) -> f32 {		func @multi_reduction(%0: vector<4x8x16x32xf32>) -> f32 {
%1 = vector.multi_reduction <add>, %0 [1, 3] :		%1 = vector.multi_reduction <add>, %0 [1, 3] :
vector<4x8x16x32xf32> to vector<4x16xf32>		vector<4x8x16x32xf32> to vector<4x16xf32>
%2 = vector.multi_reduction <add>, %1 [0, 1] :		%2 = vector.multi_reduction <add>, %1 [0, 1] :
vector<4x16xf32> to f32		vector<4x16xf32> to f32
return %2 : f32		return %2 : f32
}		}

		// CHECK-LABEL: @get_vector_scale
		func @get_vector_scale() -> index {
		// CHECK: vector.vscale
		%0 = vector.vscale
		return %0 : index
		}

mlir/test/Dialect/Vector/vector-scalable-memcpy.mlir

This file was added.

				// RUN: mlir-opt %s -convert-vector-to-llvm \| mlir-opt \| FileCheck %s

				// CHECK: vector_scalable_memcopy([[SRC:%arg[0-9]+]]: memref<?xf32>, [[DST:%arg[0-9]+]]
				func @vector_scalable_memcopy(%src : memref<?xf32>, %dst : memref<?xf32>, %size : index) {
				%c0 = arith.constant 0 : index
				%c4 = arith.constant 4 : index
				%vs = vector.vscale
				%step = arith.muli %c4, %vs : index
				// CHECK: [[SRCMRS:%[0-9]+]] = builtin.unrealized_conversion_cast [[SRC]] : memref<?xf32> to !llvm.struct<(ptr<f32>
				// CHECK: [[DSTMRS:%[0-9]+]] = builtin.unrealized_conversion_cast [[DST]] : memref<?xf32> to !llvm.struct<(ptr<f32>
				// CHECK: scf.for [[LOOPIDX:%arg[0-9]+]] = {{.*}}
				scf.for %i0 = %c0 to %size step %step {
				// CHECK: [[DATAIDX:%[0-9]+]] = builtin.unrealized_conversion_cast [[LOOPIDX]] : index to i64
				// CHECK: [[SRCMEM:%[0-9]+]] = llvm.extractvalue [[SRCMRS]][1] : !llvm.struct<(ptr<f32>
				// CHECK-NEXT: [[SRCPTR:%[0-9]+]] = llvm.getelementptr [[SRCMEM]]{{.}}[[DATAIDX]]{{.}} : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
				// CHECK-NEXT: [[SRCVPTR:%[0-9]+]] = llvm.bitcast [[SRCPTR]] : !llvm.ptr<f32> to !llvm.ptr<vector<[4]xf32>>
				// CHECK-NEXT: [[LDVAL:%[0-9]+]] = llvm.load [[SRCVPTR]]{{.*}}: !llvm.ptr<vector<[4]xf32>>
				%0 = vector.load %src[%i0] : memref<?xf32>, vector<[4]xf32>
				// CHECK: [[DSTMEM:%[0-9]+]] = llvm.extractvalue [[DSTMRS]][1] : !llvm.struct<(ptr<f32>
				// CHECK-NEXT: [[DSTPTR:%[0-9]+]] = llvm.getelementptr [[DSTMEM]]{{.}}[[DATAIDX]]{{.}} : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
				// CHECK-NEXT: [[DSTVPTR:%[0-9]+]] = llvm.bitcast [[DSTPTR]] : !llvm.ptr<f32> to !llvm.ptr<vector<[4]xf32>>
				// CHECK-NEXT: llvm.store [[LDVAL]], [[DSTVPTR]]{{.*}}: !llvm.ptr<vector<[4]xf32>>
				vector.store %0, %dst[%i0] : memref<?xf32>, vector<[4]xf32>
				}

				return
				}

mlir/test/Target/LLVMIR/arm-sve.mlir

// RUN: mlir-translate --mlir-to-llvmir %s \| FileCheck %s		// RUN: mlir-translate --mlir-to-llvmir %s \| FileCheck %s

// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_sdot		// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_sdot
llvm.func @arm_sve_sdot(%arg0: !llvm.vec<?x16 x i8>,		llvm.func @arm_sve_sdot(%arg0: vector<[16]xi8>,
%arg1: !llvm.vec<?x16 x i8>,		%arg1: vector<[16]xi8>,
%arg2: !llvm.vec<?x4 x i32>)		%arg2: vector<[4]xi32>)
-> !llvm.vec<?x4 x i32> {		-> vector<[4]xi32> {
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sdot.nxv4i32(<vscale x 4		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sdot.nxv4i32(<vscale x 4
%0 = "arm_sve.intr.sdot"(%arg2, %arg0, %arg1) :		%0 = "arm_sve.intr.sdot"(%arg2, %arg0, %arg1) :
(!llvm.vec<?x4 x i32>, !llvm.vec<?x16 x i8>, !llvm.vec<?x16 x i8>)		(vector<[4]xi32>, vector<[16]xi8>, vector<[16]xi8>)
-> !llvm.vec<?x4 x i32>		-> vector<[4]xi32>
llvm.return %0 : !llvm.vec<?x4 x i32>		llvm.return %0 : vector<[4]xi32>
}		}

// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_smmla		// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_smmla
llvm.func @arm_sve_smmla(%arg0: !llvm.vec<?x16 x i8>,		llvm.func @arm_sve_smmla(%arg0: vector<[16]xi8>,
%arg1: !llvm.vec<?x16 x i8>,		%arg1: vector<[16]xi8>,
%arg2: !llvm.vec<?x4 x i32>)		%arg2: vector<[4]xi32>)
-> !llvm.vec<?x4 x i32> {		-> vector<[4]xi32> {
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.smmla.nxv4i32(<vscale x 4		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.smmla.nxv4i32(<vscale x 4
%0 = "arm_sve.intr.smmla"(%arg2, %arg0, %arg1) :		%0 = "arm_sve.intr.smmla"(%arg2, %arg0, %arg1) :
(!llvm.vec<?x4 x i32>, !llvm.vec<?x16 x i8>, !llvm.vec<?x16 x i8>)		(vector<[4]xi32>, vector<[16]xi8>, vector<[16]xi8>)
-> !llvm.vec<?x4 x i32>		-> vector<[4]xi32>
llvm.return %0 : !llvm.vec<?x4 x i32>		llvm.return %0 : vector<[4]xi32>
}		}

// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_udot		// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_udot
llvm.func @arm_sve_udot(%arg0: !llvm.vec<?x16 x i8>,		llvm.func @arm_sve_udot(%arg0: vector<[16]xi8>,
%arg1: !llvm.vec<?x16 x i8>,		%arg1: vector<[16]xi8>,
%arg2: !llvm.vec<?x4 x i32>)		%arg2: vector<[4]xi32>)
-> !llvm.vec<?x4 x i32> {		-> vector<[4]xi32> {
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.udot.nxv4i32(<vscale x 4		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.udot.nxv4i32(<vscale x 4
%0 = "arm_sve.intr.udot"(%arg2, %arg0, %arg1) :		%0 = "arm_sve.intr.udot"(%arg2, %arg0, %arg1) :
(!llvm.vec<?x4 x i32>, !llvm.vec<?x16 x i8>, !llvm.vec<?x16 x i8>)		(vector<[4]xi32>, vector<[16]xi8>, vector<[16]xi8>)
-> !llvm.vec<?x4 x i32>		-> vector<[4]xi32>
llvm.return %0 : !llvm.vec<?x4 x i32>		llvm.return %0 : vector<[4]xi32>
}		}

// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_ummla		// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_ummla
llvm.func @arm_sve_ummla(%arg0: !llvm.vec<?x16 x i8>,		llvm.func @arm_sve_ummla(%arg0: vector<[16]xi8>,
%arg1: !llvm.vec<?x16 x i8>,		%arg1: vector<[16]xi8>,
%arg2: !llvm.vec<?x4 x i32>)		%arg2: vector<[4]xi32>)
-> !llvm.vec<?x4 x i32> {		-> vector<[4]xi32> {
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.ummla.nxv4i32(<vscale x 4		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.ummla.nxv4i32(<vscale x 4
%0 = "arm_sve.intr.ummla"(%arg2, %arg0, %arg1) :		%0 = "arm_sve.intr.ummla"(%arg2, %arg0, %arg1) :
(!llvm.vec<?x4 x i32>, !llvm.vec<?x16 x i8>, !llvm.vec<?x16 x i8>)		(vector<[4]xi32>, vector<[16]xi8>, vector<[16]xi8>)
-> !llvm.vec<?x4 x i32>		-> vector<[4]xi32>
llvm.return %0 : !llvm.vec<?x4 x i32>		llvm.return %0 : vector<[4]xi32>
}		}

// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_arithi		// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_arithi
llvm.func @arm_sve_arithi(%arg0: !llvm.vec<? x 4 x i32>,		llvm.func @arm_sve_arithi(%arg0: vector<[4]xi32>,
%arg1: !llvm.vec<? x 4 x i32>,		%arg1: vector<[4]xi32>,
%arg2: !llvm.vec<? x 4 x i32>)		%arg2: vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32> {		-> vector<[4]xi32> {
// CHECK: mul <vscale x 4 x i32>		// CHECK: mul <vscale x 4 x i32>
%0 = llvm.mul %arg0, %arg1 : !llvm.vec<? x 4 x i32>		%0 = llvm.mul %arg0, %arg1 : vector<[4]xi32>
// CHECK: add <vscale x 4 x i32>		// CHECK: add <vscale x 4 x i32>
%1 = llvm.add %0, %arg2 : !llvm.vec<? x 4 x i32>		%1 = llvm.add %0, %arg2 : vector<[4]xi32>
llvm.return %1 : !llvm.vec<? x 4 x i32>		llvm.return %1 : vector<[4]xi32>
}		}

// CHECK-LABEL: define <vscale x 4 x float> @arm_sve_arithf		// CHECK-LABEL: define <vscale x 4 x float> @arm_sve_arithf
llvm.func @arm_sve_arithf(%arg0: !llvm.vec<? x 4 x f32>,		llvm.func @arm_sve_arithf(%arg0: vector<[4]xf32>,
%arg1: !llvm.vec<? x 4 x f32>,		%arg1: vector<[4]xf32>,
%arg2: !llvm.vec<? x 4 x f32>)		%arg2: vector<[4]xf32>)
-> !llvm.vec<? x 4 x f32> {		-> vector<[4]xf32> {
// CHECK: fmul <vscale x 4 x float>		// CHECK: fmul <vscale x 4 x float>
%0 = llvm.fmul %arg0, %arg1 : !llvm.vec<? x 4 x f32>		%0 = llvm.fmul %arg0, %arg1 : vector<[4]xf32>
// CHECK: fadd <vscale x 4 x float>		// CHECK: fadd <vscale x 4 x float>
%1 = llvm.fadd %0, %arg2 : !llvm.vec<? x 4 x f32>		%1 = llvm.fadd %0, %arg2 : vector<[4]xf32>
llvm.return %1 : !llvm.vec<? x 4 x f32>		llvm.return %1 : vector<[4]xf32>
}		}

// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_arithi_masked		// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_arithi_masked
llvm.func @arm_sve_arithi_masked(%arg0: !llvm.vec<? x 4 x i32>,		llvm.func @arm_sve_arithi_masked(%arg0: vector<[4]xi32>,
%arg1: !llvm.vec<? x 4 x i32>,		%arg1: vector<[4]xi32>,
%arg2: !llvm.vec<? x 4 x i32>,		%arg2: vector<[4]xi32>,
%arg3: !llvm.vec<? x 4 x i32>,		%arg3: vector<[4]xi32>,
%arg4: !llvm.vec<? x 4 x i32>,		%arg4: vector<[4]xi32>,
%arg5: !llvm.vec<? x 4 x i1>)		%arg5: vector<[4]xi1>)
-> !llvm.vec<? x 4 x i32> {		-> vector<[4]xi32> {
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.add.nxv4i32		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.add.nxv4i32
%0 = "arm_sve.intr.add"(%arg5, %arg0, %arg1) : (!llvm.vec<? x 4 x i1>,		%0 = "arm_sve.intr.add"(%arg5, %arg0, %arg1) : (vector<[4]xi1>,
!llvm.vec<? x 4 x i32>,		vector<[4]xi32>,
!llvm.vec<? x 4 x i32>)		vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32>		-> vector<[4]xi32>
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sub.nxv4i32		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sub.nxv4i32
%1 = "arm_sve.intr.sub"(%arg5, %0, %arg1) : (!llvm.vec<? x 4 x i1>,		%1 = "arm_sve.intr.sub"(%arg5, %0, %arg1) : (vector<[4]xi1>,
!llvm.vec<? x 4 x i32>,		vector<[4]xi32>,
!llvm.vec<? x 4 x i32>)		vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32>		-> vector<[4]xi32>
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.mul.nxv4i32		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.mul.nxv4i32
%2 = "arm_sve.intr.mul"(%arg5, %1, %arg3) : (!llvm.vec<? x 4 x i1>,		%2 = "arm_sve.intr.mul"(%arg5, %1, %arg3) : (vector<[4]xi1>,
!llvm.vec<? x 4 x i32>,		vector<[4]xi32>,
!llvm.vec<? x 4 x i32>)		vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32>		-> vector<[4]xi32>
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sdiv.nxv4i32		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sdiv.nxv4i32
%3 = "arm_sve.intr.sdiv"(%arg5, %2, %arg4) : (!llvm.vec<? x 4 x i1>,		%3 = "arm_sve.intr.sdiv"(%arg5, %2, %arg4) : (vector<[4]xi1>,
!llvm.vec<? x 4 x i32>,		vector<[4]xi32>,
!llvm.vec<? x 4 x i32>)		vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32>		-> vector<[4]xi32>
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.udiv.nxv4i32		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.udiv.nxv4i32
%4 = "arm_sve.intr.udiv"(%arg5, %3, %arg4) : (!llvm.vec<? x 4 x i1>,		%4 = "arm_sve.intr.udiv"(%arg5, %3, %arg4) : (vector<[4]xi1>,
!llvm.vec<? x 4 x i32>,		vector<[4]xi32>,
!llvm.vec<? x 4 x i32>)		vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32>		-> vector<[4]xi32>
llvm.return %4 : !llvm.vec<? x 4 x i32>		llvm.return %4 : vector<[4]xi32>
}		}

// CHECK-LABEL: define <vscale x 4 x float> @arm_sve_arithf_masked		// CHECK-LABEL: define <vscale x 4 x float> @arm_sve_arithf_masked
llvm.func @arm_sve_arithf_masked(%arg0: !llvm.vec<? x 4 x f32>,		llvm.func @arm_sve_arithf_masked(%arg0: vector<[4]xf32>,
%arg1: !llvm.vec<? x 4 x f32>,		%arg1: vector<[4]xf32>,
%arg2: !llvm.vec<? x 4 x f32>,		%arg2: vector<[4]xf32>,
%arg3: !llvm.vec<? x 4 x f32>,		%arg3: vector<[4]xf32>,
%arg4: !llvm.vec<? x 4 x f32>,		%arg4: vector<[4]xf32>,
%arg5: !llvm.vec<? x 4 x i1>)		%arg5: vector<[4]xi1>)
-> !llvm.vec<? x 4 x f32> {		-> vector<[4]xf32> {
// CHECK: call <vscale x 4 x float> @llvm.aarch64.sve.fadd.nxv4f32		// CHECK: call <vscale x 4 x float> @llvm.aarch64.sve.fadd.nxv4f32
%0 = "arm_sve.intr.fadd"(%arg5, %arg0, %arg1) : (!llvm.vec<? x 4 x i1>,		%0 = "arm_sve.intr.fadd"(%arg5, %arg0, %arg1) : (vector<[4]xi1>,
!llvm.vec<? x 4 x f32>,		vector<[4]xf32>,
!llvm.vec<? x 4 x f32>)		vector<[4]xf32>)
-> !llvm.vec<? x 4 x f32>		-> vector<[4]xf32>
// CHECK: call <vscale x 4 x float> @llvm.aarch64.sve.fsub.nxv4f32		// CHECK: call <vscale x 4 x float> @llvm.aarch64.sve.fsub.nxv4f32
%1 = "arm_sve.intr.fsub"(%arg5, %0, %arg2) : (!llvm.vec<? x 4 x i1>,		%1 = "arm_sve.intr.fsub"(%arg5, %0, %arg2) : (vector<[4]xi1>,
!llvm.vec<? x 4 x f32>,		vector<[4]xf32>,
!llvm.vec<? x 4 x f32>)		vector<[4]xf32>)
-> !llvm.vec<? x 4 x f32>		-> vector<[4]xf32>
// CHECK: call <vscale x 4 x float> @llvm.aarch64.sve.fmul.nxv4f32		// CHECK: call <vscale x 4 x float> @llvm.aarch64.sve.fmul.nxv4f32
%2 = "arm_sve.intr.fmul"(%arg5, %1, %arg3) : (!llvm.vec<? x 4 x i1>,		%2 = "arm_sve.intr.fmul"(%arg5, %1, %arg3) : (vector<[4]xi1>,
!llvm.vec<? x 4 x f32>,		vector<[4]xf32>,
!llvm.vec<? x 4 x f32>)		vector<[4]xf32>)
-> !llvm.vec<? x 4 x f32>		-> vector<[4]xf32>
// CHECK: call <vscale x 4 x float> @llvm.aarch64.sve.fdiv.nxv4f32		// CHECK: call <vscale x 4 x float> @llvm.aarch64.sve.fdiv.nxv4f32
%3 = "arm_sve.intr.fdiv"(%arg5, %2, %arg4) : (!llvm.vec<? x 4 x i1>,		%3 = "arm_sve.intr.fdiv"(%arg5, %2, %arg4) : (vector<[4]xi1>,
!llvm.vec<? x 4 x f32>,		vector<[4]xf32>,
!llvm.vec<? x 4 x f32>)		vector<[4]xf32>)
-> !llvm.vec<? x 4 x f32>		-> vector<[4]xf32>
llvm.return %3 : !llvm.vec<? x 4 x f32>		llvm.return %3 : vector<[4]xf32>
}		}

// CHECK-LABEL: define <vscale x 4 x i1> @arm_sve_mask_genf		// CHECK-LABEL: define <vscale x 4 x i1> @arm_sve_mask_genf
llvm.func @arm_sve_mask_genf(%arg0: !llvm.vec<? x 4 x f32>,		llvm.func @arm_sve_mask_genf(%arg0: vector<[4]xf32>,
%arg1: !llvm.vec<? x 4 x f32>)		%arg1: vector<[4]xf32>)
-> !llvm.vec<? x 4 x i1> {		-> vector<[4]xi1> {
// CHECK: fcmp oeq <vscale x 4 x float>		// CHECK: fcmp oeq <vscale x 4 x float>
%0 = llvm.fcmp "oeq" %arg0, %arg1 : !llvm.vec<? x 4 x f32>		%0 = llvm.fcmp "oeq" %arg0, %arg1 : vector<[4]xf32>
llvm.return %0 : !llvm.vec<? x 4 x i1>		llvm.return %0 : vector<[4]xi1>
}		}

// CHECK-LABEL: define <vscale x 4 x i1> @arm_sve_mask_geni		// CHECK-LABEL: define <vscale x 4 x i1> @arm_sve_mask_geni
llvm.func @arm_sve_mask_geni(%arg0: !llvm.vec<? x 4 x i32>,		llvm.func @arm_sve_mask_geni(%arg0: vector<[4]xi32>,
%arg1: !llvm.vec<? x 4 x i32>)		%arg1: vector<[4]xi32>)
-> !llvm.vec<? x 4 x i1> {		-> vector<[4]xi1> {
// CHECK: icmp uge <vscale x 4 x i32>		// CHECK: icmp uge <vscale x 4 x i32>
%0 = llvm.icmp "uge" %arg0, %arg1 : !llvm.vec<? x 4 x i32>		%0 = llvm.icmp "uge" %arg0, %arg1 : vector<[4]xi32>
llvm.return %0 : !llvm.vec<? x 4 x i1>		llvm.return %0 : vector<[4]xi1>
}		}

// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_abs_diff		// CHECK-LABEL: define <vscale x 4 x i32> @arm_sve_abs_diff
llvm.func @arm_sve_abs_diff(%arg0: !llvm.vec<? x 4 x i32>,		llvm.func @arm_sve_abs_diff(%arg0: vector<[4]xi32>,
%arg1: !llvm.vec<? x 4 x i32>)		%arg1: vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32> {		-> vector<[4]xi32> {
// CHECK: sub <vscale x 4 x i32>		// CHECK: sub <vscale x 4 x i32>
%0 = llvm.sub %arg0, %arg0 : !llvm.vec<? x 4 x i32>		%0 = llvm.sub %arg0, %arg0 : vector<[4]xi32>
// CHECK: icmp sge <vscale x 4 x i32>		// CHECK: icmp sge <vscale x 4 x i32>
%1 = llvm.icmp "sge" %arg0, %arg1 : !llvm.vec<? x 4 x i32>		%1 = llvm.icmp "sge" %arg0, %arg1 : vector<[4]xi32>
// CHECK: icmp slt <vscale x 4 x i32>		// CHECK: icmp slt <vscale x 4 x i32>
%2 = llvm.icmp "slt" %arg0, %arg1 : !llvm.vec<? x 4 x i32>		%2 = llvm.icmp "slt" %arg0, %arg1 : vector<[4]xi32>
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sub.nxv4i32		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sub.nxv4i32
%3 = "arm_sve.intr.sub"(%1, %arg0, %arg1) : (!llvm.vec<? x 4 x i1>,		%3 = "arm_sve.intr.sub"(%1, %arg0, %arg1) : (vector<[4]xi1>,
!llvm.vec<? x 4 x i32>,		vector<[4]xi32>,
!llvm.vec<? x 4 x i32>)		vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32>		-> vector<[4]xi32>
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sub.nxv4i32		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.sub.nxv4i32
%4 = "arm_sve.intr.sub"(%2, %arg1, %arg0) : (!llvm.vec<? x 4 x i1>,		%4 = "arm_sve.intr.sub"(%2, %arg1, %arg0) : (vector<[4]xi1>,
!llvm.vec<? x 4 x i32>,		vector<[4]xi32>,
!llvm.vec<? x 4 x i32>)		vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32>		-> vector<[4]xi32>
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.add.nxv4i32		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.add.nxv4i32
%5 = "arm_sve.intr.add"(%1, %0, %3) : (!llvm.vec<? x 4 x i1>,		%5 = "arm_sve.intr.add"(%1, %0, %3) : (vector<[4]xi1>,
!llvm.vec<? x 4 x i32>,		vector<[4]xi32>,
!llvm.vec<? x 4 x i32>)		vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32>		-> vector<[4]xi32>
// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.add.nxv4i32		// CHECK: call <vscale x 4 x i32> @llvm.aarch64.sve.add.nxv4i32
%6 = "arm_sve.intr.add"(%2, %5, %4) : (!llvm.vec<? x 4 x i1>,		%6 = "arm_sve.intr.add"(%2, %5, %4) : (vector<[4]xi1>,
!llvm.vec<? x 4 x i32>,		vector<[4]xi32>,
!llvm.vec<? x 4 x i32>)		vector<[4]xi32>)
-> !llvm.vec<? x 4 x i32>		-> vector<[4]xi32>
llvm.return %6 : !llvm.vec<? x 4 x i32>		llvm.return %6 : vector<[4]xi32>
}		}

// CHECK-LABEL: define void @memcopy		// CHECK-LABEL: define void @memcopy
llvm.func @memcopy(%arg0: !llvm.ptr<f32>, %arg1: !llvm.ptr<f32>,		llvm.func @memcopy(%arg0: !llvm.ptr<f32>, %arg1: !llvm.ptr<f32>,
%arg2: i64, %arg3: i64, %arg4: i64,		%arg2: i64, %arg3: i64, %arg4: i64,
%arg5: !llvm.ptr<f32>, %arg6: !llvm.ptr<f32>,		%arg5: !llvm.ptr<f32>, %arg6: !llvm.ptr<f32>,
%arg7: i64, %arg8: i64, %arg9: i64,		%arg7: i64, %arg8: i64, %arg9: i64,
%arg10: i64) {		%arg10: i64) {
Show All 30 Lines	%10 = llvm.insertvalue %arg8, %9[3, 0] : !llvm.struct<(ptr<f32>, ptr<f32>, i64,
array<1 x i64>,		array<1 x i64>,
array<1 x i64>)>		array<1 x i64>)>
%11 = llvm.insertvalue %arg9, %10[4, 0] : !llvm.struct<(ptr<f32>, ptr<f32>, i64,		%11 = llvm.insertvalue %arg9, %10[4, 0] : !llvm.struct<(ptr<f32>, ptr<f32>, i64,
array<1 x i64>,		array<1 x i64>,
array<1 x i64>)>		array<1 x i64>)>
%12 = llvm.mlir.constant(0 : index) : i64		%12 = llvm.mlir.constant(0 : index) : i64
%13 = llvm.mlir.constant(4 : index) : i64		%13 = llvm.mlir.constant(4 : index) : i64
// CHECK: [[VL:%[0-9]+]] = call i64 @llvm.vscale.i64()		// CHECK: [[VL:%[0-9]+]] = call i64 @llvm.vscale.i64()
%14 = "arm_sve.vscale"() : () -> i64		%14 = "llvm.intr.vscale"() : () -> i64
// CHECK: mul i64 [[VL]], 4		// CHECK: mul i64 [[VL]], 4
%15 = llvm.mul %14, %13 : i64		%15 = llvm.mul %14, %13 : i64
llvm.br ^bb1(%12 : i64)		llvm.br ^bb1(%12 : i64)
^bb1(%16: i64):		^bb1(%16: i64):
%17 = llvm.icmp "slt" %16, %arg10 : i64		%17 = llvm.icmp "slt" %16, %arg10 : i64
llvm.cond_br %17, ^bb2, ^bb3		llvm.cond_br %17, ^bb2, ^bb3
^bb2:		^bb2:
// CHECK: extractvalue { float, float, i64, [1 x i64], [1 x i64] }		// CHECK: extractvalue { float, float, i64, [1 x i64], [1 x i64] }
%18 = llvm.extractvalue %5[1] : !llvm.struct<(ptr<f32>, ptr<f32>, i64,		%18 = llvm.extractvalue %5[1] : !llvm.struct<(ptr<f32>, ptr<f32>, i64,
array<1 x i64>,		array<1 x i64>,
array<1 x i64>)>		array<1 x i64>)>
// CHECK: etelementptr float, float*		// CHECK: etelementptr float, float*
%19 = llvm.getelementptr %18[%16] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>		%19 = llvm.getelementptr %18[%16] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
// CHECK: bitcast float* %{{[0-9]+}} to <vscale x 4 x float>*		// CHECK: bitcast float* %{{[0-9]+}} to <vscale x 4 x float>*
%20 = llvm.bitcast %19 : !llvm.ptr<f32> to !llvm.ptr<vec<? x 4 x f32>>		%20 = llvm.bitcast %19 : !llvm.ptr<f32> to !llvm.ptr<vector<[4]xf32>>
// CHECK: load <vscale x 4 x float>, <vscale x 4 x float>*		// CHECK: load <vscale x 4 x float>, <vscale x 4 x float>*
%21 = llvm.load %20 : !llvm.ptr<vec<? x 4 x f32>>		%21 = llvm.load %20 : !llvm.ptr<vector<[4]xf32>>
// CHECK: extractvalue { float, float, i64, [1 x i64], [1 x i64] }		// CHECK: extractvalue { float, float, i64, [1 x i64], [1 x i64] }
%22 = llvm.extractvalue %11[1] : !llvm.struct<(ptr<f32>, ptr<f32>, i64,		%22 = llvm.extractvalue %11[1] : !llvm.struct<(ptr<f32>, ptr<f32>, i64,
array<1 x i64>,		array<1 x i64>,
array<1 x i64>)>		array<1 x i64>)>
// CHECK: getelementptr float, float* %32		// CHECK: getelementptr float, float* %32
%23 = llvm.getelementptr %22[%16] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>		%23 = llvm.getelementptr %22[%16] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
// CHECK: bitcast float* %33 to <vscale x 4 x float>*		// CHECK: bitcast float* %33 to <vscale x 4 x float>*
%24 = llvm.bitcast %23 : !llvm.ptr<f32> to !llvm.ptr<vec<? x 4 x f32>>		%24 = llvm.bitcast %23 : !llvm.ptr<f32> to !llvm.ptr<vector<[4]xf32>>
// CHECK: store <vscale x 4 x float> %{{[0-9]+}}, <vscale x 4 x float>* %{{[0-9]+}}		// CHECK: store <vscale x 4 x float> %{{[0-9]+}}, <vscale x 4 x float>* %{{[0-9]+}}
llvm.store %21, %24 : !llvm.ptr<vec<? x 4 x f32>>		llvm.store %21, %24 : !llvm.ptr<vector<[4]xf32>>
%25 = llvm.add %16, %15 : i64		%25 = llvm.add %16, %15 : i64
llvm.br ^bb1(%25 : i64)		llvm.br ^bb1(%25 : i64)
^bb3:		^bb3:
llvm.return		llvm.return
}		}

// CHECK-LABEL: define i64 @get_vector_scale()		// CHECK-LABEL: define i64 @get_vector_scale()
llvm.func @get_vector_scale() -> i64 {		llvm.func @get_vector_scale() -> i64 {
// CHECK: call i64 @llvm.vscale.i64()		// CHECK: call i64 @llvm.vscale.i64()
%0 = "arm_sve.vscale"() : () -> i64		%0 = "llvm.intr.vscale"() : () -> i64
llvm.return %0 : i64		llvm.return %0 : i64
}		}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][RFC] Add scalable dimensions to VectorTypeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 394488

mlir/include/mlir/Dialect/ArmSVE/ArmSVE.td

mlir/include/mlir/Dialect/ArmSVE/ArmSVEDialect.h

mlir/include/mlir/Dialect/ArmSVE/ArmSVEOpBase.td

mlir/include/mlir/Dialect/LLVMIR/LLVMOps.td

mlir/include/mlir/Dialect/LLVMIR/LLVMTypes.h

mlir/include/mlir/Dialect/Vector/VectorOps.td

mlir/include/mlir/IR/BuiltinTypes.h

mlir/include/mlir/IR/BuiltinTypes.td

mlir/include/mlir/IR/OpBase.td

mlir/lib/Conversion/LLVMCommon/TypeConverter.cpp

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

mlir/lib/Dialect/Arithmetic/IR/ArithmeticOps.cpp

mlir/lib/Dialect/ArmSVE/IR/ArmSVEDialect.cpp

mlir/lib/Dialect/ArmSVE/Transforms/LegalizeForLLVMExport.cpp

mlir/lib/Dialect/LLVMIR/IR/LLVMDialect.cpp

mlir/lib/Dialect/LLVMIR/IR/LLVMTypes.cpp

mlir/lib/Dialect/StandardOps/IR/Ops.cpp

mlir/lib/IR/AsmPrinter.cpp

mlir/lib/IR/BuiltinAttributes.cpp

mlir/lib/IR/BuiltinTypes.cpp

mlir/lib/Parser/Parser.h

mlir/lib/Parser/TypeParser.cpp

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp

mlir/lib/Target/LLVMIR/TypeToLLVM.cpp

mlir/test/Dialect/Arithmetic/ops.mlir

mlir/test/Dialect/ArmSVE/legalize-for-llvm.mlir

mlir/test/Dialect/ArmSVE/memcpy.mlir

mlir/test/Dialect/ArmSVE/roundtrip.mlir

mlir/test/Dialect/Builtin/invalid.mlir

mlir/test/Dialect/Builtin/ops.mlir

mlir/test/Dialect/Vector/ops.mlir

mlir/test/Dialect/Vector/vector-scalable-memcpy.mlir

mlir/test/Target/LLVMIR/arm-sve.mlir

[mlir][RFC] Add scalable dimensions to VectorType
ClosedPublic