This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Dialect/
-
SPIRV/IR/
-
IR/
-
SPIRVStructureOps.td
-
Shape/IR/
-
IR/
-
ShapeOps.td
-
StandardOps/IR/
-
IR/
-
Ops.td
-
IR/
-
BuiltinOps.h
2/2
BuiltinOps.td
10/11
OpAsmInterface.td
-
OpDefinition.h
-
OperationSupport.h
-
lib/
-
IR/
21/22
AsmPrinter.cpp
1/1
Operation.cpp
-
Parser/
4/5
Parser.cpp
1/1
ParserState.h
-
test/
-
Analysis/
-
test-shape-fn-report.mlir
-
Conversion/
-
GPUToNVVM/
-
gpu-to-nvvm.mlir
-
wmma-ops-to-nvvm.mlir
-
GPUToROCDL/
-
gpu-to-rocdl.mlir
-
VectorToROCDL/
-
vector-to-rocdl.mlir
-
Dialect/
-
Builtin/
-
canonicalize.mlir
-
Linalg/
-
drop-unit-extent-dims.mlir
-
reshape_control_fusion.mlir
-
Shape/
-
invalid.mlir
-
SparseTensor/
-
sparse_perm.mlir
-
sparse_perm_lower.mlir
-
IR/
-
invalid-func-op.mlir
-
invalid-module-op.mlir
-
invalid-ops.mlir
-
invalid.mlir
-
parser.mlir
-
traits.mlir
-
Transforms/
-
canonicalize-dce.mlir
-
canonicalize.mlir
-
constant-fold.mlir
-
cse.mlir
-
test-legalizer-full.mlir
-
lib/Dialect/Test/
-
Dialect/
-
Test/
2/2
TestOps.td

Differential D107236

Add a new interface allowing to set a default dialect to be used for printing/parsing regions
ClosedPublic

Authored by mehdi_amini on Jul 31 2021, 11:49 PM.

Download Raw Diff

Details

Reviewers

rriddle
antiagainst
aartbik
ftynse
jsetoain
jpienaar
nicolasvasilache
sjarus
herhut

Commits

rG387f95541bdc: Add a new interface allowing to set a default dialect to be used for…

Summary

Currently the builtin dialect is the default namespace used for parsing
and printing. As such module and func don't need to be prefixed.
In the case of some dialects that defines new regions for their own
purpose (like SpirV modules for example), it can be beneficial to
change the default dialect in order to improve readability.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

mehdi_amini created this revision.Jul 31 2021, 11:49 PM

Herald added a reviewer: rriddle. · View Herald TranscriptJul 31 2021, 11:49 PM

Herald added subscribers: Chia-hungDuan, dcaballe, cota and 16 others. · View Herald Transcript

mehdi_amini requested review of this revision.Jul 31 2021, 11:49 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 31 2021, 11:49 PM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B117352: Diff 363336.Aug 1 2021, 12:35 AM

Fix bufferization of the stream and add doc

Harbormaster completed remote builds in B117363: Diff 363352.Aug 1 2021, 2:34 PM

Don't use the default dialect to strip prefix for generic operations
(also update tests)

Herald added a reviewer: antiagainst. · View Herald TranscriptAug 2 2021, 12:12 AM

Herald added a reviewer: aartbik. · View Herald Transcript

Herald added a reviewer: ftynse. · View Herald Transcript

Herald added a reviewer: jsetoain. · View Herald Transcript

Herald added subscribers: ThomasRaoux, AlexeySotkin, mravishankar. · View Herald Transcript

Harbormaster completed remote builds in B117388: Diff 363386.Aug 2 2021, 12:48 AM

Won't this also drop the occurrences of the string that looks like a prefix from other places than operation name? Everything goes through the same stream AFAICS, so something like my.op() attributes { my.attribute = "this.may.use.the.word.builtin.because", my.builtin.enabled = 1} will get affected where it shouldn't.

mlir/include/mlir/IR/OpImplementation.h
969–971 ↗	(On Diff #363386)
mlir/lib/IR/AsmPrinter.cpp
2322–2326
2343–2346	I can't match this comment with the code below.

In D107236#2919597, @ftynse wrote:

Won't this also drop the occurrences of the string that looks like a prefix from other places than operation name? Everything goes through the same stream AFAICS, so something like my.op() attributes { my.attribute = "this.may.use.the.word.builtin.because", my.builtin.enabled = 1} will get affected where it shouldn't.

I don't think so because we only filter the beginning of the stream and nothing else. The filter is immediately clear whether it matches or not the beginning of the stream. The filter is also set immediately before printing an operation. But I'll extend the test to show this!

Address comments

mlir/lib/IR/AsmPrinter.cpp
2343–2346	Ah, this is some explanation on the correctness of the code, I'll try to reformulate.

Harbormaster completed remote builds in B117505: Diff 363552.Aug 2 2021, 1:36 PM

Fix getDefaultDialectFromEnclosingOps

Herald added a subscriber: wrengr. · View Herald TranscriptAug 2 2021, 2:26 PM

mehdi_amini added inline comments.Aug 2 2021, 2:27 PM

mlir/include/mlir/IR/OpImplementation.h
969–971 ↗	(On Diff #363386)	Actually this edit was incorrect: if would skip the check on the current op

(some more minor fixes)

Harbormaster completed remote builds in B117530: Diff 363584.Aug 2 2021, 3:04 PM

Nice!

mlir/include/mlir/IR/OpAsmInterface.td
68	Can we instead just move this in the other interface? I don't think we need a special interface for this, especially given that OpAsmOpInterface is described as being a general parser/printer hook.
mlir/include/mlir/IR/OpImplementation.h
968 ↗	(On Diff #363584)	Can we move this to AsmPrinter.cpp? I don't see any uses outside of there, or is there some reason to have this publicly exposable?
971 ↗	(On Diff #363584)	Do we want it to be inherited? I would almost prefer that the default only apply to the immediate regions. It seems less surprising that way, and more contextualized. For example assuming we inherit default names, if I split out a nested region op from a reduced test case, I might need to re-add operation namespaces that were filtered out from the parent (as the parent may now be different).
mlir/lib/IR/AsmPrinter.cpp
2310	We discussed a little offline, but I think it would be better to just remove the constraint that custom op printers print the operation name at the beginning. I don't see a real reason for them to do that, and it would also match the behavior of the parser (which doesn't parse the operation name, for likely obvious reasons).
2433	Do we need this to be std::string? I would assume that we could enforce StringRef here (getDefaultDialect also returns StringRef).
mlir/lib/Parser/ParserState.h
91	Same comment here, can we use StringRef instead of std::string?

rriddle added inline comments.Aug 2 2021, 10:34 PM

mlir/include/mlir/IR/OpAsmInterface.td
83	Do we want to allow multiple default namespaces? Some projects might have several that all kind of co-mingle with each other.
mlir/include/mlir/IR/OpImplementation.h
973 ↗	(On Diff #363584)	Do we need to default to builtin? Can we instead default to nothing?
mlir/lib/IR/AsmPrinter.cpp
2433	I think I missed it, but how is this parser stack different from the one in the ParserState?

rriddle requested changes to this revision.Aug 2 2021, 10:34 PM

This revision now requires changes to proceed.Aug 2 2021, 10:34 PM

Address River's comment

Herald added a reviewer: jpienaar. · View Herald TranscriptAug 3 2021, 12:43 AM

mehdi_amini added inline comments.Aug 3 2021, 12:47 AM

mlir/include/mlir/IR/OpAsmInterface.td
83	I can't imagine what it'd look like right now: we'd have to guarantee these dialects don't have any pair of op with the same name to avoid conflicts. The main extension I could think of would be to take an integer as an input and return a different one per region attached to an op
mlir/include/mlir/IR/OpImplementation.h
971 ↗	(On Diff #363584)	Yeah I hesitated on this, we can start with a shallow model conservatively
973 ↗	(On Diff #363584)	That mean that we wouldn't be able to parse a top-level module right now though, do you want to change this now? (I'd be fine with this but I would land this in two revision though)
mlir/lib/IR/AsmPrinter.cpp
2310	Yeah I looked into it, one wrinkle though is that something like `Dialect::printOperation` can fail and we fallback to the generic operation printer. I'm not sure how to handle this?
2433	This is a printer stack, and the other is the parser one? They are equivalent otherwise. Maybe I missed what you were asking?

Harbormaster completed remote builds in B117571: Diff 363641.Aug 3 2021, 12:54 AM

rriddle added inline comments.Aug 3 2021, 1:00 AM

mlir/include/mlir/IR/OpImplementation.h
973 ↗	(On Diff #363584)	I think it's reasonable/fine to require the "builtin" namespace on the top-level module. I'm also okay with it being separate from this commit.
mlir/lib/IR/AsmPrinter.cpp
2310	Hmmm. Can we change Dialect::printOperation to act more like getParseOperationHook? That way we would know if it would succeed before calling it.
2433	Right right, missed that one was printer/one was parser.
mlir/lib/Parser/Parser.cpp
1846	Would it be better to keep StringRef here, but have another variable that is OperationName? That would remove the need to build a std::string, and the `OperationName` var could be passed in later when building the OperationState.
1869–1875	Shouldn't this one be `.str()`?

bondhugula added a subscriber: bondhugula.Aug 3 2021, 8:54 PM

bondhugula added inline comments.

mlir/include/mlir/IR/OpAsmInterface.td
56	under a given operation -> under this operation (cf. doc of `getAsmResultNames` above to align.)
56	dialect -> dialect prefix
57	Nit: Punctuation broken here.
58–59	if it was implementing this interface to return ... -> if this method returned `spv`. ^^ "it" is dangling here.
mlir/lib/IR/AsmPrinter.cpp
2316–2317	Nit: reflow to use width.
2361	Camel case?
2429	get -> gets / is

purpose (like SpirV modules for example), it can be beneficial to
change the default dialect in order to improve readability.

Does this really improve readability? Doesn't it create ambiguity for a reader as to which dialect's operation is being referring to (when you have the same names)? I might have misunderstood it but that's what the commit summary conveys to me.

Does this really improve readability? Doesn't it create ambiguity for a reader as to which dialect's operation is being referring to (when you have the same names)? I might have misunderstood it but that's what the commit summary conveys to me.

Yes you’re right. I think this is about how it gets used though.
I’d claim that it is less intrusive than the custom assembly that we allow (we actually had internal teams complaining about it: affine, linalg, scf, … all have different custom conventions).

If you remember the original affine functions, I don’t think operations were prefixed: it was later during generalization that we added the prefix. The prefix wasn’t for readability but for necessary disambiguating in the parser.

The idea here is that you can see it like an extension of custom assembly: instead of just controlling the immediate operation you can impact the immediately nested regions in a limited way.

My main motivation is the TensorFlow graph dialect right now: when I write tfg.graph { I’d like to avoid prefixing every single operation with tfg. inside the graph itself.
This isn’t a region that allows anything else anyway, but even if it did, the few outlier operation would be prefixed.

Address comments

mehdi_amini added inline comments.Aug 6 2021, 3:53 AM

mlir/include/mlir/IR/OpAsmInterface.td
57	I can't spot what you refer to here?
mlir/lib/IR/AsmPrinter.cpp
2310	(I'll look into this later)
2361	I am overriding the parent class from LLVM, can't do?
mlir/lib/Parser/Parser.cpp
1846	I tried but it's convoluted: OperationName does not have a default constructor.

Harbormaster completed remote builds in B118345: Diff 364746.Aug 6 2021, 4:31 AM

In D107236#2924729, @mehdi_amini wrote:

Does this really improve readability? Doesn't it create ambiguity for a reader as to which dialect's operation is being referring to (when you have the same names)? I might have misunderstood it but that's what the commit summary conveys to me.

If you remember the original affine functions, I don’t think operations were prefixed: it was later during generalization that we added the prefix. The prefix wasn’t for readability but for necessary disambiguating in the parser.

It did also disambiguate for the user though I'd say. Less of an issue with smaller number of dialects and all kept in head (we've also never had two dialects with i32 type for example, but if one saw i32 type one would not consider if from dialect or not)

The idea here is that you can see it like an extension of custom assembly: instead of just controlling the immediate operation you can impact the immediately nested regions in a limited way.

I think that is the question. This is like having a using namespace statement implicitly and attached to a region (well I think all regions of dialect specified ops?). So when looking at a dump a reader has to know which disambiguating rules to follow where implicitly (there is no visual cue). I'm assuming when an error is emitted we don't strip prefix (so that error is unambiguous without additional context).

And then Rivers question also come in to play: what if I have 3 dialects that I use equally in a given op's "first layer of ops" (as even TFG I believe allows nesting and ops inside the nested region may not be TFG), now do I have to decide based on preferred one.

I do agree that folks have complained about custom syntax being inconsistent, but that doesn't seem like a good argument for why this is good.

I do agree that folks have complained about custom syntax being inconsistent, but that doesn't seem like a good argument for why this is good.

Absolutely: the argument I made is that it is consistent with how we approached it in MLIR so far ; when the generic printer isn’t used you get an output that is readable optimally for folks familiar with a given dialect. And also that we provide this power to dialect authors, and trust them to use it responsibly.

(as even TFG I believe allows nesting and ops inside the nested region may not be TFG),

Nesting further under TFG shouldn’t be an issue, it would be a different region which may or may not use the same default (or none).

now do I have to decide based on preferred one.

Yes, or none: if three dialects are equally used in a region, why strip any of them? The fact that we can strip does not mean we should!
Other than Spirv that already restricts that only spirv ops are present, I don’t see any other place in-tree where we would use this feature right now.

In D107236#2924729, @mehdi_amini wrote:

Does this really improve readability? Doesn't it create ambiguity for a reader as to which dialect's operation is being referring to (when you have the same names)? I might have misunderstood it but that's what the commit summary conveys to me.

Yes you’re right. I think this is about how it gets used though.
I’d claim that it is less intrusive than the custom assembly that we allow (we actually had internal teams complaining about it: affine, linalg, scf, … all have different custom conventions).

If you remember the original affine functions, I don’t think operations were prefixed: it was later during generalization that we added the prefix. The prefix wasn’t for readability but for necessary disambiguating in the parser.

The idea here is that you can see it like an extension of custom assembly: instead of just controlling the immediate operation you can impact the immediately nested regions in a limited way.

My main motivation is the TensorFlow graph dialect right now: when I write tfg.graph { I’d like to avoid prefixing every single operation with tfg. inside the graph itself.
This isn’t a region that allows anything else anyway, but even if it did, the few outlier operation would be prefixed.

Then, the ambiguity that this adds due to the prefix being dropped based on the context appears really undesirable to me at first sight. It's more readable with the prefix for me. Of course, you are providing an option for a dialect to elide things, but this whole context sensitive printing (esp. for large regions) in the face of collisions can be quite painful. This also adds a layer of non-uniformity and surprises for people working across dialects and when folks encounter a new dialect. MLIR being an IR and not a user-facing programming language, I'm not sure you gain much by dropping a prefix like tfg to your ops.

Could you discuss this change on discourse? (Sorry if you already did and I missed it.) I feel we should apply brakes in this case on context sensitive elision and I'm a -1 here.

In D107236#2932063, @bondhugula wrote:

In D107236#2924729, @mehdi_amini wrote:

Does this really improve readability? Doesn't it create ambiguity for a reader as to which dialect's operation is being referring to (when you have the same names)? I might have misunderstood it but that's what the commit summary conveys to me.

Yes you’re right. I think this is about how it gets used though.
I’d claim that it is less intrusive than the custom assembly that we allow (we actually had internal teams complaining about it: affine, linalg, scf, … all have different custom conventions).

If you remember the original affine functions, I don’t think operations were prefixed: it was later during generalization that we added the prefix. The prefix wasn’t for readability but for necessary disambiguating in the parser.

The idea here is that you can see it like an extension of custom assembly: instead of just controlling the immediate operation you can impact the immediately nested regions in a limited way.

My main motivation is the TensorFlow graph dialect right now: when I write tfg.graph { I’d like to avoid prefixing every single operation with tfg. inside the graph itself.
This isn’t a region that allows anything else anyway, but even if it did, the few outlier operation would be prefixed.

Then, the ambiguity that this adds due to the prefix being dropped based on the context appears really undesirable to me at first sight. It's more readable with the prefix for me.

I think you'll have to elaborate with examples, because I don't see it really.

This also adds a layer of non-uniformity and surprises for people working across dialects and when folks encounter a new dialect. MLIR being an IR and not a user-facing programming language, I'm not sure you gain much by dropping a prefix like tfg to your ops.

The exact same argument applies to the custom assembly used by most dialect I believe.

Could you discuss this change on discourse? (Sorry if you already did and I missed it.)

Will do!

Will do!

https://llvm.discourse.group/t/rfc-allowing-to-set-a-default-dialect-to-be-used-for-printing-parsing-regions/4039

GMNGeoffrey added a subscriber: GMNGeoffrey.Aug 6 2021, 5:07 PM

GMNGeoffrey added inline comments.

mlir/test/lib/Dialect/Test/TestOps.td
637–638	Comment is copy-pasta :-)

lattner added a subscriber: lattner.Aug 25 2021, 2:47 PM

lattner added inline comments.

mlir/lib/IR/AsmPrinter.cpp
2310	We discussed a little offline, but I think it would be better to just remove the constraint that custom op printers print the operation name at the beginning. I don't see a real reason for them to do that, and it would also match the behavior of the parser (which doesn't parse the operation name, for likely obvious reasons). I agree completely, the stream approach here is too-clever. It is also a pain (and pointless) to print the op name anyway, it would be far better to have the asmprinter do this by default.

Rebase on top of D108804 to remove the "filtering stream" now that the framework has direct control on printing the op name

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptAug 27 2021, 7:58 PM

Herald added a reviewer: sjarus. · View Herald Transcript

Herald added subscribers: armkevincheng, jsmolens, eric-k256. · View Herald Transcript

Harbormaster completed remote builds in B121577: Diff 369223.Aug 27 2021, 7:59 PM

Address Geoff's comment

Harbormaster completed remote builds in B121578: Diff 369224.Aug 27 2021, 8:05 PM

mehdi_amini marked an inline comment as done.Aug 27 2021, 8:05 PM

rriddle added inline comments.Aug 28 2021, 12:42 PM

mlir/include/mlir/IR/OpAsmInterface.td
56
58
59–61	I don't think last statement is up-to-date. An empty string means no elision right? not that it is inherited.
mlir/lib/IR/AsmPrinter.cpp
2428
2671–2678	You could also use llvm::make_scope_exit instead.
2682	Shouldn't we push an empty string here instead?
mlir/lib/IR/Operation.cpp
651	Can we add a TODO/FIXME to kill this line?
mlir/lib/Parser/Parser.cpp
1892	Same comment here: why do we use builtin as the fallback here? I would expect an empty default dialect to mean "no ellision".
1903–1910	You can use llvm::make_scope_exit here instead.

Address River's comment

Herald added a reviewer: herhut. · View Herald TranscriptAug 30 2021, 4:51 PM

Herald added subscribers: csigg, jholewinski. · View Herald Transcript

mehdi_amini added inline comments.Aug 30 2021, 4:51 PM

mlir/lib/IR/AsmPrinter.cpp
2671–2678	Grahh: believe me I searched for this utility and couldn't find it and concluded that my memory of its existence was bogus...

Harbormaster completed remote builds in B121845: Diff 369580.Aug 30 2021, 4:51 PM

Looks great, thanks!

mlir/include/mlir/IR/BuiltinOps.td
163–167
216–220
mlir/lib/IR/AsmPrinter.cpp
375	Parameter comments should have a =
2522–2523	?
mlir/test/lib/Dialect/Test/TestOps.td
637

Address River's comment

Harbormaster completed remote builds in B121954: Diff 369730.Aug 31 2021, 10:38 AM

This revision was not accepted when it landed; it landed in state Needs Review.Aug 31 2021, 10:53 AM

This revision was landed with ongoing or failed builds.

Closed by commit rG387f95541bdc: Add a new interface allowing to set a default dialect to be used for… (authored by mehdi_amini). · Explain Why

This revision was automatically updated to reflect the committed changes.

mehdi_amini added a commit: rG387f95541bdc: Add a new interface allowing to set a default dialect to be used for….

silvas added a subscriber: silvas.Sep 3 2021, 1:33 PM

silvas added inline comments.

mlir/include/mlir/IR/OpImplementation.h
973 ↗	(On Diff #363584)	FYI, for future reference, this change was disruptive for us -- we had already updated tests to use builtin.func (in CHECK lines, etc.) or written new tests that were natively checking for builtin.func. After this change, we had to change it back.

silvas added inline comments.Sep 3 2021, 1:44 PM

mlir/include/mlir/IR/OpImplementation.h
973 ↗	(On Diff #363584)	(not sure if it was exactly this change, but something caused `builtin.func` to print as just `func` since our last integrate.)

mehdi_amini added inline comments.Sep 3 2021, 1:49 PM

mlir/include/mlir/IR/OpImplementation.h
973 ↗	(On Diff #363584)	This patch is the culprit indeed. Something I'm not sure, is why was so many test changed to CHECK on builtin.func a few weeks ago when they could have continued to check for `func` alone? (I had to update many tests inside Google as well). Before: you could parse without the builtin prefix but it would get printed, while now it'll be omitted if it can be parsed basically.

silvas added inline comments.Sep 3 2021, 2:22 PM

mlir/include/mlir/IR/OpImplementation.h
973 ↗	(On Diff #363584)	I think most tests are written by generate-test-checks.py which isn't smart enough to know that ;)

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

SPIRV/

IR/

SPIRVStructureOps.td

7 lines

Shape/

IR/

ShapeOps.td

2 lines

StandardOps/

IR/

Ops.td

3 lines

IR/

1 line

14 lines

14 lines

10 lines

7 lines

lib/

IR/

AsmPrinter.cpp

20 lines

Operation.cpp

8 lines

Parser/

Parser.cpp

36 lines

ParserState.h

7 lines

test/

Analysis/

test-shape-fn-report.mlir

2 lines

Conversion/

GPUToNVVM/

gpu-to-nvvm.mlir

46 lines

wmma-ops-to-nvvm.mlir

10 lines

GPUToROCDL/

gpu-to-rocdl.mlir

42 lines

VectorToROCDL/

vector-to-rocdl.mlir

12 lines

Dialect/

Builtin/

canonicalize.mlir

8 lines

Linalg/

drop-unit-extent-dims.mlir

2 lines

reshape_control_fusion.mlir

4 lines

Shape/

invalid.mlir

6 lines

SparseTensor/

sparse_perm.mlir

4 lines

sparse_perm_lower.mlir

4 lines

IR/

invalid-func-op.mlir

10 lines

invalid-module-op.mlir

4 lines

2 lines

2 lines

22 lines

4 lines

Transforms/

canonicalize-dce.mlir

2 lines

canonicalize.mlir

2 lines

constant-fold.mlir

2 lines

cse.mlir

2 lines

test-legalizer-full.mlir

4 lines

lib/

Dialect/

Test/

TestOps.td

16 lines

Diff 369730

mlir/include/mlir/Dialect/SPIRV/IR/SPIRVStructureOps.td

Show All 18 Lines
include "mlir/IR/OpAsmInterface.td"		include "mlir/IR/OpAsmInterface.td"
include "mlir/IR/SymbolInterfaces.td"		include "mlir/IR/SymbolInterfaces.td"
include "mlir/Interfaces/CallInterfaces.td"		include "mlir/Interfaces/CallInterfaces.td"
include "mlir/Interfaces/SideEffectInterfaces.td"		include "mlir/Interfaces/SideEffectInterfaces.td"

// -----		// -----

def SPV_AddressOfOp : SPV_Op<"mlir.addressof",		def SPV_AddressOfOp : SPV_Op<"mlir.addressof",
[DeclareOpInterfaceMethods<OpAsmOpInterface>, InFunctionScope, NoSideEffect]> {		[DeclareOpInterfaceMethods<OpAsmOpInterface, ["getAsmResultNames"]>,
		InFunctionScope, NoSideEffect]> {
let summary = "Get the address of a global variable.";		let summary = "Get the address of a global variable.";

let description = [{		let description = [{
Variables in module scope are defined using symbol names. This op generates		Variables in module scope are defined using symbol names. This op generates
an SSA value that can be used to refer to the symbol within function scope		an SSA value that can be used to refer to the symbol within function scope
for use in ops that expect an SSA value. This operation has no corresponding		for use in ops that expect an SSA value. This operation has no corresponding
SPIR-V instruction; it's merely used for modelling purpose in the SPIR-V		SPIR-V instruction; it's merely used for modelling purpose in the SPIR-V
dialect. Since variables in module scope in SPIR-V dialect are of pointer		dialect. Since variables in module scope in SPIR-V dialect are of pointer
Show All 29 Lines	def SPV_AddressOfOp : SPV_Op<"mlir.addressof",
let builders = [OpBuilder<(ins "spirv::GlobalVariableOp":$var)>];		let builders = [OpBuilder<(ins "spirv::GlobalVariableOp":$var)>];

let assemblyFormat = "$variable attr-dict `:` type($pointer)";		let assemblyFormat = "$variable attr-dict `:` type($pointer)";
}		}

// -----		// -----

def SPV_ConstantOp : SPV_Op<"Constant",		def SPV_ConstantOp : SPV_Op<"Constant",
[ConstantLike, DeclareOpInterfaceMethods<OpAsmOpInterface>, NoSideEffect]> {		[ConstantLike,
		DeclareOpInterfaceMethods<OpAsmOpInterface, ["getAsmResultNames"]>,
		NoSideEffect]> {
let summary = "The op that declares a SPIR-V normal constant";		let summary = "The op that declares a SPIR-V normal constant";

let description = [{		let description = [{
This op declares a SPIR-V normal constant. SPIR-V has multiple constant		This op declares a SPIR-V normal constant. SPIR-V has multiple constant
instructions covering different constant types:		instructions covering different constant types:

* `OpConstantTrue` and `OpConstantFalse` for boolean constants		* `OpConstantTrue` and `OpConstantFalse` for boolean constants
* `OpConstant` for scalar constants		* `OpConstant` for scalar constants
▲ Show 20 Lines • Show All 687 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Shape/IR/ShapeOps.td

Show First 20 Lines • Show All 129 Lines • ▼ Show 20 Lines	let extraClassDeclaration = [{
// InferTypeOpInterface:		// InferTypeOpInterface:
static bool isCompatibleReturnTypes(TypeRange l, TypeRange r);		static bool isCompatibleReturnTypes(TypeRange l, TypeRange r);
}];		}];
}		}

def Shape_ConstSizeOp : Shape_Op<"const_size", [		def Shape_ConstSizeOp : Shape_Op<"const_size", [
ConstantLike,		ConstantLike,
NoSideEffect,		NoSideEffect,
DeclareOpInterfaceMethods<OpAsmOpInterface>		DeclareOpInterfaceMethods<OpAsmOpInterface, ["getAsmResultNames"]>
]> {		]> {
let summary = "Creates a constant of type `shape.size`";		let summary = "Creates a constant of type `shape.size`";
let description = [{		let description = [{
Creates a `shape.size` type representing the constant size given by `value`.		Creates a `shape.size` type representing the constant size given by `value`.

```mlir		```mlir
%x = shape.const_size 10		%x = shape.const_size 10
```		```
▲ Show 20 Lines • Show All 918 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td

Show First 20 Lines • Show All 1,031 Lines • ▼ Show 20 Lines	def CondBranchOp : Std_Op<"cond_br",
}];		}];
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ConstantOp		// ConstantOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def ConstantOp : Std_Op<"constant",		def ConstantOp : Std_Op<"constant",
[ConstantLike, NoSideEffect, DeclareOpInterfaceMethods<OpAsmOpInterface>]> {		[ConstantLike, NoSideEffect,
		DeclareOpInterfaceMethods<OpAsmOpInterface, ["getAsmResultNames"]>]> {
let summary = "constant";		let summary = "constant";
let description = [{		let description = [{
Syntax:		Syntax:

```		```
operation ::= ssa-id `=` `std.constant` attribute-value `:` type		operation ::= ssa-id `=` `std.constant` attribute-value `:` type
```		```

▲ Show 20 Lines • Show All 1,054 Lines • Show Last 20 Lines

mlir/include/mlir/IR/BuiltinOps.h

	//===- BuiltinOps.h - MLIR Builtin Operations -------------------- C++ --===//			//===- BuiltinOps.h - MLIR Builtin Operations -------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file contains the Builtin dialect's operations.			// This file contains the Builtin dialect's operations.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_IR_BUILTINOPS_H_			#ifndef MLIR_IR_BUILTINOPS_H_
	#define MLIR_IR_BUILTINOPS_H_			#define MLIR_IR_BUILTINOPS_H_

	#include "mlir/IR/FunctionSupport.h"			#include "mlir/IR/FunctionSupport.h"
				#include "mlir/IR/OpImplementation.h"
	#include "mlir/IR/OwningOpRef.h"			#include "mlir/IR/OwningOpRef.h"
	#include "mlir/IR/RegionKindInterface.h"			#include "mlir/IR/RegionKindInterface.h"
	#include "mlir/IR/SymbolTable.h"			#include "mlir/IR/SymbolTable.h"
	#include "mlir/Interfaces/CallInterfaces.h"			#include "mlir/Interfaces/CallInterfaces.h"
	#include "mlir/Interfaces/CastInterfaces.h"			#include "mlir/Interfaces/CastInterfaces.h"
	#include "mlir/Interfaces/DataLayoutInterfaces.h"			#include "mlir/Interfaces/DataLayoutInterfaces.h"
	#include "mlir/Interfaces/SideEffectInterfaces.h"			#include "mlir/Interfaces/SideEffectInterfaces.h"
	#include "llvm/Support/PointerLikeTypeTraits.h"			#include "llvm/Support/PointerLikeTypeTraits.h"
	▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

mlir/include/mlir/IR/BuiltinOps.td

Show All 9 Lines

// necessary for the validity of and defining the IR.

//===----------------------------------------------------------------------===//

#ifndef BUILTIN_OPS

#define BUILTIN_OPS

include "mlir/IR/BuiltinDialect.td"

include "mlir/IR/OpAsmInterface.td"

include "mlir/IR/RegionKindInterface.td"

include "mlir/IR/SymbolInterfaces.td"

include "mlir/Interfaces/CallInterfaces.td"

include "mlir/Interfaces/CastInterfaces.td"

include "mlir/Interfaces/DataLayoutInterfaces.td"

include "mlir/Interfaces/SideEffectInterfaces.td"

// Base class for Builtin dialect ops.

▲ Show 20 Lines • Show All 128 Lines • ▼ Show 20 Lines

]> {

let printer = [{ return ::print(*this, p); }];

let verifier = [{ return ::verify(*this); }];

}

//===----------------------------------------------------------------------===//

// ModuleOp

//===----------------------------------------------------------------------===//

def ModuleOp : Builtin_Op<"module", [

AffineScope, IsolatedFromAbove, NoRegionArguments, SymbolTable, Symbol]

AffineScope, IsolatedFromAbove, NoRegionArguments, SymbolTable, Symbol,

# GraphRegionNoTerminator.traits> {

OpAsmOpInterface

] # GraphRegionNoTerminator.traits> {

let summary = "A top level container operation";

rriddleUnsubmitted

Done

//===----------------------------------------------------------------------===//

def ModuleOp : Builtin_Op<"module", [

- AffineScope, IsolatedFromAbove, NoRegionArguments, SymbolTable, Symbol,

- OpAsmOpInterface] # GraphRegionNoTerminator.traits> {

+ AffineScope, IsolatedFromAbove, NoRegionArguments, SymbolTable, Symbol,

+ OpAsmOpInterface

+ ] # GraphRegionNoTerminator.traits> {

let summary = "A top level container operation";

let description = [{

rriddle:

let description = [{

A `module` represents a top-level container operation. It contains a single

[graph region](../LangRef.md#control-flow-and-ssacfg-regions) containing a single block

which can contain any operations and does not have a terminator. Operations

within this region cannot implicitly capture values defined outside the module,

i.e. Modules are [IsolatedFromAbove](../Traits.md#isolatedfromabove). Modules have

an optional [symbol name](../SymbolsAndSymbolTables.md) which can be used to refer

to them in operations.

Show All 27 Lines

let extraClassDeclaration = [{

/// A ModuleOp may optionally define a symbol.

bool isOptionalSymbol() { return true; }

//===------------------------------------------------------------------===//

// DataLayoutOpInterface Methods

//===------------------------------------------------------------------===//

DataLayoutSpecInterface getDataLayoutSpec();

//===------------------------------------------------------------------===//

// OpAsmOpInterface Methods

//===------------------------------------------------------------------===//

static ::llvm::StringRef getDefaultDialect() {

return "builtin";

}

}];

let verifier = [{ return ::verify(*this); }];

rriddleUnsubmitted

Done

//===------------------------------------------------------------------===//

static ::llvm::StringRef getDefaultDialect() {

return "builtin";

}

}];

let verifier = [{ return ::verify(*this); }];

rriddle:

// We need to ensure the block inside the region is properly terminated;

// the auto-generated builders do not guarantee that.

let skipDefaultBuilders = 1;

}

//===----------------------------------------------------------------------===//

// UnrealizedConversionCastOp

▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

mlir/include/mlir/IR/OpAsmInterface.td

Show First 20 Lines • Show All 41 Lines • ▼ Show 20 Lines InterfaceMethod<[{

``` ```

This would print the operation as follows: This would print the operation as follows:

```mlir ```mlir

%first_result, %middle_results:2, %0 = "my.op" ... %first_result, %middle_results:2, %0 = "my.op" ...

``` ```

}], }],

"void", "getAsmResultNames", (ins "::mlir::OpAsmSetValueNameFn":$setNameFn) "void", "getAsmResultNames",

(ins "::mlir::OpAsmSetValueNameFn":$setNameFn),

"", ";"

StaticInterfaceMethod<[{

Return the default dialect used when printing/parsing operations in

regions nested under this operation. This allows for eliding the dialect

bondhugulaUnsubmitted

Done

under a given operation -> under this operation

(cf. doc of getAsmResultNames above to align.)

bondhugula: under a given operation -> under this operation (cf. doc of `getAsmResultNames` above to align.

bondhugulaUnsubmitted

Done

dialect -> dialect prefix

bondhugula: dialect -> dialect prefix

rriddleUnsubmitted

Done

Return the default dialect used when printing/parsing operations in

- regions nested under this operation. This allows to elide the dialect

+ regions nested under this operation. This allows for eliding the dialect

prefix from the operation name, for example it would be possible to omit

rriddle:

prefix from the operation name, for example it would be possible to omit

bondhugulaUnsubmitted

Not Done

Nit: Punctuation broken here.

bondhugula: Nit: Punctuation broken here.

mehdi_aminiAuthorUnsubmitted

Done

I can't spot what you refer to here?

mehdi_amini: I can't spot what you refer to here?

the `spv.` prefix from all operations within a SpirV module if this method

rriddleUnsubmitted

Done

prefix from the operation name, for example it would be possible to omit

- the `spv.` prefix from all operations in a SpirV module if this method

+ the `spv.` prefix from all operations within a SpirV module if this method

returned `spv`. The default implementation

rriddle:

returned `spv`. The default implementation returns an empty string which

bondhugulaUnsubmitted

Done

if it was implementing this interface to return ... -> if this method returned spv.

^^

"it" is dangling here.

bondhugula: if it was implementing this interface to return ... -> if this method returned `spv`. ^^…

is ignored.

}],

rriddleUnsubmitted

Done

I don't think last statement is up-to-date. An empty string means no elision right? not that it is inherited.

rriddle: I don't think last statement is up-to-date. An empty string means no elision right? not that it…

"StringRef", "getDefaultDialect", (ins), "", "return \"\";"

>, >,

]; ];

} }

#endif // MLIR_OPASMINTERFACE #endif // MLIR_OPASMINTERFACE

rriddleUnsubmitted

Done

Can we instead just move this in the other interface? I don't think we need a special interface for this, especially given that OpAsmOpInterface is described as being a general parser/printer hook.

rriddle: Can we instead just move this in the other interface? I don't think we need a special interface…

rriddleUnsubmitted

Done

Do we want to allow multiple default namespaces? Some projects might have several that all kind of co-mingle with each other.

rriddle: Do we want to allow multiple default namespaces? Some projects might have several that all kind…

mehdi_aminiAuthorUnsubmitted

Done

I can't imagine what it'd look like right now: we'd have to guarantee these dialects don't have any pair of op with the same name to avoid conflicts.
The main extension I could think of would be to take an integer as an input and return a different one per region attached to an op

mehdi_amini: I can't imagine what it'd look like right now: we'd have to guarantee these dialects don't have…

mlir/include/mlir/IR/OpDefinition.h

Show First 20 Lines • Show All 174 Lines • ▼ Show 20 Lines	protected:

/// Unless overridden, the custom assembly form of an op is always rejected.		/// Unless overridden, the custom assembly form of an op is always rejected.
/// Op implementations should implement this to return failure.		/// Op implementations should implement this to return failure.
/// On success, they should fill in result with the fields to use.		/// On success, they should fill in result with the fields to use.
static ParseResult parse(OpAsmParser &parser, OperationState &result);		static ParseResult parse(OpAsmParser &parser, OperationState &result);

// The fallback for the printer is to print it the generic assembly form.		// The fallback for the printer is to print it the generic assembly form.
static void print(Operation *op, OpAsmPrinter &p);		static void print(Operation *op, OpAsmPrinter &p);
static void printOpName(Operation *op, OpAsmPrinter &p);		static void printOpName(Operation *op, OpAsmPrinter &p,
		StringRef defaultDialect);

/// Mutability management is handled by the OpWrapper/OpConstWrapper classes,		/// Mutability management is handled by the OpWrapper/OpConstWrapper classes,
/// so we can cast it away here.		/// so we can cast it away here.
explicit OpState(Operation *state) : state(state) {}		explicit OpState(Operation *state) : state(state) {}

private:		private:
Operation *state;		Operation *state;

▲ Show 20 Lines • Show All 1,580 Lines • ▼ Show 20 Lines	static AbstractOperation::PrintAssemblyFn getPrintAssemblyFn() {
return getPrintAssemblyFnImpl<ConcreteType>();		return getPrintAssemblyFnImpl<ConcreteType>();
}		}
/// The internal implementation of `getPrintAssemblyFn` that is invoked when		/// The internal implementation of `getPrintAssemblyFn` that is invoked when
/// the concrete operation does not define a `print` method.		/// the concrete operation does not define a `print` method.
template <typename ConcreteOpT>		template <typename ConcreteOpT>
static std::enable_if_t<!detect_has_print<ConcreteOpT>::value,		static std::enable_if_t<!detect_has_print<ConcreteOpT>::value,
AbstractOperation::PrintAssemblyFn>		AbstractOperation::PrintAssemblyFn>
getPrintAssemblyFnImpl() {		getPrintAssemblyFnImpl() {
return [](Operation *op, OpAsmPrinter &printer) {		return [](Operation *op, OpAsmPrinter &printer, StringRef defaultDialect) {
return OpState::print(op, printer);		return OpState::print(op, printer);
};		};
}		}
/// The internal implementation of `getPrintAssemblyFn` that is invoked when		/// The internal implementation of `getPrintAssemblyFn` that is invoked when
/// the concrete operation defines a `print` method.		/// the concrete operation defines a `print` method.
template <typename ConcreteOpT>		template <typename ConcreteOpT>
static std::enable_if_t<detect_has_print<ConcreteOpT>::value,		static std::enable_if_t<detect_has_print<ConcreteOpT>::value,
AbstractOperation::PrintAssemblyFn>		AbstractOperation::PrintAssemblyFn>
getPrintAssemblyFnImpl() {		getPrintAssemblyFnImpl() {
return &printAssembly;		return &printAssembly;
}		}
static void printAssembly(Operation *op, OpAsmPrinter &p) {		static void printAssembly(Operation *op, OpAsmPrinter &p,
OpState::printOpName(op, p);		StringRef defaultDialect) {
		OpState::printOpName(op, p, defaultDialect);
return cast<ConcreteType>(op).print(p);		return cast<ConcreteType>(op).print(p);
}		}
/// Implementation of `VerifyInvariantsFn` AbstractOperation hook.		/// Implementation of `VerifyInvariantsFn` AbstractOperation hook.
static AbstractOperation::VerifyInvariantsFn getVerifyInvariantsFn() {		static AbstractOperation::VerifyInvariantsFn getVerifyInvariantsFn() {
return &verifyInvariants;		return &verifyInvariants;
}		}

static constexpr bool hasNoDataMembers() {		static constexpr bool hasNoDataMembers() {
▲ Show 20 Lines • Show All 106 Lines • Show Last 20 Lines

mlir/include/mlir/IR/OperationSupport.h

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	public:
using GetCanonicalizationPatternsFn =		using GetCanonicalizationPatternsFn =
llvm::unique_function<void(RewritePatternSet &, MLIRContext *) const>;		llvm::unique_function<void(RewritePatternSet &, MLIRContext *) const>;
using FoldHookFn = llvm::unique_function<LogicalResult(		using FoldHookFn = llvm::unique_function<LogicalResult(
Operation *, ArrayRef<Attribute>, SmallVectorImpl<OpFoldResult> &) const>;		Operation *, ArrayRef<Attribute>, SmallVectorImpl<OpFoldResult> &) const>;
using HasTraitFn = llvm::unique_function<bool(TypeID) const>;		using HasTraitFn = llvm::unique_function<bool(TypeID) const>;
using ParseAssemblyFn =		using ParseAssemblyFn =
llvm::unique_function<ParseResult(OpAsmParser &, OperationState &) const>;		llvm::unique_function<ParseResult(OpAsmParser &, OperationState &) const>;
using PrintAssemblyFn =		using PrintAssemblyFn =
llvm::unique_function<void(Operation *, OpAsmPrinter &) const>;		llvm::unique_function<void(Operation *, OpAsmPrinter &, StringRef) const>;
using VerifyInvariantsFn =		using VerifyInvariantsFn =
llvm::unique_function<LogicalResult(Operation *) const>;		llvm::unique_function<LogicalResult(Operation *) const>;

/// This is the name of the operation.		/// This is the name of the operation.
const Identifier name;		const Identifier name;

/// This is the dialect that this operation belongs to.		/// This is the dialect that this operation belongs to.
Dialect &dialect;		Dialect &dialect;

/// The unique identifier of the derived Op class.		/// The unique identifier of the derived Op class.
TypeID typeID;		TypeID typeID;

/// Use the specified object to parse this ops custom assembly format.		/// Use the specified object to parse this ops custom assembly format.
ParseResult parseAssembly(OpAsmParser &parser, OperationState &result) const;		ParseResult parseAssembly(OpAsmParser &parser, OperationState &result) const;

/// Return the static hook for parsing this operation assembly.		/// Return the static hook for parsing this operation assembly.
const ParseAssemblyFn &getParseAssemblyFn() const { return parseAssemblyFn; }		const ParseAssemblyFn &getParseAssemblyFn() const { return parseAssemblyFn; }

/// This hook implements the AsmPrinter for this operation.		/// This hook implements the AsmPrinter for this operation.
void printAssembly(Operation *op, OpAsmPrinter &p) const {		void printAssembly(Operation *op, OpAsmPrinter &p,
return printAssemblyFn(op, p);		StringRef defaultDialect) const {
		return printAssemblyFn(op, p, defaultDialect);
}		}

/// This hook implements the verifier for this operation. It should emits an		/// This hook implements the verifier for this operation. It should emits an
/// error message and returns failure if a problem is detected, or returns		/// error message and returns failure if a problem is detected, or returns
/// success if everything is ok.		/// success if everything is ok.
LogicalResult verifyInvariants(Operation *op) const {		LogicalResult verifyInvariants(Operation *op) const {
return verifyInvariantsFn(op);		return verifyInvariantsFn(op);
}		}
▲ Show 20 Lines • Show All 1,038 Lines • Show Last 20 Lines

mlir/lib/IR/AsmPrinter.cpp

Show All 21 Lines

#include "mlir/IR/MLIRContext.h" #include "mlir/IR/MLIRContext.h"

#include "mlir/IR/OpImplementation.h" #include "mlir/IR/OpImplementation.h"

#include "mlir/IR/Operation.h" #include "mlir/IR/Operation.h"

#include "mlir/IR/SubElementInterfaces.h" #include "mlir/IR/SubElementInterfaces.h"

#include "llvm/ADT/APFloat.h" #include "llvm/ADT/APFloat.h"

#include "llvm/ADT/DenseMap.h" #include "llvm/ADT/DenseMap.h"

#include "llvm/ADT/MapVector.h" #include "llvm/ADT/MapVector.h"

#include "llvm/ADT/STLExtras.h" #include "llvm/ADT/STLExtras.h"

#include "llvm/ADT/ScopeExit.h"

#include "llvm/ADT/ScopedHashTable.h" #include "llvm/ADT/ScopedHashTable.h"

#include "llvm/ADT/SetVector.h" #include "llvm/ADT/SetVector.h"

#include "llvm/ADT/SmallString.h" #include "llvm/ADT/SmallString.h"

#include "llvm/ADT/StringExtras.h" #include "llvm/ADT/StringExtras.h"

#include "llvm/ADT/StringSet.h" #include "llvm/ADT/StringSet.h"

#include "llvm/ADT/TypeSwitch.h" #include "llvm/ADT/TypeSwitch.h"

#include "llvm/Support/CommandLine.h" #include "llvm/Support/CommandLine.h"

#include "llvm/Support/Endian.h" #include "llvm/Support/Endian.h"

▲ Show 20 Lines • Show All 328 Lines • ▼ Show 20 Lines void print(Operation *op) {

if (printerFlags.shouldPrintDebugInfo()) if (printerFlags.shouldPrintDebugInfo())

initializer.visit(op->getLoc(), /*canBeDeferred=*/true); initializer.visit(op->getLoc(), /*canBeDeferred=*/true);

// If requested, always print the generic form. // If requested, always print the generic form.

if (!printerFlags.shouldPrintGenericOpForm()) { if (!printerFlags.shouldPrintGenericOpForm()) {

// Check to see if this is a known operation. If so, use the registered // Check to see if this is a known operation. If so, use the registered

// custom printer hook. // custom printer hook.

if (auto *opInfo = op->getAbstractOperation()) { if (auto *opInfo = op->getAbstractOperation()) {

opInfo->printAssembly(op, *this); opInfo->printAssembly(op, *this, /*defaultDialect=*/"");

rriddleUnsubmitted

Done

Parameter comments should have a =

rriddle: Parameter comments should have a =

return; return;

} }

// Otherwise print with the generic assembly form. // Otherwise print with the generic assembly form.

printGenericOp(op); printGenericOp(op);

} }

▲ Show 20 Lines • Show All 1,918 Lines • ▼ Show 20 Lines

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

// OperationPrinter // OperationPrinter

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

namespace { namespace {

/// This class contains the logic for printing operations, regions, and blocks. /// This class contains the logic for printing operations, regions, and blocks.

class OperationPrinter : public ModulePrinter, private OpAsmPrinter { class OperationPrinter : public ModulePrinter, private OpAsmPrinter {

public: public:

explicit OperationPrinter(raw_ostream &os, OpPrintingFlags flags, explicit OperationPrinter(raw_ostream &os, OpPrintingFlags flags,

rriddleUnsubmitted

Done

We discussed a little offline, but I think it would be better to just remove the constraint that custom op printers print the operation name at the beginning. I don't see a real reason for them to do that, and it would also match the behavior of the parser (which doesn't parse the operation name, for likely obvious reasons).

rriddle: We discussed a little offline, but I think it would be better to just remove the constraint…

mehdi_aminiAuthorUnsubmitted

Done

Yeah I looked into it, one wrinkle though is that something like Dialect::printOperation can fail and we fallback to the generic operation printer.
I'm not sure how to handle this?

mehdi_amini: Yeah I looked into it, one wrinkle though is that something like `Dialect::printOperation` can…

rriddleUnsubmitted

Done

Hmmm. Can we change Dialect::printOperation to act more like getParseOperationHook? That way we would know if it would succeed before calling it.

rriddle: Hmmm. Can we change Dialect::printOperation to act more like getParseOperationHook? That way we…

mehdi_aminiAuthorUnsubmitted

Done

(I'll look into this later)

mehdi_amini: (I'll look into this later)

lattnerUnsubmitted

Done

We discussed a little offline, but I think it would be better to just remove the constraint that custom op printers print the operation name at the beginning. I don't see a real reason for them to do that, and it would also match the behavior of the parser (which doesn't parse the operation name, for likely obvious reasons).

I agree completely, the stream approach here is too-clever. It is also a pain (and pointless) to print the op name anyway, it would be far better to have the asmprinter do this by default.

lattner: > We discussed a little offline, but I think it would be better to just remove the constraint…

AsmStateImpl &state) AsmStateImpl &state)

: ModulePrinter(os, flags, &state) {} : ModulePrinter(os, flags, &state) {}

/// Print the given top-level operation. /// Print the given top-level operation.

void printTopLevelOperation(Operation *op); void printTopLevelOperation(Operation *op);

/// Print the given operation with its indent and location. /// Print the given operation with its indent and location.

bondhugulaUnsubmitted

Done

Nit: reflow to use width.

bondhugula: Nit: reflow to use width.

void print(Operation *op); void print(Operation *op);

/// Print the bare location, not including indentation/location/etc. /// Print the bare location, not including indentation/location/etc.

void printOperation(Operation *op); void printOperation(Operation *op);

/// Print the given operation in the generic form. /// Print the given operation in the generic form.

void printGenericOp(Operation *op, bool printOpName) override; void printGenericOp(Operation *op, bool printOpName) override;

/// Print the name of the given block. /// Print the name of the given block.

void printBlockName(Block *block); void printBlockName(Block *block);

ftynseUnsubmitted

Done

void setDialectFilter(StringRef dialect) {

- // Setup a buffer that is the size of the filter: this guarantee that the

- // call to write_impl() is at least as large as the filter and allow to

- // decide if it matches or not. An previously existing buffer is flushed

+ // Setup a buffer that is the size of the filter: this guarantees that the

+ // call to write_impl() is at least as large as the filter and allows to

+ // decide if it matches or not. A previously existing buffer is flushed

// implicitly and processed here, as we intend to match only whatever is

// written to the stream *after* the filter is set.

flush();

ftynse:

/// Print the given block. If 'printBlockArgs' is false, the arguments of the /// Print the given block. If 'printBlockArgs' is false, the arguments of the

/// block are not printed. If 'printBlockTerminator' is false, the terminator /// block are not printed. If 'printBlockTerminator' is false, the terminator

/// operation of the block is not printed. /// operation of the block is not printed.

void print(Block *block, bool printBlockArgs = true, void print(Block *block, bool printBlockArgs = true,

bool printBlockTerminator = true); bool printBlockTerminator = true);

/// Print the ID of the given value, optionally with its result number. /// Print the ID of the given value, optionally with its result number.

void printValueID(Value value, bool printResultNo = true, void printValueID(Value value, bool printResultNo = true,

raw_ostream *streamOverride = nullptr) const; raw_ostream *streamOverride = nullptr) const;

//===--------------------------------------------------------------------===// //===--------------------------------------------------------------------===//

// OpAsmPrinter methods // OpAsmPrinter methods

//===--------------------------------------------------------------------===// //===--------------------------------------------------------------------===//

/// Return the current stream of the printer. /// Return the current stream of the printer.

raw_ostream &getStream() const override { return os; } raw_ostream &getStream() const override { return os; }

/// Print a newline and indent the printer to the start of the current /// Print a newline and indent the printer to the start of the current

/// operation. /// operation.

void printNewline() override { void printNewline() override {

ftynseUnsubmitted

Done

I can't match this comment with the code below.

ftynse: I can't match this comment with the code below.

mehdi_aminiAuthorUnsubmitted

Done

Ah, this is some explanation on the correctness of the code, I'll try to reformulate.

mehdi_amini: Ah, this is some explanation on the correctness of the code, I'll try to reformulate.

os << newLine; os << newLine;

os.indent(currentIndent); os.indent(currentIndent);

} }

/// Print the given type. /// Print the given type.

void printType(Type type) override { ModulePrinter::printType(type); } void printType(Type type) override { ModulePrinter::printType(type); }

/// Print the given attribute. /// Print the given attribute.

void printAttribute(Attribute attr) override { void printAttribute(Attribute attr) override {

ModulePrinter::printAttribute(attr); ModulePrinter::printAttribute(attr);

} }

/// Print the given attribute without its type. The corresponding parser must /// Print the given attribute without its type. The corresponding parser must

/// provide a valid type for the attribute. /// provide a valid type for the attribute.

void printAttributeWithoutType(Attribute attr) override { void printAttributeWithoutType(Attribute attr) override {

bondhugulaUnsubmitted

Not Done

Camel case?

bondhugula: Camel case?

mehdi_aminiAuthorUnsubmitted

Done

I am overriding the parent class from LLVM, can't do?

mehdi_amini: I am overriding the parent class from LLVM, can't do?

ModulePrinter::printAttribute(attr, AttrTypeElision::Must); ModulePrinter::printAttribute(attr, AttrTypeElision::Must);

} }

/// Print a block argument in the usual format of: /// Print a block argument in the usual format of:

/// %ssaName : type {attr1=42} loc("here") /// %ssaName : type {attr1=42} loc("here")

/// where location printing is controlled by the standard internal option. /// where location printing is controlled by the standard internal option.

/// You may pass omitType=true to not print a type, and pass an empty /// You may pass omitType=true to not print a type, and pass an empty

/// attribute list if you don't care for attributes. /// attribute list if you don't care for attributes.

▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines void printAffineExprOfSSAIds(AffineExpr expr, ValueRange dimOperands,

ValueRange symOperands) override; ValueRange symOperands) override;

/// Print the given string as a symbol reference. /// Print the given string as a symbol reference.

void printSymbolName(StringRef symbolRef) override { void printSymbolName(StringRef symbolRef) override {

::printSymbolReference(symbolRef, os); ::printSymbolReference(symbolRef, os);

} }

private: private:

// Contains the stack of default dialects to use when printing regions.

rriddleUnsubmitted

Done

private:

- // Contains the stack of default dialect to use when printing regions.

+ // Contains the stack of default dialects to use when printing regions.

// A new dialect is pushed to the stack before parsing regions nested under an

rriddle:

// A new dialect is pushed to the stack before parsing regions nested under an

bondhugulaUnsubmitted

Done

get -> gets / is

bondhugula: get -> gets / is

// operation implementing `OpAsmOpInterface`, and popped when done. At the

// top-level we start with "builtin" as the default, so that the top-level

// `module` operation prints as-is.

SmallVector<StringRef> defaultDialectStack{"builtin"};

rriddleUnsubmitted

Done

Do we need this to be std::string? I would assume that we could enforce StringRef here (getDefaultDialect also returns StringRef).

rriddle: Do we need this to be std::string? I would assume that we could enforce StringRef here…

rriddleUnsubmitted

Done

I think I missed it, but how is this parser stack different from the one in the ParserState?

rriddle: I think I missed it, but how is this parser stack different from the one in the ParserState?

mehdi_aminiAuthorUnsubmitted

Done

This is a printer stack, and the other is the parser one? They are equivalent otherwise.
Maybe I missed what you were asking?

mehdi_amini: This is a printer stack, and the other is the parser one? They are equivalent otherwise. Maybe…

rriddleUnsubmitted

Done

Right right, missed that one was printer/one was parser.

rriddle: Right right, missed that one was printer/one was parser.

/// The number of spaces used for indenting nested operations. /// The number of spaces used for indenting nested operations.

const static unsigned indentWidth = 2; const static unsigned indentWidth = 2;

// This is the current indentation level for nested structures. // This is the current indentation level for nested structures.

unsigned currentIndent = 0; unsigned currentIndent = 0;

}; };

} // end anonymous namespace } // end anonymous namespace

▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines if (size_t numResults = op->getNumResults()) {

os << " = "; os << " = ";

} }

// If requested, always print the generic form. // If requested, always print the generic form.

if (!printerFlags.shouldPrintGenericOpForm()) { if (!printerFlags.shouldPrintGenericOpForm()) {

// Check to see if this is a known operation. If so, use the registered // Check to see if this is a known operation. If so, use the registered

// custom printer hook. // custom printer hook.

if (auto *opInfo = op->getAbstractOperation()) { if (auto *opInfo = op->getAbstractOperation()) {

opInfo->printAssembly(op, *this); opInfo->printAssembly(op, *this, defaultDialectStack.back());

return; return;

} }

// Otherwise try to dispatch to the dialect, if available. // Otherwise try to dispatch to the dialect, if available.

if (Dialect *dialect = op->getDialect()) { if (Dialect *dialect = op->getDialect()) {

if (auto opPrinter = dialect->getOperationPrinter(op)) { if (auto opPrinter = dialect->getOperationPrinter(op)) {

// Print the op name first. // Print the op name first.

StringRef name = op->getName().getStringRef(); StringRef name = op->getName().getStringRef();

name.consume_front((defaultDialectStack.back() + ".").str());

printEscapedString(name, os); printEscapedString(name, os);

rriddleUnsubmitted

Done

StringRef name = op->getName().getStringRef();

- if (name.startswith((defaultDialectStack.back() + ".").str()))

- name = name.drop_front(defaultDialectStack.back().size() + 1);

+ name.consume_front(defaultDialectStack.back() + ".").str());

printEscapedString(name, os);

rriddle: ?

// Print the rest of the op now. // Print the rest of the op now.

opPrinter(op, *this); opPrinter(op, *this);

return; return;

} }

// Otherwise print with the generic assembly form. // Otherwise print with the generic assembly form.

▲ Show 20 Lines • Show All 129 Lines • ▼ Show 20 Lines void OperationPrinter::printSuccessorAndUseList(Block *successor,

os << ')'; os << ')';

} }

void OperationPrinter::printRegion(Region &region, bool printEntryBlockArgs, void OperationPrinter::printRegion(Region &region, bool printEntryBlockArgs,

bool printBlockTerminators, bool printBlockTerminators,

bool printEmptyBlock) { bool printEmptyBlock) {

os << " {" << newLine; os << " {" << newLine;

if (!region.empty()) { if (!region.empty()) {

auto restoreDefaultDialect =

llvm::make_scope_exit([&]() { defaultDialectStack.pop_back(); });

if (auto iface = dyn_cast<OpAsmOpInterface>(region.getParentOp()))

defaultDialectStack.push_back(iface.getDefaultDialect());

else

defaultDialectStack.push_back("");

auto *entryBlock = &region.front(); auto *entryBlock = &region.front();

// Force printing the block header if printEmptyBlock is set and the block // Force printing the block header if printEmptyBlock is set and the block

// is empty or if printEntryBlockArgs is set and there are arguments to // is empty or if printEntryBlockArgs is set and there are arguments to

rriddleUnsubmitted

Done

You could also use llvm::make_scope_exit instead.

rriddle: You could also use llvm::make_scope_exit instead.

mehdi_aminiAuthorUnsubmitted

Done

Grahh: believe me I searched for this utility and couldn't find it and concluded that my memory of its existence was bogus...

mehdi_amini: Grahh: believe me I searched for this utility and couldn't find it and concluded that my memory…

// print. // print.

bool shouldAlwaysPrintBlockHeader = bool shouldAlwaysPrintBlockHeader =

(printEmptyBlock && entryBlock->empty()) || (printEmptyBlock && entryBlock->empty()) ||

(printEntryBlockArgs && entryBlock->getNumArguments() != 0); (printEntryBlockArgs && entryBlock->getNumArguments() != 0);

rriddleUnsubmitted

Done

Shouldn't we push an empty string here instead?

rriddle: Shouldn't we push an empty string here instead?

print(entryBlock, shouldAlwaysPrintBlockHeader, printBlockTerminators); print(entryBlock, shouldAlwaysPrintBlockHeader, printBlockTerminators);

for (auto &b : llvm::drop_begin(region.getBlocks(), 1)) for (auto &b : llvm::drop_begin(region.getBlocks(), 1))

print(&b); print(&b);

} }

os.indent(currentIndent) << "}"; os.indent(currentIndent) << "}";

} }

void OperationPrinter::printAffineMapOfSSAIds(AffineMapAttr mapAttr, void OperationPrinter::printAffineMapOfSSAIds(AffineMapAttr mapAttr,

▲ Show 20 Lines • Show All 190 Lines • Show Last 20 Lines

mlir/lib/IR/Operation.cpp

	Show First 20 Lines • Show All 637 Lines • ▼ Show 20 Lines
	// The fallback for the parser is to reject the custom assembly form.			// The fallback for the parser is to reject the custom assembly form.
	ParseResult OpState::parse(OpAsmParser &parser, OperationState &result) {			ParseResult OpState::parse(OpAsmParser &parser, OperationState &result) {
	return parser.emitError(parser.getNameLoc(), "has no custom assembly form");			return parser.emitError(parser.getNameLoc(), "has no custom assembly form");
	}			}

	// The fallback for the printer is to print in the generic assembly form.			// The fallback for the printer is to print in the generic assembly form.
	void OpState::print(Operation *op, OpAsmPrinter &p) { p.printGenericOp(op); }			void OpState::print(Operation *op, OpAsmPrinter &p) { p.printGenericOp(op); }
	// The fallback for the printer is to print in the generic assembly form.			// The fallback for the printer is to print in the generic assembly form.
	void OpState::printOpName(Operation *op, OpAsmPrinter &p) {			void OpState::printOpName(Operation *op, OpAsmPrinter &p,
				StringRef defaultDialect) {
	StringRef name = op->getName().getStringRef();			StringRef name = op->getName().getStringRef();
	if (name.startswith("std."))			if (name.startswith((defaultDialect + ".").str()))
				name = name.drop_front(defaultDialect.size() + 1);
				// TODO: remove this special case.
				rriddleUnsubmitted Done Reply Inline Actions Can we add a TODO/FIXME to kill this line? rriddle: Can we add a TODO/FIXME to kill this line?
				else if (name.startswith("std."))
	name = name.drop_front(4);			name = name.drop_front(4);
	p.getStream() << name;			p.getStream() << name;
	}			}

	/// Emit an error about fatal conditions with this operation, reporting up to			/// Emit an error about fatal conditions with this operation, reporting up to
	/// any diagnostic handlers that may be listening.			/// any diagnostic handlers that may be listening.
	InFlightDiagnostic OpState::emitError(const Twine &message) {			InFlightDiagnostic OpState::emitError(const Twine &message) {
	return getOperation()->emitError(message);			return getOperation()->emitError(message);
	▲ Show 20 Lines • Show All 686 Lines • Show Last 20 Lines

mlir/lib/Parser/Parser.cpp

Show All 12 Lines

#include "Parser.h" #include "Parser.h"

#include "mlir/IR/AffineMap.h" #include "mlir/IR/AffineMap.h"

#include "mlir/IR/BuiltinOps.h" #include "mlir/IR/BuiltinOps.h"

#include "mlir/IR/Dialect.h" #include "mlir/IR/Dialect.h"

#include "mlir/IR/Verifier.h" #include "mlir/IR/Verifier.h"

#include "mlir/Parser.h" #include "mlir/Parser.h"

#include "mlir/Parser/AsmParserState.h" #include "mlir/Parser/AsmParserState.h"

#include "llvm/ADT/DenseMap.h" #include "llvm/ADT/DenseMap.h"

#include "llvm/ADT/ScopeExit.h"

#include "llvm/ADT/StringSet.h" #include "llvm/ADT/StringSet.h"

#include "llvm/ADT/bit.h" #include "llvm/ADT/bit.h"

#include "llvm/Support/PrettyStackTrace.h" #include "llvm/Support/PrettyStackTrace.h"

#include "llvm/Support/SourceMgr.h" #include "llvm/Support/SourceMgr.h"

#include <algorithm> #include <algorithm>

using namespace mlir; using namespace mlir;

using namespace mlir::detail; using namespace mlir::detail;

▲ Show 20 Lines • Show All 1,808 Lines • ▼ Show 20 Lines private:

/// A flag that indicates if any errors were emitted during parsing. /// A flag that indicates if any errors were emitted during parsing.

bool emittedError = false; bool emittedError = false;

}; };

} // end anonymous namespace. } // end anonymous namespace.

Operation * Operation *

OperationParser::parseCustomOperation(ArrayRef<ResultRecord> resultIDs) { OperationParser::parseCustomOperation(ArrayRef<ResultRecord> resultIDs) {

llvm::SMLoc opLoc = getToken().getLoc(); llvm::SMLoc opLoc = getToken().getLoc();

StringRef opName = getTokenSpelling(); std::string opName = getTokenSpelling().str();

rriddleUnsubmitted

Not Done

Would it be better to keep StringRef here, but have another variable that is OperationName? That would remove the need to build a std::string, and the OperationName var could be passed in later when building the OperationState.

rriddle: Would it be better to keep StringRef here, but have another variable that is OperationName?

mehdi_aminiAuthorUnsubmitted

Done

I tried but it's convoluted: OperationName does not have a default constructor.

mehdi_amini: I tried but it's convoluted: OperationName does not have a default constructor.

auto *opDefinition = AbstractOperation::lookup(opName, getContext()); auto *opDefinition = AbstractOperation::lookup(opName, getContext());

StringRef defaultDialect = getState().defaultDialectStack.back();

Dialect *dialect = nullptr; Dialect *dialect = nullptr;

if (opDefinition) { if (opDefinition) {

dialect = &opDefinition->dialect; dialect = &opDefinition->dialect;

} else { } else {

if (opName.contains('.')) { if (StringRef(opName).contains('.')) {

// This op has a dialect, we try to check if we can register it in the // This op has a dialect, we try to check if we can register it in the

// context on the fly. // context on the fly.

StringRef dialectName = opName.split('.').first; StringRef dialectName = StringRef(opName).split('.').first;

dialect = getContext()->getLoadedDialect(dialectName); dialect = getContext()->getLoadedDialect(dialectName);

if (!dialect && (dialect = getContext()->getOrLoadDialect(dialectName))) if (!dialect && (dialect = getContext()->getOrLoadDialect(dialectName)))

opDefinition = AbstractOperation::lookup(opName, getContext()); opDefinition = AbstractOperation::lookup(opName, getContext());

} else { } else {

// If the operation name has no namespace prefix we treat it as a builtin // If the operation name has no namespace prefix we lookup the current

// or standard operation and prefix it with "builtin" or "std". // default dialect (set through OpAsmOpInterface).

// TODO: Remove the special casing here. opDefinition = AbstractOperation::lookup(

opDefinition = AbstractOperation::lookup(Twine("builtin." + opName).str(), Twine(defaultDialect + "." + opName).str(), getContext());

getContext());

if (!opDefinition && getContext()->getOrLoadDialect("std")) { if (!opDefinition && getContext()->getOrLoadDialect("std")) {

opDefinition = AbstractOperation::lookup(Twine("std." + opName).str(), opDefinition = AbstractOperation::lookup(Twine("std." + opName).str(),

getContext()); getContext());

} }

if (opDefinition) if (opDefinition) {

opName = opDefinition->name.strref(); dialect = &opDefinition->dialect;

opName = opDefinition->name.str();

} else if (!defaultDialect.empty()) {

dialect = getContext()->getOrLoadDialect(defaultDialect);

opName = (defaultDialect + "." + opName).str();

}

rriddleUnsubmitted

Done

dialect = &opDefinition->dialect;

- opName = opDefinition->name.strref();

+ opName = opDefinition->name.str();

} else {

Shouldn't this one be .str()?

rriddle: Shouldn't this one be `.str()`?

} }

// This is the actual hook for the custom op parsing, usually implemented by // This is the actual hook for the custom op parsing, usually implemented by

// the op itself (`Op::parse()`). We retrieve it either from the // the op itself (`Op::parse()`). We retrieve it either from the

// AbstractOperation or from the Dialect. // AbstractOperation or from the Dialect.

function_ref<ParseResult(OpAsmParser &, OperationState &)> parseAssemblyFn; function_ref<ParseResult(OpAsmParser &, OperationState &)> parseAssemblyFn;

bool isIsolatedFromAbove = false; bool isIsolatedFromAbove = false;

defaultDialect = "";

if (opDefinition) { if (opDefinition) {

parseAssemblyFn = opDefinition->getParseAssemblyFn(); parseAssemblyFn = opDefinition->getParseAssemblyFn();

isIsolatedFromAbove = isIsolatedFromAbove =

opDefinition->hasTrait<OpTrait::IsIsolatedFromAbove>(); opDefinition->hasTrait<OpTrait::IsIsolatedFromAbove>();

auto *iface = opDefinition->getInterface<OpAsmOpInterface>();

if (iface && !iface->getDefaultDialect().empty())

defaultDialect = iface->getDefaultDialect();

rriddleUnsubmitted

Done

Same comment here: why do we use builtin as the fallback here? I would expect an empty default dialect to mean "no ellision".

rriddle: Same comment here: why do we use builtin as the fallback here? I would expect an empty default…

} else { } else {

Optional<Dialect::ParseOpHook> dialectHook; Optional<Dialect::ParseOpHook> dialectHook;

if (dialect) if (dialect)

dialectHook = dialect->getParseOperationHook(opName); dialectHook = dialect->getParseOperationHook(opName);

if (!dialectHook.hasValue()) { if (!dialectHook.hasValue()) {

emitError(opLoc) << "custom op '" << opName << "' is unknown"; emitError(opLoc) << "custom op '" << opName << "' is unknown";

return nullptr; return nullptr;

} }

parseAssemblyFn = *dialectHook; parseAssemblyFn = *dialectHook;

} }

getState().defaultDialectStack.push_back(defaultDialect);

auto restoreDefaultDialect = llvm::make_scope_exit(

[&]() { getState().defaultDialectStack.pop_back(); });

consumeToken(); consumeToken();

// If the custom op parser crashes, produce some indication to help // If the custom op parser crashes, produce some indication to help

// debugging. // debugging.

rriddleUnsubmitted

Done

You can use llvm::make_scope_exit here instead.

rriddle: You can use llvm::make_scope_exit here instead.

std::string opNameStr = opName.str();

llvm::PrettyStackTraceFormat fmt("MLIR Parser: custom op parser '%s'", llvm::PrettyStackTraceFormat fmt("MLIR Parser: custom op parser '%s'",

opNameStr.c_str()); opName.c_str());

// Get location information for the operation. // Get location information for the operation.

auto srcLocation = getEncodedSourceLocation(opLoc); auto srcLocation = getEncodedSourceLocation(opLoc);

OperationState opState(srcLocation, opName); OperationState opState(srcLocation, opName);

// If we are populating the parser state, start a new operation definition. // If we are populating the parser state, start a new operation definition.

if (state.asmState) if (state.asmState)

state.asmState->startOperationDefinition(opState.name); state.asmState->startOperationDefinition(opState.name);

▲ Show 20 Lines • Show All 513 Lines • Show Last 20 Lines

mlir/lib/Parser/ParserState.h

Show First 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	struct ParserState {
SymbolState &symbols;		SymbolState &symbols;

/// The depth of this parser in the nested parsing stack.		/// The depth of this parser in the nested parsing stack.
size_t parserDepth;		size_t parserDepth;

/// An optional pointer to a struct containing high level parser state to be		/// An optional pointer to a struct containing high level parser state to be
/// populated during parsing.		/// populated during parsing.
AsmParserState *asmState;		AsmParserState *asmState;

		// Contains the stack of default dialect to use when parsing regions.
		// A new dialect get pushed to the stack before parsing regions nested
		// under an operation implementing `OpAsmOpInterface`, and
		// popped when done. At the top-level we start with "builtin" as the
		// default, so that the top-level `module` operation parses as-is.
		SmallVector<StringRef> defaultDialectStack{"builtin"};
		rriddleUnsubmitted Done Reply Inline Actions Same comment here, can we use StringRef instead of std::string? rriddle: Same comment here, can we use StringRef instead of std::string?
};		};

} // end namespace detail		} // end namespace detail
} // end namespace mlir		} // end namespace mlir

#endif // MLIR_LIB_PARSER_PARSERSTATE_H		#endif // MLIR_LIB_PARSER_PARSERSTATE_H

mlir/test/Analysis/test-shape-fn-report.mlir

Show All 9 Lines	func @tanh(%arg: tensor<10x20xf32>) -> tensor<10x20xf32>
// expected-remark@+1 {{associated shape function: same_result_shape}}		// expected-remark@+1 {{associated shape function: same_result_shape}}
%1 = "test.same_operand_result_type"(%0) : (tensor<10x20xf32>) -> tensor<10x20xf32>		%1 = "test.same_operand_result_type"(%0) : (tensor<10x20xf32>) -> tensor<10x20xf32>
return %1 : tensor<10x20xf32>		return %1 : tensor<10x20xf32>
}		}

// The shape function library with some local functions.		// The shape function library with some local functions.
shape.function_library @shape_lib {		shape.function_library @shape_lib {
// Test shape function that returns the shape of input arg as result shape.		// Test shape function that returns the shape of input arg as result shape.
func @same_result_shape(%arg: !shape.value_shape) -> !shape.shape {		builtin.func @same_result_shape(%arg: !shape.value_shape) -> !shape.shape {
%0 = shape.shape_of %arg : !shape.value_shape -> !shape.shape		%0 = shape.shape_of %arg : !shape.value_shape -> !shape.shape
return %0 : !shape.shape		return %0 : !shape.shape
}		}
} mapping {		} mapping {
test.same_operand_result_type = @same_result_shape		test.same_operand_result_type = @same_result_shape
}		}

}		}

mlir/test/Conversion/GPUToNVVM/gpu-to-nvvm.mlir

// RUN: mlir-opt %s -convert-gpu-to-nvvm -split-input-file \| FileCheck %s		// RUN: mlir-opt %s -convert-gpu-to-nvvm -split-input-file \| FileCheck %s
// RUN: mlir-opt %s -convert-gpu-to-nvvm='index-bitwidth=32' -split-input-file \| FileCheck --check-prefix=CHECK32 %s		// RUN: mlir-opt %s -convert-gpu-to-nvvm='index-bitwidth=32' -split-input-file \| FileCheck --check-prefix=CHECK32 %s

gpu.module @test_module {		gpu.module @test_module {
// CHECK-LABEL: func @gpu_index_ops()		// CHECK-LABEL: func @gpu_index_ops()
// CHECK32-LABEL: func @gpu_index_ops()		// CHECK32-LABEL: func @gpu_index_ops()
func @gpu_index_ops()		builtin.func @gpu_index_ops()
-> (index, index, index, index, index, index,		-> (index, index, index, index, index, index,
index, index, index, index, index, index) {		index, index, index, index, index, index) {
// CHECK32-NOT: = llvm.sext %{{.*}} : i32 to i64		// CHECK32-NOT: = llvm.sext %{{.*}} : i32 to i64

// CHECK: = nvvm.read.ptx.sreg.tid.x : i32		// CHECK: = nvvm.read.ptx.sreg.tid.x : i32
// CHECK: = llvm.sext %{{.*}} : i32 to i64		// CHECK: = llvm.sext %{{.*}} : i32 to i64
%tIdX = "gpu.thread_id"() {dimension = "x"} : () -> (index)		%tIdX = "gpu.thread_id"() {dimension = "x"} : () -> (index)
// CHECK: = nvvm.read.ptx.sreg.tid.y : i32		// CHECK: = nvvm.read.ptx.sreg.tid.y : i32
Show All 40 Lines	gpu.module @test_module {
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK-LABEL: func @gpu_index_comp		// CHECK-LABEL: func @gpu_index_comp
// CHECK32-LABEL: func @gpu_index_comp		// CHECK32-LABEL: func @gpu_index_comp
func @gpu_index_comp(%idx : index) -> index {		builtin.func @gpu_index_comp(%idx : index) -> index {
// CHECK: = llvm.add %{{.}}, %{{.}} : i64		// CHECK: = llvm.add %{{.}}, %{{.}} : i64
// CHECK32: = llvm.add %{{.}}, %{{.}} : i32		// CHECK32: = llvm.add %{{.}}, %{{.}} : i32
%0 = addi %idx, %idx : index		%0 = addi %idx, %idx : index
// CHECK: llvm.return %{{.*}} : i64		// CHECK: llvm.return %{{.*}} : i64
// CHECK32: llvm.return %{{.*}} : i32		// CHECK32: llvm.return %{{.*}} : i32
std.return %0 : index		std.return %0 : index
}		}
}		}
Show All 31 Lines	gpu.func @gpu_all_reduce_region() {
gpu.return		gpu.return
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK-LABEL: func @gpu_shuffle()		// CHECK-LABEL: func @gpu_shuffle()
func @gpu_shuffle() -> (f32) {		builtin.func @gpu_shuffle() -> (f32) {
// CHECK: %[[#VALUE:]] = llvm.mlir.constant(1.000000e+00 : f32) : f32		// CHECK: %[[#VALUE:]] = llvm.mlir.constant(1.000000e+00 : f32) : f32
%arg0 = constant 1.0 : f32		%arg0 = constant 1.0 : f32
// CHECK: %[[#OFFSET:]] = llvm.mlir.constant(4 : i32) : i32		// CHECK: %[[#OFFSET:]] = llvm.mlir.constant(4 : i32) : i32
%arg1 = constant 4 : i32		%arg1 = constant 4 : i32
// CHECK: %[[#WIDTH:]] = llvm.mlir.constant(23 : i32) : i32		// CHECK: %[[#WIDTH:]] = llvm.mlir.constant(23 : i32) : i32
%arg2 = constant 23 : i32		%arg2 = constant 23 : i32
// CHECK: %[[#ONE:]] = llvm.mlir.constant(1 : i32) : i32		// CHECK: %[[#ONE:]] = llvm.mlir.constant(1 : i32) : i32
// CHECK: %[[#SHL:]] = llvm.shl %[[#ONE]], %[[#WIDTH]] : i32		// CHECK: %[[#SHL:]] = llvm.shl %[[#ONE]], %[[#WIDTH]] : i32
// CHECK: %[[#MASK:]] = llvm.sub %[[#SHL]], %[[#ONE]] : i32		// CHECK: %[[#MASK:]] = llvm.sub %[[#SHL]], %[[#ONE]] : i32
// CHECK: %[[#CLAMP:]] = llvm.sub %[[#WIDTH]], %[[#ONE]] : i32		// CHECK: %[[#CLAMP:]] = llvm.sub %[[#WIDTH]], %[[#ONE]] : i32
// CHECK: %[[#SHFL:]] = nvvm.shfl.sync.bfly %[[#MASK]], %[[#VALUE]], %[[#OFFSET]], %[[#CLAMP]] : !llvm.struct<(f32, i1)>		// CHECK: %[[#SHFL:]] = nvvm.shfl.sync.bfly %[[#MASK]], %[[#VALUE]], %[[#OFFSET]], %[[#CLAMP]] : !llvm.struct<(f32, i1)>
// CHECK: llvm.extractvalue %[[#SHFL]][0 : index] : !llvm.struct<(f32, i1)>		// CHECK: llvm.extractvalue %[[#SHFL]][0 : index] : !llvm.struct<(f32, i1)>
// CHECK: llvm.extractvalue %[[#SHFL]][1 : index] : !llvm.struct<(f32, i1)>		// CHECK: llvm.extractvalue %[[#SHFL]][1 : index] : !llvm.struct<(f32, i1)>
%shfl, %pred = "gpu.shuffle"(%arg0, %arg1, %arg2) { mode = "xor" } : (f32, i32, i32) -> (f32, i1)		%shfl, %pred = "gpu.shuffle"(%arg0, %arg1, %arg2) { mode = "xor" } : (f32, i32, i32) -> (f32, i1)

std.return %shfl : f32		std.return %shfl : f32
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK-LABEL: func @gpu_sync()		// CHECK-LABEL: func @gpu_sync()
func @gpu_sync() {		builtin.func @gpu_sync() {
// CHECK: nvvm.barrier0		// CHECK: nvvm.barrier0
gpu.barrier		gpu.barrier
std.return		std.return
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_fabsf(f32) -> f32		// CHECK: llvm.func @__nv_fabsf(f32) -> f32
// CHECK: llvm.func @__nv_fabs(f64) -> f64		// CHECK: llvm.func @__nv_fabs(f64) -> f64
// CHECK-LABEL: func @gpu_fabs		// CHECK-LABEL: func @gpu_fabs
func @gpu_fabs(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_fabs(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = std.absf %arg_f32 : f32		%result32 = std.absf %arg_f32 : f32
// CHECK: llvm.call @__nv_fabsf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_fabsf(%{{.*}}) : (f32) -> f32
%result64 = std.absf %arg_f64 : f64		%result64 = std.absf %arg_f64 : f64
// CHECK: llvm.call @__nv_fabs(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_fabs(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_ceilf(f32) -> f32		// CHECK: llvm.func @__nv_ceilf(f32) -> f32
// CHECK: llvm.func @__nv_ceil(f64) -> f64		// CHECK: llvm.func @__nv_ceil(f64) -> f64
// CHECK-LABEL: func @gpu_ceil		// CHECK-LABEL: func @gpu_ceil
func @gpu_ceil(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_ceil(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = std.ceilf %arg_f32 : f32		%result32 = std.ceilf %arg_f32 : f32
// CHECK: llvm.call @__nv_ceilf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_ceilf(%{{.*}}) : (f32) -> f32
%result64 = std.ceilf %arg_f64 : f64		%result64 = std.ceilf %arg_f64 : f64
// CHECK: llvm.call @__nv_ceil(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_ceil(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_floorf(f32) -> f32		// CHECK: llvm.func @__nv_floorf(f32) -> f32
// CHECK: llvm.func @__nv_floor(f64) -> f64		// CHECK: llvm.func @__nv_floor(f64) -> f64
// CHECK-LABEL: func @gpu_floor		// CHECK-LABEL: func @gpu_floor
func @gpu_floor(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_floor(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = std.floorf %arg_f32 : f32		%result32 = std.floorf %arg_f32 : f32
// CHECK: llvm.call @__nv_floorf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_floorf(%{{.*}}) : (f32) -> f32
%result64 = std.floorf %arg_f64 : f64		%result64 = std.floorf %arg_f64 : f64
// CHECK: llvm.call @__nv_floor(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_floor(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_cosf(f32) -> f32		// CHECK: llvm.func @__nv_cosf(f32) -> f32
// CHECK: llvm.func @__nv_cos(f64) -> f64		// CHECK: llvm.func @__nv_cos(f64) -> f64
// CHECK-LABEL: func @gpu_cos		// CHECK-LABEL: func @gpu_cos
func @gpu_cos(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_cos(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.cos %arg_f32 : f32		%result32 = math.cos %arg_f32 : f32
// CHECK: llvm.call @__nv_cosf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_cosf(%{{.*}}) : (f32) -> f32
%result64 = math.cos %arg_f64 : f64		%result64 = math.cos %arg_f64 : f64
// CHECK: llvm.call @__nv_cos(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_cos(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----
gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_expf(f32) -> f32		// CHECK: llvm.func @__nv_expf(f32) -> f32
// CHECK: llvm.func @__nv_exp(f64) -> f64		// CHECK: llvm.func @__nv_exp(f64) -> f64
// CHECK-LABEL: func @gpu_exp		// CHECK-LABEL: func @gpu_exp
func @gpu_exp(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_exp(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.exp %arg_f32 : f32		%result32 = math.exp %arg_f32 : f32
// CHECK: llvm.call @__nv_expf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_expf(%{{.*}}) : (f32) -> f32
%result64 = math.exp %arg_f64 : f64		%result64 = math.exp %arg_f64 : f64
// CHECK: llvm.call @__nv_exp(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_exp(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----
gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_exp2f(f32) -> f32		// CHECK: llvm.func @__nv_exp2f(f32) -> f32
// CHECK: llvm.func @__nv_exp2(f64) -> f64		// CHECK: llvm.func @__nv_exp2(f64) -> f64
// CHECK-LABEL: func @gpu_exp2		// CHECK-LABEL: func @gpu_exp2
func @gpu_exp2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_exp2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.exp2 %arg_f32 : f32		%result32 = math.exp2 %arg_f32 : f32
// CHECK: llvm.call @__nv_exp2f(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_exp2f(%{{.*}}) : (f32) -> f32
%result64 = math.exp2 %arg_f64 : f64		%result64 = math.exp2 %arg_f64 : f64
// CHECK: llvm.call @__nv_exp2(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_exp2(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_logf(f32) -> f32		// CHECK: llvm.func @__nv_logf(f32) -> f32
// CHECK: llvm.func @__nv_log(f64) -> f64		// CHECK: llvm.func @__nv_log(f64) -> f64
// CHECK-LABEL: func @gpu_log		// CHECK-LABEL: func @gpu_log
func @gpu_log(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_log(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.log %arg_f32 : f32		%result32 = math.log %arg_f32 : f32
// CHECK: llvm.call @__nv_logf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_logf(%{{.*}}) : (f32) -> f32
%result64 = math.log %arg_f64 : f64		%result64 = math.log %arg_f64 : f64
// CHECK: llvm.call @__nv_log(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_log(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_log10f(f32) -> f32		// CHECK: llvm.func @__nv_log10f(f32) -> f32
// CHECK: llvm.func @__nv_log10(f64) -> f64		// CHECK: llvm.func @__nv_log10(f64) -> f64
// CHECK-LABEL: func @gpu_log10		// CHECK-LABEL: func @gpu_log10
func @gpu_log10(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_log10(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.log10 %arg_f32 : f32		%result32 = math.log10 %arg_f32 : f32
// CHECK: llvm.call @__nv_log10f(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_log10f(%{{.*}}) : (f32) -> f32
%result64 = math.log10 %arg_f64 : f64		%result64 = math.log10 %arg_f64 : f64
// CHECK: llvm.call @__nv_log10(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_log10(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_log1pf(f32) -> f32		// CHECK: llvm.func @__nv_log1pf(f32) -> f32
// CHECK: llvm.func @__nv_log1p(f64) -> f64		// CHECK: llvm.func @__nv_log1p(f64) -> f64
// CHECK-LABEL: func @gpu_log1p		// CHECK-LABEL: func @gpu_log1p
func @gpu_log1p(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_log1p(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.log1p %arg_f32 : f32		%result32 = math.log1p %arg_f32 : f32
// CHECK: llvm.call @__nv_log1pf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_log1pf(%{{.*}}) : (f32) -> f32
%result64 = math.log1p %arg_f64 : f64		%result64 = math.log1p %arg_f64 : f64
// CHECK: llvm.call @__nv_log1p(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_log1p(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_log2f(f32) -> f32		// CHECK: llvm.func @__nv_log2f(f32) -> f32
// CHECK: llvm.func @__nv_log2(f64) -> f64		// CHECK: llvm.func @__nv_log2(f64) -> f64
// CHECK-LABEL: func @gpu_log2		// CHECK-LABEL: func @gpu_log2
func @gpu_log2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_log2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.log2 %arg_f32 : f32		%result32 = math.log2 %arg_f32 : f32
// CHECK: llvm.call @__nv_log2f(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_log2f(%{{.*}}) : (f32) -> f32
%result64 = math.log2 %arg_f64 : f64		%result64 = math.log2 %arg_f64 : f64
// CHECK: llvm.call @__nv_log2(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_log2(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_sinf(f32) -> f32		// CHECK: llvm.func @__nv_sinf(f32) -> f32
// CHECK: llvm.func @__nv_sin(f64) -> f64		// CHECK: llvm.func @__nv_sin(f64) -> f64
// CHECK-LABEL: func @gpu_sin		// CHECK-LABEL: func @gpu_sin
func @gpu_sin(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_sin(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.sin %arg_f32 : f32		%result32 = math.sin %arg_f32 : f32
// CHECK: llvm.call @__nv_sinf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_sinf(%{{.*}}) : (f32) -> f32
%result64 = math.sin %arg_f64 : f64		%result64 = math.sin %arg_f64 : f64
// CHECK: llvm.call @__nv_sin(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_sin(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_tanhf(f32) -> f32		// CHECK: llvm.func @__nv_tanhf(f32) -> f32
// CHECK: llvm.func @__nv_tanh(f64) -> f64		// CHECK: llvm.func @__nv_tanh(f64) -> f64
// CHECK-LABEL: func @gpu_tanh		// CHECK-LABEL: func @gpu_tanh
func @gpu_tanh(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64) -> (f16, f32, f64) {		builtin.func @gpu_tanh(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64) -> (f16, f32, f64) {
%result16 = math.tanh %arg_f16 : f16		%result16 = math.tanh %arg_f16 : f16
// CHECK: llvm.fpext %{{.*}} : f16 to f32		// CHECK: llvm.fpext %{{.*}} : f16 to f32
// CHECK-NEXT: llvm.call @__nv_tanhf(%{{.*}}) : (f32) -> f32		// CHECK-NEXT: llvm.call @__nv_tanhf(%{{.*}}) : (f32) -> f32
// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16		// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16
%result32 = math.tanh %arg_f32 : f32		%result32 = math.tanh %arg_f32 : f32
// CHECK: llvm.call @__nv_tanhf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_tanhf(%{{.*}}) : (f32) -> f32
%result64 = math.tanh %arg_f64 : f64		%result64 = math.tanh %arg_f64 : f64
// CHECK: llvm.call @__nv_tanh(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_tanh(%{{.*}}) : (f64) -> f64
std.return %result16, %result32, %result64 : f16, f32, f64		std.return %result16, %result32, %result64 : f16, f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_rsqrtf(f32) -> f32		// CHECK: llvm.func @__nv_rsqrtf(f32) -> f32
// CHECK: llvm.func @__nv_rsqrt(f64) -> f64		// CHECK: llvm.func @__nv_rsqrt(f64) -> f64
// CHECK-LABEL: func @gpu_rsqrt		// CHECK-LABEL: func @gpu_rsqrt
func @gpu_rsqrt(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)		builtin.func @gpu_rsqrt(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)
-> (f16, f32, f64) {		-> (f16, f32, f64) {
%result16 = math.rsqrt %arg_f16 : f16		%result16 = math.rsqrt %arg_f16 : f16
// CHECK: llvm.fpext %{{.*}} : f16 to f32		// CHECK: llvm.fpext %{{.*}} : f16 to f32
// CHECK-NEXT: llvm.call @__nv_rsqrtf(%{{.*}}) : (f32) -> f32		// CHECK-NEXT: llvm.call @__nv_rsqrtf(%{{.*}}) : (f32) -> f32
// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16		// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16
%result32 = math.rsqrt %arg_f32 : f32		%result32 = math.rsqrt %arg_f32 : f32
// CHECK: llvm.call @__nv_rsqrtf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_rsqrtf(%{{.*}}) : (f32) -> f32
%result64 = math.rsqrt %arg_f64 : f64		%result64 = math.rsqrt %arg_f64 : f64
// CHECK: llvm.call @__nv_rsqrt(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_rsqrt(%{{.*}}) : (f64) -> f64
std.return %result16, %result32, %result64 : f16, f32, f64		std.return %result16, %result32, %result64 : f16, f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_sqrtf(f32) -> f32		// CHECK: llvm.func @__nv_sqrtf(f32) -> f32
// CHECK: llvm.func @__nv_sqrt(f64) -> f64		// CHECK: llvm.func @__nv_sqrt(f64) -> f64
// CHECK-LABEL: func @gpu_sqrt		// CHECK-LABEL: func @gpu_sqrt
func @gpu_sqrt(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)		builtin.func @gpu_sqrt(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)
-> (f16, f32, f64) {		-> (f16, f32, f64) {
%result16 = math.sqrt %arg_f16 : f16		%result16 = math.sqrt %arg_f16 : f16
// CHECK: llvm.fpext %{{.*}} : f16 to f32		// CHECK: llvm.fpext %{{.*}} : f16 to f32
// CHECK-NEXT: llvm.call @__nv_sqrtf(%{{.*}}) : (f32) -> f32		// CHECK-NEXT: llvm.call @__nv_sqrtf(%{{.*}}) : (f32) -> f32
// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16		// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16
%result32 = math.sqrt %arg_f32 : f32		%result32 = math.sqrt %arg_f32 : f32
// CHECK: llvm.call @__nv_sqrtf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_sqrtf(%{{.*}}) : (f32) -> f32
%result64 = math.sqrt %arg_f64 : f64		%result64 = math.sqrt %arg_f64 : f64
// CHECK: llvm.call @__nv_sqrt(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_sqrt(%{{.*}}) : (f64) -> f64
std.return %result16, %result32, %result64 : f16, f32, f64		std.return %result16, %result32, %result64 : f16, f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_atanf(f32) -> f32		// CHECK: llvm.func @__nv_atanf(f32) -> f32
// CHECK: llvm.func @__nv_atan(f64) -> f64		// CHECK: llvm.func @__nv_atan(f64) -> f64
// CHECK-LABEL: func @gpu_atan		// CHECK-LABEL: func @gpu_atan
func @gpu_atan(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)		builtin.func @gpu_atan(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)
-> (f16, f32, f64) {		-> (f16, f32, f64) {
%result16 = math.atan %arg_f16 : f16		%result16 = math.atan %arg_f16 : f16
// CHECK: llvm.fpext %{{.*}} : f16 to f32		// CHECK: llvm.fpext %{{.*}} : f16 to f32
// CHECK-NEXT: llvm.call @__nv_atanf(%{{.*}}) : (f32) -> f32		// CHECK-NEXT: llvm.call @__nv_atanf(%{{.*}}) : (f32) -> f32
// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16		// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16
%result32 = math.atan %arg_f32 : f32		%result32 = math.atan %arg_f32 : f32
// CHECK: llvm.call @__nv_atanf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_atanf(%{{.*}}) : (f32) -> f32
%result64 = math.atan %arg_f64 : f64		%result64 = math.atan %arg_f64 : f64
// CHECK: llvm.call @__nv_atan(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_atan(%{{.*}}) : (f64) -> f64
std.return %result16, %result32, %result64 : f16, f32, f64		std.return %result16, %result32, %result64 : f16, f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_atan2f(f32, f32) -> f32		// CHECK: llvm.func @__nv_atan2f(f32, f32) -> f32
// CHECK: llvm.func @__nv_atan2(f64, f64) -> f64		// CHECK: llvm.func @__nv_atan2(f64, f64) -> f64
// CHECK-LABEL: func @gpu_atan2		// CHECK-LABEL: func @gpu_atan2
func @gpu_atan2(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)		builtin.func @gpu_atan2(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)
-> (f16, f32, f64) {		-> (f16, f32, f64) {
%result16 = math.atan2 %arg_f16, %arg_f16 : f16		%result16 = math.atan2 %arg_f16, %arg_f16 : f16
// CHECK: llvm.fpext %{{.*}} : f16 to f32		// CHECK: llvm.fpext %{{.*}} : f16 to f32
// CHECK: llvm.fpext %{{.*}} : f16 to f32		// CHECK: llvm.fpext %{{.*}} : f16 to f32
// CHECK-NEXT: llvm.call @__nv_atan2f(%{{.*}}) : (f32, f32) -> f32		// CHECK-NEXT: llvm.call @__nv_atan2f(%{{.*}}) : (f32, f32) -> f32
// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16		// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16
%result32 = math.atan2 %arg_f32, %arg_f32 : f32		%result32 = math.atan2 %arg_f32, %arg_f32 : f32
// CHECK: llvm.call @__nv_atan2f(%{{.*}}) : (f32, f32) -> f32		// CHECK: llvm.call @__nv_atan2f(%{{.*}}) : (f32, f32) -> f32
%result64 = math.atan2 %arg_f64, %arg_f64 : f64		%result64 = math.atan2 %arg_f64, %arg_f64 : f64
// CHECK: llvm.call @__nv_atan2(%{{.*}}) : (f64, f64) -> f64		// CHECK: llvm.call @__nv_atan2(%{{.*}}) : (f64, f64) -> f64
std.return %result16, %result32, %result64 : f16, f32, f64		std.return %result16, %result32, %result64 : f16, f32, f64
}		}
}		}

// -----		// -----

// Test that we handled properly operation with SymbolTable other than module op		// Test that we handled properly operation with SymbolTable other than module op
gpu.module @test_module {		gpu.module @test_module {
"test.symbol_scope"() ({		"test.symbol_scope"() ({
// CHECK: test.symbol_scope		// CHECK: test.symbol_scope
// CHECK: llvm.func @__nv_expf(f32) -> f32		// CHECK: llvm.func @__nv_expf(f32) -> f32
// CHECK: llvm.func @__nv_exp(f64) -> f64		// CHECK: llvm.func @__nv_exp(f64) -> f64
// CHECK-LABEL: func @gpu_exp		// CHECK-LABEL: func @gpu_exp
func @gpu_exp(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_exp(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.exp %arg_f32 : f32		%result32 = math.exp %arg_f32 : f32
// CHECK: llvm.call @__nv_expf(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_expf(%{{.*}}) : (f32) -> f32
%result64 = math.exp %arg_f64 : f64		%result64 = math.exp %arg_f64 : f64
// CHECK: llvm.call @__nv_exp(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_exp(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
"test.finish" () : () -> ()		"test.finish" () : () -> ()
}) : () -> ()		}) : () -> ()
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_expm1f(f32) -> f32		// CHECK: llvm.func @__nv_expm1f(f32) -> f32
// CHECK: llvm.func @__nv_expm1(f64) -> f64		// CHECK: llvm.func @__nv_expm1(f64) -> f64
// CHECK-LABEL: func @gpu_expm1		// CHECK-LABEL: func @gpu_expm1
func @gpu_expm1(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_expm1(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.expm1 %arg_f32 : f32		%result32 = math.expm1 %arg_f32 : f32
// CHECK: llvm.call @__nv_expm1f(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__nv_expm1f(%{{.*}}) : (f32) -> f32
%result64 = math.expm1 %arg_f64 : f64		%result64 = math.expm1 %arg_f64 : f64
// CHECK: llvm.call @__nv_expm1(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__nv_expm1(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__nv_powf(f32, f32) -> f32		// CHECK: llvm.func @__nv_powf(f32, f32) -> f32
// CHECK: llvm.func @__nv_pow(f64, f64) -> f64		// CHECK: llvm.func @__nv_pow(f64, f64) -> f64
// CHECK-LABEL: func @gpu_pow		// CHECK-LABEL: func @gpu_pow
func @gpu_pow(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_pow(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.powf %arg_f32, %arg_f32 : f32		%result32 = math.powf %arg_f32, %arg_f32 : f32
// CHECK: llvm.call @__nv_powf(%{{.}}, %{{.}}) : (f32, f32) -> f32		// CHECK: llvm.call @__nv_powf(%{{.}}, %{{.}}) : (f32, f32) -> f32
%result64 = math.powf %arg_f64, %arg_f64 : f64		%result64 = math.powf %arg_f64, %arg_f64 : f64
// CHECK: llvm.call @__nv_pow(%{{.}}, %{{.}}) : (f64, f64) -> f64		// CHECK: llvm.call @__nv_pow(%{{.}}, %{{.}}) : (f64, f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

Show All 11 Lines

mlir/test/Conversion/GPUToNVVM/wmma-ops-to-nvvm.mlir

	// RUN: mlir-opt --convert-gpu-to-nvvm="index-bitwidth=32" --split-input-file %s \| FileCheck %s			// RUN: mlir-opt --convert-gpu-to-nvvm="index-bitwidth=32" --split-input-file %s \| FileCheck %s

	gpu.module @test_module {			gpu.module @test_module {

	// CHECK-LABEL: func @gpu_wmma_load_op() ->			// CHECK-LABEL: func @gpu_wmma_load_op() ->
	// CHECK-SAME: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)> {			// CHECK-SAME: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)> {
	func @gpu_wmma_load_op() -> (!gpu.mma_matrix<16x16xf16, "AOp">) {			builtin.func @gpu_wmma_load_op() -> (!gpu.mma_matrix<16x16xf16, "AOp">) {
	%wg = memref.alloca() {alignment = 32} : memref<32x32xf16, 3>			%wg = memref.alloca() {alignment = 32} : memref<32x32xf16, 3>
	%i = constant 16 : index			%i = constant 16 : index
	%j = constant 16 : index			%j = constant 16 : index
	%0 = gpu.subgroup_mma_load_matrix %wg[%i, %j] {leadDimension = 32 : index} : memref<32x32xf16, 3> -> !gpu.mma_matrix<16x16xf16, "AOp">			%0 = gpu.subgroup_mma_load_matrix %wg[%i, %j] {leadDimension = 32 : index} : memref<32x32xf16, 3> -> !gpu.mma_matrix<16x16xf16, "AOp">
	// CHECK: %[[INX:.*]] = llvm.mlir.constant(16 : index) : i32			// CHECK: %[[INX:.*]] = llvm.mlir.constant(16 : index) : i32
	// CHECK: %{{.}} = llvm.insertvalue %{{.}}, %{{.}}[{{.}}, {{.*}}]			// CHECK: %{{.}} = llvm.insertvalue %{{.}}, %{{.}}[{{.}}, {{.*}}]
	// CHECK: %[[LDM:.*]] = llvm.mlir.constant(32 : index) : i32			// CHECK: %[[LDM:.*]] = llvm.mlir.constant(32 : index) : i32
	// CHECK: %[[LI:.*]] = llvm.mul %[[LDM]], %[[INX]] : i32			// CHECK: %[[LI:.*]] = llvm.mul %[[LDM]], %[[INX]] : i32
	Show All 10 Lines
	}			}

	// -----			// -----

	gpu.module @test_module {			gpu.module @test_module {

	// CHECK-LABEL: func @gpu_wmma_store_op			// CHECK-LABEL: func @gpu_wmma_store_op
	// CHECK-SAME: (%[[D:.*]]: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>) {			// CHECK-SAME: (%[[D:.*]]: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>) {
	func @gpu_wmma_store_op(%arg0 : !gpu.mma_matrix<16x16xf16, "COp">) -> () {			builtin.func @gpu_wmma_store_op(%arg0 : !gpu.mma_matrix<16x16xf16, "COp">) -> () {
	%sg = memref.alloca(){alignment = 32} : memref<32x32xf16, 3>			%sg = memref.alloca(){alignment = 32} : memref<32x32xf16, 3>
	%i = constant 16 : index			%i = constant 16 : index
	%j = constant 16 : index			%j = constant 16 : index
	gpu.subgroup_mma_store_matrix %arg0, %sg[%i,%j] {leadDimension= 32 : index} : !gpu.mma_matrix<16x16xf16, "COp">, memref<32x32xf16, 3>			gpu.subgroup_mma_store_matrix %arg0, %sg[%i,%j] {leadDimension= 32 : index} : !gpu.mma_matrix<16x16xf16, "COp">, memref<32x32xf16, 3>
	// CHECK: %[[INX:.*]] = llvm.mlir.constant(16 : index) : i32			// CHECK: %[[INX:.*]] = llvm.mlir.constant(16 : index) : i32
	// CHECK: %{{.}} = llvm.insertvalue %{{.}}, %{{.}}[{{.}}, {{.*}}]			// CHECK: %{{.}} = llvm.insertvalue %{{.}}, %{{.}}[{{.}}, {{.*}}]
	// CHECK: %[[LDM:.*]] = llvm.mlir.constant(32 : index) : i32			// CHECK: %[[LDM:.*]] = llvm.mlir.constant(32 : index) : i32
	// CHECK: %[[LI:.*]] = llvm.mul %[[LDM]], %[[INX]] : i32			// CHECK: %[[LI:.*]] = llvm.mul %[[LDM]], %[[INX]] : i32
	Show All 14 Lines
	}			}

	// -----			// -----

	gpu.module @test_module {			gpu.module @test_module {

	// CHECK-LABEL: func @gpu_wmma_mma_op			// CHECK-LABEL: func @gpu_wmma_mma_op
	// CHECK-SAME: (%[[A:.]]: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>, %[[B:.]]: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>, %[[C:.*]]: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>)			// CHECK-SAME: (%[[A:.]]: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>, %[[B:.]]: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>, %[[C:.*]]: !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>)
	func @gpu_wmma_mma_op(%A : !gpu.mma_matrix<16x16xf16, "AOp">, %B : !gpu.mma_matrix<16x16xf16, "BOp">, %C : !gpu.mma_matrix<16x16xf16, "COp">) -> (!gpu.mma_matrix<16x16xf16, "COp">) {			builtin.func @gpu_wmma_mma_op(%A : !gpu.mma_matrix<16x16xf16, "AOp">, %B : !gpu.mma_matrix<16x16xf16, "BOp">, %C : !gpu.mma_matrix<16x16xf16, "COp">) -> (!gpu.mma_matrix<16x16xf16, "COp">) {
	%D = gpu.subgroup_mma_compute %A, %B, %C : !gpu.mma_matrix<16x16xf16, "AOp">, !gpu.mma_matrix<16x16xf16, "BOp"> -> !gpu.mma_matrix<16x16xf16, "COp">			%D = gpu.subgroup_mma_compute %A, %B, %C : !gpu.mma_matrix<16x16xf16, "AOp">, !gpu.mma_matrix<16x16xf16, "BOp"> -> !gpu.mma_matrix<16x16xf16, "COp">
	// CHECK: %[[A1:.*]] = llvm.extractvalue %[[A]][0 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[A1:.*]] = llvm.extractvalue %[[A]][0 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[A2:.*]] = llvm.extractvalue %[[A]][1 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[A2:.*]] = llvm.extractvalue %[[A]][1 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[A3:.*]] = llvm.extractvalue %[[A]][2 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[A3:.*]] = llvm.extractvalue %[[A]][2 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[A4:.*]] = llvm.extractvalue %[[A]][3 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[A4:.*]] = llvm.extractvalue %[[A]][3 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[A5:.*]] = llvm.extractvalue %[[A]][4 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[A5:.*]] = llvm.extractvalue %[[A]][4 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[A6:.*]] = llvm.extractvalue %[[A]][5 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[A6:.*]] = llvm.extractvalue %[[A]][5 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[A7:.*]] = llvm.extractvalue %[[A]][6 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[A7:.*]] = llvm.extractvalue %[[A]][6 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines
	// CHECK: llvm.br ^bb1(%{{.*}}, %[[ACC_MUL]] : i32, !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>)			// CHECK: llvm.br ^bb1(%{{.*}}, %[[ACC_MUL]] : i32, !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>)
	// CHECK: ^bb3: // pred: ^bb1			// CHECK: ^bb3: // pred: ^bb1
	// CHECK: %87 = llvm.extractvalue %[[ACC]][0 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %87 = llvm.extractvalue %[[ACC]][0 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %88 = llvm.extractvalue %[[ACC]][1 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %88 = llvm.extractvalue %[[ACC]][1 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %89 = llvm.extractvalue %[[ACC]][2 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %89 = llvm.extractvalue %[[ACC]][2 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %90 = llvm.extractvalue %[[ACC]][3 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %90 = llvm.extractvalue %[[ACC]][3 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: nvvm.wmma.m16n16k16.store.d.f16.row.stride %86, %87, %88, %89, %90, %79 : !llvm.ptr<i32>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, i32			// CHECK: nvvm.wmma.m16n16k16.store.d.f16.row.stride %86, %87, %88, %89, %90, %79 : !llvm.ptr<i32>, vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>, i32

	func @gpu_wmma_mma_loop_op(%arg0: memref<128x128xf16>, %arg1: memref<128x128xf16>, %arg2: memref<128x128xf16>) {			builtin.func @gpu_wmma_mma_loop_op(%arg0: memref<128x128xf16>, %arg1: memref<128x128xf16>, %arg2: memref<128x128xf16>) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c128 = constant 128 : index			%c128 = constant 128 : index
	%c32 = constant 32 : index			%c32 = constant 32 : index
	%0 = gpu.subgroup_mma_load_matrix %arg2[%c0, %c0] {leadDimension = 128 : index} : memref<128x128xf16> -> !gpu.mma_matrix<16x16xf16, "COp">			%0 = gpu.subgroup_mma_load_matrix %arg2[%c0, %c0] {leadDimension = 128 : index} : memref<128x128xf16> -> !gpu.mma_matrix<16x16xf16, "COp">
	br ^bb1(%c0, %0 : index, !gpu.mma_matrix<16x16xf16, "COp">)			br ^bb1(%c0, %0 : index, !gpu.mma_matrix<16x16xf16, "COp">)
	^bb1(%1: index, %2: !gpu.mma_matrix<16x16xf16, "COp">): // 2 preds: ^bb0, ^bb2			^bb1(%1: index, %2: !gpu.mma_matrix<16x16xf16, "COp">): // 2 preds: ^bb0, ^bb2
	%3 = cmpi slt, %1, %c128 : index			%3 = cmpi slt, %1, %c128 : index
	cond_br %3, ^bb2, ^bb3			cond_br %3, ^bb2, ^bb3
	Show All 22 Lines
	// CHECK: %[[C1:.+]] = llvm.mlir.constant(1 : i32) : i32			// CHECK: %[[C1:.+]] = llvm.mlir.constant(1 : i32) : i32
	// CHECK: %[[V2:.+]] = llvm.insertelement %[[CST]], %[[V1]][%[[C1]] : i32] : vector<2xf16>			// CHECK: %[[V2:.+]] = llvm.insertelement %[[CST]], %[[V1]][%[[C1]] : i32] : vector<2xf16>
	// CHECK: %[[M0:.+]] = llvm.mlir.undef : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[M0:.+]] = llvm.mlir.undef : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[M1:.+]] = llvm.insertvalue %[[V2]], %[[M0]][0 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[M1:.+]] = llvm.insertvalue %[[V2]], %[[M0]][0 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[M2:.+]] = llvm.insertvalue %[[V2]], %[[M1]][1 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[M2:.+]] = llvm.insertvalue %[[V2]], %[[M1]][1 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[M3:.+]] = llvm.insertvalue %[[V2]], %[[M2]][2 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[M3:.+]] = llvm.insertvalue %[[V2]], %[[M2]][2 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: %[[M4:.+]] = llvm.insertvalue %[[V2]], %[[M3]][3 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: %[[M4:.+]] = llvm.insertvalue %[[V2]], %[[M3]][3 : i32] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	// CHECK: llvm.return %[[M4]] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>			// CHECK: llvm.return %[[M4]] : !llvm.struct<(vector<2xf16>, vector<2xf16>, vector<2xf16>, vector<2xf16>)>
	func @gpu_wmma_constant_op() ->(!gpu.mma_matrix<16x16xf16, "COp">) {			builtin.func @gpu_wmma_constant_op() ->(!gpu.mma_matrix<16x16xf16, "COp">) {
	%cst = constant 1.0 : f16			%cst = constant 1.0 : f16
	%C = gpu.subgroup_mma_constant_matrix %cst : !gpu.mma_matrix<16x16xf16, "COp">			%C = gpu.subgroup_mma_constant_matrix %cst : !gpu.mma_matrix<16x16xf16, "COp">
	return %C : !gpu.mma_matrix<16x16xf16, "COp">			return %C : !gpu.mma_matrix<16x16xf16, "COp">
	}			}
	}			}

mlir/test/Conversion/GPUToROCDL/gpu-to-rocdl.mlir

// RUN: mlir-opt %s -convert-gpu-to-rocdl -split-input-file \| FileCheck %s		// RUN: mlir-opt %s -convert-gpu-to-rocdl -split-input-file \| FileCheck %s
// RUN: mlir-opt %s -convert-gpu-to-rocdl='index-bitwidth=32' -split-input-file \| FileCheck --check-prefix=CHECK32 %s		// RUN: mlir-opt %s -convert-gpu-to-rocdl='index-bitwidth=32' -split-input-file \| FileCheck --check-prefix=CHECK32 %s

gpu.module @test_module {		gpu.module @test_module {
// CHECK-LABEL: func @gpu_index_ops()		// CHECK-LABEL: func @gpu_index_ops()
// CHECK32-LABEL: func @gpu_index_ops()		// CHECK32-LABEL: func @gpu_index_ops()
func @gpu_index_ops()		builtin.func @gpu_index_ops()
-> (index, index, index, index, index, index,		-> (index, index, index, index, index, index,
index, index, index, index, index, index) {		index, index, index, index, index, index) {
// CHECK32-NOT: = llvm.sext %{{.*}} : i32 to i64		// CHECK32-NOT: = llvm.sext %{{.*}} : i32 to i64

// CHECK: rocdl.workitem.id.x : i32		// CHECK: rocdl.workitem.id.x : i32
// CHECK: = llvm.sext %{{.*}} : i32 to i64		// CHECK: = llvm.sext %{{.*}} : i32 to i64
%tIdX = "gpu.thread_id"() {dimension = "x"} : () -> (index)		%tIdX = "gpu.thread_id"() {dimension = "x"} : () -> (index)
// CHECK: rocdl.workitem.id.y : i32		// CHECK: rocdl.workitem.id.y : i32
Show All 40 Lines	gpu.module @test_module {
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK-LABEL: func @gpu_index_comp		// CHECK-LABEL: func @gpu_index_comp
// CHECK32-LABEL: func @gpu_index_comp		// CHECK32-LABEL: func @gpu_index_comp
func @gpu_index_comp(%idx : index) -> index {		builtin.func @gpu_index_comp(%idx : index) -> index {
// CHECK: = llvm.add %{{.}}, %{{.}} : i64		// CHECK: = llvm.add %{{.}}, %{{.}} : i64
// CHECK32: = llvm.add %{{.}}, %{{.}} : i32		// CHECK32: = llvm.add %{{.}}, %{{.}} : i32
%0 = addi %idx, %idx : index		%0 = addi %idx, %idx : index
// CHECK: llvm.return %{{.*}} : i64		// CHECK: llvm.return %{{.*}} : i64
// CHECK32: llvm.return %{{.*}} : i32		// CHECK32: llvm.return %{{.*}} : i32
std.return %0 : index		std.return %0 : index
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK-LABEL: func @gpu_sync()		// CHECK-LABEL: func @gpu_sync()
func @gpu_sync() {		builtin.func @gpu_sync() {
// CHECK: rocdl.barrier		// CHECK: rocdl.barrier
gpu.barrier		gpu.barrier
std.return		std.return
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_fabs_f32(f32) -> f32		// CHECK: llvm.func @__ocml_fabs_f32(f32) -> f32
// CHECK: llvm.func @__ocml_fabs_f64(f64) -> f64		// CHECK: llvm.func @__ocml_fabs_f64(f64) -> f64
// CHECK-LABEL: func @gpu_fabs		// CHECK-LABEL: func @gpu_fabs
func @gpu_fabs(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_fabs(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = std.absf %arg_f32 : f32		%result32 = std.absf %arg_f32 : f32
// CHECK: llvm.call @__ocml_fabs_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_fabs_f32(%{{.*}}) : (f32) -> f32
%result64 = std.absf %arg_f64 : f64		%result64 = std.absf %arg_f64 : f64
// CHECK: llvm.call @__ocml_fabs_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_fabs_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_ceil_f32(f32) -> f32		// CHECK: llvm.func @__ocml_ceil_f32(f32) -> f32
// CHECK: llvm.func @__ocml_ceil_f64(f64) -> f64		// CHECK: llvm.func @__ocml_ceil_f64(f64) -> f64
// CHECK-LABEL: func @gpu_ceil		// CHECK-LABEL: func @gpu_ceil
func @gpu_ceil(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_ceil(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = std.ceilf %arg_f32 : f32		%result32 = std.ceilf %arg_f32 : f32
// CHECK: llvm.call @__ocml_ceil_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_ceil_f32(%{{.*}}) : (f32) -> f32
%result64 = std.ceilf %arg_f64 : f64		%result64 = std.ceilf %arg_f64 : f64
// CHECK: llvm.call @__ocml_ceil_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_ceil_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_floor_f32(f32) -> f32		// CHECK: llvm.func @__ocml_floor_f32(f32) -> f32
// CHECK: llvm.func @__ocml_floor_f64(f64) -> f64		// CHECK: llvm.func @__ocml_floor_f64(f64) -> f64
// CHECK-LABEL: func @gpu_floor		// CHECK-LABEL: func @gpu_floor
func @gpu_floor(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_floor(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = std.floorf %arg_f32 : f32		%result32 = std.floorf %arg_f32 : f32
// CHECK: llvm.call @__ocml_floor_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_floor_f32(%{{.*}}) : (f32) -> f32
%result64 = std.floorf %arg_f64 : f64		%result64 = std.floorf %arg_f64 : f64
// CHECK: llvm.call @__ocml_floor_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_floor_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_cos_f32(f32) -> f32		// CHECK: llvm.func @__ocml_cos_f32(f32) -> f32
// CHECK: llvm.func @__ocml_cos_f64(f64) -> f64		// CHECK: llvm.func @__ocml_cos_f64(f64) -> f64
// CHECK-LABEL: func @gpu_cos		// CHECK-LABEL: func @gpu_cos
func @gpu_cos(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_cos(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.cos %arg_f32 : f32		%result32 = math.cos %arg_f32 : f32
// CHECK: llvm.call @__ocml_cos_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_cos_f32(%{{.*}}) : (f32) -> f32
%result64 = math.cos %arg_f64 : f64		%result64 = math.cos %arg_f64 : f64
// CHECK: llvm.call @__ocml_cos_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_cos_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_exp_f32(f32) -> f32		// CHECK: llvm.func @__ocml_exp_f32(f32) -> f32
// CHECK: llvm.func @__ocml_exp_f64(f64) -> f64		// CHECK: llvm.func @__ocml_exp_f64(f64) -> f64
// CHECK-LABEL: func @gpu_exp		// CHECK-LABEL: func @gpu_exp
func @gpu_exp(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_exp(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%exp_f32 = math.exp %arg_f32 : f32		%exp_f32 = math.exp %arg_f32 : f32
// CHECK: llvm.call @__ocml_exp_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_exp_f32(%{{.*}}) : (f32) -> f32
%result32 = math.exp %exp_f32 : f32		%result32 = math.exp %exp_f32 : f32
// CHECK: llvm.call @__ocml_exp_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_exp_f32(%{{.*}}) : (f32) -> f32
%result64 = math.exp %arg_f64 : f64		%result64 = math.exp %arg_f64 : f64
// CHECK: llvm.call @__ocml_exp_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_exp_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_exp2_f32(f32) -> f32		// CHECK: llvm.func @__ocml_exp2_f32(f32) -> f32
// CHECK: llvm.func @__ocml_exp2_f64(f64) -> f64		// CHECK: llvm.func @__ocml_exp2_f64(f64) -> f64
// CHECK-LABEL: func @gpu_exp2		// CHECK-LABEL: func @gpu_exp2
func @gpu_exp2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_exp2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%exp2_f32 = math.exp2 %arg_f32 : f32		%exp2_f32 = math.exp2 %arg_f32 : f32
// CHECK: llvm.call @__ocml_exp2_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_exp2_f32(%{{.*}}) : (f32) -> f32
%result32 = math.exp2 %exp2_f32 : f32		%result32 = math.exp2 %exp2_f32 : f32
// CHECK: llvm.call @__ocml_exp2_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_exp2_f32(%{{.*}}) : (f32) -> f32
%result64 = math.exp2 %arg_f64 : f64		%result64 = math.exp2 %arg_f64 : f64
// CHECK: llvm.call @__ocml_exp2_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_exp2_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

// Test that we handled properly operation with SymbolTable other than module op		// Test that we handled properly operation with SymbolTable other than module op
gpu.module @test_module {		gpu.module @test_module {
"test.symbol_scope"() ({		"test.symbol_scope"() ({
// CHECK: test.symbol_scope		// CHECK: test.symbol_scope
// CHECK: llvm.func @__ocml_exp_f32(f32) -> f32		// CHECK: llvm.func @__ocml_exp_f32(f32) -> f32
// CHECK: llvm.func @__ocml_exp_f64(f64) -> f64		// CHECK: llvm.func @__ocml_exp_f64(f64) -> f64
// CHECK-LABEL: func @gpu_exp		// CHECK-LABEL: func @gpu_exp
func @gpu_exp(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_exp(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%exp_f32 = math.exp %arg_f32 : f32		%exp_f32 = math.exp %arg_f32 : f32
// CHECK: llvm.call @__ocml_exp_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_exp_f32(%{{.*}}) : (f32) -> f32
%result32 = math.exp %exp_f32 : f32		%result32 = math.exp %exp_f32 : f32
// CHECK: llvm.call @__ocml_exp_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_exp_f32(%{{.*}}) : (f32) -> f32
%result64 = math.exp %arg_f64 : f64		%result64 = math.exp %arg_f64 : f64
// CHECK: llvm.call @__ocml_exp_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_exp_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
"test.finish" () : () -> ()		"test.finish" () : () -> ()
}) : () -> ()		}) : () -> ()
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_expm1_f32(f32) -> f32		// CHECK: llvm.func @__ocml_expm1_f32(f32) -> f32
// CHECK: llvm.func @__ocml_expm1_f64(f64) -> f64		// CHECK: llvm.func @__ocml_expm1_f64(f64) -> f64
// CHECK-LABEL: func @gpu_expm1		// CHECK-LABEL: func @gpu_expm1
func @gpu_expm1(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_expm1(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%expm1_f32 = math.expm1 %arg_f32 : f32		%expm1_f32 = math.expm1 %arg_f32 : f32
// CHECK: llvm.call @__ocml_expm1_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_expm1_f32(%{{.*}}) : (f32) -> f32
%result32 = math.expm1 %expm1_f32 : f32		%result32 = math.expm1 %expm1_f32 : f32
// CHECK: llvm.call @__ocml_expm1_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_expm1_f32(%{{.*}}) : (f32) -> f32
%result64 = math.expm1 %arg_f64 : f64		%result64 = math.expm1 %arg_f64 : f64
// CHECK: llvm.call @__ocml_expm1_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_expm1_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_log_f32(f32) -> f32		// CHECK: llvm.func @__ocml_log_f32(f32) -> f32
// CHECK: llvm.func @__ocml_log_f64(f64) -> f64		// CHECK: llvm.func @__ocml_log_f64(f64) -> f64
// CHECK-LABEL: func @gpu_log		// CHECK-LABEL: func @gpu_log
func @gpu_log(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_log(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.log %arg_f32 : f32		%result32 = math.log %arg_f32 : f32
// CHECK: llvm.call @__ocml_log_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_log_f32(%{{.*}}) : (f32) -> f32
%result64 = math.log %arg_f64 : f64		%result64 = math.log %arg_f64 : f64
// CHECK: llvm.call @__ocml_log_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_log_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_log1p_f32(f32) -> f32		// CHECK: llvm.func @__ocml_log1p_f32(f32) -> f32
// CHECK: llvm.func @__ocml_log1p_f64(f64) -> f64		// CHECK: llvm.func @__ocml_log1p_f64(f64) -> f64
// CHECK-LABEL: func @gpu_log1p		// CHECK-LABEL: func @gpu_log1p
func @gpu_log1p(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_log1p(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.log1p %arg_f32 : f32		%result32 = math.log1p %arg_f32 : f32
// CHECK: llvm.call @__ocml_log1p_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_log1p_f32(%{{.*}}) : (f32) -> f32
%result64 = math.log1p %arg_f64 : f64		%result64 = math.log1p %arg_f64 : f64
// CHECK: llvm.call @__ocml_log1p_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_log1p_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_log10_f32(f32) -> f32		// CHECK: llvm.func @__ocml_log10_f32(f32) -> f32
// CHECK: llvm.func @__ocml_log10_f64(f64) -> f64		// CHECK: llvm.func @__ocml_log10_f64(f64) -> f64
// CHECK-LABEL: func @gpu_log10		// CHECK-LABEL: func @gpu_log10
func @gpu_log10(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_log10(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.log10 %arg_f32 : f32		%result32 = math.log10 %arg_f32 : f32
// CHECK: llvm.call @__ocml_log10_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_log10_f32(%{{.*}}) : (f32) -> f32
%result64 = math.log10 %arg_f64 : f64		%result64 = math.log10 %arg_f64 : f64
// CHECK: llvm.call @__ocml_log10_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_log10_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_log2_f32(f32) -> f32		// CHECK: llvm.func @__ocml_log2_f32(f32) -> f32
// CHECK: llvm.func @__ocml_log2_f64(f64) -> f64		// CHECK: llvm.func @__ocml_log2_f64(f64) -> f64
// CHECK-LABEL: func @gpu_log2		// CHECK-LABEL: func @gpu_log2
func @gpu_log2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_log2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.log2 %arg_f32 : f32		%result32 = math.log2 %arg_f32 : f32
// CHECK: llvm.call @__ocml_log2_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_log2_f32(%{{.*}}) : (f32) -> f32
%result64 = math.log2 %arg_f64 : f64		%result64 = math.log2 %arg_f64 : f64
// CHECK: llvm.call @__ocml_log2_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_log2_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_rsqrt_f32(f32) -> f32		// CHECK: llvm.func @__ocml_rsqrt_f32(f32) -> f32
// CHECK: llvm.func @__ocml_rsqrt_f64(f64) -> f64		// CHECK: llvm.func @__ocml_rsqrt_f64(f64) -> f64
// CHECK-LABEL: func @gpu_rsqrt		// CHECK-LABEL: func @gpu_rsqrt
func @gpu_rsqrt(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)		builtin.func @gpu_rsqrt(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)
-> (f16, f32, f64) {		-> (f16, f32, f64) {
%result16 = math.rsqrt %arg_f16 : f16		%result16 = math.rsqrt %arg_f16 : f16
// CHECK: llvm.fpext %{{.*}} : f16 to f32		// CHECK: llvm.fpext %{{.*}} : f16 to f32
// CHECK-NEXT: llvm.call @__ocml_rsqrt_f32(%{{.*}}) : (f32) -> f32		// CHECK-NEXT: llvm.call @__ocml_rsqrt_f32(%{{.*}}) : (f32) -> f32
// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16		// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16
%result32 = math.rsqrt %arg_f32 : f32		%result32 = math.rsqrt %arg_f32 : f32
// CHECK: llvm.call @__ocml_rsqrt_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_rsqrt_f32(%{{.*}}) : (f32) -> f32
%result64 = math.rsqrt %arg_f64 : f64		%result64 = math.rsqrt %arg_f64 : f64
// CHECK: llvm.call @__ocml_rsqrt_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_rsqrt_f64(%{{.*}}) : (f64) -> f64
std.return %result16, %result32, %result64 : f16, f32, f64		std.return %result16, %result32, %result64 : f16, f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_sqrt_f32(f32) -> f32		// CHECK: llvm.func @__ocml_sqrt_f32(f32) -> f32
// CHECK: llvm.func @__ocml_sqrt_f64(f64) -> f64		// CHECK: llvm.func @__ocml_sqrt_f64(f64) -> f64
// CHECK-LABEL: func @gpu_sqrt		// CHECK-LABEL: func @gpu_sqrt
func @gpu_sqrt(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)		builtin.func @gpu_sqrt(%arg_f16 : f16, %arg_f32 : f32, %arg_f64 : f64)
-> (f16, f32, f64) {		-> (f16, f32, f64) {
%result16 = math.sqrt %arg_f16 : f16		%result16 = math.sqrt %arg_f16 : f16
// CHECK: llvm.fpext %{{.*}} : f16 to f32		// CHECK: llvm.fpext %{{.*}} : f16 to f32
// CHECK-NEXT: llvm.call @__ocml_sqrt_f32(%{{.*}}) : (f32) -> f32		// CHECK-NEXT: llvm.call @__ocml_sqrt_f32(%{{.*}}) : (f32) -> f32
// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16		// CHECK-NEXT: llvm.fptrunc %{{.*}} : f32 to f16
%result32 = math.sqrt %arg_f32 : f32		%result32 = math.sqrt %arg_f32 : f32
// CHECK: llvm.call @__ocml_sqrt_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_sqrt_f32(%{{.*}}) : (f32) -> f32
%result64 = math.sqrt %arg_f64 : f64		%result64 = math.sqrt %arg_f64 : f64
// CHECK: llvm.call @__ocml_sqrt_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_sqrt_f64(%{{.*}}) : (f64) -> f64
std.return %result16, %result32, %result64 : f16, f32, f64		std.return %result16, %result32, %result64 : f16, f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_tanh_f32(f32) -> f32		// CHECK: llvm.func @__ocml_tanh_f32(f32) -> f32
// CHECK: llvm.func @__ocml_tanh_f64(f64) -> f64		// CHECK: llvm.func @__ocml_tanh_f64(f64) -> f64
// CHECK-LABEL: func @gpu_tanh		// CHECK-LABEL: func @gpu_tanh
func @gpu_tanh(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_tanh(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.tanh %arg_f32 : f32		%result32 = math.tanh %arg_f32 : f32
// CHECK: llvm.call @__ocml_tanh_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_tanh_f32(%{{.*}}) : (f32) -> f32
%result64 = math.tanh %arg_f64 : f64		%result64 = math.tanh %arg_f64 : f64
// CHECK: llvm.call @__ocml_tanh_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_tanh_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_atan_f32(f32) -> f32		// CHECK: llvm.func @__ocml_atan_f32(f32) -> f32
// CHECK: llvm.func @__ocml_atan_f64(f64) -> f64		// CHECK: llvm.func @__ocml_atan_f64(f64) -> f64
// CHECK-LABEL: func @gpu_atan		// CHECK-LABEL: func @gpu_atan
func @gpu_atan(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_atan(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.atan %arg_f32 : f32		%result32 = math.atan %arg_f32 : f32
// CHECK: llvm.call @__ocml_atan_f32(%{{.*}}) : (f32) -> f32		// CHECK: llvm.call @__ocml_atan_f32(%{{.*}}) : (f32) -> f32
%result64 = math.atan %arg_f64 : f64		%result64 = math.atan %arg_f64 : f64
// CHECK: llvm.call @__ocml_atan_f64(%{{.*}}) : (f64) -> f64		// CHECK: llvm.call @__ocml_atan_f64(%{{.*}}) : (f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_atan2_f32(f32, f32) -> f32		// CHECK: llvm.func @__ocml_atan2_f32(f32, f32) -> f32
// CHECK: llvm.func @__ocml_atan2_f64(f64, f64) -> f64		// CHECK: llvm.func @__ocml_atan2_f64(f64, f64) -> f64
// CHECK-LABEL: func @gpu_atan2		// CHECK-LABEL: func @gpu_atan2
func @gpu_atan2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_atan2(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.atan2 %arg_f32, %arg_f32 : f32		%result32 = math.atan2 %arg_f32, %arg_f32 : f32
// CHECK: llvm.call @__ocml_atan2_f32(%{{.*}}) : (f32, f32) -> f32		// CHECK: llvm.call @__ocml_atan2_f32(%{{.*}}) : (f32, f32) -> f32
%result64 = math.atan2 %arg_f64, %arg_f64 : f64		%result64 = math.atan2 %arg_f64, %arg_f64 : f64
// CHECK: llvm.call @__ocml_atan2_f64(%{{.*}}) : (f64, f64) -> f64		// CHECK: llvm.call @__ocml_atan2_f64(%{{.*}}) : (f64, f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

// -----		// -----

gpu.module @test_module {		gpu.module @test_module {
// CHECK: llvm.func @__ocml_pow_f32(f32, f32) -> f32		// CHECK: llvm.func @__ocml_pow_f32(f32, f32) -> f32
// CHECK: llvm.func @__ocml_pow_f64(f64, f64) -> f64		// CHECK: llvm.func @__ocml_pow_f64(f64, f64) -> f64
// CHECK-LABEL: func @gpu_pow		// CHECK-LABEL: func @gpu_pow
func @gpu_pow(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {		builtin.func @gpu_pow(%arg_f32 : f32, %arg_f64 : f64) -> (f32, f64) {
%result32 = math.powf %arg_f32, %arg_f32 : f32		%result32 = math.powf %arg_f32, %arg_f32 : f32
// CHECK: llvm.call @__ocml_pow_f32(%{{.}}, %{{.}}) : (f32, f32) -> f32		// CHECK: llvm.call @__ocml_pow_f32(%{{.}}, %{{.}}) : (f32, f32) -> f32
%result64 = math.powf %arg_f64, %arg_f64 : f64		%result64 = math.powf %arg_f64, %arg_f64 : f64
// CHECK: llvm.call @__ocml_pow_f64(%{{.}}, %{{.}}) : (f64, f64) -> f64		// CHECK: llvm.call @__ocml_pow_f64(%{{.}}, %{{.}}) : (f64, f64) -> f64
std.return %result32, %result64 : f32, f64		std.return %result32, %result64 : f32, f64
}		}
}		}

Show All 11 Lines

mlir/test/Conversion/VectorToROCDL/vector-to-rocdl.mlir

	// RUN: mlir-opt %s -convert-vector-to-rocdl \| FileCheck %s			// RUN: mlir-opt %s -convert-vector-to-rocdl \| FileCheck %s

	gpu.module @test_read{			gpu.module @test_read{
	func @transfer_readx2(%A : memref<?xf32>, %base: index) -> vector<2xf32> {			builtin.func @transfer_readx2(%A : memref<?xf32>, %base: index) -> vector<2xf32> {
	%f0 = constant 0.0: f32			%f0 = constant 0.0: f32
	%f = vector.transfer_read %A[%base], %f0			%f = vector.transfer_read %A[%base], %f0
	{permutation_map = affine_map<(d0) -> (d0)>} :			{permutation_map = affine_map<(d0) -> (d0)>} :
	memref<?xf32>, vector<2xf32>			memref<?xf32>, vector<2xf32>
	return %f: vector<2xf32>			return %f: vector<2xf32>
	}			}
	// CHECK-LABEL: @transfer_readx2			// CHECK-LABEL: @transfer_readx2
	// CHECK: rocdl.buffer.load {{.*}} vector<2xf32>			// CHECK: rocdl.buffer.load {{.*}} vector<2xf32>

	func @transfer_readx4(%A : memref<?xf32>, %base: index) -> vector<4xf32> {			builtin.func @transfer_readx4(%A : memref<?xf32>, %base: index) -> vector<4xf32> {
	%f0 = constant 0.0: f32			%f0 = constant 0.0: f32
	%f = vector.transfer_read %A[%base], %f0			%f = vector.transfer_read %A[%base], %f0
	{permutation_map = affine_map<(d0) -> (d0)>} :			{permutation_map = affine_map<(d0) -> (d0)>} :
	memref<?xf32>, vector<4xf32>			memref<?xf32>, vector<4xf32>
	return %f: vector<4xf32>			return %f: vector<4xf32>
	}			}
	// CHECK-LABEL: @transfer_readx4			// CHECK-LABEL: @transfer_readx4
	// CHECK: rocdl.buffer.load {{.*}} vector<4xf32>			// CHECK: rocdl.buffer.load {{.*}} vector<4xf32>

	func @transfer_read_dwordConfig(%A : memref<?xf32>, %base: index) -> vector<4xf32> {			builtin.func @transfer_read_dwordConfig(%A : memref<?xf32>, %base: index) -> vector<4xf32> {
	%f0 = constant 0.0: f32			%f0 = constant 0.0: f32
	%f = vector.transfer_read %A[%base], %f0			%f = vector.transfer_read %A[%base], %f0
	{permutation_map = affine_map<(d0) -> (d0)>} :			{permutation_map = affine_map<(d0) -> (d0)>} :
	memref<?xf32>, vector<4xf32>			memref<?xf32>, vector<4xf32>
	return %f: vector<4xf32>			return %f: vector<4xf32>
	}			}
	// CHECK-LABEL: @transfer_read_dwordConfig			// CHECK-LABEL: @transfer_read_dwordConfig
	// CHECK: %[[gep:.]] = llvm.getelementptr {{.}}			// CHECK: %[[gep:.]] = llvm.getelementptr {{.}}
	// CHECK: [0, 0, -1, 159744]			// CHECK: [0, 0, -1, 159744]
	// CHECK: %[[i64:.*]] = llvm.ptrtoint %[[gep]]			// CHECK: %[[i64:.*]] = llvm.ptrtoint %[[gep]]
	// CHECK: llvm.insertelement %[[i64]]			// CHECK: llvm.insertelement %[[i64]]
	}			}

	gpu.module @test_write{			gpu.module @test_write{
	func @transfer_writex2(%A : memref<?xf32>, %B : vector<2xf32>, %base: index) {			builtin.func @transfer_writex2(%A : memref<?xf32>, %B : vector<2xf32>, %base: index) {
	vector.transfer_write %B, %A[%base]			vector.transfer_write %B, %A[%base]
	{permutation_map = affine_map<(d0) -> (d0)>} :			{permutation_map = affine_map<(d0) -> (d0)>} :
	vector<2xf32>, memref<?xf32>			vector<2xf32>, memref<?xf32>
	return			return
	}			}
	// CHECK-LABEL: @transfer_writex2			// CHECK-LABEL: @transfer_writex2
	// CHECK: rocdl.buffer.store {{.*}} vector<2xf32>			// CHECK: rocdl.buffer.store {{.*}} vector<2xf32>

	func @transfer_writex4(%A : memref<?xf32>, %B : vector<4xf32>, %base: index) {			builtin.func @transfer_writex4(%A : memref<?xf32>, %B : vector<4xf32>, %base: index) {
	vector.transfer_write %B, %A[%base]			vector.transfer_write %B, %A[%base]
	{permutation_map = affine_map<(d0) -> (d0)>} :			{permutation_map = affine_map<(d0) -> (d0)>} :
	vector<4xf32>, memref<?xf32>			vector<4xf32>, memref<?xf32>
	return			return
	}			}
	// CHECK-LABEL: @transfer_writex4			// CHECK-LABEL: @transfer_writex4
	// CHECK: rocdl.buffer.store {{.*}} vector<4xf32>			// CHECK: rocdl.buffer.store {{.*}} vector<4xf32>

	func @transfer_write_dwordConfig(%A : memref<?xf32>, %B : vector<2xf32>, %base: index) {			builtin.func @transfer_write_dwordConfig(%A : memref<?xf32>, %B : vector<2xf32>, %base: index) {
	vector.transfer_write %B, %A[%base]			vector.transfer_write %B, %A[%base]
	{permutation_map = affine_map<(d0) -> (d0)>} :			{permutation_map = affine_map<(d0) -> (d0)>} :
	vector<2xf32>, memref<?xf32>			vector<2xf32>, memref<?xf32>
	return			return
	}			}
	// CHECK-LABEL: @transfer_write_dwordConfig			// CHECK-LABEL: @transfer_write_dwordConfig
	// CHECK: %[[gep:.]] = llvm.getelementptr {{.}}			// CHECK: %[[gep:.]] = llvm.getelementptr {{.}}
	// CHECK: [0, 0, -1, 159744]			// CHECK: [0, 0, -1, 159744]
	// CHECK: %[[i64:.*]] = llvm.ptrtoint %[[gep]]			// CHECK: %[[i64:.*]] = llvm.ptrtoint %[[gep]]
	// CHECK: llvm.insertelement %[[i64]]			// CHECK: llvm.insertelement %[[i64]]
	}			}

mlir/test/Dialect/Builtin/canonicalize.mlir

	// RUN: mlir-opt %s -canonicalize \| FileCheck %s			// RUN: mlir-opt %s -canonicalize \| FileCheck %s

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// UnrealizedConversionCastOp			// UnrealizedConversionCastOp
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	// Test folding conversion casts feeding into other casts.			// Test folding conversion casts feeding into other casts.
	// CHECK-LABEL: func @multiple_conversion_casts			// CHECK-LABEL: func @multiple_conversion_casts
	// CHECK-SAME: %[[ARG0:.]]: i32, %[[ARG1:.]]:			// CHECK-SAME: %[[ARG0:.]]: i32, %[[ARG1:.]]:
	func @multiple_conversion_casts(%arg0: i32, %arg1: i32) -> (i32, i32) {			func @multiple_conversion_casts(%arg0: i32, %arg1: i32) -> (i32, i32) {
	// CHECK-NOT: unrealized_conversion_cast			// CHECK-NOT: unrealized_conversion_cast
	// CHECK: return %[[ARG0]], %[[ARG1]]			// CHECK: return %[[ARG0]], %[[ARG1]]
	%inputs:2 = unrealized_conversion_cast %arg0, %arg1 : i32, i32 to i64, i64			%inputs:2 = builtin.unrealized_conversion_cast %arg0, %arg1 : i32, i32 to i64, i64
	%outputs:2 = unrealized_conversion_cast %inputs#0, %inputs#1 : i64, i64 to i32, i32			%outputs:2 = builtin.unrealized_conversion_cast %inputs#0, %inputs#1 : i64, i64 to i32, i32
	return %outputs#0, %outputs#1 : i32, i32			return %outputs#0, %outputs#1 : i32, i32
	}			}

	// CHECK-LABEL: func @multiple_conversion_casts			// CHECK-LABEL: func @multiple_conversion_casts
	func @multiple_conversion_casts_failure(%arg0: i32, %arg1: i32, %arg2: i64) -> (i32, i32) {			func @multiple_conversion_casts_failure(%arg0: i32, %arg1: i32, %arg2: i64) -> (i32, i32) {
	// CHECK: unrealized_conversion_cast			// CHECK: unrealized_conversion_cast
	// CHECK: unrealized_conversion_cast			// CHECK: unrealized_conversion_cast
	%inputs:2 = unrealized_conversion_cast %arg0, %arg1 : i32, i32 to i64, i64			%inputs:2 = builtin.unrealized_conversion_cast %arg0, %arg1 : i32, i32 to i64, i64
	%outputs:2 = unrealized_conversion_cast %arg2, %inputs#1 : i64, i64 to i32, i32			%outputs:2 = builtin.unrealized_conversion_cast %arg2, %inputs#1 : i64, i64 to i32, i32
	return %outputs#0, %outputs#1 : i32, i32			return %outputs#0, %outputs#1 : i32, i32
	}			}

mlir/test/Dialect/Linalg/drop-unit-extent-dims.mlir

Show First 20 Lines • Show All 776 Lines • ▼ Show 20 Lines	outs(%shape : memref<?x1x?x1x?xf32>) {
}		}
return %shape : memref<?x1x?x1x?xf32>		return %shape : memref<?x1x?x1x?xf32>
}		}

// CHECK: #[[MAP0:.]] = affine_map<(d0, d1, d2)[s0] -> (d0 s0 + d1 + d2)>		// CHECK: #[[MAP0:.]] = affine_map<(d0, d1, d2)[s0] -> (d0 s0 + d1 + d2)>
// CHECK: #[[MAP1:.*]] = affine_map<(d0, d1, d2) -> (d0, 0, d2)>		// CHECK: #[[MAP1:.*]] = affine_map<(d0, d1, d2) -> (d0, 0, d2)>
// CHECK: #[[MAP2:.*]] = affine_map<(d0, d1, d2) -> ()>		// CHECK: #[[MAP2:.*]] = affine_map<(d0, d1, d2) -> ()>
// CHECK: #[[MAP3:.*]] = affine_map<(d0, d1, d2) -> (d0, d1, d2)>		// CHECK: #[[MAP3:.*]] = affine_map<(d0, d1, d2) -> (d0, d1, d2)>
// CHECK: builtin.func @input_stays_same(		// CHECK: func @input_stays_same(
// CHECK-SAME: %[[ARG0:.*]]: memref<?x1x?xf32, #[[MAP0]]>,		// CHECK-SAME: %[[ARG0:.*]]: memref<?x1x?xf32, #[[MAP0]]>,
// CHECK-SAME: %[[ARG1:.]]: f32, %[[ARG2:.]]: memref<?x1x?x1x?xf32>)		// CHECK-SAME: %[[ARG1:.]]: f32, %[[ARG2:.]]: memref<?x1x?x1x?xf32>)
// CHECK-SAME -> memref<?x1x?x1x?xf32> {		// CHECK-SAME -> memref<?x1x?x1x?xf32> {
// CHECK: %[[OUT:.*]] = memref.collapse_shape %[[ARG2]] {{\[}}[0, 1], [2, 3], [4]]		// CHECK: %[[OUT:.*]] = memref.collapse_shape %[[ARG2]] {{\[}}[0, 1], [2, 3], [4]]
// CHECK-SAME: : memref<?x1x?x1x?xf32> into memref<?x?x?xf32>		// CHECK-SAME: : memref<?x1x?x1x?xf32> into memref<?x?x?xf32>
// CHECK: linalg.generic		// CHECK: linalg.generic
// CHECK-SAME: {indexing_maps = [#[[MAP1]], #[[MAP2]], #[[MAP3]]],		// CHECK-SAME: {indexing_maps = [#[[MAP1]], #[[MAP2]], #[[MAP3]]],
// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel"]}		// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel"]}
// CHECK-SAME: ins(%[[ARG0]], %[[ARG1]] : memref<?x1x?xf32, #[[MAP0]]>, f32)		// CHECK-SAME: ins(%[[ARG0]], %[[ARG1]] : memref<?x1x?xf32, #[[MAP0]]>, f32)
// CHECK-SAME: outs(%[[OUT]] : memref<?x?x?xf32>) {		// CHECK-SAME: outs(%[[OUT]] : memref<?x?x?xf32>) {
// CHECK: ^bb0(%{{.}}: f32, %[[ARG:.]]: f32, %{{.*}}: f32): // no predecessors		// CHECK: ^bb0(%{{.}}: f32, %[[ARG:.]]: f32, %{{.*}}: f32): // no predecessors
// CHECK: linalg.yield %[[ARG]] : f32		// CHECK: linalg.yield %[[ARG]] : f32
// CHECK: }		// CHECK: }
// CHECK: return %[[ARG2]] : memref<?x1x?x1x?xf32>		// CHECK: return %[[ARG2]] : memref<?x1x?x1x?xf32>

mlir/test/Dialect/Linalg/reshape_control_fusion.mlir

Show All 14 Lines	%1 = linalg.generic {
^bb0(%arg2 : f32, %arg3:f32, %arg4 : f32):		^bb0(%arg2 : f32, %arg3:f32, %arg4 : f32):
%2 = addf %arg2, %arg3 : f32		%2 = addf %arg2, %arg3 : f32
linalg.yield %2 : f32		linalg.yield %2 : f32
} -> tensor<?x?xf32>		} -> tensor<?x?xf32>
return %1 : tensor<?x?xf32>		return %1 : tensor<?x?xf32>
}		}
// CHECK-DAG: #[[MAP0:.+]] = affine_map<(d0, d1) -> (d0, d1)>		// CHECK-DAG: #[[MAP0:.+]] = affine_map<(d0, d1) -> (d0, d1)>
// CHECK-DAG: #[[MAP1:.+]] = affine_map<(d0, d1) -> (d1)>		// CHECK-DAG: #[[MAP1:.+]] = affine_map<(d0, d1) -> (d1)>
// CHECK: builtin.func @control_producer_reshape_fusion		// CHECK: func @control_producer_reshape_fusion
// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]+]]: tensor<?x?x?xf32>		// CHECK-SAME: %[[ARG0:[a-zA-Z0-9_]+]]: tensor<?x?x?xf32>
// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]+]]: tensor<?xf32>		// CHECK-SAME: %[[ARG1:[a-zA-Z0-9_]+]]: tensor<?xf32>
// CHECK-DAG: %[[C0:.+]] = constant 0 : index		// CHECK-DAG: %[[C0:.+]] = constant 0 : index
// CHECK-DAG: %[[C1:.+]] = constant 1 : index		// CHECK-DAG: %[[C1:.+]] = constant 1 : index
// CHECK: %[[RESHAPE:.+]] = linalg.tensor_collapse_shape %[[ARG0]]		// CHECK: %[[RESHAPE:.+]] = linalg.tensor_collapse_shape %[[ARG0]]
// CHECK-SAME: {{\[}}[0, 1], [2]{{\]}} : tensor<?x?x?xf32> into tensor<?x?xf32>		// CHECK-SAME: {{\[}}[0, 1], [2]{{\]}} : tensor<?x?x?xf32> into tensor<?x?xf32>
// CHECK: %[[RESULT:.+]] = linalg.generic		// CHECK: %[[RESULT:.+]] = linalg.generic
// CHECK-SAME: indexing_maps = [#[[MAP0]], #[[MAP1]], #[[MAP0]]]		// CHECK-SAME: indexing_maps = [#[[MAP0]], #[[MAP1]], #[[MAP0]]]
Show All 17 Lines	%fill = linalg.generic {
linalg.yield %cst : f32		linalg.yield %cst : f32
} -> tensor<?x?xf32>		} -> tensor<?x?xf32>
%0 = linalg.tensor_expand_shape %fill [[0, 1], [2]] : tensor<?x?xf32> into tensor<1x?x?xf32>		%0 = linalg.tensor_expand_shape %fill [[0, 1], [2]] : tensor<?x?xf32> into tensor<1x?x?xf32>
%1 = linalg.batch_matmul ins(%arg0, %arg1 : tensor<1x?x?xf32>, tensor<1x?x?xf32>)		%1 = linalg.batch_matmul ins(%arg0, %arg1 : tensor<1x?x?xf32>, tensor<1x?x?xf32>)
outs(%0 : tensor<1x?x?xf32>) -> tensor<1x?x?xf32>		outs(%0 : tensor<1x?x?xf32>) -> tensor<1x?x?xf32>
return %1 : tensor<1x?x?xf32>		return %1 : tensor<1x?x?xf32>
}		}
// CHECK-DAG: #[[MAP:.+]] = affine_map<(d0, d1, d2) -> (d0, d1, d2)		// CHECK-DAG: #[[MAP:.+]] = affine_map<(d0, d1, d2) -> (d0, d1, d2)
// CHECK: builtin.func @control_consumer_reshape_fusion		// CHECK: func @control_consumer_reshape_fusion
// CHECK: %[[FILL:.+]] = linalg.generic		// CHECK: %[[FILL:.+]] = linalg.generic
// CHECK-SAME: indexing_maps = [#[[MAP]]]		// CHECK-SAME: indexing_maps = [#[[MAP]]]
// CHECK-SAME: outs(%{{.+}} : tensor<1x?x?xf32>)		// CHECK-SAME: outs(%{{.+}} : tensor<1x?x?xf32>)
// CHECK: linalg.batch_matmul		// CHECK: linalg.batch_matmul
// CHECK-SAME: outs(%[[FILL]] : tensor<1x?x?xf32>)		// CHECK-SAME: outs(%[[FILL]] : tensor<1x?x?xf32>)

mlir/test/Dialect/Shape/invalid.mlir

	Show First 20 Lines • Show All 166 Lines • ▼ Show 20 Lines

	// Test using an unsupported shape.lib attribute type.			// Test using an unsupported shape.lib attribute type.

	// expected-error@+1 {{only SymbolRefAttr allowed in shape.lib attribute array}}			// expected-error@+1 {{only SymbolRefAttr allowed in shape.lib attribute array}}
	module attributes {shape.lib = [@shape_lib, "shape_lib"]} {			module attributes {shape.lib = [@shape_lib, "shape_lib"]} {

	shape.function_library @shape_lib {			shape.function_library @shape_lib {
	// Test shape function that returns the shape of input arg as result shape.			// Test shape function that returns the shape of input arg as result shape.
	func @same_result_shape(%arg: !shape.value_shape) -> !shape.shape {			builtin.func @same_result_shape(%arg: !shape.value_shape) -> !shape.shape {
	%0 = shape.shape_of %arg : !shape.value_shape -> !shape.shape			%0 = shape.shape_of %arg : !shape.value_shape -> !shape.shape
	return %0 : !shape.shape			return %0 : !shape.shape
	}			}
	} mapping {			} mapping {
	test.same_operand_result_type = @same_result_shape			test.same_operand_result_type = @same_result_shape
	}			}

	}			}

	// -----			// -----

	// Test that duplicate op to shape function mappings are flagged, this uses			// Test that duplicate op to shape function mappings are flagged, this uses
	// the same library twice for easy overlap.			// the same library twice for easy overlap.

	// expected-error@+1 {{only one op to shape mapping allowed}}			// expected-error@+1 {{only one op to shape mapping allowed}}
	module attributes {shape.lib = [@shape_lib, @shape_lib]} {			module attributes {shape.lib = [@shape_lib, @shape_lib]} {

	shape.function_library @shape_lib {			shape.function_library @shape_lib {
	// Test shape function that returns the shape of input arg as result shape.			// Test shape function that returns the shape of input arg as result shape.
	func @same_result_shape(%arg: !shape.value_shape) -> !shape.shape {			builtin.func @same_result_shape(%arg: !shape.value_shape) -> !shape.shape {
	%0 = shape.shape_of %arg : !shape.value_shape -> !shape.shape			%0 = shape.shape_of %arg : !shape.value_shape -> !shape.shape
	return %0 : !shape.shape			return %0 : !shape.shape
	}			}
	} mapping {			} mapping {
	test.same_operand_result_type = @same_result_shape			test.same_operand_result_type = @same_result_shape
	}			}

	}			}

	// -----			// -----

	// Test that duplicate op to shape function mappings are flagged (this is			// Test that duplicate op to shape function mappings are flagged (this is
	// more an invariant of using the dictionary attribute here than anything			// more an invariant of using the dictionary attribute here than anything
	// specific to function library op).			// specific to function library op).

	module attributes {shape.lib = [@shape_lib]} {			module attributes {shape.lib = [@shape_lib]} {

	shape.function_library @shape_lib {			shape.function_library @shape_lib {
	// Test shape function that returns the shape of input arg as result shape.			// Test shape function that returns the shape of input arg as result shape.
	func @same_result_shape(%arg: !shape.value_shape) -> !shape.shape {			builtin.func @same_result_shape(%arg: !shape.value_shape) -> !shape.shape {
	%0 = shape.shape_of %arg : !shape.value_shape -> !shape.shape			%0 = shape.shape_of %arg : !shape.value_shape -> !shape.shape
	return %0 : !shape.shape			return %0 : !shape.shape
	}			}
	} mapping {			} mapping {
	// expected-error @+2 {{duplicate key}}			// expected-error @+2 {{duplicate key}}
	test.same_operand_result_type = @same_result_shape,			test.same_operand_result_type = @same_result_shape,
	test.same_operand_result_type = @same_result_shape			test.same_operand_result_type = @same_result_shape
	}			}
	▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

mlir/test/Dialect/SparseTensor/sparse_perm.mlir

// NOTE: Assertions have been autogenerated by utils/generate-test-checks.py		// NOTE: Assertions have been autogenerated by utils/generate-test-checks.py
// RUN: mlir-opt %s -sparsification \| FileCheck %s		// RUN: mlir-opt %s -sparsification \| FileCheck %s

#X = #sparse_tensor.encoding<{		#X = #sparse_tensor.encoding<{
dimLevelType = [ "dense", "dense", "dense" ],		dimLevelType = [ "dense", "dense", "dense" ],
dimOrdering = affine_map<(i,j,k) -> (k,i,j)>		dimOrdering = affine_map<(i,j,k) -> (k,i,j)>
}>		}>

#trait = {		#trait = {
indexing_maps = [		indexing_maps = [
affine_map<(i,j,k) -> (k,i,j)>, // A (in)		affine_map<(i,j,k) -> (k,i,j)>, // A (in)
affine_map<(i,j,k) -> (i,j,k)> // X (out)		affine_map<(i,j,k) -> (i,j,k)> // X (out)
],		],
iterator_types = ["parallel", "parallel", "parallel"]		iterator_types = ["parallel", "parallel", "parallel"]
}		}

// CHECK-LABEL: builtin.func @sparse_static_dims(		// CHECK-LABEL: func @sparse_static_dims(
// CHECK-SAME: %[[VAL_0:.]]: tensor<10x20x30xf32, #sparse_tensor.encoding<{{{.}}}>>,		// CHECK-SAME: %[[VAL_0:.]]: tensor<10x20x30xf32, #sparse_tensor.encoding<{{{.}}}>>,
// CHECK-SAME: %[[VAL_1:.*]]: tensor<20x30x10xf32>) -> tensor<20x30x10xf32> {		// CHECK-SAME: %[[VAL_1:.*]]: tensor<20x30x10xf32>) -> tensor<20x30x10xf32> {
// CHECK: %[[VAL_2:.*]] = constant 20 : index		// CHECK: %[[VAL_2:.*]] = constant 20 : index
// CHECK: %[[VAL_3:.*]] = constant 30 : index		// CHECK: %[[VAL_3:.*]] = constant 30 : index
// CHECK: %[[VAL_4:.*]] = constant 10 : index		// CHECK: %[[VAL_4:.*]] = constant 10 : index
// CHECK: %[[VAL_5:.*]] = constant 0 : index		// CHECK: %[[VAL_5:.*]] = constant 0 : index
// CHECK: %[[VAL_6:.*]] = constant 1 : index		// CHECK: %[[VAL_6:.*]] = constant 1 : index
// CHECK: %[[VAL_7:.]] = sparse_tensor.values %[[VAL_0]] : tensor<10x20x30xf32, #sparse_tensor.encoding<{{{.}}}>>		// CHECK: %[[VAL_7:.]] = sparse_tensor.values %[[VAL_0]] : tensor<10x20x30xf32, #sparse_tensor.encoding<{{{.}}}>>
Show All 21 Lines	%0 = linalg.generic #trait
ins(%arga: tensor<10x20x30xf32, #X>)		ins(%arga: tensor<10x20x30xf32, #X>)
outs(%argx: tensor<20x30x10xf32>) {		outs(%argx: tensor<20x30x10xf32>) {
^bb(%a : f32, %x: f32):		^bb(%a : f32, %x: f32):
linalg.yield %a : f32		linalg.yield %a : f32
} -> tensor<20x30x10xf32>		} -> tensor<20x30x10xf32>
return %0 : tensor<20x30x10xf32>		return %0 : tensor<20x30x10xf32>
}		}

// CHECK-LABEL: builtin.func @sparse_dynamic_dims(		// CHECK-LABEL: func @sparse_dynamic_dims(
// CHECK-SAME: %[[VAL_0:.]]: tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>,		// CHECK-SAME: %[[VAL_0:.]]: tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>,
// CHECK-SAME: %[[VAL_1:.*]]: tensor<?x?x?xf32>) -> tensor<?x?x?xf32> {		// CHECK-SAME: %[[VAL_1:.*]]: tensor<?x?x?xf32>) -> tensor<?x?x?xf32> {
// CHECK: %[[VAL_2:.*]] = constant 2 : index		// CHECK: %[[VAL_2:.*]] = constant 2 : index
// CHECK: %[[VAL_3:.*]] = constant 0 : index		// CHECK: %[[VAL_3:.*]] = constant 0 : index
// CHECK: %[[VAL_4:.*]] = constant 1 : index		// CHECK: %[[VAL_4:.*]] = constant 1 : index
// CHECK: %[[VAL_5:.]] = sparse_tensor.values %[[VAL_0]] : tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>		// CHECK: %[[VAL_5:.]] = sparse_tensor.values %[[VAL_0]] : tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>
// CHECK: %[[VAL_6:.*]] = tensor.dim %[[VAL_1]], %[[VAL_3]] : tensor<?x?x?xf32>		// CHECK: %[[VAL_6:.*]] = tensor.dim %[[VAL_1]], %[[VAL_3]] : tensor<?x?x?xf32>
// CHECK: %[[VAL_7:.*]] = tensor.dim %[[VAL_1]], %[[VAL_4]] : tensor<?x?x?xf32>		// CHECK: %[[VAL_7:.*]] = tensor.dim %[[VAL_1]], %[[VAL_4]] : tensor<?x?x?xf32>
Show All 29 Lines

mlir/test/Dialect/SparseTensor/sparse_perm_lower.mlir

	Show All 10 Lines
	#trait = {			#trait = {
	indexing_maps = [			indexing_maps = [
	affine_map<(i,j,k) -> (k,i,j)>, // A (in)			affine_map<(i,j,k) -> (k,i,j)>, // A (in)
	affine_map<(i,j,k) -> ()> // X (out)			affine_map<(i,j,k) -> ()> // X (out)
	],			],
	iterator_types = ["reduction", "reduction", "reduction"]			iterator_types = ["reduction", "reduction", "reduction"]
	}			}

	// CHECK-HIR-LABEL: builtin.func @sparse_dynamic_dims(			// CHECK-HIR-LABEL: func @sparse_dynamic_dims(
	// CHECK-HIR-SAME: %[[VAL_0:.]]: tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>,			// CHECK-HIR-SAME: %[[VAL_0:.]]: tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>,
	// CHECK-HIR-SAME: %[[VAL_1:.*]]: tensor<f32>) -> tensor<f32> {			// CHECK-HIR-SAME: %[[VAL_1:.*]]: tensor<f32>) -> tensor<f32> {
	// CHECK-HIR-DAG: %[[C0:.*]] = constant 0 : index			// CHECK-HIR-DAG: %[[C0:.*]] = constant 0 : index
	// CHECK-HIR-DAG: %[[C1:.*]] = constant 1 : index			// CHECK-HIR-DAG: %[[C1:.*]] = constant 1 : index
	// CHECK-HIR-DAG: %[[C2:.*]] = constant 2 : index			// CHECK-HIR-DAG: %[[C2:.*]] = constant 2 : index
	// CHECK-HIR: %[[VAL_5:.]] = tensor.dim %[[VAL_0]], %[[C2]] : tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>			// CHECK-HIR: %[[VAL_5:.]] = tensor.dim %[[VAL_0]], %[[C2]] : tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>
	// CHECK-HIR: %[[VAL_6:.]] = tensor.dim %[[VAL_0]], %[[C0]] : tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>			// CHECK-HIR: %[[VAL_6:.]] = tensor.dim %[[VAL_0]], %[[C0]] : tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>
	// CHECK-HIR: %[[VAL_7:.]] = tensor.dim %[[VAL_0]], %[[C1]] : tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>			// CHECK-HIR: %[[VAL_7:.]] = tensor.dim %[[VAL_0]], %[[C1]] : tensor<?x?x?xf32, #sparse_tensor.encoding<{{{.}}}>>
	Show All 15 Lines
	// CHECK-HIR: }			// CHECK-HIR: }
	// CHECK-HIR: memref.store %[[VAL_23:.*]], %[[VAL_10]][] : memref<f32>			// CHECK-HIR: memref.store %[[VAL_23:.*]], %[[VAL_10]][] : memref<f32>
	// CHECK-HIR: }			// CHECK-HIR: }
	// CHECK-HIR: }			// CHECK-HIR: }
	// CHECK-HIR: %[[VAL_24:.*]] = memref.tensor_load %[[VAL_10]] : memref<f32>			// CHECK-HIR: %[[VAL_24:.*]] = memref.tensor_load %[[VAL_10]] : memref<f32>
	// CHECK-HIR: return %[[VAL_24]] : tensor<f32>			// CHECK-HIR: return %[[VAL_24]] : tensor<f32>
	// CHECK-HIR: }			// CHECK-HIR: }
	//			//
	// CHECK-MIR-LABEL: builtin.func @sparse_dynamic_dims(			// CHECK-MIR-LABEL: func @sparse_dynamic_dims(
	// CHECK-MIR-SAME: %[[VAL_0:.*]]: !llvm.ptr<i8>,			// CHECK-MIR-SAME: %[[VAL_0:.*]]: !llvm.ptr<i8>,
	// CHECK-MIR-SAME: %[[VAL_1:.*]]: tensor<f32>) -> tensor<f32> {			// CHECK-MIR-SAME: %[[VAL_1:.*]]: tensor<f32>) -> tensor<f32> {
	// CHECK-MIR-DAG: %[[C0:.*]] = constant 0 : index			// CHECK-MIR-DAG: %[[C0:.*]] = constant 0 : index
	// CHECK-MIR-DAG: %[[C1:.*]] = constant 1 : index			// CHECK-MIR-DAG: %[[C1:.*]] = constant 1 : index
	// CHECK-MIR-DAG: %[[C2:.*]] = constant 2 : index			// CHECK-MIR-DAG: %[[C2:.*]] = constant 2 : index
	// CHECK-MIR: %[[VAL_5:.*]] = call @sparseDimSize(%[[VAL_0]], %[[C0]]) : (!llvm.ptr<i8>, index) -> index			// CHECK-MIR: %[[VAL_5:.*]] = call @sparseDimSize(%[[VAL_0]], %[[C0]]) : (!llvm.ptr<i8>, index) -> index
	// CHECK-MIR: %[[VAL_6:.*]] = call @sparseDimSize(%[[VAL_0]], %[[C1]]) : (!llvm.ptr<i8>, index) -> index			// CHECK-MIR: %[[VAL_6:.*]] = call @sparseDimSize(%[[VAL_0]], %[[C1]]) : (!llvm.ptr<i8>, index) -> index
	// CHECK-MIR: %[[VAL_7:.*]] = call @sparseDimSize(%[[VAL_0]], %[[C2]]) : (!llvm.ptr<i8>, index) -> index			// CHECK-MIR: %[[VAL_7:.*]] = call @sparseDimSize(%[[VAL_0]], %[[C2]]) : (!llvm.ptr<i8>, index) -> index
	Show All 33 Lines

mlir/test/IR/invalid-func-op.mlir

	// RUN: mlir-opt %s -split-input-file -verify-diagnostics			// RUN: mlir-opt %s -split-input-file -verify-diagnostics

	// -----			// -----

	func @func_op() {			func @func_op() {
	// expected-error@+1 {{expected valid '@'-identifier for symbol name}}			// expected-error@+1 {{expected valid '@'-identifier for symbol name}}
	func missingsigil() -> (i1, index, f32)			builtin.func missingsigil() -> (i1, index, f32)
	return			return
	}			}

	// -----			// -----

	func @func_op() {			func @func_op() {
	// expected-error@+1 {{expected type instead of SSA identifier}}			// expected-error@+1 {{expected type instead of SSA identifier}}
	func @mixed_named_arguments(f32, %a : i32) {			builtin.func @mixed_named_arguments(f32, %a : i32) {
	return			return
	}			}
	return			return
	}			}

	// -----			// -----

	func @func_op() {			func @func_op() {
	// expected-error@+1 {{expected SSA identifier}}			// expected-error@+1 {{expected SSA identifier}}
	func @mixed_named_arguments(%a : i32, f32) -> () {			builtin.func @mixed_named_arguments(%a : i32, f32) -> () {
	return			return
	}			}
	return			return
	}			}

	// -----			// -----

	func @func_op() {			func @func_op() {
	// expected-error@+1 {{entry block must have 1 arguments to match function signature}}			// expected-error@+1 {{entry block must have 1 arguments to match function signature}}
	func @mixed_named_arguments(f32) {			builtin.func @mixed_named_arguments(f32) {
	^entry:			^entry:
	return			return
	}			}
	return			return
	}			}

	// -----			// -----

	func @func_op() {			func @func_op() {
	// expected-error@+1 {{type of entry block argument #0('i32') must match the type of the corresponding argument in function signature('f32')}}			// expected-error@+1 {{type of entry block argument #0('i32') must match the type of the corresponding argument in function signature('f32')}}
	func @mixed_named_arguments(f32) {			builtin.func @mixed_named_arguments(f32) {
	^entry(%arg : i32):			^entry(%arg : i32):
	return			return
	}			}
	return			return
	}			}

	// -----			// -----

	▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

mlir/test/IR/invalid-module-op.mlir

	// RUN: mlir-opt %s -split-input-file -verify-diagnostics			// RUN: mlir-opt %s -split-input-file -verify-diagnostics

	// -----			// -----

	func @module_op() {			func @module_op() {
	// expected-error@+1 {{Operations with a 'SymbolTable' must have exactly one block}}			// expected-error@+1 {{Operations with a 'SymbolTable' must have exactly one block}}
	module {			builtin.module {
	^bb1:			^bb1:
	"test.dummy"() : () -> ()			"test.dummy"() : () -> ()
	^bb2:			^bb2:
	"test.dummy"() : () -> ()			"test.dummy"() : () -> ()
	}			}
	return			return
	}			}

	// -----			// -----

	func @module_op() {			func @module_op() {
	// expected-error@+1 {{region should have no arguments}}			// expected-error@+1 {{region should have no arguments}}
	module {			builtin.module {
	^bb1(%arg: i32):			^bb1(%arg: i32):
	}			}
	return			return
	}			}

	// -----			// -----

	// expected-error@+1 {{can only contain attributes with dialect-prefixed names}}			// expected-error@+1 {{can only contain attributes with dialect-prefixed names}}
	module attributes {attr} {			module attributes {attr} {
	}			}

mlir/test/IR/invalid-ops.mlir

Show First 20 Lines • Show All 64 Lines • ▼ Show 20 Lines	^bb0:
%x = "affine.apply" (%i, %j) {map = affine_map<(d0, d1) -> ((d0 + 1), (d1 + 2))>} : (index,index) -> (index) // expected-error {{'affine.apply' op mapping must produce one value}}		%x = "affine.apply" (%i, %j) {map = affine_map<(d0, d1) -> ((d0 + 1), (d1 + 2))>} : (index,index) -> (index) // expected-error {{'affine.apply' op mapping must produce one value}}
return		return
}		}

// -----		// -----

func @unknown_custom_op() {		func @unknown_custom_op() {
^bb0:		^bb0:
%i = crazyThing() {value = 0} : () -> index // expected-error {{custom op 'crazyThing' is unknown}}		%i = test.crazyThing() {value = 0} : () -> index // expected-error {{custom op 'test.crazyThing' is unknown}}
return		return
}		}

// -----		// -----

func @unknown_std_op() {		func @unknown_std_op() {
// expected-error@+1 {{unregistered operation 'std.foo_bar_op' found in dialect ('std') that does not allow unknown operations}}		// expected-error@+1 {{unregistered operation 'std.foo_bar_op' found in dialect ('std') that does not allow unknown operations}}
%0 = "std.foo_bar_op"() : () -> index		%0 = "std.foo_bar_op"() : () -> index
▲ Show 20 Lines • Show All 1,160 Lines • Show Last 20 Lines

mlir/test/IR/invalid.mlir

	Show First 20 Lines • Show All 257 Lines • ▼ Show 20 Lines

	func @for_negative_stride() {			func @for_negative_stride() {
	affine.for %i = 1 to 10 step -1			affine.for %i = 1 to 10 step -1
	} // expected-error@-1 {{expected step to be representable as a positive signed integer}}			} // expected-error@-1 {{expected step to be representable as a positive signed integer}}

	// -----			// -----

	func @non_operation() {			func @non_operation() {
	asd // expected-error {{custom op 'asd' is unknown}}			test.asd // expected-error {{custom op 'test.asd' is unknown}}
	}			}

	// -----			// -----

	func @invalid_if_conditional2() {			func @invalid_if_conditional2() {
	affine.for %i = 1 to 10 {			affine.for %i = 1 to 10 {
	affine.if affine_set<(i)[N] : (i >= )> // expected-error {{expected '== 0' or '>= 0' at end of affine constraint}}			affine.if affine_set<(i)[N] : (i >= )> // expected-error {{expected '== 0' or '>= 0' at end of affine constraint}}
	}			}
	▲ Show 20 Lines • Show All 1,356 Lines • Show Last 20 Lines

mlir/test/IR/parser.mlir

Show First 20 Lines • Show All 1,301 Lines • ▼ Show 20 Lines	func @pretty_names() {

%q:3, %r = test.string_attr_pretty_name		%q:3, %r = test.string_attr_pretty_name
// CHECK: %q, %q_1, %q_2, %r = test.string_attr_pretty_name attributes {names = ["q", "q", "q", "r"]}		// CHECK: %q, %q_1, %q_2, %r = test.string_attr_pretty_name attributes {names = ["q", "q", "q", "r"]}

// CHECK: return		// CHECK: return
return		return
}		}


		// This tests the behavior of "default dialect":
		// operations like `test.default_dialect` can define a default dialect
		// used in nested region.
		// CHECK-LABEL: func @default_dialect
		func @default_dialect() {
		test.default_dialect {
		// The test dialect is the default in this region, the following two
		// operations are parsed identically.
		// CHECK-NOT: test.parse_integer_literal
		parse_integer_literal : 5
		// CHECK: parse_integer_literal : 6
		test.parse_integer_literal : 6
		// Verify that only an op prefix is stripped, not an attribute value for
		// example.
		// CHECK: "test.op_with_attr"() {test.attr = "test.value"} : () -> ()
		"test.op_with_attr"() {test.attr = "test.value"} : () -> ()
		"test.terminator"() : ()->()
		}
		return
		}

// CHECK-LABEL: func @unreachable_dominance_violation_ok		// CHECK-LABEL: func @unreachable_dominance_violation_ok
func @unreachable_dominance_violation_ok() -> i1 {		func @unreachable_dominance_violation_ok() -> i1 {
// CHECK: [[VAL:%.*]] = constant false		// CHECK: [[VAL:%.*]] = constant false
// CHECK: return [[VAL]] : i1		// CHECK: return [[VAL]] : i1
// CHECK: ^bb1: // no predecessors		// CHECK: ^bb1: // no predecessors
// CHECK: [[VAL2:%.]]:3 = "bar"([[VAL3:%.]]) : (i64) -> (i1, i1, i1)		// CHECK: [[VAL2:%.]]:3 = "bar"([[VAL3:%.]]) : (i64) -> (i1, i1, i1)
// CHECK: br ^bb3		// CHECK: br ^bb3
// CHECK: ^bb2: // pred: ^bb2		// CHECK: ^bb2: // pred: ^bb2
▲ Show 20 Lines • Show All 124 Lines • Show Last 20 Lines

mlir/test/IR/traits.mlir

	Show First 20 Lines • Show All 500 Lines • ▼ Show 20 Lines
	}			}

	// -----			// -----

	// Ensure that SSACFG regions of operations in GRAPH regions are			// Ensure that SSACFG regions of operations in GRAPH regions are
	// checked for dominance			// checked for dominance
	func @illegalInsideDominanceFreeScope() -> () {			func @illegalInsideDominanceFreeScope() -> () {
	test.graph_region {			test.graph_region {
	func @test() -> i1 {			builtin.func @test() -> i1 {
	^bb1:			^bb1:
	// expected-error @+1 {{operand #0 does not dominate this use}}			// expected-error @+1 {{operand #0 does not dominate this use}}
	%2:3 = "bar"(%1) : (i64) -> (i1,i1,i1)			%2:3 = "bar"(%1) : (i64) -> (i1,i1,i1)
	// expected-note @+1 {{operand defined here}}			// expected-note @+1 {{operand defined here}}
	%1 = "baz"(%2#0) : (i1) -> (i64)			%1 = "baz"(%2#0) : (i1) -> (i64)
	return %2#1 : i1			return %2#1 : i1
	}			}
	"terminator"() : () -> ()			"terminator"() : () -> ()
	}			}
	return			return
	}			}

	// -----			// -----

	// Ensure that SSACFG regions of operations in GRAPH regions are			// Ensure that SSACFG regions of operations in GRAPH regions are
	// checked for dominance			// checked for dominance
	func @illegalCDFGInsideDominanceFreeScope() -> () {			func @illegalCDFGInsideDominanceFreeScope() -> () {
	test.graph_region {			test.graph_region {
	func @test() -> i1 {			builtin.func @test() -> i1 {
	^bb1:			^bb1:
	// expected-error @+1 {{operand #0 does not dominate this use}}			// expected-error @+1 {{operand #0 does not dominate this use}}
	%2:3 = "bar"(%1) : (i64) -> (i1,i1,i1)			%2:3 = "bar"(%1) : (i64) -> (i1,i1,i1)
	br ^bb4			br ^bb4
	^bb2:			^bb2:
	br ^bb2			br ^bb2
	^bb4:			^bb4:
	%1 = "foo"() : ()->i64 // expected-note {{operand defined here}}			%1 = "foo"() : ()->i64 // expected-note {{operand defined here}}
	Show All 30 Lines

mlir/test/Transforms/canonicalize-dce.mlir

	Show First 20 Lines • Show All 76 Lines • ▼ Show 20 Lines

	// Test case: Recursively DCE into enclosed regions.			// Test case: Recursively DCE into enclosed regions.

	// CHECK: func @f(%arg0: f32)			// CHECK: func @f(%arg0: f32)
	// CHECK-NEXT: func @g(%arg1: f32)			// CHECK-NEXT: func @g(%arg1: f32)
	// CHECK-NEXT: return			// CHECK-NEXT: return

	func @f(%arg0: f32) {			func @f(%arg0: f32) {
	func @g(%arg1: f32) {			builtin.func @g(%arg1: f32) {
	%0 = "std.addf"(%arg1, %arg1) : (f32, f32) -> f32			%0 = "std.addf"(%arg1, %arg1) : (f32, f32) -> f32
	return			return
	}			}
	return			return
	}			}

	// -----			// -----

	▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

mlir/test/Transforms/canonicalize.mlir

Show First 20 Lines • Show All 406 Lines • ▼ Show 20 Lines	func @write_only_alloca_fold(%v: f32) {
%a = memref.alloca(%c4) : memref<?xf32>		%a = memref.alloca(%c4) : memref<?xf32>
memref.store %v, %a[%c0] : memref<?xf32>		memref.store %v, %a[%c0] : memref<?xf32>
return		return
}		}

// CHECK-LABEL: func @dead_block_elim		// CHECK-LABEL: func @dead_block_elim
func @dead_block_elim() {		func @dead_block_elim() {
// CHECK-NOT: ^bb		// CHECK-NOT: ^bb
func @nested() {		builtin.func @nested() {
return		return

^bb1:		^bb1:
return		return
}		}
return		return

^bb1:		^bb1:
▲ Show 20 Lines • Show All 796 Lines • Show Last 20 Lines

mlir/test/Transforms/constant-fold.mlir

	Show First 20 Lines • Show All 740 Lines • ▼ Show 20 Lines
	}			}

	// -----			// -----

	// CHECK-LABEL: func @nested_isolated_region			// CHECK-LABEL: func @nested_isolated_region
	func @nested_isolated_region() {			func @nested_isolated_region() {
	// CHECK-NEXT: func @isolated_op			// CHECK-NEXT: func @isolated_op
	// CHECK-NEXT: constant 2			// CHECK-NEXT: constant 2
	func @isolated_op() {			builtin.func @isolated_op() {
	%0 = constant 1 : i32			%0 = constant 1 : i32
	%2 = addi %0, %0 : i32			%2 = addi %0, %0 : i32
	"foo.yield"(%2) : (i32) -> ()			"foo.yield"(%2) : (i32) -> ()
	}			}

	// CHECK: "foo.unknown_region"			// CHECK: "foo.unknown_region"
	// CHECK-NEXT: constant 2			// CHECK-NEXT: constant 2
	"foo.unknown_region"() ({			"foo.unknown_region"() ({
	▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

mlir/test/Transforms/cse.mlir

	Show First 20 Lines • Show All 223 Lines • ▼ Show 20 Lines
	/// This test checks that nested regions that are isolated from above are			/// This test checks that nested regions that are isolated from above are
	/// properly handled.			/// properly handled.
	// CHECK-LABEL: @nested_isolated			// CHECK-LABEL: @nested_isolated
	func @nested_isolated() -> i32 {			func @nested_isolated() -> i32 {
	// CHECK-NEXT: constant 1			// CHECK-NEXT: constant 1
	%0 = constant 1 : i32			%0 = constant 1 : i32

	// CHECK-NEXT: @nested_func			// CHECK-NEXT: @nested_func
	func @nested_func() {			builtin.func @nested_func() {
	// CHECK-NEXT: constant 1			// CHECK-NEXT: constant 1
	%foo = constant 1 : i32			%foo = constant 1 : i32
	"foo.yield"(%foo) : (i32) -> ()			"foo.yield"(%foo) : (i32) -> ()
	}			}

	// CHECK: "foo.region"			// CHECK: "foo.region"
	"foo.region"() ({			"foo.region"() ({
	// CHECK-NEXT: constant 1			// CHECK-NEXT: constant 1
	Show All 27 Lines

mlir/test/Transforms/test-legalizer-full.mlir

Show All 26 Lines	func @replace_non_root_illegal_op() {
"test.return"() : () -> ()		"test.return"() : () -> ()
}		}

// -----		// -----

// Test that children of recursively legal operations are ignored.		// Test that children of recursively legal operations are ignored.
func @recursively_legal_invalid_op() {		func @recursively_legal_invalid_op() {
/// Operation that is statically legal.		/// Operation that is statically legal.
module attributes {test.recursively_legal} {		builtin.module attributes {test.recursively_legal} {
%ignored = "test.illegal_op_f"() : () -> (i32)		%ignored = "test.illegal_op_f"() : () -> (i32)
}		}
/// Operation that is dynamically legal, i.e. the function has a pattern		/// Operation that is dynamically legal, i.e. the function has a pattern
/// applied to legalize the argument type before it becomes recursively legal.		/// applied to legalize the argument type before it becomes recursively legal.
func @dynamic_func(%arg: i64) attributes {test.recursively_legal} {		builtin.func @dynamic_func(%arg: i64) attributes {test.recursively_legal} {
%ignored = "test.illegal_op_f"() : () -> (i32)		%ignored = "test.illegal_op_f"() : () -> (i32)
"test.return"() : () -> ()		"test.return"() : () -> ()
}		}

"test.return"() : () -> ()		"test.return"() : () -> ()
}		}

// -----		// -----
▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

mlir/test/lib/Dialect/Test/TestOps.td

Show First 20 Lines • Show All 620 Lines • ▼ Show 20 Lines

let results = (outs

Variadic<I32>:$d

);

}

// This is used to test encoding of a string attribute into an SSA name of a

// pretty printed value name.

def StringAttrPrettyNameOp

: TEST_Op<"string_attr_pretty_name",

[DeclareOpInterfaceMethods<OpAsmOpInterface>]> {

[DeclareOpInterfaceMethods<OpAsmOpInterface, ["getAsmResultNames"]>]> {

let arguments = (ins StrArrayAttr:$names);

let results = (outs Variadic<I32>:$r);

let printer = [{ return ::print(p, *this); }];

let parser = [{ return ::parse$cppClass(parser, result); }];

}

// This is used to test the OpAsmOpInterface::getDefaultDialect() feature:

rriddleUnsubmitted

Done

let parser = [{ return ::parse$cppClass(parser, result); }];

}

- // This is used to test the OpAsmOpInterface::getDefaultdialect() feature:

+ // This is used to test the OpAsmOpInterface::getDefaultDialect() feature:

// operations nested in a region under this op will drop the "test." dialect

rriddle:

// operations nested in a region under this op will drop the "test." dialect

GMNGeoffreyUnsubmitted

Done

Comment is copy-pasta :-)

GMNGeoffrey: Comment is copy-pasta :-)

// prefix.

def DefaultDialectOp : TEST_Op<"default_dialect", [OpAsmOpInterface]> {

let regions = (region AnyRegion:$body);

let extraClassDeclaration = [{

static ::llvm::StringRef getDefaultDialect() {

return "test";

}

void getAsmResultNames(::llvm::function_ref<void(::mlir::Value, ::llvm::StringRef)> setNameFn) {}

}];

let assemblyFormat = "regions attr-dict-with-keyword";

}

//===----------------------------------------------------------------------===//

// Test Locations

//===----------------------------------------------------------------------===//

def TestLocationSrcOp : TEST_Op<"loc_src"> {

let arguments = (ins I32:$input);

let results = (outs I32:$output);

}

▲ Show 20 Lines • Show All 1,532 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add a new interface allowing to set a default dialect to be used for printing/parsing regionsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 369730

mlir/include/mlir/Dialect/SPIRV/IR/SPIRVStructureOps.td

mlir/include/mlir/Dialect/Shape/IR/ShapeOps.td

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td

mlir/include/mlir/IR/BuiltinOps.h

mlir/include/mlir/IR/BuiltinOps.td

mlir/include/mlir/IR/OpAsmInterface.td

mlir/include/mlir/IR/OpDefinition.h

mlir/include/mlir/IR/OperationSupport.h

mlir/lib/IR/AsmPrinter.cpp

mlir/lib/IR/Operation.cpp

mlir/lib/Parser/Parser.cpp

mlir/lib/Parser/ParserState.h

mlir/test/Analysis/test-shape-fn-report.mlir

mlir/test/Conversion/GPUToNVVM/gpu-to-nvvm.mlir

mlir/test/Conversion/GPUToNVVM/wmma-ops-to-nvvm.mlir

mlir/test/Conversion/GPUToROCDL/gpu-to-rocdl.mlir

mlir/test/Conversion/VectorToROCDL/vector-to-rocdl.mlir

mlir/test/Dialect/Builtin/canonicalize.mlir

mlir/test/Dialect/Linalg/drop-unit-extent-dims.mlir

mlir/test/Dialect/Linalg/reshape_control_fusion.mlir

mlir/test/Dialect/Shape/invalid.mlir

mlir/test/Dialect/SparseTensor/sparse_perm.mlir

mlir/test/Dialect/SparseTensor/sparse_perm_lower.mlir

mlir/test/IR/invalid-func-op.mlir

mlir/test/IR/invalid-module-op.mlir

mlir/test/IR/invalid-ops.mlir

mlir/test/IR/invalid.mlir

mlir/test/IR/parser.mlir

mlir/test/IR/traits.mlir

mlir/test/Transforms/canonicalize-dce.mlir

mlir/test/Transforms/canonicalize.mlir

mlir/test/Transforms/constant-fold.mlir

mlir/test/Transforms/cse.mlir

mlir/test/Transforms/test-legalizer-full.mlir

mlir/test/lib/Dialect/Test/TestOps.td

Add a new interface allowing to set a default dialect to be used for printing/parsing regions
ClosedPublic