This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
docs/
6/7
ConversionToLLVMDialect.md
-
include/mlir/
-
mlir/
-
Conversion/StandardToLLVM/
-
StandardToLLVM/
3/3
ConvertStandardToLLVM.h
-
ConvertStandardToLLVMPass.h
-
IR/
-
FunctionSupport.h
-
lib/
-
Conversion/
-
GPUToCUDA/
1/1
ConvertLaunchFuncToCudaCalls.cpp
-
GPUToNVVM/
-
LowerGpuOpsToNVVMOps.cpp
-
LinalgToLLVM/
-
LinalgToLLVM.cpp
-
StandardToLLVM/
12/14
ConvertStandardToLLVM.cpp
-
Dialect/GPU/IR/
-
GPU/
-
IR/
-
GPUDialect.cpp
-
Transforms/
2/4
DialectConversion.cpp
-
test/
-
Conversion/
-
GPUToCUDA/
-
lower-launch-func-to-cuda.mlir
-
StandardToLLVM/
-
convert-argattrs.mlir
-
convert-dynamic-memref-ops.mlir
1/1
convert-funcs.mlir
-
convert-static-memref-ops.mlir
-
convert-to-llvmir.mlir
-
standard-to-llvm.mlir
-
Dialect/
-
GPU/
-
invalid.mlir
-
Linalg/
-
llvm.mlir
-
mlir-cpu-runner/
-
cblas_interface.cpp
-
include/
-
cblas_interface.h
-
mlir_runner_utils.h
-
mlir_runner_utils.cpp
-
mlir-cuda-runner/
-
gpu-to-cubin.mlir
-
tools/mlir-cuda-runner/
-
mlir-cuda-runner/
-
cuda-runtime-wrappers.cpp

Differential D74211

[mlir] use unpacked memref descriptors at function boundaries
ClosedPublic

Authored by ftynse on Feb 7 2020, 3:34 AM.

Download Raw Diff

Details

Reviewers

nicolasvasilache
dcaballe
mravishankar
rriddle
herhut

Summary

The existing (default) calling convention for memrefs in standard-to-LLVM
conversion was motivated by interfacing with LLVM IR produced from C sources.
In particular, it passes a pointer to the memref descriptor structure when
calling the function. Therefore, the descriptor is allocated on stack before
the call. This convention leads to several problems. PR44644 indicates a
problem with stack exhaustion when calling functions with memref-typed
arguments in a loop. Allocating outside of the loop may lead to concurrent
access problems in case the loop is parallel. When targeting GPUs, the contents
of the stack-allocated memory for the descriptor (passed by pointer) needs to
be explicitly copied to the device. Using an aggregate type makes it impossible
to attach pointer-specific argument attributes pertaining to alignment and
aliasing in the LLVM dialect.

Change the default calling convention for memrefs in standard-to-LLVM
conversion to transform a memref into a list of arguments, each of primitive
type, that are comprised in the memref descriptor. This avoids stack allocation
for ranked memrefs (and thus stack exhaustion and potential concurrent access
problems) and simplifies the device function invocation on GPUs.

Provide an option in the standard-to-LLVM conversion to generate auxiliary
wrapper function with the same interface as the previous calling convention,
compatible with LLVM IR porduced from C sources. These auxiliary functions
pack the individual values into a descriptor structure or unpack it. They also
handle descriptor stack allocation if necessary, serving as an allocation
scope: the memory reserved by alloca will be freed on exiting the auxiliary
function.

The effect of this change on MLIR-generated only LLVM IR is minimal. When
interfacing MLIR-generated LLVM IR with C-generated LLVM IR, the integration
only needs to require auxiliary functions and change the function name to call
the wrapper function instead of the original function.

This also opens the door to forwarding aliasing and alignment information from
memrefs to LLVM IR pointers in the standrd-to-LLVM conversion.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ftynse created this revision.Feb 7 2020, 3:34 AM

Herald added a reviewer: mravishankar. · View Herald TranscriptFeb 7 2020, 3:34 AM

Herald added a reviewer: rriddle. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, Joonsoo, liufengdb and 13 others. · View Herald Transcript

Thanks Alex for digging into this and explaining in detail the tradeoffs I also had to go through.
Most importantly thanks for putting enough sweat to getting to what I think is the best think we can hope for in the absence of handling the C ABI ourselves.
Looking now, with the objective of pushing this through with hi-pri.

Minor (ultra)-nit.
Looks great to me, this should also help with @dcaballe 's issues with forcing the memref descriptor ABI on everything.

Thanks for this great patch!

mlir/docs/ConversionToLLVMDialect.md
415	typo original
mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
162	ultra-nit: extra space before each line to make the list pop out better. Or use a proper list and backticks as such: `!llvm...`
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
284	typo individual
607	call `getNumUnpackedValues` for a single source of truth.
mlir/test/Conversion/StandardToLLVM/convert-funcs.mlir
21–22	typo argument

There is a typo in the description: porduced

mlir/docs/ConversionToLLVMDialect.md
497	nit: Introducing
498	typo: minize -> minimize
mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
65	nit: ///
mlir/lib/Conversion/GPUToCUDA/ConvertLaunchFuncToCudaCalls.cpp
243	nit: /// for the comments.
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
199	///
220	///
768	I thought std.varargs was removed? Also, you can use `op->getDialectAttrs()` to filter out the ones without a prefix.
2732	getModule().emitError ?
mlir/lib/Transforms/DialectConversion.cpp
404	This is relaxed because you are inserting multiple operations to perform the conversion?

Thanks Alex! LGTM. As Nicolas mentioned, I think this should help with the aliasing issue. Once it settles down I should be able to retire the bare ptr calling convention if nobody else is interested in it.

mlir/docs/ConversionToLLVMDialect.md
339	Stide -> Stride
mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
121	constituting?
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
224	Remove `type` if it's not needed?
284	constituting
883	drop {}
931–939	drop {}

nmostafa added a subscriber: nmostafa.Feb 7 2020, 11:29 AM

nmostafa added inline comments.

mlir/docs/ConversionToLLVMDialect.md
274	Can you add an example for how UnrankedMemRef gets unpacked as well ? Also, IIUC, if we are passing UnrankedMemRefDescriptor arguments in a loop, we still might exhaust the stack, since we still alloca, copy the MemRefDescriptor and pack the rank and alloca ptr into the UnrankedMemRef struct. Correct ?

addressed most comments

mlir/docs/ConversionToLLVMDialect.md
274	Will do. Yes, unranked memref still allocates once (instead of twice). I don't see an easy way around because it requires a pointer in order to erase the actual type. There are two things we can explore: outlining the `memref_cast` from ranked to unranked to scope the allocation, using vararg functions and argument reordering.
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
768	It was still there in the code, so I decided not to touch it in this patch. Will take a look in a follow-up.
883	I prefer not to becase `if` has braces.
mlir/lib/Transforms/DialectConversion.cpp
404	Exactly.

rriddle accepted this revision.Feb 7 2020, 2:14 PM

rriddle added inline comments.

mlir/lib/Transforms/DialectConversion.cpp
404	We erase use_empty cast operations during applyRewrites. How does this interact with generating multiple operations when casting? (This doesn't have to block this revision, but just curious on your thoughts there)

This revision is now accepted and ready to land.Feb 7 2020, 2:14 PM

Added example with unranked memref to the doc.
Added a helper view class.

ftynse marked 5 inline comments as done.Feb 10 2020, 5:36 AM

ftynse added inline comments.

mlir/docs/ConversionToLLVMDialect.md
274	Can you add an example for how UnrankedMemRef gets unpacked as well ? Done.
mlir/lib/Transforms/DialectConversion.cpp
404	It erases the last operation, but keeps the rest, which are all equally dead and have no side effects. It's a variant of the problem we face a lot in rewrites: do we clean up immediately or do we expect the canonicalizer to clean up later. I don't have a good answer here, but I wouldn't go out of my way for cleaning. In general, I considered the following: since we use DialectConversionRewriter, we can use its undo stack to remove all operations introduced in the cast materialization; with multiple operations generated, I'm not convinced that we can rely only on one of them (e.g., the last one) to consider the entire conversion dead; we can have stores and other side-effecting operations; the canonicalizer is better aware of deadness, so it sounds reasonable to rely on it instead to remove dead casts; (this is similar to the decision not to implement valuesToRemoveIfDead in replaceAllUsesWith IMO) if we want the cleanup, maybe we can try and call into the canonicalization directly.

Harbormaster failed remote builds in B46072: Diff 243518!Feb 10 2020, 5:38 AM

Landed in https://github.com/llvm/llvm-project/commit/5a1778057f72b8e0444a7932144a3fa441b641bc

This is great! Thanks a lot for fixing this, and for such a detailed and helpful commit message! Once this is committed, https://github.com/tensorflow/mlir/issues/210 can also be closed (I think it was the first report filed on this issue.)

mehdi_amini added inline comments.Feb 10 2020, 8:38 PM

mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
50	Seems like you didn't provide tests for this flag?

ftynse marked an inline comment as done.Feb 11 2020, 5:03 AM

ftynse added inline comments.

mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
50	Indeed, forgot to git add :/ Thanks for noticing. Pushed in ea3a25e4f5166ccd5e523f0165f5270b24d71f46.

Unfortunately, this commit is breaking the lowering of the noalias attribute to LLVM when the bare pointer calling convention is used. @check_noalias test in convert-static-memref-ops.mlir passes successfully but it seems that the problem happens only when more than one argument carries the attribute. For example, this test will fail if you modify it as follows:

--- a/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir
+++ b/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir
@@ -3,8 +3,9 @@
 // RUN: mlir-opt -convert-std-to-llvm='use-bare-ptr-memref-call-conv=1' -split-input-file %s | FileCheck %s --check-prefix=BAREPTR

 // BAREPTR-LABEL: func @check_noalias
+// BAREPTR-SAME: %{{.*}}: !llvm<"float*"> {llvm.noalias = true},
 // BAREPTR-SAME: %{{.*}}: !llvm<"float*"> {llvm.noalias = true}
-func @check_noalias(%static : memref<2xf32> {llvm.noalias = true}) {
+func @check_noalias(%static1 : memref<2xf32> {llvm.noalias = true}, %static2 : memref<2xf32> {llvm.noalias = true}) {
     return
 }

It would be great if someone could have a look at this quickly or revert the commit if this is not blocking for you.
I'll try to take a look tomorrow if nobody hasn't.

Thanks!

@dcaballe I don't think reverting the commit will be a reasonable path here, a bunch of other things and integrates have switched to this.

The problem seems to be that the existing tests were not sufficiently covering the behavior you are interested in enforcing?
It seems the natural solution would be to fix those cases and add the proper tests to cover those behaviors?

In D74211#1875514, @nicolasvasilache wrote:

@dcaballe I don't think reverting the commit will be a reasonable path here, a bunch of other things and integrates have switched to this.

The problem seems to be that the existing tests were not sufficiently covering the behavior you are interested in enforcing?
It seems the natural solution would be to fix those cases and add the proper tests to cover those behaviors?

If a fix can be quickly provided by the commit author, sure. Otherwise a few days after a breakage is introduced, reverting is absolutely the correct path.
(you need to revert 696f80736b861 along the way).

Should be fixed by 39cb2a8fc79976171b20369ff756f7fa43232b50.

Reverting or fixing it yourself is a complexity trade-off. Patches that are bigger and/or stayed in the repo longer have higher chance of transitively affecting other patches up to a point that you'd need to revert dozens of commits and resolve conflicts, which may take you more time than finding a trivial problem....

Thanks for addressing this so quickly and for the feedback! I'm giving it a try. Re reverting patches, this is a very common practice. Sometimes patches are reverted without giving an opportunity to provide a fix. It shouldn't be a problem. LLVM has a no regression policy and testing goes beyond in-tree tests since many vendors have private forks. In any case, I just provided some options so that you could decide. Thanks again for addressing this so quickly!

Herald added a reviewer: herhut. · View Herald TranscriptFeb 14 2020, 10:47 AM

denis13 added a subscriber: denis13.Feb 14 2020, 11:16 AM

@ayzhuang found another case that is breaking our project using the default calling convention. The noalias attribute is now added to all the flattened arguments, including those that are not pointers. LLVM complaints about it:

<stdin>:4:3: error: llvm.noalias attribute attached to LLVM non-pointer argument
  llvm.func @check_noalias(%arg0: !llvm<"float*"> {llvm.noalias = true}, %arg1: !llvm<"float*"> {llvm.noalias = true}, %arg2: !llvm.i64 {llvm.noalias = true}, %arg3: !llvm.i64 {llvm.noalias
= true}, %arg4: !llvm.i64 {llvm.noalias = true}, %arg5: !llvm<"float*"> {llvm.noalias = true}, %arg6: !llvm<"float*"> {llvm.noalias = true}, %arg7: !llvm.i64 {llvm.noalias = true}, %arg8: !l
lvm.i64 {llvm.noalias = true}, %arg9: !llvm.i64 {llvm.noalias = true}) {
  ^

This is extending the test case to cover this new scenario:

diff --git a/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir b/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir
index 105a80dec7a..da3ae1341f0 100644
--- a/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir
+++ b/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir
@@ -2,6 +2,9 @@
 // RUN: mlir-opt -convert-std-to-llvm='use-alloca=1' %s | FileCheck %s --check-prefix=ALLOCA
 // RUN: mlir-opt -convert-std-to-llvm='use-bare-ptr-memref-call-conv=1' -split-input-file %s | FileCheck %s --check-prefix=BAREPTR

+// CHECK-LABEL: func @check_noalias
+// CHECK-SAME: %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm.i64, %{{.*}}: !llvm.i64, %{{.*}}: !llvm.i64,
+// CHECK-SAME: %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm.i64, %{{.*}}: !llvm.i64, %{{.*}}: !llvm.i64)
 // BAREPTR-LABEL: func @check_noalias
 // BAREPTR-SAME: %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm<"float*"> {llvm.noalias = true}
 func @check_noalias(%static : memref<2xf32> {llvm.noalias = true}, %other : memref<2xf32> {llvm.noalias = true}) {

In D74211#1876818, @dcaballe wrote:

Thanks for addressing this so quickly and for the feedback! I'm giving it a try. Re reverting patches, this is a very common practice. Sometimes patches are reverted without giving an opportunity to provide a fix. It shouldn't be a problem. LLVM has a no regression policy and testing goes beyond in-tree tests since many vendors have private forks. In any case, I just provided some options so that you could decide. Thanks again for addressing this so quickly!

I know it's common. In this specific situation, you will get in a weird situation where reverting this patch could fix your project, but break other projects, so somebody would revert the revert :)

In D74211#1877293, @dcaballe wrote:

@ayzhuang found another case that is breaking our project using the default calling convention. The noalias attribute is now added to all the flattened arguments, including those that are not pointers. LLVM complaints about it:

You should not be using llvm.noalias with the default calling convention. It has not worked before, since you cannot attach noalias to structures either. So I will be surprised if you actually had a test exercising the default calling convention with llvm.noalias that used to pass and that was broken by this change.

This is extending the test case to cover this new scenario:

diff --git a/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir b/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir
index 105a80dec7a..da3ae1341f0 100644
--- a/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir
+++ b/mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir
@@ -2,6 +2,9 @@
 // RUN: mlir-opt -convert-std-to-llvm='use-alloca=1' %s | FileCheck %s --check-prefix=ALLOCA
 // RUN: mlir-opt -convert-std-to-llvm='use-bare-ptr-memref-call-conv=1' -split-input-file %s | FileCheck %s --check-prefix=BAREPTR

+// CHECK-LABEL: func @check_noalias
+// CHECK-SAME: %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm.i64, %{{.*}}: !llvm.i64, %{{.*}}: !llvm.i64,
+// CHECK-SAME: %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm.i64, %{{.*}}: !llvm.i64, %{{.*}}: !llvm.i64)
 // BAREPTR-LABEL: func @check_noalias
 // BAREPTR-SAME: %{{.*}}: !llvm<"float*"> {llvm.noalias = true}, %{{.*}}: !llvm<"float*"> {llvm.noalias = true}
 func @check_noalias(%static : memref<2xf32> {llvm.noalias = true}, %other : memref<2xf32> {llvm.noalias = true}) {

This is wrong. The allocated and aligned pointer _do_ alias, so attaching "noalias" to them is incorrect. There is no aliasing model on memrefs. When we have one, we will see how to lower them to the LLVM dialect.

In D74211#1878702, @ftynse wrote:

In D74211#1876818, @dcaballe wrote:

Thanks for addressing this so quickly and for the feedback! I'm giving it a try. Re reverting patches, this is a very common practice. Sometimes patches are reverted without giving an opportunity to provide a fix. It shouldn't be a problem. LLVM has a no regression policy and testing goes beyond in-tree tests since many vendors have private forks. In any case, I just provided some options so that you could decide. Thanks again for addressing this so quickly!

I know it's common. In this specific situation, you will get in a weird situation where reverting this patch could fix your project, but break other projects, so somebody would revert the revert :)

I am not sure I agree with this: if other project already depended on a *feature* they don't get to revert a revert that was fixing a *bug* introduced with this new feature. They have to deal with this out-of-tree.
Of course the delay matters, in this case it was just a few days later: the revert is a no-brainer for me (unless the author can provide a quick fix, as I mentioned before, and which is what happened in this case).

So I will be surprised if you actually had a test exercising the default calling convention with llvm.noalias that used to pass and that was broken by this change.

We actually have been testing this since before memrefs were lowered to a pointer to struct. It became useless but it worked since the noalias attribute was attached to the pointer to struct. However, I think I can take care of this on our side. No problem then.
Thanks!

Revision Contents

Path

Size

mlir/

docs/

ConversionToLLVMDialect.md

276 lines

include/

mlir/

Conversion/

StandardToLLVM/

ConvertStandardToLLVM.h

138 lines

ConvertStandardToLLVMPass.h

13 lines

IR/

FunctionSupport.h

7 lines

lib/

Conversion/

GPUToCUDA/

ConvertLaunchFuncToCudaCalls.cpp

108 lines

GPUToNVVM/

LowerGpuOpsToNVVMOps.cpp

23 lines

LinalgToLLVM/

LinalgToLLVM.cpp

3 lines

StandardToLLVM/

ConvertStandardToLLVM.cpp

496 lines

Dialect/

GPU/

IR/

GPUDialect.cpp

29 lines

Transforms/

DialectConversion.cpp

3 lines

test/

Conversion/

GPUToCUDA/

lower-launch-func-to-cuda.mlir

4 lines

StandardToLLVM/

convert-argattrs.mlir

20 lines

convert-dynamic-memref-ops.mlir

92 lines

convert-funcs.mlir

6 lines

convert-static-memref-ops.mlir

83 lines

convert-to-llvmir.mlir

20 lines

standard-to-llvm.mlir

3 lines

Dialect/

GPU/

invalid.mlir

28 lines

Linalg/

llvm.mlir

27 lines

mlir-cpu-runner/

cblas_interface.cpp

37 lines

include/

cblas_interface.h

27 lines

mlir_runner_utils.h

20 lines

mlir_runner_utils.cpp

30 lines

mlir-cuda-runner/

gpu-to-cubin.mlir

7 lines

tools/

mlir-cuda-runner/

cuda-runtime-wrappers.cpp

35 lines

Diff 243518

mlir/docs/ConversionToLLVMDialect.md

Show First 20 Lines • Show All 242 Lines • ▼ Show 20 Lines	func @bar() {
// use as before		// use as before
"use_i32"(%3) : (!llvm.i32) -> ()		"use_i32"(%3) : (!llvm.i32) -> ()
"use_i64"(%4) : (!llvm.i64) -> ()		"use_i64"(%4) : (!llvm.i64) -> ()
}		}
```		```

### Calling Convention for `memref`		### Calling Convention for `memref`

For function _arguments_ of `memref` type, ranked or unranked, the type of the		Function _arguments_ of `memref` type, ranked or unranked, are _expanded_ into a
argument is a _pointer_ to the memref descriptor type defined above. The caller		list of arguments of non-aggregate types that the memref descriptor defined
of such function is required to store the descriptor in memory and guarantee		above comprises. That is, the outer struct type and the inner array types are
that the storage remains live until the callee returns. The caller can than pass		replaced with individual arguments.
the pointer to that memory as function argument. The callee loads from the
pointers it was passed as arguments in the entry block of the function, making
the descriptor passed in as argument available for use similarly to
ocally-defined descriptors.

This convention is implemented in the conversion of `std.func` and `std.call` to		This convention is implemented in the conversion of `std.func` and `std.call` to
the LLVM dialect. Conversions from other dialects should take it into account.		the LLVM dialect, with the former unpacking the descriptor into a set of
The motivation for this convention is to simplify the ABI for interfacing with		individual values and the latter packing those values back into a descriptor so
other LLVM modules, in particular those generated from C sources, while avoiding		as to make it transparently usable by other operations. Conversions from other
platform-specific aspects until MLIR has a proper ABI modeling.		dialects should take this convention into account.

Example:		This specific convention is motivated by the necessity to specify alignment and
		aliasing attributes on the raw pointers underpinning the memref.

```mlir		Examples:

func @foo(memref<?xf32>) -> () {		```mlir
%c0 = constant 0 : index		func @foo(%arg0: memref<?xf32>) -> () {
load %arg0[%c0] : memref<?xf32>		"use"(%arg0) : (memref<?xf32>) -> ()
return		return
}		}

func @bar(%arg0: index) {		// Gets converted to the following.
%0 = alloc(%arg0) : memref<?xf32>
		nmostafaUnsubmitted Done Reply Inline Actions Can you add an example for how UnrankedMemRef gets unpacked as well ? Also, IIUC, if we are passing UnrankedMemRefDescriptor arguments in a loop, we still might exhaust the stack, since we still alloca, copy the MemRefDescriptor and pack the rank and alloca ptr into the UnrankedMemRef struct. Correct ? nmostafa: Can you add an example for how UnrankedMemRef gets unpacked as well ? Also, IIUC, if we are…
		ftynseAuthorUnsubmitted Not Done Reply Inline Actions Will do. Yes, unranked memref still allocates once (instead of twice). I don't see an easy way around because it requires a pointer in order to erase the actual type. There are two things we can explore: outlining the `memref_cast` from ranked to unranked to scope the allocation, using vararg functions and argument reordering. ftynse: Will do. Yes, unranked memref still allocates once (instead of twice). I don't see an easy way…
		ftynseAuthorUnsubmitted Done Reply Inline Actions Can you add an example for how UnrankedMemRef gets unpacked as well ? Done. ftynse: > Can you add an example for how UnrankedMemRef gets unpacked as well ? Done.
		llvm.func @foo(%arg0: !llvm<"float*">, // Allocated pointer.
		%arg1: !llvm<"float*">, // Aligned pointer.
		%arg2: !llvm.i64, // Offset.
		%arg3: !llvm.i64, // Size in dim 0.
		%arg4: !llvm.i64) { // Stride in dim 0.
		// Populate memref descriptor structure.
		%0 = llvm.mlir.undef : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
		%1 = llvm.insertvalue %arg0, %0[0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
		%2 = llvm.insertvalue %arg1, %1[1] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
		%3 = llvm.insertvalue %arg2, %2[2] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
		%4 = llvm.insertvalue %arg3, %3[3, 0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
		%5 = llvm.insertvalue %arg4, %4[4, 0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">

		// Descriptor is now usable as a single value.
		"use"(%5) : (!llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">) -> ()
		llvm.return
		}
		```

		```mlir
		func @bar() {
		%0 = "get"() : () -> (memref<?xf32>)
call @foo(%0) : (memref<?xf32>)-> ()		call @foo(%0) : (memref<?xf32>) -> ()
return		return
}		}

// Gets converted to the following IR.		// Gets converted to the following.
// Accepts a pointer to the memref descriptor.
llvm.func @foo(!llvm<"{ float, float, i64, [1 x i64], [1 x i64] }*">) {		llvm.func @bar() {
// Loads the descriptor so that it can be used similarly to locally		%0 = "get"() : () -> !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
// created descriptors.
%0 = llvm.load %arg0 : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }*">		// Unpack the memref descriptor.
}		%1 = llvm.extractvalue %0[0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
		%2 = llvm.extractvalue %0[1] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
llvm.func @bar(%arg0: !llvm.i64) {		%3 = llvm.extractvalue %0[2] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
// ... Allocation ...		%4 = llvm.extractvalue %0[3, 0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
// Definition of the descriptor.		%5 = llvm.extractvalue %0[4, 0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
%7 = llvm.mlir.undef : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
// ... Filling in the descriptor ...		// Pass individual values to the callee.
%14 = // The final value of the allocated descriptor.		llvm.call @foo(%1, %2, %3, %4, %5) : (!llvm<"float">, !llvm<"float">, !llvm.i64, !llvm.i64, !llvm.i64) -> ()
// Allocate the memory for the descriptor and store it.		llvm.return
%15 = llvm.mlir.constant(1 : index) : !llvm.i64		}
%16 = llvm.alloca %15 x !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
: (!llvm.i64) -> !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }*">		```
llvm.store %14, %16 : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }*">
// Pass the pointer to the function.		For unranked memrefs, the list of function arguments always contains two
llvm.call @foo(%16) : (!llvm<"{ float, float, i64, [1 x i64], [1 x i64] }*">) -> ()		elements, same as the unranked memref descriptor: an integer rank, and a
		type-erased (`!llvm<"i8*">`) pointer to the ranked memref descriptor. Note that
		while the _calling convention_ does not require stack allocation, _casting_ to
		unranked memref does since one cannot take an address of an SSA value containing
		the ranked memref. The caller is in charge of ensuring the thread safety and
		eventually removing unnecessary stack allocations in cast operations.

		Example

		```mlir
		llvm.func @foo(%arg0: memref<*xf32>) -> () {
		"use"(%arg0) : (memref<*xf32>) -> ()
		return
		}

		// Gets converted to the following.

		llvm.func @foo(%arg0: !llvm.i64 // Rank.
		%arg1: !llvm<"i8*">) { // Type-erased pointer to descriptor.
		dcaballeUnsubmitted Done Reply Inline Actions Stide -> Stride dcaballe: Stide -> Stride
		// Pack the unranked memref descriptor.
		%0 = llvm.mlir.undef : !llvm<"{ i64, i8* }">
		%1 = llvm.insertvalue %arg0, %0[0] : !llvm<"{ i64, i8* }">
		%2 = llvm.insertvalue %arg1, %1[1] : !llvm<"{ i64, i8* }">

		"use"(%2) : (!llvm<"{ i64, i8* }">) -> ()
		llvm.return
		}
		```

		```mlir
		llvm.func @bar() {
		%0 = "get"() : () -> (memref<*xf32>)
		call @foo(%0): (memref<*xf32>) -> ()
		return
		}

		// Gets converted to the following.

		llvm.func @bar() {
		%0 = "get"() : () -> (!llvm<"{ i64, i8* }">)

		// Unpack the memref descriptor.
		%1 = llvm.extractvalue %0[0] : !llvm<"{ i64, i8* }">
		%2 = llvm.extractvalue %0[1] : !llvm<"{ i64, i8* }">

		// Pass individual values to the callee.
		llvm.call @foo(%1, %2) : (!llvm.i64, !llvm<"i8*">)
llvm.return		llvm.return
}		}
```		```

*This convention may or may not apply if the conversion of MemRef types is		*This convention may or may not apply if the conversion of MemRef types is
overridden by the user.*		overridden by the user.*

		### C-compatible wrapper emission

		In practical cases, it may be desirable to have externally-facing functions
		with a single attribute corresponding to a MemRef argument. When interfacing
		with LLVM IR produced from C, the code needs to respect the corresponding
		calling convention. The conversion to the LLVM dialect provides an option to
		generate wrapper functions that take memref descriptors as pointers-to-struct
		compatible with data types produced by Clang when compiling C sources.

		More specifically, a memref argument is converted into a pointer-to-struct
		argument of type `{T, T, i64, i64[N], i64[N]}*` in the wrapper function, where
		`T` is the converted element type and `N` is the memref rank. This type is
		compatible with that produced by Clang for the following C++ structure template
		instantiations or their equivalents in C.

		```cpp
		template<typename T, size_t N>
		struct MemRefDescriptor {
		T *allocated;
		T *aligned;
		intptr_t offset;
		intptr_t sizes[N];
		intptr_t stides[N];
		};
		```

		If enabled, the option will do the following. For _external_ functions declared
		in the MLIR module.

		1. Declare a new function `_mlir_ciface_<original name>` where memref arguments
		are converted to pointer-to-struct and the remaining arguments are converted
		as usual.
		1. Add a body to the original function (making it non-external) that
		1. allocates a memref descriptor,
		1. populates it, and
		1. passes the pointer to it into the newly declared interface function
		1. collects the result of the call and returns it to the caller.

		For (non-external) functions defined in the MLIR module.

		1. Define a new function `_mlir_ciface_<original name>` where memref arguments
		nicolasvasilacheUnsubmitted Done Reply Inline Actions typo original nicolasvasilache: typo original
		are converted to pointer-to-struct and the remaining arguments are converted
		as usual.
		1. Populate the body of the newly defined function with IR that
		1. loads descriptors from pointers;
		1. unpacks descriptor into individual non-aggregate values;
		1. passes these values into the original function;
		1. collects the result of the call and returns it to the caller.

		Examples:

		```mlir

		func @qux(%arg0: memref<?x?xf32>)

		// Gets converted into the following.

		// Function with unpacked arguments.
		llvm.func @qux(%arg0: !llvm<"float">, %arg1: !llvm<"float">, %arg2: !llvm.i64,
		%arg3: !llvm.i64, %arg4: !llvm.i64, %arg5: !llvm.i64,
		%arg6: !llvm.i64) {
		// Populate memref descriptor (as per calling convention).
		%0 = llvm.mlir.undef : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%1 = llvm.insertvalue %arg0, %0[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%2 = llvm.insertvalue %arg1, %1[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%3 = llvm.insertvalue %arg2, %2[2] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%4 = llvm.insertvalue %arg3, %3[3, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%5 = llvm.insertvalue %arg5, %4[4, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%6 = llvm.insertvalue %arg4, %5[3, 1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%7 = llvm.insertvalue %arg6, %6[4, 1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">

		// Store the descriptor in a stack-allocated space.
		%8 = llvm.mlir.constant(1 : index) : !llvm.i64
		%9 = llvm.alloca %8 x !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		: (!llvm.i64) -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
		llvm.store %7, %9 : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">

		// Call the interface function.
		llvm.call @_mlir_ciface_qux(%9) : (!llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">) -> ()

		// The stored descriptor will be freed on return.
		llvm.return
		}

		// Interface function.
		llvm.func @_mlir_ciface_qux(!llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">)
		```

		```mlir
		func @foo(%arg0: memref<?x?xf32>) {
		return
		}

		// Gets converted into the following.

		// Function with unpacked arguments.
		llvm.func @foo(%arg0: !llvm<"float">, %arg1: !llvm<"float">, %arg2: !llvm.i64,
		%arg3: !llvm.i64, %arg4: !llvm.i64, %arg5: !llvm.i64,
		%arg6: !llvm.i64) {
		llvm.return
		}

		// Interface function callable from C.
		llvm.func @_mlir_ciface_foo(%arg0: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">) {
		// Load the descriptor.
		%0 = llvm.load %arg0 : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">

		// Unpack the descriptor as per calling convention.
		%1 = llvm.extractvalue %0[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%2 = llvm.extractvalue %0[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%3 = llvm.extractvalue %0[2] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%4 = llvm.extractvalue %0[3, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%5 = llvm.extractvalue %0[3, 1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%6 = llvm.extractvalue %0[4, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		%7 = llvm.extractvalue %0[4, 1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
		llvm.call @foo(%1, %2, %3, %4, %5, %6, %7)
		: (!llvm<"float">, !llvm<"float">, !llvm.i64, !llvm.i64, !llvm.i64,
		!llvm.i64, !llvm.i64) -> ()
		llvm.return
		}
		```

		Rationale: Introducing auxiliary functions for C-compatible interfaces is
		rriddleUnsubmitted Done Reply Inline Actions nit: Introducing rriddle: nit: Introducing
		preferred to modifying the calling convention since it will minimize the effect
		rriddleUnsubmitted Done Reply Inline Actions typo: minize -> minimize rriddle: typo: minize -> minimize
		of C compatibility on intra-module calls or calls between MLIR-generated
		functions. In particular, when calling external functions from an MLIR module in
		a (parallel) loop, the fact of storing a memref descriptor on stack can lead to
		stack exhaustion and/or concurrent access to the same address. Auxiliary
		interface function serves as an allocation scope in this case. Furthermore, when
		targeting accelerators with separate memory spaces such as GPUs, stack-allocated
		descriptors passed by pointer would have to be transferred to the device memory,
		which introduces significant overhead. In such situations, auxiliary interface
		functions are executed on host and only pass the values through device function
		invocation mechanism.

## Repeated Successor Removal		## Repeated Successor Removal

Since the goal of the LLVM IR dialect is to reflect LLVM IR in MLIR, the dialect		Since the goal of the LLVM IR dialect is to reflect LLVM IR in MLIR, the dialect
and the conversion procedure must account for the differences between block		and the conversion procedure must account for the differences between block
arguments and LLVM IR PHI nodes. In particular, LLVM IR disallows PHI nodes with		arguments and LLVM IR PHI nodes. In particular, LLVM IR disallows PHI nodes with
different values coming from the same source. Therefore, the LLVM IR dialect		different values coming from the same source. Therefore, the LLVM IR dialect
disallows operations that have identical successors accepting arguments, which		disallows operations that have identical successors accepting arguments, which
would lead to invalid PHI nodes. The conversion process resolves the potential		would lead to invalid PHI nodes. The conversion process resolves the potential
▲ Show 20 Lines • Show All 126 Lines • Show Last 20 Lines

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h

Show All 30 Lines

namespace LLVM {		namespace LLVM {
class LLVMDialect;		class LLVMDialect;
class LLVMType;		class LLVMType;
} // namespace LLVM		} // namespace LLVM

/// Set of callbacks that allows the customization of LLVMTypeConverter.		/// Set of callbacks that allows the customization of LLVMTypeConverter.
struct LLVMTypeConverterCustomization {		struct LLVMTypeConverterCustomization {
using CustomCallback =		using CustomCallback = std::function<LogicalResult(LLVMTypeConverter &, Type,
std::function<LLVM::LLVMType(LLVMTypeConverter &, Type)>;		SmallVectorImpl<Type> &)>;

/// Customize the type conversion of function arguments.		/// Customize the type conversion of function arguments.
CustomCallback funcArgConverter;		CustomCallback funcArgConverter;

/// Initialize customization to default callbacks.		/// Initialize customization to default callbacks.
LLVMTypeConverterCustomization();		LLVMTypeConverterCustomization();
};		};

/// Callback to convert function argument types. It converts a MemRef function		/// Callback to convert function argument types. It converts a MemRef function
/// argument to a struct that contains the descriptor information. Converted		/// argument to a list of non-aggregate types containing descriptor
/// types are promoted to a pointer to the converted type.		/// information, and an UnrankedmemRef function argument to a list containing
LLVM::LLVMType structFuncArgTypeConverter(LLVMTypeConverter &converter,		/// the rank and a pointer to a descriptor struct.
Type type);		LogicalResult structFuncArgTypeConverter(LLVMTypeConverter &converter,
		Type type,
		SmallVectorImpl<Type> &result);

/// Callback to convert function argument types. It converts MemRef function		/// Callback to convert function argument types. It converts MemRef function
/// arguments to bare pointers to the MemRef element type. Converted types are		/// arguments to bare pointers to the MemRef element type.
/// not promoted to pointers.		LogicalResult barePtrFuncArgTypeConverter(LLVMTypeConverter &converter,
LLVM::LLVMType barePtrFuncArgTypeConverter(LLVMTypeConverter &converter,		Type type,
Type type);		SmallVectorImpl<Type> &result);

/// Conversion from types in the Standard dialect to the LLVM IR dialect.		/// Conversion from types in the Standard dialect to the LLVM IR dialect.
class LLVMTypeConverter : public TypeConverter {		class LLVMTypeConverter : public TypeConverter {
		/// Give structFuncArgTypeConverter access to memref-specific functions.
		rriddleUnsubmitted Done Reply Inline Actions nit: /// rriddle: nit: ///
		friend LogicalResult
		structFuncArgTypeConverter(LLVMTypeConverter &converter, Type type,
		SmallVectorImpl<Type> &result);

public:		public:
using TypeConverter::convertType;		using TypeConverter::convertType;

/// Create an LLVMTypeConverter using the default		/// Create an LLVMTypeConverter using the default
/// LLVMTypeConverterCustomization.		/// LLVMTypeConverterCustomization.
LLVMTypeConverter(MLIRContext *ctx);		LLVMTypeConverter(MLIRContext *ctx);

/// Create an LLVMTypeConverter using 'custom' customizations.		/// Create an LLVMTypeConverter using 'custom' customizations.
Show All 31 Lines	SmallVector<Value, 4> promoteMemRefDescriptors(Location loc,
OpBuilder &builder);		OpBuilder &builder);

/// Promote the LLVM struct representation of one MemRef descriptor to stack		/// Promote the LLVM struct representation of one MemRef descriptor to stack
/// and use pointer to struct to avoid the complexity of the platform-specific		/// and use pointer to struct to avoid the complexity of the platform-specific
/// C/C++ ABI lowering related to struct argument passing.		/// C/C++ ABI lowering related to struct argument passing.
Value promoteOneMemRefDescriptor(Location loc, Value operand,		Value promoteOneMemRefDescriptor(Location loc, Value operand,
OpBuilder &builder);		OpBuilder &builder);

		/// Converts the function type to a C-compatible format, in particular using
		/// pointers to memref descriptors for arguments.
		LLVM::LLVMType convertFunctionTypeCWrapper(FunctionType type);

		/// Creates descriptor structs from individual values constituting them.
		dcaballeUnsubmitted Done Reply Inline Actions constituting? dcaballe: constituting?
		Operation *materializeConversion(PatternRewriter &rewriter, Type type,
		ArrayRef<Value> values,
		Location loc) override;

protected:		protected:
/// LLVM IR module used to parse/create types.		/// LLVM IR module used to parse/create types.
llvm::Module *module;		llvm::Module *module;
LLVM::LLVMDialect *llvmDialect;		LLVM::LLVMDialect *llvmDialect;

private:		private:
Type convertStandardType(Type type);		Type convertStandardType(Type type);

Show All 10 Lines	private:
// Convert an integer type `i` to `!llvm<"i">`.		// Convert an integer type `i` to `!llvm<"i">`.
Type convertIntegerType(IntegerType type);		Type convertIntegerType(IntegerType type);

// Convert a floating point type: `f16` to `!llvm.half`, `f32` to		// Convert a floating point type: `f16` to `!llvm.half`, `f32` to
// `!llvm.float` and `f64` to `!llvm.double`. `bf16` is not supported		// `!llvm.float` and `f64` to `!llvm.double`. `bf16` is not supported
// by LLVM.		// by LLVM.
Type convertFloatType(FloatType type);		Type convertFloatType(FloatType type);

// Convert a memref type into an LLVM type that captures the relevant data.		/// Convert a memref type into an LLVM type that captures the relevant data.
// For statically-shaped memrefs, the resulting type is a pointer to the
// (converted) memref element type. For dynamically-shaped memrefs, the
// resulting type is an LLVM structure type that contains:
// 1. a pointer to the (converted) memref element type
// 2. as many index types as memref has dynamic dimensions.
Type convertMemRefType(MemRefType type);		Type convertMemRefType(MemRefType type);

		/// Convert a memref type into a list of non-aggregate LLVM IR types that
		/// contain all the relevant data. In particular, the list will contain:
		/// - two pointers to the memref element type, followed by
		/// - an integer offset, followed by
		/// - one integer size per dimension of the memref, followed by
		/// - one integer stride per dimension of the memref.
		/// For example, memref<?x?xf32> is converted to the following list:
		/// - `!llvm<"float*">` (allocated pointer),
		nicolasvasilacheUnsubmitted Done Reply Inline Actions ultra-nit: extra space before each line to make the list pop out better. Or use a proper list and backticks as such: `!llvm...` nicolasvasilache: ultra-nit: extra space before each line to make the list pop out better. Or use a proper list…
		/// - `!llvm<"float*">` (aligned pointer),
		/// - `!llvm.i64` (offset),
		/// - `!llvm.i64`, `!llvm.i64` (sizes),
		/// - `!llvm.i64`, `!llvm.i64` (strides).
		/// These types can be recomposed to a memref descriptor struct.
		SmallVector<Type, 5> convertMemRefSignature(MemRefType type);

		/// Convert an unranked memref type into a list of non-aggregate LLVM IR types
		/// that contain all the relevant data. In particular, this list contains:
		/// - an integer rank, followed by
		/// - a pointer to the memref descriptor struct.
		/// For example, memref<*xf32> is converted to the following list:
		/// !llvm.i64 (rank)
		/// !llvm<"i8*"> (type-erased pointer).
		/// These types can be recomposed to a unranked memref descriptor struct.
		SmallVector<Type, 2> convertUnrankedMemRefSignature();

// Convert an unranked memref type to an LLVM type that captures the		// Convert an unranked memref type to an LLVM type that captures the
// runtime rank and a pointer to the static ranked memref desc		// runtime rank and a pointer to the static ranked memref desc
Type convertUnrankedMemRefType(UnrankedMemRefType type);		Type convertUnrankedMemRefType(UnrankedMemRefType type);

// Convert a 1D vector type into an LLVM vector type.		// Convert a 1D vector type into an LLVM vector type.
Type convertVectorType(VectorType type);		Type convertVectorType(VectorType type);

// Get the LLVM representation of the index type based on the bitwidth of the		// Get the LLVM representation of the index type based on the bitwidth of the
Show All 23 Lines	protected:
Type structType;		Type structType;

protected:		protected:
/// Builds IR to extract a value from the struct at position pos		/// Builds IR to extract a value from the struct at position pos
Value extractPtr(OpBuilder &builder, Location loc, unsigned pos);		Value extractPtr(OpBuilder &builder, Location loc, unsigned pos);
/// Builds IR to set a value in the struct at position pos		/// Builds IR to set a value in the struct at position pos
void setPtr(OpBuilder &builder, Location loc, unsigned pos, Value ptr);		void setPtr(OpBuilder &builder, Location loc, unsigned pos, Value ptr);
};		};

/// Helper class to produce LLVM dialect operations extracting or inserting		/// Helper class to produce LLVM dialect operations extracting or inserting
/// elements of a MemRef descriptor. Wraps a Value pointing to the descriptor.		/// elements of a MemRef descriptor. Wraps a Value pointing to the descriptor.
/// The Value may be null, in which case none of the operations are valid.		/// The Value may be null, in which case none of the operations are valid.
class MemRefDescriptor : public StructBuilder {		class MemRefDescriptor : public StructBuilder {
public:		public:
/// Construct a helper for the given descriptor value.		/// Construct a helper for the given descriptor value.
explicit MemRefDescriptor(Value descriptor);		explicit MemRefDescriptor(Value descriptor);
/// Builds IR creating an `undef` value of the descriptor type.		/// Builds IR creating an `undef` value of the descriptor type.
Show All 38 Lines	public:
/// Builds IR inserting the pos-th stride into the descriptor		/// Builds IR inserting the pos-th stride into the descriptor
void setStride(OpBuilder &builder, Location loc, unsigned pos, Value stride);		void setStride(OpBuilder &builder, Location loc, unsigned pos, Value stride);
void setConstantStride(OpBuilder &builder, Location loc, unsigned pos,		void setConstantStride(OpBuilder &builder, Location loc, unsigned pos,
uint64_t stride);		uint64_t stride);

/// Returns the (LLVM) type this descriptor points to.		/// Returns the (LLVM) type this descriptor points to.
LLVM::LLVMType getElementType();		LLVM::LLVMType getElementType();

		/// Builds IR populating a MemRef descriptor structure from a list of
		/// individual values composing that descriptor, in the following order:
		/// - allocated pointer;
		/// - aligned pointer;
		/// - offset;
		/// - <rank> sizes;
		/// - <rank> shapes;
		/// where <rank> is the MemRef rank as provided in `type`.
		static Value pack(OpBuilder &builder, Location loc,
		LLVMTypeConverter &converter, MemRefType type,
		ValueRange values);

		/// Builds IR extracting individual elements of a MemRef descriptor structure
		/// and returning them as `results` list.
		static void unpack(OpBuilder &builder, Location loc, Value packed,
		MemRefType type, SmallVectorImpl<Value> &results);

		/// Returns the number of non-aggregate values that would be produced by
		/// `unpack`.
		static unsigned getNumUnpackedValues(MemRefType type);

private:		private:
// Cached index type.		// Cached index type.
Type indexType;		Type indexType;
};		};

		/// Helper class allowing the user to access a range of Values that correspond
		/// to an unpacked memref descriptor using named accessors. This does not own
		/// the values.
		class MemRefDescriptorView {
		public:
		/// Constructs the view from a range of values. Infers the rank from the size
		/// of the range.
		explicit MemRefDescriptorView(ValueRange range);

		/// Returns the allocated pointer Value.
		Value allocatedPtr();

		/// Returns the aligned pointer Value.
		Value alignedPtr();

		/// Returns the offset Value.
		Value offset();

		/// Returns the pos-th size Value.
		Value size(unsigned pos);

		/// Returns the pos-th stride Value.
		Value stride(unsigned pos);

		private:
		/// Rank of the memref the descriptor is pointing to.
		int rank;
		/// Underlying range of Values.
		ValueRange elements;
		};

class UnrankedMemRefDescriptor : public StructBuilder {		class UnrankedMemRefDescriptor : public StructBuilder {
public:		public:
/// Construct a helper for the given descriptor value.		/// Construct a helper for the given descriptor value.
explicit UnrankedMemRefDescriptor(Value descriptor);		explicit UnrankedMemRefDescriptor(Value descriptor);
/// Builds IR creating an `undef` value of the descriptor type.		/// Builds IR creating an `undef` value of the descriptor type.
static UnrankedMemRefDescriptor undef(OpBuilder &builder, Location loc,		static UnrankedMemRefDescriptor undef(OpBuilder &builder, Location loc,
Type descriptorType);		Type descriptorType);

/// Builds IR extracting the rank from the descriptor		/// Builds IR extracting the rank from the descriptor
Value rank(OpBuilder &builder, Location loc);		Value rank(OpBuilder &builder, Location loc);
/// Builds IR setting the rank in the descriptor		/// Builds IR setting the rank in the descriptor
void setRank(OpBuilder &builder, Location loc, Value value);		void setRank(OpBuilder &builder, Location loc, Value value);
/// Builds IR extracting ranked memref descriptor ptr		/// Builds IR extracting ranked memref descriptor ptr
Value memRefDescPtr(OpBuilder &builder, Location loc);		Value memRefDescPtr(OpBuilder &builder, Location loc);
/// Builds IR setting ranked memref descriptor ptr		/// Builds IR setting ranked memref descriptor ptr
void setMemRefDescPtr(OpBuilder &builder, Location loc, Value value);		void setMemRefDescPtr(OpBuilder &builder, Location loc, Value value);

		/// Builds IR populating an unranked MemRef descriptor structure from a list
		/// of individual constituent values in the following order:
		/// - rank of the memref;
		/// - pointer to the memref descriptor.
		static Value pack(OpBuilder &builder, Location loc,
		LLVMTypeConverter &converter, UnrankedMemRefType type,
		ValueRange values);

		/// Builds IR extracting individual elements that compose an unranked memref
		/// descriptor and returns them as `results` list.
		static void unpack(OpBuilder &builder, Location loc, Value packed,
		SmallVectorImpl<Value> &results);

		/// Returns the number of non-aggregate values that would be produced by
		/// `unpack`.
		static unsigned getNumUnpackedValues() { return 2; }
};		};
/// Base class for operation conversions targeting the LLVM IR dialect. Provides		/// Base class for operation conversions targeting the LLVM IR dialect. Provides
/// conversion patterns with an access to the containing LLVMLowering for the		/// conversion patterns with an access to the containing LLVMLowering for the
/// purpose of type conversions.		/// purpose of type conversions.
class LLVMOpLowering : public ConversionPattern {		class LLVMOpLowering : public ConversionPattern {
public:		public:
LLVMOpLowering(StringRef rootOpName, MLIRContext *context,		LLVMOpLowering(StringRef rootOpName, MLIRContext *context,
LLVMTypeConverter &lowering, PatternBenefit benefit = 1);		LLVMTypeConverter &lowering, PatternBenefit benefit = 1);
Show All 10 Lines

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h

Show All 23 Lines	void populateStdToLLVMMemoryConversionPatters(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns,		LLVMTypeConverter &converter, OwningRewritePatternList &patterns,
bool useAlloca);		bool useAlloca);

/// Collect a set of patterns to convert from the Standard dialect to the LLVM		/// Collect a set of patterns to convert from the Standard dialect to the LLVM
/// dialect, excluding the memory-related operations.		/// dialect, excluding the memory-related operations.
void populateStdToLLVMNonMemoryConversionPatterns(		void populateStdToLLVMNonMemoryConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns);		LLVMTypeConverter &converter, OwningRewritePatternList &patterns);

/// Collect the default pattern to convert a FuncOp to the LLVM dialect.		/// Collect the default pattern to convert a FuncOp to the LLVM dialect. If
		/// `emitCWrappers` is set, the pattern will also produce functions
		/// that pass memref descriptors by pointer-to-structure in addition to the
		/// default unpacked form.
void populateStdToLLVMDefaultFuncOpConversionPattern(		void populateStdToLLVMDefaultFuncOpConversionPattern(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns);		LLVMTypeConverter &converter, OwningRewritePatternList &patterns,
		bool emitCWrappers = false);

/// Collect a set of default patterns to convert from the Standard dialect to		/// Collect a set of default patterns to convert from the Standard dialect to
/// LLVM. If `useAlloca` is set, the patterns for AllocOp and DeallocOp will		/// LLVM. If `useAlloca` is set, the patterns for AllocOp and DeallocOp will
/// generate `llvm.alloca` instead of calls to "malloc".		/// generate `llvm.alloca` instead of calls to "malloc".
void populateStdToLLVMConversionPatterns(LLVMTypeConverter &converter,		void populateStdToLLVMConversionPatterns(LLVMTypeConverter &converter,
OwningRewritePatternList &patterns,		OwningRewritePatternList &patterns,
bool useAlloca = false);		bool useAlloca = false,
		bool emitCWrappers = false);

/// Collect a set of patterns to convert from the Standard dialect to		/// Collect a set of patterns to convert from the Standard dialect to
/// LLVM using the bare pointer calling convention for MemRef function		/// LLVM using the bare pointer calling convention for MemRef function
/// arguments. If `useAlloca` is set, the patterns for AllocOp and DeallocOp		/// arguments. If `useAlloca` is set, the patterns for AllocOp and DeallocOp
/// will generate `llvm.alloca` instead of calls to "malloc".		/// will generate `llvm.alloca` instead of calls to "malloc".
void populateStdToLLVMBarePtrConversionPatterns(		void populateStdToLLVMBarePtrConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns,		LLVMTypeConverter &converter, OwningRewritePatternList &patterns,
bool useAlloca = false);		bool useAlloca = false);

/// Creates a pass to convert the Standard dialect into the LLVMIR dialect.		/// Creates a pass to convert the Standard dialect into the LLVMIR dialect.
/// By default stdlib malloc/free are used for allocating MemRef payloads.		/// By default stdlib malloc/free are used for allocating MemRef payloads.
/// Specifying `useAlloca-true` emits stack allocations instead. In the future		/// Specifying `useAlloca-true` emits stack allocations instead. In the future
/// this may become an enum when we have concrete uses for other options.		/// this may become an enum when we have concrete uses for other options.
std::unique_ptr<OpPassBase<ModuleOp>>		std::unique_ptr<OpPassBase<ModuleOp>>
createLowerToLLVMPass(bool useAlloca = false);		createLowerToLLVMPass(bool useAlloca = false, bool emitCWrappers = false);

namespace LLVM {		namespace LLVM {
/// Make argument-taking successors of each block distinct. PHI nodes in LLVM		/// Make argument-taking successors of each block distinct. PHI nodes in LLVM
/// IR use the predecessor ID to identify which value to take. They do not		/// IR use the predecessor ID to identify which value to take. They do not
/// support different values coming from the same predecessor. If a block has		/// support different values coming from the same predecessor. If a block has
/// another block as a successor more than once with different values, insert		/// another block as a successor more than once with different values, insert
/// a new dummy block for LLVM PHI nodes to tell the sources apart.		/// a new dummy block for LLVM PHI nodes to tell the sources apart.
void ensureDistinctSuccessors(ModuleOp m);		void ensureDistinctSuccessors(ModuleOp m);
} // namespace LLVM		} // namespace LLVM

} // namespace mlir		} // namespace mlir

#endif // MLIR_CONVERSION_STANDARDTOLLVM_CONVERTSTANDARDTOLLVMPASS_H_		#endif // MLIR_CONVERSION_STANDARDTOLLVM_CONVERTSTANDARDTOLLVMPASS_H_

mlir/include/mlir/IR/FunctionSupport.h

	Show All 24 Lines
	inline StringRef getTypeAttrName() { return "type"; }			inline StringRef getTypeAttrName() { return "type"; }

	/// Return the name of the attribute used for function arguments.			/// Return the name of the attribute used for function arguments.
	inline StringRef getArgAttrName(unsigned arg, SmallVectorImpl<char> &out) {			inline StringRef getArgAttrName(unsigned arg, SmallVectorImpl<char> &out) {
	out.clear();			out.clear();
	return ("arg" + Twine(arg)).toStringRef(out);			return ("arg" + Twine(arg)).toStringRef(out);
	}			}

				/// Returns true if the given name is a valid argument attribute name.
				inline bool isArgAttrName(StringRef name) {
				APInt unused;
				return name.startswith("arg") &&
				!name.drop_front(3).getAsInteger(/Radix=/10, unused);
				}

	/// Return the name of the attribute used for function results.			/// Return the name of the attribute used for function results.
	inline StringRef getResultAttrName(unsigned arg, SmallVectorImpl<char> &out) {			inline StringRef getResultAttrName(unsigned arg, SmallVectorImpl<char> &out) {
	out.clear();			out.clear();
	return ("result" + Twine(arg)).toStringRef(out);			return ("result" + Twine(arg)).toStringRef(out);
	}			}

	/// Returns the dictionary attribute corresponding to the argument at 'index'.			/// Returns the dictionary attribute corresponding to the argument at 'index'.
	/// If there are no argument attributes at 'index', a null attribute is			/// If there are no argument attributes at 'index', a null attribute is
	▲ Show 20 Lines • Show All 499 Lines • Show Last 20 Lines

mlir/lib/Conversion/GPUToCUDA/ConvertLaunchFuncToCudaCalls.cpp

Show First 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	private:
Value allocatePointer(OpBuilder &builder, Location loc) {		Value allocatePointer(OpBuilder &builder, Location loc) {
auto one = builder.create<LLVM::ConstantOp>(loc, getInt32Type(),		auto one = builder.create<LLVM::ConstantOp>(loc, getInt32Type(),
builder.getI32IntegerAttr(1));		builder.getI32IntegerAttr(1));
return builder.create<LLVM::AllocaOp>(loc, getPointerPointerType(), one,		return builder.create<LLVM::AllocaOp>(loc, getPointerPointerType(), one,
/alignment=/0);		/alignment=/0);
}		}

void declareCudaFunctions(Location loc);		void declareCudaFunctions(Location loc);
		void addParamToList(OpBuilder &builder, Location loc, Value param, Value list,
		unsigned pos, Value one);
Value setupParamsArray(gpu::LaunchFuncOp launchOp, OpBuilder &builder);		Value setupParamsArray(gpu::LaunchFuncOp launchOp, OpBuilder &builder);
Value generateKernelNameConstant(StringRef name, Location loc,		Value generateKernelNameConstant(StringRef name, Location loc,
OpBuilder &builder);		OpBuilder &builder);
void translateGpuLaunchCalls(mlir::gpu::LaunchFuncOp launchOp);		void translateGpuLaunchCalls(mlir::gpu::LaunchFuncOp launchOp);

public:		public:
// Run the dialect converter on the module.		// Run the dialect converter on the module.
void runOnModule() override {		void runOnModule() override {
▲ Show 20 Lines • Show All 102 Lines • ▼ Show 20 Lines	builder.create<LLVM::LLVMFuncOp>(
{		{
getPointerType(), /* void ptr /		getPointerType(), /* void ptr /
getInt64Type() /* int64 sizeBytes*/		getInt64Type() /* int64 sizeBytes*/
},		},
/isVarArg=/false));		/isVarArg=/false));
}		}
}		}

		/// Emits the IR with the following structure:
		///
		/// %data = llvm.alloca 1 x type-of(<param>)
		/// llvm.store <param>, %data
		/// %typeErased = llvm.bitcast %data to !llvm<"i8*">
		/// %addr = llvm.getelementptr <list>[<pos>]
		/// llvm.store %typeErased, %addr
		///
		rriddleUnsubmitted Done Reply Inline Actions nit: /// for the comments. rriddle: nit: /// for the comments.
		/// This is necessary to construct the list of arguments passed to the kernel
		/// function as accepted by cuLaunchKernel, i.e. as a void** that points to list
		/// of stack-allocated type-erased pointers to the actual arguments.
		void GpuLaunchFuncToCudaCallsPass::addParamToList(OpBuilder &builder,
		Location loc, Value param,
		Value list, unsigned pos,
		Value one) {
		auto memLocation = builder.create<LLVM::AllocaOp>(
		loc, param.getType().cast<LLVM::LLVMType>().getPointerTo(), one,
		/alignment=/1);
		builder.create<LLVM::StoreOp>(loc, param, memLocation);
		auto casted =
		builder.create<LLVM::BitcastOp>(loc, getPointerType(), memLocation);

		auto index = builder.create<LLVM::ConstantOp>(loc, getInt32Type(),
		builder.getI32IntegerAttr(pos));
		auto gep = builder.create<LLVM::GEPOp>(loc, getPointerPointerType(), list,
		ArrayRef<Value>{index});
		builder.create<LLVM::StoreOp>(loc, casted, gep);
		}

// Generates a parameters array to be used with a CUDA kernel launch call. The		// Generates a parameters array to be used with a CUDA kernel launch call. The
// arguments are extracted from the launchOp.		// arguments are extracted from the launchOp.
// The generated code is essentially as follows:		// The generated code is essentially as follows:
//		//
// %array = alloca(numparams * sizeof(void *))		// %array = alloca(numparams * sizeof(void *))
// for (i : [0, NumKernelOperands))		// for (i : [0, NumKernelOperands))
// %array[i] = cast<void*>(KernelOperand[i])		// %array[i] = cast<void*>(KernelOperand[i])
// return %array		// return %array
Value GpuLaunchFuncToCudaCallsPass::setupParamsArray(gpu::LaunchFuncOp launchOp,		Value GpuLaunchFuncToCudaCallsPass::setupParamsArray(gpu::LaunchFuncOp launchOp,
OpBuilder &builder) {		OpBuilder &builder) {

		// Get the launch target.
		auto containingModule = launchOp.getParentOfType<ModuleOp>();
		if (!containingModule)
		return {};
		auto gpuModule = containingModule.lookupSymbol<gpu::GPUModuleOp>(
		launchOp.getKernelModuleName());
		if (!gpuModule)
		return {};
		auto gpuFunc = gpuModule.lookupSymbol<LLVM::LLVMFuncOp>(launchOp.kernel());
		if (!gpuFunc)
		return {};

		unsigned numArgs = gpuFunc.getNumArguments();

auto numKernelOperands = launchOp.getNumKernelOperands();		auto numKernelOperands = launchOp.getNumKernelOperands();
Location loc = launchOp.getLoc();		Location loc = launchOp.getLoc();
auto one = builder.create<LLVM::ConstantOp>(loc, getInt32Type(),		auto one = builder.create<LLVM::ConstantOp>(loc, getInt32Type(),
builder.getI32IntegerAttr(1));		builder.getI32IntegerAttr(1));
// Provision twice as much for the `array` to allow up to one level of
// indirection for each argument.
auto arraySize = builder.create<LLVM::ConstantOp>(		auto arraySize = builder.create<LLVM::ConstantOp>(
loc, getInt32Type(), builder.getI32IntegerAttr(numKernelOperands));		loc, getInt32Type(), builder.getI32IntegerAttr(numArgs));
auto array = builder.create<LLVM::AllocaOp>(loc, getPointerPointerType(),		auto array = builder.create<LLVM::AllocaOp>(loc, getPointerPointerType(),
arraySize, /alignment=/0);		arraySize, /alignment=/0);

		unsigned pos = 0;
for (unsigned idx = 0; idx < numKernelOperands; ++idx) {		for (unsigned idx = 0; idx < numKernelOperands; ++idx) {
auto operand = launchOp.getKernelOperand(idx);		auto operand = launchOp.getKernelOperand(idx);
auto llvmType = operand.getType().cast<LLVM::LLVMType>();		auto llvmType = operand.getType().cast<LLVM::LLVMType>();
Value memLocation = builder.create<LLVM::AllocaOp>(
loc, llvmType.getPointerTo(), one, /alignment=/1);
builder.create<LLVM::StoreOp>(loc, operand, memLocation);
auto casted =
builder.create<LLVM::BitcastOp>(loc, getPointerType(), memLocation);

// Assume all struct arguments come from MemRef. If this assumption does not		// Assume all struct arguments come from MemRef. If this assumption does not
// hold anymore then we `launchOp` to lower from MemRefType and not after		// hold anymore then we `launchOp` to lower from MemRefType and not after
// LLVMConversion has taken place and the MemRef information is lost.		// LLVMConversion has taken place and the MemRef information is lost.
// Extra level of indirection in the `array`:		if (!llvmType.isStructTy()) {
// the descriptor pointer is registered via @mcuMemHostRegisterPtr		addParamToList(builder, loc, operand, array, pos++, one);
if (llvmType.isStructTy()) {		continue;
auto registerFunc =		}
getModule().lookupSymbol<LLVM::LLVMFuncOp>(kMcuMemHostRegister);
auto nullPtr = builder.create<LLVM::NullOp>(loc, llvmType.getPointerTo());		// Put individual components of a memref descriptor into the flat argument
auto gep = builder.create<LLVM::GEPOp>(loc, llvmType.getPointerTo(),		// list. We cannot use unpackMemref from LLVM lowering here because we have
ArrayRef<Value>{nullPtr, one});		// no access to MemRefType that had been lowered away.
auto size = builder.create<LLVM::PtrToIntOp>(loc, getInt64Type(), gep);		for (int32_t j = 0, ej = llvmType.getStructNumElements(); j < ej; ++j) {
builder.create<LLVM::CallOp>(loc, ArrayRef<Type>{},		auto elemType = llvmType.getStructElementType(j);
builder.getSymbolRefAttr(registerFunc),		if (elemType.isArrayTy()) {
ArrayRef<Value>{casted, size});		for (int32_t k = 0, ek = elemType.getArrayNumElements(); k < ek; ++k) {
Value memLocation = builder.create<LLVM::AllocaOp>(		Value elem = builder.create<LLVM::ExtractValueOp>(
loc, getPointerPointerType(), one, /alignment=/1);		loc, elemType.getArrayElementType(), operand,
builder.create<LLVM::StoreOp>(loc, casted, memLocation);		builder.getI32ArrayAttr({j, k}));
casted =		addParamToList(builder, loc, elem, array, pos++, one);
builder.create<LLVM::BitcastOp>(loc, getPointerType(), memLocation);		}
		} else {
		assert((elemType.isIntegerTy() \|\| elemType.isFloatTy() \|\|
		elemType.isDoubleTy() \|\| elemType.isPointerTy()) &&
		"expected scalar type");
		Value strct = builder.create<LLVM::ExtractValueOp>(
		loc, elemType, operand, builder.getI32ArrayAttr(j));
		addParamToList(builder, loc, strct, array, pos++, one);
		}
}		}

auto index = builder.create<LLVM::ConstantOp>(
loc, getInt32Type(), builder.getI32IntegerAttr(idx));
auto gep = builder.create<LLVM::GEPOp>(loc, getPointerPointerType(), array,
ArrayRef<Value>{index});
builder.create<LLVM::StoreOp>(loc, casted, gep);
}		}

return array;		return array;
}		}

// Generates an LLVM IR dialect global that contains the name of the given		// Generates an LLVM IR dialect global that contains the name of the given
// kernel function as a C string, and returns a pointer to its beginning.		// kernel function as a C string, and returns a pointer to its beginning.
// The code is essentially:		// The code is essentially:
//		//
// llvm.global constant @kernel_name("function_name\00")		// llvm.global constant @kernel_name("function_name\00")
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	auto cuStream = builder.create<LLVM::CallOp>(
loc, ArrayRef<Type>{getPointerType()},		loc, ArrayRef<Type>{getPointerType()},
builder.getSymbolRefAttr(cuGetStreamHelper), ArrayRef<Value>{});		builder.getSymbolRefAttr(cuGetStreamHelper), ArrayRef<Value>{});
// Invoke the function with required arguments.		// Invoke the function with required arguments.
auto cuLaunchKernel =		auto cuLaunchKernel =
getModule().lookupSymbol<LLVM::LLVMFuncOp>(cuLaunchKernelName);		getModule().lookupSymbol<LLVM::LLVMFuncOp>(cuLaunchKernelName);
auto cuFunctionRef =		auto cuFunctionRef =
builder.create<LLVM::LoadOp>(loc, getPointerType(), cuFunction);		builder.create<LLVM::LoadOp>(loc, getPointerType(), cuFunction);
auto paramsArray = setupParamsArray(launchOp, builder);		auto paramsArray = setupParamsArray(launchOp, builder);
		if (!paramsArray) {
		launchOp.emitOpError() << "cannot pass given parameters to the kernel";
		return signalPassFailure();
		}
auto nullpointer =		auto nullpointer =
builder.create<LLVM::IntToPtrOp>(loc, getPointerPointerType(), zero);		builder.create<LLVM::IntToPtrOp>(loc, getPointerPointerType(), zero);
builder.create<LLVM::CallOp>(		builder.create<LLVM::CallOp>(
loc, ArrayRef<Type>{getCUResultType()},		loc, ArrayRef<Type>{getCUResultType()},
builder.getSymbolRefAttr(cuLaunchKernel),		builder.getSymbolRefAttr(cuLaunchKernel),
ArrayRef<Value>{cuFunctionRef, launchOp.getOperand(0),		ArrayRef<Value>{cuFunctionRef, launchOp.getOperand(0),
launchOp.getOperand(1), launchOp.getOperand(2),		launchOp.getOperand(1), launchOp.getOperand(2),
launchOp.getOperand(3), launchOp.getOperand(4),		launchOp.getOperand(3), launchOp.getOperand(4),
Show All 21 Lines

mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp

Show First 20 Lines • Show All 558 Lines • ▼ Show 20 Lines	matchAndRewrite(Operation *op, ArrayRef<Value> operands,
// Rewrite the original GPU function to an LLVM function.		// Rewrite the original GPU function to an LLVM function.
auto funcType = lowering.convertType(gpuFuncOp.getType())		auto funcType = lowering.convertType(gpuFuncOp.getType())
.cast<LLVM::LLVMType>()		.cast<LLVM::LLVMType>()
.getPointerElementTy();		.getPointerElementTy();

// Remap proper input types.		// Remap proper input types.
TypeConverter::SignatureConversion signatureConversion(		TypeConverter::SignatureConversion signatureConversion(
gpuFuncOp.front().getNumArguments());		gpuFuncOp.front().getNumArguments());
for (unsigned i = 0, e = funcType.getFunctionNumParams(); i < e; ++i)		lowering.convertFunctionSignature(gpuFuncOp.getType(), /isVariadic=/false,
signatureConversion.addInputs(i, funcType.getFunctionParamType(i));		signatureConversion);

// Create the new function operation. Only copy those attributes that are		// Create the new function operation. Only copy those attributes that are
// not specific to function modeling.		// not specific to function modeling.
SmallVector<NamedAttribute, 4> attributes;		SmallVector<NamedAttribute, 4> attributes;
for (const auto &attr : gpuFuncOp.getAttrs()) {		for (const auto &attr : gpuFuncOp.getAttrs()) {
if (attr.first.is(SymbolTable::getSymbolAttrName()) \|\|		if (attr.first.is(SymbolTable::getSymbolAttrName()) \|\|
attr.first.is(impl::getTypeAttrName()) \|\|		attr.first.is(impl::getTypeAttrName()) \|\|
attr.first.is(gpu::GPUFuncOp::getNumWorkgroupAttributionsAttrName()))		attr.first.is(gpu::GPUFuncOp::getNumWorkgroupAttributionsAttrName()))
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	matchAndRewrite(Operation *op, ArrayRef<Value> operands,
}		}

// Move the region to the new function, update the entry block signature.		// Move the region to the new function, update the entry block signature.
rewriter.inlineRegionBefore(gpuFuncOp.getBody(), llvmFuncOp.getBody(),		rewriter.inlineRegionBefore(gpuFuncOp.getBody(), llvmFuncOp.getBody(),
llvmFuncOp.end());		llvmFuncOp.end());
rewriter.applySignatureConversion(&llvmFuncOp.getBody(),		rewriter.applySignatureConversion(&llvmFuncOp.getBody(),
signatureConversion);		signatureConversion);

{
// For memref-typed arguments, insert the relevant loads in the beginning
// of the block to comply with the LLVM dialect calling convention. This
// needs to be done after signature conversion to get the right types.
OpBuilder::InsertionGuard guard(rewriter);
Block &block = llvmFuncOp.front();
rewriter.setInsertionPointToStart(&block);

for (auto en : llvm::enumerate(gpuFuncOp.getType().getInputs())) {
if (!en.value().isa<MemRefType>() &&
!en.value().isa<UnrankedMemRefType>())
continue;

BlockArgument arg = block.getArgument(en.index());
Value loaded = rewriter.create<LLVM::LoadOp>(loc, arg);
rewriter.replaceUsesOfBlockArgument(arg, loaded);
}
}

rewriter.eraseOp(gpuFuncOp);		rewriter.eraseOp(gpuFuncOp);
return matchSuccess();		return matchSuccess();
}		}
};		};

struct GPUReturnOpLowering : public LLVMOpLowering {		struct GPUReturnOpLowering : public LLVMOpLowering {
GPUReturnOpLowering(LLVMTypeConverter &typeConverter)		GPUReturnOpLowering(LLVMTypeConverter &typeConverter)
: LLVMOpLowering(gpu::ReturnOp::getOperationName(),		: LLVMOpLowering(gpu::ReturnOp::getOperationName(),
▲ Show 20 Lines • Show All 78 Lines • Show Last 20 Lines

mlir/lib/Conversion/LinalgToLLVM/LinalgToLLVM.cpp

	Show First 20 Lines • Show All 571 Lines • ▼ Show 20 Lines
	void ConvertLinalgToLLVMPass::runOnModule() {			void ConvertLinalgToLLVMPass::runOnModule() {
	auto module = getModule();			auto module = getModule();

	// Convert to the LLVM IR dialect using the converter defined above.			// Convert to the LLVM IR dialect using the converter defined above.
	OwningRewritePatternList patterns;			OwningRewritePatternList patterns;
	LinalgTypeConverter converter(&getContext());			LinalgTypeConverter converter(&getContext());
	populateAffineToStdConversionPatterns(patterns, &getContext());			populateAffineToStdConversionPatterns(patterns, &getContext());
	populateLoopToStdConversionPatterns(patterns, &getContext());			populateLoopToStdConversionPatterns(patterns, &getContext());
	populateStdToLLVMConversionPatterns(converter, patterns);			populateStdToLLVMConversionPatterns(converter, patterns, /useAlloca=/false,
				/emitCWrappers=/true);
	populateVectorToLLVMConversionPatterns(converter, patterns);			populateVectorToLLVMConversionPatterns(converter, patterns);
	populateLinalgToStandardConversionPatterns(patterns, &getContext());			populateLinalgToStandardConversionPatterns(patterns, &getContext());
	populateLinalgToLLVMConversionPatterns(converter, patterns, &getContext());			populateLinalgToLLVMConversionPatterns(converter, patterns, &getContext());

	ConversionTarget target(getContext());			ConversionTarget target(getContext());
	target.addLegalDialect<LLVM::LLVMDialect>();			target.addLegalDialect<LLVM::LLVMDialect>();
	target.addDynamicallyLegalOp<FuncOp>(			target.addDynamicallyLegalOp<FuncOp>(
	[&](FuncOp op) { return converter.isSignatureLegal(op.getType()); });			[&](FuncOp op) { return converter.isSignatureLegal(op.getType()); });
	Show All 12 Lines

mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp

Show All 24 Lines
#include "mlir/Transforms/DialectConversion.h"		#include "mlir/Transforms/DialectConversion.h"
#include "mlir/Transforms/Passes.h"		#include "mlir/Transforms/Passes.h"
#include "mlir/Transforms/Utils.h"		#include "mlir/Transforms/Utils.h"

#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
#include "llvm/IR/IRBuilder.h"		#include "llvm/IR/IRBuilder.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
		#include "llvm/Support/FormatVariadic.h"

using namespace mlir;		using namespace mlir;

#define PASS_NAME "convert-std-to-llvm"		#define PASS_NAME "convert-std-to-llvm"

static llvm::cl::OptionCategory		static llvm::cl::OptionCategory
clOptionsCategory("Standard to LLVM lowering options");		clOptionsCategory("Standard to LLVM lowering options");

static llvm::cl::opt<bool>		static llvm::cl::opt<bool>
clUseAlloca(PASS_NAME "-use-alloca",		clUseAlloca(PASS_NAME "-use-alloca",
llvm::cl::desc("Replace emission of malloc/free by alloca"),		llvm::cl::desc("Replace emission of malloc/free by alloca"),
llvm::cl::init(false));		llvm::cl::init(false));

		static llvm::cl::opt<bool>
		clEmitCWrappers(PASS_NAME "-emit-c-wrappers",
		llvm::cl::desc("Emit C-compatible wrapper functions"),
		llvm::cl::init(false));
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions Seems like you didn't provide tests for this flag? mehdi_amini: Seems like you didn't provide tests for this flag?
		ftynseAuthorUnsubmitted Done Reply Inline Actions Indeed, forgot to git add :/ Thanks for noticing. Pushed in ea3a25e4f5166ccd5e523f0165f5270b24d71f46. ftynse: Indeed, forgot to git add :/ Thanks for noticing. Pushed in…

static llvm::cl::opt<bool> clUseBarePtrCallConv(		static llvm::cl::opt<bool> clUseBarePtrCallConv(
PASS_NAME "-use-bare-ptr-memref-call-conv",		PASS_NAME "-use-bare-ptr-memref-call-conv",
llvm::cl::desc("Replace FuncOp's MemRef arguments with "		llvm::cl::desc("Replace FuncOp's MemRef arguments with "
"bare pointers to the MemRef element types"),		"bare pointers to the MemRef element types"),
llvm::cl::init(false));		llvm::cl::init(false));

// Extract an LLVM IR type from the LLVM IR dialect type.		// Extract an LLVM IR type from the LLVM IR dialect type.
static LLVM::LLVMType unwrap(Type type) {		static LLVM::LLVMType unwrap(Type type) {
if (!type)		if (!type)
return nullptr;		return nullptr;
auto *mlirContext = type.getContext();		auto *mlirContext = type.getContext();
auto wrappedLLVMType = type.dyn_cast<LLVM::LLVMType>();		auto wrappedLLVMType = type.dyn_cast<LLVM::LLVMType>();
if (!wrappedLLVMType)		if (!wrappedLLVMType)
emitError(UnknownLoc::get(mlirContext),		emitError(UnknownLoc::get(mlirContext),
"conversion resulted in a non-LLVM type");		"conversion resulted in a non-LLVM type");
return wrappedLLVMType;		return wrappedLLVMType;
}		}

/// Initialize customization to default callbacks.		/// Initialize customization to default callbacks.
LLVMTypeConverterCustomization::LLVMTypeConverterCustomization() {		LLVMTypeConverterCustomization::LLVMTypeConverterCustomization() {
funcArgConverter = structFuncArgTypeConverter;		funcArgConverter = structFuncArgTypeConverter;
}		}

// Callback to convert function argument types. It converts a MemRef function		/// Callback to convert function argument types. It converts a MemRef function
// arguments to a struct that contains the descriptor information. Converted		/// argument to a list of non-aggregate types containing descriptor
// types are promoted to a pointer to the converted type.		/// information, and an UnrankedmemRef function argument to a list containing
LLVM::LLVMType mlir::structFuncArgTypeConverter(LLVMTypeConverter &converter,		/// the rank and a pointer to a descriptor struct.
Type type) {		LogicalResult mlir::structFuncArgTypeConverter(LLVMTypeConverter &converter,
auto converted =		Type type,
converter.convertType(type).dyn_cast_or_null<LLVM::LLVMType>();		SmallVectorImpl<Type> &result) {
		if (auto memref = type.dyn_cast<MemRefType>()) {
		auto converted = converter.convertMemRefSignature(memref);
		if (converted.empty())
		return failure();
		result.append(converted.begin(), converted.end());
		return success();
		}
		if (type.isa<UnrankedMemRefType>()) {
		auto converted = converter.convertUnrankedMemRefSignature();
		if (converted.empty())
		return failure();
		result.append(converted.begin(), converted.end());
		return success();
		}
		auto converted = converter.convertType(type);
if (!converted)		if (!converted)
return {};		return failure();
if (type.isa<MemRefType>() \|\| type.isa<UnrankedMemRefType>())		result.push_back(converted);
converted = converted.getPointerTo();		return success();
return converted;
}		}

/// Convert a MemRef type to a bare pointer to the MemRef element type.		/// Convert a MemRef type to a bare pointer to the MemRef element type.
static Type convertMemRefTypeToBarePtr(LLVMTypeConverter &converter,		static Type convertMemRefTypeToBarePtr(LLVMTypeConverter &converter,
MemRefType type) {		MemRefType type) {
int64_t offset;		int64_t offset;
SmallVector<int64_t, 4> strides;		SmallVector<int64_t, 4> strides;
if (failed(getStridesAndOffset(type, strides, offset)))		if (failed(getStridesAndOffset(type, strides, offset)))
return {};		return {};

LLVM::LLVMType elementType =		LLVM::LLVMType elementType =
unwrap(converter.convertType(type.getElementType()));		unwrap(converter.convertType(type.getElementType()));
if (!elementType)		if (!elementType)
return {};		return {};
return elementType.getPointerTo(type.getMemorySpace());		return elementType.getPointerTo(type.getMemorySpace());
}		}

/// Callback to convert function argument types. It converts MemRef function		/// Callback to convert function argument types. It converts MemRef function
/// arguments to bare pointers to the MemRef element type. Converted types are		/// arguments to bare pointers to the MemRef element type.
/// not promoted to pointers.		LogicalResult mlir::barePtrFuncArgTypeConverter(LLVMTypeConverter &converter,
LLVM::LLVMType mlir::barePtrFuncArgTypeConverter(LLVMTypeConverter &converter,		Type type,
Type type) {		SmallVectorImpl<Type> &result) {
// TODO: Add support for unranked memref.		// TODO: Add support for unranked memref.
if (auto memrefTy = type.dyn_cast<MemRefType>())		if (auto memrefTy = type.dyn_cast<MemRefType>()) {
return convertMemRefTypeToBarePtr(converter, memrefTy)		auto llvmTy = convertMemRefTypeToBarePtr(converter, memrefTy);
.dyn_cast_or_null<LLVM::LLVMType>();		if (!llvmTy)
return converter.convertType(type).dyn_cast_or_null<LLVM::LLVMType>();		return failure();

		result.push_back(llvmTy);
		return success();
		}

		auto llvmTy = converter.convertType(type);
		if (!llvmTy)
		return failure();

		result.push_back(llvmTy);
		return success();
}		}

/// Create an LLVMTypeConverter using default LLVMTypeConverterCustomization.		/// Create an LLVMTypeConverter using default LLVMTypeConverterCustomization.
LLVMTypeConverter::LLVMTypeConverter(MLIRContext *ctx)		LLVMTypeConverter::LLVMTypeConverter(MLIRContext *ctx)
: LLVMTypeConverter(ctx, LLVMTypeConverterCustomization()) {}		: LLVMTypeConverter(ctx, LLVMTypeConverterCustomization()) {}

/// Create an LLVMTypeConverter using 'custom' customizations.		/// Create an LLVMTypeConverter using 'custom' customizations.
LLVMTypeConverter::LLVMTypeConverter(		LLVMTypeConverter::LLVMTypeConverter(
▲ Show 20 Lines • Show All 44 Lines • ▼ Show 20 Lines
// pointer-to-function types.		// pointer-to-function types.
Type LLVMTypeConverter::convertFunctionType(FunctionType type) {		Type LLVMTypeConverter::convertFunctionType(FunctionType type) {
SignatureConversion conversion(type.getNumInputs());		SignatureConversion conversion(type.getNumInputs());
LLVM::LLVMType converted =		LLVM::LLVMType converted =
convertFunctionSignature(type, /isVariadic=/false, conversion);		convertFunctionSignature(type, /isVariadic=/false, conversion);
return converted.getPointerTo();		return converted.getPointerTo();
}		}

		/// In signatures, MemRef descriptors are expanded into lists of non-aggregate
		rriddleUnsubmitted Done Reply Inline Actions /// rriddle: ///
		/// values.
		SmallVector<Type, 5>
		LLVMTypeConverter::convertMemRefSignature(MemRefType type) {
		SmallVector<Type, 5> results;
		assert(isStrided(type) &&
		"Non-strided layout maps must have been normalized away");

		LLVM::LLVMType elementType = unwrap(convertType(type.getElementType()));
		if (!elementType)
		return {};
		auto indexTy = getIndexType();

		results.insert(results.begin(), 2,
		elementType.getPointerTo(type.getMemorySpace()));
		results.push_back(indexTy);
		auto rank = type.getRank();
		results.insert(results.end(), 2 * rank, indexTy);
		return results;
		}

		/// In signatures, unranked MemRef descriptors are expanded into a pair "rank,
		rriddleUnsubmitted Done Reply Inline Actions /// rriddle: ///
		/// pointer to descriptor".
		SmallVector<Type, 2> LLVMTypeConverter::convertUnrankedMemRefSignature() {
		return {getIndexType(), LLVM::LLVMType::getInt8PtrTy(llvmDialect)};
		}
		dcaballeUnsubmitted Done Reply Inline Actions Remove `type` if it's not needed? dcaballe: Remove `type` if it's not needed?

// Function types are converted to LLVM Function types by recursively converting		// Function types are converted to LLVM Function types by recursively converting
// argument and result types. If MLIR Function has zero results, the LLVM		// argument and result types. If MLIR Function has zero results, the LLVM
// Function has one VoidType result. If MLIR Function has more than one result,		// Function has one VoidType result. If MLIR Function has more than one result,
// they are into an LLVM StructType in their order of appearance.		// they are into an LLVM StructType in their order of appearance.
LLVM::LLVMType LLVMTypeConverter::convertFunctionSignature(		LLVM::LLVMType LLVMTypeConverter::convertFunctionSignature(
FunctionType type, bool isVariadic,		FunctionType type, bool isVariadic,
LLVMTypeConverter::SignatureConversion &result) {		LLVMTypeConverter::SignatureConversion &result) {
// Convert argument types one by one and check for errors.		// Convert argument types one by one and check for errors.
for (auto &en : llvm::enumerate(type.getInputs())) {		for (auto &en : llvm::enumerate(type.getInputs())) {
Type type = en.value();		Type type = en.value();
auto converted = customizations.funcArgConverter(*this, type)		SmallVector<Type, 8> converted;
.dyn_cast_or_null<LLVM::LLVMType>();		if (failed(customizations.funcArgConverter(*this, type, converted)))
if (!converted)
return {};		return {};
result.addInputs(en.index(), converted);		result.addInputs(en.index(), converted);
}		}

SmallVector<LLVM::LLVMType, 8> argTypes;		SmallVector<LLVM::LLVMType, 8> argTypes;
argTypes.reserve(llvm::size(result.getConvertedTypes()));		argTypes.reserve(llvm::size(result.getConvertedTypes()));
for (Type type : result.getConvertedTypes())		for (Type type : result.getConvertedTypes())
argTypes.push_back(unwrap(type));		argTypes.push_back(unwrap(type));

// If function does not return anything, create the void result type,		// If function does not return anything, create the void result type,
// if it returns on element, convert it, otherwise pack the result types into		// if it returns on element, convert it, otherwise pack the result types into
// a struct.		// a struct.
LLVM::LLVMType resultType =		LLVM::LLVMType resultType =
type.getNumResults() == 0		type.getNumResults() == 0
? LLVM::LLVMType::getVoidTy(llvmDialect)		? LLVM::LLVMType::getVoidTy(llvmDialect)
: unwrap(packFunctionResults(type.getResults()));		: unwrap(packFunctionResults(type.getResults()));
if (!resultType)		if (!resultType)
return {};		return {};
return LLVM::LLVMType::getFunctionTy(resultType, argTypes, isVariadic);		return LLVM::LLVMType::getFunctionTy(resultType, argTypes, isVariadic);
}		}

		/// Converts the function type to a C-compatible format, in particular using
		/// pointers to memref descriptors for arguments.
		LLVM::LLVMType
		LLVMTypeConverter::convertFunctionTypeCWrapper(FunctionType type) {
		SmallVector<LLVM::LLVMType, 4> inputs;

		for (Type t : type.getInputs()) {
		auto converted = convertType(t).dyn_cast_or_null<LLVM::LLVMType>();
		if (!converted)
		return {};
		if (t.isa<MemRefType>() \|\| t.isa<UnrankedMemRefType>())
		converted = converted.getPointerTo();
		inputs.push_back(converted);
		}

		LLVM::LLVMType resultType =
		type.getNumResults() == 0
		? LLVM::LLVMType::getVoidTy(llvmDialect)
		: unwrap(packFunctionResults(type.getResults()));
		if (!resultType)
		return {};

		return LLVM::LLVMType::getFunctionTy(resultType, inputs, false);
		}

		/// Creates descriptor structs from individual values constituting them.
		nicolasvasilacheUnsubmitted Done Reply Inline Actions typo individual nicolasvasilache: typo individual
		dcaballeUnsubmitted Done Reply Inline Actions constituting dcaballe: constituting
		Operation *LLVMTypeConverter::materializeConversion(PatternRewriter &rewriter,
		Type type,
		ArrayRef<Value> values,
		Location loc) {
		if (auto unrankedMemRefType = type.dyn_cast<UnrankedMemRefType>())
		return UnrankedMemRefDescriptor::pack(rewriter, loc, *this,
		unrankedMemRefType, values)
		.getDefiningOp();

		auto memRefType = type.dyn_cast<MemRefType>();
		assert(memRefType && "1->N conversion is only supported for memrefs");
		return MemRefDescriptor::pack(rewriter, loc, *this, memRefType, values)
		.getDefiningOp();
		}

// Convert a MemRef to an LLVM type. The result is a MemRef descriptor which		// Convert a MemRef to an LLVM type. The result is a MemRef descriptor which
// contains:		// contains:
// 1. the pointer to the data buffer, followed by		// 1. the pointer to the data buffer, followed by
// 2. a lowered `index`-type integer containing the distance between the		// 2. a lowered `index`-type integer containing the distance between the
// beginning of the buffer and the first element to be accessed through the		// beginning of the buffer and the first element to be accessed through the
// view, followed by		// view, followed by
// 3. an array containing as many `index`-type integers as the rank of the		// 3. an array containing as many `index`-type integers as the rank of the
// MemRef: the array represents the size, in number of elements, of the memref		// MemRef: the array represents the size, in number of elements, of the memref
▲ Show 20 Lines • Show All 258 Lines • ▼ Show 20 Lines	setStride(builder, loc, pos,
createIndexAttrConstant(builder, loc, indexType, stride));		createIndexAttrConstant(builder, loc, indexType, stride));
}		}

LLVM::LLVMType MemRefDescriptor::getElementType() {		LLVM::LLVMType MemRefDescriptor::getElementType() {
return value.getType().cast<LLVM::LLVMType>().getStructElementType(		return value.getType().cast<LLVM::LLVMType>().getStructElementType(
kAlignedPtrPosInMemRefDescriptor);		kAlignedPtrPosInMemRefDescriptor);
}		}

		/// Creates a MemRef descriptor structure from a list of individual values
		/// composing that descriptor, in the following order:
		/// - allocated pointer;
		/// - aligned pointer;
		/// - offset;
		/// - <rank> sizes;
		/// - <rank> shapes;
		/// where <rank> is the MemRef rank as provided in `type`.
		Value MemRefDescriptor::pack(OpBuilder &builder, Location loc,
		LLVMTypeConverter &converter, MemRefType type,
		ValueRange values) {
		Type llvmType = converter.convertType(type);
		auto d = MemRefDescriptor::undef(builder, loc, llvmType);

		d.setAllocatedPtr(builder, loc, values[kAllocatedPtrPosInMemRefDescriptor]);
		d.setAlignedPtr(builder, loc, values[kAlignedPtrPosInMemRefDescriptor]);
		d.setOffset(builder, loc, values[kOffsetPosInMemRefDescriptor]);

		int64_t rank = type.getRank();
		for (unsigned i = 0; i < rank; ++i) {
		d.setSize(builder, loc, i, values[kSizePosInMemRefDescriptor + i]);
		d.setStride(builder, loc, i, values[kSizePosInMemRefDescriptor + rank + i]);
		}

		return d;
		}

		/// Builds IR extracting individual elements of a MemRef descriptor structure
		/// and returning them as `results` list.
		void MemRefDescriptor::unpack(OpBuilder &builder, Location loc, Value packed,
		MemRefType type,
		SmallVectorImpl<Value> &results) {
		int64_t rank = type.getRank();
		results.reserve(results.size() + getNumUnpackedValues(type));
		nicolasvasilacheUnsubmitted Done Reply Inline Actions call `getNumUnpackedValues` for a single source of truth. nicolasvasilache: call `getNumUnpackedValues` for a single source of truth.

		MemRefDescriptor d(packed);
		results.push_back(d.allocatedPtr(builder, loc));
		results.push_back(d.alignedPtr(builder, loc));
		results.push_back(d.offset(builder, loc));
		for (int64_t i = 0; i < rank; ++i)
		results.push_back(d.size(builder, loc, i));
		for (int64_t i = 0; i < rank; ++i)
		results.push_back(d.stride(builder, loc, i));
		}

		/// Returns the number of non-aggregate values that would be produced by
		/// `unpack`.
		unsigned MemRefDescriptor::getNumUnpackedValues(MemRefType type) {
		// Two pointers, offset, <rank> sizes, <rank> shapes.
		return 3 + 2 * type.getRank();
		}

		/============================================================================/
		/* MemRefDescriptorView implementation. */
		/============================================================================/

		MemRefDescriptorView::MemRefDescriptorView(ValueRange range)
		: rank((range.size() - kSizePosInMemRefDescriptor) / 2), elements(range) {}

		Value MemRefDescriptorView::allocatedPtr() {
		return elements[kAllocatedPtrPosInMemRefDescriptor];
		}

		Value MemRefDescriptorView::alignedPtr() {
		return elements[kAlignedPtrPosInMemRefDescriptor];
		}

		Value MemRefDescriptorView::offset() {
		return elements[kOffsetPosInMemRefDescriptor];
		}

		Value MemRefDescriptorView::size(unsigned pos) {
		return elements[kSizePosInMemRefDescriptor + pos];
		}

		Value MemRefDescriptorView::stride(unsigned pos) {
		return elements[kSizePosInMemRefDescriptor + rank + pos];
		}

/============================================================================/		/============================================================================/
/* UnrankedMemRefDescriptor implementation */		/* UnrankedMemRefDescriptor implementation */
/============================================================================/		/============================================================================/

/// Construct a helper for the given descriptor value.		/// Construct a helper for the given descriptor value.
UnrankedMemRefDescriptor::UnrankedMemRefDescriptor(Value descriptor)		UnrankedMemRefDescriptor::UnrankedMemRefDescriptor(Value descriptor)
: StructBuilder(descriptor) {}		: StructBuilder(descriptor) {}

Show All 15 Lines
Value UnrankedMemRefDescriptor::memRefDescPtr(OpBuilder &builder,		Value UnrankedMemRefDescriptor::memRefDescPtr(OpBuilder &builder,
Location loc) {		Location loc) {
return extractPtr(builder, loc, kPtrInUnrankedMemRefDescriptor);		return extractPtr(builder, loc, kPtrInUnrankedMemRefDescriptor);
}		}
void UnrankedMemRefDescriptor::setMemRefDescPtr(OpBuilder &builder,		void UnrankedMemRefDescriptor::setMemRefDescPtr(OpBuilder &builder,
Location loc, Value v) {		Location loc, Value v) {
setPtr(builder, loc, kPtrInUnrankedMemRefDescriptor, v);		setPtr(builder, loc, kPtrInUnrankedMemRefDescriptor, v);
}		}

		/// Builds IR populating an unranked MemRef descriptor structure from a list
		/// of individual constituent values in the following order:
		/// - rank of the memref;
		/// - pointer to the memref descriptor.
		Value UnrankedMemRefDescriptor::pack(OpBuilder &builder, Location loc,
		LLVMTypeConverter &converter,
		UnrankedMemRefType type,
		ValueRange values) {
		Type llvmType = converter.convertType(type);
		auto d = UnrankedMemRefDescriptor::undef(builder, loc, llvmType);

		d.setRank(builder, loc, values[kRankInUnrankedMemRefDescriptor]);
		d.setMemRefDescPtr(builder, loc, values[kPtrInUnrankedMemRefDescriptor]);
		return d;
		}

		/// Builds IR extracting individual elements that compose an unranked memref
		/// descriptor and returns them as `results` list.
		void UnrankedMemRefDescriptor::unpack(OpBuilder &builder, Location loc,
		Value packed,
		SmallVectorImpl<Value> &results) {
		UnrankedMemRefDescriptor d(packed);
		results.reserve(results.size() + 2);
		results.push_back(d.rank(builder, loc));
		results.push_back(d.memRefDescPtr(builder, loc));
		}

namespace {		namespace {
// Base class for Standard to LLVM IR op conversions. Matches the Op type		// Base class for Standard to LLVM IR op conversions. Matches the Op type
// provided as template argument. Carries a reference to the LLVM dialect in		// provided as template argument. Carries a reference to the LLVM dialect in
// case it is necessary for rewriters.		// case it is necessary for rewriters.
template <typename SourceOp>		template <typename SourceOp>
class LLVMLegalizationPattern : public LLVMOpLowering {		class LLVMLegalizationPattern : public LLVMOpLowering {
public:		public:
// Construct a conversion pattern.		// Construct a conversion pattern.
Show All 31 Lines	Value createIndexConstant(ConversionPatternRewriter &builder, Location loc,
uint64_t value) const {		uint64_t value) const {
return createIndexAttrConstant(builder, loc, getIndexType(), value);		return createIndexAttrConstant(builder, loc, getIndexType(), value);
}		}

protected:		protected:
LLVM::LLVMDialect &dialect;		LLVM::LLVMDialect &dialect;
};		};

		/// Only retain those attributes that are not constructed by
		/// `LLVMFuncOp::build`. If `filterArgAttrs` is set, also filter out argument
		/// attributes.
		static void filterFuncAttributes(ArrayRef<NamedAttribute> attrs,
		bool filterArgAttrs,
		SmallVectorImpl<NamedAttribute> &result) {
		for (const auto &attr : attrs) {
		if (attr.first.is(SymbolTable::getSymbolAttrName()) \|\|
		attr.first.is(impl::getTypeAttrName()) \|\|
		attr.first.is("std.varargs") \|\|
		rriddleUnsubmitted Done Reply Inline Actions I thought std.varargs was removed? Also, you can use `op->getDialectAttrs()` to filter out the ones without a prefix. rriddle: I thought std.varargs was removed? Also, you can use `op->getDialectAttrs()` to filter out the…
		ftynseAuthorUnsubmitted Done Reply Inline Actions It was still there in the code, so I decided not to touch it in this patch. Will take a look in a follow-up. ftynse: It was still there in the code, so I decided not to touch it in this patch. Will take a look in…
		(filterArgAttrs && impl::isArgAttrName(attr.first.strref())))
		continue;
		result.push_back(attr);
		}
		}

		/// Creates an auxiliary function with pointer-to-memref-descriptor-struct
		/// arguments instead of unpacked arguments. This function can be called from C
		/// by passing a pointer to a C struct corresponding to a memref descriptor.
		/// Internally, the auxiliary function unpacks the descriptor into individual
		/// components and forwards them to `newFuncOp`.
		static void wrapForExternalCallers(OpBuilder &rewriter, Location loc,
		LLVMTypeConverter &typeConverter,
		FuncOp funcOp, LLVM::LLVMFuncOp newFuncOp) {
		auto type = funcOp.getType();
		SmallVector<NamedAttribute, 4> attributes;
		filterFuncAttributes(funcOp.getAttrs(), /filterArgAttrs=/false, attributes);
		auto wrapperFuncOp = rewriter.create<LLVM::LLVMFuncOp>(
		loc, llvm::formatv("_mlir_ciface_{0}", funcOp.getName()).str(),
		typeConverter.convertFunctionTypeCWrapper(type), LLVM::Linkage::External,
		attributes);

		OpBuilder::InsertionGuard guard(rewriter);
		rewriter.setInsertionPointToStart(wrapperFuncOp.addEntryBlock());

		SmallVector<Value, 8> args;
		for (auto &en : llvm::enumerate(type.getInputs())) {
		Value arg = wrapperFuncOp.getArgument(en.index());
		if (auto memrefType = en.value().dyn_cast<MemRefType>()) {
		Value loaded = rewriter.create<LLVM::LoadOp>(loc, arg);
		MemRefDescriptor::unpack(rewriter, loc, loaded, memrefType, args);
		continue;
		}
		if (en.value().isa<UnrankedMemRefType>()) {
		Value loaded = rewriter.create<LLVM::LoadOp>(loc, arg);
		UnrankedMemRefDescriptor::unpack(rewriter, loc, loaded, args);
		continue;
		}

		args.push_back(wrapperFuncOp.getArgument(en.index()));
		}
		auto call = rewriter.create<LLVM::CallOp>(loc, newFuncOp, args);
		rewriter.create<LLVM::ReturnOp>(loc, call.getResults());
		}

		/// Creates an auxiliary function with pointer-to-memref-descriptor-struct
		/// arguments instead of unpacked arguments. Creates a body for the (external)
		/// `newFuncOp` that allocates a memref descriptor on stack, packs the
		/// individual arguments into this descriptor and passes a pointer to it into
		/// the auxiliary function. This auxiliary external function is now compatible
		/// with functions defined in C using pointers to C structs corresponding to a
		/// memref descriptor.
		static void wrapExternalFunction(OpBuilder &builder, Location loc,
		LLVMTypeConverter &typeConverter,
		FuncOp funcOp, LLVM::LLVMFuncOp newFuncOp) {
		OpBuilder::InsertionGuard guard(builder);

		LLVM::LLVMType wrapperType =
		typeConverter.convertFunctionTypeCWrapper(funcOp.getType());
		// This conversion can only fail if it could not convert one of the argument
		// types. But since it has been applies to a non-wrapper function before, it
		// should have failed earlier and not reach this point at all.
		assert(wrapperType && "unexpected type conversion failure");

		SmallVector<NamedAttribute, 4> attributes;
		filterFuncAttributes(funcOp.getAttrs(), /filterArgAttrs=/false, attributes);

		// Create the auxiliary function.
		auto wrapperFunc = builder.create<LLVM::LLVMFuncOp>(
		loc, llvm::formatv("_mlir_ciface_{0}", funcOp.getName()).str(),
		wrapperType, LLVM::Linkage::External, attributes);

		builder.setInsertionPointToStart(newFuncOp.addEntryBlock());

		// Get a ValueRange containing argument types. Note that ValueRange is
		// currently not constructible from a pair of iterators pointing to
		// BlockArgument.
		FunctionType type = funcOp.getType();
		SmallVector<Value, 8> args;
		args.reserve(type.getNumInputs());
		auto wrapperArgIters = newFuncOp.getArguments();
		SmallVector<Value, 8> wrapperArgs(wrapperArgIters.begin(),
		wrapperArgIters.end());
		ValueRange wrapperArgsRange(wrapperArgs);

		// Iterate over the inputs of the original function and pack values into
		// memref descriptors if the original type is a memref.
		for (auto &en : llvm::enumerate(type.getInputs())) {
		Value arg;
		int numToDrop = 1;
		auto memRefType = en.value().dyn_cast<MemRefType>();
		auto unrankedMemRefType = en.value().dyn_cast<UnrankedMemRefType>();
		if (memRefType \|\| unrankedMemRefType) {
		numToDrop = memRefType
		? MemRefDescriptor::getNumUnpackedValues(memRefType)
		: UnrankedMemRefDescriptor::getNumUnpackedValues();
		Value packed =
		memRefType
		? MemRefDescriptor::pack(builder, loc, typeConverter, memRefType,
		wrapperArgsRange.take_front(numToDrop))
		: UnrankedMemRefDescriptor::pack(
		builder, loc, typeConverter, unrankedMemRefType,
		wrapperArgsRange.take_front(numToDrop));

		auto ptrTy = packed.getType().cast<LLVM::LLVMType>().getPointerTo();
		Value one = builder.create<LLVM::ConstantOp>(
		loc, typeConverter.convertType(builder.getIndexType()),
		builder.getIntegerAttr(builder.getIndexType(), 1));
		Value allocated =
		builder.create<LLVM::AllocaOp>(loc, ptrTy, one, /alignment=/0);
		builder.create<LLVM::StoreOp>(loc, packed, allocated);
		arg = allocated;
		} else {
		arg = wrapperArgsRange[0];
		}
		dcaballeUnsubmitted Not Done Reply Inline Actions drop {} dcaballe: drop {}
		ftynseAuthorUnsubmitted Done Reply Inline Actions I prefer not to becase `if` has braces. ftynse: I prefer not to becase `if` has braces.

		args.push_back(arg);
		wrapperArgsRange = wrapperArgsRange.drop_front(numToDrop);
		}
		assert(wrapperArgsRange.empty() && "did not map some of the arguments");

		auto call = builder.create<LLVM::CallOp>(loc, wrapperFunc, args);
		builder.create<LLVM::ReturnOp>(loc, call.getResults());
		}

struct FuncOpConversionBase : public LLVMLegalizationPattern<FuncOp> {		struct FuncOpConversionBase : public LLVMLegalizationPattern<FuncOp> {
protected:		protected:
using LLVMLegalizationPattern::LLVMLegalizationPattern;		using LLVMLegalizationPattern<FuncOp>::LLVMLegalizationPattern;
using UnsignedTypePair = std::pair<unsigned, Type>;		using UnsignedTypePair = std::pair<unsigned, Type>;

// Gather the positions and types of memref-typed arguments in a given		// Gather the positions and types of memref-typed arguments in a given
// FunctionType.		// FunctionType.
void getMemRefArgIndicesAndTypes(		void getMemRefArgIndicesAndTypes(
FunctionType type, SmallVectorImpl<UnsignedTypePair> &argsInfo) const {		FunctionType type, SmallVectorImpl<UnsignedTypePair> &argsInfo) const {
argsInfo.reserve(type.getNumInputs());		argsInfo.reserve(type.getNumInputs());
for (auto en : llvm::enumerate(type.getInputs())) {		for (auto en : llvm::enumerate(type.getInputs())) {
Show All 9 Lines	convertFuncOpToLLVMFuncOp(FuncOp funcOp,
ConversionPatternRewriter &rewriter) const {		ConversionPatternRewriter &rewriter) const {
// Convert the original function arguments. They are converted using the		// Convert the original function arguments. They are converted using the
// LLVMTypeConverter provided to this legalization pattern.		// LLVMTypeConverter provided to this legalization pattern.
auto varargsAttr = funcOp.getAttrOfType<BoolAttr>("std.varargs");		auto varargsAttr = funcOp.getAttrOfType<BoolAttr>("std.varargs");
TypeConverter::SignatureConversion result(funcOp.getNumArguments());		TypeConverter::SignatureConversion result(funcOp.getNumArguments());
auto llvmType = lowering.convertFunctionSignature(		auto llvmType = lowering.convertFunctionSignature(
funcOp.getType(), varargsAttr && varargsAttr.getValue(), result);		funcOp.getType(), varargsAttr && varargsAttr.getValue(), result);

// Only retain those attributes that are not constructed by build.		// Propagate argument attributes to all converted arguments obtained after
		// converting a given original argument.
SmallVector<NamedAttribute, 4> attributes;		SmallVector<NamedAttribute, 4> attributes;
for (const auto &attr : funcOp.getAttrs()) {		filterFuncAttributes(funcOp.getAttrs(), /filterArgAttrs=/true,
if (attr.first.is(SymbolTable::getSymbolAttrName()) \|\|		attributes);
attr.first.is(impl::getTypeAttrName()) \|\|		for (unsigned i = 0, e = funcOp.getNumArguments(); i < e; ++i) {
attr.first.is("std.varargs"))		auto attr = impl::getArgAttrDict(funcOp, i);
		if (!attr)
continue;		continue;
attributes.push_back(attr);
		auto mapping = result.getInputMapping(i);
		assert(mapping.hasValue() && "unexpected deletion of function argument");

		SmallString<8> name;
		for (size_t j = mapping->inputNo; j < mapping->size; ++j) {
		impl::getArgAttrName(j, name);
		attributes.push_back(rewriter.getNamedAttr(name, attr));
		}
		dcaballeUnsubmitted Done Reply Inline Actions drop {} dcaballe: drop {}
}		}

// Create an LLVM function, use external linkage by default until MLIR		// Create an LLVM function, use external linkage by default until MLIR
// functions have linkage.		// functions have linkage.
auto newFuncOp = rewriter.create<LLVM::LLVMFuncOp>(		auto newFuncOp = rewriter.create<LLVM::LLVMFuncOp>(
funcOp.getLoc(), funcOp.getName(), llvmType, LLVM::Linkage::External,		funcOp.getLoc(), funcOp.getName(), llvmType, LLVM::Linkage::External,
attributes);		attributes);
rewriter.inlineRegionBefore(funcOp.getBody(), newFuncOp.getBody(),		rewriter.inlineRegionBefore(funcOp.getBody(), newFuncOp.getBody(),
newFuncOp.end());		newFuncOp.end());
// Tell the rewriter to convert the region signature.		// Tell the rewriter to convert the region signature.
rewriter.applySignatureConversion(&newFuncOp.getBody(), result);		rewriter.applySignatureConversion(&newFuncOp.getBody(), result);

return newFuncOp;		return newFuncOp;
}		}
};		};

/// FuncOp legalization pattern that converts MemRef arguments to pointers to		/// FuncOp legalization pattern that converts MemRef arguments to pointers to
/// MemRef descriptors (LLVM struct data types) containing all the MemRef type		/// MemRef descriptors (LLVM struct data types) containing all the MemRef type
/// information.		/// information.
struct FuncOpConversion : public FuncOpConversionBase {		struct FuncOpConversion : public FuncOpConversionBase {
using FuncOpConversionBase::FuncOpConversionBase;		FuncOpConversion(LLVM::LLVMDialect &dialect, LLVMTypeConverter &converter,
		bool emitCWrappers)
		: FuncOpConversionBase(dialect, converter), emitWrappers(emitCWrappers) {}

PatternMatchResult		PatternMatchResult
matchAndRewrite(Operation *op, ArrayRef<Value> operands,		matchAndRewrite(Operation *op, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
auto funcOp = cast<FuncOp>(op);		auto funcOp = cast<FuncOp>(op);

// Store the positions of memref-typed arguments so that we can emit loads
// from them to follow the calling convention.
SmallVector<UnsignedTypePair, 4> promotedArgsInfo;
getMemRefArgIndicesAndTypes(funcOp.getType(), promotedArgsInfo);

auto newFuncOp = convertFuncOpToLLVMFuncOp(funcOp, rewriter);		auto newFuncOp = convertFuncOpToLLVMFuncOp(funcOp, rewriter);
		if (emitWrappers) {
// Insert loads from memref descriptor pointers in function bodies.		if (newFuncOp.isExternal())
if (!newFuncOp.getBody().empty()) {		wrapExternalFunction(rewriter, op->getLoc(), lowering, funcOp,
Block *firstBlock = &newFuncOp.getBody().front();		newFuncOp);
rewriter.setInsertionPoint(firstBlock, firstBlock->begin());		else
for (const auto &argInfo : promotedArgsInfo) {		wrapForExternalCallers(rewriter, op->getLoc(), lowering, funcOp,
BlockArgument arg = firstBlock->getArgument(argInfo.first);		newFuncOp);
Value loaded = rewriter.create<LLVM::LoadOp>(funcOp.getLoc(), arg);
rewriter.replaceUsesOfBlockArgument(arg, loaded);
}
}		}

rewriter.eraseOp(op);		rewriter.eraseOp(op);
return matchSuccess();		return matchSuccess();
}		}

		private:
		/// If true, also create the adaptor functions having signatures compatible
		/// with those produced by clang.
		const bool emitWrappers;
};		};

/// FuncOp legalization pattern that converts MemRef arguments to bare pointers		/// FuncOp legalization pattern that converts MemRef arguments to bare pointers
/// to the MemRef element type. This will impact the calling convention and ABI.		/// to the MemRef element type. This will impact the calling convention and ABI.
struct BarePtrFuncOpConversion : public FuncOpConversionBase {		struct BarePtrFuncOpConversion : public FuncOpConversionBase {
using FuncOpConversionBase::FuncOpConversionBase;		using FuncOpConversionBase::FuncOpConversionBase;

PatternMatchResult		PatternMatchResult
▲ Show 20 Lines • Show All 1,622 Lines • ▼ Show 20 Lines	void mlir::populateStdToLLVMMemoryConversionPatters(
patterns.insert<		patterns.insert<
AllocOpLowering,		AllocOpLowering,
DeallocOpLowering>(		DeallocOpLowering>(
*converter.getDialect(), converter, useAlloca);		*converter.getDialect(), converter, useAlloca);
// clang-format on		// clang-format on
}		}

void mlir::populateStdToLLVMDefaultFuncOpConversionPattern(		void mlir::populateStdToLLVMDefaultFuncOpConversionPattern(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns,
patterns.insert<FuncOpConversion>(*converter.getDialect(), converter);		bool emitCWrappers) {
		patterns.insert<FuncOpConversion>(*converter.getDialect(), converter,
		emitCWrappers);
}		}

void mlir::populateStdToLLVMConversionPatterns(		void mlir::populateStdToLLVMConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns,		LLVMTypeConverter &converter, OwningRewritePatternList &patterns,
bool useAlloca) {		bool useAlloca, bool emitCWrappers) {
populateStdToLLVMDefaultFuncOpConversionPattern(converter, patterns);		populateStdToLLVMDefaultFuncOpConversionPattern(converter, patterns,
		emitCWrappers);
populateStdToLLVMNonMemoryConversionPatterns(converter, patterns);		populateStdToLLVMNonMemoryConversionPatterns(converter, patterns);
populateStdToLLVMMemoryConversionPatters(converter, patterns, useAlloca);		populateStdToLLVMMemoryConversionPatters(converter, patterns, useAlloca);
}		}

static void populateStdToLLVMBarePtrFuncOpConversionPattern(		static void populateStdToLLVMBarePtrFuncOpConversionPattern(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
patterns.insert<BarePtrFuncOpConversion>(*converter.getDialect(), converter);		patterns.insert<BarePtrFuncOpConversion>(*converter.getDialect(), converter);
}		}
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
LLVMTypeConverter::promoteMemRefDescriptors(Location loc, ValueRange opOperands,		LLVMTypeConverter::promoteMemRefDescriptors(Location loc, ValueRange opOperands,
ValueRange operands,		ValueRange operands,
OpBuilder &builder) {		OpBuilder &builder) {
SmallVector<Value, 4> promotedOperands;		SmallVector<Value, 4> promotedOperands;
promotedOperands.reserve(operands.size());		promotedOperands.reserve(operands.size());
for (auto it : llvm::zip(opOperands, operands)) {		for (auto it : llvm::zip(opOperands, operands)) {
auto operand = std::get<0>(it);		auto operand = std::get<0>(it);
auto llvmOperand = std::get<1>(it);		auto llvmOperand = std::get<1>(it);
if (!operand.getType().isa<MemRefType>() &&
!operand.getType().isa<UnrankedMemRefType>()) {		if (operand.getType().isa<UnrankedMemRefType>()) {
promotedOperands.push_back(operand);		UnrankedMemRefDescriptor::unpack(builder, loc, llvmOperand,
		promotedOperands);
continue;		continue;
}		}
promotedOperands.push_back(		if (auto memrefType = operand.getType().dyn_cast<MemRefType>()) {
promoteOneMemRefDescriptor(loc, llvmOperand, builder));		MemRefDescriptor::unpack(builder, loc, llvmOperand,
		operand.getType().cast<MemRefType>(),
		promotedOperands);
		continue;
		}

		promotedOperands.push_back(operand);
}		}
return promotedOperands;		return promotedOperands;
}		}

namespace {		namespace {
/// A pass converting MLIR operations into the LLVM IR dialect.		/// A pass converting MLIR operations into the LLVM IR dialect.
struct LLVMLoweringPass : public ModulePass<LLVMLoweringPass> {		struct LLVMLoweringPass : public ModulePass<LLVMLoweringPass> {
/// Creates an LLVM lowering pass.		/// Creates an LLVM lowering pass.
explicit LLVMLoweringPass(bool useAlloca = false,		explicit LLVMLoweringPass(bool useAlloca = false,
bool useBarePtrCallConv = false)		bool useBarePtrCallConv = false,
: useAlloca(useAlloca), useBarePtrCallConv(useBarePtrCallConv) {}		bool emitCWrappers = false)
		: useAlloca(useAlloca), useBarePtrCallConv(useBarePtrCallConv),
		emitCWrappers(emitCWrappers) {}

/// Run the dialect converter on the module.		/// Run the dialect converter on the module.
void runOnModule() override {		void runOnModule() override {
		if (useBarePtrCallConv && emitCWrappers) {
		getModule().emitError()
		rriddleUnsubmitted Done Reply Inline Actions getModule().emitError ? rriddle: getModule().emitError ?
		<< "incompatible conversion options: bare-pointer calling convention "
		"and C wrapper emission";
		signalPassFailure();
		return;
		}

ModuleOp m = getModule();		ModuleOp m = getModule();
LLVM::ensureDistinctSuccessors(m);		LLVM::ensureDistinctSuccessors(m);

LLVMTypeConverterCustomization customs;		LLVMTypeConverterCustomization customs;
customs.funcArgConverter = useBarePtrCallConv ? barePtrFuncArgTypeConverter		customs.funcArgConverter = useBarePtrCallConv ? barePtrFuncArgTypeConverter
: structFuncArgTypeConverter;		: structFuncArgTypeConverter;
LLVMTypeConverter typeConverter(&getContext(), customs);		LLVMTypeConverter typeConverter(&getContext(), customs);

OwningRewritePatternList patterns;		OwningRewritePatternList patterns;
if (useBarePtrCallConv)		if (useBarePtrCallConv)
populateStdToLLVMBarePtrConversionPatterns(typeConverter, patterns,		populateStdToLLVMBarePtrConversionPatterns(typeConverter, patterns,
useAlloca);		useAlloca);
else		else
populateStdToLLVMConversionPatterns(typeConverter, patterns, useAlloca);		populateStdToLLVMConversionPatterns(typeConverter, patterns, useAlloca,
		emitCWrappers);

ConversionTarget target(getContext());		ConversionTarget target(getContext());
target.addLegalDialect<LLVM::LLVMDialect>();		target.addLegalDialect<LLVM::LLVMDialect>();
if (failed(applyPartialConversion(m, target, patterns, &typeConverter)))		if (failed(applyPartialConversion(m, target, patterns, &typeConverter)))
signalPassFailure();		signalPassFailure();
}		}

/// Use `alloca` instead of `call @malloc` for converting std.alloc.		/// Use `alloca` instead of `call @malloc` for converting std.alloc.
bool useAlloca;		bool useAlloca;

/// Convert memrefs to bare pointers in function signatures.		/// Convert memrefs to bare pointers in function signatures.
bool useBarePtrCallConv;		bool useBarePtrCallConv;

		/// Emit wrappers for C-compatible pointer-to-struct memref descriptors.
		bool emitCWrappers;
};		};
} // end namespace		} // end namespace

std::unique_ptr<OpPassBase<ModuleOp>>		std::unique_ptr<OpPassBase<ModuleOp>>
mlir::createLowerToLLVMPass(bool useAlloca) {		mlir::createLowerToLLVMPass(bool useAlloca, bool emitCWrappers) {
return std::make_unique<LLVMLoweringPass>(useAlloca);		return std::make_unique<LLVMLoweringPass>(useAlloca, emitCWrappers);
}		}

static PassRegistration<LLVMLoweringPass>		static PassRegistration<LLVMLoweringPass>
pass("convert-std-to-llvm",		pass(PASS_NAME,
"Convert scalar and vector operations from the "		"Convert scalar and vector operations from the "
"Standard to the LLVM dialect",		"Standard to the LLVM dialect",
[] {		[] {
return std::make_unique<LLVMLoweringPass>(		return std::make_unique<LLVMLoweringPass>(
clUseAlloca.getValue(), clUseBarePtrCallConv.getValue());		clUseAlloca.getValue(), clUseBarePtrCallConv.getValue(),
		clEmitCWrappers.getValue());
});		});

mlir/lib/Dialect/GPU/IR/GPUDialect.cpp

Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	auto walkResult = module.walk([&module](LaunchFuncOp launchOp) -> WalkResult {
if (!kernelGPUFunction && !kernelLLVMFunction)		if (!kernelGPUFunction && !kernelLLVMFunction)
return launchOp.emitOpError("kernel function '")		return launchOp.emitOpError("kernel function '")
<< kernelName << "' is undefined";		<< kernelName << "' is undefined";
if (!kernelFunc->getAttrOfType<mlir::UnitAttr>(		if (!kernelFunc->getAttrOfType<mlir::UnitAttr>(
GPUDialect::getKernelFuncAttrName()))		GPUDialect::getKernelFuncAttrName()))
return launchOp.emitOpError("kernel function is missing the '")		return launchOp.emitOpError("kernel function is missing the '")
<< GPUDialect::getKernelFuncAttrName() << "' attribute";		<< GPUDialect::getKernelFuncAttrName() << "' attribute";

		// TODO(ntv,zinenko,herhut): if the kernel function has been converted to
		// the LLVM dialect but the caller hasn't (which happens during the
		// separate compilation), do not check type correspondance as it would
		// require the verifier to be aware of the LLVM type conversion.
		if (kernelLLVMFunction)
		return success();

unsigned actualNumArguments = launchOp.getNumKernelOperands();		unsigned actualNumArguments = launchOp.getNumKernelOperands();
unsigned expectedNumArguments = kernelLLVMFunction		unsigned expectedNumArguments = kernelGPUFunction.getNumArguments();
? kernelLLVMFunction.getNumArguments()
: kernelGPUFunction.getNumArguments();
if (expectedNumArguments != actualNumArguments)		if (expectedNumArguments != actualNumArguments)
return launchOp.emitOpError("got ")		return launchOp.emitOpError("got ")
<< actualNumArguments << " kernel operands but expected "		<< actualNumArguments << " kernel operands but expected "
<< expectedNumArguments;		<< expectedNumArguments;

// Due to the ordering of the current impl of lowering and LLVMLowering,		auto functionType = kernelGPUFunction.getType();
// type checks need to be temporarily disabled.		for (unsigned i = 0; i < expectedNumArguments; ++i) {
// TODO(ntv,zinenko,herhut): reactivate checks once "changing gpu.launchFunc		if (launchOp.getKernelOperand(i).getType() != functionType.getInput(i)) {
// to encode target module" has landed.		return launchOp.emitOpError("type of function argument ")
// auto functionType = kernelFunc.getType();		<< i << " does not match";
// for (unsigned i = 0; i < numKernelFuncArgs; ++i) {		}
// if (getKernelOperand(i).getType() != functionType.getInput(i)) {		}
// return emitOpError("type of function argument ")
// << i << " does not match";
// }
// }

return success();		return success();
});		});

return walkResult.wasInterrupted() ? failure() : success();		return walkResult.wasInterrupted() ? failure() : success();
}		}

template <typename T> static LogicalResult verifyIndexOp(T op) {		template <typename T> static LogicalResult verifyIndexOp(T op) {
▲ Show 20 Lines • Show All 667 Lines • Show Last 20 Lines

mlir/lib/Transforms/DialectConversion.cpp

Show First 20 Lines • Show All 395 Lines • ▼ Show 20 Lines	if (inputMap->size == 1) {
continue;		continue;
}		}

// Otherwise, this is a 1->N mapping. Call into the provided type converter		// Otherwise, this is a 1->N mapping. Call into the provided type converter
// to pack the new values.		// to pack the new values.
auto replArgs = newArgs.slice(inputMap->inputNo, inputMap->size);		auto replArgs = newArgs.slice(inputMap->inputNo, inputMap->size);
Operation *cast = typeConverter->materializeConversion(		Operation *cast = typeConverter->materializeConversion(
rewriter, origArg.getType(), replArgs, loc);		rewriter, origArg.getType(), replArgs, loc);
assert(cast->getNumResults() == 1 &&		assert(cast->getNumResults() == 1);
		rriddleUnsubmitted Not Done Reply Inline Actions This is relaxed because you are inserting multiple operations to perform the conversion? rriddle: This is relaxed because you are inserting multiple operations to perform the conversion?
		ftynseAuthorUnsubmitted Done Reply Inline Actions Exactly. ftynse: Exactly.
		rriddleUnsubmitted Not Done Reply Inline Actions We erase use_empty cast operations during applyRewrites. How does this interact with generating multiple operations when casting? (This doesn't have to block this revision, but just curious on your thoughts there) rriddle: We erase use_empty cast operations during applyRewrites. How does this interact with generating…
		ftynseAuthorUnsubmitted Done Reply Inline Actions It erases the last operation, but keeps the rest, which are all equally dead and have no side effects. It's a variant of the problem we face a lot in rewrites: do we clean up immediately or do we expect the canonicalizer to clean up later. I don't have a good answer here, but I wouldn't go out of my way for cleaning. In general, I considered the following: since we use DialectConversionRewriter, we can use its undo stack to remove all operations introduced in the cast materialization; with multiple operations generated, I'm not convinced that we can rely only on one of them (e.g., the last one) to consider the entire conversion dead; we can have stores and other side-effecting operations; the canonicalizer is better aware of deadness, so it sounds reasonable to rely on it instead to remove dead casts; (this is similar to the decision not to implement valuesToRemoveIfDead in replaceAllUsesWith IMO) if we want the cleanup, maybe we can try and call into the canonicalization directly. ftynse: It erases the last operation, but keeps the rest, which are all equally dead and have no side…
cast->getNumOperands() == replArgs.size());
mapping.map(origArg, cast->getResult(0));		mapping.map(origArg, cast->getResult(0));
info.argInfo[i] =		info.argInfo[i] =
ConvertedArgInfo(inputMap->inputNo, inputMap->size, cast->getResult(0));		ConvertedArgInfo(inputMap->inputNo, inputMap->size, cast->getResult(0));
}		}

// Remove the original block from the region and return the new one.		// Remove the original block from the region and return the new one.
insertConversion(newBlock, std::move(info));		insertConversion(newBlock, std::move(info));
return newBlock;		return newBlock;
▲ Show 20 Lines • Show All 1,522 Lines • Show Last 20 Lines

mlir/test/Conversion/GPUToCUDA/lower-launch-func-to-cuda.mlir

	// RUN: mlir-opt %s --launch-func-to-cuda \| FileCheck %s			// RUN: mlir-opt %s --launch-func-to-cuda \| FileCheck %s

	module attributes {gpu.container_module} {			module attributes {gpu.container_module} {

	// CHECK: llvm.mlir.global internal constant @[[kernel_name:.*]]("kernel\00")			// CHECK: llvm.mlir.global internal constant @[[kernel_name:.*]]("kernel\00")
	// CHECK: llvm.mlir.global internal constant @[[global:.*]]("CUBIN")			// CHECK: llvm.mlir.global internal constant @[[global:.*]]("CUBIN")

	gpu.module @kernel_module attributes {nvvm.cubin = "CUBIN"} {			gpu.module @kernel_module attributes {nvvm.cubin = "CUBIN"} {
	gpu.func @kernel(%arg0: !llvm.float, %arg1: !llvm<"float*">) attributes {gpu.kernel} {			llvm.func @kernel(%arg0: !llvm.float, %arg1: !llvm<"float*">) attributes {gpu.kernel} {
	gpu.return			llvm.return
	}			}
	}			}

	llvm.func @foo() {			llvm.func @foo() {
	%0 = "op"() : () -> (!llvm.float)			%0 = "op"() : () -> (!llvm.float)
	%1 = "op"() : () -> (!llvm<"float*">)			%1 = "op"() : () -> (!llvm<"float*">)
	%cst = llvm.mlir.constant(8 : index) : !llvm.i64			%cst = llvm.mlir.constant(8 : index) : !llvm.i64

	Show All 18 Lines

mlir/test/Conversion/StandardToLLVM/convert-argattrs.mlir

	// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s			// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s

				// CHECK-LABEL: func @check_attributes
	// CHECK-LABEL: func @check_attributes(%arg0: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*"> {dialect.a = true, dialect.b = 4 : i64}) {			// When expanding the memref to multiple arguments, argument attributes are replicated.
	// CHECK-NEXT: llvm.load %arg0 : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK-COUNT-7: {dialect.a = true, dialect.b = 4 : i64}
	func @check_attributes(%static: memref<10x20xf32> {dialect.a = true, dialect.b = 4 : i64 }) {			func @check_attributes(%static: memref<10x20xf32> {dialect.a = true, dialect.b = 4 : i64 }) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%0 = load %static[%c0, %c0]: memref<10x20xf32>			%0 = load %static[%c0, %c0]: memref<10x20xf32>
	return			return
	}			}

	// CHECK-LABEL: func @external_func(!llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">)
	// CHECK: func @call_external(%[[arg:.]]: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">) {
	// CHECK: %[[ld:.]] = llvm.load %[[arg]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK: %[[c1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK: %[[alloca:.]] = llvm.alloca %[[c1]] x !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> : (!llvm.i64) -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK: llvm.store %[[ld]], %[[alloca]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
	// CHECK: call @external_func(%[[alloca]]) : (!llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">) -> ()
	func @external_func(memref<10x20xf32>)

	func @call_external(%static: memref<10x20xf32>) {
	call @external_func(%static) : (memref<10x20xf32>) -> ()
	return
	}

mlir/test/Conversion/StandardToLLVM/convert-dynamic-memref-ops.mlir

	// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s			// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s

	// CHECK-LABEL: func @check_strided_memref_arguments(			// CHECK-LABEL: func @check_strided_memref_arguments(
	// CHECK-COUNT-3: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK-COUNT-2: !llvm<"float*">
				// CHECK-COUNT-5: !llvm.i64
				// CHECK-COUNT-2: !llvm<"float*">
				// CHECK-COUNT-5: !llvm.i64
				// CHECK-COUNT-2: !llvm<"float*">
				// CHECK-COUNT-5: !llvm.i64
	func @check_strided_memref_arguments(%static: memref<10x20xf32, affine_map<(i,j)->(20 * i + j + 1)>>,			func @check_strided_memref_arguments(%static: memref<10x20xf32, affine_map<(i,j)->(20 * i + j + 1)>>,
	%dynamic : memref<?x?xf32, affine_map<(i,j)[M]->(M * i + j + 1)>>,			%dynamic : memref<?x?xf32, affine_map<(i,j)[M]->(M * i + j + 1)>>,
	%mixed : memref<10x?xf32, affine_map<(i,j)[M]->(M * i + j + 1)>>) {			%mixed : memref<10x?xf32, affine_map<(i,j)[M]->(M * i + j + 1)>>) {
	return			return
	}			}

	// CHECK-LABEL: func @check_arguments(%arg0: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %arg1: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %arg2: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">)			// CHECK-LABEL: func @check_arguments
				// CHECK-COUNT-2: !llvm<"float*">
				// CHECK-COUNT-5: !llvm.i64
				// CHECK-COUNT-2: !llvm<"float*">
				// CHECK-COUNT-5: !llvm.i64
				// CHECK-COUNT-2: !llvm<"float*">
				// CHECK-COUNT-5: !llvm.i64
	func @check_arguments(%static: memref<10x20xf32>, %dynamic : memref<?x?xf32>, %mixed : memref<10x?xf32>) {			func @check_arguments(%static: memref<10x20xf32>, %dynamic : memref<?x?xf32>, %mixed : memref<10x?xf32>) {
	return			return
	}			}

	// CHECK-LABEL: func @mixed_alloc(			// CHECK-LABEL: func @mixed_alloc(
	// CHECK: %[[M:.]]: !llvm.i64, %[[N:.]]: !llvm.i64) -> !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }"> {			// CHECK: %[[M:.]]: !llvm.i64, %[[N:.]]: !llvm.i64) -> !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }"> {
	func @mixed_alloc(%arg0: index, %arg1: index) -> memref<?x42x?xf32> {			func @mixed_alloc(%arg0: index, %arg1: index) -> memref<?x42x?xf32> {
	// CHECK-NEXT: %[[c42:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64			// CHECK: %[[c42:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
	// CHECK-NEXT: llvm.mul %[[M]], %[[c42]] : !llvm.i64			// CHECK-NEXT: llvm.mul %[[M]], %[[c42]] : !llvm.i64
	// CHECK-NEXT: %[[sz:.]] = llvm.mul %{{.}}, %[[N]] : !llvm.i64			// CHECK-NEXT: %[[sz:.]] = llvm.mul %{{.}}, %[[N]] : !llvm.i64
	// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">			// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
	// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">			// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64			// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
	// CHECK-NEXT: %[[sz_bytes:.*]] = llvm.mul %[[sz]], %[[sizeof]] : !llvm.i64			// CHECK-NEXT: %[[sz_bytes:.*]] = llvm.mul %[[sz]], %[[sizeof]] : !llvm.i64
	// CHECK-NEXT: llvm.call @malloc(%[[sz_bytes]]) : (!llvm.i64) -> !llvm<"i8*">			// CHECK-NEXT: llvm.call @malloc(%[[sz_bytes]]) : (!llvm.i64) -> !llvm<"i8*">
	Show All 12 Lines
	// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[st2]], %{{.}}[4, 2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[st2]], %{{.}}[4, 2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
	%0 = alloc(%arg0, %arg1) : memref<?x42x?xf32>			%0 = alloc(%arg0, %arg1) : memref<?x42x?xf32>
	// CHECK-NEXT: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">			// CHECK-NEXT: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
	return %0 : memref<?x42x?xf32>			return %0 : memref<?x42x?xf32>
	}			}

	// CHECK-LABEL: func @mixed_dealloc(%arg0: !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }*">) {			// CHECK-LABEL: func @mixed_dealloc
	func @mixed_dealloc(%arg0: memref<?x42x?xf32>) {			func @mixed_dealloc(%arg0: memref<?x42x?xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }*">			// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
	// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">			// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
	// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()			// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()
	dealloc %arg0 : memref<?x42x?xf32>			dealloc %arg0 : memref<?x42x?xf32>
	// CHECK-NEXT: llvm.return			// CHECK-NEXT: llvm.return
	return			return
	}			}

	// CHECK-LABEL: func @dynamic_alloc(			// CHECK-LABEL: func @dynamic_alloc(
	// CHECK: %[[M:.]]: !llvm.i64, %[[N:.]]: !llvm.i64) -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {			// CHECK: %[[M:.]]: !llvm.i64, %[[N:.]]: !llvm.i64) -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
	func @dynamic_alloc(%arg0: index, %arg1: index) -> memref<?x?xf32> {			func @dynamic_alloc(%arg0: index, %arg1: index) -> memref<?x?xf32> {
	// CHECK-NEXT: %[[sz:.*]] = llvm.mul %[[M]], %[[N]] : !llvm.i64			// CHECK: %[[sz:.*]] = llvm.mul %[[M]], %[[N]] : !llvm.i64
	// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">			// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
	// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">			// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64			// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
	// CHECK-NEXT: %[[sz_bytes:.*]] = llvm.mul %[[sz]], %[[sizeof]] : !llvm.i64			// CHECK-NEXT: %[[sz_bytes:.*]] = llvm.mul %[[sz]], %[[sizeof]] : !llvm.i64
	// CHECK-NEXT: llvm.call @malloc(%[[sz_bytes]]) : (!llvm.i64) -> !llvm<"i8*">			// CHECK-NEXT: llvm.call @malloc(%[[sz_bytes]]) : (!llvm.i64) -> !llvm<"i8*">
	// CHECK-NEXT: llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float*">			// CHECK-NEXT: llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float*">
	// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %{{.}}, %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %{{.}}, %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %{{.}}, %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %{{.}}, %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: llvm.insertvalue %[[off]], %{{.}}[2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[off]], %{{.}}[2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.]] = llvm.mul %{{.}}, %[[N]] : !llvm.i64			// CHECK-NEXT: %[[st0:.]] = llvm.mul %{{.}}, %[[N]] : !llvm.i64
	// CHECK-NEXT: llvm.insertvalue %[[M]], %{{.}}[3, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[M]], %{{.}}[3, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[st0]], %{{.}}[4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[st0]], %{{.}}[4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	%0 = alloc(%arg0, %arg1) : memref<?x?xf32>			%0 = alloc(%arg0, %arg1) : memref<?x?xf32>
	// CHECK-NEXT: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	return %0 : memref<?x?xf32>			return %0 : memref<?x?xf32>
	}			}

	// CHECK-LABEL: func @dynamic_dealloc(%arg0: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">) {			// CHECK-LABEL: func @dynamic_dealloc
	func @dynamic_dealloc(%arg0: memref<?x?xf32>) {			func @dynamic_dealloc(%arg0: memref<?x?xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">			// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
	// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()			// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()
	dealloc %arg0 : memref<?x?xf32>			dealloc %arg0 : memref<?x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @mixed_load(			// CHECK-LABEL: func @mixed_load(
	// CHECK: %[[A:.]]: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64			// CHECK-COUNT-2: !llvm<"float*">,
				// CHECK-COUNT-5: {{%[a-zA-Z0-9]*}}: !llvm.i64
				// CHECK: %[[I:.*]]: !llvm.i64,
				// CHECK: %[[J:.*]]: !llvm.i64)
	func @mixed_load(%mixed : memref<42x?xf32>, %i : index, %j : index) {			func @mixed_load(%mixed : memref<42x?xf32>, %i : index, %j : index) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: %[[ptr:.]] = llvm.extractvalue %[[ld:.]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64			// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
	// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64			// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
	// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64			// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
	// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64			// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
	// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">			// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: llvm.load %[[addr]] : !llvm<"float*">			// CHECK-NEXT: llvm.load %[[addr]] : !llvm<"float*">
	%0 = load %mixed[%i, %j] : memref<42x?xf32>			%0 = load %mixed[%i, %j] : memref<42x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @dynamic_load(			// CHECK-LABEL: func @dynamic_load(
	// CHECK: %[[A:.]]: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64
	func @dynamic_load(%dynamic : memref<?x?xf32>, %i : index, %j : index) {			func @dynamic_load(%dynamic : memref<?x?xf32>, %i : index, %j : index) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: %[[ptr:.]] = llvm.extractvalue %[[ld:.]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64			// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
	// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64			// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
	// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64			// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
	// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64			// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
	// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">			// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: llvm.load %[[addr]] : !llvm<"float*">			// CHECK-NEXT: llvm.load %[[addr]] : !llvm<"float*">
	%0 = load %dynamic[%i, %j] : memref<?x?xf32>			%0 = load %dynamic[%i, %j] : memref<?x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @prefetch			// CHECK-LABEL: func @prefetch
	func @prefetch(%A : memref<?x?xf32>, %i : index, %j : index) {			func @prefetch(%A : memref<?x?xf32>, %i : index, %j : index) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: %[[ptr:.]] = llvm.extractvalue %[[ld:.]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64			// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
	// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64			// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
	// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64			// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
	// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64			// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
	// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">			// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	Show All 12 Lines
	// CHECK: [[C0_3:%.*]] = llvm.mlir.constant(0 : i32) : !llvm.i32			// CHECK: [[C0_3:%.*]] = llvm.mlir.constant(0 : i32) : !llvm.i32
	// CHECK: "llvm.intr.prefetch"(%{{.}}, [[C0_2]], [[C2]], [[C0_3]]) : (!llvm<"float">, !llvm.i32, !llvm.i32, !llvm.i32) -> ()			// CHECK: "llvm.intr.prefetch"(%{{.}}, [[C0_2]], [[C2]], [[C0_3]]) : (!llvm<"float">, !llvm.i32, !llvm.i32, !llvm.i32) -> ()
	prefetch %A[%i, %j], read, locality<2>, instr : memref<?x?xf32>			prefetch %A[%i, %j], read, locality<2>, instr : memref<?x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @dynamic_store			// CHECK-LABEL: func @dynamic_store
	func @dynamic_store(%dynamic : memref<?x?xf32>, %i : index, %j : index, %val : f32) {			func @dynamic_store(%dynamic : memref<?x?xf32>, %i : index, %j : index, %val : f32) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: %[[ptr:.]] = llvm.extractvalue %[[ld:.]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64			// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
	// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64			// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
	// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64			// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
	// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64			// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
	// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">			// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: llvm.store %arg3, %[[addr]] : !llvm<"float*">			// CHECK-NEXT: llvm.store %{{.}}, %[[addr]] : !llvm<"float">
	store %val, %dynamic[%i, %j] : memref<?x?xf32>			store %val, %dynamic[%i, %j] : memref<?x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @mixed_store			// CHECK-LABEL: func @mixed_store
	func @mixed_store(%mixed : memref<42x?xf32>, %i : index, %j : index, %val : f32) {			func @mixed_store(%mixed : memref<42x?xf32>, %i : index, %j : index, %val : f32) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: %[[ptr:.]] = llvm.extractvalue %[[ld:.]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64			// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
	// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64			// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
	// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64			// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
	// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64			// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
	// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">			// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: llvm.store %arg3, %[[addr]] : !llvm<"float*">			// CHECK-NEXT: llvm.store %{{.}}, %[[addr]] : !llvm<"float">
	store %val, %mixed[%i, %j] : memref<42x?xf32>			store %val, %mixed[%i, %j] : memref<42x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @memref_cast_static_to_dynamic			// CHECK-LABEL: func @memref_cast_static_to_dynamic
	func @memref_cast_static_to_dynamic(%static : memref<10x42xf32>) {			func @memref_cast_static_to_dynamic(%static : memref<10x42xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: llvm.bitcast %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.bitcast %[[ld]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	%0 = memref_cast %static : memref<10x42xf32> to memref<?x?xf32>			%0 = memref_cast %static : memref<10x42xf32> to memref<?x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @memref_cast_static_to_mixed			// CHECK-LABEL: func @memref_cast_static_to_mixed
	func @memref_cast_static_to_mixed(%static : memref<10x42xf32>) {			func @memref_cast_static_to_mixed(%static : memref<10x42xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: llvm.bitcast %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.bitcast %[[ld]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	%0 = memref_cast %static : memref<10x42xf32> to memref<?x42xf32>			%0 = memref_cast %static : memref<10x42xf32> to memref<?x42xf32>
	return			return
	}			}

	// CHECK-LABEL: func @memref_cast_dynamic_to_static			// CHECK-LABEL: func @memref_cast_dynamic_to_static
	func @memref_cast_dynamic_to_static(%dynamic : memref<?x?xf32>) {			func @memref_cast_dynamic_to_static(%dynamic : memref<?x?xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: llvm.bitcast %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.bitcast %[[ld]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	%0 = memref_cast %dynamic : memref<?x?xf32> to memref<10x12xf32>			%0 = memref_cast %dynamic : memref<?x?xf32> to memref<10x12xf32>
	return			return
	}			}

	// CHECK-LABEL: func @memref_cast_dynamic_to_mixed			// CHECK-LABEL: func @memref_cast_dynamic_to_mixed
	func @memref_cast_dynamic_to_mixed(%dynamic : memref<?x?xf32>) {			func @memref_cast_dynamic_to_mixed(%dynamic : memref<?x?xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: llvm.bitcast %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.bitcast %[[ld]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	%0 = memref_cast %dynamic : memref<?x?xf32> to memref<?x12xf32>			%0 = memref_cast %dynamic : memref<?x?xf32> to memref<?x12xf32>
	return			return
	}			}

	// CHECK-LABEL: func @memref_cast_mixed_to_dynamic			// CHECK-LABEL: func @memref_cast_mixed_to_dynamic
	func @memref_cast_mixed_to_dynamic(%mixed : memref<42x?xf32>) {			func @memref_cast_mixed_to_dynamic(%mixed : memref<42x?xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: llvm.bitcast %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.bitcast %[[ld]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	%0 = memref_cast %mixed : memref<42x?xf32> to memref<?x?xf32>			%0 = memref_cast %mixed : memref<42x?xf32> to memref<?x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @memref_cast_mixed_to_static			// CHECK-LABEL: func @memref_cast_mixed_to_static
	func @memref_cast_mixed_to_static(%mixed : memref<42x?xf32>) {			func @memref_cast_mixed_to_static(%mixed : memref<42x?xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: llvm.bitcast %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.bitcast %[[ld]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	%0 = memref_cast %mixed : memref<42x?xf32> to memref<42x1xf32>			%0 = memref_cast %mixed : memref<42x?xf32> to memref<42x1xf32>
	return			return
	}			}

	// CHECK-LABEL: func @memref_cast_mixed_to_mixed			// CHECK-LABEL: func @memref_cast_mixed_to_mixed
	func @memref_cast_mixed_to_mixed(%mixed : memref<42x?xf32>) {			func @memref_cast_mixed_to_mixed(%mixed : memref<42x?xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK: llvm.bitcast %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.bitcast %[[ld]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> to !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	%0 = memref_cast %mixed : memref<42x?xf32> to memref<?x1xf32>			%0 = memref_cast %mixed : memref<42x?xf32> to memref<?x1xf32>
	return			return
	}			}

	// CHECK-LABEL: func @memref_cast_ranked_to_unranked			// CHECK-LABEL: func @memref_cast_ranked_to_unranked
	func @memref_cast_ranked_to_unranked(%arg : memref<42x2x?xf32>) {			func @memref_cast_ranked_to_unranked(%arg : memref<42x2x?xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }*">
	// CHECK-DAG: %[[c:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-DAG: %[[c:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-DAG: %[[p:.]] = llvm.alloca %[[c]] x !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }"> : (!llvm.i64) -> !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">			// CHECK-DAG: %[[p:.]] = llvm.alloca %[[c]] x !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }"> : (!llvm.i64) -> !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">
	// CHECK-DAG: llvm.store %[[ld]], %[[p]] : !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }*">			// CHECK-DAG: llvm.store %{{.}}, %[[p]] : !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">
	// CHECK-DAG: %[[p2:.]] = llvm.bitcast %2 : !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }"> to !llvm<"i8*">			// CHECK-DAG: %[[p2:.]] = llvm.bitcast %[[p]] : !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }"> to !llvm<"i8*">
	// CHECK-DAG: %[[r:.*]] = llvm.mlir.constant(3 : i64) : !llvm.i64			// CHECK-DAG: %[[r:.*]] = llvm.mlir.constant(3 : i64) : !llvm.i64
	// CHECK : llvm.mlir.undef : !llvm<"{ i64, i8* }">			// CHECK : llvm.mlir.undef : !llvm<"{ i64, i8* }">
	// CHECK-DAG: llvm.insertvalue %[[r]], %{{.}}[0] : !llvm<"{ i64, i8 }">			// CHECK-DAG: llvm.insertvalue %[[r]], %{{.}}[0] : !llvm<"{ i64, i8 }">
	// CHECK-DAG: llvm.insertvalue %[[p2]], %{{.}}[1] : !llvm<"{ i64, i8 }">			// CHECK-DAG: llvm.insertvalue %[[p2]], %{{.}}[1] : !llvm<"{ i64, i8 }">
	%0 = memref_cast %arg : memref<42x2x?xf32> to memref<*xf32>			%0 = memref_cast %arg : memref<42x2x?xf32> to memref<*xf32>
	return			return
	}			}

	// CHECK-LABEL: func @memref_cast_unranked_to_ranked			// CHECK-LABEL: func @memref_cast_unranked_to_ranked
	func @memref_cast_unranked_to_ranked(%arg : memref<*xf32>) {			func @memref_cast_unranked_to_ranked(%arg : memref<*xf32>) {
	// CHECK: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ i64, i8* }*">			// CHECK: %[[p:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ i64, i8* }">
	// CHECK-NEXT: %[[p:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ i64, i8 }">
	// CHECK-NEXT: llvm.bitcast %[[p]] : !llvm<"i8"> to !llvm<"{ float, float, i64, [4 x i64], [4 x i64] }">			// CHECK-NEXT: llvm.bitcast %[[p]] : !llvm<"i8"> to !llvm<"{ float, float, i64, [4 x i64], [4 x i64] }">
	%0 = memref_cast %arg : memref<*xf32> to memref<?x?x10x2xf32>			%0 = memref_cast %arg : memref<*xf32> to memref<?x?x10x2xf32>
	return			return
	}			}

	// CHECK-LABEL: func @mixed_memref_dim(%arg0: !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }*">) {			// CHECK-LABEL: func @mixed_memref_dim
	func @mixed_memref_dim(%mixed : memref<42x?x?x13x?xf32>) {			func @mixed_memref_dim(%mixed : memref<42x?x?x13x?xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }*">			// CHECK: llvm.mlir.constant(42 : index) : !llvm.i64
	// CHECK-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64
	%0 = dim %mixed, 0 : memref<42x?x?x13x?xf32>			%0 = dim %mixed, 0 : memref<42x?x?x13x?xf32>
	// CHECK-NEXT: llvm.extractvalue %[[ld]][3, 1] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">			// CHECK-NEXT: llvm.extractvalue %[[ld:.]][3, 1] : !llvm<"{ float, float*, i64, [5 x i64], [5 x i64] }">
	%1 = dim %mixed, 1 : memref<42x?x?x13x?xf32>			%1 = dim %mixed, 1 : memref<42x?x?x13x?xf32>
	// CHECK-NEXT: llvm.extractvalue %[[ld]][3, 2] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">			// CHECK-NEXT: llvm.extractvalue %[[ld]][3, 2] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">
	%2 = dim %mixed, 2 : memref<42x?x?x13x?xf32>			%2 = dim %mixed, 2 : memref<42x?x?x13x?xf32>
	// CHECK-NEXT: llvm.mlir.constant(13 : index) : !llvm.i64			// CHECK-NEXT: llvm.mlir.constant(13 : index) : !llvm.i64
	%3 = dim %mixed, 3 : memref<42x?x?x13x?xf32>			%3 = dim %mixed, 3 : memref<42x?x?x13x?xf32>
	// CHECK-NEXT: llvm.extractvalue %[[ld]][3, 4] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">			// CHECK-NEXT: llvm.extractvalue %[[ld]][3, 4] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">
	%4 = dim %mixed, 4 : memref<42x?x?x13x?xf32>			%4 = dim %mixed, 4 : memref<42x?x?x13x?xf32>
	return			return
	}			}

mlir/test/Conversion/StandardToLLVM/convert-funcs.mlir

	Show All 12 Lines
	func @third_order(%arg0 : (() -> ()) -> (() -> ())) -> ((() -> ()) -> (() -> ()))			func @third_order(%arg0 : (() -> ()) -> (() -> ())) -> ((() -> ()) -> (() -> ()))

	//CHECK: llvm.func @fifth_order_left(!llvm<"void (void (void (void ())))">)			//CHECK: llvm.func @fifth_order_left(!llvm<"void (void (void (void ())))">)
	func @fifth_order_left(%arg0: (((() -> ()) -> ()) -> ()) -> ())			func @fifth_order_left(%arg0: (((() -> ()) -> ()) -> ()) -> ())

	//CHECK: llvm.func @fifth_order_right(!llvm<"void ()* ()* ()* ()*">)			//CHECK: llvm.func @fifth_order_right(!llvm<"void ()* ()* ()* ()*">)
	func @fifth_order_right(%arg0: () -> (() -> (() -> (() -> ()))))			func @fifth_order_right(%arg0: () -> (() -> (() -> (() -> ()))))

	// Check that memrefs are converted to pointers-to-struct if appear as function arguments.			// Check that memrefs are converted to argument packs if appear as function arguments.
	// CHECK: llvm.func @memref_call_conv(!llvm<"{ float, float, i64, [1 x i64], [1 x i64] }*">)			// CHECK: llvm.func @memref_call_conv(!llvm<"float">, !llvm<"float">, !llvm.i64, !llvm.i64, !llvm.i64)
				nicolasvasilacheUnsubmitted Done Reply Inline Actions typo argument nicolasvasilache: typo argument
	func @memref_call_conv(%arg0: memref<?xf32>)			func @memref_call_conv(%arg0: memref<?xf32>)

	// Same in nested functions.			// Same in nested functions.
	// CHECK: llvm.func @memref_call_conv_nested(!llvm<"void ({ float, float, i64, [1 x i64], [1 x i64] })">)			// CHECK: llvm.func @memref_call_conv_nested(!llvm<"void (float, float, i64, i64, i64)*">)
	func @memref_call_conv_nested(%arg0: (memref<?xf32>) -> ())			func @memref_call_conv_nested(%arg0: (memref<?xf32>) -> ())

	//CHECK-LABEL: llvm.func @pass_through(%arg0: !llvm<"void ()">) -> !llvm<"void ()"> {			//CHECK-LABEL: llvm.func @pass_through(%arg0: !llvm<"void ()">) -> !llvm<"void ()"> {
	func @pass_through(%arg0: () -> ()) -> (() -> ()) {			func @pass_through(%arg0: () -> ()) -> (() -> ()) {
	// CHECK-NEXT: llvm.br ^bb1(%arg0 : !llvm<"void ()*">)			// CHECK-NEXT: llvm.br ^bb1(%arg0 : !llvm<"void ()*">)
	br ^bb1(%arg0 : () -> ())			br ^bb1(%arg0 : () -> ())

	//CHECK-NEXT: ^bb1(%0: !llvm<"void ()*">): // pred: ^bb0			//CHECK-NEXT: ^bb1(%0: !llvm<"void ()*">): // pred: ^bb0
	Show All 26 Lines

mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir

// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s		// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s
// RUN: mlir-opt -convert-std-to-llvm -convert-std-to-llvm-use-alloca=1 %s \| FileCheck %s --check-prefix=ALLOCA		// RUN: mlir-opt -convert-std-to-llvm -convert-std-to-llvm-use-alloca=1 %s \| FileCheck %s --check-prefix=ALLOCA
// RUN: mlir-opt -convert-std-to-llvm -split-input-file -convert-std-to-llvm-use-bare-ptr-memref-call-conv=1 %s \| FileCheck %s --check-prefix=BAREPTR		// RUN: mlir-opt -convert-std-to-llvm -split-input-file -convert-std-to-llvm-use-bare-ptr-memref-call-conv=1 %s \| FileCheck %s --check-prefix=BAREPTR

// BAREPTR-LABEL: func @check_noalias		// BAREPTR-LABEL: func @check_noalias
// BAREPTR-SAME: %{{.}}: !llvm<"float"> {llvm.noalias = true}		// BAREPTR-SAME: %{{.}}: !llvm<"float"> {llvm.noalias = true}
func @check_noalias(%static : memref<2xf32> {llvm.noalias = true}) {		func @check_noalias(%static : memref<2xf32> {llvm.noalias = true}) {
return		return
}		}

// -----		// -----

// CHECK-LABEL: func @check_static_return(%arg0: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">) -> !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }"> {		// CHECK-LABEL: func @check_static_return
		// CHECK-COUNT-2: !llvm<"float*">
		// CHECK-COUNT-5: !llvm.i64
		// CHECK-SAME: -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// BAREPTR-LABEL: func @check_static_return		// BAREPTR-LABEL: func @check_static_return
// BAREPTR-SAME: (%[[arg:.]]: !llvm<"float">) -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {		// BAREPTR-SAME: (%[[arg:.]]: !llvm<"float">) -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
func @check_static_return(%static : memref<32x18xf32>) -> memref<32x18xf32> {		func @check_static_return(%static : memref<32x18xf32>) -> memref<32x18xf32> {
// CHECK: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">

// BAREPTR: %[[udf:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// BAREPTR: %[[udf:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// BAREPTR-NEXT: %[[base:.]] = llvm.insertvalue %[[arg]], %[[udf]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// BAREPTR-NEXT: %[[base:.]] = llvm.insertvalue %[[arg]], %[[udf]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// BAREPTR-NEXT: %[[aligned:.]] = llvm.insertvalue %[[arg]], %[[base]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// BAREPTR-NEXT: %[[aligned:.]] = llvm.insertvalue %[[arg]], %[[base]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
// BAREPTR-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// BAREPTR-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// BAREPTR-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64 }">		// BAREPTR-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64 }">
%0 = alloc() : memref<f32>		%0 = alloc() : memref<f32>
return %0 : memref<f32>		return %0 : memref<f32>
}		}

// -----		// -----

// CHECK-LABEL: func @zero_d_dealloc(%{{.}}: !llvm<"{ float, float, i64 }">) {		// CHECK-LABEL: func @zero_d_dealloc
// BAREPTR-LABEL: func @zero_d_dealloc(%{{.}}: !llvm<"float">) {		// BAREPTR-LABEL: func @zero_d_dealloc(%{{.}}: !llvm<"float">) {
func @zero_d_dealloc(%arg0: memref<f32>) {		func @zero_d_dealloc(%arg0: memref<f32>) {
// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64 }*">		// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64 }">
// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64 }">
// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">		// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()		// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()

// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64 }">		// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64 }">
// BAREPTR-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">		// BAREPTR-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
// BAREPTR-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()		// BAREPTR-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()
dealloc %arg0 : memref<f32>		dealloc %arg0 : memref<f32>
return		return
}		}

// -----		// -----

// CHECK-LABEL: func @aligned_1d_alloc(		// CHECK-LABEL: func @aligned_1d_alloc(
// BAREPTR-LABEL: func @aligned_1d_alloc(		// BAREPTR-LABEL: func @aligned_1d_alloc(
func @aligned_1d_alloc() -> memref<42xf32> {		func @aligned_1d_alloc() -> memref<42xf32> {
// CHECK-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64		// CHECK: llvm.mlir.constant(42 : index) : !llvm.i64
// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">		// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64		// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64		// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
// CHECK-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64		// CHECK-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64
// CHECK-NEXT: %[[alignment:.*]] = llvm.mlir.constant(8 : index) : !llvm.i64		// CHECK-NEXT: %[[alignment:.*]] = llvm.mlir.constant(8 : index) : !llvm.i64
// CHECK-NEXT: %[[alignmentMinus1:.]] = llvm.add {{.}}, %[[alignment]] : !llvm.i64		// CHECK-NEXT: %[[alignmentMinus1:.]] = llvm.add {{.}}, %[[alignment]] : !llvm.i64
// CHECK-NEXT: %[[allocsize:.*]] = llvm.sub %[[alignmentMinus1]], %[[one]] : !llvm.i64		// CHECK-NEXT: %[[allocsize:.*]] = llvm.sub %[[alignmentMinus1]], %[[one]] : !llvm.i64
Show All 37 Lines	// BAREPTR-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">
return %0 : memref<42xf32>		return %0 : memref<42xf32>
}		}

// -----		// -----

// CHECK-LABEL: func @static_alloc() -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {		// CHECK-LABEL: func @static_alloc() -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
// BAREPTR-LABEL: func @static_alloc() -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {		// BAREPTR-LABEL: func @static_alloc() -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
func @static_alloc() -> memref<32x18xf32> {		func @static_alloc() -> memref<32x18xf32> {
// CHECK-NEXT: %[[sz1:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64		// CHECK: %[[sz1:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64
// CHECK-NEXT: %[[sz2:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64		// CHECK-NEXT: %[[sz2:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64
// CHECK-NEXT: %[[num_elems:.*]] = llvm.mul %0, %1 : !llvm.i64		// CHECK-NEXT: %[[num_elems:.*]] = llvm.mul %0, %1 : !llvm.i64
// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">		// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64		// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64		// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
// CHECK-NEXT: %[[bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64		// CHECK-NEXT: %[[bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64
// CHECK-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">		// CHECK-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">
// CHECK-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">		// CHECK-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">

// BAREPTR-NEXT: %[[sz1:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64		// BAREPTR-NEXT: %[[sz1:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64
// BAREPTR-NEXT: %[[sz2:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64		// BAREPTR-NEXT: %[[sz2:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64
// BAREPTR-NEXT: %[[num_elems:.*]] = llvm.mul %[[sz1]], %[[sz2]] : !llvm.i64		// BAREPTR-NEXT: %[[num_elems:.*]] = llvm.mul %[[sz1]], %[[sz2]] : !llvm.i64
// BAREPTR-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">		// BAREPTR-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
// BAREPTR-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64		// BAREPTR-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
// BAREPTR-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// BAREPTR-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// BAREPTR-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64		// BAREPTR-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
// BAREPTR-NEXT: %[[bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64		// BAREPTR-NEXT: %[[bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64
// BAREPTR-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">		// BAREPTR-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">
// BAREPTR-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">		// BAREPTR-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">
%0 = alloc() : memref<32x18xf32>		%0 = alloc() : memref<32x18xf32>
return %0 : memref<32x18xf32>		return %0 : memref<32x18xf32>
}		}

// -----		// -----

// CHECK-LABEL: func @static_dealloc(%{{.}}: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">) {		// CHECK-LABEL: func @static_dealloc
// BAREPTR-LABEL: func @static_dealloc(%{{.}}: !llvm<"float">) {		// BAREPTR-LABEL: func @static_dealloc(%{{.}}: !llvm<"float">) {
func @static_dealloc(%static: memref<10x8xf32>) {		func @static_dealloc(%static: memref<10x8xf32>) {
// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">		// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">		// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()		// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()

// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// BAREPTR-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">		// BAREPTR-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
// BAREPTR-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()		// BAREPTR-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()
dealloc %static : memref<10x8xf32>		dealloc %static : memref<10x8xf32>
return		return
}		}

// -----		// -----

// CHECK-LABEL: func @zero_d_load(%{{.}}: !llvm<"{ float, float, i64 }">) -> !llvm.float {		// CHECK-LABEL: func @zero_d_load
// BAREPTR-LABEL: func @zero_d_load(%{{.}}: !llvm<"float">) -> !llvm.float		// BAREPTR-LABEL: func @zero_d_load(%{{.}}: !llvm<"float">) -> !llvm.float
func @zero_d_load(%arg0: memref<f32>) -> f32 {		func @zero_d_load(%arg0: memref<f32>) -> f32 {
// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64 }*">		// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64 }">
// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64 }">
// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[c0]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[c0]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// CHECK-NEXT: %{{.}} = llvm.load %[[addr]] : !llvm<"float">		// CHECK-NEXT: %{{.}} = llvm.load %[[addr]] : !llvm<"float">

// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64 }">		// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64 }">
// BAREPTR-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// BAREPTR-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[c0]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[c0]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// BAREPTR-NEXT: llvm.load %[[addr:.]] : !llvm<"float">		// BAREPTR-NEXT: llvm.load %[[addr:.]] : !llvm<"float">
%0 = load %arg0[] : memref<f32>		%0 = load %arg0[] : memref<f32>
return %0 : f32		return %0 : f32
}		}

// -----		// -----

// CHECK-LABEL: func @static_load(		// CHECK-LABEL: func @static_load(
// CHECK-SAME: %[[A:.]]: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64		// CHECK-COUNT-2: !llvm<"float*">,
		// CHECK-COUNT-5: {{%[a-zA-Z0-9]*}}: !llvm.i64
		// CHECK: %[[I:.*]]: !llvm.i64,
		// CHECK: %[[J:.*]]: !llvm.i64)
// BAREPTR-LABEL: func @static_load		// BAREPTR-LABEL: func @static_load
// BAREPTR-SAME: (%[[A:.]]: !llvm<"float">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64) {		// BAREPTR-SAME: (%[[A:.]]: !llvm<"float">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64) {
func @static_load(%static : memref<10x42xf32>, %i : index, %j : index) {		func @static_load(%static : memref<10x42xf32>, %i : index, %j : index) {
// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">		// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// CHECK-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64		// CHECK-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64		// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64		// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64		// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64		// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64		// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// CHECK-NEXT: llvm.load %[[addr]] : !llvm<"float*">		// CHECK-NEXT: llvm.load %[[addr]] : !llvm<"float*">

// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// BAREPTR-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// BAREPTR-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// BAREPTR-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64		// BAREPTR-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
// BAREPTR-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64		// BAREPTR-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
// BAREPTR-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64		// BAREPTR-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
// BAREPTR-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64		// BAREPTR-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
// BAREPTR-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64		// BAREPTR-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
// BAREPTR-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64		// BAREPTR-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// BAREPTR-NEXT: llvm.load %[[addr]] : !llvm<"float*">		// BAREPTR-NEXT: llvm.load %[[addr]] : !llvm<"float*">
%0 = load %static[%i, %j] : memref<10x42xf32>		%0 = load %static[%i, %j] : memref<10x42xf32>
return		return
}		}

// -----		// -----

// CHECK-LABEL: func @zero_d_store(%arg0: !llvm<"{ float, float, i64 }*">, %arg1: !llvm.float) {		// CHECK-LABEL: func @zero_d_store
// BAREPTR-LABEL: func @zero_d_store		// BAREPTR-LABEL: func @zero_d_store
// BAREPTR-SAME: (%[[A:.]]: !llvm<"float">, %[[val:.*]]: !llvm.float)		// BAREPTR-SAME: (%[[A:.]]: !llvm<"float">, %[[val:.*]]: !llvm.float)
func @zero_d_store(%arg0: memref<f32>, %arg1: f32) {		func @zero_d_store(%arg0: memref<f32>, %arg1: f32) {
// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64 }*">		// CHECK: %[[ptr:.]] = llvm.extractvalue %[[ld:.]][1] : !llvm<"{ float, float, i64 }">
// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64 }">
// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// CHECK-NEXT: llvm.store %arg1, %[[addr]] : !llvm<"float*">		// CHECK-NEXT: llvm.store %{{.}}, %[[addr]] : !llvm<"float">

// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64 }">		// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64 }">
// BAREPTR-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// BAREPTR-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// BAREPTR-NEXT: llvm.store %[[val]], %[[addr]] : !llvm<"float*">		// BAREPTR-NEXT: llvm.store %[[val]], %[[addr]] : !llvm<"float*">
store %arg1, %arg0[] : memref<f32>		store %arg1, %arg0[] : memref<f32>
return		return
}		}

// -----		// -----

// CHECK-LABEL: func @static_store		// CHECK-LABEL: func @static_store
// BAREPTR-LABEL: func @static_store		// BAREPTR-LABEL: func @static_store
// BAREPTR-SAME: %[[A:.]]: !llvm<"float">		// BAREPTR-SAME: %[[A:.]]: !llvm<"float">
func @static_store(%static : memref<10x42xf32>, %i : index, %j : index, %val : f32) {		func @static_store(%static : memref<10x42xf32>, %i : index, %j : index, %val : f32) {
// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">		// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// CHECK-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64		// CHECK-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64		// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64		// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64		// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64		// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64		// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// CHECK-NEXT: llvm.store %arg3, %[[addr]] : !llvm<"float*">		// CHECK-NEXT: llvm.store %{{.}}, %[[addr]] : !llvm<"float">

// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// BAREPTR-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// BAREPTR-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// BAREPTR-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64		// BAREPTR-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
// BAREPTR-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64		// BAREPTR-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
// BAREPTR-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64		// BAREPTR-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
// BAREPTR-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64		// BAREPTR-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
// BAREPTR-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64		// BAREPTR-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
// BAREPTR-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64		// BAREPTR-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">		// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
// BAREPTR-NEXT: llvm.store %{{.}}, %[[addr]] : !llvm<"float">		// BAREPTR-NEXT: llvm.store %{{.}}, %[[addr]] : !llvm<"float">
store %val, %static[%i, %j] : memref<10x42xf32>		store %val, %static[%i, %j] : memref<10x42xf32>
return		return
}		}

// -----		// -----

// CHECK-LABEL: func @static_memref_dim(%arg0: !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }*">) {		// CHECK-LABEL: func @static_memref_dim
// BAREPTR-LABEL: func @static_memref_dim(%{{.}}: !llvm<"float">) {		// BAREPTR-LABEL: func @static_memref_dim(%{{.}}: !llvm<"float">) {
func @static_memref_dim(%static : memref<42x32x15x13x27xf32>) {		func @static_memref_dim(%static : memref<42x32x15x13x27xf32>) {
// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }*">		// CHECK: llvm.mlir.constant(42 : index) : !llvm.i64
// CHECK-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64
// BAREPTR: llvm.insertvalue %{{.}}, %{{.}}[4, 0] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">		// BAREPTR: llvm.insertvalue %{{.}}, %{{.}}[4, 0] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">
// BAREPTR-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64		// BAREPTR-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64
%0 = dim %static, 0 : memref<42x32x15x13x27xf32>		%0 = dim %static, 0 : memref<42x32x15x13x27xf32>
// CHECK-NEXT: llvm.mlir.constant(32 : index) : !llvm.i64		// CHECK-NEXT: llvm.mlir.constant(32 : index) : !llvm.i64
// BAREPTR-NEXT: llvm.mlir.constant(32 : index) : !llvm.i64		// BAREPTR-NEXT: llvm.mlir.constant(32 : index) : !llvm.i64
%1 = dim %static, 1 : memref<42x32x15x13x27xf32>		%1 = dim %static, 1 : memref<42x32x15x13x27xf32>
// CHECK-NEXT: llvm.mlir.constant(15 : index) : !llvm.i64		// CHECK-NEXT: llvm.mlir.constant(15 : index) : !llvm.i64
// BAREPTR-NEXT: llvm.mlir.constant(15 : index) : !llvm.i64		// BAREPTR-NEXT: llvm.mlir.constant(15 : index) : !llvm.i64
Show All 9 Lines

mlir/test/Conversion/StandardToLLVM/convert-to-llvmir.mlir

Show First 20 Lines • Show All 722 Lines • ▼ Show 20 Lines	func @view(%arg0 : index, %arg1 : index, %arg2 : index) {
// CHECK: llvm.insertvalue %{{.}}, %{{.}}[4, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// CHECK: llvm.insertvalue %{{.}}, %{{.}}[4, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
%6 = view %0[%arg2][%arg0, %arg1]		%6 = view %0[%arg2][%arg0, %arg1]
: memref<2048xi8> to memref<?x?xf32, affine_map<(d0, d1)[s0, s1] -> (d0 * s0 + d1 + s1)>>		: memref<2048xi8> to memref<?x?xf32, affine_map<(d0, d1)[s0, s1] -> (d0 * s0 + d1 + s1)>>

return		return
}		}

// CHECK-LABEL: func @subview(		// CHECK-LABEL: func @subview(
// CHECK: %[[MEMREFPTR:.]]: !llvm<{{.}}>, %[[ARG0:.]]: !llvm.i64, %[[ARG1:.]]: !llvm.i64, %[[ARG2:.*]]: !llvm.i64		// CHECK-COUNT-2: !llvm<"float*">,
		// CHECK-COUNT-5: {{%[a-zA-Z0-9]*}}: !llvm.i64,
		// CHECK: %[[ARG0:[a-zA-Z0-9]*]]: !llvm.i64,
		// CHECK: %[[ARG1:[a-zA-Z0-9]*]]: !llvm.i64,
		// CHECK: %[[ARG2:.*]]: !llvm.i64)
func @subview(%0 : memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>>, %arg0 : index, %arg1 : index, %arg2 : index) {		func @subview(%0 : memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>>, %arg0 : index, %arg1 : index, %arg2 : index) {
// CHECK: %[[MEMREF:.]] = llvm.load %[[MEMREFPTR]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// The last "insertvalue" that populates the memref descriptor from the function arguments.
		// CHECK: %[[MEMREF:.]] = llvm.insertvalue %{{.}}, %{{.*}}[4, 1]

// CHECK: %[[DESC:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESC0:.]] = llvm.insertvalue %{{.}}, %[[DESC]][0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC0:.]] = llvm.insertvalue %{{.}}, %[[DESC]][0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESC1:.]] = llvm.insertvalue %{{.}}, %[[DESC0]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC1:.]] = llvm.insertvalue %{{.}}, %[[DESC0]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[STRIDE0:.]] = llvm.extractvalue %[[MEMREF]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[STRIDE0:.]] = llvm.extractvalue %[[MEMREF]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[STRIDE1:.]] = llvm.extractvalue %[[MEMREF]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[STRIDE1:.]] = llvm.extractvalue %[[MEMREF]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[OFF:.]] = llvm.extractvalue %[[MEMREF]][2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[OFF:.]] = llvm.extractvalue %[[MEMREF]][2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[OFFINC:.*]] = llvm.mul %[[ARG0]], %[[STRIDE0]] : !llvm.i64		// CHECK: %[[OFFINC:.*]] = llvm.mul %[[ARG0]], %[[STRIDE0]] : !llvm.i64
// CHECK: %[[OFF1:.*]] = llvm.add %[[OFF]], %[[OFFINC]] : !llvm.i64		// CHECK: %[[OFF1:.*]] = llvm.add %[[OFF]], %[[OFFINC]] : !llvm.i64
// CHECK: %[[OFFINC1:.*]] = llvm.mul %[[ARG1]], %[[STRIDE1]] : !llvm.i64		// CHECK: %[[OFFINC1:.*]] = llvm.mul %[[ARG1]], %[[STRIDE1]] : !llvm.i64
// CHECK: %[[OFF2:.*]] = llvm.add %[[OFF1]], %[[OFFINC1]] : !llvm.i64		// CHECK: %[[OFF2:.*]] = llvm.add %[[OFF1]], %[[OFFINC1]] : !llvm.i64
// CHECK: %[[DESC2:.]] = llvm.insertvalue %[[OFF2]], %[[DESC1]][2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC2:.]] = llvm.insertvalue %[[OFF2]], %[[DESC1]][2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESC3:.]] = llvm.insertvalue %[[ARG1]], %[[DESC2]][3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC3:.]] = llvm.insertvalue %[[ARG1]], %[[DESC2]][3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESCSTRIDE1:.*]] = llvm.mul %[[ARG1]], %[[STRIDE1]] : !llvm.i64		// CHECK: %[[DESCSTRIDE1:.*]] = llvm.mul %[[ARG1]], %[[STRIDE1]] : !llvm.i64
// CHECK: %[[DESC4:.]] = llvm.insertvalue %[[DESCSTRIDE1]], %[[DESC3]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC4:.]] = llvm.insertvalue %[[DESCSTRIDE1]], %[[DESC3]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESC5:.]] = llvm.insertvalue %[[ARG0]], %[[DESC4]][3, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC5:.]] = llvm.insertvalue %[[ARG0]], %[[DESC4]][3, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESCSTRIDE0:.*]] = llvm.mul %[[ARG0]], %[[STRIDE0]] : !llvm.i64		// CHECK: %[[DESCSTRIDE0:.*]] = llvm.mul %[[ARG0]], %[[STRIDE0]] : !llvm.i64
// CHECK: llvm.insertvalue %[[DESCSTRIDE0]], %[[DESC5]][4, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// CHECK: llvm.insertvalue %[[DESCSTRIDE0]], %[[DESC5]][4, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
%1 = subview %0[%arg0, %arg1][%arg0, %arg1][%arg0, %arg1] :		%1 = subview %0[%arg0, %arg1][%arg0, %arg1][%arg0, %arg1] :
memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>> to memref<?x?xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s1 + d1 * s2 + s0)>>		memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>> to memref<?x?xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s1 + d1 * s2 + s0)>>
return		return
}		}

// CHECK-LABEL: func @subview_const_size(		// CHECK-LABEL: func @subview_const_size(
// CHECK: %[[MEMREFPTR:.]]: !llvm<{{.}}>, %[[ARG0:.]]: !llvm.i64, %[[ARG1:.]]: !llvm.i64, %[[ARG2:.*]]: !llvm.i64
func @subview_const_size(%0 : memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>>, %arg0 : index, %arg1 : index, %arg2 : index) {		func @subview_const_size(%0 : memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>>, %arg0 : index, %arg1 : index, %arg2 : index) {
// CHECK: %[[MEMREF:.]] = llvm.load %[[MEMREFPTR]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// The last "insertvalue" that populates the memref descriptor from the function arguments.
		// CHECK: %[[MEMREF:.]] = llvm.insertvalue %{{.}}, %{{.*}}[4, 1]

// CHECK: %[[DESC:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESC0:.]] = llvm.insertvalue %{{.}}, %[[DESC]][0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC0:.]] = llvm.insertvalue %{{.}}, %[[DESC]][0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESC1:.]] = llvm.insertvalue %{{.}}, %[[DESC0]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC1:.]] = llvm.insertvalue %{{.}}, %[[DESC0]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[STRIDE0:.]] = llvm.extractvalue %[[MEMREF]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[STRIDE0:.]] = llvm.extractvalue %[[MEMREF]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[STRIDE1:.]] = llvm.extractvalue %[[MEMREF]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[STRIDE1:.]] = llvm.extractvalue %[[MEMREF]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[CST4:.*]] = llvm.mlir.constant(4 : i64)		// CHECK: %[[CST4:.*]] = llvm.mlir.constant(4 : i64)
// CHECK: %[[CST2:.*]] = llvm.mlir.constant(2 : i64)		// CHECK: %[[CST2:.*]] = llvm.mlir.constant(2 : i64)
// CHECK: %[[OFF:.]] = llvm.extractvalue %[[MEMREF]][2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[OFF:.]] = llvm.extractvalue %[[MEMREF]][2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
Show All 9 Lines	func @subview_const_size(%0 : memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>>, %arg0 : index, %arg1 : index, %arg2 : index) {
// CHECK: %[[DESCSTRIDE0:.*]] = llvm.mul %[[ARG0]], %[[STRIDE0]] : !llvm.i64		// CHECK: %[[DESCSTRIDE0:.*]] = llvm.mul %[[ARG0]], %[[STRIDE0]] : !llvm.i64
// CHECK: llvm.insertvalue %[[DESCSTRIDE0]], %[[DESC5]][4, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// CHECK: llvm.insertvalue %[[DESCSTRIDE0]], %[[DESC5]][4, 0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
%1 = subview %0[%arg0, %arg1][][%arg0, %arg1] :		%1 = subview %0[%arg0, %arg1][][%arg0, %arg1] :
memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>> to memref<4x2xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s1 + d1 * s2 + s0)>>		memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>> to memref<4x2xf32, affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s1 + d1 * s2 + s0)>>
return		return
}		}

// CHECK-LABEL: func @subview_const_stride(		// CHECK-LABEL: func @subview_const_stride(
// CHECK: %[[MEMREFPTR:.]]: !llvm<{{.}}>, %[[ARG0:.]]: !llvm.i64, %[[ARG1:.]]: !llvm.i64, %[[ARG2:.*]]: !llvm.i64
func @subview_const_stride(%0 : memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>>, %arg0 : index, %arg1 : index, %arg2 : index) {		func @subview_const_stride(%0 : memref<64x4xf32, affine_map<(d0, d1) -> (d0 * 4 + d1)>>, %arg0 : index, %arg1 : index, %arg2 : index) {
// CHECK: %[[MEMREF:.]] = llvm.load %[[MEMREFPTR]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// The last "insertvalue" that populates the memref descriptor from the function arguments.
		// CHECK: %[[MEMREF:.]] = llvm.insertvalue %{{.}}, %{{.*}}[4, 1]

// CHECK: %[[DESC:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESC0:.]] = llvm.insertvalue %{{.}}, %[[DESC]][0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC0:.]] = llvm.insertvalue %{{.}}, %[[DESC]][0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[DESC1:.]] = llvm.insertvalue %{{.}}, %[[DESC0]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[DESC1:.]] = llvm.insertvalue %{{.}}, %[[DESC0]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[STRIDE0:.]] = llvm.extractvalue %[[MEMREF]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[STRIDE0:.]] = llvm.extractvalue %[[MEMREF]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[STRIDE1:.]] = llvm.extractvalue %[[MEMREF]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[STRIDE1:.]] = llvm.extractvalue %[[MEMREF]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[OFF:.]] = llvm.extractvalue %[[MEMREF]][2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">		// CHECK: %[[OFF:.]] = llvm.extractvalue %[[MEMREF]][2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
// CHECK: %[[OFFINC:.*]] = llvm.mul %[[ARG0]], %[[STRIDE0]] : !llvm.i64		// CHECK: %[[OFFINC:.*]] = llvm.mul %[[ARG0]], %[[STRIDE0]] : !llvm.i64
// CHECK: %[[OFF1:.*]] = llvm.add %[[OFF]], %[[OFFINC]] : !llvm.i64		// CHECK: %[[OFF1:.*]] = llvm.add %[[OFF]], %[[OFFINC]] : !llvm.i64
Show All 29 Lines

mlir/test/Conversion/StandardToLLVM/standard-to-llvm.mlir

	// RUN: mlir-opt %s -convert-std-to-llvm -split-input-file -verify-diagnostics \| FileCheck %s			// RUN: mlir-opt %s -convert-std-to-llvm -split-input-file -verify-diagnostics \| FileCheck %s

	// CHECK-LABEL: func @address_space(			// CHECK-LABEL: func @address_space(
	// CHECK: %{{.}}: !llvm<"{ float addrspace(7), float addrspace(7), i64, [1 x i64], [1 x i64] }">)			// CHECK-SAME: !llvm<"float addrspace(7)*">
	// CHECK: llvm.load %{{.}} : !llvm<"{ float addrspace(7), float addrspace(7), i64, [1 x i64], [1 x i64] }">
	func @address_space(%arg0 : memref<32xf32, affine_map<(d0) -> (d0)>, 7>) {			func @address_space(%arg0 : memref<32xf32, affine_map<(d0) -> (d0)>, 7>) {
	%0 = alloc() : memref<32xf32, affine_map<(d0) -> (d0)>, 5>			%0 = alloc() : memref<32xf32, affine_map<(d0) -> (d0)>, 5>
	%1 = constant 7 : index			%1 = constant 7 : index
	// CHECK: llvm.load %{{.}} : !llvm<"float addrspace(5)">			// CHECK: llvm.load %{{.}} : !llvm<"float addrspace(5)">
	%2 = load %0[%1] : memref<32xf32, affine_map<(d0) -> (d0)>, 5>			%2 = load %0[%1] : memref<32xf32, affine_map<(d0) -> (d0)>, 5>
	std.return			std.return
	}			}

	Show All 17 Lines

mlir/test/Dialect/GPU/invalid.mlir

Show First 20 Lines • Show All 169 Lines • ▼ Show 20 Lines	"gpu.launch_func"(%sz, %sz, %sz, %sz, %sz, %sz, %arg, %arg)
: (index, index, index, index, index, index, !llvm<"float*">,		: (index, index, index, index, index, index, !llvm<"float*">,
!llvm<"float*">) -> ()		!llvm<"float*">) -> ()
return		return
}		}
}		}

// -----		// -----

		module attributes {gpu.container_module} {
gpu.module @kernels {		gpu.module @kernels {
gpu.func @kernel_1(%arg1 : !llvm<"float*">) attributes { gpu.kernel } {		gpu.func @kernel_1(%arg1 : f32) attributes { gpu.kernel } {
gpu.return		gpu.return
}		}
}		}

// Due to the ordering of the current impl of lowering and LLVMLowering, type		func @launch_func_kernel_operand_types(%sz : index, %arg : f32) {
// checks need to be temporarily disabled.		// expected-err@+1 {{type of function argument 0 does not match}}
// TODO(ntv,zinenko,herhut): reactivate checks once "changing gpu.launchFunc		"gpu.launch_func"(%sz, %sz, %sz, %sz, %sz, %sz, %arg)
// to encode target module" has landed.		{kernel = "kernel_1", kernel_module = @kernels}
// func @launch_func_kernel_operand_types(%sz : index, %arg : f32) {		: (index, index, index, index, index, index, f32) -> ()
// // expected-err@+1 {{type of function argument 0 does not match}}		return
// "gpu.launch_func"(%sz, %sz, %sz, %sz, %sz, %sz, %arg)		}
// {kernel = "kernel_1"}		}
// : (index, index, index, index, index, index, f32) -> ()
// return
// }

// -----		// -----

func @illegal_dimension() {		func @illegal_dimension() {
// expected-error@+1 {{dimension "o" is invalid}}		// expected-error@+1 {{dimension "o" is invalid}}
%tIdX = "gpu.thread_id"() {dimension = "o"} : () -> (index)		%tIdX = "gpu.thread_id"() {dimension = "o"} : () -> (index)

return		return
▲ Show 20 Lines • Show All 211 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/llvm.mlir

Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines
// insert size and stride		// insert size and stride
// CHECK-NEXT: llvm.insertvalue %{{.}}, %{{.}}[3, 0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">		// CHECK-NEXT: llvm.insertvalue %{{.}}, %{{.}}[3, 0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
// CHECK-NEXT: llvm.insertvalue %{{.}}, %{{.}}[4, 0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">		// CHECK-NEXT: llvm.insertvalue %{{.}}, %{{.}}[4, 0] : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">

func @dot(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<f32>) {		func @dot(%arg0: memref<?xf32, offset: ?, strides: [1]>, %arg1: memref<?xf32, offset: ?, strides: [1]>, %arg2: memref<f32>) {
linalg.dot(%arg0, %arg1, %arg2) : memref<?xf32, offset: ?, strides: [1]>, memref<?xf32, offset: ?, strides: [1]>, memref<f32>		linalg.dot(%arg0, %arg1, %arg2) : memref<?xf32, offset: ?, strides: [1]>, memref<?xf32, offset: ?, strides: [1]>, memref<f32>
return		return
}		}
// CHECK-LABEL: func @dot(%{{.}}: !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">, %{{.}}: !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">, %{{.}}: !llvm<"{ float, float, i64 }">) {		// CHECK-LABEL: func @dot
// CHECK-COUNT-3: llvm.mlir.constant(1 : index){{.[[:space:]].}}llvm.alloca{{.[[:space:]].}}llvm.store		// CHECK: llvm.call @linalg_dot_viewsxf32_viewsxf32_viewf32(%{{.*}}) :
// CHECK-NEXT: llvm.call @linalg_dot_viewsxf32_viewsxf32_viewf32(%{{.}}, %{{.}}, %{{.}}) : (!llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">, !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">, !llvm<"{ float, float, i64 }">) -> ()		// CHECK-SAME: !llvm<"float">, !llvm<"float">, !llvm.i64, !llvm.i64, !llvm.i64
		// CHECK-SAME: !llvm<"float">, !llvm<"float">, !llvm.i64, !llvm.i64, !llvm.i64
		// CHECK-SAME: !llvm<"float">, !llvm<"float">, !llvm.i64

func @slice_with_range_and_index(%arg0: memref<?x?xf64, offset: ?, strides: [?, 1]>) {		func @slice_with_range_and_index(%arg0: memref<?x?xf64, offset: ?, strides: [?, 1]>) {
%c0 = constant 0 : index		%c0 = constant 0 : index
%c1 = constant 1 : index		%c1 = constant 1 : index
%R = linalg.range %c0:%c1:%c1 : !linalg.range		%R = linalg.range %c0:%c1:%c1 : !linalg.range
loop.for %i0 = %c0 to %c1 step %c1 {		loop.for %i0 = %c0 to %c1 step %c1 {
%1 = linalg.slice %arg0[%i0, %R] : memref<?x?xf64, offset: ?, strides: [?, 1]>, index, !linalg.range, memref<?xf64, offset: ?, strides: [1]>		%1 = linalg.slice %arg0[%i0, %R] : memref<?x?xf64, offset: ?, strides: [?, 1]>, index, !linalg.range, memref<?xf64, offset: ?, strides: [1]>
}		}
Show All 12 Lines
// CHECK: llvm.insertvalue %{{.}}[3, 0] : !llvm<"{ double, double*, i64, [1 x i64], [1 x i64] }">		// CHECK: llvm.insertvalue %{{.}}[3, 0] : !llvm<"{ double, double*, i64, [1 x i64], [1 x i64] }">
// CHECK: llvm.insertvalue %{{.}}[4, 0] : !llvm<"{ double, double*, i64, [1 x i64], [1 x i64] }">		// CHECK: llvm.insertvalue %{{.}}[4, 0] : !llvm<"{ double, double*, i64, [1 x i64], [1 x i64] }">

func @copy(%arg0: memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>, %arg1: memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>) {		func @copy(%arg0: memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>, %arg1: memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>) {
linalg.copy(%arg0, %arg1) : memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>, memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>		linalg.copy(%arg0, %arg1) : memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>, memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>
return		return
}		}
// CHECK-LABEL: func @copy		// CHECK-LABEL: func @copy
// CHECK: llvm.call @linalg_copy_viewsxsxsxf32_viewsxsxsxf32(%{{.}}, %{{.}}) : (!llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">, !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">) -> ()		// CHECK: llvm.call @linalg_copy_viewsxsxsxf32_viewsxsxsxf32({{.*}}) :
		// CHECK-SAME: !llvm<"float">, !llvm<"float">, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64
		// CHECK-SAME: !llvm<"float">, !llvm<"float">, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64

func @transpose(%arg0: memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>) {		func @transpose(%arg0: memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>) {
%0 = linalg.transpose %arg0 (i, j, k) -> (k, i, j) : memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>		%0 = linalg.transpose %arg0 (i, j, k) -> (k, i, j) : memref<?x?x?xf32, offset: ?, strides: [?, ?, 1]>
return		return
}		}
// CHECK-LABEL: func @transpose		// CHECK-LABEL: func @transpose
// CHECK: llvm.mlir.undef : !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.mlir.undef : !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">
// CHECK: llvm.insertvalue {{.}}[0] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.insertvalue {{.}}[0] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
Show All 28 Lines
// CHECK: llvm.insertvalue {{.}}[1] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.insertvalue {{.}}[1] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
// CHECK: llvm.insertvalue {{.}}[2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.insertvalue {{.}}[2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
// CHECK: llvm.extractvalue {{.}}[3, 0] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.extractvalue {{.}}[3, 0] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
// CHECK: llvm.insertvalue {{.}}[3, 2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.insertvalue {{.}}[3, 2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
// CHECK: llvm.extractvalue {{.}}[3, 1] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.extractvalue {{.}}[3, 1] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
// CHECK: llvm.insertvalue {{.}}[3, 1] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.insertvalue {{.}}[3, 1] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
// CHECK: llvm.extractvalue {{.}}[3, 2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.extractvalue {{.}}[3, 2] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
// CHECK: llvm.insertvalue {{.}}[3, 0] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">		// CHECK: llvm.insertvalue {{.}}[3, 0] : !llvm<"{ float, float*, i64, [3 x i64], [3 x i64] }">
// Call external copy after promoting input and output structs to pointers		// Call external copy.
// CHECK-COUNT-2: llvm.mlir.constant(1 : index){{.[[:space:]].}}llvm.alloca{{.[[:space:]].}}llvm.store		// CHECK: llvm.call @linalg_copy_viewsxsxsxf32_viewsxsxsxf32
// CHECK: llvm.call @linalg_copy_viewsxsxsxf32_viewsxsxsxf32(%{{.}}, %{{.}}) : (!llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">, !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }">) -> ()

#matmul_accesses = [		#matmul_accesses = [
affine_map<(m, n, k) -> (m, k)>,		affine_map<(m, n, k) -> (m, k)>,
affine_map<(m, n, k) -> (k, n)>,		affine_map<(m, n, k) -> (k, n)>,
affine_map<(m, n, k) -> (m, n)>		affine_map<(m, n, k) -> (m, n)>
]		]
#matmul_trait = {		#matmul_trait = {
args_in = 2,		args_in = 2,
Show All 16 Lines	linalg.generic #matmul_trait %A, %B, %C {
^bb0(%a: !vector_type_A, %b: !vector_type_B, %c: !vector_type_C):		^bb0(%a: !vector_type_A, %b: !vector_type_B, %c: !vector_type_C):
%d = vector.outerproduct %a, %b, %c: !vector_type_A, !vector_type_B		%d = vector.outerproduct %a, %b, %c: !vector_type_A, !vector_type_B
linalg.yield %d: !vector_type_C		linalg.yield %d: !vector_type_C
} : !matrix_type_A, !matrix_type_B, !matrix_type_C		} : !matrix_type_A, !matrix_type_B, !matrix_type_C

return		return
}		}
// CHECK-LABEL: func @matmul_vec_impl(		// CHECK-LABEL: func @matmul_vec_impl(
// CHECK: llvm.call @external_outerproduct_matmul(%{{.}}) : (!llvm<"{ <4 x float>, <4 x float>, i64, [2 x i64], [2 x i64] }">, !llvm<"{ <4 x float>, <4 x float>, i64, [2 x i64], [2 x i64] }">, !llvm<"{ [4 x <4 x float>], [4 x <4 x float>], i64, [2 x i64], [2 x i64] }">) -> ()		// CHECK: llvm.call @external_outerproduct_matmul(%{{.*}}) :
		// CHECK-SAME: !llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64
		// CHECK-SAME: !llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64
		// CHECK-SAME: !llvm<"[4 x <4 x float>]">, !llvm<"[4 x <4 x float>]">, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64

// LLVM-LOOPS-LABEL: func @matmul_vec_impl(		// LLVM-LOOPS-LABEL: func @matmul_vec_impl(
// LLVM-LOOPS: llvm.shufflevector {{.*}} [0 : i32, 0 : i32, 0 : i32, 0 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>">		// LLVM-LOOPS: llvm.shufflevector {{.*}} [0 : i32, 0 : i32, 0 : i32, 0 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>">
// LLVM-LOOPS: llvm.shufflevector {{.*}} [1 : i32, 1 : i32, 1 : i32, 1 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>">		// LLVM-LOOPS: llvm.shufflevector {{.*}} [1 : i32, 1 : i32, 1 : i32, 1 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>">
// LLVM-LOOPS: llvm.shufflevector {{.*}} [2 : i32, 2 : i32, 2 : i32, 2 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>">		// LLVM-LOOPS: llvm.shufflevector {{.*}} [2 : i32, 2 : i32, 2 : i32, 2 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>">
// LLVM-LOOPS: llvm.shufflevector {{.*}} [3 : i32, 3 : i32, 3 : i32, 3 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>">		// LLVM-LOOPS: llvm.shufflevector {{.*}} [3 : i32, 3 : i32, 3 : i32, 3 : i32] : !llvm<"<4 x float>">, !llvm<"<4 x float>">
// LLVM-LOOPS-NEXT: llvm.extractvalue {{.*}}[3] : !llvm<"[4 x <4 x float>]">		// LLVM-LOOPS-NEXT: llvm.extractvalue {{.*}}[3] : !llvm<"[4 x <4 x float>]">
// LLVM-LOOPS-NEXT: "llvm.intr.fma"({{.*}}) : (!llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm<"<4 x float>">) -> !llvm<"<4 x float>">		// LLVM-LOOPS-NEXT: "llvm.intr.fma"({{.*}}) : (!llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm<"<4 x float>">) -> !llvm<"<4 x float>">
Show All 15 Lines	^bb0(%i: index, %j: index, %k: index,
%a: !vector_type_A, %b: !vector_type_B, %c: !vector_type_C):		%a: !vector_type_A, %b: !vector_type_B, %c: !vector_type_C):
%d = vector.outerproduct %a, %b, %c: !vector_type_A, !vector_type_B		%d = vector.outerproduct %a, %b, %c: !vector_type_A, !vector_type_B
linalg.yield %d: !vector_type_C		linalg.yield %d: !vector_type_C
} : !matrix_type_A, !matrix_type_B, !matrix_type_C		} : !matrix_type_A, !matrix_type_B, !matrix_type_C
return		return
}		}
// CHECK-LABEL: func @matmul_vec_indexed(		// CHECK-LABEL: func @matmul_vec_indexed(
// CHECK: %[[ZERO:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64		// CHECK: %[[ZERO:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
// CHECK: llvm.call @external_indexed_outerproduct_matmul(%[[ZERO]], %[[ZERO]], %[[ZERO]], %{{.}}, %{{.}}, %{{.}}) : (!llvm.i64, !llvm.i64, !llvm.i64, !llvm<"{ <4 x float>, <4 x float>, i64, [2 x i64], [2 x i64] }">, !llvm<"{ <4 x float>, <4 x float>, i64, [2 x i64], [2 x i64] }">, !llvm<"{ [4 x <4 x float>], [4 x <4 x float>], i64, [2 x i64], [2 x i64] }">) -> ()		// CHECK: llvm.call @external_indexed_outerproduct_matmul(%[[ZERO]], %[[ZERO]], %[[ZERO]], %{{.*}}) :
		// CHECK-SAME: !llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64
		// CHECK-SAME: !llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64
		// CHECK-SAME: !llvm<"[4 x <4 x float>]">, !llvm<"[4 x <4 x float>]">, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64, !llvm.i64

func @reshape_static(%arg0: memref<3x4x5xf32>) {		func @reshape_static(%arg0: memref<3x4x5xf32>) {
// Reshapes that expand and collapse back a contiguous tensor with some 1's.		// Reshapes that expand and collapse back a contiguous tensor with some 1's.
%0 = linalg.reshape %arg0 [affine_map<(i, j, k, l, m) -> (i, j)>,		%0 = linalg.reshape %arg0 [affine_map<(i, j, k, l, m) -> (i, j)>,
affine_map<(i, j, k, l, m) -> (k)>,		affine_map<(i, j, k, l, m) -> (k)>,
affine_map<(i, j, k, l, m) -> (l, m)>] :		affine_map<(i, j, k, l, m) -> (l, m)>] :
memref<3x4x5xf32> into memref<1x3x4x1x5xf32>		memref<3x4x5xf32> into memref<1x3x4x1x5xf32>
%r0 = linalg.reshape %0 [affine_map<(i, j, k, l, m) -> (i, j)>,		%r0 = linalg.reshape %0 [affine_map<(i, j, k, l, m) -> (i, j)>,
▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

mlir/test/mlir-cpu-runner/cblas_interface.cpp

	Show All 9 Lines
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "include/cblas_interface.h"			#include "include/cblas_interface.h"
	#include "include/cblas.h"			#include "include/cblas.h"
	#include <assert.h>			#include <assert.h>
	#include <iostream>			#include <iostream>

	extern "C" void linalg_fill_viewf32_f32(StridedMemRefType<float, 0> *X,			extern "C" void
	float f) {			_mlir_ciface_linalg_fill_viewf32_f32(StridedMemRefType<float, 0> *X, float f) {
	X->data[X->offset] = f;			X->data[X->offset] = f;
	}			}

	extern "C" void linalg_fill_viewsxf32_f32(StridedMemRefType<float, 1> *X,			extern "C" void
				_mlir_ciface_linalg_fill_viewsxf32_f32(StridedMemRefType<float, 1> *X,
	float f) {			float f) {
	for (unsigned i = 0; i < X->sizes[0]; ++i)			for (unsigned i = 0; i < X->sizes[0]; ++i)
	(X->data + X->offset + i X->strides[0]) = f;			(X->data + X->offset + i X->strides[0]) = f;
	}			}

	extern "C" void linalg_fill_viewsxsxf32_f32(StridedMemRefType<float, 2> *X,			extern "C" void
				_mlir_ciface_linalg_fill_viewsxsxf32_f32(StridedMemRefType<float, 2> *X,
	float f) {			float f) {
	for (unsigned i = 0; i < X->sizes[0]; ++i)			for (unsigned i = 0; i < X->sizes[0]; ++i)
	for (unsigned j = 0; j < X->sizes[1]; ++j)			for (unsigned j = 0; j < X->sizes[1]; ++j)
	(X->data + X->offset + i X->strides[0] + j * X->strides[1]) = f;			(X->data + X->offset + i X->strides[0] + j * X->strides[1]) = f;
	}			}

	extern "C" void linalg_copy_viewf32_viewf32(StridedMemRefType<float, 0> *I,			extern "C" void
				_mlir_ciface_linalg_copy_viewf32_viewf32(StridedMemRefType<float, 0> *I,
	StridedMemRefType<float, 0> *O) {			StridedMemRefType<float, 0> *O) {
	O->data[O->offset] = I->data[I->offset];			O->data[O->offset] = I->data[I->offset];
	}			}

	extern "C" void			extern "C" void
	linalg_copy_viewsxf32_viewsxf32(StridedMemRefType<float, 1> *I,			_mlir_ciface_linalg_copy_viewsxf32_viewsxf32(StridedMemRefType<float, 1> *I,
	StridedMemRefType<float, 1> *O) {			StridedMemRefType<float, 1> *O) {
	if (I->sizes[0] != O->sizes[0]) {			if (I->sizes[0] != O->sizes[0]) {
	std::cerr << "Incompatible strided memrefs\n";			std::cerr << "Incompatible strided memrefs\n";
	printMemRefMetaData(std::cerr, *I);			printMemRefMetaData(std::cerr, *I);
	printMemRefMetaData(std::cerr, *O);			printMemRefMetaData(std::cerr, *O);
	return;			return;
	}			}
	for (unsigned i = 0; i < I->sizes[0]; ++i)			for (unsigned i = 0; i < I->sizes[0]; ++i)
	O->data[O->offset + i * O->strides[0]] =			O->data[O->offset + i * O->strides[0]] =
	I->data[I->offset + i * I->strides[0]];			I->data[I->offset + i * I->strides[0]];
	}			}

	extern "C" void			extern "C" void _mlir_ciface_linalg_copy_viewsxsxf32_viewsxsxf32(
	linalg_copy_viewsxsxf32_viewsxsxf32(StridedMemRefType<float, 2> *I,			StridedMemRefType<float, 2> I, StridedMemRefType<float, 2> O) {
	StridedMemRefType<float, 2> *O) {
	if (I->sizes[0] != O->sizes[0] \|\| I->sizes[1] != O->sizes[1]) {			if (I->sizes[0] != O->sizes[0] \|\| I->sizes[1] != O->sizes[1]) {
	std::cerr << "Incompatible strided memrefs\n";			std::cerr << "Incompatible strided memrefs\n";
	printMemRefMetaData(std::cerr, *I);			printMemRefMetaData(std::cerr, *I);
	printMemRefMetaData(std::cerr, *O);			printMemRefMetaData(std::cerr, *O);
	return;			return;
	}			}
	auto so0 = O->strides[0], so1 = O->strides[1];			auto so0 = O->strides[0], so1 = O->strides[1];
	auto si0 = I->strides[0], si1 = I->strides[1];			auto si0 = I->strides[0], si1 = I->strides[1];
	for (unsigned i = 0; i < I->sizes[0]; ++i)			for (unsigned i = 0; i < I->sizes[0]; ++i)
	for (unsigned j = 0; j < I->sizes[1]; ++j)			for (unsigned j = 0; j < I->sizes[1]; ++j)
	O->data[O->offset + i * so0 + j * so1] =			O->data[O->offset + i * so0 + j * so1] =
	I->data[I->offset + i * si0 + j * si1];			I->data[I->offset + i * si0 + j * si1];
	}			}

	extern "C" void			extern "C" void _mlir_ciface_linalg_dot_viewsxf32_viewsxf32_viewf32(
	linalg_dot_viewsxf32_viewsxf32_viewf32(StridedMemRefType<float, 1> *X,			StridedMemRefType<float, 1> X, StridedMemRefType<float, 1> Y,
	StridedMemRefType<float, 1> *Y,
	StridedMemRefType<float, 0> *Z) {			StridedMemRefType<float, 0> *Z) {
	if (X->strides[0] != 1 \|\| Y->strides[0] != 1 \|\| X->sizes[0] != Y->sizes[0]) {			if (X->strides[0] != 1 \|\| Y->strides[0] != 1 \|\| X->sizes[0] != Y->sizes[0]) {
	std::cerr << "Incompatible strided memrefs\n";			std::cerr << "Incompatible strided memrefs\n";
	printMemRefMetaData(std::cerr, *X);			printMemRefMetaData(std::cerr, *X);
	printMemRefMetaData(std::cerr, *Y);			printMemRefMetaData(std::cerr, *Y);
	printMemRefMetaData(std::cerr, *Z);			printMemRefMetaData(std::cerr, *Z);
	return;			return;
	}			}
	Z->data[Z->offset] +=			Z->data[Z->offset] +=
	cblas_sdot(X->sizes[0], X->data + X->offset, X->strides[0],			cblas_sdot(X->sizes[0], X->data + X->offset, X->strides[0],
	Y->data + Y->offset, Y->strides[0]);			Y->data + Y->offset, Y->strides[0]);
	}			}

	extern "C" void linalg_matmul_viewsxsxf32_viewsxsxf32_viewsxsxf32(			extern "C" void _mlir_ciface_linalg_matmul_viewsxsxf32_viewsxsxf32_viewsxsxf32(
	StridedMemRefType<float, 2> A, StridedMemRefType<float, 2> B,			StridedMemRefType<float, 2> A, StridedMemRefType<float, 2> B,
	StridedMemRefType<float, 2> *C) {			StridedMemRefType<float, 2> *C) {
	if (A->strides[1] != B->strides[1] \|\| A->strides[1] != C->strides[1] \|\|			if (A->strides[1] != B->strides[1] \|\| A->strides[1] != C->strides[1] \|\|
	A->strides[1] != 1 \|\| A->sizes[0] < A->strides[1] \|\|			A->strides[1] != 1 \|\| A->sizes[0] < A->strides[1] \|\|
	B->sizes[0] < B->strides[1] \|\| C->sizes[0] < C->strides[1] \|\|			B->sizes[0] < B->strides[1] \|\| C->sizes[0] < C->strides[1] \|\|
	C->sizes[0] != A->sizes[0] \|\| C->sizes[1] != B->sizes[1] \|\|			C->sizes[0] != A->sizes[0] \|\| C->sizes[1] != B->sizes[1] \|\|
	A->sizes[1] != B->sizes[0]) {			A->sizes[1] != B->sizes[0]) {
	printMemRefMetaData(std::cerr, *A);			printMemRefMetaData(std::cerr, *A);
	Show All 10 Lines

mlir/test/mlir-cpu-runner/include/cblas_interface.h

	Show All 19 Lines
	#define MLIR_CBLAS_INTERFACE_EXPORT __declspec(dllimport)			#define MLIR_CBLAS_INTERFACE_EXPORT __declspec(dllimport)
	#endif // cblas_interface_EXPORTS			#endif // cblas_interface_EXPORTS
	#endif // MLIR_CBLAS_INTERFACE_EXPORT			#endif // MLIR_CBLAS_INTERFACE_EXPORT
	#else			#else
	#define MLIR_CBLAS_INTERFACE_EXPORT			#define MLIR_CBLAS_INTERFACE_EXPORT
	#endif // _WIN32			#endif // _WIN32

	extern "C" MLIR_CBLAS_INTERFACE_EXPORT void			extern "C" MLIR_CBLAS_INTERFACE_EXPORT void
	linalg_fill_viewf32_f32(StridedMemRefType<float, 0> *X, float f);			_mlir_ciface_linalg_fill_viewf32_f32(StridedMemRefType<float, 0> *X, float f);

	extern "C" MLIR_CBLAS_INTERFACE_EXPORT void			extern "C" MLIR_CBLAS_INTERFACE_EXPORT void
	linalg_fill_viewsxf32_f32(StridedMemRefType<float, 1> *X, float f);			_mlir_ciface_linalg_fill_viewsxf32_f32(StridedMemRefType<float, 1> *X, float f);

	extern "C" MLIR_CBLAS_INTERFACE_EXPORT void			extern "C" MLIR_CBLAS_INTERFACE_EXPORT void
	linalg_fill_viewsxsxf32_f32(StridedMemRefType<float, 2> *X, float f);			_mlir_ciface_linalg_fill_viewsxsxf32_f32(StridedMemRefType<float, 2> *X,
				float f);

	extern "C" MLIR_CBLAS_INTERFACE_EXPORT void			extern "C" MLIR_CBLAS_INTERFACE_EXPORT void
	linalg_copy_viewf32_viewf32(StridedMemRefType<float, 0> *I,			_mlir_ciface_linalg_copy_viewf32_viewf32(StridedMemRefType<float, 0> *I,
	StridedMemRefType<float, 0> *O);			StridedMemRefType<float, 0> *O);

	extern "C" MLIR_CBLAS_INTERFACE_EXPORT void			extern "C" MLIR_CBLAS_INTERFACE_EXPORT void
	linalg_copy_viewsxf32_viewsxf32(StridedMemRefType<float, 1> *I,			_mlir_ciface_linalg_copy_viewsxf32_viewsxf32(StridedMemRefType<float, 1> *I,
	StridedMemRefType<float, 1> *O);			StridedMemRefType<float, 1> *O);

	extern "C" MLIR_CBLAS_INTERFACE_EXPORT void			extern "C" MLIR_CBLAS_INTERFACE_EXPORT void
	linalg_copy_viewsxsxf32_viewsxsxf32(StridedMemRefType<float, 2> *I,			_mlir_ciface_linalg_copy_viewsxsxf32_viewsxsxf32(
	StridedMemRefType<float, 2> *O);			StridedMemRefType<float, 2> I, StridedMemRefType<float, 2> O);

	extern "C" MLIR_CBLAS_INTERFACE_EXPORT void			extern "C" MLIR_CBLAS_INTERFACE_EXPORT void
	linalg_dot_viewsxf32_viewsxf32_viewf32(StridedMemRefType<float, 1> *X,			_mlir_ciface_linalg_dot_viewsxf32_viewsxf32_viewf32(
	StridedMemRefType<float, 1> *Y,			StridedMemRefType<float, 1> X, StridedMemRefType<float, 1> Y,
	StridedMemRefType<float, 0> *Z);			StridedMemRefType<float, 0> *Z);

	extern "C" MLIR_CBLAS_INTERFACE_EXPORT void			extern "C" MLIR_CBLAS_INTERFACE_EXPORT void
	linalg_matmul_viewsxsxf32_viewsxsxf32_viewsxsxf32(			_mlir_ciface_linalg_matmul_viewsxsxf32_viewsxsxf32_viewsxsxf32(
	StridedMemRefType<float, 2> A, StridedMemRefType<float, 2> B,			StridedMemRefType<float, 2> A, StridedMemRefType<float, 2> B,
	StridedMemRefType<float, 2> *C);			StridedMemRefType<float, 2> *C);

	#endif // MLIR_CPU_RUNNER_CBLAS_INTERFACE_H_			#endif // MLIR_CPU_RUNNER_CBLAS_INTERFACE_H_

mlir/test/mlir-cpu-runner/include/mlir_runner_utils.h

Show First 20 Lines • Show All 255 Lines • ▼ Show 20 Lines	template <typename T> void printMemRef(StridedMemRefType<T, 0> &M) {
std::cout << "]" << std::endl;		std::cout << "]" << std::endl;
}		}
} // namespace impl		} // namespace impl

////////////////////////////////////////////////////////////////////////////////		////////////////////////////////////////////////////////////////////////////////
// Currently exposed C API.		// Currently exposed C API.
////////////////////////////////////////////////////////////////////////////////		////////////////////////////////////////////////////////////////////////////////
extern "C" MLIR_RUNNER_UTILS_EXPORT void		extern "C" MLIR_RUNNER_UTILS_EXPORT void
print_memref_i8(UnrankedMemRefType<int8_t> *M);		_mlir_ciface_print_memref_i8(UnrankedMemRefType<int8_t> *M);
extern "C" MLIR_RUNNER_UTILS_EXPORT void		extern "C" MLIR_RUNNER_UTILS_EXPORT void
print_memref_f32(UnrankedMemRefType<float> *M);		_mlir_ciface_print_memref_f32(UnrankedMemRefType<float> *M);

		extern "C" MLIR_RUNNER_UTILS_EXPORT void print_memref_f32(int64_t rank,
		void *ptr);

extern "C" MLIR_RUNNER_UTILS_EXPORT void		extern "C" MLIR_RUNNER_UTILS_EXPORT void
print_memref_0d_f32(StridedMemRefType<float, 0> *M);		_mlir_ciface_print_memref_0d_f32(StridedMemRefType<float, 0> *M);
extern "C" MLIR_RUNNER_UTILS_EXPORT void		extern "C" MLIR_RUNNER_UTILS_EXPORT void
print_memref_1d_f32(StridedMemRefType<float, 1> *M);		_mlir_ciface_print_memref_1d_f32(StridedMemRefType<float, 1> *M);
extern "C" MLIR_RUNNER_UTILS_EXPORT void		extern "C" MLIR_RUNNER_UTILS_EXPORT void
print_memref_2d_f32(StridedMemRefType<float, 2> *M);		_mlir_ciface_print_memref_2d_f32(StridedMemRefType<float, 2> *M);
extern "C" MLIR_RUNNER_UTILS_EXPORT void		extern "C" MLIR_RUNNER_UTILS_EXPORT void
print_memref_3d_f32(StridedMemRefType<float, 3> *M);		_mlir_ciface_print_memref_3d_f32(StridedMemRefType<float, 3> *M);
extern "C" MLIR_RUNNER_UTILS_EXPORT void		extern "C" MLIR_RUNNER_UTILS_EXPORT void
print_memref_4d_f32(StridedMemRefType<float, 4> *M);		_mlir_ciface_print_memref_4d_f32(StridedMemRefType<float, 4> *M);

extern "C" MLIR_RUNNER_UTILS_EXPORT void		extern "C" MLIR_RUNNER_UTILS_EXPORT void
print_memref_vector_4x4xf32(StridedMemRefType<Vector2D<4, 4, float>, 2> *M);		_mlir_ciface_print_memref_vector_4x4xf32(
		StridedMemRefType<Vector2D<4, 4, float>, 2> *M);

// Small runtime support "lib" for vector.print lowering.		// Small runtime support "lib" for vector.print lowering.
extern "C" MLIR_RUNNER_UTILS_EXPORT void print_f32(float f);		extern "C" MLIR_RUNNER_UTILS_EXPORT void print_f32(float f);
extern "C" MLIR_RUNNER_UTILS_EXPORT void print_f64(double d);		extern "C" MLIR_RUNNER_UTILS_EXPORT void print_f64(double d);
extern "C" MLIR_RUNNER_UTILS_EXPORT void print_open();		extern "C" MLIR_RUNNER_UTILS_EXPORT void print_open();
extern "C" MLIR_RUNNER_UTILS_EXPORT void print_close();		extern "C" MLIR_RUNNER_UTILS_EXPORT void print_close();
extern "C" MLIR_RUNNER_UTILS_EXPORT void print_comma();		extern "C" MLIR_RUNNER_UTILS_EXPORT void print_comma();
extern "C" MLIR_RUNNER_UTILS_EXPORT void print_newline();		extern "C" MLIR_RUNNER_UTILS_EXPORT void print_newline();

#endif // MLIR_CPU_RUNNER_MLIRUTILS_H_		#endif // MLIR_CPU_RUNNER_MLIRUTILS_H_

mlir/test/mlir-cpu-runner/mlir_runner_utils.cpp

	Show All 10 Lines
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "include/mlir_runner_utils.h"			#include "include/mlir_runner_utils.h"

	#include <cinttypes>			#include <cinttypes>
	#include <cstdio>			#include <cstdio>

	extern "C" void			extern "C" void _mlir_ciface_print_memref_vector_4x4xf32(
	print_memref_vector_4x4xf32(StridedMemRefType<Vector2D<4, 4, float>, 2> *M) {			StridedMemRefType<Vector2D<4, 4, float>, 2> *M) {
	impl::printMemRef(*M);			impl::printMemRef(*M);
	}			}

	#define MEMREF_CASE(TYPE, RANK) \			#define MEMREF_CASE(TYPE, RANK) \
	case RANK: \			case RANK: \
	impl::printMemRef((static_cast<StridedMemRefType<TYPE, RANK> >(ptr))); \			impl::printMemRef((static_cast<StridedMemRefType<TYPE, RANK> >(ptr))); \
	break			break

	extern "C" void print_memref_i8(UnrankedMemRefType<int8_t> *M) {			extern "C" void _mlir_ciface_print_memref_i8(UnrankedMemRefType<int8_t> *M) {
	printUnrankedMemRefMetaData(std::cout, *M);			printUnrankedMemRefMetaData(std::cout, *M);
	int rank = M->rank;			int rank = M->rank;
	void *ptr = M->descriptor;			void *ptr = M->descriptor;

	switch (rank) {			switch (rank) {
	MEMREF_CASE(int8_t, 0);			MEMREF_CASE(int8_t, 0);
	MEMREF_CASE(int8_t, 1);			MEMREF_CASE(int8_t, 1);
	MEMREF_CASE(int8_t, 2);			MEMREF_CASE(int8_t, 2);
	MEMREF_CASE(int8_t, 3);			MEMREF_CASE(int8_t, 3);
	MEMREF_CASE(int8_t, 4);			MEMREF_CASE(int8_t, 4);
	default:			default:
	assert(0 && "Unsupported rank to print");			assert(0 && "Unsupported rank to print");
	}			}
	}			}

	extern "C" void print_memref_f32(UnrankedMemRefType<float> *M) {			extern "C" void _mlir_ciface_print_memref_f32(UnrankedMemRefType<float> *M) {
	printUnrankedMemRefMetaData(std::cout, *M);			printUnrankedMemRefMetaData(std::cout, *M);
	int rank = M->rank;			int rank = M->rank;
	void *ptr = M->descriptor;			void *ptr = M->descriptor;

	switch (rank) {			switch (rank) {
	MEMREF_CASE(float, 0);			MEMREF_CASE(float, 0);
	MEMREF_CASE(float, 1);			MEMREF_CASE(float, 1);
	MEMREF_CASE(float, 2);			MEMREF_CASE(float, 2);
	MEMREF_CASE(float, 3);			MEMREF_CASE(float, 3);
	MEMREF_CASE(float, 4);			MEMREF_CASE(float, 4);
	default:			default:
	assert(0 && "Unsupported rank to print");			assert(0 && "Unsupported rank to print");
	}			}
	}			}

	extern "C" void print_memref_0d_f32(StridedMemRefType<float, 0> *M) {			extern "C" void print_memref_f32(int64_t rank, void *ptr) {
				UnrankedMemRefType<float> descriptor;
				descriptor.rank = rank;
				descriptor.descriptor = ptr;
				_mlir_ciface_print_memref_f32(&descriptor);
				}

				extern "C" void
				_mlir_ciface_print_memref_0d_f32(StridedMemRefType<float, 0> *M) {
	impl::printMemRef(*M);			impl::printMemRef(*M);
	}			}
	extern "C" void print_memref_1d_f32(StridedMemRefType<float, 1> *M) {			extern "C" void
				_mlir_ciface_print_memref_1d_f32(StridedMemRefType<float, 1> *M) {
	impl::printMemRef(*M);			impl::printMemRef(*M);
	}			}
	extern "C" void print_memref_2d_f32(StridedMemRefType<float, 2> *M) {			extern "C" void
				_mlir_ciface_print_memref_2d_f32(StridedMemRefType<float, 2> *M) {
	impl::printMemRef(*M);			impl::printMemRef(*M);
	}			}
	extern "C" void print_memref_3d_f32(StridedMemRefType<float, 3> *M) {			extern "C" void
				_mlir_ciface_print_memref_3d_f32(StridedMemRefType<float, 3> *M) {
	impl::printMemRef(*M);			impl::printMemRef(*M);
	}			}
	extern "C" void print_memref_4d_f32(StridedMemRefType<float, 4> *M) {			extern "C" void
				_mlir_ciface_print_memref_4d_f32(StridedMemRefType<float, 4> *M) {
	impl::printMemRef(*M);			impl::printMemRef(*M);
	}			}

	// Small runtime support "lib" for vector.print lowering.			// Small runtime support "lib" for vector.print lowering.
	// By providing elementary printing methods only, this			// By providing elementary printing methods only, this
	// library can remain fully unaware of low-level implementation			// library can remain fully unaware of low-level implementation
	// details of our vectors. Also useful for direct LLVM IR output.			// details of our vectors. Also useful for direct LLVM IR output.
	extern "C" void print_i32(int32_t i) { fprintf(stdout, "%" PRId32, i); }			extern "C" void print_i32(int32_t i) { fprintf(stdout, "%" PRId32, i); }
	extern "C" void print_i64(int64_t l) { fprintf(stdout, "%" PRId64, l); }			extern "C" void print_i64(int64_t l) { fprintf(stdout, "%" PRId64, l); }
	extern "C" void print_f32(float f) { fprintf(stdout, "%g", f); }			extern "C" void print_f32(float f) { fprintf(stdout, "%g", f); }
	extern "C" void print_f64(double d) { fprintf(stdout, "%lg", d); }			extern "C" void print_f64(double d) { fprintf(stdout, "%lg", d); }
	extern "C" void print_open() { fputs("( ", stdout); }			extern "C" void print_open() { fputs("( ", stdout); }
	extern "C" void print_close() { fputs(" )", stdout); }			extern "C" void print_close() { fputs(" )", stdout); }
	extern "C" void print_comma() { fputs(", ", stdout); }			extern "C" void print_comma() { fputs(", ", stdout); }
	extern "C" void print_newline() { fputc('\n', stdout); }			extern "C" void print_newline() { fputc('\n', stdout); }

mlir/test/mlir-cuda-runner/gpu-to-cubin.mlir

	Show All 11 Lines
	}			}

	// CHECK: [1, 1, 1, 1, 1]			// CHECK: [1, 1, 1, 1, 1]
	func @main() {			func @main() {
	%arg0 = alloc() : memref<5xf32>			%arg0 = alloc() : memref<5xf32>
	%21 = constant 5 : i32			%21 = constant 5 : i32
	%22 = memref_cast %arg0 : memref<5xf32> to memref<?xf32>			%22 = memref_cast %arg0 : memref<5xf32> to memref<?xf32>
	call @mcuMemHostRegisterMemRef1dFloat(%22) : (memref<?xf32>) -> ()			call @mcuMemHostRegisterMemRef1dFloat(%22) : (memref<?xf32>) -> ()
	call @print_memref_1d_f32(%22) : (memref<?xf32>) -> ()			%23 = memref_cast %22 : memref<?xf32> to memref<*xf32>
				call @print_memref_f32(%23) : (memref<*xf32>) -> ()
	%24 = constant 1.0 : f32			%24 = constant 1.0 : f32
	call @other_func(%24, %22) : (f32, memref<?xf32>) -> ()			call @other_func(%24, %22) : (f32, memref<?xf32>) -> ()
	call @print_memref_1d_f32(%22) : (memref<?xf32>) -> ()			call @print_memref_f32(%23) : (memref<*xf32>) -> ()
	return			return
	}			}

	func @mcuMemHostRegisterMemRef1dFloat(%ptr : memref<?xf32>)			func @mcuMemHostRegisterMemRef1dFloat(%ptr : memref<?xf32>)
	func @print_memref_1d_f32(memref<?xf32>)			func @print_memref_f32(%ptr : memref<*xf32>)

mlir/tools/mlir-cuda-runner/cuda-runtime-wrappers.cpp

	Show First 20 Lines • Show All 90 Lines • ▼ Show 20 Lines
	// value. Helpful until we have transfer functions implemented.			// value. Helpful until we have transfer functions implemented.
	template <typename T, int N>			template <typename T, int N>
	void mcuMemHostRegisterMemRef(const MemRefType<T, N> *arg, T value) {			void mcuMemHostRegisterMemRef(const MemRefType<T, N> *arg, T value) {
	auto count = std::accumulate(arg->sizes, arg->sizes + N, 1,			auto count = std::accumulate(arg->sizes, arg->sizes + N, 1,
	std::multiplies<int64_t>());			std::multiplies<int64_t>());
	std::fill_n(arg->data, count, value);			std::fill_n(arg->data, count, value);
	mcuMemHostRegister(arg->data, count * sizeof(T));			mcuMemHostRegister(arg->data, count * sizeof(T));
	}			}
	extern "C" void
	mcuMemHostRegisterMemRef1dFloat(const MemRefType<float, 1> *arg) {			extern "C" void mcuMemHostRegisterMemRef1dFloat(float *allocated,
	mcuMemHostRegisterMemRef(arg, 1.23f);			float *aligned, int64_t offset,
	}			int64_t size, int64_t stride) {
	extern "C" void			MemRefType<float, 1> descriptor;
	mcuMemHostRegisterMemRef3dFloat(const MemRefType<float, 3> *arg) {			descriptor.basePtr = allocated;
	mcuMemHostRegisterMemRef(arg, 1.23f);			descriptor.data = aligned;
				descriptor.offset = offset;
				descriptor.sizes[0] = size;
				descriptor.strides[0] = stride;
				mcuMemHostRegisterMemRef(&descriptor, 1.23f);
				}

				extern "C" void mcuMemHostRegisterMemRef3dFloat(float *allocated,
				float *aligned, int64_t offset,
				int64_t size0, int64_t size1,
				int64_t size2, int64_t stride0,
				int64_t stride1,
				int64_t stride2) {
				MemRefType<float, 3> descriptor;
				descriptor.basePtr = allocated;
				descriptor.data = aligned;
				descriptor.offset = offset;
				descriptor.sizes[0] = size0;
				descriptor.strides[0] = stride0;
				descriptor.sizes[1] = size1;
				descriptor.strides[1] = stride1;
				descriptor.sizes[2] = size2;
				descriptor.strides[2] = stride2;
				mcuMemHostRegisterMemRef(&descriptor, 1.23f);
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] use unpacked memref descriptors at function boundariesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 243518

mlir/docs/ConversionToLLVMDialect.md

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h

mlir/include/mlir/IR/FunctionSupport.h

mlir/lib/Conversion/GPUToCUDA/ConvertLaunchFuncToCudaCalls.cpp

mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp

mlir/lib/Conversion/LinalgToLLVM/LinalgToLLVM.cpp

mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp

mlir/lib/Dialect/GPU/IR/GPUDialect.cpp

mlir/lib/Transforms/DialectConversion.cpp

mlir/test/Conversion/GPUToCUDA/lower-launch-func-to-cuda.mlir

mlir/test/Conversion/StandardToLLVM/convert-argattrs.mlir

mlir/test/Conversion/StandardToLLVM/convert-dynamic-memref-ops.mlir

mlir/test/Conversion/StandardToLLVM/convert-funcs.mlir

mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir

mlir/test/Conversion/StandardToLLVM/convert-to-llvmir.mlir

mlir/test/Conversion/StandardToLLVM/standard-to-llvm.mlir

mlir/test/Dialect/GPU/invalid.mlir

mlir/test/Dialect/Linalg/llvm.mlir

mlir/test/mlir-cpu-runner/cblas_interface.cpp

mlir/test/mlir-cpu-runner/include/cblas_interface.h

mlir/test/mlir-cpu-runner/include/mlir_runner_utils.h

mlir/test/mlir-cpu-runner/mlir_runner_utils.cpp

mlir/test/mlir-cuda-runner/gpu-to-cubin.mlir

mlir/tools/mlir-cuda-runner/cuda-runtime-wrappers.cpp

[mlir] use unpacked memref descriptors at function boundaries
ClosedPublic