This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Conversion/StandardToLLVM/
-
StandardToLLVM/
8/13
ConvertStandardToLLVM.h
1/6
ConvertStandardToLLVMPass.h
-
Transforms/
4/8
DialectConversion.h
-
lib/
-
Conversion/
-
GPUToNVVM/
-
LowerGpuOpsToNVVMOps.cpp
-
StandardToLLVM/
17/22
ConvertStandardToLLVM.cpp
-
Transforms/
-
DialectConversion.cpp
-
test/Conversion/StandardToLLVM/
-
Conversion/
-
StandardToLLVM/
-
convert-dynamic-memref-ops.mlir
-
convert-memref-ops.mlir
-
convert-static-memref-ops.mlir

Differential D72802

[mlir] Introduce bare ptr calling convention for MemRefs in LLVM dialect
ClosedPublic

Authored by dcaballe on Jan 15 2020, 12:52 PM.

Download Raw Diff

Details

Reviewers

ftynse
bondhugula
nicolasvasilache
rriddle
mehdi_amini

Commits

rGe5aaf30cf1ab: [mlir] Introduce bare ptr calling convention for MemRefs in LLVM dialect

Summary

This patch introduces an alternative calling convention for
MemRef function arguments in LLVM dialect. It converts MemRef
function arguments to LLVM bare pointers to the MemRef element
type instead of creating a MemRef descriptor. Bare pointers are
then promoted to a MemRef descriptors at the beginning of the
function. This calling convention is only enabled with a flag.

This is a stepping stone towards having an alternative and simpler
lowering for MemRefs when dynamic shapes are not needed. It can
also be used to temporarily overcome the issue with passing 'noalias'
attribute for MemRef arguments, discussed in [1, 2], since we can
now convert:

func @check_noalias(%static : memref<2xf32> {llvm.noalias = true}) {

return

}

into:

llvm.func @check_noalias(%arg0: !llvm<"float*"> {llvm.noalias = true}) {

%0 = llvm.mlir.undef ...
%1 = llvm.insertvalue %arg0, %0[0] ...
%2 = llvm.insertvalue %arg0, %1[1] ...
...
llvm.return

}

Related discussion:

[1] https://github.com/tensorflow/mlir/issues/309
[2] https://github.com/tensorflow/mlir/pull/337

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dcaballe created this revision.Jan 15 2020, 12:52 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 15 2020, 12:52 PM

Herald added subscribers: llvm-commits, liufengdb, aartbik and 12 others. · View Herald Transcript

Unit tests: pass. 61906 tests passed, 0 failed and 782 were skipped.

clang-tidy: unknown.

clang-format: pass.

Build artifacts: diff.json, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Harbormaster completed remote builds in B44089: Diff 238340.Jan 15 2020, 1:11 PM

rriddle added inline comments.Jan 15 2020, 1:15 PM

mlir/include/mlir/Transforms/DialectConversion.h
324	This function is already broken, I'd rather not expand its scope until its fixed.
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
261	This looks like a lot of code duplication.
650	Same in this pattern.

Thanks, River! Some comments inline.
I think I can refactor some of the code in the patterns but I'd like to know if there is any major concern with this approach before moving forward.

mlir/include/mlir/Transforms/DialectConversion.h
324	It's currently being used. Why is it broken? Maybe I could just replace all the uses (lines 683 and 687) with other utilities but I thought adding a mapping from 'from' to 'to' to pattern-rewriter would be necessary.
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
261	We discussed that concern in our initial thread (https://github.com/tensorflow/mlir/issues/309#issuecomment-568791098). However, I think that keeping a separate pattern, as suggested, despite some code duplication might be worth it to keep each pattern simple. Otherwise, we would end up with a complex pattern with many special cases. I could try to refactor some of this code to protected members in LLVMTypeConverter but I think it's going to look a bit odd since the differences between the two patterns are small and scattered.
650	Same, I could try to refactor some code to a common base class for both patterns.

flaub added a subscriber: flaub.Jan 16 2020, 1:55 AM

"I think I can refactor some of the code in the patterns but I'd like to know if there is any major concern with this approach before moving forward."

No concerns on the approach on my end, except for the current copypasta :)
Thanks for doing this!

mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
261	+10 for refactoring: when things evolve we don't want to have to evolve 2 independent impls.

mehdi_amini added inline comments.Jan 16 2020, 8:56 PM

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h
76	I'm concerned about the scalability of this approach. You're making the type converter extensible through inheritance and overloading, which is not gonna compose well: it is impossible to another orthogonal customization point. Then extending it with a new subclass requires new header entry points: the multiple populate* functions does not seems very nice to me already, and this is making it just harder to figure out how to use this header. If we are convinced that we don't need to scale or that we won't need more customization point, then I rather not use any virtual function and look into passing a config struct with flag on the lowering behavior we want.

Address refactor comments.

dcaballe marked an inline comment as done.Jan 17 2020, 11:01 AM

dcaballe added inline comments.

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h
76	Thanks for the feedback, Mehdi. You're making the type converter extensible through inheritance and overloading, which is not gonna compose well: it is impossible to another orthogonal customization point. I think this looks better after the refactoring since I'm only introducing `convertArgType`, which should be a unitary piece of the type conversion and easy to compose with other customizations. If I understood correctly, the general direction is that developers may want to create custom lowerings to LLVM, even out of tree. If that is the case, we should try to make the LLVM type converter more friendly to that and facilitate code reuse as much as possible. I think that breaking LLVM type conversion into smaller unitary chucks would help and this patch goes towards that direction. If eventually the current LLVMTypeConverter makes composability "impossible to another orthogonal customization points", we could move the current "default" implementation to a sub-class and only keep the code or API that is generic enough, unitary and reusable in LLVMTypeConverter. Would that make sense? Then extending it with a new subclass requires new header entry points: the multiple populate* functions does not seems very nice to me already, and this is making it just harder to figure out how to use this header. I see your point. IIRC, the recently added public populate* functions were aimed at facilitating the reuse of existing patterns in custom lowerings, particularly out of tree. Currently, there is no other way to make these patterns available since they are private to this translation unit. If we don't want to change the latter, we could publicly expose only the populate* functions that provide all the conversion patterns (i.e., `populateStdToLLVMConversionPatterns` and `populateStdToLLVMBarePtrConversionPatterns`). Someone could always add custom patterns with more priority to override the default ones. Not a very clean solution, though. If we are convinced that we don't need to scale or that we won't need more customization point, then I rather not use any virtual function and look into passing a config struct with flag on the lowering behavior we want. It's difficult to know at this point and the feedback that I've heard so far is in the direction of having custom lowerings. I suggested an abstraction to customize memref lowering that didn't go through (https://github.com/tensorflow/mlir/pull/337) and more people are interested in lowering memrefs in different ways (https://groups.google.com/a/tensorflow.org/forum/?utm_medium=email&utm_source=footer#!msg/mlir/9UcFIefP9u0/3ujw73F8BAAJ)

Unit tests: pass. 61906 tests passed, 0 failed and 782 were skipped.

clang-tidy: unknown.

clang-format: pass.

Build artifacts: diff.json, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Harbormaster completed remote builds in B44294: Diff 238826.Jan 17 2020, 11:30 AM

Move tests for static MemRefs in 'convert-memref-ops.mlir' to 'convert-static-memref-ops.mlir' and enable such tests for bare ptr calling convention.
Move file 'convert-memref-ops.mlir' to 'convert-dynamic-memref-ops.mlir'.

This patch is ready for review.

dcaballe edited the summary of this revision. (Show Details)Jan 21 2020, 3:06 PM

Unit tests: pass. 61906 tests passed, 0 failed and 782 were skipped.

clang-tidy: unknown.

clang-format: pass.

Build artifacts: diff.json, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Harbormaster completed remote builds in B44533: Diff 239442.Jan 21 2020, 3:37 PM

mehdi_amini added inline comments.Jan 22 2020, 11:52 AM

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
147	I am missing how this gets used (other than in testing with the cl::opt) right now?
mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h
76	I think this looks better after the refactoring since I'm only introducing convertArgType, which should be a unitary piece of the type conversion and easy to compose with other customizations. I'm not sure I understand what you mean here: are you saying that LLVMTypeConverter will never ever have any other virtual method that you would need for customization? I think that breaking LLVM type conversion into smaller unitary chucks would help and this patch goes towards that direction. How is this patch going in this direction? Maybe I don't perceive the direction you're referring to with "smaller unitary chunks" here? we could publicly expose only the populate* functions that provide all the conversion patterns (i.e., populateStdToLLVMConversionPatterns and populateStdToLLVMBarePtrConversionPatterns). Again, I see some composability issue. What about the next orthogonal option? Are we gonna multiply the number of `populateStdToLLVM*ConversionPatterns` function by two? There is a combinatorial aspect that I don't understand how you see solved. Unless the "BarePtr" is the only ever customization we want to apply, I don't see how this is scaling forward right now. Have you looked into injection instead of subclassing here?

Thanks for the feedback, Mehdi!

The reason for having this upstream is because it's generic enough and there were more people interested in lowering MemRefs to bare pointers in LLVM. It's also useful to temporarily overcome the aliasing issue for function arguments, currently impacting everybody using the default LLVM lowering.
However, if you think this is too problematic I can keep this local to our nGraph repository. I would still need some minor changes upstream to enable the overloading of some methods so that we don't have to duplicate and maintain a lot of code.

Thanks,
Diego

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
147	That's a good point. I'm currently using it locally by invoking `createLowerToLLVM` with the provided `populateStdToLLVMBarePtrConversionPatterns` `makeStandardToLLVMBarePtrTypeConverter`. I can add changes and a flag to JitRunner so that we can use it through cpu runner. What do you think?
mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h
76	I think this looks better after the refactoring since I'm only introducing convertArgType, which should be a unitary piece of the type conversion and easy to compose with other customizations. I'm not sure I understand what you mean here: are you saying that LLVMTypeConverter will never ever have any other virtual method that you would need for customization? Not at all! IIUC, your concern about composability was on the first version of the code where I was overloading `convertFunctionSignature`, a conversion method that does quite a few simple steps altogether (convert argument types, convert return type, create a new function type, etc.). I agree that overloading that method would make composability/reuse of those simple steps difficult. In the second version, I refactored the code that converted argument types into an independent function. By refactoring code into smaller virtual functions doing only a simple thing (that's what I meant by unitary), we should be able to provide a higher level of customization and composability by just overloading the small function that refers to the part of the conversion that we want to customize. Maybe if you could provide an example of an "orthogonal customization" we could discuss based on something specific. I think that breaking LLVM type conversion into smaller unitary chucks would help and this patch goes towards that direction. How is this patch going in this direction? Maybe I don't perceive the direction you're referring to with "smaller unitary chunks" here? Hopefully it's clearer now with my answer above. we could publicly expose only the populate* functions that provide all the conversion patterns (i.e., populateStdToLLVMConversionPatterns and populateStdToLLVMBarePtrConversionPatterns). Again, I see some composability issue. What about the next orthogonal option? Are we gonna multiply the number of populateStdToLLVM*ConversionPatterns function by two? There is a combinatorial aspect that I don't understand how you see solved. This is a different composability problem but, yes, I agree with you. However, I don't think we are aiming at having every potential combination of customization available. If that were the case, we would have a combinatorial problem regardless of these interfaces being public or not. If we really need a high level of composability here, we could make all the LLVM conversion patterns public so that each client could build its own customized "populate" function. Pinging @ftynse since he also introduced some of these functions. Have you looked into injection instead of subclassing here? I assume this refers to the type conversion again. Not sure how injection could improve the design here. We would need either virtual functions or refactoring common code to utility functions. Do you see something I'm missing? Also, type converter is already injected so I think it would make usability more complicated. What do you think?
mlir/include/mlir/Transforms/DialectConversion.h
324	I tried using `replaceAllUsesWith` but it didn't work. The mapping from 'from' to 'to' is needed. I don't see any other option. Any suggestions?

mehdi_amini added inline comments.Jan 22 2020, 8:20 PM

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
132	(Just noticed that the comment needs to be updated)
147	Unless I'm missing something, `makeStandardToLLVMBarePtrTypeConverter` isn't publicly exposed? (it is a `static` function in the implementation file)
mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h
76	By refactoring code into smaller virtual functions doing only a simple thing (that's what I meant by unitary), we should be able to provide a higher level of customization and composability by just overloading the small function that refers to the part of the conversion that we want to customize. Maybe if you could provide an example of an "orthogonal customization" we could discuss based on something specific. Sure, let me clarify: right now you're overloading `convertArgType` by adding `BarePtrTypeConverter` , which only deals with MemRef function arguments. If later we want to have another customization point to handle conversion for other kind of argument, let say composite type. We'd have to overload the same method in another subclass `CustomCompositeTypeConverter`. But how would these two subclasses compose with each other? How do I get both the `bare` memref ABI and the custom composite types? Expand this to other customization points: assume we want to control the return type of function. We'll create another virtual function similar to `convertArgType`, let say `convertResultType`. We can overload this in subclasses, but again if one subclass is overloading `convertResultType` but I also want the `BarePtrTypeConverter`, how do I proceed? I assume this refers to the type conversion again. Not sure how injection could improve the design here. We would need either virtual functions or refactoring common code to utility functions. Do you see something I'm missing? Also, type converter is already injected so I think it would make usability more complicated. What do you think? What I'm referring to is that you instead of virtual functions, you can have a callback or a list of callbacks for each of the possible extension point. For instance instead of having: `LLVM::LLVMType convertArgType(Type type);` as a virtual method you could store: std::vector<std::function<LLVM::LLVMType(Type type)>> convertArgTypeImpls; And have: LLVM::LLVMType convertArgType(Type type) { for (auto &callback : convertArgTypeImpls) { type = callback(type); // possibly early exit on the first success instead? } return type; } This allows to inject customization both for different types and/or for different customization points (another vector of callbacks for example).

I'm supportive of this change as long as Mehdi's and River's concerns are addressed.

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h
76	It sounds like the scalable approach would be to implement type conversion following the similar pattern-based architecture we are building for op conversion... I considered doing that at some point. An intermediate proposition could be to do the dispatch manually instead of relying on virtual functions with something like class LLVMTypeCoverter { std::function<LLVMType(Type)> memrefConverter; std::function<LLVMType(Type)> funcArgumentConverter; // ... };
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
273	I'd rather return null here. This can happen in correct code and we shouldn't crash. Normally, the type conversion should propagate null-as-error-mark.

Thank you both for the feedback! It makes sense to me now. Let me rework this following your suggestions.
Just to recap, these are the pending issues:

LLVMTypeConverter composability/scalability: To be addressed following @mehdi_amini and @ftynse suggestions.
populate* functions scalability: No good solution for now. To be addressed in a separate commit (remove most of the populate* functions and make conversion patterns public could work)?
Enable bare pointer calling conversion in JITRunner
Do not expand scope of replaceUsesOfBlockArgument: Changes on replaceUsesOfBlockArgument seem necessary. Waiting for any other suggestions here (@rriddle?).

Please, let me know if I'm missing something or you have any other concerns before I proceed.
Thanks!

Hopefully this approach looks better now! Let's iterate on it.
Changes:

Add customization callbacks to LLVMTypeConverter.

This implementation is a stepping stone towards a pattern-based type conversion approach that allows customization of different points of the type conversion and different types for each customization point.

Re current implementation:
- It only allows customization of different points but not customization of different types for each customziation point for now but it should be easy to extend once the latter is needed.
- LLVMTypeConverterCustomization holds the callbacks for the customizations points available (only function argument types for now) and it's initialized by default to the pre-existing calling convention.
- 'makeStandardToLLVMBarePtrTypeConverter' is private on purpose to avoid a combinatorial explosion similar to that of populate* functions. Customization methods are made public to allow external clients to create their ad-hoc customizations.

Keep 'populateStdToLLVMBarePtrFuncOpConversionPattern'

This is to reduce the number of public populate* functions. Follow-up work is needed to avoid a potential combinatorial explosion. Probably making rewrite patterns public would help here.

Add test with mlir_cpu_runner

mlir-cpu-runner entry-point invocation is currently limited to a couple of function signatures. For that reason we use a () -> () entry-point function that allocates memrefs, initializes them, passes them to another function and print their output values. Therefore, no changes are needed at JitRunner level for now since the entry-point function won't have memref arguments.

Unit tests: fail. 61903 tests passed, 5 failed and 781 were skipped.

failed: libc++.std/language_support/cmp/cmp_partialord/partialord.pass.cpp
failed: libc++.std/language_support/cmp/cmp_strongeq/cmp.strongeq.pass.cpp
failed: libc++.std/language_support/cmp/cmp_strongord/strongord.pass.cpp
failed: libc++.std/language_support/cmp/cmp_weakeq/cmp.weakeq.pass.cpp
failed: libc++.std/language_support/cmp/cmp_weakord/weakord.pass.cpp

clang-tidy: pass.

clang-format: fail. Please format your changes with clang-format by running git-clang-format HEAD^ or applying this patch.

Build artifacts: diff.json, clang-tidy.txt, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Pre-merge checks is in beta. Report issue. Please join beta or enable it for your project.

Harbormaster failed remote builds in B45139: Diff 240901!Jan 28 2020, 9:44 AM

This looks good to me. I don't fully understand the problem @rriddle mentioned with replaceUsesOfWith, and I wonder if we could have a solution that also avoids the need for the placeholder operation.

mlir/include/mlir/Transforms/DialectConversion.h
324	@rriddle could you please elaborate on how exactly it is broken?
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
555–561	Nit: convert input FuncOp to LLVMFuncOp
646	Could we exit early instead?
663	Should we also erase the placeholder op? This sounds there's a missing feature in the rewriter (or the IR in general) to replace all uses except the given ones...

rriddle added inline comments.Jan 29 2020, 8:30 AM

mlir/include/mlir/Transforms/DialectConversion.h
324	Sorry, I sent this when I was OOO and it got lost in my email. The current problem is that the transformation isn't "revertible". If a pattern that uses it would fail, there isn't a guarantee that the IR will be reverted to its original state. It currently internally directly replaces the uses of the argument, which isn't valid given that everything in DialectConversion has to be undoable.

Thanks. Addressing the comments:

Rebase
Addressed @ftynse comments
Revert changes on 'replaceUsesOfBlockArgument'

mlir/include/mlir/Transforms/DialectConversion.h
324	Thanks! Got it now... reverted. I can use `replaceUsesOfBlockArgument` + `rewriter.replaceOp` instead for my use case.
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
663	Should we also erase the placeholder op? I erased it in my previous version but it was a bug. The placeholder must be alive since it goes into rewriter's mapping. This sounds there's a missing feature in the rewriter (or the IR in general) to replace all uses except the given ones... We would need something like 'domInstFilter' in (https://github.com/llvm/llvm-project/blob/master/mlir/include/mlir/Transforms/Utils.h#L60) but for the rewritter. I initially considered it but I thought it would be an overkill for this case since we would need to compute dominance information and use `dominates` queries for each use, which is expensive. Also, dominance information would need to be computed at the time of materializing the replacement (?) instead of when adding the replacement to the map, which would make it more complicated. Maybe something to think about separately?

Unit tests: pass. 62310 tests passed, 0 failed and 838 were skipped.

clang-tidy: pass.

clang-format: pass.

Build artifacts: diff.json, clang-tidy.txt, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Pre-merge checks is in beta. Report issue. Please join beta or enable it for your project.

Harbormaster completed remote builds in B45287: Diff 241258.Jan 29 2020, 1:03 PM

I'm fine with this, but please make sure @rriddle has no objections either.

mlir/include/mlir/Transforms/DialectConversion.h
324	I still see `replaceUsesOfValue` declared below, did you forget to upload something?
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
663	I erased it in my previous version but it was a bug. The placeholder must be alive since it goes into rewriter's mapping. Even through the rewriter? (`rewriter.erase` calls `rewriter.replaceOp` internally) We would need something like 'domInstFilter' It looks like here we are in a more specific case, which seems to come back several times on the function boundary: we need to inject a new operation that takes the region argument, and replace all existing uses of the region argument with the new op. This sounds like this should be possible with the `materializeConversion` hook (https://github.com/llvm/llvm-project/blob/master/mlir/include/mlir/Transforms/DialectConversion.h#L135). I can take a look into that, but it's better to have this landed first.

This revision is now accepted and ready to land.Jan 29 2020, 2:51 PM

Thanks, much better!

mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
2342	This API seems to still be an issue for extensibility. If you create the LLVMTypeConverter instead of customizing an existing one, how will this the next extension point be implemented/used?

It would be nice to see in a future revision if we can cleanup some of the extensibility aspects of the LLVM type converter.

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
41	using the
54	-> /
132–146	These should really be using ///
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
63	It would be much better if we could use proper pass options for these instead: https://mlir.llvm.org/docs/WritingAPass/#instance-specific-pass-options
76	Class/Function/Variable/etc. top-level comments should all be using ///
100	failed
107	Why not just return directly?
691	nit: Remove the newline
2342	+1. One direction that I intend to take TypeConverter is to allow registering additional callbacks for the type conversion similarly to how ConversionTarget is extensible. For these types of things, it is increasingly clear how clunky inheritance style modeling is.

Address feedback

Thanks for the feedback, again! I addressed it and replied inline.
This is a summary of the pending items that I would suggest addressing separately:

Utility to inject a new operation that takes the region argument, and replace all existing uses of the region argument with the new op (@ftynse).
Extensibility aspects of the LLVM type converter (@rriddle, @dcaballe can help if needed).
Use proper pass options for LLVMLoweringPass flags (@dcaballe).
Extensibility aspects of populate* functions for LLVMLoweringPass (@dcaballe will bring this to discourse).
Add support for CallOp to the bare pointer calling convention (TBD once #4 is addressed).

What do you think? Are you OK with this, @mehdi_amini?

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
132–146	No strong opinion but just for my understanding... if I remember correctly, `///` is for Doxygen purposes so it's only needed for public interfaces and members (?). That's why `//` is used for private members and .cpp documentation. Isn't that the case? At least that's what I've seen around. LLVM Coding Standards seem a bit ambiguous here: https://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments Don’t duplicate the documentation comment in the header file and in the implementation file. Put the documentation comments for public APIs into the header file. Documentation comments for private APIs can go to the implementation file. In any case, implementation files can include additional comments (not necessarily in Doxygen markup) to explain implementation details as needed.
mlir/include/mlir/Transforms/DialectConversion.h
324	Unused leftover, sorry.
mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp
63	I'm just trying to be consistent with the `-use-alloca` flag. I'll change this in a follow-up commit for both.
76	Same comment about Doxygen.
663	Even through the rewriter? (rewriter.erase calls rewriter.replaceOp internally) Oh, that would have worked for the first version, yes! It should be OK now since we are replacing the placeholder (line 684). Thanks! This sounds like this should be possible with the materializeConversion hook. I can take a look into that, but it's better to have this landed first. It sounds good. Thanks!
2342	I totally agree with that direction! I couldn't see how to go much further in this commit without introducing significant changes in the conversion infra. I think that should happen in a separate commit. Are you OK with the current approach for this commit, @mehdi_amini? Any specific suggestion for this commit that doesn't require too many infra changes?

rriddle added inline comments.Jan 30 2020, 5:33 PM

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
132–146	I prefer to keep comment style consistent to make the code base uniform, and less error-prone. Refactoring can often make things public that were private, and private that were public. By having a consistent style, there isn't a need to even think about any of that stuff. So for the sake of MLIR I've been trying to make sure that everyone is consistent.

dcaballe marked an inline comment as done.Jan 30 2020, 5:43 PM

dcaballe added inline comments.

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
132–146	Ok, that's reasonable. I'll change that before committing. Thanks!

Unit tests: fail. 62357 tests passed, 1 failed and 839 were skipped.

failed: libc++.std/containers/sequences/array/array_creation/to_array.fail.cpp

clang-tidy: pass.

clang-format: pass.

Build artifacts: diff.json, clang-tidy.txt, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Pre-merge checks is in beta. Report issue. Please join beta or enable it for your project.

Harbormaster failed remote builds in B45406: Diff 241615!Jan 30 2020, 5:46 PM

mehdi_amini accepted this revision.Jan 30 2020, 7:23 PM

This is a summary of the pending items that I would suggest addressing separately:
<...>

Great, thanks for working on this! I've already started looking at possible solutions (https://reviews.llvm.org/D73702) on my side.

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
132–146	Indeed, we have an (implicit) convention that top-level definitions and class members are commented with `///` (although not all of the code base follows, yet). Let's document it in https://mlir.llvm.org/getting_started/DeveloperGuide/#style-guide, we already have some restrictions on top of LLVM's style.

Herald added a subscriber: Joonsoo. · View Herald TranscriptJan 31 2020, 12:32 AM

ftynse marked an inline comment as done.Jan 31 2020, 12:39 AM

ftynse added inline comments.

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
132–146	https://github.com/llvm/mlir-www/pull/8

By the way, it wasn’t announced widely but : https://mlir.llvm.org/doxygen/

Rebase to see if pre-commit test failure is gone
Fixed doxygen comments
Added -convert-loop-to-std flag to mlir-cpu-runner test. It failed without it after rebase.

Thank you all! Updating diff to see if the test failure is gone. I'll commit it in a few hours.

In D72802#1851916, @mehdi_amini wrote:

By the way, it wasn’t announced widely but : https://mlir.llvm.org/doxygen/

That's awesome! I use it a lot and had to be building it locally. Thanks!

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h
132–146	Great, thanks!

dcaballe mentioned this in D73795: [mlir] Drop customization hooks from StandardToLLVM conversion.Jan 31 2020, 12:46 PM

Unit tests: unknown.

clang-tidy: pass.

clang-format: pass.

Build artifacts: diff.json, clang-tidy.txt, clang-format.patch, CMakeCache.txt, console-log.txt

Pre-merge checks is in beta. Report issue. Please join beta or enable it for your project.

Harbormaster failed remote builds in B45478: Diff 241793!Jan 31 2020, 12:53 PM

Unit tests: unknown.

clang-tidy: pass.

clang-format: pass.

Build artifacts: diff.json, clang-tidy.txt, clang-format.patch, CMakeCache.txt, console-log.txt

Pre-merge checks is in beta. Report issue. Please join beta or enable it for your project.

Rebase

AFAIK, MLIR changes cannot affect libc++ tests so it's okay to land if those are the only problem.

Unit tests: pass. 62377 tests passed, 0 failed and 839 were skipped.

clang-tidy: pass.

clang-format: pass.

Build artifacts: diff.json, clang-tidy.txt, clang-format.patch, CMakeCache.txt, console-log.txt, test-results.xml

Pre-merge checks is in beta. Report issue. Please join beta or enable it for your project.

Harbormaster completed remote builds in B45489: Diff 241823.Jan 31 2020, 3:12 PM

Closed by commit rGe5aaf30cf1ab: [mlir] Introduce bare ptr calling convention for MemRefs in LLVM dialect (authored by dcaballe). · Explain WhyJan 31 2020, 3:22 PM

This revision was automatically updated to reflect the committed changes.

dcaballe mentioned this in D73912: [mlir] Turn flags in ConvertStandardToLLVM into pass flags.Feb 3 2020, 11:21 AM

dcaballe mentioned this in rG696f80736b86: [mlir] Turn flags in ConvertStandardToLLVM into pass flags.Feb 11 2020, 10:33 AM

Revision Contents

Path

Size

mlir/

include/

mlir/

Conversion/

StandardToLLVM/

ConvertStandardToLLVM.h

25 lines

ConvertStandardToLLVMPass.h

22 lines

Transforms/

DialectConversion.h

2 lines

lib/

Conversion/

GPUToNVVM/

LowerGpuOpsToNVVMOps.cpp

2 lines

StandardToLLVM/

ConvertStandardToLLVM.cpp

196 lines

Transforms/

DialectConversion.cpp

3 lines

test/

Conversion/

StandardToLLVM/

	convert-dynamic-memref-ops.mlir
	convert-memref-ops.mlir

173 lines

convert-memref-ops.mlir

convert-static-memref-ops.mlir

322 lines

Diff 239442

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h

Show All 32 Lines
class LLVMType;		class LLVMType;
} // namespace LLVM		} // namespace LLVM

/// Conversion from types in the Standard dialect to the LLVM IR dialect.		/// Conversion from types in the Standard dialect to the LLVM IR dialect.
class LLVMTypeConverter : public TypeConverter {		class LLVMTypeConverter : public TypeConverter {
public:		public:
using TypeConverter::convertType;		using TypeConverter::convertType;

LLVMTypeConverter(MLIRContext *ctx);		LLVMTypeConverter(MLIRContext *ctx);
		rriddleUnsubmitted Done Reply Inline Actions using the rriddle: using the

/// Convert types to LLVM IR. This calls `convertAdditionalType` to convert		/// Convert types to LLVM IR. This calls `convertAdditionalType` to convert
/// non-standard or non-builtin types.		/// non-standard or non-builtin types.
Type convertType(Type t) override;		Type convertType(Type t) override;

/// Convert a function type. The arguments and results are converted one by		/// Convert a function type. The arguments and results are converted one by
/// one and results are packed into a wrapped LLVM IR structure type. `result`		/// one and results are packed into a wrapped LLVM IR structure type. `result`
/// is populated with argument mapping.		/// is populated with argument mapping.
LLVM::LLVMType convertFunctionSignature(FunctionType type, bool isVariadic,		LLVM::LLVMType convertFunctionSignature(FunctionType type, bool isVariadic,
SignatureConversion &result);		SignatureConversion &result);

/// Convert a non-empty list of types to be returned from a function into a		/// Convert a non-empty list of types to be returned from a function into a
/// supported LLVM IR type. In particular, if more than one values is		/// supported LLVM IR type. In particular, if more than one values is
		rriddleUnsubmitted Done Reply Inline Actions -> / rriddle: // -> ///
/// returned, create an LLVM IR structure type with elements that correspond		/// returned, create an LLVM IR structure type with elements that correspond
/// to each of the MLIR types converted with `convertType`.		/// to each of the MLIR types converted with `convertType`.
Type packFunctionResults(ArrayRef<Type> types);		Type packFunctionResults(ArrayRef<Type> types);

/// Returns the LLVM context.		/// Returns the LLVM context.
llvm::LLVMContext &getLLVMContext();		llvm::LLVMContext &getLLVMContext();

/// Returns the LLVM dialect.		/// Returns the LLVM dialect.
Show All 9 Lines	public:

/// Promote the LLVM struct representation of one MemRef descriptor to stack		/// Promote the LLVM struct representation of one MemRef descriptor to stack
/// and use pointer to struct to avoid the complexity of the platform-specific		/// and use pointer to struct to avoid the complexity of the platform-specific
/// C/C++ ABI lowering related to struct argument passing.		/// C/C++ ABI lowering related to struct argument passing.
Value promoteOneMemRefDescriptor(Location loc, Value operand,		Value promoteOneMemRefDescriptor(Location loc, Value operand,
OpBuilder &builder);		OpBuilder &builder);

protected:		protected:
		/// Convert a function argument type to an LLVM type using 'convertType'.
		/// MemRef arguments are promoted to a pointer to the converted type.
		virtual LLVM::LLVMType convertArgType(Type type);

/// LLVM IR module used to parse/create types.		/// LLVM IR module used to parse/create types.
llvm::Module *module;		llvm::Module *module;
LLVM::LLVMDialect *llvmDialect;		LLVM::LLVMDialect *llvmDialect;

		// Extract an LLVM IR dialect type.
		LLVM::LLVMType unwrap(Type type);

private:		private:
Type convertStandardType(Type type);		Type convertStandardType(Type type);

// Convert a function type. The arguments and results are converted one by		// Convert a function type. The arguments and results are converted one by
// one. Additionally, if the function returns more than one value, pack the		// one. Additionally, if the function returns more than one value, pack the
// results into an LLVM IR structure type so that the converted function type		// results into an LLVM IR structure type so that the converted function type
// returns at most one result.		// returns at most one result.
Type convertFunctionType(FunctionType type);		Type convertFunctionType(FunctionType type);
Show All 23 Lines	private:
Type convertUnrankedMemRefType(UnrankedMemRefType type);		Type convertUnrankedMemRefType(UnrankedMemRefType type);

// Convert a 1D vector type into an LLVM vector type.		// Convert a 1D vector type into an LLVM vector type.
Type convertVectorType(VectorType type);		Type convertVectorType(VectorType type);

// Get the LLVM representation of the index type based on the bitwidth of the		// Get the LLVM representation of the index type based on the bitwidth of the
// pointer as defined by the data layout of the module.		// pointer as defined by the data layout of the module.
LLVM::LLVMType getIndexType();		LLVM::LLVMType getIndexType();
		};

// Extract an LLVM IR dialect type.		/// Custom LLVMTypeConverter that overrides `convertFunctionSignature` to
		mehdi_aminiUnsubmitted Done Reply Inline Actions (Just noticed that the comment needs to be updated) mehdi_amini: (Just noticed that the comment needs to be updated)
LLVM::LLVMType unwrap(Type type);		/// replace the type of MemRef function arguments with a bare pointer to the
		/// MemRef element type.
		class BarePtrTypeConverter : public mlir::LLVMTypeConverter {
		public:
		using LLVMTypeConverter::LLVMTypeConverter;

		private:
		/// Convert a function argument type to an LLVM type using 'convertType'
		/// except for MemRef arguments. MemRef types are converted to LLVM bare
		/// pointers to the MemRef element type.
		LLVM::LLVMType convertArgType(Type type) override;

		/// Converts MemRef type to an LLVM bare pointer to the MemRef element type.
		mlir::Type convertMemRefTypeToBarePtr(mlir::MemRefType type);
		rriddleUnsubmitted Not Done Reply Inline Actions These should really be using /// rriddle: These should really be using ///
		dcaballeAuthorUnsubmitted Done Reply Inline Actions No strong opinion but just for my understanding... if I remember correctly, `///` is for Doxygen purposes so it's only needed for public interfaces and members (?). That's why `//` is used for private members and .cpp documentation. Isn't that the case? At least that's what I've seen around. LLVM Coding Standards seem a bit ambiguous here: https://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments Don’t duplicate the documentation comment in the header file and in the implementation file. Put the documentation comments for public APIs into the header file. Documentation comments for private APIs can go to the implementation file. In any case, implementation files can include additional comments (not necessarily in Doxygen markup) to explain implementation details as needed. dcaballe: No strong opinion but just for my understanding... if I remember correctly, `///` is for…
		rriddleUnsubmitted Not Done Reply Inline Actions I prefer to keep comment style consistent to make the code base uniform, and less error-prone. Refactoring can often make things public that were private, and private that were public. By having a consistent style, there isn't a need to even think about any of that stuff. So for the sake of MLIR I've been trying to make sure that everyone is consistent. rriddle: I prefer to keep comment style consistent to make the code base uniform, and less error-prone.
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Ok, that's reasonable. I'll change that before committing. Thanks! dcaballe: Ok, that's reasonable. I'll change that before committing. Thanks!
		ftynseUnsubmitted Not Done Reply Inline Actions Indeed, we have an (implicit) convention that top-level definitions and class members are commented with `///` (although not all of the code base follows, yet). Let's document it in https://mlir.llvm.org/getting_started/DeveloperGuide/#style-guide, we already have some restrictions on top of LLVM's style. ftynse: Indeed, we have an (implicit) convention that top-level definitions and class members are…
		ftynseUnsubmitted Done Reply Inline Actions https://github.com/llvm/mlir-www/pull/8 ftynse: https://github.com/llvm/mlir-www/pull/8
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Great, thanks! dcaballe: Great, thanks!
};		};
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions I am missing how this gets used (other than in testing with the cl::opt) right now? mehdi_amini: I am missing how this gets used (other than in testing with the cl::opt) right now?
		dcaballeAuthorUnsubmitted Done Reply Inline Actions That's a good point. I'm currently using it locally by invoking `createLowerToLLVM` with the provided `populateStdToLLVMBarePtrConversionPatterns` `makeStandardToLLVMBarePtrTypeConverter`. I can add changes and a flag to JitRunner so that we can use it through cpu runner. What do you think? dcaballe: That's a good point. I'm currently using it locally by invoking `createLowerToLLVM` with the…
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions Unless I'm missing something, `makeStandardToLLVMBarePtrTypeConverter` isn't publicly exposed? (it is a `static` function in the implementation file) mehdi_amini: Unless I'm missing something, `makeStandardToLLVMBarePtrTypeConverter` isn't publicly exposed?

/// Helper class to produce LLVM dialect operations extracting or inserting		/// Helper class to produce LLVM dialect operations extracting or inserting
/// values to a struct.		/// values to a struct.
class StructBuilder {		class StructBuilder {
public:		public:
/// Construct a helper for the given value.		/// Construct a helper for the given value.
explicit StructBuilder(Value v);		explicit StructBuilder(Value v);
/// Builds IR creating an `undef` value of the descriptor type.		/// Builds IR creating an `undef` value of the descriptor type.
▲ Show 20 Lines • Show All 110 Lines • Show Last 20 Lines

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h

	Show All 38 Lines

	/// Type for a callback constructing the type converter for the conversion to			/// Type for a callback constructing the type converter for the conversion to
	/// the LLVMIR dialect. The callback is expected to return an instance of the			/// the LLVMIR dialect. The callback is expected to return an instance of the
	/// converter.			/// converter.
	using LLVMTypeConverterMaker =			using LLVMTypeConverterMaker =
	std::function<std::unique_ptr<LLVMTypeConverter>(MLIRContext *)>;			std::function<std::unique_ptr<LLVMTypeConverter>(MLIRContext *)>;

	/// Collect a set of patterns to convert memory-related operations from the			/// Collect a set of patterns to convert memory-related operations from the
	/// Standard dialect to the LLVM dialect, excluding the memory-related			/// Standard dialect to the LLVM dialect, excluding non-memory-related
	/// operations.			/// operations and FuncOp.
	void populateStdToLLVMMemoryConversionPatters(			void populateStdToLLVMMemoryConversionPatters(
	LLVMTypeConverter &converter, OwningRewritePatternList &patterns);			LLVMTypeConverter &converter, OwningRewritePatternList &patterns);

	/// Collect a set of patterns to convert from the Standard dialect to the LLVM			/// Collect a set of patterns to convert from the Standard dialect to the LLVM
	/// dialect, excluding the memory-related operations.			/// dialect, excluding the memory-related operations.
	void populateStdToLLVMNonMemoryConversionPatterns(			void populateStdToLLVMNonMemoryConversionPatterns(
	LLVMTypeConverter &converter, OwningRewritePatternList &patterns);			LLVMTypeConverter &converter, OwningRewritePatternList &patterns);

	/// Collect a set of patterns to convert from the Standard dialect to LLVM.			/// Collect the default pattern to convert a FuncOp to the LLVM dialect.
				void populateStdToLLVMDefaultFuncOpConversionPattern(
				LLVMTypeConverter &converter, OwningRewritePatternList &patterns);

				/// Collect a set of default patterns to convert from the Standard dialect to
				/// LLVM.
	void populateStdToLLVMConversionPatterns(LLVMTypeConverter &converter,			void populateStdToLLVMConversionPatterns(LLVMTypeConverter &converter,
	OwningRewritePatternList &patterns);			OwningRewritePatternList &patterns);

				/// Collect the pattern to convert a FuncOp to the LLVM dialect using the bare
				/// pointer calling convertion for MemRef function arguments.
				void populateStdToLLVMBarePtrFuncOpConversionPattern(
				LLVMTypeConverter &converter, OwningRewritePatternList &patterns);

				/// Collect a set of patterns to convert from the Standard dialect to
				/// LLVM using the bare pointer calling convention for MemRef function
				/// arguments.
				void populateStdToLLVMBarePtrConversionPatterns(
				LLVMTypeConverter &converter, OwningRewritePatternList &patterns);

				mehdi_aminiUnsubmitted Not Done Reply Inline Actions I'm concerned about the scalability of this approach. You're making the type converter extensible through inheritance and overloading, which is not gonna compose well: it is impossible to another orthogonal customization point. Then extending it with a new subclass requires new header entry points: the multiple populate* functions does not seems very nice to me already, and this is making it just harder to figure out how to use this header. If we are convinced that we don't need to scale or that we won't need more customization point, then I rather not use any virtual function and look into passing a config struct with flag on the lowering behavior we want. mehdi_amini: I'm concerned about the scalability of this approach. You're making the type converter…
				dcaballeAuthorUnsubmitted Done Reply Inline Actions Thanks for the feedback, Mehdi. You're making the type converter extensible through inheritance and overloading, which is not gonna compose well: it is impossible to another orthogonal customization point. I think this looks better after the refactoring since I'm only introducing `convertArgType`, which should be a unitary piece of the type conversion and easy to compose with other customizations. If I understood correctly, the general direction is that developers may want to create custom lowerings to LLVM, even out of tree. If that is the case, we should try to make the LLVM type converter more friendly to that and facilitate code reuse as much as possible. I think that breaking LLVM type conversion into smaller unitary chucks would help and this patch goes towards that direction. If eventually the current LLVMTypeConverter makes composability "impossible to another orthogonal customization points", we could move the current "default" implementation to a sub-class and only keep the code or API that is generic enough, unitary and reusable in LLVMTypeConverter. Would that make sense? Then extending it with a new subclass requires new header entry points: the multiple populate* functions does not seems very nice to me already, and this is making it just harder to figure out how to use this header. I see your point. IIRC, the recently added public populate* functions were aimed at facilitating the reuse of existing patterns in custom lowerings, particularly out of tree. Currently, there is no other way to make these patterns available since they are private to this translation unit. If we don't want to change the latter, we could publicly expose only the populate* functions that provide all the conversion patterns (i.e., `populateStdToLLVMConversionPatterns` and `populateStdToLLVMBarePtrConversionPatterns`). Someone could always add custom patterns with more priority to override the default ones. Not a very clean solution, though. If we are convinced that we don't need to scale or that we won't need more customization point, then I rather not use any virtual function and look into passing a config struct with flag on the lowering behavior we want. It's difficult to know at this point and the feedback that I've heard so far is in the direction of having custom lowerings. I suggested an abstraction to customize memref lowering that didn't go through (https://github.com/tensorflow/mlir/pull/337) and more people are interested in lowering memrefs in different ways (https://groups.google.com/a/tensorflow.org/forum/?utm_medium=email&utm_source=footer#!msg/mlir/9UcFIefP9u0/3ujw73F8BAAJ) dcaballe: Thanks for the feedback, Mehdi. > You're making the type converter extensible through…
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions I think this looks better after the refactoring since I'm only introducing convertArgType, which should be a unitary piece of the type conversion and easy to compose with other customizations. I'm not sure I understand what you mean here: are you saying that LLVMTypeConverter will never ever have any other virtual method that you would need for customization? I think that breaking LLVM type conversion into smaller unitary chucks would help and this patch goes towards that direction. How is this patch going in this direction? Maybe I don't perceive the direction you're referring to with "smaller unitary chunks" here? we could publicly expose only the populate* functions that provide all the conversion patterns (i.e., populateStdToLLVMConversionPatterns and populateStdToLLVMBarePtrConversionPatterns). Again, I see some composability issue. What about the next orthogonal option? Are we gonna multiply the number of `populateStdToLLVMConversionPatterns` function by two? There is a combinatorial aspect that I don't understand how you see solved. Unless the "BarePtr" is the only ever customization we want to apply, I don't see how this is scaling forward right now. Have you looked into injection instead of subclassing here? mehdi_amini:* > I think this looks better after the refactoring since I'm only introducing convertArgType…
				dcaballeAuthorUnsubmitted Not Done Reply Inline Actions I think this looks better after the refactoring since I'm only introducing convertArgType, which should be a unitary piece of the type conversion and easy to compose with other customizations. I'm not sure I understand what you mean here: are you saying that LLVMTypeConverter will never ever have any other virtual method that you would need for customization? Not at all! IIUC, your concern about composability was on the first version of the code where I was overloading `convertFunctionSignature`, a conversion method that does quite a few simple steps altogether (convert argument types, convert return type, create a new function type, etc.). I agree that overloading that method would make composability/reuse of those simple steps difficult. In the second version, I refactored the code that converted argument types into an independent function. By refactoring code into smaller virtual functions doing only a simple thing (that's what I meant by unitary), we should be able to provide a higher level of customization and composability by just overloading the small function that refers to the part of the conversion that we want to customize. Maybe if you could provide an example of an "orthogonal customization" we could discuss based on something specific. I think that breaking LLVM type conversion into smaller unitary chucks would help and this patch goes towards that direction. How is this patch going in this direction? Maybe I don't perceive the direction you're referring to with "smaller unitary chunks" here? Hopefully it's clearer now with my answer above. we could publicly expose only the populate* functions that provide all the conversion patterns (i.e., populateStdToLLVMConversionPatterns and populateStdToLLVMBarePtrConversionPatterns). Again, I see some composability issue. What about the next orthogonal option? Are we gonna multiply the number of populateStdToLLVMConversionPatterns function by two? There is a combinatorial aspect that I don't understand how you see solved. This is a different composability problem but, yes, I agree with you. However, I don't think we are aiming at having every potential combination of customization available. If that were the case, we would have a combinatorial problem regardless of these interfaces being public or not. If we really need a high level of composability here, we could make all the LLVM conversion patterns public so that each client could build its own customized "populate" function. Pinging @ftynse since he also introduced some of these functions. Have you looked into injection instead of subclassing here? I assume this refers to the type conversion again. Not sure how injection could improve the design here. We would need either virtual functions or refactoring common code to utility functions. Do you see something I'm missing? Also, type converter is already injected so I think it would make usability more complicated. What do you think? dcaballe:* >>I think this looks better after the refactoring since I'm only introducing convertArgType…
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions By refactoring code into smaller virtual functions doing only a simple thing (that's what I meant by unitary), we should be able to provide a higher level of customization and composability by just overloading the small function that refers to the part of the conversion that we want to customize. Maybe if you could provide an example of an "orthogonal customization" we could discuss based on something specific. Sure, let me clarify: right now you're overloading `convertArgType` by adding `BarePtrTypeConverter` , which only deals with MemRef function arguments. If later we want to have another customization point to handle conversion for other kind of argument, let say composite type. We'd have to overload the same method in another subclass `CustomCompositeTypeConverter`. But how would these two subclasses compose with each other? How do I get both the `bare` memref ABI and the custom composite types? Expand this to other customization points: assume we want to control the return type of function. We'll create another virtual function similar to `convertArgType`, let say `convertResultType`. We can overload this in subclasses, but again if one subclass is overloading `convertResultType` but I also want the `BarePtrTypeConverter`, how do I proceed? I assume this refers to the type conversion again. Not sure how injection could improve the design here. We would need either virtual functions or refactoring common code to utility functions. Do you see something I'm missing? Also, type converter is already injected so I think it would make usability more complicated. What do you think? What I'm referring to is that you instead of virtual functions, you can have a callback or a list of callbacks for each of the possible extension point. For instance instead of having: `LLVM::LLVMType convertArgType(Type type);` as a virtual method you could store: std::vector<std::function<LLVM::LLVMType(Type type)>> convertArgTypeImpls; And have: LLVM::LLVMType convertArgType(Type type) { for (auto &callback : convertArgTypeImpls) { type = callback(type); // possibly early exit on the first success instead? } return type; } This allows to inject customization both for different types and/or for different customization points (another vector of callbacks for example). mehdi_amini: > By refactoring code into smaller virtual functions doing only a simple thing (that's what I…
				ftynseUnsubmitted Not Done Reply Inline Actions It sounds like the scalable approach would be to implement type conversion following the similar pattern-based architecture we are building for op conversion... I considered doing that at some point. An intermediate proposition could be to do the dispatch manually instead of relying on virtual functions with something like class LLVMTypeCoverter { std::function<LLVMType(Type)> memrefConverter; std::function<LLVMType(Type)> funcArgumentConverter; // ... }; ftynse: It sounds like the scalable approach would be to implement type conversion following the…
	/// Creates a pass to convert the Standard dialect into the LLVMIR dialect.			/// Creates a pass to convert the Standard dialect into the LLVMIR dialect.
	/// By default stdlib malloc/free are used for allocating MemRef payloads.			/// By default stdlib malloc/free are used for allocating MemRef payloads.
	/// Specifying `useAlloca-true` emits stack allocations instead. In the future			/// Specifying `useAlloca-true` emits stack allocations instead. In the future
	/// this may become an enum when we have concrete uses for other options.			/// this may become an enum when we have concrete uses for other options.
	std::unique_ptr<OpPassBase<ModuleOp>>			std::unique_ptr<OpPassBase<ModuleOp>>
	createLowerToLLVMPass(bool useAlloca = false);			createLowerToLLVMPass(bool useAlloca = false);

	/// Creates a pass to convert operations to the LLVMIR dialect. The conversion			/// Creates a pass to convert operations to the LLVMIR dialect. The conversion
	▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

mlir/include/mlir/Transforms/DialectConversion.h

Show First 20 Lines • Show All 315 Lines • ▼ Show 20 Lines	public:
/// Apply a signature conversion to the entry block of the given region. This		/// Apply a signature conversion to the entry block of the given region. This
/// replaces the entry block with a new block containing the updated		/// replaces the entry block with a new block containing the updated
/// signature. The new entry block to the region is returned for convenience.		/// signature. The new entry block to the region is returned for convenience.
Block *		Block *
applySignatureConversion(Region *region,		applySignatureConversion(Region *region,
TypeConverter::SignatureConversion &conversion);		TypeConverter::SignatureConversion &conversion);

/// Replace all the uses of the block argument `from` with value `to`.		/// Replace all the uses of the block argument `from` with value `to`.
void replaceUsesOfBlockArgument(BlockArgument from, Value to);		void replaceUsesOfWith(Value from, Value to);
		rriddleUnsubmitted Not Done Reply Inline Actions This function is already broken, I'd rather not expand its scope until its fixed. rriddle: This function is already broken, I'd rather not expand its scope until its fixed.
		dcaballeAuthorUnsubmitted Not Done Reply Inline Actions It's currently being used. Why is it broken? Maybe I could just replace all the uses (lines 683 and 687) with other utilities but I thought adding a mapping from 'from' to 'to' to pattern-rewriter would be necessary. dcaballe: It's currently being used. Why is it broken? Maybe I could just replace all the uses (lines 683…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions I tried using `replaceAllUsesWith` but it didn't work. The mapping from 'from' to 'to' is needed. I don't see any other option. Any suggestions? dcaballe: I tried using `replaceAllUsesWith` but it didn't work. The mapping from 'from' to 'to' is…
		ftynseUnsubmitted Not Done Reply Inline Actions @rriddle could you please elaborate on how exactly it is broken? ftynse: @rriddle could you please elaborate on how exactly it is broken?
		rriddleUnsubmitted Not Done Reply Inline Actions Sorry, I sent this when I was OOO and it got lost in my email. The current problem is that the transformation isn't "revertible". If a pattern that uses it would fail, there isn't a guarantee that the IR will be reverted to its original state. It currently internally directly replaces the uses of the argument, which isn't valid given that everything in DialectConversion has to be undoable. rriddle: Sorry, I sent this when I was OOO and it got lost in my email. The current problem is that the…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Thanks! Got it now... reverted. I can use `replaceUsesOfBlockArgument` + `rewriter.replaceOp` instead for my use case. dcaballe: Thanks! Got it now... reverted. I can use `replaceUsesOfBlockArgument` + `rewriter.replaceOp`…
		ftynseUnsubmitted Done Reply Inline Actions I still see `replaceUsesOfValue` declared below, did you forget to upload something? ftynse: I still see `replaceUsesOfValue` declared below, did you forget to upload something?
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Unused leftover, sorry. dcaballe: Unused leftover, sorry.

/// Return the converted value that replaces 'key'. Return 'key' if there is		/// Return the converted value that replaces 'key'. Return 'key' if there is
/// no such a converted value.		/// no such a converted value.
Value getRemappedValue(Value key);		Value getRemappedValue(Value key);

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// PatternRewriter Hooks		// PatternRewriter Hooks
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 343 Lines • Show Last 20 Lines

mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp

Show First 20 Lines • Show All 661 Lines • ▼ Show 20 Lines	rewriter.applySignatureConversion(&llvmFuncOp.getBody(),

for (auto en : llvm::enumerate(gpuFuncOp.getType().getInputs())) {		for (auto en : llvm::enumerate(gpuFuncOp.getType().getInputs())) {
if (!en.value().isa<MemRefType>() &&		if (!en.value().isa<MemRefType>() &&
!en.value().isa<UnrankedMemRefType>())		!en.value().isa<UnrankedMemRefType>())
continue;		continue;

BlockArgument arg = block.getArgument(en.index());		BlockArgument arg = block.getArgument(en.index());
Value loaded = rewriter.create<LLVM::LoadOp>(loc, arg);		Value loaded = rewriter.create<LLVM::LoadOp>(loc, arg);
rewriter.replaceUsesOfBlockArgument(arg, loaded);		rewriter.replaceUsesOfWith(arg, loaded);
}		}
}		}

rewriter.eraseOp(gpuFuncOp);		rewriter.eraseOp(gpuFuncOp);
return matchSuccess();		return matchSuccess();
}		}
};		};

▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp

Show All 38 Lines
static llvm::cl::OptionCategory		static llvm::cl::OptionCategory
clOptionsCategory("Standard to LLVM lowering options");		clOptionsCategory("Standard to LLVM lowering options");

static llvm::cl::opt<bool>		static llvm::cl::opt<bool>
clUseAlloca(PASS_NAME "-use-alloca",		clUseAlloca(PASS_NAME "-use-alloca",
llvm::cl::desc("Replace emission of malloc/free by alloca"),		llvm::cl::desc("Replace emission of malloc/free by alloca"),
llvm::cl::init(false));		llvm::cl::init(false));

		static llvm::cl::opt<bool> clUseBarePtrCallConv(
		PASS_NAME "-use-bare-ptr-memref-call-conv",
		llvm::cl::desc("Replace FuncOp's MemRef arguments with "
		"bare pointers to the MemRef element types"),
		llvm::cl::init(false));

LLVMTypeConverter::LLVMTypeConverter(MLIRContext *ctx)		LLVMTypeConverter::LLVMTypeConverter(MLIRContext *ctx)
: llvmDialect(ctx->getRegisteredDialect<LLVM::LLVMDialect>()) {		: llvmDialect(ctx->getRegisteredDialect<LLVM::LLVMDialect>()) {
assert(llvmDialect && "LLVM IR dialect is not registered");		assert(llvmDialect && "LLVM IR dialect is not registered");
module = &llvmDialect->getLLVMModule();		module = &llvmDialect->getLLVMModule();
}		}

// Get the LLVM context.		// Get the LLVM context.
llvm::LLVMContext &LLVMTypeConverter::getLLVMContext() {		llvm::LLVMContext &LLVMTypeConverter::getLLVMContext() {
return module->getContext();		return module->getContext();
}		}

		rriddleUnsubmitted Not Done Reply Inline Actions It would be much better if we could use proper pass options for these instead: https://mlir.llvm.org/docs/WritingAPass/#instance-specific-pass-options rriddle: It would be much better if we could use proper pass options for these instead: https://mlir.
		dcaballeAuthorUnsubmitted Done Reply Inline Actions I'm just trying to be consistent with the `-use-alloca` flag. I'll change this in a follow-up commit for both. dcaballe: I'm just trying to be consistent with the `-use-alloca` flag. I'll change this in a follow-up…
// Extract an LLVM IR type from the LLVM IR dialect type.		// Extract an LLVM IR type from the LLVM IR dialect type.
LLVM::LLVMType LLVMTypeConverter::unwrap(Type type) {		LLVM::LLVMType LLVMTypeConverter::unwrap(Type type) {
if (!type)		if (!type)
return nullptr;		return nullptr;
auto *mlirContext = type.getContext();		auto *mlirContext = type.getContext();
auto wrappedLLVMType = type.dyn_cast<LLVM::LLVMType>();		auto wrappedLLVMType = type.dyn_cast<LLVM::LLVMType>();
if (!wrappedLLVMType)		if (!wrappedLLVMType)
emitError(UnknownLoc::get(mlirContext),		emitError(UnknownLoc::get(mlirContext),
"conversion resulted in a non-LLVM type");		"conversion resulted in a non-LLVM type");
return wrappedLLVMType;		return wrappedLLVMType;
}		}

LLVM::LLVMType LLVMTypeConverter::getIndexType() {		LLVM::LLVMType LLVMTypeConverter::getIndexType() {
		rriddleUnsubmitted Not Done Reply Inline Actions Class/Function/Variable/etc. top-level comments should all be using /// rriddle: Class/Function/Variable/etc. top-level comments should all be using ///
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Same comment about Doxygen. dcaballe: Same comment about Doxygen.
return LLVM::LLVMType::getIntNTy(		return LLVM::LLVMType::getIntNTy(
llvmDialect, module->getDataLayout().getPointerSizeInBits());		llvmDialect, module->getDataLayout().getPointerSizeInBits());
}		}

Type LLVMTypeConverter::convertIndexType(IndexType type) {		Type LLVMTypeConverter::convertIndexType(IndexType type) {
return getIndexType();		return getIndexType();
}		}

Type LLVMTypeConverter::convertIntegerType(IntegerType type) {		Type LLVMTypeConverter::convertIntegerType(IntegerType type) {
return LLVM::LLVMType::getIntNTy(llvmDialect, type.getWidth());		return LLVM::LLVMType::getIntNTy(llvmDialect, type.getWidth());
}		}

Type LLVMTypeConverter::convertFloatType(FloatType type) {		Type LLVMTypeConverter::convertFloatType(FloatType type) {
switch (type.getKind()) {		switch (type.getKind()) {
case mlir::StandardTypes::F32:		case mlir::StandardTypes::F32:
return LLVM::LLVMType::getFloatTy(llvmDialect);		return LLVM::LLVMType::getFloatTy(llvmDialect);
case mlir::StandardTypes::F64:		case mlir::StandardTypes::F64:
return LLVM::LLVMType::getDoubleTy(llvmDialect);		return LLVM::LLVMType::getDoubleTy(llvmDialect);
case mlir::StandardTypes::F16:		case mlir::StandardTypes::F16:
return LLVM::LLVMType::getHalfTy(llvmDialect);		return LLVM::LLVMType::getHalfTy(llvmDialect);
case mlir::StandardTypes::BF16: {		case mlir::StandardTypes::BF16: {
auto *mlirContext = llvmDialect->getContext();		auto *mlirContext = llvmDialect->getContext();
return emitError(UnknownLoc::get(mlirContext), "unsupported type: BF16"),		return emitError(UnknownLoc::get(mlirContext), "unsupported type: BF16"),
Type();		Type();
		rriddleUnsubmitted Done Reply Inline Actions failed rriddle: failed
}		}
default:		default:
llvm_unreachable("non-float type in convertFloatType");		llvm_unreachable("non-float type in convertFloatType");
}		}
}		}

// Except for signatures, MLIR function types are converted into LLVM		// Except for signatures, MLIR function types are converted into LLVM
		rriddleUnsubmitted Done Reply Inline Actions Why not just return directly? rriddle: Why not just return directly?
// pointer-to-function types.		// pointer-to-function types.
Type LLVMTypeConverter::convertFunctionType(FunctionType type) {		Type LLVMTypeConverter::convertFunctionType(FunctionType type) {
SignatureConversion conversion(type.getNumInputs());		SignatureConversion conversion(type.getNumInputs());
LLVM::LLVMType converted =		LLVM::LLVMType converted =
convertFunctionSignature(type, /isVariadic=/false, conversion);		convertFunctionSignature(type, /isVariadic=/false, conversion);
return converted.getPointerTo();		return converted.getPointerTo();
}		}

		// Convert a function argument type to an LLVM type using 'convertType'. MemRef
		// arguments are promoted to a pointer to the converted type.
		LLVM::LLVMType LLVMTypeConverter::convertArgType(Type type) {
		auto converted = convertType(type).dyn_cast_or_null<LLVM::LLVMType>();
		if (!converted)
		return {};
		if (type.isa<MemRefType>() \|\| type.isa<UnrankedMemRefType>())
		converted = converted.getPointerTo();
		return converted;
		}

// Function types are converted to LLVM Function types by recursively converting		// Function types are converted to LLVM Function types by recursively converting
// argument and result types. If MLIR Function has zero results, the LLVM		// argument and result types. If MLIR Function has zero results, the LLVM
// Function has one VoidType result. If MLIR Function has more than one result,		// Function has one VoidType result. If MLIR Function has more than one result,
// they are into an LLVM StructType in their order of appearance.		// they are into an LLVM StructType in their order of appearance.
LLVM::LLVMType LLVMTypeConverter::convertFunctionSignature(		LLVM::LLVMType LLVMTypeConverter::convertFunctionSignature(
FunctionType type, bool isVariadic,		FunctionType type, bool isVariadic,
LLVMTypeConverter::SignatureConversion &result) {		LLVMTypeConverter::SignatureConversion &result) {
// Convert argument types one by one and check for errors.		// Convert argument types one by one and check for errors.
for (auto &en : llvm::enumerate(type.getInputs())) {		for (auto &en : llvm::enumerate(type.getInputs())) {
Type type = en.value();		Type type = en.value();
auto converted = convertType(type).dyn_cast_or_null<LLVM::LLVMType>();		auto converted = convertArgType(type).dyn_cast_or_null<LLVM::LLVMType>();
if (!converted)		if (!converted)
return {};		return {};
if (type.isa<MemRefType>() \|\| type.isa<UnrankedMemRefType>())
converted = converted.getPointerTo();
result.addInputs(en.index(), converted);		result.addInputs(en.index(), converted);
}		}

SmallVector<LLVM::LLVMType, 8> argTypes;		SmallVector<LLVM::LLVMType, 8> argTypes;
argTypes.reserve(llvm::size(result.getConvertedTypes()));		argTypes.reserve(llvm::size(result.getConvertedTypes()));
for (Type type : result.getConvertedTypes())		for (Type type : result.getConvertedTypes())
argTypes.push_back(unwrap(type));		argTypes.push_back(unwrap(type));

▲ Show 20 Lines • Show All 101 Lines • ▼ Show 20 Lines	return TypeSwitch<Type, Type>(t)
.Case([&](UnrankedMemRefType type) {		.Case([&](UnrankedMemRefType type) {
return convertUnrankedMemRefType(type);		return convertUnrankedMemRefType(type);
})		})
.Case([&](VectorType type) { return convertVectorType(type); })		.Case([&](VectorType type) { return convertVectorType(type); })
.Case([](LLVM::LLVMType type) { return type; })		.Case([](LLVM::LLVMType type) { return type; })
.Default([](Type) { return Type(); });		.Default([](Type) { return Type(); });
}		}

		// Convert a function argument type to an LLVM type using 'convertType' except
		// for MemRef arguments. MemRef types are converted to LLVM bare pointers to the
		// MemRef element type.
		LLVM::LLVMType BarePtrTypeConverter::convertArgType(Type type) {
		// TODO: Add support for unranked memref.
		rriddleUnsubmitted Done Reply Inline Actions This looks like a lot of code duplication. rriddle: This looks like a lot of code duplication.
		dcaballeAuthorUnsubmitted Done Reply Inline Actions We discussed that concern in our initial thread (https://github.com/tensorflow/mlir/issues/309#issuecomment-568791098). However, I think that keeping a separate pattern, as suggested, despite some code duplication might be worth it to keep each pattern simple. Otherwise, we would end up with a complex pattern with many special cases. I could try to refactor some of this code to protected members in LLVMTypeConverter but I think it's going to look a bit odd since the differences between the two patterns are small and scattered. dcaballe: We discussed that concern in our initial thread (https://github.
		nicolasvasilacheUnsubmitted Done Reply Inline Actions +10 for refactoring: when things evolve we don't want to have to evolve 2 independent impls. nicolasvasilache: +10 for refactoring: when things evolve we don't want to have to evolve 2 independent impls.
		if (auto memrefTy = type.dyn_cast<MemRefType>())
		return convertMemRefTypeToBarePtr(memrefTy)
		.dyn_cast_or_null<LLVM::LLVMType>();
		return convertType(type).dyn_cast_or_null<LLVM::LLVMType>();
		}

		// Converts MemRef type to an LLVM bare pointer to the MemRef element type.
		Type BarePtrTypeConverter::convertMemRefTypeToBarePtr(MemRefType type) {
		int64_t offset;
		SmallVector<int64_t, 4> strides;
		bool strideSuccess = succeeded(getStridesAndOffset(type, strides, offset));
		assert(strideSuccess &&
		ftynseUnsubmitted Done Reply Inline Actions I'd rather return null here. This can happen in correct code and we shouldn't crash. Normally, the type conversion should propagate null-as-error-mark. ftynse: I'd rather return null here. This can happen in correct code and we shouldn't crash. Normally…
		"Non-strided layout maps must have been normalized away");
		(void)strideSuccess;

		LLVM::LLVMType elementType = unwrap(convertType(type.getElementType()));
		if (!elementType)
		return {};
		auto ptrTy = elementType.getPointerTo(type.getMemorySpace());
		return ptrTy;
		}

LLVMOpLowering::LLVMOpLowering(StringRef rootOpName, MLIRContext *context,		LLVMOpLowering::LLVMOpLowering(StringRef rootOpName, MLIRContext *context,
LLVMTypeConverter &lowering_,		LLVMTypeConverter &lowering_,
PatternBenefit benefit)		PatternBenefit benefit)
: ConversionPattern(rootOpName, benefit, context), lowering(lowering_) {}		: ConversionPattern(rootOpName, benefit, context), lowering(lowering_) {}

/============================================================================/		/============================================================================/
/* StructBuilder implementation */		/* StructBuilder implementation */
/============================================================================/		/============================================================================/
▲ Show 20 Lines • Show All 239 Lines • ▼ Show 20 Lines	Value createIndexConstant(ConversionPatternRewriter &builder, Location loc,
uint64_t value) const {		uint64_t value) const {
return createIndexAttrConstant(builder, loc, getIndexType(), value);		return createIndexAttrConstant(builder, loc, getIndexType(), value);
}		}

protected:		protected:
LLVM::LLVMDialect &dialect;		LLVM::LLVMDialect &dialect;
};		};

struct FuncOpConversion : public LLVMLegalizationPattern<FuncOp> {		struct FuncOpConversionBase : public LLVMLegalizationPattern<FuncOp> {
using LLVMLegalizationPattern<FuncOp>::LLVMLegalizationPattern;		protected:
		using LLVMLegalizationPattern::LLVMLegalizationPattern;
PatternMatchResult		using UnsignedTypePair = std::pair<unsigned, Type>;
matchAndRewrite(Operation *op, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const override {
auto funcOp = cast<FuncOp>(op);
FunctionType type = funcOp.getType();

// Store the positions of memref-typed arguments so that we can emit loads		// Gather the positions and types of memref-typed arguments in a given
// from them to follow the calling convention.		// FunctionType.
SmallVector<unsigned, 4> promotedArgIndices;		void getMemRefArgIndicesAndTypes(
promotedArgIndices.reserve(type.getNumInputs());		FunctionType type, SmallVectorImpl<UnsignedTypePair> &argsInfo) const {
		argsInfo.reserve(type.getNumInputs());
for (auto en : llvm::enumerate(type.getInputs())) {		for (auto en : llvm::enumerate(type.getInputs())) {
if (en.value().isa<MemRefType>() \|\| en.value().isa<UnrankedMemRefType>())		if (en.value().isa<MemRefType>() \|\| en.value().isa<UnrankedMemRefType>())
promotedArgIndices.push_back(en.index());		argsInfo.push_back({en.index(), en.value()});
		}
}		}

// Convert the original function arguments. Struct arguments are promoted to		// Convert input FuncOp to a new FuncOp in LLVM dialect by using the
// pointer to struct arguments to allow calling external functions with		// LLVMTypeConverter provided to this legalization pattern.
// various ABIs (e.g. compiled from C/C++ on platform X).		LLVM::LLVMFuncOp
		convertFuncOpToLLVMFuncOp(FuncOp funcOp,
		ConversionPatternRewriter &rewriter) const {
		// Convert the original function arguments. They are converted using the
		// LLVMTypeConverter provided to this legalization pattern.
		ftynseUnsubmitted Done Reply Inline Actions Nit: convert input FuncOp to LLVMFuncOp ftynse: Nit: convert input FuncOp to LLVMFuncOp
auto varargsAttr = funcOp.getAttrOfType<BoolAttr>("std.varargs");		auto varargsAttr = funcOp.getAttrOfType<BoolAttr>("std.varargs");
TypeConverter::SignatureConversion result(funcOp.getNumArguments());		TypeConverter::SignatureConversion result(funcOp.getNumArguments());
auto llvmType = lowering.convertFunctionSignature(		auto llvmType = lowering.convertFunctionSignature(
funcOp.getType(), varargsAttr && varargsAttr.getValue(), result);		funcOp.getType(), varargsAttr && varargsAttr.getValue(), result);

// Only retain those attributes that are not constructed by build.		// Only retain those attributes that are not constructed by build.
SmallVector<NamedAttribute, 4> attributes;		SmallVector<NamedAttribute, 4> attributes;
for (const auto &attr : funcOp.getAttrs()) {		for (const auto &attr : funcOp.getAttrs()) {
if (attr.first.is(SymbolTable::getSymbolAttrName()) \|\|		if (attr.first.is(SymbolTable::getSymbolAttrName()) \|\|
attr.first.is(impl::getTypeAttrName()) \|\|		attr.first.is(impl::getTypeAttrName()) \|\|
attr.first.is("std.varargs"))		attr.first.is("std.varargs"))
continue;		continue;
attributes.push_back(attr);		attributes.push_back(attr);
}		}

// Create an LLVM function, use external linkage by default until MLIR		// Create an LLVM function, use external linkage by default until MLIR
// functions have linkage.		// functions have linkage.
auto newFuncOp = rewriter.create<LLVM::LLVMFuncOp>(		auto newFuncOp = rewriter.create<LLVM::LLVMFuncOp>(
op->getLoc(), funcOp.getName(), llvmType, LLVM::Linkage::External,		funcOp.getLoc(), funcOp.getName(), llvmType, LLVM::Linkage::External,
attributes);		attributes);
rewriter.inlineRegionBefore(funcOp.getBody(), newFuncOp.getBody(),		rewriter.inlineRegionBefore(funcOp.getBody(), newFuncOp.getBody(),
newFuncOp.end());		newFuncOp.end());

// Tell the rewriter to convert the region signature.		// Tell the rewriter to convert the region signature.
rewriter.applySignatureConversion(&newFuncOp.getBody(), result);		rewriter.applySignatureConversion(&newFuncOp.getBody(), result);

		return newFuncOp;
		}
		};

		// FuncOp legalization pattern that converts MemRef arguments to pointers to
		// MemRef descriptors (LLVM struct data types) containing all the MemRef type
		// information.
		struct FuncOpConversion : public FuncOpConversionBase {
		using FuncOpConversionBase::FuncOpConversionBase;

		PatternMatchResult
		matchAndRewrite(Operation *op, ArrayRef<Value> operands,
		ConversionPatternRewriter &rewriter) const override {
		auto funcOp = cast<FuncOp>(op);

		// Store the positions of memref-typed arguments so that we can emit loads
		// from them to follow the calling convention.
		SmallVector<UnsignedTypePair, 4> promotedArgsInfo;
		getMemRefArgIndicesAndTypes(funcOp.getType(), promotedArgsInfo);

		auto newFuncOp = convertFuncOpToLLVMFuncOp(funcOp, rewriter);

// Insert loads from memref descriptor pointers in function bodies.		// Insert loads from memref descriptor pointers in function bodies.
if (!newFuncOp.getBody().empty()) {		if (!newFuncOp.getBody().empty()) {
Block *firstBlock = &newFuncOp.getBody().front();		Block *firstBlock = &newFuncOp.getBody().front();
rewriter.setInsertionPoint(firstBlock, firstBlock->begin());		rewriter.setInsertionPoint(firstBlock, firstBlock->begin());
for (unsigned idx : promotedArgIndices) {		for (const auto &argInfo : promotedArgsInfo) {
BlockArgument arg = firstBlock->getArgument(idx);		BlockArgument arg = firstBlock->getArgument(argInfo.first);
Value loaded = rewriter.create<LLVM::LoadOp>(funcOp.getLoc(), arg);		Value loaded = rewriter.create<LLVM::LoadOp>(funcOp.getLoc(), arg);
rewriter.replaceUsesOfBlockArgument(arg, loaded);		rewriter.replaceUsesOfWith(arg, loaded);
		}
		}

		rewriter.eraseOp(op);
		return matchSuccess();
		}
		};

		// FuncOp legalization pattern that converts MemRef arguments to bare pointers
		// to the MemRef element type. This will impact the calling convention and ABI.
		struct BarePtrFuncOpConversion : public FuncOpConversionBase {
		using FuncOpConversionBase::FuncOpConversionBase;

		PatternMatchResult
		matchAndRewrite(Operation *op, ArrayRef<Value> operands,
		ConversionPatternRewriter &rewriter) const override {
		auto funcOp = cast<FuncOp>(op);

		// Store the positions and type of memref-typed arguments so that we can
		// promote them to MemRef descriptor structs at the beginning of the
		// function.
		SmallVector<UnsignedTypePair, 4> promotedArgsInfo;
		getMemRefArgIndicesAndTypes(funcOp.getType(), promotedArgsInfo);

		auto newFuncOp = convertFuncOpToLLVMFuncOp(funcOp, rewriter);

		// Promote bare pointers from MemRef arguments to a MemRef descriptor struct
		// at the beginning of the function so that all the MemRefs in the function
		// have a uniform representation.
		if (!newFuncOp.getBody().empty()) {
		ftynseUnsubmitted Done Reply Inline Actions Could we exit early instead? ftynse: Could we exit early instead?
		Block *firstBlock = &newFuncOp.getBody().front();
		rewriter.setInsertionPoint(firstBlock, firstBlock->begin());
		auto funcLoc = funcOp.getLoc();
		for (const auto &argInfo : promotedArgsInfo) {
		rriddleUnsubmitted Done Reply Inline Actions Same in this pattern. rriddle: Same in this pattern.
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Same, I could try to refactor some code to a common base class for both patterns. dcaballe: Same, I could try to refactor some code to a common base class for both patterns.
		// TODO: Add support for unranked MemRefs.
		if (auto memrefType = argInfo.second.dyn_cast<MemRefType>()) {
		// Replace argument with a placeholder (undef), promote argument to a
		// MemRef descriptor and replace placeholder with the last instruction
		// of the MemRef descriptor. The placeholder is needed to avoid
		// replacing argument uses in the MemRef descriptor instructions.
		BlockArgument arg = firstBlock->getArgument(argInfo.first);
		Value placeHolder =
		rewriter.create<LLVM::UndefOp>(funcLoc, arg.getType());
		rewriter.replaceUsesOfWith(arg, placeHolder);
		auto desc = MemRefDescriptor::fromStaticShape(
		rewriter, funcLoc, lowering, memrefType, arg);
		rewriter.replaceUsesOfWith(placeHolder, desc);
		ftynseUnsubmitted Not Done Reply Inline Actions Should we also erase the placeholder op? This sounds there's a missing feature in the rewriter (or the IR in general) to replace all uses except the given ones... ftynse: Should we also erase the placeholder op? This sounds there's a missing feature in the rewriter…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Should we also erase the placeholder op? I erased it in my previous version but it was a bug. The placeholder must be alive since it goes into rewriter's mapping. This sounds there's a missing feature in the rewriter (or the IR in general) to replace all uses except the given ones... We would need something like 'domInstFilter' in (https://github.com/llvm/llvm-project/blob/master/mlir/include/mlir/Transforms/Utils.h#L60) but for the rewritter. I initially considered it but I thought it would be an overkill for this case since we would need to compute dominance information and use `dominates` queries for each use, which is expensive. Also, dominance information would need to be computed at the time of materializing the replacement (?) instead of when adding the replacement to the map, which would make it more complicated. Maybe something to think about separately? dcaballe: > Should we also erase the placeholder op? I erased it in my previous version but it was a bug.
		ftynseUnsubmitted Done Reply Inline Actions I erased it in my previous version but it was a bug. The placeholder must be alive since it goes into rewriter's mapping. Even through the rewriter? (`rewriter.erase` calls `rewriter.replaceOp` internally) We would need something like 'domInstFilter' It looks like here we are in a more specific case, which seems to come back several times on the function boundary: we need to inject a new operation that takes the region argument, and replace all existing uses of the region argument with the new op. This sounds like this should be possible with the `materializeConversion` hook (https://github.com/llvm/llvm-project/blob/master/mlir/include/mlir/Transforms/DialectConversion.h#L135). I can take a look into that, but it's better to have this landed first. ftynse: > I erased it in my previous version but it was a bug. The placeholder must be alive since it…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions Even through the rewriter? (rewriter.erase calls rewriter.replaceOp internally) Oh, that would have worked for the first version, yes! It should be OK now since we are replacing the placeholder (line 684). Thanks! This sounds like this should be possible with the materializeConversion hook. I can take a look into that, but it's better to have this landed first. It sounds good. Thanks! dcaballe: > Even through the rewriter? (rewriter.erase calls rewriter.replaceOp internally) Oh, that…
		placeHolder.getDefiningOp()->erase();
		}
}		}
}		}

rewriter.eraseOp(op);		rewriter.eraseOp(op);
return matchSuccess();		return matchSuccess();
}		}
};		};

Show All 9 Lines	struct NDVectorTypeInfo {
// Multiplicity of llvmArrayTy to llvmVectorTy.		// Multiplicity of llvmArrayTy to llvmVectorTy.
SmallVector<int64_t, 4> arraySizes;		SmallVector<int64_t, 4> arraySizes;
};		};
} // namespace		} // namespace

// For >1-D vector types, extracts the necessary information to iterate over all		// For >1-D vector types, extracts the necessary information to iterate over all
// 1-D subvectors in the underlying llrepresentation of the n-D vector		// 1-D subvectors in the underlying llrepresentation of the n-D vector
// Iterates on the llvm array type until we hit a non-array type (which is		// Iterates on the llvm array type until we hit a non-array type (which is
// asserted to be an llvm vector type).		// asserted to be an llvm vector type).
		rriddleUnsubmitted Done Reply Inline Actions nit: Remove the newline rriddle: nit: Remove the newline
static NDVectorTypeInfo extractNDVectorTypeInfo(VectorType vectorType,		static NDVectorTypeInfo extractNDVectorTypeInfo(VectorType vectorType,
LLVMTypeConverter &converter) {		LLVMTypeConverter &converter) {
assert(vectorType.getRank() > 1 && "expected >1D vector type");		assert(vectorType.getRank() > 1 && "expected >1D vector type");
NDVectorTypeInfo info;		NDVectorTypeInfo info;
info.llvmArrayTy =		info.llvmArrayTy =
converter.convertType(vectorType).dyn_cast<LLVM::LLVMType>();		converter.convertType(vectorType).dyn_cast<LLVM::LLVMType>();
if (!info.llvmArrayTy)		if (!info.llvmArrayTy)
return info;		return info;
▲ Show 20 Lines • Show All 1,535 Lines • ▼ Show 20 Lines	void mlir::populateStdToLLVMNonMemoryConversionPatterns(
// clang-format on		// clang-format on
}		}

void mlir::populateStdToLLVMMemoryConversionPatters(		void mlir::populateStdToLLVMMemoryConversionPatters(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
// clang-format off		// clang-format off
patterns.insert<		patterns.insert<
DimOpLowering,		DimOpLowering,
FuncOpConversion,
LoadOpLowering,		LoadOpLowering,
MemRefCastOpLowering,		MemRefCastOpLowering,
StoreOpLowering,		StoreOpLowering,
SubViewOpLowering,		SubViewOpLowering,
ViewOpLowering>(*converter.getDialect(), converter);		ViewOpLowering>(*converter.getDialect(), converter);
patterns.insert<		patterns.insert<
AllocOpLowering,		AllocOpLowering,
DeallocOpLowering>(		DeallocOpLowering>(
*converter.getDialect(), converter, clUseAlloca.getValue());		*converter.getDialect(), converter, clUseAlloca.getValue());
// clang-format on		// clang-format on
}		}

		void mlir::populateStdToLLVMDefaultFuncOpConversionPattern(
		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
		patterns.insert<FuncOpConversion>(*converter.getDialect(), converter);
		}

void mlir::populateStdToLLVMConversionPatterns(		void mlir::populateStdToLLVMConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
		populateStdToLLVMDefaultFuncOpConversionPattern(converter, patterns);
		populateStdToLLVMNonMemoryConversionPatterns(converter, patterns);
		populateStdToLLVMMemoryConversionPatters(converter, patterns);
		}

		void mlir::populateStdToLLVMBarePtrFuncOpConversionPattern(
		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
		patterns.insert<BarePtrFuncOpConversion>(*converter.getDialect(), converter);
		}

		void mlir::populateStdToLLVMBarePtrConversionPatterns(
		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
		populateStdToLLVMBarePtrFuncOpConversionPattern(converter, patterns);
populateStdToLLVMNonMemoryConversionPatterns(converter, patterns);		populateStdToLLVMNonMemoryConversionPatterns(converter, patterns);
populateStdToLLVMMemoryConversionPatters(converter, patterns);		populateStdToLLVMMemoryConversionPatters(converter, patterns);
}		}

// Convert types using the stored LLVM IR module.		// Convert types using the stored LLVM IR module.
Type LLVMTypeConverter::convertType(Type t) { return convertStandardType(t); }		Type LLVMTypeConverter::convertType(Type t) { return convertStandardType(t); }

// Create an LLVM IR structure type if there is more than one result.		// Create an LLVM IR structure type if there is more than one result.
▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	LLVMTypeConverter::promoteMemRefDescriptors(Location loc, ValueRange opOperands,
}		}
return promotedOperands;		return promotedOperands;
}		}

/// Create an instance of LLVMTypeConverter in the given context.		/// Create an instance of LLVMTypeConverter in the given context.
static std::unique_ptr<LLVMTypeConverter>		static std::unique_ptr<LLVMTypeConverter>
makeStandardToLLVMTypeConverter(MLIRContext *context) {		makeStandardToLLVMTypeConverter(MLIRContext *context) {
return std::make_unique<LLVMTypeConverter>(context);		return std::make_unique<LLVMTypeConverter>(context);
}		}
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions This API seems to still be an issue for extensibility. If you create the LLVMTypeConverter instead of customizing an existing one, how will this the next extension point be implemented/used? mehdi_amini: This API seems to still be an issue for extensibility. If you create the LLVMTypeConverter…
		rriddleUnsubmitted Not Done Reply Inline Actions +1. One direction that I intend to take TypeConverter is to allow registering additional callbacks for the type conversion similarly to how ConversionTarget is extensible. For these types of things, it is increasingly clear how clunky inheritance style modeling is. rriddle: +1. One direction that I intend to take TypeConverter is to allow registering additional…
		dcaballeAuthorUnsubmitted Done Reply Inline Actions I totally agree with that direction! I couldn't see how to go much further in this commit without introducing significant changes in the conversion infra. I think that should happen in a separate commit. Are you OK with the current approach for this commit, @mehdi_amini? Any specific suggestion for this commit that doesn't require too many infra changes? dcaballe: I totally agree with that direction! I couldn't see how to go much further in this commit…

		/// Create an instance of BarePtrTypeConverter in the given context.
		static std::unique_ptr<LLVMTypeConverter>
		makeStandardToLLVMBarePtrTypeConverter(MLIRContext *context) {
		return std::make_unique<BarePtrTypeConverter>(context);
		}

namespace {		namespace {
/// A pass converting MLIR operations into the LLVM IR dialect.		/// A pass converting MLIR operations into the LLVM IR dialect.
struct LLVMLoweringPass : public ModulePass<LLVMLoweringPass> {		struct LLVMLoweringPass : public ModulePass<LLVMLoweringPass> {
// By default, the patterns are those converting Standard operations to the		// By default, the patterns are those converting Standard operations to the
// LLVMIR dialect.		// LLVMIR dialect.
explicit LLVMLoweringPass(		explicit LLVMLoweringPass(
bool useAlloca = false,		bool useAlloca = false,
LLVMPatternListFiller patternListFiller =		LLVMPatternListFiller patternListFiller =
▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
}		}

static PassRegistration<LLVMLoweringPass>		static PassRegistration<LLVMLoweringPass>
pass("convert-std-to-llvm",		pass("convert-std-to-llvm",
"Convert scalar and vector operations from the "		"Convert scalar and vector operations from the "
"Standard to the LLVM dialect",		"Standard to the LLVM dialect",
[] {		[] {
return std::make_unique<LLVMLoweringPass>(		return std::make_unique<LLVMLoweringPass>(
clUseAlloca.getValue(), populateStdToLLVMConversionPatterns,		clUseAlloca.getValue(),
makeStandardToLLVMTypeConverter);		clUseBarePtrCallConv ? populateStdToLLVMBarePtrConversionPatterns
		: populateStdToLLVMConversionPatterns,
		clUseBarePtrCallConv ? makeStandardToLLVMBarePtrTypeConverter
		: makeStandardToLLVMTypeConverter);
});		});

mlir/lib/Transforms/DialectConversion.cpp

	Show First 20 Lines • Show All 855 Lines • ▼ Show 20 Lines
	}			}

	/// Apply a signature conversion to the entry block of the given region.			/// Apply a signature conversion to the entry block of the given region.
	Block *ConversionPatternRewriter::applySignatureConversion(			Block *ConversionPatternRewriter::applySignatureConversion(
	Region *region, TypeConverter::SignatureConversion &conversion) {			Region *region, TypeConverter::SignatureConversion &conversion) {
	return impl->applySignatureConversion(region, conversion);			return impl->applySignatureConversion(region, conversion);
	}			}

	void ConversionPatternRewriter::replaceUsesOfBlockArgument(BlockArgument from,			void ConversionPatternRewriter::replaceUsesOfWith(Value from, Value to) {
	Value to) {
	for (auto &u : from.getUses()) {			for (auto &u : from.getUses()) {
	if (u.getOwner() == to.getDefiningOp())			if (u.getOwner() == to.getDefiningOp())
	continue;			continue;
	u.getOwner()->replaceUsesOfWith(from, to);			u.getOwner()->replaceUsesOfWith(from, to);
	}			}
	impl->mapping.map(impl->mapping.lookupOrDefault(from), to);			impl->mapping.map(impl->mapping.lookupOrDefault(from), to);
	}			}

	▲ Show 20 Lines • Show All 973 Lines • Show Last 20 Lines

mlir/test/Conversion/StandardToLLVM/convert-dynamic-memref-ops.mlir

This file was moved from mlir/test/Conversion/StandardToLLVM/convert-memref-ops.mlir.

	// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s			// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s
	// RUN: mlir-opt -convert-std-to-llvm -convert-std-to-llvm-use-alloca=1 %s \| FileCheck %s --check-prefix=ALLOCA

	// CHECK-LABEL: func @check_arguments(%arg0: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %arg1: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %arg2: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">)
	func @check_arguments(%static: memref<10x20xf32>, %dynamic : memref<?x?xf32>, %mixed : memref<10x?xf32>) {
	return
	}

	// CHECK-LABEL: func @check_strided_memref_arguments(			// CHECK-LABEL: func @check_strided_memref_arguments(
	// CHECK-COUNT-3: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK-COUNT-3: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
	func @check_strided_memref_arguments(%static: memref<10x20xf32, affine_map<(i,j)->(20 * i + j + 1)>>,			func @check_strided_memref_arguments(%static: memref<10x20xf32, affine_map<(i,j)->(20 * i + j + 1)>>,
	%dynamic : memref<?x?xf32, affine_map<(i,j)[M]->(M * i + j + 1)>>,			%dynamic : memref<?x?xf32, affine_map<(i,j)[M]->(M * i + j + 1)>>,
	%mixed : memref<10x?xf32, affine_map<(i,j)[M]->(M * i + j + 1)>>) {			%mixed : memref<10x?xf32, affine_map<(i,j)[M]->(M * i + j + 1)>>) {
	return			return
	}			}

	// CHECK-LABEL: func @check_static_return(%arg0: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">) -> !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }"> {			// CHECK-LABEL: func @check_arguments(%arg0: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %arg1: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %arg2: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">)
	func @check_static_return(%static : memref<32x18xf32>) -> memref<32x18xf32> {			func @check_arguments(%static: memref<10x20xf32>, %dynamic : memref<?x?xf32>, %mixed : memref<10x?xf32>) {
	// CHECK: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	return %static : memref<32x18xf32>
	}

	// CHECK-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {
	// ALLOCA-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {
	func @zero_d_alloc() -> memref<f32> {
	// CHECK-NEXT: llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
	// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
	// CHECK-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64
	// CHECK-NEXT: llvm.call @malloc(%{{.}}) : (!llvm.i64) -> !llvm<"i8">
	// CHECK-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">
	// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64 }">
	// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[0] : !llvm<"{ float, float*, i64 }">
	// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[1] : !llvm<"{ float, float*, i64 }">
	// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64 }">

	// ALLOCA-NOT: malloc
	// ALLOCA: alloca
	// ALLOCA-NOT: malloc
	%0 = alloc() : memref<f32>
	return %0 : memref<f32>
	}

	// CHECK-LABEL: func @zero_d_dealloc(%{{.}}: !llvm<"{ float, float, i64 }">) {
	func @zero_d_dealloc(%arg0: memref<f32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64 }*">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64 }">
	// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
	// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()
	dealloc %arg0 : memref<f32>
	return			return
	}			}

	// CHECK-LABEL: func @aligned_1d_alloc(
	func @aligned_1d_alloc() -> memref<42xf32> {
	// CHECK-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64
	// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
	// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
	// CHECK-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64
	// CHECK-NEXT: %[[alignment:.*]] = llvm.mlir.constant(8 : index) : !llvm.i64
	// CHECK-NEXT: %[[alignmentMinus1:.]] = llvm.add {{.}}, %[[alignment]] : !llvm.i64
	// CHECK-NEXT: %[[allocsize:.*]] = llvm.sub %[[alignmentMinus1]], %[[one]] : !llvm.i64
	// CHECK-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[allocsize]]) : (!llvm.i64) -> !llvm<"i8">
	// CHECK-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">
	// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[0] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">
	// CHECK-NEXT: %[[allocatedAsInt:.]] = llvm.ptrtoint %[[allocated]] : !llvm<"i8"> to !llvm.i64
	// CHECK-NEXT: %[[alignAdj1:.*]] = llvm.urem %[[allocatedAsInt]], %[[alignment]] : !llvm.i64
	// CHECK-NEXT: %[[alignAdj2:.*]] = llvm.sub %[[alignment]], %[[alignAdj1]] : !llvm.i64
	// CHECK-NEXT: %[[alignAdj3:.*]] = llvm.urem %[[alignAdj2]], %[[alignment]] : !llvm.i64
	// CHECK-NEXT: %[[aligned:.]] = llvm.getelementptr %9[%[[alignAdj3]]] : (!llvm<"i8">, !llvm.i64) -> !llvm<"i8*">
	// CHECK-NEXT: %[[alignedBitCast:.]] = llvm.bitcast %[[aligned]] : !llvm<"i8"> to !llvm<"float*">
	// CHECK-NEXT: llvm.insertvalue %[[alignedBitCast]], %{{.}}[1] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">
	// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">
	%0 = alloc() {alignment = 8} : memref<42xf32>
	return %0 : memref<42xf32>
	}

	// CHECK-LABEL: func @mixed_alloc(			// CHECK-LABEL: func @mixed_alloc(
	// CHECK: %[[M:.]]: !llvm.i64, %[[N:.]]: !llvm.i64) -> !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }"> {			// CHECK: %[[M:.]]: !llvm.i64, %[[N:.]]: !llvm.i64) -> !llvm<"{ float, float, i64, [3 x i64], [3 x i64] }"> {
	func @mixed_alloc(%arg0: index, %arg1: index) -> memref<?x42x?xf32> {			func @mixed_alloc(%arg0: index, %arg1: index) -> memref<?x42x?xf32> {
	// CHECK-NEXT: %[[c42:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64			// CHECK-NEXT: %[[c42:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
	// CHECK-NEXT: llvm.mul %[[M]], %[[c42]] : !llvm.i64			// CHECK-NEXT: llvm.mul %[[M]], %[[c42]] : !llvm.i64
	// CHECK-NEXT: %[[sz:.]] = llvm.mul %{{.}}, %[[N]] : !llvm.i64			// CHECK-NEXT: %[[sz:.]] = llvm.mul %{{.}}, %[[N]] : !llvm.i64
	// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">			// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
	// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">			// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
	// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()			// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()
	dealloc %arg0 : memref<?x?xf32>			dealloc %arg0 : memref<?x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @static_alloc() -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
	func @static_alloc() -> memref<32x18xf32> {
	// CHECK-NEXT: %[[sz1:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64
	// CHECK-NEXT: %[[sz2:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64
	// CHECK-NEXT: %[[num_elems:.*]] = llvm.mul %0, %1 : !llvm.i64
	// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
	// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
	// CHECK-NEXT: %[[bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64
	// CHECK-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">
	// CHECK-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">
	%0 = alloc() : memref<32x18xf32>
	return %0 : memref<32x18xf32>
	}

	// CHECK-LABEL: func @static_dealloc(%{{.}}: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">) {
	func @static_dealloc(%static: memref<10x8xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
	// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()
	dealloc %static : memref<10x8xf32>
	return
	}

	// CHECK-LABEL: func @zero_d_load(%{{.}}: !llvm<"{ float, float, i64 }">) -> !llvm.float {
	func @zero_d_load(%arg0: memref<f32>) -> f32 {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64 }*">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64 }">
	// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[c0]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: %{{.}} = llvm.load %[[addr]] : !llvm<"float">
	%0 = load %arg0[] : memref<f32>
	return %0 : f32
	}

	// CHECK-LABEL: func @static_load(
	// CHECK: %[[A:.]]: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64
	func @static_load(%static : memref<10x42xf32>, %i : index, %j : index) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
	// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
	// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
	// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
	// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
	// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: llvm.load %[[addr]] : !llvm<"float*">
	%0 = load %static[%i, %j] : memref<10x42xf32>
	return
	}

	// CHECK-LABEL: func @mixed_load(			// CHECK-LABEL: func @mixed_load(
	// CHECK: %[[A:.]]: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64			// CHECK: %[[A:.]]: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64
	func @mixed_load(%mixed : memref<42x?xf32>, %i : index, %j : index) {			func @mixed_load(%mixed : memref<42x?xf32>, %i : index, %j : index) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64			// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
	▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
	// CHECK: [[C0_2:%.*]] = llvm.mlir.constant(0 : i32) : !llvm.i32			// CHECK: [[C0_2:%.*]] = llvm.mlir.constant(0 : i32) : !llvm.i32
	// CHECK: [[C2:%.*]] = llvm.mlir.constant(2 : i32) : !llvm.i32			// CHECK: [[C2:%.*]] = llvm.mlir.constant(2 : i32) : !llvm.i32
	// CHECK: [[C0_3:%.*]] = llvm.mlir.constant(0 : i32) : !llvm.i32			// CHECK: [[C0_3:%.*]] = llvm.mlir.constant(0 : i32) : !llvm.i32
	// CHECK: "llvm.intr.prefetch"(%{{.}}, [[C0_2]], [[C2]], [[C0_3]]) : (!llvm<"float">, !llvm.i32, !llvm.i32, !llvm.i32) -> ()			// CHECK: "llvm.intr.prefetch"(%{{.}}, [[C0_2]], [[C2]], [[C0_3]]) : (!llvm<"float">, !llvm.i32, !llvm.i32, !llvm.i32) -> ()
	prefetch %A[%i, %j], read, locality<2>, instr : memref<?x?xf32>			prefetch %A[%i, %j], read, locality<2>, instr : memref<?x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @zero_d_store(%arg0: !llvm<"{ float, float, i64 }*">, %arg1: !llvm.float) {
	func @zero_d_store(%arg0: memref<f32>, %arg1: f32) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64 }*">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64 }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: llvm.store %arg1, %[[addr]] : !llvm<"float*">
	store %arg1, %arg0[] : memref<f32>
	return
	}

	// CHECK-LABEL: func @static_store
	func @static_store(%static : memref<10x42xf32>, %i : index, %j : index, %val : f32) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
	// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
	// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
	// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
	// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
	// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: llvm.store %arg3, %[[addr]] : !llvm<"float*">
	store %val, %static[%i, %j] : memref<10x42xf32>
	return
	}

	// CHECK-LABEL: func @dynamic_store			// CHECK-LABEL: func @dynamic_store
	func @dynamic_store(%dynamic : memref<?x?xf32>, %i : index, %j : index, %val : f32) {			func @dynamic_store(%dynamic : memref<?x?xf32>, %i : index, %j : index, %val : f32) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">			// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
	// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: %[[st0:.]] = llvm.extractvalue %[[ld]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64			// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
	// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64			// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
	▲ Show 20 Lines • Show All 113 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: llvm.extractvalue %[[ld]][3, 2] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">			// CHECK-NEXT: llvm.extractvalue %[[ld]][3, 2] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">
	%2 = dim %mixed, 2 : memref<42x?x?x13x?xf32>			%2 = dim %mixed, 2 : memref<42x?x?x13x?xf32>
	// CHECK-NEXT: llvm.mlir.constant(13 : index) : !llvm.i64			// CHECK-NEXT: llvm.mlir.constant(13 : index) : !llvm.i64
	%3 = dim %mixed, 3 : memref<42x?x?x13x?xf32>			%3 = dim %mixed, 3 : memref<42x?x?x13x?xf32>
	// CHECK-NEXT: llvm.extractvalue %[[ld]][3, 4] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">			// CHECK-NEXT: llvm.extractvalue %[[ld]][3, 4] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">
	%4 = dim %mixed, 4 : memref<42x?x?x13x?xf32>			%4 = dim %mixed, 4 : memref<42x?x?x13x?xf32>
	return			return
	}			}

	// CHECK-LABEL: func @static_memref_dim(%arg0: !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }*">) {
	func @static_memref_dim(%static : memref<42x32x15x13x27xf32>) {
	// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }*">
	// CHECK-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64
	%0 = dim %static, 0 : memref<42x32x15x13x27xf32>
	// CHECK-NEXT: llvm.mlir.constant(32 : index) : !llvm.i64
	%1 = dim %static, 1 : memref<42x32x15x13x27xf32>
	// CHECK-NEXT: llvm.mlir.constant(15 : index) : !llvm.i64
	%2 = dim %static, 2 : memref<42x32x15x13x27xf32>
	// CHECK-NEXT: llvm.mlir.constant(13 : index) : !llvm.i64
	%3 = dim %static, 3 : memref<42x32x15x13x27xf32>
	// CHECK-NEXT: llvm.mlir.constant(27 : index) : !llvm.i64
	%4 = dim %static, 4 : memref<42x32x15x13x27xf32>
	return
	}

mlir/test/Conversion/StandardToLLVM/convert-memref-ops.mlir

This file was moved to mlir/test/Conversion/StandardToLLVM/convert-dynamic-memref-ops.mlir.

mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir

This file was added.

				// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s
				// RUN: mlir-opt -convert-std-to-llvm -convert-std-to-llvm-use-alloca=1 %s \| FileCheck %s --check-prefix=ALLOCA
				// RUN: mlir-opt -convert-std-to-llvm -split-input-file -convert-std-to-llvm-use-bare-ptr-memref-call-conv=1 %s \| FileCheck %s --check-prefix=BAREPTR

				// BAREPTR-LABEL: func @check_noalias
				// BAREPTR-SAME: %{{.}}: !llvm<"float"> {llvm.noalias = true}
				func @check_noalias(%static : memref<2xf32> {llvm.noalias = true}) {
				return
				}

				// -----

				// CHECK-LABEL: func @check_static_return(%arg0: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">) -> !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }"> {
				// BAREPTR-LABEL: func @check_static_return
				// BAREPTR-SAME: (%[[arg:.]]: !llvm<"float">) -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
				func @check_static_return(%static : memref<32x18xf32>) -> memref<32x18xf32> {
				// CHECK: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">

				// BAREPTR: %[[udf:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[base:.]] = llvm.insertvalue %[[arg]], %[[udf]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[aligned:.]] = llvm.insertvalue %[[arg]], %[[base]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[val0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[ins0:.]] = llvm.insertvalue %[[val0]], %[[aligned]][2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[val1:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[ins1:.]] = llvm.insertvalue %[[val1]], %[[ins0]][3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[val2:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[ins2:.]] = llvm.insertvalue %[[val2]], %[[ins1]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[val3:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[ins3:.]] = llvm.insertvalue %[[val3]], %[[ins2]][3, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[val4:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[ins4:.]] = llvm.insertvalue %[[val4]], %[[ins3]][4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: llvm.return %[[ins4]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				return %static : memref<32x18xf32>
				}

				// -----

				// CHECK-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {
				// ALLOCA-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {
				// BAREPTR-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {
				func @zero_d_alloc() -> memref<f32> {
				// CHECK-NEXT: llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// CHECK-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64
				// CHECK-NEXT: llvm.call @malloc(%{{.}}) : (!llvm.i64) -> !llvm<"i8">
				// CHECK-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">
				// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64 }">
				// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[0] : !llvm<"{ float, float*, i64 }">
				// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[1] : !llvm<"{ float, float*, i64 }">
				// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// CHECK-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64 }">

				// ALLOCA-NOT: malloc
				// ALLOCA: alloca
				// ALLOCA-NOT: malloc

				// BAREPTR-NEXT: llvm.mlir.constant(1 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// BAREPTR-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// BAREPTR-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// BAREPTR-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64
				// BAREPTR-NEXT: llvm.call @malloc(%{{.}}) : (!llvm.i64) -> !llvm<"i8">
				// BAREPTR-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">
				// BAREPTR-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64 }">
				// BAREPTR-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[0] : !llvm<"{ float, float*, i64 }">
				// BAREPTR-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[1] : !llvm<"{ float, float*, i64 }">
				// BAREPTR-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// BAREPTR-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64 }">
				%0 = alloc() : memref<f32>
				return %0 : memref<f32>
				}

				// -----

				// CHECK-LABEL: func @zero_d_dealloc(%{{.}}: !llvm<"{ float, float, i64 }">) {
				// BAREPTR-LABEL: func @zero_d_dealloc(%{{.}}: !llvm<"float">) {
				func @zero_d_dealloc(%arg0: memref<f32>) {
				// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64 }*">
				// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64 }">
				// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
				// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()

				// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64 }">
				// BAREPTR-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
				// BAREPTR-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()
				dealloc %arg0 : memref<f32>
				return
				}

				// -----

				// CHECK-LABEL: func @aligned_1d_alloc(
				// BAREPTR-LABEL: func @aligned_1d_alloc(
				func @aligned_1d_alloc() -> memref<42xf32> {
				// CHECK-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64
				// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// CHECK-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64
				// CHECK-NEXT: %[[alignment:.*]] = llvm.mlir.constant(8 : index) : !llvm.i64
				// CHECK-NEXT: %[[alignmentMinus1:.]] = llvm.add {{.}}, %[[alignment]] : !llvm.i64
				// CHECK-NEXT: %[[allocsize:.*]] = llvm.sub %[[alignmentMinus1]], %[[one]] : !llvm.i64
				// CHECK-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[allocsize]]) : (!llvm.i64) -> !llvm<"i8">
				// CHECK-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">
				// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[0] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">
				// CHECK-NEXT: %[[allocatedAsInt:.]] = llvm.ptrtoint %[[allocated]] : !llvm<"i8"> to !llvm.i64
				// CHECK-NEXT: %[[alignAdj1:.*]] = llvm.urem %[[allocatedAsInt]], %[[alignment]] : !llvm.i64
				// CHECK-NEXT: %[[alignAdj2:.*]] = llvm.sub %[[alignment]], %[[alignAdj1]] : !llvm.i64
				// CHECK-NEXT: %[[alignAdj3:.*]] = llvm.urem %[[alignAdj2]], %[[alignment]] : !llvm.i64
				// CHECK-NEXT: %[[aligned:.]] = llvm.getelementptr %9[%[[alignAdj3]]] : (!llvm<"i8">, !llvm.i64) -> !llvm<"i8*">
				// CHECK-NEXT: %[[alignedBitCast:.]] = llvm.bitcast %[[aligned]] : !llvm<"i8"> to !llvm<"float*">
				// CHECK-NEXT: llvm.insertvalue %[[alignedBitCast]], %{{.}}[1] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">
				// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// CHECK-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">

				// BAREPTR-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// BAREPTR-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// BAREPTR-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// BAREPTR-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64
				// BAREPTR-NEXT: %[[alignment:.*]] = llvm.mlir.constant(8 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[alignmentMinus1:.]] = llvm.add {{.}}, %[[alignment]] : !llvm.i64
				// BAREPTR-NEXT: %[[allocsize:.*]] = llvm.sub %[[alignmentMinus1]], %[[one]] : !llvm.i64
				// BAREPTR-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[allocsize]]) : (!llvm.i64) -> !llvm<"i8">
				// BAREPTR-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">
				// BAREPTR-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64, [1 x i64], [1 x i64] }">
				// BAREPTR-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[0] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">
				// BAREPTR-NEXT: %[[allocatedAsInt:.]] = llvm.ptrtoint %[[allocated]] : !llvm<"i8"> to !llvm.i64
				// BAREPTR-NEXT: %[[alignAdj1:.*]] = llvm.urem %[[allocatedAsInt]], %[[alignment]] : !llvm.i64
				// BAREPTR-NEXT: %[[alignAdj2:.*]] = llvm.sub %[[alignment]], %[[alignAdj1]] : !llvm.i64
				// BAREPTR-NEXT: %[[alignAdj3:.*]] = llvm.urem %[[alignAdj2]], %[[alignment]] : !llvm.i64
				// BAREPTR-NEXT: %[[aligned:.]] = llvm.getelementptr %9[%[[alignAdj3]]] : (!llvm<"i8">, !llvm.i64) -> !llvm<"i8*">
				// BAREPTR-NEXT: %[[alignedBitCast:.]] = llvm.bitcast %[[aligned]] : !llvm<"i8"> to !llvm<"float*">
				// BAREPTR-NEXT: llvm.insertvalue %[[alignedBitCast]], %{{.}}[1] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">
				// BAREPTR-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// BAREPTR-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64, [1 x i64], [1 x i64] }">
				%0 = alloc() {alignment = 8} : memref<42xf32>
				return %0 : memref<42xf32>
				}

				// -----

				// CHECK-LABEL: func @static_alloc() -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
				// BAREPTR-LABEL: func @static_alloc() -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
				func @static_alloc() -> memref<32x18xf32> {
				// CHECK-NEXT: %[[sz1:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64
				// CHECK-NEXT: %[[sz2:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64
				// CHECK-NEXT: %[[num_elems:.*]] = llvm.mul %0, %1 : !llvm.i64
				// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// CHECK-NEXT: %[[bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64
				// CHECK-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">
				// CHECK-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">

				// BAREPTR-NEXT: %[[sz1:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[sz2:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[num_elems:.*]] = llvm.mul %[[sz1]], %[[sz2]] : !llvm.i64
				// BAREPTR-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// BAREPTR-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// BAREPTR-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// BAREPTR-NEXT: %[[bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64
				// BAREPTR-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">
				// BAREPTR-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">
				%0 = alloc() : memref<32x18xf32>
				return %0 : memref<32x18xf32>
				}

				// -----

				// CHECK-LABEL: func @static_dealloc(%{{.}}: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">) {
				// BAREPTR-LABEL: func @static_dealloc(%{{.}}: !llvm<"float">) {
				func @static_dealloc(%static: memref<10x8xf32>) {
				// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
				// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
				// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()

				// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
				// BAREPTR-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()
				dealloc %static : memref<10x8xf32>
				return
				}

				// -----

				// CHECK-LABEL: func @zero_d_load(%{{.}}: !llvm<"{ float, float, i64 }">) -> !llvm.float {
				// BAREPTR-LABEL: func @zero_d_load(%{{.}}: !llvm<"float">) -> !llvm.float
				func @zero_d_load(%arg0: memref<f32>) -> f32 {
				// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64 }*">
				// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64 }">
				// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[c0]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: %{{.}} = llvm.load %[[addr]] : !llvm<"float">

				// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64 }">
				// BAREPTR-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[c0]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// BAREPTR-NEXT: llvm.load %[[addr:.]] : !llvm<"float">
				%0 = load %arg0[] : memref<f32>
				return %0 : f32
				}

				// -----

				// CHECK-LABEL: func @static_load(
				// CHECK-SAME: %[[A:.]]: !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64
				// BAREPTR-LABEL: func @static_load
				// BAREPTR-SAME: (%[[A:.]]: !llvm<"float">, %[[I:.]]: !llvm.i64, %[[J:.]]: !llvm.i64) {
				func @static_load(%static : memref<10x42xf32>, %i : index, %j : index) {
				// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
				// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// CHECK-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
				// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
				// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
				// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
				// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
				// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: llvm.load %[[addr]] : !llvm<"float*">

				// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
				// BAREPTR-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
				// BAREPTR-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
				// BAREPTR-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
				// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// BAREPTR-NEXT: llvm.load %[[addr]] : !llvm<"float*">
				%0 = load %static[%i, %j] : memref<10x42xf32>
				return
				}

				// -----

				// CHECK-LABEL: func @zero_d_store(%arg0: !llvm<"{ float, float, i64 }*">, %arg1: !llvm.float) {
				// BAREPTR-LABEL: func @zero_d_store
				// BAREPTR-SAME: (%[[A:.]]: !llvm<"float">, %[[val:.*]]: !llvm.float)
				func @zero_d_store(%arg0: memref<f32>, %arg1: f32) {
				// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64 }*">
				// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64 }">
				// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: llvm.store %arg1, %[[addr]] : !llvm<"float*">

				// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64 }">
				// BAREPTR-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// BAREPTR-NEXT: llvm.store %[[val]], %[[addr]] : !llvm<"float*">
				store %arg1, %arg0[] : memref<f32>
				return
				}

				// -----

				// CHECK-LABEL: func @static_store
				// BAREPTR-LABEL: func @static_store
				// BAREPTR-SAME: %[[A:.]]: !llvm<"float">
				func @static_store(%static : memref<10x42xf32>, %i : index, %j : index, %val : f32) {
				// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }*">
				// CHECK-NEXT: %[[ptr:.]] = llvm.extractvalue %[[ld]][1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// CHECK-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
				// CHECK-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
				// CHECK-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
				// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
				// CHECK-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
				// CHECK-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: llvm.store %arg3, %[[addr]] : !llvm<"float*">

				// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				// BAREPTR-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[st0:.*]] = llvm.mlir.constant(42 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[offI:.*]] = llvm.mul %[[I]], %[[st0]] : !llvm.i64
				// BAREPTR-NEXT: %[[off0:.*]] = llvm.add %[[off]], %[[offI]] : !llvm.i64
				// BAREPTR-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// BAREPTR-NEXT: %[[offJ:.*]] = llvm.mul %[[J]], %[[st1]] : !llvm.i64
				// BAREPTR-NEXT: %[[off1:.*]] = llvm.add %[[off0]], %[[offJ]] : !llvm.i64
				// BAREPTR-NEXT: %[[addr:.]] = llvm.getelementptr %[[ptr]][%[[off1]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// BAREPTR-NEXT: llvm.store %{{.}}, %[[addr]] : !llvm<"float">
				store %val, %static[%i, %j] : memref<10x42xf32>
				return
				}

				// -----

				// CHECK-LABEL: func @static_memref_dim(%arg0: !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }*">) {
				// BAREPTR-LABEL: func @static_memref_dim(%{{.}}: !llvm<"float">) {
				func @static_memref_dim(%static : memref<42x32x15x13x27xf32>) {
				// CHECK-NEXT: %[[ld:.]] = llvm.load %{{.}} : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }*">
				// CHECK-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64
				// BAREPTR: llvm.insertvalue %{{.}}, %{{.}}[4, 0] : !llvm<"{ float, float, i64, [5 x i64], [5 x i64] }">
				// BAREPTR-NEXT: llvm.mlir.constant(42 : index) : !llvm.i64
				%0 = dim %static, 0 : memref<42x32x15x13x27xf32>
				// CHECK-NEXT: llvm.mlir.constant(32 : index) : !llvm.i64
				// BAREPTR-NEXT: llvm.mlir.constant(32 : index) : !llvm.i64
				%1 = dim %static, 1 : memref<42x32x15x13x27xf32>
				// CHECK-NEXT: llvm.mlir.constant(15 : index) : !llvm.i64
				// BAREPTR-NEXT: llvm.mlir.constant(15 : index) : !llvm.i64
				%2 = dim %static, 2 : memref<42x32x15x13x27xf32>
				// CHECK-NEXT: llvm.mlir.constant(13 : index) : !llvm.i64
				// BAREPTR-NEXT: llvm.mlir.constant(13 : index) : !llvm.i64
				%3 = dim %static, 3 : memref<42x32x15x13x27xf32>
				// CHECK-NEXT: llvm.mlir.constant(27 : index) : !llvm.i64
				// BAREPTR-NEXT: llvm.mlir.constant(27 : index) : !llvm.i64
				%4 = dim %static, 4 : memref<42x32x15x13x27xf32>
				return
				}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Introduce bare ptr calling convention for MemRefs in LLVM dialectClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 239442

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h

mlir/include/mlir/Transforms/DialectConversion.h

mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp

mlir/lib/Conversion/StandardToLLVM/ConvertStandardToLLVM.cpp

mlir/lib/Transforms/DialectConversion.cpp

mlir/test/Conversion/StandardToLLVM/convert-dynamic-memref-ops.mlir

mlir/test/Conversion/StandardToLLVM/convert-memref-ops.mlir

mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir

[mlir] Introduce bare ptr calling convention for MemRefs in LLVM dialect
ClosedPublic