This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Conversion/
-
Passes.td
-
StandardToLLVM/
-
ConvertStandardToLLVMPass.h
-
Dialect/StandardOps/IR/
-
StandardOps/
-
IR/
25/25
Ops.td
-
lib/
-
Conversion/StandardToLLVM/
-
StandardToLLVM/
6/6
StandardToLLVM.cpp
-
Dialect/StandardOps/IR/
-
StandardOps/
-
IR/
2/2
Ops.cpp
-
test/
-
Conversion/StandardToLLVM/
-
StandardToLLVM/
-
convert-dynamic-memref-ops.mlir
-
convert-static-memref-ops.mlir
-
IR/
-
memory-ops.mlir
-
Transforms/
-
canonicalize.mlir

Differential D76602

[MLIR] Introduce std.alloca op
ClosedPublic

Authored by bondhugula on Mar 23 2020, 5:21 AM.

Download Raw Diff

Details

Reviewers

mehdi_amini
rriddle
ftynse
nicolasvasilache

Commits

rG7023f4b4cb01: [MLIR] Introduce std.alloca op

Summary

Introduce the alloca op for stack memory allocation. When converting to the
LLVM dialect, this is lowered to an llvm.alloca.

Signed-off-by: Uday Bondhugula <uday@polymagelabs.com>

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

bondhugula created this revision.Mar 23 2020, 5:21 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 23 2020, 5:21 AM

Herald added subscribers: llvm-commits, Joonsoo, liufengdb and 9 others. · View Herald Transcript

nicolasvasilache requested changes to this revision.Mar 23 2020, 5:57 AM

The amount of copy-pasta is uncanny, is there a way to factor out the 90%+ common part?
Also, please use the assemblyFormat for parsing and printing,

This revision now requires changes to proceed.Mar 23 2020, 5:57 AM

In D76602#1936610, @nicolasvasilache wrote:

Also, please use the assemblyFormat for parsing and printing,

Can the assembly format support its parsing and printing? Both affine.apply and alloc don't use the assembly format and have identical operand syntax (affine.apply was recently migrated to ODS but not auto print / parse). All these three require two variadic operand lists. Let me know.

Harbormaster failed remote builds in B50099: Diff 251999!Mar 23 2020, 6:31 AM

In D76602#1936610, @nicolasvasilache wrote:

The amount of copy-pasta is uncanny, is there a way to factor out the 90%+ common part?
Also, please use the assemblyFormat for parsing and printing,

I was thinking the same. The duplication is unfortunate and it's all in StandardToLLVM - it's possible to factor it all out if we get rid of useAlloca because the AllocOp lowering is conditional on that. Should I remove useAlloca in this patch itself and mostly merge AllocOpLowering and AllocaOpLowering? Merging the parse/print methods is straightforward.

Share print/parse/verify b/w alloc/alloca.

Refactor ODS for alloc/alloca

Harbormaster completed remote builds in B50261: Diff 252332.Mar 24 2020, 9:07 AM

Harbormaster failed remote builds in B50259: Diff 252331!

Fix format

Harbormaster failed remote builds in B50264: Diff 252338!Mar 24 2020, 9:39 AM

In D76602#1936765, @bondhugula wrote:

In D76602#1936610, @nicolasvasilache wrote:

The amount of copy-pasta is uncanny, is there a way to factor out the 90%+ common part?

I was thinking the same. The duplication is unfortunate and it's all in StandardToLLVM - it's possible to factor it all out if we get rid of useAlloca because the AllocOp lowering is conditional on that. Should I remove useAlloca in this patch itself and mostly merge AllocOpLowering and AllocaOpLowering? Merging the parse/print methods is straightforward.

@nicolasvasilache Done with the refactoring here (except for the llvm lowering - see question above).

Harbormaster completed remote builds in B50268: Diff 252346.Mar 24 2020, 10:12 AM

rriddle added inline comments.Mar 24 2020, 5:33 PM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
144	Use ValueRange instead of ArrayRef<Value> in builder mehtods.
328	nit: "alloca" -> `alloca`
332–334	Please use mlir code blocks for any inline code.
mlir/lib/Dialect/StandardOps/IR/Ops.cpp
250	llvm::is_one_of

Address comments from @rriddle

Thanks for the review!

mlir/lib/Dialect/StandardOps/IR/Ops.cpp
250	Thanks!

Harbormaster completed remote builds in B50340: Diff 252474.Mar 24 2020, 7:42 PM

Thanks for adding the alloca op! Really needed.
Not sure if you discussed this already but just a nit about the name: any plans on renaming alloca and alloc so that it's a bit clearer what they model? I find it a bit confusing right now. Some options that came to mind:
alloca -> salloc, salloca
alloc -> malloc, malloca, alloc

In D76602#1941785, @dcaballe wrote:

Thanks for adding the alloca op! Really needed.
Not sure if you discussed this already but just a nit about the name: any plans on renaming alloca and alloc so that it's a bit clearer what they model? I find it a bit confusing right now. Some options that came to mind:
alloca -> salloc, salloca
alloc -> malloc, malloca, alloc

alloca -> salloca sounds good to me! Note that the 'a' suffix in 'alloca' is for 'automatic' freeing (originating from early Unix's and BSDs and it has always meant allocating from the caller's stack). So, alloc -> malloca, alloca -> salloc would be inconsistent. Since 'alloc' currently doesn't specify where it's from the stack/heap and specifies it's explicitly freed via dealloc, we can leave it like that.

What is the s in salloc? Stack? Can we make it explicit then: stack_alloc?

Note that the 'a' suffix in 'alloca' is for 'automatic' freeing

Thanks for clarifying! I thought it was just a short for 'allocate' :)

Can we make it explicit then: stack_alloc?

stack_alloc sounds better to me, thanks.

Since 'alloc' currently doesn't specify where it's from the stack/heap and specifies it's explicitly freed via dealloc, we can leave it like that.

It sounds good. I think we are mapping 'alloc' to static allocation by using a flag in the llvm lowering. Maybe can create a simple pass to do a more proper alloc->static_alloc conversion in the future and leave 'alloc' only for heap allocation.

@bondhugula
Thanks for the refactoring!

Dropping the LLVM flag and refactoring the impl further sounds like the right thing to do IMO.
The unit test that uses the flag can easily be updated (or deprecated in favor of your test).
The internal use case we have for this will be easy to update.

Herald added a subscriber: grosul1. · View Herald TranscriptMar 30 2020, 6:29 PM

In D76602#1951483, @nicolasvasilache wrote:

@bondhugula
Thanks for the refactoring!

Dropping the LLVM flag and refactoring the impl further sounds like the right thing to do IMO.
The unit test that uses the flag can easily be updated (or deprecated in favor of your test).
The internal use case we have for this will be easy to update.

This sounds the best to me too. I'll do it in this patch.

Refactored std to llvm lowering for alloc op to reuse for alloca. The lowering
rewrite was really long. Broken down with multiple helpers now: this is also
ready to now use aligned_alloc in place of malloc for heap allocations.

In D76602#1954032, @bondhugula wrote:

In D76602#1951483, @nicolasvasilache wrote:

@bondhugula
Thanks for the refactoring!

Dropping the LLVM flag and refactoring the impl further sounds like the right thing to do IMO.
The unit test that uses the flag can easily be updated (or deprecated in favor of your test).
The internal use case we have for this will be easy to update.

This sounds the best to me too. I'll do it in this patch.

Done @nicolasvasilache PTAL.

Update comments.

bondhugula added a child revision: D77528: [MLIR] Add support to use aligned_alloc to lower AllocOp from std to llvm.Apr 6 2020, 12:55 AM

Harbormaster completed remote builds in B51888: Diff 255240.Apr 6 2020, 1:36 AM

Harbormaster completed remote builds in B51887: Diff 255239.

ftynse added inline comments.Apr 6 2020, 5:49 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
328	Nit: I think the intention of the comment above was to ask you to use backticks instead of double quotes.
329	Could you please elaborate what is a stack frame in MLIR? We don't seem to have this concept defined anywhere. In particular, is it only related to `std.func`, or can one register other ops that create stack frames?
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
1349	Nit: can we rather define `one` at the call site and pass it here (and to another call) ?

bondhugula marked 5 inline comments as done.Apr 6 2020, 7:02 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	A stack frame here refers to the standard stack frame concept in CS that we know of! It's up to the conversion out of MLIR to realize this correctly.
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
1349	The reason I didn't do that is that the order of the instructions in that case wouldn't be natural - there would be other stuff (the size constant definitions) between the def of 'one' and its first use here. So I left it this way.

Address review comments.

ftynse added inline comments.Apr 6 2020, 7:15 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	MLIR makes us rethink a lot of things. Like MLIR does not have functions as a first-class concept :) I could obviously guess your intention with stack frames, but I would still insist you think it through in context of MLIR. Similarly to functions, I don't think we have a built-in memory model that has a stack... Or that `std.func` semantics says something about stack frames. IIRC, this was one of the conceptual problems of having `alloca` in the first place. What should happen if I do the following? my.func @foo() { alloca ... } Or std.func @foo() { %0 = my.func_that_may_or_may_not_be_a_lambda @inner() { alloca ... } } "Conversion out of MLIR" does not mean anything to me either. Do you mean the translation of LLVM dialect to LLVM IR? There are other passes that may be using the standard dialect that are completely unaware of that.

bondhugula marked an inline comment as done.Apr 6 2020, 7:56 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	Irrespective of which case it is, it depends on the ops around it and for the folks defining those ops to realize it. What you need to keep in mind is that an alloca's memory is automatically freed (i.e., you won't find a dealloc) and it disappears at the time the stack frame goes away whenever such a concept exists. Now, one is of course free to transform it and implement the auto freeing in another way within MLIR itself. Heap allocations for eg. get promoted to stack ones (eg. in LLVM's passes), and for whatever reason, one could decide to switch a stack allocation to a heap one. That won't be an incorrect transform. So it ultimately may not even be realized on the stack (let alone your question of when it should be freed). It's the intent of the op when you actually see in the IR that is of an allocation that is freed automatically when the stack frame goes away.

Harbormaster completed remote builds in B51932: Diff 255319.Apr 6 2020, 8:06 AM

Thanks Uday, looks good!

Accepting conditioned on the LLVM options struct change.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
144	Note: I tend to prefer extra builders that take all PODs instead of attributes. They are more natural to use and just work nicely once wrapped into EDSCs.
304	typo %d1
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
1293	I think the change in indentation (?) here and everywhere is unfortunate, it does not properly track what code moved (materialized with a yellow phab horizontal bar on the left) and make it harder to see what changed vs what moved. I am mostly eyeballing the code after realizing most of this is just moved. Please flag specific things need deeper review if appropriate.
2905–2907	I am concerned by the silent behavior breaking changes here and in the followup revision. I am expecting this will be painful for integrations and will repeat itself. Can we turn this into a: struct LowerToLLVMOptions { ... }

This revision is now accepted and ready to land.Apr 6 2020, 9:06 AM

ftynse requested changes to this revision.Apr 7 2020, 2:42 AM

ftynse added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	I don't disagree with anything you say. I am merely pointing out that it is absolutely unclear from the op definition _when_ the allocated memory will be automatically freed. And "when the stack frame goes away" is not a satisfactory definition, because it is meaningless under the current MLIR semantics as it is written. I could literally add the `stack_frame.go_away()` operation tomorrow and expect it to free the allocations... If you could instead tie it to something like "std.func" returning or the region of an operation with FunctionLike trait transferring control flow back to its enclosing op, it would be more MLIR-compatible.

This revision now requires changes to proceed.Apr 7 2020, 2:42 AM

mehdi_amini added inline comments.Apr 7 2020, 2:47 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	I agree with Alex that this could deserve a more careful way of describing this. In particular the "stack" isn't very important here, we really need a scope. Could we use a trait like we have for "IsolatedFromAbove" to control this scoping? Unless we just consider every region as a new scope for these allocations?

Address @nicolasvasilache review comments.

bondhugula added inline comments.Apr 7 2020, 3:15 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
144	Hmm... I didn't change the existing alloc op builder here; just using the same for Alloca op. Yes, this can be changed to just take int64_t in another revision.
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
1293	All of this is just moved. Nothing in the alloc op lowering actually changed with this - I'm just reusing the common parts for alloca.
2905–2907	That's a concern - there are going to be silent breaking effects. I've changed it to struct here. But even with a struct, you'll have the same issue with list initialization (some of the fields will remain uninitialized / wrongly initialized depending on where the field was removed from or where the new field was added). If the new fields are always added at the end, we'll just have uninitialized fields. And with explicit field-wise init, it'll just lead to uninitialized stuff. BTW, none of the three options are documented at the declaration (but only in Passes.td).

bondhugula marked an inline comment as done.Apr 7 2020, 3:29 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	Looks like these messages were in flight while I updated and committed. The rephrasing can be addressed in a future revision while we continue discussion here. the current MLIR semantics as it is written. I could literally add the stack_frame.go_away() operation tomorrow and expect it to free the allocations. The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using another op. If you are imagining a low level op that exists that manipulates the stack frame, this would be a pathological case that it has freed it by circumventing things. In particular the "stack" isn't very important here, we really need a scope. Could we use a trait like we have for IsolatedFromAbove" to Tying it to a scope like that of a closest surrounding op with a function like trait (or isolatedFromAbove) is almost fine, but ultimately not perfect since imperative function like ops like std.execute_region don't have function like traits, and we may want to imply freeing at that scope.

Harbormaster failed remote builds in B52132: Diff 255622!Apr 7 2020, 3:45 AM

This revision was not accepted when it landed; it landed in state Needs Review.Apr 7 2020, 3:45 AM

Closed by commit rG7023f4b4cb01: [MLIR] Introduce std.alloca op (authored by bondhugula). · Explain Why

This revision was automatically updated to reflect the committed changes.

ftynse added inline comments.Apr 7 2020, 4:55 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using another op. Why not? I would not constraint the behavior of every possible op based on a one-letter suffix of another op... "Automatic" does not mean it cannot be connected to another op. In fact, in MLIR, it will likely be connected to another op because functions are ops. Tying it to a scope like that of a closest surrounding op with a function like trait (or isolatedFromAbove) is almost fine, but ultimately not perfect since imperative function like ops like std.execute_region don't have function like traits, and we may want to imply freeing at that scope. IIUC, the idea is to introduce a new trait `AutomaticAllocationScope` that is orthogonal to `FunctionLike`. We can then make, e.g., `std.func`, `llvm.func` and `std.execute_region` have this trait, and let other ops opt-in to the automatic allocation/deallocation behavior. This would be my ideal solution, but I hesitated to push it since it may involve larger changes to this patch, trying to find a simpler solution like attaching to the function trait first. Now that the patch has landed, I think introducing a separate trait is the right thing to do.

bondhugula marked 4 inline comments as done.Apr 7 2020, 6:26 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	mean it cannot be connected to another op. In fact, in MLIR, it will likely be connected to another op because functions are ops. Automatically there implies it has to be freed automatically, not by another op. The moment you want to explicitly free that, you'll have to rewrite it to something else - for eg. alloc/dealloc pair. like attaching to the function trait first. Now that the patch has landed, I think introducing a separate trait is the right thing to do. Strictly speaking, I also feel adding another trait is the right option. But for now and to keep it lightweight until we have such use cases, 'freed when the closest surrounding op with FunctionLike trait returns' is a good approximation for 'stack frame being discarded'.

ftynse added inline comments.Apr 7 2020, 6:47 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	Strictly speaking, I also feel adding another trait is the right option. But for now and to keep it lightweight until we have such use cases, 'freed when the closest surrounding op with FunctionLike trait returns' is a good approximation for 'stack frame being discarded'. Let's have it as "freed when the closest surrounding op with FunctionLike trait has the control transferred back from its body", this avoids the potential interpretation that FunctionLike should also terminate with "std.return" and a weird-sounding "op <..> returns". We may also want to add a verifier check that such an op exists.

mehdi_amini added inline comments.Apr 7 2020, 11:27 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	I am not convinced that the "FunctionLike" trait is the right one here, an op like `gpu.launch` for example would create a new scope in my expectations. What do you think about the trait Uday proposed above `AutomaticAllocationScope` ?

ftynse added a subscriber: herhut.Apr 7 2020, 11:40 AM

ftynse added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	I proposed it :) So I am obviously in favor of it. I think we can start by associating the deallocation with `FunctionLike`, it's a simple documentation+verifier change that we can land fast and avoid having contentious code upstream as well as rolling it back. Then we can implement `AutomaticAllocationScope` and let ops to opt-in (e.g., I'd expect @herhut to decide whether we want `alloca`s in GPU at all).

mehdi_amini added inline comments.Apr 7 2020, 7:20 PM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	I think we can start by associating the deallocation with FunctionLike, I'm afraid this will lead to wrong assumption, I rather have code written with the right trait checked from the beginning.

bondhugula marked 6 inline comments as done.Apr 7 2020, 9:57 PM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	It is clear that using a new trait for scoped allocation will be accurate here. FunctionLike will always imply a new scope for stack allocation. Introducing the trait should be straightforward - okay to update the description when we introduce the trait.

mehdi_amini added inline comments.Apr 8 2020, 8:54 PM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	Are you adding the trait? I think this should be done now: I wouldn't want code that start checking for "FunctionLike" where it should check for the "AutomaticAllocationScope" trait.

Herald added a subscriber: frgossen. · View Herald TranscriptApr 8 2020, 8:54 PM

bondhugula marked 2 inline comments as done.Apr 8 2020, 9:27 PM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	Alex, are you interested in / are you able to add this trait? If not, I can do it right away.

ftynse added inline comments.Apr 9 2020, 1:23 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	I have quite a long todo list...

bondhugula marked 3 inline comments as done.Apr 9 2020, 3:16 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
329	D77787

Revision Contents

Path

Size

mlir/

include/

mlir/

Conversion/

Passes.td

2 lines

StandardToLLVM/

ConvertStandardToLLVMPass.h

30 lines

Dialect/

StandardOps/

IR/

Ops.td

125 lines

lib/

Conversion/

StandardToLLVM/

StandardToLLVM.cpp

390 lines

Dialect/

StandardOps/

IR/

Ops.cpp

48 lines

test/

Conversion/

StandardToLLVM/

convert-dynamic-memref-ops.mlir

36 lines

convert-static-memref-ops.mlir

32 lines

IR/

memory-ops.mlir

29 lines

Transforms/

canonicalize.mlir

15 lines

Diff 255628

mlir/include/mlir/Conversion/Passes.td

Show First 20 Lines • Show All 219 Lines • ▼ Show 20 Lines	let description = [{
Functions converted to LLVM IR. Function arguments types are converted		Functions converted to LLVM IR. Function arguments types are converted
one-to-one. Function results are converted one-to-one and, in case more than		one-to-one. Function results are converted one-to-one and, in case more than
1 value is returned, packed into an LLVM IR struct type. Function calls and		1 value is returned, packed into an LLVM IR struct type. Function calls and
returns are updated accordingly. Block argument types are updated to use		returns are updated accordingly. Block argument types are updated to use
LLVM IR types.		LLVM IR types.
}];		}];
let constructor = "mlir::createLowerToLLVMPass()";		let constructor = "mlir::createLowerToLLVMPass()";
let options = [		let options = [
Option<"useAlloca", "use-alloca", "bool", /default=/"false",
"Use `alloca` instead of `call @malloc` for converting std.alloc">,
Option<"useBarePtrCallConv", "use-bare-ptr-memref-call-conv", "bool",		Option<"useBarePtrCallConv", "use-bare-ptr-memref-call-conv", "bool",
/default=/"false",		/default=/"false",
"Replace FuncOp's MemRef arguments with bare pointers to the MemRef "		"Replace FuncOp's MemRef arguments with bare pointers to the MemRef "
"element types">,		"element types">,
Option<"emitCWrappers", "emit-c-wrappers", "bool", /default=/"false",		Option<"emitCWrappers", "emit-c-wrappers", "bool", /default=/"false",
"Emit wrappers for C-compatible pointer-to-struct memref "		"Emit wrappers for C-compatible pointer-to-struct memref "
"descriptors">,		"descriptors">,
Option<"indexBitwidth", "index-bitwidth", "unsigned",		Option<"indexBitwidth", "index-bitwidth", "unsigned",
Show All 30 Lines

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h

	Show All 15 Lines
	class ModuleOp;			class ModuleOp;
	template <typename T> class OpPassBase;			template <typename T> class OpPassBase;
	class OwningRewritePatternList;			class OwningRewritePatternList;

	/// Collect a set of patterns to convert memory-related operations from the			/// Collect a set of patterns to convert memory-related operations from the
	/// Standard dialect to the LLVM dialect, excluding non-memory-related			/// Standard dialect to the LLVM dialect, excluding non-memory-related
	/// operations and FuncOp.			/// operations and FuncOp.
	void populateStdToLLVMMemoryConversionPatters(			void populateStdToLLVMMemoryConversionPatters(
	LLVMTypeConverter &converter, OwningRewritePatternList &patterns,			LLVMTypeConverter &converter, OwningRewritePatternList &patterns);
	bool useAlloca);

	/// Collect a set of patterns to convert from the Standard dialect to the LLVM			/// Collect a set of patterns to convert from the Standard dialect to the LLVM
	/// dialect, excluding the memory-related operations.			/// dialect, excluding the memory-related operations.
	void populateStdToLLVMNonMemoryConversionPatterns(			void populateStdToLLVMNonMemoryConversionPatterns(
	LLVMTypeConverter &converter, OwningRewritePatternList &patterns);			LLVMTypeConverter &converter, OwningRewritePatternList &patterns);

	/// Collect the default pattern to convert a FuncOp to the LLVM dialect. If			/// Collect the default pattern to convert a FuncOp to the LLVM dialect. If
	/// `emitCWrappers` is set, the pattern will also produce functions			/// `emitCWrappers` is set, the pattern will also produce functions
	/// that pass memref descriptors by pointer-to-structure in addition to the			/// that pass memref descriptors by pointer-to-structure in addition to the
	/// default unpacked form.			/// default unpacked form.
	void populateStdToLLVMDefaultFuncOpConversionPattern(			void populateStdToLLVMDefaultFuncOpConversionPattern(
	LLVMTypeConverter &converter, OwningRewritePatternList &patterns,			LLVMTypeConverter &converter, OwningRewritePatternList &patterns,
	bool emitCWrappers = false);			bool emitCWrappers = false);

	/// Collect a set of default patterns to convert from the Standard dialect to			/// Collect a set of default patterns to convert from the Standard dialect to
	/// LLVM. If `useAlloca` is set, the patterns for AllocOp and DeallocOp will			/// LLVM.
	/// generate `llvm.alloca` instead of calls to "malloc".
	void populateStdToLLVMConversionPatterns(LLVMTypeConverter &converter,			void populateStdToLLVMConversionPatterns(LLVMTypeConverter &converter,
	OwningRewritePatternList &patterns,			OwningRewritePatternList &patterns,
	bool useAlloca = false,
	bool emitCWrappers = false);			bool emitCWrappers = false);

	/// Collect a set of patterns to convert from the Standard dialect to			/// Collect a set of patterns to convert from the Standard dialect to
	/// LLVM using the bare pointer calling convention for MemRef function			/// LLVM using the bare pointer calling convention for MemRef function
	/// arguments. If `useAlloca` is set, the patterns for AllocOp and DeallocOp			/// arguments.
	/// will generate `llvm.alloca` instead of calls to "malloc".
	void populateStdToLLVMBarePtrConversionPatterns(			void populateStdToLLVMBarePtrConversionPatterns(
	LLVMTypeConverter &converter, OwningRewritePatternList &patterns,			LLVMTypeConverter &converter, OwningRewritePatternList &patterns);
	bool useAlloca = false);

	/// Value to pass as bitwidth for the index type when the converter is expected			/// Value to pass as bitwidth for the index type when the converter is expected
	/// to derive the bitwidth from the LLVM data layout.			/// to derive the bitwidth from the LLVM data layout.
	static constexpr unsigned kDeriveIndexBitwidthFromDataLayout = 0;			static constexpr unsigned kDeriveIndexBitwidthFromDataLayout = 0;

				struct LowerToLLVMOptions {
				bool useBarePtrCallConv = false;
				bool emitCWrappers = false;
				unsigned indexBitwidth = kDeriveIndexBitwidthFromDataLayout;
				};

	/// Creates a pass to convert the Standard dialect into the LLVMIR dialect.			/// Creates a pass to convert the Standard dialect into the LLVMIR dialect.
	/// By default stdlib malloc/free are used for allocating MemRef payloads.			/// stdlib malloc/free is used for allocating memrefs allocated with std.alloc,
	/// Specifying `useAlloca-true` emits stack allocations instead. In the future			/// while LLVM's alloca is used for those allocated with std.alloca.
	/// this may become an enum when we have concrete uses for other options.
	std::unique_ptr<OpPassBase<ModuleOp>> createLowerToLLVMPass(			std::unique_ptr<OpPassBase<ModuleOp>> createLowerToLLVMPass(
	bool useAlloca = false, bool useBarePtrCallConv = false,			const LowerToLLVMOptions &options = {
	bool emitCWrappers = false,			/useBarePtrCallConv=/false, /emitCWrappers=/false,
	unsigned indexBitwidth = kDeriveIndexBitwidthFromDataLayout);			/indexBitwidth=/kDeriveIndexBitwidthFromDataLayout});

	} // namespace mlir			} // namespace mlir

	#endif // MLIR_CONVERSION_STANDARDTOLLVM_CONVERTSTANDARDTOLLVMPASS_H_			#endif // MLIR_CONVERSION_STANDARDTOLLVM_CONVERTSTANDARDTOLLVMPASS_H_

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td

Show First 20 Lines • Show All 118 Lines • ▼ Show 20 Lines
// type, or a floating point tensor. The custom assembly form of the operation		// type, or a floating point tensor. The custom assembly form of the operation
// is as follows		// is as follows
//		//
// <op>f %0, %1 : f32		// <op>f %0, %1 : f32
class FloatArithmeticOp<string mnemonic, list<OpTrait> traits = []> :		class FloatArithmeticOp<string mnemonic, list<OpTrait> traits = []> :
ArithmeticOp<mnemonic, traits>,		ArithmeticOp<mnemonic, traits>,
Arguments<(ins FloatLike:$lhs, FloatLike:$rhs)>;		Arguments<(ins FloatLike:$lhs, FloatLike:$rhs)>;

		// Base class for memref allocating ops: alloca and alloc.
		//
		// %0 = alloclike(%m)[%s] : memref<8x?xf32, (d0, d1)[s0] -> ((d0 + s0), d1)>
		//
		class AllocLikeOp<string mnemonic, list<OpTrait> traits = []> :
		Std_Op<mnemonic, traits> {

		let arguments = (ins Variadic<Index>:$value,
		Confined<OptionalAttr<I64Attr>, [IntMinValue<0>]>:$alignment);
		let results = (outs AnyMemRef);

		let builders = [OpBuilder<
		"Builder *builder, OperationState &result, MemRefType memrefType", [{
		result.types.push_back(memrefType);
		}]>,
		OpBuilder<
		"Builder *builder, OperationState &result, MemRefType memrefType, " #
		"ValueRange operands, IntegerAttr alignment = IntegerAttr()", [{
		rriddleUnsubmitted Done Reply Inline Actions Use ValueRange instead of ArrayRef<Value> in builder mehtods. rriddle: Use ValueRange instead of ArrayRef<Value> in builder mehtods.
		nicolasvasilacheUnsubmitted Done Reply Inline Actions Note: I tend to prefer extra builders that take all PODs instead of attributes. They are more natural to use and just work nicely once wrapped into EDSCs. nicolasvasilache: Note: I tend to prefer extra builders that take all PODs instead of attributes. They are more…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Hmm... I didn't change the existing alloc op builder here; just using the same for Alloca op. Yes, this can be changed to just take int64_t in another revision. bondhugula: Hmm... I didn't change the existing alloc op builder here; just using the same for Alloca op.
		result.addOperands(operands);
		result.types.push_back(memrefType);
		if (alignment)
		result.addAttribute(getAlignmentAttrName(), alignment);
		}]>];

		let extraClassDeclaration = [{
		static StringRef getAlignmentAttrName() { return "alignment"; }

		MemRefType getType() { return getResult().getType().cast<MemRefType>(); }

		/// Returns the number of symbolic operands (the ones in square brackets),
		/// which bind to the symbols of the memref's layout map.
		unsigned getNumSymbolicOperands() {
		return getNumOperands() - getType().getNumDynamicDims();
		}

		/// Returns the symbolic operands (the ones in square brackets), which bind
		/// to the symbols of the memref's layout map.
		operand_range getSymbolicOperands() {
		return {operand_begin() + getType().getNumDynamicDims(), operand_end()};
		}

		/// Returns the dynamic sizes for this alloc operation if specified.
		operand_range getDynamicSizes() { return getOperands(); }
		}];

		let parser = [{ return ::parseAllocLikeOp(parser, result); }];

		let hasCanonicalizer = 1;
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AbsFOp		// AbsFOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def AbsFOp : FloatUnaryOp<"absf"> {		def AbsFOp : FloatUnaryOp<"absf"> {
let summary = "floating point absolute-value operation";		let summary = "floating point absolute-value operation";
let description = [{		let description = [{
The `absf` operation computes the absolute value. It takes one operand and		The `absf` operation computes the absolute value. It takes one operand and
▲ Show 20 Lines • Show All 85 Lines • ▼ Show 20 Lines	def AddIOp : IntArithmeticOp<"addi", [Commutative]> {
}];		}];
let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AllocOp		// AllocOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def AllocOp : Std_Op<"alloc"> {		def AllocOp : AllocLikeOp<"alloc"> {
let summary = "memory allocation operation";		let summary = "memory allocation operation";
let description = [{		let description = [{
The `alloc` operation allocates a region of memory, as specified by its		The `alloc` operation allocates a region of memory, as specified by its
memref type.		memref type.

Example:		Example:

```mlir		```mlir
%0 = alloc() : memref<8x64xf32, (d0, d1) -> (d0, d1), 1>		%0 = alloc() : memref<8x64xf32, 1>
```		```

The optional list of dimension operands are bound to the dynamic dimensions		The optional list of dimension operands are bound to the dynamic dimensions
specified in its memref type. In the example below, the ssa value '%d' is		specified in its memref type. In the example below, the ssa value '%d' is
bound to the second dimension of the memref (which is dynamic).		bound to the second dimension of the memref (which is dynamic).

```mlir		```mlir
%0 = alloc(%d) : memref<8x?xf32, (d0, d1) -> (d0, d1), 1>		%0 = alloc(%d) : memref<8x?xf32, 1>
```		```

The optional list of symbol operands are bound to the symbols of the		The optional list of symbol operands are bound to the symbols of the
memrefs affine map. In the example below, the ssa value '%s' is bound to		memrefs affine map. In the example below, the ssa value '%s' is bound to
the symbol 's0' in the affine map specified in the allocs memref type.		the symbol 's0' in the affine map specified in the allocs memref type.

```mlir		```mlir
%0 = alloc()[%s] : memref<8x64xf32, (d0, d1)[s0] -> ((d0 + s0), d1), 1>		%0 = alloc()[%s] : memref<8x64xf32,
		affine_map<(d0, d1)[s0] -> ((d0 + s0), d1)>, 1>
		nicolasvasilacheUnsubmitted Done Reply Inline Actions typo %d1 nicolasvasilache: typo %d1
```		```

This operation returns a single ssa value of memref type, which can be used		This operation returns a single ssa value of memref type, which can be used
by subsequent load and store operations.		by subsequent load and store operations.

The optional `alignment` attribute may be specified to ensure that the		The optional `alignment` attribute may be specified to ensure that the
region of memory that will be indexed is aligned at the specified byte		region of memory that will be indexed is aligned at the specified byte
boundary.		boundary.

```mlir		```mlir
%0 = alloc()[%s] {alignment = 8} :		%0 = alloc()[%s] {alignment = 8} :
memref<8x64xf32, (d0, d1)[s0] -> ((d0 + s0), d1), 1>		memref<8x64xf32, affine_map<(d0, d1)[s0] -> ((d0 + s0), d1)>, 1>
```		```
}];		}];
		}

let arguments = (ins Variadic<Index>:$value,		//===----------------------------------------------------------------------===//
Confined<OptionalAttr<I64Attr>, [IntMinValue<0>]>:$alignment);		// AllocaOp
let results = (outs AnyMemRef);		//===----------------------------------------------------------------------===//

let builders = [OpBuilder<		def AllocaOp : AllocLikeOp<"alloca"> {
"Builder *builder, OperationState &result, MemRefType memrefType", [{		let summary = "stack memory allocation operation";
result.types.push_back(memrefType);		let description = [{
}]>,		The `alloca` operation allocates memory on the stack, to be automatically
		rriddleUnsubmitted Done Reply Inline Actions nit: "alloca" -> `alloca` rriddle: nit: "alloca" -> `alloca`
		ftynseUnsubmitted Done Reply Inline Actions Nit: I think the intention of the comment above was to ask you to use backticks instead of double quotes. ftynse: Nit: I think the intention of the comment above was to ask you to use backticks instead of…
OpBuilder<		released when the stack frame is discarded. The amount of memory allocated
		ftynseUnsubmitted Done Reply Inline Actions Could you please elaborate what is a stack frame in MLIR? We don't seem to have this concept defined anywhere. In particular, is it only related to `std.func`, or can one register other ops that create stack frames? ftynse: Could you please elaborate what is a stack frame in MLIR? We don't seem to have this concept…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions A stack frame here refers to the standard stack frame concept in CS that we know of! It's up to the conversion out of MLIR to realize this correctly. bondhugula: A stack frame here refers to the standard stack frame concept in CS that we know of! It's up to…
		ftynseUnsubmitted Done Reply Inline Actions MLIR makes us rethink a lot of things. Like MLIR does not have functions as a first-class concept :) I could obviously guess your intention with stack frames, but I would still insist you think it through in context of MLIR. Similarly to functions, I don't think we have a built-in memory model that has a stack... Or that `std.func` semantics says something about stack frames. IIRC, this was one of the conceptual problems of having `alloca` in the first place. What should happen if I do the following? my.func @foo() { alloca ... } Or std.func @foo() { %0 = my.func_that_may_or_may_not_be_a_lambda @inner() { alloca ... } } "Conversion out of MLIR" does not mean anything to me either. Do you mean the translation of LLVM dialect to LLVM IR? There are other passes that may be using the standard dialect that are completely unaware of that. ftynse: MLIR makes us rethink a lot of things. Like MLIR does not have functions as a first-class…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Irrespective of which case it is, it depends on the ops around it and for the folks defining those ops to realize it. What you need to keep in mind is that an alloca's memory is automatically freed (i.e., you won't find a dealloc) and it disappears at the time the stack frame goes away whenever such a concept exists. Now, one is of course free to transform it and implement the auto freeing in another way within MLIR itself. Heap allocations for eg. get promoted to stack ones (eg. in LLVM's passes), and for whatever reason, one could decide to switch a stack allocation to a heap one. That won't be an incorrect transform. So it ultimately may not even be realized on the stack (let alone your question of when it should be freed). It's the intent of the op when you actually see in the IR that is of an allocation that is freed automatically when the stack frame goes away. bondhugula: Irrespective of which case it is, it depends on the ops around it and for the folks defining…
		ftynseUnsubmitted Done Reply Inline Actions I don't disagree with anything you say. I am merely pointing out that it is absolutely unclear from the op definition _when_ the allocated memory will be automatically freed. And "when the stack frame goes away" is not a satisfactory definition, because it is meaningless under the current MLIR semantics as it is written. I could literally add the `stack_frame.go_away()` operation tomorrow and expect it to free the allocations... If you could instead tie it to something like "std.func" returning or the region of an operation with FunctionLike trait transferring control flow back to its enclosing op, it would be more MLIR-compatible. ftynse: I don't disagree with anything you say. I am merely pointing out that it is absolutely unclear…
		mehdi_aminiUnsubmitted Done Reply Inline Actions I agree with Alex that this could deserve a more careful way of describing this. In particular the "stack" isn't very important here, we really need a scope. Could we use a trait like we have for "IsolatedFromAbove" to control this scoping? Unless we just consider every region as a new scope for these allocations? mehdi_amini: I agree with Alex that this could deserve a more careful way of describing this. In particular…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Looks like these messages were in flight while I updated and committed. The rephrasing can be addressed in a future revision while we continue discussion here. the current MLIR semantics as it is written. I could literally add the stack_frame.go_away() operation tomorrow and expect it to free the allocations. The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using another op. If you are imagining a low level op that exists that manipulates the stack frame, this would be a pathological case that it has freed it by circumventing things. In particular the "stack" isn't very important here, we really need a scope. Could we use a trait like we have for IsolatedFromAbove" to Tying it to a scope like that of a closest surrounding op with a function like trait (or isolatedFromAbove) is almost fine, but ultimately not perfect since imperative function like ops like std.execute_region don't have function like traits, and we may want to imply freeing at that scope. bondhugula: Looks like these messages were in flight while I updated and committed. The rephrasing can be…
		ftynseUnsubmitted Done Reply Inline Actions The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using another op. Why not? I would not constraint the behavior of every possible op based on a one-letter suffix of another op... "Automatic" does not mean it cannot be connected to another op. In fact, in MLIR, it will likely be connected to another op because functions are ops. Tying it to a scope like that of a closest surrounding op with a function like trait (or isolatedFromAbove) is almost fine, but ultimately not perfect since imperative function like ops like std.execute_region don't have function like traits, and we may want to imply freeing at that scope. IIUC, the idea is to introduce a new trait `AutomaticAllocationScope` that is orthogonal to `FunctionLike`. We can then make, e.g., `std.func`, `llvm.func` and `std.execute_region` have this trait, and let other ops opt-in to the automatic allocation/deallocation behavior. This would be my ideal solution, but I hesitated to push it since it may involve larger changes to this patch, trying to find a simpler solution like attaching to the function trait first. Now that the patch has landed, I think introducing a separate trait is the right thing to do. ftynse: > The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions mean it cannot be connected to another op. In fact, in MLIR, it will likely be connected to another op because functions are ops. Automatically there implies it has to be freed automatically, not by another op. The moment you want to explicitly free that, you'll have to rewrite it to something else - for eg. alloc/dealloc pair. like attaching to the function trait first. Now that the patch has landed, I think introducing a separate trait is the right thing to do. Strictly speaking, I also feel adding another trait is the right option. But for now and to keep it lightweight until we have such use cases, 'freed when the closest surrounding op with FunctionLike trait returns' is a good approximation for 'stack frame being discarded'. bondhugula: >mean it cannot be connected to another op. In fact, in MLIR, it will >likely be connected to…
		ftynseUnsubmitted Done Reply Inline Actions Strictly speaking, I also feel adding another trait is the right option. But for now and to keep it lightweight until we have such use cases, 'freed when the closest surrounding op with FunctionLike trait returns' is a good approximation for 'stack frame being discarded'. Let's have it as "freed when the closest surrounding op with FunctionLike trait has the control transferred back from its body", this avoids the potential interpretation that FunctionLike should also terminate with "std.return" and a weird-sounding "op <..> returns". We may also want to add a verifier check that such an op exists. ftynse: > Strictly speaking, I also feel adding another trait is the right option. But for now and to…
		mehdi_aminiUnsubmitted Done Reply Inline Actions I am not convinced that the "FunctionLike" trait is the right one here, an op like `gpu.launch` for example would create a new scope in my expectations. What do you think about the trait Uday proposed above `AutomaticAllocationScope` ? mehdi_amini: I am not convinced that the "FunctionLike" trait is the right one here, an op like `gpu.launch`…
		ftynseUnsubmitted Done Reply Inline Actions I proposed it :) So I am obviously in favor of it. I think we can start by associating the deallocation with `FunctionLike`, it's a simple documentation+verifier change that we can land fast and avoid having contentious code upstream as well as rolling it back. Then we can implement `AutomaticAllocationScope` and let ops to opt-in (e.g., I'd expect @herhut to decide whether we want `alloca`s in GPU at all). ftynse: I proposed it :) So I am obviously in favor of it. I think we can start by associating the…
		mehdi_aminiUnsubmitted Done Reply Inline Actions I think we can start by associating the deallocation with FunctionLike, I'm afraid this will lead to wrong assumption, I rather have code written with the right trait checked from the beginning. mehdi_amini: > I think we can start by associating the deallocation with FunctionLike, I'm afraid this…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions It is clear that using a new trait for scoped allocation will be accurate here. FunctionLike will always imply a new scope for stack allocation. Introducing the trait should be straightforward - okay to update the description when we introduce the trait. bondhugula: It is clear that using a new trait for scoped allocation will be accurate here. FunctionLike…
		mehdi_aminiUnsubmitted Done Reply Inline Actions Are you adding the trait? I think this should be done now: I wouldn't want code that start checking for "FunctionLike" where it should check for the "AutomaticAllocationScope" trait. mehdi_amini: Are you adding the trait? I think this should be done now: I wouldn't want code that start…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Alex, are you interested in / are you able to add this trait? If not, I can do it right away. bondhugula: Alex, are you interested in / are you able to add this trait? If not, I can do it right away.
		ftynseUnsubmitted Not Done Reply Inline Actions I have quite a long todo list... ftynse: I have quite a long todo list...
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions D77787 bondhugula: D77787
"Builder *builder, OperationState &result, MemRefType memrefType, " #		is specified by its memref and additional operands. For example:
"ArrayRef<Value> operands, IntegerAttr alignment = IntegerAttr()", [{
result.addOperands(operands);
result.types.push_back(memrefType);
if (alignment)
result.addAttribute(getAlignmentAttrName(), alignment);
}]>];

let extraClassDeclaration = [{		```mlir
static StringRef getAlignmentAttrName() { return "alignment"; }		%0 = alloca() : memref<8x64xf32>
		```
		rriddleUnsubmitted Done Reply Inline Actions Please use mlir code blocks for any inline code. rriddle: Please use mlir code blocks for any inline code.

MemRefType getType() { return getResult().getType().cast<MemRefType>(); }		The optional list of dimension operands are bound to the dynamic dimensions
		specified in its memref type. In the example below, the SSA value '%d' is
		bound to the second dimension of the memref (which is dynamic).

/// Returns the number of symbolic operands (the ones in square brackets),		```mlir
/// which bind to the symbols of the memref's layout map.		%0 = alloca(%d) : memref<8x?xf32>
unsigned getNumSymbolicOperands() {		```
return getNumOperands() - getType().getNumDynamicDims();
}

/// Returns the symbolic operands (the ones in square brackets), which bind		The optional list of symbol operands are bound to the symbols of the
/// to the symbols of the memref's layout map.		memref's affine map. In the example below, the SSA value '%s' is bound to
operand_range getSymbolicOperands() {		the symbol 's0' in the affine map specified in the allocs memref type.
return {operand_begin() + getType().getNumDynamicDims(), operand_end()};
}

/// Returns the dynamic sizes for this alloc operation if specified.		```mlir
operand_range getDynamicSizes() { return getOperands(); }		%0 = alloca()[%s] : memref<8x64xf32,
}];		affine_map<(d0, d1)[s0] -> ((d0 + s0), d1)>>
		```

let hasCanonicalizer = 1;		This operation returns a single SSA value of memref type, which can be used
		by subsequent load and store operations. An optional alignment attribute, if
		specified, guarantees alignment at least to that boundary. If not specified,
		an alignment on any convenient boundary compatible with the type will be
		chosen.
		}];
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AndOp		// AndOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def AndOp : IntArithmeticOp<"and", [Commutative]> {		def AndOp : IntArithmeticOp<"and", [Commutative]> {
let summary = "integer binary and";		let summary = "integer binary and";
▲ Show 20 Lines • Show All 2,476 Lines • Show Last 20 Lines

mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp

//===- StandardToLLVM.cpp - Standard to LLVM dialect conversion -----------===//		//===- StandardToLLVM.cpp - Standard to LLVM dialect conversion -----------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file implements a pass to convert MLIR standard and builtin dialects		// This file implements a pass to convert MLIR standard and builtin dialects
// into the LLVM IR dialect.		// into the LLVM IR dialect.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h"
#include "mlir/ADT/TypeSwitch.h"		#include "mlir/ADT/TypeSwitch.h"
		#include "mlir/Conversion/StandardToLLVM/ConvertStandardToLLVM.h"
#include "mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h"		#include "mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h"
#include "mlir/Dialect/LLVMIR/LLVMDialect.h"		#include "mlir/Dialect/LLVMIR/LLVMDialect.h"
#include "mlir/Dialect/StandardOps/IR/Ops.h"		#include "mlir/Dialect/StandardOps/IR/Ops.h"
#include "mlir/IR/Attributes.h"		#include "mlir/IR/Attributes.h"
#include "mlir/IR/Builders.h"		#include "mlir/IR/Builders.h"
#include "mlir/IR/MLIRContext.h"		#include "mlir/IR/MLIRContext.h"
#include "mlir/IR/Module.h"		#include "mlir/IR/Module.h"
#include "mlir/IR/PatternMatch.h"		#include "mlir/IR/PatternMatch.h"
▲ Show 20 Lines • Show All 1,209 Lines • ▼ Show 20 Lines
// Check if the MemRefType `type` is supported by the lowering. We currently		// Check if the MemRefType `type` is supported by the lowering. We currently
// only support memrefs with identity maps.		// only support memrefs with identity maps.
static bool isSupportedMemRefType(MemRefType type) {		static bool isSupportedMemRefType(MemRefType type) {
return type.getAffineMaps().empty() \|\|		return type.getAffineMaps().empty() \|\|
llvm::all_of(type.getAffineMaps(),		llvm::all_of(type.getAffineMaps(),
[](AffineMap map) { return map.isIdentity(); });		[](AffineMap map) { return map.isIdentity(); });
}		}

// An `alloc` is converted into a definition of a memref descriptor value and		/// Lowering for AllocOp and AllocaOp.
// a call to `malloc` to allocate the underlying data buffer. The memref		template <typename AllocLikeOp>
// descriptor is of the LLVM structure type where:		struct AllocLikeOpLowering : public ConvertOpToLLVMPattern<AllocLikeOp> {
// 1. the first element is a pointer to the allocated (typed) data buffer,		using ConvertOpToLLVMPattern<AllocLikeOp>::ConvertOpToLLVMPattern;
// 2. the second element is a pointer to the (typed) payload, aligned to the		using Base = AllocLikeOpLowering<AllocLikeOp>;
// specified alignment,		using ConvertOpToLLVMPattern<AllocLikeOp>::createIndexConstant;
// 3. the remaining elements serve to store all the sizes and strides of the		using ConvertOpToLLVMPattern<AllocLikeOp>::getIndexType;
// memref using LLVM-converted `index` type.		using ConvertOpToLLVMPattern<AllocLikeOp>::typeConverter;
//		using ConvertOpToLLVMPattern<AllocLikeOp>::getVoidPtrType;
// Alignment is obtained by allocating `alignment - 1` more bytes than requested
// and shifting the aligned pointer relative to the allocated memory. If
// alignment is unspecified, the two pointers are equal.
struct AllocOpLowering : public ConvertOpToLLVMPattern<AllocOp> {
using ConvertOpToLLVMPattern<AllocOp>::ConvertOpToLLVMPattern;

explicit AllocOpLowering(LLVMTypeConverter &converter, bool useAlloca = false)		explicit AllocLikeOpLowering(LLVMTypeConverter &converter)
: ConvertOpToLLVMPattern<AllocOp>(converter), useAlloca(useAlloca) {}		: ConvertOpToLLVMPattern<AllocLikeOp>(converter) {}

LogicalResult match(Operation *op) const override {		LogicalResult match(Operation *op) const override {
MemRefType type = cast<AllocOp>(op).getType();		MemRefType memRefType = cast<AllocLikeOp>(op).getType();
if (isSupportedMemRefType(type))		if (isSupportedMemRefType(memRefType))
return success();		return success();

int64_t offset;		int64_t offset;
SmallVector<int64_t, 4> strides;		SmallVector<int64_t, 4> strides;
auto successStrides = getStridesAndOffset(type, strides, offset);		auto successStrides = getStridesAndOffset(memRefType, strides, offset);
if (failed(successStrides))		if (failed(successStrides))
return failure();		return failure();

// Dynamic strides are ok if they can be deduced from dynamic sizes (which		// Dynamic strides are ok if they can be deduced from dynamic sizes (which
// is guaranteed when succeeded(successStrides)). Dynamic offset however can		// is guaranteed when succeeded(successStrides)). Dynamic offset however can
// never be alloc'ed.		// never be alloc'ed.
if (offset == MemRefType::getDynamicStrideOrOffset())		if (offset == MemRefType::getDynamicStrideOrOffset())
return failure();		return failure();

return success();		return success();
}		}

void rewrite(Operation *op, ArrayRef<Value> operands,		/// Creates and populates the memref descriptor struct given all its fields.
ConversionPatternRewriter &rewriter) const override {		/// This method also performs any post allocation alignment needed for heap
auto loc = op->getLoc();		/// allocations when `accessAlignment` is non null. This is used with
auto allocOp = cast<AllocOp>(op);		/// allocators that do not support alignment.
MemRefType type = allocOp.getType();		MemRefDescriptor createMemRefDescriptor(
		Location loc, ConversionPatternRewriter &rewriter, MemRefType memRefType,
		Value allocatedTypePtr, Value allocatedBytePtr, Value accessAlignment,
		uint64_t offset, ArrayRef<int64_t> strides, ArrayRef<Value> sizes) const {
		auto elementPtrType = getElementPtrType(memRefType);
		auto structType = typeConverter.convertType(memRefType);
		auto memRefDescriptor = MemRefDescriptor::undef(rewriter, loc, structType);

// Get actual sizes of the memref as values: static sizes are constant		// Field 1: Allocated pointer, used for malloc/free.
// values and dynamic sizes are passed to 'alloc' as operands. In case of		memRefDescriptor.setAllocatedPtr(rewriter, loc, allocatedTypePtr);
// zero-dimensional memref, assume a scalar (size 1).
SmallVector<Value, 4> sizes;		// Field 2: Actual aligned pointer to payload.
sizes.reserve(type.getRank());		Value alignedBytePtr = allocatedTypePtr;
		if (accessAlignment) {
		// offset = (align - (ptr % align))% align
		Value intVal = rewriter.create<LLVM::PtrToIntOp>(
		nicolasvasilacheUnsubmitted Done Reply Inline Actions I think the change in indentation (?) here and everywhere is unfortunate, it does not properly track what code moved (materialized with a yellow phab horizontal bar on the left) and make it harder to see what changed vs what moved. I am mostly eyeballing the code after realizing most of this is just moved. Please flag specific things need deeper review if appropriate. nicolasvasilache: I think the change in indentation (?) here and everywhere is unfortunate, it does not properly…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions All of this is just moved. Nothing in the alloc op lowering actually changed with this - I'm just reusing the common parts for alloca. bondhugula: All of this is just moved. Nothing in the alloc op lowering actually changed with this - I'm…
		loc, this->getIndexType(), allocatedBytePtr);
		Value ptrModAlign =
		rewriter.create<LLVM::URemOp>(loc, intVal, accessAlignment);
		Value subbed =
		rewriter.create<LLVM::SubOp>(loc, accessAlignment, ptrModAlign);
		Value offset =
		rewriter.create<LLVM::URemOp>(loc, subbed, accessAlignment);
		Value aligned = rewriter.create<LLVM::GEPOp>(
		loc, allocatedBytePtr.getType(), allocatedBytePtr, offset);
		alignedBytePtr = rewriter.create<LLVM::BitcastOp>(
		loc, elementPtrType, ArrayRef<Value>(aligned));
		}
		memRefDescriptor.setAlignedPtr(rewriter, loc, alignedBytePtr);

		// Field 3: Offset in aligned pointer.
		memRefDescriptor.setOffset(rewriter, loc,
		createIndexConstant(rewriter, loc, offset));

		if (memRefType.getRank() == 0)
		// No size/stride descriptor in memref, return the descriptor value.
		return memRefDescriptor;

		// Fields 4 and 5: sizes and strides of the strided MemRef.
		// Store all sizes in the descriptor. Only dynamic sizes are passed in as
		// operands to AllocOp.
		Value runningStride = nullptr;
		// Iterate strides in reverse order, compute runningStride and strideValues.
		auto nStrides = strides.size();
		SmallVector<Value, 4> strideValues(nStrides, nullptr);
		for (unsigned i = 0; i < nStrides; ++i) {
		int64_t index = nStrides - 1 - i;
		if (strides[index] == MemRefType::getDynamicStrideOrOffset())
		// Identity layout map is enforced in the match function, so we compute:
		// `runningStride *= sizes[index + 1]`
		runningStride = runningStride
		? rewriter.create<LLVM::MulOp>(loc, runningStride,
		sizes[index + 1])
		: createIndexConstant(rewriter, loc, 1);
		else
		runningStride = createIndexConstant(rewriter, loc, strides[index]);
		strideValues[index] = runningStride;
		}
		// Fill size and stride descriptors in memref.
		for (auto indexedSize : llvm::enumerate(sizes)) {
		int64_t index = indexedSize.index();
		memRefDescriptor.setSize(rewriter, loc, index, indexedSize.value());
		memRefDescriptor.setStride(rewriter, loc, index, strideValues[index]);
		}
		return memRefDescriptor;
		}

		/// Determines sizes to be used in the memref descriptor.
		void getSizes(Location loc, MemRefType memRefType, ArrayRef<Value> operands,
		ConversionPatternRewriter &rewriter,
		SmallVectorImpl<Value> &sizes, Value &cumulativeSize,
		Value &one) const {
		ftynseUnsubmitted Done Reply Inline Actions Nit: can we rather define `one` at the call site and pass it here (and to another call) ? ftynse: Nit: can we rather define `one` at the call site and pass it here (and to another call) ?
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions The reason I didn't do that is that the order of the instructions in that case wouldn't be natural - there would be other stuff (the size constant definitions) between the def of 'one' and its first use here. So I left it this way. bondhugula: The reason I didn't do that is that the order of the instructions in that case wouldn't be…
		sizes.reserve(memRefType.getRank());
unsigned i = 0;		unsigned i = 0;
for (int64_t s : type.getShape())		for (int64_t s : memRefType.getShape())
sizes.push_back(s == -1 ? operands[i++]		sizes.push_back(s == -1 ? operands[i++]
: createIndexConstant(rewriter, loc, s));		: createIndexConstant(rewriter, loc, s));
if (sizes.empty())		if (sizes.empty())
sizes.push_back(createIndexConstant(rewriter, loc, 1));		sizes.push_back(createIndexConstant(rewriter, loc, 1));

// Compute the total number of memref elements.		// Compute the total number of memref elements.
Value cumulativeSize = sizes.front();		cumulativeSize = sizes.front();
for (unsigned i = 1, e = sizes.size(); i < e; ++i)		for (unsigned i = 1, e = sizes.size(); i < e; ++i)
cumulativeSize = rewriter.create<LLVM::MulOp>(		cumulativeSize = rewriter.create<LLVM::MulOp>(
loc, getIndexType(), ArrayRef<Value>{cumulativeSize, sizes[i]});		loc, getIndexType(), ArrayRef<Value>{cumulativeSize, sizes[i]});

// Compute the size of an individual element. This emits the MLIR equivalent		// Compute the size of an individual element. This emits the MLIR equivalent
// of the following sizeof(...) implementation in LLVM IR:		// of the following sizeof(...) implementation in LLVM IR:
// %0 = getelementptr %elementType* null, %indexType 1		// %0 = getelementptr %elementType* null, %indexType 1
// %1 = ptrtoint %elementType* %0 to %indexType		// %1 = ptrtoint %elementType* %0 to %indexType
// which is a common pattern of getting the size of a type in bytes.		// which is a common pattern of getting the size of a type in bytes.
auto elementType = type.getElementType();		auto elementType = memRefType.getElementType();
auto convertedPtrType = typeConverter.convertType(elementType)		auto convertedPtrType = typeConverter.convertType(elementType)
.cast<LLVM::LLVMType>()		.template cast<LLVM::LLVMType>()
.getPointerTo();		.getPointerTo();
auto nullPtr = rewriter.create<LLVM::NullOp>(loc, convertedPtrType);		auto nullPtr = rewriter.create<LLVM::NullOp>(loc, convertedPtrType);
auto one = createIndexConstant(rewriter, loc, 1);		one = createIndexConstant(rewriter, loc, 1);
auto gep = rewriter.create<LLVM::GEPOp>(loc, convertedPtrType,		auto gep = rewriter.create<LLVM::GEPOp>(loc, convertedPtrType,
ArrayRef<Value>{nullPtr, one});		ArrayRef<Value>{nullPtr, one});
auto elementSize =		auto elementSize =
rewriter.create<LLVM::PtrToIntOp>(loc, getIndexType(), gep);		rewriter.create<LLVM::PtrToIntOp>(loc, getIndexType(), gep);
cumulativeSize = rewriter.create<LLVM::MulOp>(		cumulativeSize = rewriter.create<LLVM::MulOp>(
loc, getIndexType(), ArrayRef<Value>{cumulativeSize, elementSize});		loc, getIndexType(), ArrayRef<Value>{cumulativeSize, elementSize});
		}

// Allocate the underlying buffer and store a pointer to it in the MemRef		/// Returns the type of a pointer to an element of the memref.
// descriptor.		Type getElementPtrType(MemRefType memRefType) const {
Value allocated = nullptr;		auto elementType = memRefType.getElementType();
int alignment = 0;		auto structElementType = typeConverter.convertType(elementType);
Value alignmentValue = nullptr;		return structElementType.template cast<LLVM::LLVMType>().getPointerTo(
if (auto alignAttr = allocOp.alignment())		memRefType.getMemorySpace());
alignment = alignAttr.getValue().getSExtValue();		}

if (useAlloca) {		/// Allocates the underlying buffer using the right call. `allocatedBytePtr`
allocated = rewriter.create<LLVM::AllocaOp>(loc, getVoidPtrType(),		/// is set to null for stack allocations. `accessAlignment` is set if
cumulativeSize, alignment);		/// alignment is neeeded post allocation (for eg. in conjunction with malloc).
} else {		/// TODO(bondhugula): next revision will support std lib func aligned_alloc.
// Insert the `malloc` declaration if it is not already present.		Value allocateBuffer(Location loc, Value cumulativeSize, Operation *op,
auto module = op->getParentOfType<ModuleOp>();		MemRefType memRefType, Value one, Value &accessAlignment,
auto mallocFunc = module.lookupSymbol<LLVM::LLVMFuncOp>("malloc");		Value &allocatedBytePtr,
if (!mallocFunc) {		ConversionPatternRewriter &rewriter) const {
OpBuilder moduleBuilder(		auto elementPtrType = getElementPtrType(memRefType);
op->getParentOfType<ModuleOp>().getBodyRegion());
mallocFunc = moduleBuilder.create<LLVM::LLVMFuncOp>(		// Whether to use std lib function aligned_alloc that supports alignment.
rewriter.getUnknownLoc(), "malloc",		Optional<APInt> allocationAlignment = cast<AllocLikeOp>(op).alignment();
LLVM::LLVMType::getFunctionTy(getVoidPtrType(), getIndexType(),
		// With alloca, one gets a pointer to the element type right away.
		bool onStack = isa<AllocaOp>(op);
		if (onStack) {
		allocatedBytePtr = nullptr;
		accessAlignment = nullptr;
		return rewriter.create<LLVM::AllocaOp>(
		loc, elementPtrType, cumulativeSize,
		allocationAlignment ? allocationAlignment.getValue().getSExtValue()
		: 0);
		}

		// Use malloc. Insert the malloc declaration if it is not already present.
		auto allocFuncName = "malloc";
		AllocOp allocOp = cast<AllocOp>(op);
		auto module = allocOp.getParentOfType<ModuleOp>();
		auto allocFunc = module.lookupSymbol<LLVM::LLVMFuncOp>(allocFuncName);
		if (!allocFunc) {
		OpBuilder moduleBuilder(op->getParentOfType<ModuleOp>().getBodyRegion());
		SmallVector<LLVM::LLVMType, 2> callArgTypes = {getIndexType()};
		allocFunc = moduleBuilder.create<LLVM::LLVMFuncOp>(
		rewriter.getUnknownLoc(), allocFuncName,
		LLVM::LLVMType::getFunctionTy(getVoidPtrType(), callArgTypes,
/isVarArg=/false));		/isVarArg=/false));
}		}
if (alignment != 0) {
alignmentValue = createIndexConstant(rewriter, loc, alignment);		// Allocate the underlying buffer and store a pointer to it in the MemRef
		// descriptor.
		SmallVector<Value, 2> callArgs;
		// Adjust the allocation size to consider alignment.
		if (allocOp.alignment()) {
		accessAlignment = createIndexConstant(
		rewriter, loc, allocOp.alignment().getValue().getSExtValue());
cumulativeSize = rewriter.create<LLVM::SubOp>(		cumulativeSize = rewriter.create<LLVM::SubOp>(
loc,		loc,
rewriter.create<LLVM::AddOp>(loc, cumulativeSize, alignmentValue),		rewriter.create<LLVM::AddOp>(loc, cumulativeSize, accessAlignment),
one);		one);
}		}
allocated = rewriter		callArgs.push_back(cumulativeSize);
.create<LLVM::CallOp>(		auto allocFuncSymbol = rewriter.getSymbolRefAttr(allocFunc);
loc, getVoidPtrType(),		allocatedBytePtr = rewriter
rewriter.getSymbolRefAttr(mallocFunc), cumulativeSize)		.create<LLVM::CallOp>(loc, getVoidPtrType(),
		allocFuncSymbol, callArgs)
.getResult(0);		.getResult(0);
		// For heap allocations, the allocated pointer is a cast of the byte pointer
		// to the type pointer.
		return rewriter.create<LLVM::BitcastOp>(loc, elementPtrType,
		allocatedBytePtr);
}		}

auto structElementType = typeConverter.convertType(elementType);		// An `alloc` is converted into a definition of a memref descriptor value and
auto elementPtrType = structElementType.cast<LLVM::LLVMType>().getPointerTo(		// a call to `malloc` to allocate the underlying data buffer. The memref
type.getMemorySpace());		// descriptor is of the LLVM structure type where:
Value bitcastAllocated = rewriter.create<LLVM::BitcastOp>(		// 1. the first element is a pointer to the allocated (typed) data buffer,
loc, elementPtrType, ArrayRef<Value>(allocated));		// 2. the second element is a pointer to the (typed) payload, aligned to the
		// specified alignment,
		// 3. the remaining elements serve to store all the sizes and strides of the
		// memref using LLVM-converted `index` type.
		//
		// Alignment is performed by allocating `alignment - 1` more bytes than
		// requested and shifting the aligned pointer relative to the allocated
		// memory. If alignment is unspecified, the two pointers are equal.

		// An `alloca` is converted into a definition of a memref descriptor value and
		// an llvm.alloca to allocate the underlying data buffer.
		void rewrite(Operation *op, ArrayRef<Value> operands,
		ConversionPatternRewriter &rewriter) const override {
		MemRefType memRefType = cast<AllocLikeOp>(op).getType();
		auto loc = op->getLoc();

		// Get actual sizes of the memref as values: static sizes are constant
		// values and dynamic sizes are passed to 'alloc' as operands. In case of
		// zero-dimensional memref, assume a scalar (size 1).
		SmallVector<Value, 4> sizes;
		Value cumulativeSize, one;
		getSizes(loc, memRefType, operands, rewriter, sizes, cumulativeSize, one);

		// Allocate the underlying buffer.
		// Value holding the alignment that has to be performed post allocation
		// (in conjunction with allocators that do not support alignment, eg.
		// malloc); nullptr if no such adjustment needs to be performed.
		Value accessAlignment;
		// Byte pointer to the allocated buffer.
		Value allocatedBytePtr;
		Value allocatedTypePtr =
		allocateBuffer(loc, cumulativeSize, op, memRefType, one,
		accessAlignment, allocatedBytePtr, rewriter);

int64_t offset;		int64_t offset;
SmallVector<int64_t, 4> strides;		SmallVector<int64_t, 4> strides;
auto successStrides = getStridesAndOffset(type, strides, offset);		auto successStrides = getStridesAndOffset(memRefType, strides, offset);
assert(succeeded(successStrides) && "unexpected non-strided memref");
(void)successStrides;		(void)successStrides;
		assert(succeeded(successStrides) && "unexpected non-strided memref");
assert(offset != MemRefType::getDynamicStrideOrOffset() &&		assert(offset != MemRefType::getDynamicStrideOrOffset() &&
"unexpected dynamic offset");		"unexpected dynamic offset");

// 0-D memref corner case: they have size 1 ...		// 0-D memref corner case: they have size 1.
assert(((type.getRank() == 0 && strides.empty() && sizes.size() == 1) \|\|		assert(
		((memRefType.getRank() == 0 && strides.empty() && sizes.size() == 1) \|\|
(strides.size() == sizes.size())) &&		(strides.size() == sizes.size())) &&
"unexpected number of strides");		"unexpected number of strides");

// Create the MemRef descriptor.		// Create the MemRef descriptor.
auto structType = typeConverter.convertType(type);		auto memRefDescriptor = createMemRefDescriptor(
auto memRefDescriptor = MemRefDescriptor::undef(rewriter, loc, structType);		loc, rewriter, memRefType, allocatedTypePtr, allocatedBytePtr,
// Field 1: Allocated pointer, used for malloc/free.		accessAlignment, offset, strides, sizes);
memRefDescriptor.setAllocatedPtr(rewriter, loc, bitcastAllocated);

// Field 2: Actual aligned pointer to payload.
Value bitcastAligned = bitcastAllocated;
if (!useAlloca && alignment != 0) {
assert(alignmentValue);
// offset = (align - (ptr % align))% align
Value intVal = rewriter.create<LLVM::PtrToIntOp>(
loc, this->getIndexType(), allocated);
Value ptrModAlign =
rewriter.create<LLVM::URemOp>(loc, intVal, alignmentValue);
Value subbed =
rewriter.create<LLVM::SubOp>(loc, alignmentValue, ptrModAlign);
Value offset = rewriter.create<LLVM::URemOp>(loc, subbed, alignmentValue);
Value aligned = rewriter.create<LLVM::GEPOp>(loc, allocated.getType(),
allocated, offset);
bitcastAligned = rewriter.create<LLVM::BitcastOp>(
loc, elementPtrType, ArrayRef<Value>(aligned));
}
memRefDescriptor.setAlignedPtr(rewriter, loc, bitcastAligned);

// Field 3: Offset in aligned pointer.
memRefDescriptor.setOffset(rewriter, loc,
createIndexConstant(rewriter, loc, offset));

if (type.getRank() == 0)
// No size/stride descriptor in memref, return the descriptor value.
return rewriter.replaceOp(op, {memRefDescriptor});

// Fields 4 and 5: Sizes and strides of the strided MemRef.
// Store all sizes in the descriptor. Only dynamic sizes are passed in as
// operands to AllocOp.
Value runningStride = nullptr;
// Iterate strides in reverse order, compute runningStride and strideValues.
auto nStrides = strides.size();
SmallVector<Value, 4> strideValues(nStrides, nullptr);
for (unsigned i = 0; i < nStrides; ++i) {
int64_t index = nStrides - 1 - i;
if (strides[index] == MemRefType::getDynamicStrideOrOffset())
// Identity layout map is enforced in the match function, so we compute:
// `runningStride *= sizes[index + 1]`
runningStride = runningStride
? rewriter.create<LLVM::MulOp>(loc, runningStride,
sizes[index + 1])
: createIndexConstant(rewriter, loc, 1);
else
runningStride = createIndexConstant(rewriter, loc, strides[index]);
strideValues[index] = runningStride;
}
// Fill size and stride descriptors in memref.
for (auto indexedSize : llvm::enumerate(sizes)) {
int64_t index = indexedSize.index();
memRefDescriptor.setSize(rewriter, loc, index, indexedSize.value());
memRefDescriptor.setStride(rewriter, loc, index, strideValues[index]);
}

// Return the final value of the descriptor.		// Return the final value of the descriptor.
rewriter.replaceOp(op, {memRefDescriptor});		rewriter.replaceOp(op, {memRefDescriptor});
}		}
		};

bool useAlloca;		struct AllocOpLowering : public AllocLikeOpLowering<AllocOp> {
		using Base::Base;
		};
		struct AllocaOpLowering : public AllocLikeOpLowering<AllocaOp> {
		using Base::Base;
};		};

// A CallOp automatically promotes MemRefType to a sequence of alloca/store and		// A CallOp automatically promotes MemRefType to a sequence of alloca/store and
// passes the pointer to the MemRef across function boundaries.		// passes the pointer to the MemRef across function boundaries.
template <typename CallOpType>		template <typename CallOpType>
struct CallOpInterfaceLowering : public ConvertOpToLLVMPattern<CallOpType> {		struct CallOpInterfaceLowering : public ConvertOpToLLVMPattern<CallOpType> {
using ConvertOpToLLVMPattern<CallOpType>::ConvertOpToLLVMPattern;		using ConvertOpToLLVMPattern<CallOpType>::ConvertOpToLLVMPattern;
using Super = CallOpInterfaceLowering<CallOpType>;		using Super = CallOpInterfaceLowering<CallOpType>;
▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
};		};

// A `dealloc` is converted into a call to `free` on the underlying data buffer.		// A `dealloc` is converted into a call to `free` on the underlying data buffer.
// The memref descriptor being an SSA value, there is no need to clean it up		// The memref descriptor being an SSA value, there is no need to clean it up
// in any way.		// in any way.
struct DeallocOpLowering : public ConvertOpToLLVMPattern<DeallocOp> {		struct DeallocOpLowering : public ConvertOpToLLVMPattern<DeallocOp> {
using ConvertOpToLLVMPattern<DeallocOp>::ConvertOpToLLVMPattern;		using ConvertOpToLLVMPattern<DeallocOp>::ConvertOpToLLVMPattern;

explicit DeallocOpLowering(LLVMTypeConverter &converter,		explicit DeallocOpLowering(LLVMTypeConverter &converter)
bool useAlloca = false)		: ConvertOpToLLVMPattern<DeallocOp>(converter) {}
: ConvertOpToLLVMPattern<DeallocOp>(converter), useAlloca(useAlloca) {}

LogicalResult		LogicalResult
matchAndRewrite(Operation *op, ArrayRef<Value> operands,		matchAndRewrite(Operation *op, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
if (useAlloca)
return rewriter.eraseOp(op), success();

assert(operands.size() == 1 && "dealloc takes one operand");		assert(operands.size() == 1 && "dealloc takes one operand");
OperandAdaptor<DeallocOp> transformed(operands);		OperandAdaptor<DeallocOp> transformed(operands);

// Insert the `free` declaration if it is not already present.		// Insert the `free` declaration if it is not already present.
auto freeFunc =		auto freeFunc =
op->getParentOfType<ModuleOp>().lookupSymbol<LLVM::LLVMFuncOp>("free");		op->getParentOfType<ModuleOp>().lookupSymbol<LLVM::LLVMFuncOp>("free");
if (!freeFunc) {		if (!freeFunc) {
OpBuilder moduleBuilder(op->getParentOfType<ModuleOp>().getBodyRegion());		OpBuilder moduleBuilder(op->getParentOfType<ModuleOp>().getBodyRegion());
freeFunc = moduleBuilder.create<LLVM::LLVMFuncOp>(		freeFunc = moduleBuilder.create<LLVM::LLVMFuncOp>(
rewriter.getUnknownLoc(), "free",		rewriter.getUnknownLoc(), "free",
LLVM::LLVMType::getFunctionTy(getVoidType(), getVoidPtrType(),		LLVM::LLVMType::getFunctionTy(getVoidType(), getVoidPtrType(),
/isVarArg=/false));		/isVarArg=/false));
}		}

MemRefDescriptor memref(transformed.memref());		MemRefDescriptor memref(transformed.memref());
Value casted = rewriter.create<LLVM::BitcastOp>(		Value casted = rewriter.create<LLVM::BitcastOp>(
op->getLoc(), getVoidPtrType(),		op->getLoc(), getVoidPtrType(),
memref.allocatedPtr(rewriter, op->getLoc()));		memref.allocatedPtr(rewriter, op->getLoc()));
rewriter.replaceOpWithNewOp<LLVM::CallOp>(		rewriter.replaceOpWithNewOp<LLVM::CallOp>(
op, ArrayRef<Type>(), rewriter.getSymbolRefAttr(freeFunc), casted);		op, ArrayRef<Type>(), rewriter.getSymbolRefAttr(freeFunc), casted);
return success();		return success();
}		}

bool useAlloca;
};		};

// A `rsqrt` is converted into `1 / sqrt`.		// A `rsqrt` is converted into `1 / sqrt`.
struct RsqrtOpLowering : public ConvertOpToLLVMPattern<RsqrtOp> {		struct RsqrtOpLowering : public ConvertOpToLLVMPattern<RsqrtOp> {
using ConvertOpToLLVMPattern<RsqrtOp>::ConvertOpToLLVMPattern;		using ConvertOpToLLVMPattern<RsqrtOp>::ConvertOpToLLVMPattern;

LogicalResult		LogicalResult
matchAndRewrite(Operation *op, ArrayRef<Value> operands,		matchAndRewrite(Operation *op, ArrayRef<Value> operands,
▲ Show 20 Lines • Show All 1,046 Lines • ▼ Show 20 Lines
void mlir::populateStdToLLVMNonMemoryConversionPatterns(		void mlir::populateStdToLLVMNonMemoryConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
// FIXME: this should be tablegen'ed		// FIXME: this should be tablegen'ed
// clang-format off		// clang-format off
patterns.insert<		patterns.insert<
AbsFOpLowering,		AbsFOpLowering,
AddFOpLowering,		AddFOpLowering,
AddIOpLowering,		AddIOpLowering,
		AllocaOpLowering,
AndOpLowering,		AndOpLowering,
AtomicCmpXchgOpLowering,		AtomicCmpXchgOpLowering,
AtomicRMWOpLowering,		AtomicRMWOpLowering,
BranchOpLowering,		BranchOpLowering,
CallIndirectOpLowering,		CallIndirectOpLowering,
CallOpLowering,		CallOpLowering,
CeilFOpLowering,		CeilFOpLowering,
CmpFOpLowering,		CmpFOpLowering,
Show All 37 Lines	patterns.insert<
UnsignedRemIOpLowering,		UnsignedRemIOpLowering,
UnsignedShiftRightOpLowering,		UnsignedShiftRightOpLowering,
XOrOpLowering,		XOrOpLowering,
ZeroExtendIOpLowering>(converter);		ZeroExtendIOpLowering>(converter);
// clang-format on		// clang-format on
}		}

void mlir::populateStdToLLVMMemoryConversionPatters(		void mlir::populateStdToLLVMMemoryConversionPatters(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns,		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
bool useAlloca) {
// clang-format off		// clang-format off
patterns.insert<		patterns.insert<
AssumeAlignmentOpLowering,		AssumeAlignmentOpLowering,
DimOpLowering,		DimOpLowering,
LoadOpLowering,		LoadOpLowering,
MemRefCastOpLowering,		MemRefCastOpLowering,
StoreOpLowering,		StoreOpLowering,
SubViewOpLowering,		SubViewOpLowering,
ViewOpLowering>(converter);		ViewOpLowering>(converter);
patterns.insert<		patterns.insert<
AllocOpLowering,		AllocOpLowering,
DeallocOpLowering>(converter, useAlloca);		DeallocOpLowering>(converter);
// clang-format on		// clang-format on
}		}

void mlir::populateStdToLLVMDefaultFuncOpConversionPattern(		void mlir::populateStdToLLVMDefaultFuncOpConversionPattern(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns,		LLVMTypeConverter &converter, OwningRewritePatternList &patterns,
bool emitCWrappers) {		bool emitCWrappers) {
patterns.insert<FuncOpConversion>(converter, emitCWrappers);		patterns.insert<FuncOpConversion>(converter, emitCWrappers);
}		}

void mlir::populateStdToLLVMConversionPatterns(		void mlir::populateStdToLLVMConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns,		LLVMTypeConverter &converter, OwningRewritePatternList &patterns,
bool useAlloca, bool emitCWrappers) {		bool emitCWrappers) {
populateStdToLLVMDefaultFuncOpConversionPattern(converter, patterns,		populateStdToLLVMDefaultFuncOpConversionPattern(converter, patterns,
emitCWrappers);		emitCWrappers);
populateStdToLLVMNonMemoryConversionPatterns(converter, patterns);		populateStdToLLVMNonMemoryConversionPatterns(converter, patterns);
populateStdToLLVMMemoryConversionPatters(converter, patterns, useAlloca);		populateStdToLLVMMemoryConversionPatters(converter, patterns);
}		}

static void populateStdToLLVMBarePtrFuncOpConversionPattern(		static void populateStdToLLVMBarePtrFuncOpConversionPattern(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
patterns.insert<BarePtrFuncOpConversion>(converter);		patterns.insert<BarePtrFuncOpConversion>(converter);
}		}

void mlir::populateStdToLLVMBarePtrConversionPatterns(		void mlir::populateStdToLLVMBarePtrConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns,		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
bool useAlloca) {
populateStdToLLVMBarePtrFuncOpConversionPattern(converter, patterns);		populateStdToLLVMBarePtrFuncOpConversionPattern(converter, patterns);
populateStdToLLVMNonMemoryConversionPatterns(converter, patterns);		populateStdToLLVMNonMemoryConversionPatterns(converter, patterns);
populateStdToLLVMMemoryConversionPatters(converter, patterns, useAlloca);		populateStdToLLVMMemoryConversionPatters(converter, patterns);
}		}

// Create an LLVM IR structure type if there is more than one result.		// Create an LLVM IR structure type if there is more than one result.
Type LLVMTypeConverter::packFunctionResults(ArrayRef<Type> types) {		Type LLVMTypeConverter::packFunctionResults(ArrayRef<Type> types) {
assert(!types.empty() && "expected non-empty list of type");		assert(!types.empty() && "expected non-empty list of type");

if (types.size() == 1)		if (types.size() == 1)
return convertType(types.front());		return convertType(types.front());
▲ Show 20 Lines • Show All 57 Lines • ▼ Show 20 Lines
namespace {		namespace {
/// A pass converting MLIR operations into the LLVM IR dialect.		/// A pass converting MLIR operations into the LLVM IR dialect.
struct LLVMLoweringPass : public ModulePass<LLVMLoweringPass> {		struct LLVMLoweringPass : public ModulePass<LLVMLoweringPass> {
/// Include the generated pass utilities.		/// Include the generated pass utilities.
#define GEN_PASS_ConvertStandardToLLVM		#define GEN_PASS_ConvertStandardToLLVM
#include "mlir/Conversion/Passes.h.inc"		#include "mlir/Conversion/Passes.h.inc"

/// Creates an LLVM lowering pass.		/// Creates an LLVM lowering pass.
LLVMLoweringPass(bool useAlloca, bool useBarePtrCallConv, bool emitCWrappers,		LLVMLoweringPass(bool useBarePtrCallConv, bool emitCWrappers,
unsigned indexBitwidth) {		unsigned indexBitwidth) {
this->useAlloca = useAlloca;
this->useBarePtrCallConv = useBarePtrCallConv;		this->useBarePtrCallConv = useBarePtrCallConv;
this->emitCWrappers = emitCWrappers;		this->emitCWrappers = emitCWrappers;
this->indexBitwidth = indexBitwidth;		this->indexBitwidth = indexBitwidth;
}		}
explicit LLVMLoweringPass() {}		explicit LLVMLoweringPass() {}
LLVMLoweringPass(const LLVMLoweringPass &pass) {}		LLVMLoweringPass(const LLVMLoweringPass &pass) {}

/// Run the dialect converter on the module.		/// Run the dialect converter on the module.
Show All 11 Lines	void runOnModule() override {
LLVMTypeConverterCustomization customs;		LLVMTypeConverterCustomization customs;
customs.funcArgConverter = useBarePtrCallConv ? barePtrFuncArgTypeConverter		customs.funcArgConverter = useBarePtrCallConv ? barePtrFuncArgTypeConverter
: structFuncArgTypeConverter;		: structFuncArgTypeConverter;
customs.indexBitwidth = indexBitwidth;		customs.indexBitwidth = indexBitwidth;
LLVMTypeConverter typeConverter(&getContext(), customs);		LLVMTypeConverter typeConverter(&getContext(), customs);

OwningRewritePatternList patterns;		OwningRewritePatternList patterns;
if (useBarePtrCallConv)		if (useBarePtrCallConv)
populateStdToLLVMBarePtrConversionPatterns(typeConverter, patterns,		populateStdToLLVMBarePtrConversionPatterns(typeConverter, patterns);
useAlloca);
else		else
populateStdToLLVMConversionPatterns(typeConverter, patterns, useAlloca,		populateStdToLLVMConversionPatterns(typeConverter, patterns,
emitCWrappers);		emitCWrappers);

LLVMConversionTarget target(getContext());		LLVMConversionTarget target(getContext());
if (failed(applyPartialConversion(m, target, patterns, &typeConverter)))		if (failed(applyPartialConversion(m, target, patterns, &typeConverter)))
signalPassFailure();		signalPassFailure();
}		}
};		};
} // end namespace		} // end namespace

mlir::LLVMConversionTarget::LLVMConversionTarget(MLIRContext &ctx)		mlir::LLVMConversionTarget::LLVMConversionTarget(MLIRContext &ctx)
: ConversionTarget(ctx) {		: ConversionTarget(ctx) {
this->addLegalDialect<LLVM::LLVMDialect>();		this->addLegalDialect<LLVM::LLVMDialect>();
this->addIllegalOp<LLVM::DialectCastOp>();		this->addIllegalOp<LLVM::DialectCastOp>();
this->addIllegalOp<TanhOp>();		this->addIllegalOp<TanhOp>();
}		}

std::unique_ptr<OpPassBase<ModuleOp>>		std::unique_ptr<OpPassBase<ModuleOp>>
mlir::createLowerToLLVMPass(bool useAlloca, bool useBarePtrCallConv,		mlir::createLowerToLLVMPass(const LowerToLLVMOptions &options) {
bool emitCWrappers, unsigned indexBitwidth) {		return std::make_unique<LLVMLoweringPass>(
return std::make_unique<LLVMLoweringPass>(useAlloca, useBarePtrCallConv,		options.useBarePtrCallConv, options.emitCWrappers, options.indexBitwidth);
		nicolasvasilacheUnsubmitted Done Reply Inline Actions I am concerned by the silent behavior breaking changes here and in the followup revision. I am expecting this will be painful for integrations and will repeat itself. Can we turn this into a: struct LowerToLLVMOptions { ... } nicolasvasilache: I am concerned by the silent behavior breaking changes here and in the followup revision. I am…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions That's a concern - there are going to be silent breaking effects. I've changed it to struct here. But even with a struct, you'll have the same issue with list initialization (some of the fields will remain uninitialized / wrongly initialized depending on where the field was removed from or where the new field was added). If the new fields are always added at the end, we'll just have uninitialized fields. And with explicit field-wise init, it'll just lead to uninitialized stuff. BTW, none of the three options are documented at the declaration (but only in Passes.td). bondhugula: That's a concern - there are going to be silent breaking effects. I've changed it to struct…
emitCWrappers, indexBitwidth);
}		}

mlir/lib/Dialect/StandardOps/IR/Ops.cpp

Show First 20 Lines • Show All 236 Lines • ▼ Show 20 Lines	OpFoldResult AddIOp::fold(ArrayRef<Attribute> operands) {
if (matchPattern(rhs(), m_Zero()))		if (matchPattern(rhs(), m_Zero()))
return lhs();		return lhs();

return constFoldBinaryOp<IntegerAttr>(operands,		return constFoldBinaryOp<IntegerAttr>(operands,
[](APInt a, APInt b) { return a + b; });		[](APInt a, APInt b) { return a + b; });
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AllocOp		// AllocOp / AllocaOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static void print(OpAsmPrinter &p, AllocOp op) {		template <typename AllocLikeOp>
p << "alloc";		static void printAllocLikeOp(OpAsmPrinter &p, AllocLikeOp op, StringRef name) {
		static_assert(llvm::is_one_of<AllocLikeOp, AllocOp, AllocaOp>::value,
		rriddleUnsubmitted Done Reply Inline Actions llvm::is_one_of rriddle: llvm::is_one_of
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Thanks! bondhugula: Thanks!
		"applies to only alloc or alloca");
		p << name;

// Print dynamic dimension operands.		// Print dynamic dimension operands.
MemRefType type = op.getType();		MemRefType type = op.getType();
printDimAndSymbolList(op.operand_begin(), op.operand_end(),		printDimAndSymbolList(op.operand_begin(), op.operand_end(),
type.getNumDynamicDims(), p);		type.getNumDynamicDims(), p);
p.printOptionalAttrDict(op.getAttrs(), /elidedAttrs=/{"map"});		p.printOptionalAttrDict(op.getAttrs(), /elidedAttrs=/{"map"});
p << " : " << type;		p << " : " << type;
}		}

static ParseResult parseAllocOp(OpAsmParser &parser, OperationState &result) {		static void print(OpAsmPrinter &p, AllocOp op) {
		printAllocLikeOp(p, op, "alloc");
		}

		static void print(OpAsmPrinter &p, AllocaOp op) {
		printAllocLikeOp(p, op, "alloca");
		}

		static ParseResult parseAllocLikeOp(OpAsmParser &parser,
		OperationState &result) {
MemRefType type;		MemRefType type;

// Parse the dimension operands and optional symbol operands, followed by a		// Parse the dimension operands and optional symbol operands, followed by a
// memref type.		// memref type.
unsigned numDimOperands;		unsigned numDimOperands;
if (parseDimAndSymbolList(parser, result.operands, numDimOperands) \|\|		if (parseDimAndSymbolList(parser, result.operands, numDimOperands) \|\|
parser.parseOptionalAttrDict(result.attributes) \|\|		parser.parseOptionalAttrDict(result.attributes) \|\|
parser.parseColonType(type))		parser.parseColonType(type))
return failure();		return failure();

// Check numDynamicDims against number of question marks in memref type.		// Check numDynamicDims against number of question marks in memref type.
// Note: this check remains here (instead of in verify()), because the		// Note: this check remains here (instead of in verify()), because the
// partition between dim operands and symbol operands is lost after parsing.		// partition between dim operands and symbol operands is lost after parsing.
// Verification still checks that the total number of operands matches		// Verification still checks that the total number of operands matches
// the number of symbols in the affine map, plus the number of dynamic		// the number of symbols in the affine map, plus the number of dynamic
// dimensions in the memref.		// dimensions in the memref.
if (numDimOperands != type.getNumDynamicDims())		if (numDimOperands != type.getNumDynamicDims())
return parser.emitError(parser.getNameLoc())		return parser.emitError(parser.getNameLoc())
<< "dimension operand count does not equal memref dynamic dimension "		<< "dimension operand count does not equal memref dynamic dimension "
"count";		"count";
result.types.push_back(type);		result.types.push_back(type);
return success();		return success();
}		}

static LogicalResult verify(AllocOp op) {		template <typename AllocLikeOp>
auto memRefType = op.getResult().getType().dyn_cast<MemRefType>();		static LogicalResult verify(AllocLikeOp op) {
		static_assert(std::is_same<AllocLikeOp, AllocOp>::value \|\|
		std::is_same<AllocLikeOp, AllocaOp>::value,
		"applies to only alloc or alloca");
		auto memRefType = op.getResult().getType().template dyn_cast<MemRefType>();
if (!memRefType)		if (!memRefType)
return op.emitOpError("result must be a memref");		return op.emitOpError("result must be a memref");

unsigned numSymbols = 0;		unsigned numSymbols = 0;
if (!memRefType.getAffineMaps().empty()) {		if (!memRefType.getAffineMaps().empty()) {
// Store number of symbols used in affine map (used in subsequent check).		// Store number of symbols used in affine map (used in subsequent check).
AffineMap affineMap = memRefType.getAffineMaps()[0];		AffineMap affineMap = memRefType.getAffineMaps()[0];
numSymbols = affineMap.getNumSymbols();		numSymbols = affineMap.getNumSymbols();
Show All 10 Lines	static LogicalResult verify(AllocLikeOp op) {
// Verify that all operands are of type Index.		// Verify that all operands are of type Index.
for (auto operandType : op.getOperandTypes())		for (auto operandType : op.getOperandTypes())
if (!operandType.isIndex())		if (!operandType.isIndex())
return op.emitOpError("requires operands to be of type Index");		return op.emitOpError("requires operands to be of type Index");
return success();		return success();
}		}

namespace {		namespace {
/// Fold constant dimensions into an alloc operation.		/// Fold constant dimensions into an alloc like operation.
struct SimplifyAllocConst : public OpRewritePattern<AllocOp> {		template <typename AllocLikeOp>
using OpRewritePattern<AllocOp>::OpRewritePattern;		struct SimplifyAllocConst : public OpRewritePattern<AllocLikeOp> {
		using OpRewritePattern<AllocLikeOp>::OpRewritePattern;

LogicalResult matchAndRewrite(AllocOp alloc,		LogicalResult matchAndRewrite(AllocLikeOp alloc,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Check to see if any dimensions operands are constants. If so, we can		// Check to see if any dimensions operands are constants. If so, we can
// substitute and drop them.		// substitute and drop them.
if (llvm::none_of(alloc.getOperands(), [](Value operand) {		if (llvm::none_of(alloc.getOperands(), [](Value operand) {
return matchPattern(operand, m_ConstantIndex());		return matchPattern(operand, m_ConstantIndex());
}))		}))
return failure();		return failure();

Show All 27 Lines	LogicalResult matchAndRewrite(AllocLikeOp alloc,

// Create new memref type (which will have fewer dynamic dimensions).		// Create new memref type (which will have fewer dynamic dimensions).
MemRefType newMemRefType =		MemRefType newMemRefType =
MemRefType::Builder(memrefType).setShape(newShapeConstants);		MemRefType::Builder(memrefType).setShape(newShapeConstants);
assert(static_cast<int64_t>(newOperands.size()) ==		assert(static_cast<int64_t>(newOperands.size()) ==
newMemRefType.getNumDynamicDims());		newMemRefType.getNumDynamicDims());

// Create and insert the alloc op for the new memref.		// Create and insert the alloc op for the new memref.
auto newAlloc = rewriter.create<AllocOp>(alloc.getLoc(), newMemRefType,		auto newAlloc = rewriter.create<AllocLikeOp>(alloc.getLoc(), newMemRefType,
newOperands, IntegerAttr());		newOperands, IntegerAttr());
// Insert a cast so we have the same type as the old alloc.		// Insert a cast so we have the same type as the old alloc.
auto resultCast = rewriter.create<MemRefCastOp>(alloc.getLoc(), newAlloc,		auto resultCast = rewriter.create<MemRefCastOp>(alloc.getLoc(), newAlloc,
alloc.getType());		alloc.getType());

rewriter.replaceOp(alloc, {resultCast});		rewriter.replaceOp(alloc, {resultCast});
return success();		return success();
}		}
};		};
Show All 11 Lines	LogicalResult matchAndRewrite(AllocOp alloc,
}		}
return failure();		return failure();
}		}
};		};
} // end anonymous namespace.		} // end anonymous namespace.

void AllocOp::getCanonicalizationPatterns(OwningRewritePatternList &results,		void AllocOp::getCanonicalizationPatterns(OwningRewritePatternList &results,
MLIRContext *context) {		MLIRContext *context) {
results.insert<SimplifyAllocConst, SimplifyDeadAlloc>(context);		results.insert<SimplifyAllocConst<AllocOp>, SimplifyDeadAlloc>(context);
		}

		void AllocaOp::getCanonicalizationPatterns(OwningRewritePatternList &results,
		MLIRContext *context) {
		results.insert<SimplifyAllocConst<AllocaOp>>(context);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AndOp		// AndOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

OpFoldResult AndOp::fold(ArrayRef<Attribute> operands) {		OpFoldResult AndOp::fold(ArrayRef<Attribute> operands) {
/// and(x, 0) -> 0		/// and(x, 0) -> 0
▲ Show 20 Lines • Show All 2,123 Lines • Show Last 20 Lines

mlir/test/Conversion/StandardToLLVM/convert-dynamic-memref-ops.mlir

	Show First 20 Lines • Show All 87 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: llvm.insertvalue %[[st0]], %{{.}}[4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[st0]], %{{.}}[4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	%0 = alloc(%arg0, %arg1) : memref<?x?xf32>			%0 = alloc(%arg0, %arg1) : memref<?x?xf32>
	// CHECK-NEXT: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	return %0 : memref<?x?xf32>			return %0 : memref<?x?xf32>
	}			}

				// -----

				// CHECK-LABEL: func @dynamic_alloca
				// CHECK: %[[M:.]]: !llvm.i64, %[[N:.]]: !llvm.i64) -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
				func @dynamic_alloca(%arg0: index, %arg1: index) -> memref<?x?xf32> {
				// CHECK: %[[num_elems:.*]] = llvm.mul %[[M]], %[[N]] : !llvm.i64
				// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// CHECK-NEXT: %[[sz_bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64
				// CHECK-NEXT: %[[allocated:.]] = llvm.alloca %[[sz_bytes]] x !llvm.float : (!llvm.i64) -> !llvm<"float">
				// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[allocated]], %{{.}}[0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[allocated]], %{{.}}[1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// CHECK-NEXT: llvm.insertvalue %[[off]], %{{.}}[2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[st0:.]] = llvm.mul %{{.}}, %[[N]] : !llvm.i64
				// CHECK-NEXT: llvm.insertvalue %[[M]], %{{.}}[3, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[st0]], %{{.}}[4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				%0 = alloca(%arg0, %arg1) : memref<?x?xf32>

				// Test with explicitly specified alignment. llvm.alloca takes care of the
				// alignment. The same pointer is thus used for allocation and aligned
				// accesses.
				// CHECK: %[[alloca_aligned:.]] = llvm.alloca %{{.}} x !llvm.float {alignment = 32 : i64} : (!llvm.i64) -> !llvm<"float*">
				// CHECK: %[[desc:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK: %[[desc1:.]] = llvm.insertvalue %[[alloca_aligned]], %[[desc]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK: llvm.insertvalue %[[alloca_aligned]], %[[desc1]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				alloca(%arg0, %arg1) {alignment = 32} : memref<?x?xf32>
				return %0 : memref<?x?xf32>
				}

	// CHECK-LABEL: func @dynamic_dealloc			// CHECK-LABEL: func @dynamic_dealloc
	func @dynamic_dealloc(%arg0: memref<?x?xf32>) {			func @dynamic_dealloc(%arg0: memref<?x?xf32>) {
	// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">			// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
	// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()			// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()
	dealloc %arg0 : memref<?x?xf32>			dealloc %arg0 : memref<?x?xf32>
	return			return
	}			}
	▲ Show 20 Lines • Show All 183 Lines • Show Last 20 Lines

mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir

	// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s			// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s
	// RUN: mlir-opt -convert-std-to-llvm='use-alloca=1' %s \| FileCheck %s --check-prefix=ALLOCA
	// RUN: mlir-opt -convert-std-to-llvm='use-bare-ptr-memref-call-conv=1' -split-input-file %s \| FileCheck %s --check-prefix=BAREPTR			// RUN: mlir-opt -convert-std-to-llvm='use-bare-ptr-memref-call-conv=1' -split-input-file %s \| FileCheck %s --check-prefix=BAREPTR

	// BAREPTR-LABEL: func @check_noalias			// BAREPTR-LABEL: func @check_noalias
	// BAREPTR-SAME: %{{.}}: !llvm<"float"> {llvm.noalias = true}, %{{.}}: !llvm<"float"> {llvm.noalias = true}			// BAREPTR-SAME: %{{.}}: !llvm<"float"> {llvm.noalias = true}, %{{.}}: !llvm<"float"> {llvm.noalias = true}
	func @check_noalias(%static : memref<2xf32> {llvm.noalias = true}, %other : memref<2xf32> {llvm.noalias = true}) {			func @check_noalias(%static : memref<2xf32> {llvm.noalias = true}, %other : memref<2xf32> {llvm.noalias = true}) {
	return			return
	}			}

	▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines
	// BAREPTR-NEXT: %[[ins4:.]] = llvm.insertvalue %[[val4]], %[[ins3]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// BAREPTR-NEXT: %[[ins4:.]] = llvm.insertvalue %[[val4]], %[[ins3]][4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// BAREPTR-NEXT: llvm.return %[[ins4]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// BAREPTR-NEXT: llvm.return %[[ins4]] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	return %static : memref<32x18xf32, offset:7, strides:[22,1]>			return %static : memref<32x18xf32, offset:7, strides:[22,1]>
	}			}

	// -----			// -----

	// CHECK-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {			// CHECK-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {
	// ALLOCA-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {
	// BAREPTR-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {			// BAREPTR-LABEL: func @zero_d_alloc() -> !llvm<"{ float, float, i64 }"> {
	func @zero_d_alloc() -> memref<f32> {			func @zero_d_alloc() -> memref<f32> {
	// CHECK-NEXT: llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">			// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
	// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">			// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64			// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
	// CHECK-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64			// CHECK-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64
	// CHECK-NEXT: llvm.call @malloc(%{{.}}) : (!llvm.i64) -> !llvm<"i8">			// CHECK-NEXT: llvm.call @malloc(%{{.}}) : (!llvm.i64) -> !llvm<"i8">
	// CHECK-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">			// CHECK-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">
	// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64 }">			// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64 }">
	// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[0] : !llvm<"{ float, float*, i64 }">			// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[0] : !llvm<"{ float, float*, i64 }">
	// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[1] : !llvm<"{ float, float*, i64 }">			// CHECK-NEXT: llvm.insertvalue %[[ptr]], %{{.}}[1] : !llvm<"{ float, float*, i64 }">
	// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK-NEXT: %[[c0:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64 }">			// CHECK-NEXT: llvm.insertvalue %[[c0]], %{{.}}[2] : !llvm<"{ float, float*, i64 }">

	// ALLOCA-NOT: malloc
	// ALLOCA: alloca
	// ALLOCA-NOT: malloc

	// BAREPTR-NEXT: llvm.mlir.constant(1 : index) : !llvm.i64			// BAREPTR-NEXT: llvm.mlir.constant(1 : index) : !llvm.i64
	// BAREPTR-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">			// BAREPTR-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
	// BAREPTR-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64			// BAREPTR-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
	// BAREPTR-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">			// BAREPTR-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
	// BAREPTR-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64			// BAREPTR-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
	// BAREPTR-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64			// BAREPTR-NEXT: llvm.mul %{{.*}}, %[[sizeof]] : !llvm.i64
	// BAREPTR-NEXT: llvm.call @malloc(%{{.}}) : (!llvm.i64) -> !llvm<"i8">			// BAREPTR-NEXT: llvm.call @malloc(%{{.}}) : (!llvm.i64) -> !llvm<"i8">
	// BAREPTR-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">			// BAREPTR-NEXT: %[[ptr:.]] = llvm.bitcast %{{.}} : !llvm<"i8"> to !llvm<"float">
	▲ Show 20 Lines • Show All 103 Lines • ▼ Show 20 Lines
	// BAREPTR-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">			// BAREPTR-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">
	// BAREPTR-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">			// BAREPTR-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">
	%0 = alloc() : memref<32x18xf32>			%0 = alloc() : memref<32x18xf32>
	return %0 : memref<32x18xf32>			return %0 : memref<32x18xf32>
	}			}

	// -----			// -----

				// CHECK-LABEL: func @static_alloca() -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
				func @static_alloca() -> memref<32x18xf32> {
				// CHECK-NEXT: %[[sz1:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64
				// CHECK-NEXT: %[[sz2:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64
				// CHECK-NEXT: %[[num_elems:.*]] = llvm.mul %0, %1 : !llvm.i64
				// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// CHECK-NEXT: %[[bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64
				// CHECK-NEXT: %[[allocated:.]] = llvm.alloca %[[bytes]] x !llvm.float : (!llvm.i64) -> !llvm<"float">
				%0 = alloca() : memref<32x18xf32>

				// Test with explicitly specified alignment. llvm.alloca takes care of the
				// alignment. The same pointer is thus used for allocation and aligned
				// accesses.
				// CHECK: %[[alloca_aligned:.]] = llvm.alloca %{{.}} x !llvm.float {alignment = 32 : i64} : (!llvm.i64) -> !llvm<"float*">
				// CHECK: %[[desc:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK: %[[desc1:.]] = llvm.insertvalue %[[alloca_aligned]], %[[desc]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK: llvm.insertvalue %[[alloca_aligned]], %[[desc1]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				alloca() {alignment = 32} : memref<32x18xf32>
				return %0 : memref<32x18xf32>
				}

				// -----

	// CHECK-LABEL: func @static_dealloc			// CHECK-LABEL: func @static_dealloc
	// BAREPTR-LABEL: func @static_dealloc(%{{.}}: !llvm<"float">) {			// BAREPTR-LABEL: func @static_dealloc(%{{.}}: !llvm<"float">) {
	func @static_dealloc(%static: memref<10x8xf32>) {			func @static_dealloc(%static: memref<10x8xf32>) {
	// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">			// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
	// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()			// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()

	// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	▲ Show 20 Lines • Show All 132 Lines • Show Last 20 Lines

mlir/test/IR/memory-ops.mlir

Show All 27 Lines	^bb0:
// b/116054838 Parser crash while parsing ill-formed AllocOp		// b/116054838 Parser crash while parsing ill-formed AllocOp
// CHECK: %4 = alloc() : memref<2xi32>		// CHECK: %4 = alloc() : memref<2xi32>
%4 = alloc() : memref<2 x i32>		%4 = alloc() : memref<2 x i32>

// CHECK: return		// CHECK: return
return		return
}		}

		// CHECK-LABEL: func @alloca() {
		func @alloca() {
		^bb0:
		// Test simple alloc.
		// CHECK: %0 = alloca() : memref<1024x64xf32, 1>
		%0 = alloca() : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 1>

		%c0 = "std.constant"() {value = 0: index} : () -> index
		%c1 = "std.constant"() {value = 1: index} : () -> index

		// Test alloca with dynamic dimensions.
		// CHECK: %1 = alloca(%c0, %c1) : memref<?x?xf32, 1>
		%1 = alloca(%c0, %c1) : memref<?x?xf32, affine_map<(d0, d1) -> (d0, d1)>, 1>

		// Test alloca with no dynamic dimensions and one symbol.
		// CHECK: %2 = alloca()[%c0] : memref<2x4xf32, #map0, 1>
		%2 = alloca()[%c0] : memref<2x4xf32, affine_map<(d0, d1)[s0] -> ((d0 + s0), d1)>, 1>

		// Test alloca with dynamic dimensions and one symbol.
		// CHECK: %3 = alloca(%c1)[%c0] : memref<2x?xf32, #map0, 1>
		%3 = alloca(%c1)[%c0] : memref<2x?xf32, affine_map<(d0, d1)[s0] -> (d0 + s0, d1)>, 1>

		// Alloca with no mappings, but with alignment.
		// CHECK: %4 = alloca() {alignment = 64 : i64} : memref<2xi32>
		%4 = alloca() {alignment = 64} : memref<2 x i32>

		return
		}

// CHECK-LABEL: func @dealloc() {		// CHECK-LABEL: func @dealloc() {
func @dealloc() {		func @dealloc() {
^bb0:		^bb0:
// CHECK: %0 = alloc() : memref<1024x64xf32>		// CHECK: %0 = alloc() : memref<1024x64xf32>
%0 = alloc() : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 0>		%0 = alloc() : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 0>

// CHECK: dealloc %0 : memref<1024x64xf32>		// CHECK: dealloc %0 : memref<1024x64xf32>
dealloc %0 : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 0>		dealloc %0 : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 0>
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

mlir/test/Transforms/canonicalize.mlir

	Show First 20 Lines • Show All 386 Lines • ▼ Show 20 Lines
	func @dyn_shape_fold(%L : index, %M : index) -> (memref<? x ? x i32>, memref<? x ? x f32>) {			func @dyn_shape_fold(%L : index, %M : index) -> (memref<? x ? x i32>, memref<? x ? x f32>) {
	// CHECK: %c0 = constant 0 : index			// CHECK: %c0 = constant 0 : index
	%zero = constant 0 : index			%zero = constant 0 : index
	// The constants below disappear after they propagate into shapes.			// The constants below disappear after they propagate into shapes.
	%nine = constant 9 : index			%nine = constant 9 : index
	%N = constant 1024 : index			%N = constant 1024 : index
	%K = constant 512 : index			%K = constant 512 : index

	// CHECK-NEXT: %0 = alloc(%arg0) : memref<?x1024xf32>			// CHECK-NEXT: alloc(%arg0) : memref<?x1024xf32>
	%a = alloc(%L, %N) : memref<? x ? x f32>			%a = alloc(%L, %N) : memref<? x ? x f32>

	// CHECK-NEXT: %1 = alloc(%arg1) : memref<4x1024x8x512x?xf32>			// CHECK-NEXT: alloc(%arg1) : memref<4x1024x8x512x?xf32>
	%b = alloc(%N, %K, %M) : memref<4 x ? x 8 x ? x ? x f32>			%b = alloc(%N, %K, %M) : memref<4 x ? x 8 x ? x ? x f32>

	// CHECK-NEXT: %2 = alloc() : memref<512x1024xi32>			// CHECK-NEXT: alloc() : memref<512x1024xi32>
	%c = alloc(%K, %N) : memref<? x ? x i32>			%c = alloc(%K, %N) : memref<? x ? x i32>

				// CHECK: alloc() : memref<9x9xf32>
				%d = alloc(%nine, %nine) : memref<? x ? x f32>

				// CHECK: alloca(%arg1) : memref<4x1024x8x512x?xf32>
				%e = alloca(%N, %K, %M) : memref<4 x ? x 8 x ? x ? x f32>

	// CHECK: affine.for			// CHECK: affine.for
	affine.for %i = 0 to %L {			affine.for %i = 0 to %L {
	// CHECK-NEXT: affine.for			// CHECK-NEXT: affine.for
	affine.for %j = 0 to 10 {			affine.for %j = 0 to 10 {
	// CHECK-NEXT: load %0[%arg2, %arg3] : memref<?x1024xf32>			// CHECK-NEXT: load %0[%arg2, %arg3] : memref<?x1024xf32>
	// CHECK-NEXT: store %{{.*}}, %1[%c0, %c0, %arg2, %arg3, %c0] : memref<4x1024x8x512x?xf32>			// CHECK-NEXT: store %{{.*}}, %1[%c0, %c0, %arg2, %arg3, %c0] : memref<4x1024x8x512x?xf32>
	%v = load %a[%i, %j] : memref<?x?xf32>			%v = load %a[%i, %j] : memref<?x?xf32>
	store %v, %b[%zero, %zero, %i, %j, %zero] : memref<4x?x8x?x?xf32>			store %v, %b[%zero, %zero, %i, %j, %zero] : memref<4x?x8x?x?xf32>
	}			}
	}			}

	// CHECK: alloc() : memref<9x9xf32>
	%d = alloc(%nine, %nine) : memref<? x ? x f32>

	return %c, %d : memref<? x ? x i32>, memref<? x ? x f32>			return %c, %d : memref<? x ? x i32>, memref<? x ? x f32>
	}			}

	#map1 = affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s1 + d1 * s2 + s0)>			#map1 = affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s1 + d1 * s2 + s0)>
	#map2 = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d0 * s2 + d1 * s1 + d2 + s0)>			#map2 = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d0 * s2 + d1 * s1 + d2 + s0)>

	// CHECK-LABEL: func @dim_op_fold(%arg0: index, %arg1: index, %arg2: index,			// CHECK-LABEL: func @dim_op_fold(%arg0: index, %arg1: index, %arg2: index,
	func @dim_op_fold(%arg0: index, %arg1: index, %arg2: index, %BUF: memref<?xi8>, %M : index, %N : index, %K : index) {			func @dim_op_fold(%arg0: index, %arg1: index, %arg2: index, %BUF: memref<?xi8>, %M : index, %N : index, %K : index) {
	▲ Show 20 Lines • Show All 494 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[MLIR] Introduce std.alloca opClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 255628

mlir/include/mlir/Conversion/Passes.td

mlir/include/mlir/Conversion/StandardToLLVM/ConvertStandardToLLVMPass.h

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td

mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp

mlir/lib/Dialect/StandardOps/IR/Ops.cpp

mlir/test/Conversion/StandardToLLVM/convert-dynamic-memref-ops.mlir

mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir

mlir/test/IR/memory-ops.mlir

mlir/test/Transforms/canonicalize.mlir

[MLIR] Introduce std.alloca op
ClosedPublic