This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/StandardOps/IR/
-
mlir/
-
Dialect/
-
StandardOps/
-
IR/
25/25
Ops.td
-
lib/
-
Conversion/StandardToLLVM/
-
StandardToLLVM/
6/6
StandardToLLVM.cpp
-
Dialect/StandardOps/IR/
-
StandardOps/
-
IR/
2/2
Ops.cpp
-
test/
-
Conversion/StandardToLLVM/
-
StandardToLLVM/
-
convert-dynamic-memref-ops.mlir
-
convert-static-memref-ops.mlir
-
IR/
-
memory-ops.mlir
-
Transforms/
-
canonicalize.mlir

Differential D76602

[MLIR] Introduce std.alloca op
ClosedPublic

Authored by bondhugula on Mar 23 2020, 5:21 AM.

Download Raw Diff

Details

Reviewers

mehdi_amini
rriddle
ftynse
nicolasvasilache

Commits

rG7023f4b4cb01: [MLIR] Introduce std.alloca op

Summary

Introduce the alloca op for stack memory allocation. When converting to the
LLVM dialect, this is lowered to an llvm.alloca.

Signed-off-by: Uday Bondhugula <uday@polymagelabs.com>

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

bondhugula created this revision.Mar 23 2020, 5:21 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 23 2020, 5:21 AM

Herald added subscribers: llvm-commits, Joonsoo, liufengdb and 9 others. · View Herald Transcript

nicolasvasilache requested changes to this revision.Mar 23 2020, 5:57 AM

The amount of copy-pasta is uncanny, is there a way to factor out the 90%+ common part?
Also, please use the assemblyFormat for parsing and printing,

This revision now requires changes to proceed.Mar 23 2020, 5:57 AM

In D76602#1936610, @nicolasvasilache wrote:

Also, please use the assemblyFormat for parsing and printing,

Can the assembly format support its parsing and printing? Both affine.apply and alloc don't use the assembly format and have identical operand syntax (affine.apply was recently migrated to ODS but not auto print / parse). All these three require two variadic operand lists. Let me know.

Harbormaster failed remote builds in B50099: Diff 251999!Mar 23 2020, 6:31 AM

In D76602#1936610, @nicolasvasilache wrote:

The amount of copy-pasta is uncanny, is there a way to factor out the 90%+ common part?
Also, please use the assemblyFormat for parsing and printing,

I was thinking the same. The duplication is unfortunate and it's all in StandardToLLVM - it's possible to factor it all out if we get rid of useAlloca because the AllocOp lowering is conditional on that. Should I remove useAlloca in this patch itself and mostly merge AllocOpLowering and AllocaOpLowering? Merging the parse/print methods is straightforward.

Share print/parse/verify b/w alloc/alloca.

Refactor ODS for alloc/alloca

Harbormaster completed remote builds in B50261: Diff 252332.Mar 24 2020, 9:07 AM

Harbormaster failed remote builds in B50259: Diff 252331!

Fix format

Harbormaster failed remote builds in B50264: Diff 252338!Mar 24 2020, 9:39 AM

In D76602#1936765, @bondhugula wrote:

In D76602#1936610, @nicolasvasilache wrote:

The amount of copy-pasta is uncanny, is there a way to factor out the 90%+ common part?

I was thinking the same. The duplication is unfortunate and it's all in StandardToLLVM - it's possible to factor it all out if we get rid of useAlloca because the AllocOp lowering is conditional on that. Should I remove useAlloca in this patch itself and mostly merge AllocOpLowering and AllocaOpLowering? Merging the parse/print methods is straightforward.

@nicolasvasilache Done with the refactoring here (except for the llvm lowering - see question above).

Harbormaster completed remote builds in B50268: Diff 252346.Mar 24 2020, 10:12 AM

rriddle added inline comments.Mar 24 2020, 5:33 PM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
144	Use ValueRange instead of ArrayRef<Value> in builder mehtods.
252	nit: "alloca" -> `alloca`
256	Please use mlir code blocks for any inline code.
mlir/lib/Dialect/StandardOps/IR/Ops.cpp
250	llvm::is_one_of

Address comments from @rriddle

Thanks for the review!

mlir/lib/Dialect/StandardOps/IR/Ops.cpp
250	Thanks!

Harbormaster completed remote builds in B50340: Diff 252474.Mar 24 2020, 7:42 PM

Thanks for adding the alloca op! Really needed.
Not sure if you discussed this already but just a nit about the name: any plans on renaming alloca and alloc so that it's a bit clearer what they model? I find it a bit confusing right now. Some options that came to mind:
alloca -> salloc, salloca
alloc -> malloc, malloca, alloc

In D76602#1941785, @dcaballe wrote:

Thanks for adding the alloca op! Really needed.
Not sure if you discussed this already but just a nit about the name: any plans on renaming alloca and alloc so that it's a bit clearer what they model? I find it a bit confusing right now. Some options that came to mind:
alloca -> salloc, salloca
alloc -> malloc, malloca, alloc

alloca -> salloca sounds good to me! Note that the 'a' suffix in 'alloca' is for 'automatic' freeing (originating from early Unix's and BSDs and it has always meant allocating from the caller's stack). So, alloc -> malloca, alloca -> salloc would be inconsistent. Since 'alloc' currently doesn't specify where it's from the stack/heap and specifies it's explicitly freed via dealloc, we can leave it like that.

What is the s in salloc? Stack? Can we make it explicit then: stack_alloc?

Note that the 'a' suffix in 'alloca' is for 'automatic' freeing

Thanks for clarifying! I thought it was just a short for 'allocate' :)

Can we make it explicit then: stack_alloc?

stack_alloc sounds better to me, thanks.

Since 'alloc' currently doesn't specify where it's from the stack/heap and specifies it's explicitly freed via dealloc, we can leave it like that.

It sounds good. I think we are mapping 'alloc' to static allocation by using a flag in the llvm lowering. Maybe can create a simple pass to do a more proper alloc->static_alloc conversion in the future and leave 'alloc' only for heap allocation.

@bondhugula
Thanks for the refactoring!

Dropping the LLVM flag and refactoring the impl further sounds like the right thing to do IMO.
The unit test that uses the flag can easily be updated (or deprecated in favor of your test).
The internal use case we have for this will be easy to update.

Herald added a subscriber: grosul1. · View Herald TranscriptMar 30 2020, 6:29 PM

In D76602#1951483, @nicolasvasilache wrote:

@bondhugula
Thanks for the refactoring!

Dropping the LLVM flag and refactoring the impl further sounds like the right thing to do IMO.
The unit test that uses the flag can easily be updated (or deprecated in favor of your test).
The internal use case we have for this will be easy to update.

This sounds the best to me too. I'll do it in this patch.

Refactored std to llvm lowering for alloc op to reuse for alloca. The lowering
rewrite was really long. Broken down with multiple helpers now: this is also
ready to now use aligned_alloc in place of malloc for heap allocations.

In D76602#1954032, @bondhugula wrote:

In D76602#1951483, @nicolasvasilache wrote:

@bondhugula
Thanks for the refactoring!

Dropping the LLVM flag and refactoring the impl further sounds like the right thing to do IMO.
The unit test that uses the flag can easily be updated (or deprecated in favor of your test).
The internal use case we have for this will be easy to update.

This sounds the best to me too. I'll do it in this patch.

Done @nicolasvasilache PTAL.

Update comments.

bondhugula added a child revision: D77528: [MLIR] Add support to use aligned_alloc to lower AllocOp from std to llvm.Apr 6 2020, 12:55 AM

Harbormaster completed remote builds in B51888: Diff 255240.Apr 6 2020, 1:36 AM

Harbormaster completed remote builds in B51887: Diff 255239.

ftynse added inline comments.Apr 6 2020, 5:49 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
252	Nit: I think the intention of the comment above was to ask you to use backticks instead of double quotes.
253	Could you please elaborate what is a stack frame in MLIR? We don't seem to have this concept defined anywhere. In particular, is it only related to `std.func`, or can one register other ops that create stack frames?
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
1568	Nit: can we rather define `one` at the call site and pass it here (and to another call) ?

bondhugula marked 5 inline comments as done.Apr 6 2020, 7:02 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	A stack frame here refers to the standard stack frame concept in CS that we know of! It's up to the conversion out of MLIR to realize this correctly.
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
1568	The reason I didn't do that is that the order of the instructions in that case wouldn't be natural - there would be other stuff (the size constant definitions) between the def of 'one' and its first use here. So I left it this way.

Address review comments.

ftynse added inline comments.Apr 6 2020, 7:15 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	MLIR makes us rethink a lot of things. Like MLIR does not have functions as a first-class concept :) I could obviously guess your intention with stack frames, but I would still insist you think it through in context of MLIR. Similarly to functions, I don't think we have a built-in memory model that has a stack... Or that `std.func` semantics says something about stack frames. IIRC, this was one of the conceptual problems of having `alloca` in the first place. What should happen if I do the following? my.func @foo() { alloca ... } Or std.func @foo() { %0 = my.func_that_may_or_may_not_be_a_lambda @inner() { alloca ... } } "Conversion out of MLIR" does not mean anything to me either. Do you mean the translation of LLVM dialect to LLVM IR? There are other passes that may be using the standard dialect that are completely unaware of that.

bondhugula marked an inline comment as done.Apr 6 2020, 7:56 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	Irrespective of which case it is, it depends on the ops around it and for the folks defining those ops to realize it. What you need to keep in mind is that an alloca's memory is automatically freed (i.e., you won't find a dealloc) and it disappears at the time the stack frame goes away whenever such a concept exists. Now, one is of course free to transform it and implement the auto freeing in another way within MLIR itself. Heap allocations for eg. get promoted to stack ones (eg. in LLVM's passes), and for whatever reason, one could decide to switch a stack allocation to a heap one. That won't be an incorrect transform. So it ultimately may not even be realized on the stack (let alone your question of when it should be freed). It's the intent of the op when you actually see in the IR that is of an allocation that is freed automatically when the stack frame goes away.

Harbormaster completed remote builds in B51932: Diff 255319.Apr 6 2020, 8:06 AM

Thanks Uday, looks good!

Accepting conditioned on the LLVM options struct change.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
144	Note: I tend to prefer extra builders that take all PODs instead of attributes. They are more natural to use and just work nicely once wrapped into EDSCs.
231	typo %d1
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
1513	I think the change in indentation (?) here and everywhere is unfortunate, it does not properly track what code moved (materialized with a yellow phab horizontal bar on the left) and make it harder to see what changed vs what moved. I am mostly eyeballing the code after realizing most of this is just moved. Please flag specific things need deeper review if appropriate.
3178	I am concerned by the silent behavior breaking changes here and in the followup revision. I am expecting this will be painful for integrations and will repeat itself. Can we turn this into a: struct LowerToLLVMOptions { ... }

This revision is now accepted and ready to land.Apr 6 2020, 9:06 AM

ftynse requested changes to this revision.Apr 7 2020, 2:42 AM

ftynse added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	I don't disagree with anything you say. I am merely pointing out that it is absolutely unclear from the op definition _when_ the allocated memory will be automatically freed. And "when the stack frame goes away" is not a satisfactory definition, because it is meaningless under the current MLIR semantics as it is written. I could literally add the `stack_frame.go_away()` operation tomorrow and expect it to free the allocations... If you could instead tie it to something like "std.func" returning or the region of an operation with FunctionLike trait transferring control flow back to its enclosing op, it would be more MLIR-compatible.

This revision now requires changes to proceed.Apr 7 2020, 2:42 AM

mehdi_amini added inline comments.Apr 7 2020, 2:47 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	I agree with Alex that this could deserve a more careful way of describing this. In particular the "stack" isn't very important here, we really need a scope. Could we use a trait like we have for "IsolatedFromAbove" to control this scoping? Unless we just consider every region as a new scope for these allocations?

Address @nicolasvasilache review comments.

bondhugula added inline comments.Apr 7 2020, 3:15 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
144	Hmm... I didn't change the existing alloc op builder here; just using the same for Alloca op. Yes, this can be changed to just take int64_t in another revision.
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
1513	All of this is just moved. Nothing in the alloc op lowering actually changed with this - I'm just reusing the common parts for alloca.
3178	That's a concern - there are going to be silent breaking effects. I've changed it to struct here. But even with a struct, you'll have the same issue with list initialization (some of the fields will remain uninitialized / wrongly initialized depending on where the field was removed from or where the new field was added). If the new fields are always added at the end, we'll just have uninitialized fields. And with explicit field-wise init, it'll just lead to uninitialized stuff. BTW, none of the three options are documented at the declaration (but only in Passes.td).

bondhugula marked an inline comment as done.Apr 7 2020, 3:29 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	Looks like these messages were in flight while I updated and committed. The rephrasing can be addressed in a future revision while we continue discussion here. the current MLIR semantics as it is written. I could literally add the stack_frame.go_away() operation tomorrow and expect it to free the allocations. The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using another op. If you are imagining a low level op that exists that manipulates the stack frame, this would be a pathological case that it has freed it by circumventing things. In particular the "stack" isn't very important here, we really need a scope. Could we use a trait like we have for IsolatedFromAbove" to Tying it to a scope like that of a closest surrounding op with a function like trait (or isolatedFromAbove) is almost fine, but ultimately not perfect since imperative function like ops like std.execute_region don't have function like traits, and we may want to imply freeing at that scope.

Harbormaster failed remote builds in B52132: Diff 255622!Apr 7 2020, 3:45 AM

This revision was not accepted when it landed; it landed in state Needs Review.Apr 7 2020, 3:45 AM

Closed by commit rG7023f4b4cb01: [MLIR] Introduce std.alloca op (authored by bondhugula). · Explain Why

This revision was automatically updated to reflect the committed changes.

ftynse added inline comments.Apr 7 2020, 4:55 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using another op. Why not? I would not constraint the behavior of every possible op based on a one-letter suffix of another op... "Automatic" does not mean it cannot be connected to another op. In fact, in MLIR, it will likely be connected to another op because functions are ops. Tying it to a scope like that of a closest surrounding op with a function like trait (or isolatedFromAbove) is almost fine, but ultimately not perfect since imperative function like ops like std.execute_region don't have function like traits, and we may want to imply freeing at that scope. IIUC, the idea is to introduce a new trait `AutomaticAllocationScope` that is orthogonal to `FunctionLike`. We can then make, e.g., `std.func`, `llvm.func` and `std.execute_region` have this trait, and let other ops opt-in to the automatic allocation/deallocation behavior. This would be my ideal solution, but I hesitated to push it since it may involve larger changes to this patch, trying to find a simpler solution like attaching to the function trait first. Now that the patch has landed, I think introducing a separate trait is the right thing to do.

bondhugula marked 4 inline comments as done.Apr 7 2020, 6:26 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	mean it cannot be connected to another op. In fact, in MLIR, it will likely be connected to another op because functions are ops. Automatically there implies it has to be freed automatically, not by another op. The moment you want to explicitly free that, you'll have to rewrite it to something else - for eg. alloc/dealloc pair. like attaching to the function trait first. Now that the patch has landed, I think introducing a separate trait is the right thing to do. Strictly speaking, I also feel adding another trait is the right option. But for now and to keep it lightweight until we have such use cases, 'freed when the closest surrounding op with FunctionLike trait returns' is a good approximation for 'stack frame being discarded'.

ftynse added inline comments.Apr 7 2020, 6:47 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	Strictly speaking, I also feel adding another trait is the right option. But for now and to keep it lightweight until we have such use cases, 'freed when the closest surrounding op with FunctionLike trait returns' is a good approximation for 'stack frame being discarded'. Let's have it as "freed when the closest surrounding op with FunctionLike trait has the control transferred back from its body", this avoids the potential interpretation that FunctionLike should also terminate with "std.return" and a weird-sounding "op <..> returns". We may also want to add a verifier check that such an op exists.

mehdi_amini added inline comments.Apr 7 2020, 11:27 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	I am not convinced that the "FunctionLike" trait is the right one here, an op like `gpu.launch` for example would create a new scope in my expectations. What do you think about the trait Uday proposed above `AutomaticAllocationScope` ?

ftynse added a subscriber: herhut.Apr 7 2020, 11:40 AM

ftynse added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	I proposed it :) So I am obviously in favor of it. I think we can start by associating the deallocation with `FunctionLike`, it's a simple documentation+verifier change that we can land fast and avoid having contentious code upstream as well as rolling it back. Then we can implement `AutomaticAllocationScope` and let ops to opt-in (e.g., I'd expect @herhut to decide whether we want `alloca`s in GPU at all).

mehdi_amini added inline comments.Apr 7 2020, 7:20 PM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	I think we can start by associating the deallocation with FunctionLike, I'm afraid this will lead to wrong assumption, I rather have code written with the right trait checked from the beginning.

bondhugula marked 6 inline comments as done.Apr 7 2020, 9:57 PM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	It is clear that using a new trait for scoped allocation will be accurate here. FunctionLike will always imply a new scope for stack allocation. Introducing the trait should be straightforward - okay to update the description when we introduce the trait.

mehdi_amini added inline comments.Apr 8 2020, 8:54 PM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	Are you adding the trait? I think this should be done now: I wouldn't want code that start checking for "FunctionLike" where it should check for the "AutomaticAllocationScope" trait.

Herald added a subscriber: frgossen. · View Herald TranscriptApr 8 2020, 8:54 PM

bondhugula marked 2 inline comments as done.Apr 8 2020, 9:27 PM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	Alex, are you interested in / are you able to add this trait? If not, I can do it right away.

ftynse added inline comments.Apr 9 2020, 1:23 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	I have quite a long todo list...

bondhugula marked 3 inline comments as done.Apr 9 2020, 3:16 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
253	D77787

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

StandardOps/

IR/

Ops.td

109 lines

lib/

Conversion/

StandardToLLVM/

StandardToLLVM.cpp

147 lines

Dialect/

StandardOps/

IR/

Ops.cpp

47 lines

test/

Conversion/

StandardToLLVM/

convert-dynamic-memref-ops.mlir

36 lines

convert-static-memref-ops.mlir

26 lines

IR/

memory-ops.mlir

29 lines

Transforms/

canonicalize.mlir

15 lines

Diff 252338

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td

Show First 20 Lines • Show All 118 Lines • ▼ Show 20 Lines
// type, or a floating point tensor. The custom assembly form of the operation		// type, or a floating point tensor. The custom assembly form of the operation
// is as follows		// is as follows
//		//
// <op>f %0, %1 : f32		// <op>f %0, %1 : f32
class FloatArithmeticOp<string mnemonic, list<OpTrait> traits = []> :		class FloatArithmeticOp<string mnemonic, list<OpTrait> traits = []> :
ArithmeticOp<mnemonic, traits>,		ArithmeticOp<mnemonic, traits>,
Arguments<(ins FloatLike:$lhs, FloatLike:$rhs)>;		Arguments<(ins FloatLike:$lhs, FloatLike:$rhs)>;

		// Base class for memref allocating ops: alloca and alloc.
		//
		// %0 = alloclike(%m)[%s] : memref<8x?xf32, (d0, d1)[s0] -> ((d0 + s0), d1)>
		//
		class AllocLikeOp<string mnemonic, list<OpTrait> traits = []> :
		Std_Op<mnemonic, traits> {

		let arguments = (ins Variadic<Index>:$value,
		Confined<OptionalAttr<I64Attr>, [IntMinValue<0>]>:$alignment);
		let results = (outs AnyMemRef);

		let builders = [OpBuilder<
		"Builder *builder, OperationState &result, MemRefType memrefType", [{
		result.types.push_back(memrefType);
		}]>,
		OpBuilder<
		"Builder *builder, OperationState &result, MemRefType memrefType, " #
		"ArrayRef<Value> operands, IntegerAttr alignment = IntegerAttr()", [{
		rriddleUnsubmitted Done Reply Inline Actions Use ValueRange instead of ArrayRef<Value> in builder mehtods. rriddle: Use ValueRange instead of ArrayRef<Value> in builder mehtods.
		nicolasvasilacheUnsubmitted Done Reply Inline Actions Note: I tend to prefer extra builders that take all PODs instead of attributes. They are more natural to use and just work nicely once wrapped into EDSCs. nicolasvasilache: Note: I tend to prefer extra builders that take all PODs instead of attributes. They are more…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Hmm... I didn't change the existing alloc op builder here; just using the same for Alloca op. Yes, this can be changed to just take int64_t in another revision. bondhugula: Hmm... I didn't change the existing alloc op builder here; just using the same for Alloca op.
		result.addOperands(operands);
		result.types.push_back(memrefType);
		if (alignment)
		result.addAttribute(getAlignmentAttrName(), alignment);
		}]>];

		let extraClassDeclaration = [{
		static StringRef getAlignmentAttrName() { return "alignment"; }

		MemRefType getType() { return getResult().getType().cast<MemRefType>(); }

		/// Returns the number of symbolic operands (the ones in square brackets),
		/// which bind to the symbols of the memref's layout map.
		unsigned getNumSymbolicOperands() {
		return getNumOperands() - getType().getNumDynamicDims();
		}

		/// Returns the symbolic operands (the ones in square brackets), which bind
		/// to the symbols of the memref's layout map.
		operand_range getSymbolicOperands() {
		return {operand_begin() + getType().getNumDynamicDims(), operand_end()};
		}

		/// Returns the dynamic sizes for this alloc operation if specified.
		operand_range getDynamicSizes() { return getOperands(); }
		}];

		let parser = [{ return ::parseAllocLikeOp(parser, result); }];

		let hasCanonicalizer = 1;
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AbsFOp		// AbsFOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def AbsFOp : FloatUnaryOp<"absf"> {		def AbsFOp : FloatUnaryOp<"absf"> {
let summary = "floating point absolute-value operation";		let summary = "floating point absolute-value operation";
let description = [{		let description = [{
The `absf` operation computes the absolute value. It takes one operand and		The `absf` operation computes the absolute value. It takes one operand and
Show All 20 Lines	def AddIOp : IntArithmeticOp<"addi", [Commutative]> {
let summary = "integer addition operation";		let summary = "integer addition operation";
let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AllocOp		// AllocOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def AllocOp : Std_Op<"alloc"> {		def AllocOp : AllocLikeOp<"alloc"> {
let summary = "memory allocation operation";		let summary = "memory allocation operation";
let description = [{		let description = [{
The "alloc" operation allocates a region of memory, as specified by its		The "alloc" operation allocates a region of memory, as specified by its
memref type. For example:		memref type. For example:

%0 = alloc() : memref<8x64xf32, (d0, d1) -> (d0, d1), 1>		%0 = alloc() : memref<8x64xf32, (d0, d1) -> (d0, d1), 1>

The optional list of dimension operands are bound to the dynamic dimensions		The optional list of dimension operands are bound to the dynamic dimensions
specified in its memref type. In the example below, the ssa value '%d' is		specified in its memref type. In the example below, the ssa value '%d' is
bound to the second dimension of the memref (which is dynamic).		bound to the second dimension of the memref (which is dynamic).

%0 = alloc(%d) : memref<8x?xf32, (d0, d1) -> (d0, d1), 1>		%0 = alloc(%d) : memref<8x?xf32, (d0, d1) -> (d0, d1), 1>

The optional list of symbol operands are bound to the symbols of the		The optional list of symbol operands are bound to the symbols of the
memrefs affine map. In the example below, the ssa value '%s' is bound to		memrefs affine map. In the example below, the ssa value '%s' is bound to
the symbol 's0' in the affine map specified in the allocs memref type.		the symbol 's0' in the affine map specified in the allocs memref type.

%0 = alloc()[%s] : memref<8x64xf32, (d0, d1)[s0] -> ((d0 + s0), d1), 1>		%0 = alloc()[%s] : memref<8x64xf32, (d0, d1)[s0] -> ((d0 + s0), d1), 1>
		nicolasvasilacheUnsubmitted Done Reply Inline Actions typo %d1 nicolasvasilache: typo %d1

This operation returns a single ssa value of memref type, which can be used		This operation returns a single ssa value of memref type, which can be used
by subsequent load and store operations.		by subsequent load and store operations.

The optional `alignment` attribute may be specified to ensure that the		The optional `alignment` attribute may be specified to ensure that the
region of memory that will be indexed is aligned at the specified byte		region of memory that will be indexed is aligned at the specified byte
boundary. TODO(b/144281289) optional alignment attribute to MemRefType.		boundary. TODO(b/144281289) optional alignment attribute to MemRefType.

%0 = alloc()[%s] {alignment = 8} :		%0 = alloc()[%s] {alignment = 8} :
memref<8x64xf32, (d0, d1)[s0] -> ((d0 + s0), d1), 1>		memref<8x64xf32, (d0, d1)[s0] -> ((d0 + s0), d1), 1>
}];		}];
		}

let arguments = (ins Variadic<Index>:$value,		//===----------------------------------------------------------------------===//
Confined<OptionalAttr<I64Attr>, [IntMinValue<0>]>:$alignment);		// AllocaOp
let results = (outs AnyMemRef);		//===----------------------------------------------------------------------===//

let builders = [OpBuilder<		def AllocaOp : AllocLikeOp<"alloca"> {
"Builder *builder, OperationState &result, MemRefType memrefType", [{		let summary = "stack memory allocation operation";
result.types.push_back(memrefType);		let description = [{
}]>,		The "alloca" operation allocates memory on the stack, to be automatically
		rriddleUnsubmitted Done Reply Inline Actions nit: "alloca" -> `alloca` rriddle: nit: "alloca" -> `alloca`
		ftynseUnsubmitted Done Reply Inline Actions Nit: I think the intention of the comment above was to ask you to use backticks instead of double quotes. ftynse: Nit: I think the intention of the comment above was to ask you to use backticks instead of…
OpBuilder<		released when the stack frame is discarded. The amount of memory allocated
		ftynseUnsubmitted Done Reply Inline Actions Could you please elaborate what is a stack frame in MLIR? We don't seem to have this concept defined anywhere. In particular, is it only related to `std.func`, or can one register other ops that create stack frames? ftynse: Could you please elaborate what is a stack frame in MLIR? We don't seem to have this concept…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions A stack frame here refers to the standard stack frame concept in CS that we know of! It's up to the conversion out of MLIR to realize this correctly. bondhugula: A stack frame here refers to the standard stack frame concept in CS that we know of! It's up to…
		ftynseUnsubmitted Done Reply Inline Actions MLIR makes us rethink a lot of things. Like MLIR does not have functions as a first-class concept :) I could obviously guess your intention with stack frames, but I would still insist you think it through in context of MLIR. Similarly to functions, I don't think we have a built-in memory model that has a stack... Or that `std.func` semantics says something about stack frames. IIRC, this was one of the conceptual problems of having `alloca` in the first place. What should happen if I do the following? my.func @foo() { alloca ... } Or std.func @foo() { %0 = my.func_that_may_or_may_not_be_a_lambda @inner() { alloca ... } } "Conversion out of MLIR" does not mean anything to me either. Do you mean the translation of LLVM dialect to LLVM IR? There are other passes that may be using the standard dialect that are completely unaware of that. ftynse: MLIR makes us rethink a lot of things. Like MLIR does not have functions as a first-class…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Irrespective of which case it is, it depends on the ops around it and for the folks defining those ops to realize it. What you need to keep in mind is that an alloca's memory is automatically freed (i.e., you won't find a dealloc) and it disappears at the time the stack frame goes away whenever such a concept exists. Now, one is of course free to transform it and implement the auto freeing in another way within MLIR itself. Heap allocations for eg. get promoted to stack ones (eg. in LLVM's passes), and for whatever reason, one could decide to switch a stack allocation to a heap one. That won't be an incorrect transform. So it ultimately may not even be realized on the stack (let alone your question of when it should be freed). It's the intent of the op when you actually see in the IR that is of an allocation that is freed automatically when the stack frame goes away. bondhugula: Irrespective of which case it is, it depends on the ops around it and for the folks defining…
		ftynseUnsubmitted Done Reply Inline Actions I don't disagree with anything you say. I am merely pointing out that it is absolutely unclear from the op definition _when_ the allocated memory will be automatically freed. And "when the stack frame goes away" is not a satisfactory definition, because it is meaningless under the current MLIR semantics as it is written. I could literally add the `stack_frame.go_away()` operation tomorrow and expect it to free the allocations... If you could instead tie it to something like "std.func" returning or the region of an operation with FunctionLike trait transferring control flow back to its enclosing op, it would be more MLIR-compatible. ftynse: I don't disagree with anything you say. I am merely pointing out that it is absolutely unclear…
		mehdi_aminiUnsubmitted Done Reply Inline Actions I agree with Alex that this could deserve a more careful way of describing this. In particular the "stack" isn't very important here, we really need a scope. Could we use a trait like we have for "IsolatedFromAbove" to control this scoping? Unless we just consider every region as a new scope for these allocations? mehdi_amini: I agree with Alex that this could deserve a more careful way of describing this. In particular…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Looks like these messages were in flight while I updated and committed. The rephrasing can be addressed in a future revision while we continue discussion here. the current MLIR semantics as it is written. I could literally add the stack_frame.go_away() operation tomorrow and expect it to free the allocations. The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using another op. If you are imagining a low level op that exists that manipulates the stack frame, this would be a pathological case that it has freed it by circumventing things. In particular the "stack" isn't very important here, we really need a scope. Could we use a trait like we have for IsolatedFromAbove" to Tying it to a scope like that of a closest surrounding op with a function like trait (or isolatedFromAbove) is almost fine, but ultimately not perfect since imperative function like ops like std.execute_region don't have function like traits, and we may want to imply freeing at that scope. bondhugula: Looks like these messages were in flight while I updated and committed. The rephrasing can be…
		ftynseUnsubmitted Done Reply Inline Actions The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using another op. Why not? I would not constraint the behavior of every possible op based on a one-letter suffix of another op... "Automatic" does not mean it cannot be connected to another op. In fact, in MLIR, it will likely be connected to another op because functions are ops. Tying it to a scope like that of a closest surrounding op with a function like trait (or isolatedFromAbove) is almost fine, but ultimately not perfect since imperative function like ops like std.execute_region don't have function like traits, and we may want to imply freeing at that scope. IIUC, the idea is to introduce a new trait `AutomaticAllocationScope` that is orthogonal to `FunctionLike`. We can then make, e.g., `std.func`, `llvm.func` and `std.execute_region` have this trait, and let other ops opt-in to the automatic allocation/deallocation behavior. This would be my ideal solution, but I hesitated to push it since it may involve larger changes to this patch, trying to find a simpler solution like attaching to the function trait first. Now that the patch has landed, I think introducing a separate trait is the right thing to do. ftynse: > The 'a' suffix in alloca is for automatic freeing - it can't be explicitly freed using…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions mean it cannot be connected to another op. In fact, in MLIR, it will likely be connected to another op because functions are ops. Automatically there implies it has to be freed automatically, not by another op. The moment you want to explicitly free that, you'll have to rewrite it to something else - for eg. alloc/dealloc pair. like attaching to the function trait first. Now that the patch has landed, I think introducing a separate trait is the right thing to do. Strictly speaking, I also feel adding another trait is the right option. But for now and to keep it lightweight until we have such use cases, 'freed when the closest surrounding op with FunctionLike trait returns' is a good approximation for 'stack frame being discarded'. bondhugula: >mean it cannot be connected to another op. In fact, in MLIR, it will >likely be connected to…
		ftynseUnsubmitted Done Reply Inline Actions Strictly speaking, I also feel adding another trait is the right option. But for now and to keep it lightweight until we have such use cases, 'freed when the closest surrounding op with FunctionLike trait returns' is a good approximation for 'stack frame being discarded'. Let's have it as "freed when the closest surrounding op with FunctionLike trait has the control transferred back from its body", this avoids the potential interpretation that FunctionLike should also terminate with "std.return" and a weird-sounding "op <..> returns". We may also want to add a verifier check that such an op exists. ftynse: > Strictly speaking, I also feel adding another trait is the right option. But for now and to…
		mehdi_aminiUnsubmitted Done Reply Inline Actions I am not convinced that the "FunctionLike" trait is the right one here, an op like `gpu.launch` for example would create a new scope in my expectations. What do you think about the trait Uday proposed above `AutomaticAllocationScope` ? mehdi_amini: I am not convinced that the "FunctionLike" trait is the right one here, an op like `gpu.launch`…
		ftynseUnsubmitted Done Reply Inline Actions I proposed it :) So I am obviously in favor of it. I think we can start by associating the deallocation with `FunctionLike`, it's a simple documentation+verifier change that we can land fast and avoid having contentious code upstream as well as rolling it back. Then we can implement `AutomaticAllocationScope` and let ops to opt-in (e.g., I'd expect @herhut to decide whether we want `alloca`s in GPU at all). ftynse: I proposed it :) So I am obviously in favor of it. I think we can start by associating the…
		mehdi_aminiUnsubmitted Done Reply Inline Actions I think we can start by associating the deallocation with FunctionLike, I'm afraid this will lead to wrong assumption, I rather have code written with the right trait checked from the beginning. mehdi_amini: > I think we can start by associating the deallocation with FunctionLike, I'm afraid this…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions It is clear that using a new trait for scoped allocation will be accurate here. FunctionLike will always imply a new scope for stack allocation. Introducing the trait should be straightforward - okay to update the description when we introduce the trait. bondhugula: It is clear that using a new trait for scoped allocation will be accurate here. FunctionLike…
		mehdi_aminiUnsubmitted Done Reply Inline Actions Are you adding the trait? I think this should be done now: I wouldn't want code that start checking for "FunctionLike" where it should check for the "AutomaticAllocationScope" trait. mehdi_amini: Are you adding the trait? I think this should be done now: I wouldn't want code that start…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Alex, are you interested in / are you able to add this trait? If not, I can do it right away. bondhugula: Alex, are you interested in / are you able to add this trait? If not, I can do it right away.
		ftynseUnsubmitted Not Done Reply Inline Actions I have quite a long todo list... ftynse: I have quite a long todo list...
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions D77787 bondhugula: D77787
"Builder *builder, OperationState &result, MemRefType memrefType, " #		is specified by its memref and additional operands. For example:
"ArrayRef<Value> operands, IntegerAttr alignment = IntegerAttr()", [{
result.addOperands(operands);
result.types.push_back(memrefType);
if (alignment)
result.addAttribute(getAlignmentAttrName(), alignment);
}]>];

let extraClassDeclaration = [{		%0 = alloca() : memref<8x64xf32>
		rriddleUnsubmitted Done Reply Inline Actions Please use mlir code blocks for any inline code. rriddle: Please use mlir code blocks for any inline code.
static StringRef getAlignmentAttrName() { return "alignment"; }

MemRefType getType() { return getResult().getType().cast<MemRefType>(); }		The optional list of dimension operands are bound to the dynamic dimensions
		specified in its memref type. In the example below, the SSA value '%d' is
		bound to the second dimension of the memref (which is dynamic).

/// Returns the number of symbolic operands (the ones in square brackets),		%0 = alloca(%d) : memref<8x?xf32>
/// which bind to the symbols of the memref's layout map.
unsigned getNumSymbolicOperands() {
return getNumOperands() - getType().getNumDynamicDims();
}

/// Returns the symbolic operands (the ones in square brackets), which bind		The optional list of symbol operands are bound to the symbols of the
/// to the symbols of the memref's layout map.		memref's affine map. In the example below, the SSA value '%s' is bound to
operand_range getSymbolicOperands() {		the symbol 's0' in the affine map specified in the allocs memref type.
return {operand_begin() + getType().getNumDynamicDims(), operand_end()};
}

/// Returns the dynamic sizes for this alloc operation if specified.		%0 = alloca()[%s] : memref<8x64xf32, (d0, d1)[s0] -> ((d0 + s0), d1)>
operand_range getDynamicSizes() { return getOperands(); }
}];

let hasCanonicalizer = 1;		This operation returns a single SSA value of memref type, which can be used
		by subsequent load and store operations. An optional alignment attribute, if
		specified, guarantees alignment at least to that boundary. If not specified,
		an alignment on any convenient boundary compatible with the type will be
		chosen.
		}];
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AndOp		// AndOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def AndOp : IntArithmeticOp<"and", [Commutative]> {		def AndOp : IntArithmeticOp<"and", [Commutative]> {
let summary = "integer binary and";		let summary = "integer binary and";
▲ Show 20 Lines • Show All 1,853 Lines • Show Last 20 Lines

mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp

Show First 20 Lines • Show All 1,504 Lines • ▼ Show 20 Lines	void rewrite(Operation *op, ArrayRef<Value> operands,
// Field 1: Allocated pointer, used for malloc/free.		// Field 1: Allocated pointer, used for malloc/free.
memRefDescriptor.setAllocatedPtr(rewriter, loc, bitcastAllocated);		memRefDescriptor.setAllocatedPtr(rewriter, loc, bitcastAllocated);

// Field 2: Actual aligned pointer to payload.		// Field 2: Actual aligned pointer to payload.
Value bitcastAligned = bitcastAllocated;		Value bitcastAligned = bitcastAllocated;
if (!useAlloca && alignment != 0) {		if (!useAlloca && alignment != 0) {
assert(alignmentValue);		assert(alignmentValue);
// offset = (align - (ptr % align))% align		// offset = (align - (ptr % align))% align
Value intVal = rewriter.create<LLVM::PtrToIntOp>(		Value intVal = rewriter.create<LLVM::PtrToIntOp>(
		nicolasvasilacheUnsubmitted Done Reply Inline Actions I think the change in indentation (?) here and everywhere is unfortunate, it does not properly track what code moved (materialized with a yellow phab horizontal bar on the left) and make it harder to see what changed vs what moved. I am mostly eyeballing the code after realizing most of this is just moved. Please flag specific things need deeper review if appropriate. nicolasvasilache: I think the change in indentation (?) here and everywhere is unfortunate, it does not properly…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions All of this is just moved. Nothing in the alloc op lowering actually changed with this - I'm just reusing the common parts for alloca. bondhugula: All of this is just moved. Nothing in the alloc op lowering actually changed with this - I'm…
loc, this->getIndexType(), allocated);		loc, this->getIndexType(), allocated);
Value ptrModAlign =		Value ptrModAlign =
rewriter.create<LLVM::URemOp>(loc, intVal, alignmentValue);		rewriter.create<LLVM::URemOp>(loc, intVal, alignmentValue);
Value subbed =		Value subbed =
rewriter.create<LLVM::SubOp>(loc, alignmentValue, ptrModAlign);		rewriter.create<LLVM::SubOp>(loc, alignmentValue, ptrModAlign);
Value offset = rewriter.create<LLVM::URemOp>(loc, subbed, alignmentValue);		Value offset = rewriter.create<LLVM::URemOp>(loc, subbed, alignmentValue);
Value aligned = rewriter.create<LLVM::GEPOp>(loc, allocated.getType(),		Value aligned = rewriter.create<LLVM::GEPOp>(loc, allocated.getType(),
allocated, offset);		allocated, offset);
Show All 38 Lines	void rewrite(Operation *op, ArrayRef<Value> operands,
}		}

// Return the final value of the descriptor.		// Return the final value of the descriptor.
rewriter.replaceOp(op, {memRefDescriptor});		rewriter.replaceOp(op, {memRefDescriptor});
}		}

bool useAlloca;		bool useAlloca;
};		};

		ftynseUnsubmitted Done Reply Inline Actions Nit: can we rather define `one` at the call site and pass it here (and to another call) ? ftynse: Nit: can we rather define `one` at the call site and pass it here (and to another call) ?
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions The reason I didn't do that is that the order of the instructions in that case wouldn't be natural - there would be other stuff (the size constant definitions) between the def of 'one' and its first use here. So I left it this way. bondhugula: The reason I didn't do that is that the order of the instructions in that case wouldn't be…
		// An `alloca` is converted into a definition of a memref descriptor value and
		// an llvm.alloca to allocate the underlying data buffer. The memref descriptor
		// is of the LLVM structure type where the first element is a pointer to the
		// (typed) data buffer, and the remaining elements serve to store offset, sizes,
		// and strides of the memref using LLVM-converted `index` type.
		struct AllocaOpLowering : public LLVMLegalizationPattern<AllocaOp> {
		using LLVMLegalizationPattern<AllocaOp>::LLVMLegalizationPattern;

		LogicalResult match(Operation *op) const override {
		MemRefType type = cast<AllocaOp>(op).getType();
		if (isSupportedMemRefType(type))
		return success();

		int64_t offset;
		SmallVector<int64_t, 4> strides;
		if (failed(getStridesAndOffset(type, strides, offset)))
		return failure();

		// Dynamic strides are ok if they can be deduced from dynamic sizes (which
		// is guaranteed if getStridesAndOffset succeeds. Dynamic offset however can
		// never be alloc'ed.
		if (offset == MemRefType::getDynamicStrideOrOffset())
		return failure();

		return success();
		}

		void rewrite(Operation *op, ArrayRef<Value> operands,
		ConversionPatternRewriter &rewriter) const override {
		auto allocaOp = cast<AllocaOp>(op);
		auto loc = op->getLoc();
		MemRefType type = allocaOp.getType();

		// Get actual sizes of the memref as values: static sizes are constant
		// values and dynamic sizes are passed to 'alloc' as operands. In case of
		// zero-dimensional memref, assume a scalar (size 1).
		SmallVector<Value, 4> sizes;
		sizes.reserve(type.getRank());
		unsigned i = 0;
		for (int64_t s : type.getShape())
		sizes.push_back(s == -1 ? operands[i++]
		: createIndexConstant(rewriter, loc, s));
		if (sizes.empty())
		sizes.push_back(createIndexConstant(rewriter, loc, 1));

		// Compute the total number of memref elements.
		Value cumulativeSize = sizes.front();
		for (unsigned i = 1, e = sizes.size(); i < e; ++i)
		cumulativeSize = rewriter.create<LLVM::MulOp>(
		loc, getIndexType(), ArrayRef<Value>{cumulativeSize, sizes[i]});

		// Compute the size of an individual element. This emits the MLIR equivalent
		// of the following sizeof(...) implementation in LLVM IR:
		// %0 = getelementptr %elementType* null, %indexType 1
		// %1 = ptrtoint %elementType* %0 to %indexType
		// which is a common pattern of getting the size of a type in bytes.
		auto elementType = type.getElementType();
		auto convertedPtrType = typeConverter.convertType(elementType)
		.cast<LLVM::LLVMType>()
		.getPointerTo();
		auto nullPtr = rewriter.create<LLVM::NullOp>(loc, convertedPtrType);
		auto one = createIndexConstant(rewriter, loc, 1);
		auto gep = rewriter.create<LLVM::GEPOp>(loc, convertedPtrType,
		ArrayRef<Value>{nullPtr, one});
		auto elementSize =
		rewriter.create<LLVM::PtrToIntOp>(loc, getIndexType(), gep);
		cumulativeSize = rewriter.create<LLVM::MulOp>(
		loc, getIndexType(), ArrayRef<Value>{cumulativeSize, elementSize});

		Optional<int> alignment;
		if (auto attr = allocaOp.alignment())
		alignment = attr.getValue().getSExtValue();

		// Allocate the underlying buffer and store a pointer to it in the MemRef
		// descriptor.
		auto structElementType = typeConverter.convertType(elementType);
		auto elementPtrType = structElementType.cast<LLVM::LLVMType>().getPointerTo(
		type.getMemorySpace());
		Value allocated = rewriter.create<LLVM::AllocaOp>(
		loc, elementPtrType, cumulativeSize,
		alignment.hasValue() ? alignment.getValue() : 0);

		int64_t offset;
		SmallVector<int64_t, 4> strides;
		auto successStrides = getStridesAndOffset(type, strides, offset);
		assert(succeeded(successStrides) && "unexpected non-strided memref");
		(void)successStrides;
		assert(offset != MemRefType::getDynamicStrideOrOffset() &&
		"unexpected dynamic offset");

		// 0-D memref corner case: they have size 1.
		assert(((type.getRank() == 0 && strides.empty() && sizes.size() == 1) \|\|
		(strides.size() == sizes.size())) &&
		"unexpected number of strides");

		// Create the MemRef descriptor.
		auto structType = typeConverter.convertType(type);
		auto memRefDescriptor = MemRefDescriptor::undef(rewriter, loc, structType);

		// Field 1: Allocated pointer.
		memRefDescriptor.setAllocatedPtr(rewriter, loc, allocated);

		// Field 2: The aligned pointer is the same as the allocated one here since
		// the underlying alloca supports alignment.
		memRefDescriptor.setAlignedPtr(rewriter, loc, allocated);

		// Field 3: Offset in aligned pointer.
		memRefDescriptor.setOffset(rewriter, loc,
		createIndexConstant(rewriter, loc, offset));

		if (type.getRank() == 0)
		// No size/stride descriptor in memref, return the descriptor value.
		return rewriter.replaceOp(op, {memRefDescriptor});

		// Store all sizes in the descriptor. Only dynamic sizes are passed in as
		// operands to AllocOp.
		Value runningStride = nullptr;
		// Iterate strides in reverse order, compute runningStride and strideValues.
		auto nStrides = strides.size();
		SmallVector<Value, 4> strideValues(nStrides, nullptr);
		for (auto indexedStride : llvm::enumerate(llvm::reverse(strides))) {
		int64_t index = nStrides - 1 - indexedStride.index();
		if (strides[index] == MemRefType::getDynamicStrideOrOffset())
		// Identity layout map is enforced in the match function, so we compute:
		// `runningStride *= sizes[index + 1]`
		runningStride = runningStride
		? rewriter.create<LLVM::MulOp>(
		op->getLoc(), runningStride, sizes[index + 1])
		: createIndexConstant(rewriter, op->getLoc(), 1);
		else
		runningStride =
		createIndexConstant(rewriter, op->getLoc(), strides[index]);
		strideValues[index] = runningStride;
		}
		// Fill size and stride descriptors in memref.
		for (auto indexedSize : llvm::enumerate(sizes)) {
		int64_t index = indexedSize.index();
		memRefDescriptor.setSize(rewriter, loc, index, indexedSize.value());
		memRefDescriptor.setStride(rewriter, loc, index, strideValues[index]);
		}

		// Return the final value of the descriptor.
		rewriter.replaceOp(op, {memRefDescriptor});
		}
		};

// A CallOp automatically promotes MemRefType to a sequence of alloca/store and		// A CallOp automatically promotes MemRefType to a sequence of alloca/store and
// passes the pointer to the MemRef across function boundaries.		// passes the pointer to the MemRef across function boundaries.
template <typename CallOpType>		template <typename CallOpType>
struct CallOpInterfaceLowering : public LLVMLegalizationPattern<CallOpType> {		struct CallOpInterfaceLowering : public LLVMLegalizationPattern<CallOpType> {
using LLVMLegalizationPattern<CallOpType>::LLVMLegalizationPattern;		using LLVMLegalizationPattern<CallOpType>::LLVMLegalizationPattern;
using Super = CallOpInterfaceLowering<CallOpType>;		using Super = CallOpInterfaceLowering<CallOpType>;
using Base = LLVMLegalizationPattern<CallOpType>;		using Base = LLVMLegalizationPattern<CallOpType>;

▲ Show 20 Lines • Show All 1,209 Lines • ▼ Show 20 Lines
void mlir::populateStdToLLVMNonMemoryConversionPatterns(		void mlir::populateStdToLLVMNonMemoryConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
// FIXME: this should be tablegen'ed		// FIXME: this should be tablegen'ed
// clang-format off		// clang-format off
patterns.insert<		patterns.insert<
AbsFOpLowering,		AbsFOpLowering,
AddFOpLowering,		AddFOpLowering,
AddIOpLowering,		AddIOpLowering,
		AllocaOpLowering,
AndOpLowering,		AndOpLowering,
AtomicCmpXchgOpLowering,		AtomicCmpXchgOpLowering,
AtomicRMWOpLowering,		AtomicRMWOpLowering,
BranchOpLowering,		BranchOpLowering,
CallIndirectOpLowering,		CallIndirectOpLowering,
CallOpLowering,		CallOpLowering,
CeilFOpLowering,		CeilFOpLowering,
CmpFOpLowering,		CmpFOpLowering,
▲ Show 20 Lines • Show All 221 Lines • ▼ Show 20 Lines	mlir::LLVMConversionTarget::LLVMConversionTarget(MLIRContext &ctx)
this->addLegalDialect<LLVM::LLVMDialect>();		this->addLegalDialect<LLVM::LLVMDialect>();
this->addIllegalOp<LLVM::DialectCastOp>();		this->addIllegalOp<LLVM::DialectCastOp>();
}		}

std::unique_ptr<OpPassBase<ModuleOp>>		std::unique_ptr<OpPassBase<ModuleOp>>
mlir::createLowerToLLVMPass(bool useAlloca, bool useBarePtrCallConv,		mlir::createLowerToLLVMPass(bool useAlloca, bool useBarePtrCallConv,
bool emitCWrappers) {		bool emitCWrappers) {
return std::make_unique<LLVMLoweringPass>(useAlloca, useBarePtrCallConv,		return std::make_unique<LLVMLoweringPass>(useAlloca, useBarePtrCallConv,
emitCWrappers);		emitCWrappers);
		nicolasvasilacheUnsubmitted Done Reply Inline Actions I am concerned by the silent behavior breaking changes here and in the followup revision. I am expecting this will be painful for integrations and will repeat itself. Can we turn this into a: struct LowerToLLVMOptions { ... } nicolasvasilache: I am concerned by the silent behavior breaking changes here and in the followup revision. I am…
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions That's a concern - there are going to be silent breaking effects. I've changed it to struct here. But even with a struct, you'll have the same issue with list initialization (some of the fields will remain uninitialized / wrongly initialized depending on where the field was removed from or where the new field was added). If the new fields are always added at the end, we'll just have uninitialized fields. And with explicit field-wise init, it'll just lead to uninitialized stuff. BTW, none of the three options are documented at the declaration (but only in Passes.td). bondhugula: That's a concern - there are going to be silent breaking effects. I've changed it to struct…
}		}

static PassRegistration<LLVMLoweringPass>		static PassRegistration<LLVMLoweringPass>
pass(PASS_NAME, "Convert scalar and vector operations from the "		pass(PASS_NAME, "Convert scalar and vector operations from the "
"Standard to the LLVM dialect");		"Standard to the LLVM dialect");

mlir/lib/Dialect/StandardOps/IR/Ops.cpp

Show First 20 Lines • Show All 236 Lines • ▼ Show 20 Lines	OpFoldResult AddIOp::fold(ArrayRef<Attribute> operands) {
if (matchPattern(rhs(), m_Zero()))		if (matchPattern(rhs(), m_Zero()))
return lhs();		return lhs();

return constFoldBinaryOp<IntegerAttr>(operands,		return constFoldBinaryOp<IntegerAttr>(operands,
[](APInt a, APInt b) { return a + b; });		[](APInt a, APInt b) { return a + b; });
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AllocOp		// AllocOp / AllocaOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static void print(OpAsmPrinter &p, AllocOp op) {		template <typename AllocLikeOp>
p << "alloc";		static void printAllocLikeOp(OpAsmPrinter &p, AllocLikeOp op, StringRef name) {
		static_assert(std::is_same<AllocLikeOp, AllocOp>::value \|\|
		rriddleUnsubmitted Done Reply Inline Actions llvm::is_one_of rriddle: llvm::is_one_of
		bondhugulaAuthorUnsubmitted Done Reply Inline Actions Thanks! bondhugula: Thanks!
		std::is_same<AllocLikeOp, AllocaOp>::value,
		"applies to only alloc or alloca");
		p << name;

// Print dynamic dimension operands.		// Print dynamic dimension operands.
MemRefType type = op.getType();		MemRefType type = op.getType();
printDimAndSymbolList(op.operand_begin(), op.operand_end(),		printDimAndSymbolList(op.operand_begin(), op.operand_end(),
type.getNumDynamicDims(), p);		type.getNumDynamicDims(), p);
p.printOptionalAttrDict(op.getAttrs(), /elidedAttrs=/{"map"});		p.printOptionalAttrDict(op.getAttrs(), /elidedAttrs=/{"map"});
p << " : " << type;		p << " : " << type;
}		}

static ParseResult parseAllocOp(OpAsmParser &parser, OperationState &result) {		static void print(OpAsmPrinter &p, AllocOp op) {
		printAllocLikeOp(p, op, "alloc");
		}

		static ParseResult parseAllocLikeOp(OpAsmParser &parser,
		OperationState &result) {
MemRefType type;		MemRefType type;

// Parse the dimension operands and optional symbol operands, followed by a		// Parse the dimension operands and optional symbol operands, followed by a
// memref type.		// memref type.
unsigned numDimOperands;		unsigned numDimOperands;
if (parseDimAndSymbolList(parser, result.operands, numDimOperands) \|\|		if (parseDimAndSymbolList(parser, result.operands, numDimOperands) \|\|
parser.parseOptionalAttrDict(result.attributes) \|\|		parser.parseOptionalAttrDict(result.attributes) \|\|
parser.parseColonType(type))		parser.parseColonType(type))
return failure();		return failure();

// Check numDynamicDims against number of question marks in memref type.		// Check numDynamicDims against number of question marks in memref type.
// Note: this check remains here (instead of in verify()), because the		// Note: this check remains here (instead of in verify()), because the
// partition between dim operands and symbol operands is lost after parsing.		// partition between dim operands and symbol operands is lost after parsing.
// Verification still checks that the total number of operands matches		// Verification still checks that the total number of operands matches
// the number of symbols in the affine map, plus the number of dynamic		// the number of symbols in the affine map, plus the number of dynamic
// dimensions in the memref.		// dimensions in the memref.
if (numDimOperands != type.getNumDynamicDims())		if (numDimOperands != type.getNumDynamicDims())
return parser.emitError(parser.getNameLoc())		return parser.emitError(parser.getNameLoc())
<< "dimension operand count does not equal memref dynamic dimension "		<< "dimension operand count does not equal memref dynamic dimension "
"count";		"count";
result.types.push_back(type);		result.types.push_back(type);
return success();		return success();
}		}

static LogicalResult verify(AllocOp op) {		template <typename AllocLikeOp>
auto memRefType = op.getResult().getType().dyn_cast<MemRefType>();		static LogicalResult verify(AllocLikeOp op) {
		static_assert(std::is_same<AllocLikeOp, AllocOp>::value \|\|
		std::is_same<AllocLikeOp, AllocaOp>::value,
		"applies to only alloc or alloca");
		auto memRefType = op.getResult().getType().template dyn_cast<MemRefType>();
if (!memRefType)		if (!memRefType)
return op.emitOpError("result must be a memref");		return op.emitOpError("result must be a memref");

unsigned numSymbols = 0;		unsigned numSymbols = 0;
if (!memRefType.getAffineMaps().empty()) {		if (!memRefType.getAffineMaps().empty()) {
// Store number of symbols used in affine map (used in subsequent check).		// Store number of symbols used in affine map (used in subsequent check).
AffineMap affineMap = memRefType.getAffineMaps()[0];		AffineMap affineMap = memRefType.getAffineMaps()[0];
numSymbols = affineMap.getNumSymbols();		numSymbols = affineMap.getNumSymbols();
Show All 9 Lines	static LogicalResult verify(AllocLikeOp op) {

// Verify that all operands are of type Index.		// Verify that all operands are of type Index.
for (auto operandType : op.getOperandTypes())		for (auto operandType : op.getOperandTypes())
if (!operandType.isIndex())		if (!operandType.isIndex())
return op.emitOpError("requires operands to be of type Index");		return op.emitOpError("requires operands to be of type Index");
return success();		return success();
}		}

		static void print(OpAsmPrinter &p, AllocaOp op) {
		printAllocLikeOp(p, op, "alloca");
		}

namespace {		namespace {
/// Fold constant dimensions into an alloc operation.		/// Fold constant dimensions into an alloc like operation.
struct SimplifyAllocConst : public OpRewritePattern<AllocOp> {		template <typename AllocLikeOp>
using OpRewritePattern<AllocOp>::OpRewritePattern;		struct SimplifyAllocConst : public OpRewritePattern<AllocLikeOp> {
		using OpRewritePattern<AllocLikeOp>::OpRewritePattern;

LogicalResult matchAndRewrite(AllocOp alloc,		LogicalResult matchAndRewrite(AllocLikeOp alloc,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Check to see if any dimensions operands are constants. If so, we can		// Check to see if any dimensions operands are constants. If so, we can
// substitute and drop them.		// substitute and drop them.
if (llvm::none_of(alloc.getOperands(), [](Value operand) {		if (llvm::none_of(alloc.getOperands(), [](Value operand) {
return matchPattern(operand, m_ConstantIndex());		return matchPattern(operand, m_ConstantIndex());
}))		}))
return failure();		return failure();

Show All 27 Lines	LogicalResult matchAndRewrite(AllocLikeOp alloc,

// Create new memref type (which will have fewer dynamic dimensions).		// Create new memref type (which will have fewer dynamic dimensions).
MemRefType newMemRefType =		MemRefType newMemRefType =
MemRefType::Builder(memrefType).setShape(newShapeConstants);		MemRefType::Builder(memrefType).setShape(newShapeConstants);
assert(static_cast<int64_t>(newOperands.size()) ==		assert(static_cast<int64_t>(newOperands.size()) ==
newMemRefType.getNumDynamicDims());		newMemRefType.getNumDynamicDims());

// Create and insert the alloc op for the new memref.		// Create and insert the alloc op for the new memref.
auto newAlloc = rewriter.create<AllocOp>(alloc.getLoc(), newMemRefType,		auto newAlloc = rewriter.create<AllocLikeOp>(alloc.getLoc(), newMemRefType,
newOperands, IntegerAttr());		newOperands, IntegerAttr());
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - newOperands, IntegerAttr()); + newOperands, IntegerAttr()); Lint: Pre-merge checks: clang-format: please reformat the code ``` - …
// Insert a cast so we have the same type as the old alloc.		// Insert a cast so we have the same type as the old alloc.
auto resultCast = rewriter.create<MemRefCastOp>(alloc.getLoc(), newAlloc,		auto resultCast = rewriter.create<MemRefCastOp>(alloc.getLoc(), newAlloc,
alloc.getType());		alloc.getType());

rewriter.replaceOp(alloc, {resultCast});		rewriter.replaceOp(alloc, {resultCast});
return success();		return success();
}		}
};		};
Show All 11 Lines	LogicalResult matchAndRewrite(AllocOp alloc,
}		}
return failure();		return failure();
}		}
};		};
} // end anonymous namespace.		} // end anonymous namespace.

void AllocOp::getCanonicalizationPatterns(OwningRewritePatternList &results,		void AllocOp::getCanonicalizationPatterns(OwningRewritePatternList &results,
MLIRContext *context) {		MLIRContext *context) {
results.insert<SimplifyAllocConst, SimplifyDeadAlloc>(context);		results.insert<SimplifyAllocConst<AllocOp>, SimplifyDeadAlloc>(context);
		}

		void AllocaOp::getCanonicalizationPatterns(OwningRewritePatternList &results,
		MLIRContext *context) {
		results.insert<SimplifyAllocConst<AllocaOp>>(context);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AndOp		// AndOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

OpFoldResult AndOp::fold(ArrayRef<Attribute> operands) {		OpFoldResult AndOp::fold(ArrayRef<Attribute> operands) {
/// and(x, 0) -> 0		/// and(x, 0) -> 0
▲ Show 20 Lines • Show All 2,123 Lines • Show Last 20 Lines

mlir/test/Conversion/StandardToLLVM/convert-dynamic-memref-ops.mlir

	Show First 20 Lines • Show All 87 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: llvm.insertvalue %[[st0]], %{{.}}[4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[st0]], %{{.}}[4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	%0 = alloc(%arg0, %arg1) : memref<?x?xf32>			%0 = alloc(%arg0, %arg1) : memref<?x?xf32>
	// CHECK-NEXT: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">			// CHECK-NEXT: llvm.return %{{.}} : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
	return %0 : memref<?x?xf32>			return %0 : memref<?x?xf32>
	}			}

				// -----

				// CHECK-LABEL: func @dynamic_alloca
				// CHECK: %[[M:.]]: !llvm.i64, %[[N:.]]: !llvm.i64) -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
				func @dynamic_alloca(%arg0: index, %arg1: index) -> memref<?x?xf32> {
				// CHECK: %[[num_elems:.*]] = llvm.mul %[[M]], %[[N]] : !llvm.i64
				// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// CHECK-NEXT: %[[sz_bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64
				// CHECK-NEXT: %[[allocated:.]] = llvm.alloca %[[sz_bytes]] x !llvm.float : (!llvm.i64) -> !llvm<"float">
				// CHECK-NEXT: llvm.mlir.undef : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[allocated]], %{{.}}[0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[allocated]], %{{.}}[1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: %[[off:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
				// CHECK-NEXT: llvm.insertvalue %[[off]], %{{.}}[2] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: %[[st1:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[st0:.]] = llvm.mul %{{.}}, %[[N]] : !llvm.i64
				// CHECK-NEXT: llvm.insertvalue %[[M]], %{{.}}[3, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[st0]], %{{.}}[4, 0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[N]], %{{.}}[3, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK-NEXT: llvm.insertvalue %[[st1]], %{{.}}[4, 1] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				%0 = alloca(%arg0, %arg1) : memref<?x?xf32>

				// Test with explicitly specified alignment. llvm.alloca takes care of the
				// alignment. The same pointer is thus used for allocation and aligned
				// accesses.
				// CHECK: %[[alloca_aligned:.]] = llvm.alloca %{{.}} x !llvm.float {alignment = 32 : i64} : (!llvm.i64) -> !llvm<"float*">
				// CHECK: %[[desc:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK: %[[desc1:.]] = llvm.insertvalue %[[alloca_aligned]], %[[desc]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK: llvm.insertvalue %[[alloca_aligned]], %[[desc1]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				alloca(%arg0, %arg1) {alignment = 32} : memref<?x?xf32>
				return %0 : memref<?x?xf32>
				}

	// CHECK-LABEL: func @dynamic_dealloc			// CHECK-LABEL: func @dynamic_dealloc
	func @dynamic_dealloc(%arg0: memref<?x?xf32>) {			func @dynamic_dealloc(%arg0: memref<?x?xf32>) {
	// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">			// CHECK-NEXT: %[[ptri8:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
	// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()			// CHECK-NEXT: llvm.call @free(%[[ptri8]]) : (!llvm<"i8*">) -> ()
	dealloc %arg0 : memref<?x?xf32>			dealloc %arg0 : memref<?x?xf32>
	return			return
	}			}
	▲ Show 20 Lines • Show All 183 Lines • Show Last 20 Lines

mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir

	Show First 20 Lines • Show All 201 Lines • ▼ Show 20 Lines
	// BAREPTR-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">			// BAREPTR-NEXT: %[[allocated:.]] = llvm.call @malloc(%[[bytes]]) : (!llvm.i64) -> !llvm<"i8">
	// BAREPTR-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">			// BAREPTR-NEXT: llvm.bitcast %[[allocated]] : !llvm<"i8"> to !llvm<"float">
	%0 = alloc() : memref<32x18xf32>			%0 = alloc() : memref<32x18xf32>
	return %0 : memref<32x18xf32>			return %0 : memref<32x18xf32>
	}			}

	// -----			// -----

				// CHECK-LABEL: func @static_alloca() -> !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }"> {
				func @static_alloca() -> memref<32x18xf32> {
				// CHECK-NEXT: %[[sz1:.*]] = llvm.mlir.constant(32 : index) : !llvm.i64
				// CHECK-NEXT: %[[sz2:.*]] = llvm.mlir.constant(18 : index) : !llvm.i64
				// CHECK-NEXT: %[[num_elems:.*]] = llvm.mul %0, %1 : !llvm.i64
				// CHECK-NEXT: %[[null:.]] = llvm.mlir.null : !llvm<"float">
				// CHECK-NEXT: %[[one:.*]] = llvm.mlir.constant(1 : index) : !llvm.i64
				// CHECK-NEXT: %[[gep:.]] = llvm.getelementptr %[[null]][%[[one]]] : (!llvm<"float">, !llvm.i64) -> !llvm<"float*">
				// CHECK-NEXT: %[[sizeof:.]] = llvm.ptrtoint %[[gep]] : !llvm<"float"> to !llvm.i64
				// CHECK-NEXT: %[[bytes:.*]] = llvm.mul %[[num_elems]], %[[sizeof]] : !llvm.i64
				// CHECK-NEXT: %[[allocated:.]] = llvm.alloca %[[bytes]] x !llvm.float : (!llvm.i64) -> !llvm<"float">
				%0 = alloca() : memref<32x18xf32>

				// Test with explicitly specified alignment. llvm.alloca takes care of the
				// alignment. The same pointer is thus used for allocation and aligned
				// accesses.
				// CHECK: %[[alloca_aligned:.]] = llvm.alloca %{{.}} x !llvm.float {alignment = 32 : i64} : (!llvm.i64) -> !llvm<"float*">
				// CHECK: %[[desc:.]] = llvm.mlir.undef : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK: %[[desc1:.]] = llvm.insertvalue %[[alloca_aligned]], %[[desc]][0] : !llvm<"{ float, float*, i64, [2 x i64], [2 x i64] }">
				// CHECK: llvm.insertvalue %[[alloca_aligned]], %[[desc1]][1] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
				alloca() {alignment = 32} : memref<32x18xf32>
				return %0 : memref<32x18xf32>
				}

				// -----

	// CHECK-LABEL: func @static_dealloc			// CHECK-LABEL: func @static_dealloc
	// BAREPTR-LABEL: func @static_dealloc(%{{.}}: !llvm<"float">) {			// BAREPTR-LABEL: func @static_dealloc(%{{.}}: !llvm<"float">) {
	func @static_dealloc(%static: memref<10x8xf32>) {			func @static_dealloc(%static: memref<10x8xf32>) {
	// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// CHECK: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">			// CHECK-NEXT: %[[bc:.]] = llvm.bitcast %[[ptr]] : !llvm<"float"> to !llvm<"i8*">
	// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()			// CHECK-NEXT: llvm.call @free(%[[bc]]) : (!llvm<"i8*">) -> ()

	// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">			// BAREPTR: %[[ptr:.]] = llvm.extractvalue %{{.}}[0] : !llvm<"{ float, float, i64, [2 x i64], [2 x i64] }">
	▲ Show 20 Lines • Show All 132 Lines • Show Last 20 Lines

mlir/test/IR/memory-ops.mlir

Show All 27 Lines	^bb0:
// b/116054838 Parser crash while parsing ill-formed AllocOp		// b/116054838 Parser crash while parsing ill-formed AllocOp
// CHECK: %4 = alloc() : memref<2xi32>		// CHECK: %4 = alloc() : memref<2xi32>
%4 = alloc() : memref<2 x i32>		%4 = alloc() : memref<2 x i32>

// CHECK: return		// CHECK: return
return		return
}		}

		// CHECK-LABEL: func @alloca() {
		func @alloca() {
		^bb0:
		// Test simple alloc.
		// CHECK: %0 = alloca() : memref<1024x64xf32, 1>
		%0 = alloca() : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 1>

		%c0 = "std.constant"() {value = 0: index} : () -> index
		%c1 = "std.constant"() {value = 1: index} : () -> index

		// Test alloca with dynamic dimensions.
		// CHECK: %1 = alloca(%c0, %c1) : memref<?x?xf32, 1>
		%1 = alloca(%c0, %c1) : memref<?x?xf32, affine_map<(d0, d1) -> (d0, d1)>, 1>

		// Test alloca with no dynamic dimensions and one symbol.
		// CHECK: %2 = alloca()[%c0] : memref<2x4xf32, #map0, 1>
		%2 = alloca()[%c0] : memref<2x4xf32, affine_map<(d0, d1)[s0] -> ((d0 + s0), d1)>, 1>

		// Test alloca with dynamic dimensions and one symbol.
		// CHECK: %3 = alloca(%c1)[%c0] : memref<2x?xf32, #map0, 1>
		%3 = alloca(%c1)[%c0] : memref<2x?xf32, affine_map<(d0, d1)[s0] -> (d0 + s0, d1)>, 1>

		// Alloca with no mappings, but with alignment.
		// CHECK: %4 = alloca() {alignment = 64 : i64} : memref<2xi32>
		%4 = alloca() {alignment = 64} : memref<2 x i32>

		return
		}

// CHECK-LABEL: func @dealloc() {		// CHECK-LABEL: func @dealloc() {
func @dealloc() {		func @dealloc() {
^bb0:		^bb0:
// CHECK: %0 = alloc() : memref<1024x64xf32>		// CHECK: %0 = alloc() : memref<1024x64xf32>
%0 = alloc() : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 0>		%0 = alloc() : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 0>

// CHECK: dealloc %0 : memref<1024x64xf32>		// CHECK: dealloc %0 : memref<1024x64xf32>
dealloc %0 : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 0>		dealloc %0 : memref<1024x64xf32, affine_map<(d0, d1) -> (d0, d1)>, 0>
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

mlir/test/Transforms/canonicalize.mlir

	Show First 20 Lines • Show All 386 Lines • ▼ Show 20 Lines
	func @dyn_shape_fold(%L : index, %M : index) -> (memref<? x ? x i32>, memref<? x ? x f32>) {			func @dyn_shape_fold(%L : index, %M : index) -> (memref<? x ? x i32>, memref<? x ? x f32>) {
	// CHECK: %c0 = constant 0 : index			// CHECK: %c0 = constant 0 : index
	%zero = constant 0 : index			%zero = constant 0 : index
	// The constants below disappear after they propagate into shapes.			// The constants below disappear after they propagate into shapes.
	%nine = constant 9 : index			%nine = constant 9 : index
	%N = constant 1024 : index			%N = constant 1024 : index
	%K = constant 512 : index			%K = constant 512 : index

	// CHECK-NEXT: %0 = alloc(%arg0) : memref<?x1024xf32>			// CHECK-NEXT: alloc(%arg0) : memref<?x1024xf32>
	%a = alloc(%L, %N) : memref<? x ? x f32>			%a = alloc(%L, %N) : memref<? x ? x f32>

	// CHECK-NEXT: %1 = alloc(%arg1) : memref<4x1024x8x512x?xf32>			// CHECK-NEXT: alloc(%arg1) : memref<4x1024x8x512x?xf32>
	%b = alloc(%N, %K, %M) : memref<4 x ? x 8 x ? x ? x f32>			%b = alloc(%N, %K, %M) : memref<4 x ? x 8 x ? x ? x f32>

	// CHECK-NEXT: %2 = alloc() : memref<512x1024xi32>			// CHECK-NEXT: alloc() : memref<512x1024xi32>
	%c = alloc(%K, %N) : memref<? x ? x i32>			%c = alloc(%K, %N) : memref<? x ? x i32>

				// CHECK: alloc() : memref<9x9xf32>
				%d = alloc(%nine, %nine) : memref<? x ? x f32>

				// CHECK: alloca(%arg1) : memref<4x1024x8x512x?xf32>
				%e = alloca(%N, %K, %M) : memref<4 x ? x 8 x ? x ? x f32>

	// CHECK: affine.for			// CHECK: affine.for
	affine.for %i = 0 to %L {			affine.for %i = 0 to %L {
	// CHECK-NEXT: affine.for			// CHECK-NEXT: affine.for
	affine.for %j = 0 to 10 {			affine.for %j = 0 to 10 {
	// CHECK-NEXT: load %0[%arg2, %arg3] : memref<?x1024xf32>			// CHECK-NEXT: load %0[%arg2, %arg3] : memref<?x1024xf32>
	// CHECK-NEXT: store %{{.*}}, %1[%c0, %c0, %arg2, %arg3, %c0] : memref<4x1024x8x512x?xf32>			// CHECK-NEXT: store %{{.*}}, %1[%c0, %c0, %arg2, %arg3, %c0] : memref<4x1024x8x512x?xf32>
	%v = load %a[%i, %j] : memref<?x?xf32>			%v = load %a[%i, %j] : memref<?x?xf32>
	store %v, %b[%zero, %zero, %i, %j, %zero] : memref<4x?x8x?x?xf32>			store %v, %b[%zero, %zero, %i, %j, %zero] : memref<4x?x8x?x?xf32>
	}			}
	}			}

	// CHECK: alloc() : memref<9x9xf32>
	%d = alloc(%nine, %nine) : memref<? x ? x f32>

	return %c, %d : memref<? x ? x i32>, memref<? x ? x f32>			return %c, %d : memref<? x ? x i32>, memref<? x ? x f32>
	}			}

	#map1 = affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s1 + d1 * s2 + s0)>			#map1 = affine_map<(d0, d1)[s0, s1, s2] -> (d0 * s1 + d1 * s2 + s0)>
	#map2 = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d0 * s2 + d1 * s1 + d2 + s0)>			#map2 = affine_map<(d0, d1, d2)[s0, s1, s2] -> (d0 * s2 + d1 * s1 + d2 + s0)>

	// CHECK-LABEL: func @dim_op_fold(%arg0: index, %arg1: index, %arg2: index,			// CHECK-LABEL: func @dim_op_fold(%arg0: index, %arg1: index, %arg2: index,
	func @dim_op_fold(%arg0: index, %arg1: index, %arg2: index, %BUF: memref<?xi8>, %M : index, %N : index, %K : index) {			func @dim_op_fold(%arg0: index, %arg1: index, %arg2: index, %BUF: memref<?xi8>, %M : index, %N : index, %K : index) {
	▲ Show 20 Lines • Show All 472 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[MLIR] Introduce std.alloca opClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 252338

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td

mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp

mlir/lib/Dialect/StandardOps/IR/Ops.cpp

mlir/test/Conversion/StandardToLLVM/convert-dynamic-memref-ops.mlir

mlir/test/Conversion/StandardToLLVM/convert-static-memref-ops.mlir

mlir/test/IR/memory-ops.mlir

mlir/test/Transforms/canonicalize.mlir

[MLIR] Introduce std.alloca op
ClosedPublic