Download Raw Diff

Details

Reviewers

ftynse

Commits

rGfdc0d4360b4e: Introduce alloca_scope op

Summary

Introduction

This proposal describes the new op to be added to the std (and later moved memref)
dialect called alloca_scope.

Motivation

Alloca operations are easy to misuse, especially if one relies on it while doing
rewriting/conversion passes. For example let's consider a simple example of two
independent dialects, one defines an op that wants to allocate on-stack and
another defines a construct that corresponds to some form of looping:

dialect1.looping_op {
  %x = dialect2.stack_allocating_op
}

Since the dialects might not know about each other they are going to define a
lowering to std/scf/etc independently:

scf.for … {
   %x_temp = std.alloca …
   … // do some domain-specific work using %x_temp buffer 
   … // and store the result into %result
   %x = %result
}

Later on the scf and std.alloca is going to be lowered to llvm using a
combination of llvm.alloca and unstructured control flow.

At this point the use of %x_temp is bound to either be either optimized by
llvm (for example using mem2reg) or in the worst case: perform an independent
stack allocation on each iteration of the loop. While the llvm optimizations are
likely to succeed they are not guaranteed to do so, and they provide
opportunities for surprising issues with unexpected use of stack size.

Proposal

We propose a new operation that defines a finer-grain allocation scope for the
alloca-allocated memory called alloca_scope:

alloca_scope {
   %x_temp = alloca …
   ...
}

Here the lifetime of %x_temp is going to be bound to the narrow annotated
region within alloca_scope. Moreover, one can also return values out of the
alloca_scope with an accompanying alloca_scope.return op (that behaves
similarly to scf.yield):

%result = alloca_scope {
   %x_temp = alloca …
   …
   alloca_scope.return %myvalue
}

Under the hood the alloca_scope is going to lowered to a combination of
llvm.intr.stacksave and llvm.intr.strackrestore that are going to be invoked
automatically as control-flow enters and leaves the body of the alloca_scope.

The key value of the new op is to allow deterministic guaranteed stack use
through an explicit annotation in the code which is finer-grain than the
function-level scope of AutomaticAllocationScope interface. alloca_scope
can be inserted at arbitrary locations and doesn’t require non-trivial
transformations such as outlining.

Which dialect

Before memref dialect is split, alloca_scope can temporarily reside in std
dialect, and later on be moved to memref together with the rest of
memory-related operations.

Implementation

An implementation of the op is available here.

Original commits:

Add initial scaffolding for alloca_scope op
Add alloca_scope.return op
Add no region arguments and variadic results
Add op descriptions
Add failing test case
Add another failing test
Initial implementation of lowering for std.alloca_scope
Fix backticks
Fix getSuccessorRegions implementation

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

shabalin created this revision.Mar 2 2021, 6:45 AM

Herald added a reviewer: ftynse. · View Herald TranscriptMar 2 2021, 6:45 AM

Herald added subscribers: cota, teijeong, rdzhabarov and 15 others. · View Herald Transcript

shabalin requested review of this revision.Mar 2 2021, 6:45 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 2 2021, 6:45 AM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B91562: Diff 327439.Mar 2 2021, 7:34 AM

Can you cleanup the summary and provide an actual contextual description instead? In particular the motivation for this op would be interesting to provide here.
(see https://chris.beams.io/posts/git-commit/ for general guidelines)

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
469 ↗	(On Diff #327439)	delimited

@mehdi_amini I was going to post a longer post describing the context and link it here on discourse but it seems to have been lost somewhere in moderation. I'll update update the commit with more info before it shows up.

@mehdi_amini Detailed description posted above.

shabalin edited the summary of this revision. (Show Details)Mar 2 2021, 12:16 PM

In D97768#2598196, @shabalin wrote:

@mehdi_amini Detailed description posted above.

Thanks! I'll wait for the RFC to show up on Discourse and we can discuss it here :)

I just in general look at the Phabricator description to be what will be committed, and thus follow roughly the guidelines I linked above.

shabalin updated this revision to Diff 327539.Mar 2 2021, 12:20 PM

This comment was removed by shabalin.

Fixed typo in the description

@mehdi_amini Sounds good to me! Sorry for the initial poor description without any context.

Harbormaster completed remote builds in B91639: Diff 327539.Mar 2 2021, 3:08 PM

Harbormaster completed remote builds in B91640: Diff 327540.Mar 2 2021, 3:10 PM

ftynse added inline comments.Mar 3 2021, 1:29 AM

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td
500 ↗	(On Diff #327540)	I'm not convinced we want to restrict the body region to only have one block.
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
2055–2056	Currently, only one block is allowed so beforeBody==afterBody always. I think we should relax that and allow an arbitrary CFG in the body. In which case, we probably want to handle multiple blocks terminated with `alloca_scope.return`.
mlir/test/Conversion/StandardToLLVM/convert-alloca-scope.mlir
6	Nit: we indent `// CHECK` lines
mlir/test/Dialect/Standard/ops.mlir
103 ↗	(On Diff #327540)	Hmm, don't we actually want to `// CHECK` the printed op?
107 ↗	(On Diff #327540)	Could we also have the test for `alloca_scope.return` with operands?

Rebase on top of latest main

Herald added a subscriber: dcaballe. · View Herald TranscriptJun 10 2021, 4:50 AM

I suppose this now belongs to the memref dialect.

Harbormaster completed remote builds in B108595: Diff 351134.Jun 10 2021, 5:00 AM

Move the op to memref dialect.

ftynse accepted this revision.Jun 11 2021, 4:12 AM

ftynse added inline comments.

mlir/include/mlir/Dialect/MemRef/IR/MemRefOps.td
213	Here and below `std` prefix needs to be updated to `memref`.

This revision is now accepted and ready to land.Jun 11 2021, 4:12 AM

Harbormaster completed remote builds in B108774: Diff 351394.Jun 11 2021, 4:29 AM

bondhugula added a subscriber: bondhugula.Jun 11 2021, 4:47 AM

bondhugula added inline comments.

mlir/include/mlir/Dialect/MemRef/IR/MemRefOps.td
260	be to -> to be
262	It is customary to include an example here showing the syntax.
mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp
2032–2037	You don't need the `mlir::` here I think.
mlir/lib/Dialect/MemRef/IR/MemRefOps.cpp
260	Typo: Parse

Fix doc strings and typos

Harbormaster completed remote builds in B108797: Diff 351427.Jun 11 2021, 6:59 AM

Closed by commit rGfdc0d4360b4e: Introduce alloca_scope op (authored by shabalin, committed by ftynse). · Explain WhyJun 11 2021, 10:28 AM

This revision was automatically updated to reflect the committed changes.

ftynse added a commit: rGfdc0d4360b4e: Introduce alloca_scope op.

Diff 351493

mlir/include/mlir/Dialect/MemRef/IR/MemRef.h

	//===- MemRef.h - MemRef dialect --------------------------------- C++ --===//			//===- MemRef.h - MemRef dialect --------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_DIALECT_MEMREF_IR_MEMREF_H_			#ifndef MLIR_DIALECT_MEMREF_IR_MEMREF_H_
	#define MLIR_DIALECT_MEMREF_IR_MEMREF_H_			#define MLIR_DIALECT_MEMREF_IR_MEMREF_H_

	#include "mlir/IR/Dialect.h"			#include "mlir/IR/Dialect.h"
	#include "mlir/Interfaces/CallInterfaces.h"			#include "mlir/Interfaces/CallInterfaces.h"
	#include "mlir/Interfaces/CastInterfaces.h"			#include "mlir/Interfaces/CastInterfaces.h"
				#include "mlir/Interfaces/ControlFlowInterfaces.h"
	#include "mlir/Interfaces/CopyOpInterface.h"			#include "mlir/Interfaces/CopyOpInterface.h"
	#include "mlir/Interfaces/SideEffectInterfaces.h"			#include "mlir/Interfaces/SideEffectInterfaces.h"
	#include "mlir/Interfaces/ViewLikeInterface.h"			#include "mlir/Interfaces/ViewLikeInterface.h"

	namespace mlir {			namespace mlir {

	class Location;			class Location;
	class OpBuilder;			class OpBuilder;
	▲ Show 20 Lines • Show All 228 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/MemRef/IR/MemRefOps.td

//===- MemRefOps.td - MemRef op definitions ----------------- tablegen --===//		//===- MemRefOps.td - MemRef op definitions ----------------- tablegen --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef MEMREF_OPS		#ifndef MEMREF_OPS
#define MEMREF_OPS		#define MEMREF_OPS

		include "mlir/Interfaces/ControlFlowInterfaces.td"
include "mlir/Dialect/MemRef/IR/MemRefBase.td"		include "mlir/Dialect/MemRef/IR/MemRefBase.td"
include "mlir/IR/OpBase.td"		include "mlir/IR/OpBase.td"
include "mlir/Interfaces/CastInterfaces.td"		include "mlir/Interfaces/CastInterfaces.td"
include "mlir/Interfaces/CopyOpInterface.td"		include "mlir/Interfaces/CopyOpInterface.td"
include "mlir/Interfaces/SideEffectInterfaces.td"		include "mlir/Interfaces/SideEffectInterfaces.td"
include "mlir/Interfaces/ViewLikeInterface.td"		include "mlir/Interfaces/ViewLikeInterface.td"
include "mlir/IR/SymbolInterfaces.td"		include "mlir/IR/SymbolInterfaces.td"

▲ Show 20 Lines • Show All 174 Lines • ▼ Show 20 Lines	let description = [{
by subsequent load and store operations. An optional alignment attribute, if		by subsequent load and store operations. An optional alignment attribute, if
specified, guarantees alignment at least to that boundary. If not specified,		specified, guarantees alignment at least to that boundary. If not specified,
an alignment on any convenient boundary compatible with the type will be		an alignment on any convenient boundary compatible with the type will be
chosen.		chosen.
}];		}];
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// AllocaScopeOp
		//===----------------------------------------------------------------------===//

		def MemRef_AllocaScopeOp : MemRef_Op<"alloca_scope",
		[DeclareOpInterfaceMethods<RegionBranchOpInterface>,
		SingleBlockImplicitTerminator<"AllocaScopeReturnOp">,
		RecursiveSideEffects,
		NoRegionArguments]> {
		let summary = "explicitly delimited scope for stack allocation";
		let description = [{
		The `memref.alloca_scope` operation represents an explicitly-delimited
		ftynseUnsubmitted Done Reply Inline Actions Here and below `std` prefix needs to be updated to `memref`. ftynse: Here and below `std` prefix needs to be updated to `memref`.
		scope for the alloca allocations. Any `memref.alloca` operations that are
		used within this scope are going to be cleaned up automatically once
		the control-flow exits the nested region. For example:

		```mlir
		memref.alloca_scope {
		%myalloca = memref.alloca(): memref<4x3xf32>
		...
		}
		```

		Here, `%myalloca` memref is valid within the explicitly delimited scope
		and is automatically deallocated at the end of the given region.

		`memref.alloca_scope` may also return results that are defined in the nested
		region. To return a value, one should use `memref.alloca_scope.return`
		operation:

		```mlir
		%result = memref.alloca_scope {
		...
		memref.alloca_scope.return %value
		}
		```

		If `memref.alloca_scope` returns no value, the `memref.alloca_scope.return ` can
		be left out, and will be inserted implicitly.
		}];

		let results = (outs Variadic<AnyType>:$results);
		let regions = (region SizedRegion<1>:$bodyRegion);
		}

		//===----------------------------------------------------------------------===//
		// AllocaScopeReturnOp
		//===----------------------------------------------------------------------===//

		def MemRef_AllocaScopeReturnOp : MemRef_Op<"alloca_scope.return",
		[HasParent<"AllocaScopeOp">,
		NoSideEffect,
		ReturnLike,
		Terminator]> {
		let summary = "terminator for alloca_scope operation";
		let description = [{
		`memref.alloca_scope.return` operation returns zero or more SSA values
		from the region within `memref.alloca_scope`. If no values are returned,
		the return operation may be omitted. Otherwise, it has to be present
		bondhugulaUnsubmitted Done Reply Inline Actions be to -> to be bondhugula: be to -> to be
		to indicate which values are going to be returned. For example:

		bondhugulaUnsubmitted Done Reply Inline Actions It is customary to include an example here showing the syntax. bondhugula: It is customary to include an example here showing the syntax.
		```mlir
		memref.alloca_scope.return %value
		```
		}];

		let arguments = (ins Variadic<AnyType>:$results);
		let builders = [OpBuilder<(ins), [{ /nothing to do / }]>];

		let assemblyFormat =
		[{ attr-dict ($results^ `:` type($results))? }];

		// No custom verification needed.
		let verifier = ?;
		}



		//===----------------------------------------------------------------------===//
// BufferCastOp		// BufferCastOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def MemRef_BufferCastOp : MemRef_Op<"buffer_cast",		def MemRef_BufferCastOp : MemRef_Op<"buffer_cast",
[SameOperandsAndResultShape, SameOperandsAndResultElementType, NoSideEffect,		[SameOperandsAndResultShape, SameOperandsAndResultElementType, NoSideEffect,
TypesMatchWith<"type of 'tensor' is the tensor equivalent of 'memref'",		TypesMatchWith<"type of 'tensor' is the tensor equivalent of 'memref'",
"memref", "tensor",		"memref", "tensor",
"getTensorTypeFromMemRefType($_self)">]> {		"getTensorTypeFromMemRefType($_self)">]> {
▲ Show 20 Lines • Show All 1,098 Lines • Show Last 20 Lines

mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp

Show First 20 Lines • Show All 2,022 Lines • ▼ Show 20 Lines	std::tuple<Value, Value> allocateBuffer(ConversionPatternRewriter &rewriter,
auto allocatedElementPtr = rewriter.create<LLVM::AllocaOp>(		auto allocatedElementPtr = rewriter.create<LLVM::AllocaOp>(
loc, elementPtrType, sizeBytes,		loc, elementPtrType, sizeBytes,
allocaOp.alignment() ? *allocaOp.alignment() : 0);		allocaOp.alignment() ? *allocaOp.alignment() : 0);

return std::make_tuple(allocatedElementPtr, allocatedElementPtr);		return std::make_tuple(allocatedElementPtr, allocatedElementPtr);
}		}
};		};

		struct AllocaScopeOpLowering
		: public ConvertOpToLLVMPattern<memref::AllocaScopeOp> {
		using ConvertOpToLLVMPattern<memref::AllocaScopeOp>::ConvertOpToLLVMPattern;

		LogicalResult
		matchAndRewrite(memref::AllocaScopeOp allocaScopeOp, ArrayRef<Value> operands,
		ConversionPatternRewriter &rewriter) const override {
		bondhugulaUnsubmitted Done Reply Inline Actions You don't need the `mlir::` here I think. bondhugula: You don't need the `mlir::` here I think.
		OpBuilder::InsertionGuard guard(rewriter);
		Location loc = allocaScopeOp.getLoc();

		// Split the current block before the AllocaScopeOp to create the inlining
		// point.
		auto *currentBlock = rewriter.getInsertionBlock();
		auto *remainingOpsBlock =
		rewriter.splitBlock(currentBlock, rewriter.getInsertionPoint());
		Block *continueBlock;
		if (allocaScopeOp.getNumResults() == 0) {
		continueBlock = remainingOpsBlock;
		} else {
		continueBlock = rewriter.createBlock(remainingOpsBlock,
		allocaScopeOp.getResultTypes());
		rewriter.create<BranchOp>(loc, remainingOpsBlock);
		}

		// Inline body region.
		Block *beforeBody = &allocaScopeOp.bodyRegion().front();
		ftynseUnsubmitted Not Done Reply Inline Actions Currently, only one block is allowed so beforeBody==afterBody always. I think we should relax that and allow an arbitrary CFG in the body. In which case, we probably want to handle multiple blocks terminated with `alloca_scope.return`. ftynse: Currently, only one block is allowed so beforeBody==afterBody always. I think we should relax…
		Block *afterBody = &allocaScopeOp.bodyRegion().back();
		rewriter.inlineRegionBefore(allocaScopeOp.bodyRegion(), continueBlock);

		// Save stack and then branch into the body of the region.
		rewriter.setInsertionPointToEnd(currentBlock);
		auto stackSaveOp =
		rewriter.create<LLVM::StackSaveOp>(loc, getVoidPtrType());
		rewriter.create<BranchOp>(loc, beforeBody);

		// Replace the alloca_scope return with a branch that jumps out of the body.
		// Stack restore before leaving the body region.
		rewriter.setInsertionPointToEnd(afterBody);
		auto returnOp =
		cast<memref::AllocaScopeReturnOp>(afterBody->getTerminator());
		auto branchOp = rewriter.replaceOpWithNewOp<BranchOp>(
		returnOp, continueBlock, returnOp.results());

		// Insert stack restore before jumping out the body of the region.
		rewriter.setInsertionPoint(branchOp);
		rewriter.create<LLVM::StackRestoreOp>(loc, stackSaveOp);

		// Replace the op with values return from the body region.
		rewriter.replaceOp(allocaScopeOp, continueBlock->getArguments());

		return success();
		}
		};

/// Copies the shaped descriptor part to (if `toDynamic` is set) or from		/// Copies the shaped descriptor part to (if `toDynamic` is set) or from
/// (otherwise) the dynamically allocated memory for any operands that were		/// (otherwise) the dynamically allocated memory for any operands that were
/// unranked descriptors originally.		/// unranked descriptors originally.
static LogicalResult copyUnrankedDescriptors(OpBuilder &builder, Location loc,		static LogicalResult copyUnrankedDescriptors(OpBuilder &builder, Location loc,
LLVMTypeConverter &typeConverter,		LLVMTypeConverter &typeConverter,
TypeRange origTypes,		TypeRange origTypes,
SmallVectorImpl<Value> &operands,		SmallVectorImpl<Value> &operands,
bool toDynamic) {		bool toDynamic) {
▲ Show 20 Lines • Show All 1,841 Lines • ▼ Show 20 Lines	void mlir::populateStdToLLVMNonMemoryConversionPatterns(
LLVMTypeConverter &converter, RewritePatternSet &patterns) {		LLVMTypeConverter &converter, RewritePatternSet &patterns) {
// FIXME: this should be tablegen'ed		// FIXME: this should be tablegen'ed
// clang-format off		// clang-format off
patterns.add<		patterns.add<
AbsFOpLowering,		AbsFOpLowering,
AddFOpLowering,		AddFOpLowering,
AddIOpLowering,		AddIOpLowering,
AllocaOpLowering,		AllocaOpLowering,
		AllocaScopeOpLowering,
AndOpLowering,		AndOpLowering,
AssertOpLowering,		AssertOpLowering,
AtomicRMWOpLowering,		AtomicRMWOpLowering,
BranchOpLowering,		BranchOpLowering,
CallIndirectOpLowering,		CallIndirectOpLowering,
CallOpLowering,		CallOpLowering,
CeilFOpLowering,		CeilFOpLowering,
CmpFOpLowering,		CmpFOpLowering,
▲ Show 20 Lines • Show All 303 Lines • Show Last 20 Lines

mlir/lib/Dialect/MemRef/IR/MemRefOps.cpp

	Show First 20 Lines • Show All 224 Lines • ▼ Show 20 Lines

	void AllocaOp::getCanonicalizationPatterns(RewritePatternSet &results,			void AllocaOp::getCanonicalizationPatterns(RewritePatternSet &results,
	MLIRContext *context) {			MLIRContext *context) {
	results.add<SimplifyAllocConst<AllocaOp>, SimplifyDeadAlloc<AllocaOp>>(			results.add<SimplifyAllocConst<AllocaOp>, SimplifyDeadAlloc<AllocaOp>>(
	context);			context);
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
				// AllocaScopeOp
				//===----------------------------------------------------------------------===//

				static void print(OpAsmPrinter &p, AllocaScopeOp &op) {
				bool printBlockTerminators = false;

				p << AllocaScopeOp::getOperationName() << " ";
				if (!op.results().empty()) {
				p << " -> (" << op.getResultTypes() << ")";
				printBlockTerminators = true;
				}
				p.printRegion(op.bodyRegion(),
				/printEntryBlockArgs=/false,
				/printBlockTerminators=/printBlockTerminators);
				p.printOptionalAttrDict(op->getAttrs());
				}

				static ParseResult parseAllocaScopeOp(OpAsmParser &parser,
				OperationState &result) {
				// Create a region for the body.
				result.regions.reserve(1);
				Region *bodyRegion = result.addRegion();

				// Parse optional results type list.
				if (parser.parseOptionalArrowTypeList(result.types))
				return failure();

				// Parse the body region.
				bondhugulaUnsubmitted Done Reply Inline Actions Typo: Parse bondhugula: Typo: Parse
				if (parser.parseRegion(bodyRegion, /arguments=/{}, /argTypes=*/{}))
				return failure();
				AllocaScopeOp::ensureTerminator(*bodyRegion, parser.getBuilder(),
				result.location);

				// Parse the optional attribute list.
				if (parser.parseOptionalAttrDict(result.attributes))
				return failure();

				return success();
				}

				static LogicalResult verify(AllocaScopeOp op) {
				if (failed(RegionBranchOpInterface::verifyTypes(op)))
				return failure();

				return success();
				}

				void AllocaScopeOp::getSuccessorRegions(
				Optional<unsigned> index, ArrayRef<Attribute> operands,
				SmallVectorImpl<RegionSuccessor> &regions) {
				if (index.hasValue()) {
				regions.push_back(RegionSuccessor(getResults()));
				return;
				}

				regions.push_back(RegionSuccessor(&bodyRegion()));
				}

				//===----------------------------------------------------------------------===//
	// AssumeAlignmentOp			// AssumeAlignmentOp
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	static LogicalResult verify(AssumeAlignmentOp op) {			static LogicalResult verify(AssumeAlignmentOp op) {
	unsigned alignment = op.alignment();			unsigned alignment = op.alignment();
	if (!llvm::isPowerOf2_32(alignment))			if (!llvm::isPowerOf2_32(alignment))
	return op.emitOpError("alignment must be power of 2");			return op.emitOpError("alignment must be power of 2");
	return success();			return success();
	▲ Show 20 Lines • Show All 2,015 Lines • Show Last 20 Lines

mlir/test/Conversion/StandardToLLVM/convert-alloca-scope.mlir

This file was added.

				// RUN: mlir-opt -convert-std-to-llvm %s \| FileCheck %s

				// CHECK-LABEL: llvm.func @empty
				func @empty() {
				// CHECK: llvm.intr.stacksave
				// CHECK: llvm.br
				ftynseUnsubmitted Not Done Reply Inline Actions Nit: we indent `// CHECK` lines ftynse: Nit: we indent `// CHECK` lines
				memref.alloca_scope {
				memref.alloca_scope.return
				}
				// CHECK: llvm.intr.stackrestore
				// CHECK: llvm.br
				// CHECK: llvm.return
				return
				}

				// CHECK-LABEL: llvm.func @returns_nothing
				func @returns_nothing(%b: f32) {
				%a = constant 10.0 : f32
				// CHECK: llvm.intr.stacksave
				memref.alloca_scope {
				%c = std.addf %a, %b : f32
				memref.alloca_scope.return
				}
				// CHECK: llvm.intr.stackrestore
				// CHECK: llvm.return
				return
				}

				// CHECK-LABEL: llvm.func @returns_one_value
				func @returns_one_value(%b: f32) -> f32 {
				%a = constant 10.0 : f32
				// CHECK: llvm.intr.stacksave
				%result = memref.alloca_scope -> f32 {
				%c = std.addf %a, %b : f32
				memref.alloca_scope.return %c: f32
				}
				// CHECK: llvm.intr.stackrestore
				// CHECK: llvm.return
				return %result : f32
				}

				// CHECK-LABEL: llvm.func @returns_multiple_values
				func @returns_multiple_values(%b: f32) -> f32 {
				%a = constant 10.0 : f32
				// CHECK: llvm.intr.stacksave
				%result1, %result2 = memref.alloca_scope -> (f32, f32) {
				%c = std.addf %a, %b : f32
				%d = std.subf %a, %b : f32
				memref.alloca_scope.return %c, %d: f32, f32
				}
				// CHECK: llvm.intr.stackrestore
				// CHECK: llvm.return
				%result = std.addf %result1, %result2 : f32
				return %result : f32
				}

mlir/test/Dialect/MemRef/ops.mlir

	Show First 20 Lines • Show All 70 Lines • ▼ Show 20 Lines

	// CHECK-LABEL: func @memref_dealloc			// CHECK-LABEL: func @memref_dealloc
	func @memref_dealloc() {			func @memref_dealloc() {
	%0 = memref.alloc() : memref<2xf32>			%0 = memref.alloc() : memref<2xf32>
	%1 = memref.cast %0 : memref<2xf32> to memref<*xf32>			%1 = memref.cast %0 : memref<2xf32> to memref<*xf32>
	memref.dealloc %1 : memref<*xf32>			memref.dealloc %1 : memref<*xf32>
	return			return
	}			}


				// CHECK-LABEL: func @memref_alloca_scope
				func @memref_alloca_scope() {
				memref.alloca_scope {
				memref.alloca_scope.return
				}
				return
				}

This is an archive of the discontinued LLVM Phabricator instance.

Introduce alloca_scope op
ClosedPublic

Details

Introduction

Motivation

Proposal

Which dialect

Implementation

Diff Detail

Event Timeline

Revision Contents

Diff 351493

mlir/include/mlir/Dialect/MemRef/IR/MemRef.h

mlir/include/mlir/Dialect/MemRef/IR/MemRefOps.td

mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp

mlir/lib/Dialect/MemRef/IR/MemRefOps.cpp

mlir/test/Conversion/StandardToLLVM/convert-alloca-scope.mlir

mlir/test/Dialect/MemRef/ops.mlir

This is an archive of the discontinued LLVM Phabricator instance.

Introduce alloca_scope opClosedPublic

Details

Introduction

Motivation

Proposal

Which dialect

Implementation

Diff Detail

Event Timeline

Revision Contents

Diff 351493

mlir/include/mlir/Dialect/MemRef/IR/MemRef.h

mlir/include/mlir/Dialect/MemRef/IR/MemRefOps.td

mlir/lib/Conversion/StandardToLLVM/StandardToLLVM.cpp

mlir/lib/Dialect/MemRef/IR/MemRefOps.cpp

mlir/test/Conversion/StandardToLLVM/convert-alloca-scope.mlir

mlir/test/Dialect/MemRef/ops.mlir

Introduce alloca_scope op
ClosedPublic