This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Dialect/
-
ArmSME/
-
CMakeLists.txt
-
Transforms/
-
CMakeLists.txt
-
Passes.h
-
Passes.td
1/1
CMakeLists.txt
-
InitAllPasses.h
-
lib/Dialect/
-
Dialect/
-
ArmSME/
-
CMakeLists.txt
-
Transforms/
-
CMakeLists.txt
-
EnableArmStreaming.cpp
1/1
CMakeLists.txt
-
test/Dialect/ArmSME/
-
Dialect/
-
ArmSME/
-
enable-arm-streaming.mlir

Differential D150934

[mlir] Add pass to enable Armv9 Streaming SVE mode
ClosedPublic

Authored by c-rhodes on May 18 2023, 11:26 PM.

Download Raw Diff

Details

Reviewers

awarzynski
dcaballe
WanderAway
nicolasvasilache

Commits

rG12648492998b: [mlir] Add pass to enable Armv9 Streaming SVE mode

Summary

This patch adds a pass 'enable-arm-streaming' that enables the Armv9
Scalable Matrix Extension (SME) Streaming SVE (SSVE) mode [1] by adding
either of the following attributes to 'func.func' ops:

arm_streaming (default)
arm_locally_streaming

PATCH [2 / 2] in series for RFC: https://discourse.llvm.org/t/rfc-supporting-armv9-scalable-matrix-extension-sme-streaming-sve-ssve-mode-in-mlir/70678

[1] https://developer.arm.com/documentation/ddi0616/aa

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

c-rhodes created this revision.May 18 2023, 11:26 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 18 2023, 11:26 PM

Herald added subscribers: bviyer, Moerafaat, zero9178 and 24 others. · View Herald Transcript

c-rhodes requested review of this revision.May 18 2023, 11:26 PM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptMay 18 2023, 11:26 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: alextsao1999, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

c-rhodes added a parent revision: D150933: [mlir][func] refactor namespace to support passing options to passes.May 18 2023, 11:26 PM

c-rhodes added a parent revision: D150932: [mlir][llvm] Add arm_streaming LLVM function attributes.

Harbormaster completed remote builds in B233092: Diff 523665.May 18 2023, 11:43 PM

awarzynski mentioned this in D150933: [mlir][func] refactor namespace to support passing options to passes.May 19 2023, 7:16 AM

Great stuff, thank you! LGTM

This revision is now accepted and ready to land.May 19 2023, 7:21 AM

Updated namespace to remove dependent pass.

Harbormaster completed remote builds in B233184: Diff 523783.May 19 2023, 8:12 AM

Matt added a subscriber: Matt.May 19 2023, 8:24 AM

I'll land this tomorrow unless there's any further comments by then.

Thanks for moving forward with the implementation! I think we should move this outside of the Func dialect. Since an SME dialect is in the horizon, I would suggest that we move the implementation to Dialect/SME/Transforms and the lit tests to similar Dialect/SME folders.
We currently don't have a good place for target specific transformations without a dialect and introducing something like {lib/include}/Target/... would probably need much more discussion.

This revision now requires changes to proceed.May 23 2023, 5:58 PM

Move to ArmSME dialect.

In D150934#4366544, @dcaballe wrote:

Thanks for moving forward with the implementation! I think we should move this outside of the Func dialect. Since an SME dialect is in the horizon, I would suggest that we move the implementation to Dialect/SME/Transforms and the lit tests to similar Dialect/SME folders.

Thanks for reviewing. I've moved the implementation to a minimal SME dialect.

We currently don't have a good place for target specific transformations without a dialect and introducing something like {lib/include}/Target/... would probably need much more discussion.

I think a single dialect for each target (e.g. Arm, x86, ...) much like target in LLVM backend would be better than having dialects for each target feature. There's a fair amount of boilerplate in these dialects.

Harbormaster completed remote builds in B234145: Diff 525100.May 24 2023, 4:21 AM

LGTM, thanks!

I think a single dialect for each target (e.g. Arm, x86, ...) much like target in LLVM backend would be better than having dialects for each target feature. There's a fair amount of boilerplate in these dialects.

That's fair. Others can correct me if I'm wrong but I think the current "trend" in MLIR (as this is something that has been changing over time) leans more towards more minimalist dialects with a small and well-defined scopes (e.g., 'func' dialect). I think this is, to some extent, motivated by the infra as, for example, registering ops that are not used will have a negative impact in compile time. We also have mechanisms to make a dialect legal/illegal after a conversion which I think also fosters this kind of small logical grouping (e.g., it would be tedious if we had a single ARM dialect and we had to mark as legal/illegal a large part of it, one instruction at a time). There are also clear exceptions to this, such as the Vector dialect, where we have multiple layers of abstraction altogether. I think the idea of "sub-dialects" or some kind of logical grouping within a dialect was discussed at some point. That would also make sense for target-like dialects and their features. At the end of the day, all of this is flexible and it mostly depends on how we use the operations in practice.

mlir/include/mlir/Dialect/CMakeLists.txt
8	alphabetically sorted?
mlir/lib/Dialect/CMakeLists.txt
7	same

This revision is now accepted and ready to land.May 24 2023, 9:10 AM

This revision was landed with ongoing or failed builds.May 25 2023, 2:22 AM

Closed by commit rG12648492998b: [mlir] Add pass to enable Armv9 Streaming SVE mode (authored by c-rhodes). · Explain Why

This revision was automatically updated to reflect the committed changes.

c-rhodes marked 2 inline comments as done.

c-rhodes added a commit: rG12648492998b: [mlir] Add pass to enable Armv9 Streaming SVE mode.

In D150934#4368724, @dcaballe wrote:

LGTM, thanks!

I think a single dialect for each target (e.g. Arm, x86, ...) much like target in LLVM backend would be better than having dialects for each target feature. There's a fair amount of boilerplate in these dialects.

That's fair. Others can correct me if I'm wrong but I think the current "trend" in MLIR (as this is something that has been changing over time) leans more towards more minimalist dialects with a small and well-defined scopes (e.g., 'func' dialect). I think this is, to some extent, motivated by the infra as, for example, registering ops that are not used will have a negative impact in compile time. We also have mechanisms to make a dialect legal/illegal after a conversion which I think also fosters this kind of small logical grouping (e.g., it would be tedious if we had a single ARM dialect and we had to mark as legal/illegal a large part of it, one instruction at a time). There are also clear exceptions to this, such as the Vector dialect, where we have multiple layers of abstraction altogether. I think the idea of "sub-dialects" or some kind of logical grouping within a dialect was discussed at some point. That would also make sense for target-like dialects and their features. At the end of the day, all of this is flexible and it mostly depends on how we use the operations in practice.

Ah ok, I wasn't aware of most of that, thanks for explaining.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

ArmSME/

CMakeLists.txt

1 line

Transforms/

5 lines

43 lines

40 lines

1 line

2 lines

lib/

Dialect/

ArmSME/

CMakeLists.txt

1 line

Transforms/

CMakeLists.txt

13 lines

EnableArmStreaming.cpp

75 lines

CMakeLists.txt

1 line

test/

Dialect/

ArmSME/

enable-arm-streaming.mlir

8 lines

Diff 525500

mlir/include/mlir/Dialect/ArmSME/CMakeLists.txt

This file was added.

add_subdirectory(Transforms)

mlir/include/mlir/Dialect/ArmSME/Transforms/CMakeLists.txt

This file was added.

				set(LLVM_TARGET_DEFINITIONS Passes.td)
				mlir_tablegen(Passes.h.inc -gen-pass-decls -name ArmSME)
				add_public_tablegen_target(MLIRArmSMETransformsIncGen)

				add_mlir_doc(Passes ArmSMEPasses ./ -gen-pass-doc)

mlir/include/mlir/Dialect/ArmSME/Transforms/Passes.h

This file was added.

				//===- Passes.h - Pass Entrypoints ------------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef MLIR_DIALECT_ARMSME_TRANSFORMS_PASSES_H
				#define MLIR_DIALECT_ARMSME_TRANSFORMS_PASSES_H

				#include "mlir/Pass/Pass.h"

				namespace mlir {

				class RewritePatternSet;

				namespace arm_sme {
				// Options for Armv9 Streaming SVE mode. By default, streaming-mode is part of
				// the function interface (ABI) and the caller manages PSTATE.SM on entry/exit.
				// In a locally streaming function PSTATE.SM is kept internal and the callee
				// manages it on entry/exit.
				enum class ArmStreaming { Default = 0, Locally = 1 };

				#define GEN_PASS_DECL
				#include "mlir/Dialect/ArmSME/Transforms/Passes.h.inc"

				/// Pass to enable Armv9 Streaming SVE mode.
				std::unique_ptr<Pass>
				createEnableArmStreamingPass(const ArmStreaming mode = ArmStreaming::Default);

				//===----------------------------------------------------------------------===//
				// Registration
				//===----------------------------------------------------------------------===//

				/// Generate the code for registering passes.
				#define GEN_PASS_REGISTRATION
				#include "mlir/Dialect/ArmSME/Transforms/Passes.h.inc"

				} // namespace arm_sme
				} // namespace mlir

				#endif // MLIR_DIALECT_ARMSME_TRANSFORMS_PASSES_H

mlir/include/mlir/Dialect/ArmSME/Transforms/Passes.td

This file was added.

				//===-- Passes.td - ArmSME pass definition file ------------- tablegen --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef MLIR_DIALECT_ARMSME_TRANSFORMS_PASSES_TD
				#define MLIR_DIALECT_ARMSME_TRANSFORMS_PASSES_TD

				include "mlir/Pass/PassBase.td"

				def EnableArmStreaming
				: Pass<"enable-arm-streaming", "mlir::func::FuncOp"> {
				let summary = "Enable Armv9 Streaming SVE mode";
				let description = [{
				Enables the Armv9 Streaming SVE mode [1] for func.func ops by annotating
				them with attributes. See options for more details.

				[1] https://developer.arm.com/documentation/ddi0616/aa
				}];
				let constructor = "mlir::arm_sme::createEnableArmStreamingPass()";
				let options = [
				Option<"mode", "mode", "mlir::arm_sme::ArmStreaming",
				/default=/"mlir::arm_sme::ArmStreaming::Default",
				"Select how streaming-mode is managed at the function-level.",
				[{::llvm::cl::values(
				clEnumValN(mlir::arm_sme::ArmStreaming::Default, "default",
				"Streaming mode is part of the function interface "
				"(ABI), caller manages PSTATE.SM on entry/exit."),
				clEnumValN(mlir::arm_sme::ArmStreaming::Locally, "locally",
				"Streaming mode is internal to the function, callee "
				"manages PSTATE.SM on entry/exit.")
				)}]>,
				];
				let dependentDialects = ["func::FuncDialect"];
				}

				#endif // MLIR_DIALECT_ARMSME_TRANSFORMS_PASSES_TD

mlir/include/mlir/Dialect/CMakeLists.txt

	add_subdirectory(AMDGPU)			add_subdirectory(AMDGPU)
	add_subdirectory(AMX)			add_subdirectory(AMX)
	add_subdirectory(Affine)			add_subdirectory(Affine)
	add_subdirectory(Arith)			add_subdirectory(Arith)
	add_subdirectory(ArmNeon)			add_subdirectory(ArmNeon)
				add_subdirectory(ArmSME)
	add_subdirectory(ArmSVE)			add_subdirectory(ArmSVE)
	add_subdirectory(Async)			add_subdirectory(Async)
				dcaballeUnsubmitted Done Reply Inline Actions alphabetically sorted? dcaballe: alphabetically sorted?
	add_subdirectory(Bufferization)			add_subdirectory(Bufferization)
	add_subdirectory(Complex)			add_subdirectory(Complex)
	add_subdirectory(ControlFlow)			add_subdirectory(ControlFlow)
	add_subdirectory(DLTI)			add_subdirectory(DLTI)
	add_subdirectory(EmitC)			add_subdirectory(EmitC)
	add_subdirectory(Func)			add_subdirectory(Func)
	add_subdirectory(GPU)			add_subdirectory(GPU)
	add_subdirectory(Index)			add_subdirectory(Index)
	Show All 22 Lines

mlir/include/mlir/InitAllPasses.h

Show All 12 Lines

#ifndef MLIR_INITALLPASSES_H_		#ifndef MLIR_INITALLPASSES_H_
#define MLIR_INITALLPASSES_H_		#define MLIR_INITALLPASSES_H_

#include "mlir/Conversion/Passes.h"		#include "mlir/Conversion/Passes.h"
#include "mlir/Dialect/AMDGPU/Transforms/Passes.h"		#include "mlir/Dialect/AMDGPU/Transforms/Passes.h"
#include "mlir/Dialect/Affine/Passes.h"		#include "mlir/Dialect/Affine/Passes.h"
#include "mlir/Dialect/Arith/Transforms/Passes.h"		#include "mlir/Dialect/Arith/Transforms/Passes.h"
		#include "mlir/Dialect/ArmSME/Transforms/Passes.h"
#include "mlir/Dialect/Async/Passes.h"		#include "mlir/Dialect/Async/Passes.h"
#include "mlir/Dialect/Bufferization/Transforms/Passes.h"		#include "mlir/Dialect/Bufferization/Transforms/Passes.h"
#include "mlir/Dialect/Func/Transforms/Passes.h"		#include "mlir/Dialect/Func/Transforms/Passes.h"
#include "mlir/Dialect/GPU/Transforms/Passes.h"		#include "mlir/Dialect/GPU/Transforms/Passes.h"
#include "mlir/Dialect/LLVMIR/Transforms/Passes.h"		#include "mlir/Dialect/LLVMIR/Transforms/Passes.h"
#include "mlir/Dialect/Linalg/Passes.h"		#include "mlir/Dialect/Linalg/Passes.h"
#include "mlir/Dialect/MemRef/Transforms/Passes.h"		#include "mlir/Dialect/MemRef/Transforms/Passes.h"
#include "mlir/Dialect/NVGPU/Passes.h"		#include "mlir/Dialect/NVGPU/Passes.h"
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	inline void registerAllPasses() {
memref::registerMemRefPasses();		memref::registerMemRefPasses();
registerSCFPasses();		registerSCFPasses();
registerShapePasses();		registerShapePasses();
spirv::registerSPIRVPasses();		spirv::registerSPIRVPasses();
tensor::registerTensorPasses();		tensor::registerTensorPasses();
tosa::registerTosaOptPasses();		tosa::registerTosaOptPasses();
transform::registerTransformPasses();		transform::registerTransformPasses();
vector::registerVectorPasses();		vector::registerVectorPasses();
		arm_sme::registerArmSMEPasses();

// Dialect pipelines		// Dialect pipelines
sparse_tensor::registerSparseTensorPipelines();		sparse_tensor::registerSparseTensorPipelines();
}		}

} // namespace mlir		} // namespace mlir

#endif // MLIR_INITALLPASSES_H_		#endif // MLIR_INITALLPASSES_H_

mlir/lib/Dialect/ArmSME/CMakeLists.txt

This file was added.

add_subdirectory(Transforms)

mlir/lib/Dialect/ArmSME/Transforms/CMakeLists.txt

This file was added.

				add_mlir_dialect_library(MLIRArmSMETransforms
				EnableArmStreaming.cpp

				ADDITIONAL_HEADER_DIRS
				${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/ArmSME/Transforms

				DEPENDS
				MLIRArmSMETransformsIncGen

				LINK_LIBS PUBLIC
				MLIRFuncDialect
				MLIRPass
				)

mlir/lib/Dialect/ArmSME/Transforms/EnableArmStreaming.cpp

This file was added.

				//===- EnableArmStreaming.cpp - Enable Armv9 Streaming SVE mode -----------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This pass enables the Armv9 Scalable Matrix Extension (SME) Streaming SVE
				// (SSVE) mode [1][2] by adding either of the following attributes to
				// 'func.func' ops:
				//
				// * 'arm_streaming' (default)
				// * 'arm_locally_streaming'
				//
				// Streaming-mode is part of the interface (ABI) for functions with the
				// first attribute and it's the responsibility of the caller to manage
				// PSTATE.SM on entry/exit to functions with this attribute [3]. The LLVM
				// backend will emit 'smstart sm' / 'smstop sm' [4] around calls to
				// streaming functions.
				//
				// In locally streaming functions PSTATE.SM is kept internal and managed by
				// the callee on entry/exit. The LLVM backend will emit 'smstart sm' /
				// 'smstop sm' in the prologue / epilogue for functions with this
				// attribute.
				//
				// [1] https://developer.arm.com/documentation/ddi0616/aa
				// [2] https://llvm.org/docs/AArch64SME.html
				// [3] https://github.com/ARM-software/abi-aa/blob/main/aapcs64/aapcs64.rst#671pstatesm-interfaces
				// [4] https://developer.arm.com/documentation/ddi0602/2023-03/Base-Instructions/SMSTART--Enables-access-to-Streaming-SVE-mode-and-SME-architectural-state--an-alias-of-MSR--immediate--
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/ArmSME/Transforms/Passes.h"

				#include "mlir/Dialect/Func/IR/FuncOps.h"

				#define DEBUG_TYPE "enable-arm-streaming"

				namespace mlir {
				namespace arm_sme {
				#define GEN_PASS_DEF_ENABLEARMSTREAMING
				#include "mlir/Dialect/ArmSME/Transforms/Passes.h.inc"
				} // namespace arm_sme
				} // namespace mlir

				using namespace mlir;
				using namespace mlir::arm_sme;

				static constexpr char kArmStreamingAttr[] = "arm_streaming";
				static constexpr char kArmLocallyStreamingAttr[] = "arm_locally_streaming";

				namespace {
				struct EnableArmStreamingPass
				: public arm_sme::impl::EnableArmStreamingBase<EnableArmStreamingPass> {
				EnableArmStreamingPass(ArmStreaming mode) { this->mode = mode; }
				void runOnOperation() override {
				std::string attr;
				switch (mode) {
				case ArmStreaming::Default:
				attr = kArmStreamingAttr;
				break;
				case ArmStreaming::Locally:
				attr = kArmLocallyStreamingAttr;
				break;
				}
				getOperation()->setAttr(attr, UnitAttr::get(&getContext()));
				}
				};
				} // namespace

				std::unique_ptr<Pass>
				mlir::arm_sme::createEnableArmStreamingPass(const ArmStreaming mode) {
				return std::make_unique<EnableArmStreamingPass>(mode);
				}

mlir/lib/Dialect/CMakeLists.txt

	add_subdirectory(Affine)			add_subdirectory(Affine)
	add_subdirectory(AMDGPU)			add_subdirectory(AMDGPU)
	add_subdirectory(Arith)			add_subdirectory(Arith)
	add_subdirectory(ArmNeon)			add_subdirectory(ArmNeon)
				add_subdirectory(ArmSME)
	add_subdirectory(ArmSVE)			add_subdirectory(ArmSVE)
	add_subdirectory(Async)			add_subdirectory(Async)
				dcaballeUnsubmitted Done Reply Inline Actions same dcaballe: same
	add_subdirectory(AMX)			add_subdirectory(AMX)
	add_subdirectory(Bufferization)			add_subdirectory(Bufferization)
	add_subdirectory(Complex)			add_subdirectory(Complex)
	add_subdirectory(ControlFlow)			add_subdirectory(ControlFlow)
	add_subdirectory(DLTI)			add_subdirectory(DLTI)
	add_subdirectory(EmitC)			add_subdirectory(EmitC)
	add_subdirectory(Func)			add_subdirectory(Func)
	add_subdirectory(GPU)			add_subdirectory(GPU)
	Show All 37 Lines

mlir/test/Dialect/ArmSME/enable-arm-streaming.mlir

This file was added.

				// RUN: mlir-opt %s -enable-arm-streaming -verify-diagnostics \| FileCheck %s
				// RUN: mlir-opt %s -enable-arm-streaming=mode=locally -verify-diagnostics \| FileCheck %s -check-prefix=CHECK-LOCALLY

				// CHECK-LABEL: @arm_streaming
				// CHECK-SAME: attributes {arm_streaming}
				// CHECK-LOCALLY-LABEL: @arm_streaming
				// CHECK-LOCALLY-SAME: attributes {arm_locally_streaming}
				func.func @arm_streaming() { return }

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Add pass to enable Armv9 Streaming SVE modeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 525500

mlir/include/mlir/Dialect/ArmSME/CMakeLists.txt

mlir/include/mlir/Dialect/ArmSME/Transforms/CMakeLists.txt

mlir/include/mlir/Dialect/ArmSME/Transforms/Passes.h

mlir/include/mlir/Dialect/ArmSME/Transforms/Passes.td

mlir/include/mlir/Dialect/CMakeLists.txt

mlir/include/mlir/InitAllPasses.h

mlir/lib/Dialect/ArmSME/CMakeLists.txt

mlir/lib/Dialect/ArmSME/Transforms/CMakeLists.txt

mlir/lib/Dialect/ArmSME/Transforms/EnableArmStreaming.cpp

mlir/lib/Dialect/CMakeLists.txt

mlir/test/Dialect/ArmSME/enable-arm-streaming.mlir

[mlir] Add pass to enable Armv9 Streaming SVE mode
ClosedPublic