Download Raw Diff

Details

Reviewers

ftynse
nicolasvasilache

Commits

rG20daedacca80: 2d Arm Neon sdot op, and lowering to the intrinsic.

Summary

This adds Sdot2d op, which is similar to the usual Neon
intrinsic except that it takes 2d vector operands, reflecting the
structure of the arithmetic that it's performing: 4 separate
4-dimensional dot products, whence the vector<4x4xi8> shape.

This also adds a new pass, arm-neon-2d-to-intr, lowering
this new 2d op to the 1d intrinsic.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Benoit created this revision.May 14 2021, 8:58 AM

Herald added a reviewer: ftynse. · View Herald TranscriptMay 14 2021, 8:58 AM

Herald added subscribers: dcaballe, cota, teijeong and 18 others. · View Herald Transcript

Benoit requested review of this revision.May 14 2021, 8:58 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 14 2021, 8:58 AM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Benoit added a reviewer: nicolasvasilache.May 14 2021, 9:00 AM

Cool ! Accepting conditioned on these minor changes.

mlir/include/mlir/Conversion/ArmNeonStructuredToIntr/ArmNeonStructuredToIntr.h
11 ↗	(On Diff #345453)	some minor comments + spacing prefixed with `///` please
mlir/include/mlir/Dialect/ArmNeon/ArmNeon.td
127	So structured op has a specific meaning in a bunch of MLIR places related to Linalg and Vector. Could we call this MamtulSomethingOp ?
135	Not sure there is tablegen support for that. Just add it in C++ with a custom verifier ? e.g. https://github.com/llvm/llvm-project/blob/main/mlir/include/mlir/Dialect/X86Vector/X86Vector.td#L78
mlir/lib/Conversion/ArmNeonStructuredToIntr/ArmNeonStructuredToIntr.cpp
2 ↗	(On Diff #345453)	80 cols plz
24 ↗	(On Diff #345453)	Note that one of the remaining challenges is to properly propagate the shape casts you introduce here all the way to memory operations and ideally have them fold into reindexings. Add a TODO along those lines ?
29 ↗	(On Diff #345453)	There is non LLVM type here, just 1-D vector type that map 1-1 with LLVM (unlike the 2+D vectors that don't). Can you please rephrase a bit? I'd just say "convert to 1-D vector type + attribute to match the neon.intr.xxx operation requirements". Those intr. ops are the separation of concern between LLVM and MLIR.
36 ↗	(On Diff #345453)	camelcase everywhere plz

This revision is now accepted and ready to land.May 14 2021, 9:30 AM

Harbormaster completed remote builds in B104511: Diff 345453.May 14 2021, 9:41 AM

Benoit planned changes to this revision.May 14 2021, 10:15 AM

Applied review comments

This revision is now accepted and ready to land.May 14 2021, 1:20 PM

Please take a look - nontrivial enough changes that I'd like your opinion again.

mlir/include/mlir/Dialect/ArmNeon/ArmNeon.td
127	I renamed 'Structured' to '2d' everywhere. This is the best descriptive term that I could think of. 'Matmul' would be both overly specific to certain Neon intrinsic, and not even well descriptive of the present intrinsic: there are 2 forms of ARM SDOT instruction, only one of them is a 4x4x1 matmul, and the one that LLVM has as an intrinsic (which we are dealing with here) is the other one. It's not a 4x4x1 matmul, it's 4 completely independent dot products of separate 4d vectors.

There is one of your comments which I haven't applied yet. It's the one about writing a TODO about the handling of these reshapes. Part of the problem is I didn't understand 100% of what you were saying, and part is that this topic is going to get more subtle still in the following commit, adding the lowering from vector.contract, which will involve a broadcast to map the contract to the flavor of SDOT that we have here, which (as said in the other comment) is actually 4 separate vector dot products. So if you're OK, I would suggest to do nothing about this in this CL, discuss with you next week, decide something for the next commit.

Benoit retitled this revision from structured 2d Arm Neon sdot op, and lowering to the intrinsic. to 2d Arm Neon sdot op, and lowering to the intrinsic..May 14 2021, 1:30 PM

Benoit edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B104579: Diff 345541.May 14 2021, 2:22 PM

please add an invalid.mlir in the armneon dialect tests, one per verifier error that is not automatically produced by tablegen (we test those separately), look at other invalid.mlir to see how they are structured.

mlir/include/mlir/Dialect/ArmNeon/ArmNeon.td
127	sgtm, thanks!
mlir/lib/Dialect/ArmNeon/IR/ArmNeonDialect.cpp
28	with custom verifiers we also add invalid tests in an invalid.mlir that exercise each of the expected errors.

ftynse added inline comments.May 17 2021, 7:07 AM

mlir/include/mlir/Dialect/ArmNeon/ArmNeon.td
123–125
mlir/lib/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.cpp
31	Please expand `auto` here and below. MLIR uses `auto` when the type is either obvious from context (e.g., there's a `cast` on the RHS) or prohibitively long/impossible to express.
41	Nit: consider `op.res()` instead of `op->getResult(0)` to avoid the generic API with magic numbers.
mlir/lib/Dialect/ArmNeon/IR/ArmNeonDialect.cpp
32–36	These seem to be already covered by the type constraint in ODS so no need to check them manually. If you follow Nicolas' advice from above, you'll notice which messages are never emitted.
33	Nit: `op.emitOpError()` / `op.emitError()` spare you the manual fetching of location and produce more semantically-meaningful errors.

Addressed review comments. Predicates now implemented using CPred.

addressed ftynse's review comments

one last review comment

one more whitespace fix

Benoit marked an inline comment as done.Jun 9 2021, 12:27 PM

nicolasvasilache accepted this revision.Jun 9 2021, 1:01 PM

nicolasvasilache added inline comments.

mlir/lib/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.cpp
33	You could give the magic constant a meaningful name and expose it as a static extra method of the op. You could also use that static method in the .td and then everything gets normalized.
39	just a warning, a lot of the canonicalization work to push these ShapeCastOp all the way to address computations is missing. So we'll prob. need to iterate when you start looking at a real kernel from memory and the quality of the assembly that gets generated.

Harbormaster completed remote builds in B108472: Diff 350973.Jun 9 2021, 1:16 PM

Benoit marked an inline comment as done.Jun 9 2021, 1:49 PM

Benoit added inline comments.

mlir/lib/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.cpp
33	Sorry, I don't know how to add such a static method or constant to a class defined in tablegen?
39	ah ok, thanks for the note.

nicolasvasilache added inline comments.Jun 9 2021, 2:24 PM

mlir/lib/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.cpp
33	You could do something like this: https://sourcegraph.com/github.com/llvm/llvm-project/-/blob/mlir/include/mlir/Dialect/Linalg/IR/LinalgOps.td#L61 let extraClassDeclarations = [] ...

one more review comment

Benoit marked 2 inline comments as done.Jun 9 2021, 7:34 PM

Harbormaster completed remote builds in B108528: Diff 351044.Jun 9 2021, 8:12 PM

Great, let's land this!

I don't think that I have write access. I read https://llvm.org/docs/Phabricator.html#committing-a-change but it's not very clear to me. As far as I can see all I can do here is write this message asking if someone can submit for me? I don't see something else than this free-text field to input to (so is this process dependent on contributors freely browsing open reviews for ones that need submitting?)

Closed by commit rG20daedacca80: 2d Arm Neon sdot op, and lowering to the intrinsic. (authored by Benoit, committed by asaadaldien). · Explain WhyJun 10 2021, 2:36 PM

This revision was automatically updated to reflect the committed changes.

asaadaldien added a commit: rG20daedacca80: 2d Arm Neon sdot op, and lowering to the intrinsic..

Diff 345541

mlir/include/mlir/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.h

This file was added.

				//===- ArmNeon2dToIntr.h - convert Arm Neon 2d ops to intrinsics ----------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef MLIR_CONVERSION_ARMNEON2DTOINTR_ARMNEON2DTOINTR_H_
				#define MLIR_CONVERSION_ARMNEON2DTOINTR_ARMNEON2DTOINTR_H_

				#include "mlir/Pass/Pass.h"

				namespace mlir {
				class FuncOp;
				template <typename T>
				class OperationPass;

				/// Populates patterns for the lowering of Arm NEON 2D ops to intrinsics.
				/// See createConvertArmNeon2dToIntrPass.
				void populateConvertArmNeon2dToIntrPatterns(RewritePatternSet &patterns);

				/// Creates a pass to lower Arm NEON 2D ops to intrinsics, i.e.
				/// equivalent ops operating on flattened 1D vectors and mapping more
				/// directly to the corresponding Arm NEON instruction.
				std::unique_ptr<OperationPass<FuncOp>> createConvertArmNeon2dToIntrPass();

				} // namespace mlir

				#endif // MLIR_CONVERSION_ARMNEON2DTOINTR_ARMNEON2DTOINTR_H_

mlir/include/mlir/Conversion/Passes.h

	//===- Passes.h - Conversion Pass Construction and Registration -----------===//			//===- Passes.h - Conversion Pass Construction and Registration -----------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_CONVERSION_PASSES_H			#ifndef MLIR_CONVERSION_PASSES_H
	#define MLIR_CONVERSION_PASSES_H			#define MLIR_CONVERSION_PASSES_H

	#include "mlir/Conversion/AffineToStandard/AffineToStandard.h"			#include "mlir/Conversion/AffineToStandard/AffineToStandard.h"
				#include "mlir/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.h"
	#include "mlir/Conversion/AsyncToLLVM/AsyncToLLVM.h"			#include "mlir/Conversion/AsyncToLLVM/AsyncToLLVM.h"
	#include "mlir/Conversion/ComplexToLLVM/ComplexToLLVM.h"			#include "mlir/Conversion/ComplexToLLVM/ComplexToLLVM.h"
	#include "mlir/Conversion/ComplexToStandard/ComplexToStandard.h"			#include "mlir/Conversion/ComplexToStandard/ComplexToStandard.h"
	#include "mlir/Conversion/GPUCommon/GPUCommonPass.h"			#include "mlir/Conversion/GPUCommon/GPUCommonPass.h"
	#include "mlir/Conversion/GPUToNVVM/GPUToNVVMPass.h"			#include "mlir/Conversion/GPUToNVVM/GPUToNVVMPass.h"
	#include "mlir/Conversion/GPUToROCDL/GPUToROCDLPass.h"			#include "mlir/Conversion/GPUToROCDL/GPUToROCDLPass.h"
	#include "mlir/Conversion/GPUToSPIRV/GPUToSPIRVPass.h"			#include "mlir/Conversion/GPUToSPIRV/GPUToSPIRVPass.h"
	#include "mlir/Conversion/GPUToVulkan/ConvertGPUToVulkanPass.h"			#include "mlir/Conversion/GPUToVulkan/ConvertGPUToVulkanPass.h"
	Show All 32 Lines

mlir/include/mlir/Conversion/Passes.td

	Show First 20 Lines • Show All 586 Lines • ▼ Show 20 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def ConvertVectorToSPIRV : Pass<"convert-vector-to-spirv", "ModuleOp"> {			def ConvertVectorToSPIRV : Pass<"convert-vector-to-spirv", "ModuleOp"> {
	let summary = "Convert Vector dialect to SPIR-V dialect";			let summary = "Convert Vector dialect to SPIR-V dialect";
	let constructor = "mlir::createConvertVectorToSPIRVPass()";			let constructor = "mlir::createConvertVectorToSPIRVPass()";
	let dependentDialects = ["spirv::SPIRVDialect"];			let dependentDialects = ["spirv::SPIRVDialect"];
	}			}

				//===----------------------------------------------------------------------===//
				// ArmNeon2dToIntr
				//===----------------------------------------------------------------------===//

				def ConvertArmNeon2dToIntr : Pass<"arm-neon-2d-to-intr", "FuncOp"> {
				let summary = "Convert Arm NEON structured ops to intrinsics";
				let constructor = "mlir::createConvertArmNeon2dToIntrPass()";
				let dependentDialects = ["arm_neon::ArmNeonDialect", "vector::VectorDialect"];
				}


	#endif // MLIR_CONVERSION_PASSES			#endif // MLIR_CONVERSION_PASSES

mlir/include/mlir/Dialect/ArmNeon/ArmNeon.td

Show All 9 Lines

//===----------------------------------------------------------------------===//

#ifndef ARMNEON_OPS

#define ARMNEON_OPS

include "mlir/Dialect/LLVMIR/LLVMOpBase.td"

include "mlir/Interfaces/SideEffectInterfaces.td"

include "mlir/IR/OpBase.td"

//===----------------------------------------------------------------------===//

// ArmNeon dialect definition

//===----------------------------------------------------------------------===//

def ArmNeon_Dialect : Dialect {

let name = "arm_neon";

let cppNamespace = "::mlir::arm_neon";

▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines

def SdotOp : ArmNeon_OverloadedOperandsWithOneResultIntrOp<"sdot",[1], [

let arguments = (ins VectorOfLengthAndType<[4, 2], [I32]>:$a,

VectorOfLengthAndType<[16, 8], [I8]>:$b,

VectorOfLengthAndType<[16, 8], [I8]>:$c);

let results = (outs VectorOfLengthAndType<[4, 2], [I32]>:$res);

let assemblyFormat =

"$a `,` $b `,` $c attr-dict `:` type($b) `,` type($c) `to` type($res)";

}

class ArmNeon_2dOp<string mnemonic,

list<OpTrait> traits = []>

: Op</*dialect=*/ArmNeon_Dialect,

/*opName=*/"2d." # mnemonic,

/*traits=*/traits>;

ftynseUnsubmitted

Done

list<OpTrait> traits = []>

: Op</*dialect=*/ArmNeon_Dialect,

- /*opName=*/"2d." # mnemonic,

- /*traits=*/traits>;

+ /*opName=*/"2d." # mnemonic,

+ /*traits=*/traits>;

def Sdot2dOp : ArmNeon_2dOp<"sdot", [

ftynse:

def Sdot2dOp : ArmNeon_2dOp<"sdot", [

nicolasvasilacheUnsubmitted

Done

So structured op has a specific meaning in a bunch of MLIR places related to Linalg and Vector.
Could we call this MamtulSomethingOp ?

nicolasvasilache: So structured op has a specific meaning in a bunch of MLIR places related to Linalg and Vector.

BenoitAuthorUnsubmitted

Done

I renamed 'Structured' to '2d' everywhere.
This is the best descriptive term that I could think of.
'Matmul' would be both overly specific to certain Neon intrinsic, and not even well descriptive of the present intrinsic: there are 2 forms of ARM SDOT instruction, only one of them is a 4x4x1 matmul, and the one that LLVM has as an intrinsic (which we are dealing with here) is the other one. It's not a 4x4x1 matmul, it's 4 completely independent dot products of separate 4d vectors.

Benoit: I renamed 'Structured' to '2d' everywhere. This is the best descriptive term that I could think…

nicolasvasilacheUnsubmitted

Done

sgtm, thanks!

nicolasvasilache: sgtm, thanks!

NoSideEffect,

AllTypesMatch<["b", "c"]>,

AllTypesMatch<["a", "res"]>,

TypesMatchWith<"res has the same number of elements as operand b",

"b", "res",

"VectorType::get({$_self.cast<VectorType>().getShape()[0]},"

"IntegerType::get($_self.getContext(), 32))">]

> {

nicolasvasilacheUnsubmitted

Done

Not sure there is tablegen support for that.
Just add it in C++ with a custom verifier ? e.g. https://github.com/llvm/llvm-project/blob/main/mlir/include/mlir/Dialect/X86Vector/X86Vector.td#L78

nicolasvasilache: Not sure there is tablegen support for that. Just add it in C++ with a custom verifier ? e.g.

let summary = "sdot op";

let description = [{

The two input vectors `b` and `c` have a 2D shape, consisting of either 2

or 4 rows, each row having length 4. This operation computes the pair-wise

dot-products of the rows of `b` and `c` and accumulates them with the

corresponding entry of `a`:

```

res[i] := a[i] + dot_product(b[i, ...], c[i, ...])

```

}];

// Supports either:

// (vector<2xi32>, vector<2x4xi8>, vector<2x4xi8>) -> vector<2xi32>

// (vector<4xi32>, vector<4x4xi8>, vector<4x4xi8>) -> vector<4xi32>

// TODO: how do we express 2D shape requirements here?

let arguments = (ins VectorOfLengthAndType<[4, 2], [I32]>:$a,

VectorOfLengthAndType<[16, 8], [I8]>:$b,

VectorOfLengthAndType<[16, 8], [I8]>:$c);

let results = (outs VectorOfLengthAndType<[4, 2], [I32]>:$res);

let assemblyFormat =

"$a `,` $b `,` $c attr-dict `:` type($b) `,` type($c) `to` type($res)";

let verifier = [{ return ::verify(*this); }];

}

#endif // ARMNEON_OPS

mlir/lib/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.cpp

This file was added.

				//===- ArmNeon2dToIntr.cpp - convert Arm Neon 2d ops to intrinsics --------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.h"
				#include "../PassDetail.h"
				#include "mlir/Dialect/ArmNeon/ArmNeonDialect.h"
				#include "mlir/Dialect/Vector/VectorOps.h"
				#include "mlir/IR/PatternMatch.h"
				#include "mlir/Pass/Pass.h"
				#include "mlir/Pass/PassRegistry.h"
				#include "mlir/Transforms/GreedyPatternRewriteDriver.h"

				using namespace mlir;
				using namespace mlir::arm_neon;

				namespace {

				class Sdot2dLoweringPattern : public OpRewritePattern<Sdot2dOp> {
				public:
				using OpRewritePattern::OpRewritePattern;

				/// Convert to 1-dimensional vector type to match the requirements of
				/// arm.neon.intr.sdot
				LogicalResult matchAndRewrite(Sdot2dOp op,
				PatternRewriter &rewriter) const override {
				auto elemType = op.b().getType().cast<VectorType>().getElementType();
				ftynseUnsubmitted Done Reply Inline Actions Please expand `auto` here and below. MLIR uses `auto` when the type is either obvious from context (e.g., there's a `cast` on the RHS) or prohibitively long/impossible to express. ftynse: Please expand `auto` here and below. MLIR uses `auto` when the type is either obvious from…
				int length = op.b().getType().cast<VectorType>().getShape()[0] * 4;
				auto flattenedVectorType = VectorType::get({length}, elemType);
				nicolasvasilacheUnsubmitted Done Reply Inline Actions You could give the magic constant a meaningful name and expose it as a static extra method of the op. You could also use that static method in the .td and then everything gets normalized. nicolasvasilache: You could give the magic constant a meaningful name and expose it as a static extra method of…
				BenoitAuthorUnsubmitted Done Reply Inline Actions Sorry, I don't know how to add such a static method or constant to a class defined in tablegen? Benoit: Sorry, I don't know how to add such a static method or constant to a class defined in tablegen?
				nicolasvasilacheUnsubmitted Done Reply Inline Actions You could do something like this: https://sourcegraph.com/github.com/llvm/llvm-project/-/blob/mlir/include/mlir/Dialect/Linalg/IR/LinalgOps.td#L61 let extraClassDeclarations = [] ... nicolasvasilache: You could do something like this: https://sourcegraph.com/github.com/llvm/llvm-project/…
				auto b2d = op.b();
				auto c2d = op.c();
				auto loc = op.getLoc();
				auto b1d =
				rewriter.create<vector::ShapeCastOp>(loc, flattenedVectorType, b2d);
				auto c1d =
				nicolasvasilacheUnsubmitted Done Reply Inline Actions just a warning, a lot of the canonicalization work to push these ShapeCastOp all the way to address computations is missing. So we'll prob. need to iterate when you start looking at a real kernel from memory and the quality of the assembly that gets generated. nicolasvasilache: just a warning, a lot of the canonicalization work to push these ShapeCastOp all the way to…
				BenoitAuthorUnsubmitted Done Reply Inline Actions ah ok, thanks for the note. Benoit: ah ok, thanks for the note.
				rewriter.create<vector::ShapeCastOp>(loc, flattenedVectorType, c2d);
				Value newOp = rewriter.create<SdotOp>(loc, op->getResult(0).getType(),
				ftynseUnsubmitted Done Reply Inline Actions Nit: consider `op.res()` instead of `op->getResult(0)` to avoid the generic API with magic numbers. ftynse: Nit: consider `op.res()` instead of `op->getResult(0)` to avoid the generic API with magic…
				op.a(), b1d, c1d);
				rewriter.replaceOp(op, {newOp});
				return success();
				}
				};

				class ConvertArmNeon2dToIntr
				: public ConvertArmNeon2dToIntrBase<ConvertArmNeon2dToIntr> {
				void runOnOperation() override {
				auto func = getOperation();
				auto *context = &getContext();

				RewritePatternSet patterns(context);
				populateConvertArmNeon2dToIntrPatterns(patterns);

				if (failed(applyPatternsAndFoldGreedily(func, std::move(patterns))))
				return signalPassFailure();
				}
				};

				} // namespace

				namespace mlir {

				void populateConvertArmNeon2dToIntrPatterns(RewritePatternSet &patterns) {
				patterns.add<Sdot2dLoweringPattern>(patterns.getContext());
				}

				std::unique_ptr<OperationPass<FuncOp>> createConvertArmNeon2dToIntrPass() {
				return std::make_unique<ConvertArmNeon2dToIntr>();
				}

				} // namespace mlir

mlir/lib/Conversion/ArmNeon2dToIntr/CMakeLists.txt

This file was added.

				add_mlir_conversion_library(MLIRArmNeon2dToIntr
				ArmNeon2dToIntr.cpp

				ADDITIONAL_HEADER_DIRS
				${MLIR_MAIN_INCLUDE_DIR}/mlir/Conversion/ArmNeon2dToIntr

				DEPENDS
				MLIRConversionPassIncGen

				LINK_COMPONENTS
				Core

				LINK_LIBS PUBLIC
				MLIRArmNeon
				MLIRPass
				MLIRTransforms
				MLIRIR
				)

mlir/lib/Conversion/CMakeLists.txt

	add_subdirectory(AffineToStandard)			add_subdirectory(AffineToStandard)
				add_subdirectory(ArmNeon2dToIntr)
	add_subdirectory(AsyncToLLVM)			add_subdirectory(AsyncToLLVM)
	add_subdirectory(ComplexToLLVM)			add_subdirectory(ComplexToLLVM)
	add_subdirectory(ComplexToStandard)			add_subdirectory(ComplexToStandard)
	add_subdirectory(GPUCommon)			add_subdirectory(GPUCommon)
	add_subdirectory(GPUToNVVM)			add_subdirectory(GPUToNVVM)
	add_subdirectory(GPUToROCDL)			add_subdirectory(GPUToROCDL)
	add_subdirectory(GPUToSPIRV)			add_subdirectory(GPUToSPIRV)
	add_subdirectory(GPUToVulkan)			add_subdirectory(GPUToVulkan)
	Show All 22 Lines

mlir/lib/Conversion/PassDetail.h

	Show First 20 Lines • Show All 70 Lines • ▼ Show 20 Lines
	namespace tosa {			namespace tosa {
	class TosaDialect;			class TosaDialect;
	} // end namespace tosa			} // end namespace tosa

	namespace vector {			namespace vector {
	class VectorDialect;			class VectorDialect;
	} // end namespace vector			} // end namespace vector

				namespace arm_neon {
				class ArmNeonDialect;
				} // end namespace arm_neon

	#define GEN_PASS_CLASSES			#define GEN_PASS_CLASSES
	#include "mlir/Conversion/Passes.h.inc"			#include "mlir/Conversion/Passes.h.inc"

	} // end namespace mlir			} // end namespace mlir

	#endif // CONVERSION_PASSDETAIL_H_			#endif // CONVERSION_PASSDETAIL_H_

mlir/lib/Dialect/ArmNeon/IR/ArmNeonDialect.cpp

	Show All 19 Lines

	void arm_neon::ArmNeonDialect::initialize() {			void arm_neon::ArmNeonDialect::initialize() {
	addOperations<			addOperations<
	#define GET_OP_LIST			#define GET_OP_LIST
	#include "mlir/Dialect/ArmNeon/ArmNeon.cpp.inc"			#include "mlir/Dialect/ArmNeon/ArmNeon.cpp.inc"
	>();			>();
	}			}

				static LogicalResult verify(arm_neon::Sdot2dOp op) {
				nicolasvasilacheUnsubmitted Not Done Reply Inline Actions with custom verifiers we also add invalid tests in an invalid.mlir that exercise each of the expected errors. nicolasvasilache: with custom verifiers we also add invalid tests in an invalid.mlir that exercise each of the…
				auto shapeA = op.a().getType().cast<VectorType>().getShape();
				auto shapeB = op.b().getType().cast<VectorType>().getShape();

				if (shapeA.size() != 1)
				return emitError(op.getLoc(), "Operand a should be a 1-dimensional vector");
				ftynseUnsubmitted Not Done Reply Inline Actions Nit: `op.emitOpError()` / `op.emitError()` spare you the manual fetching of location and produce more semantically-meaningful errors. ftynse: Nit: `op.emitOpError()` / `op.emitError()` spare you the manual fetching of location and…

				if (shapeA[0] != 2 && shapeA[0] != 4)
				return emitError(op.getLoc(), "Operand a should have length 2 or 4");
				ftynseUnsubmitted Not Done Reply Inline Actions These seem to be already covered by the type constraint in ODS so no need to check them manually. If you follow Nicolas' advice from above, you'll notice which messages are never emitted. ftynse: These seem to be already covered by the type constraint in ODS so no need to check them…

				if (shapeB.size() != 2)
				return emitError(op.getLoc(), "Operand b should be a 2-dimensional vector");

				if (shapeB[1] != 4)
				return emitError(
				op.getLoc(),
				"The inner size of the 2-dimensional operand b should be 4");

				if (shapeB[0] != shapeA[0])
				return emitError(op.getLoc(),
				"The outer size of the 2-dimensional operand b should "
				"equal the size of 1-dimensional operand a");

				return success();
				}

	#define GET_OP_CLASSES			#define GET_OP_CLASSES
	#include "mlir/Dialect/ArmNeon/ArmNeon.cpp.inc"			#include "mlir/Dialect/ArmNeon/ArmNeon.cpp.inc"

mlir/test/Target/LLVMIR/arm-neon-2d.mlir

This file was added.

				// RUN: mlir-opt -arm-neon-2d-to-intr %s \| FileCheck %s

				// CHECK-LABEL: arm_neon_sdot2d_4x4_i8i8
				func @arm_neon_sdot2d_4x4_i8i8(%a: vector<4xi32>, %b: vector<4x4xi8>, %c: vector<4x4xi8>) -> vector<4xi32> {
				// CHECK: arm_neon.intr.sdot %{{.}}, %{{.}}, %{{.*}} : vector<16xi8>, vector<16xi8> to vector<4xi32>
				// CHECK-NEXT: return %{{.*}} : vector<4xi32>
				%0 = arm_neon.2d.sdot %a, %b, %c : vector<4x4xi8>, vector<4x4xi8> to vector<4xi32>
				return %0 : vector<4xi32>
				}

				// CHECK-LABEL: arm_neon_sdot2d_2x4_i8i8
				func @arm_neon_sdot2d_2x4_i8i8(%a: vector<2xi32>, %b: vector<2x4xi8>, %c: vector<2x4xi8>) -> vector<2xi32> {
				// CHECK: arm_neon.intr.sdot %{{.}}, %{{.}}, %{{.*}} : vector<8xi8>, vector<8xi8> to vector<2xi32>
				// CHECK-NEXT: return %{{.*}} : vector<2xi32>
				%0 = arm_neon.2d.sdot %a, %b, %c : vector<2x4xi8>, vector<2x4xi8> to vector<2xi32>
				return %0 : vector<2xi32>
				}

This is an archive of the discontinued LLVM Phabricator instance.

2d Arm Neon sdot op, and lowering to the intrinsic.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 345541

mlir/include/mlir/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.h

mlir/include/mlir/Conversion/Passes.h

mlir/include/mlir/Conversion/Passes.td

mlir/include/mlir/Dialect/ArmNeon/ArmNeon.td

mlir/lib/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.cpp

mlir/lib/Conversion/ArmNeon2dToIntr/CMakeLists.txt

mlir/lib/Conversion/CMakeLists.txt

mlir/lib/Conversion/PassDetail.h

mlir/lib/Dialect/ArmNeon/IR/ArmNeonDialect.cpp

mlir/test/Target/LLVMIR/arm-neon-2d.mlir

This is an archive of the discontinued LLVM Phabricator instance.

2d Arm Neon sdot op, and lowering to the intrinsic.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 345541

mlir/include/mlir/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.h

mlir/include/mlir/Conversion/Passes.h

mlir/include/mlir/Conversion/Passes.td

mlir/include/mlir/Dialect/ArmNeon/ArmNeon.td

mlir/lib/Conversion/ArmNeon2dToIntr/ArmNeon2dToIntr.cpp

mlir/lib/Conversion/ArmNeon2dToIntr/CMakeLists.txt

mlir/lib/Conversion/CMakeLists.txt

mlir/lib/Conversion/PassDetail.h

mlir/lib/Dialect/ArmNeon/IR/ArmNeonDialect.cpp

mlir/test/Target/LLVMIR/arm-neon-2d.mlir

2d Arm Neon sdot op, and lowering to the intrinsic.
ClosedPublic