This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Vector]Lower vector.contract to llvm.intr.matrix_multiply
ClosedPublic

Authored by nicolasvasilache on Mar 11 2020, 11:15 AM.

Download Raw Diff

Details

Reviewers

aartbik
fhahn

Commits

rGbbf3ef854116: [mlir][Vector]Lower vector.contract to llvm.intr.matrix_multiply

Summary

This revision adds lowering of vector.contract to llvm.intr.matrix_multiply.
Note that there is currently a mismatch between the MLIR vector dialect which expects row-major layout and the LLVM matrix intrinsics which expect column major layout.

As a consequence, we currently only match a vector.contract with indexing maps
that express column-major matrix multiplication.
Other cases would require additional transposes and it is better to wait for
LLVM intrinsics to provide a per-operation attribute that would specify which layout is expected.

A separate integration test, not submitted to MLIR core, has independently verified that correct execution occurs on a 2x2x2 matrix multiplication.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nicolasvasilache created this revision.Mar 11 2020, 11:15 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 11 2020, 11:16 AM

Herald added subscribers: llvm-commits, Joonsoo, liufengdb and 9 others. · View Herald Transcript

Harbormaster completed remote builds in B48849: Diff 249693.Mar 11 2020, 12:29 PM

aartbik added inline comments.Mar 11 2020, 12:58 PM

mlir/include/mlir/Dialect/Utils/StructuredOpsUtils.h
42	note that we also have two "transposed" versions, ie. with either {m,n} or {n.m} but then the position of the k's swapped
mlir/lib/Dialect/VectorOps/VectorTransforms.cpp
970	can this test every fail? if so, shouldn't we move it a bit up to avoid doing some work first?
mlir/test/Dialect/VectorOps/vector-contract-transforms.mlir
1–2	note that in the pending CL, I have renamed this flag and file, since it started to become less and less about contract only :-) One of us will have to rebase and merge

rriddle added inline comments.Mar 11 2020, 1:13 PM

mlir/lib/Dialect/VectorOps/VectorTransforms.cpp
46	Can we please avoid global cl opts?
971	This check is duplicated, are you trying to check elementType.isa<IntegerType>()

nicolasvasilache marked 7 inline comments as done.Mar 12 2020, 7:31 AM

nicolasvasilache added inline comments.

mlir/include/mlir/Dialect/Utils/StructuredOpsUtils.h
42	I didn't want to invest too much in column major patterns in this PR, can we keep as a followup to add more patterns on a per need basis? We can also do much more with transposes once we have them in MLIR + retargeting LLVM intrinsics.
mlir/test/Dialect/VectorOps/vector-contract-transforms.mlir
1–2	Ack

Address review comments.

Thanks for the reviews!

Harbormaster completed remote builds in B48987: Diff 249932.Mar 12 2020, 9:46 AM

aartbik accepted this revision.Mar 12 2020, 1:42 PM

This revision is now accepted and ready to land.Mar 12 2020, 1:42 PM

Closed by commit rGbbf3ef854116: [mlir][Vector]Lower vector.contract to llvm.intr.matrix_multiply (authored by nicolasvasilache). · Explain WhyMar 13 2020, 2:01 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Utils/

StructuredOpsUtils.h

22 lines

lib/

Conversion/

VectorToLLVM/

ConvertVectorToLLVM.cpp

1 line

Dialect/

Linalg/

Transforms/

LinalgTransforms.cpp

16 lines

VectorOps/

VectorTransforms.cpp

40 lines

test/

Dialect/

VectorOps/

vector-contract-transforms.mlir

45 lines

Diff 250298

mlir/include/mlir/Dialect/Utils/StructuredOpsUtils.h

	Show All 11 Lines
	// information about their semantics (e.g. type of iterators like parallel,			// information about their semantics (e.g. type of iterators like parallel,
	// reduction, etc..) as attributes.			// reduction, etc..) as attributes.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_DIALECT_UTILS_STRUCTUREDOPSUTILS_H			#ifndef MLIR_DIALECT_UTILS_STRUCTUREDOPSUTILS_H
	#define MLIR_DIALECT_UTILS_STRUCTUREDOPSUTILS_H			#define MLIR_DIALECT_UTILS_STRUCTUREDOPSUTILS_H

				#include "mlir/IR/AffineMap.h"
	#include "mlir/IR/Attributes.h"			#include "mlir/IR/Attributes.h"
	#include "mlir/Support/LLVM.h"			#include "mlir/Support/LLVM.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"

	namespace mlir {			namespace mlir {

				inline bool isRowMajorMatmul(ArrayAttr indexingMaps) {
				AffineExpr m, n, k;
				bindDims(indexingMaps.getContext(), m, n, k);
				auto mapA = AffineMapAttr::get(AffineMap::get(3, 0, {m, k}));
				auto mapB = AffineMapAttr::get(AffineMap::get(3, 0, {k, n}));
				auto mapC = AffineMapAttr::get(AffineMap::get(3, 0, {m, n}));
				auto maps = ArrayAttr::get({mapA, mapB, mapC}, indexingMaps.getContext());
				return indexingMaps == maps;
				}

				inline bool isColumnMajorMatmul(ArrayAttr indexingMaps) {
				AffineExpr m, n, k;
				bindDims(indexingMaps.getContext(), m, n, k);
				auto mapA = AffineMapAttr::get(AffineMap::get(3, 0, {k, n}));
				auto mapB = AffineMapAttr::get(AffineMap::get(3, 0, {m, k}));
				auto mapC = AffineMapAttr::get(AffineMap::get(3, 0, {n, m}));
				aartbikUnsubmitted Done Reply Inline Actions note that we also have two "transposed" versions, ie. with either {m,n} or {n.m} but then the position of the k's swapped aartbik: note that we also have two "transposed" versions, ie. with either {m,n} or {n.m} but then the…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I didn't want to invest too much in column major patterns in this PR, can we keep as a followup to add more patterns on a per need basis? We can also do much more with transposes once we have them in MLIR + retargeting LLVM intrinsics. nicolasvasilache: I didn't want to invest too much in column major patterns in this PR, can we keep as a followup…
				auto maps = ArrayAttr::get({mapA, mapB, mapC}, indexingMaps.getContext());
				return indexingMaps == maps;
				}

	/// Attribute name for the AffineArrayAttr which encodes the relationship			/// Attribute name for the AffineArrayAttr which encodes the relationship
	/// between a structured op iterators' and its operands.			/// between a structured op iterators' and its operands.
	constexpr StringRef getIndexingMapsAttrName() { return "indexing_maps"; }			constexpr StringRef getIndexingMapsAttrName() { return "indexing_maps"; }

	/// Attribute name for the StrArrayAttr which encodes the type of a structured			/// Attribute name for the StrArrayAttr which encodes the type of a structured
	/// op's iterators.			/// op's iterators.
	constexpr StringRef getIteratorTypesAttrName() { return "iterator_types"; }			constexpr StringRef getIteratorTypesAttrName() { return "iterator_types"; }

	▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

Show First 20 Lines • Show All 1,132 Lines • ▼ Show 20 Lines	// all contraction operations. Also applies folding and DCE.
applyPatternsGreedily(getModule(), patterns);		applyPatternsGreedily(getModule(), patterns);
}		}

// Convert to the LLVM IR dialect.		// Convert to the LLVM IR dialect.
LLVMTypeConverter converter(&getContext());		LLVMTypeConverter converter(&getContext());
OwningRewritePatternList patterns;		OwningRewritePatternList patterns;
populateVectorToLLVMMatrixConversionPatterns(converter, patterns);		populateVectorToLLVMMatrixConversionPatterns(converter, patterns);
populateVectorToLLVMConversionPatterns(converter, patterns);		populateVectorToLLVMConversionPatterns(converter, patterns);
		populateVectorToLLVMMatrixConversionPatterns(converter, patterns);
populateStdToLLVMConversionPatterns(converter, patterns);		populateStdToLLVMConversionPatterns(converter, patterns);

LLVMConversionTarget target(getContext());		LLVMConversionTarget target(getContext());
target.addDynamicallyLegalOp<FuncOp>(		target.addDynamicallyLegalOp<FuncOp>(
[&](FuncOp op) { return converter.isSignatureLegal(op.getType()); });		[&](FuncOp op) { return converter.isSignatureLegal(op.getType()); });
if (failed(		if (failed(
applyPartialConversion(getModule(), target, patterns, &converter))) {		applyPartialConversion(getModule(), target, patterns, &converter))) {
signalPassFailure();		signalPassFailure();
Show All 10 Lines

mlir/lib/Dialect/Linalg/Transforms/LinalgTransforms.cpp

Show All 9 Lines
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Dialect/Linalg/Transforms/LinalgTransforms.h"		#include "mlir/Dialect/Linalg/Transforms/LinalgTransforms.h"
#include "mlir/Dialect/Linalg/Analysis/DependenceAnalysis.h"		#include "mlir/Dialect/Linalg/Analysis/DependenceAnalysis.h"
#include "mlir/Dialect/Linalg/IR/LinalgOps.h"		#include "mlir/Dialect/Linalg/IR/LinalgOps.h"
#include "mlir/Dialect/Linalg/Utils/Utils.h"		#include "mlir/Dialect/Linalg/Utils/Utils.h"
#include "mlir/Dialect/StandardOps/EDSC/Intrinsics.h"		#include "mlir/Dialect/StandardOps/EDSC/Intrinsics.h"
		#include "mlir/Dialect/Utils/StructuredOpsUtils.h"
#include "mlir/Dialect/VectorOps/VectorOps.h"		#include "mlir/Dialect/VectorOps/VectorOps.h"
#include "mlir/IR/AffineExpr.h"		#include "mlir/IR/AffineExpr.h"
#include "mlir/IR/Matchers.h"		#include "mlir/IR/Matchers.h"
#include "mlir/IR/PatternMatch.h"		#include "mlir/IR/PatternMatch.h"
#include "mlir/Pass/Pass.h"		#include "mlir/Pass/Pass.h"
#include "mlir/Support/LLVM.h"		#include "mlir/Support/LLVM.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
▲ Show 20 Lines • Show All 113 Lines • ▼ Show 20 Lines	static bool hasMultiplyAddBody(linalg::GenericOp op) {
auto pattern3 = m_Op<YieldOp>(m_Op<AddFOp>(m_Op<MulFOp>(b, a), c));		auto pattern3 = m_Op<YieldOp>(m_Op<AddFOp>(m_Op<MulFOp>(b, a), c));
auto pattern4 = m_Op<YieldOp>(m_Op<AddFOp>(c, m_Op<MulFOp>(b, a)));		auto pattern4 = m_Op<YieldOp>(m_Op<AddFOp>(c, m_Op<MulFOp>(b, a)));
return pattern1.match(&ops.back()) \|\| pattern2.match(&ops.back()) \|\|		return pattern1.match(&ops.back()) \|\| pattern2.match(&ops.back()) \|\|
pattern3.match(&ops.back()) \|\| pattern4.match(&ops.back());		pattern3.match(&ops.back()) \|\| pattern4.match(&ops.back());
}		}

// TODO(ntv) should be Tablegen'd from a single source that generates the op		// TODO(ntv) should be Tablegen'd from a single source that generates the op
// itself.		// itself.
static bool isMatmul(linalg::GenericOp genericOp) {		static bool isRowMajorMatmul(linalg::GenericOp genericOp) {
auto *ctx = genericOp.getContext();
auto m = getAffineDimExpr(0, ctx);
auto n = getAffineDimExpr(1, ctx);
auto k = getAffineDimExpr(2, ctx);
auto mapA = AffineMapAttr::get(AffineMap::get(3, 0, {m, k}));
auto mapB = AffineMapAttr::get(AffineMap::get(3, 0, {k, n}));
auto mapC = AffineMapAttr::get(AffineMap::get(3, 0, {m, n}));
auto maps = ArrayAttr::get({mapA, mapB, mapC}, ctx);
return genericOp.getNumInputs() == 2 && genericOp.getNumOutputs() == 1 &&		return genericOp.getNumInputs() == 2 && genericOp.getNumOutputs() == 1 &&
genericOp.indexing_maps() == maps && hasMultiplyAddBody(genericOp);		isRowMajorMatmul(genericOp.indexing_maps()) &&
		hasMultiplyAddBody(genericOp);
}		}

// TODO(ntv, ataei): This is in fact much more general than just vectorization		// TODO(ntv, ataei): This is in fact much more general than just vectorization
// for matmul and fill ops.		// for matmul and fill ops.
LogicalResult mlir::linalg::vectorizeLinalgOpPrecondition(Operation *op) {		LogicalResult mlir::linalg::vectorizeLinalgOpPrecondition(Operation *op) {
auto linalgOp = cast<linalg::LinalgOp>(op);		auto linalgOp = cast<linalg::LinalgOp>(op);
// All types must be static shape to go to vector.		// All types must be static shape to go to vector.
for (Value operand : linalgOp.getInputsAndOutputBuffers())		for (Value operand : linalgOp.getInputsAndOutputBuffers())
if (!operand.getType().cast<ShapedType>().hasStaticShape())		if (!operand.getType().cast<ShapedType>().hasStaticShape())
return failure();		return failure();
for (Type outputTensorType : linalgOp.getOutputTensorTypes())		for (Type outputTensorType : linalgOp.getOutputTensorTypes())
if (!outputTensorType.cast<ShapedType>().hasStaticShape())		if (!outputTensorType.cast<ShapedType>().hasStaticShape())
return failure();		return failure();
if (isa<linalg::MatmulOp>(op) \|\| isa<linalg::FillOp>(op))		if (isa<linalg::MatmulOp>(op) \|\| isa<linalg::FillOp>(op))
return success();		return success();

auto genericOp = dyn_cast<linalg::GenericOp>(op);		auto genericOp = dyn_cast<linalg::GenericOp>(op);
if (!genericOp \|\| !isMatmul(genericOp))		if (!genericOp \|\| !::isRowMajorMatmul(genericOp))
return failure();		return failure();

// TODO(ntv): non-identity layout.		// TODO(ntv): non-identity layout.
auto isStaticMemRefWithIdentityLayout = [](Value v) {		auto isStaticMemRefWithIdentityLayout = [](Value v) {
auto m = v.getType().dyn_cast<MemRefType>();		auto m = v.getType().dyn_cast<MemRefType>();
if (!m \|\| !m.hasStaticShape() \|\| !m.getAffineMaps().empty())		if (!m \|\| !m.hasStaticShape() \|\| !m.getAffineMaps().empty())
return false;		return false;
return true;		return true;
▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines

mlir/lib/Dialect/VectorOps/VectorTransforms.cpp

Show All 36 Lines
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

#define DEBUG_TYPE "vector-to-vector"		#define DEBUG_TYPE "vector-to-vector"

using namespace mlir;		using namespace mlir;
using llvm::dbgs;		using llvm::dbgs;
using mlir::functional::zipMap;		using mlir::functional::zipMap;

		static llvm::cl::OptionCategory clOptionsCategory(DEBUG_TYPE " options");

		rriddleUnsubmitted Done Reply Inline Actions Can we please avoid global cl opts? rriddle: Can we please avoid global cl opts?
		static llvm::cl::opt<bool> lowerToLLVMMatrixIntrinsics(
		"vector-lower-matrix-intrinsics",
		llvm::cl::desc("Lower vector.contract to llvm.intr.matrix.multiply"),
		llvm::cl::init(false), llvm::cl::cat(clOptionsCategory));

/// Given a shape with sizes greater than 0 along all dimensions,		/// Given a shape with sizes greater than 0 along all dimensions,
/// returns the distance, in number of elements, between a slice in a dimension		/// returns the distance, in number of elements, between a slice in a dimension
/// and the next slice in the same dimension.		/// and the next slice in the same dimension.
/// e.g. shape[3, 4, 5] -> linearization_basis[20, 5, 1]		/// e.g. shape[3, 4, 5] -> linearization_basis[20, 5, 1]
static SmallVector<int64_t, 8> computeStrides(ArrayRef<int64_t> shape) {		static SmallVector<int64_t, 8> computeStrides(ArrayRef<int64_t> shape) {
if (shape.empty())		if (shape.empty())
return {};		return {};
SmallVector<int64_t, 8> tmp;		SmallVector<int64_t, 8> tmp;
▲ Show 20 Lines • Show All 877 Lines • ▼ Show 20 Lines	public:
using OpRewritePattern<vector::ContractionOp>::OpRewritePattern;		using OpRewritePattern<vector::ContractionOp>::OpRewritePattern;

PatternMatchResult matchAndRewrite(vector::ContractionOp op,		PatternMatchResult matchAndRewrite(vector::ContractionOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// TODO(ajcbik): implement masks		// TODO(ajcbik): implement masks
if (llvm::size(op.masks()) != 0)		if (llvm::size(op.masks()) != 0)
return matchFailure();		return matchFailure();

		// TODO(ntv, ajcbik): implement benefits, cost models, separate this out in
		// a new pattern.
		// TODO(ntv, fhahn): once row-major mode is available in LLVM's matrix
		// intrinsics, use that.
		if (lowerToLLVMMatrixIntrinsics &&
		isColumnMajorMatmul(op.indexing_maps())) {
		VectorType lhsType = op.getLhsType();
		VectorType rhsType = op.getRhsType();
		Type flattenedLHSType =
		VectorType::get(lhsType.getNumElements(), lhsType.getElementType());
		Type flattenedRHSType =
		VectorType::get(rhsType.getNumElements(), rhsType.getElementType());
		auto lhs = rewriter.create<vector::ShapeCastOp>(
		op.getLoc(), flattenedLHSType, op.lhs());
		auto rhs = rewriter.create<vector::ShapeCastOp>(
		op.getLoc(), flattenedRHSType, op.rhs());

		unsigned lhsRows = op.getLhsType().getShape()[0];
		unsigned lhsColumns = op.getLhsType().getShape()[1];
		unsigned rhsColumns = op.getRhsType().getShape()[1];
		Value mul = rewriter.create<vector::MatmulOp>(
		op.getLoc(), lhs, rhs, lhsRows, lhsColumns, rhsColumns);
		mul = rewriter.create<vector::ShapeCastOp>(op.getLoc(),
		op.acc().getType(), mul);
		Type elementType = op.getLhsType().getElementType();
		assert(elementType.isIntOrFloat());
		aartbikUnsubmitted Done Reply Inline Actions can this test every fail? if so, shouldn't we move it a bit up to avoid doing some work first? aartbik: can this test every fail? if so, shouldn't we move it a bit up to avoid doing some work first?
		if (elementType.isa<IntegerType>())
		rriddleUnsubmitted Done Reply Inline Actions This check is duplicated, are you trying to check elementType.isa<IntegerType>() rriddle: This check is duplicated, are you trying to check elementType.isa<IntegerType>()
		rewriter.replaceOpWithNewOp<AddIOp>(op, op.acc(), mul);
		else
		rewriter.replaceOpWithNewOp<AddFOp>(op, op.acc(), mul);
		return matchSuccess();
		}

// Find first batch dimension in LHS/RHS, and lower when found.		// Find first batch dimension in LHS/RHS, and lower when found.
std::vector<std::pair<int64_t, int64_t>> batchDimMap = op.getBatchDimMap();		std::vector<std::pair<int64_t, int64_t>> batchDimMap = op.getBatchDimMap();
if (!batchDimMap.empty()) {		if (!batchDimMap.empty()) {
int64_t lhsIndex = batchDimMap[0].first;		int64_t lhsIndex = batchDimMap[0].first;
int64_t rhsIndex = batchDimMap[0].second;		int64_t rhsIndex = batchDimMap[0].second;
rewriter.replaceOp(op, lowerParallel(op, lhsIndex, rhsIndex, rewriter));		rewriter.replaceOp(op, lowerParallel(op, lhsIndex, rhsIndex, rewriter));
return matchSuccess();		return matchSuccess();
}		}
▲ Show 20 Lines • Show All 364 Lines • Show Last 20 Lines

mlir/test/Dialect/VectorOps/vector-contract-transforms.mlir

// RUN: mlir-opt %s -test-vector-contraction-conversion \| FileCheck %s		// RUN: mlir-opt %s -test-vector-contraction-conversion \| FileCheck %s
		// RUN: mlir-opt %s -test-vector-contraction-conversion -vector-lower-matrix-intrinsics \| FileCheck %s --check-prefix=MATRIX
		aartbikUnsubmitted Done Reply Inline Actions note that in the pending CL, I have renamed this flag and file, since it started to become less and less about contract only :-) One of us will have to rebase and merge aartbik: note that in the pending CL, I have renamed this flag and file, since it started to become…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Ack nicolasvasilache: Ack

#dotp_accesses = [		#dotp_accesses = [
affine_map<(i) -> (i)>,		affine_map<(i) -> (i)>,
affine_map<(i) -> (i)>,		affine_map<(i) -> (i)>,
affine_map<(i) -> ()>		affine_map<(i) -> ()>
]		]
#dotp_trait = {		#dotp_trait = {
indexing_maps = #dotp_accesses,		indexing_maps = #dotp_accesses,
▲ Show 20 Lines • Show All 318 Lines • ▼ Show 20 Lines	func @shape_casts(%a: vector<2x2xf32>) -> (vector<4xf32>, vector<2x2xf32>) {
//		//
// CHECK: %[[res1:.*]] = vector.insert %[[s2]], %[[res0]] [1] :		// CHECK: %[[res1:.*]] = vector.insert %[[s2]], %[[res0]] [1] :
// CHECK-SAME: vector<2xf32> into vector<2x2xf32>		// CHECK-SAME: vector<2xf32> into vector<2x2xf32>
//		//
%1 = vector.shape_cast %r0 : vector<4xf32> to vector<2x2xf32>		%1 = vector.shape_cast %r0 : vector<4xf32> to vector<2x2xf32>
// CHECK: return %[[add]], %[[res1]] : vector<4xf32>, vector<2x2xf32>		// CHECK: return %[[add]], %[[res1]] : vector<4xf32>, vector<2x2xf32>
return %r0, %1 : vector<4xf32>, vector<2x2xf32>		return %r0, %1 : vector<4xf32>, vector<2x2xf32>
}		}

		// MATRIX-LABEL: func @column_major_matmul
		// MATRIX-SAME: %[[A:[a-zA-Z0-9]*]]: vector<4x3xf32>,
		// MATRIX-SAME: %[[B:[a-zA-Z0-9]*]]: vector<2x4xf32>,
		// MATRIX-SAME: %[[C:[a-zA-Z0-9]*]]: vector<3x2xf32>
		// MATRIX: %[[vcst:.*]] = constant dense<0.000000e+00> : vector<12xf32>
		// MATRIX: %[[vcst_0:.*]] = constant dense<0.000000e+00> : vector<8xf32>
		// MATRIX: %[[vcst_1:.*]] = constant dense<0.000000e+00> : vector<3x2xf32>
		// MATRIX: %[[a0:.*]] = vector.extract %[[A]][0] : vector<4x3xf32>
		// MATRIX: %[[a1:.*]] = vector.insert_strided_slice %[[a0]], %[[vcst]] {offsets = [0], strides = [1]} : vector<3xf32> into vector<12xf32>
		// MATRIX: %[[a2:.*]] = vector.extract %[[A]][1] : vector<4x3xf32>
		// MATRIX: %[[a3:.*]] = vector.insert_strided_slice %[[a2]], %[[a1]] {offsets = [3], strides = [1]} : vector<3xf32> into vector<12xf32>
		// MATRIX: %[[a4:.*]] = vector.extract %[[A]][2] : vector<4x3xf32>
		// MATRIX: %[[a5:.*]] = vector.insert_strided_slice %[[a4]], %[[a3]] {offsets = [6], strides = [1]} : vector<3xf32> into vector<12xf32>
		// MATRIX: %[[a6:.*]] = vector.extract %[[A]][3] : vector<4x3xf32>
		// MATRIX: %[[a7:.*]] = vector.insert_strided_slice %[[a6]], %[[a5]] {offsets = [9], strides = [1]} : vector<3xf32> into vector<12xf32>
		// MATRIX: %[[b8:.*]] = vector.extract %[[B]][0] : vector<2x4xf32>
		// MATRIX: %[[b9:.*]] = vector.insert_strided_slice %[[b8]], %[[vcst_0]] {offsets = [0], strides = [1]} : vector<4xf32> into vector<8xf32>
		// MATRIX: %[[b10:.*]] = vector.extract %[[B]][1] : vector<2x4xf32>
		// MATRIX: %[[b11:.*]] = vector.insert_strided_slice %[[b10]], %[[b9]] {offsets = [4], strides = [1]} : vector<4xf32> into vector<8xf32>
		// MATRIX: %[[mm12:.*]] = vector.matrix_multiply %[[a7]], %[[b11]] {lhs_columns = 3 : i32, lhs_rows = 4 : i32, rhs_columns = 4 : i32} : (vector<12xf32>, vector<8xf32>) -> vector<12xf32>
		// MATRIX: %[[mm13:.*]] = vector.strided_slice %[[mm12]] {offsets = [0], sizes = [2], strides = [1]} : vector<12xf32> to vector<2xf32>
		// MATRIX: %[[mm14:.*]] = vector.insert %[[mm13]], %[[vcst_1]] [0] : vector<2xf32> into vector<3x2xf32>
		// MATRIX: %[[mm15:.*]] = vector.strided_slice %[[mm12]] {offsets = [2], sizes = [2], strides = [1]} : vector<12xf32> to vector<2xf32>
		// MATRIX: %[[mm16:.*]] = vector.insert %[[mm15]], %[[mm14]] [1] : vector<2xf32> into vector<3x2xf32>
		// MATRIX: %[[mm17:.*]] = vector.strided_slice %[[mm12]] {offsets = [4], sizes = [2], strides = [1]} : vector<12xf32> to vector<2xf32>
		// MATRIX: %[[mm18:.*]] = vector.insert %[[mm17]], %[[mm16]] [2] : vector<2xf32> into vector<3x2xf32>
		// MATRIX: %[[mm19:.*]] = addf %[[C]], %[[mm18]] : vector<3x2xf32>
		#column_major_matmat_accesses = [
		affine_map<(i, j, k) -> (k, j)>,
		affine_map<(i, j, k) -> (i, k)>,
		affine_map<(i, j, k) -> (j, i)>
		]
		#column_major_matmat_trait = {
		indexing_maps = #column_major_matmat_accesses,
		iterator_types = ["parallel", "parallel", "reduction"]
		}
		func @column_major_matmul(%arg0: vector<4x3xf32>,
		%arg1: vector<2x4xf32>,
		%arg2: vector<3x2xf32>) -> vector<3x2xf32> {
		%0 = vector.contract #column_major_matmat_trait %arg0, %arg1, %arg2
		: vector<4x3xf32>, vector<2x4xf32> into vector<3x2xf32>
		return %0 : vector<3x2xf32>
		}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Vector]Lower vector.contract to llvm.intr.matrix_multiplyClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 250298

mlir/include/mlir/Dialect/Utils/StructuredOpsUtils.h

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

mlir/lib/Dialect/Linalg/Transforms/LinalgTransforms.cpp

mlir/lib/Dialect/VectorOps/VectorTransforms.cpp

mlir/test/Dialect/VectorOps/vector-contract-transforms.mlir

[mlir][Vector]Lower vector.contract to llvm.intr.matrix_multiply
ClosedPublic