This is an archive of the discontinued LLVM Phabricator instance.

[flang][hlfir] add hlfir.matmul_transpose operation
ClosedPublic

Authored by tblah on Mar 13 2023, 10:06 AM.

Download Raw Diff

Details

Reviewers

jeanPerier
clementval
sscalpone
vzakhari

Commits

rG49bd444fc361: [flang][hlfir] add hlfir.matmul_transpose operation

Summary

This operation will be used to transform MATMUL(TRANSPOSE(a), b). The
transformation will go in the following stages:

Lowering to hlfir.transpose and hlfir.matmul
Canonicalise to hlfir.matmul_transpose
hlfir.matmul_transpose will be lowered to FIR as a new runtime library call

Step 2 (and this operation) are included for consistency with the other
hlfir intrinsic operations and to avoid mixing concerns in the intrinsic
lowering pass.

In step 3, a new runtime library call is used because this operation is
most easily implemented in one go (the transposed indexing actually
makes the indexing simpler than for a normal matrix multiplication). In
the long run, it is intended that HLFIR will allow the same buffer
to be shared between different runtime calls without temporary
allocations, but in this specific case we can do even better than that
with a dedicated implementation.

This should speed up galgel from SPEC2000 (but this hadn't been tested
yet). The optimization was implemented in Classic Flang.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

tblah created this revision.Mar 13 2023, 10:06 AM

Herald added a reviewer: sscalpone. · View Herald TranscriptMar 13 2023, 10:06 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald Transcript

Herald added subscribers: sunshaoce, mehdi_amini. · View Herald Transcript

tblah requested review of this revision.Mar 13 2023, 10:06 AM

Herald added a subscriber: jdoerfert. · View Herald TranscriptMar 13 2023, 10:06 AM

tblah added a child revision: D145959: [flang][hlfir] add matmul canonicalizer.Mar 13 2023, 10:08 AM

Harbormaster completed remote builds in B219090: Diff 504728.Mar 13 2023, 1:29 PM

tschuett added a subscriber: tschuett.Mar 14 2023, 1:29 AM

jeanPerier added a reviewer: vzakhari.Mar 14 2023, 2:54 AM

vzakhari accepted this revision.Mar 16 2023, 9:48 AM

This revision is now accepted and ready to land.Mar 16 2023, 9:48 AM

Closed by commit rG49bd444fc361: [flang][hlfir] add hlfir.matmul_transpose operation (authored by tblah). · Explain WhyMar 17 2023, 2:31 AM

This revision was automatically updated to reflect the committed changes.

tblah added a commit: rG49bd444fc361: [flang][hlfir] add hlfir.matmul_transpose operation.

Revision Contents

Path

Size

flang/

docs/

HighLevelFIR.md

34 lines

include/

flang/

Optimizer/

HLFIR/

HLFIROps.td

23 lines

lib/

Optimizer/

HLFIR/

IR/

HLFIROps.cpp

65 lines

test/

HLFIR/

invalid.fir

42 lines

matmul_transpose.fir

87 lines

Diff 506017

flang/docs/HighLevelFIR.md

	Show First 20 Lines • Show All 646 Lines • ▼ Show 20 Lines

	hlfir.apply indices will be one based to make further lowering simpler.			hlfir.apply indices will be one based to make further lowering simpler.

	Syntax:			Syntax:
	```			```
	%element = hlfir.apply %array_expr %i, %j: (hlfir.expr<?x?xi32>) -> i32			%element = hlfir.apply %array_expr %i, %j: (hlfir.expr<?x?xi32>) -> i32
	```			```


	#### Introducing operations for transformational intrinsic functions			#### Introducing operations for transformational intrinsic functions

	Motivation: Represent transformational intrinsics functions at a high-level so			Motivation: Represent transformational intrinsics functions at a high-level so
	that they can be manipulated easily by the optimizer, and do not require			that they can be manipulated easily by the optimizer, and do not require
	materializing the result as a temporary in lowering.			materializing the result as a temporary in lowering.

	An operation will be added for each Fortran transformational functions (SUM,			An operation will be added for each Fortran transformational functions (SUM,
	MATMUL, TRANSPOSE....). It translates the Fortran expression verbatim: it takes			MATMUL, TRANSPOSE....). It translates the Fortran expression verbatim: it takes
	Show All 32 Lines
	For the following transformational intrinsics, the current lowering to runtime			For the following transformational intrinsics, the current lowering to runtime
	call will probably be used since there is little point to keep them high level:			call will probably be used since there is little point to keep them high level:
	- command_argument_count, get_team, null, num_images, team_number, this_image			- command_argument_count, get_team, null, num_images, team_number, this_image
	that are more program related (and cannot appear for instance in constant			that are more program related (and cannot appear for instance in constant
	expressions)			expressions)
	- selected_char_kind, selected_int_kind, selected_real_kind that returns scalar			- selected_char_kind, selected_int_kind, selected_real_kind that returns scalar
	integers			integers

				#### Introducing operations for composed intrinsic functions

				Motivation: optimize commonly composed intrinsic functions (e.g.
				MATMUL(TRANSPOSE(a), b)). This optimization is implemented in Classic Flang.

				An operation and runtime function will be added for each commonly used
				composition of intrinsic functions. The operation will be the canonical way to
				write this chained operation (the MLIR canonicalization pass will rewrite the
				operations for the composed intrinsics into this one operation).

				These new operations will be treated as though they were standard
				transformational intrinsic functions.

				The composed intrinsic operation will return a hlfir.expr<T>. The arguments
				may be hlfir.expr<T>, boxed arrays, simple scalar types (e.g. i32, f32), or
				variables.

				To keep things simple, these operations will only match one form of the composed
				intrinsic functions: therefore there will be no optional arguments.

				Syntax:
				```
				%res = hlfir."intrinsic_name" %expr_or_var, ...
				```

				The composed intrinsic operation will be lowered to a `fir.call` to the newly
				added runtime implementation of the operation.

				These operations should not be added where the only improvement is to avoid
				creating a temporary intermediate buffer which would otherwise be removed by
				intelligent bufferization of a hlfir.expr. Similarly, these should not replace
				profitable uses of hlfir.elemental.

	#### Introducing operations for character operations and elemental intrinsic functions			#### Introducing operations for character operations and elemental intrinsic functions


	Motivation: represent character operations without requiring the operand and			Motivation: represent character operations without requiring the operand and
	results to be materialized in memory.			results to be materialized in memory.

	fir.char_op is intended to represent:			fir.char_op is intended to represent:
	- Character concatenation (//)			- Character concatenation (//)
	▲ Show 20 Lines • Show All 735 Lines • Show Last 20 Lines

flang/include/flang/Optimizer/HLFIR/HLFIROps.td

Show First 20 Lines • Show All 396 Lines • ▼ Show 20 Lines	def hlfir_TransposeOp : hlfir_Op<"transpose", []> {

let assemblyFormat = [{		let assemblyFormat = [{
$array attr-dict `:` functional-type(operands, results)		$array attr-dict `:` functional-type(operands, results)
}];		}];

let hasVerifier = 1;		let hasVerifier = 1;
}		}

		def hlfir_MatmulTransposeOp : hlfir_Op<"matmul_transpose",
		[DeclareOpInterfaceMethods<ArithFastMathInterface>]> {
		let summary = "Optimized MATMUL(TRANSPOSE(...), ...)";
		let description = [{
		Matrix multiplication where the left hand side is transposed
		}];

		let arguments = (ins
		AnyFortranNumericalOrLogicalArrayObject:$lhs,
		AnyFortranNumericalOrLogicalArrayObject:$rhs,
		DefaultValuedAttr<Arith_FastMathAttr,
		"::mlir::arith::FastMathFlags::none">:$fastmath
		);

		let results = (outs hlfir_ExprType);

		let assemblyFormat = [{
		$lhs $rhs attr-dict `:` functional-type(operands, results)
		}];

		let hasVerifier = 1;
		}

def hlfir_AssociateOp : hlfir_Op<"associate", [AttrSizedOperandSegments,		def hlfir_AssociateOp : hlfir_Op<"associate", [AttrSizedOperandSegments,
DeclareOpInterfaceMethods<fir_FortranVariableOpInterface>]> {		DeclareOpInterfaceMethods<fir_FortranVariableOpInterface>]> {
let summary = "Create a variable from an expression value";		let summary = "Create a variable from an expression value";
let description = [{		let description = [{
Create a variable from an expression value.		Create a variable from an expression value.
For expressions, this operation is an incentive to re-use the expression		For expressions, this operation is an incentive to re-use the expression
storage, if any, after the bufferization pass when possible (if the		storage, if any, after the bufferization pass when possible (if the
expression is not used afterwards).		expression is not used afterwards).
▲ Show 20 Lines • Show All 328 Lines • Show Last 20 Lines

flang/lib/Optimizer/HLFIR/IR/HLFIROps.cpp

Show First 20 Lines • Show All 663 Lines • ▼ Show 20 Lines	mlir::LogicalResult hlfir::TransposeOp::verify() {
if (eleTy != resultEleTy)		if (eleTy != resultEleTy)
return emitOpError(		return emitOpError(
"input and output arrays should have the same element type");		"input and output arrays should have the same element type");

return mlir::success();		return mlir::success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// MatmulTransposeOp
		//===----------------------------------------------------------------------===//

		mlir::LogicalResult hlfir::MatmulTransposeOp::verify() {
		mlir::Value lhs = getLhs();
		mlir::Value rhs = getRhs();
		fir::SequenceType lhsTy =
		hlfir::getFortranElementOrSequenceType(lhs.getType())
		.cast<fir::SequenceType>();
		fir::SequenceType rhsTy =
		hlfir::getFortranElementOrSequenceType(rhs.getType())
		.cast<fir::SequenceType>();
		llvm::ArrayRef<int64_t> lhsShape = lhsTy.getShape();
		llvm::ArrayRef<int64_t> rhsShape = rhsTy.getShape();
		std::size_t lhsRank = lhsShape.size();
		std::size_t rhsRank = rhsShape.size();
		mlir::Type lhsEleTy = lhsTy.getEleTy();
		mlir::Type rhsEleTy = rhsTy.getEleTy();
		hlfir::ExprType resultTy = getResult().getType().cast<hlfir::ExprType>();
		llvm::ArrayRef<int64_t> resultShape = resultTy.getShape();
		mlir::Type resultEleTy = resultTy.getEleTy();

		// lhs must have rank 2 for the transpose to be valid
		if ((lhsRank != 2) \|\| ((rhsRank != 1) && (rhsRank != 2)))
		return emitOpError("array must have either rank 1 or rank 2");

		if (mlir::isa<fir::LogicalType>(lhsEleTy) !=
		mlir::isa<fir::LogicalType>(rhsEleTy))
		return emitOpError("if one array is logical, so should the other be");

		// for matmul we compare the last dimension of lhs with the first dimension of
		// rhs, but for MatmulTranspose, dimensions of lhs are inverted by the
		// transpose
		int64_t firstLhsDim = lhsShape[0];
		int64_t firstRhsDim = rhsShape[0];
		constexpr int64_t unknownExtent = fir::SequenceType::getUnknownExtent();
		if (firstLhsDim != firstRhsDim)
		if ((firstLhsDim != unknownExtent) && (firstRhsDim != unknownExtent))
		return emitOpError(
		"the first dimension of LHS should match the first dimension of RHS");

		if (mlir::isa<fir::LogicalType>(lhsEleTy) !=
		mlir::isa<fir::LogicalType>(resultEleTy))
		return emitOpError("the result type should be a logical only if the "
		"argument types are logical");

		llvm::SmallVector<int64_t, 2> expectedResultShape;
		if (rhsRank == 2) {
		expectedResultShape.push_back(lhsShape[1]);
		expectedResultShape.push_back(rhsShape[1]);
		} else {
		// rhsRank == 1
		expectedResultShape.push_back(lhsShape[1]);
		}
		if (resultShape.size() != expectedResultShape.size())
		return emitOpError("incorrect result shape");
		if (resultShape[0] != expectedResultShape[0])
		return emitOpError("incorrect result shape");
		if (resultShape.size() == 2 && resultShape[1] != expectedResultShape[1])
		return emitOpError("incorrect result shape");

		return mlir::success();
		}

		//===----------------------------------------------------------------------===//
// AssociateOp		// AssociateOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void hlfir::AssociateOp::build(mlir::OpBuilder &builder,		void hlfir::AssociateOp::build(mlir::OpBuilder &builder,
mlir::OperationState &result, mlir::Value source,		mlir::OperationState &result, mlir::Value source,
llvm::StringRef uniq_name, mlir::Value shape,		llvm::StringRef uniq_name, mlir::Value shape,
mlir::ValueRange typeparams,		mlir::ValueRange typeparams,
fir::FortranVariableFlagsAttr fortran_attrs) {		fir::FortranVariableFlagsAttr fortran_attrs) {
▲ Show 20 Lines • Show All 99 Lines • Show Last 20 Lines

flang/test/HLFIR/invalid.fir

	Show First 20 Lines • Show All 392 Lines • ▼ Show 20 Lines
	// -----			// -----
	func.func @bad_transpose3(%arg0: !hlfir.expr<2x3xi32>) {			func.func @bad_transpose3(%arg0: !hlfir.expr<2x3xi32>) {
	// expected-error@+1 {{'hlfir.transpose' op input and output arrays should have the same element type}}			// expected-error@+1 {{'hlfir.transpose' op input and output arrays should have the same element type}}
	%0 = hlfir.transpose %arg0 : (!hlfir.expr<2x3xi32>) -> !hlfir.expr<3x2xf64>			%0 = hlfir.transpose %arg0 : (!hlfir.expr<2x3xi32>) -> !hlfir.expr<3x2xf64>
	return			return
	}			}

	// -----			// -----
				func.func @bad_matmultranspose1(%arg0: !hlfir.expr<?x?x?xi32>, %arg1: !hlfir.expr<?x?xi32>) {
				// expected-error@+1 {{'hlfir.matmul_transpose' op array must have either rank 1 or rank 2}}
				%0 = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<?x?x?xi32>, !hlfir.expr<?x?xi32>) -> !hlfir.expr<?x?xi32>
				return
				}

				// -----
				func.func @bad_matmultranspose2(%arg0: !hlfir.expr<?xi32>, %arg1: !hlfir.expr<?xi32>) {
				// expected-error@+1 {{'hlfir.matmul_transpose' op array must have either rank 1 or rank 2}}
				%0 = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<?xi32>, !hlfir.expr<?xi32>) -> !hlfir.expr<?x?xi32>
				return
				}

				// -----
				func.func @bad_matmultranspose3(%arg0: !hlfir.expr<?x?x!fir.logical<4>>, %arg1: !hlfir.expr<?x?xi32>) {
				// expected-error@+1 {{'hlfir.matmul_transpose' op if one array is logical, so should the other be}}
				%0 = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<?x?x!fir.logical<4>>, !hlfir.expr<?x?xi32>) -> !hlfir.expr<?x?xi32>
				return
				}

				// -----
				func.func @bad_matmultranspose5(%arg0: !hlfir.expr<?x?xi32>, %arg1: !hlfir.expr<?x?xi32>) {
				// expected-error@+1 {{'hlfir.matmul_transpose' op the result type should be a logical only if the argument types are logical}}
				%0 = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<?x?xi32>, !hlfir.expr<?x?xi32>) -> !hlfir.expr<?x?x!fir.logical<4>>
				return
				}

				// -----
				func.func @bad_matmultranspose6(%arg0: !hlfir.expr<2x1xi32>, %arg1: !hlfir.expr<2x3xi32>) {
				// expected-error@+1 {{'hlfir.matmul_transpose' op incorrect result shape}}
				%0 = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<2x1xi32>, !hlfir.expr<2x3xi32>) -> !hlfir.expr<10x30xi32>
				return
				}

				// -----
				func.func @bad_matmultranspose7(%arg0: !hlfir.expr<2x1xi32>, %arg1: !hlfir.expr<2xi32>) {
				// expected-error@+1 {{'hlfir.matmul_transpose' op incorrect result shape}}
				%0 = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<2x1xi32>, !hlfir.expr<2xi32>) -> !hlfir.expr<1x3xi32>
				return
				}

				// -----
	func.func @bad_assign_1(%arg0: !fir.box<!fir.array<?xi32>>, %arg1: !fir.box<!fir.array<?xi32>>) {			func.func @bad_assign_1(%arg0: !fir.box<!fir.array<?xi32>>, %arg1: !fir.box<!fir.array<?xi32>>) {
	// expected-error@+1 {{'hlfir.assign' op lhs must be an allocatable when `realloc` is set}}			// expected-error@+1 {{'hlfir.assign' op lhs must be an allocatable when `realloc` is set}}
	hlfir.assign %arg1 to %arg0 realloc : !fir.box<!fir.array<?xi32>>, !fir.box<!fir.array<?xi32>>			hlfir.assign %arg1 to %arg0 realloc : !fir.box<!fir.array<?xi32>>, !fir.box<!fir.array<?xi32>>
	return			return
	}			}

	// -----			// -----
	func.func @bad_assign_2(%arg0: !fir.ref<!fir.box<!fir.heap<!fir.array<?xi32>>>>, %arg1: !fir.box<!fir.array<?xi32>>) {			func.func @bad_assign_2(%arg0: !fir.ref<!fir.box<!fir.heap<!fir.array<?xi32>>>>, %arg1: !fir.box<!fir.array<?xi32>>) {
	▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

flang/test/HLFIR/matmul_transpose.fir

This file was added.

				// Test hlfir.matmul_transpose operation parse, verify (no errors), and unparse

				// RUN: fir-opt %s \| fir-opt \| FileCheck %s

				// arguments are expressions of known shape
				func.func @matmul_transpose0(%arg0: !hlfir.expr<2x2xi32>, %arg1: !hlfir.expr<2x2xi32>) {
				%res = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<2x2xi32>, !hlfir.expr<2x2xi32>) -> !hlfir.expr<2x2xi32>
				return
				}
				// CHECK-LABEL: func.func @matmul_transpose0
				// CHECK: %[[ARG0:.*]]: !hlfir.expr<2x2xi32>,
				// CHECK: %[[ARG1:.*]]: !hlfir.expr<2x2xi32>) {
				// CHECK-NEXT: %[[RES:.*]] = hlfir.matmul_transpose %[[ARG0]] %[[ARG1]] : (!hlfir.expr<2x2xi32>, !hlfir.expr<2x2xi32>) -> !hlfir.expr<2x2xi32>
				// CHECK-NEXT: return
				// CHECK-NEXT: }

				// arguments are expressions of assumed shape
				func.func @matmul_transpose1(%arg0: !hlfir.expr<?x?xi32>, %arg1: !hlfir.expr<?x?xi32>) {
				%res = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<?x?xi32>, !hlfir.expr<?x?xi32>) -> !hlfir.expr<?x?xi32>
				return
				}
				// CHECK-LABEL: func.func @matmul_transpose1
				// CHECK: %[[ARG0:.*]]: !hlfir.expr<?x?xi32>,
				// CHECK: %[[ARG1:.*]]: !hlfir.expr<?x?xi32>) {
				// CHECK-NEXT: %[[RES:.*]] = hlfir.matmul_transpose %[[ARG0]] %[[ARG1]] : (!hlfir.expr<?x?xi32>, !hlfir.expr<?x?xi32>) -> !hlfir.expr<?x?xi32>
				// CHECK-NEXT: return
				// CHECK-NEXT: }

				// arguments are expressions where only some dimensions are known #1
				func.func @matmul_transpose2(%arg0: !hlfir.expr<?x2xi32>, %arg1: !hlfir.expr<?x2xi32>) {
				%res = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<?x2xi32>, !hlfir.expr<?x2xi32>) -> !hlfir.expr<2x2xi32>
				return
				}
				// CHECK-LABEL: func.func @matmul_transpose2
				// CHECK: %[[ARG0:.*]]: !hlfir.expr<?x2xi32>,
				// CHECK: %[[ARG1:.*]]: !hlfir.expr<?x2xi32>) {
				// CHECK-NEXT: %[[RES:.*]] = hlfir.matmul_transpose %[[ARG0]] %[[ARG1]] : (!hlfir.expr<?x2xi32>, !hlfir.expr<?x2xi32>) -> !hlfir.expr<2x2xi32>
				// CHECK-NEXT: return
				// CHECK-NEXT: }

				// arguments are expressions where only some dimensions are known #2
				func.func @matmul_transpose3(%arg0: !hlfir.expr<2x?xi32>, %arg1: !hlfir.expr<2x?xi32>) {
				%res = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<2x?xi32>, !hlfir.expr<2x?xi32>) -> !hlfir.expr<?x?xi32>
				return
				}
				// CHECK-LABEL: func.func @matmul_transpose3
				// CHECK: %[[ARG0:.*]]: !hlfir.expr<2x?xi32>,
				// CHECK: %[[ARG1:.*]]: !hlfir.expr<2x?xi32>) {
				// CHECK-NEXT: %[[RES:.*]] = hlfir.matmul_transpose %[[ARG0]] %[[ARG1]] : (!hlfir.expr<2x?xi32>, !hlfir.expr<2x?xi32>) -> !hlfir.expr<?x?xi32>
				// CHECK-NEXT: return
				// CHECK-NEXT: }

				// arguments are logicals
				func.func @matmul_transpose4(%arg0: !hlfir.expr<?x?x!fir.logical<4>>, %arg1: !hlfir.expr<?x?x!fir.logical<4>>) {
				%res = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<?x?x!fir.logical<4>>, !hlfir.expr<?x?x!fir.logical<4>>) -> !hlfir.expr<?x?x!fir.logical<4>>
				return
				}
				// CHECK-LABEL: func.func @matmul_transpose4
				// CHECK: %[[ARG0:.*]]: !hlfir.expr<?x?x!fir.logical<4>>,
				// CHECK: %[[ARG1:.*]]: !hlfir.expr<?x?x!fir.logical<4>>) {
				// CHECK-NEXT: %[[RES:.*]] = hlfir.matmul_transpose %[[ARG0]] %[[ARG1]] : (!hlfir.expr<?x?x!fir.logical<4>>, !hlfir.expr<?x?x!fir.logical<4>>) -> !hlfir.expr<?x?x!fir.logical<4>>
				// CHECK-NEXT: return
				// CHECK-NEXT: }

				// rhs is rank 1
				func.func @matmul_transpose6(%arg0: !hlfir.expr<?x?xi32>, %arg1: !hlfir.expr<?xi32>) {
				%res = hlfir.matmul_transpose %arg0 %arg1 : (!hlfir.expr<?x?xi32>, !hlfir.expr<?xi32>) -> !hlfir.expr<?xi32>
				return
				}
				// CHECK-LABEL: func.func @matmul_transpose6
				// CHECK: %[[ARG0:.*]]: !hlfir.expr<?x?xi32>,
				// CHECK: %[[ARG1:.*]]: !hlfir.expr<?xi32>) {
				// CHECK-NEXT: %[[RES:.*]] = hlfir.matmul_transpose %[[ARG0]] %[[ARG1]] : (!hlfir.expr<?x?xi32>, !hlfir.expr<?xi32>) -> !hlfir.expr<?xi32>
				// CHECK-NEXT: return
				// CHECK-NEXT: }

				// arguments are boxed arrays
				func.func @matmul_transpose7(%arg0: !fir.box<!fir.array<2x2xf32>>, %arg1: !fir.box<!fir.array<2x2xf32>>) {
				%res = hlfir.matmul_transpose %arg0 %arg1 : (!fir.box<!fir.array<2x2xf32>>, !fir.box<!fir.array<2x2xf32>>) -> !hlfir.expr<2x2xf32>
				return
				}
				// CHECK-LABEL: func.func @matmul_transpose7
				// CHECK: %[[ARG0:.*]]: !fir.box<!fir.array<2x2xf32>>,
				// CHECK: %[[ARG1:.*]]: !fir.box<!fir.array<2x2xf32>>) {
				// CHECK-NEXT: %[[RES:.*]] = hlfir.matmul_transpose %[[ARG0]] %[[ARG1]] : (!fir.box<!fir.array<2x2xf32>>, !fir.box<!fir.array<2x2xf32>>) -> !hlfir.expr<2x2xf32>
				// CHECK-NEXT: return
				// CHECK-NEXT: }

This is an archive of the discontinued LLVM Phabricator instance.

[flang][hlfir] add hlfir.matmul_transpose operationClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 506017

flang/docs/HighLevelFIR.md

flang/include/flang/Optimizer/HLFIR/HLFIROps.td

flang/lib/Optimizer/HLFIR/IR/HLFIROps.cpp

flang/test/HLFIR/invalid.fir

flang/test/HLFIR/matmul_transpose.fir

[flang][hlfir] add hlfir.matmul_transpose operation
ClosedPublic