This is an archive of the discontinued LLVM Phabricator instance.

[MLIR][Linalg] Named op 'add' element-wise
ClosedPublic

Authored by rengolin on Jul 5 2023, 5:20 AM.

Download Raw Diff

Details

Reviewers

ftynse
nicolasvasilache
harsh
chelini
adam-smnk

Commits

rG7e486d5c2d26: [MLIR][Linalg] Named op 'add' element-wise

Summary

This adds the first strict element-wise named op to Linalg.

The semantics here is to not allow auto-cast, broadcast semantics and to
restrict the operations only to identical types. The remaining semantics
must come in the form of surrounding operations on operands, to avoid
ambiguity.

Examples:

// Cast int-to-fp
%0 = linalg.copy ins(%in: tensor<32x32xi32>)
                 outs(%out: tensor<32x32xf32>)
%1 = linalg.add  ins(%arg, %0: tensor<32x32xf32>, tensor<32x32xf32>)
                 outs(%0: tensor<32x32xf32>)

// This can be lowered to
%1 = linalg.generic {...}
          ins(%arg, %in: tensor<32x32xf32>, tensor<32x32xi32>)
          outs(%0: tensor<32x32xf32>) {
  ^bb0(%a: f32, %i: i32, %out: f32):
    %f = arith.uitofp %i : f32
    %0 = arith.addf %a, %f : f32
    linalg.yield %0 : f32
}

// Broadcast
%0 = linalg.broadcast ins(%in: tensor<32xf32>)
                      init(%out: tensor<32x32xf32>)
%1 = linalg.add  ins(%arg, %0: tensor<32x32xf32>, tensor<32x32xf32>)
                 outs(%0: tensor<32x32xf32>)

// This can be lowered to
#bcast_map = affine_map<(d0, d1) -> (d0)>
%1 = linalg.generic {... #bcast_map] }
          ins(%arg, %in: tensor<32x32xf32>, tensor<32xf32>)
          outs(%0: tensor<32x32xf32>) {
  ^bb0(%a: f32, %b: f32, %out: f32):
    %0 = arith.addf %a, %b : f32
    linalg.yield %0 : f32
}

Once this gets accepted, other arithmetic and maths operations will be
added accordingly, with the same semantics.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

rengolin created this revision.Jul 5 2023, 5:20 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 5 2023, 5:20 AM

Herald added subscribers: bviyer, Moerafaat, bzcheeseman and 25 others. · View Herald Transcript

rengolin requested review of this revision.Jul 5 2023, 5:20 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 5 2023, 5:20 AM

Herald added subscribers: limo1996, stephenneuendorffer. · View Herald Transcript

rengolin added a reviewer: adam-smnk.Jul 5 2023, 5:24 AM

Harbormaster completed remote builds in B243187: Diff 537308.Jul 5 2023, 5:35 AM

It is very timely you are looking at this, the implicit casting rules in this part of the world have been problematic and revisiting from stricter starting semantics makes a lot of sense.
Thanks for pushing this!

This revision is now accepted and ready to land.Jul 5 2023, 5:56 AM

Closed by commit rG7e486d5c2d26: [MLIR][Linalg] Named op 'add' element-wise (authored by rengolin). · Explain WhyJul 5 2023, 8:37 AM

This revision was automatically updated to reflect the committed changes.

rengolin added a commit: rG7e486d5c2d26: [MLIR][Linalg] Named op 'add' element-wise.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Linalg/

IR/

LinalgNamedStructuredOps.yaml

49 lines

python/

mlir/

dialects/

linalg/

opdsl/

ops/

core_named_ops.py

19 lines

test/

Dialect/

Linalg/

generalize-named-ops.mlir

25 lines

named-ops-fail.mlir

16 lines

named-ops.mlir

34 lines

Diff 537370

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yaml

Show First 20 Lines • Show All 151 Lines • ▼ Show 20 Lines	value: !ScalarExpression
kind: type		kind: type
attr_name: cast		attr_name: cast
type_var: U		type_var: U
operands:		operands:
- !ScalarExpression		- !ScalarExpression
scalar_arg: rhs		scalar_arg: rhs
--- !LinalgOpConfig		--- !LinalgOpConfig
metadata: !LinalgOpMetadata		metadata: !LinalgOpMetadata
		name: add
		cpp_class_name: AddOp
		doc: \|-
		Adds two tensors elementwise.

		The shapes and element types must be identical. The appropriate casts,
		broadcasts and reductions should be done previously to calling this op.

		This means reduction/broadcast/element cast semantics is explicit. Further
		passes can take that into account when lowering this code. For example,
		a `linalg.broadcast` + `linalg.add` sequence can be lowered to a
		`linalg.generic` with different affine maps for the two operands.
		structured_op: !LinalgStructuredOpConfig
		args:
		- !LinalgOperandDefConfig
		name: lhs
		kind: input_tensor
		type_var: T
		shape_map: affine_map<() -> ()>
		- !LinalgOperandDefConfig
		name: rhs
		kind: input_tensor
		type_var: T
		shape_map: affine_map<() -> ()>
		- !LinalgOperandDefConfig
		name: out
		kind: output_tensor
		type_var: T
		shape_map: affine_map<() -> ()>
		indexing_maps: !LinalgIndexingMapsConfig
		static_indexing_maps:
		- affine_map<() -> ()>
		- affine_map<() -> ()>
		- affine_map<() -> ()>
		iterator_types: []
		assignments:
		- !ScalarAssign
		arg: out
		value: !ScalarExpression
		scalar_fn:
		kind: binary
		fn_name: add
		operands:
		- !ScalarExpression
		scalar_arg: lhs
		- !ScalarExpression
		scalar_arg: rhs
		--- !LinalgOpConfig
		metadata: !LinalgOpMetadata
name: matmul		name: matmul
cpp_class_name: MatmulOp		cpp_class_name: MatmulOp
doc: \|-		doc: \|-
Performs a matrix multiplication of two 2D inputs.		Performs a matrix multiplication of two 2D inputs.

Numeric casting is performed on the operands to the inner multiply, promoting		Numeric casting is performed on the operands to the inner multiply, promoting
them to the same data type as the accumulator/output.		them to the same data type as the accumulator/output.
implements:		implements:
▲ Show 20 Lines • Show All 4,983 Lines • Show Last 20 Lines

mlir/python/mlir/dialects/linalg/opdsl/ops/core_named_ops.py

Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	):

Numeric casting is performed on the input operand, promoting it to the same		Numeric casting is performed on the input operand, promoting it to the same
data type as the accumulator/output.		data type as the accumulator/output.
"""		"""
O[None] = fun(cast(U, lhs[None]), cast(U, rhs[None]))		O[None] = fun(cast(U, lhs[None]), cast(U, rhs[None]))


@linalg_structured_op		@linalg_structured_op
		def add(
		lhs=TensorDef(T1),
		rhs=TensorDef(T1),
		O=TensorDef(T1, output=True),
		):
		""" Adds two tensors elementwise.

		The shapes and element types must be identical. The appropriate casts,
		broadcasts and reductions should be done previously to calling this op.

		This means reduction/broadcast/element cast semantics is explicit. Further
		passes can take that into account when lowering this code. For example,
		a `linalg.broadcast` + `linalg.add` sequence can be lowered to a
		`linalg.generic` with different affine maps for the two operands.
		"""
		O[None] = lhs[None] + rhs[None]


		@linalg_structured_op
def matmul(		def matmul(
A=TensorDef(T1, S.M, S.K),		A=TensorDef(T1, S.M, S.K),
B=TensorDef(T2, S.K, S.N),		B=TensorDef(T2, S.K, S.N),
C=TensorDef(U, S.M, S.N, output=True),		C=TensorDef(U, S.M, S.N, output=True),
cast=TypeFnAttrDef(default=TypeFn.cast_signed),		cast=TypeFnAttrDef(default=TypeFn.cast_signed),
):		):
"""Performs a matrix multiplication of two 2D inputs.		"""Performs a matrix multiplication of two 2D inputs.

▲ Show 20 Lines • Show All 1,325 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/generalize-named-ops.mlir

Show First 20 Lines • Show All 280 Lines • ▼ Show 20 Lines	func.func @generalize_linalg_map(%arg0: memref<1x8x8x8xf32>) {
// CHECK: linalg.map		// CHECK: linalg.map
// CHECK-NOT: linalg.generic		// CHECK-NOT: linalg.generic
linalg.map outs(%arg0 : memref<1x8x8x8xf32>)		linalg.map outs(%arg0 : memref<1x8x8x8xf32>)
() {		() {
linalg.yield %cst : f32		linalg.yield %cst : f32
}		}
return		return
}		}

		// -----

		func.func @generalize_add(%lhs: memref<7x14x21xf32>, %rhs: memref<7x14x21xf32>,
		%out: memref<7x14x21xf32>) {
		linalg.add ins(%lhs, %rhs : memref<7x14x21xf32>, memref<7x14x21xf32>)
		outs(%out : memref<7x14x21xf32>)
		return
		}

		// CHECK-DAG: #[[MAP:.+]] = affine_map<(d0, d1, d2) -> (d0, d1, d2)>

		// CHECK: func @generalize_add
		// CHECK-SAME: (%[[LHS:.+]]: memref<7x14x21xf32>, %[[RHS:.+]]: memref<7x14x21xf32>,
		// CHECK-SAME: %[[OUT:.+]]: memref<7x14x21xf32>)

		// CHECK: linalg.generic
		// CHECK-SAME: indexing_maps = [#[[MAP]], #[[MAP]], #[[MAP]]]
		// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel"]}
		// CHECK-SAME: ins(%[[LHS]], %[[RHS]] : memref<7x14x21xf32>, memref<7x14x21xf32>)
		// CHECK-SAME: outs(%[[OUT]] : memref<7x14x21xf32>)

		// CHECK: ^{{.+}}(%[[BBARG0:.+]]: f32, %[[BBARG1:.+]]: f32, %[[BBARG2:.+]]: f32)
		// CHECK-NEXT: %[[SUM:.+]] = arith.addf %[[BBARG0]], %[[BBARG1]] : f32
		// CHECK-NEXT: linalg.yield %[[SUM]] : f32

mlir/test/Dialect/Linalg/named-ops-fail.mlir

This file was added.

				// RUN: not mlir-opt -split-input-file -verify-diagnostics %s 2>&1 \| FileCheck %s

				func.func @add_type_cast(%arg0: memref<4x8x16xf32>, %arg1: memref<4x8x16xf16>, %arg2: memref<4x8x16xf32>) {
				// CHECK: op requires the same type for all operands and results
				linalg.add ins(%arg0, %arg1 : memref<4x8x16xf32>, memref<4x8x16xf16>) outs(%arg2: memref<4x8x16xf32>)
				return
				}

				// -----

				func.func @add_broadcast(%arg0: memref<8x16xf32>, %arg1: memref<4x8x16xf32>, %arg2: memref<4x8x16xf32>) {
				// CHECK: op expected operand rank (2) to match the result rank of indexing_map #0 (3)
				linalg.add ins(%arg0, %arg1 : memref<8x16xf32>, memref<4x8x16xf32>) outs(%arg2: memref<4x8x16xf32>)
				return
				}

mlir/test/Dialect/Linalg/named-ops.mlir

	Show First 20 Lines • Show All 1,178 Lines • ▼ Show 20 Lines
	// CHECK-LABEL: func @batchmatmul_transpose_b			// CHECK-LABEL: func @batchmatmul_transpose_b
	// CHECK: linalg.batch_matmul_transpose_b			// CHECK: linalg.batch_matmul_transpose_b
	// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<2x3x5xf32>, memref<2x7x5xf32>)			// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<2x3x5xf32>, memref<2x7x5xf32>)
	// CHECK-SAME: outs(%{{.+}} : memref<2x3x7xf32>)			// CHECK-SAME: outs(%{{.+}} : memref<2x3x7xf32>)
	func.func @batchmatmul_transpose_b(%arg0: memref<2x3x5xf32>, %arg1: memref<2x7x5xf32>, %arg2: memref<2x3x7xf32>) {			func.func @batchmatmul_transpose_b(%arg0: memref<2x3x5xf32>, %arg1: memref<2x7x5xf32>, %arg2: memref<2x3x7xf32>) {
	linalg.batch_matmul_transpose_b ins(%arg0, %arg1 : memref<2x3x5xf32>, memref<2x7x5xf32>) outs(%arg2: memref<2x3x7xf32>)			linalg.batch_matmul_transpose_b ins(%arg0, %arg1 : memref<2x3x5xf32>, memref<2x7x5xf32>) outs(%arg2: memref<2x3x7xf32>)
	return			return
	}			}

				// -----

				// CHECK-LABEL: func @add_dynamic
				func.func @add_dynamic(%arg0: memref<?x?x?xf32>, %arg1: memref<?x?x?xf32>, %arg2: memref<?x?x?xf32>) {
				// CHECK: linalg.add
				// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<?x?x?xf32>, memref<?x?x?xf32>)
				// CHECK-SAME: outs(%{{.+}} : memref<?x?x?xf32>)
				linalg.add ins(%arg0, %arg1 : memref<?x?x?xf32>, memref<?x?x?xf32>) outs(%arg2: memref<?x?x?xf32>)
				return
				}

				// -----

				// CHECK-LABEL: func @add_static
				func.func @add_static(%arg0: memref<4x8x16xf32>, %arg1: memref<4x8x16xf32>, %arg2: memref<4x8x16xf32>) {
				// CHECK: linalg.add
				// CHECK-SAME: ins(%{{.+}}, %{{.+}} : memref<4x8x16xf32>, memref<4x8x16xf32>)
				// CHECK-SAME: outs(%{{.+}} : memref<4x8x16xf32>)
				linalg.add ins(%arg0, %arg1 : memref<4x8x16xf32>, memref<4x8x16xf32>) outs(%arg2: memref<4x8x16xf32>)
				return
				}

				// -----

				// CHECK-LABEL: func @add_tensor
				func.func @add_tensor(%arg0: tensor<4x8x16xf32>, %arg1: tensor<4x8x16xf32>) -> tensor<4x8x16xf32> {
				%0 = tensor.empty() : tensor<4x8x16xf32>
				// CHECK: linalg.add
				// CHECK-SAME: ins(%{{.+}}, %{{.+}} : tensor<4x8x16xf32>, tensor<4x8x16xf32>)
				// CHECK-SAME: outs(%{{.+}} : tensor<4x8x16xf32>)
				%1 = linalg.add ins(%arg0, %arg1 : tensor<4x8x16xf32>, tensor<4x8x16xf32>) outs(%0: tensor<4x8x16xf32>) -> tensor<4x8x16xf32>
				return %1 : tensor<4x8x16xf32>
				}

This is an archive of the discontinued LLVM Phabricator instance.

[MLIR][Linalg] Named op 'add' element-wiseClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 537370

mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yaml

mlir/python/mlir/dialects/linalg/opdsl/ops/core_named_ops.py

mlir/test/Dialect/Linalg/generalize-named-ops.mlir

mlir/test/Dialect/Linalg/named-ops-fail.mlir

mlir/test/Dialect/Linalg/named-ops.mlir

[MLIR][Linalg] Named op 'add' element-wise
ClosedPublic