This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/StandardOps/IR/
-
mlir/
-
Dialect/
-
StandardOps/
-
IR/
-
Ops.td
-
lib/Dialect/StandardOps/IR/
-
Dialect/
-
StandardOps/
-
IR/
4
Ops.cpp
-
test/Transforms/
-
Transforms/
-
constant-fold.mlir

Differential D95105

Implement constant folding for PowFOp
Needs ReviewPublic

Authored by jacksonfellows on Jan 20 2021, 4:22 PM.

Download Raw Diff

This revision needs review, but there are no reviewers specified.

Details

Reviewers: None

Summary

Add a constant folder for PowFOp. Analogous to existing folders for
floating point operators, but instead of using an APFloat method to
perform its operation it first converts the operands to doubles, calls
the built-in pow function, and then converts the result back into the
proper floating point type. This is necessary since APFloat lacks a
pow method. This behavior matches how constant folding is implemented
for the pow intrinsic in
LLVM (https://github.com/llvm/llvm-project/blob/689de5841c1c4c9b0fe711b61d26f7425cf99423/llvm/lib/Analysis/ConstantFolding.cpp#L2373).

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jacksonfellows created this revision.Jan 20 2021, 4:22 PM

Herald added subscribers: teijeong, rdzhabarov, tatianashp and 14 others. · View Herald TranscriptJan 20 2021, 4:22 PM

jacksonfellows requested review of this revision.Jan 20 2021, 4:22 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 20 2021, 4:22 PM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Can you add a lit test for this?

In D95105#2511374, @mehdi_amini wrote:

Can you add a lit test for this?

Should I add it to this file: https://github.com/llvm/llvm-project/blob/main/mlir/test/Transforms/constant-fold.mlir?
Also should the cases mirror what already exists?

rriddle added inline comments.Jan 20 2021, 4:41 PM

mlir/lib/Dialect/StandardOps/IR/Ops.cpp
2276	How does this work for floating point types with greater than 64 bits? e.g. FP80/FP128/etc.

As far as I can tell they would lose precision on the conversion. It seems like LLVM avoids this by only folding halfs, floats, and doubles. We could do the same by checking the operand types first, which is probably the right solution.

Since MLIR is also handling tensors and vectors, I'm not entirely sure how this check would be implemented (i.e. the operands can't always be cast to FloatAttr). The function constFoldBinaryOp currently handles these cases, so we'd have to re-implement some of that logic. Does someone with more experience with the existing folders have any good ideas about this?

In D95105#2511444, @jacksonfellows wrote:

Since MLIR is also handling tensors and vectors, I'm not entirely sure how this check would be implemented (i.e. the operands can't always be cast to FloatAttr). The function constFoldBinaryOp currently handles these cases, so we'd have to re-implement some of that logic. Does someone with more experience with the existing folders have any good ideas about this?

Can you just add a check before calling constFoldBinaryOp that the result type of the powf when fed to mlir::getElementTypeOrSelf is one of the valid types?

Add check to only fold the pow of halfs, floats, and doubles.

This matches the behavior present in LLVM.

Add lit tests for powf constant folding.

Thanks for adding the folding!

mlir/lib/Dialect/StandardOps/IR/Ops.cpp
2276	I think we can include BF16 here as well.
2277	I wouldn't say "following LLVM" here. The limitation is due to how the folding is implemented, i.e. it goes to double which naturally can't support larger representations.
2281	This comment shouldn't be necessary given that the element types are already guaranteed to be the same.

Add BF16 to allowed FP types

Harbormaster completed remote builds in B86125: Diff 318251.Jan 21 2021, 11:40 AM

Harbormaster completed remote builds in B86126: Diff 318254.Jan 21 2021, 12:11 PM

Harbormaster completed remote builds in B86133: Diff 318268.Jan 21 2021, 12:23 PM

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

StandardOps/

IR/

Ops.td

1 line

lib/

Dialect/

StandardOps/

IR/

Ops.cpp

27 lines

test/

Transforms/

constant-fold.mlir

44 lines

Diff 318268

mlir/include/mlir/Dialect/StandardOps/IR/Ops.td

Show First 20 Lines • Show All 2,333 Lines • ▼ Show 20 Lines	let description = [{

// SIMD pointwise vector exponentiation		// SIMD pointwise vector exponentiation
%f = powf %g, %h : vector<4xf32>		%f = powf %g, %h : vector<4xf32>

// Tensor pointwise exponentiation.		// Tensor pointwise exponentiation.
%x = powf %y, %z : tensor<4x?xbf16>		%x = powf %y, %z : tensor<4x?xbf16>
```		```
}];		}];
		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// PrefetchOp		// PrefetchOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def PrefetchOp : Std_Op<"prefetch"> {		def PrefetchOp : Std_Op<"prefetch"> {
let summary = "prefetch operation";		let summary = "prefetch operation";
▲ Show 20 Lines • Show All 1,511 Lines • Show Last 20 Lines

mlir/lib/Dialect/StandardOps/IR/Ops.cpp

Show First 20 Lines • Show All 2,258 Lines • ▼ Show 20 Lines	OpFoldResult OrOp::fold(ArrayRef<Attribute> operands) {
if (lhs() == rhs())		if (lhs() == rhs())
return rhs();		return rhs();

return constFoldBinaryOp<IntegerAttr>(operands,		return constFoldBinaryOp<IntegerAttr>(operands,
[](APInt a, APInt b) { return a \| b; });		[](APInt a, APInt b) { return a \| b; });
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// PowFOp
		//===----------------------------------------------------------------------===//

		OpFoldResult PowFOp::fold(ArrayRef<Attribute> operands) {
		assert(operands.size() == 2 && "binary op takes two operands");
		if (operands[0] && operands[1] &&
		operands[0].getType() == operands[1].getType()) {
		// make sure that types match (similar check performed in constFoldBinaryOp)
		Type ty = getElementTypeOrSelf(operands[0]);
		if (ty.isBF16() \|\| ty.isF16() \|\| ty.isF32() \|\| ty.isF64()) {
		rriddleUnsubmitted Not Done Reply Inline Actions How does this work for floating point types with greater than 64 bits? e.g. FP80/FP128/etc. rriddle: How does this work for floating point types with greater than 64 bits? e.g. FP80/FP128/etc.
		rriddleUnsubmitted Not Done Reply Inline Actions I think we can include BF16 here as well. rriddle: I think we can include BF16 here as well.
		// only fold the pow of floating point types that can be represented as a
		rriddleUnsubmitted Not Done Reply Inline Actions I wouldn't say "following LLVM" here. The limitation is due to how the folding is implemented, i.e. it goes to double which naturally can't support larger representations. rriddle: I wouldn't say "following LLVM" here. The limitation is due to how the folding is implemented…
		// double without losing information
		return constFoldBinaryOp<FloatAttr>(operands, [](APFloat a, APFloat b) {
		bool unused;
		// assume a and b are the same floating point type (i.e. share the same
		rriddleUnsubmitted Not Done Reply Inline Actions This comment shouldn't be necessary given that the element types are already guaranteed to be the same. rriddle: This comment shouldn't be necessary given that the element types are already guaranteed to be…
		// semantics)
		APFloat res = APFloat(pow(FloatAttr::getValueAsDouble(a),
		FloatAttr::getValueAsDouble(b)));
		res.convert(a.getSemantics(), APFloat::rmNearestTiesToEven, &unused);
		return res;
		});
		}
		}
		return {};
		}

		//===----------------------------------------------------------------------===//
// PrefetchOp		// PrefetchOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static void print(OpAsmPrinter &p, PrefetchOp op) {		static void print(OpAsmPrinter &p, PrefetchOp op) {
p << PrefetchOp::getOperationName() << " " << op.memref() << '[';		p << PrefetchOp::getOperationName() << " " << op.memref() << '[';
p.printOperands(op.indices());		p.printOperands(op.indices());
p << ']' << ", " << (op.isWrite() ? "write" : "read");		p << ']' << ", " << (op.isWrite() ? "write" : "read");
p << ", locality<" << op.localityHint();		p << ", locality<" << op.localityHint();
▲ Show 20 Lines • Show All 1,670 Lines • Show Last 20 Lines

mlir/test/Transforms/constant-fold.mlir

	Show First 20 Lines • Show All 791 Lines • ▼ Show 20 Lines
	// -----			// -----

	// CHECK-LABEL: func @subview_scalar_fold			// CHECK-LABEL: func @subview_scalar_fold
	func @subview_scalar_fold(%arg0: memref<f32>) -> memref<f32> {			func @subview_scalar_fold(%arg0: memref<f32>) -> memref<f32> {
	// CHECK-NOT: subview			// CHECK-NOT: subview
	%c = subview %arg0[] [] [] : memref<f32> to memref<f32>			%c = subview %arg0[] [] [] : memref<f32> to memref<f32>
	return %c : memref<f32>			return %c : memref<f32>
	}			}

				// -----

				// CHECK-LABEL: func @simple_powf
				func @simple_powf() -> f32 {
				%0 = constant 4.5 : f32
				%1 = constant 2.0 : f32

				// CHECK-NEXT: [[C:%.+]] = constant 2.025{{0*}}e+01 : f32
				%2 = powf %0, %1 : f32

				// CHECK-NEXT: return [[C]]
				return %2 : f32
				}

				// -----

				// CHECK-LABEL: func @powf_splat_tensor
				func @powf_splat_tensor() -> tensor<4xf32> {
				%0 = constant dense<4.5> : tensor<4xf32>
				%1 = constant dense<2.0> : tensor<4xf32>

				// CHECK-NEXT: [[C:%.+]] = constant dense<2.025{{0*}}e+01> : tensor<4xf32>
				%2 = powf %0, %1 : tensor<4xf32>

				// CHECK-NEXT: return [[C]]
				return %2 : tensor<4xf32>
				}

				// -----

				// CHECK-LABEL: func @powf_dont_fold_f128
				func @powf_dont_fold_f128() -> f128 {
				// CHECK-NEXT: [[A:%.+]] = constant 4.5{{0*}}e+00 : f128
				%0 = constant 4.5 : f128
				// CHECK-NEXT: [[B:%.+]] = constant 2.0{{0*}}e+00 : f128
				%1 = constant 2.0 : f128

				// CHECK-NEXT: %0 = powf [[A]], [[B]] : f128
				%2 = powf %0, %1 : f128

				// CHECK-NEXT: return %0
				return %2 : f128
				}