Download Raw Diff

Details

Reviewers

kuhar
nicolasvasilache
mravishankar

Commits

rG8d175b3577cb: [mlir][complex] Canonicalize complex.mul with 1 and 0

Summary

We can fold the complex.mul if the right value is obvious 1 or 0.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Lewuathe created this revision.Jun 22 2023, 8:43 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 22 2023, 8:43 PM

Herald added subscribers: bviyer, Moerafaat, zero9178 and 21 others. · View Herald Transcript

Lewuathe requested review of this revision.Jun 22 2023, 8:43 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 22 2023, 8:43 PM

Herald added a subscriber: stephenneuendorffer. · View Herald Transcript

Harbormaster completed remote builds in B240688: Diff 533853.Jun 22 2023, 9:00 PM

Does complex allow for special fp values like NaN, +/- Inf, or -0? I remember this being the case for complex in C. If yes, is the '* 0' fold valid when the non-constant values are NaN/Inf/-0/etc.?

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
256	Prefer early exits: https://llvm.org/docs/CodingStandards.html#use-early-exits-and-continue-to-simplify-code. Also in the code below.
257–259	nit: I would prefer to see the exact types spelled out here, it's not immediately obvious what each `.getValue();` returns.

As @kuhar mentioned, multiplying by zero isn't necessarily zero, unless the appropriate fastmath flags are set. (LLVM won't simplify multiplication scalar values by zero unless the nnan and nsz flags are set.)

I would think the "right" thing to do here is add fastmath flags to the complex dialect operations and make the fold conditional.

@kuhar @jfurtek Thank you for the comment. I found a related discussion here for the arithmetic dialect. (you are already leading the one, @jfurtek)

https://discourse.llvm.org/t/rfc-fastmath-flags-support-in-mlir-arith-dialect/6049

I think we can add similar flag as the arith dialect right?

IIRC mul_one case should not be affected by the fast math flag this time. Can I separate mul_zero from the patch so that we can work on after supporting fast math flag?

In D153606#4445249, @Lewuathe wrote:

@kuhar @jfurtek Thank you for the comment. I found a related discussion here for the arithmetic dialect. (you are already leading the one, @jfurtek)

IIRC mul_one case should not be affected by the fast math flag this time. Can I separate mul_zero from the patch so that we can work on after supporting fast math flag?

Limiting the scope SGTM. We should make sure to add test cases with those special fp values to show that these folds handle them as intended, provided they won't be matched against directly in the fold.

Add test cases for multiple floating point precisions.

Harbormaster completed remote builds in B241095: Diff 534435.Jun 26 2023, 12:11 AM

kuhar added inline comments.Jun 26 2023, 10:37 AM

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
261–262	nit: we don't need `llvm::` with casts, I believe it started appearing in the code base because it was easier to migrate from member casts this way
264–265	Consider adding an early exit `if (real != APFloat(...)) return {};` to simplify the switch below

Post review follow-up

Lewuathe marked 4 inline comments as done.Jun 26 2023, 4:17 PM

Harbormaster completed remote builds in B241329: Diff 534779.Jun 26 2023, 4:42 PM

kuhar added inline comments.Jun 26 2023, 5:13 PM

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
257–259	The type of `arrayAttr` is not obvious to me based on the RHS. Could we spell it out instead of using auto?
264–265	Not done

Lewuathe added inline comments.Jun 26 2023, 7:15 PM

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
264–265	@kuhar Sorry, I might have missed what you meant. Which part should we do early exit for this case? To do `if (real != APFloat(...))` type of check, we need to check the size in bits first (64 bit real will always fail to check `real != APFloat(1.0f)` but it should be caught in the case of 32 bit. )

kuhar added inline comments.Jun 26 2023, 11:26 PM

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
264–265	I was thinking of something like: if (!imag.isZero()) return {}; if (real != APFloat(1.0)) return {}; if (!llvm::is_contained({32, 64}, real.getSizeInBits(real.getSemantics())) return {}; return getLhs();

kuhar added inline comments.Jun 26 2023, 11:31 PM

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
264–265	Gah, I'm sorry, I missed the `1.0` vs. `1.0f` distinction. In this case I think we can use the `m_OneFloat` matcher: https://mlir.llvm.org/doxygen/namespacemlir.html#af0495d84f34cf3238a7741fa6974a485

Lewuathe added inline comments.Jun 27 2023, 12:36 AM

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
264–265	Thank you. Let me try!

Lewuathe added inline comments.Jun 27 2023, 11:50 PM

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
264–265	Apparently, `m_OneFloat` (or any other matcher) does not work with APFloat derived from the attribute. It's only working with the value type. https://discord.com/channels/636084430946959380/642426447167881246/1123498353129426994

Early exit for non-zero imaginary part.

Lewuathe marked 3 inline comments as done and an inline comment as not done.Jun 28 2023, 12:03 AM

Lewuathe marked an inline comment as done.

Lewuathe added inline comments.

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
264–265	I changed to exit earlier in the case of non-zero imaginary part instead.

kuhar added inline comments.Jun 28 2023, 12:10 AM

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp
264–265	In this case, should we use the same implementation as they do to support any bitwidth? inline detail::constant_float_predicate_matcher m_OneFloat() { return {[](const APFloat &value) { return APFloat(value.getSemantics(), 1) == value; }}; }

Support any bitwidth (f16, f80 and f128)

LGTM, thanks for the fixes!

This revision is now accepted and ready to land.Jun 28 2023, 12:19 AM

Lewuathe marked an inline comment as done.Jun 28 2023, 12:20 AM

Harbormaster completed remote builds in B241701: Diff 535258.Jun 28 2023, 12:27 AM

Closed by commit rG8d175b3577cb: [mlir][complex] Canonicalize complex.mul with 1 and 0 (authored by Lewuathe). · Explain WhyJun 28 2023, 12:29 AM

This revision was automatically updated to reflect the committed changes.

Lewuathe added a commit: rG8d175b3577cb: [mlir][complex] Canonicalize complex.mul with 1 and 0.

Diff 533853

mlir/include/mlir/Dialect/Complex/IR/ComplexOps.td

Show First 20 Lines • Show All 345 Lines • ▼ Show 20 Lines	def MulOp : ComplexArithmeticOp<"mul"> {
let summary = "complex multiplication";		let summary = "complex multiplication";
let description = [{		let description = [{
The `mul` operation takes two complex numbers and returns their product:		The `mul` operation takes two complex numbers and returns their product:

```mlir		```mlir
%a = complex.mul %b, %c : complex<f32>		%a = complex.mul %b, %c : complex<f32>
```		```
}];		}];

		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// NegOp		// NegOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def NegOp : ComplexUnaryOp<"neg", [SameOperandsAndResultType]> {		def NegOp : ComplexUnaryOp<"neg", [SameOperandsAndResultType]> {
let summary = "Negation operator";		let summary = "Negation operator";
▲ Show 20 Lines • Show All 260 Lines • Show Last 20 Lines

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp

Show First 20 Lines • Show All 243 Lines • ▼ Show 20 Lines	OpFoldResult ConjOp::fold(FoldAdaptor adaptor) {
// complex.conj(complex.conj(a)) -> a		// complex.conj(complex.conj(a)) -> a
if (auto conjOp = getOperand().getDefiningOp<ConjOp>())		if (auto conjOp = getOperand().getDefiningOp<ConjOp>())
return conjOp.getOperand();		return conjOp.getOperand();

return {};		return {};
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// MulOp
		//===----------------------------------------------------------------------===//

		OpFoldResult MulOp::fold(FoldAdaptor adaptor) {
		if (auto constant = getRhs().getDefiningOp<ConstantOp>()) {
		kuharUnsubmitted Done Reply Inline Actions Prefer early exits: https://llvm.org/docs/CodingStandards.html#use-early-exits-and-continue-to-simplify-code. Also in the code below. kuhar: Prefer early exits: https://llvm.org/docs/CodingStandards.html#use-early-exits-and-continue-to…
		auto arrayAttr = constant.getValue();
		auto real = llvm::cast<FloatAttr>(arrayAttr[0]).getValue();
		auto imag = llvm::cast<FloatAttr>(arrayAttr[1]).getValue();
		kuharUnsubmitted Done Reply Inline Actions nit: I would prefer to see the exact types spelled out here, it's not immediately obvious what each `.getValue();` returns. kuhar: nit: I would prefer to see the exact types spelled out here, it's not immediately obvious what…
		kuharUnsubmitted Done Reply Inline Actions The type of `arrayAttr` is not obvious to me based on the RHS. Could we spell it out instead of using auto? kuhar: The type of `arrayAttr` is not obvious to me based on the RHS. Could we spell it out instead of…

		if (imag.isZero()) {
		// complex.mul(a, complex.constant<0.0, 0.0>)
		kuharUnsubmitted Done Reply Inline Actions nit: we don't need `llvm::` with casts, I believe it started appearing in the code base because it was easier to migrate from member casts this way kuhar: nit: we don't need `llvm::` with casts, I believe it started appearing in the code base because…
		// -> complex.constant<0.0, 0.0>
		if (real.isZero())
		return getRhs();
		kuharUnsubmitted Done Reply Inline Actions Consider adding an early exit `if (real != APFloat(...)) return {};` to simplify the switch below kuhar: Consider adding an early exit `if (real != APFloat(...)) return {};` to simplify the switch…
		kuharUnsubmitted Done Reply Inline Actions Not done kuhar: Not done
		LewuatheAuthorUnsubmitted Done Reply Inline Actions @kuhar Sorry, I might have missed what you meant. Which part should we do early exit for this case? To do `if (real != APFloat(...))` type of check, we need to check the size in bits first (64 bit real will always fail to check `real != APFloat(1.0f)` but it should be caught in the case of 32 bit. ) Lewuathe: @kuhar Sorry, I might have missed what you meant. Which part should we do early exit for this…
		kuharUnsubmitted Done Reply Inline Actions I was thinking of something like: if (!imag.isZero()) return {}; if (real != APFloat(1.0)) return {}; if (!llvm::is_contained({32, 64}, real.getSizeInBits(real.getSemantics())) return {}; return getLhs(); kuhar: I was thinking of something like: ``` if (!imag.isZero()) return {}; if (real != APFloat(1.0))…
		kuharUnsubmitted Done Reply Inline Actions Gah, I'm sorry, I missed the `1.0` vs. `1.0f` distinction. In this case I think we can use the `m_OneFloat` matcher: https://mlir.llvm.org/doxygen/namespacemlir.html#af0495d84f34cf3238a7741fa6974a485 kuhar: Gah, I'm sorry, I missed the `1.0` vs. `1.0f` distinction. In this case I think we can use the…
		LewuatheAuthorUnsubmitted Done Reply Inline Actions Thank you. Let me try! Lewuathe: Thank you. Let me try!
		LewuatheAuthorUnsubmitted Not Done Reply Inline Actions Apparently, `m_OneFloat` (or any other matcher) does not work with APFloat derived from the attribute. It's only working with the value type. https://discord.com/channels/636084430946959380/642426447167881246/1123498353129426994 Lewuathe: Apparently, `m_OneFloat` (or any other matcher) does not work with APFloat derived from the…
		LewuatheAuthorUnsubmitted Done Reply Inline Actions I changed to exit earlier in the case of non-zero imaginary part instead. Lewuathe: I changed to exit earlier in the case of non-zero imaginary part instead.
		kuharUnsubmitted Done Reply Inline Actions In this case, should we use the same implementation as they do to support any bitwidth? inline detail::constant_float_predicate_matcher m_OneFloat() { return {[](const APFloat &value) { return APFloat(value.getSemantics(), 1) == value; }}; } kuhar: In this case, should we use the same implementation as they do to support any bitwidth? ```…

		// complex.mul(a, complex.constant<1.0, 0.0>) -> a
		switch (real.getSizeInBits(real.getSemantics())) {
		case 64:
		if (real == APFloat(1.0))
		return getLhs();
		return {};
		case 32:
		if (real == APFloat(1.0f))
		return getLhs();
		return {};
		default:
		return {};
		}
		}
		}

		return {};
		}

		//===----------------------------------------------------------------------===//
// TableGen'd op method definitions		// TableGen'd op method definitions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#define GET_OP_CLASSES		#define GET_OP_CLASSES
#include "mlir/Dialect/Complex/IR/ComplexOps.cpp.inc"		#include "mlir/Dialect/Complex/IR/ComplexOps.cpp.inc"

mlir/test/Dialect/Complex/canonicalize.mlir

	Show First 20 Lines • Show All 171 Lines • ▼ Show 20 Lines
	func.func @im_neg(%arg0: f32, %arg1: f32) -> f32 {			func.func @im_neg(%arg0: f32, %arg1: f32) -> f32 {
	%create = complex.create %arg0, %arg1: complex<f32>			%create = complex.create %arg0, %arg1: complex<f32>
	// CHECK: %[[NEG:.*]] = arith.negf %[[ARG1]]			// CHECK: %[[NEG:.*]] = arith.negf %[[ARG1]]
	%neg = complex.neg %create : complex<f32>			%neg = complex.neg %create : complex<f32>
	%im = complex.im %neg : complex<f32>			%im = complex.im %neg : complex<f32>
	// CHECK-NEXT: return %[[NEG]]			// CHECK-NEXT: return %[[NEG]]
	return %im : f32			return %im : f32
	}			}

				// CHECK-LABEL: func @mul_zero
				// CHECK-SAME: (%[[ARG0:.]]: f32, %[[ARG1:.]]: f32) -> complex<f32>
				func.func @mul_zero(%arg0: f32, %arg1: f32) -> complex<f32> {
				%create = complex.create %arg0, %arg1: complex<f32>
				%zero = complex.constant [0.0 : f32, 0.0 : f32] : complex<f32>
				%mul = complex.mul %create, %zero : complex<f32>
				// CHECK: %[[ZERO:.*]] = complex.constant [0.000000e+00 : f32, 0.000000e+00 : f32] : complex<f32>
				// CHECK-NEXT: return %[[ZERO]]
				return %mul : complex<f32>
				}

				// CHECK-LABEL: func @mul_one_f32
				// CHECK-SAME: (%[[ARG0:.]]: f32, %[[ARG1:.]]: f32) -> complex<f32>
				func.func @mul_one_f32(%arg0: f32, %arg1: f32) -> complex<f32> {
				%create = complex.create %arg0, %arg1: complex<f32>
				%one = complex.constant [1.0 : f32, 0.0 : f32] : complex<f32>
				%mul = complex.mul %create, %one : complex<f32>
				// CHECK: %[[CREATE:.*]] = complex.create %[[ARG0]], %[[ARG1]] : complex<f32>
				// CHECK-NEXT: return %[[CREATE]]
				return %mul : complex<f32>
				}

				// CHECK-LABEL: func @mul_one_f64
				// CHECK-SAME: (%[[ARG0:.]]: f64, %[[ARG1:.]]: f64) -> complex<f64>
				func.func @mul_one_f64(%arg0: f64, %arg1: f64) -> complex<f64> {
				%create = complex.create %arg0, %arg1: complex<f64>
				%one = complex.constant [1.0 : f64, 0.0 : f64] : complex<f64>
				%mul = complex.mul %create, %one : complex<f64>
				// CHECK: %[[CREATE:.*]] = complex.create %[[ARG0]], %[[ARG1]] : complex<f64>
				// CHECK-NEXT: return %[[CREATE]]
				return %mul : complex<f64>
				}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][complex] Canonicalize complex.mul with 1 and 0
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 533853

mlir/include/mlir/Dialect/Complex/IR/ComplexOps.td

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp

mlir/test/Dialect/Complex/canonicalize.mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][complex] Canonicalize complex.mul with 1 and 0ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 533853

mlir/include/mlir/Dialect/Complex/IR/ComplexOps.td

mlir/lib/Dialect/Complex/IR/ComplexOps.cpp

mlir/test/Dialect/Complex/canonicalize.mlir

[mlir][complex] Canonicalize complex.mul with 1 and 0
ClosedPublic