This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
3/6
InstCombineMulDivRem.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
1
fmul-pow.ll

Differential D41608

[InstCombine] Missed optimization in math expression: pow multiplications
AbandonedPublic

Authored by spatel on Dec 27 2017, 11:30 PM.

Download Raw Diff

Details

Reviewers

hfinkel
efriedma
Quolyk
lebedev.ri

Summary

This patch enables folding following expressions under -ffast-math flag:
pow(a, b) * a -> pow(a, b+1)
(1/a) * pow(a, b) -> pow(a, b-1)
pow(a, b) * pow(c, b) -> pow(a*c, b)
pow(a, b) * pow(a, c) -> pow(a, b+c)
Motivation: https://bugs.llvm.org/show_bug.cgi?id=35595.

Diff Detail

Repository

rL LLVM

Build Status

Buildable 27662
Build 27661: arc lint + arc unit

Event Timeline

Quolyk created this revision.Dec 27 2017, 11:30 PM

I'm slightly worried about all this bunch of missing instcombines added, as InstCombine is already really really slow.

That said, this one is probably one we really want. I skimmed the code quickly and I think it's correct, but please wait for somebody else to take a look

Why is this patch WIP? The mentioned case of 'pow(a, x) * a * a * a * a -> pow(a, x+4)' is handled already by this patch.

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
377–380	You mentioned that you see warnings without explicitly setting these to nullptr in Xcode. I use Xcode as an IDE too, but I don't see any warnings like that when I remove the nullptrs. If this is really a problem, then you must be getting hundreds of these warnings for existing code that does not initialize things like this? I'd prefer not to bloat the code unnecessarily.
385	Need to handle commuted versions too (please add a test): define double @pow_ab_x_a_fast_commute(double %a, double %b) { %c = fdiv double 1.0, %a ; defeat complexity-based canonicalization of operands %p = call fast double @llvm.pow.f64(double %a, double %b) %mul = fmul fast double %c, %p ret double %mul }
386–387	Here and below: use m_Specific instead of the trailing check for equality?
395–396	Please use variable names that match the formulas in the code comments for better readability.

pow(a, x) * a * a * a * a emits to

define double @pow_ab_x_aaaa_fast(double %a, double %x) {
  %1 = call fast double @llvm.pow.f64(double %a, double %x)
  %2 = fmul fast double %a, %a
  %3 = fmul fast double %2, %2
  %mul4 = fmul fast double %3, %1
  ret double %mul4
}

I don't see obvious ways to fold these instructions. I Would appreciate if somebody could help me with this.

Quolyk marked 4 inline comments as done.Jan 7 2018, 9:54 AM

Quolyk retitled this revision from [WIP][InstCombine] Missed optimization in math expression: aggressive optimization with pow to [InstCombine] Missed optimization in math expression: aggressive optimization with pow.Jan 14 2018, 11:58 PM

Quolyk added a reviewer: efriedma.

In D41608#969442, @Quolyk wrote:
pow(a, x) * a * a * a * a emits to
define double @pow_ab_x_aaaa_fast(double %a, double %x) {
  %1 = call fast double @llvm.pow.f64(double %a, double %x)
  %2 = fmul fast double %a, %a
  %3 = fmul fast double %2, %2
  %mul4 = fmul fast double %3, %1
  ret double %mul4
}
I don't see obvious ways to fold these instructions. I Would appreciate if somebody could help me with this.

This looks like it went through -reassociate first? I was checking with a straight IR translation, so we're just multiplying by 'a' over and over. This would require more complex logic, so that's a different patch (and I'm not sure where it would belong).

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
385	This comment was marked 'Done', but I don't see code to account for this or the test that I suggested.

Quolyk marked an inline comment as not done.Jan 17 2018, 11:01 AM

Quolyk added inline comments.

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
385	My bad, I thought fdiv_pow_ab_a test would be enough.

Quolyk mentioned this in rL353037: Commit tests for changes in revision D41608.Feb 4 2019, 2:32 AM

Quolyk mentioned this in rG3643cbbf9ca1: Commit tests for changes in revision D41608.

Update tests. Apply patch only for pow multiplications. Pow divisions will be considered in different patch.

Herald added a project: Restricted Project. · View Herald TranscriptFeb 4 2019, 3:33 AM

Harbormaster completed remote builds in B27662: Diff 185021.Feb 4 2019, 3:33 AM

Quolyk retitled this revision from [InstCombine] Missed optimization in math expression: aggressive optimization with pow to [InstCombine] Missed optimization in math expression: pow multiplications.Feb 4 2019, 3:34 AM

Quolyk edited the summary of this revision. (Show Details)

Quolyk added a reviewer: lebedev.ri.

Quolyk removed rL LLVM as the repository for this revision.

This patch is trying to do too many things at once. Please split only the 1st transform (pow(X, Y) * X -> pow(X, Y+1)) into its own patch, and let's continue looking at that alone as a 1st step. You should consider (and include tests for) these variations:

Commuted operands for fmul.
Extra uses of the pow result.
Vector types.
Additional FMF.

We don't necessarily need tests for every permutation of those, but there needs to be more coverage than what we see here currently.

test/Transforms/InstCombine/fmul-pow.ll

28–29

This is a misleading test name. This isn't the commuted version of the previous test - the fmul does not have a common operand with the pow().

I think you want something like this:

declare double @call_f64(double)
define double @pow_ab_a_reassoc(double %p, double %b) {
  %a = call double @call_f64(double %p)  ; thwart complexity-based canonicalization
  %pow = call double @llvm.pow.f64(double %a, double %b)
  %mul = fmul reassoc double %a, %pow
  ret double %mul
}

davide removed a reviewer: davide.Feb 4 2019, 7:16 AM

lebedev.ri requested changes to this revision.Jun 21 2019, 10:44 AM

This revision now requires changes to proceed.Jun 21 2019, 10:44 AM

This review seems to be stuck/dead, consider abandoning if no longer relevant.

This revision now requires review to proceed.Jan 12 2023, 4:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 12 2023, 4:42 PM

Herald added a subscriber: StephenFan. · View Herald Transcript

The folds shown here have all been added now, so this patch can be abandoned:
7736c1936a93d7
914576c1f0b669
61af2ab68142729
072b03c4714ea4

spatel commandeered this revision.Jan 14 2023, 5:13 AM

spatel edited reviewers, added: Quolyk; removed: spatel.

Herald added a subscriber: mcrosier. · View Herald TranscriptJan 14 2023, 5:13 AM

spatel abandoned this revision.Jan 14 2023, 5:13 AM

Revision Contents

Path

Size

lib/

Transforms/

InstCombine/

InstCombineMulDivRem.cpp

34 lines

test/

Transforms/

InstCombine/

fmul-pow.ll

27 lines

Diff 185021

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp

Show First 20 Lines • Show All 368 Lines • ▼ Show 20 Lines	if (match(Op0, m_FNeg(m_Value(X))) && match(Op1, m_FNeg(m_Value(Y))))
return BinaryOperator::CreateFMulFMF(X, Y, &I);		return BinaryOperator::CreateFMulFMF(X, Y, &I);

// -X * C --> X * -C		// -X * C --> X * -C
Constant *C;		Constant *C;
if (match(Op0, m_FNeg(m_Value(X))) && match(Op1, m_Constant(C)))		if (match(Op0, m_FNeg(m_Value(X))) && match(Op1, m_Constant(C)))
return BinaryOperator::CreateFMulFMF(X, ConstantExpr::getFNeg(C), &I);		return BinaryOperator::CreateFMulFMF(X, ConstantExpr::getFNeg(C), &I);

// Sink negation: -X * Y --> -(X * Y)		// Sink negation: -X * Y --> -(X * Y)
if (match(Op0, m_OneUse(m_FNeg(m_Value(X)))))		if (match(Op0, m_OneUse(m_FNeg(m_Value(X)))))
return BinaryOperator::CreateFNegFMF(Builder.CreateFMulFMF(X, Op1, &I), &I);		return BinaryOperator::CreateFNegFMF(Builder.CreateFMulFMF(X, Op1, &I), &I);

// Sink negation: Y * -X --> -(X * Y)		// Sink negation: Y * -X --> -(X * Y)
		spatelAuthorUnsubmitted Done Reply Inline Actions You mentioned that you see warnings without explicitly setting these to nullptr in Xcode. I use Xcode as an IDE too, but I don't see any warnings like that when I remove the nullptrs. If this is really a problem, then you must be getting hundreds of these warnings for existing code that does not initialize things like this? I'd prefer not to bloat the code unnecessarily. spatel: You mentioned that you see warnings without explicitly setting these to nullptr in Xcode. I use…
if (match(Op1, m_OneUse(m_FNeg(m_Value(X)))))		if (match(Op1, m_OneUse(m_FNeg(m_Value(X)))))
return BinaryOperator::CreateFNegFMF(Builder.CreateFMulFMF(X, Op0, &I), &I);		return BinaryOperator::CreateFNegFMF(Builder.CreateFMulFMF(X, Op0, &I), &I);

// fabs(X) * fabs(X) -> X * X		// fabs(X) * fabs(X) -> X * X
if (Op0 == Op1 && match(Op0, m_Intrinsic<Intrinsic::fabs>(m_Value(X))))		if (Op0 == Op1 && match(Op0, m_Intrinsic<Intrinsic::fabs>(m_Value(X))))
		spatelAuthorUnsubmitted Not Done Reply Inline Actions Need to handle commuted versions too (please add a test): define double @pow_ab_x_a_fast_commute(double %a, double %b) { %c = fdiv double 1.0, %a ; defeat complexity-based canonicalization of operands %p = call fast double @llvm.pow.f64(double %a, double %b) %mul = fmul fast double %c, %p ret double %mul } spatel: Need to handle commuted versions too (please add a test): ``` define double…
		spatelAuthorUnsubmitted Not Done Reply Inline Actions This comment was marked 'Done', but I don't see code to account for this or the test that I suggested. spatel: This comment was marked 'Done', but I don't see code to account for this or the test that I…
		QuolykUnsubmitted Not Done Reply Inline Actions My bad, I thought fdiv_pow_ab_a test would be enough. Quolyk: My bad, I thought fdiv_pow_ab_a test would be enough.
return BinaryOperator::CreateFMulFMF(X, X, &I);		return BinaryOperator::CreateFMulFMF(X, X, &I);

		spatelAuthorUnsubmitted Done Reply Inline Actions Here and below: use m_Specific instead of the trailing check for equality? spatel: Here and below: use m_Specific instead of the trailing check for equality?
// (select A, B, C) * (select A, D, E) --> select A, (BD), (CE)		// (select A, B, C) * (select A, D, E) --> select A, (BD), (CE)
if (Value *V = SimplifySelectsFeedingBinaryOp(I, Op0, Op1))		if (Value *V = SimplifySelectsFeedingBinaryOp(I, Op0, Op1))
return replaceInstUsesWith(I, V);		return replaceInstUsesWith(I, V);

if (I.hasAllowReassoc()) {		if (I.hasAllowReassoc()) {
// Reassociate constant RHS with another constant to form constant		// Reassociate constant RHS with another constant to form constant
// expression.		// expression.
if (match(Op1, m_Constant(C)) && C->isFiniteNonZeroFP()) {		if (match(Op1, m_Constant(C)) && C->isFiniteNonZeroFP()) {
Constant *C1;		Constant *C1;
		spatelAuthorUnsubmitted Done Reply Inline Actions Please use variable names that match the formulas in the code comments for better readability. spatel: Please use variable names that match the formulas in the code comments for better readability.
if (match(Op0, m_OneUse(m_FDiv(m_Constant(C1), m_Value(X))))) {		if (match(Op0, m_OneUse(m_FDiv(m_Constant(C1), m_Value(X))))) {
// (C1 / X) * C --> (C * C1) / X		// (C1 / X) * C --> (C * C1) / X
Constant *CC1 = ConstantExpr::getFMul(C, C1);		Constant *CC1 = ConstantExpr::getFMul(C, C1);
if (CC1->isNormalFP())		if (CC1->isNormalFP())
return BinaryOperator::CreateFDivFMF(CC1, X, &I);		return BinaryOperator::CreateFDivFMF(CC1, X, &I);
}		}
if (match(Op0, m_FDiv(m_Value(X), m_Constant(C1)))) {		if (match(Op0, m_FDiv(m_Value(X), m_Constant(C1)))) {
// (X / C1) * C --> X * (C / C1)		// (X / C1) * C --> X * (C / C1)
▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	if (I.hasAllowReassoc()) {
if (match(Op0, m_Intrinsic<Intrinsic::exp2>(m_Value(X))) &&		if (match(Op0, m_Intrinsic<Intrinsic::exp2>(m_Value(X))) &&
match(Op1, m_Intrinsic<Intrinsic::exp2>(m_Value(Y))) &&		match(Op1, m_Intrinsic<Intrinsic::exp2>(m_Value(Y))) &&
(Op0->hasOneUse() \|\| Op1->hasOneUse())) {		(Op0->hasOneUse() \|\| Op1->hasOneUse())) {
Value *XY = Builder.CreateFAddFMF(X, Y, &I);		Value *XY = Builder.CreateFAddFMF(X, Y, &I);
Value *Exp2 = Builder.CreateUnaryIntrinsic(Intrinsic::exp2, XY, &I);		Value *Exp2 = Builder.CreateUnaryIntrinsic(Intrinsic::exp2, XY, &I);
return replaceInstUsesWith(I, Exp2);		return replaceInstUsesWith(I, Exp2);
}		}

		// pow(X, Y) * X -> pow(X, Y+1)
		if (match(Op0, m_Intrinsic<Intrinsic::pow>(m_Specific(Op1), m_Value(Y)))) {
		Value *One = ConstantFP::get(Op0->getType(), 1.0);
		Value *YIncr = Builder.CreateFAddFMF(Y, One, &I);
		Value *Pow = Builder.CreateBinaryIntrinsic(Intrinsic::pow, Op1, YIncr, &I);
		return replaceInstUsesWith(I, Pow);
		}

		// (1/X) * pow(X, Y) -> pow(X, Y-1)
		if (match(Op0, m_OneUse(m_FDiv(m_SpecificFP(1.0), m_Value(X)))) &&
		match(Op1, m_Intrinsic<Intrinsic::pow>(m_Specific(X), m_Value(Y)))) {
		Value *One = ConstantFP::get(Op0->getType(), 1.0);
		Value *YDecr = Builder.CreateFSubFMF(Y, One, &I);
		Value *Pow = Builder.CreateBinaryIntrinsic(Intrinsic::pow, X, YDecr, &I);
		return replaceInstUsesWith(I, Pow);
		}

		Value *Z;
		// pow(X, Y) * pow(Z, Y) -> pow(X*Z, Y)
		if (match(Op0, m_Intrinsic<Intrinsic::pow>(m_Value(X), m_Value(Y))) &&
		match(Op1, m_Intrinsic<Intrinsic::pow>(m_Value(Z), m_Specific(Y)))) {
		Value *XZ = Builder.CreateFMulFMF(X, Z, &I);
		Value *Pow = Builder.CreateBinaryIntrinsic(Intrinsic::pow, XZ, Y, &I);
		return replaceInstUsesWith(I, Pow);
		}

		// pow(X, Y) * pow(X, Z) -> pow(X, Y+Z)
		if (match(Op0, m_Intrinsic<Intrinsic::pow>(m_Value(X), m_Value(Y))) &&
		match(Op1, m_Intrinsic<Intrinsic::pow>(m_Specific(X), m_Value(Z)))) {
		Value *YZ = Builder.CreateFAddFMF(Y, Z, &I);
		Value *Pow = Builder.CreateBinaryIntrinsic(Intrinsic::pow, X, YZ, &I);
		return replaceInstUsesWith(I, Pow);
		}

// (XY) X => (XX) Y where Y != X		// (XY) X => (XX) Y where Y != X
// The purpose is two-fold:		// The purpose is two-fold:
// 1) to form a power expression (of X).		// 1) to form a power expression (of X).
// 2) potentially shorten the critical path: After transformation, the		// 2) potentially shorten the critical path: After transformation, the
// latency of the instruction Y is amortized by the expression of X*X,		// latency of the instruction Y is amortized by the expression of X*X,
// and therefore Y is in a "less critical" position compared to what it		// and therefore Y is in a "less critical" position compared to what it
// was before the transformation.		// was before the transformation.
if (match(Op0, m_OneUse(m_c_FMul(m_Specific(Op1), m_Value(Y)))) &&		if (match(Op0, m_OneUse(m_c_FMul(m_Specific(Op1), m_Value(Y)))) &&
▲ Show 20 Lines • Show All 917 Lines • Show Last 20 Lines

test/Transforms/InstCombine/fmul-pow.ll

	Show All 10 Lines
	;			;
	%1 = call double @llvm.pow.f64(double %a, double %b)			%1 = call double @llvm.pow.f64(double %a, double %b)
	%mul = fmul double %1, %a			%mul = fmul double %1, %a
	ret double %mul			ret double %mul
	}			}

	define double @pow_ab_a_reassoc(double %a, double %b) {			define double @pow_ab_a_reassoc(double %a, double %b) {
	; CHECK-LABEL: @pow_ab_a_reassoc(			; CHECK-LABEL: @pow_ab_a_reassoc(
	; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.pow.f64(double [[A:%.]], double [[B:%.*]])			; CHECK-NEXT: [[TMP1:%.]] = fadd reassoc double [[B:%.]], 1.000000e+00
	; CHECK-NEXT: [[MUL:%.*]] = fmul reassoc double [[TMP1]], [[A]]			; CHECK-NEXT: [[TMP2:%.]] = call reassoc double @llvm.pow.f64(double [[A:%.]], double [[TMP1]])
	; CHECK-NEXT: ret double [[MUL]]			; CHECK-NEXT: ret double [[TMP2]]
	;			;
	%1 = call double @llvm.pow.f64(double %a, double %b)			%1 = call double @llvm.pow.f64(double %a, double %b)
	%mul = fmul reassoc double %1, %a			%mul = fmul reassoc double %1, %a
	ret double %mul			ret double %mul
	}			}

	define double @pow_ab_a_reassoc_commute(double %a, double %b) {			define double @pow_ab_a_reassoc_commute(double %a, double %b) {
	; CHECK-LABEL: @pow_ab_a_reassoc_commute(			; CHECK-LABEL: @pow_ab_a_reassoc_commute(
				spatelAuthorUnsubmitted Not Done Reply Inline Actions This is a misleading test name. This isn't the commuted version of the previous test - the fmul does not have a common operand with the pow(). I think you want something like this: declare double @call_f64(double) define double @pow_ab_a_reassoc(double %p, double %b) { %a = call double @call_f64(double %p) ; thwart complexity-based canonicalization %pow = call double @llvm.pow.f64(double %a, double %b) %mul = fmul reassoc double %a, %pow ret double %mul } spatel: This is a misleading test name. This isn't the commuted version of the previous test - the fmul…
	; CHECK-NEXT: [[TMP1:%.]] = fdiv double 1.000000e+00, [[A:%.]]			; CHECK-NEXT: [[TMP1:%.]] = fadd reassoc double [[B:%.]], -1.000000e+00
	; CHECK-NEXT: [[TMP2:%.]] = call double @llvm.pow.f64(double [[A]], double [[B:%.]])			; CHECK-NEXT: [[TMP2:%.]] = call reassoc double @llvm.pow.f64(double [[A:%.]], double [[TMP1]])
	; CHECK-NEXT: [[MUL:%.*]] = fmul reassoc double [[TMP1]], [[TMP2]]			; CHECK-NEXT: ret double [[TMP2]]
	; CHECK-NEXT: ret double [[MUL]]
	;			;
	%1 = fdiv double 1.0, %a			%1 = fdiv double 1.0, %a
	%2 = call double @llvm.pow.f64(double %a, double %b)			%2 = call double @llvm.pow.f64(double %a, double %b)
	%mul = fmul reassoc double %1, %2			%mul = fmul reassoc double %1, %2
	ret double %mul			ret double %mul
	}			}

	define double @pow_ab_pow_cb(double %a, double %b, double %c) {			define double @pow_ab_pow_cb(double %a, double %b, double %c) {
	; CHECK-LABEL: @pow_ab_pow_cb(			; CHECK-LABEL: @pow_ab_pow_cb(
	; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.pow.f64(double [[A:%.]], double [[B:%.*]])			; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.pow.f64(double [[A:%.]], double [[B:%.*]])
	; CHECK-NEXT: [[TMP2:%.]] = call double @llvm.pow.f64(double [[C:%.]], double [[B]])			; CHECK-NEXT: [[TMP2:%.]] = call double @llvm.pow.f64(double [[C:%.]], double [[B]])
	; CHECK-NEXT: [[MUL:%.*]] = fmul double [[TMP2]], [[TMP1]]			; CHECK-NEXT: [[MUL:%.*]] = fmul double [[TMP2]], [[TMP1]]
	; CHECK-NEXT: ret double [[MUL]]			; CHECK-NEXT: ret double [[MUL]]
	;			;
	%1 = call double @llvm.pow.f64(double %a, double %b)			%1 = call double @llvm.pow.f64(double %a, double %b)
	%2 = call double @llvm.pow.f64(double %c, double %b)			%2 = call double @llvm.pow.f64(double %c, double %b)
	%mul = fmul double %2, %1			%mul = fmul double %2, %1
	ret double %mul			ret double %mul
	}			}

	define double @pow_ab_pow_cb_reassoc(double %a, double %b, double %c) {			define double @pow_ab_pow_cb_reassoc(double %a, double %b, double %c) {
	; CHECK-LABEL: @pow_ab_pow_cb_reassoc(			; CHECK-LABEL: @pow_ab_pow_cb_reassoc(
	; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.pow.f64(double [[A:%.]], double [[B:%.*]])			; CHECK-NEXT: [[TMP1:%.]] = fmul reassoc double [[C:%.]], [[A:%.*]]
	; CHECK-NEXT: [[TMP2:%.]] = call double @llvm.pow.f64(double [[C:%.]], double [[B]])			; CHECK-NEXT: [[TMP2:%.]] = call reassoc double @llvm.pow.f64(double [[TMP1]], double [[B:%.]])
	; CHECK-NEXT: [[MUL:%.*]] = fmul reassoc double [[TMP2]], [[TMP1]]			; CHECK-NEXT: ret double [[TMP2]]
	; CHECK-NEXT: ret double [[MUL]]
	;			;
	%1 = call double @llvm.pow.f64(double %a, double %b)			%1 = call double @llvm.pow.f64(double %a, double %b)
	%2 = call double @llvm.pow.f64(double %c, double %b)			%2 = call double @llvm.pow.f64(double %c, double %b)
	%mul = fmul reassoc double %2, %1			%mul = fmul reassoc double %2, %1
	ret double %mul			ret double %mul
	}			}

	define double @pow_ab_pow_ac(double %a, double %b, double %c) {			define double @pow_ab_pow_ac(double %a, double %b, double %c) {
	; CHECK-LABEL: @pow_ab_pow_ac(			; CHECK-LABEL: @pow_ab_pow_ac(
	; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.pow.f64(double [[A:%.]], double [[B:%.*]])			; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.pow.f64(double [[A:%.]], double [[B:%.*]])
	; CHECK-NEXT: [[TMP2:%.]] = call double @llvm.pow.f64(double [[A]], double [[C:%.]])			; CHECK-NEXT: [[TMP2:%.]] = call double @llvm.pow.f64(double [[A]], double [[C:%.]])
	; CHECK-NEXT: [[MUL:%.*]] = fmul double [[TMP2]], [[TMP1]]			; CHECK-NEXT: [[MUL:%.*]] = fmul double [[TMP2]], [[TMP1]]
	; CHECK-NEXT: ret double [[MUL]]			; CHECK-NEXT: ret double [[MUL]]
	;			;
	%1 = call double @llvm.pow.f64(double %a, double %b)			%1 = call double @llvm.pow.f64(double %a, double %b)
	%2 = call double @llvm.pow.f64(double %a, double %c)			%2 = call double @llvm.pow.f64(double %a, double %c)
	%mul = fmul double %2, %1			%mul = fmul double %2, %1
	ret double %mul			ret double %mul
	}			}

	define double @pow_ab_x_pow_ac_reassoc(double %a, double %b, double %c) {			define double @pow_ab_x_pow_ac_reassoc(double %a, double %b, double %c) {
	; CHECK-LABEL: @pow_ab_x_pow_ac_reassoc(			; CHECK-LABEL: @pow_ab_x_pow_ac_reassoc(
	; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.pow.f64(double [[A:%.]], double [[B:%.*]])			; CHECK-NEXT: [[TMP1:%.]] = fadd reassoc double [[C:%.]], [[B:%.*]]
	; CHECK-NEXT: [[TMP2:%.]] = call double @llvm.pow.f64(double [[A]], double [[C:%.]])			; CHECK-NEXT: [[TMP2:%.]] = call reassoc double @llvm.pow.f64(double [[A:%.]], double [[TMP1]])
	; CHECK-NEXT: [[MUL:%.*]] = fmul reassoc double [[TMP2]], [[TMP1]]			; CHECK-NEXT: ret double [[TMP2]]
	; CHECK-NEXT: ret double [[MUL]]
	;			;
	%1 = call double @llvm.pow.f64(double %a, double %b)			%1 = call double @llvm.pow.f64(double %a, double %b)
	%2 = call double @llvm.pow.f64(double %a, double %c)			%2 = call double @llvm.pow.f64(double %a, double %c)
	%mul = fmul reassoc double %2, %1			%mul = fmul reassoc double %2, %1
	ret double %mul			ret double %mul
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Missed optimization in math expression: pow multiplicationsAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 185021

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp

test/Transforms/InstCombine/fmul-pow.ll

[InstCombine] Missed optimization in math expression: pow multiplications
AbandonedPublic