This is an archive of the discontinued LLVM Phabricator instance.

Quolyk retitled this revision from [InstCombine] Missed optimization in math expression: sin(x) / cos(x) => tan(x) to [WIP][InstCombine] Missed optimization in math expression: sin(x) / cos(x) => tan(x).

Quolyk added a reviewer: davide.

Quolyk mentioned this in D41381: [InstSimplify] Missed optimization in math expression: squashing exp(log), log(exp).Dec 19 2017, 2:21 AM

hfinkel added inline comments.Dec 19 2017, 8:10 PM

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1474	Use `match` here?
1481	For the name here... Predicate the transform on TLI->has(LibFunc_tan) (or tanf, tanl, depending on the type). Use TLI->getName(LibFunc_tan) (or tanf, tanl, depending on the type).
1482	Needs BuilderTy::FastMathFlagGuard Guard(Builder); above this.

Quolyk updated this revision to Diff 127683.Dec 20 2017, 4:29 AM

Quolyk retitled this revision from [WIP][InstCombine] Missed optimization in math expression: sin(x) / cos(x) => tan(x) to [InstCombine] Missed optimization in math expression: sin(x) / cos(x) => tan(x).

@hfinkel I'd like to thank you for your help and patience reviewing my patches, as I'm new to the community. I hope it's ok when I include you as a reviewer.

Quolyk marked 3 inline comments as done.Dec 20 2017, 4:32 AM

In D41286#960694, @Quolyk wrote:

@hfinkel I'd like to thank you for your help and patience reviewing my patches, as I'm new to the community. I hope it's ok when I include you as a reviewer.

It's certainly fine to include me as a review. Welcome to the community!

Quolyk updated this revision to Diff 128599.Jan 4 2018, 12:52 AM

Quolyk edited the summary of this revision. (Show Details)

spatel added inline comments.Jan 5 2018, 10:43 AM

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
561–564	Can you move and reuse the very similar function that's currently in SimplifyLibCalls.cpp? static bool hasUnaryFloatFn(const TargetLibraryInfo TLI, Type Ty, LibFunc DoubleFn, LibFunc FloatFn, LibFunc LongDoubleFn) { See also: /// Emit a call to the unary function named 'Name' (e.g. 'floor'). This /// function is known to take a single of type matching 'Op' and returns one /// value with the same type. If 'Op' is a long double, 'l' is added as the /// suffix of name, if 'Op' is a float, we add a 'f' suffix. Value emitUnaryFloatFnCall(Value Op, StringRef Name, IRBuilder<> &B, const AttributeList &Attrs); ...in BuildLibCalls.h
1475–1478	Use m_Specific to simplify this: // sin(a) / cos(a) -> tan(a) Value *A; if (match(Op0, m_Intrinsic<Intrinsic::sin>(m_Value(A))) && match(Op1, m_Intrinsic<Intrinsic::cos>(m_Specific(A)))) {
test/Transforms/InstCombine/fdiv.ll
90–92 ↗	(On Diff #128599)	Please vary the fast-ness in these tests or add test(s) that show some variation. I'd prefer to see at least one test where we show the minimum case as we showed in one of the related patches - only the fdiv has relaxed math while the trig calls are strict.

Quolyk updated this revision to Diff 128867.Jan 7 2018, 12:49 AM

Quolyk marked 3 inline comments as done.

spatel added inline comments.Jan 8 2018, 8:05 AM

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1472	Should we also handle: cos(a) / sin(a) -> 1 / tan(a) ?
test/Transforms/InstCombine/fdiv.ll
161–163 ↗	(On Diff #128867)	Do we need to declare tan functions for these tests?

@scanon should sign off this.

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1472	Please wait for @scanon opinion before implementing every possible 10th grade trigonometrical identity.

This revision now requires changes to proceed.Jan 8 2018, 8:07 AM

For reference. this is what GCC generates (although it's unclear whether it's a good idea to follow them)
https://godbolt.org/g/YUUKGE

spatel added inline comments.Jan 8 2018, 10:57 AM

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1472	Do you have some specific numerical concern here? As you've noted, this is a well-known math transform. We can make cos(a) / sin(a) a 'TODO' if you think we should use a different transform. https://stackoverflow.com/questions/3738384/stable-cotangent

davide added inline comments.Jan 8 2018, 11:02 AM

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1472	The concern is that those transformation can overflow quite dramatically, even for `-ffast-math`. The other concern is that I'm not a numerical expert, so I'd love to have this signed off from somebody who knows better than me. The last concern is, again, we shouldn't pattern match every possible thing just because, it slows down the compiler without real benefit, so, do you know how this pattern is frequent?

spatel added inline comments.Jan 8 2018, 11:55 AM

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1472	The compile-time concern is misguided. This pattern, like every other "10th grade trigonometrical identity", should be optimized by an optimizing compiler because that's the job of an optimizing compiler. These patterns can occur by way of templated code, inlining, or because the programmer may not be a computer performance expert. Think: scientists who are trying to model/simulate some math problem, but don't know much about perf...because again: that's the optimizing compiler's job. If this patch or pass is causing a compile-time problem for you, please point to or file a bug. Obstructing patches like this is doubly bad when you're undermining the efforts of new contributors.

davide added inline comments.Jan 8 2018, 12:28 PM

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1472	I'm afraid this is not entirely correct. The job of an optimizing compiler is that of making tradeoff on what to optimize, based on cost. I'm not obstructing this patch, I'm asking for a second opinion. If you have a numerical explanation of why this patch can go in, I'll be happy to accept, otherwise I'll defer the review to @scanon.

spatel added inline comments.Jan 8 2018, 1:34 PM

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1475–1476	We probably don't want to do this transform if both of the existing values have more than one use; we'd be trading a division for a libcall. If only one value has >1 use, it's probably still ok. Add more tests. :)

If you have a numerical explanation of why this patch can go in, I'll be happy to accept,

I'm not sure what you're looking for. Brute force sin/cos is equal to tan for every possible value?

#include <math.h>
#include <stdio.h>
#include <string.h>

int main() {
  unsigned int i;
  for (i = 0x00000000; i<0x7f800002; i++) {
    float f;
    memcpy(&f, &i, 4);
    float slow = sin(f) / cos(f);
    float fast = tan(f);
    if (slow != fast) {
      printf("\nsin(%f)/cos(%f) = %f, tan(%f) = %f\n", f, f, slow, f, fast);
    }
    if (i % (1024*1024*256) == 0) printf("0x%x...\n", i); 
  }

  return 0;
}

$  clang -O2 tan_checker.c ; ./a.out 
0x0...
0x10000000...
0x20000000...
0x30000000...
0x40000000...
0x50000000...
0x60000000...
0x70000000...

sin(inf)/cos(inf) = nan, tan(inf) = nan

sin(nan)/cos(nan) = nan, tan(nan) = nan

Note that I'm testing on macOS x86 with strict math (the sin and cos calls are replaced by __sincos_stret).
Double-check to make sure I haven't screwed anything up there, but this suggests we could do this transform without -ffast-math?

In D41286#970342, @spatel wrote:

If you have a numerical explanation of why this patch can go in, I'll be happy to accept,

I'm not sure what you're looking for. Brute force sin/cos is equal to tan for every possible value?

I don't think this is feasible (for 64-bit values). -ffast-math is a little fun and complicated, e.g.

log(pow(x, y)) -> y*log(x)

which seems perfectly fine on paper, for x = -1, y = 4

log(pow(-1, 4)) -> 0
4*log(-1) -> NaN.

(courtesy of Steven)

My point is that reasoning about seemingly innocuous algebraic simplification turns out to be harder than expected, and therefore we should make a conscious choice on whether implement these after careful numerical analysis.

In D41286#970343, @davide wrote:

log(pow(x, y)) -> y*log(x)

How is this relevant? Log is only defined for positive numbers, while sin/cos/tan are valid across all numerical inputs.

I don't think there's anything I more I can say here; sorry @Quolyk , I tried. If @hfinkel , @efriedma , @andrew.w.kaylor or anyone else would like to comment that would be great.

I don't feel qualified enough to say whether this can go in, somebody with fast-math experience should comment.

In D41286#970372, @spatel wrote:

In D41286#970343, @davide wrote:

log(pow(x, y)) -> y*log(x)

How is this relevant? Log is only defined for positive numbers, while sin/cos/tan are valid across all numerical inputs.

My point is that fast-math implications can be non-trivial.

davide removed a reviewer: davide.Jan 8 2018, 3:50 PM

Quolyk updated this revision to Diff 129042.Jan 9 2018, 1:04 AM

Quolyk edited the summary of this revision. (Show Details)

I changed the brute force float checker to test 1/tan(x), and it matches cos(x)/sin(x) in all cases on macOS 10.13. This is also true on Ubuntu 17.10 x86-64.

LGTM.

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
1492–1493	Could hoist this check to the first 'if' to reduce the duplication.

This revision is now accepted and ready to land.Jan 10 2018, 12:45 PM

Quolyk closed this revision.Jan 10 2018, 10:34 PM

@spatel @davide thanks for review and comments

For reference, @bkramer improved (thanks!) the attribute propagation:
rL322284
rL322285

spatel mentioned this in D41283: [InstCombine] Missed optimization in math expression: tan(a) * cos(a) == sin(a).Jan 12 2018, 10:09 AM

spatel mentioned this in rL325247: [InstCombine] allow sin/cos transforms with 'reassoc'.Feb 15 2018, 7:10 AM

Revision Contents

Path

Size

include/

llvm/

Transforms/

Utils/

BuildLibCalls.h

7 lines

lib/

Transforms/

InstCombine/

InstCombineMulDivRem.cpp

35 lines

Utils/

BuildLibCalls.cpp

13 lines

SimplifyLibCalls.cpp

15 lines

test/

Transforms/

InstCombine/

fdiv-cos-sin.ll

113 lines

fdiv-sin-cos.ll

108 lines

Diff 129042

include/llvm/Transforms/Utils/BuildLibCalls.h

	Show All 9 Lines
	// This file exposes an interface to build some C language libcalls for			// This file exposes an interface to build some C language libcalls for
	// optimization passes that need to call the various functions.			// optimization passes that need to call the various functions.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TRANSFORMS_UTILS_BUILDLIBCALLS_H			#ifndef LLVM_TRANSFORMS_UTILS_BUILDLIBCALLS_H
	#define LLVM_TRANSFORMS_UTILS_BUILDLIBCALLS_H			#define LLVM_TRANSFORMS_UTILS_BUILDLIBCALLS_H

				#include "llvm/Analysis/TargetLibraryInfo.h"
	#include "llvm/IR/IRBuilder.h"			#include "llvm/IR/IRBuilder.h"

	namespace llvm {			namespace llvm {
	class Value;			class Value;
	class DataLayout;			class DataLayout;
	class TargetLibraryInfo;			class TargetLibraryInfo;

	/// Analyze the name and prototype of the given function and set any			/// Analyze the name and prototype of the given function and set any
	/// applicable attributes.			/// applicable attributes.
	/// If the library function is unavailable, this doesn't modify it.			/// If the library function is unavailable, this doesn't modify it.
	///			///
	/// Returns true if any attributes were set and false otherwise.			/// Returns true if any attributes were set and false otherwise.
	bool inferLibFuncAttributes(Function &F, const TargetLibraryInfo &TLI);			bool inferLibFuncAttributes(Function &F, const TargetLibraryInfo &TLI);

				/// Check whether the overloaded unary floating point function
				/// corresponding to \a Ty is available.
				bool hasUnaryFloatFn(const TargetLibraryInfo TLI, Type Ty,
				LibFunc DoubleFn, LibFunc FloatFn,
				LibFunc LongDoubleFn);

	/// Return V if it is an i8, otherwise cast it to i8.			/// Return V if it is an i8, otherwise cast it to i8.
	Value castToCStr(Value V, IRBuilder<> &B);			Value castToCStr(Value V, IRBuilder<> &B);

	/// Emit a call to the strlen function to the builder, for the specified			/// Emit a call to the strlen function to the builder, for the specified
	/// pointer. Ptr is required to be some pointer type, and the return value has			/// pointer. Ptr is required to be some pointer type, and the return value has
	/// 'intptr_t' type.			/// 'intptr_t' type.
	Value emitStrLen(Value Ptr, IRBuilder<> &B, const DataLayout &DL,			Value emitStrLen(Value Ptr, IRBuilder<> &B, const DataLayout &DL,
	const TargetLibraryInfo *TLI);			const TargetLibraryInfo *TLI);
	▲ Show 20 Lines • Show All 79 Lines • Show Last 20 Lines

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp

Show All 27 Lines
#include "llvm/IR/Operator.h"		#include "llvm/IR/Operator.h"
#include "llvm/IR/PatternMatch.h"		#include "llvm/IR/PatternMatch.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/KnownBits.h"		#include "llvm/Support/KnownBits.h"
#include "llvm/Transforms/InstCombine/InstCombineWorklist.h"		#include "llvm/Transforms/InstCombine/InstCombineWorklist.h"
		#include "llvm/Transforms/Utils/BuildLibCalls.h"
#include <cassert>		#include <cassert>
#include <cstddef>		#include <cstddef>
#include <cstdint>		#include <cstdint>
#include <utility>		#include <utility>

using namespace llvm;		using namespace llvm;
using namespace PatternMatch;		using namespace PatternMatch;

▲ Show 20 Lines • Show All 508 Lines • ▼ Show 20 Lines	static bool isFMulOrFDivWithConstant(Value *V) {
if (C0 && C1)		if (C0 && C1)
return false;		return false;

return (C0 && isFiniteNonZeroFp(C0)) \|\| (C1 && isFiniteNonZeroFp(C1));		return (C0 && isFiniteNonZeroFp(C0)) \|\| (C1 && isFiniteNonZeroFp(C1));
}		}

/// foldFMulConst() is a helper routine of InstCombiner::visitFMul().		/// foldFMulConst() is a helper routine of InstCombiner::visitFMul().
/// The input \p FMulOrDiv is a FMul/FDiv with one and only one operand		/// The input \p FMulOrDiv is a FMul/FDiv with one and only one operand
/// being a constant (i.e. isFMulOrFDivWithConstant(FMulOrDiv) == true).		/// being a constant (i.e. isFMulOrFDivWithConstant(FMulOrDiv) == true).
/// This function is to simplify "FMulOrDiv * C" and returns the		/// This function is to simplify "FMulOrDiv * C" and returns the
/// resulting expression. Note that this function could return NULL in		/// resulting expression. Note that this function could return NULL in
/// case the constants cannot be folded into a normal floating-point.		/// case the constants cannot be folded into a normal floating-point.
		spatelUnsubmitted Done Reply Inline Actions Can you move and reuse the very similar function that's currently in SimplifyLibCalls.cpp? static bool hasUnaryFloatFn(const TargetLibraryInfo TLI, Type Ty, LibFunc DoubleFn, LibFunc FloatFn, LibFunc LongDoubleFn) { See also: /// Emit a call to the unary function named 'Name' (e.g. 'floor'). This /// function is known to take a single of type matching 'Op' and returns one /// value with the same type. If 'Op' is a long double, 'l' is added as the /// suffix of name, if 'Op' is a float, we add a 'f' suffix. Value emitUnaryFloatFnCall(Value Op, StringRef Name, IRBuilder<> &B, const AttributeList &Attrs); ...in BuildLibCalls.h spatel: Can you move and reuse the very similar function that's currently in SimplifyLibCalls.cpp?
Value InstCombiner::foldFMulConst(Instruction FMulOrDiv, Constant *C,		Value InstCombiner::foldFMulConst(Instruction FMulOrDiv, Constant *C,
Instruction *InsertBefore) {		Instruction *InsertBefore) {
assert(isFMulOrFDivWithConstant(FMulOrDiv) && "V is invalid");		assert(isFMulOrFDivWithConstant(FMulOrDiv) && "V is invalid");

Value *Opnd0 = FMulOrDiv->getOperand(0);		Value *Opnd0 = FMulOrDiv->getOperand(0);
Value *Opnd1 = FMulOrDiv->getOperand(1);		Value *Opnd1 = FMulOrDiv->getOperand(1);

Constant *C0 = dyn_cast<Constant>(Opnd0);		Constant *C0 = dyn_cast<Constant>(Opnd0);
▲ Show 20 Lines • Show All 891 Lines • ▼ Show 20 Lines	if (AllowReassociate) {
if (NewInst) {		if (NewInst) {
if (Instruction *T = dyn_cast<Instruction>(NewInst))		if (Instruction *T = dyn_cast<Instruction>(NewInst))
T->setDebugLoc(I.getDebugLoc());		T->setDebugLoc(I.getDebugLoc());
SimpR->setFastMathFlags(I.getFastMathFlags());		SimpR->setFastMathFlags(I.getFastMathFlags());
return SimpR;		return SimpR;
}		}
}		}

		if (AllowReassociate &&
		spatelUnsubmitted Not Done Reply Inline Actions Should we also handle: cos(a) / sin(a) -> 1 / tan(a) ? spatel: Should we also handle: cos(a) / sin(a) -> 1 / tan(a) ?
		davideUnsubmitted Not Done Reply Inline Actions Please wait for @scanon opinion before implementing every possible 10th grade trigonometrical identity. davide: Please wait for @scanon opinion before implementing every possible 10th grade trigonometrical…
		spatelUnsubmitted Not Done Reply Inline Actions Do you have some specific numerical concern here? As you've noted, this is a well-known math transform. We can make cos(a) / sin(a) a 'TODO' if you think we should use a different transform. https://stackoverflow.com/questions/3738384/stable-cotangent spatel: Do you have some specific numerical concern here? As you've noted, this is a well-known math…
		davideUnsubmitted Not Done Reply Inline Actions The concern is that those transformation can overflow quite dramatically, even for `-ffast-math`. The other concern is that I'm not a numerical expert, so I'd love to have this signed off from somebody who knows better than me. The last concern is, again, we shouldn't pattern match every possible thing just because, it slows down the compiler without real benefit, so, do you know how this pattern is frequent? davide: The concern is that those transformation can overflow quite dramatically, even for `-ffast…
		spatelUnsubmitted Not Done Reply Inline Actions The compile-time concern is misguided. This pattern, like every other "10th grade trigonometrical identity", should be optimized by an optimizing compiler because that's the job of an optimizing compiler. These patterns can occur by way of templated code, inlining, or because the programmer may not be a computer performance expert. Think: scientists who are trying to model/simulate some math problem, but don't know much about perf...because again: that's the optimizing compiler's job. If this patch or pass is causing a compile-time problem for you, please point to or file a bug. Obstructing patches like this is doubly bad when you're undermining the efforts of new contributors. spatel: The compile-time concern is misguided. This pattern, like every other "10th grade…
		davideUnsubmitted Not Done Reply Inline Actions I'm afraid this is not entirely correct. The job of an optimizing compiler is that of making tradeoff on what to optimize, based on cost. I'm not obstructing this patch, I'm asking for a second opinion. If you have a numerical explanation of why this patch can go in, I'll be happy to accept, otherwise I'll defer the review to @scanon. davide: I'm afraid this is not entirely correct. The job of an optimizing compiler is that of making…
		Op0->hasOneUse() && Op1->hasOneUse()) {
		Value *A;
		hfinkelUnsubmitted Done Reply Inline Actions Use `match` here? hfinkel: Use `match` here?
		// sin(a) / cos(a) -> tan(a)
		if (match(Op0, m_Intrinsic<Intrinsic::sin>(m_Value(A))) &&
		spatelUnsubmitted Not Done Reply Inline Actions We probably don't want to do this transform if both of the existing values have more than one use; we'd be trading a division for a libcall. If only one value has >1 use, it's probably still ok. Add more tests. :) spatel: We probably don't want to do this transform if both of the existing values have more than one…
		match(Op1, m_Intrinsic<Intrinsic::cos>(m_Specific(A)))) {
		if (hasUnaryFloatFn(&TLI, I.getType(), LibFunc_tan,
		spatelUnsubmitted Done Reply Inline Actions Use m_Specific to simplify this: // sin(a) / cos(a) -> tan(a) Value A; if (match(Op0, m_Intrinsic<Intrinsic::sin>(m_Value(A))) && match(Op1, m_Intrinsic<Intrinsic::cos>(m_Specific(A)))) { spatel:* Use m_Specific to simplify this: // sin(a) / cos(a) -> tan(a) Value *A; if (match(Op0…
		LibFunc_tanf, LibFunc_tanl)) {
		IRBuilder<> B(&I);
		IRBuilder<>::FastMathFlagGuard Guard(B);
		hfinkelUnsubmitted Done Reply Inline Actions For the name here... Predicate the transform on TLI->has(LibFunc_tan) (or tanf, tanl, depending on the type). Use TLI->getName(LibFunc_tan) (or tanf, tanl, depending on the type). hfinkel: For the name here... 1. Predicate the transform on TLI->has(LibFunc_tan) (or tanf, tanl…
		B.setFastMathFlags(I.getFastMathFlags());
		hfinkelUnsubmitted Done Reply Inline Actions Needs BuilderTy::FastMathFlagGuard Guard(Builder); above this. hfinkel: Needs BuilderTy::FastMathFlagGuard Guard(Builder); above this.
		Value *Tan = emitUnaryFloatFnCall(A, TLI.getName(LibFunc_tan),
		B, I.getFunction()->getAttributes());
		return replaceInstUsesWith(I, Tan);
		}
		}

		// cos(a) / sin(a) -> 1/tan(a)
		if (match(Op0, m_Intrinsic<Intrinsic::cos>(m_Value(A))) &&
		match(Op1, m_Intrinsic<Intrinsic::sin>(m_Specific(A)))) {
		if (hasUnaryFloatFn(&TLI, I.getType(), LibFunc_tan,
		LibFunc_tanf, LibFunc_tanl)) {
		spatelUnsubmitted Not Done Reply Inline Actions Could hoist this check to the first 'if' to reduce the duplication. spatel: Could hoist this check to the first 'if' to reduce the duplication.
		IRBuilder<> B(&I);
		IRBuilder<>::FastMathFlagGuard Guard(B);
		B.setFastMathFlags(I.getFastMathFlags());
		Value *Tan = emitUnaryFloatFnCall(A, TLI.getName(LibFunc_tan),
		B, I.getFunction()->getAttributes());
		Value *One = ConstantFP::get(Tan->getType(), 1.0);
		Value *Div = B.CreateFDiv(One, Tan);
		return replaceInstUsesWith(I, Div);
		}
		}
		}

Value *LHS;		Value *LHS;
Value *RHS;		Value *RHS;

// -x / -y -> x / y		// -x / -y -> x / y
if (match(Op0, m_FNeg(m_Value(LHS))) && match(Op1, m_FNeg(m_Value(RHS)))) {		if (match(Op0, m_FNeg(m_Value(LHS))) && match(Op1, m_FNeg(m_Value(RHS)))) {
I.setOperand(0, LHS);		I.setOperand(0, LHS);
I.setOperand(1, RHS);		I.setOperand(1, RHS);
return &I;		return &I;
▲ Show 20 Lines • Show All 174 Lines • Show Last 20 Lines

lib/Transforms/Utils/BuildLibCalls.cpp

Show First 20 Lines • Show All 703 Lines • ▼ Show 20 Lines	bool llvm::inferLibFuncAttributes(Function &F, const TargetLibraryInfo &TLI) {

default:		default:
// FIXME: It'd be really nice to cover all the library functions we're		// FIXME: It'd be really nice to cover all the library functions we're
// aware of here.		// aware of here.
return false;		return false;
}		}
}		}

		bool llvm::hasUnaryFloatFn(const TargetLibraryInfo TLI, Type Ty,
		LibFunc DoubleFn, LibFunc FloatFn,
		LibFunc LongDoubleFn) {
		switch (Ty->getTypeID()) {
		case Type::FloatTyID:
		return TLI->has(FloatFn);
		case Type::DoubleTyID:
		return TLI->has(DoubleFn);
		default:
		return TLI->has(LongDoubleFn);
		}
		}

//- Emit LibCalls ------------------------------------------------------------//		//- Emit LibCalls ------------------------------------------------------------//

Value llvm::castToCStr(Value V, IRBuilder<> &B) {		Value llvm::castToCStr(Value V, IRBuilder<> &B) {
unsigned AS = V->getType()->getPointerAddressSpace();		unsigned AS = V->getType()->getPointerAddressSpace();
return B.CreateBitCast(V, B.getInt8PtrTy(AS), "cstr");		return B.CreateBitCast(V, B.getInt8PtrTy(AS), "cstr");
}		}

Value llvm::emitStrLen(Value Ptr, IRBuilder<> &B, const DataLayout &DL,		Value llvm::emitStrLen(Value Ptr, IRBuilder<> &B, const DataLayout &DL,
▲ Show 20 Lines • Show All 296 Lines • Show Last 20 Lines

lib/Transforms/Utils/SimplifyLibCalls.cpp

	Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines
	}			}

	static bool callHasFloatingPointArgument(const CallInst *CI) {			static bool callHasFloatingPointArgument(const CallInst *CI) {
	return any_of(CI->operands(), [](const Use &OI) {			return any_of(CI->operands(), [](const Use &OI) {
	return OI->getType()->isFloatingPointTy();			return OI->getType()->isFloatingPointTy();
	});			});
	}			}

	/// \brief Check whether the overloaded unary floating point function
	/// corresponding to \a Ty is available.
	static bool hasUnaryFloatFn(const TargetLibraryInfo TLI, Type Ty,
	LibFunc DoubleFn, LibFunc FloatFn,
	LibFunc LongDoubleFn) {
	switch (Ty->getTypeID()) {
	case Type::FloatTyID:
	return TLI->has(FloatFn);
	case Type::DoubleTyID:
	return TLI->has(DoubleFn);
	default:
	return TLI->has(LongDoubleFn);
	}
	}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// String and Memory Library Call Optimizations			// String and Memory Library Call Optimizations
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	Value LibCallSimplifier::optimizeStrCat(CallInst CI, IRBuilder<> &B) {			Value LibCallSimplifier::optimizeStrCat(CallInst CI, IRBuilder<> &B) {
	// Extract some information from the instruction			// Extract some information from the instruction
	Value *Dst = CI->getArgOperand(0);			Value *Dst = CI->getArgOperand(0);
	Value *Src = CI->getArgOperand(1);			Value *Src = CI->getArgOperand(1);
	▲ Show 20 Lines • Show All 2,402 Lines • Show Last 20 Lines

test/Transforms/InstCombine/fdiv-cos-sin.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -S -instcombine < %s \| FileCheck %s

				define double @fdiv_cos_sin(double %a) {
				; CHECK-LABEL: @fdiv_cos_sin(
				; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.cos.f64(double [[A:%.]])
				; CHECK-NEXT: [[TMP2:%.*]] = call double @llvm.sin.f64(double [[A]])
				; CHECK-NEXT: [[DIV:%.*]] = fdiv double [[TMP1]], [[TMP2]]
				; CHECK-NEXT: ret double [[DIV]]
				;
				%1 = call double @llvm.cos.f64(double %a)
				%2 = call double @llvm.sin.f64(double %a)
				%div = fdiv double %1, %2
				ret double %div
				}

				define double @fdiv_strict_cos_strict_sin_fast(double %a) {
				; CHECK-LABEL: @fdiv_strict_cos_strict_sin_fast(
				; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.cos.f64(double [[A:%.]])
				; CHECK-NEXT: [[TMP2:%.*]] = call fast double @llvm.sin.f64(double [[A]])
				; CHECK-NEXT: [[DIV:%.*]] = fdiv double [[TMP1]], [[TMP2]]
				; CHECK-NEXT: ret double [[DIV]]
				;
				%1 = call double @llvm.cos.f64(double %a)
				%2 = call fast double @llvm.sin.f64(double %a)
				%div = fdiv double %1, %2
				ret double %div
				}

				define double @fdiv_fast_cos_strict_sin_strict(double %a) {
				; CHECK-LABEL: @fdiv_fast_cos_strict_sin_strict(
				; CHECK-NEXT: [[TAN:%.]] = call fast double @tan(double [[A:%.]])
				; CHECK-NEXT: [[TMP1:%.*]] = fdiv fast double 1.000000e+00, [[TAN]]
				; CHECK-NEXT: ret double [[TMP1]]
				;
				%1 = call double @llvm.cos.f64(double %a)
				%2 = call double @llvm.sin.f64(double %a)
				%div = fdiv fast double %1, %2
				ret double %div
				}

				define double @fdiv_fast_cos_fast_sin_strict(double %a) {
				; CHECK-LABEL: @fdiv_fast_cos_fast_sin_strict(
				; CHECK-NEXT: [[TAN:%.]] = call fast double @tan(double [[A:%.]])
				; CHECK-NEXT: [[TMP1:%.*]] = fdiv fast double 1.000000e+00, [[TAN]]
				; CHECK-NEXT: ret double [[TMP1]]
				;
				%1 = call fast double @llvm.cos.f64(double %a)
				%2 = call double @llvm.sin.f64(double %a)
				%div = fdiv fast double %1, %2
				ret double %div
				}

				define double @fdiv_cos_sin_fast_multiple_uses(double %a) {
				; CHECK-LABEL: @fdiv_cos_sin_fast_multiple_uses(
				; CHECK-NEXT: [[TMP1:%.]] = call fast double @llvm.cos.f64(double [[A:%.]])
				; CHECK-NEXT: [[TMP2:%.*]] = call fast double @llvm.sin.f64(double [[A]])
				; CHECK-NEXT: [[DIV:%.*]] = fdiv fast double [[TMP1]], [[TMP2]]
				; CHECK-NEXT: call void @use(double [[TMP2]])
				; CHECK-NEXT: ret double [[DIV]]
				;
				%1 = call fast double @llvm.cos.f64(double %a)
				%2 = call fast double @llvm.sin.f64(double %a)
				%div = fdiv fast double %1, %2
				call void @use(double %2)
				ret double %div
				}

				define double @fdiv_cos_sin_fast(double %a) {
				; CHECK-LABEL: @fdiv_cos_sin_fast(
				; CHECK-NEXT: [[TAN:%.]] = call fast double @tan(double [[A:%.]])
				; CHECK-NEXT: [[TMP1:%.*]] = fdiv fast double 1.000000e+00, [[TAN]]
				; CHECK-NEXT: ret double [[TMP1]]
				;
				%1 = call fast double @llvm.cos.f64(double %a)
				%2 = call fast double @llvm.sin.f64(double %a)
				%div = fdiv fast double %1, %2
				ret double %div
				}

				define float @fdiv_cosf_sinf_fast(float %a) {
				; CHECK-LABEL: @fdiv_cosf_sinf_fast(
				; CHECK-NEXT: [[TANF:%.]] = call fast float @tanf(float [[A:%.]])
				; CHECK-NEXT: [[TMP1:%.*]] = fdiv fast float 1.000000e+00, [[TANF]]
				; CHECK-NEXT: ret float [[TMP1]]
				;
				%1 = call fast float @llvm.cos.f32(float %a)
				%2 = call fast float @llvm.sin.f32(float %a)
				%div = fdiv fast float %1, %2
				ret float %div
				}

				define fp128 @fdiv_cosfp128_sinfp128_fast(fp128 %a) {
				; CHECK-LABEL: @fdiv_cosfp128_sinfp128_fast(
				; CHECK-NEXT: [[TANL:%.]] = call fast fp128 @tanl(fp128 [[A:%.]])
				; CHECK-NEXT: [[TMP1:%.*]] = fdiv fast fp128 0xL00000000000000003FFF000000000000, [[TANL]]
				; CHECK-NEXT: ret fp128 [[TMP1]]
				;
				%1 = call fast fp128 @llvm.cos.fp128(fp128 %a)
				%2 = call fast fp128 @llvm.sin.fp128(fp128 %a)
				%div = fdiv fast fp128 %1, %2
				ret fp128 %div
				}

				declare double @llvm.cos.f64(double)
				declare float @llvm.cos.f32(float)
				declare fp128 @llvm.cos.fp128(fp128)

				declare double @llvm.sin.f64(double)
				declare float @llvm.sin.f32(float)
				declare fp128 @llvm.sin.fp128(fp128)

				declare void @use(double)

test/Transforms/InstCombine/fdiv-sin-cos.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -S -instcombine < %s \| FileCheck %s

				define double @fdiv_sin_cos(double %a) {
				; CHECK-LABEL: @fdiv_sin_cos(
				; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.sin.f64(double [[A:%.]])
				; CHECK-NEXT: [[TMP2:%.*]] = call double @llvm.cos.f64(double [[A]])
				; CHECK-NEXT: [[DIV:%.*]] = fdiv double [[TMP1]], [[TMP2]]
				; CHECK-NEXT: ret double [[DIV]]
				;
				%1 = call double @llvm.sin.f64(double %a)
				%2 = call double @llvm.cos.f64(double %a)
				%div = fdiv double %1, %2
				ret double %div
				}

				define double @fdiv_strict_sin_strict_cos_fast(double %a) {
				; CHECK-LABEL: @fdiv_strict_sin_strict_cos_fast(
				; CHECK-NEXT: [[TMP1:%.]] = call double @llvm.sin.f64(double [[A:%.]])
				; CHECK-NEXT: [[TMP2:%.*]] = call fast double @llvm.cos.f64(double [[A]])
				; CHECK-NEXT: [[DIV:%.*]] = fdiv double [[TMP1]], [[TMP2]]
				; CHECK-NEXT: ret double [[DIV]]
				;
				%1 = call double @llvm.sin.f64(double %a)
				%2 = call fast double @llvm.cos.f64(double %a)
				%div = fdiv double %1, %2
				ret double %div
				}

				define double @fdiv_fast_sin_strict_cos_strict(double %a) {
				; CHECK-LABEL: @fdiv_fast_sin_strict_cos_strict(
				; CHECK-NEXT: [[TAN:%.]] = call fast double @tan(double [[A:%.]])
				; CHECK-NEXT: ret double [[TAN]]
				;
				%1 = call double @llvm.sin.f64(double %a)
				%2 = call double @llvm.cos.f64(double %a)
				%div = fdiv fast double %1, %2
				ret double %div
				}

				define double @fdiv_fast_sin_fast_cos_strict(double %a) {
				; CHECK-LABEL: @fdiv_fast_sin_fast_cos_strict(
				; CHECK-NEXT: [[TAN:%.]] = call fast double @tan(double [[A:%.]])
				; CHECK-NEXT: ret double [[TAN]]
				;
				%1 = call fast double @llvm.sin.f64(double %a)
				%2 = call double @llvm.cos.f64(double %a)
				%div = fdiv fast double %1, %2
				ret double %div
				}

				define double @fdiv_sin_cos_fast_multiple_uses(double %a) {
				; CHECK-LABEL: @fdiv_sin_cos_fast_multiple_uses(
				; CHECK-NEXT: [[TMP1:%.]] = call fast double @llvm.sin.f64(double [[A:%.]])
				; CHECK-NEXT: [[TMP2:%.*]] = call fast double @llvm.cos.f64(double [[A]])
				; CHECK-NEXT: [[DIV:%.*]] = fdiv fast double [[TMP1]], [[TMP2]]
				; CHECK-NEXT: call void @use(double [[TMP2]])
				; CHECK-NEXT: ret double [[DIV]]
				;
				%1 = call fast double @llvm.sin.f64(double %a)
				%2 = call fast double @llvm.cos.f64(double %a)
				%div = fdiv fast double %1, %2
				call void @use(double %2)
				ret double %div
				}

				define double @fdiv_sin_cos_fast(double %a) {
				; CHECK-LABEL: @fdiv_sin_cos_fast(
				; CHECK-NEXT: [[TMP1:%.]] = call fast double @tan(double [[A:%.]])
				; CHECK-NEXT: ret double [[TMP1]]
				;
				%1 = call fast double @llvm.sin.f64(double %a)
				%2 = call fast double @llvm.cos.f64(double %a)
				%div = fdiv fast double %1, %2
				ret double %div
				}

				define float @fdiv_sinf_cosf_fast(float %a) {
				; CHECK-LABEL: @fdiv_sinf_cosf_fast(
				; CHECK-NEXT: [[TMP1:%.]] = call fast float @tanf(float [[A:%.]])
				; CHECK-NEXT: ret float [[TMP1]]
				;
				%1 = call fast float @llvm.sin.f32(float %a)
				%2 = call fast float @llvm.cos.f32(float %a)
				%div = fdiv fast float %1, %2
				ret float %div
				}

				define fp128 @fdiv_sinfp128_cosfp128_fast(fp128 %a) {
				; CHECK-LABEL: @fdiv_sinfp128_cosfp128_fast(
				; CHECK-NEXT: [[TMP0:%.]] = call fast fp128 @tanl(fp128 [[A:%.]])
				; CHECK-NEXT: ret fp128 [[TMP0]]
				;
				%1 = call fast fp128 @llvm.sin.fp128(fp128 %a)
				%2 = call fast fp128 @llvm.cos.fp128(fp128 %a)
				%div = fdiv fast fp128 %1, %2
				ret fp128 %div
				}

				declare double @llvm.sin.f64(double)
				declare float @llvm.sin.f32(float)
				declare fp128 @llvm.sin.fp128(fp128)

				declare double @llvm.cos.f64(double)
				declare float @llvm.cos.f32(float)
				declare fp128 @llvm.cos.fp128(fp128)

				declare void @use(double)

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Missed optimization in math expression: sin(x) / cos(x) => tan(x)ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 129042

include/llvm/Transforms/Utils/BuildLibCalls.h

lib/Transforms/InstCombine/InstCombineMulDivRem.cpp

lib/Transforms/Utils/BuildLibCalls.cpp

lib/Transforms/Utils/SimplifyLibCalls.cpp

test/Transforms/InstCombine/fdiv-cos-sin.ll

test/Transforms/InstCombine/fdiv-sin-cos.ll

[InstCombine] Missed optimization in math expression: sin(x) / cos(x) => tan(x)
ClosedPublic