This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/
-
llvm/
-
Analysis/
-
TargetLibraryInfo.h
-
TargetLibraryInfo.def
-
Transforms/Utils/
-
Utils/
-
SimplifyLibCalls.h
-
lib/
-
Analysis/
-
TargetLibraryInfo.cpp
-
Transforms/Utils/
-
Utils/
-
SimplifyLibCalls.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
fp-classify-libcalls.ll

Differential D18513

Simplify isfinite/isnan/isinf in finite-math-only mode
AbandonedPublic

Authored by hfinkel on Mar 28 2016, 2:53 AM.

Download Raw Diff

Details

Reviewers

spatel
mclow.lists
chandlerc
scanon
joerg
beanz
lhames

Summary

This patch adds simplification for isfinite, isnan and isinf when we know that we don't have NaNs or Infs (based on the corresponding function attributes). Doing this requires a small infrastructure improvement to TargetLibraryInfo, as I'll explain below.

C/POSIX specify that math.h provides a set of macros, of which these are a subset:

int isfinite(x);
int isnan(x);
int isinf(x);

where x is some floating-point type (float, double, long double). When we're compiling with -ffast-math (or specifically with -ffinite-math-only), it is profitable to statically simplify calls to these functions. For a motivating use case, consider compiling this code using libc++:

#include <complex>
using namespace std;

complex<float> bar(complex<float> C);
complex<float> foo(complex<float> C) {
  return bar(C)*C;
}

and you'll quickly see that, even at -O3 -ffast-math, we produce a mess of code including calls to isnanf and isinff. Why those functions? This comes down to how glibc implements these macros:

define isnan(x) \ (sizeof (x) == sizeof (float) \ ? isnanf (x) \ : sizeof (x) == sizeof (double) \ ? isnan (x) : __isnanl (x))

Other systems use similar macro expansions, with some variation in the names of the underlying functions. OSX here has a split system (or at least used to). When using finite-math-only mode, you get function calls:

#define isfinite(x)                                               \
    ( sizeof(x) == sizeof(float)  ? __isfinitef((float)(x))       \
    : sizeof(x) == sizeof(double) ? __isfinited((double)(x))      \
                                  : __isfinitel((long double)(x)))

but when in IEEE-conforming mode, you get faster inline implementations:

#define isfinite(x)                                                      \
    ( sizeof(x) == sizeof(float)  ? __inline_isfinitef((float)(x))       \
    : sizeof(x) == sizeof(double) ? __inline_isfinited((double)(x))      \
                                  : __inline_isfinitel((long double)(x)))

where the headers define things like:

__header_always_inline int __inline_isfinitef(float __x) {
    return __x == __x && __builtin_fabsf(__x) != __builtin_inff();
}

__header_always_inline int __inline_isinff(float __x) {
    return __builtin_fabsf(__x) == __builtin_inff();
}

__header_always_inline int __inline_isnanf(float __x) {
    return __x != __x;
}

so some effort has been made to preserve the full functioning of these calls even when otherwise compiling in finite-math-only mode. This optimization would purposely break that feature (in favor of lowering abstraction penalties). If we do this and a user wishes to check his or her inputs for NaNs, Infs, etc. the user must do so in a translation unit where such values are permitted to exist. To be fair, gcc's manual does not define finite-math-only mode in this way, but rather:

Allow optimizations for floating-point arithmetic that assume that arguments and results are not NaNs or +-Infs.

perhaps implying that it is fine to check numbers for NaN/Inf that you did not compute via some arithmetic operation. We could certainly do it this way (i.e. based on an operand's fast-math flags -- although the implementation is not completely trivial because we need to look through PHIs, not just at direct function arguments), although that has the obvious problems with users being surprised by the effects of function inlining. Also, we have -fno-builtin-foo, although it would need some enhancements to work easily in this case because of the macros.

As another data point, FreeBSD seems to always use an inline version of isnan, but has function calls for isinf/isfinite.

Some alternatives (not all mutually exclusive with this one):

As mentioned above, base the folding decision on the fast-math flags of the inputs (looking through phis) instead of the caller's function attributes
In non-finite-math-only mode, replace the calls with a direct implementation (i.e. have the compiler do on all platforms what the OSX math.h header does)
Always replace the calls with inline versions, but mark the instructions somehow so that they don't be removed by later optimizatons
Enhance libc++ to contain some FINITE_MATH_ONLY ifdefs

Regarding the infrastructure enhancement, we currently check for known library calls by name like this in SimplifyLibCalls:

if (TLI->getLibFunc(FuncName, Func) && TLI->has(Func)) {

but currently this does not work if a library function's name is not the default name, but rather one substituted with TLI.setAvailableWithName. This is because we simply never search these custom names when looking for known functions by name (we know only to generate the custom name if we already have its LibFunc identifier). To make this work (necessary for this case because systems disagree on finite vs. isfinite vs. __isfinited, etc.) I've added an additional StringMap to TargetLibraryInfo used to lookup LibFunc identifiers based on custom names. As it turns out, this also requires adding a copy constructor to StringMap (D18506).

Diff Detail

Event Timeline

hfinkel updated this revision to Diff 51760.Mar 28 2016, 2:53 AM

hfinkel retitled this revision from to Simplify isfinite/isnan/isinf in finite-math-only mode.

hfinkel updated this object.

hfinkel added reviewers: scanon, chandlerc, spatel, beanz, mclow.lists, joerg, lhames.

hfinkel added a subscriber: llvm-commits.

Herald added subscribers: mcrosier, emaste. · View Herald TranscriptMar 28 2016, 2:53 AM

Updated library function names for Darwin (and NetBSD).

hfinkel added a parent revision: D18506: Add a copy constructor to StringMap.Mar 28 2016, 3:34 AM

hfinkel mentioned this in D18506: Add a copy constructor to StringMap.Mar 28 2016, 4:04 AM

So, here are my thoughts on this. I'm curious what you, Steven, and others
think though....

My initial feeling is that -ffinite-math-only should apply to *math* and
not to *tests*. That is, we should be free to transform and optimize math
assuming finite operands, but we can't make any assumptions about the
results of testing for finite values. This means that the implementaiton of
complex gets to leverage finite-math-only, but we can't nuke tests for
infinities. I also think that we should aggressively optimize how the tests
are done while preserving their functionality. So I guess I'm suggesting
#2, #3, and #4 from your email as the path forward.

However, I can see an argument that forcing users to leverage
FINITE_MATH_ONLY in their library code is really annoying. So I would
also be happy having two different mechanisms for testing for infinities
(and NaNs, I'm just using the inf case an my example) -- one which is
folded under finite-math-only, and one which survives. I'm not sure what to
call the two interfaces though. Ideas?

In either case, I think we should definitely replace calls to functions
with fast, inline, and ensured correct (according to whatever rules are
appropriate in the particular case) implementations of these tests.

Also in either case, I think we should change the frontend and/or headers
to add *call* attributes (on the actual call instruction) marking these
operations as finite-math-only rather than relying on *caller* attributes
or having to chase operands through phi nodes. This should essentially
follow the same model we use for tagging floating point operations that can
be optimized.

-Chandler

spatel mentioned this in D18648: make __builtin_isfinite more efficient (PR27145).Mar 31 2016, 8:28 AM

Marking this code itself as awaiting update based on direction comments.

This revision now requires changes to proceed.Apr 7 2016, 12:19 AM

spatel mentioned this in rL265675: make __builtin_isfinite more efficient (PR27145).Apr 7 2016, 7:34 AM

I haven't thought about the behavioral question raised by -ffinite-math-only, but I wanted to know what the code to perform these ops might look like with default settings.

For reference, these bugs are byproducts of that investigation (although not all will be directly applicable to isfinite/isnan/isinf codegen):
https://llvm.org/bugs/show_bug.cgi?id=27105
https://llvm.org/bugs/show_bug.cgi?id=27145
https://llvm.org/bugs/show_bug.cgi?id=27164
https://llvm.org/bugs/show_bug.cgi?id=27202
https://llvm.org/bugs/show_bug.cgi?id=27203

Breaking isnan, isinf, etc is a non-starter. I know it's appealing from a consistent formal model viewpoint, but in practice it breaks a lot of code (this is why we have the call fallbacks for iOS/OSX).

I would be delighted to have a finite-math-safe builtin to avoid the calls, but #2 and #4 both seems reasonable to me. #4 should probably be done regardless of the resolution of this conversation.

I agree with Steve here. Even code that is built with -ffast-math has to live in a reality where NaNs, Infs, etc. are sometimes generated by external function calls, provided as user input, etc. It's critical that this code be able to filter out these invalid values, precisely because the body of the code will be optimized on the assumption that those values did not need to be considered.

In D18513#394642, @scanon wrote:

Breaking isnan, isinf, etc is a non-starter. I know it's appealing from a consistent formal model viewpoint, but in practice it breaks a lot of code (this is why we have the call fallbacks for iOS/OSX).

I would be delighted to have a finite-math-safe builtin to avoid the calls, but #2 and #4 both seems reasonable to me. #4 should probably be done regardless of the resolution of this conversation.

In D18513#394647, @resistor wrote:

I agree with Steve here. Even code that is built with -ffast-math has to live in a reality where NaNs, Infs, etc. are sometimes generated by external function calls, provided as user input, etc. It's critical that this code be able to filter out these invalid values, precisely because the body of the code will be optimized on the assumption that those values did not need to be considered.

I don't really disagree with any of this. For now, I'll abandon this patch (although I might revive the TLI infrastructure bits, as they're generally useful). I submitted D18639 to fixup libc++ <complex> in this regard.

sepavloff mentioned this in D104854: Introduce intrinsic llvm.isnan.Jun 24 2021, 6:16 AM

sepavloff mentioned this in rG16ff91ebccda: Introduce intrinsic llvm.isnan.Aug 4 2021, 1:28 AM

Revision Contents

Path

Size

include/

llvm/

Analysis/

TargetLibraryInfo.h

3 lines

TargetLibraryInfo.def

27 lines

Transforms/

Utils/

SimplifyLibCalls.h

2 lines

lib/

Analysis/

TargetLibraryInfo.cpp

27 lines

Transforms/

Utils/

SimplifyLibCalls.cpp

58 lines

test/

Transforms/

InstCombine/

fp-classify-libcalls.ll

85 lines

Diff 51774

include/llvm/Analysis/TargetLibraryInfo.h

//===-- TargetLibraryInfo.h - Library information ---------------- C++ --===//		//===-- TargetLibraryInfo.h - Library information ---------------- C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_ANALYSIS_TARGETLIBRARYINFO_H		#ifndef LLVM_ANALYSIS_TARGETLIBRARYINFO_H
#define LLVM_ANALYSIS_TARGETLIBRARYINFO_H		#define LLVM_ANALYSIS_TARGETLIBRARYINFO_H

#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/Triple.h"		#include "llvm/ADT/Triple.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IR/PassManager.h"		#include "llvm/IR/PassManager.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"

namespace llvm {		namespace llvm {
/// VecDesc - Describes a possible vectorization of a function.		/// VecDesc - Describes a possible vectorization of a function.
Show All 20 Lines
/// make it available. However, it is somewhat expensive to compute and only		/// make it available. However, it is somewhat expensive to compute and only
/// depends on the triple. So users typically interact with the \c		/// depends on the triple. So users typically interact with the \c
/// TargetLibraryInfo wrapper below.		/// TargetLibraryInfo wrapper below.
class TargetLibraryInfoImpl {		class TargetLibraryInfoImpl {
friend class TargetLibraryInfo;		friend class TargetLibraryInfo;

unsigned char AvailableArray[(LibFunc::NumLibFuncs+3)/4];		unsigned char AvailableArray[(LibFunc::NumLibFuncs+3)/4];
llvm::DenseMap<unsigned, std::string> CustomNames;		llvm::DenseMap<unsigned, std::string> CustomNames;
		llvm::StringMap<unsigned> CustomNameFuncs;
static const char *const StandardNames[LibFunc::NumLibFuncs];		static const char *const StandardNames[LibFunc::NumLibFuncs];

enum AvailabilityState {		enum AvailabilityState {
StandardName = 3, // (memset to all ones)		StandardName = 3, // (memset to all ones)
CustomName = 1,		CustomName = 1,
Unavailable = 0 // (memset to all zeros)		Unavailable = 0 // (memset to all zeros)
};		};
void setState(LibFunc::Func F, AvailabilityState State) {		void setState(LibFunc::Func F, AvailabilityState State) {
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	public:

/// \brief Forces a function to be marked as available and provide an		/// \brief Forces a function to be marked as available and provide an
/// alternate name that must be used.		/// alternate name that must be used.
void setAvailableWithName(LibFunc::Func F, StringRef Name) {		void setAvailableWithName(LibFunc::Func F, StringRef Name) {
if (StandardNames[F] != Name) {		if (StandardNames[F] != Name) {
setState(F, CustomName);		setState(F, CustomName);
CustomNames[F] = Name;		CustomNames[F] = Name;
assert(CustomNames.find(F) != CustomNames.end());		assert(CustomNames.find(F) != CustomNames.end());
		CustomNameFuncs[Name] = F;
} else {		} else {
setState(F, StandardName);		setState(F, StandardName);
}		}
}		}

/// \brief Disables all builtins.		/// \brief Disables all builtins.
///		///
/// This can be used for options like -fno-builtin.		/// This can be used for options like -fno-builtin.
▲ Show 20 Lines • Show All 198 Lines • Show Last 20 Lines

include/llvm/Analysis/TargetLibraryInfo.def

	Show First 20 Lines • Show All 174 Lines • ▼ Show 20 Lines
	TLI_DEFINE_ENUM_INTERNAL(cxa_guard_abort)			TLI_DEFINE_ENUM_INTERNAL(cxa_guard_abort)
	TLI_DEFINE_STRING_INTERNAL("__cxa_guard_abort")			TLI_DEFINE_STRING_INTERNAL("__cxa_guard_abort")
	/// int __cxa_guard_acquire(guard_t *guard);			/// int __cxa_guard_acquire(guard_t *guard);
	TLI_DEFINE_ENUM_INTERNAL(cxa_guard_acquire)			TLI_DEFINE_ENUM_INTERNAL(cxa_guard_acquire)
	TLI_DEFINE_STRING_INTERNAL("__cxa_guard_acquire")			TLI_DEFINE_STRING_INTERNAL("__cxa_guard_acquire")
	/// void __cxa_guard_release(guard_t *guard);			/// void __cxa_guard_release(guard_t *guard);
	TLI_DEFINE_ENUM_INTERNAL(cxa_guard_release)			TLI_DEFINE_ENUM_INTERNAL(cxa_guard_release)
	TLI_DEFINE_STRING_INTERNAL("__cxa_guard_release")			TLI_DEFINE_STRING_INTERNAL("__cxa_guard_release")
				/// int __isfinite(double x);
				TLI_DEFINE_ENUM_INTERNAL(isfinite)
				TLI_DEFINE_STRING_INTERNAL("__isfinite")
				/// int __isfinitef(double x);
				TLI_DEFINE_ENUM_INTERNAL(isfinitef)
				TLI_DEFINE_STRING_INTERNAL("__isfinitef")
				/// int __isfinitel(long double x);
				TLI_DEFINE_ENUM_INTERNAL(isfinitel)
				TLI_DEFINE_STRING_INTERNAL("__isfinitel")
				/// int __isinf(double x);
				TLI_DEFINE_ENUM_INTERNAL(isinf)
				TLI_DEFINE_STRING_INTERNAL("__isinf")
				/// int __isinff(double x);
				TLI_DEFINE_ENUM_INTERNAL(isinff)
				TLI_DEFINE_STRING_INTERNAL("__isinff")
				/// int __isinfl(long double x);
				TLI_DEFINE_ENUM_INTERNAL(isinfl)
				TLI_DEFINE_STRING_INTERNAL("__isinfl")
				/// int __isnan(double x);
				TLI_DEFINE_ENUM_INTERNAL(isnan)
				TLI_DEFINE_STRING_INTERNAL("__isnan")
				/// int __isnanf(double x);
				TLI_DEFINE_ENUM_INTERNAL(isnanf)
				TLI_DEFINE_STRING_INTERNAL("__isnanf")
				/// int __isnanl(long double x);
				TLI_DEFINE_ENUM_INTERNAL(isnanl)
				TLI_DEFINE_STRING_INTERNAL("__isnanl")
	/// int __isoc99_scanf (const char *format, ...)			/// int __isoc99_scanf (const char *format, ...)
	TLI_DEFINE_ENUM_INTERNAL(dunder_isoc99_scanf)			TLI_DEFINE_ENUM_INTERNAL(dunder_isoc99_scanf)
	TLI_DEFINE_STRING_INTERNAL("__isoc99_scanf")			TLI_DEFINE_STRING_INTERNAL("__isoc99_scanf")
	/// int __isoc99_sscanf(const char s, const char format, ...)			/// int __isoc99_sscanf(const char s, const char format, ...)
	TLI_DEFINE_ENUM_INTERNAL(dunder_isoc99_sscanf)			TLI_DEFINE_ENUM_INTERNAL(dunder_isoc99_sscanf)
	TLI_DEFINE_STRING_INTERNAL("__isoc99_sscanf")			TLI_DEFINE_STRING_INTERNAL("__isoc99_sscanf")
	/// void __memcpy_chk(void s1, const void *s2, size_t n, size_t s1size);			/// void __memcpy_chk(void s1, const void *s2, size_t n, size_t s1size);
	TLI_DEFINE_ENUM_INTERNAL(memcpy_chk)			TLI_DEFINE_ENUM_INTERNAL(memcpy_chk)
	▲ Show 20 Lines • Show All 929 Lines • Show Last 20 Lines

include/llvm/Transforms/Utils/SimplifyLibCalls.h

Show First 20 Lines • Show All 124 Lines • ▼ Show 20 Lines	private:
Value optimizeStringMemoryLibCall(CallInst CI, IRBuilder<> &B);		Value optimizeStringMemoryLibCall(CallInst CI, IRBuilder<> &B);

// Math Library Optimizations		// Math Library Optimizations
Value optimizeCos(CallInst CI, IRBuilder<> &B);		Value optimizeCos(CallInst CI, IRBuilder<> &B);
Value optimizePow(CallInst CI, IRBuilder<> &B);		Value optimizePow(CallInst CI, IRBuilder<> &B);
Value optimizeExp2(CallInst CI, IRBuilder<> &B);		Value optimizeExp2(CallInst CI, IRBuilder<> &B);
Value optimizeFabs(CallInst CI, IRBuilder<> &B);		Value optimizeFabs(CallInst CI, IRBuilder<> &B);
Value optimizeFMinFMax(CallInst CI, IRBuilder<> &B);		Value optimizeFMinFMax(CallInst CI, IRBuilder<> &B);
		Value optimizeFPClassification(CallInst CI, IRBuilder<> &B,
		LibFunc::Func Func);
Value optimizeLog(CallInst CI, IRBuilder<> &B);		Value optimizeLog(CallInst CI, IRBuilder<> &B);
Value optimizeSqrt(CallInst CI, IRBuilder<> &B);		Value optimizeSqrt(CallInst CI, IRBuilder<> &B);
Value optimizeSinCosPi(CallInst CI, IRBuilder<> &B);		Value optimizeSinCosPi(CallInst CI, IRBuilder<> &B);
Value optimizeTan(CallInst CI, IRBuilder<> &B);		Value optimizeTan(CallInst CI, IRBuilder<> &B);

// Integer Library Call Optimizations		// Integer Library Call Optimizations
Value optimizeFFS(CallInst CI, IRBuilder<> &B);		Value optimizeFFS(CallInst CI, IRBuilder<> &B);
Value optimizeAbs(CallInst CI, IRBuilder<> &B);		Value optimizeAbs(CallInst CI, IRBuilder<> &B);
Show All 32 Lines

lib/Analysis/TargetLibraryInfo.cpp

Show First 20 Lines • Show All 379 Lines • ▼ Show 20 Lines	if (!T.isOSLinux()) {
TLI.setUnavailable(LibFunc::ftello64);		TLI.setUnavailable(LibFunc::ftello64);
TLI.setUnavailable(LibFunc::lstat64);		TLI.setUnavailable(LibFunc::lstat64);
TLI.setUnavailable(LibFunc::open64);		TLI.setUnavailable(LibFunc::open64);
TLI.setUnavailable(LibFunc::stat64);		TLI.setUnavailable(LibFunc::stat64);
TLI.setUnavailable(LibFunc::statvfs64);		TLI.setUnavailable(LibFunc::statvfs64);
TLI.setUnavailable(LibFunc::tmpfile64);		TLI.setUnavailable(LibFunc::tmpfile64);
}		}

		if (T.isOSLinux()) {
		// On Linux (GLIBC), __isfinite* is just __finite*.
		TLI.setAvailableWithName(LibFunc::isfinite, "__finite");
		TLI.setAvailableWithName(LibFunc::isfinitef, "__finitef");
		TLI.setAvailableWithName(LibFunc::isfinitel, "__finitel");
		} else if (T.isOSDarwin() \|\| T.isOSNetBSD()) {
		// On Darwin and NetBSD, the double-precision FP classification functions
		// end with a 'd'.
		TLI.setAvailableWithName(LibFunc::isfinite, "__isfinited");
		TLI.setAvailableWithName(LibFunc::isinf, "__isinfd");
		TLI.setAvailableWithName(LibFunc::isnan, "__isnand");
		}

// As currently implemented in clang, NVPTX code has no standard library to		// As currently implemented in clang, NVPTX code has no standard library to
// speak of. Headers provide a standard-ish library implementation, but many		// speak of. Headers provide a standard-ish library implementation, but many
// of the signatures are wrong -- for example, many libm functions are not		// of the signatures are wrong -- for example, many libm functions are not
// extern "C".		// extern "C".
//		//
// libdevice, an IR library provided by nvidia, is linked in by the front-end,		// libdevice, an IR library provided by nvidia, is linked in by the front-end,
// but only used functions are provided to llvm. Moreover, most of the		// but only used functions are provided to llvm. Moreover, most of the
// functions in libdevice don't map precisely to standard library functions.		// functions in libdevice don't map precisely to standard library functions.
Show All 16 Lines
TargetLibraryInfoImpl::TargetLibraryInfoImpl(const Triple &T) {		TargetLibraryInfoImpl::TargetLibraryInfoImpl(const Triple &T) {
// Default to everything being available.		// Default to everything being available.
memset(AvailableArray, -1, sizeof(AvailableArray));		memset(AvailableArray, -1, sizeof(AvailableArray));

initialize(*this, T, StandardNames);		initialize(*this, T, StandardNames);
}		}

TargetLibraryInfoImpl::TargetLibraryInfoImpl(const TargetLibraryInfoImpl &TLI)		TargetLibraryInfoImpl::TargetLibraryInfoImpl(const TargetLibraryInfoImpl &TLI)
: CustomNames(TLI.CustomNames) {		: CustomNames(TLI.CustomNames), CustomNameFuncs(TLI.CustomNameFuncs) {
memcpy(AvailableArray, TLI.AvailableArray, sizeof(AvailableArray));		memcpy(AvailableArray, TLI.AvailableArray, sizeof(AvailableArray));
VectorDescs = TLI.VectorDescs;		VectorDescs = TLI.VectorDescs;
ScalarDescs = TLI.ScalarDescs;		ScalarDescs = TLI.ScalarDescs;
}		}

TargetLibraryInfoImpl::TargetLibraryInfoImpl(TargetLibraryInfoImpl &&TLI)		TargetLibraryInfoImpl::TargetLibraryInfoImpl(TargetLibraryInfoImpl &&TLI)
: CustomNames(std::move(TLI.CustomNames)) {		: CustomNames(std::move(TLI.CustomNames)),
		CustomNameFuncs(std::move(TLI.CustomNameFuncs)) {
std::move(std::begin(TLI.AvailableArray), std::end(TLI.AvailableArray),		std::move(std::begin(TLI.AvailableArray), std::end(TLI.AvailableArray),
AvailableArray);		AvailableArray);
VectorDescs = TLI.VectorDescs;		VectorDescs = TLI.VectorDescs;
ScalarDescs = TLI.ScalarDescs;		ScalarDescs = TLI.ScalarDescs;
}		}

TargetLibraryInfoImpl &TargetLibraryInfoImpl::operator=(const TargetLibraryInfoImpl &TLI) {		TargetLibraryInfoImpl &TargetLibraryInfoImpl::operator=(const TargetLibraryInfoImpl &TLI) {
CustomNames = TLI.CustomNames;		CustomNames = TLI.CustomNames;
		CustomNameFuncs = TLI.CustomNameFuncs;
memcpy(AvailableArray, TLI.AvailableArray, sizeof(AvailableArray));		memcpy(AvailableArray, TLI.AvailableArray, sizeof(AvailableArray));
return *this;		return *this;
}		}

TargetLibraryInfoImpl &TargetLibraryInfoImpl::operator=(TargetLibraryInfoImpl &&TLI) {		TargetLibraryInfoImpl &TargetLibraryInfoImpl::operator=(TargetLibraryInfoImpl &&TLI) {
CustomNames = std::move(TLI.CustomNames);		CustomNames = std::move(TLI.CustomNames);
		CustomNameFuncs = std::move(TLI.CustomNameFuncs);
std::move(std::begin(TLI.AvailableArray), std::end(TLI.AvailableArray),		std::move(std::begin(TLI.AvailableArray), std::end(TLI.AvailableArray),
AvailableArray);		AvailableArray);
return *this;		return *this;
}		}

static StringRef sanitizeFunctionName(StringRef funcName) {		static StringRef sanitizeFunctionName(StringRef funcName) {
// Filter out empty names and names containing null bytes, those can't be in		// Filter out empty names and names containing null bytes, those can't be in
// our table.		// our table.
Show All 17 Lines	bool TargetLibraryInfoImpl::getLibFunc(StringRef funcName,
const char const I = std::lower_bound(		const char const I = std::lower_bound(
Start, End, funcName, [](const char *LHS, StringRef RHS) {		Start, End, funcName, [](const char *LHS, StringRef RHS) {
return std::strncmp(LHS, RHS.data(), RHS.size()) < 0;		return std::strncmp(LHS, RHS.data(), RHS.size()) < 0;
});		});
if (I != End && *I == funcName) {		if (I != End && *I == funcName) {
F = (LibFunc::Func)(I - Start);		F = (LibFunc::Func)(I - Start);
return true;		return true;
}		}

		auto CNFI = CustomNameFuncs.find(funcName);
		if (CNFI != CustomNameFuncs.end()) {
		F = (LibFunc::Func) CNFI->second;
		return true;
		}

return false;		return false;
}		}

void TargetLibraryInfoImpl::disableAllFunctions() {		void TargetLibraryInfoImpl::disableAllFunctions() {
memset(AvailableArray, 0, sizeof(AvailableArray));		memset(AvailableArray, 0, sizeof(AvailableArray));
}		}

static bool compareByScalarFnName(const VecDesc &LHS, const VecDesc &RHS) {		static bool compareByScalarFnName(const VecDesc &LHS, const VecDesc &RHS) {
▲ Show 20 Lines • Show All 164 Lines • Show Last 20 Lines

lib/Transforms/Utils/SimplifyLibCalls.cpp

Show First 20 Lines • Show All 1,387 Lines • ▼ Show 20 Lines	Value LibCallSimplifier::optimizeFMinFMax(CallInst CI, IRBuilder<> &B) {
// exceptions, because fmin/fmax do not have those.		// exceptions, because fmin/fmax do not have those.
Value *Op0 = CI->getArgOperand(0);		Value *Op0 = CI->getArgOperand(0);
Value *Op1 = CI->getArgOperand(1);		Value *Op1 = CI->getArgOperand(1);
Value *Cmp = Callee->getName().startswith("fmin") ?		Value *Cmp = Callee->getName().startswith("fmin") ?
B.CreateFCmpOLT(Op0, Op1) : B.CreateFCmpOGT(Op0, Op1);		B.CreateFCmpOLT(Op0, Op1) : B.CreateFCmpOGT(Op0, Op1);
return B.CreateSelect(Cmp, Op0, Op1);		return B.CreateSelect(Cmp, Op0, Op1);
}		}

		Value LibCallSimplifier::optimizeFPClassification(CallInst CI,
		IRBuilder<> &B,
		LibFunc::Func Func) {
		// isfinite, isnan, insinf* all take one floating-point argument and return
		// an integer.
		Function *Callee = CI->getCalledFunction();
		FunctionType *FT = Callee->getFunctionType();
		if (!FT->getReturnType()->isIntegerTy(32))
		return nullptr;
		if (FT->getNumParams() != 1)
		return nullptr;
		if (!FT->getParamType(0)->isFloatingPointTy())
		return nullptr;

		bool HasFunNoNaNAttr = false, HasFunNoInfAttr = false;
		Function &F = *B.GetInsertBlock()->getParent();
		if (F.hasFnAttribute("no-nans-fp-math"))
		HasFunNoNaNAttr =
		F.getFnAttribute("no-nans-fp-math").getValueAsString() == "true";
		if (F.hasFnAttribute("no-infs-fp-math"))
		HasFunNoInfAttr =
		F.getFnAttribute("no-infs-fp-math").getValueAsString() == "true";

		switch (Func) {
		default: llvm_unreachable("Unknown FP classification function");
		case LibFunc::isfinitef:
		case LibFunc::isfinite:
		case LibFunc::isfinitel:
		if (!HasFunNoNaNAttr \|\| !HasFunNoInfAttr)
		return nullptr;
		return ConstantInt::get(B.getInt32Ty(), 1);
		case LibFunc::isnanf:
		case LibFunc::isnan:
		case LibFunc::isnanl:
		if (!HasFunNoNaNAttr)
		return nullptr;
		break;
		case LibFunc::isinff:
		case LibFunc::isinf:
		case LibFunc::isinfl:
		if (!HasFunNoInfAttr)
		return nullptr;
		break;
		}

		return ConstantInt::get(B.getInt32Ty(), 0);
		}

Value LibCallSimplifier::optimizeLog(CallInst CI, IRBuilder<> &B) {		Value LibCallSimplifier::optimizeLog(CallInst CI, IRBuilder<> &B) {
Function *Callee = CI->getCalledFunction();		Function *Callee = CI->getCalledFunction();
if (!matchesFPLibFunctionSignature(Callee, 1, false))		if (!matchesFPLibFunctionSignature(Callee, 1, false))
return nullptr;		return nullptr;

Value *Ret = nullptr;		Value *Ret = nullptr;
StringRef Name = Callee->getName();		StringRef Name = Callee->getName();
if (UnsafeFPShrink && hasFloatVersion(Name))		if (UnsafeFPShrink && hasFloatVersion(Name))
▲ Show 20 Lines • Show All 980 Lines • ▼ Show 20 Lines	case LibFunc::copysign:
return nullptr;		return nullptr;
case LibFunc::fminf:		case LibFunc::fminf:
case LibFunc::fmin:		case LibFunc::fmin:
case LibFunc::fminl:		case LibFunc::fminl:
case LibFunc::fmaxf:		case LibFunc::fmaxf:
case LibFunc::fmax:		case LibFunc::fmax:
case LibFunc::fmaxl:		case LibFunc::fmaxl:
return optimizeFMinFMax(CI, Builder);		return optimizeFMinFMax(CI, Builder);
		case LibFunc::isfinitef:
		case LibFunc::isfinite:
		case LibFunc::isfinitel:
		case LibFunc::isnanf:
		case LibFunc::isnan:
		case LibFunc::isnanl:
		case LibFunc::isinff:
		case LibFunc::isinf:
		case LibFunc::isinfl:
		return optimizeFPClassification(CI, Builder, Func);
default:		default:
return nullptr;		return nullptr;
}		}
}		}
return nullptr;		return nullptr;
}		}

LibCallSimplifier::LibCallSimplifier(		LibCallSimplifier::LibCallSimplifier(
▲ Show 20 Lines • Show All 243 Lines • Show Last 20 Lines

test/Transforms/InstCombine/fp-classify-libcalls.ll

This file was added.

				; RUN: opt -S < %s -instcombine \| FileCheck %s
				target datalayout = "E-m:e-i64:64-n32:64"
				target triple = "powerpc64-unknown-linux-gnu"

				; Function Attrs: nounwind readnone
				define zeroext i1 @_Z2t1f(float %x) #0 {
				entry:
				%call = tail call signext i32 @__finitef(float %x) #1
				%tobool = icmp ne i32 %call, 0
				ret i1 %tobool
				; CHECK-LABEL: @_Z2t1f
				; CHECK: ret i1 true
				}

				; Function Attrs: nounwind readnone
				declare signext i32 @__finitef(float) #0

				; Function Attrs: nounwind readnone
				define zeroext i1 @_Z2t2f(float %x) #0 {
				entry:
				%call = tail call signext i32 @__isnanf(float %x) #1
				%tobool = icmp ne i32 %call, 0
				ret i1 %tobool
				; CHECK-LABEL: @_Z2t2f
				; CHECK: ret i1 false
				}

				; Function Attrs: nounwind readnone
				declare signext i32 @__isnanf(float) #0

				; Function Attrs: nounwind readnone
				define zeroext i1 @_Z2t3f(float %x) #0 {
				entry:
				%call = tail call signext i32 @__isinff(float %x) #1
				%tobool = icmp ne i32 %call, 0
				ret i1 %tobool
				; CHECK-LABEL: @_Z2t3f
				; CHECK: ret i1 false
				}

				; Function Attrs: nounwind readnone
				declare signext i32 @__isinff(float) #0

				; Function Attrs: nounwind readnone
				define zeroext i1 @_Z3t1dd(double %x) #0 {
				entry:
				%call = tail call signext i32 @__finite(double %x) #1
				%tobool = icmp ne i32 %call, 0
				ret i1 %tobool
				; CHECK-LABEL: @_Z3t1dd
				; CHECK: ret i1 true
				}

				; Function Attrs: nounwind readnone
				declare signext i32 @__finite(double) #0

				; Function Attrs: nounwind readnone
				define zeroext i1 @_Z3t2dd(double %x) #0 {
				entry:
				%call = tail call signext i32 @__isnan(double %x) #1
				%tobool = icmp ne i32 %call, 0
				ret i1 %tobool
				; CHECK-LABEL: @_Z3t2dd
				; CHECK: ret i1 false
				}

				; Function Attrs: nounwind readnone
				declare signext i32 @__isnan(double) #0

				; Function Attrs: nounwind readnone
				define zeroext i1 @_Z3t3dd(double %x) #0 {
				entry:
				%call = tail call signext i32 @__isinf(double %x) #1
				%tobool = icmp ne i32 %call, 0
				ret i1 %tobool
				; CHECK-LABEL: @_Z3t3dd
				; CHECK: ret i1 false
				}

				; Function Attrs: nounwind readnone
				declare signext i32 @__isinf(double) #0

				attributes #0 = { nounwind readnone "no-infs-fp-math"="true" "no-nans-fp-math"="true" "unsafe-fp-math"="true" }
				attributes #1 = { nounwind readnone }

This is an archive of the discontinued LLVM Phabricator instance.

Simplify isfinite/isnan/isinf in finite-math-only modeAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 51774

include/llvm/Analysis/TargetLibraryInfo.h

include/llvm/Analysis/TargetLibraryInfo.def

include/llvm/Transforms/Utils/SimplifyLibCalls.h

lib/Analysis/TargetLibraryInfo.cpp

lib/Transforms/Utils/SimplifyLibCalls.cpp

test/Transforms/InstCombine/fp-classify-libcalls.ll

Simplify isfinite/isnan/isinf in finite-math-only mode
AbandonedPublic