This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
Analysis/
-
TargetLibraryInfo.h
-
VecFuncs.def
-
Support/
-
TypeSize.h
-
lib/
-
Analysis/
5/5
TargetLibraryInfo.cpp
-
Transforms/
-
Utils/
3/4
InjectTLIMappings.cpp
-
Vectorize/
4/4
LoopVectorizationLegality.cpp

Differential D96011

[NFC][Analysis] Change struct VecDesc to use ElementCount
ClosedPublic

Authored by david-arm on Feb 4 2021, 3:19 AM.

Download Raw Diff

Details

Reviewers

sdesmalen
kmclaughlin
spatel
fpetrogalli

Commits

rG01b87444cb02: [NFC][Analysis] Change struct VecDesc to use ElementCount

Summary

This patch changes the VecDesc struct to use ElementCount
instead of an unsigned VF value, in preparation for
future work that adds support for vectorized versions of
math functions using scalable vectors. Since all I'm doing
in this patch is switching the type I believe it's a
non-functional change. I changed getWidestVF to now return
both the widest fixed-width and scalable VF values, but
currently the widest scalable value will be zero.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

david-arm created this revision.Feb 4 2021, 3:19 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptFeb 4 2021, 3:19 AM

david-arm requested review of this revision.Feb 4 2021, 3:19 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 4 2021, 3:19 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B87865: Diff 321366.Feb 4 2021, 3:19 AM

david-arm added a parent revision: D95934: [Analysis] Remove unused functions from TargetLibraryInfo.Feb 4 2021, 3:38 AM

sdesmalen added inline comments.Feb 8 2021, 7:49 AM

llvm/include/llvm/Analysis/VectorUtils.h
194 ↗	(On Diff #321366)	Can you pull this change out into a separate patch? Then the remainder of this patch becomes NFC (especially when you add asserts the VF must be fixed-width for the entries).
llvm/lib/Analysis/TargetLibraryInfo.cpp
1673–1674	Can you define these as ElementCount (and change the interface to take an ElementCount instead of unsigned), and do the initialization before the call to getWidestVF?
1681–1684	nit: ElementCount Ptr = VF.isScalable() ? &ScalableVF : &FixedVF; if (ElementCount::isKnownGT(VF, Ptr)) *Ptr = VF;
llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
109	nit: unnecessary curly braces.
121	This needs an assert to make sure WidestScalableVF is not set, so that it is not silently ignoring any scalable entries.
llvm/lib/Transforms/Vectorize/LoopVectorizationLegality.cpp
604–607	This can probably return false directly when WidestScalableVF is > 0, because there is no way to scalarize for scalable VFs.

david-arm added inline comments.Feb 8 2021, 7:59 AM

llvm/lib/Analysis/TargetLibraryInfo.cpp
1673–1674	Hi @sdesmalen, I did that initially but then I thought it seemed a bit unnecessary to have them as ElementCounts because the interface clearly defines one as fixed and the other as scalable. I can change them back to ElementCount - it just felt like we were doing redundant extra work that's all.

sdesmalen added inline comments.Feb 8 2021, 8:15 AM

llvm/lib/Analysis/TargetLibraryInfo.cpp
1673–1674	What is the redundant extra work? The only difference is in where the initialization is done. Now all uses of FixedVF coming from `getWidestVF` have to write explicit ElementCount::getFixed(). Just FYI, I had to solve a similar problem for returning max-VFs in D96025, for which I added a new class `OptionalVFCandidates`, and in a subsequent patch (not yet posted) added a function that generates all power-of-two VFs upto the max-VFs.

david-arm added inline comments.Feb 9 2021, 5:35 AM

llvm/lib/Transforms/Vectorize/LoopVectorizationLegality.cpp
604–607	I thought about that as well when I first did the patch, but I don't think that's right. This function is not asking if something should be scalarised, but instead asking if there is at least one variant of the function that is vectorisable. If we return false here when WidestScalableVF > 0 then that's a bug I think because that means we're saying we can definitely vectorise at least one function, which may not be true. As an alternative I could add an assert like this: assert(WidestScalableVF.isZero() \|\| !Scalarize); Also, returning true from this function doesn't mean we are going to scalarise using as a scalable VF, even if widest fixed and scalable VFs are both non-zero. It just means that no vectorisable variants could be found and therefore the caller must decide what to do and which variant to scalarise - fixed or scalable?

david-arm updated this revision to Diff 322377.Feb 9 2021, 6:33 AM

david-arm added a parent revision: D96338: [Analysis] Change VFABI::mangleTLIVectorName to use ElementCount.

david-arm marked 6 inline comments as done.

sdesmalen added inline comments.Feb 11 2021, 1:40 AM

llvm/lib/Analysis/TargetLibraryInfo.cpp
1611–1613	nit: `const ElementCount &`
llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
114	nit: s/unsupported/not yet supported/
llvm/lib/Transforms/Vectorize/LoopVectorizationLegality.cpp
598	Should we just add a constructor that takes no arguments and does the same as getNull?
603–608	This should be `Element::getScalable(1)` because `<vscale x 1 x <eltty>>` may well be a valid type for some scalable vector architecturs. For SVE we don't loops to be vectorized with this VF because it is expensive to legalize, but we shouldn't work with that assumption for generic code.

david-arm updated this revision to Diff 322999.Feb 11 2021, 7:17 AM

Herald added a subscriber: dexonsmith. · View Herald TranscriptFeb 11 2021, 7:17 AM

david-arm marked 4 inline comments as done.Feb 11 2021, 7:18 AM

Thanks for the changes, LGTM.

llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
91–94	nit: can you move this closer to the for loop?

This revision is now accepted and ready to land.Feb 11 2021, 7:26 AM

Closed by commit rG01b87444cb02: [NFC][Analysis] Change struct VecDesc to use ElementCount (authored by david-arm). · Explain WhyFeb 12 2021, 3:08 AM

This revision was automatically updated to reflect the committed changes.

david-arm added a commit: rG01b87444cb02: [NFC][Analysis] Change struct VecDesc to use ElementCount.

spatel mentioned this in D95373: Replace vector intrinsics with call to vector library.Feb 12 2021, 6:52 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

TargetLibraryInfo.h

18 lines

VecFuncs.def

581 lines

Support/

TypeSize.h

1 line

lib/

Analysis/

TargetLibraryInfo.cpp

23 lines

Transforms/

Utils/

InjectTLIMappings.cpp

24 lines

Vectorize/

LoopVectorizationLegality.cpp

15 lines

Diff 323278

llvm/include/llvm/Analysis/TargetLibraryInfo.h

Show All 22 Lines
class Triple;		class Triple;

/// Describes a possible vectorization of a function.		/// Describes a possible vectorization of a function.
/// Function 'VectorFnName' is equivalent to 'ScalarFnName' vectorized		/// Function 'VectorFnName' is equivalent to 'ScalarFnName' vectorized
/// by a factor 'VectorizationFactor'.		/// by a factor 'VectorizationFactor'.
struct VecDesc {		struct VecDesc {
StringRef ScalarFnName;		StringRef ScalarFnName;
StringRef VectorFnName;		StringRef VectorFnName;
unsigned VectorizationFactor;		ElementCount VectorizationFactor;
};		};

enum LibFunc : unsigned {		enum LibFunc : unsigned {
#define TLI_DEFINE_ENUM		#define TLI_DEFINE_ENUM
#include "llvm/Analysis/TargetLibraryInfo.def"		#include "llvm/Analysis/TargetLibraryInfo.def"

NumLibFuncs,		NumLibFuncs,
NotLibFunc		NotLibFunc
▲ Show 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	public:
void addVectorizableFunctions(ArrayRef<VecDesc> Fns);		void addVectorizableFunctions(ArrayRef<VecDesc> Fns);

/// Calls addVectorizableFunctions with a known preset of functions for the		/// Calls addVectorizableFunctions with a known preset of functions for the
/// given vector library.		/// given vector library.
void addVectorizableFunctionsFromVecLib(enum VectorLibrary VecLib);		void addVectorizableFunctionsFromVecLib(enum VectorLibrary VecLib);

/// Return true if the function F has a vector equivalent with vectorization		/// Return true if the function F has a vector equivalent with vectorization
/// factor VF.		/// factor VF.
bool isFunctionVectorizable(StringRef F, unsigned VF) const {		bool isFunctionVectorizable(StringRef F, const ElementCount &VF) const {
return !getVectorizedFunction(F, VF).empty();		return !getVectorizedFunction(F, VF).empty();
}		}

/// Return true if the function F has a vector equivalent with any		/// Return true if the function F has a vector equivalent with any
/// vectorization factor.		/// vectorization factor.
bool isFunctionVectorizable(StringRef F) const;		bool isFunctionVectorizable(StringRef F) const;

/// Return the name of the equivalent of F, vectorized with factor VF. If no		/// Return the name of the equivalent of F, vectorized with factor VF. If no
/// such mapping exists, return the empty string.		/// such mapping exists, return the empty string.
StringRef getVectorizedFunction(StringRef F, unsigned VF) const;		StringRef getVectorizedFunction(StringRef F, const ElementCount &VF) const;

/// Set to true iff i32 parameters to library functions should have signext		/// Set to true iff i32 parameters to library functions should have signext
/// or zeroext attributes if they correspond to C-level int or unsigned int,		/// or zeroext attributes if they correspond to C-level int or unsigned int,
/// respectively.		/// respectively.
void setShouldExtI32Param(bool Val) {		void setShouldExtI32Param(bool Val) {
ShouldExtI32Param = Val;		ShouldExtI32Param = Val;
}		}

Show All 11 Lines	public:
}		}

/// Returns the size of the wchar_t type in bytes or 0 if the size is unknown.		/// Returns the size of the wchar_t type in bytes or 0 if the size is unknown.
/// This queries the 'wchar_size' metadata.		/// This queries the 'wchar_size' metadata.
unsigned getWCharSize(const Module &M) const;		unsigned getWCharSize(const Module &M) const;

/// Returns the largest vectorization factor used in the list of		/// Returns the largest vectorization factor used in the list of
/// vector functions.		/// vector functions.
unsigned getWidestVF(StringRef ScalarF) const;		void getWidestVF(StringRef ScalarF, ElementCount &FixedVF,
		ElementCount &Scalable) const;
};		};

/// Provides information about what library functions are available for		/// Provides information about what library functions are available for
/// the current target.		/// the current target.
///		///
/// This both allows optimizations to handle them specially and frontends to		/// This both allows optimizations to handle them specially and frontends to
/// disable such optimizations through -fno-builtin etc.		/// disable such optimizations through -fno-builtin etc.
class TargetLibraryInfo {		class TargetLibraryInfo {
▲ Show 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	if (OverrideAsUnavailable[F])
return TargetLibraryInfoImpl::Unavailable;		return TargetLibraryInfoImpl::Unavailable;
return Impl->getState(F);		return Impl->getState(F);
}		}

/// Tests whether a library function is available.		/// Tests whether a library function is available.
bool has(LibFunc F) const {		bool has(LibFunc F) const {
return getState(F) != TargetLibraryInfoImpl::Unavailable;		return getState(F) != TargetLibraryInfoImpl::Unavailable;
}		}
bool isFunctionVectorizable(StringRef F, unsigned VF) const {		bool isFunctionVectorizable(StringRef F, const ElementCount &VF) const {
return Impl->isFunctionVectorizable(F, VF);		return Impl->isFunctionVectorizable(F, VF);
}		}
bool isFunctionVectorizable(StringRef F) const {		bool isFunctionVectorizable(StringRef F) const {
return Impl->isFunctionVectorizable(F);		return Impl->isFunctionVectorizable(F);
}		}
StringRef getVectorizedFunction(StringRef F, unsigned VF) const {		StringRef getVectorizedFunction(StringRef F, const ElementCount &VF) const {
return Impl->getVectorizedFunction(F, VF);		return Impl->getVectorizedFunction(F, VF);
}		}

/// Tests if the function is both available and a candidate for optimized code		/// Tests if the function is both available and a candidate for optimized code
/// generation.		/// generation.
bool hasOptimizedCodeGen(LibFunc F) const {		bool hasOptimizedCodeGen(LibFunc F) const {
if (getState(F) == TargetLibraryInfoImpl::Unavailable)		if (getState(F) == TargetLibraryInfoImpl::Unavailable)
return false;		return false;
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	bool invalidate(Module &, const PreservedAnalyses &,
return false;		return false;
}		}
bool invalidate(Function &, const PreservedAnalyses &,		bool invalidate(Function &, const PreservedAnalyses &,
FunctionAnalysisManager::Invalidator &) {		FunctionAnalysisManager::Invalidator &) {
return false;		return false;
}		}
/// Returns the largest vectorization factor used in the list of		/// Returns the largest vectorization factor used in the list of
/// vector functions.		/// vector functions.
unsigned getWidestVF(StringRef ScalarF) const {		void getWidestVF(StringRef ScalarF, ElementCount &FixedVF,
return Impl->getWidestVF(ScalarF);		ElementCount &ScalableVF) const {
		Impl->getWidestVF(ScalarF, FixedVF, ScalableVF);
}		}

/// Check if the function "F" is listed in a library known to LLVM.		/// Check if the function "F" is listed in a library known to LLVM.
bool isKnownVectorFunctionInLibrary(StringRef F) const {		bool isKnownVectorFunctionInLibrary(StringRef F) const {
return this->isFunctionVectorizable(F);		return this->isFunctionVectorizable(F);
}		}
};		};

▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

llvm/include/llvm/Analysis/VecFuncs.def

	Show All 11 Lines
	// This .def file also allows creating an array of vector functions supported in			// This .def file also allows creating an array of vector functions supported in
	// the specified framework or library.			// the specified framework or library.

	#if defined(TLI_DEFINE_MASSV_VECFUNCS_NAMES)			#if defined(TLI_DEFINE_MASSV_VECFUNCS_NAMES)
	#define TLI_DEFINE_MASSV_VECFUNCS			#define TLI_DEFINE_MASSV_VECFUNCS
	#define TLI_DEFINE_VECFUNC(SCAL, VEC, VF) VEC,			#define TLI_DEFINE_VECFUNC(SCAL, VEC, VF) VEC,
	#endif			#endif

				#define FIXED(NL) ElementCount::getFixed(NL)
				#define SCALABLE(NL) ElementCount::getScalable(NL)

	#if !(defined(TLI_DEFINE_VECFUNC))			#if !(defined(TLI_DEFINE_VECFUNC))
	#define TLI_DEFINE_VECFUNC(SCAL, VEC, VF) {SCAL, VEC, VF},			#define TLI_DEFINE_VECFUNC(SCAL, VEC, VF) {SCAL, VEC, VF},
	#endif			#endif

	#if defined(TLI_DEFINE_ACCELERATE_VECFUNCS)			#if defined(TLI_DEFINE_ACCELERATE_VECFUNCS)
	// Accelerate framework's Vector Functions			// Accelerate framework's Vector Functions

	// Floating-Point Arithmetic and Auxiliary Functions			// Floating-Point Arithmetic and Auxiliary Functions
	TLI_DEFINE_VECFUNC("ceilf", "vceilf", 4)			TLI_DEFINE_VECFUNC("ceilf", "vceilf", FIXED(4))
	TLI_DEFINE_VECFUNC("fabsf", "vfabsf", 4)			TLI_DEFINE_VECFUNC("fabsf", "vfabsf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.fabs.f32", "vfabsf", 4)			TLI_DEFINE_VECFUNC("llvm.fabs.f32", "vfabsf", FIXED(4))
	TLI_DEFINE_VECFUNC("floorf", "vfloorf", 4)			TLI_DEFINE_VECFUNC("floorf", "vfloorf", FIXED(4))
	TLI_DEFINE_VECFUNC("sqrtf", "vsqrtf", 4)			TLI_DEFINE_VECFUNC("sqrtf", "vsqrtf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.sqrt.f32", "vsqrtf", 4)			TLI_DEFINE_VECFUNC("llvm.sqrt.f32", "vsqrtf", FIXED(4))

	// Exponential and Logarithmic Functions			// Exponential and Logarithmic Functions
	TLI_DEFINE_VECFUNC("expf", "vexpf", 4)			TLI_DEFINE_VECFUNC("expf", "vexpf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.exp.f32", "vexpf", 4)			TLI_DEFINE_VECFUNC("llvm.exp.f32", "vexpf", FIXED(4))
	TLI_DEFINE_VECFUNC("expm1f", "vexpm1f", 4)			TLI_DEFINE_VECFUNC("expm1f", "vexpm1f", FIXED(4))
	TLI_DEFINE_VECFUNC("logf", "vlogf", 4)			TLI_DEFINE_VECFUNC("logf", "vlogf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log.f32", "vlogf", 4)			TLI_DEFINE_VECFUNC("llvm.log.f32", "vlogf", FIXED(4))
	TLI_DEFINE_VECFUNC("log1pf", "vlog1pf", 4)			TLI_DEFINE_VECFUNC("log1pf", "vlog1pf", FIXED(4))
	TLI_DEFINE_VECFUNC("log10f", "vlog10f", 4)			TLI_DEFINE_VECFUNC("log10f", "vlog10f", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log10.f32", "vlog10f", 4)			TLI_DEFINE_VECFUNC("llvm.log10.f32", "vlog10f", FIXED(4))
	TLI_DEFINE_VECFUNC("logbf", "vlogbf", 4)			TLI_DEFINE_VECFUNC("logbf", "vlogbf", FIXED(4))

	// Trigonometric Functions			// Trigonometric Functions
	TLI_DEFINE_VECFUNC("sinf", "vsinf", 4)			TLI_DEFINE_VECFUNC("sinf", "vsinf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.sin.f32", "vsinf", 4)			TLI_DEFINE_VECFUNC("llvm.sin.f32", "vsinf", FIXED(4))
	TLI_DEFINE_VECFUNC("cosf", "vcosf", 4)			TLI_DEFINE_VECFUNC("cosf", "vcosf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.cos.f32", "vcosf", 4)			TLI_DEFINE_VECFUNC("llvm.cos.f32", "vcosf", FIXED(4))
	TLI_DEFINE_VECFUNC("tanf", "vtanf", 4)			TLI_DEFINE_VECFUNC("tanf", "vtanf", FIXED(4))
	TLI_DEFINE_VECFUNC("asinf", "vasinf", 4)			TLI_DEFINE_VECFUNC("asinf", "vasinf", FIXED(4))
	TLI_DEFINE_VECFUNC("acosf", "vacosf", 4)			TLI_DEFINE_VECFUNC("acosf", "vacosf", FIXED(4))
	TLI_DEFINE_VECFUNC("atanf", "vatanf", 4)			TLI_DEFINE_VECFUNC("atanf", "vatanf", FIXED(4))

	// Hyperbolic Functions			// Hyperbolic Functions
	TLI_DEFINE_VECFUNC("sinhf", "vsinhf", 4)			TLI_DEFINE_VECFUNC("sinhf", "vsinhf", FIXED(4))
	TLI_DEFINE_VECFUNC("coshf", "vcoshf", 4)			TLI_DEFINE_VECFUNC("coshf", "vcoshf", FIXED(4))
	TLI_DEFINE_VECFUNC("tanhf", "vtanhf", 4)			TLI_DEFINE_VECFUNC("tanhf", "vtanhf", FIXED(4))
	TLI_DEFINE_VECFUNC("asinhf", "vasinhf", 4)			TLI_DEFINE_VECFUNC("asinhf", "vasinhf", FIXED(4))
	TLI_DEFINE_VECFUNC("acoshf", "vacoshf", 4)			TLI_DEFINE_VECFUNC("acoshf", "vacoshf", FIXED(4))
	TLI_DEFINE_VECFUNC("atanhf", "vatanhf", 4)			TLI_DEFINE_VECFUNC("atanhf", "vatanhf", FIXED(4))


	#elif defined(TLI_DEFINE_LIBMVEC_X86_VECFUNCS)			#elif defined(TLI_DEFINE_LIBMVEC_X86_VECFUNCS)
	// GLIBC Vector math Functions			// GLIBC Vector math Functions

	TLI_DEFINE_VECFUNC("sin", "_ZGVbN2v_sin", 2)			TLI_DEFINE_VECFUNC("sin", "_ZGVbN2v_sin", FIXED(2))
	TLI_DEFINE_VECFUNC("sin", "_ZGVdN4v_sin", 4)			TLI_DEFINE_VECFUNC("sin", "_ZGVdN4v_sin", FIXED(4))

	TLI_DEFINE_VECFUNC("sinf", "_ZGVbN4v_sinf", 4)			TLI_DEFINE_VECFUNC("sinf", "_ZGVbN4v_sinf", FIXED(4))
	TLI_DEFINE_VECFUNC("sinf", "_ZGVdN8v_sinf", 8)			TLI_DEFINE_VECFUNC("sinf", "_ZGVdN8v_sinf", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.sin.f64", "_ZGVbN2v_sin", 2)			TLI_DEFINE_VECFUNC("llvm.sin.f64", "_ZGVbN2v_sin", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.sin.f64", "_ZGVdN4v_sin", 4)			TLI_DEFINE_VECFUNC("llvm.sin.f64", "_ZGVdN4v_sin", FIXED(4))

	TLI_DEFINE_VECFUNC("llvm.sin.f32", "_ZGVbN4v_sinf", 4)			TLI_DEFINE_VECFUNC("llvm.sin.f32", "_ZGVbN4v_sinf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.sin.f32", "_ZGVdN8v_sinf", 8)			TLI_DEFINE_VECFUNC("llvm.sin.f32", "_ZGVdN8v_sinf", FIXED(8))

	TLI_DEFINE_VECFUNC("cos", "_ZGVbN2v_cos", 2)			TLI_DEFINE_VECFUNC("cos", "_ZGVbN2v_cos", FIXED(2))
	TLI_DEFINE_VECFUNC("cos", "_ZGVdN4v_cos", 4)			TLI_DEFINE_VECFUNC("cos", "_ZGVdN4v_cos", FIXED(4))

	TLI_DEFINE_VECFUNC("cosf", "_ZGVbN4v_cosf", 4)			TLI_DEFINE_VECFUNC("cosf", "_ZGVbN4v_cosf", FIXED(4))
	TLI_DEFINE_VECFUNC("cosf", "_ZGVdN8v_cosf", 8)			TLI_DEFINE_VECFUNC("cosf", "_ZGVdN8v_cosf", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.cos.f64", "_ZGVbN2v_cos", 2)			TLI_DEFINE_VECFUNC("llvm.cos.f64", "_ZGVbN2v_cos", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.cos.f64", "_ZGVdN4v_cos", 4)			TLI_DEFINE_VECFUNC("llvm.cos.f64", "_ZGVdN4v_cos", FIXED(4))

	TLI_DEFINE_VECFUNC("llvm.cos.f32", "_ZGVbN4v_cosf", 4)			TLI_DEFINE_VECFUNC("llvm.cos.f32", "_ZGVbN4v_cosf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.cos.f32", "_ZGVdN8v_cosf", 8)			TLI_DEFINE_VECFUNC("llvm.cos.f32", "_ZGVdN8v_cosf", FIXED(8))

	TLI_DEFINE_VECFUNC("pow", "_ZGVbN2vv_pow", 2)			TLI_DEFINE_VECFUNC("pow", "_ZGVbN2vv_pow", FIXED(2))
	TLI_DEFINE_VECFUNC("pow", "_ZGVdN4vv_pow", 4)			TLI_DEFINE_VECFUNC("pow", "_ZGVdN4vv_pow", FIXED(4))

	TLI_DEFINE_VECFUNC("powf", "_ZGVbN4vv_powf", 4)			TLI_DEFINE_VECFUNC("powf", "_ZGVbN4vv_powf", FIXED(4))
	TLI_DEFINE_VECFUNC("powf", "_ZGVdN8vv_powf", 8)			TLI_DEFINE_VECFUNC("powf", "_ZGVdN8vv_powf", FIXED(8))

	TLI_DEFINE_VECFUNC("__pow_finite", "_ZGVbN2vv___pow_finite", 2)			TLI_DEFINE_VECFUNC("__pow_finite", "_ZGVbN2vv___pow_finite", FIXED(2))
	TLI_DEFINE_VECFUNC("__pow_finite", "_ZGVdN4vv___pow_finite", 4)			TLI_DEFINE_VECFUNC("__pow_finite", "_ZGVdN4vv___pow_finite", FIXED(4))

	TLI_DEFINE_VECFUNC("__powf_finite", "_ZGVbN4vv___powf_finite", 4)			TLI_DEFINE_VECFUNC("__powf_finite", "_ZGVbN4vv___powf_finite", FIXED(4))
	TLI_DEFINE_VECFUNC("__powf_finite", "_ZGVdN8vv___powf_finite", 8)			TLI_DEFINE_VECFUNC("__powf_finite", "_ZGVdN8vv___powf_finite", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.pow.f64", "_ZGVbN2vv_pow", 2)			TLI_DEFINE_VECFUNC("llvm.pow.f64", "_ZGVbN2vv_pow", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.pow.f64", "_ZGVdN4vv_pow", 4)			TLI_DEFINE_VECFUNC("llvm.pow.f64", "_ZGVdN4vv_pow", FIXED(4))

	TLI_DEFINE_VECFUNC("llvm.pow.f32", "_ZGVbN4vv_powf", 4)			TLI_DEFINE_VECFUNC("llvm.pow.f32", "_ZGVbN4vv_powf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.pow.f32", "_ZGVdN8vv_powf", 8)			TLI_DEFINE_VECFUNC("llvm.pow.f32", "_ZGVdN8vv_powf", FIXED(8))

	TLI_DEFINE_VECFUNC("exp", "_ZGVbN2v_exp", 2)			TLI_DEFINE_VECFUNC("exp", "_ZGVbN2v_exp", FIXED(2))
	TLI_DEFINE_VECFUNC("exp", "_ZGVdN4v_exp", 4)			TLI_DEFINE_VECFUNC("exp", "_ZGVdN4v_exp", FIXED(4))

	TLI_DEFINE_VECFUNC("expf", "_ZGVbN4v_expf", 4)			TLI_DEFINE_VECFUNC("expf", "_ZGVbN4v_expf", FIXED(4))
	TLI_DEFINE_VECFUNC("expf", "_ZGVdN8v_expf", 8)			TLI_DEFINE_VECFUNC("expf", "_ZGVdN8v_expf", FIXED(8))

	TLI_DEFINE_VECFUNC("__exp_finite", "_ZGVbN2v___exp_finite", 2)			TLI_DEFINE_VECFUNC("__exp_finite", "_ZGVbN2v___exp_finite", FIXED(2))
	TLI_DEFINE_VECFUNC("__exp_finite", "_ZGVdN4v___exp_finite", 4)			TLI_DEFINE_VECFUNC("__exp_finite", "_ZGVdN4v___exp_finite", FIXED(4))

	TLI_DEFINE_VECFUNC("__expf_finite", "_ZGVbN4v___expf_finite", 4)			TLI_DEFINE_VECFUNC("__expf_finite", "_ZGVbN4v___expf_finite", FIXED(4))
	TLI_DEFINE_VECFUNC("__expf_finite", "_ZGVdN8v___expf_finite", 8)			TLI_DEFINE_VECFUNC("__expf_finite", "_ZGVdN8v___expf_finite", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.exp.f64", "_ZGVbN2v_exp", 2)			TLI_DEFINE_VECFUNC("llvm.exp.f64", "_ZGVbN2v_exp", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.exp.f64", "_ZGVdN4v_exp", 4)			TLI_DEFINE_VECFUNC("llvm.exp.f64", "_ZGVdN4v_exp", FIXED(4))

	TLI_DEFINE_VECFUNC("llvm.exp.f32", "_ZGVbN4v_expf", 4)			TLI_DEFINE_VECFUNC("llvm.exp.f32", "_ZGVbN4v_expf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.exp.f32", "_ZGVdN8v_expf", 8)			TLI_DEFINE_VECFUNC("llvm.exp.f32", "_ZGVdN8v_expf", FIXED(8))

	TLI_DEFINE_VECFUNC("log", "_ZGVbN2v_log", 2)			TLI_DEFINE_VECFUNC("log", "_ZGVbN2v_log", FIXED(2))
	TLI_DEFINE_VECFUNC("log", "_ZGVdN4v_log", 4)			TLI_DEFINE_VECFUNC("log", "_ZGVdN4v_log", FIXED(4))

	TLI_DEFINE_VECFUNC("logf", "_ZGVbN4v_logf", 4)			TLI_DEFINE_VECFUNC("logf", "_ZGVbN4v_logf", FIXED(4))
	TLI_DEFINE_VECFUNC("logf", "_ZGVdN8v_logf", 8)			TLI_DEFINE_VECFUNC("logf", "_ZGVdN8v_logf", FIXED(8))

	TLI_DEFINE_VECFUNC("__log_finite", "_ZGVbN2v___log_finite", 2)			TLI_DEFINE_VECFUNC("__log_finite", "_ZGVbN2v___log_finite", FIXED(2))
	TLI_DEFINE_VECFUNC("__log_finite", "_ZGVdN4v___log_finite", 4)			TLI_DEFINE_VECFUNC("__log_finite", "_ZGVdN4v___log_finite", FIXED(4))

	TLI_DEFINE_VECFUNC("__logf_finite", "_ZGVbN4v___logf_finite", 4)			TLI_DEFINE_VECFUNC("__logf_finite", "_ZGVbN4v___logf_finite", FIXED(4))
	TLI_DEFINE_VECFUNC("__logf_finite", "_ZGVdN8v___logf_finite", 8)			TLI_DEFINE_VECFUNC("__logf_finite", "_ZGVdN8v___logf_finite", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.log.f64", "_ZGVbN2v_log", 2)			TLI_DEFINE_VECFUNC("llvm.log.f64", "_ZGVbN2v_log", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.log.f64", "_ZGVdN4v_log", 4)			TLI_DEFINE_VECFUNC("llvm.log.f64", "_ZGVdN4v_log", FIXED(4))

	TLI_DEFINE_VECFUNC("llvm.log.f32", "_ZGVbN4v_logf", 4)			TLI_DEFINE_VECFUNC("llvm.log.f32", "_ZGVbN4v_logf", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log.f32", "_ZGVdN8v_logf", 8)			TLI_DEFINE_VECFUNC("llvm.log.f32", "_ZGVdN8v_logf", FIXED(8))

	#elif defined(TLI_DEFINE_MASSV_VECFUNCS)			#elif defined(TLI_DEFINE_MASSV_VECFUNCS)
	// IBM MASS library's vector Functions			// IBM MASS library's vector Functions

	// Floating-Point Arithmetic and Auxiliary Functions			// Floating-Point Arithmetic and Auxiliary Functions
	TLI_DEFINE_VECFUNC("cbrt", "__cbrtd2_massv", 2)			TLI_DEFINE_VECFUNC("cbrt", "__cbrtd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("cbrtf", "__cbrtf4_massv", 4)			TLI_DEFINE_VECFUNC("cbrtf", "__cbrtf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("pow", "__powd2_massv", 2)			TLI_DEFINE_VECFUNC("pow", "__powd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.pow.f64", "__powd2_massv", 2)			TLI_DEFINE_VECFUNC("llvm.pow.f64", "__powd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("powf", "__powf4_massv", 4)			TLI_DEFINE_VECFUNC("powf", "__powf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.pow.f32", "__powf4_massv", 4)			TLI_DEFINE_VECFUNC("llvm.pow.f32", "__powf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("sqrt", "__sqrtd2_massv", 2)			TLI_DEFINE_VECFUNC("sqrt", "__sqrtd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.sqrt.f64", "__sqrtd2_massv", 2)			TLI_DEFINE_VECFUNC("llvm.sqrt.f64", "__sqrtd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("sqrtf", "__sqrtf4_massv", 4)			TLI_DEFINE_VECFUNC("sqrtf", "__sqrtf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.sqrt.f32", "__sqrtf4_massv", 4)			TLI_DEFINE_VECFUNC("llvm.sqrt.f32", "__sqrtf4_massv", FIXED(4))

	// Exponential and Logarithmic Functions			// Exponential and Logarithmic Functions
	TLI_DEFINE_VECFUNC("exp", "__expd2_massv", 2)			TLI_DEFINE_VECFUNC("exp", "__expd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.exp.f64", "__expd2_massv", 2)			TLI_DEFINE_VECFUNC("llvm.exp.f64", "__expd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("expf", "__expf4_massv", 4)			TLI_DEFINE_VECFUNC("expf", "__expf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.exp.f32", "__expf4_massv", 4)			TLI_DEFINE_VECFUNC("llvm.exp.f32", "__expf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("exp2", "__exp2d2_massv", 2)			TLI_DEFINE_VECFUNC("exp2", "__exp2d2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.exp2.f64", "__exp2d2_massv", 2)			TLI_DEFINE_VECFUNC("llvm.exp2.f64", "__exp2d2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("exp2f", "__exp2f4_massv", 4)			TLI_DEFINE_VECFUNC("exp2f", "__exp2f4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.exp2.f32", "__exp2f4_massv", 4)			TLI_DEFINE_VECFUNC("llvm.exp2.f32", "__exp2f4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("expm1", "__expm1d2_massv", 2)			TLI_DEFINE_VECFUNC("expm1", "__expm1d2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("expm1f", "__expm1f4_massv", 4)			TLI_DEFINE_VECFUNC("expm1f", "__expm1f4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("log", "__logd2_massv", 2)			TLI_DEFINE_VECFUNC("log", "__logd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.log.f64", "__logd2_massv", 2)			TLI_DEFINE_VECFUNC("llvm.log.f64", "__logd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("logf", "__logf4_massv", 4)			TLI_DEFINE_VECFUNC("logf", "__logf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log.f32", "__logf4_massv", 4)			TLI_DEFINE_VECFUNC("llvm.log.f32", "__logf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("log1p", "__log1pd2_massv", 2)			TLI_DEFINE_VECFUNC("log1p", "__log1pd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("log1pf", "__log1pf4_massv", 4)			TLI_DEFINE_VECFUNC("log1pf", "__log1pf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("log10", "__log10d2_massv", 2)			TLI_DEFINE_VECFUNC("log10", "__log10d2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.log10.f64", "__log10d2_massv", 2)			TLI_DEFINE_VECFUNC("llvm.log10.f64", "__log10d2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("log10f", "__log10f4_massv", 4)			TLI_DEFINE_VECFUNC("log10f", "__log10f4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log10.f32", "__log10f4_massv", 4)			TLI_DEFINE_VECFUNC("llvm.log10.f32", "__log10f4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("log2", "__log2d2_massv", 2)			TLI_DEFINE_VECFUNC("log2", "__log2d2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.log2.f64", "__log2d2_massv", 2)			TLI_DEFINE_VECFUNC("llvm.log2.f64", "__log2d2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("log2f", "__log2f4_massv", 4)			TLI_DEFINE_VECFUNC("log2f", "__log2f4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log2.f32", "__log2f4_massv", 4)			TLI_DEFINE_VECFUNC("llvm.log2.f32", "__log2f4_massv", FIXED(4))

	// Trigonometric Functions			// Trigonometric Functions
	TLI_DEFINE_VECFUNC("sin", "__sind2_massv", 2)			TLI_DEFINE_VECFUNC("sin", "__sind2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.sin.f64", "__sind2_massv", 2)			TLI_DEFINE_VECFUNC("llvm.sin.f64", "__sind2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("sinf", "__sinf4_massv", 4)			TLI_DEFINE_VECFUNC("sinf", "__sinf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.sin.f32", "__sinf4_massv", 4)			TLI_DEFINE_VECFUNC("llvm.sin.f32", "__sinf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("cos", "__cosd2_massv", 2)			TLI_DEFINE_VECFUNC("cos", "__cosd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.cos.f64", "__cosd2_massv", 2)			TLI_DEFINE_VECFUNC("llvm.cos.f64", "__cosd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("cosf", "__cosf4_massv", 4)			TLI_DEFINE_VECFUNC("cosf", "__cosf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.cos.f32", "__cosf4_massv", 4)			TLI_DEFINE_VECFUNC("llvm.cos.f32", "__cosf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("tan", "__tand2_massv", 2)			TLI_DEFINE_VECFUNC("tan", "__tand2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("tanf", "__tanf4_massv", 4)			TLI_DEFINE_VECFUNC("tanf", "__tanf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("asin", "__asind2_massv", 2)			TLI_DEFINE_VECFUNC("asin", "__asind2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("asinf", "__asinf4_massv", 4)			TLI_DEFINE_VECFUNC("asinf", "__asinf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("acos", "__acosd2_massv", 2)			TLI_DEFINE_VECFUNC("acos", "__acosd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("acosf", "__acosf4_massv", 4)			TLI_DEFINE_VECFUNC("acosf", "__acosf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("atan", "__atand2_massv", 2)			TLI_DEFINE_VECFUNC("atan", "__atand2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("atanf", "__atanf4_massv", 4)			TLI_DEFINE_VECFUNC("atanf", "__atanf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("atan2", "__atan2d2_massv", 2)			TLI_DEFINE_VECFUNC("atan2", "__atan2d2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("atan2f", "__atan2f4_massv", 4)			TLI_DEFINE_VECFUNC("atan2f", "__atan2f4_massv", FIXED(4))

	// Hyperbolic Functions			// Hyperbolic Functions
	TLI_DEFINE_VECFUNC("sinh", "__sinhd2_massv", 2)			TLI_DEFINE_VECFUNC("sinh", "__sinhd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("sinhf", "__sinhf4_massv", 4)			TLI_DEFINE_VECFUNC("sinhf", "__sinhf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("cosh", "__coshd2_massv", 2)			TLI_DEFINE_VECFUNC("cosh", "__coshd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("coshf", "__coshf4_massv", 4)			TLI_DEFINE_VECFUNC("coshf", "__coshf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("tanh", "__tanhd2_massv", 2)			TLI_DEFINE_VECFUNC("tanh", "__tanhd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("tanhf", "__tanhf4_massv", 4)			TLI_DEFINE_VECFUNC("tanhf", "__tanhf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("asinh", "__asinhd2_massv", 2)			TLI_DEFINE_VECFUNC("asinh", "__asinhd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("asinhf", "__asinhf4_massv", 4)			TLI_DEFINE_VECFUNC("asinhf", "__asinhf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("acosh", "__acoshd2_massv", 2)			TLI_DEFINE_VECFUNC("acosh", "__acoshd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("acoshf", "__acoshf4_massv", 4)			TLI_DEFINE_VECFUNC("acoshf", "__acoshf4_massv", FIXED(4))
	TLI_DEFINE_VECFUNC("atanh", "__atanhd2_massv", 2)			TLI_DEFINE_VECFUNC("atanh", "__atanhd2_massv", FIXED(2))
	TLI_DEFINE_VECFUNC("atanhf", "__atanhf4_massv", 4)			TLI_DEFINE_VECFUNC("atanhf", "__atanhf4_massv", FIXED(4))


	#elif defined(TLI_DEFINE_SVML_VECFUNCS)			#elif defined(TLI_DEFINE_SVML_VECFUNCS)
	// Intel SVM library's Vector Functions			// Intel SVM library's Vector Functions

	TLI_DEFINE_VECFUNC("sin", "__svml_sin2", 2)			TLI_DEFINE_VECFUNC("sin", "__svml_sin2", FIXED(2))
	TLI_DEFINE_VECFUNC("sin", "__svml_sin4", 4)			TLI_DEFINE_VECFUNC("sin", "__svml_sin4", FIXED(4))
	TLI_DEFINE_VECFUNC("sin", "__svml_sin8", 8)			TLI_DEFINE_VECFUNC("sin", "__svml_sin8", FIXED(8))

	TLI_DEFINE_VECFUNC("sinf", "__svml_sinf4", 4)			TLI_DEFINE_VECFUNC("sinf", "__svml_sinf4", FIXED(4))
	TLI_DEFINE_VECFUNC("sinf", "__svml_sinf8", 8)			TLI_DEFINE_VECFUNC("sinf", "__svml_sinf8", FIXED(8))
	TLI_DEFINE_VECFUNC("sinf", "__svml_sinf16", 16)			TLI_DEFINE_VECFUNC("sinf", "__svml_sinf16", FIXED(16))

	TLI_DEFINE_VECFUNC("llvm.sin.f64", "__svml_sin2", 2)			TLI_DEFINE_VECFUNC("llvm.sin.f64", "__svml_sin2", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.sin.f64", "__svml_sin4", 4)			TLI_DEFINE_VECFUNC("llvm.sin.f64", "__svml_sin4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.sin.f64", "__svml_sin8", 8)			TLI_DEFINE_VECFUNC("llvm.sin.f64", "__svml_sin8", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.sin.f32", "__svml_sinf4", 4)			TLI_DEFINE_VECFUNC("llvm.sin.f32", "__svml_sinf4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.sin.f32", "__svml_sinf8", 8)			TLI_DEFINE_VECFUNC("llvm.sin.f32", "__svml_sinf8", FIXED(8))
	TLI_DEFINE_VECFUNC("llvm.sin.f32", "__svml_sinf16", 16)			TLI_DEFINE_VECFUNC("llvm.sin.f32", "__svml_sinf16", FIXED(16))

	TLI_DEFINE_VECFUNC("cos", "__svml_cos2", 2)			TLI_DEFINE_VECFUNC("cos", "__svml_cos2", FIXED(2))
	TLI_DEFINE_VECFUNC("cos", "__svml_cos4", 4)			TLI_DEFINE_VECFUNC("cos", "__svml_cos4", FIXED(4))
	TLI_DEFINE_VECFUNC("cos", "__svml_cos8", 8)			TLI_DEFINE_VECFUNC("cos", "__svml_cos8", FIXED(8))

	TLI_DEFINE_VECFUNC("cosf", "__svml_cosf4", 4)			TLI_DEFINE_VECFUNC("cosf", "__svml_cosf4", FIXED(4))
	TLI_DEFINE_VECFUNC("cosf", "__svml_cosf8", 8)			TLI_DEFINE_VECFUNC("cosf", "__svml_cosf8", FIXED(8))
	TLI_DEFINE_VECFUNC("cosf", "__svml_cosf16", 16)			TLI_DEFINE_VECFUNC("cosf", "__svml_cosf16", FIXED(16))

	TLI_DEFINE_VECFUNC("llvm.cos.f64", "__svml_cos2", 2)			TLI_DEFINE_VECFUNC("llvm.cos.f64", "__svml_cos2", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.cos.f64", "__svml_cos4", 4)			TLI_DEFINE_VECFUNC("llvm.cos.f64", "__svml_cos4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.cos.f64", "__svml_cos8", 8)			TLI_DEFINE_VECFUNC("llvm.cos.f64", "__svml_cos8", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.cos.f32", "__svml_cosf4", 4)			TLI_DEFINE_VECFUNC("llvm.cos.f32", "__svml_cosf4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.cos.f32", "__svml_cosf8", 8)			TLI_DEFINE_VECFUNC("llvm.cos.f32", "__svml_cosf8", FIXED(8))
	TLI_DEFINE_VECFUNC("llvm.cos.f32", "__svml_cosf16", 16)			TLI_DEFINE_VECFUNC("llvm.cos.f32", "__svml_cosf16", FIXED(16))

	TLI_DEFINE_VECFUNC("pow", "__svml_pow2", 2)			TLI_DEFINE_VECFUNC("pow", "__svml_pow2", FIXED(2))
	TLI_DEFINE_VECFUNC("pow", "__svml_pow4", 4)			TLI_DEFINE_VECFUNC("pow", "__svml_pow4", FIXED(4))
	TLI_DEFINE_VECFUNC("pow", "__svml_pow8", 8)			TLI_DEFINE_VECFUNC("pow", "__svml_pow8", FIXED(8))

	TLI_DEFINE_VECFUNC("powf", "__svml_powf4", 4)			TLI_DEFINE_VECFUNC("powf", "__svml_powf4", FIXED(4))
	TLI_DEFINE_VECFUNC("powf", "__svml_powf8", 8)			TLI_DEFINE_VECFUNC("powf", "__svml_powf8", FIXED(8))
	TLI_DEFINE_VECFUNC("powf", "__svml_powf16", 16)			TLI_DEFINE_VECFUNC("powf", "__svml_powf16", FIXED(16))

	TLI_DEFINE_VECFUNC("__pow_finite", "__svml_pow2", 2)			TLI_DEFINE_VECFUNC("__pow_finite", "__svml_pow2", FIXED(2))
	TLI_DEFINE_VECFUNC("__pow_finite", "__svml_pow4", 4)			TLI_DEFINE_VECFUNC("__pow_finite", "__svml_pow4", FIXED(4))
	TLI_DEFINE_VECFUNC("__pow_finite", "__svml_pow8", 8)			TLI_DEFINE_VECFUNC("__pow_finite", "__svml_pow8", FIXED(8))

	TLI_DEFINE_VECFUNC("__powf_finite", "__svml_powf4", 4)			TLI_DEFINE_VECFUNC("__powf_finite", "__svml_powf4", FIXED(4))
	TLI_DEFINE_VECFUNC("__powf_finite", "__svml_powf8", 8)			TLI_DEFINE_VECFUNC("__powf_finite", "__svml_powf8", FIXED(8))
	TLI_DEFINE_VECFUNC("__powf_finite", "__svml_powf16", 16)			TLI_DEFINE_VECFUNC("__powf_finite", "__svml_powf16", FIXED(16))

	TLI_DEFINE_VECFUNC("llvm.pow.f64", "__svml_pow2", 2)			TLI_DEFINE_VECFUNC("llvm.pow.f64", "__svml_pow2", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.pow.f64", "__svml_pow4", 4)			TLI_DEFINE_VECFUNC("llvm.pow.f64", "__svml_pow4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.pow.f64", "__svml_pow8", 8)			TLI_DEFINE_VECFUNC("llvm.pow.f64", "__svml_pow8", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.pow.f32", "__svml_powf4", 4)			TLI_DEFINE_VECFUNC("llvm.pow.f32", "__svml_powf4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.pow.f32", "__svml_powf8", 8)			TLI_DEFINE_VECFUNC("llvm.pow.f32", "__svml_powf8", FIXED(8))
	TLI_DEFINE_VECFUNC("llvm.pow.f32", "__svml_powf16", 16)			TLI_DEFINE_VECFUNC("llvm.pow.f32", "__svml_powf16", FIXED(16))

	TLI_DEFINE_VECFUNC("exp", "__svml_exp2", 2)			TLI_DEFINE_VECFUNC("exp", "__svml_exp2", FIXED(2))
	TLI_DEFINE_VECFUNC("exp", "__svml_exp4", 4)			TLI_DEFINE_VECFUNC("exp", "__svml_exp4", FIXED(4))
	TLI_DEFINE_VECFUNC("exp", "__svml_exp8", 8)			TLI_DEFINE_VECFUNC("exp", "__svml_exp8", FIXED(8))

	TLI_DEFINE_VECFUNC("expf", "__svml_expf4", 4)			TLI_DEFINE_VECFUNC("expf", "__svml_expf4", FIXED(4))
	TLI_DEFINE_VECFUNC("expf", "__svml_expf8", 8)			TLI_DEFINE_VECFUNC("expf", "__svml_expf8", FIXED(8))
	TLI_DEFINE_VECFUNC("expf", "__svml_expf16", 16)			TLI_DEFINE_VECFUNC("expf", "__svml_expf16", FIXED(16))

	TLI_DEFINE_VECFUNC("__exp_finite", "__svml_exp2", 2)			TLI_DEFINE_VECFUNC("__exp_finite", "__svml_exp2", FIXED(2))
	TLI_DEFINE_VECFUNC("__exp_finite", "__svml_exp4", 4)			TLI_DEFINE_VECFUNC("__exp_finite", "__svml_exp4", FIXED(4))
	TLI_DEFINE_VECFUNC("__exp_finite", "__svml_exp8", 8)			TLI_DEFINE_VECFUNC("__exp_finite", "__svml_exp8", FIXED(8))

	TLI_DEFINE_VECFUNC("__expf_finite", "__svml_expf4", 4)			TLI_DEFINE_VECFUNC("__expf_finite", "__svml_expf4", FIXED(4))
	TLI_DEFINE_VECFUNC("__expf_finite", "__svml_expf8", 8)			TLI_DEFINE_VECFUNC("__expf_finite", "__svml_expf8", FIXED(8))
	TLI_DEFINE_VECFUNC("__expf_finite", "__svml_expf16", 16)			TLI_DEFINE_VECFUNC("__expf_finite", "__svml_expf16", FIXED(16))

	TLI_DEFINE_VECFUNC("llvm.exp.f64", "__svml_exp2", 2)			TLI_DEFINE_VECFUNC("llvm.exp.f64", "__svml_exp2", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.exp.f64", "__svml_exp4", 4)			TLI_DEFINE_VECFUNC("llvm.exp.f64", "__svml_exp4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.exp.f64", "__svml_exp8", 8)			TLI_DEFINE_VECFUNC("llvm.exp.f64", "__svml_exp8", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.exp.f32", "__svml_expf4", 4)			TLI_DEFINE_VECFUNC("llvm.exp.f32", "__svml_expf4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.exp.f32", "__svml_expf8", 8)			TLI_DEFINE_VECFUNC("llvm.exp.f32", "__svml_expf8", FIXED(8))
	TLI_DEFINE_VECFUNC("llvm.exp.f32", "__svml_expf16", 16)			TLI_DEFINE_VECFUNC("llvm.exp.f32", "__svml_expf16", FIXED(16))

	TLI_DEFINE_VECFUNC("log", "__svml_log2", 2)			TLI_DEFINE_VECFUNC("log", "__svml_log2", FIXED(2))
	TLI_DEFINE_VECFUNC("log", "__svml_log4", 4)			TLI_DEFINE_VECFUNC("log", "__svml_log4", FIXED(4))
	TLI_DEFINE_VECFUNC("log", "__svml_log8", 8)			TLI_DEFINE_VECFUNC("log", "__svml_log8", FIXED(8))

	TLI_DEFINE_VECFUNC("logf", "__svml_logf4", 4)			TLI_DEFINE_VECFUNC("logf", "__svml_logf4", FIXED(4))
	TLI_DEFINE_VECFUNC("logf", "__svml_logf8", 8)			TLI_DEFINE_VECFUNC("logf", "__svml_logf8", FIXED(8))
	TLI_DEFINE_VECFUNC("logf", "__svml_logf16", 16)			TLI_DEFINE_VECFUNC("logf", "__svml_logf16", FIXED(16))

	TLI_DEFINE_VECFUNC("__log_finite", "__svml_log2", 2)			TLI_DEFINE_VECFUNC("__log_finite", "__svml_log2", FIXED(2))
	TLI_DEFINE_VECFUNC("__log_finite", "__svml_log4", 4)			TLI_DEFINE_VECFUNC("__log_finite", "__svml_log4", FIXED(4))
	TLI_DEFINE_VECFUNC("__log_finite", "__svml_log8", 8)			TLI_DEFINE_VECFUNC("__log_finite", "__svml_log8", FIXED(8))

	TLI_DEFINE_VECFUNC("__logf_finite", "__svml_logf4", 4)			TLI_DEFINE_VECFUNC("__logf_finite", "__svml_logf4", FIXED(4))
	TLI_DEFINE_VECFUNC("__logf_finite", "__svml_logf8", 8)			TLI_DEFINE_VECFUNC("__logf_finite", "__svml_logf8", FIXED(8))
	TLI_DEFINE_VECFUNC("__logf_finite", "__svml_logf16", 16)			TLI_DEFINE_VECFUNC("__logf_finite", "__svml_logf16", FIXED(16))

	TLI_DEFINE_VECFUNC("llvm.log.f64", "__svml_log2", 2)			TLI_DEFINE_VECFUNC("llvm.log.f64", "__svml_log2", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.log.f64", "__svml_log4", 4)			TLI_DEFINE_VECFUNC("llvm.log.f64", "__svml_log4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log.f64", "__svml_log8", 8)			TLI_DEFINE_VECFUNC("llvm.log.f64", "__svml_log8", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.log.f32", "__svml_logf4", 4)			TLI_DEFINE_VECFUNC("llvm.log.f32", "__svml_logf4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log.f32", "__svml_logf8", 8)			TLI_DEFINE_VECFUNC("llvm.log.f32", "__svml_logf8", FIXED(8))
	TLI_DEFINE_VECFUNC("llvm.log.f32", "__svml_logf16", 16)			TLI_DEFINE_VECFUNC("llvm.log.f32", "__svml_logf16", FIXED(16))

	TLI_DEFINE_VECFUNC("log2", "__svml_log22", 2)			TLI_DEFINE_VECFUNC("log2", "__svml_log22", FIXED(2))
	TLI_DEFINE_VECFUNC("log2", "__svml_log24", 4)			TLI_DEFINE_VECFUNC("log2", "__svml_log24", FIXED(4))
	TLI_DEFINE_VECFUNC("log2", "__svml_log28", 8)			TLI_DEFINE_VECFUNC("log2", "__svml_log28", FIXED(8))

	TLI_DEFINE_VECFUNC("log2f", "__svml_log2f4", 4)			TLI_DEFINE_VECFUNC("log2f", "__svml_log2f4", FIXED(4))
	TLI_DEFINE_VECFUNC("log2f", "__svml_log2f8", 8)			TLI_DEFINE_VECFUNC("log2f", "__svml_log2f8", FIXED(8))
	TLI_DEFINE_VECFUNC("log2f", "__svml_log2f16", 16)			TLI_DEFINE_VECFUNC("log2f", "__svml_log2f16", FIXED(16))

	TLI_DEFINE_VECFUNC("__log2_finite", "__svml_log22", 2)			TLI_DEFINE_VECFUNC("__log2_finite", "__svml_log22", FIXED(2))
	TLI_DEFINE_VECFUNC("__log2_finite", "__svml_log24", 4)			TLI_DEFINE_VECFUNC("__log2_finite", "__svml_log24", FIXED(4))
	TLI_DEFINE_VECFUNC("__log2_finite", "__svml_log28", 8)			TLI_DEFINE_VECFUNC("__log2_finite", "__svml_log28", FIXED(8))

	TLI_DEFINE_VECFUNC("__log2f_finite", "__svml_log2f4", 4)			TLI_DEFINE_VECFUNC("__log2f_finite", "__svml_log2f4", FIXED(4))
	TLI_DEFINE_VECFUNC("__log2f_finite", "__svml_log2f8", 8)			TLI_DEFINE_VECFUNC("__log2f_finite", "__svml_log2f8", FIXED(8))
	TLI_DEFINE_VECFUNC("__log2f_finite", "__svml_log2f16", 16)			TLI_DEFINE_VECFUNC("__log2f_finite", "__svml_log2f16", FIXED(16))

	TLI_DEFINE_VECFUNC("llvm.log2.f64", "__svml_log22", 2)			TLI_DEFINE_VECFUNC("llvm.log2.f64", "__svml_log22", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.log2.f64", "__svml_log24", 4)			TLI_DEFINE_VECFUNC("llvm.log2.f64", "__svml_log24", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log2.f64", "__svml_log28", 8)			TLI_DEFINE_VECFUNC("llvm.log2.f64", "__svml_log28", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.log2.f32", "__svml_log2f4", 4)			TLI_DEFINE_VECFUNC("llvm.log2.f32", "__svml_log2f4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log2.f32", "__svml_log2f8", 8)			TLI_DEFINE_VECFUNC("llvm.log2.f32", "__svml_log2f8", FIXED(8))
	TLI_DEFINE_VECFUNC("llvm.log2.f32", "__svml_log2f16", 16)			TLI_DEFINE_VECFUNC("llvm.log2.f32", "__svml_log2f16", FIXED(16))

	TLI_DEFINE_VECFUNC("log10", "__svml_log102", 2)			TLI_DEFINE_VECFUNC("log10", "__svml_log102", FIXED(2))
	TLI_DEFINE_VECFUNC("log10", "__svml_log104", 4)			TLI_DEFINE_VECFUNC("log10", "__svml_log104", FIXED(4))
	TLI_DEFINE_VECFUNC("log10", "__svml_log108", 8)			TLI_DEFINE_VECFUNC("log10", "__svml_log108", FIXED(8))

	TLI_DEFINE_VECFUNC("log10f", "__svml_log10f4", 4)			TLI_DEFINE_VECFUNC("log10f", "__svml_log10f4", FIXED(4))
	TLI_DEFINE_VECFUNC("log10f", "__svml_log10f8", 8)			TLI_DEFINE_VECFUNC("log10f", "__svml_log10f8", FIXED(8))
	TLI_DEFINE_VECFUNC("log10f", "__svml_log10f16", 16)			TLI_DEFINE_VECFUNC("log10f", "__svml_log10f16", FIXED(16))

	TLI_DEFINE_VECFUNC("__log10_finite", "__svml_log102", 2)			TLI_DEFINE_VECFUNC("__log10_finite", "__svml_log102", FIXED(2))
	TLI_DEFINE_VECFUNC("__log10_finite", "__svml_log104", 4)			TLI_DEFINE_VECFUNC("__log10_finite", "__svml_log104", FIXED(4))
	TLI_DEFINE_VECFUNC("__log10_finite", "__svml_log108", 8)			TLI_DEFINE_VECFUNC("__log10_finite", "__svml_log108", FIXED(8))

	TLI_DEFINE_VECFUNC("__log10f_finite", "__svml_log10f4", 4)			TLI_DEFINE_VECFUNC("__log10f_finite", "__svml_log10f4", FIXED(4))
	TLI_DEFINE_VECFUNC("__log10f_finite", "__svml_log10f8", 8)			TLI_DEFINE_VECFUNC("__log10f_finite", "__svml_log10f8", FIXED(8))
	TLI_DEFINE_VECFUNC("__log10f_finite", "__svml_log10f16", 16)			TLI_DEFINE_VECFUNC("__log10f_finite", "__svml_log10f16", FIXED(16))

	TLI_DEFINE_VECFUNC("llvm.log10.f64", "__svml_log102", 2)			TLI_DEFINE_VECFUNC("llvm.log10.f64", "__svml_log102", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.log10.f64", "__svml_log104", 4)			TLI_DEFINE_VECFUNC("llvm.log10.f64", "__svml_log104", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log10.f64", "__svml_log108", 8)			TLI_DEFINE_VECFUNC("llvm.log10.f64", "__svml_log108", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.log10.f32", "__svml_log10f4", 4)			TLI_DEFINE_VECFUNC("llvm.log10.f32", "__svml_log10f4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.log10.f32", "__svml_log10f8", 8)			TLI_DEFINE_VECFUNC("llvm.log10.f32", "__svml_log10f8", FIXED(8))
	TLI_DEFINE_VECFUNC("llvm.log10.f32", "__svml_log10f16", 16)			TLI_DEFINE_VECFUNC("llvm.log10.f32", "__svml_log10f16", FIXED(16))

	TLI_DEFINE_VECFUNC("sqrt", "__svml_sqrt2", 2)			TLI_DEFINE_VECFUNC("sqrt", "__svml_sqrt2", FIXED(2))
	TLI_DEFINE_VECFUNC("sqrt", "__svml_sqrt4", 4)			TLI_DEFINE_VECFUNC("sqrt", "__svml_sqrt4", FIXED(4))
	TLI_DEFINE_VECFUNC("sqrt", "__svml_sqrt8", 8)			TLI_DEFINE_VECFUNC("sqrt", "__svml_sqrt8", FIXED(8))

	TLI_DEFINE_VECFUNC("sqrtf", "__svml_sqrtf4", 4)			TLI_DEFINE_VECFUNC("sqrtf", "__svml_sqrtf4", FIXED(4))
	TLI_DEFINE_VECFUNC("sqrtf", "__svml_sqrtf8", 8)			TLI_DEFINE_VECFUNC("sqrtf", "__svml_sqrtf8", FIXED(8))
	TLI_DEFINE_VECFUNC("sqrtf", "__svml_sqrtf16", 16)			TLI_DEFINE_VECFUNC("sqrtf", "__svml_sqrtf16", FIXED(16))

	TLI_DEFINE_VECFUNC("__sqrt_finite", "__svml_sqrt2", 2)			TLI_DEFINE_VECFUNC("__sqrt_finite", "__svml_sqrt2", FIXED(2))
	TLI_DEFINE_VECFUNC("__sqrt_finite", "__svml_sqrt4", 4)			TLI_DEFINE_VECFUNC("__sqrt_finite", "__svml_sqrt4", FIXED(4))
	TLI_DEFINE_VECFUNC("__sqrt_finite", "__svml_sqrt8", 8)			TLI_DEFINE_VECFUNC("__sqrt_finite", "__svml_sqrt8", FIXED(8))

	TLI_DEFINE_VECFUNC("__sqrtf_finite", "__svml_sqrtf4", 4)			TLI_DEFINE_VECFUNC("__sqrtf_finite", "__svml_sqrtf4", FIXED(4))
	TLI_DEFINE_VECFUNC("__sqrtf_finite", "__svml_sqrtf8", 8)			TLI_DEFINE_VECFUNC("__sqrtf_finite", "__svml_sqrtf8", FIXED(8))
	TLI_DEFINE_VECFUNC("__sqrtf_finite", "__svml_sqrtf16", 16)			TLI_DEFINE_VECFUNC("__sqrtf_finite", "__svml_sqrtf16", FIXED(16))

	TLI_DEFINE_VECFUNC("exp2", "__svml_exp22", 2)			TLI_DEFINE_VECFUNC("exp2", "__svml_exp22", FIXED(2))
	TLI_DEFINE_VECFUNC("exp2", "__svml_exp24", 4)			TLI_DEFINE_VECFUNC("exp2", "__svml_exp24", FIXED(4))
	TLI_DEFINE_VECFUNC("exp2", "__svml_exp28", 8)			TLI_DEFINE_VECFUNC("exp2", "__svml_exp28", FIXED(8))

	TLI_DEFINE_VECFUNC("exp2f", "__svml_exp2f4", 4)			TLI_DEFINE_VECFUNC("exp2f", "__svml_exp2f4", FIXED(4))
	TLI_DEFINE_VECFUNC("exp2f", "__svml_exp2f8", 8)			TLI_DEFINE_VECFUNC("exp2f", "__svml_exp2f8", FIXED(8))
	TLI_DEFINE_VECFUNC("exp2f", "__svml_exp2f16", 16)			TLI_DEFINE_VECFUNC("exp2f", "__svml_exp2f16", FIXED(16))

	TLI_DEFINE_VECFUNC("llvm.exp2.f64", "__svml_exp22", 2)			TLI_DEFINE_VECFUNC("llvm.exp2.f64", "__svml_exp22", FIXED(2))
	TLI_DEFINE_VECFUNC("llvm.exp2.f64", "__svml_exp24", 4)			TLI_DEFINE_VECFUNC("llvm.exp2.f64", "__svml_exp24", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.exp2.f64", "__svml_exp28", 8)			TLI_DEFINE_VECFUNC("llvm.exp2.f64", "__svml_exp28", FIXED(8))

	TLI_DEFINE_VECFUNC("llvm.exp2.f32", "__svml_exp2f4", 4)			TLI_DEFINE_VECFUNC("llvm.exp2.f32", "__svml_exp2f4", FIXED(4))
	TLI_DEFINE_VECFUNC("llvm.exp2.f32", "__svml_exp2f8", 8)			TLI_DEFINE_VECFUNC("llvm.exp2.f32", "__svml_exp2f8", FIXED(8))
	TLI_DEFINE_VECFUNC("llvm.exp2.f32", "__svml_exp2f16", 16)			TLI_DEFINE_VECFUNC("llvm.exp2.f32", "__svml_exp2f16", FIXED(16))

	TLI_DEFINE_VECFUNC("__exp2_finite", "__svml_exp22", 2)			TLI_DEFINE_VECFUNC("__exp2_finite", "__svml_exp22", FIXED(2))
	TLI_DEFINE_VECFUNC("__exp2_finite", "__svml_exp24", 4)			TLI_DEFINE_VECFUNC("__exp2_finite", "__svml_exp24", FIXED(4))
	TLI_DEFINE_VECFUNC("__exp2_finite", "__svml_exp28", 8)			TLI_DEFINE_VECFUNC("__exp2_finite", "__svml_exp28", FIXED(8))

	TLI_DEFINE_VECFUNC("__exp2f_finite", "__svml_exp2f4", 4)			TLI_DEFINE_VECFUNC("__exp2f_finite", "__svml_exp2f4", FIXED(4))
	TLI_DEFINE_VECFUNC("__exp2f_finite", "__svml_exp2f8", 8)			TLI_DEFINE_VECFUNC("__exp2f_finite", "__svml_exp2f8", FIXED(8))
	TLI_DEFINE_VECFUNC("__exp2f_finite", "__svml_exp2f16", 16)			TLI_DEFINE_VECFUNC("__exp2f_finite", "__svml_exp2f16", FIXED(16))

	#else			#else
	#error "Must choose which vector library functions are to be defined."			#error "Must choose which vector library functions are to be defined."
	#endif			#endif

	#undef TLI_DEFINE_VECFUNC			#undef TLI_DEFINE_VECFUNC
	#undef TLI_DEFINE_ACCELERATE_VECFUNCS			#undef TLI_DEFINE_ACCELERATE_VECFUNCS
	#undef TLI_DEFINE_LIBMVEC_X86_VECFUNCS			#undef TLI_DEFINE_LIBMVEC_X86_VECFUNCS
	#undef TLI_DEFINE_MASSV_VECFUNCS			#undef TLI_DEFINE_MASSV_VECFUNCS
	#undef TLI_DEFINE_SVML_VECFUNCS			#undef TLI_DEFINE_SVML_VECFUNCS
	#undef TLI_DEFINE_MASSV_VECFUNCS_NAMES			#undef TLI_DEFINE_MASSV_VECFUNCS_NAMES

llvm/include/llvm/Support/TypeSize.h

	Show First 20 Lines • Show All 375 Lines • ▼ Show 20 Lines
	class ElementCount;			class ElementCount;
	template <> struct LinearPolyBaseTypeTraits<ElementCount> {			template <> struct LinearPolyBaseTypeTraits<ElementCount> {
	using ScalarTy = unsigned;			using ScalarTy = unsigned;
	static constexpr unsigned Dimensions = 2;			static constexpr unsigned Dimensions = 2;
	};			};

	class ElementCount : public LinearPolySize<ElementCount> {			class ElementCount : public LinearPolySize<ElementCount> {
	public:			public:
				ElementCount() : LinearPolySize(LinearPolySize::getNull()) {}

	ElementCount(const LinearPolySize<ElementCount> &V) : LinearPolySize(V) {}			ElementCount(const LinearPolySize<ElementCount> &V) : LinearPolySize(V) {}

	/// Counting predicates.			/// Counting predicates.
	///			///
	///@{ Number of elements..			///@{ Number of elements..
	/// Exactly one element.			/// Exactly one element.
	bool isScalar() const { return !isScalable() && getKnownMinValue() == 1; }			bool isScalar() const { return !isScalable() && getKnownMinValue() == 1; }
	▲ Show 20 Lines • Show All 140 Lines • Show Last 20 Lines

llvm/lib/Analysis/TargetLibraryInfo.cpp

Show First 20 Lines • Show All 1,602 Lines • ▼ Show 20 Lines	bool TargetLibraryInfoImpl::isFunctionVectorizable(StringRef funcName) const {
if (funcName.empty())		if (funcName.empty())
return false;		return false;

std::vector<VecDesc>::const_iterator I =		std::vector<VecDesc>::const_iterator I =
llvm::lower_bound(VectorDescs, funcName, compareWithScalarFnName);		llvm::lower_bound(VectorDescs, funcName, compareWithScalarFnName);
return I != VectorDescs.end() && StringRef(I->ScalarFnName) == funcName;		return I != VectorDescs.end() && StringRef(I->ScalarFnName) == funcName;
}		}

StringRef TargetLibraryInfoImpl::getVectorizedFunction(StringRef F,		StringRef
unsigned VF) const {		TargetLibraryInfoImpl::getVectorizedFunction(StringRef F,
		const ElementCount &VF) const {
		sdesmalenUnsubmitted Done Reply Inline Actions nit: `const ElementCount &` sdesmalen: nit: `const ElementCount &`
F = sanitizeFunctionName(F);		F = sanitizeFunctionName(F);
if (F.empty())		if (F.empty())
return F;		return F;
std::vector<VecDesc>::const_iterator I =		std::vector<VecDesc>::const_iterator I =
llvm::lower_bound(VectorDescs, F, compareWithScalarFnName);		llvm::lower_bound(VectorDescs, F, compareWithScalarFnName);
while (I != VectorDescs.end() && StringRef(I->ScalarFnName) == F) {		while (I != VectorDescs.end() && StringRef(I->ScalarFnName) == F) {
if (I->VectorizationFactor == VF)		if (I->VectorizationFactor == VF)
return I->VectorFnName;		return I->VectorFnName;
Show All 37 Lines

// Register the basic pass.		// Register the basic pass.
INITIALIZE_PASS(TargetLibraryInfoWrapperPass, "targetlibinfo",		INITIALIZE_PASS(TargetLibraryInfoWrapperPass, "targetlibinfo",
"Target Library Information", false, true)		"Target Library Information", false, true)
char TargetLibraryInfoWrapperPass::ID = 0;		char TargetLibraryInfoWrapperPass::ID = 0;

void TargetLibraryInfoWrapperPass::anchor() {}		void TargetLibraryInfoWrapperPass::anchor() {}

unsigned TargetLibraryInfoImpl::getWidestVF(StringRef ScalarF) const {		void TargetLibraryInfoImpl::getWidestVF(StringRef ScalarF,
		ElementCount &FixedVF,
		ElementCount &ScalableVF) const {
ScalarF = sanitizeFunctionName(ScalarF);		ScalarF = sanitizeFunctionName(ScalarF);
		// Use '0' here because a type of the form <vscale x 1 x ElTy> is not the
		// same as a scalar.
		ScalableVF = ElementCount::getScalable(0);
		FixedVF = ElementCount::getFixed(1);
		sdesmalenUnsubmitted Done Reply Inline Actions Can you define these as ElementCount (and change the interface to take an ElementCount instead of unsigned), and do the initialization before the call to getWidestVF? sdesmalen: Can you define these as ElementCount (and change the interface to take an ElementCount instead…
		david-armAuthorUnsubmitted Done Reply Inline Actions Hi @sdesmalen, I did that initially but then I thought it seemed a bit unnecessary to have them as ElementCounts because the interface clearly defines one as fixed and the other as scalable. I can change them back to ElementCount - it just felt like we were doing redundant extra work that's all. david-arm: Hi @sdesmalen, I did that initially but then I thought it seemed a bit unnecessary to have them…
		sdesmalenUnsubmitted Done Reply Inline Actions What is the redundant extra work? The only difference is in where the initialization is done. Now all uses of FixedVF coming from `getWidestVF` have to write explicit ElementCount::getFixed(). Just FYI, I had to solve a similar problem for returning max-VFs in D96025, for which I added a new class `OptionalVFCandidates`, and in a subsequent patch (not yet posted) added a function that generates all power-of-two VFs upto the max-VFs. sdesmalen: What is the redundant extra work? The only difference is in where the initialization is done.
if (ScalarF.empty())		if (ScalarF.empty())
return 1;		return;

unsigned VF = 1;
std::vector<VecDesc>::const_iterator I =		std::vector<VecDesc>::const_iterator I =
llvm::lower_bound(VectorDescs, ScalarF, compareWithScalarFnName);		llvm::lower_bound(VectorDescs, ScalarF, compareWithScalarFnName);
while (I != VectorDescs.end() && StringRef(I->ScalarFnName) == ScalarF) {		while (I != VectorDescs.end() && StringRef(I->ScalarFnName) == ScalarF) {
if (I->VectorizationFactor > VF)		ElementCount *VF =
VF = I->VectorizationFactor;		I->VectorizationFactor.isScalable() ? &ScalableVF : &FixedVF;
		if (ElementCount::isKnownGT(I->VectorizationFactor, *VF))
		*VF = I->VectorizationFactor;
		sdesmalenUnsubmitted Done Reply Inline Actions nit: ElementCount Ptr = VF.isScalable() ? &ScalableVF : &FixedVF; if (ElementCount::isKnownGT(VF, Ptr)) Ptr = VF; sdesmalen:* nit: ElementCount *Ptr = VF.isScalable() ? &ScalableVF : &FixedVF; if (ElementCount…
++I;		++I;
}		}
return VF;
}		}

llvm/lib/Transforms/Utils/InjectTLIMappings.cpp

Show All 34 Lines
STATISTIC(NumCompUsedAdded,		STATISTIC(NumCompUsedAdded,
"Number of `@llvm.compiler.used` operands that have been added.");		"Number of `@llvm.compiler.used` operands that have been added.");

/// A helper function that adds the vector function declaration that		/// A helper function that adds the vector function declaration that
/// vectorizes the CallInst CI with a vectorization factor of VF		/// vectorizes the CallInst CI with a vectorization factor of VF
/// lanes. The TLI assumes that all parameters and the return type of		/// lanes. The TLI assumes that all parameters and the return type of
/// CI (other than void) need to be widened to a VectorType of VF		/// CI (other than void) need to be widened to a VectorType of VF
/// lanes.		/// lanes.
static void addVariantDeclaration(CallInst &CI, const unsigned VF,		static void addVariantDeclaration(CallInst &CI, const ElementCount &VF,
const StringRef VFName) {		const StringRef VFName) {
Module *M = CI.getModule();		Module *M = CI.getModule();

// Add function declaration.		// Add function declaration.
Type *RetTy = ToVectorTy(CI.getType(), VF);		Type *RetTy = ToVectorTy(CI.getType(), VF);
SmallVector<Type *, 4> Tys;		SmallVector<Type *, 4> Tys;
for (Value *ArgOperand : CI.arg_operands())		for (Value *ArgOperand : CI.arg_operands())
Tys.push_back(ToVectorTy(ArgOperand->getType(), VF));		Tys.push_back(ToVectorTy(ArgOperand->getType(), VF));
Show All 31 Lines	static void addMappingsFromTLI(const TargetLibraryInfo &TLI, CallInst &CI) {
// Nothing to be done if the TLI thinks the function is not		// Nothing to be done if the TLI thinks the function is not
// vectorizable.		// vectorizable.
if (!TLI.isFunctionVectorizable(ScalarName))		if (!TLI.isFunctionVectorizable(ScalarName))
return;		return;
SmallVector<std::string, 8> Mappings;		SmallVector<std::string, 8> Mappings;
VFABI::getVectorVariantNames(CI, Mappings);		VFABI::getVectorVariantNames(CI, Mappings);
Module *M = CI.getModule();		Module *M = CI.getModule();
const SetVector<StringRef> OriginalSetOfMappings(Mappings.begin(),		const SetVector<StringRef> OriginalSetOfMappings(Mappings.begin(),
Mappings.end());		Mappings.end());
// All VFs in the TLI are powers of 2.
for (unsigned VF = 2, WidestVF = TLI.getWidestVF(ScalarName); VF <= WidestVF;		auto AddVariantDecl = [&](const ElementCount &VF) {
VF *= 2) {
const std::string TLIName =		const std::string TLIName =
		sdesmalenUnsubmitted Not Done Reply Inline Actions nit: can you move this closer to the for loop? sdesmalen: nit: can you move this closer to the for loop?
std::string(TLI.getVectorizedFunction(ScalarName, VF));		std::string(TLI.getVectorizedFunction(ScalarName, VF));
if (!TLIName.empty()) {		if (!TLIName.empty()) {
std::string MangledName = VFABI::mangleTLIVectorName(		std::string MangledName = VFABI::mangleTLIVectorName(
TLIName, ScalarName, CI.getNumArgOperands(),		TLIName, ScalarName, CI.getNumArgOperands(), VF);
ElementCount::getFixed(VF));
if (!OriginalSetOfMappings.count(MangledName)) {		if (!OriginalSetOfMappings.count(MangledName)) {
Mappings.push_back(MangledName);		Mappings.push_back(MangledName);
++NumCallInjected;		++NumCallInjected;
}		}
Function *VariantF = M->getFunction(TLIName);		Function *VariantF = M->getFunction(TLIName);
if (!VariantF)		if (!VariantF)
addVariantDeclaration(CI, VF, TLIName);		addVariantDeclaration(CI, VF, TLIName);
}		}
}		};

		// All VFs in the TLI are powers of 2.
		sdesmalenUnsubmitted Done Reply Inline Actions nit: unnecessary curly braces. sdesmalen: nit: unnecessary curly braces.
		ElementCount WidestFixedVF, WidestScalableVF;
		TLI.getWidestVF(ScalarName, WidestFixedVF, WidestScalableVF);

		for (ElementCount VF = ElementCount::getFixed(2);
		ElementCount::isKnownLE(VF, WidestFixedVF); VF *= 2)
		sdesmalenUnsubmitted Done Reply Inline Actions nit: s/unsupported/not yet supported/ sdesmalen: nit: s/unsupported/not yet supported/
		AddVariantDecl(VF);

		// TODO: Add scalable variants once we're able to test them.
		assert(WidestScalableVF.isZero() &&
		"Scalable vector mappings not yet supported");

VFABI::setVectorVariantNames(&CI, Mappings);		VFABI::setVectorVariantNames(&CI, Mappings);
		sdesmalenUnsubmitted Done Reply Inline Actions This needs an assert to make sure WidestScalableVF is not set, so that it is not silently ignoring any scalable entries. sdesmalen: This needs an assert to make sure WidestScalableVF is not set, so that it is not silently…
}		}

static bool runImpl(const TargetLibraryInfo &TLI, Function &F) {		static bool runImpl(const TargetLibraryInfo &TLI, Function &F) {
for (auto &I : instructions(F))		for (auto &I : instructions(F))
if (auto CI = dyn_cast<CallInst>(&I))		if (auto CI = dyn_cast<CallInst>(&I))
addMappingsFromTLI(TLI, *CI);		addMappingsFromTLI(TLI, *CI);
// Even if the pass adds IR attributes, the analyses are preserved.		// Even if the pass adds IR attributes, the analyses are preserved.
return false;		return false;
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/LoopVectorizationLegality.cpp

	Show First 20 Lines • Show All 588 Lines • ▼ Show 20 Lines
	/// const VecDesc VecIntrinsics[] = {			/// const VecDesc VecIntrinsics[] = {
	/// {"llvm.phx.abs.i32", "", 4}			/// {"llvm.phx.abs.i32", "", 4}
	/// };			/// };
	static bool isTLIScalarize(const TargetLibraryInfo &TLI, const CallInst &CI) {			static bool isTLIScalarize(const TargetLibraryInfo &TLI, const CallInst &CI) {
	const StringRef ScalarName = CI.getCalledFunction()->getName();			const StringRef ScalarName = CI.getCalledFunction()->getName();
	bool Scalarize = TLI.isFunctionVectorizable(ScalarName);			bool Scalarize = TLI.isFunctionVectorizable(ScalarName);
	// Check that all known VFs are not associated to a vector			// Check that all known VFs are not associated to a vector
	// function, i.e. the vector name is emty.			// function, i.e. the vector name is emty.
	if (Scalarize)			if (Scalarize) {
	for (unsigned VF = 2, WidestVF = TLI.getWidestVF(ScalarName);			ElementCount WidestFixedVF, WidestScalableVF;
				sdesmalenUnsubmitted Done Reply Inline Actions Should we just add a constructor that takes no arguments and does the same as getNull? sdesmalen: Should we just add a constructor that takes no arguments and does the same as getNull?
	VF <= WidestVF; VF *= 2) {			TLI.getWidestVF(ScalarName, WidestFixedVF, WidestScalableVF);
				for (ElementCount VF = ElementCount::getFixed(2);
				ElementCount::isKnownLE(VF, WidestFixedVF); VF *= 2)
	Scalarize &= !TLI.isFunctionVectorizable(ScalarName, VF);			Scalarize &= !TLI.isFunctionVectorizable(ScalarName, VF);
				for (ElementCount VF = ElementCount::getScalable(1);
				ElementCount::isKnownLE(VF, WidestScalableVF); VF *= 2)
				Scalarize &= !TLI.isFunctionVectorizable(ScalarName, VF);
				assert((WidestScalableVF.isZero() \|\| !Scalarize) &&
				"Caller may decide to scalarize a variant using a scalable VF");
				sdesmalenUnsubmitted Done Reply Inline Actions This can probably return false directly when WidestScalableVF is > 0, because there is no way to scalarize for scalable VFs. sdesmalen: This can probably return false directly when WidestScalableVF is > 0, because there is no way…
				david-armAuthorUnsubmitted Done Reply Inline Actions I thought about that as well when I first did the patch, but I don't think that's right. This function is not asking if something should be scalarised, but instead asking if there is at least one variant of the function that is vectorisable. If we return false here when WidestScalableVF > 0 then that's a bug I think because that means we're saying we can definitely vectorise at least one function, which may not be true. As an alternative I could add an assert like this: assert(WidestScalableVF.isZero() \|\| !Scalarize); Also, returning true from this function doesn't mean we are going to scalarise using as a scalable VF, even if widest fixed and scalable VFs are both non-zero. It just means that no vectorisable variants could be found and therefore the caller must decide what to do and which variant to scalarise - fixed or scalable? david-arm: I thought about that as well when I first did the patch, but I don't think that's right. This…
	}			}
				sdesmalenUnsubmitted Done Reply Inline Actions This should be `Element::getScalable(1)` because `<vscale x 1 x <eltty>>` may well be a valid type for some scalable vector architecturs. For SVE we don't loops to be vectorized with this VF because it is expensive to legalize, but we shouldn't work with that assumption for generic code. sdesmalen: This should be `Element::getScalable(1)` because `<vscale x 1 x <eltty>>` may well be a valid…
	return Scalarize;			return Scalarize;
	}			}

	bool LoopVectorizationLegality::canVectorizeInstrs() {			bool LoopVectorizationLegality::canVectorizeInstrs() {
	BasicBlock *Header = TheLoop->getHeader();			BasicBlock *Header = TheLoop->getHeader();

	// For each block in the loop.			// For each block in the loop.
	for (BasicBlock *BB : TheLoop->blocks()) {			for (BasicBlock *BB : TheLoop->blocks()) {
	▲ Show 20 Lines • Show All 681 Lines • Show Last 20 Lines