This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
libc/
-
config/gpu/
-
gpu/
-
entrypoints.txt
-
src/math/gpu/vendor/
-
math/
-
gpu/
-
vendor/
-
CMakeLists.txt
-
amdgpu/
-
amdgpu.h
-
declarations.h
-
modf.cpp
-
modff.cpp
-
nearbyint.cpp
-
nearbyintf.cpp
-
nextafter.cpp
-
nextafterf.cpp
-
nvptx/
-
declarations.h
1
nvptx.h
-
remainder.cpp
-
remainderf.cpp
-
remquo.cpp
-
remquof.cpp
-
rint.cpp
-
rintf.cpp
-
scalbn.cpp
-
scalbnf.cpp
-
sincosf.cpp

Differential D152575

Added modf for NVPTX and AMDGPU targets to implement 'libmgpu.a' for math on the GPU
AbandonedPublic

Authored by AntonRydahl on Jun 9 2023, 11:53 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
jhuber6
elmcdonough
tra
sivachandra

Summary

This patch is a follow-up to D152468. It replaces calls to modf with __nv_modf or __ocml_modf_f64. If you think I have done it the right way, I will continue to submit equivalent changes
for other math functions.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

AntonRydahl created this revision.Jun 9 2023, 11:53 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJun 9 2023, 11:53 AM

Herald added subscribers: libc-commits, mattd, gchakrabarti and 7 others. · View Herald Transcript

AntonRydahl requested review of this revision.Jun 9 2023, 11:53 AM

Herald added a reviewer: jdoerfert. · View Herald TranscriptJun 9 2023, 11:53 AM

Herald added subscribers: jplehr, sstefan1, wdng. · View Herald Transcript

Harbormaster completed remote builds in B237833: Diff 530045.Jun 9 2023, 11:54 AM

jdoerfert added reviewers: jhuber6, elmcdonough.Jun 9 2023, 12:00 PM

jhuber6 added a parent revision: D152486: [libc] Begin implementing a 'libmgpu.a' for math on the GPU.Jun 9 2023, 12:39 PM

Remember to clang format via git clang-format HEAD~1 then git add -A; git commit --amend to update if anything was changed.

libc/src/math/gpu/CMakeLists.txt
47 ↗	(On Diff #530045)	Don't need to copy this comment everywhere.
libc/src/math/gpu/amdgpu/amdgpu.h
18 ↗	(On Diff #530045)	This should be `internal` forgot about that when I updated.
24 ↗	(On Diff #530045)	We're not using OpenMP so these won't work nor should they be necessary. Address space five maps to local memory which should be the default here for a variable on the stack. The only reason this was necessary for OpenMP is because of the memory model OpenMP uses trying to mimic the CPU. E.g. the CPU threads can share stack data by default but the GPU can't, so by default we don't put things on the stack in the GPU. Since this is compiled directly for the GPU it should already be stack memory. Ditto the cast shouldn't be necessary.

#Updating D152575: Added modf for NVPTX and AMDGPU targets to implement 'libmgpu.a' for math on the GPU

The OpenMP specifics have been removed, the namespace changed to internal, and git clang-format has been run.

Harbormaster completed remote builds in B238246: Diff 530585.Jun 12 2023, 10:36 AM

Make sure to amend your new changes into your last commit and then arc diff. This revision only contains your new edits.

Thanks, Ethan! I will see If I know how to roll it back. Do you think I should use arc diff HEAD~2 --update D152575?

That should work I think.

AntonRydahl updated this revision to Diff 530596.Jun 12 2023, 10:59 AM

This comment was removed by AntonRydahl.

Harbormaster completed remote builds in B238255: Diff 530596.Jun 12 2023, 11:00 AM

Squashed the two commits into one.

Harbormaster completed remote builds in B238266: Diff 530612.Jun 12 2023, 11:30 AM

Updated this patch in accordance with the changes to the parent patch, https://reviews.llvm.org/D152486.

Harbormaster completed remote builds in B238690: Diff 531166.Jun 13 2023, 9:14 PM

This update contains more functions following the same template.

Harbormaster completed remote builds in B238698: Diff 531176.Jun 13 2023, 10:48 PM

sivachandra mentioned this in D152486: [libc] Begin implementing a 'libmgpu.a' for math on the GPU.Jun 13 2023, 11:39 PM

The boilerplate to functionality ratio is pretty extreme here. How does the libc subproject feel about code generators?

If we can run python scripts to generate the code at build time, great.

For an out of tree target I remember generating source code in cmake as people didn't like running python at build time.

If neither of those are acceptable, perhaps we should do the X macro thing.

JonChesterfield added reviewers: tra, sivachandra.Jun 14 2023, 1:17 AM

I would rather implement the functions added by this patch as builtin wrappers instead of vendor wrappers. See the round example in D152468.

In D152575#4420284, @sivachandra wrote:

I would rather implement the functions added by this patch as builtin wrappers instead of vendor wrappers. See the round example in D152468.

I think we're missing a canonicalisation opportunity in libm. It's never totally clear whether an optimisation should target a C function with a known name or an intrinsic. Fair chance O0 C++ produces a wrapper to handle the overload with a different name. Openmp variants introduce another name mangling scheme. Then we have the nv_sin ocml_cos set. I don't know what Fortran calls it, maybe _sin.

I think we should have an IR intrinsic for each (or at least most) libm functions, transform the various source names to the intrinsics in the front end or generally as aggressively as we can. Then optimise them - at least constant fold, but trig identities might be fair game as well. Then lower to whatever mix of libm functions and native instructions the backend sees fit.

Errno is a pain here, but if errno is not disabled we can still constant fold cases that don't error. Likewise there's various fast math flags which are a mess but it seems more likely that we can handle them consistently and correctly if it's all localised to one place.

This diff isn't the right venue to propose that, I should probably try the forum that replaced mailing lists.

In D152575#4421651, @JonChesterfield wrote:

In D152575#4420284, @sivachandra wrote:

I would rather implement the functions added by this patch as builtin wrappers instead of vendor wrappers. See the round example in D152468.

I think we're missing a canonicalisation opportunity in libm. It's never totally clear whether an optimisation should target a C function with a known name or an intrinsic. Fair chance O0 C++ produces a wrapper to handle the overload with a different name. Openmp variants introduce another name mangling scheme. Then we have the nv_sin ocml_cos set. I don't know what Fortran calls it, maybe _sin.

I think we should have an IR intrinsic for each (or at least most) libm functions, transform the various source names to the intrinsics in the front end or generally as aggressively as we can. Then optimise them - at least constant fold, but trig identities might be fair game as well. Then lower to whatever mix of libm functions and native instructions the backend sees fit.

Errno is a pain here, but if errno is not disabled we can still constant fold cases that don't error. Likewise there's various fast math flags which are a mess but it seems more likely that we can handle them consistently and correctly if it's all localised to one place.

This diff isn't the right venue to propose that, I should probably try the forum that replaced mailing lists.

On the discussion forum, it has been proposed that we should test which versions perform better on GPUs: https://discourse.llvm.org/t/libm-conformance-and-timing-ci-for-gpus/71362

Maybe that would be a better fit for this discussion.

In D152575#4420284, @sivachandra wrote:

I would rather implement the functions added by this patch as builtin wrappers instead of vendor wrappers. See the round example in D152468.

Should I include both the vendor and builtin functions in this patch?

In D152575#4421820, @AntonRydahl wrote:

Should I include both the vendor and builtin functions in this patch?

That's a good question, we'd probably like to have both if we'd like to perform some performance tests, but I'm not sure if that's a good reason to keep it in-tree if we have a suitable alternative. @sivachandra What do you think?

In D152575#4421826, @jhuber6 wrote:

In D152575#4421820, @AntonRydahl wrote:

Should I include both the vendor and builtin functions in this patch?

That's a good question, we'd probably like to have both if we'd like to perform some performance tests, but I'm not sure if that's a good reason to keep it in-tree if we have a suitable alternative. @sivachandra What do you think?

Do you reckon it would be good to keep the vendor definitions as a fallback if LIBC_HAS_BUILTIN doesn't return true?

The default is a good question. Libm isn't _that_ big a library. Getting the functions right is difficult but relatively well established by this point, they've been implemented a lot of times. Getting them fast and right is probably always a vendor specific thing, you need to know the ISA you're targeting and put more effort in.

I think we could reasonably aspire to have something slow and wrong for every function implemented in tree. I'm personally ok with return 0 levels of wrong in the first instance but others may disagree.

If we're aiming at slow+wrong+complete in the first instance, the default can reasonably be to use libc unless specified otherwise. Vendors will always want to use their own implementation and that's fine, they can use the same hook provided for users who want to bring their own.

Fun question, do we want to aspire to finer grained replacement than wholesale? I claim _no_ on the grounds that we won't test every permutation that results and we shouldn't ship something untested.

(My side interest here is to end up with something that Fortran can use basically unchanged, which is not the case for libm-via-header-files)

JonChesterfield added inline comments.Jun 14 2023, 11:43 AM

libc/src/math/gpu/vendor/nvptx/nvptx.h
19	Why is this stuff all in a header? It means it has to be different for different targets even though the interface is the same. I was hoping for `float remainderf(float x, float y);` in a header and the nvptx specific implementation in a source file

In D152575#4421826, @jhuber6 wrote:

That's a good question, we'd probably like to have both if we'd like to perform some performance tests, but I'm not sure if that's a good reason to keep it in-tree if we have a suitable alternative. @sivachandra What do you think?

Per my understanding from the earlier discussion, we will add a vendor-wrapper only if the in-tree implementation is not sufficiently proven to be a good enough replacement for the vendor implementation. If that is correct, I would like to stick to that. You shouldn't need a wrapper in the libc for differential testing.

That said, will it ever be that a builtin for a floating point primitive will not be available? If yes, then the ideal approach we should take is to "fix the compiler". If that is not practical, we can take up adding vendor wrappers at that time with reduced scope (as in, that option is taken only if the builtin is not available.)

In D152575#4423253, @sivachandra wrote:

In D152575#4421826, @jhuber6 wrote:

That's a good question, we'd probably like to have both if we'd like to perform some performance tests, but I'm not sure if that's a good reason to keep it in-tree if we have a suitable alternative. @sivachandra What do you think?

Per my understanding from the earlier discussion, we will add a vendor-wrapper only if the in-tree implementation is not sufficiently proven to be a good enough replacement for the vendor implementation. If that is correct, I would like to stick to that. You shouldn't need a wrapper in the libc for differential testing.

It's not strictly necessary, but it will make it a lot easier if we can simply enable or disable a flag to check the performance delta either in the implementation itself or an application. My proposal is that we implement all the vendor targets at once to get a mostly complete facade of a libm.a that we can test. This will mostly just be copying the existing headers at https://github.com/llvm/llvm-project/blob/main/clang/lib/Headers/__clang_cuda_math.h and https://github.com/llvm/llvm-project/blob/main/clang/lib/Headers/__clang_hip_math.h. The expectation is that the order of implementation by default will be native GPU implementation > generic implementation > vendor implementation with a special with an extra configuration variable. That is, since we have a native implementation of sinf we will only user the vendor version of sinf if the user specified it in some list like LIBC_GPU_VENDOR_MATH=sinf. The only burden with this approach is carrying around a few extra entrypoints that are only enabled if specified by the user, but in exchange we get something that's functional immediately and can be incrementally improved. However, I do think that anything that can be replaced with a built-in shouldn't be considered a 'vendor' implementation and can simply be placed in the regular source.

That said, will it ever be that a builtin for a floating point primitive will not be available? If yes, then the ideal approach we should take is to "fix the compiler". If that is not practical, we can take up adding vendor wrappers at that time with reduced scope (as in, that option is taken only if the builtin is not available.)

Yes, fixing the compiler will be the preferred approach here. Since we *only* build the GPU libraries with an up-to-date clang we can always patch the compiler in parallel with the GPU implementation.

Thanks for being understanding with this whole project. The GPU implementation brings in a lot of non-standard requirements but I think it's a a compelling project so I'm glad you've stuck with it thus far.

In D152575#4423390, @jhuber6 wrote:

It's not strictly necessary, but it will make it a lot easier if we can simply enable or disable a flag to check the performance delta either in the implementation itself or an application.

I would expect that a libc implementation for a GPU math function, even if it is just a builtin-wrapper, is being added because it is expected/proven to be either better or equivalent to the vendor implementation. It is the burden of the GPU libc developer to verify that - making it convenient is definitely something that can be accommodated in the libc project. We do have a number of such things for comparison against the system libc. But, all such conveniences are outside of the libc (as in, there are no alternate wrapper entrypoints) - in fact, for many functions, the wrapper indirection is close to 50% overhead that it is not meaningful to compare against wrappers.

About builtin-wrappers, I will be surprised if vendor libraries are doing anything different to get better performance etc. If that is really the case, the builtins should be fixed.

My proposal is that we implement all the vendor targets at once to get a mostly complete facade of a libm.a that we can test. This will mostly just be copying the existing headers at https://github.com/llvm/llvm-project/blob/main/clang/lib/Headers/__clang_cuda_math.h and https://github.com/llvm/llvm-project/blob/main/clang/lib/Headers/__clang_hip_math.h. The expectation is that the order of implementation by default will be native GPU implementation > generic implementation > vendor implementation with a special with an extra configuration variable. That is, since we have a native implementation of sinf we will only user the vendor version of sinf if the user specified it in some list like LIBC_GPU_VENDOR_MATH=sinf. The only burden with this approach is carrying around a few extra entrypoints that are only enabled if specified by the user, but in exchange we get something that's functional immediately and can be incrementally improved. However, I do think that anything that can be replaced with a built-in shouldn't be considered a 'vendor' implementation and can simply be placed in the regular source.

I thought we agreed on this plan already. So, as a first step, add all builtin-wrappers wherever possible. Next, add vendor-wrappers and their libc-implementation alternates when available. Then the long tail of actually comparing/verifying/improving libc implementations to make them the default. I would really like if the libc implementations were the default to begin with, but as a practical decision in these initial stages, we will make the vendor wrappers the default. But, the vendor wrappers should be added only if there are no builtins which implement the corresponding floating point operations.

In D152575#4423642, @sivachandra wrote:

I would expect that a libc implementation for a GPU math function, even if it is just a builtin-wrapper, is being added because it is expected/proven to be either better or equivalent to the vendor implementation. It is the burden of the GPU libc developer to verify that - making it convenient is definitely something that can be accommodated in the libc project. We do have a number of such things for comparison against the system libc. But, all such conveniences are outside of the libc (as in, there are no alternate wrapper entrypoints) - in fact, for many functions, the wrapper indirection is close to 50% overhead that it is not meaningful to compare against wrappers.

That's reasonable, we can bias towards in-tree implementations as long as the performance is somewhat on-par.

About builtin-wrappers, I will be surprised if vendor libraries are doing anything different to get better performance etc. If that is really the case, the builtins should be fixed.

So, the vendor libraries are presented as LLVM-IR. This means that they will ultimately use the same intrinsics and be compiled by the same exact compiler. So there should be no difference. We should probably scan for any vendor implementation that can be replaced with one of the nvvm or __builtin functions. However we should keep in mind that the floating point behavior of these functions will probably be somewhat divergent from libc's correct rounding assertion, we'll need to specify as such when we document this.

I thought we agreed on this plan already. So, as a first step, add all builtin-wrappers wherever possible. Next, add vendor-wrappers and their libc-implementation alternates when available. Then the long tail of actually comparing/verifying/improving libc implementations to make them the default. I would really like if the libc implementations were the default to begin with, but as a practical decision in these initial stages, we will make the vendor wrappers the default. But, the vendor wrappers should be added only if there are no builtins which implement the corresponding floating point operations.

Yes, that's very reasonable. There's no reason to even compare the performance if it just goes to an intrinsic as I've mentioned above. So we should add all vendor functions, except ones that can be trivially recreated without calling into the accompanying library.

I will make a new patch where I test if the __builtin math functions compile to --target=amdgcn-amd-amdhsa and --target=nvptx64-nvidia-cuda. If it successfully compiles, I will add __builtin wrappers. If not, I will add __ocml and __nv wrappers. Does that sound good to you, @sivachandra?

In D152575#4426284, @AntonRydahl wrote:

I will make a new patch where I test if the __builtin math functions compile to --target=amdgcn-amd-amdhsa and --target=nvptx64-nvidia-cuda. If it successfully compiles, I will add __builtin wrappers. If not, I will add __ocml and __nv wrappers. Does that sound good to you, @sivachandra?

SGTM

AntonRydahl abandoned this revision.Jul 27 2023, 10:16 AM

Herald added a subscriber: wangpc. · View Herald TranscriptJul 27 2023, 10:16 AM

Revision Contents

Path

Size

libc/

config/

gpu/

entrypoints.txt

17 lines

src/

math/

gpu/

vendor/

CMakeLists.txt

165 lines

amdgpu/

amdgpu.h

33 lines

declarations.h

15 lines

	modf.cpp/
	nvptx/


	declarations.h

14 lines

	modff.cpp/
	nvptx/


	declarations.h

14 lines

	nearbyint.cpp/
	nvptx/


	declarations.h

14 lines

	nearbyintf.cpp/
	nvptx/


	declarations.h

14 lines

	nextafter.cpp/
	nvptx/


	declarations.h

14 lines

	nextafterf.cpp/
	nvptx/


	declarations.h

14 lines

nvptx/

declarations.h

15 lines

nvptx.h

26 lines

	remainder.cpp/
	nvptx/


	declarations.h

14 lines

	remainderf.cpp/
	nvptx/


	declarations.h

14 lines

	remquo.cpp/
	nvptx/


	declarations.h

14 lines

	remquof.cpp/
	nvptx/


	declarations.h

14 lines

	rint.cpp/
	nvptx/


	declarations.h

14 lines

	rintf.cpp/
	nvptx/


	declarations.h

14 lines

	scalbn.cpp/
	nvptx/


	declarations.h

14 lines

	scalbnf.cpp/
	nvptx/


	declarations.h

14 lines

	sincosf.cpp/
	nvptx/


	declarations.h

14 lines

Diff 531176

libc/config/gpu/entrypoints.txt

Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	set(TARGET_LIBC_ENTRYPOINTS
libc.src.stdio.fputs		libc.src.stdio.fputs
libc.src.stdio.stdin		libc.src.stdio.stdin
libc.src.stdio.stdout		libc.src.stdio.stdout
libc.src.stdio.stderr		libc.src.stdio.stderr
)		)

set(TARGET_LIBM_ENTRYPOINTS		set(TARGET_LIBM_ENTRYPOINTS
# math.h entrypoints		# math.h entrypoints
libc.src.math.sin		libc.src.math.modf
		libc.src.math.modff
		libc.src.math.nearbyint
		libc.src.math.nearbyintf
		libc.src.math.nextafter
		libc.src.math.nextafterf
		libc.src.math.remainder
		libc.src.math.remainderf
		libc.src.math.remquo
		libc.src.math.remquof
		libc.src.math.rint
		libc.src.math.rintf
libc.src.math.round		libc.src.math.round
		libc.src.math.scalbn
		libc.src.math.scalbnf
		libc.src.math.sin
		libc.src.math.sincosf
)		)

set(TARGET_LLVMLIBC_ENTRYPOINTS		set(TARGET_LLVMLIBC_ENTRYPOINTS
${TARGET_LIBC_ENTRYPOINTS}		${TARGET_LIBC_ENTRYPOINTS}
${TARGET_LIBM_ENTRYPOINTS}		${TARGET_LIBM_ENTRYPOINTS}
)		)

libc/src/math/gpu/vendor/CMakeLists.txt

	Show All 22 Lines
	endif()			endif()

	# FIXME: We need a way to pass the library to only the NVTPX / AMDGPU build.			# FIXME: We need a way to pass the library to only the NVTPX / AMDGPU build.
	# This shouldn't cause issues because we only link in needed symbols, but it			# This shouldn't cause issues because we only link in needed symbols, but it
	# will link in identity metadata from both libraries. This silences the warning.			# will link in identity metadata from both libraries. This silences the warning.
	list(APPEND bitcode_link_flags "-Wno-linker-warnings")			list(APPEND bitcode_link_flags "-Wno-linker-warnings")

	add_entrypoint_object(			add_entrypoint_object(
				modf
				SRCS
				modf.cpp
				HDRS
				../../modf.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				modff
				SRCS
				modff.cpp
				HDRS
				../../modff.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				nearbyint
				SRCS
				nearbyint.cpp
				HDRS
				../../nearbyint.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				nearbyintf
				SRCS
				nearbyintf.cpp
				HDRS
				../../nearbyintf.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				nextafter
				SRCS
				nextafter.cpp
				HDRS
				../../nextafter.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				nextafterf
				SRCS
				nextafterf.cpp
				HDRS
				../../nextafterf.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				remainder
				SRCS
				remainder.cpp
				HDRS
				../../remainder.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				remainderf
				SRCS
				remainderf.cpp
				HDRS
				../../remainderf.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				remquo
				SRCS
				remquo.cpp
				HDRS
				../../remquo.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				remquof
				SRCS
				remquof.cpp
				HDRS
				../../remquof.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				rint
				SRCS
				rint.cpp
				HDRS
				../../rint.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				rintf
				SRCS
				rintf.cpp
				HDRS
				../../rintf.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				scalbn
				SRCS
				scalbn.cpp
				HDRS
				../../scalbn.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				scalbnf
				SRCS
				scalbnf.cpp
				HDRS
				../../scalbnf.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
				sincosf
				SRCS
				sincosf.cpp
				HDRS
				../../sincosf.h
				COMPILE_OPTIONS
				${bitcode_link_flags}
				-O2
				)

				add_entrypoint_object(
	sin			sin
	SRCS			SRCS
	sin.cpp			sin.cpp
	HDRS			HDRS
	../../sin.h			../../sin.h
	COMPILE_OPTIONS			COMPILE_OPTIONS
	${bitcode_link_flags}			${bitcode_link_flags}
	-O2			-O2
	)			)

libc/src/math/gpu/vendor/amdgpu/amdgpu.h

	Show All 11 Lines
	#include "declarations.h"			#include "declarations.h"
	#include "platform.h"			#include "platform.h"

	#include "src/__support/macros/attributes.h"			#include "src/__support/macros/attributes.h"

	namespace __llvm_libc {			namespace __llvm_libc {
	namespace internal {			namespace internal {

				LIBC_INLINE double modf(double x, double *iptr) {
				return __ocml_modf_f64(x, iptr);
				}
				LIBC_INLINE float modff(float x, float *iptr) {
				return __ocml_modf_f32(x, iptr);
				}
				LIBC_INLINE double nearbyint(double x) { return __ocml_nearbyint_f64(x); }
				LIBC_INLINE float nearbyintf(float x) { return __ocml_nearbyint_f32(x); }
				LIBC_INLINE double nextafter(double x, double y) {
				return __ocml_nextafter_f64(x, y);
				}
				LIBC_INLINE float nextafterf(float x, float y) {
				return __ocml_nextafter_f32(x, y);
				}
				LIBC_INLINE double remainder(double x, double y) {
				return __ocml_remainder_f64(x, y);
				}
				LIBC_INLINE float remainderf(float x, float y) {
				return __ocml_remainder_f32(x, y);
				}
				LIBC_INLINE double remquo(double x, double y, int *quo) {
				return __ocml_remquo_f64(x, y, quo);
				}
				LIBC_INLINE float remquof(float x, float y, int *quo) {
				return __ocml_remquo_f32(x, y, quo);
				}
				LIBC_INLINE double rint(double x) { return __ocml_rint_f64(x); }
				LIBC_INLINE float rintf(float x) { return __ocml_rint_f32(x); }
				LIBC_INLINE double scalbn(double x, int n) { return __ocml_scalbn_f64(x, n); }
				LIBC_INLINE float scalbnf(float x, int n) { return __ocml_scalbn_f32(x, n); }
	LIBC_INLINE double sin(double x) { return __ocml_sin_f64(x); }			LIBC_INLINE double sin(double x) { return __ocml_sin_f64(x); }
				LIBC_INLINE void sincosf(float x, float sinptr, float cosptr) {
				*sinptr = __ocml_sincos_f32(x, cosptr);
				}

	} // namespace internal			} // namespace internal
	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_AMDGPU_H			#endif // LLVM_LIBC_SRC_MATH_GPU_AMDGPU_H

libc/src/math/gpu/vendor/amdgpu/declarations.h

	//===-- AMDGPU specific declarations for math support ---------------------===//			//===-- AMDGPU specific declarations for math support ---------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_AMDGPU_DECLARATIONS_H			#ifndef LLVM_LIBC_SRC_MATH_GPU_AMDGPU_DECLARATIONS_H
	#define LLVM_LIBC_SRC_MATH_GPU_AMDGPU_DECLARATIONS_H			#define LLVM_LIBC_SRC_MATH_GPU_AMDGPU_DECLARATIONS_H

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			extern "C" {
				double __ocml_modf_f64(double, double *);
				float __ocml_modf_f32(float, float *);
				double __ocml_nearbyint_f64(double);
				float __ocml_nearbyint_f32(float);
				double __ocml_nextafter_f64(double, double);
				float __ocml_nextafter_f32(float, float);
				double __ocml_remainder_f64(double, double);
				float __ocml_remainder_f32(float, float);
				double __ocml_remquo_f64(double, double, int *);
				float __ocml_remquo_f32(float, float, int *);
				double __ocml_rint_f64(double);
				float __ocml_rint_f32(float);
				double __ocml_scalbn_f64(double, int);
				float __ocml_scalbn_f32(float, int);
	double __ocml_sin_f64(double);			double __ocml_sin_f64(double);
				float __ocml_sincos_f32(float, float *);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_AMDGPU_DECLARATIONS_H			#endif // LLVM_LIBC_SRC_MATH_GPU_AMDGPU_DECLARATIONS_H

libc/src/math/gpu/vendor/modf.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the modf function for GPU -----------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/modf.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(double, modf, (double x, double *iptr)) {
	double __nv_sin(double);			return internal::modf(x, iptr);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/modff.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the modff function for GPU ----------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/modff.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(float, modff, (float x, float *iptr)) {
	double __nv_sin(double);			return internal::modff(x, iptr);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/nearbyint.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the nearbyint function for GPU -----------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/nearbyint.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(double, nearbyint, (double x)) {
	double __nv_sin(double);			return internal::nearbyint(x);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/nearbyintf.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the nearbyintf function for GPU ----------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/nearbyintf.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(float, nearbyintf, (float x)) {
	double __nv_sin(double);			return internal::nearbyintf(x);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/nextafter.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the nextafter function for GPU ------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/nextafter.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(double, nextafter, (double x, double y)) {
	double __nv_sin(double);			return internal::nextafter(x, y);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/nextafterf.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the nextafterf function for GPU -----------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/nextafterf.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(double, nextafterf, (double x, double y)) {
	double __nv_sin(double);			return internal::nextafterf(x, y);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/nvptx/declarations.h

This file was copied to libc/src/math/gpu/vendor/modf.cpp, libc/src/math/gpu/vendor/modff.cpp, libc/src/math/gpu/vendor/nearbyint.cpp, libc/src/math/gpu/vendor/nearbyintf.cpp, libc/src/math/gpu/vendor/nextafter.cpp, libc/src/math/gpu/vendor/nextafterf.cpp, libc/src/math/gpu/vendor/remainder.cpp, libc/src/math/gpu/vendor/remainderf.cpp, libc/src/math/gpu/vendor/remquo.cpp, libc/src/math/gpu/vendor/remquof.cpp, libc/src/math/gpu/vendor/rint.cpp, libc/src/math/gpu/vendor/rintf.cpp, libc/src/math/gpu/vendor/scalbn.cpp, libc/src/math/gpu/vendor/scalbnf.cpp, libc/src/math/gpu/vendor/sincosf.cpp.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- NVPTX specific declarations for math support ----------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			extern "C" {
				double __nv_modf(double, double *);
				float __nv_modff(float, float *);
				double __builtin_nearbyint(double);
				float __builtin_nearbyintf(float);
				double __nv_nextafter(float, float);
				float __nv_nextafterf(float, float);
				double __nv_remainder(double, double);
				float __nv_remainderf(float, float);
				double __nv_remquo(double, double, int *);
				float __nv_remquof(float, float, int *);
				double __builtin_rint(double);
				float __builtin_rintf(float);
				double __nv_scalbn(double, int);
				float __nv_scalbnf(float, int);
	double __nv_sin(double);			double __nv_sin(double);
				void __nv_sincosf(float, float , float );
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/nvptx/nvptx.h

	Show All 10 Lines

	#include "declarations.h"			#include "declarations.h"

	#include "src/__support/macros/attributes.h"			#include "src/__support/macros/attributes.h"

	namespace __llvm_libc {			namespace __llvm_libc {
	namespace internal {			namespace internal {

				LIBC_INLINE double modf(double x, double *iptr) { return __nv_modf(x, iptr); }
				JonChesterfieldUnsubmitted Not Done Reply Inline Actions Why is this stuff all in a header? It means it has to be different for different targets even though the interface is the same. I was hoping for `float remainderf(float x, float y);` in a header and the nvptx specific implementation in a source file JonChesterfield: Why is this stuff all in a header? It means it has to be different for different targets even…
				LIBC_INLINE float modff(float x, float *iptr) { return __nv_modff(x, iptr); }
				LIBC_INLINE double nearbyint(double x) { return __builtin_nearbyint(x); }
				LIBC_INLINE float nearbyintf(float x) { return __builtin_nearbyintf(x); }
				LIBC_INLINE double nextafter(double x, double y) {
				return __nv_nextafter(x, y);
				}
				LIBC_INLINE float nextafterf(float x, float y) { return __nv_nextafterf(x, y); }
				LIBC_INLINE double remainder(double x, double y) {
				return __nv_remainder(x, y);
				}
				LIBC_INLINE float remainderf(float x, float y) { return __nv_remainderf(x, y); }
				LIBC_INLINE double remquo(double x, double y, int *quo) {
				return __nv_remquo(x, y, quo);
				}
				LIBC_INLINE float remquof(float x, float y, int *quo) {
				return __nv_remquof(x, y, quo);
				}
				LIBC_INLINE double rint(double x) { return __builtin_rint(x); }
				LIBC_INLINE float rintf(float x) { return __builtin_rintf(x); }
				LIBC_INLINE double scalbn(double x, int n) { return __nv_scalbn(x, n); }
				LIBC_INLINE float scalbnf(float x, int n) { return __nv_scalbnf(x, n); }
	LIBC_INLINE double sin(double x) { return __nv_sin(x); }			LIBC_INLINE double sin(double x) { return __nv_sin(x); }
				LIBC_INLINE void sincosf(float x, float sinptr, float cosptr) {
				return __nv_sincosf(x, sinptr, cosptr);
				}
	} // namespace internal			} // namespace internal
	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_H			#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_H

libc/src/math/gpu/vendor/remainder.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the remainder function for GPU ------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/remainder.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(double, remainder, (double x, double y)) {
	double __nv_sin(double);			return internal::remainder(x, y);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/remainderf.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the remainderf function for GPU -----------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/remainderf.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(float, remainderf, (float x, float y)) {
	double __nv_sin(double);			return internal::remainderf(x, y);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/remquo.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the remquo function for GPU ---------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/remquo.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(double, remquo, (double x, double y, int *quo)) {
	double __nv_sin(double);			return internal::remquo(x, y, quo);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/remquof.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the remquof function for GPU --------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/remquof.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(float, remquof, (float x, float y, int *quo)) {
	double __nv_sin(double);			return internal::remquof(x, y, quo);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/rint.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the rint function for GPU -----------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/rint.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(double, rint, (double x)) { return internal::rint(x); }
	double __nv_sin(double);
	}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/rintf.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the rintf function for GPU ----------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/rintf.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(float, rintf, (float x)) { return internal::rintf(x); }
	double __nv_sin(double);
	}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/scalbn.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the scalbn function for GPU ---------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/scalbn.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(double, scalbn, (double x, int n)) {
	double __nv_sin(double);			return internal::scalbn(x, n);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/scalbnf.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the scalbnf function for GPU --------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/scalbnf.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(float, scalbnf, (float x, int n)) {
	double __nv_sin(double);			return internal::scalbnf(x, n);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

libc/src/math/gpu/vendor/sincosf.cpp

This file was copied from libc/src/math/gpu/vendor/nvptx/declarations.h.

	//===-- NVPTX specific declarations for math support ----------------------===//			//===-- Implementation of the sincosf function for GPU --------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/math/sincosf.h"
	#define LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H			#include "src/__support/common.h"

				#include "common.h"

	namespace __llvm_libc {			namespace __llvm_libc {

	extern "C" {			LLVM_LIBC_FUNCTION(void, sincosf, (float x, float sinptr, float cosptr)) {
	double __nv_sin(double);			return internal::sincosf(x, sinptr, cosptr);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_SRC_MATH_GPU_NVPTX_DECLARATIONS_H

This is an archive of the discontinued LLVM Phabricator instance.

Added modf for NVPTX and AMDGPU targets to implement 'libmgpu.a' for math on the GPUAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 531176

libc/config/gpu/entrypoints.txt

libc/src/math/gpu/vendor/CMakeLists.txt

libc/src/math/gpu/vendor/amdgpu/amdgpu.h

libc/src/math/gpu/vendor/amdgpu/declarations.h

libc/src/math/gpu/vendor/modf.cpp

libc/src/math/gpu/vendor/modff.cpp

libc/src/math/gpu/vendor/nearbyint.cpp

libc/src/math/gpu/vendor/nearbyintf.cpp

libc/src/math/gpu/vendor/nextafter.cpp

libc/src/math/gpu/vendor/nextafterf.cpp

libc/src/math/gpu/vendor/nvptx/declarations.h

libc/src/math/gpu/vendor/nvptx/nvptx.h

libc/src/math/gpu/vendor/remainder.cpp

libc/src/math/gpu/vendor/remainderf.cpp

libc/src/math/gpu/vendor/remquo.cpp

libc/src/math/gpu/vendor/remquof.cpp

libc/src/math/gpu/vendor/rint.cpp

libc/src/math/gpu/vendor/rintf.cpp

libc/src/math/gpu/vendor/scalbn.cpp

libc/src/math/gpu/vendor/scalbnf.cpp

libc/src/math/gpu/vendor/sincosf.cpp

Added modf for NVPTX and AMDGPU targets to implement 'libmgpu.a' for math on the GPU
AbandonedPublic