This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/
-
Driver/ToolChains/
-
ToolChains/
-
Clang.cpp
-
Headers/
-
CMakeLists.txt
-
__clang_cuda_cmath.h
-
__clang_cuda_device_functions.h
4/8
__clang_cuda_math_forward_declares.h
-
openmp_wrappers/
-
__clang_openmp_math.h
3/5
__clang_openmp_math_declares.h
-
cmath
-
math.h
-
test/Headers/
-
Headers/
-
Inputs/include/
-
include/
2/2
cstdlib
-
nvptx_device_cmath_functions.c
-
nvptx_device_cmath_functions.cpp
-
nvptx_device_math_functions.c
-
nvptx_device_math_functions.cpp

Differential D61765

[OpenMP][Clang][BugFix] Split declares and math functions inclusion.
ClosedPublic

Authored by gtbercea on May 9 2019, 3:33 PM.

Download Raw Diff

Details

Reviewers

jdoerfert
ABataev
hfinkel
caomhin
tra

Commits

rZORGc2401f8391f0: [OpenMP][Clang][BugFix] Split declares and math functions inclusion.
rZORGe082bc297abe: [OpenMP][Clang][BugFix] Split declares and math functions inclusion.
rGc2401f8391f0: [OpenMP][Clang][BugFix] Split declares and math functions inclusion.
rGe082bc297abe: [OpenMP][Clang][BugFix] Split declares and math functions inclusion.
rG946957189d6b: [OpenMP][Clang][BugFix] Split declares and math functions inclusion.
rL360626: [OpenMP][Clang][BugFix] Split declares and math functions inclusion.
rC360626: [OpenMP][Clang][BugFix] Split declares and math functions inclusion.

Summary

This patches fixes an issue in which the __clang_cuda_cmath.h header is being included even when cmath or math.h headers are not included.

Diff Detail

Repository: rC Clang

Event Timeline

gtbercea created this revision.May 9 2019, 3:33 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 9 2019, 3:33 PM

Herald added subscribers: cfe-commits, guansong, mgorny. · View Herald Transcript

Remove define.

Harbormaster completed remote builds in B31708: Diff 198929.May 9 2019, 3:51 PM

This always includes the declare file but not the define file, correct?

Could we have 4 tests that are compiled in target mode with:

// with and without math.h/cmath (clang/clang++)
#include <math.h>

long abs(long __i) { return (__i < 0 ? -i : i); }

Fix cstdlib issue.

Harbormaster completed remote builds in B31750: Diff 199041.May 10 2019, 11:13 AM

Move back functions.

Harbormaster completed remote builds in B31751: Diff 199043.May 10 2019, 11:34 AM

What about abs tests?

lib/Headers/openmp_wrappers/__clang_openmp_math_declares.h
18	Why do we need the stdlib includes again?
test/Headers/Inputs/include/cstdlib
3	Where is this used? Are there tests missing?

gtbercea marked 2 inline comments as done.May 10 2019, 12:44 PM

gtbercea added inline comments.

lib/Headers/openmp_wrappers/__clang_openmp_math_declares.h
18	They are both prone to abs inclusion. We need them here to control the order in which they are included relative to the forward_declares header.
test/Headers/Inputs/include/cstdlib
3	I'll remove it.

Clean test header.

Harbormaster completed remote builds in B31756: Diff 199062.May 10 2019, 12:48 PM

Mock cstdlib header's abs functions.

gtbercea marked 2 inline comments as done.May 10 2019, 1:02 PM

Harbormaster completed remote builds in B31758: Diff 199064.May 10 2019, 1:02 PM

jdoerfert added inline comments.May 10 2019, 1:02 PM

lib/Headers/openmp_wrappers/__clang_openmp_math_declares.h
18	I thought the "not defining abs" in __clang_cuda_math_forward_declares.h was the solution?

tra added a subscriber: tra.May 10 2019, 1:22 PM

tra added inline comments.

lib/Headers/__clang_cuda_math_forward_declares.h
30–37	I'm not quite sure what's the idea here. It may be worth adding a comment. It could also be expressed somewhat simpler: #if !(defined(_OPENMP) && defined(__cplusplus)) ... #endif
lib/Headers/openmp_wrappers/__clang_openmp_math_declares.h
11	You may want to add include guards. I'd also make inclusion of the file in non-openmp compilation an error, if it makes sense for OpenMP. It does for CUDA.

Add tests. Exclude additional abs definition.

Harbormaster completed remote builds in B31759: Diff 199073.May 10 2019, 1:31 PM

Simplify conditions. Add guards.

gtbercea marked 2 inline comments as done.May 10 2019, 1:39 PM

Harbormaster completed remote builds in B31760: Diff 199075.May 10 2019, 1:39 PM

gtbercea marked an inline comment as done.May 10 2019, 1:41 PM

gtbercea added inline comments.

lib/Headers/__clang_cuda_math_forward_declares.h

30–37

When these two functions definitions are here or in the clang_cuda_cmath.h header then I get the following error (adapted for the clang_cuda_cmath.h case):

/usr/lib/gcc/ppc64le-redhat-linux/4.8.5/../../../../include/c++/4.8.5/cstdlib:166:3: error: declaration conflicts with target of using declaration already in scope
  abs(long __i) { return __builtin_labs(__i); }
  ^
/autofs/home/gbercea/patch-compiler/obj-release/lib/clang/9.0.0/include/__clang_cuda_cmath.h:40:17: note: target of using declaration
__DEVICE__ long abs(long __n) { return ::labs(__n); }
                ^
/usr/lib/gcc/ppc64le-redhat-linux/4.8.5/../../../../include/c++/4.8.5/cstdlib:122:11: note: using declaration
  using ::abs;
          ^
/usr/lib/gcc/ppc64le-redhat-linux/4.8.5/../../../../include/c++/4.8.5/cstdlib:174:3: error: declaration conflicts with target of using declaration already in scope
  abs(long long __x) { return __builtin_llabs (__x); }
  ^
/autofs/home/gbercea/patch-compiler/obj-release/lib/clang/9.0.0/include/__clang_cuda_cmath.h:39:22: note: target of using declaration
__DEVICE__ long long abs(long long __n) { return ::llabs(__n); }
                     ^
/usr/lib/gcc/ppc64le-redhat-linux/4.8.5/../../../../include/c++/4.8.5/cstdlib:122:11: note: using declaration
  using ::abs;

Last issue I have (in addition to the check @tra suggested) is the order in which we include math.h and cstdlib. can you flip it in one of the test cases?

lib/Headers/__clang_cuda_math_forward_declares.h
30–37	Long story short, we currently cannot use the overload trick through `__device__` and therefore replace (not augment) host math headers with the cuda versions which unfortunately mix std math functions with other functions that we don't want/need.
lib/Headers/openmp_wrappers/__clang_openmp_math_declares.h
11	That is something we should be able to do, error out if _OPENMP is not defined I mean.

gtbercea marked an inline comment as done.May 10 2019, 1:49 PM

gtbercea added inline comments.

lib/Headers/__clang_cuda_math_forward_declares.h
30–37	This doesn't seem to be happening in the CUDA case. My suspicion is it's because of the device attribute.

Error if not in OpenMP.

tra added inline comments.May 10 2019, 1:53 PM

lib/Headers/__clang_cuda_math_forward_declares.h
30–37	It looks like until OpenMP supports some sort of target-based overloading this will not play nicely with libstdc++. Did you, by any chance, check if the header works with libc++ ? I wonder if we may encounter more conflicts like these.

Harbormaster completed remote builds in B31763: Diff 199080.May 10 2019, 1:53 PM

tra added inline comments.May 10 2019, 1:57 PM

lib/Headers/__clang_cuda_math_forward_declares.h
30–37	Correct. `__device__` functions overload whatever `(implicitly)__host__` functions declared by the standard library, so they coexist w/o problems. Usually. host/device implementation nuances are still observable.

gtbercea marked an inline comment as done.May 10 2019, 2:27 PM

gtbercea added inline comments.

lib/Headers/__clang_cuda_math_forward_declares.h
30–37	Just did a few quick tests with libstdc++ and it was all good.

tra added inline comments.May 10 2019, 2:36 PM

lib/Headers/__clang_cuda_math_forward_declares.h
30–37	How about `libc++`? The idea is to make sure the change works with both libraries.

Two small changes and then it is fine with me. @tra ?

we need to use ifdef to not define clock
Can you switch the include order in test/Headers/nvptx_device_math_functions.cpp?

P.S. I'm currently at the OpenMP standard meeting to get the OpenMP variants fixed.
Once done, we should prioritize the implementation.
Excluding non-math functions in the cuda headers is not perfect...

In D61765#1499957, @jdoerfert wrote:

Two small changes and then it is fine with me. @tra ?

LGTM in general. I would still like to confirm that the changes work with libc++.

In D61765#1500233, @tra wrote:

In D61765#1499957, @jdoerfert wrote:

Two small changes and then it is fine with me. @tra ?

LGTM in general. I would still like to confirm that the changes work with libc++.

As soon as libc++ the limits header included in

__clang_cuda_cmath.h:15
``` is not found:

__clang_cuda_cmath.h:15:10: fatal error: 'limits' file not found
#include <limits>

Not even CUDA works actually so I'm not sure what the best answer to this problem is.

Exclude clock functions. Reverse inclusion order.

Harbormaster completed remote builds in B31835: Diff 199304.May 13 2019, 12:15 PM

In D61765#1500309, @gtbercea wrote:
As soon as libc++ the limits header included in
__clang_cuda_cmath.h:15
``` is not found:
__clang_cuda_cmath.h:15:10: fatal error: 'limits' file not found
#include <limits>
Not even CUDA works actually so I'm not sure what the best answer to this problem is.

Could you give me more details on how you've got this error?

If this change breaks CUDA compilation with libc++, that's going to be a problem. Currently CUDA and clang's headers we ship do work with both libc++ and few versions of libstdc++:
E.g: http://lab.llvm.org:8011/builders/clang-cuda-build/builds/33364/steps/ninja%20build%20simple%20CUDA%20tests/logs/stdio

In D61765#1500446, @tra wrote:
In D61765#1500309, @gtbercea wrote:
As soon as libc++ the limits header included in
__clang_cuda_cmath.h:15
``` is not found:
__clang_cuda_cmath.h:15:10: fatal error: 'limits' file not found
#include <limits>
Not even CUDA works actually so I'm not sure what the best answer to this problem is.
Could you give me more details on how you've got this error?

If this change breaks CUDA compilation with libc++, that's going to be a problem. Currently CUDA and clang's headers we ship do work with both libc++ and few versions of libstdc++:
E.g: http://lab.llvm.org:8011/builders/clang-cuda-build/builds/33364/steps/ninja%20build%20simple%20CUDA%20tests/logs/stdio

It's an error on my side, I don't have libc++ installed so trying to use it will come up with header not found errors.

In D61765#1500446, @tra wrote:
In D61765#1500309, @gtbercea wrote:
As soon as libc++ the limits header included in
__clang_cuda_cmath.h:15
``` is not found:
__clang_cuda_cmath.h:15:10: fatal error: 'limits' file not found
#include <limits>
Not even CUDA works actually so I'm not sure what the best answer to this problem is.
Could you give me more details on how you've got this error?

If this change breaks CUDA compilation with libc++, that's going to be a problem. Currently CUDA and clang's headers we ship do work with both libc++ and few versions of libstdc++:
E.g: http://lab.llvm.org:8011/builders/clang-cuda-build/builds/33364/steps/ninja%20build%20simple%20CUDA%20tests/logs/stdio

This won't affect CUDA in any way, all we have added is OpenMP specific.

In D61765#1500457, @gtbercea wrote:

This won't affect CUDA in any way, all we have added is OpenMP specific.

LGTM for CUDA. I'll leave the question of testing with libc++ to someone more familiar with OpenMP.

This revision is now accepted and ready to land.May 13 2019, 2:31 PM

In D61765#1500351, @gtbercea wrote:

Exclude clock functions. Reverse inclusion order.

LGTM from my side. I don't have strong feelings about testing libc++ now, though it is probably a good idea to have such a testbed.
I agree this should not infer with CUDA (at least that is our intention).

P.S.
There will be a ticket to add the OpenMP variant support we need (similar to the __device__ in CUDA) will be written by Tom Scogland and me for a first vote already in this meeting.
That is what we will then need to implement.

Closed by commit rC360626: [OpenMP][Clang][BugFix] Split declares and math functions inclusion. (authored by gbercea). · Explain WhyMay 13 2019, 3:10 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

lib/

Driver/

ToolChains/

Clang.cpp

2 lines

Headers/

CMakeLists.txt

1 line

__clang_cuda_cmath.h

2 lines

__clang_cuda_device_functions.h

2 lines

__clang_cuda_math_forward_declares.h

8 lines

openmp_wrappers/

__clang_openmp_math.h

9 lines

__clang_openmp_math_declares.h

33 lines

cmath

2 lines

math.h

2 lines

test/

Headers/

Inputs/

include/

cstdlib

16 lines

nvptx_device_cmath_functions.c

2 lines

nvptx_device_cmath_functions.cpp

3 lines

nvptx_device_math_functions.c

2 lines

nvptx_device_math_functions.cpp

3 lines

Diff 199340

lib/Driver/ToolChains/Clang.cpp

Show First 20 Lines • Show All 1,160 Lines • ▼ Show 20 Lines	if (!Args.hasArg(options::OPT_nobuiltininc)) {
SmallString<128> P(D.ResourceDir);		SmallString<128> P(D.ResourceDir);
llvm::sys::path::append(P, "include");		llvm::sys::path::append(P, "include");
llvm::sys::path::append(P, "openmp_wrappers");		llvm::sys::path::append(P, "openmp_wrappers");
CmdArgs.push_back("-internal-isystem");		CmdArgs.push_back("-internal-isystem");
CmdArgs.push_back(Args.MakeArgString(P));		CmdArgs.push_back(Args.MakeArgString(P));
}		}

CmdArgs.push_back("-include");		CmdArgs.push_back("-include");
CmdArgs.push_back("__clang_openmp_math.h");		CmdArgs.push_back("__clang_openmp_math_declares.h");
}		}

// Add -i* options, and automatically translate to		// Add -i* options, and automatically translate to
// -include-pch/-include-pth for transparent PCH support. It's		// -include-pch/-include-pth for transparent PCH support. It's
// wonky, but we include looking for .gch so we can support seamless		// wonky, but we include looking for .gch so we can support seamless
// replacement into a build system already set up to be generating		// replacement into a build system already set up to be generating
// .gch files.		// .gch files.

▲ Show 20 Lines • Show All 5,146 Lines • Show Last 20 Lines

lib/Headers/CMakeLists.txt

	Show First 20 Lines • Show All 126 Lines • ▼ Show 20 Lines
	set(ppc_wrapper_files			set(ppc_wrapper_files
	ppc_wrappers/mmintrin.h			ppc_wrappers/mmintrin.h
	)			)

	set(openmp_wrapper_files			set(openmp_wrapper_files
	openmp_wrappers/math.h			openmp_wrappers/math.h
	openmp_wrappers/cmath			openmp_wrappers/cmath
	openmp_wrappers/__clang_openmp_math.h			openmp_wrappers/__clang_openmp_math.h
				openmp_wrappers/__clang_openmp_math_declares.h
	)			)

	set(output_dir ${LLVM_LIBRARY_OUTPUT_INTDIR}/clang/${CLANG_VERSION}/include)			set(output_dir ${LLVM_LIBRARY_OUTPUT_INTDIR}/clang/${CLANG_VERSION}/include)
	set(out_files)			set(out_files)
	set(generated_files)			set(generated_files)

	function(copy_header_to_output_dir src_dir file)			function(copy_header_to_output_dir src_dir file)
	set(src ${src_dir}/${file})			set(src ${src_dir}/${file})
	▲ Show 20 Lines • Show All 64 Lines • Show Last 20 Lines

lib/Headers/__clang_cuda_cmath.h

	Show All 30 Lines
	// std covers all of the known knowns.			// std covers all of the known knowns.

	#ifdef _OPENMP			#ifdef _OPENMP
	#define __DEVICE__ static __attribute__((always_inline))			#define __DEVICE__ static __attribute__((always_inline))
	#else			#else
	#define __DEVICE__ static __device__ __inline__ __attribute__((always_inline))			#define __DEVICE__ static __device__ __inline__ __attribute__((always_inline))
	#endif			#endif

				#if !(defined(_OPENMP) && defined(__cplusplus))
	__DEVICE__ long long abs(long long __n) { return ::llabs(__n); }			__DEVICE__ long long abs(long long __n) { return ::llabs(__n); }
	__DEVICE__ long abs(long __n) { return ::labs(__n); }			__DEVICE__ long abs(long __n) { return ::labs(__n); }
				#endif
	__DEVICE__ float abs(float __x) { return ::fabsf(__x); }			__DEVICE__ float abs(float __x) { return ::fabsf(__x); }
	__DEVICE__ double abs(double __x) { return ::fabs(__x); }			__DEVICE__ double abs(double __x) { return ::fabs(__x); }
	__DEVICE__ float acos(float __x) { return ::acosf(__x); }			__DEVICE__ float acos(float __x) { return ::acosf(__x); }
	__DEVICE__ float asin(float __x) { return ::asinf(__x); }			__DEVICE__ float asin(float __x) { return ::asinf(__x); }
	__DEVICE__ float atan(float __x) { return ::atanf(__x); }			__DEVICE__ float atan(float __x) { return ::atanf(__x); }
	__DEVICE__ float atan2(float __x, float __y) { return ::atan2f(__x, __y); }			__DEVICE__ float atan2(float __x, float __y) { return ::atan2f(__x, __y); }
	__DEVICE__ float ceil(float __x) { return ::ceilf(__x); }			__DEVICE__ float ceil(float __x) { return ::ceilf(__x); }
	__DEVICE__ float cos(float __x) { return ::cosf(__x); }			__DEVICE__ float cos(float __x) { return ::cosf(__x); }
	▲ Show 20 Lines • Show All 420 Lines • Show Last 20 Lines

lib/Headers/__clang_cuda_device_functions.h

	Show First 20 Lines • Show All 1,487 Lines • ▼ Show 20 Lines
	__DEVICE__ float atan2f(float __a, float __b) { return __nv_atan2f(__a, __b); }			__DEVICE__ float atan2f(float __a, float __b) { return __nv_atan2f(__a, __b); }
	__DEVICE__ float atanf(float __a) { return __nv_atanf(__a); }			__DEVICE__ float atanf(float __a) { return __nv_atanf(__a); }
	__DEVICE__ double atanh(double __a) { return __nv_atanh(__a); }			__DEVICE__ double atanh(double __a) { return __nv_atanh(__a); }
	__DEVICE__ float atanhf(float __a) { return __nv_atanhf(__a); }			__DEVICE__ float atanhf(float __a) { return __nv_atanhf(__a); }
	__DEVICE__ double cbrt(double __a) { return __nv_cbrt(__a); }			__DEVICE__ double cbrt(double __a) { return __nv_cbrt(__a); }
	__DEVICE__ float cbrtf(float __a) { return __nv_cbrtf(__a); }			__DEVICE__ float cbrtf(float __a) { return __nv_cbrtf(__a); }
	__DEVICE__ double ceil(double __a) { return __nv_ceil(__a); }			__DEVICE__ double ceil(double __a) { return __nv_ceil(__a); }
	__DEVICE__ float ceilf(float __a) { return __nv_ceilf(__a); }			__DEVICE__ float ceilf(float __a) { return __nv_ceilf(__a); }
				#ifndef _OPENMP
	__DEVICE__ int clock() { return __nvvm_read_ptx_sreg_clock(); }			__DEVICE__ int clock() { return __nvvm_read_ptx_sreg_clock(); }
	__DEVICE__ long long clock64() { return __nvvm_read_ptx_sreg_clock64(); }			__DEVICE__ long long clock64() { return __nvvm_read_ptx_sreg_clock64(); }
				#endif
	__DEVICE__ double copysign(double __a, double __b) {			__DEVICE__ double copysign(double __a, double __b) {
	return __nv_copysign(__a, __b);			return __nv_copysign(__a, __b);
	}			}
	__DEVICE__ float copysignf(float __a, float __b) {			__DEVICE__ float copysignf(float __a, float __b) {
	return __nv_copysignf(__a, __b);			return __nv_copysignf(__a, __b);
	}			}
	__DEVICE__ double cos(double __a) { return __nv_cos(__a); }			__DEVICE__ double cos(double __a) { return __nv_cos(__a); }
	__DEVICE__ float cosf(float __a) {			__DEVICE__ float cosf(float __a) {
	▲ Show 20 Lines • Show All 276 Lines • Show Last 20 Lines

lib/Headers/__clang_cuda_math_forward_declares.h

	Show All 21 Lines
	#pragma push_macro("__DEVICE__")			#pragma push_macro("__DEVICE__")
	#ifdef _OPENMP			#ifdef _OPENMP
	#define __DEVICE__ static __inline__ __attribute__((always_inline))			#define __DEVICE__ static __inline__ __attribute__((always_inline))
	#else			#else
	#define __DEVICE__ \			#define __DEVICE__ \
	static __inline__ __attribute__((always_inline)) __attribute__((device))			static __inline__ __attribute__((always_inline)) __attribute__((device))
	#endif			#endif

	__DEVICE__ double abs(double);			#if !(defined(_OPENMP) && defined(__cplusplus))
	__DEVICE__ float abs(float);
	__DEVICE__ int abs(int);
	__DEVICE__ long abs(long);			__DEVICE__ long abs(long);
	__DEVICE__ long long abs(long long);			__DEVICE__ long long abs(long long);
				#endif
				__DEVICE__ int abs(int);
				__DEVICE__ double abs(double);
				__DEVICE__ float abs(float);
	__DEVICE__ double acos(double);			__DEVICE__ double acos(double);
				traUnsubmitted Done Reply Inline Actions I'm not quite sure what's the idea here. It may be worth adding a comment. It could also be expressed somewhat simpler: #if !(defined(_OPENMP) && defined(__cplusplus)) ... #endif tra: I'm not quite sure what's the idea here. It may be worth adding a comment. It could also be…
				gtberceaAuthorUnsubmitted Done Reply Inline Actions When these two functions definitions are here or in the clang_cuda_cmath.h header then I get the following error (adapted for the clang_cuda_cmath.h case): /usr/lib/gcc/ppc64le-redhat-linux/4.8.5/../../../../include/c++/4.8.5/cstdlib:166:3: error: declaration conflicts with target of using declaration already in scope abs(long __i) { return __builtin_labs(__i); } ^ /autofs/home/gbercea/patch-compiler/obj-release/lib/clang/9.0.0/include/__clang_cuda_cmath.h:40:17: note: target of using declaration __DEVICE__ long abs(long __n) { return ::labs(__n); } ^ /usr/lib/gcc/ppc64le-redhat-linux/4.8.5/../../../../include/c++/4.8.5/cstdlib:122:11: note: using declaration using ::abs; ^ /usr/lib/gcc/ppc64le-redhat-linux/4.8.5/../../../../include/c++/4.8.5/cstdlib:174:3: error: declaration conflicts with target of using declaration already in scope abs(long long __x) { return __builtin_llabs (__x); } ^ /autofs/home/gbercea/patch-compiler/obj-release/lib/clang/9.0.0/include/__clang_cuda_cmath.h:39:22: note: target of using declaration __DEVICE__ long long abs(long long __n) { return ::llabs(__n); } ^ /usr/lib/gcc/ppc64le-redhat-linux/4.8.5/../../../../include/c++/4.8.5/cstdlib:122:11: note: using declaration using ::abs; gtbercea: When these two functions definitions are here or in the __clang_cuda_cmath.h header then I get…
				jdoerfertUnsubmitted Not Done Reply Inline Actions Long story short, we currently cannot use the overload trick through `__device__` and therefore replace (not augment) host math headers with the cuda versions which unfortunately mix std math functions with other functions that we don't want/need. jdoerfert: Long story short, we currently cannot use the overload trick through `__device__` and therefore…
				traUnsubmitted Not Done Reply Inline Actions It looks like until OpenMP supports some sort of target-based overloading this will not play nicely with libstdc++. Did you, by any chance, check if the header works with libc++ ? I wonder if we may encounter more conflicts like these. tra: It looks like until OpenMP supports some sort of target-based overloading this will not play…
				gtberceaAuthorUnsubmitted Done Reply Inline Actions Just did a few quick tests with libstdc++ and it was all good. gtbercea: Just did a few quick tests with libstdc++ and it was all good.
				traUnsubmitted Not Done Reply Inline Actions How about `libc++`? The idea is to make sure the change works with both libraries. tra: How about `libc++`? The idea is to make sure the change works with both libraries.
				gtberceaAuthorUnsubmitted Done Reply Inline Actions This doesn't seem to be happening in the CUDA case. My suspicion is it's because of the device attribute. gtbercea: This doesn't seem to be happening in the CUDA case. My suspicion is it's because of the…
				traUnsubmitted Not Done Reply Inline Actions Correct. `__device__` functions overload whatever `(implicitly)__host__` functions declared by the standard library, so they coexist w/o problems. Usually. host/device implementation nuances are still observable. tra: Correct. `__device__` functions overload whatever `(implicitly)__host__` functions declared by…
	__DEVICE__ float acos(float);			__DEVICE__ float acos(float);
	__DEVICE__ double acosh(double);			__DEVICE__ double acosh(double);
	__DEVICE__ float acosh(float);			__DEVICE__ float acosh(float);
	__DEVICE__ double asin(double);			__DEVICE__ double asin(double);
	__DEVICE__ float asin(float);			__DEVICE__ float asin(float);
	__DEVICE__ double asinh(double);			__DEVICE__ double asinh(double);
	__DEVICE__ float asinh(float);			__DEVICE__ float asinh(float);
	__DEVICE__ double atan2(double, double);			__DEVICE__ double atan2(double, double);
	▲ Show 20 Lines • Show All 242 Lines • Show Last 20 Lines

lib/Headers/openmp_wrappers/__clang_openmp_math.h

	Show All 17 Lines
	/// host declarations by unconditionally including the host math.h or cmath,			/// host declarations by unconditionally including the host math.h or cmath,
	/// respectively. This is actually what the Clang-CUDA code path does, using			/// respectively. This is actually what the Clang-CUDA code path does, using
	/// __device__ instead of variants to avoid redeclarations and get the desired			/// __device__ instead of variants to avoid redeclarations and get the desired
	/// overload resolution.			/// overload resolution.

	#define __CUDA__			#define __CUDA__

	#if defined(__cplusplus)			#if defined(__cplusplus)
	#include <__clang_cuda_math_forward_declares.h>
	#endif

	/// Include declarations for libdevice functions.
	#include <__clang_cuda_libdevice_declares.h>
	/// Provide definitions for these functions.
	#include <__clang_cuda_device_functions.h>

	#if defined(__cplusplus)
	#include <__clang_cuda_cmath.h>			#include <__clang_cuda_cmath.h>
	#endif			#endif

	#undef __CUDA__			#undef __CUDA__

	/// Magic macro for stopping the math.h/cmath host header from being included.			/// Magic macro for stopping the math.h/cmath host header from being included.
	#define __CLANG_NO_HOST_MATH__			#define __CLANG_NO_HOST_MATH__

	#endif			#endif

lib/Headers/openmp_wrappers/__clang_openmp_math_declares.h

				/*===---- __clang_openmp_math_declares.h - OpenMP math declares ------------===
				*
				* Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				* See https://llvm.org/LICENSE.txt for license information.
				* SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				*
				*===-----------------------------------------------------------------------===
				*/

				#ifndef __CLANG_OPENMP_MATH_DECLARES_H__
				#define __CLANG_OPENMP_MATH_DECLARES_H__
				traUnsubmitted Done Reply Inline Actions You may want to add include guards. I'd also make inclusion of the file in non-openmp compilation an error, if it makes sense for OpenMP. It does for CUDA. tra: You may want to add include guards. I'd also make inclusion of the file in non-openmp…
				jdoerfertUnsubmitted Not Done Reply Inline Actions That is something we should be able to do, error out if _OPENMP is not defined I mean. jdoerfert: That is something we should be able to do, error out if _OPENMP is not defined I mean.

				#ifndef _OPENMP
				#error "This file is for OpenMP compilation only."
				#endif

				#if defined(__NVPTX__) && defined(_OPENMP)

				jdoerfertUnsubmitted Done Reply Inline Actions Why do we need the stdlib includes again? jdoerfert: Why do we need the stdlib includes again?
				gtberceaAuthorUnsubmitted Done Reply Inline Actions They are both prone to abs inclusion. We need them here to control the order in which they are included relative to the forward_declares header. gtbercea: They are both prone to abs inclusion. We need them here to control the order in which they are…
				jdoerfertUnsubmitted Not Done Reply Inline Actions I thought the "not defining abs" in __clang_cuda_math_forward_declares.h was the solution? jdoerfert: I thought the "not defining abs" in __clang_cuda_math_forward_declares.h was the solution?
				#define __CUDA__

				#if defined(__cplusplus)
				#include <__clang_cuda_math_forward_declares.h>
				#endif

				/// Include declarations for libdevice functions.
				#include <__clang_cuda_libdevice_declares.h>
				/// Provide definitions for these functions.
				#include <__clang_cuda_device_functions.h>

				#undef __CUDA__

				#endif
				#endif

lib/Headers/openmp_wrappers/cmath

	/*===-------------- cmath - Alternative cmath header -----------------------===			/*===-------------- cmath - Alternative cmath header -----------------------===
	*			*
	* Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			* Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	* See https://llvm.org/LICENSE.txt for license information.			* See https://llvm.org/LICENSE.txt for license information.
	* SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			* SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	*			*
	*===-----------------------------------------------------------------------===			*===-----------------------------------------------------------------------===
	*/			*/

	#ifndef __cplusplus
	#include <__clang_openmp_math.h>			#include <__clang_openmp_math.h>
	#endif

	#ifndef __CLANG_NO_HOST_MATH__			#ifndef __CLANG_NO_HOST_MATH__
	#include_next <cmath>			#include_next <cmath>
	#else			#else
	#undef __CLANG_NO_HOST_MATH__			#undef __CLANG_NO_HOST_MATH__
	#endif			#endif

lib/Headers/openmp_wrappers/math.h

	/*===------------- math.h - Alternative math.h header ----------------------===			/*===------------- math.h - Alternative math.h header ----------------------===
	*			*
	* Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			* Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	* See https://llvm.org/LICENSE.txt for license information.			* See https://llvm.org/LICENSE.txt for license information.
	* SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			* SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	*			*
	*===-----------------------------------------------------------------------===			*===-----------------------------------------------------------------------===
	*/			*/

	#ifndef __cplusplus
	#include <__clang_openmp_math.h>			#include <__clang_openmp_math.h>
	#endif

	#ifndef __CLANG_NO_HOST_MATH__			#ifndef __CLANG_NO_HOST_MATH__
	#include_next <math.h>			#include_next <math.h>
	#else			#else
	#undef __CLANG_NO_HOST_MATH__			#undef __CLANG_NO_HOST_MATH__
	#endif			#endif

test/Headers/Inputs/include/cstdlib

				#pragma once

				extern int abs (int __x) __attribute__ ((__const__)) ;
				jdoerfertUnsubmitted Done Reply Inline Actions Where is this used? Are there tests missing? jdoerfert: Where is this used? Are there tests missing?
				gtberceaAuthorUnsubmitted Done Reply Inline Actions I'll remove it. gtbercea: I'll remove it.
				extern long int labs (long int __x) __attribute__ ((__const__)) ;

				namespace std
				{

				using ::abs;

				inline long
				abs(long __i) { return __builtin_labs(__i); }

				inline long long
				abs(long long __x) { return __builtin_llabs (__x); }
				}

test/Headers/nvptx_device_cmath_functions.c

	// Test calling of device math functions.			// Test calling of device math functions.
	///==========================================================================///			///==========================================================================///

	// REQUIRES: nvptx-registered-target			// REQUIRES: nvptx-registered-target

	// RUN: %clang_cc1 -internal-isystem %S/Inputs/include -include cmath -fopenmp -triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm-bc %s -o %t-ppc-host.bc			// RUN: %clang_cc1 -internal-isystem %S/Inputs/include -include cmath -fopenmp -triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm-bc %s -o %t-ppc-host.bc
	// RUN: %clang_cc1 -internal-isystem %S/../../lib/Headers/openmp_wrappers -include __clang_openmp_math.h -internal-isystem %S/../../lib/Headers/openmp_wrappers -include cmath -fopenmp -triple nvptx64-nvidia-cuda -aux-triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm %s -fopenmp-is-device -fopenmp-host-ir-file-path %t-ppc-host.bc -o - \| FileCheck -check-prefix CHECK-YES %s			// RUN: %clang_cc1 -internal-isystem %S/../../lib/Headers/openmp_wrappers -include __clang_openmp_math_declares.h -internal-isystem %S/../../lib/Headers/openmp_wrappers -include cmath -fopenmp -triple nvptx64-nvidia-cuda -aux-triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm %s -fopenmp-is-device -fopenmp-host-ir-file-path %t-ppc-host.bc -o - \| FileCheck -check-prefix CHECK-YES %s

	#include <cmath>			#include <cmath>

	void test_sqrt(double a1) {			void test_sqrt(double a1) {
	#pragma omp target			#pragma omp target
	{			{
	// CHECK-YES: call double @__nv_sqrt(double			// CHECK-YES: call double @__nv_sqrt(double
	double l1 = sqrt(a1);			double l1 = sqrt(a1);
	// CHECK-YES: call double @__nv_pow(double			// CHECK-YES: call double @__nv_pow(double
	double l2 = pow(a1, a1);			double l2 = pow(a1, a1);
	// CHECK-YES: call double @__nv_modf(double			// CHECK-YES: call double @__nv_modf(double
	double l3 = modf(a1 + 3.5, &a1);			double l3 = modf(a1 + 3.5, &a1);
	}			}
	}			}

test/Headers/nvptx_device_cmath_functions.cpp

	// Test calling of device math functions.			// Test calling of device math functions.
	///==========================================================================///			///==========================================================================///

	// REQUIRES: nvptx-registered-target			// REQUIRES: nvptx-registered-target

	// RUN: %clang_cc1 -internal-isystem %S/Inputs/include -include cmath -x c++ -fopenmp -triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm-bc %s -o %t-ppc-host.bc			// RUN: %clang_cc1 -internal-isystem %S/Inputs/include -include cmath -x c++ -fopenmp -triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm-bc %s -o %t-ppc-host.bc
	// RUN: %clang_cc1 -internal-isystem %S/../../lib/Headers/openmp_wrappers -include __clang_openmp_math.h -internal-isystem %S/../../lib/Headers/openmp_wrappers -include cmath -internal-isystem %S/Inputs/include -include stdlib.h -x c++ -fopenmp -triple nvptx64-nvidia-cuda -aux-triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm %s -fopenmp-is-device -fopenmp-host-ir-file-path %t-ppc-host.bc -o - \| FileCheck -check-prefix CHECK-YES %s			// RUN: %clang_cc1 -internal-isystem %S/../../lib/Headers/openmp_wrappers -include __clang_openmp_math_declares.h -internal-isystem %S/../../lib/Headers/openmp_wrappers -include cmath -internal-isystem %S/Inputs/include -include stdlib.h -x c++ -fopenmp -triple nvptx64-nvidia-cuda -aux-triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm %s -fopenmp-is-device -fopenmp-host-ir-file-path %t-ppc-host.bc -o - \| FileCheck -check-prefix CHECK-YES %s

	#include <cmath>			#include <cmath>
				#include <cstdlib>

	void test_sqrt(double a1) {			void test_sqrt(double a1) {
	#pragma omp target			#pragma omp target
	{			{
	// CHECK-YES: call double @__nv_sqrt(double			// CHECK-YES: call double @__nv_sqrt(double
	double l1 = sqrt(a1);			double l1 = sqrt(a1);
	// CHECK-YES: call double @__nv_pow(double			// CHECK-YES: call double @__nv_pow(double
	double l2 = pow(a1, a1);			double l2 = pow(a1, a1);
	// CHECK-YES: call double @__nv_modf(double			// CHECK-YES: call double @__nv_modf(double
	double l3 = modf(a1 + 3.5, &a1);			double l3 = modf(a1 + 3.5, &a1);
	}			}
	}			}

test/Headers/nvptx_device_math_functions.c

	// Test calling of device math functions.			// Test calling of device math functions.
	///==========================================================================///			///==========================================================================///

	// REQUIRES: nvptx-registered-target			// REQUIRES: nvptx-registered-target

	// RUN: %clang_cc1 -internal-isystem %S/Inputs/include -include math.h -fopenmp -triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm-bc %s -o %t-ppc-host.bc			// RUN: %clang_cc1 -internal-isystem %S/Inputs/include -include math.h -fopenmp -triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm-bc %s -o %t-ppc-host.bc
	// RUN: %clang_cc1 -internal-isystem %S/../../lib/Headers/openmp_wrappers -include __clang_openmp_math.h -internal-isystem %S/../../lib/Headers/openmp_wrappers -include math.h -fopenmp -triple nvptx64-nvidia-cuda -aux-triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm %s -fopenmp-is-device -fopenmp-host-ir-file-path %t-ppc-host.bc -o - \| FileCheck -check-prefix CHECK-YES %s			// RUN: %clang_cc1 -internal-isystem %S/../../lib/Headers/openmp_wrappers -include __clang_openmp_math_declares.h -internal-isystem %S/../../lib/Headers/openmp_wrappers -include math.h -fopenmp -triple nvptx64-nvidia-cuda -aux-triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm %s -fopenmp-is-device -fopenmp-host-ir-file-path %t-ppc-host.bc -o - \| FileCheck -check-prefix CHECK-YES %s

	#include <math.h>			#include <math.h>

	void test_sqrt(double a1) {			void test_sqrt(double a1) {
	#pragma omp target			#pragma omp target
	{			{
	// CHECK-YES: call double @__nv_sqrt(double			// CHECK-YES: call double @__nv_sqrt(double
	double l1 = sqrt(a1);			double l1 = sqrt(a1);
	// CHECK-YES: call double @__nv_pow(double			// CHECK-YES: call double @__nv_pow(double
	double l2 = pow(a1, a1);			double l2 = pow(a1, a1);
	// CHECK-YES: call double @__nv_modf(double			// CHECK-YES: call double @__nv_modf(double
	double l3 = modf(a1 + 3.5, &a1);			double l3 = modf(a1 + 3.5, &a1);
	}			}
	}			}

test/Headers/nvptx_device_math_functions.cpp

	// Test calling of device math functions.			// Test calling of device math functions.
	///==========================================================================///			///==========================================================================///

	// REQUIRES: nvptx-registered-target			// REQUIRES: nvptx-registered-target

	// RUN: %clang_cc1 -internal-isystem %S/Inputs/include -include math.h -x c++ -fopenmp -triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm-bc %s -o %t-ppc-host.bc			// RUN: %clang_cc1 -internal-isystem %S/Inputs/include -include math.h -x c++ -fopenmp -triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm-bc %s -o %t-ppc-host.bc
	// RUN: %clang_cc1 -internal-isystem %S/../../lib/Headers/openmp_wrappers -include __clang_openmp_math.h -internal-isystem %S/../../lib/Headers/openmp_wrappers -include math.h -internal-isystem %S/Inputs/include -include stdlib.h -include limits -x c++ -fopenmp -triple nvptx64-nvidia-cuda -aux-triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm %s -fopenmp-is-device -fopenmp-host-ir-file-path %t-ppc-host.bc -o - \| FileCheck -check-prefix CHECK-YES %s			// RUN: %clang_cc1 -internal-isystem %S/../../lib/Headers/openmp_wrappers -include __clang_openmp_math_declares.h -internal-isystem %S/../../lib/Headers/openmp_wrappers -include math.h -internal-isystem %S/Inputs/include -include stdlib.h -include limits -include cstdlib -x c++ -fopenmp -triple nvptx64-nvidia-cuda -aux-triple powerpc64le-unknown-unknown -fopenmp-targets=nvptx64-nvidia-cuda -emit-llvm %s -fopenmp-is-device -fopenmp-host-ir-file-path %t-ppc-host.bc -o - \| FileCheck -check-prefix CHECK-YES %s

				#include <cstdlib>
	#include <math.h>			#include <math.h>

	void test_sqrt(double a1) {			void test_sqrt(double a1) {
	#pragma omp target			#pragma omp target
	{			{
	// CHECK-YES: call double @__nv_sqrt(double			// CHECK-YES: call double @__nv_sqrt(double
	double l1 = sqrt(a1);			double l1 = sqrt(a1);
	// CHECK-YES: call double @__nv_pow(double			// CHECK-YES: call double @__nv_pow(double
	double l2 = pow(a1, a1);			double l2 = pow(a1, a1);
	// CHECK-YES: call double @__nv_modf(double			// CHECK-YES: call double @__nv_modf(double
	double l3 = modf(a1 + 3.5, &a1);			double l3 = modf(a1 + 3.5, &a1);
	}			}
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP][Clang][BugFix] Split declares and math functions inclusion.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 199340

lib/Driver/ToolChains/Clang.cpp

lib/Headers/CMakeLists.txt

lib/Headers/__clang_cuda_cmath.h

lib/Headers/__clang_cuda_device_functions.h

lib/Headers/__clang_cuda_math_forward_declares.h

lib/Headers/openmp_wrappers/__clang_openmp_math.h

lib/Headers/openmp_wrappers/__clang_openmp_math_declares.h

lib/Headers/openmp_wrappers/cmath

lib/Headers/openmp_wrappers/math.h

test/Headers/Inputs/include/cstdlib

test/Headers/nvptx_device_cmath_functions.c

test/Headers/nvptx_device_cmath_functions.cpp

test/Headers/nvptx_device_math_functions.c

test/Headers/nvptx_device_math_functions.cpp

[OpenMP][Clang][BugFix] Split declares and math functions inclusion.
ClosedPublic