This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/runtimes/
-
runtimes/
4/8
CMakeLists.txt
-
openmp/libomptarget/deviceRTLs/nvptx/
-
libomptarget/
-
deviceRTLs/
-
nvptx/
2/5
CMakeLists.txt

Differential D101663

[OpenMP] Avoid unintentional use of host compiler as bclib compiler.
AbandonedPublic

Authored by Meinersbur on Apr 30 2021, 1:50 PM.

Download Raw Diff

Details

Reviewers

tianshilei1992
jdoerfert
protze.joachim

Summary

Before this patch, libomptarget assumed that if the host compiler is clang, then it is also suitable for compile CUDA to LLVM-IR. The intention was that if libomptarget is compiled using LLVM_ENABLE_RUMTIMES=openmp, the host compiler will be clang from the same propository.

However, the host compiler may also be clang when compiling as a LLVM_ENABLE_PROJECTS=openmp or standalone build if the user explicitly set the tool chain or, clang is the system's default compiler (MacOS, most BSDs). Various vendor compilers such as Apple's clang, Intel's dpcpp/icx, IBM's xlclang, etc. may be identified as 'clang' by CMake. However, these potentially use an incompatible IR, or stripped CUDA support, with unpredictable results.

Fix by explicitly set LIBOMPTARGET_NVPTX_CUDA_COMPILER (and LIBOMPTARGET_NVPTX_BC_LINKER) when using LLVM_ENABLE_RUMTIMES=openmp and do not try to use the host compiler.

Implementation of discussion in D101265. Compilation result: http://meinersbur.de:8011/#/builders/143/builds/375

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Meinersbur created this revision.Apr 30 2021, 1:50 PM

Herald added subscribers: guansong, yaxunl, mgorny. · View Herald TranscriptApr 30 2021, 1:50 PM

Meinersbur requested review of this revision.Apr 30 2021, 1:50 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 30 2021, 1:50 PM

Herald added subscribers: llvm-commits, sstefan1. · View Herald Transcript

tianshilei1992 requested changes to this revision.Apr 30 2021, 1:58 PM

tianshilei1992 added inline comments.

llvm/runtimes/CMakeLists.txt
238	I don't think it's a good idea to "pollute" LLVM CMake files for this purpose. There are plenty of ways to tell whether OpenMP is built via `LLVM_ENABLE_RUNTIMES`. I'd set the two CMake variables in OpenMP by checking whether we're in `LLVM_ENABLE_RUNTIMES`.
openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt
36	Removing this can cause issue if I compile OpenMP standalone. We cannot assume people all compile OpenMP along with LLVM either with `LLVM_ENABLE_RUNTIMES` or `LLVM_ENABLE_PROJECTS`. Like I said in your previous patch, we need a mechanism to check whether the provided `clang` is qualified.

This revision now requires changes to proceed.Apr 30 2021, 1:58 PM

Harbormaster completed remote builds in B102011: Diff 342024.Apr 30 2021, 3:58 PM

Meinersbur added inline comments.May 3 2021, 7:52 AM

llvm/runtimes/CMakeLists.txt
238	Adding project-specific options is already done for `COMPILER_RT`. This seems to be the established approach.
openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt
36	As mentioned in the summary, automatically using the host compiler may result in unpredictable LLVM-IR that e.g. include vendor extensions. That is, the C/C++ to host-assembly compiler is just the wrong tool for CUDA-to-LLVM-IR compilation. I think the only clang we would want to support is the clang from that same git commit.

Meinersbur added inline comments.May 3 2021, 8:04 AM

llvm/runtimes/CMakeLists.txt
238	Btw, PASSTHROUGH_PREFIXES, will pass all `OPENMP_` options (even those that are left to their defaults) to the nested CMake configuration. It will not do so with `LIBOMPTARGET_NVPTX_CUDA_COMPILER` because the prefix for the openmp project is assumed to be `OPENMP_` and `LIBOMPTARGET_` is missing.

tianshilei1992 added inline comments.May 3 2021, 11:22 AM

llvm/runtimes/CMakeLists.txt
238	I mean, you could do something in CUDA's CMake file in the following way (pseudo code): if (BUILD_VIA_RUNTIME) set(LIBOMPTARGET_NVPTX_CUDA_COMPILER target clang) set(BC_LINKER target llvm-link) endif() where `BUILD_VIA_RUNTIME` can be detected. You don't have to do that in LLVM's CMake file. Btw, PASSTHROUGH_PREFIXES, will pass all OPENMP_ options (even those that are left to their defaults) to the nested CMake configuration. It will not do so with LIBOMPTARGET_NVPTX_CUDA_COMPILER because the prefix for the openmp project is assumed to be OPENMP_ and LIBOMPTARGET_ is missing. This is not true. See line 178 at `llvm/runtimes/CMakeLists.txt`.
openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt
36	That doesn't make sense. It's fine that if building in-tree, use the one from same commit, but it should never be the only way. Again (I have said that for three times), we need to check if the clang is qualified. Users can of course set `LIBOMPTARGET_NVPTX_CUDA_COMPILER` to a random clang. This change doesn't solve the root problem: is the compiler qualified? What we really need is, no matter where the compiler is from (it can be the host compiler detected by CMake, it can also be the one specified by users), check its qualification before use it.

Meinersbur added inline comments.May 5 2021, 8:19 AM

llvm/runtimes/CMakeLists.txt
238	How would `BUILD_VIA_RUNTIME` be detected? `target clang`/`target llvm-link` are not targets in the runtimes build CMake configurations. I think setting `LIBOMPTARGET_NVPTX_CUDA_COMPILER` is exactly the right option because this is the variable a user would need to set if they want a standalone build without using `LLVM_ENABLE_RUNTIMES`. Not additional magic needed. This is not true. See line 178 at llvm/runtimes/CMakeLists.txt. Correct, did not see that. Makes me wonder why this kind if pollution OK, but setting `LIBOMPTARGET_NVPTX_CUDA_COMPILER` is not.
openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt
36	The user sets `LIBOMPTARGET_NVPTX_CUDA_COMPILER` manually, it is their responsibility. If they do not specify `LIBOMPTARGET_NVPTX_CUDA_COMPILER`, but gets a broken build due to the default being inadequate we cannot blame the user. The only adequacy test I can think of that cannot result in a broken build is to execute `clang --version` and compare it the commit hash.

tianshilei1992 added inline comments.May 5 2021, 10:26 AM

llvm/runtimes/CMakeLists.txt
238	How would `BUILD_VIA_RUNTIME` be detected? `target clang`/`target llvm-link` are not targets in the runtimes build CMake configurations. You could refer to line 7 at `openmp/CMakeLists.txt`. That's what we used to determine if it is standalone build. If `OPENMP_STANDALONE_BUILD` is false, then we could set the two variables accordingly because in that case, it is either in runtime build, or project build, where in both cases you wanna use the in-tree build clang. Makes me wonder why this kind if pollution OK, but setting LIBOMPTARGET_NVPTX_CUDA_COMPILER is not. That changes apply for all OpenMP arguments, `libomp`, `libomptarget`, plugins, device runtimes, tests, you name it. It's not just for the one single NVIDIA device runtime. What's more important, we obviously have a more elegant way to do it in project's own directory to avoid leaking those project specific arguments all over the place.
openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt
36	The only adequacy test I can think of that cannot result in a broken build is to execute `clang --version` and compare it the commit hash. That is not sufficient. What if users build LLVM w/o expected targets? You can still get a "valid" clang (in terms of version) and use it for NVPTX, but in fact it doesn't support it at all. We did encounter this once when AMD offloading was enabled by default previously. We need to check the eligibility before including corresponding directories in CMake. That's the ultimate solution. We know our device runtime is using X, Y, Z features. We check whether the compiler can work properly. That's how `autoconfig` works. If in the future new features are being used, we simply update the checker.

josemonsalve2 added a subscriber: josemonsalve2.Jul 29 2021, 11:49 AM

josemonsalve2 added inline comments.

llvm/runtimes/CMakeLists.txt
238	I know this is been a while since this happened. But I was looking at this code and found this revision. I just wanted to add. Even though they have `compiler-rt` specific code, they recognize it is something that would need to be fixed. see `llvm-project/runtimes/CMakeLists.txt:110` or nearby. The comment says: # TODO: compiler-rt has to use standalone build for now. We tried to remove # this in D57992 but this broke the build because compiler-rt assumes that # LLVM and Clang are configured in the same build to set up dependencies. We # should clean up the compiler-rt build and remove this eventually.

Meinersbur added inline comments.Jul 29 2021, 12:37 PM

llvm/runtimes/CMakeLists.txt
238	I still have the opinion that we should not assume that `CMAKE_CXX_COMPILER` (C++-to-`.o`) is also the compiler to cross-compile to `.bc` even more than why `CMAKE_C_COMPILER` is also different from `CMAKE_CXX_COMPILER`, `CMAKE_CUDA_COMPILER` or `CMAKE_CUDA_HOST_COMPILER` (even though they probably are all able to compile C-code to the host target), and you need a different `CMAKE_C_COMPILER` when cross-compiling. What CMake variable specifies what compiler to use as "OpenMP/CUDA-to-bc" compiler is not libomptarget-specific. It could also be named `CMAKE_OPENMP_<TARGET>_CROSSCOMPILER`. Alternatively, the deviceRTL could be built in another CMake build configuration that is configured to cross-compiler to NVPTX (in the same sense that `compiler-rt` is configured to cross-compile to the architecture(s) that just-built Clang targets), but this requires much more effort.

Meinersbur mentioned this in D108534: [OpenMP][Docs] add clang to LLVM_ENABLE_PROJECTS in build instructions.Aug 23 2021, 4:29 PM

Superseded by D111983 which stops using CMAKE_C_COMPILER as device compiler (using clang in LLVM_DIR instead)

Herald added a subscriber: asavonic. · View Herald TranscriptOct 29 2021, 6:15 AM

Revision Contents

Path

Size

llvm/

runtimes/

CMakeLists.txt

4 lines

openmp/

libomptarget/

deviceRTLs/

nvptx/

CMakeLists.txt

9 lines

Diff 342024

llvm/runtimes/CMakeLists.txt

Show First 20 Lines • Show All 229 Lines • ▼ Show 20 Lines	llvm_ExternalProject_Add(runtimes
CMAKE_ARGS -DCOMPILER_RT_BUILD_BUILTINS=Off		CMAKE_ARGS -DCOMPILER_RT_BUILD_BUILTINS=Off
-DLLVM_INCLUDE_TESTS=${LLVM_INCLUDE_TESTS}		-DLLVM_INCLUDE_TESTS=${LLVM_INCLUDE_TESTS}
-DLLVM_DEFAULT_TARGET_TRIPLE=${TARGET_TRIPLE}		-DLLVM_DEFAULT_TARGET_TRIPLE=${TARGET_TRIPLE}
-DLLVM_ENABLE_PROJECTS_USED=${LLVM_ENABLE_PROJECTS_USED}		-DLLVM_ENABLE_PROJECTS_USED=${LLVM_ENABLE_PROJECTS_USED}
-DLLVM_ENABLE_PER_TARGET_RUNTIME_DIR=${LLVM_ENABLE_PER_TARGET_RUNTIME_DIR_default}		-DLLVM_ENABLE_PER_TARGET_RUNTIME_DIR=${LLVM_ENABLE_PER_TARGET_RUNTIME_DIR_default}
-DCMAKE_C_COMPILER_WORKS=ON		-DCMAKE_C_COMPILER_WORKS=ON
-DCMAKE_CXX_COMPILER_WORKS=ON		-DCMAKE_CXX_COMPILER_WORKS=ON
-DCMAKE_ASM_COMPILER_WORKS=ON		-DCMAKE_ASM_COMPILER_WORKS=ON
		-DLIBOMPTARGET_NVPTX_CUDA_COMPILER=$<TARGET_FILE:clang>
		tianshilei1992Unsubmitted Not Done Reply Inline Actions I don't think it's a good idea to "pollute" LLVM CMake files for this purpose. There are plenty of ways to tell whether OpenMP is built via `LLVM_ENABLE_RUNTIMES`. I'd set the two CMake variables in OpenMP by checking whether we're in `LLVM_ENABLE_RUNTIMES`. tianshilei1992: I don't think it's a good idea to "pollute" LLVM CMake files for this purpose. There are plenty…
		MeinersburAuthorUnsubmitted Done Reply Inline Actions Adding project-specific options is already done for `COMPILER_RT`. This seems to be the established approach. Meinersbur: Adding project-specific options is already done for `COMPILER_RT`. This seems to be the…
		MeinersburAuthorUnsubmitted Done Reply Inline Actions Btw, PASSTHROUGH_PREFIXES, will pass all `OPENMP_` options (even those that are left to their defaults) to the nested CMake configuration. It will not do so with `LIBOMPTARGET_NVPTX_CUDA_COMPILER` because the prefix for the openmp project is assumed to be `OPENMP_` and `LIBOMPTARGET_` is missing. Meinersbur: Btw, PASSTHROUGH_PREFIXES, will pass all `OPENMP_` options (even those that are left to their…
		tianshilei1992Unsubmitted Not Done Reply Inline Actions I mean, you could do something in CUDA's CMake file in the following way (pseudo code): if (BUILD_VIA_RUNTIME) set(LIBOMPTARGET_NVPTX_CUDA_COMPILER target clang) set(BC_LINKER target llvm-link) endif() where `BUILD_VIA_RUNTIME` can be detected. You don't have to do that in LLVM's CMake file. Btw, PASSTHROUGH_PREFIXES, will pass all OPENMP_ options (even those that are left to their defaults) to the nested CMake configuration. It will not do so with LIBOMPTARGET_NVPTX_CUDA_COMPILER because the prefix for the openmp project is assumed to be OPENMP_ and LIBOMPTARGET_ is missing. This is not true. See line 178 at `llvm/runtimes/CMakeLists.txt`. tianshilei1992: I mean, you could do something in CUDA's CMake file in the following way (pseudo code): ``` if…
		MeinersburAuthorUnsubmitted Done Reply Inline Actions How would `BUILD_VIA_RUNTIME` be detected? `target clang`/`target llvm-link` are not targets in the runtimes build CMake configurations. I think setting `LIBOMPTARGET_NVPTX_CUDA_COMPILER` is exactly the right option because this is the variable a user would need to set if they want a standalone build without using `LLVM_ENABLE_RUNTIMES`. Not additional magic needed. This is not true. See line 178 at llvm/runtimes/CMakeLists.txt. Correct, did not see that. Makes me wonder why this kind if pollution OK, but setting `LIBOMPTARGET_NVPTX_CUDA_COMPILER` is not. Meinersbur: How would `BUILD_VIA_RUNTIME` be detected? `target clang`/`target llvm-link` are not targets in…
		tianshilei1992Unsubmitted Not Done Reply Inline Actions How would `BUILD_VIA_RUNTIME` be detected? `target clang`/`target llvm-link` are not targets in the runtimes build CMake configurations. You could refer to line 7 at `openmp/CMakeLists.txt`. That's what we used to determine if it is standalone build. If `OPENMP_STANDALONE_BUILD` is false, then we could set the two variables accordingly because in that case, it is either in runtime build, or project build, where in both cases you wanna use the in-tree build clang. Makes me wonder why this kind if pollution OK, but setting LIBOMPTARGET_NVPTX_CUDA_COMPILER is not. That changes apply for all OpenMP arguments, `libomp`, `libomptarget`, plugins, device runtimes, tests, you name it. It's not just for the one single NVIDIA device runtime. What's more important, we obviously have a more elegant way to do it in project's own directory to avoid leaking those project specific arguments all over the place. tianshilei1992: > How would `BUILD_VIA_RUNTIME` be detected? `target clang`/`target llvm-link` are not targets…
		josemonsalve2Unsubmitted Not Done Reply Inline Actions I know this is been a while since this happened. But I was looking at this code and found this revision. I just wanted to add. Even though they have `compiler-rt` specific code, they recognize it is something that would need to be fixed. see `llvm-project/runtimes/CMakeLists.txt:110` or nearby. The comment says: # TODO: compiler-rt has to use standalone build for now. We tried to remove # this in D57992 but this broke the build because compiler-rt assumes that # LLVM and Clang are configured in the same build to set up dependencies. We # should clean up the compiler-rt build and remove this eventually. josemonsalve2: I know this is been a while since this happened. But I was looking at this code and found this…
		MeinersburAuthorUnsubmitted Done Reply Inline Actions I still have the opinion that we should not assume that `CMAKE_CXX_COMPILER` (C++-to-`.o`) is also the compiler to cross-compile to `.bc` even more than why `CMAKE_C_COMPILER` is also different from `CMAKE_CXX_COMPILER`, `CMAKE_CUDA_COMPILER` or `CMAKE_CUDA_HOST_COMPILER` (even though they probably are all able to compile C-code to the host target), and you need a different `CMAKE_C_COMPILER` when cross-compiling. What CMake variable specifies what compiler to use as "OpenMP/CUDA-to-bc" compiler is not libomptarget-specific. It could also be named `CMAKE_OPENMP_<TARGET>_CROSSCOMPILER`. Alternatively, the deviceRTL could be built in another CMake build configuration that is configured to cross-compiler to NVPTX (in the same sense that `compiler-rt` is configured to cross-compile to the architecture(s) that just-built Clang targets), but this requires much more effort. Meinersbur: I still have the opinion that we should not assume that `CMAKE_CXX_COMPILER` (C++-to-`.o`) is…
		-DLIBOMPTARGET_NVPTX_BC_LINKER=$<TARGET_FILE:llvm-link>
${RUNTIMES_CMAKE_ARGS}		${RUNTIMES_CMAKE_ARGS}
PASSTHROUGH_PREFIXES LLVM_ENABLE_RUNTIMES		PASSTHROUGH_PREFIXES LLVM_ENABLE_RUNTIMES
${ARG_PREFIXES}		${ARG_PREFIXES}
EXTRA_TARGETS ${extra_targets}		EXTRA_TARGETS ${extra_targets}
${test_targets}		${test_targets}
${SUB_COMPONENTS}		${SUB_COMPONENTS}
${SUB_CHECK_TARGETS}		${SUB_CHECK_TARGETS}
${SUB_INSTALL_TARGETS}		${SUB_INSTALL_TARGETS}
▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	llvm_ExternalProject_Add(runtimes-${name}
CMAKE_ARGS -DCOMPILER_RT_BUILD_BUILTINS=Off		CMAKE_ARGS -DCOMPILER_RT_BUILD_BUILTINS=Off
-DLLVM_INCLUDE_TESTS=${LLVM_INCLUDE_TESTS}		-DLLVM_INCLUDE_TESTS=${LLVM_INCLUDE_TESTS}
-DLLVM_DEFAULT_TARGET_TRIPLE=${target}		-DLLVM_DEFAULT_TARGET_TRIPLE=${target}
-DLLVM_ENABLE_PROJECTS_USED=${LLVM_ENABLE_PROJECTS_USED}		-DLLVM_ENABLE_PROJECTS_USED=${LLVM_ENABLE_PROJECTS_USED}
-DLLVM_ENABLE_PER_TARGET_RUNTIME_DIR=ON		-DLLVM_ENABLE_PER_TARGET_RUNTIME_DIR=ON
-DCMAKE_C_COMPILER_WORKS=ON		-DCMAKE_C_COMPILER_WORKS=ON
-DCMAKE_CXX_COMPILER_WORKS=ON		-DCMAKE_CXX_COMPILER_WORKS=ON
-DCMAKE_ASM_COMPILER_WORKS=ON		-DCMAKE_ASM_COMPILER_WORKS=ON
		-DLIBOMPTARGET_NVPTX_CUDA_COMPILER=$<TARGET_FILE:clang>
		-DLIBOMPTARGET_NVPTX_BC_LINKER=$<TARGET_FILE:llvm-link>
-DCOMPILER_RT_DEFAULT_TARGET_ONLY=ON		-DCOMPILER_RT_DEFAULT_TARGET_ONLY=ON
-DLLVM_RUNTIMES_TARGET=${name}		-DLLVM_RUNTIMES_TARGET=${name}
${${name}_extra_args}		${${name}_extra_args}
EXTRA_TARGETS ${${name}_extra_targets}		EXTRA_TARGETS ${${name}_extra_targets}
${${name}_test_targets}		${${name}_test_targets}
USE_TOOLCHAIN		USE_TOOLCHAIN
TARGET_TRIPLE ${target}		TARGET_TRIPLE ${target}
${EXTRA_ARGS})		${EXTRA_ARGS})
▲ Show 20 Lines • Show All 113 Lines • Show Last 20 Lines

openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt

	Show All 27 Lines
	set(LIBOMPTARGET_NVPTX_BC_LINKER "" CACHE STRING			set(LIBOMPTARGET_NVPTX_BC_LINKER "" CACHE STRING
	"Location of a linker capable of linking LLVM bitcode objects.")			"Location of a linker capable of linking LLVM bitcode objects.")

	if (NOT LIBOMPTARGET_NVPTX_CUDA_COMPILER STREQUAL "")			if (NOT LIBOMPTARGET_NVPTX_CUDA_COMPILER STREQUAL "")
	set(cuda_compiler ${LIBOMPTARGET_NVPTX_CUDA_COMPILER})			set(cuda_compiler ${LIBOMPTARGET_NVPTX_CUDA_COMPILER})
	elseif (LLVM_TOOL_CLANG_BUILD AND NOT CMAKE_CROSSCOMPILING)			elseif (LLVM_TOOL_CLANG_BUILD AND NOT CMAKE_CROSSCOMPILING)
	# Compile the deviceRTL with the clang that is built in the project.			# Compile the deviceRTL with the clang that is built in the project.
	set(cuda_compiler "$<TARGET_FILE:clang>")			set(cuda_compiler "$<TARGET_FILE:clang>")
	elseif(${CMAKE_C_COMPILER_ID} STREQUAL "Clang")
	tianshilei1992Unsubmitted Not Done Reply Inline Actions Removing this can cause issue if I compile OpenMP standalone. We cannot assume people all compile OpenMP along with LLVM either with `LLVM_ENABLE_RUNTIMES` or `LLVM_ENABLE_PROJECTS`. Like I said in your previous patch, we need a mechanism to check whether the provided `clang` is qualified. tianshilei1992: Removing this can cause issue if I compile OpenMP standalone. We cannot assume people all…
	MeinersburAuthorUnsubmitted Done Reply Inline Actions As mentioned in the summary, automatically using the host compiler may result in unpredictable LLVM-IR that e.g. include vendor extensions. That is, the C/C++ to host-assembly compiler is just the wrong tool for CUDA-to-LLVM-IR compilation. I think the only clang we would want to support is the clang from that same git commit. Meinersbur: As mentioned in the summary, automatically using the host compiler may result in unpredictable…
	tianshilei1992Unsubmitted Not Done Reply Inline Actions That doesn't make sense. It's fine that if building in-tree, use the one from same commit, but it should never be the only way. Again (I have said that for three times), we need to check if the clang is qualified. Users can of course set `LIBOMPTARGET_NVPTX_CUDA_COMPILER` to a random clang. This change doesn't solve the root problem: is the compiler qualified? What we really need is, no matter where the compiler is from (it can be the host compiler detected by CMake, it can also be the one specified by users), check its qualification before use it. tianshilei1992: That doesn't make sense. It's fine that if building in-tree, use the one from same commit, but…
	MeinersburAuthorUnsubmitted Done Reply Inline Actions The user sets `LIBOMPTARGET_NVPTX_CUDA_COMPILER` manually, it is their responsibility. If they do not specify `LIBOMPTARGET_NVPTX_CUDA_COMPILER`, but gets a broken build due to the default being inadequate we cannot blame the user. The only adequacy test I can think of that cannot result in a broken build is to execute `clang --version` and compare it the commit hash. Meinersbur: The user sets `LIBOMPTARGET_NVPTX_CUDA_COMPILER` manually, it is their responsibility. If they…
	tianshilei1992Unsubmitted Not Done Reply Inline Actions The only adequacy test I can think of that cannot result in a broken build is to execute `clang --version` and compare it the commit hash. That is not sufficient. What if users build LLVM w/o expected targets? You can still get a "valid" clang (in terms of version) and use it for NVPTX, but in fact it doesn't support it at all. We did encounter this once when AMD offloading was enabled by default previously. We need to check the eligibility before including corresponding directories in CMake. That's the ultimate solution. We know our device runtime is using X, Y, Z features. We check whether the compiler can work properly. That's how `autoconfig` works. If in the future new features are being used, we simply update the checker. tianshilei1992: > The only adequacy test I can think of that cannot result in a broken build is to execute…
	# Compile the device runtime with the compiler that OpenMP is built with.
	# This is the case with LLVM_ENABLE_RUNTIMES=openmp.
	# FIXME: This is unreliable; the compiler can be on older version of clang
	# that does not support compiling CUDA, or only an older version of it. The
	# risk is especially high on sytems where clang is the default compiler
	# (MacOS, BSDs). LLVM_ENABLE_RUNTIMES=openmp should itself set
	# LIBOMPTARGET_NVPTX_CUDA_COMPILER instead.
	set(cuda_compiler ${CMAKE_C_COMPILER})
	else()			else()
	libomptarget_say("Not building NVPTX deviceRTL: clang not found")			libomptarget_say("Not building NVPTX deviceRTL: clang not found")
	return()			return()
	endif()			endif()

	# Get compiler directory to try to locate a suitable linker.			# Get compiler directory to try to locate a suitable linker.
	get_filename_component(compiler_dir ${cuda_compiler} DIRECTORY)			get_filename_component(compiler_dir ${cuda_compiler} DIRECTORY)
	set(llvm_link "${compiler_dir}/llvm-link")			set(llvm_link "${compiler_dir}/llvm-link")
	▲ Show 20 Lines • Show All 191 Lines • Show Last 20 Lines