This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
libc/
-
cmake/modules/
-
modules/
5/5
LLVMLibCObjectRules.cmake
-
prepare_libc_gpu_build.cmake
-
src/__support/
-
__support/
-
common.h

Differential D143089

[libc] Remove OpenMP and build the GPU libc directly
ClosedPublic

Authored by jhuber6 on Feb 1 2023, 9:47 AM.

Download Raw Diff

Details

Reviewers

sivachandra
lntue
michaelrj
jdoerfert

Commits

rG6d0e1373589a: [libc] Remove OpenMP and build the GPU libc directly

Summary

The current libcgpu.a is actually an archive of fatbinaries. The host
file contains nothing but a section called LLVM_OFFLOADING that
contains embedded device code. This used to be handled implicitly by
borrowing the OpenMP toolchain, which did this packaging internally.
Passing the OpenMP flags causes problems with trying to move to testing.
This patch pulls this logic out into the CMake and handles it manually.

This patch is a lot of noise, but it fundamentally comes down to the
following changes.

Build the source for every GPU architecture (GPU architectures are generally not backwards compatible)
Combine all of these files into a single binary blob
Embed that binary blob into a host file
Package these host files into a .a archive.
The device code will be extracted and managed by the offloading linker.

Another important point. Right now we are maintaining an important
distinction with the GPU build. That is, when we build the exported
library we will build for many GPU architectures. However, the internal
version will only be built for a single GPU architecture, one that was
found on the user's system. This is intended to be used for internal
testing, very similar to the current path where libc is compiled for a
single target triple.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.Feb 1 2023, 9:47 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptFeb 1 2023, 9:47 AM

Herald added subscribers: libc-commits, ecnelises, tschuett and 2 others. · View Herald Transcript

jhuber6 requested review of this revision.Feb 1 2023, 9:47 AM

Herald added a reviewer: jdoerfert. · View Herald TranscriptFeb 1 2023, 9:47 AM

Herald added a subscriber: sstefan1. · View Herald Transcript

Harbormaster completed remote builds in B211262: Diff 493990.Feb 1 2023, 9:54 AM

Mostly LGTM but I have left a few nits and request for one TODO. I will approve as soon as I can see the TODO comment.

libc/cmake/modules/LLVMLibCObjectRules.cmake
157	`gpu` can potentially be directory name. So, can we use a suffix like, `.__gpu__`?
158	Can this be given a suffix of `.gpu.bin`?
163	Can this be moved into the `foreach` block above?
175	Can you add a TODO explaining how this will evolve?
204	Can this be moved to the `if` block where the internal target is actually added?

Addressing comments.

jhuber6 marked 5 inline comments as done.Feb 2 2023, 5:37 AM

Harbormaster completed remote builds in B211466: Diff 494271.Feb 2 2023, 5:43 AM

sivachandra accepted this revision.Feb 2 2023, 7:04 AM

This revision is now accepted and ready to land.Feb 2 2023, 7:04 AM

Closed by commit rG6d0e1373589a: [libc] Remove OpenMP and build the GPU libc directly (authored by jhuber6). · Explain WhyFeb 2 2023, 7:47 AM

This revision was automatically updated to reflect the committed changes.

jhuber6 added a commit: rG6d0e1373589a: [libc] Remove OpenMP and build the GPU libc directly.

Revision Contents

Path

Size

libc/

cmake/

modules/

LLVMLibCObjectRules.cmake

243 lines

prepare_libc_gpu_build.cmake

23 lines

src/

__support/

common.h

12 lines

Diff 494313

libc/cmake/modules/LLVMLibCObjectRules.cmake

Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	function(_get_common_compile_options output_var flags)
elseif(MSVC)		elseif(MSVC)
list(APPEND compile_options "/EHs-c-")		list(APPEND compile_options "/EHs-c-")
list(APPEND compile_options "/GR-")		list(APPEND compile_options "/GR-")
if(ADD_FMA_FLAG)		if(ADD_FMA_FLAG)
list(APPEND compile_options "/arch:AVX2")		list(APPEND compile_options "/arch:AVX2")
endif()		endif()
endif()		endif()
if (LIBC_TARGET_ARCHITECTURE_IS_GPU)		if (LIBC_TARGET_ARCHITECTURE_IS_GPU)
list(APPEND compile_options "-fopenmp")
list(APPEND compile_options "-fopenmp-cuda-mode")
foreach(gpu_arch ${LIBC_GPU_ARCHITECTURES})
list(APPEND compile_options "--offload-arch=${gpu_arch}")
endforeach()
list(APPEND compile_options "-nogpulib")		list(APPEND compile_options "-nogpulib")
list(APPEND compile_options "-nogpuinc")
list(APPEND compile_options "-fvisibility=hidden")		list(APPEND compile_options "-fvisibility=hidden")
list(APPEND compile_options "-foffload-lto")
endif()		endif()
set(${output_var} ${compile_options} PARENT_SCOPE)		set(${output_var} ${compile_options} PARENT_SCOPE)
endfunction()		endfunction()

		# Builds the entrypoint target for the GPU.
		# Usage:
		# _build_gpu_entrypoint_objects(
		# <target_name>
		# SRCS <list of .cpp files>
		# HDRS <list of .h files>
		# DEPENDS <list of dependencies>
		# COMPILE_OPTIONS <optional list of special compile options for this target>
		# FLAGS <optional list of flags>
		# )
		function(_build_gpu_entrypoint_objects fq_target_name)
		cmake_parse_arguments(
		"ADD_GPU_ENTRYPOINT_OBJ"
		"" # No optional arguments
		"NAME;CXX_STANDARD" # Single value arguments
		"SRCS;HDRS;DEPENDS;COMPILE_OPTIONS;FLAGS" # Multi value arguments
		${ARGN}
		)

		# The packaged version will be built for every target GPU architecture. We do
		# this so we can support multiple accelerators on the same machine.
		foreach(gpu_arch ${all_gpu_architectures})
		set(gpu_target_name ${fq_target_name}.${gpu_arch})
		set(compile_options ${ADD_GPU_ENTRYPOINT_OBJ_COMPILE_OPTIONS})
		# Derive the triple from the specified architecture.
		if("${gpu_arch}" IN_LIST all_amdgpu_architectures)
		set(gpu_target_triple "amdgcn-amd-amdhsa")
		list(APPEND compile_options "-mcpu=${gpu_arch}")
		elseif("${gpu_arch}" IN_LIST all_nvptx_architectures)
		set(gpu_target_triple "nvptx64-nvidia-cuda")
		list(APPEND compile_options "-march=${gpu_arch}")
		else()
		message(FATAL_ERROR "Unknown GPU architecture '${gpu_arch}'")
		endif()
		list(APPEND compile_options "--target=${gpu_target_triple}")
		list(APPEND compile_options "-emit-llvm")

		# Build the library for this target architecture. We always emit LLVM-IR for
		# packaged GPU binaries.
		add_library(${gpu_target_name}
		EXCLUDE_FROM_ALL
		OBJECT
		${ADD_GPU_ENTRYPOINT_OBJ_SRCS}
		${ADD_GPU_ENTRYPOINT_OBJ_HDRS}
		)

		target_compile_options(${gpu_target_name} PRIVATE ${compile_options})
		target_include_directories(${gpu_target_name} PRIVATE ${include_dirs})
		add_dependencies(${gpu_target_name} ${ADD_GPU_ENTRYPOINT_OBJ_DEPENDS})
		target_compile_definitions(${gpu_target_name} PRIVATE LLVM_LIBC_PUBLIC_PACKAGING)

		# Append this target to a list of images to package into a single binary.
		set(input_file $<TARGET_OBJECTS:${gpu_target_name}>)
		list(APPEND packager_images
		--image=file=${input_file},arch=${gpu_arch},triple=${gpu_target_triple})
		list(APPEND gpu_target_names ${gpu_target_name})
		endforeach()

		# After building the target for the desired GPUs we must package the output
		# into a fatbinary, see https://clang.llvm.org/docs/OffloadingDesign.html for
		# more information.
		set(packaged_target_name ${fq_target_name}.__gpu__)
		set(packaged_output_name ${CMAKE_CURRENT_BINARY_DIR}/${fq_target_name}.gpubin)

		add_custom_command(OUTPUT ${packaged_output_name}
		COMMAND ${LIBC_CLANG_OFFLOAD_PACKAGER}
		${packager_images} -o ${packaged_output_name}
		DEPENDS ${gpu_target_names}
		COMMENT "Packaging LLVM offloading binary")
		add_custom_target(${packaged_target_name} DEPENDS ${packaged_output_name})

		# We create an empty 'stub' file for the host to contain the embedded device
		# code. This will be packaged into 'libcgpu.a'.
		# TODO: In the future we will want to combine every architecture for a target
		# into a single bitcode file and use that. For now we simply build for
		# every single one and let the offloading linker handle it.
		get_filename_component(stub_filename ${ADD_GPU_ENTRYPOINT_OBJ_SRCS} NAME)
		file(WRITE ${CMAKE_CURRENT_BINARY_DIR}/${stub_filename} "// Empty file.\n")
		add_library(
		${fq_target_name}
		# We want an object library as the objects will eventually get packaged into
		# an archive (like libcgpu.a).
		EXCLUDE_FROM_ALL
		OBJECT
		"${CMAKE_CURRENT_BINARY_DIR}/${stub_filename}"
		)
		target_compile_options(${fq_target_name} BEFORE PRIVATE ${common_compile_options}
		-DLLVM_LIBC_PUBLIC_PACKAGING
		-nostdlib -Xclang -fembed-offload-object=${packaged_output_name})
		target_include_directories(${fq_target_name} PRIVATE ${include_dirs})
		add_dependencies(${fq_target_name} ${full_deps_list} ${packaged_target_name})

		set_target_properties(
		${fq_target_name}
		PROPERTIES
		ENTRYPOINT_NAME ${ADD_ENTRYPOINT_OBJ_NAME}
		TARGET_TYPE ${ENTRYPOINT_OBJ_TARGET_TYPE}
		sivachandraUnsubmitted Done Reply Inline Actions `gpu` can potentially be directory name. So, can we use a suffix like, `.__gpu__`? sivachandra: `gpu` can potentially be directory name. So, can we use a suffix like, `.__gpu__`?
		OBJECT_FILE "$<TARGET_OBJECTS:${fq_target_name}>"
		sivachandraUnsubmitted Done Reply Inline Actions Can this be given a suffix of `.gpu.bin`? sivachandra: Can this be given a suffix of `.gpu.bin`?
		CXX_STANDARD ${ADD_ENTRYPOINT_OBJ_CXX_STANDARD}
		DEPS "${fq_deps_list}"
		FLAGS "${ADD_ENTRYPOINT_OBJ_FLAGS}"
		)

		sivachandraUnsubmitted Done Reply Inline Actions Can this be moved into the `foreach` block above? sivachandra: Can this be moved into the `foreach` block above?
		# We only build the internal target for a single supported architecture.
		set(internal_target_name ${fq_target_name}.__internal__)
		set(include_dirs ${LIBC_BUILD_DIR}/include ${LIBC_SOURCE_DIR} ${LIBC_BUILD_DIR})
		if(LIBC_GPU_TARGET_ARCHITECTURE_IS_AMDGPU OR
		LIBC_GPU_TARGET_ARCHITECTURE_IS_NVPTX)
		add_library(
		${internal_target_name}
		EXCLUDE_FROM_ALL
		OBJECT
		${ADD_ENTRYPOINT_OBJ_SRCS}
		${ADD_ENTRYPOINT_OBJ_HDRS}
		)
		sivachandraUnsubmitted Done Reply Inline Actions Can you add a TODO explaining how this will evolve? sivachandra: Can you add a TODO explaining how this will evolve?
		target_compile_options(${internal_target_name} BEFORE PRIVATE
		${common_compile_options} --target=${LIBC_GPU_TARGET_TRIPLE})
		if(LIBC_GPU_TARGET_ARCHITECTURE_IS_AMDGPU)
		target_compile_options(${internal_target_name} PRIVATE -mcpu=${LIBC_GPU_TARGET_ARCHITECTURE})
		elseif(LIBC_GPU_TARGET_ARCHITECTURE_IS_NVPTX)
		target_compile_options(${internal_target_name} PRIVATE -march=${LIBC_GPU_TARGET_ARCHITECTURE})
		endif()
		target_include_directories(${internal_target_name} PRIVATE ${include_dirs})
		add_dependencies(${internal_target_name} ${full_deps_list})
		set_target_properties(
		${internal_target_name}
		PROPERTIES
		CXX_STANDARD ${ADD_ENTRYPOINT_OBJ_CXX_STANDARD}
		FLAGS "${ADD_ENTRYPOINT_OBJ_FLAGS}"
		)
		set_target_properties(
		${fq_target_name}
		PROPERTIES OBJECT_FILE_RAW "$<TARGET_OBJECTS:${internal_target_name}>"
		)
		endif()
		endfunction()

# Rule which is essentially a wrapper over add_library to compile a set of		# Rule which is essentially a wrapper over add_library to compile a set of
# sources to object files.		# sources to object files.
# Usage:		# Usage:
# add_object_library(		# add_object_library(
# <target_name>		# <target_name>
# HDRS <list of header files>		# HDRS <list of header files>
# SRCS <list of source files>		# SRCS <list of source files>
		sivachandraUnsubmitted Done Reply Inline Actions Can this be moved to the `if` block where the internal target is actually added? sivachandra: Can this be moved to the `if` block where the internal target is actually added?
# DEPENDS <list of dependencies>		# DEPENDS <list of dependencies>
# COMPILE_OPTIONS <optional list of special compile options for this target>		# COMPILE_OPTIONS <optional list of special compile options for this target>
# FLAGS <optional list of flags>		# FLAGS <optional list of flags>
function(create_object_library fq_target_name)		function(create_object_library fq_target_name)
cmake_parse_arguments(		cmake_parse_arguments(
"ADD_OBJECT"		"ADD_OBJECT"
"" # No optional arguments		"" # No optional arguments
"CXX_STANDARD" # Single value arguments		"CXX_STANDARD" # Single value arguments
Show All 39 Lines	function(create_object_library fq_target_name)

if(fq_deps_list)		if(fq_deps_list)
add_dependencies(${fq_target_name} ${fq_deps_list})		add_dependencies(${fq_target_name} ${fq_deps_list})
endif()		endif()

if(NOT ADD_OBJECT_CXX_STANDARD)		if(NOT ADD_OBJECT_CXX_STANDARD)
set(ADD_OBJECT_CXX_STANDARD ${CMAKE_CXX_STANDARD})		set(ADD_OBJECT_CXX_STANDARD ${CMAKE_CXX_STANDARD})
endif()		endif()

set_target_properties(		set_target_properties(
${fq_target_name}		${fq_target_name}
PROPERTIES		PROPERTIES
TARGET_TYPE ${OBJECT_LIBRARY_TARGET_TYPE}		TARGET_TYPE ${OBJECT_LIBRARY_TARGET_TYPE}
OBJECT_FILES "$<TARGET_OBJECTS:${fq_target_name}>"		OBJECT_FILES "$<TARGET_OBJECTS:${fq_target_name}>"
CXX_STANDARD ${ADD_OBJECT_CXX_STANDARD}		CXX_STANDARD ${ADD_OBJECT_CXX_STANDARD}
DEPS "${fq_deps_list}"		DEPS "${fq_deps_list}"
FLAGS "${ADD_OBJECT_FLAGS}"		FLAGS "${ADD_OBJECT_FLAGS}"
▲ Show 20 Lines • Show All 206 Lines • ▼ Show 20 Lines	if(SHOW_INTERMEDIATE_OBJECTS)
message(STATUS "Adding entrypoint object ${fq_target_name}")		message(STATUS "Adding entrypoint object ${fq_target_name}")
if(${SHOW_INTERMEDIATE_OBJECTS} STREQUAL "DEPS")		if(${SHOW_INTERMEDIATE_OBJECTS} STREQUAL "DEPS")
foreach(dep IN LISTS ADD_OBJECT_DEPENDS)		foreach(dep IN LISTS ADD_OBJECT_DEPENDS)
message(STATUS " ${fq_target_name} depends on ${dep}")		message(STATUS " ${fq_target_name} depends on ${dep}")
endforeach()		endforeach()
endif()		endif()
endif()		endif()

		# GPU builds require special handling for the objects because we want to
		# export several different targets at once, e.g. for both Nvidia and AMD.
		if(LIBC_TARGET_ARCHITECTURE_IS_GPU)
		_build_gpu_entrypoint_objects(
		${fq_target_name}
		SRCS ${ADD_ENTRYPOINT_OBJ_SRCS}
		HDRS ${ADD_ENTRYPOINT_OBJ_HDRS}
		COMPILE_OPTIONS ${common_compile_options}
		DEPENDS ${full_deps_list}
		CXX_STANDARD ${ADD_ENTRYPOINT_OBJ_CXX_STANDARD}
		FLAGS "${ADD_ENTRYPOINT_OBJ_FLAGS}"
		)
		else()
add_library(		add_library(
${internal_target_name}		${internal_target_name}
# TODO: We don't need an object library for internal consumption.		# TODO: We don't need an object library for internal consumption.
# A future change should switch this to a normal static library.		# A future change should switch this to a normal static library.
EXCLUDE_FROM_ALL		EXCLUDE_FROM_ALL
OBJECT		OBJECT
${ADD_ENTRYPOINT_OBJ_SRCS}		${ADD_ENTRYPOINT_OBJ_SRCS}
${ADD_ENTRYPOINT_OBJ_HDRS}		${ADD_ENTRYPOINT_OBJ_HDRS}
)		)
target_compile_options(${internal_target_name} BEFORE PRIVATE ${common_compile_options})		target_compile_options(${internal_target_name} BEFORE PRIVATE ${common_compile_options})
target_include_directories(${internal_target_name} PRIVATE ${include_dirs})		target_include_directories(${internal_target_name} PRIVATE ${include_dirs})
add_dependencies(${internal_target_name} ${full_deps_list})		add_dependencies(${internal_target_name} ${full_deps_list})
set_target_properties(		set_target_properties(
${internal_target_name}		${internal_target_name}
PROPERTIES		PROPERTIES
CXX_STANDARD ${ADD_ENTRYPOINT_OBJ_CXX_STANDARD}		CXX_STANDARD ${ADD_ENTRYPOINT_OBJ_CXX_STANDARD}
FLAGS "${ADD_ENTRYPOINT_OBJ_FLAGS}"		FLAGS "${ADD_ENTRYPOINT_OBJ_FLAGS}"
)		)

add_library(		add_library(
${fq_target_name}		${fq_target_name}
# We want an object library as the objects will eventually get packaged into		# We want an object library as the objects will eventually get packaged into
# an archive (like libc.a).		# an archive (like libc.a).
EXCLUDE_FROM_ALL		EXCLUDE_FROM_ALL
OBJECT		OBJECT
${ADD_ENTRYPOINT_OBJ_SRCS}		${ADD_ENTRYPOINT_OBJ_SRCS}
${ADD_ENTRYPOINT_OBJ_HDRS}		${ADD_ENTRYPOINT_OBJ_HDRS}
)		)
target_compile_options(${fq_target_name} BEFORE PRIVATE ${common_compile_options} -DLLVM_LIBC_PUBLIC_PACKAGING)		target_compile_options(${fq_target_name} BEFORE PRIVATE ${common_compile_options} -DLLVM_LIBC_PUBLIC_PACKAGING)
target_include_directories(${fq_target_name} PRIVATE ${include_dirs})		target_include_directories(${fq_target_name} PRIVATE ${include_dirs})
add_dependencies(${fq_target_name} ${full_deps_list})		add_dependencies(${fq_target_name} ${full_deps_list})

set_target_properties(		set_target_properties(
${fq_target_name}		${fq_target_name}
PROPERTIES		PROPERTIES
ENTRYPOINT_NAME ${ADD_ENTRYPOINT_OBJ_NAME}		ENTRYPOINT_NAME ${ADD_ENTRYPOINT_OBJ_NAME}
TARGET_TYPE ${ENTRYPOINT_OBJ_TARGET_TYPE}		TARGET_TYPE ${ENTRYPOINT_OBJ_TARGET_TYPE}
OBJECT_FILE "$<TARGET_OBJECTS:${fq_target_name}>"		OBJECT_FILE "$<TARGET_OBJECTS:${fq_target_name}>"
# TODO: We don't need to list internal object files if the internal		# TODO: We don't need to list internal object files if the internal
# target is a normal static library.		# target is a normal static library.
OBJECT_FILE_RAW "$<TARGET_OBJECTS:${internal_target_name}>"		OBJECT_FILE_RAW "$<TARGET_OBJECTS:${internal_target_name}>"
CXX_STANDARD ${ADD_ENTRYPOINT_OBJ_CXX_STANDARD}		CXX_STANDARD ${ADD_ENTRYPOINT_OBJ_CXX_STANDARD}
DEPS "${fq_deps_list}"		DEPS "${fq_deps_list}"
FLAGS "${ADD_ENTRYPOINT_OBJ_FLAGS}"		FLAGS "${ADD_ENTRYPOINT_OBJ_FLAGS}"
)		)
		endif()

if(LLVM_LIBC_ENABLE_LINTING)		if(LLVM_LIBC_ENABLE_LINTING AND TARGET ${internal_target_name})
if(NOT LLVM_LIBC_CLANG_TIDY)		if(NOT LLVM_LIBC_CLANG_TIDY)
message(FATAL_ERROR "Something is wrong! LLVM_LIBC_ENABLE_LINTING is "		message(FATAL_ERROR "Something is wrong! LLVM_LIBC_ENABLE_LINTING is "
"ON but LLVM_LIBC_CLANG_TIDY is not set.")		"ON but LLVM_LIBC_CLANG_TIDY is not set.")
endif()		endif()

# We only want a second invocation of clang-tidy to run		# We only want a second invocation of clang-tidy to run
# restrict-system-libc-headers if the compiler-resource-dir was set in		# restrict-system-libc-headers if the compiler-resource-dir was set in
# order to prevent false-positives due to a mismatch between the host		# order to prevent false-positives due to a mismatch between the host
▲ Show 20 Lines • Show All 206 Lines • Show Last 20 Lines

libc/cmake/modules/prepare_libc_gpu_build.cmake

	if(NOT LIBC_TARGET_ARCHITECTURE_IS_GPU)			if(NOT LIBC_TARGET_ARCHITECTURE_IS_GPU)
	message(FATAL_ERROR			message(FATAL_ERROR
	"libc build: Invalid attempt to set up GPU architectures.")			"libc build: Invalid attempt to set up GPU architectures.")
	endif()			endif()

	# Set up the target architectures to build the GPU libc for.			# Set up the target architectures to build the GPU libc for.
	set(all_gpu_architectures "sm_35;sm_37;sm_50;sm_52;sm_53;sm_60;sm_61;sm_62;"			set(all_amdgpu_architectures "gfx700;gfx701;gfx801;gfx803;gfx900;gfx902;gfx906;"
	"sm_70;sm_72;sm_75;sm_80;sm_86;gfx700;gfx701;gfx801;"			"gfx908;gfx90a;gfx90c;gfx940;gfx1010;gfx1030;"
	"gfx803;gfx900;gfx902;gfx906;gfx908;gfx90a;gfx90c;"			"gfx1031;gfx1032;gfx1033;gfx1034;gfx1035;gfx1036;"
	"gfx940;gfx1010;gfx1030;gfx1031;gfx1032;gfx1033;"			"gfx1100;gfx1101;gfx1102;gfx1103")
	"gfx1034;gfx1035;gfx1036;gfx1100;gfx1101;gfx1102;"			set(all_nvptx_architectures "sm_35;sm_37;sm_50;sm_52;sm_53;sm_60;sm_61;sm_62;"
	"gfx1103")			"sm_70;sm_72;sm_75;sm_80;sm_86")
				set(all_gpu_architectures
				"${all_amdgpu_architectures};${all_nvptx_architectures}")
	set(LIBC_GPU_ARCHITECTURES ${all_gpu_architectures} CACHE STRING			set(LIBC_GPU_ARCHITECTURES ${all_gpu_architectures} CACHE STRING
	"List of GPU architectures to build the libc for.")			"List of GPU architectures to build the libc for.")
	if(LIBC_GPU_ARCHITECTURES STREQUAL "all")			if(LIBC_GPU_ARCHITECTURES STREQUAL "all")
	set(LIBC_GPU_ARCHITECTURES ${all_gpu_architectures} FORCE)			set(LIBC_GPU_ARCHITECTURES ${all_gpu_architectures} FORCE)
	endif()			endif()

	# Ensure the compiler is a valid clang when building the GPU target.			# Ensure the compiler is a valid clang when building the GPU target.
	set(req_ver "${LLVM_VERSION_MAJOR}.${LLVM_VERSION_MINOR}.${LLVM_VERSION_PATCH}")			set(req_ver "${LLVM_VERSION_MAJOR}.${LLVM_VERSION_MINOR}.${LLVM_VERSION_PATCH}")
	if(NOT (CMAKE_CXX_COMPILER_ID MATCHES "[Cc]lang" AND			if(NOT (CMAKE_CXX_COMPILER_ID MATCHES "[Cc]lang" AND
	${CMAKE_CXX_COMPILER_VERSION} VERSION_EQUAL "${req_ver}"))			${CMAKE_CXX_COMPILER_VERSION} VERSION_EQUAL "${req_ver}"))
	message(FATAL_ERROR "Cannot build libc for GPU. CMake compiler "			message(FATAL_ERROR "Cannot build libc for GPU. CMake compiler "
	"'${CMAKE_CXX_COMPILER_ID} ${CMAKE_CXX_COMPILER_VERSION}' "			"'${CMAKE_CXX_COMPILER_ID} ${CMAKE_CXX_COMPILER_VERSION}' "
	" is not `Clang ${req_ver}.")			" is not `Clang ${req_ver}.")
	endif()			endif()
	if(NOT LLVM_LIBC_FULL_BUILD)			if(NOT LLVM_LIBC_FULL_BUILD)
	message(FATAL_ERROR "LLVM_LIBC_FULL_BUILD must be enabled to build libc for "			message(FATAL_ERROR "LLVM_LIBC_FULL_BUILD must be enabled to build libc for "
	"GPU.")			"GPU.")
	endif()			endif()

				# Identify the program used to package multiple images into a single binary.
				find_program(LIBC_CLANG_OFFLOAD_PACKAGER
				NAMES clang-offload-packager
				PATHS ${LLVM_BINARY_DIR}/bin)
				if(NOT LIBC_CLANG_OFFLOAD_PACKAGER)
				message(FATAL_ERROR "Cannot find the 'clang-offload-packager' for the GPU "
				"build")
				endif()

	# Identify any locally installed AMD GPUs on the system to use for testing.			# Identify any locally installed AMD GPUs on the system to use for testing.
	find_program(LIBC_AMDGPU_ARCH			find_program(LIBC_AMDGPU_ARCH
	NAMES amdgpu-arch			NAMES amdgpu-arch
	PATHS ${LLVM_BINARY_DIR}/bin /opt/rocm/llvm/bin/)			PATHS ${LLVM_BINARY_DIR}/bin /opt/rocm/llvm/bin/)
	if(LIBC_AMDGPU_ARCH)			if(LIBC_AMDGPU_ARCH)
	execute_process(COMMAND ${LIBC_AMDGPU_ARCH}			execute_process(COMMAND ${LIBC_AMDGPU_ARCH}
	OUTPUT_VARIABLE LIBC_AMDGPU_ARCH_OUTPUT			OUTPUT_VARIABLE LIBC_AMDGPU_ARCH_OUTPUT
	OUTPUT_STRIP_TRAILING_WHITESPACE)			OUTPUT_STRIP_TRAILING_WHITESPACE)
	Show All 39 Lines

libc/src/__support/common.h

	Show All 23 Lines
	#ifndef LLVM_LIBC_FUNCTION_ATTR			#ifndef LLVM_LIBC_FUNCTION_ATTR
	#define LLVM_LIBC_FUNCTION_ATTR			#define LLVM_LIBC_FUNCTION_ATTR
	#endif			#endif

	#ifndef LIBC_INLINE			#ifndef LIBC_INLINE
	#define LIBC_INLINE inline			#define LIBC_INLINE inline
	#endif			#endif

	// We use OpenMP to declare these functions on the device.			#if defined(__AMDGPU__) \|\| defined(__NVPTX__)
	#define STR(X) #X			#define PACKAGE_FOR_GPU
	#define LLVM_LIBC_DECLARE_DEVICE(name) \			#endif
	_Pragma(STR(omp declare target to(name) device_type(nohost)))

	// GPU targets do not support aliasing and must be declared on the device.			// GPU targets do not support aliasing.
	#if defined(LLVM_LIBC_PUBLIC_PACKAGING) && defined(_OPENMP)			#if defined(LLVM_LIBC_PUBLIC_PACKAGING) && defined(PACKAGE_FOR_GPU)
	#define LLVM_LIBC_FUNCTION(type, name, arglist) \			#define LLVM_LIBC_FUNCTION(type, name, arglist) \
	LLVM_LIBC_FUNCTION_ATTR decltype(__llvm_libc::name) \			LLVM_LIBC_FUNCTION_ATTR decltype(__llvm_libc::name) \
	__##name##_impl__ __asm__(#name); \			__##name##_impl__ __asm__(#name); \
	LLVM_LIBC_DECLARE_DEVICE(__##name##_impl__) \
	type __##name##_impl__ arglist			type __##name##_impl__ arglist
	// MacOS needs to be excluded because it does not support aliasing.			// MacOS needs to be excluded because it does not support aliasing.
	#elif defined(LLVM_LIBC_PUBLIC_PACKAGING) && (!defined(__APPLE__))			#elif defined(LLVM_LIBC_PUBLIC_PACKAGING) && (!defined(__APPLE__))
	#define LLVM_LIBC_FUNCTION(type, name, arglist) \			#define LLVM_LIBC_FUNCTION(type, name, arglist) \
	LLVM_LIBC_FUNCTION_ATTR decltype(__llvm_libc::name) \			LLVM_LIBC_FUNCTION_ATTR decltype(__llvm_libc::name) \
	__##name##_impl__ __asm__(#name); \			__##name##_impl__ __asm__(#name); \
	decltype(__llvm_libc::name) name [[gnu::alias(#name)]]; \			decltype(__llvm_libc::name) name [[gnu::alias(#name)]]; \
	type __##name##_impl__ arglist			type __##name##_impl__ arglist
	Show All 28 Lines