This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
CMakeLists.txt
-
cmake/caches/
-
caches/
9
linux-toolchain.cmake

Differential D41660

[cmake] Add new linux toolchain file
AbandonedPublic

Authored by hintonda on Jan 2 2018, 1:30 AM.

Download Raw Diff

Details

Reviewers

compnerd
beanz
phosek

Summary

Add new linux toolchain file that allows cross compiling to
linux from other systems, e.g., Darwin.

Also, add a new variable, ADDITIONAL_CLANG_BOOTSTRAP_DEPS, which
allows adding additional dependencies to clang-bootstrap-deps.

Diff Detail

Repository

rC Clang

Build Status

Buildable 13466
Build 13466: arc lint + arc unit

Event Timeline

hintonda created this revision.Jan 2 2018, 1:30 AM

Harbormaster completed remote builds in B13462: Diff 128398.Jan 2 2018, 1:30 AM

Herald added a subscriber: mgorny. · View Herald TranscriptJan 2 2018, 1:30 AM

Why is this a cache file rather than a toolchain file (but passing itself as a toolchain file to CMake under some circumstances?) Aren't toolchain files traditionally used for cross-compilation?

cmake/caches/linux-toolchain.cmake
21	Typo: patches

In D41660#965656, @smeenai wrote:

Why is this a cache file rather than a toolchain file (but passing itself as a toolchain file to CMake under some circumstances?) Aren't toolchain files traditionally used for cross-compilation?

Thanks for taking a look.

Yes, this is for cross-compiling clang+llvm for Linux on Darwin -- and possibly Windows to Linux, but that hasn't been tested -- or Linux to Linux if you have completely different system files. It enforces using --sysroot to find the targets headers and libraries.

Cache files are preferred since they are only loaded once, but toolchain files are more flexible -- particularly when setting -target and --sysroot. Users shouldn't care if the cache file reloads itself as a toolchain file, and keeping everything is one file makes it easier to understand. This version doesn't include arch-specific builtins and runtimes, but that could easily be added.

Also, I'm happy to rename it if that would help.

Use CMAKE_(C|CXX)_COMPILER_TARGET instead of
CMAKE_(C|CXX)_COMPILER_ARG1, and pass all target variables via
CLANG_BOOTSTRAP_CMAKE_ARGS.

Add variable tests and Fix/update comments.

Harbormaster completed remote builds in B13466: Diff 128409.Jan 2 2018, 4:22 AM

In D41660#965686, @hintonda wrote:

In D41660#965656, @smeenai wrote:

Cache files are preferred since they are only loaded once

Isn't that precisely the behavior needed for cross-compilation though? You want all of your CMake configuration checks (which are independent CMake configures) to load your toolchain file, which is what you get automatically (and cache files don't behave that way).

From what I understand, the if part of the top-level if(DEFINED SYSROOT) is essentially functioning as a cache file to set up the stage2 build, and the else part is used as a toolchain file for that build. I think it would be cleaner to separate the two out; other cache files seem to be split out into stage1 and stage2 caches, for example (over here it would be stage1 cache and a stage2 toolchain, but the concept is similar).

cmake/caches/linux-toolchain.cmake
2	Cross-compilation terminology is kinda weird, and traditionally, the "host" is actually the system the built binaries will be run on (Linux in this case), whereas the build machine is the "build" (but of course that word is super ambiguous). I think LLVM generally sticks to that terminology though, e.g. `LLVM_HOST_TRIPLE`.
84	Nit: write this out as a list instead of a string with semicolons? (I know they're equivalent, but the list reads nicer IMO.)
88	Not exactly related, but I wonder why the LLVM build needs ranlib (rather than just invoking ar appropriately).
102	The CMake documentation for CMAKE_SYSTEM_NAME says CMAKE_SYSTEM_VERSION should also be set when cross-compiling (though I haven't seen any ill effects from not doing so). Setting CMAKE_SYSTEM_PROCESSOR probably doesn't hurt either.

You should split the CMake cache file you created into two files, (1) a CMake Cache to manage the build configuration and (2) a tool chain file for targeting Linux. As @semeenai pointed out we absolutly want the behavior of the toolchain file being loaded multiple times. That is the correct way this build should work.

For bootstrap builds where you want the stage1 to run on your build host, you should be able to set BOOTSTRAP_CMAKE_TOOLCHAIN_FILE in the first stage build, to signal to the first stage build that the second stage will be cross-compiled, and we can customize the multi-stage dependencies correctly based on that. That avoids the need for the ADDITIONAL_CLANG_BOOTSTRAP_DEPS variable, which feels a bit hacky to me.

In D41660#965877, @beanz wrote:

You should split the CMake cache file you created into two files, (1) a CMake Cache to manage the build configuration and (2) a tool chain file for targeting Linux. As @semeenai pointed out we absolutly want the behavior of the toolchain file being loaded multiple times. That is the correct way this build should work.

I really like keeping this in a single file, but will break it up if necessary.

The if part of the if/else is used in stage1 as a cache file, and the else part used in stage2 (and as you said, is loaded many times). Splitting this into two files won't make much difference in that regard.

For bootstrap builds where you want the stage1 to run on your build host, you should be able to set BOOTSTRAP_CMAKE_TOOLCHAIN_FILE in the first stage build, to signal to the first stage build that the second stage will be cross-compiled, and we can customize the multi-stage dependencies correctly based on that. That avoids the need for the ADDITIONAL_CLANG_BOOTSTRAP_DEPS variable, which feels a bit hacky to me.

Unless there's another way to do it, It's not hacky. I believe the term is escape hatch.

While I'm happy to use BOOTSTRAP_CMAKE_TOOLCHAIN_FILE instead of passing -DCMAKE_TOOLCHAIN_FILE=${CMAKE_CURRENT_LIST_FILE}, I do not see how it helps with this problem. When running ninja stage2, I need to insure that the dependancies where built. BOOTSTRAP_LLVM_ENABLE_LLD can be used to add lld to the dependency list, but since I'm setting CLANG_DEFAULT_LINKER=llb, I don't want clang adding -fuse-ld.

If I run this on Linux for both stages, it doesn't matter, because clang/CMakeLists.txt add llvm-ar and llvm-ranlib automatically, but since I'm on APPLE (see clang/CMakeLists.txt:559), they don't get added.

So, how else would I add them?

cmake/caches/linux-toolchain.cmake
2	I'll work on cleaning up this comment, but the idea is that we cross compile on any host system, e.g., Linux, Darwin, Windows, etc., and target Linux.
84	Perhaps, but this is the style used throughout the clang+llvm cmake files.
88	Darwin version of ranlib doesn't like elf binaries, so we need the one we build in stage1.
102	These can be passed to stage1 as BOOTSTRAP_CMAKE_SYSTEM_VERSION, etc., allowing the user full control. I'll add a note to the comments up top.

In D41660#965935, @hintonda wrote:

In D41660#965877, @beanz wrote:

You should split the CMake cache file you created into two files, (1) a CMake Cache to manage the build configuration and (2) a tool chain file for targeting Linux. As @semeenai pointed out we absolutly want the behavior of the toolchain file being loaded multiple times. That is the correct way this build should work.

I really like keeping this in a single file, but will break it up if necessary.

The if part of the if/else is used in stage1 as a cache file, and the else part used in stage2 (and as you said, is loaded many times). Splitting this into two files won't make much difference in that regard.

For bootstrap builds where you want the stage1 to run on your build host, you should be able to set BOOTSTRAP_CMAKE_TOOLCHAIN_FILE in the first stage build, to signal to the first stage build that the second stage will be cross-compiled, and we can customize the multi-stage dependencies correctly based on that. That avoids the need for the ADDITIONAL_CLANG_BOOTSTRAP_DEPS variable, which feels a bit hacky to me.

Unless there's another way to do it, It's not hacky. I believe the term is escape hatch.

While I'm happy to use BOOTSTRAP_CMAKE_TOOLCHAIN_FILE instead of passing -DCMAKE_TOOLCHAIN_FILE=${CMAKE_CURRENT_LIST_FILE}, I do not see how it helps with this problem. When running ninja stage2, I need to insure that the dependancies where built. BOOTSTRAP_LLVM_ENABLE_LLD can be used to add lld to the dependency list, but since I'm setting CLANG_DEFAULT_LINKER=llb, I don't want clang adding -fuse-ld.

Did a quick test and setting BOOTSTRAP_CMAKE_TOOLCHAIN_FILE does not work in this case.

If I run this on Linux for both stages, it doesn't matter, because clang/CMakeLists.txt add llvm-ar and llvm-ranlib automatically, but since I'm on APPLE (see clang/CMakeLists.txt:559), they don't get added.

So, how else would I add them?

@hintonda I think this should be a platform file in https://github.com/llvm-mirror/llvm/tree/master/cmake/platforms rather than Clang cache file. Platform files are concerned with the host platform (including cross-compilation), while cache files are related to the distribution setup. What you're trying to do is the former rather than the latter. Some of the aspects of your setup like the bootstrap tool setup is already handled by the 2-stage build so you should use that rather than reimplementing your own solution.

In D41660#966023, @phosek wrote:

@hintonda I think this should be a platform file in https://github.com/llvm-mirror/llvm/tree/master/cmake/platforms rather than Clang cache file. Platform files are concerned with the host platform (including cross-compilation), while cache files are related to the distribution setup. What you're trying to do is the former rather than the latter. Some of the aspects of your setup like the bootstrap tool setup is already handled by the 2-stage build so you should use that rather than reimplementing your own solution.

Thanks for the pointer. I'll rework the patch along the lines you suggest.

Thanks for all your suggestions.

Planning to rework and move the toolchain specific part to llvm/cmake/platform and abandon the cache part altogether.

Revision Contents

Path

Size

CMakeLists.txt

4 lines

cmake/

caches/

linux-toolchain.cmake

144 lines

Diff 128409

CMakeLists.txt

Show First 20 Lines • Show All 544 Lines • ▼ Show 20 Lines	if (CLANG_ENABLE_BOOTSTRAP)

set(STAMP_DIR ${CMAKE_CURRENT_BINARY_DIR}/${NEXT_CLANG_STAGE}-stamps/)		set(STAMP_DIR ${CMAKE_CURRENT_BINARY_DIR}/${NEXT_CLANG_STAGE}-stamps/)
set(BINARY_DIR ${CMAKE_CURRENT_BINARY_DIR}/${NEXT_CLANG_STAGE}-bins/)		set(BINARY_DIR ${CMAKE_CURRENT_BINARY_DIR}/${NEXT_CLANG_STAGE}-bins/)

if(BOOTSTRAP_LLVM_ENABLE_LLD)		if(BOOTSTRAP_LLVM_ENABLE_LLD)
add_dependencies(clang-bootstrap-deps lld)		add_dependencies(clang-bootstrap-deps lld)
endif()		endif()

		if(ADDITIONAL_CLANG_BOOTSTRAP_DEPS)
		add_dependencies(clang-bootstrap-deps ${ADDITIONAL_CLANG_BOOTSTRAP_DEPS})
		endif()

# If the next stage is LTO we need to depend on LTO and possibly lld or LLVMgold		# If the next stage is LTO we need to depend on LTO and possibly lld or LLVMgold
if(BOOTSTRAP_LLVM_ENABLE_LTO OR LLVM_ENABLE_LTO AND NOT LLVM_BUILD_INSTRUMENTED)		if(BOOTSTRAP_LLVM_ENABLE_LTO OR LLVM_ENABLE_LTO AND NOT LLVM_BUILD_INSTRUMENTED)
if(APPLE)		if(APPLE)
add_dependencies(clang-bootstrap-deps LTO)		add_dependencies(clang-bootstrap-deps LTO)
# on Darwin we need to set DARWIN_LTO_LIBRARY so that -flto will work		# on Darwin we need to set DARWIN_LTO_LIBRARY so that -flto will work
# using the just-built compiler, and we need to override DYLD_LIBRARY_PATH		# using the just-built compiler, and we need to override DYLD_LIBRARY_PATH
# so that the host object file tools will use the just-built libLTO.		# so that the host object file tools will use the just-built libLTO.
# However if System Integrity Protection is enabled the DYLD variables		# However if System Integrity Protection is enabled the DYLD variables
▲ Show 20 Lines • Show All 185 Lines • Show Last 20 Lines

cmake/caches/linux-toolchain.cmake

This file was added.

				# This file is intended to support cross compiling a linux toolchain
				# on any host system, includind Darwin.
				smeenaiUnsubmitted Not Done Reply Inline Actions Cross-compilation terminology is kinda weird, and traditionally, the "host" is actually the system the built binaries will be run on (Linux in this case), whereas the build machine is the "build" (but of course that word is super ambiguous). I think LLVM generally sticks to that terminology though, e.g. `LLVM_HOST_TRIPLE`. smeenai: Cross-compilation terminology is kinda weird, and traditionally, the "host" is actually the…
				hintondaAuthorUnsubmitted Not Done Reply Inline Actions I'll work on cleaning up this comment, but the idea is that we cross compile on any host system, e.g., Linux, Darwin, Windows, etc., and target Linux. hintonda: I'll work on cleaning up this comment, but the idea is that we cross compile on any host system…
				#
				# Usage: cmake -GNinja -DSYSROOT=<path> [OPTIONS] -C ../clang/cmake/caches/linux-toolchain.cmake ../llvm
				#
				# OPTIONS:
				# Regular options apply to stage1.
				# BOOTSTRAP_ options apply to stage2, and should use the BOOTSTRAP_ prefix.
				#
				# Then run "ninja stage2" to cross compile. Bins and libs can be
				# found in tools/clang/stage2-bins.
				#
				# Known issues:
				#
				# 1) This toolchain assumes a flat, mono-repo layout.
				#
				# 2) Several sub-projects, including libcxx, libcxxabi, and
				# libunwind, use FIND_PATH() to find the source path for headers
				# of other sub-projects on the host system, but fail because they
				# don't specify NO_CMAKE_FIND_ROOT_PATH. The following patches are
				# reqired to build these projects:
				smeenaiUnsubmitted Not Done Reply Inline Actions Typo: patches smeenai: Typo: patches
				#
				# libcxx https://reviews.llvm.org/D41622
				# libcxxabi https://reviews.llvm.org/D41623
				# libunwind https://reviews.llvm.org/D41621
				#
				# 3) Libraries in the compiler-rt sub-project fail to link with the
				# following error:
				#
				# bin/ld.lld: error:
				# projects/compiler-rt/lib/sanitizer_common/CMakeFiles/RTSanitizerCommon.x86_64.dir/sanitizer_linux_x86_64.S.o:
				# invalid data encoding
				#
				# 4) FIND_PACKAGE can fail if the package file uses PkgConfig, e.g.,
				# FindLibXml2.cmake, since the pkg-config program runs on the
				# host, and the variables set by PKG_CHECK_MODULES reference the
				# host system, not the target.
				#
				# 5) Stage2 configuration fails for several sub-projects, including
				# libcxx, libcxxabi, and libunwind, with the following error:
				#
				# CMake Error at .../llvm_project/libunwind/src/CMakeLists.txt:110 (add_library):
				# The install of the unwind_shared target requires changing an
				# RPATH from the build tree, but this is not supported with the
				# Ninja generator unless on an ELF-based platform. The
				# CMAKE_BUILD_WITH_INSTALL_RPATH variable may be set to avoid
				# this relinking step.
				#

				if(NOT DEFINED SYSROOT AND NOT DEFINED CMAKE_SYSROOT)
				message(FATAL_ERROR "Missing required option -DSYSROOT=<sysroot path>.")
				endif()

				if(DEFINED SYSROOT)
				# Since we just want to build a bootstrap compiler, turn off as much
				# as possible.
				set(LLVM_INCLUDE_DOCS OFF CACHE BOOL "")
				set(LLVM_INCLUDE_EXAMPLES OFF CACHE BOOL "")
				set(LLVM_INCLUDE_RUNTIMES OFF CACHE BOOL "")
				set(LLVM_INCLUDE_TESTS OFF CACHE BOOL "")
				set(LLVM_INCLUDE_UTILS OFF CACHE BOOL "")
				set(CLANG_BUILD_TOOLS OFF CACHE BOOL "")
				set(CLANG_ENABLE_ARCMT OFF CACHE BOOL "")
				set(CLANG_ENABLE_STATIC_ANALYZER OFF CACHE BOOL "")

				set(LLVM_TARGETS_TO_BUILD Native CACHE STRING "")
				set(CMAKE_BUILD_TYPE RELEASE CACHE STRING "")
				set(LLVM_ENABLE_ASSERTIONS ON CACHE BOOL "")

				# Make sure at least clang and lld are included.
				list(APPEND LLVM_ENABLE_PROJECTS clang lld)
				set(LLVM_ENABLE_PROJECTS ${LLVM_ENABLE_PROJECTS} CACHE STRING "")

				# Passing -fuse-ld=lld is hard for cmake to handle correctly, so
				# make lld the default linker.
				set(CLANG_DEFAULT_LINKER lld CACHE STRING "" FORCE)

				# Since LLVM_ENABLE_PROJECTS gets passed automatically to the next
				# stage, use another variable to pass the desired projects to stage2.
				if(DEFINED BOOTSTRAP_LLVM_ENABLE_PROJECTS)
				set(BOOTSTRAP_STAGE2_PROJECTS ${BOOTSTRAP_LLVM_ENABLE_PROJECTS} CACHE STRING "" FORCE)
				unset(BOOTSTRAP_LLVM_ENABLE_PROJECTS CACHE)
				else()
				set(BOOTSTRAP_STAGE2_PROJECTS "clang;libcxx;libcxxabi;libunwind" CACHE STRING "" FORCE)
				smeenaiUnsubmitted Not Done Reply Inline Actions Nit: write this out as a list instead of a string with semicolons? (I know they're equivalent, but the list reads nicer IMO.) smeenai: Nit: write this out as a list instead of a string with semicolons? (I know they're equivalent…
				hintondaAuthorUnsubmitted Not Done Reply Inline Actions Perhaps, but this is the style used throughout the clang+llvm cmake files. hintonda: Perhaps, but this is the style used throughout the clang+llvm cmake files.
				endif()

				# Required on non-elf hosts.
				set(ADDITIONAL_CLANG_BOOTSTRAP_DEPS "lld;llvm-ar;llvm-ranlib" CACHE STRING "")
				smeenaiUnsubmitted Not Done Reply Inline Actions Not exactly related, but I wonder why the LLVM build needs ranlib (rather than just invoking ar appropriately). smeenai: Not exactly related, but I wonder why the LLVM build needs ranlib (rather than just invoking ar…
				hintondaAuthorUnsubmitted Not Done Reply Inline Actions Darwin version of ranlib doesn't like elf binaries, so we need the one we build in stage1. hintonda: Darwin version of ranlib doesn't like elf binaries, so we need the one we build in stage1.

				if(NOT DEFINED TRIPLE)
				set(TRIPLE "x86_64-unknown-linux-gnu")
				endif()

				set(CLANG_ENABLE_BOOTSTRAP ON CACHE BOOL "")
				set(CLANG_BOOTSTRAP_CMAKE_ARGS
				-DCMAKE_SYSROOT=${SYSROOT}
				-DLLVM_DEFAULT_TARGET_TRIPLE=${TRIPLE}
				-DCMAKE_C_COMPILER_TARGET=${TRIPLE}
				-DCMAKE_CXX_COMPILER_TARGET=${TRIPLE}
				-DCMAKE_TOOLCHAIN_FILE=${CMAKE_CURRENT_LIST_FILE} CACHE STRING "")
				else()
				set(CMAKE_SYSTEM_NAME Linux CACHE STRING "" FORCE)
				smeenaiUnsubmitted Not Done Reply Inline Actions The CMake documentation for CMAKE_SYSTEM_NAME says CMAKE_SYSTEM_VERSION should also be set when cross-compiling (though I haven't seen any ill effects from not doing so). Setting CMAKE_SYSTEM_PROCESSOR probably doesn't hurt either. smeenai: The CMake documentation for CMAKE_SYSTEM_NAME says CMAKE_SYSTEM_VERSION should also be set when…
				hintondaAuthorUnsubmitted Not Done Reply Inline Actions These can be passed to stage1 as BOOTSTRAP_CMAKE_SYSTEM_VERSION, etc., allowing the user full control. I'll add a note to the comments up top. hintonda: These can be passed to stage1 as BOOTSTRAP_CMAKE_SYSTEM_VERSION, etc., allowing the user full…

				# Set default, but allow overries.
				if(NOT DEFINED CMAKE_BUILD_TYPE)
				set(CMAKE_BUILD_TYPE Release CACHE STRING "")
				endif()

				# Always use STAGE2_PROJECTS Use FORCE to override passed vaiable.
				set(LLVM_ENABLE_PROJECTS ${STAGE2_PROJECTS} CACHE STRING "" FORCE)

				# Use bootstrap tools.
				get_filename_component(BASE_PATH "${CMAKE_C_COMPILER}" DIRECTORY CACHE)
				set(CMAKE_AR "${BASE_PATH}/llvm-ar" CACHE STRING "")
				set(CMAKE_RANLIB "${BASE_PATH}/llvm-ranlib" CACHE STRING "")
				set(CLANG_TABLEGEN "${BASE_PATH}/clang-tblgen" CACHE STRING "")
				set(LLVM_TABLEGEN "${BASE_PATH}/llvm-tblgen" CACHE STRING "")
				set(_LLVM_CONFIG_EXE "${BASE_PATH}/llvm-config" CACHE STRING "")
				set(CMAKE_LINKER "${BASE_PATH}/lld" CACHE STRING "")

				# Make sure static libs use the gnu format.
				set(CMAKE_STATIC_LINKER_FLAGS "-format gnu" CACHE STRING "")

				# Force clang to look for gcc at runtime -- otherwise it will
				# default to 4.2.1.
				set(GCC_INSTALL_PREFIX "/usr" CACHE STRING "")

				# Changing an RPATH from the build tree is not supported with the
				# Ninja generator unless on an ELF-based platform. This might
				# require changes to llvm cmake files at some point.
				if(CMAKE_HOST_APPLE AND CMAKE_GENERATOR STREQUAL "Ninja")
				set(CMAKE_BUILD_WITH_INSTALL_RPATH ON CACHE BOOL "")
				endif()

				# Use CMAKE_SYSROOT prefix for FIND_XXX() commands.
				SET(CMAKE_FIND_ROOT_PATH "${CMAKE_SYSROOT}" CACHE STRING "")

				# Adjust the default behaviour of the FIND_XXX() commands:
				# search headers and libraries in the target environment, search
				# programs in the host environment.
				set(CMAKE_FIND_ROOT_PATH_MODE_PROGRAM NEVER CACHE STRING "")
				set(CMAKE_FIND_ROOT_PATH_MODE_LIBRARY ONLY CACHE STRING "")
				set(CMAKE_FIND_ROOT_PATH_MODE_INCLUDE ONLY CACHE STRING "")
				endif()

This is an archive of the discontinued LLVM Phabricator instance.

[cmake] Add new linux toolchain fileAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 128409

CMakeLists.txt

cmake/caches/linux-toolchain.cmake

[cmake] Add new linux toolchain file
AbandonedPublic