This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/lib/Driver/ToolChains/
-
lib/
-
Driver/
-
ToolChains/
-
Cuda.cpp

Differential D150136

[Clang] Change default triple to LLVM_HOST_TRIPLE for the CUDA toolchain
ClosedPublic

Authored by jhuber6 on May 8 2023, 12:06 PM.

Download Raw Diff

Details

Reviewers

tra
yaxunl

Commits

rGc2c917f7f668: [Clang] Change default triple to LLVM_HOST_TRIPLE for the CUDA toolchain

Summary

When cross-compiling NVPTX we use the triple to indicate which paths to
search for the CUDA toolchain. Currently this uses the default target
triple. This might not be exactly correct, as this is the default triple
used to compile binaries, not the host system. We want the host triple
because it indicates which folders should hold CUDA.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.May 8 2023, 12:06 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 8 2023, 12:06 PM

Herald added a subscriber: mattd. · View Herald Transcript

jhuber6 requested review of this revision.May 8 2023, 12:06 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 8 2023, 12:06 PM

Herald added subscribers: cfe-commits, MaskRay. · View Herald Transcript

The change may be an improvement, but we may still have a potential issue here.

E.g. ideally we may want to be able to cross-compile a CUDA app on a powerpc or ARM build host targeting NVIDIA GPU on a x86 host. So, the compilation tools would need to be found for the powerpc/arm host, but the the pair of triples used during compilation would have to be x86 and nvptx.

In this situation the LLVM_HOST_TRIPLE would not be the right triple at all. Does OpenMP currently handle the cross-compilation scenario above?

In D150136#4327570, @tra wrote:

The change may be an improvement, but we may still have a potential issue here.

E.g. ideally we may want to be able to cross-compile a CUDA app on a powerpc or ARM build host targeting NVIDIA GPU on a x86 host. So, the compilation tools would need to be found for the powerpc/arm host, but the the pair of triples used during compilation would have to be x86 and nvptx.

So, this triple is only used for locating the CUDA library itself. In that case it's generally assumed that it will match whatever file structure the host computer is using. Specifically, right now all it's used for is HostTriple.isOSWindows().

In this situation the LLVM_HOST_TRIPLE would not be the right triple at all. Does OpenMP currently handle the cross-compilation scenario above?

I don't think anyone's tried OpenMP with cross compilation. Most likely because it's only supported on Linux currently. I actually don't know what would happen if you tried.

right now all it's used for is HostTriple.isOSWindows()

OK.

In that case we may want to rename the parameter to BuildHostTriple to make it clear which host we have in mind.

This revision is now accepted and ready to land.May 8 2023, 12:58 PM

This revision was landed with ongoing or failed builds.May 8 2023, 1:55 PM

Closed by commit rGc2c917f7f668: [Clang] Change default triple to LLVM_HOST_TRIPLE for the CUDA toolchain (authored by jhuber6). · Explain Why

This revision was automatically updated to reflect the committed changes.

jhuber6 added a commit: rGc2c917f7f668: [Clang] Change default triple to LLVM_HOST_TRIPLE for the CUDA toolchain.

Harbormaster completed remote builds in B230689: Diff 520459.May 8 2023, 2:02 PM

Revision Contents

Path

Size

clang/

lib/

Driver/

ToolChains/

Cuda.cpp

3 lines

Diff 520477

clang/lib/Driver/ToolChains/Cuda.cpp

Show First 20 Lines • Show All 705 Lines • ▼ Show 20 Lines	NVPTXToolChain::NVPTXToolChain(const Driver &D, const llvm::Triple &Triple,
// discover the 'nvptx-arch' executable.		// discover the 'nvptx-arch' executable.
getProgramPaths().push_back(getDriver().Dir);		getProgramPaths().push_back(getDriver().Dir);
}		}

/// We only need the host triple to locate the CUDA binary utilities, use the		/// We only need the host triple to locate the CUDA binary utilities, use the
/// system's default triple if not provided.		/// system's default triple if not provided.
NVPTXToolChain::NVPTXToolChain(const Driver &D, const llvm::Triple &Triple,		NVPTXToolChain::NVPTXToolChain(const Driver &D, const llvm::Triple &Triple,
const ArgList &Args)		const ArgList &Args)
: NVPTXToolChain(D, Triple,		: NVPTXToolChain(D, Triple, llvm::Triple(LLVM_HOST_TRIPLE), Args,
llvm::Triple(llvm::sys::getDefaultTargetTriple()), Args,
/Freestanding=/true) {}		/Freestanding=/true) {}

llvm::opt::DerivedArgList *		llvm::opt::DerivedArgList *
NVPTXToolChain::TranslateArgs(const llvm::opt::DerivedArgList &Args,		NVPTXToolChain::TranslateArgs(const llvm::opt::DerivedArgList &Args,
StringRef BoundArch,		StringRef BoundArch,
Action::OffloadKind DeviceOffloadKind) const {		Action::OffloadKind DeviceOffloadKind) const {
DerivedArgList *DAL =		DerivedArgList *DAL =
ToolChain::TranslateArgs(Args, BoundArch, DeviceOffloadKind);		ToolChain::TranslateArgs(Args, BoundArch, DeviceOffloadKind);
▲ Show 20 Lines • Show All 303 Lines • Show Last 20 Lines