This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/Basic/Targets/
-
Basic/
-
Targets/
1/2
NVPTX.cpp
-
test/Frontend/
-
Frontend/
-
standalone-nvptx-macros.c

Differential D146975

[NVPTX] Add __CUDA_ARCH__ macro to standalone NVPTX compilations
ClosedPublic

Authored by jhuber6 on Mar 27 2023, 8:20 AM.

Download Raw Diff

Details

Reviewers

tra
tianshilei1992
ye-luo
jdoerfert

Commits

rGbed7005eb4d4: [NVPTX] Add __CUDA_ARCH__ macro to standalone NVPTX compilations

Summary

We can now target the NVPTX architecture directly via
--target=nvptx64-nvidia-cuda. This currently does not define the
__CUDA_ARCH__ macro with is used to allow code to target different
codes based on support. This patch simply adds this support.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.Mar 27 2023, 8:20 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 27 2023, 8:20 AM

Herald added subscribers: mattd, gchakrabarti, asavonic, yaxunl. · View Herald Transcript

jhuber6 requested review of this revision.Mar 27 2023, 8:20 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 27 2023, 8:20 AM

Herald added subscribers: cfe-commits, jplehr, sstefan1, jholewinski. · View Herald Transcript

This revision is now accepted and ready to land.Mar 27 2023, 10:12 AM

tra accepted this revision.Mar 27 2023, 10:22 AM

tra added inline comments.

clang/lib/Basic/Targets/NVPTX.cpp
171	Wouldn't just `if(!HostTarget)` be sufficient here?
clang/test/Frontend/nvptx-macros.c
1 ↗	(On Diff #508672)	I'd rename the file to make it more obvious that it deals with a standalone compilation. `standalone-nvptx-macros.c` ?

jhuber6 marked an inline comment as done.Mar 27 2023, 10:23 AM

jhuber6 added inline comments.

clang/lib/Basic/Targets/NVPTX.cpp
171	`HostTarget` is the host toolchain, provided via `-aux-triple`. So it's set for OpenMP and CUDA but not for a standalone.
clang/test/Frontend/nvptx-macros.c
1 ↗	(On Diff #508672)	I can do that.

This revision was landed with ongoing or failed builds.Mar 27 2023, 4:08 PM

Closed by commit rGbed7005eb4d4: [NVPTX] Add __CUDA_ARCH__ macro to standalone NVPTX compilations (authored by jhuber6). · Explain Why

This revision was automatically updated to reflect the committed changes.

jhuber6 marked an inline comment as done.

jhuber6 added a commit: rGbed7005eb4d4: [NVPTX] Add __CUDA_ARCH__ macro to standalone NVPTX compilations.

Harbormaster completed remote builds in B222031: Diff 508672.Mar 27 2023, 9:14 PM

Revision Contents

Path

Size

clang/

lib/

Basic/

Targets/

NVPTX.cpp

2 lines

test/

Frontend/

standalone-nvptx-macros.c

5 lines

Diff 508828

clang/lib/Basic/Targets/NVPTX.cpp

Show First 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	return llvm::StringSwitch<bool>(Feature)
.Cases("ptx", "nvptx", true)		.Cases("ptx", "nvptx", true)
.Default(false);		.Default(false);
}		}

void NVPTXTargetInfo::getTargetDefines(const LangOptions &Opts,		void NVPTXTargetInfo::getTargetDefines(const LangOptions &Opts,
MacroBuilder &Builder) const {		MacroBuilder &Builder) const {
Builder.defineMacro("__PTX__");		Builder.defineMacro("__PTX__");
Builder.defineMacro("__NVPTX__");		Builder.defineMacro("__NVPTX__");
if (Opts.CUDAIsDevice \|\| Opts.OpenMPIsDevice) {		if (Opts.CUDAIsDevice \|\| Opts.OpenMPIsDevice \|\| !HostTarget) {
		traUnsubmitted Not Done Reply Inline Actions Wouldn't just `if(!HostTarget)` be sufficient here? tra: Wouldn't just `if(!HostTarget)` be sufficient here?
		jhuber6AuthorUnsubmitted Done Reply Inline Actions `HostTarget` is the host toolchain, provided via `-aux-triple`. So it's set for OpenMP and CUDA but not for a standalone. jhuber6: `HostTarget` is the host toolchain, provided via `-aux-triple`. So it's set for OpenMP and CUDA…
// Set __CUDA_ARCH__ for the GPU specified.		// Set __CUDA_ARCH__ for the GPU specified.
std::string CUDAArchCode = [this] {		std::string CUDAArchCode = [this] {
switch (GPU) {		switch (GPU) {
case CudaArch::GFX600:		case CudaArch::GFX600:
case CudaArch::GFX601:		case CudaArch::GFX601:
case CudaArch::GFX602:		case CudaArch::GFX602:
case CudaArch::GFX700:		case CudaArch::GFX700:
case CudaArch::GFX701:		case CudaArch::GFX701:
▲ Show 20 Lines • Show All 91 Lines • Show Last 20 Lines

clang/test/Frontend/standalone-nvptx-macros.c

This file was added.

				// REQUIRES: nvptx-registered-target

				// RUN: %clang %s -c -E -dM --target=nvptx64-nvidia-cuda -march=sm_70 -o - \| \
				// RUN: FileCheck --check-prefix=CHECK-CUDA-ARCH %s
				// CHECK-CUDA-ARCH: #define __CUDA_ARCH__ 700