This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/Basic/Targets/
-
Basic/
-
Targets/
-
NVPTX.cpp
-
test/OpenMP/
-
OpenMP/
-
driver-openmp-target.c

Differential D125256

[OpenMP] Add `__CUDA_ARCH__` definition when offloading with OpenMP
ClosedPublic

Authored by jhuber6 on May 9 2022, 12:11 PM.

Download Raw Diff

Details

Reviewers

jdoerfert
tra
tianshilei1992

Commits

rG002a63f937d9: [OpenMP] Add `__CUDA_ARCH__` definition when offloading with OpenMP

Summary

Currently we define the __CUDA_ARCH__ macro only in CUDA mode. This
patch allows us to use this macro in OpenMP-offloading mode when
targeting NVPTX.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.May 9 2022, 12:11 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 9 2022, 12:11 PM

Herald added subscribers: mattd, gchakrabarti, asavonic and 3 others. · View Herald Transcript

jhuber6 requested review of this revision.May 9 2022, 12:11 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 9 2022, 12:11 PM

Herald added subscribers: cfe-commits, sstefan1. · View Herald Transcript

jhuber6 retitled this revision from [OpenMP] Add __CUDA_ARCH__ definition when offloading with OpenMP to [OpenMP] Add `__CUDA_ARCH__` definition when offloading with OpenMP.May 9 2022, 12:13 PM

jhuber6 edited the summary of this revision. (Show Details)

This revision is now accepted and ready to land.May 9 2022, 1:00 PM

tra accepted this revision.May 9 2022, 1:08 PM

Harbormaster completed remote builds in B163536: Diff 428155.May 9 2022, 2:22 PM

jhuber6 added a child revision: D125315: [Libomptarget] Build the device runtime as a static library.May 10 2022, 7:04 AM

Closed by commit rG002a63f937d9: [OpenMP] Add `__CUDA_ARCH__` definition when offloading with OpenMP (authored by jhuber6). · Explain WhyMay 13 2022, 11:39 AM

This revision was automatically updated to reflect the committed changes.

jhuber6 added a commit: rG002a63f937d9: [OpenMP] Add `__CUDA_ARCH__` definition when offloading with OpenMP.

@jhuber6 I think this or one of your other openmp commits has caused the Driver/cuda-openmp-driver.cu test failure here: https://lab.llvm.org/buildbot/#/builders/214/builds/1274/steps/6/logs/stdio

In D125256#3513596, @RKSimon wrote:

@jhuber6 I think this or one of your other openmp commits has caused the Driver/cuda-openmp-driver.cu test failure here: https://lab.llvm.org/buildbot/#/builders/214/builds/1274/steps/6/logs/stdio

Is that still failing? I saw another build-bot fail on that test as well, so I pushed a quick change and it went green. When I check a more recent build there it doesn't show the test failing.

Sorry - my mistake - its a different test failure now! Nothing to do with openmp.

Revision Contents

Path

Size

clang/

lib/

Basic/

Targets/

NVPTX.cpp

2 lines

test/

OpenMP/

driver-openmp-target.c

4 lines

Diff 429305

clang/lib/Basic/Targets/NVPTX.cpp

Show First 20 Lines • Show All 173 Lines • ▼ Show 20 Lines	return llvm::StringSwitch<bool>(Feature)
.Cases("ptx", "nvptx", true)		.Cases("ptx", "nvptx", true)
.Default(false);		.Default(false);
}		}

void NVPTXTargetInfo::getTargetDefines(const LangOptions &Opts,		void NVPTXTargetInfo::getTargetDefines(const LangOptions &Opts,
MacroBuilder &Builder) const {		MacroBuilder &Builder) const {
Builder.defineMacro("__PTX__");		Builder.defineMacro("__PTX__");
Builder.defineMacro("__NVPTX__");		Builder.defineMacro("__NVPTX__");
if (Opts.CUDAIsDevice) {		if (Opts.CUDAIsDevice \|\| Opts.OpenMPIsDevice) {
// Set __CUDA_ARCH__ for the GPU specified.		// Set __CUDA_ARCH__ for the GPU specified.
std::string CUDAArchCode = [this] {		std::string CUDAArchCode = [this] {
switch (GPU) {		switch (GPU) {
case CudaArch::GFX600:		case CudaArch::GFX600:
case CudaArch::GFX601:		case CudaArch::GFX601:
case CudaArch::GFX602:		case CudaArch::GFX602:
case CudaArch::GFX700:		case CudaArch::GFX700:
case CudaArch::GFX701:		case CudaArch::GFX701:
▲ Show 20 Lines • Show All 85 Lines • Show Last 20 Lines

clang/test/OpenMP/driver-openmp-target.c

	// REQUIRES: x86-registered-target			// REQUIRES: x86-registered-target
				// REQUIRES: nvptx-registered-target
	// REQUIRES: clang-target-64-bits			// REQUIRES: clang-target-64-bits

	// RUN: %clang %s -c -E -dM -fopenmp=libomp -fopenmp-version=45 -fopenmp-targets=x86_64-unknown-unknown -o - \| FileCheck --check-prefix=CHECK-45-VERSION %s			// RUN: %clang %s -c -E -dM -fopenmp=libomp -fopenmp-version=45 -fopenmp-targets=x86_64-unknown-unknown -o - \| FileCheck --check-prefix=CHECK-45-VERSION %s
	// CHECK-45-VERSION: #define _OPENMP 201511			// CHECK-45-VERSION: #define _OPENMP 201511
				// RUN: %clang %s -c -E -dM -fopenmp=libomp -nogpulib --offload-arch=sm_70 --offload-device-only -o - \| FileCheck --check-prefix=CHECK-CUDA-ARCH %s
				// CHECK-CUDA-ARCH: #define __CUDA_ARCH__ 700