This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
openmp/libomptarget/
-
libomptarget/
-
plugins/cuda/
-
cuda/
-
dynamic_cuda/
-
cuda.h
-
src/
-
rtl.cpp
-
test/offloading/
-
offloading/
-
cuda_no_devices.c

Differential D130371

[Libomptarget] Don't report lack of CUDA devices
ClosedPublic

Authored by jdenny on Jul 22 2022, 10:13 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
jhuber6
RaviNarayanaswamy
tianshilei1992
JonChesterfield

Commits

rGcfa6e79df30c: [Libomptarget] Don't report lack of CUDA devices

Summary

Sometimes libomptarget's CUDA plugin produces unhelpful diagnostics
about a lack of CUDA devices before an application runs:

$ clang -fopenmp -fopenmp-targets=amdgcn-amd-amdhsa hello-world.c
$ ./a.out
CUDA error: Error returned from cuInit
CUDA error: no CUDA-capable device is detected
Hello World: 4

This can happen when the CUDA plugin was built but all CUDA devices
are currently disabled in some manner, perhaps because
CUDA_VISIBLE_DEVICES is set to the empty string. As shown in the
above example, it can even happen when we haven't compiled the
application for offloading to CUDA.

The following code from openmp/libomptarget/plugins/cuda/src/rtl.cpp
appears to be intended to handle this case, and it chooses not to
write a diagnostic to stderr unless debugging is enabled:

if (NumberOfDevices == 0) {
  DP("There are no devices supporting CUDA.\n");
  return;
}

The problem is that the above code is never reached because the
earlier cuInit returns CUDA_ERROR_NO_DEVICE. This patch handles
that cuInit case in the same manner as the above code handles the
NumberOfDevices == 0 case.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jdenny created this revision.Jul 22 2022, 10:13 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 22 2022, 10:13 AM

Herald added subscribers: kosarev, mattd, yaxunl. · View Herald Transcript

jdenny requested review of this revision.Jul 22 2022, 10:13 AM

Herald added a subscriber: sstefan1. · View Herald TranscriptJul 22 2022, 10:13 AM

Harbormaster completed remote builds in B177048: Diff 446863.Jul 22 2022, 10:20 AM

LGTM Thanks for the improvement!

This revision is now accepted and ready to land.Jul 22 2022, 10:34 AM

This revision was landed with ongoing or failed builds.Jul 22 2022, 11:50 AM

Closed by commit rGcfa6e79df30c: [Libomptarget] Don't report lack of CUDA devices (authored by jdenny). · Explain Why

This revision was automatically updated to reflect the committed changes.

jdenny added a commit: rGcfa6e79df30c: [Libomptarget] Don't report lack of CUDA devices.

Thanks for the quick review.

I guess you machine has the nvidia driver installed but there is no GPU.
When there is no nvidia driver,

Libomptarget --> Loading library 'libomptarget.rtl.cuda.so'...
Libomptarget --> Unable to load library 'libomptarget.rtl.cuda.so': libcuda.so.1: cannot open shared object file: No such file or directory!

In D130371#3700740, @ye-luo wrote:

I guess you machine has the nvidia driver installed but there is no GPU.

On my laptop, I saw the problem when I just disabled the (discrete) nvidia gpu in favor of integrated graphics... or when I set CUDA_VISIBLE_DEVICES to the empty string.

The machine that originally motivated this change has cuda installed but not the nvidia driver. This patch helped that case too. However, that machine also experienced other strange behavior I don't have any more time right now to pursue, and I ultimately recommended -DLIBOMPTARGET_BUILD_CUDA_PLUGIN=False to get around it. (I would have reported the behavior upstream, but it might be specific to Clacc.) Anyway, my point is that I'm not sure yet that things always work right with a disabled nvidia driver.

In D130371#3706544, @jdenny wrote:

In D130371#3700740, @ye-luo wrote:

I guess you machine has the nvidia driver installed but there is no GPU.

On my laptop, I saw the problem when I just disabled the (discrete) nvidia gpu in favor of integrated graphics... or when I set CUDA_VISIBLE_DEVICES to the empty string.

The machine that originally motivated this change has cuda installed but not the nvidia driver. This patch helped that case too. However, that machine also experienced other strange behavior I don't have any more time right now to pursue, and I ultimately recommended -DLIBOMPTARGET_BUILD_CUDA_PLUGIN=False to get around it. (I would have reported the behavior upstream, but it might be specific to Clacc.) Anyway, my point is that I'm not sure yet that things always work right with a disabled nvidia driver.

Thanks for the info. With your patch
clang++ -fopenmp --offload-arch=sm_80,gfx906 main.cpp
CUDA_VISIBLE_DEVICES="" ./a.out # runs fine on the AMD GPU.
so it is good.

Revision Contents

Path

Size

openmp/

libomptarget/

plugins/

cuda/

dynamic_cuda/

cuda.h

1 line

src/

rtl.cpp

4 lines

test/

offloading/

cuda_no_devices.c

20 lines

Diff 446917

openmp/libomptarget/plugins/cuda/dynamic_cuda/cuda.h

	Show All 21 Lines
	typedef struct CUctx_st *CUcontext;			typedef struct CUctx_st *CUcontext;
	typedef struct CUfunc_st *CUfunction;			typedef struct CUfunc_st *CUfunction;
	typedef struct CUstream_st *CUstream;			typedef struct CUstream_st *CUstream;
	typedef struct CUevent_st *CUevent;			typedef struct CUevent_st *CUevent;

	typedef enum cudaError_enum {			typedef enum cudaError_enum {
	CUDA_SUCCESS = 0,			CUDA_SUCCESS = 0,
	CUDA_ERROR_INVALID_VALUE = 1,			CUDA_ERROR_INVALID_VALUE = 1,
				CUDA_ERROR_NO_DEVICE = 100,
	CUDA_ERROR_INVALID_HANDLE = 400,			CUDA_ERROR_INVALID_HANDLE = 400,
	} CUresult;			} CUresult;

	typedef enum CUstream_flags_enum {			typedef enum CUstream_flags_enum {
	CU_STREAM_DEFAULT = 0x0,			CU_STREAM_DEFAULT = 0x0,
	CU_STREAM_NON_BLOCKING = 0x1,			CU_STREAM_NON_BLOCKING = 0x1,
	} CUstream_flags;			} CUstream_flags;

	▲ Show 20 Lines • Show All 228 Lines • Show Last 20 Lines

openmp/libomptarget/plugins/cuda/src/rtl.cpp

Show First 20 Lines • Show All 501 Lines • ▼ Show 20 Lines	DeviceRTLTy()
DP("Start initializing CUDA\n");		DP("Start initializing CUDA\n");

CUresult Err = cuInit(0);		CUresult Err = cuInit(0);
if (Err == CUDA_ERROR_INVALID_HANDLE) {		if (Err == CUDA_ERROR_INVALID_HANDLE) {
// Can't call cuGetErrorString if dlsym failed		// Can't call cuGetErrorString if dlsym failed
DP("Failed to load CUDA shared library\n");		DP("Failed to load CUDA shared library\n");
return;		return;
}		}
		if (Err == CUDA_ERROR_NO_DEVICE) {
		DP("There are no devices supporting CUDA.\n");
		return;
		}
if (!checkResult(Err, "Error returned from cuInit\n")) {		if (!checkResult(Err, "Error returned from cuInit\n")) {
return;		return;
}		}

Err = cuDeviceGetCount(&NumberOfDevices);		Err = cuDeviceGetCount(&NumberOfDevices);
if (!checkResult(Err, "Error returned from cuDeviceGetCount\n"))		if (!checkResult(Err, "Error returned from cuDeviceGetCount\n"))
return;		return;

▲ Show 20 Lines • Show All 1,341 Lines • Show Last 20 Lines

openmp/libomptarget/test/offloading/cuda_no_devices.c

This file was added.

				// The CUDA plugin used to complain on stderr when no CUDA devices were enabled,
				// and then it let the application run anyway. Check that there's no such
				// complaint anymore, especially when the user isn't targeting CUDA.

				// RUN: %libomptarget-compile-generic
				// RUN: env CUDA_VISIBLE_DEVICES= \
				// RUN: %libomptarget-run-generic 2>&1 \| %fcheck-generic

				#include <stdio.h>

				// CHECK-NOT: {{.}}
				// CHECK: Hello World: 4
				// CHECK-NOT: {{.}}
				int main() {
				int x = 0;
				#pragma omp target teams num_teams(2) reduction(+:x)
				x += 2;
				printf("Hello World: %d\n", x);
				return 0;
				}