Asm is a gnu extension for C, so at present -fopenmp -std=c99
and similar fail to compile on nvptx, bug 51344
Changing to __asm__ or __asm works for openmp, all three appear to work
for cuda. Suggesting __asm__ here as __asm is used by MSVC with different
syntax, so this should make for better error diagnostics if the header is
passed to a compiler other than clang.
clang-tidy: error: This file is intended to be used with CUDA-9+ only. [clang-diagnostic-error]
not useful