This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Driver/
-
clang/
-
Driver/
-
Options.td
3/3
Types.def
-
lib/Driver/
-
Driver/
2/2
Driver.cpp
-
ToolChains/
-
Clang.cpp
-
Types.cpp
-
test/Driver/
-
Driver/
-
Inputs/hip_multiple_inputs/
-
hip_multiple_inputs/
-
a.hipp
-
hip-binding.hipp
-
hip-device-compile.hipp
-
hip-phases.hipp

Differential D87325

[HIP] Add -emit-pch option to clang driver
Needs ReviewPublic

Authored by ashi1 on Sep 8 2020, 1:12 PM.

Download Raw Diff

Details

Reviewers

yaxunl
rsmith
tra

Summary

Move the -emit-pch option to clang driver flags, so that
HIP can emit pch files, if requested in driver. This is on both
device and host paths. Introducing a new TY_HIPHeader,
which is used for the Precompile phase list.

Diff Detail

Event Timeline

ashi1 created this revision.Sep 8 2020, 1:12 PM

Herald added a subscriber: dang. · View Herald TranscriptSep 8 2020, 1:12 PM

ashi1 requested review of this revision.Sep 8 2020, 1:12 PM

need tests for --cuda-host-only and default host/device compilation, and tests for C/C++

clang/include/clang/Driver/Types.def
47	We should not pollute the normal phases for HIP program with Precompile. BTW can you fix the extensions for HIP and HIP_DEVICE, which should be "hip" instead of "cu".
64	I would suggest to add "hip-header" and "hip-header-cpp-output" like "c++-header" and "c++-header-cpp-output". Basically we will only allow -emit-pch on "hip-header".
clang/lib/Driver/Driver.cpp
2816	typo: Actoms
3625	we need to support -emit-pch in host only compilation too. And not just for HIP, for C and C++ too, since it is supposed to be a generic option.
clang/test/Driver/hip-device-compile.hip
31 ↗	(On Diff #290572)	pls remove the --hip-device-lib options and add -nogpulib

Looking into the C/C++ tests.

clang/include/clang/Driver/Types.def
64	Sounds good, I had tried this attempt earlier, and I have the new diff with this implementation.

Fixed to use TY_HIPHeader instead of changing the phases in TY_HIP.

Can you elaborate on the use case of PCH files for CUDA/HIP?

Is -emit-pch only expected to work with a single sub-compilation only, similarly to how we handle -S ? The tests appear to imply so. If that's the case, if would be great to add a test verifying that --emit-pch fails if it's used with more than one sub-coimpilation.

What will happen if I attempt to use PCH compiled for one GPU variant during compilation targeting a different variant? I know that clang does complain if the compilation uses different options (some of them?) compared to the options used during compilation that produced the PCH. I'm not sure whether it will be sufficient to prevent use of PCH for a wrong GPU. It would be great to have a test for that, too.

In D87325#2271676, @tra wrote:

Can you elaborate on the use case of PCH files for CUDA/HIP?

I believe one use-case for PCH is for common include headers such as hip_runtime.h which is being re-used in many application source files. To improve the performance, we can pre-compile the header and re-use it during online compilation.

Is -emit-pch only expected to work with a single sub-compilation only, similarly to how we handle -S ? The tests appear to imply so. If that's the case, if would be great to add a test verifying that --emit-pch fails if it's used with more than one sub-coimpilation.

This patch seems to only work for single sub-compilation. I've found that using -emit-pch for host+device will generate a PCH bitcode with clang-offload-bundle. The ASTReader cannot understand the clang_offload_bundle, and only looks for PCH's magic number (that could be future work). So for now, this will only work when PCH is generated under --cuda-host-only or --cuda-device-only options. I will update the tests to reflect this. We can however generate multiple pch for multiple device archs.

What will happen if I attempt to use PCH compiled for one GPU variant during compilation targeting a different variant? I know that clang does complain if the compilation uses different options (some of them?) compared to the options used during compilation that produced the PCH. I'm not sure whether it will be sufficient to prevent use of PCH for a wrong GPU. It would be great to have a test for that, too.

Right, clang will complain if the PCH used was compiled for a different GPU. I will add a test to check for this. We will see an error like:
error: PCH file was compiled for the target CPU 'gfx900' but the current translation unit is being compiled for target 'gfx803'

In D87325#2277316, @ashi1 wrote:

In D87325#2271676, @tra wrote:

Can you elaborate on the use case of PCH files for CUDA/HIP?

I believe one use-case for PCH is for common include headers such as hip_runtime.h which is being re-used in many application source files. To improve the performance, we can pre-compile the header and re-use it during online compilation.

That would be potentially useful if it could be used from a normal compilation, but it's not. Single-sub-compilarion is a very very small niche.

I'm OK with making -emit-pch work for GPUs, but considering very limited use case and the fact that the generated PCH will be wrong more often than not (I.e. it will be usable for only 1 out of N subcompilations for particular TU), I would rather keep the -emit-pch a CC1 only option. Those who need it should be able to use it via -Xclang -emit-pch and for most of the regular users it does not matter.

@rsmith - WDYT?

Updated the tests to use --cuda-host-only or --cuda-device-only options when using -emit-pch. Added more tests for compilation when using -include-pch. Also, added a negative test when using different GPU variant PCH during compilation.

In D87325#2277467, @tra wrote:

In D87325#2277316, @ashi1 wrote:

In D87325#2271676, @tra wrote:

Can you elaborate on the use case of PCH files for CUDA/HIP?

I believe one use-case for PCH is for common include headers such as hip_runtime.h which is being re-used in many application source files. To improve the performance, we can pre-compile the header and re-use it during online compilation.

That would be potentially useful if it could be used from a normal compilation, but it's not. Single-sub-compilarion is a very very small niche.

I'm OK with making -emit-pch work for GPUs, but considering very limited use case and the fact that the generated PCH will be wrong more often than not (I.e. it will be usable for only 1 out of N subcompilations for particular TU), I would rather keep the -emit-pch a CC1 only option. Those who need it should be able to use it via -Xclang -emit-pch and for most of the regular users it does not matter.

Having -emit-pch in the clang driver is useful because it doesn't require users to specify standard C++ include paths, clang include paths, and CUDA/HIP wrapper headers needed by CC1. That is error prone for the user.
Also, this device compilation is not niche, it is needed for nvrtc/hiprtc and hip applications can perform device-only compilations at either compile-time or run-time.

In D87325#2277717, @ashi1 wrote:

Having -emit-pch in the clang driver is useful because it doesn't require users to specify standard C++ include paths, clang include paths, and CUDA/HIP wrapper headers needed by CC1. That is error prone for the user.

I didn't meant o invoke -cc1 directly, but rather to pass -emit-pch via -Xclang -emit-pch. No need to provide *all* CC1 options manually.

Also, this device compilation is not niche, it is needed for nvrtc/hiprtc and hip applications can perform device-only compilations at either compile-time or run-time.

I'll leave it up to @rsmith. I'm not quite convinced that making -emit-pch a top-level option is the right thing to do yet.

Added a test checking for error when -o option is used for multi-device -emit-pch run.

Added a C++ header to .pch file test.

Adding Diag when mixing device and host paths with -emit-pch. Currently, we don't support this path, since the generated pch will be a clang_offload_bundle (supporting that will require that the ASTReader understand clang offload bundles and that is outside the scope of this patch). Added tests to check error Diag is reported when running both paths with -emit-pch.

In D87325#2277854, @tra wrote:

In D87325#2277717, @ashi1 wrote:

Having -emit-pch in the clang driver is useful because it doesn't require users to specify standard C++ include paths, clang include paths, and CUDA/HIP wrapper headers needed by CC1. That is error prone for the user.

I didn't meant o invoke -cc1 directly, but rather to pass -emit-pch via -Xclang -emit-pch. No need to provide *all* CC1 options manually.

@tra , I tried to invoke clang with -Xclang -emit-pch, but the -x hip path doesn't know about the precompile phase. It does pass -emit-pch to the cc1 command, but is overridden by the -emit-obj default flag to cc1 in compiler phase. Also, from there it will go through backend, assembler, linker, which we will need to disable if we want this method to work. Here is the experiment:

root@e6915ef660c7:~/llvm-project/build_rel# ./bin/clang++ -x hip --cuda-device-only --cuda-gpu-arch=gfx803 -Xclang -emit-pch a.hip -o a.hip.pch -ccc-print-bindings
# "amdgcn-amd-amdhsa" - "clang", inputs: ["a.hip"], output: "/tmp/a-e15936.o"
# "amdgcn-amd-amdhsa" - "AMDGCN::Linker", inputs: ["/tmp/a-e15936.o"], output: "/tmp/a-b03f5d.out"
# "amdgcn-amd-amdhsa" - "AMDGCN::Linker", inputs: ["/tmp/a-b03f5d.out"], output: "a.hip.pch"
clang-12: warning: argument unused during compilation: '-Xclang -emit-pch' [-Wunused-command-line-argument]
root@e6915ef660c7:~/llvm-project/build_rel# ./bin/clang++ -x hip --cuda-device-only --cuda-gpu-arch=gfx803 -Xclang -emit-pch a.hip -o a.hip.pch -ccc-print-phases
                     +- 0: input, "a.hip", hip, (device-hip, gfx803)
                  +- 1: preprocessor, {0}, hip-cpp-output, (device-hip, gfx803)
               +- 2: compiler, {1}, ir, (device-hip, gfx803)
            +- 3: backend, {2}, assembler, (device-hip, gfx803)
         +- 4: assembler, {3}, object, (device-hip, gfx803)
      +- 5: linker, {4}, image, (device-hip, gfx803)
   +- 6: offload, "device-hip (amdgcn-amd-amdhsa:gfx803)" {5}, image
+- 7: linker, {6}, hip-fatbin, (device-hip, unknown)
8: offload, "device-hip (amdgcn-amd-amdhsa:unknown)" {7}, hip-fatbin
root@e6915ef660c7:~/llvm-project/build_rel# ./bin/clang++ -x hip --cuda-device-only --cuda-gpu-arch=gfx803 -Xclang -emit-pch a.hip -o a.hip.pch -###
clang version 12.0.0 (https://github.com/llvm/llvm-project.git 18698802f1075c3dbb8d51ed6c2e59c2108bf260)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /root/llvm-project/build_rel/./bin
 "/root/llvm-project/build_rel/bin/clang-12" "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-unknown-linux-gnu" "-emit-obj" "-mrelax-all" "--mrelax-relocations" "-disable-free" "-main-file-name" "a.hip" "-mrelocation-model" "pic" "-pic-level" "1" "-mframe-pointer=all" "-fdenormal-fp-math-f32=preserve-sign,preserve-sign" "-fno-rounding-math" "-mconstructor-aliases" "-aux-target-cpu" "x86-64" "-fcuda-is-device" "-mllvm" "-amdgpu-internalize-symbols" "-fcuda-allow-variadic-functions" "-fvisibility" "hidden" "-fapply-global-visibility-to-externs" "-mlink-builtin-bitcode" "/opt/rocm/lib/hip.amdgcn.bc" "-mlink-builtin-bitcode" "/opt/rocm/lib/ocml.amdgcn.bc" "-mlink-builtin-bitcode" "/opt/rocm/lib/ockl.amdgcn.bc" "-mlink-builtin-bitcode" "/opt/rocm/lib/oclc_daz_opt_on.amdgcn.bc" "-mlink-builtin-bitcode" "/opt/rocm/lib/oclc_unsafe_math_off.amdgcn.bc" "-mlink-builtin-bitcode" "/opt/rocm/lib/oclc_finite_only_off.amdgcn.bc" "-mlink-builtin-bitcode" "/opt/rocm/lib/oclc_correctly_rounded_sqrt_on.amdgcn.bc" "-mlink-builtin-bitcode" "/opt/rocm/lib/oclc_wavefrontsize64_on.amdgcn.bc" "-mlink-builtin-bitcode" "/opt/rocm/lib/oclc_isa_version_803.amdgcn.bc" "-target-cpu" "gfx803" "-fno-split-dwarf-inlining" "-debugger-tuning=gdb" "-resource-dir" "/root/llvm-project/build_rel/lib/clang/12.0.0" "-internal-isystem" "/root/llvm-project/build_rel/lib/clang/12.0.0/include/cuda_wrappers" "-internal-isystem" "/opt/rocm/include" "-include" "__clang_hip_runtime_wrapper.h" "-internal-isystem" "/usr/lib/gcc/x86_64-linux-gnu/7.5.0/../../../../include/c++/7.5.0" "-internal-isystem" "/usr/lib/gcc/x86_64-linux-gnu/7.5.0/../../../../include/x86_64-linux-gnu/c++/7.5.0" "-internal-isystem" "/usr/lib/gcc/x86_64-linux-gnu/7.5.0/../../../../include/x86_64-linux-gnu/c++/7.5.0" "-internal-isystem" "/usr/lib/gcc/x86_64-linux-gnu/7.5.0/../../../../include/c++/7.5.0/backward" "-internal-isystem" "/usr/lib/gcc/x86_64-linux-gnu/7.5.0/../../../../include/c++/7.5.0" "-internal-isystem" "/usr/lib/gcc/x86_64-linux-gnu/7.5.0/../../../../include/x86_64-linux-gnu/c++/7.5.0" "-internal-isystem" "/usr/lib/gcc/x86_64-linux-gnu/7.5.0/../../../../include/x86_64-linux-gnu/c++/7.5.0" "-internal-isystem" "/usr/lib/gcc/x86_64-linux-gnu/7.5.0/../../../../include/c++/7.5.0/backward" "-internal-isystem" "/usr/local/include" "-internal-isystem" "/root/llvm-project/build_rel/lib/clang/12.0.0/include" "-internal-externc-isystem" "/usr/include/x86_64-linux-gnu" "-internal-externc-isystem" "/include" "-internal-externc-isystem" "/usr/include" "-internal-isystem" "/usr/local/include" "-internal-isystem" "/root/llvm-project/build_rel/lib/clang/12.0.0/include" "-internal-externc-isystem" "/usr/include/x86_64-linux-gnu" "-internal-externc-isystem" "/include" "-internal-externc-isystem" "/usr/include" "-std=c++11" "-fdeprecated-macro" "-fno-autolink" "-fdebug-compilation-dir" "/root/llvm-project/build_rel" "-ferror-limit" "19" "-fhip-new-launch-api" "-fgnuc-version=4.2.1" "-fcxx-exceptions" "-fexceptions" "-fcolor-diagnostics" "-emit-pch" "-fcuda-allow-variadic-functions" "-faddrsig" "-o" "/tmp/a-fb5984.o" "-x" "hip" "a.hip"
 "/root/llvm-project/build_rel/./bin/lld" "-flavor" "gnu" "--no-undefined" "-shared" "-plugin-opt=-amdgpu-internalize-symbols" "-plugin-opt=mcpu=gfx803" "-o" "/tmp/a-f7f74d.out" "/tmp/a-fb5984.o"
 "/root/llvm-project/build_rel/./bin/clang-offload-bundler" "-type=o" "-targets=host-x86_64-unknown-linux,hip-amdgcn-amd-amdhsa-gfx803" "-inputs=/dev/null,/tmp/a-f7f74d.out" "-outputs=a.hip.pch"
root@e6915ef660c7:~/llvm-project/build_rel# ./bin/clang++ -x hip --cuda-device-only --cuda-gpu-arch=gfx803 -Xclang -emit-pch a.hip -o a.hip.pch
lld: error: /tmp/a-2d2a47.o:22693: unclosed quote
clang-12: error: amdgcn-link command failed with exit code 1 (use -v to see invocation)

In D87325#2280313, @ashi1 wrote:
@tra , I tried to invoke clang with -Xclang -emit-pch, but the -x hip path doesn't know about the precompile phase. It does pass -emit-pch to the cc1 command, but is overridden by the -emit-obj default flag to cc1 in compiler phase. Also, from there it will go through backend, assembler, linker, which we will need to disable if we want this method to work. Here is the experiment:
root@e6915ef660c7:~/llvm-project/build_rel# ./bin/clang++ -x hip --cuda-device-only --cuda-gpu-arch=gfx803 -Xclang -emit-pch a.hip -o a.hip.pch -ccc-print-bindings

If you dd -S and remove -ccc-print-bindingsthis command does produce a PCH.

-Xclang -emit-pch does not change what the top-level driver does. You do need to tell it not to do too much. -S prevents additional bundling/linking steps. For regular C++ compilation -c would work, too.

Caveat -- while the command does produce the PCH, I have no idea whether that's the correct way to do it.

In D87325#2280370, @tra wrote:

If you dd -S and remove -ccc-print-bindingsthis command does produce a PCH.

-Xclang -emit-pch does not change what the top-level driver does. You do need to tell it not to do too much. -S prevents additional bundling/linking steps. For regular C++ compilation -c would work, too.

Caveat -- while the command does produce the PCH, I have no idea whether that's the correct way to do it.

@tra , thanks, I've tried the options -S -Xclang -emit-pch and -c -emit-llvm -Xclang -emit-pch and they do generate a PCH. This workaround seems to work, and I am trying to test it.
But same as you, I'm not sure if this is the correct way to do it. It feels a bit hacky, and its uncertain whether this will continue to work in the future.

ping @rsmith

Revision Contents

Path

Size

clang/

include/

clang/

Driver/

Options.td

4 lines

Types.def

6 lines

lib/

Driver/

Driver.cpp

33 lines

ToolChains/

Clang.cpp

1 line

Types.cpp

15 lines

test/

Driver/

Inputs/

hip_multiple_inputs/

a.hipp

hip-binding.hipp

21 lines

hip-device-compile.hipp

153 lines

hip-phases.hipp

41 lines

Diff 292315

clang/include/clang/Driver/Options.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 691 Lines • ▼ Show 20 Lines
	def dynamiclib : Flag<["-"], "dynamiclib">;			def dynamiclib : Flag<["-"], "dynamiclib">;
	def dynamic : Flag<["-"], "dynamic">, Flags<[NoArgumentUnused]>;			def dynamic : Flag<["-"], "dynamic">, Flags<[NoArgumentUnused]>;
	def d_Flag : Flag<["-"], "d">, Group<d_Group>;			def d_Flag : Flag<["-"], "d">, Group<d_Group>;
	def d_Joined : Joined<["-"], "d">, Group<d_Group>;			def d_Joined : Joined<["-"], "d">, Group<d_Group>;
	def emit_ast : Flag<["-"], "emit-ast">,			def emit_ast : Flag<["-"], "emit-ast">,
	HelpText<"Emit Clang AST files for source inputs">;			HelpText<"Emit Clang AST files for source inputs">;
	def emit_llvm : Flag<["-"], "emit-llvm">, Flags<[CC1Option]>, Group<Action_Group>,			def emit_llvm : Flag<["-"], "emit-llvm">, Flags<[CC1Option]>, Group<Action_Group>,
	HelpText<"Use the LLVM representation for assembler and object files">;			HelpText<"Use the LLVM representation for assembler and object files">;
				def emit_pch : Flag<["-"], "emit-pch">, Flags<[CC1Option]>, Group<Action_Group>,
				HelpText<"Generate pre-compiled header file">;
	def emit_interface_stubs : Flag<["-"], "emit-interface-stubs">, Flags<[CC1Option]>, Group<Action_Group>,			def emit_interface_stubs : Flag<["-"], "emit-interface-stubs">, Flags<[CC1Option]>, Group<Action_Group>,
	HelpText<"Generate Interface Stub Files.">;			HelpText<"Generate Interface Stub Files.">;
	def emit_merged_ifs : Flag<["-"], "emit-merged-ifs">,			def emit_merged_ifs : Flag<["-"], "emit-merged-ifs">,
	Flags<[CC1Option]>, Group<Action_Group>,			Flags<[CC1Option]>, Group<Action_Group>,
	HelpText<"Generate Interface Stub Files, emit merged text not binary.">;			HelpText<"Generate Interface Stub Files, emit merged text not binary.">;
	def interface_stub_version_EQ : JoinedOrSeparate<["-"], "interface-stub-version=">, Flags<[CC1Option]>;			def interface_stub_version_EQ : JoinedOrSeparate<["-"], "interface-stub-version=">, Flags<[CC1Option]>;
	def exported__symbols__list : Separate<["-"], "exported_symbols_list">;			def exported__symbols__list : Separate<["-"], "exported_symbols_list">;
	def e : JoinedOrSeparate<["-"], "e">, Flags<[LinkerInput]>, Group<Link_Group>;			def e : JoinedOrSeparate<["-"], "e">, Flags<[LinkerInput]>, Group<Link_Group>;
	▲ Show 20 Lines • Show All 3,400 Lines • ▼ Show 20 Lines
	def ast_view : Flag<["-"], "ast-view">,			def ast_view : Flag<["-"], "ast-view">,
	HelpText<"Build ASTs and view them with GraphViz">;			HelpText<"Build ASTs and view them with GraphViz">;
	def emit_module : Flag<["-"], "emit-module">,			def emit_module : Flag<["-"], "emit-module">,
	HelpText<"Generate pre-compiled module file from a module map">;			HelpText<"Generate pre-compiled module file from a module map">;
	def emit_module_interface : Flag<["-"], "emit-module-interface">,			def emit_module_interface : Flag<["-"], "emit-module-interface">,
	HelpText<"Generate pre-compiled module file from a C++ module interface">;			HelpText<"Generate pre-compiled module file from a C++ module interface">;
	def emit_header_module : Flag<["-"], "emit-header-module">,			def emit_header_module : Flag<["-"], "emit-header-module">,
	HelpText<"Generate pre-compiled module file from a set of header files">;			HelpText<"Generate pre-compiled module file from a set of header files">;
	def emit_pch : Flag<["-"], "emit-pch">,
	HelpText<"Generate pre-compiled header file">;
	def emit_llvm_bc : Flag<["-"], "emit-llvm-bc">,			def emit_llvm_bc : Flag<["-"], "emit-llvm-bc">,
	HelpText<"Build ASTs then convert to LLVM, emit .bc file">;			HelpText<"Build ASTs then convert to LLVM, emit .bc file">;
	def emit_llvm_only : Flag<["-"], "emit-llvm-only">,			def emit_llvm_only : Flag<["-"], "emit-llvm-only">,
	HelpText<"Build ASTs and convert to LLVM, discarding output">;			HelpText<"Build ASTs and convert to LLVM, discarding output">;
	def emit_codegen_only : Flag<["-"], "emit-codegen-only">,			def emit_codegen_only : Flag<["-"], "emit-codegen-only">,
	HelpText<"Generate machine code, but discard output">;			HelpText<"Generate machine code, but discard output">;
	def emit_obj : Flag<["-"], "emit-obj">,			def emit_obj : Flag<["-"], "emit-obj">,
	HelpText<"Emit native object files">;			HelpText<"Emit native object files">;
	▲ Show 20 Lines • Show All 785 Lines • Show Last 20 Lines

clang/include/clang/Driver/Types.def

	Show All 36 Lines
	// C family source language (with and without preprocessing).			// C family source language (with and without preprocessing).
	TYPE("cpp-output", PP_C, INVALID, "i", phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("cpp-output", PP_C, INVALID, "i", phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("c", C, PP_C, "c", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("c", C, PP_C, "c", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("cl", CL, PP_C, "cl", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("cl", CL, PP_C, "cl", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("cuda-cpp-output", PP_CUDA, INVALID, "cui", phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("cuda-cpp-output", PP_CUDA, INVALID, "cui", phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("cuda", CUDA, PP_CUDA, "cu", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("cuda", CUDA, PP_CUDA, "cu", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("cuda", CUDA_DEVICE, PP_CUDA, "cu", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("cuda", CUDA_DEVICE, PP_CUDA, "cu", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("hip-cpp-output", PP_HIP, INVALID, "cui", phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("hip-cpp-output", PP_HIP, INVALID, "cui", phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("hip", HIP, PP_HIP, "cu", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("hip", HIP, PP_HIP, "hip", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("hip", HIP_DEVICE, PP_HIP, "cu", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("hip", HIP_DEVICE, PP_HIP, "hip", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("objective-c-cpp-output", PP_ObjC, INVALID, "mi", phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("objective-c-cpp-output", PP_ObjC, INVALID, "mi", phases::Compile, phases::Backend, phases::Assemble, phases::Link)
				yaxunlUnsubmitted Done Reply Inline Actions We should not pollute the normal phases for HIP program with Precompile. BTW can you fix the extensions for HIP and HIP_DEVICE, which should be "hip" instead of "cu". yaxunl: We should not pollute the normal phases for HIP program with Precompile. BTW can you fix the…
	TYPE("objc-cpp-output", PP_ObjC_Alias, INVALID, "mi", phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("objc-cpp-output", PP_ObjC_Alias, INVALID, "mi", phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("objective-c", ObjC, PP_ObjC, "m", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("objective-c", ObjC, PP_ObjC, "m", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("c++-cpp-output", PP_CXX, INVALID, "ii", phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("c++-cpp-output", PP_CXX, INVALID, "ii", phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("c++", CXX, PP_CXX, "cpp", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("c++", CXX, PP_CXX, "cpp", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("objective-c++-cpp-output", PP_ObjCXX, INVALID, "mii", phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("objective-c++-cpp-output", PP_ObjCXX, INVALID, "mii", phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("objc++-cpp-output", PP_ObjCXX_Alias, INVALID, "mii", phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("objc++-cpp-output", PP_ObjCXX_Alias, INVALID, "mii", phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("objective-c++", ObjCXX, PP_ObjCXX, "mm", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("objective-c++", ObjCXX, PP_ObjCXX, "mm", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("renderscript", RenderScript, PP_C, "rs", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("renderscript", RenderScript, PP_C, "rs", phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)

	// C family input files to precompile.			// C family input files to precompile.
	TYPE("c-header-cpp-output", PP_CHeader, INVALID, "i", phases::Precompile)			TYPE("c-header-cpp-output", PP_CHeader, INVALID, "i", phases::Precompile)
	TYPE("c-header", CHeader, PP_CHeader, "h", phases::Preprocess, phases::Precompile)			TYPE("c-header", CHeader, PP_CHeader, "h", phases::Preprocess, phases::Precompile)
	TYPE("cl-header", CLHeader, PP_CHeader, "h", phases::Preprocess, phases::Precompile)			TYPE("cl-header", CLHeader, PP_CHeader, "h", phases::Preprocess, phases::Precompile)
	TYPE("objective-c-header-cpp-output", PP_ObjCHeader, INVALID, "mi", phases::Precompile)			TYPE("objective-c-header-cpp-output", PP_ObjCHeader, INVALID, "mi", phases::Precompile)
	TYPE("objective-c-header", ObjCHeader, PP_ObjCHeader, "h", phases::Preprocess, phases::Precompile)			TYPE("objective-c-header", ObjCHeader, PP_ObjCHeader, "h", phases::Preprocess, phases::Precompile)
	TYPE("c++-header-cpp-output", PP_CXXHeader, INVALID, "ii", phases::Precompile)			TYPE("c++-header-cpp-output", PP_CXXHeader, INVALID, "ii", phases::Precompile)
	TYPE("c++-header", CXXHeader, PP_CXXHeader, "hh", phases::Preprocess, phases::Precompile)			TYPE("c++-header", CXXHeader, PP_CXXHeader, "hh", phases::Preprocess, phases::Precompile)
				yaxunlUnsubmitted Done Reply Inline Actions I would suggest to add "hip-header" and "hip-header-cpp-output" like "c++-header" and "c++-header-cpp-output". Basically we will only allow -emit-pch on "hip-header". yaxunl: I would suggest to add "hip-header" and "hip-header-cpp-output" like "c++-header" and "c++…
				ashi1AuthorUnsubmitted Done Reply Inline Actions Sounds good, I had tried this attempt earlier, and I have the new diff with this implementation. ashi1: Sounds good, I had tried this attempt earlier, and I have the new diff with this implementation.
	TYPE("objective-c++-header-cpp-output", PP_ObjCXXHeader, INVALID, "mii", phases::Precompile)			TYPE("objective-c++-header-cpp-output", PP_ObjCXXHeader, INVALID, "mii", phases::Precompile)
	TYPE("objective-c++-header", ObjCXXHeader, PP_ObjCXXHeader, "h", phases::Preprocess, phases::Precompile)			TYPE("objective-c++-header", ObjCXXHeader, PP_ObjCXXHeader, "h", phases::Preprocess, phases::Precompile)
	TYPE("c++-module", CXXModule, PP_CXXModule, "cppm", phases::Preprocess, phases::Precompile, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("c++-module", CXXModule, PP_CXXModule, "cppm", phases::Preprocess, phases::Precompile, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("c++-module-cpp-output", PP_CXXModule, INVALID, "iim", phases::Precompile, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("c++-module-cpp-output", PP_CXXModule, INVALID, "iim", phases::Precompile, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
				TYPE("hip-header-cpp-output", PP_HIPHeader, INVALID, "hipii", phases::Precompile)
				TYPE("hip-header", HIPHeader, PP_HIPHeader, "hiphh", phases::Preprocess, phases::Precompile)

	// Other languages.			// Other languages.
	TYPE("ada", Ada, INVALID, nullptr, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("ada", Ada, INVALID, nullptr, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("assembler", PP_Asm, INVALID, "s", phases::Assemble, phases::Link)			TYPE("assembler", PP_Asm, INVALID, "s", phases::Assemble, phases::Link)
	TYPE("assembler-with-cpp", Asm, PP_Asm, "S", phases::Preprocess, phases::Assemble, phases::Link)			TYPE("assembler-with-cpp", Asm, PP_Asm, "S", phases::Preprocess, phases::Assemble, phases::Link)
	TYPE("f95", PP_Fortran, INVALID, nullptr, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("f95", PP_Fortran, INVALID, nullptr, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("f95-cpp-input", Fortran, PP_Fortran, nullptr, phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("f95-cpp-input", Fortran, PP_Fortran, nullptr, phases::Preprocess, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	TYPE("java", Java, INVALID, nullptr, phases::Compile, phases::Backend, phases::Assemble, phases::Link)			TYPE("java", Java, INVALID, nullptr, phases::Compile, phases::Backend, phases::Assemble, phases::Link)
	Show All 26 Lines

clang/lib/Driver/Driver.cpp

Show First 20 Lines • Show All 286 Lines • ▼ Show 20 Lines	phases::ID Driver::getFinalPhase(const DerivedArgList &DAL,
// -{E,EP,P,M,MM} only run the preprocessor.		// -{E,EP,P,M,MM} only run the preprocessor.
if (CCCIsCPP() \|\| (PhaseArg = DAL.getLastArg(options::OPT_E)) \|\|		if (CCCIsCPP() \|\| (PhaseArg = DAL.getLastArg(options::OPT_E)) \|\|
(PhaseArg = DAL.getLastArg(options::OPT__SLASH_EP)) \|\|		(PhaseArg = DAL.getLastArg(options::OPT__SLASH_EP)) \|\|
(PhaseArg = DAL.getLastArg(options::OPT_M, options::OPT_MM)) \|\|		(PhaseArg = DAL.getLastArg(options::OPT_M, options::OPT_MM)) \|\|
(PhaseArg = DAL.getLastArg(options::OPT__SLASH_P))) {		(PhaseArg = DAL.getLastArg(options::OPT__SLASH_P))) {
FinalPhase = phases::Preprocess;		FinalPhase = phases::Preprocess;

// --precompile only runs up to precompilation.		// --precompile only runs up to precompilation.
} else if ((PhaseArg = DAL.getLastArg(options::OPT__precompile))) {		} else if ((PhaseArg = DAL.getLastArg(options::OPT__precompile)) \|\|
		(PhaseArg = DAL.getLastArg(options::OPT_emit_pch))) {
FinalPhase = phases::Precompile;		FinalPhase = phases::Precompile;

// -{fsyntax-only,-analyze,emit-ast} only run up to the compiler.		// -{fsyntax-only,-analyze,emit-ast} only run up to the compiler.
} else if ((PhaseArg = DAL.getLastArg(options::OPT_fsyntax_only)) \|\|		} else if ((PhaseArg = DAL.getLastArg(options::OPT_fsyntax_only)) \|\|
(PhaseArg = DAL.getLastArg(options::OPT_print_supported_cpus)) \|\|		(PhaseArg = DAL.getLastArg(options::OPT_print_supported_cpus)) \|\|
(PhaseArg = DAL.getLastArg(options::OPT_module_file_info)) \|\|		(PhaseArg = DAL.getLastArg(options::OPT_module_file_info)) \|\|
(PhaseArg = DAL.getLastArg(options::OPT_verify_pch)) \|\|		(PhaseArg = DAL.getLastArg(options::OPT_verify_pch)) \|\|
(PhaseArg = DAL.getLastArg(options::OPT_rewrite_objc)) \|\|		(PhaseArg = DAL.getLastArg(options::OPT_rewrite_objc)) \|\|
▲ Show 20 Lines • Show All 2,085 Lines • ▼ Show 20 Lines	class OffloadingActionBuilder final {
class CudaActionBuilderBase : public DeviceActionBuilder {		class CudaActionBuilderBase : public DeviceActionBuilder {
protected:		protected:
/// Flags to signal if the user requested host-only or device-only		/// Flags to signal if the user requested host-only or device-only
/// compilation.		/// compilation.
bool CompileHostOnly = false;		bool CompileHostOnly = false;
bool CompileDeviceOnly = false;		bool CompileDeviceOnly = false;
bool EmitLLVM = false;		bool EmitLLVM = false;
bool EmitAsm = false;		bool EmitAsm = false;
		bool EmitPCH = false;

/// ID to identify each device compilation. For CUDA it is simply the		/// ID to identify each device compilation. For CUDA it is simply the
/// GPU arch string. For HIP it is either the GPU arch string or GPU		/// GPU arch string. For HIP it is either the GPU arch string or GPU
/// arch string plus feature strings delimited by a plus sign, e.g.		/// arch string plus feature strings delimited by a plus sign, e.g.
/// gfx906+xnack.		/// gfx906+xnack.
struct TargetID {		struct TargetID {
/// Target ID string which is persistent throughout the compilation.		/// Target ID string which is persistent throughout the compilation.
const char *ID;		const char *ID;
Show All 35 Lines	ActionBuilderReturnCode addDeviceDepences(Action *HostAction) override {
// the host uses the CUDA offload kind.		// the host uses the CUDA offload kind.
if (auto *IA = dyn_cast<InputAction>(HostAction)) {		if (auto *IA = dyn_cast<InputAction>(HostAction)) {
assert(!GpuArchList.empty() &&		assert(!GpuArchList.empty() &&
"We should have at least one GPU architecture.");		"We should have at least one GPU architecture.");

// If the host input is not CUDA or HIP, we don't need to bother about		// If the host input is not CUDA or HIP, we don't need to bother about
// this input.		// this input.
if (IA->getType() != types::TY_CUDA &&		if (IA->getType() != types::TY_CUDA &&
IA->getType() != types::TY_HIP) {		IA->getType() != types::TY_HIP &&
		IA->getType() != types::TY_HIPHeader) {
// The builder will ignore this input.		// The builder will ignore this input.
IsActive = false;		IsActive = false;
return ABRT_Inactive;		return ABRT_Inactive;
}		}

// Set the flag to true, so that the builder acts on the current input.		// Set the flag to true, so that the builder acts on the current input.
IsActive = true;		IsActive = true;

if (CompileHostOnly)		if (CompileHostOnly)
return ABRT_Success;		return ABRT_Success;

// Replicate inputs for each GPU architecture.		// Replicate inputs for each GPU architecture.
auto Ty = IA->getType() == types::TY_HIP ? types::TY_HIP_DEVICE		auto Ty = IA->getType();
: types::TY_CUDA_DEVICE;		if(Ty != types::TY_HIPHeader)
		Ty = Ty == types::TY_HIP ?
		types::TY_HIP_DEVICE : types::TY_CUDA_DEVICE;

for (unsigned I = 0, E = GpuArchList.size(); I != E; ++I) {		for (unsigned I = 0, E = GpuArchList.size(); I != E; ++I) {
CudaDeviceActions.push_back(		CudaDeviceActions.push_back(
C.MakeAction<InputAction>(IA->getInputArg(), Ty));		C.MakeAction<InputAction>(IA->getInputArg(), Ty));
}		}

return ABRT_Success;		return ABRT_Success;
}		}

▲ Show 20 Lines • Show All 108 Lines • ▼ Show 20 Lines	bool initialize() override {
CompileHostOnly = PartialCompilationArg &&		CompileHostOnly = PartialCompilationArg &&
PartialCompilationArg->getOption().matches(		PartialCompilationArg->getOption().matches(
options::OPT_cuda_host_only);		options::OPT_cuda_host_only);
CompileDeviceOnly = PartialCompilationArg &&		CompileDeviceOnly = PartialCompilationArg &&
PartialCompilationArg->getOption().matches(		PartialCompilationArg->getOption().matches(
options::OPT_cuda_device_only);		options::OPT_cuda_device_only);
EmitLLVM = Args.getLastArg(options::OPT_emit_llvm);		EmitLLVM = Args.getLastArg(options::OPT_emit_llvm);
EmitAsm = Args.getLastArg(options::OPT_S);		EmitAsm = Args.getLastArg(options::OPT_S);
		EmitPCH = Args.getLastArg(options::OPT_emit_pch);

// Collect all cuda_gpu_arch parameters, removing duplicates.		// Collect all cuda_gpu_arch parameters, removing duplicates.
std::set<StringRef> GpuArchs;		std::set<StringRef> GpuArchs;
bool Error = false;		bool Error = false;
for (Arg *A : Args) {		for (Arg *A : Args) {
if (!(A->getOption().matches(options::OPT_offload_arch_EQ) \|\|		if (!(A->getOption().matches(options::OPT_offload_arch_EQ) \|\|
A->getOption().matches(options::OPT_no_offload_arch_EQ)))		A->getOption().matches(options::OPT_no_offload_arch_EQ)))
continue;		continue;
▲ Show 20 Lines • Show All 206 Lines • ▼ Show 20 Lines	getDeviceDependences(OffloadAction::DeviceDependences &DA,
PhasesTy &Phases) override {		PhasesTy &Phases) override {
// amdgcn does not support linking of object files, therefore we skip		// amdgcn does not support linking of object files, therefore we skip
// backend and assemble phases to output LLVM IR. Except for generating		// backend and assemble phases to output LLVM IR. Except for generating
// non-relocatable device coee, where we generate fat binary for device		// non-relocatable device coee, where we generate fat binary for device
// code and pass to host in Backend phase.		// code and pass to host in Backend phase.
if (CudaDeviceActions.empty())		if (CudaDeviceActions.empty())
return ABRT_Success;		return ABRT_Success;

		// In EmitPCH mode, construct Phase Actions up to Precompile,
		yaxunlUnsubmitted Done Reply Inline Actions typo: Actoms yaxunl: typo: Actoms
		// and ignore phases coming afterwards.
		// Otherwise, ignore the Precompile phase.
		if (EmitPCH) {
		if (CurPhase == phases::Precompile) {
		for (Action *&A : CudaDeviceActions) {
		A = C.getDriver().ConstructPhaseAction(
		C, Args, phases::Precompile, A, AssociatedOffloadKind);
		}
		return CompileDeviceOnly ? ABRT_Ignore_Host : ABRT_Success;
		}
		if (CurPhase >= phases::Compile)
		return CompileDeviceOnly ? ABRT_Ignore_Host : ABRT_Success;
		} else {
		if (CurPhase == phases::Precompile)
		return ABRT_Success;
		}

assert(((CurPhase == phases::Link && Relocatable) \|\|		assert(((CurPhase == phases::Link && Relocatable) \|\|
CudaDeviceActions.size() == GpuArchList.size()) &&		CudaDeviceActions.size() == GpuArchList.size()) &&
"Expecting one action per GPU architecture.");		"Expecting one action per GPU architecture.");
assert(!CompileHostOnly &&		assert(!CompileHostOnly &&
"Not expecting CUDA actions in host-only compilation.");		"Not expecting CUDA actions in host-only compilation.");

if (!Relocatable && CurPhase == phases::Backend && !EmitLLVM &&		if (!Relocatable && CurPhase == phases::Backend && !EmitLLVM &&
!EmitAsm) {		!EmitAsm) {
▲ Show 20 Lines • Show All 775 Lines • ▼ Show 20 Lines	for (phases::ID Phase : PL) {
// separate PCH.		// separate PCH.
if (Phase == phases::Precompile && HeaderModuleAction &&		if (Phase == phases::Precompile && HeaderModuleAction &&
getPrecompiledType(InputType) == types::TY_PCH) {		getPrecompiledType(InputType) == types::TY_PCH) {
HeaderModuleAction->addModuleHeaderInput(Current);		HeaderModuleAction->addModuleHeaderInput(Current);
Current = nullptr;		Current = nullptr;
break;		break;
}		}

// FIXME: Should we include any prior module file outputs as inputs of		// FIXME: Should we include any prior module file outputs as inputs of
		yaxunlUnsubmitted Done Reply Inline Actions we need to support -emit-pch in host only compilation too. And not just for HIP, for C and C++ too, since it is supposed to be a generic option. yaxunl: we need to support -emit-pch in host only compilation too. And not just for HIP, for C and C++…
// later actions in the same command line?		// later actions in the same command line?

// Otherwise construct the appropriate action.		// Otherwise construct the appropriate action.
Action *NewCurrent = ConstructPhaseAction(C, Args, Phase, Current);		Action *NewCurrent = ConstructPhaseAction(C, Args, Phase, Current);

// We didn't create a new action, so we will just move to the next phase.		// We didn't create a new action, so we will just move to the next phase.
if (NewCurrent == Current)		if (NewCurrent == Current)
continue;		continue;
▲ Show 20 Lines • Show All 1,674 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Clang.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 6,312 Lines • ▼ Show 20 Lines

	// Claim some arguments which clang doesn't support, but we don't			// Claim some arguments which clang doesn't support, but we don't
	// care to warn the user about.			// care to warn the user about.
	Args.ClaimAllArgs(options::OPT_clang_ignored_f_Group);			Args.ClaimAllArgs(options::OPT_clang_ignored_f_Group);
	Args.ClaimAllArgs(options::OPT_clang_ignored_m_Group);			Args.ClaimAllArgs(options::OPT_clang_ignored_m_Group);

	// Disable warnings for clang -E -emit-llvm foo.c			// Disable warnings for clang -E -emit-llvm foo.c
	Args.ClaimAllArgs(options::OPT_emit_llvm);			Args.ClaimAllArgs(options::OPT_emit_llvm);
				Args.ClaimAllArgs(options::OPT_emit_pch);
	}			}

	Clang::Clang(const ToolChain &TC)			Clang::Clang(const ToolChain &TC)
	// CAUTION! The first constructor argument ("clang") is not arbitrary,			// CAUTION! The first constructor argument ("clang") is not arbitrary,
	// as it is for other tools. Some operations on a Tool actually test			// as it is for other tools. Some operations on a Tool actually test
	// whether that tool is Clang based on the Tool's Name as a string.			// whether that tool is Clang based on the Tool's Name as a string.
	: Tool("clang", "clang frontend", TC) {}			: Tool("clang", "clang frontend", TC) {}

	▲ Show 20 Lines • Show All 886 Lines • Show Last 20 Lines

clang/lib/Driver/Types.cpp

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines
bool types::canTypeBeUserSpecified(ID Id) {		bool types::canTypeBeUserSpecified(ID Id) {
static const clang::driver::types::ID kStaticLangageTypes[] = {		static const clang::driver::types::ID kStaticLangageTypes[] = {
TY_CUDA_DEVICE, TY_HIP_DEVICE, TY_PP_CHeader,		TY_CUDA_DEVICE, TY_HIP_DEVICE, TY_PP_CHeader,
TY_PP_ObjCHeader, TY_PP_CXXHeader, TY_PP_ObjCXXHeader,		TY_PP_ObjCHeader, TY_PP_CXXHeader, TY_PP_ObjCXXHeader,
TY_PP_CXXModule, TY_LTO_IR, TY_LTO_BC,		TY_PP_CXXModule, TY_LTO_IR, TY_LTO_BC,
TY_Plist, TY_RewrittenObjC, TY_RewrittenLegacyObjC,		TY_Plist, TY_RewrittenObjC, TY_RewrittenLegacyObjC,
TY_Remap, TY_PCH, TY_Object,		TY_Remap, TY_PCH, TY_Object,
TY_Image, TY_dSYM, TY_Dependencies,		TY_Image, TY_dSYM, TY_Dependencies,
TY_CUDA_FATBIN, TY_HIP_FATBIN};		TY_CUDA_FATBIN, TY_HIP_FATBIN, TY_PP_HIPHeader};
return !llvm::is_contained(kStaticLangageTypes, Id);		return !llvm::is_contained(kStaticLangageTypes, Id);
}		}

bool types::appendSuffixForType(ID Id) {		bool types::appendSuffixForType(ID Id) {
return Id == TY_PCH \|\| Id == TY_dSYM \|\| Id == TY_CUDA_FATBIN \|\|		return Id == TY_PCH \|\| Id == TY_dSYM \|\| Id == TY_CUDA_FATBIN \|\|
Id == TY_HIP_FATBIN;		Id == TY_HIP_FATBIN;
}		}

Show All 21 Lines	bool types::isAcceptedByClang(ID Id) {
case TY_CXX: case TY_PP_CXX:		case TY_CXX: case TY_PP_CXX:
case TY_ObjCXX: case TY_PP_ObjCXX: case TY_PP_ObjCXX_Alias:		case TY_ObjCXX: case TY_PP_ObjCXX: case TY_PP_ObjCXX_Alias:
case TY_CHeader: case TY_PP_CHeader:		case TY_CHeader: case TY_PP_CHeader:
case TY_CLHeader:		case TY_CLHeader:
case TY_ObjCHeader: case TY_PP_ObjCHeader:		case TY_ObjCHeader: case TY_PP_ObjCHeader:
case TY_CXXHeader: case TY_PP_CXXHeader:		case TY_CXXHeader: case TY_PP_CXXHeader:
case TY_ObjCXXHeader: case TY_PP_ObjCXXHeader:		case TY_ObjCXXHeader: case TY_PP_ObjCXXHeader:
case TY_CXXModule: case TY_PP_CXXModule:		case TY_CXXModule: case TY_PP_CXXModule:
		case TY_HIPHeader: case TY_PP_HIPHeader:
case TY_AST: case TY_ModuleFile: case TY_PCH:		case TY_AST: case TY_ModuleFile: case TY_PCH:
case TY_LLVM_IR: case TY_LLVM_BC:		case TY_LLVM_IR: case TY_LLVM_BC:
return true;		return true;
}		}
}		}

bool types::isObjC(ID Id) {		bool types::isObjC(ID Id) {
switch (Id) {		switch (Id) {
Show All 13 Lines	bool types::isCXX(ID Id) {
default:		default:
return false;		return false;

case TY_CXX: case TY_PP_CXX:		case TY_CXX: case TY_PP_CXX:
case TY_ObjCXX: case TY_PP_ObjCXX: case TY_PP_ObjCXX_Alias:		case TY_ObjCXX: case TY_PP_ObjCXX: case TY_PP_ObjCXX_Alias:
case TY_CXXHeader: case TY_PP_CXXHeader:		case TY_CXXHeader: case TY_PP_CXXHeader:
case TY_ObjCXXHeader: case TY_PP_ObjCXXHeader:		case TY_ObjCXXHeader: case TY_PP_ObjCXXHeader:
case TY_CXXModule: case TY_PP_CXXModule:		case TY_CXXModule: case TY_PP_CXXModule:
		case TY_HIPHeader: case TY_PP_HIPHeader:
case TY_CUDA: case TY_PP_CUDA: case TY_CUDA_DEVICE:		case TY_CUDA: case TY_PP_CUDA: case TY_CUDA_DEVICE:
case TY_HIP:		case TY_HIP: case TY_PP_HIP: case TY_HIP_DEVICE:
case TY_PP_HIP:
case TY_HIP_DEVICE:
return true;		return true;
}		}
}		}

bool types::isLLVMIR(ID Id) {		bool types::isLLVMIR(ID Id) {
switch (Id) {		switch (Id) {
default:		default:
return false;		return false;
Show All 21 Lines
bool types::isHIP(ID Id) {		bool types::isHIP(ID Id) {
switch (Id) {		switch (Id) {
default:		default:
return false;		return false;

case TY_HIP:		case TY_HIP:
case TY_PP_HIP:		case TY_PP_HIP:
case TY_HIP_DEVICE:		case TY_HIP_DEVICE:
		case TY_HIPHeader: case TY_PP_HIPHeader:
return true;		return true;
}		}
}		}

bool types::isFortran(ID Id) {		bool types::isFortran(ID Id) {
switch (Id) {		switch (Id) {
default:		default:
return false;		return false;
▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	return llvm::StringSwitch<types::ID>(Ext)
.Case("mii", TY_PP_ObjCXX)		.Case("mii", TY_PP_ObjCXX)
.Case("obj", TY_Object)		.Case("obj", TY_Object)
.Case("ifs", TY_IFS)		.Case("ifs", TY_IFS)
.Case("pch", TY_PCH)		.Case("pch", TY_PCH)
.Case("pcm", TY_ModuleFile)		.Case("pcm", TY_ModuleFile)
.Case("c++m", TY_CXXModule)		.Case("c++m", TY_CXXModule)
.Case("cppm", TY_CXXModule)		.Case("cppm", TY_CXXModule)
.Case("cxxm", TY_CXXModule)		.Case("cxxm", TY_CXXModule)
		.Case("hipp", TY_HIPHeader)
.Default(TY_INVALID);		.Default(TY_INVALID);
}		}

types::ID types::lookupTypeForTypeSpecifier(const char *Name) {		types::ID types::lookupTypeForTypeSpecifier(const char *Name) {
for (unsigned i=0; i<numTypes; ++i) {		for (unsigned i=0; i<numTypes; ++i) {
types::ID Id = (types::ID) (i + 1);		types::ID Id = (types::ID) (i + 1);
if (canTypeBeUserSpecified(Id) &&		if (canTypeBeUserSpecified(Id) &&
strcmp(Name, getInfo(Id).Name) == 0)		strcmp(Name, getInfo(Id).Name) == 0)
Show All 28 Lines	types::getCompilationPhases(const clang::driver::Driver &Driver,
if (Driver.CCCIsCPP() \|\| DAL.getLastArg(options::OPT_E) \|\|		if (Driver.CCCIsCPP() \|\| DAL.getLastArg(options::OPT_E) \|\|
DAL.getLastArg(options::OPT__SLASH_EP) \|\|		DAL.getLastArg(options::OPT__SLASH_EP) \|\|
DAL.getLastArg(options::OPT_M, options::OPT_MM) \|\|		DAL.getLastArg(options::OPT_M, options::OPT_MM) \|\|
DAL.getLastArg(options::OPT__SLASH_P))		DAL.getLastArg(options::OPT__SLASH_P))
LastPhase = phases::Preprocess;		LastPhase = phases::Preprocess;

// --precompile only runs up to precompilation.		// --precompile only runs up to precompilation.
// This is a clang extension and is not compatible with GCC.		// This is a clang extension and is not compatible with GCC.
else if (DAL.getLastArg(options::OPT__precompile))		else if (DAL.getLastArg(options::OPT__precompile) \|\|
		DAL.getLastArg(options::OPT_emit_pch))
LastPhase = phases::Precompile;		LastPhase = phases::Precompile;

// -{fsyntax-only,-analyze,emit-ast} only run up to the compiler.		// -{fsyntax-only,-analyze,emit-ast} only run up to the compiler.
else if (DAL.getLastArg(options::OPT_fsyntax_only) \|\|		else if (DAL.getLastArg(options::OPT_fsyntax_only) \|\|
DAL.getLastArg(options::OPT_print_supported_cpus) \|\|		DAL.getLastArg(options::OPT_print_supported_cpus) \|\|
DAL.getLastArg(options::OPT_module_file_info) \|\|		DAL.getLastArg(options::OPT_module_file_info) \|\|
DAL.getLastArg(options::OPT_verify_pch) \|\|		DAL.getLastArg(options::OPT_verify_pch) \|\|
DAL.getLastArg(options::OPT_rewrite_objc) \|\|		DAL.getLastArg(options::OPT_rewrite_objc) \|\|
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	ID types::lookupHeaderTypeForSourceType(ID Id) {
case types::TY_CXXModule:		case types::TY_CXXModule:
return types::TY_CXXHeader;		return types::TY_CXXHeader;
case types::TY_ObjC:		case types::TY_ObjC:
return types::TY_ObjCHeader;		return types::TY_ObjCHeader;
case types::TY_ObjCXX:		case types::TY_ObjCXX:
return types::TY_ObjCXXHeader;		return types::TY_ObjCXXHeader;
case types::TY_CL:		case types::TY_CL:
return types::TY_CLHeader;		return types::TY_CLHeader;
		case types::TY_HIP:
		return types::TY_HIPHeader;
}		}
}		}

clang/test/Driver/Inputs/hip_multiple_inputs/a.hipp

This file was added.

This is an empty file.

clang/test/Driver/hip-binding.hipp

This file was added.

				// REQUIRES: clang-driver
				// REQUIRES: x86-registered-target
				// REQUIRES: amdgpu-registered-target

				// Host only
				// RUN: %clang -ccc-print-bindings -target x86_64-linux-gnu \
				// RUN: -x hip-header --cuda-host-only %s \
				// RUN: -emit-pch 2>&1 \| FileCheck -check-prefix=PCHH %s
				// PCHH: # "x86_64-unknown-linux-gnu" - "clang", inputs: ["{{.*}}hip-binding.hipp"],
				// PCHH-SAME: output: "{{.*}}gch"
				//
				//
				// Device only
				// RUN: %clang -ccc-print-bindings -target x86_64-linux-gnu \
				// RUN: -x hip-header --cuda-device-only \
				// RUN: --cuda-gpu-arch=gfx803 --cuda-gpu-arch=gfx900 %s \
				// RUN: -emit-pch 2>&1 \| FileCheck -check-prefix=PCHD %s
				// PCHD: # "amdgcn-amd-amdhsa" - "clang", inputs: ["[[IN:.*hip-binding.hipp]]"],
				// PCHD-SAME: output: "{{.*}}gfx803.gch"
				// PCHD: # "amdgcn-amd-amdhsa" - "clang", inputs: ["[[IN]]"],
				// PCHD-SAME: output: "{{.*}}gfx900.gch"

clang/test/Driver/hip-device-compile.hipp

This file was added.

				// REQUIRES: clang-driver
				// REQUIRES: x86-registered-target
				// REQUIRES: amdgpu-registered-target

				// Host only -emit-pch and -include-pch in compilation
				// RUN: %clang -emit-pch --cuda-host-only -### \
				// RUN: -target x86_64-linux-gnu -x hip-header \
				// RUN: -nogpulib -o a.hipp.pch \
				// RUN: %S/Inputs/hip_multiple_inputs/a.hipp \
				// RUN: 2>&1 \| FileCheck -check-prefixes=HOSTP %s
				//
				// RUN: %clang -x hip --cuda-host-only -### \
				// RUN: -target x86_64-linux-gnu -include-pch a.hipp.pch \
				// RUN: -nogpulib -c -emit-llvm -o b.bc \
				// RUN: %S/Inputs/hip_multiple_inputs/b.hip \
				// RUN: 2>&1 \| FileCheck -check-prefixes=HOSTC %s
				//
				// HOSTP: {{".clang."}} "-cc1" "-triple" "x86_64-unknown-linux-gnu"
				// HOSTP-SAME: "-aux-triple" "amdgcn-amd-amdhsa"
				// HOSTP-SAME: "-emit-pch"
				// HOSTP-SAME: "-main-file-name" "a.hipp"
				// HOSTP-SAME: "-o" "a.hipp.pch"
				// HOSTP-SAME: {{".*a.hipp"}}
				//
				// HOSTC: {{".clang."}} "-cc1" "-triple" "x86_64-unknown-linux-gnu"
				// HOSTC-SAME: "-aux-triple" "amdgcn-amd-amdhsa"
				// HOSTC-SAME: "-emit-llvm-bc"
				// HOSTC-SAME: "-main-file-name" "b.hip"
				// HOSTC-SAME: "-o" "b.bc"
				// HOSTC-SAME: {{".*b.hip"}}

				// Single Device -emit-pch and -include-pch in compilation
				// RUN: %clang -emit-pch --cuda-device-only -### \
				// RUN: -target x86_64-linux-gnu -o a.hipp.pch -x hip-header \
				// RUN: --cuda-gpu-arch=gfx900 -nogpulib \
				// RUN: %S/Inputs/hip_multiple_inputs/a.hipp \
				// RUN: 2>&1 \| FileCheck -check-prefixes=DEV1P %s
				//
				// RUN: %clang -x hip --cuda-device-only -### \
				// RUN: -target x86_64-linux-gnu -include-pch a.hipp.pch \
				// RUN: -c -emit-llvm -o b.bc \
				// RUN: --cuda-gpu-arch=gfx900 -nogpulib \
				// RUN: %S/Inputs/hip_multiple_inputs/b.hip \
				// RUN: 2>&1 \| FileCheck -check-prefixes=DEV1C %s
				//
				// DEV1P: {{".clang."}} "-cc1" "-triple" "amdgcn-amd-amdhsa"
				// DEV1P-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"
				// DEV1P-SAME: "-emit-pch"
				// DEV1P-SAME: "-main-file-name" "a.hipp"
				// DEV1P-SAME: "-fcuda-is-device"
				// DEV1P-SAME: "-target-cpu" "gfx900"
				// DEV1P-SAME: "-o" "a.hipp.pch"
				// DEV1P-SAME: {{".*a.hipp"}}
				//
				// DEV1P-NOT: {{"*.llvm-link"}}
				// DEV1P-NOT: {{".*opt"}}
				// DEV1P-NOT: {{".*llc"}}
				// DEV1P-NOT: {{".lld."}}
				// DEV1P-NOT: {{".*clang-offload-bundler"}}
				// DEV1P-NOT: {{".ld."}}
				//
				// DEV1C: {{".clang."}} "-cc1" "-triple" "amdgcn-amd-amdhsa"
				// DEV1C-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"
				// DEV1C-SAME: "-emit-llvm-bc"
				// DEV1C-SAME: "-main-file-name" "b.hip"
				// DEV1C-SAME: "-fcuda-is-device"
				// DEV1C-SAME: "-target-cpu" "gfx900"
				// DEV1C-SAME: "-include-pch" "a.hipp.pch"
				// DEV1C-SAME: "-o" "b.bc"
				// DEV1C-SAME: {{".*b.hip"}}

				// Multi Device -emit-pch and -include-pch in compilation
				// RUN: %clang -emit-pch --cuda-device-only -### \
				// RUN: -target x86_64-linux-gnu -x hip-header -nogpulib \
				// RUN: --cuda-gpu-arch=gfx803 --cuda-gpu-arch=gfx900 \
				// RUN: %S/Inputs/hip_multiple_inputs/a.hipp \
				// RUN: 2>&1 \| FileCheck -check-prefixes=DEV2P %s
				//
				// RUN: %clang -x hip --cuda-device-only -### \
				// RUN: -target x86_64-linux-gnu -c -emit-llvm \
				// RUN: -include-pch a.hipp-hip-amdgcn-amd-amdhsa-gfx803.gch \
				// RUN: --cuda-gpu-arch=gfx803 -nogpulib -o a.bc \
				// RUN: %S/Inputs/hip_multiple_inputs/a.cu \
				// RUN: 2>&1 \| FileCheck -check-prefixes=DEV2C8 %s
				//
				// RUN: %clang -x hip --cuda-device-only -### \
				// RUN: -target x86_64-linux-gnu -c -emit-llvm \
				// RUN: -include-pch a.hipp-hip-amdgcn-amd-amdhsa-gfx900.gch \
				// RUN: --cuda-gpu-arch=gfx900 -nogpulib -o b.bc \
				// RUN: %S/Inputs/hip_multiple_inputs/b.hip \
				// RUN: 2>&1 \| FileCheck -check-prefixes=DEV2C9 %s
				//
				// DEV2P: {{".clang."}} "-cc1" "-triple" "amdgcn-amd-amdhsa"
				// DEV2P-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"
				// DEV2P-SAME: "-emit-pch"
				// DEV2P-SAME: "-main-file-name" "a.hipp"
				// DEV2P-SAME: "-fcuda-is-device"
				// DEV2P-SAME: "-target-cpu" "gfx803"
				// DEV2P-SAME: "-o" "{{.*gfx803.gch}}"
				// DEV2P-SAME: {{".*a.hipp"}}
				//
				// DEV2P: {{".clang."}} "-cc1" "-triple" "amdgcn-amd-amdhsa"
				// DEV2P-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"
				// DEV2P-SAME: "-emit-pch"
				// DEV2P-SAME: "-main-file-name" "a.hipp"
				// DEV2P-SAME: "-fcuda-is-device"
				// DEV2P-SAME: "-target-cpu" "gfx900"
				// DEV2P-SAME: "-o" "{{.*gfx900.gch}}"
				// DEV2P-SAME: {{".*a.hipp"}}
				//
				// DEV2P-NOT: {{"*.llvm-link"}}
				// DEV2P-NOT: {{".*opt"}}
				// DEV2P-NOT: {{".*llc"}}
				// DEV2P-NOT: {{".lld."}}
				// DEV2P-NOT: {{".*clang-offload-bundler"}}
				// DEV2P-NOT: {{".ld."}}
				//
				// DEV2C8: {{".clang."}} "-cc1" "-triple" "amdgcn-amd-amdhsa"
				// DEV2C8-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"
				// DEV2C8-SAME: "-emit-llvm-bc"
				// DEV2C8-SAME: "-main-file-name" "a.cu"
				// DEV2C8-SAME: "-fcuda-is-device"
				// DEV2C8-SAME: "-target-cpu" "gfx803"
				// DEV2C8-SAME: "-include-pch" "a.hipp-hip-amdgcn-amd-amdhsa-gfx803.gch"
				// DEV2C8-SAME: "-o" "a.bc"
				// DEV2C8-SAME: {{".*a.cu"}}
				//
				// DEV2C9: {{".clang."}} "-cc1" "-triple" "amdgcn-amd-amdhsa"
				// DEV2C9-SAME: "-aux-triple" "x86_64-unknown-linux-gnu"
				// DEV2C9-SAME: "-emit-llvm-bc"
				// DEV2C9-SAME: "-main-file-name" "b.hip"
				// DEV2C9-SAME: "-fcuda-is-device"
				// DEV2C9-SAME: "-target-cpu" "gfx900"
				// DEV2C9-SAME: "-include-pch" "a.hipp-hip-amdgcn-amd-amdhsa-gfx900.gch"
				// DEV2C9-SAME: "-o" "b.bc"
				// DEV2C9-SAME: {{".*b.hip"}}

				// Compiling with the wrong GPU variant of PCH should fail
				// RUN: %clang -emit-pch --cuda-device-only -v \
				// RUN: -target x86_64-linux-gnu -x hip-header -nogpulib \
				// RUN: --cuda-gpu-arch=gfx900 -o a-900.hipp.pch \
				// RUN: %S/Inputs/hip_multiple_inputs/a.hipp \
				// RUN: 2>&1 \| FileCheck -check-prefixes=CREATE %s
				// CREATE: {{.*}} -o a-900.hipp.pch
				//
				// RUN: not %clang -x hip --cuda-device-only \
				// RUN: -target x86_64-linux-gnu -c -emit-llvm \
				// RUN: -include-pch a-900.hipp.pch \
				// RUN: --cuda-gpu-arch=gfx803 -nogpulib -o b.bc \
				// RUN: %S/Inputs/hip_multiple_inputs/b.hip \
				// RUN: 2>&1 \| FileCheck -check-prefixes=ERROR --implicit-check-not=error: %s
				//
				// ERROR: error: PCH file was compiled for the target CPU 'gfx900' but the current translation unit is being compiled for target 'gfx803'

clang/test/Driver/hip-phases.hipp

This file was added.

				// REQUIRES: clang-driver
				// REQUIRES: x86-registered-target
				// REQUIRES: amdgpu-registered-target
				//
				//
				// Test single gpu architecture up to the precompile phase in device-only mode.
				//
				// RUN: %clang -x hip-header -target x86_64-unknown-linux-gnu -ccc-print-phases \
				// RUN: --cuda-gpu-arch=gfx803 %s --cuda-device-only -emit-pch 2>&1 \
				// RUN: \| FileCheck -check-prefixes=PCH %s
				//
				// PCH-DAG: [[P0:[0-9]+]]: input, "{{.*}}hip-phases.hipp", [[T:hip-header]], (device-hip, [[ARCH:gfx803]])
				// PCH-DAG: [[P1:[0-9]+]]: preprocessor, {[[P0]]}, [[T]]-cpp-output, (device-hip, [[ARCH]])
				// PCH-DAG: [[P2:[0-9]+]]: precompiler, {[[P1]]}, precompiled-header, (device-hip, [[ARCH]])
				// PCH-NOT: host
				//
				// Test two gpu architecture up to the precompile phase in device-only mode.
				//
				// RUN: %clang -x hip-header -target x86_64-unknown-linux-gnu \
				// RUN: -ccc-print-phases --cuda-gpu-arch=gfx803 --cuda-gpu-arch=gfx900 %s \
				// RUN: --cuda-device-only -emit-pch 2>&1 \
				// RUN: \| FileCheck -check-prefixes=PCH2 %s
				//
				// PCH2-DAG: [[P0:[0-9]+]]: input, "{{.*}}hip-phases.hipp", [[T:hip-header]], (device-hip, [[ARCH:gfx803]])
				// PCH2-DAG: [[P1:[0-9]+]]: preprocessor, {[[P0]]}, [[T]]-cpp-output, (device-hip, [[ARCH]])
				// PCH2-DAG: [[P2:[0-9]+]]: precompiler, {[[P1]]}, precompiled-header, (device-hip, [[ARCH]])
				// PCH2-DAG: [[P4:[0-9]+]]: input, "{{.*}}hip-phases.hipp", [[T]], (device-hip, [[ARCH:gfx900]])
				// PCH2-DAG: [[P5:[0-9]+]]: preprocessor, {[[P4]]}, [[T]]-cpp-output, (device-hip, [[ARCH]])
				// PCH2-DAG: [[P6:[0-9]+]]: precompiler, {[[P5]]}, precompiled-header, (device-hip, [[ARCH]])
				// PCH2-NOT: host
				//
				// Test the precompile phase in host-only mode.
				//
				// RUN: %clang -x hip-header -target x86_64-unknown-linux-gnu \
				// RUN: -ccc-print-phases %s --cuda-host-only \
				// RUN: -emit-pch 2>&1 \| FileCheck -check-prefixes=PCHH %s
				//
				// PCHH-DAG: [[P0:[0-9]+]]: input, "{{.*}}hip-phases.hipp", [[T:hip-header]], (host-hip)
				// PCHH-DAG: [[P1:[0-9]+]]: preprocessor, {[[P0]]}, [[T]]-cpp-output, (host-hip)
				// PCHH-DAG: [[P2:[0-9]+]]: precompiler, {[[P1]]}, precompiled-header, (host-hip)
				// PCHH-NOT: device

This is an archive of the discontinued LLVM Phabricator instance.

[HIP] Add -emit-pch option to clang driverNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 292315

clang/include/clang/Driver/Options.td

clang/include/clang/Driver/Types.def

clang/lib/Driver/Driver.cpp

clang/lib/Driver/ToolChains/Clang.cpp

clang/lib/Driver/Types.cpp

clang/test/Driver/Inputs/hip_multiple_inputs/a.hipp

clang/test/Driver/hip-binding.hipp

clang/test/Driver/hip-device-compile.hipp

clang/test/Driver/hip-phases.hipp

[HIP] Add -emit-pch option to clang driver
Needs ReviewPublic