This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Driver/
-
clang/
-
Driver/
1/2
Driver.h
-
Options.td
-
lib/Driver/
-
Driver/
-
Driver.cpp
-
ToolChains/
-
Clang.cpp
-
test/Driver/
-
Driver/
-
openmp-offload-gpu.c

Differential D116541

[OpenMP] Introduce new flag to change offloading driver pipeline
ClosedPublic

Authored by jhuber6 on Jan 3 2022, 9:40 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
gregrodgers
JonChesterfield
ronlieb

Commits

rG2f9ace9e9a58: [OpenMP] Introduce new flag to change offloading driver pipeline

Summary

This patch introduces the -fopenmp-new-driver option which instructs
the compiler to use a new driver scheme for producing offloading code.
In this scheme we create a complete offloading object file and then pass
it as input to the host compilation phase. This will allow us to embed
the object code in the backend phase.

This is the start of a series of commits to rework the OpenMP offloading driver
pipeline. The goal of this is to simplify the steps required for creating an
offloading program. This patch changes the driver's configuration to simply pass
the device file back to the host as an input so it can be embedded as an LLVM IR
global during the backend, then simply passes that object file to the linker.

This driver implementation will currently create the following phases,

$ clang input.c -fopenmp -fopenmp-targets=nvptx64 -fopenmp-new-driver -ccc-print-phases
               +- 0: input, "input.c", c, (host-openmp)
            +- 1: preprocessor, {0}, cpp-output, (host-openmp)
         +- 2: compiler, {1}, ir, (host-openmp)
         |        |     +- 3: input, "input.c", c, (device-openmp)
         |        |  +- 4: preprocessor, {3}, cpp-output, (device-openmp)
         |        |- 5: compiler, {4}, ir, (device-openmp)
         |     +- 6: offload, "host-openmp (x86_64-unknown-linux-gnu)" {2}, "device-openmp (nvptx64)" {5}, ir
         |  +- 7: backend, {6}, assembler, (device-openmp)
         |- 8: assembler, {7}, object, (device-openmp)
      +- 9: offload, "host-openmp (x86_64-unknown-linux-gnu)" {2}, "device-openmp (nvptx64)" {8}, ir
   +- 10: backend, {9}, assembler, (host-openmp)
+- 11: assembler, {10}, object, (host-openmp)
12: clang-linker-wrapper, {11}, image, (host-openmp)

Which will map to the following bindings

# "x86_64-unknown-linux-gnu" - "clang", inputs: ["input.c"], output: "/tmp/input-bae62e.bc"
# "nvptx64" - "clang", inputs: ["input.c", "/tmp/input-bae62e.bc"], output: "/tmp/input-76784e.s"
# "nvptx64" - "NVPTX::Assembler", inputs: ["/tmp/input-76784e.s"], output: "/tmp/input-8f29db.o"
# "x86_64-unknown-linux-gnu" - "clang", inputs: ["/tmp/input-bae62e.bc", "/tmp/input-8f29db.o"], output: "/tmp/input-545450.o"
# "x86_64-unknown-linux-gnu" - "Offload::Linker", inputs: ["/tmp/input-545450.o"], output: "a.out"

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.Jan 3 2022, 9:40 AM

Herald added subscribers: ormris, dexonsmith, dang and 4 others. · View Herald TranscriptJan 3 2022, 9:40 AM

jhuber6 requested review of this revision.Jan 3 2022, 9:40 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJan 3 2022, 9:40 AM

Herald added subscribers: llvm-commits, cfe-commits, sstefan1. · View Herald Transcript

Updating to only contain this commit.

jhuber6 edited the summary of this revision. (Show Details)Jan 3 2022, 9:53 AM

Herald added a subscriber: pengfei. · View Herald TranscriptJan 3 2022, 9:53 AM

jhuber6 added a child revision: D116542: [OpenMP] Add a flag for embedding a file into the module.Jan 3 2022, 9:55 AM

Harbormaster completed remote builds in B141347: Diff 397089.Jan 3 2022, 10:16 AM

ormris removed a subscriber: ormris.Jan 18 2022, 10:18 AM

This looks reasonable to me, though I'd prefer we keep the forward declare. The scalar->vector transform is mechanical and the if (Args.hasArg(options::OPT_fopenmp_new_driver)) cruft will disappear when we return to a single driver implementation.

clang/include/clang/Driver/Driver.h
45	This looks like it should be a breaking change - InputInfo is no longer forward declared. Would it be reasonable to keep the forward declaration and put the typedef between class statements and the LTOKind enum?

This revision is now accepted and ready to land.Jan 26 2022, 9:55 AM

jhuber6 added inline comments.Jan 26 2022, 11:16 AM

clang/include/clang/Driver/Driver.h
45	We can't forward declare a struct and use it by-value in a container, I would need to change it to a pointer. It's doable, but I don't think it's ideal. I'm not sure why this would break anything, the forward declarations simply were there to avoid including more files here. I could be wrong however.

dexonsmith removed a subscriber: dexonsmith.Jan 31 2022, 11:21 AM

jhuber6 added a child revision: D118637: [Libomptarget] Run GPU offloading tests using the new driver.Jan 31 2022, 11:43 AM

This revision was landed with ongoing or failed builds.Jan 31 2022, 12:56 PM

Closed by commit rG2f9ace9e9a58: [OpenMP] Introduce new flag to change offloading driver pipeline (authored by jhuber6). · Explain Why

This revision was automatically updated to reflect the committed changes.

jhuber6 added a commit: rG2f9ace9e9a58: [OpenMP] Introduce new flag to change offloading driver pipeline.

Looks like this breaks tests on macOS: http://45.33.8.238/macm1/26856/step_7.txt

Please take a look and revert for now if it takes a while to fix (maybe just needs an explicit triple?)

In D116541#3285455, @thakis wrote:

Looks like this breaks tests on macOS: http://45.33.8.238/macm1/26856/step_7.txt

Please take a look and revert for now if it takes a while to fix (maybe just needs an explicit triple?)

Not sure what I expected when I hard-coded the host-triple in the test. I pushed a change in rGb79e2a1ccd3b, can you check it again?

Still failing: http://45.33.8.238/macm1/26873/step_7.txt

In D116541#3285927, @thakis wrote:

Still failing: http://45.33.8.238/macm1/26873/step_7.txt

Weird, can you show me what -fopenmp -fopenmp-targets=nvptx64 -fopenmp-new-driver -ccc-print-bindings looks like there? I'm not sure why but it doesn't seem to be getting one of the expected arguments but I'm not sure how to reproduce it.

In D116541#3285927, @thakis wrote:

Still failing: http://45.33.8.238/macm1/26873/step_7.txt

It seems what's happening here is that we are building the host.bc twice, this will work fine but isn't ideal. I prevent this manually by checking the cache if one of the jobs was already created, but for some reason that doesn't seem to be happening here. I'll need to figure out how to reproduce this.

Tests have been failing on Mac for over 20 hours now. Time to revert and fix async?

 % bin/clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -fopenmp-new-driver -no-canonical-prefixes -ccc-print-bindings /Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c -o openmp-offload-gpu -fopenmp -fopenmp-targets=nvptx64 -fopenmp-new-driver -ccc-print-bindings
clang version 14.0.0
Target: x86_64-apple-darwin19.6.0
Thread model: posix
InstalledDir: bin
# "x86_64-apple-darwin19.6.0" - "clang", inputs: ["/Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-729b05.bc"
# "nvptx64-nvidia-cuda" - "clang", inputs: ["/Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c", "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-729b05.bc"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a35969.s"
# "nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a35969.s"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a3f7f0.o"
# "x86_64-apple-darwin19.6.0" - "clang", inputs: ["/Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c", "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a3f7f0.o"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-f53552.bc"
# "x86_64-apple-darwin19.6.0" - "clang", inputs: ["/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-f53552.bc"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-86f846.o"
# "x86_64-apple-darwin19.6.0" - "Offload::Linker", inputs: ["/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-86f846.o"], output: "openmp-offload-gpu"

In D116541#3287330, @thakis wrote:

Tests have been failing on Mac for over 20 hours now. Time to revert and fix async?

 % bin/clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -fopenmp-new-driver -no-canonical-prefixes -ccc-print-bindings /Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c -o openmp-offload-gpu -fopenmp -fopenmp-targets=nvptx64 -fopenmp-new-driver -ccc-print-bindings
clang version 14.0.0
Target: x86_64-apple-darwin19.6.0
Thread model: posix
InstalledDir: bin
# "x86_64-apple-darwin19.6.0" - "clang", inputs: ["/Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-729b05.bc"
# "nvptx64-nvidia-cuda" - "clang", inputs: ["/Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c", "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-729b05.bc"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a35969.s"
# "nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a35969.s"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a3f7f0.o"
# "x86_64-apple-darwin19.6.0" - "clang", inputs: ["/Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c", "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a3f7f0.o"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-f53552.bc"
# "x86_64-apple-darwin19.6.0" - "clang", inputs: ["/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-f53552.bc"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-86f846.o"
# "x86_64-apple-darwin19.6.0" - "Offload::Linker", inputs: ["/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-86f846.o"], output: "openmp-offload-gpu"

I'd rather just disable the test, this patch has like 20 others that depend on it so we'd need to revert all of those. It's definitely not grabbing the bitcode from the cache as expected. I can disable this part of the test for now, but do you know how I could figure out how to reproduce this? I've been tracking some other buildbots and they don't seem to have the same issue so I'm not sure what's special about this one.

In D116541#3287342, @jhuber6 wrote:

In D116541#3287330, @thakis wrote:

Tests have been failing on Mac for over 20 hours now. Time to revert and fix async?

 % bin/clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -fopenmp-new-driver -no-canonical-prefixes -ccc-print-bindings /Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c -o openmp-offload-gpu -fopenmp -fopenmp-targets=nvptx64 -fopenmp-new-driver -ccc-print-bindings
clang version 14.0.0
Target: x86_64-apple-darwin19.6.0
Thread model: posix
InstalledDir: bin
# "x86_64-apple-darwin19.6.0" - "clang", inputs: ["/Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-729b05.bc"
# "nvptx64-nvidia-cuda" - "clang", inputs: ["/Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c", "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-729b05.bc"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a35969.s"
# "nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a35969.s"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a3f7f0.o"
# "x86_64-apple-darwin19.6.0" - "clang", inputs: ["/Users/thakis/src/llvm-project/clang/test/Driver/openmp-offload-gpu.c", "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-a3f7f0.o"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-f53552.bc"
# "x86_64-apple-darwin19.6.0" - "clang", inputs: ["/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-f53552.bc"], output: "/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-86f846.o"
# "x86_64-apple-darwin19.6.0" - "Offload::Linker", inputs: ["/var/folders/qt/hxckwtm545l643cnk200wzt00000gn/T/openmp-offload-gpu-86f846.o"], output: "openmp-offload-gpu"

Just build and run tests on any mac. This fails on 3 different macs I tried (2x arm, 1x intel), in a bunch of different build configs.

For the particular build I sent the output from, the cmake invocation looked like /Applications/CMake.app/Contents/bin/cmake -GNinja -DCMAKE_BUILD_TYPE=Release -DLLVM_ENABLE_ASSERTIONS=ON -DLLVM_ENABLE_PROJECTS='compiler-rt;libcxx;clang' -DLLVM_APPEND_VC_REV=NO -DCMAKE_C_COMPILER=$HOME/src/chrome/src/third_party/llvm-build/Release+Asserts/bin/clang -DCMAKE_CXX_COMPILER=$HOME/src/chrome/src/third_party/llvm-build/Release+Asserts/bin/clang++ -DCMAKE_OSX_SYSROOT=/Users/thakis/src/llvm-project/sysroot/MacOSX.sdk -DDARWIN_macosx_CACHED_SYSROOT=/Users/thakis/src/llvm-project/sysroot/MacOSX.sdk -DDARWIN_iphoneos_CACHED_SYSROOT=/Users/thakis/src/llvm-project/sysroot/iPhoneOS.sdk -DDARWIN_iphonesimulator_CACHED_SYSROOT=/Users/thakis/src/llvm-project/sysroot/iPhoneSimulator.sdk ../llvm

ps "This is inconvenient to revert since 20 patches landed" means you're landing too many patches too quickly. This landed less than 24h ago, so if it's difficult to revert, that's a bit of a problem for the project, right?

In D116541#3287379, @thakis wrote:

Just build and run tests on any mac. This fails on 3 different macs I tried (2x arm, 1x intel), in a bunch of different build configs.

For the particular build I sent the output from, the cmake invocation looked like /Applications/CMake.app/Contents/bin/cmake -GNinja -DCMAKE_BUILD_TYPE=Release -DLLVM_ENABLE_ASSERTIONS=ON -DLLVM_ENABLE_PROJECTS='compiler-rt;libcxx;clang' -DLLVM_APPEND_VC_REV=NO -DCMAKE_C_COMPILER=$HOME/src/chrome/src/third_party/llvm-build/Release+Asserts/bin/clang -DCMAKE_CXX_COMPILER=$HOME/src/chrome/src/third_party/llvm-build/Release+Asserts/bin/clang++ -DCMAKE_OSX_SYSROOT=/Users/thakis/src/llvm-project/sysroot/MacOSX.sdk -DDARWIN_macosx_CACHED_SYSROOT=/Users/thakis/src/llvm-project/sysroot/MacOSX.sdk -DDARWIN_iphoneos_CACHED_SYSROOT=/Users/thakis/src/llvm-project/sysroot/iPhoneOS.sdk -DDARWIN_iphonesimulator_CACHED_SYSROOT=/Users/thakis/src/llvm-project/sysroot/iPhoneSimulator.sdk ../llvm

I don't have access to a mac computer right now. I'm just going to remove the problematic check line so this passes and add it in later once I figure out what's going on here. The output you're getting should still work, it's just not ideal because we're regenerating the bitcode file so this isn't a breaking issue.

Removed the two lines in rG28c15341368b, let me know if this lets the tests pass. I'll look into getting an access somehow so I can reproduce this and figure it out.

Tests are passing again.

Revision Contents

Path

Size

clang/

include/

clang/

Driver/

Driver.h

51 lines

Options.td

2 lines

lib/

Driver/

Driver.cpp

153 lines

ToolChains/

Clang.cpp

4 lines

test/

Driver/

openmp-offload-gpu.c

10 lines

Diff 404684

clang/include/clang/Driver/Driver.h

//===--- Driver.h - Clang GCC Compatible Driver ------------------ C++ --===//		//===--- Driver.h - Clang GCC Compatible Driver ------------------ C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_CLANG_DRIVER_DRIVER_H		#ifndef LLVM_CLANG_DRIVER_DRIVER_H
#define LLVM_CLANG_DRIVER_DRIVER_H		#define LLVM_CLANG_DRIVER_DRIVER_H

#include "clang/Basic/Diagnostic.h"		#include "clang/Basic/Diagnostic.h"
#include "clang/Basic/LLVM.h"		#include "clang/Basic/LLVM.h"
#include "clang/Driver/Action.h"		#include "clang/Driver/Action.h"
		#include "clang/Driver/InputInfo.h"
#include "clang/Driver/Options.h"		#include "clang/Driver/Options.h"
#include "clang/Driver/Phases.h"		#include "clang/Driver/Phases.h"
#include "clang/Driver/ToolChain.h"		#include "clang/Driver/ToolChain.h"
#include "clang/Driver/Types.h"		#include "clang/Driver/Types.h"
#include "clang/Driver/Util.h"		#include "clang/Driver/Util.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/Option/Arg.h"		#include "llvm/Option/Arg.h"
Show All 10 Lines
class FileSystem;		class FileSystem;
}		}
} // namespace llvm		} // namespace llvm

namespace clang {		namespace clang {

namespace driver {		namespace driver {

		typedef SmallVector<InputInfo, 4> InputInfoList;

class Command;		class Command;
class Compilation;		class Compilation;
		JonChesterfieldUnsubmitted Not Done Reply Inline Actions This looks like it should be a breaking change - InputInfo is no longer forward declared. Would it be reasonable to keep the forward declaration and put the typedef between class statements and the LTOKind enum? JonChesterfield: This looks like it should be a breaking change - InputInfo is no longer forward declared. Would…
		jhuber6AuthorUnsubmitted Done Reply Inline Actions We can't forward declare a struct and use it by-value in a container, I would need to change it to a pointer. It's doable, but I don't think it's ideal. I'm not sure why this would break anything, the forward declarations simply were there to avoid including more files here. I could be wrong however. jhuber6: We can't forward declare a struct and use it by-value in a container, I would need to change it…
class InputInfo;
class JobList;		class JobList;
class JobAction;		class JobAction;
class SanitizerArgs;		class SanitizerArgs;
class ToolChain;		class ToolChain;

/// Describes the kind of LTO mode selected via -f(no-)?lto(=.*)? options.		/// Describes the kind of LTO mode selected via -f(no-)?lto(=.*)? options.
enum LTOKind {		enum LTOKind {
LTOK_None,		LTOK_None,
LTOK_Full,		LTOK_Full,
LTOK_Thin,		LTOK_Thin,
LTOK_Unknown		LTOK_Unknown
};		};
▲ Show 20 Lines • Show All 110 Lines • ▼ Show 20 Lines	public:
std::string CCPrintOptionsFilename;		std::string CCPrintOptionsFilename;

/// The file to log CC_PRINT_HEADERS output to, if enabled.		/// The file to log CC_PRINT_HEADERS output to, if enabled.
std::string CCPrintHeadersFilename;		std::string CCPrintHeadersFilename;

/// The file to log CC_LOG_DIAGNOSTICS output to, if enabled.		/// The file to log CC_LOG_DIAGNOSTICS output to, if enabled.
std::string CCLogDiagnosticsFilename;		std::string CCLogDiagnosticsFilename;

		/// An input type and its arguments.
		using InputTy = std::pair<types::ID, const llvm::opt::Arg *>;

/// A list of inputs and their types for the given arguments.		/// A list of inputs and their types for the given arguments.
typedef SmallVector<std::pair<types::ID, const llvm::opt::Arg *>, 16>		using InputList = SmallVector<InputTy, 16>;
InputList;

/// Whether the driver should follow g++ like behavior.		/// Whether the driver should follow g++ like behavior.
bool CCCIsCXX() const { return Mode == GXXMode; }		bool CCCIsCXX() const { return Mode == GXXMode; }

/// Whether the driver is just the preprocessor.		/// Whether the driver is just the preprocessor.
bool CCCIsCPP() const { return Mode == CPPMode; }		bool CCCIsCPP() const { return Mode == CPPMode; }

/// Whether the driver should follow gcc like behavior.		/// Whether the driver should follow gcc like behavior.
▲ Show 20 Lines • Show All 223 Lines • ▼ Show 20 Lines	public:
/// BuildUniversalActions - Construct the list of actions to perform		/// BuildUniversalActions - Construct the list of actions to perform
/// for the given arguments, which may require a universal build.		/// for the given arguments, which may require a universal build.
///		///
/// \param C - The compilation that is being built.		/// \param C - The compilation that is being built.
/// \param TC - The default host tool chain.		/// \param TC - The default host tool chain.
void BuildUniversalActions(Compilation &C, const ToolChain &TC,		void BuildUniversalActions(Compilation &C, const ToolChain &TC,
const InputList &BAInputs) const;		const InputList &BAInputs) const;

		/// BuildOffloadingActions - Construct the list of actions to perform for the
		/// offloading toolchain that will be embedded in the host.
		///
		/// \param C - The compilation that is being built.
		/// \param Args - The input arguments.
		/// \param Input - The input type and arguments
		/// \param HostAction - The host action used in the offloading toolchain.
		Action *BuildOffloadingActions(Compilation &C,
		llvm::opt::DerivedArgList &Args,
		const InputTy &Input,
		Action *HostAction) const;

/// Check that the file referenced by Value exists. If it doesn't,		/// Check that the file referenced by Value exists. If it doesn't,
/// issue a diagnostic and return false.		/// issue a diagnostic and return false.
/// If TypoCorrect is true and the file does not exist, see if it looks		/// If TypoCorrect is true and the file does not exist, see if it looks
/// like a likely typo for a flag and if so print a "did you mean" blurb.		/// like a likely typo for a flag and if so print a "did you mean" blurb.
bool DiagnoseInputExistence(const llvm::opt::DerivedArgList &Args,		bool DiagnoseInputExistence(const llvm::opt::DerivedArgList &Args,
StringRef Value, types::ID Ty,		StringRef Value, types::ID Ty,
bool TypoCorrect) const;		bool TypoCorrect) const;

▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines	public:
Action *ConstructPhaseAction(		Action *ConstructPhaseAction(
Compilation &C, const llvm::opt::ArgList &Args, phases::ID Phase,		Compilation &C, const llvm::opt::ArgList &Args, phases::ID Phase,
Action *Input,		Action *Input,
Action::OffloadKind TargetDeviceOffloadKind = Action::OFK_None) const;		Action::OffloadKind TargetDeviceOffloadKind = Action::OFK_None) const;

/// BuildJobsForAction - Construct the jobs to perform for the action \p A and		/// BuildJobsForAction - Construct the jobs to perform for the action \p A and
/// return an InputInfo for the result of running \p A. Will only construct		/// return an InputInfo for the result of running \p A. Will only construct
/// jobs for a given (Action, ToolChain, BoundArch, DeviceKind) tuple once.		/// jobs for a given (Action, ToolChain, BoundArch, DeviceKind) tuple once.
InputInfo		InputInfoList BuildJobsForAction(
BuildJobsForAction(Compilation &C, const Action A, const ToolChain TC,		Compilation &C, const Action A, const ToolChain TC, StringRef BoundArch,
StringRef BoundArch, bool AtTopLevel, bool MultipleArchs,		bool AtTopLevel, bool MultipleArchs, const char *LinkingOutput,
const char *LinkingOutput,		std::map<std::pair<const Action *, std::string>, InputInfoList>
std::map<std::pair<const Action *, std::string>, InputInfo>
&CachedResults,		&CachedResults,
Action::OffloadKind TargetDeviceOffloadKind) const;		Action::OffloadKind TargetDeviceOffloadKind) const;

/// Returns the default name for linked images (e.g., "a.out").		/// Returns the default name for linked images (e.g., "a.out").
const char *getDefaultImageName() const;		const char *getDefaultImageName() const;

/// GetNamedOutputPath - Return the name to use for the output of		/// GetNamedOutputPath - Return the name to use for the output of
/// the action \p JA. The result is appended to the compilation's		/// the action \p JA. The result is appended to the compilation's
/// list of temporary or result files, as appropriate.		/// list of temporary or result files, as appropriate.
///		///
▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	private:

/// Get bitmasks for which option flags to include and exclude based on		/// Get bitmasks for which option flags to include and exclude based on
/// the driver mode.		/// the driver mode.
std::pair<unsigned, unsigned> getIncludeExcludeOptionFlagMasks(bool IsClCompatMode) const;		std::pair<unsigned, unsigned> getIncludeExcludeOptionFlagMasks(bool IsClCompatMode) const;

/// Helper used in BuildJobsForAction. Doesn't use the cache when building		/// Helper used in BuildJobsForAction. Doesn't use the cache when building
/// jobs specifically for the given action, but will use the cache when		/// jobs specifically for the given action, but will use the cache when
/// building jobs for the Action's inputs.		/// building jobs for the Action's inputs.
InputInfo BuildJobsForActionNoCache(		InputInfoList BuildJobsForActionNoCache(
Compilation &C, const Action A, const ToolChain TC, StringRef BoundArch,		Compilation &C, const Action A, const ToolChain TC, StringRef BoundArch,
bool AtTopLevel, bool MultipleArchs, const char *LinkingOutput,		bool AtTopLevel, bool MultipleArchs, const char *LinkingOutput,
std::map<std::pair<const Action *, std::string>, InputInfo>		std::map<std::pair<const Action *, std::string>, InputInfoList>
&CachedResults,		&CachedResults,
Action::OffloadKind TargetDeviceOffloadKind) const;		Action::OffloadKind TargetDeviceOffloadKind) const;

public:		public:
/// GetReleaseVersion - Parse (([0-9]+)(.([0-9]+)(.([0-9]+)?))?)? and		/// GetReleaseVersion - Parse (([0-9]+)(.([0-9]+)(.([0-9]+)?))?)? and
/// return the grouped values as integers. Numbers which are not		/// return the grouped values as integers. Numbers which are not
/// provided are set to 0.		/// provided are set to 0.
///		///
Show All 40 Lines

clang/include/clang/Driver/Options.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,466 Lines • ▼ Show 20 Lines	defm openmp_target_new_runtime: BoolFOption<"openmp-target-new-runtime",
LangOpts<"OpenMPTargetNewRuntime">, DefaultTrue,		LangOpts<"OpenMPTargetNewRuntime">, DefaultTrue,
PosFlag<SetTrue, [CC1Option], "Use the new bitcode library for OpenMP offloading">,		PosFlag<SetTrue, [CC1Option], "Use the new bitcode library for OpenMP offloading">,
NegFlag<SetFalse>>;		NegFlag<SetFalse>>;
defm openmp_optimistic_collapse : BoolFOption<"openmp-optimistic-collapse",		defm openmp_optimistic_collapse : BoolFOption<"openmp-optimistic-collapse",
LangOpts<"OpenMPOptimisticCollapse">, DefaultFalse,		LangOpts<"OpenMPOptimisticCollapse">, DefaultFalse,
PosFlag<SetTrue, [CC1Option]>, NegFlag<SetFalse>, BothFlags<[NoArgumentUnused, HelpHidden]>>;		PosFlag<SetTrue, [CC1Option]>, NegFlag<SetFalse>, BothFlags<[NoArgumentUnused, HelpHidden]>>;
def static_openmp: Flag<["-"], "static-openmp">,		def static_openmp: Flag<["-"], "static-openmp">,
HelpText<"Use the static host OpenMP runtime while linking.">;		HelpText<"Use the static host OpenMP runtime while linking.">;
		def fopenmp_new_driver : Flag<["-"], "fopenmp-new-driver">, Flags<[CC1Option]>, Group<Action_Group>,
		HelpText<"Use the new driver for OpenMP offloading.">;
def fno_optimize_sibling_calls : Flag<["-"], "fno-optimize-sibling-calls">, Group<f_Group>;		def fno_optimize_sibling_calls : Flag<["-"], "fno-optimize-sibling-calls">, Group<f_Group>;
def foptimize_sibling_calls : Flag<["-"], "foptimize-sibling-calls">, Group<f_Group>;		def foptimize_sibling_calls : Flag<["-"], "foptimize-sibling-calls">, Group<f_Group>;
defm escaping_block_tail_calls : BoolFOption<"escaping-block-tail-calls",		defm escaping_block_tail_calls : BoolFOption<"escaping-block-tail-calls",
CodeGenOpts<"NoEscapingBlockTailCalls">, DefaultFalse,		CodeGenOpts<"NoEscapingBlockTailCalls">, DefaultFalse,
NegFlag<SetTrue, [CC1Option]>, PosFlag<SetFalse>>;		NegFlag<SetTrue, [CC1Option]>, PosFlag<SetFalse>>;
def force__cpusubtype__ALL : Flag<["-"], "force_cpusubtype_ALL">;		def force__cpusubtype__ALL : Flag<["-"], "force_cpusubtype_ALL">;
def force__flat__namespace : Flag<["-"], "force_flat_namespace">;		def force__flat__namespace : Flag<["-"], "force_flat_namespace">;
def force__load : Separate<["-"], "force_load">;		def force__load : Separate<["-"], "force_load">;
▲ Show 20 Lines • Show All 4,042 Lines • Show Last 20 Lines

clang/lib/Driver/Driver.cpp

Show First 20 Lines • Show All 3,824 Lines • ▼ Show 20 Lines	if (Arg *A = Args.getLastArg(options::OPT__SLASH_o)) {
}		}
}		}

handleArguments(C, Args, Inputs, Actions);		handleArguments(C, Args, Inputs, Actions);

// Builder to be used to build offloading actions.		// Builder to be used to build offloading actions.
OffloadingActionBuilder OffloadBuilder(C, Args, Inputs);		OffloadingActionBuilder OffloadBuilder(C, Args, Inputs);

		// Offload kinds active for this compilation.
		unsigned OffloadKinds = Action::OFK_None;
		if (C.hasOffloadToolChain<Action::OFK_OpenMP>())
		OffloadKinds \|= Action::OFK_OpenMP;

// Construct the actions to perform.		// Construct the actions to perform.
HeaderModulePrecompileJobAction *HeaderModuleAction = nullptr;		HeaderModulePrecompileJobAction *HeaderModuleAction = nullptr;
ActionList LinkerInputs;		ActionList LinkerInputs;
ActionList MergerInputs;		ActionList MergerInputs;

for (auto &I : Inputs) {		for (auto &I : Inputs) {
types::ID InputType = I.first;		types::ID InputType = I.first;
const Arg *InputArg = I.second;		const Arg *InputArg = I.second;

auto PL = types::getCompilationPhases(*this, Args, InputType);		auto PL = types::getCompilationPhases(*this, Args, InputType);
if (PL.empty())		if (PL.empty())
continue;		continue;

auto FullPL = types::getCompilationPhases(InputType);		auto FullPL = types::getCompilationPhases(InputType);

// Build the pipeline for this file.		// Build the pipeline for this file.
Action Current = C.MakeAction<InputAction>(InputArg, InputType);		Action Current = C.MakeAction<InputAction>(InputArg, InputType);

// Use the current host action in any of the offloading actions, if		// Use the current host action in any of the offloading actions, if
// required.		// required.
		if (!Args.hasArg(options::OPT_fopenmp_new_driver))
if (OffloadBuilder.addHostDependenceToDeviceActions(Current, InputArg))		if (OffloadBuilder.addHostDependenceToDeviceActions(Current, InputArg))
break;		break;

for (phases::ID Phase : PL) {		for (phases::ID Phase : PL) {

// Add any offload action the host action depends on.		// Add any offload action the host action depends on.
		if (!Args.hasArg(options::OPT_fopenmp_new_driver))
Current = OffloadBuilder.addDeviceDependencesToHostAction(		Current = OffloadBuilder.addDeviceDependencesToHostAction(
Current, InputArg, Phase, PL.back(), FullPL);		Current, InputArg, Phase, PL.back(), FullPL);
if (!Current)		if (!Current)
break;		break;

// Queue linker inputs.		// Queue linker inputs.
if (Phase == phases::Link) {		if (Phase == phases::Link) {
assert(Phase == PL.back() && "linking must be final compilation step.");		assert(Phase == PL.back() && "linking must be final compilation step.");
LinkerInputs.push_back(Current);		LinkerInputs.push_back(Current);
Current = nullptr;		Current = nullptr;
Show All 16 Lines	for (phases::ID Phase : PL) {
// separate PCH.		// separate PCH.
if (Phase == phases::Precompile && HeaderModuleAction &&		if (Phase == phases::Precompile && HeaderModuleAction &&
getPrecompiledType(InputType) == types::TY_PCH) {		getPrecompiledType(InputType) == types::TY_PCH) {
HeaderModuleAction->addModuleHeaderInput(Current);		HeaderModuleAction->addModuleHeaderInput(Current);
Current = nullptr;		Current = nullptr;
break;		break;
}		}

		// Try to build the offloading actions and add the result as a dependency
		// to the host.
		if (Args.hasArg(options::OPT_fopenmp_new_driver))
		Current = BuildOffloadingActions(C, Args, I, Current);

// FIXME: Should we include any prior module file outputs as inputs of		// FIXME: Should we include any prior module file outputs as inputs of
// later actions in the same command line?		// later actions in the same command line?

// Otherwise construct the appropriate action.		// Otherwise construct the appropriate action.
Action *NewCurrent = ConstructPhaseAction(C, Args, Phase, Current);		Action *NewCurrent = ConstructPhaseAction(C, Args, Phase, Current);

// We didn't create a new action, so we will just move to the next phase.		// We didn't create a new action, so we will just move to the next phase.
if (NewCurrent == Current)		if (NewCurrent == Current)
continue;		continue;

if (auto *HMA = dyn_cast<HeaderModulePrecompileJobAction>(NewCurrent))		if (auto *HMA = dyn_cast<HeaderModulePrecompileJobAction>(NewCurrent))
HeaderModuleAction = HMA;		HeaderModuleAction = HMA;

Current = NewCurrent;		Current = NewCurrent;

// Use the current host action in any of the offloading actions, if		// Use the current host action in any of the offloading actions, if
// required.		// required.
		if (!Args.hasArg(options::OPT_fopenmp_new_driver))
if (OffloadBuilder.addHostDependenceToDeviceActions(Current, InputArg))		if (OffloadBuilder.addHostDependenceToDeviceActions(Current, InputArg))
break;		break;

if (Current->getType() == types::TY_Nothing)		if (Current->getType() == types::TY_Nothing)
break;		break;
}		}

// If we ended with something, add to the output list.		// If we ended with something, add to the output list.
if (Current)		if (Current)
Actions.push_back(Current);		Actions.push_back(Current);

// Add any top level actions generated for offloading.		// Add any top level actions generated for offloading.
		if (!Args.hasArg(options::OPT_fopenmp_new_driver))
OffloadBuilder.appendTopLevelActions(Actions, Current, InputArg);		OffloadBuilder.appendTopLevelActions(Actions, Current, InputArg);
		else if (Current)
		Current->propagateHostOffloadInfo(OffloadKinds,
		/BoundArch=/nullptr);
}		}

// Add a link action if necessary.		// Add a link action if necessary.

if (LinkerInputs.empty()) {		if (LinkerInputs.empty()) {
Arg *FinalPhaseArg;		Arg *FinalPhaseArg;
if (getFinalPhase(Args, &FinalPhaseArg) == phases::Link)		if (getFinalPhase(Args, &FinalPhaseArg) == phases::Link)
OffloadBuilder.appendDeviceLinkActions(Actions);		OffloadBuilder.appendDeviceLinkActions(Actions);
}		}

if (!LinkerInputs.empty()) {		if (!LinkerInputs.empty()) {
		if (!Args.hasArg(options::OPT_fopenmp_new_driver))
if (Action *Wrapper = OffloadBuilder.makeHostLinkAction())		if (Action *Wrapper = OffloadBuilder.makeHostLinkAction())
LinkerInputs.push_back(Wrapper);		LinkerInputs.push_back(Wrapper);
Action *LA;		Action *LA;
// Check if this Linker Job should emit a static library.		// Check if this Linker Job should emit a static library.
if (ShouldEmitStaticLibrary(Args)) {		if (ShouldEmitStaticLibrary(Args)) {
LA = C.MakeAction<StaticLibJobAction>(LinkerInputs, types::TY_Image);		LA = C.MakeAction<StaticLibJobAction>(LinkerInputs, types::TY_Image);
} else {		} else {
LA = C.MakeAction<LinkJobAction>(LinkerInputs, types::TY_Image);		LA = C.MakeAction<LinkJobAction>(LinkerInputs, types::TY_Image);
}		}
		if (!Args.hasArg(options::OPT_fopenmp_new_driver))
LA = OffloadBuilder.processHostLinkAction(LA);		LA = OffloadBuilder.processHostLinkAction(LA);
		if (Args.hasArg(options::OPT_fopenmp_new_driver))
		LA->propagateHostOffloadInfo(OffloadKinds,
		/BoundArch=/nullptr);
Actions.push_back(LA);		Actions.push_back(LA);
}		}

// Add an interface stubs merge action if necessary.		// Add an interface stubs merge action if necessary.
if (!MergerInputs.empty())		if (!MergerInputs.empty())
Actions.push_back(		Actions.push_back(
C.MakeAction<IfsMergeJobAction>(MergerInputs, types::TY_Image));		C.MakeAction<IfsMergeJobAction>(MergerInputs, types::TY_Image));

▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	void Driver::BuildActions(Compilation &C, DerivedArgList &Args,
Args.ClaimAllArgs(options::OPT_cl_ignored_Group);		Args.ClaimAllArgs(options::OPT_cl_ignored_Group);

// Claim --cuda-host-only and --cuda-compile-host-device, which may be passed		// Claim --cuda-host-only and --cuda-compile-host-device, which may be passed
// to non-CUDA compilations and should not trigger warnings there.		// to non-CUDA compilations and should not trigger warnings there.
Args.ClaimAllArgs(options::OPT_cuda_host_only);		Args.ClaimAllArgs(options::OPT_cuda_host_only);
Args.ClaimAllArgs(options::OPT_cuda_compile_host_device);		Args.ClaimAllArgs(options::OPT_cuda_compile_host_device);
}		}

		Action *Driver::BuildOffloadingActions(Compilation &C,
		llvm::opt::DerivedArgList &Args,
		const InputTy &Input,
		Action *HostAction) const {
		if (!isa<CompileJobAction>(HostAction))
		return HostAction;

		SmallVector<const ToolChain *, 2> ToolChains;
		ActionList DeviceActions;

		types::ID InputType = Input.first;
		const Arg *InputArg = Input.second;

		auto OpenMPTCRange = C.getOffloadToolChains<Action::OFK_OpenMP>();
		for (auto TI = OpenMPTCRange.first, TE = OpenMPTCRange.second; TI != TE; ++TI)
		ToolChains.push_back(TI->second);

		for (unsigned I = 0; I < ToolChains.size(); ++I)
		DeviceActions.push_back(C.MakeAction<InputAction>(*InputArg, InputType));

		if (DeviceActions.empty())
		return HostAction;

		auto PL = types::getCompilationPhases(*this, Args, InputType);

		for (phases::ID Phase : PL) {
		if (Phase == phases::Link) {
		assert(Phase == PL.back() && "linking must be final compilation step.");
		break;
		}

		auto TC = ToolChains.begin();
		for (Action *&A : DeviceActions) {
		A = ConstructPhaseAction(C, Args, Phase, A);

		if (isa<CompileJobAction>(A)) {
		HostAction->setCannotBeCollapsedWithNextDependentAction();
		OffloadAction::HostDependence HDep(
		HostAction, C.getSingleOffloadToolChain<Action::OFK_Host>(),
		/BourdArch=/nullptr, Action::OFK_OpenMP);
		OffloadAction::DeviceDependences DDep;
		DDep.add(A, TC, /BoundArch=*/nullptr, Action::OFK_OpenMP);
		A = C.MakeAction<OffloadAction>(HDep, DDep);
		}
		++TC;
		}
		}

		OffloadAction::DeviceDependences DDeps;

		auto TC = ToolChains.begin();
		for (Action *A : DeviceActions) {
		DDeps.add(A, TC, /BoundArch=*/nullptr, Action::OFK_OpenMP);
		TC++;
		}

		OffloadAction::HostDependence HDep(
		HostAction, C.getSingleOffloadToolChain<Action::OFK_Host>(),
		/BoundArch=/nullptr, DDeps);
		return C.MakeAction<OffloadAction>(HDep, DDeps);
		}

Action *Driver::ConstructPhaseAction(		Action *Driver::ConstructPhaseAction(
Compilation &C, const ArgList &Args, phases::ID Phase, Action *Input,		Compilation &C, const ArgList &Args, phases::ID Phase, Action *Input,
Action::OffloadKind TargetDeviceOffloadKind) const {		Action::OffloadKind TargetDeviceOffloadKind) const {
llvm::PrettyStackTraceString CrashInfo("Constructing phase actions");		llvm::PrettyStackTraceString CrashInfo("Constructing phase actions");

// Some types skip the assembler phase (e.g., llvm-bc), but we can't		// Some types skip the assembler phase (e.g., llvm-bc), but we can't
// encode this in the steps because the intermediate type depends on		// encode this in the steps because the intermediate type depends on
// arguments. Just special case here.		// arguments. Just special case here.
▲ Show 20 Lines • Show All 146 Lines • ▼ Show 20 Lines	void Driver::BuildJobs(Compilation &C) const {
// Collect the list of architectures.		// Collect the list of architectures.
llvm::StringSet<> ArchNames;		llvm::StringSet<> ArchNames;
if (RawTriple.isOSBinFormatMachO())		if (RawTriple.isOSBinFormatMachO())
for (const Arg *A : C.getArgs())		for (const Arg *A : C.getArgs())
if (A->getOption().matches(options::OPT_arch))		if (A->getOption().matches(options::OPT_arch))
ArchNames.insert(A->getValue());		ArchNames.insert(A->getValue());

// Set of (Action, canonical ToolChain triple) pairs we've built jobs for.		// Set of (Action, canonical ToolChain triple) pairs we've built jobs for.
std::map<std::pair<const Action *, std::string>, InputInfo> CachedResults;		std::map<std::pair<const Action *, std::string>, InputInfoList> CachedResults;
for (Action *A : C.getActions()) {		for (Action *A : C.getActions()) {
// If we are linking an image for multiple archs then the linker wants		// If we are linking an image for multiple archs then the linker wants
// -arch_multiple and -final_output <final image name>. Unfortunately, this		// -arch_multiple and -final_output <final image name>. Unfortunately, this
// doesn't fit in cleanly because we have to pass this information down.		// doesn't fit in cleanly because we have to pass this information down.
//		//
// FIXME: This is a hack; find a cleaner way to integrate this into the		// FIXME: This is a hack; find a cleaner way to integrate this into the
// process.		// process.
const char *LinkingOutput = nullptr;		const char *LinkingOutput = nullptr;
▲ Show 20 Lines • Show All 440 Lines • ▼ Show 20 Lines	if (!BoundArch.empty()) {
TriplePlusArch += "-";		TriplePlusArch += "-";
TriplePlusArch += BoundArch;		TriplePlusArch += BoundArch;
}		}
TriplePlusArch += "-";		TriplePlusArch += "-";
TriplePlusArch += Action::GetOffloadKindName(OffloadKind);		TriplePlusArch += Action::GetOffloadKindName(OffloadKind);
return TriplePlusArch;		return TriplePlusArch;
}		}

InputInfo Driver::BuildJobsForAction(		InputInfoList Driver::BuildJobsForAction(
Compilation &C, const Action A, const ToolChain TC, StringRef BoundArch,		Compilation &C, const Action A, const ToolChain TC, StringRef BoundArch,
bool AtTopLevel, bool MultipleArchs, const char *LinkingOutput,		bool AtTopLevel, bool MultipleArchs, const char *LinkingOutput,
std::map<std::pair<const Action *, std::string>, InputInfo> &CachedResults,		std::map<std::pair<const Action *, std::string>, InputInfoList>
		&CachedResults,
Action::OffloadKind TargetDeviceOffloadKind) const {		Action::OffloadKind TargetDeviceOffloadKind) const {
std::pair<const Action *, std::string> ActionTC = {		std::pair<const Action *, std::string> ActionTC = {
A, GetTriplePlusArchString(TC, BoundArch, TargetDeviceOffloadKind)};		A, GetTriplePlusArchString(TC, BoundArch, TargetDeviceOffloadKind)};
auto CachedResult = CachedResults.find(ActionTC);		auto CachedResult = CachedResults.find(ActionTC);
if (CachedResult != CachedResults.end()) {		if (CachedResult != CachedResults.end()) {
return CachedResult->second;		return CachedResult->second;
}		}
InputInfo Result = BuildJobsForActionNoCache(		InputInfoList Result = BuildJobsForActionNoCache(
C, A, TC, BoundArch, AtTopLevel, MultipleArchs, LinkingOutput,		C, A, TC, BoundArch, AtTopLevel, MultipleArchs, LinkingOutput,
CachedResults, TargetDeviceOffloadKind);		CachedResults, TargetDeviceOffloadKind);
CachedResults[ActionTC] = Result;		CachedResults[ActionTC] = Result;
return Result;		return Result;
}		}

InputInfo Driver::BuildJobsForActionNoCache(		InputInfoList Driver::BuildJobsForActionNoCache(
Compilation &C, const Action A, const ToolChain TC, StringRef BoundArch,		Compilation &C, const Action A, const ToolChain TC, StringRef BoundArch,
bool AtTopLevel, bool MultipleArchs, const char *LinkingOutput,		bool AtTopLevel, bool MultipleArchs, const char *LinkingOutput,
std::map<std::pair<const Action *, std::string>, InputInfo> &CachedResults,		std::map<std::pair<const Action *, std::string>, InputInfoList>
		&CachedResults,
Action::OffloadKind TargetDeviceOffloadKind) const {		Action::OffloadKind TargetDeviceOffloadKind) const {
llvm::PrettyStackTraceString CrashInfo("Building compilation jobs");		llvm::PrettyStackTraceString CrashInfo("Building compilation jobs");

InputInfoList OffloadDependencesInputInfo;		InputInfoList OffloadDependencesInputInfo;
bool BuildingForOffloadDevice = TargetDeviceOffloadKind != Action::OFK_None;		bool BuildingForOffloadDevice = TargetDeviceOffloadKind != Action::OFK_None;
if (const OffloadAction *OA = dyn_cast<OffloadAction>(A)) {		if (const OffloadAction *OA = dyn_cast<OffloadAction>(A)) {
// The 'Darwin' toolchain is initialized only when its arguments are		// The 'Darwin' toolchain is initialized only when its arguments are
// computed. Get the default arguments for OFK_None to ensure that		// computed. Get the default arguments for OFK_None to ensure that
Show All 21 Lines	if (const OffloadAction *OA = dyn_cast<OffloadAction>(A)) {
//		//
// For a) and b), we just return the job generated for the dependence. For		// For a) and b), we just return the job generated for the dependence. For
// c) and d) we override the current action with the host/device dependence		// c) and d) we override the current action with the host/device dependence
// if the current toolchain is host/device and set the offload dependences		// if the current toolchain is host/device and set the offload dependences
// info with the jobs obtained from the device/host dependence(s).		// info with the jobs obtained from the device/host dependence(s).

// If there is a single device option, just generate the job for it.		// If there is a single device option, just generate the job for it.
if (OA->hasSingleDeviceDependence()) {		if (OA->hasSingleDeviceDependence()) {
InputInfo DevA;		InputInfoList DevA;
OA->doOnEachDeviceDependence([&](Action DepA, const ToolChain DepTC,		OA->doOnEachDeviceDependence([&](Action DepA, const ToolChain DepTC,
const char *DepBoundArch) {		const char *DepBoundArch) {
DevA =		DevA =
BuildJobsForAction(C, DepA, DepTC, DepBoundArch, AtTopLevel,		BuildJobsForAction(C, DepA, DepTC, DepBoundArch, AtTopLevel,
/MultipleArchs/ !!DepBoundArch, LinkingOutput,		/MultipleArchs/ !!DepBoundArch, LinkingOutput,
CachedResults, DepA->getOffloadingDeviceKind());		CachedResults, DepA->getOffloadingDeviceKind());
});		});
return DevA;		return DevA;
}		}

// If 'Action 2' is host, we generate jobs for the device dependences and		// If 'Action 2' is host, we generate jobs for the device dependences and
// override the current action with the host dependence. Otherwise, we		// override the current action with the host dependence. Otherwise, we
// generate the host dependences and override the action with the device		// generate the host dependences and override the action with the device
// dependence. The dependences can't therefore be a top-level action.		// dependence. The dependences can't therefore be a top-level action.
OA->doOnEachDependence(		OA->doOnEachDependence(
/IsHostDependence=/BuildingForOffloadDevice,		/IsHostDependence=/BuildingForOffloadDevice,
[&](Action DepA, const ToolChain DepTC, const char *DepBoundArch) {		[&](Action DepA, const ToolChain DepTC, const char *DepBoundArch) {
OffloadDependencesInputInfo.push_back(BuildJobsForAction(		OffloadDependencesInputInfo.append(BuildJobsForAction(
C, DepA, DepTC, DepBoundArch, /AtTopLevel=/false,		C, DepA, DepTC, DepBoundArch, /AtTopLevel=/false,
/MultipleArchs/ !!DepBoundArch, LinkingOutput, CachedResults,		/MultipleArchs/ !!DepBoundArch, LinkingOutput, CachedResults,
DepA->getOffloadingDeviceKind()));		DepA->getOffloadingDeviceKind()));
});		});

A = BuildingForOffloadDevice		A = BuildingForOffloadDevice
? OA->getSingleDeviceDependence(/DoNotConsiderHostActions=/true)		? OA->getSingleDeviceDependence(/DoNotConsiderHostActions=/true)
: OA->getHostDependence();		: OA->getHostDependence();

		// We may have already built this action as a part of the offloading
		// toolchain, return the cached input if so.
		std::pair<const Action *, std::string> ActionTC = {
		OA->getHostDependence(),
		GetTriplePlusArchString(TC, BoundArch, TargetDeviceOffloadKind)};
		if (CachedResults.find(ActionTC) != CachedResults.end()) {
		InputInfoList Inputs = CachedResults[ActionTC];
		Inputs.append(OffloadDependencesInputInfo);
		return Inputs;
		}
}		}

if (const InputAction *IA = dyn_cast<InputAction>(A)) {		if (const InputAction *IA = dyn_cast<InputAction>(A)) {
// FIXME: It would be nice to not claim this here; maybe the old scheme of		// FIXME: It would be nice to not claim this here; maybe the old scheme of
// just using Args was better?		// just using Args was better?
const Arg &Input = IA->getInputArg();		const Arg &Input = IA->getInputArg();
Input.claim();		Input.claim();
if (Input.getOption().matches(options::OPT_INPUT)) {		if (Input.getOption().matches(options::OPT_INPUT)) {
const char *Name = Input.getValue();		const char *Name = Input.getValue();
return InputInfo(A, Name, /* _BaseInput = */ Name);		return {InputInfo(A, Name, /* _BaseInput = */ Name)};
}		}
return InputInfo(A, &Input, /* _BaseInput = */ "");		return {InputInfo(A, &Input, /* _BaseInput = */ "")};
}		}

if (const BindArchAction *BAA = dyn_cast<BindArchAction>(A)) {		if (const BindArchAction *BAA = dyn_cast<BindArchAction>(A)) {
const ToolChain *TC;		const ToolChain *TC;
StringRef ArchName = BAA->getArchName();		StringRef ArchName = BAA->getArchName();

if (!ArchName.empty())		if (!ArchName.empty())
TC = &getToolChain(C.getArgs(),		TC = &getToolChain(C.getArgs(),
Show All 13 Lines	InputInfoList Driver::BuildJobsForActionNoCache(
const JobAction *JA = cast<JobAction>(A);		const JobAction *JA = cast<JobAction>(A);
ActionList CollapsedOffloadActions;		ActionList CollapsedOffloadActions;

ToolSelector TS(JA, *TC, C, isSaveTempsEnabled(),		ToolSelector TS(JA, *TC, C, isSaveTempsEnabled(),
embedBitcodeInObject() && !isUsingLTO());		embedBitcodeInObject() && !isUsingLTO());
const Tool *T = TS.getTool(Inputs, CollapsedOffloadActions);		const Tool *T = TS.getTool(Inputs, CollapsedOffloadActions);

if (!T)		if (!T)
return InputInfo();		return {InputInfo()};

if (BuildingForOffloadDevice &&		if (BuildingForOffloadDevice &&
A->getOffloadingDeviceKind() == Action::OFK_OpenMP) {		A->getOffloadingDeviceKind() == Action::OFK_OpenMP) {
if (TC->getTriple().isAMDGCN()) {		if (TC->getTriple().isAMDGCN()) {
// AMDGCN treats backend and assemble actions as no-op because		// AMDGCN treats backend and assemble actions as no-op because
// linker does not support object files.		// linker does not support object files.
if (const BackendJobAction *BA = dyn_cast<BackendJobAction>(A)) {		if (const BackendJobAction *BA = dyn_cast<BackendJobAction>(A)) {
return BuildJobsForAction(C, *BA->input_begin(), TC, BoundArch,		return BuildJobsForAction(C, *BA->input_begin(), TC, BoundArch,
Show All 10 Lines	InputInfoList Driver::BuildJobsForActionNoCache(
}		}

// If we've collapsed action list that contained OffloadAction we		// If we've collapsed action list that contained OffloadAction we
// need to build jobs for host/device-side inputs it may have held.		// need to build jobs for host/device-side inputs it may have held.
for (const auto *OA : CollapsedOffloadActions)		for (const auto *OA : CollapsedOffloadActions)
cast<OffloadAction>(OA)->doOnEachDependence(		cast<OffloadAction>(OA)->doOnEachDependence(
/IsHostDependence=/BuildingForOffloadDevice,		/IsHostDependence=/BuildingForOffloadDevice,
[&](Action DepA, const ToolChain DepTC, const char *DepBoundArch) {		[&](Action DepA, const ToolChain DepTC, const char *DepBoundArch) {
OffloadDependencesInputInfo.push_back(BuildJobsForAction(		OffloadDependencesInputInfo.append(BuildJobsForAction(
C, DepA, DepTC, DepBoundArch, /* AtTopLevel */ false,		C, DepA, DepTC, DepBoundArch, /* AtTopLevel */ false,
/MultipleArchs=/!!DepBoundArch, LinkingOutput, CachedResults,		/MultipleArchs=/!!DepBoundArch, LinkingOutput, CachedResults,
DepA->getOffloadingDeviceKind()));		DepA->getOffloadingDeviceKind()));
});		});

// Only use pipes when there is exactly one input.		// Only use pipes when there is exactly one input.
InputInfoList InputInfos;		InputInfoList InputInfos;
for (const Action *Input : Inputs) {		for (const Action *Input : Inputs) {
// Treat dsymutil and verify sub-jobs as being at the top-level too, they		// Treat dsymutil and verify sub-jobs as being at the top-level too, they
// shouldn't get temporary output names.		// shouldn't get temporary output names.
// FIXME: Clean this up.		// FIXME: Clean this up.
bool SubJobAtTopLevel =		bool SubJobAtTopLevel =
AtTopLevel && (isa<DsymutilJobAction>(A) \|\| isa<VerifyJobAction>(A));		AtTopLevel && (isa<DsymutilJobAction>(A) \|\| isa<VerifyJobAction>(A));
InputInfos.push_back(BuildJobsForAction(		InputInfos.append(BuildJobsForAction(
C, Input, TC, BoundArch, SubJobAtTopLevel, MultipleArchs, LinkingOutput,		C, Input, TC, BoundArch, SubJobAtTopLevel, MultipleArchs, LinkingOutput,
CachedResults, A->getOffloadingDeviceKind()));		CachedResults, A->getOffloadingDeviceKind()));
}		}

// Always use the first file input as the base input.		// Always use the first file input as the base input.
const char *BaseInput = InputInfos[0].getBaseInput();		const char *BaseInput = InputInfos[0].getBaseInput();
for (auto &Info : InputInfos) {		for (auto &Info : InputInfos) {
if (Info.isFilename()) {		if (Info.isFilename()) {
▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	for (auto &UI : UA->getDependentActionsInfo()) {
if (UI.DependentOffloadKind == Action::OFK_Host)		if (UI.DependentOffloadKind == Action::OFK_Host)
Arch = StringRef();		Arch = StringRef();
else		else
Arch = UI.DependentBoundArch;		Arch = UI.DependentBoundArch;
} else		} else
Arch = BoundArch;		Arch = BoundArch;

CachedResults[{A, GetTriplePlusArchString(UI.DependentToolChain, Arch,		CachedResults[{A, GetTriplePlusArchString(UI.DependentToolChain, Arch,
UI.DependentOffloadKind)}] =		UI.DependentOffloadKind)}] = {
CurI;		CurI};
}		}

// Now that we have all the results generated, select the one that should be		// Now that we have all the results generated, select the one that should be
// returned for the current depending action.		// returned for the current depending action.
std::pair<const Action *, std::string> ActionTC = {		std::pair<const Action *, std::string> ActionTC = {
A, GetTriplePlusArchString(TC, BoundArch, TargetDeviceOffloadKind)};		A, GetTriplePlusArchString(TC, BoundArch, TargetDeviceOffloadKind)};
assert(CachedResults.find(ActionTC) != CachedResults.end() &&		assert(CachedResults.find(ActionTC) != CachedResults.end() &&
"Result does not exist??");		"Result does not exist??");
Result = CachedResults[ActionTC];		Result = CachedResults[ActionTC].front();
} else if (JA->getType() == types::TY_Nothing)		} else if (JA->getType() == types::TY_Nothing)
Result = InputInfo(A, BaseInput);		Result = {InputInfo(A, BaseInput)};
else {		else {
// We only have to generate a prefix for the host if this is not a top-level		// We only have to generate a prefix for the host if this is not a top-level
// action.		// action.
std::string OffloadingPrefix = Action::GetOffloadingFileNamePrefix(		std::string OffloadingPrefix = Action::GetOffloadingFileNamePrefix(
A->getOffloadingDeviceKind(), TC->getTriple().normalize(),		A->getOffloadingDeviceKind(), TC->getTriple().normalize(),
/CreatePrefixForHost=/!!A->getOffloadingHostActiveKinds() &&		/CreatePrefixForHost=/!!A->getOffloadingHostActiveKinds() &&
!AtTopLevel);		!AtTopLevel);
if (isa<OffloadWrapperJobAction>(JA)) {		if (isa<OffloadWrapperJobAction>(JA)) {
Show All 36 Lines	if (UnbundlingResults.empty())
C.getArgsForToolChain(TC, BoundArch, JA->getOffloadingDeviceKind()),		C.getArgsForToolChain(TC, BoundArch, JA->getOffloadingDeviceKind()),
LinkingOutput);		LinkingOutput);
else		else
T->ConstructJobMultipleOutputs(		T->ConstructJobMultipleOutputs(
C, *JA, UnbundlingResults, InputInfos,		C, *JA, UnbundlingResults, InputInfos,
C.getArgsForToolChain(TC, BoundArch, JA->getOffloadingDeviceKind()),		C.getArgsForToolChain(TC, BoundArch, JA->getOffloadingDeviceKind()),
LinkingOutput);		LinkingOutput);
}		}
return Result;		return {Result};
}		}

const char *Driver::getDefaultImageName() const {		const char *Driver::getDefaultImageName() const {
llvm::Triple Target(llvm::Triple::normalize(TargetTriple));		llvm::Triple Target(llvm::Triple::normalize(TargetTriple));
return Target.isOSWindows() ? "a.exe" : "a.out";		return Target.isOSWindows() ? "a.exe" : "a.out";
}		}

/// Create output filename based on ArgValue, which could either be a		/// Create output filename based on ArgValue, which could either be a
▲ Show 20 Lines • Show All 800 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Clang.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,345 Lines • ▼ Show 20 Lines	void Clang::ConstructJob(Compilation &C, const JobAction &JA,
// second input. Module precompilation accepts a list of header files to		// second input. Module precompilation accepts a list of header files to
// include as part of the module. All other jobs are expected to have exactly		// include as part of the module. All other jobs are expected to have exactly
// one input.		// one input.
bool IsCuda = JA.isOffloading(Action::OFK_Cuda);		bool IsCuda = JA.isOffloading(Action::OFK_Cuda);
bool IsCudaDevice = JA.isDeviceOffloading(Action::OFK_Cuda);		bool IsCudaDevice = JA.isDeviceOffloading(Action::OFK_Cuda);
bool IsHIP = JA.isOffloading(Action::OFK_HIP);		bool IsHIP = JA.isOffloading(Action::OFK_HIP);
bool IsHIPDevice = JA.isDeviceOffloading(Action::OFK_HIP);		bool IsHIPDevice = JA.isDeviceOffloading(Action::OFK_HIP);
bool IsOpenMPDevice = JA.isDeviceOffloading(Action::OFK_OpenMP);		bool IsOpenMPDevice = JA.isDeviceOffloading(Action::OFK_OpenMP);
		bool IsOpenMPHost = JA.isHostOffloading(Action::OFK_OpenMP);
bool IsHeaderModulePrecompile = isa<HeaderModulePrecompileJobAction>(JA);		bool IsHeaderModulePrecompile = isa<HeaderModulePrecompileJobAction>(JA);
bool IsDeviceOffloadAction = !(JA.isDeviceOffloading(Action::OFK_None) \|\|		bool IsDeviceOffloadAction = !(JA.isDeviceOffloading(Action::OFK_None) \|\|
JA.isDeviceOffloading(Action::OFK_Host));		JA.isDeviceOffloading(Action::OFK_Host));
bool IsUsingLTO = D.isUsingLTO(IsDeviceOffloadAction);		bool IsUsingLTO = D.isUsingLTO(IsDeviceOffloadAction);
auto LTOMode = D.getLTOMode(IsDeviceOffloadAction);		auto LTOMode = D.getLTOMode(IsDeviceOffloadAction);

// A header module compilation doesn't have a main input file, so invent a		// A header module compilation doesn't have a main input file, so invent a
// fake one as a placeholder.		// fake one as a placeholder.
const char *ModuleName = [&]{		const char *ModuleName = [&]{
auto *ModuleNameArg = Args.getLastArg(options::OPT_fmodule_name_EQ);		auto *ModuleNameArg = Args.getLastArg(options::OPT_fmodule_name_EQ);
return ModuleNameArg ? ModuleNameArg->getValue() : "";		return ModuleNameArg ? ModuleNameArg->getValue() : "";
}();		}();
InputInfo HeaderModuleInput(Inputs[0].getType(), ModuleName, ModuleName);		InputInfo HeaderModuleInput(Inputs[0].getType(), ModuleName, ModuleName);

const InputInfo &Input =		const InputInfo &Input =
IsHeaderModulePrecompile ? HeaderModuleInput : Inputs[0];		IsHeaderModulePrecompile ? HeaderModuleInput : Inputs[0];

InputInfoList ModuleHeaderInputs;		InputInfoList ModuleHeaderInputs;
const InputInfo *CudaDeviceInput = nullptr;		const InputInfo *CudaDeviceInput = nullptr;
const InputInfo *OpenMPDeviceInput = nullptr;		const InputInfo *OpenMPDeviceInput = nullptr;
		const InputInfo *OpenMPHostInput = nullptr;
for (const InputInfo &I : Inputs) {		for (const InputInfo &I : Inputs) {
if (&I == &Input) {		if (&I == &Input) {
// This is the primary input.		// This is the primary input.
} else if (IsHeaderModulePrecompile &&		} else if (IsHeaderModulePrecompile &&
types::getPrecompiledType(I.getType()) == types::TY_PCH) {		types::getPrecompiledType(I.getType()) == types::TY_PCH) {
types::ID Expected = HeaderModuleInput.getType();		types::ID Expected = HeaderModuleInput.getType();
if (I.getType() != Expected) {		if (I.getType() != Expected) {
D.Diag(diag::err_drv_module_header_wrong_kind)		D.Diag(diag::err_drv_module_header_wrong_kind)
<< I.getFilename() << types::getTypeName(I.getType())		<< I.getFilename() << types::getTypeName(I.getType())
<< types::getTypeName(Expected);		<< types::getTypeName(Expected);
}		}
ModuleHeaderInputs.push_back(I);		ModuleHeaderInputs.push_back(I);
} else if ((IsCuda \|\| IsHIP) && !CudaDeviceInput) {		} else if ((IsCuda \|\| IsHIP) && !CudaDeviceInput) {
CudaDeviceInput = &I;		CudaDeviceInput = &I;
} else if (IsOpenMPDevice && !OpenMPDeviceInput) {		} else if (IsOpenMPDevice && !OpenMPDeviceInput) {
OpenMPDeviceInput = &I;		OpenMPDeviceInput = &I;
		} else if (IsOpenMPHost && !OpenMPHostInput) {
		OpenMPHostInput = &I;
} else {		} else {
llvm_unreachable("unexpectedly given multiple inputs");		llvm_unreachable("unexpectedly given multiple inputs");
}		}
}		}

const llvm::Triple *AuxTriple =		const llvm::Triple *AuxTriple =
(IsCuda \|\| IsHIP) ? TC.getAuxTriple() : nullptr;		(IsCuda \|\| IsHIP) ? TC.getAuxTriple() : nullptr;
bool IsWindowsMSVC = RawTriple.isWindowsMSVCEnvironment();		bool IsWindowsMSVC = RawTriple.isWindowsMSVCEnvironment();
▲ Show 20 Lines • Show All 3,724 Lines • Show Last 20 Lines

clang/test/Driver/openmp-offload-gpu.c

	Show First 20 Lines • Show All 344 Lines • ▼ Show 20 Lines
	// SAVE_TEMPS_NAMES-NOT: "GNU::Linker"{{.}}["[[SAVE_TEMPS_INPUT1:.\.o]]", "[[SAVE_TEMPS_INPUT1]]"]			// SAVE_TEMPS_NAMES-NOT: "GNU::Linker"{{.}}["[[SAVE_TEMPS_INPUT1:.\.o]]", "[[SAVE_TEMPS_INPUT1]]"]

	// RUN: %clang -### -fopenmp=libomp -fopenmp-targets=nvptx64 -Xopenmp-target=nvptx64 -march=sm_35 \			// RUN: %clang -### -fopenmp=libomp -fopenmp-targets=nvptx64 -Xopenmp-target=nvptx64 -march=sm_35 \
	// RUN: -save-temps -no-canonical-prefixes %s -o openmp-offload-gpu 2>&1 \			// RUN: -save-temps -no-canonical-prefixes %s -o openmp-offload-gpu 2>&1 \
	// RUN: \| FileCheck -check-prefix=TRIPLE %s			// RUN: \| FileCheck -check-prefix=TRIPLE %s

	// TRIPLE: "-triple" "nvptx64-nvidia-cuda"			// TRIPLE: "-triple" "nvptx64-nvidia-cuda"
	// TRIPLE: "-target-cpu" "sm_35"			// TRIPLE: "-target-cpu" "sm_35"

				// RUN: %clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda \
				// RUN: -fopenmp-new-driver -no-canonical-prefixes -ccc-print-bindings %s -o openmp-offload-gpu 2>&1 \
				// RUN: \| FileCheck -check-prefix=NEW_DRIVER %s

				// NEW_DRIVER: "x86_64-unknown-linux-gnu" - "clang", inputs: ["[[HOST_INPUT:.+]]"], output: "[[HOST_BC:.+]]"
				// NEW_DRIVER: "nvptx64-nvidia-cuda" - "clang", inputs: ["[[DEVICE_INPUT:.+]]", "[[HOST_BC]]"], output: "[[DEVICE_ASM:.+]]"
				// NEW_DRIVER: "nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["[[DEVICE_ASM]]"], output: "[[DEVICE_OBJ:.+]]"
				// NEW_DRIVER: "x86_64-unknown-linux-gnu" - "clang", inputs: ["[[HOST_BC]]", "[[DEVICE_OBJ]]"], output: "[[HOST_OBJ:.+]]"
				// NEW_DRIVER: "x86_64-unknown-linux-gnu" - "[[LINKER:.+]]", inputs: ["[[HOST_OBJ]]"], output: "openmp-offload-gpu"

This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP] Introduce new flag to change offloading driver pipelineClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 404684

clang/include/clang/Driver/Driver.h

clang/include/clang/Driver/Options.td

clang/lib/Driver/Driver.cpp

clang/lib/Driver/ToolChains/Clang.cpp

clang/test/Driver/openmp-offload-gpu.c

[OpenMP] Introduce new flag to change offloading driver pipeline
ClosedPublic