Diff 426064

clang/include/clang/Driver/Options.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 900 Lines • ▼ Show 20 Lines
def c : Flag<["-"], "c">, Flags<[NoXarchOption, FlangOption]>, Group<Action_Group>,		def c : Flag<["-"], "c">, Flags<[NoXarchOption, FlangOption]>, Group<Action_Group>,
HelpText<"Only run preprocess, compile, and assemble steps">;		HelpText<"Only run preprocess, compile, and assemble steps">;
def fconvergent_functions : Flag<["-"], "fconvergent-functions">, Group<f_Group>, Flags<[CC1Option]>,		def fconvergent_functions : Flag<["-"], "fconvergent-functions">, Group<f_Group>, Flags<[CC1Option]>,
HelpText<"Assume functions may be convergent">;		HelpText<"Assume functions may be convergent">;

def gpu_use_aux_triple_only : Flag<["--"], "gpu-use-aux-triple-only">,		def gpu_use_aux_triple_only : Flag<["--"], "gpu-use-aux-triple-only">,
InternalDriverOpt, HelpText<"Prepare '-aux-triple' only without populating "		InternalDriverOpt, HelpText<"Prepare '-aux-triple' only without populating "
"'-aux-target-cpu' and '-aux-target-feature'.">;		"'-aux-target-cpu' and '-aux-target-feature'.">;
def cuda_device_only : Flag<["--"], "cuda-device-only">,
HelpText<"Compile CUDA code for device only">;
def cuda_host_only : Flag<["--"], "cuda-host-only">,
HelpText<"Compile CUDA code for host only. Has no effect on non-CUDA "
"compilations.">;
def cuda_compile_host_device : Flag<["--"], "cuda-compile-host-device">,
HelpText<"Compile CUDA code for both host and device (default). Has no "
"effect on non-CUDA compilations.">;
def cuda_include_ptx_EQ : Joined<["--"], "cuda-include-ptx=">, Flags<[NoXarchOption]>,		def cuda_include_ptx_EQ : Joined<["--"], "cuda-include-ptx=">, Flags<[NoXarchOption]>,
HelpText<"Include PTX for the following GPU architecture (e.g. sm_35) or 'all'. May be specified more than once.">;		HelpText<"Include PTX for the following GPU architecture (e.g. sm_35) or 'all'. May be specified more than once.">;
def no_cuda_include_ptx_EQ : Joined<["--"], "no-cuda-include-ptx=">, Flags<[NoXarchOption]>,		def no_cuda_include_ptx_EQ : Joined<["--"], "no-cuda-include-ptx=">, Flags<[NoXarchOption]>,
HelpText<"Do not include PTX for the following GPU architecture (e.g. sm_35) or 'all'. May be specified more than once.">;		HelpText<"Do not include PTX for the following GPU architecture (e.g. sm_35) or 'all'. May be specified more than once.">;
def offload_arch_EQ : Joined<["--"], "offload-arch=">, Flags<[NoXarchOption]>,		def offload_arch_EQ : Joined<["--"], "offload-arch=">, Flags<[NoXarchOption]>,
HelpText<"CUDA offloading device architecture (e.g. sm_35), or HIP offloading target ID in the form of a "		HelpText<"CUDA offloading device architecture (e.g. sm_35), or HIP offloading target ID in the form of a "
"device architecture followed by target ID features delimited by a colon. Each target ID feature "		"device architecture followed by target ID features delimited by a colon. Each target ID feature "
"is a pre-defined string followed by a plus or minus sign (e.g. gfx908:xnack+:sramecc-). May be "		"is a pre-defined string followed by a plus or minus sign (e.g. gfx908:xnack+:sramecc-). May be "
▲ Show 20 Lines • Show All 1,604 Lines • ▼ Show 20 Lines	def fopenmp_target_new_runtime : Flag<["-"], "fopenmp-target-new-runtime">,
Group<f_Group>, Flags<[CC1Option, HelpHidden]>;		Group<f_Group>, Flags<[CC1Option, HelpHidden]>;
def fno_openmp_target_new_runtime : Flag<["-"], "fno-openmp-target-new-runtime">,		def fno_openmp_target_new_runtime : Flag<["-"], "fno-openmp-target-new-runtime">,
Group<f_Group>, Flags<[CC1Option, HelpHidden]>;		Group<f_Group>, Flags<[CC1Option, HelpHidden]>;
defm openmp_optimistic_collapse : BoolFOption<"openmp-optimistic-collapse",		defm openmp_optimistic_collapse : BoolFOption<"openmp-optimistic-collapse",
LangOpts<"OpenMPOptimisticCollapse">, DefaultFalse,		LangOpts<"OpenMPOptimisticCollapse">, DefaultFalse,
PosFlag<SetTrue, [CC1Option]>, NegFlag<SetFalse>, BothFlags<[NoArgumentUnused, HelpHidden]>>;		PosFlag<SetTrue, [CC1Option]>, NegFlag<SetFalse>, BothFlags<[NoArgumentUnused, HelpHidden]>>;
def static_openmp: Flag<["-"], "static-openmp">,		def static_openmp: Flag<["-"], "static-openmp">,
HelpText<"Use the static host OpenMP runtime while linking.">;		HelpText<"Use the static host OpenMP runtime while linking.">;
def offload_new_driver : Flag<["--"], "offload-new-driver">, Flags<[CC1Option]>, Group<Action_Group>,		def offload_new_driver : Flag<["--"], "offload-new-driver">, Flags<[CC1Option]>, Group<Action_Group>,
		traUnsubmitted Not Done Reply Inline Actions We should be using "--" for the new options. tra: We should be using "--" for the new options.
		jhuber6AuthorUnsubmitted Done Reply Inline Actions What's the reason for preferring `--` over `-`? Is it just because `-o` can bind to output? jhuber6: What's the reason for preferring `--` over `-`? Is it just because `-o` can bind to output?
		traUnsubmitted Not Done Reply Inline Actions Convention, I guess. Legacy (e.g. `-nostdlib` ) and single-letter options (e.g. `-o` `-m`, `-f` ) use single dash. Long options typically use double-dash. tra: Convention, I guess. Legacy (e.g. `-nostdlib` ) and single-letter options (e.g. `-o` `-m`, `…
HelpText<"Use the new driver for offloading compilation.">;		HelpText<"Use the new driver for offloading compilation.">;
def no_offload_new_driver : Flag<["--"], "no-offload-new-driver">, Flags<[CC1Option]>, Group<Action_Group>,		def no_offload_new_driver : Flag<["--"], "no-offload-new-driver">, Flags<[CC1Option]>, Group<Action_Group>,
HelpText<"Don't Use the new driver for offloading compilation.">;		HelpText<"Don't Use the new driver for offloading compilation.">;
		traUnsubmitted Not Done Reply Inline Actions We should probably alias `--cuda-host-only`/`--cuda-device-only` to these. tra: We should probably alias `--cuda-host-only`/`--cuda-device-only` to these.
		jhuber6AuthorUnsubmitted Done Reply Inline Actions Unfortunately I can't make these a strict alias for `-foffload-host-only`/`-foffload-device-only` without breaking their current usage. I could just check them along with these new versions, but that makes it a little cluttered. jhuber6: Unfortunately I can't make these a strict alias for `-foffload-host-only`/`-foffload-device…
		traUnsubmitted Not Done Reply Inline Actions Could you give me an example why we can use just one set of options to control host-only and device-only compilation? Is that because we also have `--cuda-compile-host-device`? Or is it because `-f` options have `-fno-` variants to override them, while `--cuda-host/device-only` don't, so we can't just mark them as an alias in Options.td? To think of it, those are just facets of the same issue -- difference in the options syntax. I do not see semantic differences between the existing options and -foffload--only. E.g. I believe you could've used --cuda-host-only and --cuda-device-only instead of -foffload--only without the loss of functionality of your patch. tra: Could you give me an example why we can use just one set of options to control host-only and…
		jhuber6AuthorUnsubmitted Done Reply Inline Actions I could have used those options, but this applies in general to the new driver, so it wouldn't be intuitive if we used `--cuda-device-only` when offloading to x86_64 with OpenMP for example. The problem is that the `--cuda-device-only` is already used in the current offloading driver so if I just make it an alias to this option it won't work. I was hoping to avoid touching the old driver, but I guess I could go back and replace all uses of `--cuda-host/device-only` with these new options. Also I did forget to add the `fno-` variants for these and use them correctly. I'm not sure if there's an explicit reason to, but everything else in the OpenMP world uses `-f` prefixes so I'm mostly just sticking with that. jhuber6: I could have used those options, but this applies in general to the new driver, so it wouldn't…
		traUnsubmitted Not Done Reply Inline Actions it wouldn't be intuitive if we used --cuda-device-only when offloading to x86_64 with OpenMP On the other hand, adding another set of options replicating the functionality creates more opportunities for conflicting usage. E.g. what should we do if user specifies `--cuda-device-only` and `-foffload-device-only` ? We then need to diagnose that in a sensible way, at the very least. In general, we have been transitioning some options that used to be CUDA-specific into more general variants. Usually `--cuda` -> `--offload` (--cuda-gpu-arch->--offload-arch) or `--gpu`. (-fcuda-rdc -> -fgpu-rdc) Picking the offloading side should follow the same pattern. Speaking of names and renaming. Perhaps `-f` is not the best choice here. `-f` options are conventionally used to control code generation parameters. `-foffload--only` mostly controls the driver behavior. Perhaps it would make sense to call them `--offload-host-only`, `--offload-device-only` and `--offload-host-device` and alias them to the matching `--cuda` counterparts (or vice-versa + search/replace their use in the code). tra:* > it wouldn't be intuitive if we used --cuda-device-only when offloading to x86_64 with…
		def offload_device_only : Flag<["--"], "offload-device-only">,
		HelpText<"Only compile for the offloading device.">;
		def offload_host_only : Flag<["--"], "offload-host-only">,
		HelpText<"Only compile for the offloading host.">;
		def offload_host_device : Flag<["--"], "offload-host-device">,
		HelpText<"Only compile for the offloading host.">;
		def cuda_device_only : Flag<["--"], "cuda-device-only">, Alias<offload_device_only>,
		HelpText<"Compile CUDA code for device only">;
		def cuda_host_only : Flag<["--"], "cuda-host-only">, Alias<offload_host_only>,
		HelpText<"Compile CUDA code for host only. Has no effect on non-CUDA compilations.">;
		def cuda_compile_host_device : Flag<["--"], "cuda-compile-host-device">, Alias<offload_host_device>,
		HelpText<"Compile CUDA code for both host and device (default). Has no "
		"effect on non-CUDA compilations.">;
def fopenmp_new_driver : Flag<["-"], "fopenmp-new-driver">, Flags<[CC1Option]>, Group<Action_Group>,		def fopenmp_new_driver : Flag<["-"], "fopenmp-new-driver">, Flags<[CC1Option]>, Group<Action_Group>,
HelpText<"Use the new driver for OpenMP offloading.">;		HelpText<"Use the new driver for OpenMP offloading.">;
def fno_openmp_new_driver : Flag<["-"], "fno-openmp-new-driver">, Flags<[CC1Option]>, Group<Action_Group>,		def fno_openmp_new_driver : Flag<["-"], "fno-openmp-new-driver">, Flags<[CC1Option]>, Group<Action_Group>,
Alias<no_offload_new_driver>, HelpText<"Don't use the new driver for OpenMP offloading.">;		Alias<no_offload_new_driver>, HelpText<"Don't use the new driver for OpenMP offloading.">;
def fno_optimize_sibling_calls : Flag<["-"], "fno-optimize-sibling-calls">, Group<f_Group>, Flags<[CC1Option]>,		def fno_optimize_sibling_calls : Flag<["-"], "fno-optimize-sibling-calls">, Group<f_Group>, Flags<[CC1Option]>,
HelpText<"Disable tail call optimization, keeping the call stack accurate">,		HelpText<"Disable tail call optimization, keeping the call stack accurate">,
MarshallingInfoFlag<CodeGenOpts<"DisableTailCalls">>;		MarshallingInfoFlag<CodeGenOpts<"DisableTailCalls">>;
def foptimize_sibling_calls : Flag<["-"], "foptimize-sibling-calls">, Group<f_Group>;		def foptimize_sibling_calls : Flag<["-"], "foptimize-sibling-calls">, Group<f_Group>;
▲ Show 20 Lines • Show All 4,208 Lines • Show Last 20 Lines

clang/lib/Driver/Driver.cpp

Show First 20 Lines • Show All 2,862 Lines • ▼ Show 20 Lines	bool initialize() override {
}		}

ToolChains.push_back(		ToolChains.push_back(
AssociatedOffloadKind == Action::OFK_Cuda		AssociatedOffloadKind == Action::OFK_Cuda
? C.getSingleOffloadToolChain<Action::OFK_Cuda>()		? C.getSingleOffloadToolChain<Action::OFK_Cuda>()
: C.getSingleOffloadToolChain<Action::OFK_HIP>());		: C.getSingleOffloadToolChain<Action::OFK_HIP>());

Arg *PartialCompilationArg = Args.getLastArg(		Arg *PartialCompilationArg = Args.getLastArg(
options::OPT_cuda_host_only, options::OPT_cuda_device_only,		options::OPT_offload_host_only, options::OPT_offload_device_only,
options::OPT_cuda_compile_host_device);		options::OPT_offload_host_device);
CompileHostOnly = PartialCompilationArg &&		CompileHostOnly =
PartialCompilationArg->getOption().matches(		PartialCompilationArg && PartialCompilationArg->getOption().matches(
options::OPT_cuda_host_only);		options::OPT_offload_host_only);
CompileDeviceOnly = PartialCompilationArg &&		CompileDeviceOnly =
PartialCompilationArg->getOption().matches(		PartialCompilationArg && PartialCompilationArg->getOption().matches(
options::OPT_cuda_device_only);		options::OPT_offload_device_only);
EmitLLVM = Args.getLastArg(options::OPT_emit_llvm);		EmitLLVM = Args.getLastArg(options::OPT_emit_llvm);
EmitAsm = Args.getLastArg(options::OPT_S);		EmitAsm = Args.getLastArg(options::OPT_S);
FixedCUID = Args.getLastArgValue(options::OPT_cuid_EQ);		FixedCUID = Args.getLastArgValue(options::OPT_cuid_EQ);
if (Arg *A = Args.getLastArg(options::OPT_fuse_cuid_EQ)) {		if (Arg *A = Args.getLastArg(options::OPT_fuse_cuid_EQ)) {
StringRef UseCUIDStr = A->getValue();		StringRef UseCUIDStr = A->getValue();
UseCUID = llvm::StringSwitch<UseCUIDKind>(UseCUIDStr)		UseCUID = llvm::StringSwitch<UseCUIDKind>(UseCUIDStr)
.Case("hash", CUID_Hash)		.Case("hash", CUID_Hash)
.Case("random", CUID_Random)		.Case("random", CUID_Random)
▲ Show 20 Lines • Show All 1,163 Lines • ▼ Show 20 Lines	for (phases::ID Phase : PL) {
}		}

if (Phase == phases::Precompile && ExtractAPIAction) {		if (Phase == phases::Precompile && ExtractAPIAction) {
ExtractAPIAction->addHeaderInput(Current);		ExtractAPIAction->addHeaderInput(Current);
Current = nullptr;		Current = nullptr;
break;		break;
}		}

// Try to build the offloading actions and add the result as a dependency
// to the host.
if (UseNewOffloadingDriver)
Current = BuildOffloadingActions(C, Args, I, Current);

// FIXME: Should we include any prior module file outputs as inputs of		// FIXME: Should we include any prior module file outputs as inputs of
// later actions in the same command line?		// later actions in the same command line?

// Otherwise construct the appropriate action.		// Otherwise construct the appropriate action.
Action *NewCurrent = ConstructPhaseAction(C, Args, Phase, Current);		Action *NewCurrent = ConstructPhaseAction(C, Args, Phase, Current);

// We didn't create a new action, so we will just move to the next phase.		// We didn't create a new action, so we will just move to the next phase.
if (NewCurrent == Current)		if (NewCurrent == Current)
continue;		continue;

if (auto *HMA = dyn_cast<HeaderModulePrecompileJobAction>(NewCurrent))		if (auto *HMA = dyn_cast<HeaderModulePrecompileJobAction>(NewCurrent))
HeaderModuleAction = HMA;		HeaderModuleAction = HMA;
else if (auto *EAA = dyn_cast<ExtractAPIJobAction>(NewCurrent))		else if (auto *EAA = dyn_cast<ExtractAPIJobAction>(NewCurrent))
ExtractAPIAction = EAA;		ExtractAPIAction = EAA;

Current = NewCurrent;		Current = NewCurrent;

// Use the current host action in any of the offloading actions, if		// Use the current host action in any of the offloading actions, if
// required.		// required.
if (!UseNewOffloadingDriver)		if (!UseNewOffloadingDriver)
if (OffloadBuilder.addHostDependenceToDeviceActions(Current, InputArg))		if (OffloadBuilder.addHostDependenceToDeviceActions(Current, InputArg))
break;		break;

		// Try to build the offloading actions and add the result as a dependency
		// to the host.
		if (UseNewOffloadingDriver)
		Current = BuildOffloadingActions(C, Args, I, Current);

if (Current->getType() == types::TY_Nothing)		if (Current->getType() == types::TY_Nothing)
break;		break;
}		}

// If we ended with something, add to the output list.		// If we ended with something, add to the output list.
if (Current)		if (Current)
Actions.push_back(Current);		Actions.push_back(Current);

▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	if (Arg *A = Args.getLastArg(options::OPT_print_supported_cpus)) {
Actions.push_back(		Actions.push_back(
C.MakeAction<PrecompileJobAction>(InputAc, types::TY_Nothing));		C.MakeAction<PrecompileJobAction>(InputAc, types::TY_Nothing));
for (auto &I : Inputs)		for (auto &I : Inputs)
I.second->claim();		I.second->claim();
}		}

// Claim ignored clang-cl options.		// Claim ignored clang-cl options.
Args.ClaimAllArgs(options::OPT_cl_ignored_Group);		Args.ClaimAllArgs(options::OPT_cl_ignored_Group);

// Claim --cuda-host-only and --cuda-compile-host-device, which may be passed		// Claim --offload-host-only and --offload-compile-host-device, which may be
		traUnsubmitted Not Done Reply Inline Actions Comment may need updating. tra: Comment may need updating.
// to non-CUDA compilations and should not trigger warnings there.		// passed to non-CUDA compilations and should not trigger warnings there.
Args.ClaimAllArgs(options::OPT_cuda_host_only);		Args.ClaimAllArgs(options::OPT_offload_host_only);
Args.ClaimAllArgs(options::OPT_cuda_compile_host_device);		Args.ClaimAllArgs(options::OPT_offload_host_device);
}		}

/// Returns the canonical name for the offloading architecture when using HIP or		/// Returns the canonical name for the offloading architecture when using HIP or
/// CUDA.		/// CUDA.
static StringRef getCanonicalArchString(Compilation &C,		static StringRef getCanonicalArchString(Compilation &C,
llvm::opt::DerivedArgList &Args,		llvm::opt::DerivedArgList &Args,
StringRef ArchStr,		StringRef ArchStr,
Action::OffloadKind Kind) {		Action::OffloadKind Kind) {
▲ Show 20 Lines • Show All 85 Lines • ▼ Show 20 Lines	getOffloadArchs(Compilation &C, llvm::opt::DerivedArgList &Args,

return Archs;		return Archs;
}		}

Action *Driver::BuildOffloadingActions(Compilation &C,		Action *Driver::BuildOffloadingActions(Compilation &C,
llvm::opt::DerivedArgList &Args,		llvm::opt::DerivedArgList &Args,
const InputTy &Input,		const InputTy &Input,
Action *HostAction) const {		Action *HostAction) const {
if (!isa<CompileJobAction>(HostAction))		const Arg *Mode = Args.getLastArg(options::OPT_offload_host_only,
		options::OPT_offload_device_only,
		options::OPT_offload_host_device);
		const bool HostOnly =
		Mode && Mode->getOption().matches(options::OPT_offload_host_only);
		const bool DeviceOnly =
		Mode && Mode->getOption().matches(options::OPT_offload_device_only);

		// Don't build offloading actions if explicitly disabled or we do not have a
		// compile action to embed it in. If preprocessing only ignore embedding.
		if (HostOnly \|\| !(isa<CompileJobAction>(HostAction) \|\|
		getFinalPhase(Args) == phases::Preprocess))
		traUnsubmitted Not Done Reply Inline Actions This will not always be correct. E.g. `--offload-host-only --offload-host-device` would be true here, but we would still want to compile for both and device. Is there a reason we can no longer rely on `HostAction`? tra: This will not always be correct. E.g. `--offload-host-only --offload-host-device` would be true…
		jhuber6AuthorUnsubmitted Done Reply Inline Actions Guess I should just use the last argument. Host action is used below, can probably merge these if statements. jhuber6: Guess I should just use the last argument. Host action is used below, can probably merge these…
		traUnsubmitted Not Done Reply Inline Actions I guess the general idea is to avoid the ambiguity about what controls the behavior of the function. Is that the `Args`, or the `HostAction`? Ideally I'd prefer to parse command line options once, save results somewhere we could use them and then use those flargs to control the behavior, regardless of which options were used to specify it. E.g. CUDA/HIPActionBuilder classes have these member fields: bool CompileHostOnly = false; bool CompileDeviceOnly = false; ... Arg PartialCompilationArg = Args.getLastArg( options::OPT_cuda_host_only, options::OPT_cuda_device_only, options::OPT_cuda_compile_host_device); CompileHostOnly = PartialCompilationArg && PartialCompilationArg->getOption().matches( options::OPT_cuda_host_only); CompileDeviceOnly = PartialCompilationArg && PartialCompilationArg->getOption().matches( options::OPT_cuda_device_only); ... tra:* I guess the general idea is to avoid the ambiguity about what controls the behavior of the…
		jhuber6AuthorUnsubmitted Done Reply Inline Actions I more or less copied that below, but I suppose we are recalculating them each phase. We don't have a monolithic class with my new implementation, but I suppose I could add these to the Compilation or something if we don't want to recalculate it. Beside that, does anything else seem amiss? jhuber6: I more or less copied that below, but I suppose we are recalculating them each phase. We don't…
return HostAction;		return HostAction;

OffloadAction::DeviceDependences DDeps;		OffloadAction::DeviceDependences DDeps;

types::ID InputType = Input.first;
const Arg *InputArg = Input.second;

const Action::OffloadKind OffloadKinds[] = {		const Action::OffloadKind OffloadKinds[] = {
Action::OFK_OpenMP, Action::OFK_Cuda, Action::OFK_HIP};		Action::OFK_OpenMP, Action::OFK_Cuda, Action::OFK_HIP};

		traUnsubmitted Not Done Reply Inline Actions This could use a comment. I don't quite understand what we're doing here and why. Only doing host-side preprocessing if -E is passed? tra: This could use a comment. I don't quite understand what we're doing here and why. Only doing…
		jhuber6AuthorUnsubmitted Done Reply Inline Actions OpenMP requires host IR for the device compile, but `-E` doesn't generate one, but we don't use it for preprocessing. jhuber6: OpenMP requires host IR for the device compile, but `-E` doesn't generate one, but we don't use…
for (Action::OffloadKind Kind : OffloadKinds) {		for (Action::OffloadKind Kind : OffloadKinds) {
SmallVector<const ToolChain *, 2> ToolChains;		SmallVector<const ToolChain *, 2> ToolChains;
		traUnsubmitted Not Done Reply Inline Actions Nit: `(!isa<CompileJobAction>(HostAction) && PL.back() != phases::Preprocess)` -> `!(isa<CompileJobAction>(HostAction) \|\| PL.back() == phases::Preprocess)` It's a bit easier to understand that way, IMO. We could also return early if `HostOnly` is set and make this condition simpler. tra: Nit: `(!isa<CompileJobAction>(HostAction) && PL.back() != phases::Preprocess)` -> `!
ActionList DeviceActions;		ActionList DeviceActions;

auto TCRange = C.getOffloadToolChains(Kind);		auto TCRange = C.getOffloadToolChains(Kind);
for (auto TI = TCRange.first, TE = TCRange.second; TI != TE; ++TI)		for (auto TI = TCRange.first, TE = TCRange.second; TI != TE; ++TI)
ToolChains.push_back(TI->second);		ToolChains.push_back(TI->second);

if (ToolChains.empty())		if (ToolChains.empty())
continue;		continue;

		types::ID InputType = Input.first;
		const Arg *InputArg = Input.second;

// Get the product of all bound architectures and toolchains.		// Get the product of all bound architectures and toolchains.
SmallVector<std::pair<const ToolChain *, StringRef>> TCAndArchs;		SmallVector<std::pair<const ToolChain *, StringRef>> TCAndArchs;
for (const ToolChain *TC : ToolChains)		for (const ToolChain *TC : ToolChains)
for (StringRef Arch : getOffloadArchs(C, Args, Kind))		for (StringRef Arch : getOffloadArchs(C, Args, Kind))
TCAndArchs.push_back(std::make_pair(TC, Arch));		TCAndArchs.push_back(std::make_pair(TC, Arch));

for (unsigned I = 0, E = TCAndArchs.size(); I != E; ++I)		for (unsigned I = 0, E = TCAndArchs.size(); I != E; ++I)
DeviceActions.push_back(C.MakeAction<InputAction>(*InputArg, InputType));		DeviceActions.push_back(C.MakeAction<InputAction>(*InputArg, InputType));

if (DeviceActions.empty())		if (DeviceActions.empty())
return HostAction;		return HostAction;

auto PL = types::getCompilationPhases(*this, Args, InputType);		auto PL = types::getCompilationPhases(*this, Args, InputType);

for (phases::ID Phase : PL) {		for (phases::ID Phase : PL) {
if (Phase == phases::Link) {		if (Phase == phases::Link) {
assert(Phase == PL.back() && "linking must be final compilation step.");		assert(Phase == PL.back() && "linking must be final compilation step.");
break;		break;
}		}

auto TCAndArch = TCAndArchs.begin();		auto TCAndArch = TCAndArchs.begin();
for (Action *&A : DeviceActions) {		for (Action *&A : DeviceActions) {
A = ConstructPhaseAction(C, Args, Phase, A, Kind);		A = ConstructPhaseAction(C, Args, Phase, A, Kind);

if (isa<CompileJobAction>(A) && Kind == Action::OFK_OpenMP) {		if (isa<CompileJobAction>(A) && isa<CompileJobAction>(HostAction) &&
		Kind == Action::OFK_OpenMP) {
// OpenMP offloading has a dependency on the host compile action to		// OpenMP offloading has a dependency on the host compile action to
// identify which declarations need to be emitted. This shouldn't be		// identify which declarations need to be emitted. This shouldn't be
// collapsed with any other actions so we can use it in the device.		// collapsed with any other actions so we can use it in the device.
HostAction->setCannotBeCollapsedWithNextDependentAction();		HostAction->setCannotBeCollapsedWithNextDependentAction();
OffloadAction::HostDependence HDep(		OffloadAction::HostDependence HDep(
HostAction, C.getSingleOffloadToolChain<Action::OFK_Host>(),		HostAction, C.getSingleOffloadToolChain<Action::OFK_Host>(),
/BoundArch=/nullptr, Kind);		/BoundArch=/nullptr, Kind);
OffloadAction::DeviceDependences DDep;		OffloadAction::DeviceDependences DDep;
Show All 17 Lines	for (Action::OffloadKind Kind : OffloadKinds) {

auto TCAndArch = TCAndArchs.begin();		auto TCAndArch = TCAndArchs.begin();
for (Action *A : DeviceActions) {		for (Action *A : DeviceActions) {
DDeps.add(A, TCAndArch->first, TCAndArch->second.data(), Kind);		DDeps.add(A, TCAndArch->first, TCAndArch->second.data(), Kind);
++TCAndArch;		++TCAndArch;
}		}
}		}

		if (DeviceOnly)
		traUnsubmitted Not Done Reply Inline Actions Same problem as with the host-checking above. tra: Same problem as with the host-checking above.
		return C.MakeAction<OffloadAction>(DDeps, types::TY_Nothing);

OffloadAction::HostDependence HDep(		OffloadAction::HostDependence HDep(
HostAction, C.getSingleOffloadToolChain<Action::OFK_Host>(),		HostAction, C.getSingleOffloadToolChain<Action::OFK_Host>(),
/BoundArch=/nullptr, DDeps);		/BoundArch=/nullptr, DDeps);
return C.MakeAction<OffloadAction>(HDep, DDeps);		return C.MakeAction<OffloadAction>(HDep, DDeps);
}		}

Action *Driver::ConstructPhaseAction(		Action *Driver::ConstructPhaseAction(
Compilation &C, const ArgList &Args, phases::ID Phase, Action *Input,		Compilation &C, const ArgList &Args, phases::ID Phase, Action *Input,
▲ Show 20 Lines • Show All 1,785 Lines • Show Last 20 Lines

clang/test/Driver/cuda-openmp-driver.cu

	Show All 10 Lines
	// BINDINGS-NEXT: "nvptx64-nvidia-cuda" - "clang", inputs: ["[[INPUT]]"], output: "[[PTX_SM_70:.+]]"			// BINDINGS-NEXT: "nvptx64-nvidia-cuda" - "clang", inputs: ["[[INPUT]]"], output: "[[PTX_SM_70:.+]]"
	// BINDINGS-NEXT: "nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["[[PTX_SM_70:.+]]"], output: "[[CUBIN_SM_70:.+]]"			// BINDINGS-NEXT: "nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["[[PTX_SM_70:.+]]"], output: "[[CUBIN_SM_70:.+]]"
	// BINDINGS-NEXT: "nvptx64-nvidia-cuda" - "NVPTX::Linker", inputs: ["[[CUBIN_SM_70]]", "[[PTX_SM_70:.+]]"], output: "[[FATBIN_SM_70:.+]]"			// BINDINGS-NEXT: "nvptx64-nvidia-cuda" - "NVPTX::Linker", inputs: ["[[CUBIN_SM_70]]", "[[PTX_SM_70:.+]]"], output: "[[FATBIN_SM_70:.+]]"
	// BINDINGS-NEXT: "x86_64-unknown-linux-gnu" - "clang", inputs: ["[[INPUT]]", "[[FATBIN_SM_35]]", "[[FATBIN_SM_70]]"], output: "[[HOST_OBJ:.+]]"			// BINDINGS-NEXT: "x86_64-unknown-linux-gnu" - "clang", inputs: ["[[INPUT]]", "[[FATBIN_SM_35]]", "[[FATBIN_SM_70]]"], output: "[[HOST_OBJ:.+]]"
	// BINDINGS-NEXT: "x86_64-unknown-linux-gnu" - "Offload::Linker", inputs: ["[[HOST_OBJ]]"], output: "a.out"			// BINDINGS-NEXT: "x86_64-unknown-linux-gnu" - "Offload::Linker", inputs: ["[[HOST_OBJ]]"], output: "a.out"

	// RUN: %clang -### -nocudalib --offload-new-driver %s 2>&1 \| FileCheck -check-prefix RDC %s			// RUN: %clang -### -nocudalib --offload-new-driver %s 2>&1 \| FileCheck -check-prefix RDC %s
	// RDC: error: Using '--offload-new-driver' requires '-fgpu-rdc'			// RDC: error: Using '--offload-new-driver' requires '-fgpu-rdc'

				// RUN: %clang -### -target x86_64-linux-gnu -nocudalib -ccc-print-bindings -fgpu-rdc \
				// RUN: --offload-new-driver --offload-arch=sm_35 --offload-arch=sm_70 %s 2>&1 \
				// RUN: \| FileCheck -check-prefix BINDINGS-HOST %s

				// BINDINGS-HOST: # "x86_64-unknown-linux-gnu" - "clang", inputs: ["[[INPUT:.+]]"], output: "[[OUTPUT:.+]]"
				// BINDINGS-HOST: # "x86_64-unknown-linux-gnu" - "Offload::Linker", inputs: ["[[OUTPUT]]"], output: "a.out"

				// RUN: %clang -### -target x86_64-linux-gnu -nocudalib -ccc-print-bindings -fgpu-rdc \
				// RUN: --offload-new-driver --offload-arch=sm_35 --offload-arch=sm_70 %s 2>&1 \
				// RUN: \| FileCheck -check-prefix BINDINGS-DEVICE %s

				// BINDINGS-DEVICE: # "nvptx64-nvidia-cuda" - "clang", inputs: ["[[INPUT:.+]]"], output: "[[PTX:.+]]"
				// BINDINGS-DEVICE: # "nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["[[PTX]]"], output: "[[CUBIN:.+]]"
				// BINDINGS-DEVICE: # "nvptx64-nvidia-cuda" - "NVPTX::Linker", inputs: ["[[CUBIN]]", "[[PTX]]"], output: "{{.*}}.fatbin"

clang/test/Driver/openmp-offload-gpu-new.c

	///			///
	/// Perform several driver tests for OpenMP offloading			/// Perform several driver tests for OpenMP offloading
	///			///

	// REQUIRES: x86-registered-target			// REQUIRES: x86-registered-target
	// REQUIRES: powerpc-registered-target
	// REQUIRES: nvptx-registered-target			// REQUIRES: nvptx-registered-target
	// REQUIRES: amdgpu-registered-target			// REQUIRES: amdgpu-registered-target

	// RUN: %clang -### --target=x86_64-unknown-linux-gnu -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda \			// RUN: %clang -### --target=x86_64-unknown-linux-gnu -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda \
	// RUN: -Xopenmp-target=nvptx64-nvidia-cuda -march=sm_52 \			// RUN: -Xopenmp-target=nvptx64-nvidia-cuda -march=sm_52 \
	// RUN: --libomptarget-nvptx-bc-path=%S/Inputs/libomptarget/libomptarget-nvptx-test.bc %s 2>&1 \			// RUN: --libomptarget-nvptx-bc-path=%S/Inputs/libomptarget/libomptarget-nvptx-test.bc %s 2>&1 \
	// RUN: \| FileCheck %s			// RUN: \| FileCheck %s

	Show All 30 Lines
	// CHECK-EMIT-LLVM-IR: clang{{.}}"-cc1"{{.}}"-triple" "nvptx64-nvidia-cuda"{{.*}}"-emit-llvm"			// CHECK-EMIT-LLVM-IR: clang{{.}}"-cc1"{{.}}"-triple" "nvptx64-nvidia-cuda"{{.*}}"-emit-llvm"

	// RUN: %clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -Xopenmp-target=nvptx64-nvida-cuda -march=sm_70 \			// RUN: %clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda -Xopenmp-target=nvptx64-nvida-cuda -march=sm_70 \
	// RUN: --libomptarget-nvptx-bc-path=%S/Inputs/libomptarget/libomptarget-new-nvptx-test.bc \			// RUN: --libomptarget-nvptx-bc-path=%S/Inputs/libomptarget/libomptarget-new-nvptx-test.bc \
	// RUN: -no-canonical-prefixes -nogpulib %s -o openmp-offload-gpu 2>&1 \			// RUN: -no-canonical-prefixes -nogpulib %s -o openmp-offload-gpu 2>&1 \
	// RUN: \| FileCheck -check-prefix=DRIVER_EMBEDDING %s			// RUN: \| FileCheck -check-prefix=DRIVER_EMBEDDING %s

	// DRIVER_EMBEDDING: -fembed-offload-object=[[CUBIN:.*\.cubin]],openmp,nvptx64-nvidia-cuda,sm_70			// DRIVER_EMBEDDING: -fembed-offload-object=[[CUBIN:.*\.cubin]],openmp,nvptx64-nvidia-cuda,sm_70

				// RUN: %clang -### --target=x86_64-unknown-linux-gnu -ccc-print-bindings -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda \
				// RUN: --offload-host-only -nogpulib %s 2>&1 \| FileCheck %s --check-prefix=CHECK-HOST-ONLY
				// CHECK-HOST-ONLY: "x86_64-unknown-linux-gnu" - "clang", inputs: ["[[INPUT:.]]"], output: "[[OUTPUT:.]]"
				// CHECK-HOST-ONLY: "x86_64-unknown-linux-gnu" - "Offload::Linker", inputs: ["[[OUTPUT]]"], output: "a.out"

				// RUN: %clang -### --target=x86_64-unknown-linux-gnu -ccc-print-bindings -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda \
				// RUN: --offload-device-only -nogpulib %s 2>&1 \| FileCheck %s --check-prefix=CHECK-DEVICE-ONLY
				// CHECK-DEVICE-ONLY: "x86_64-unknown-linux-gnu" - "clang", inputs: ["[[INPUT:.]]"], output: "[[HOST_BC:.]]"
				// CHECK-DEVICE-ONLY: "nvptx64-nvidia-cuda" - "clang", inputs: ["[[INPUT]]", "[[HOST_BC]]"], output: "[[DEVICE_ASM:.*]]"
				// CHECK-DEVICE-ONLY: "nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["[[DEVICE_ASM]]"], output: "{{.*}}-openmp-nvptx64-nvidia-cuda.o"

				// RUN: %clang -### --target=x86_64-unknown-linux-gnu -ccc-print-bindings -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda \
				// RUN: --offload-device-only -E -nogpulib %s 2>&1 \| FileCheck %s --check-prefix=CHECK-DEVICE-ONLY-PP
				// CHECK-DEVICE-ONLY-PP: "nvptx64-nvidia-cuda" - "clang", inputs: ["[[INPUT:.*]]"], output: "-"

This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP] Add options to only compile the host or device when offloading
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 426064

clang/include/clang/Driver/Options.td

clang/lib/Driver/Driver.cpp

clang/test/Driver/cuda-openmp-driver.cu

clang/test/Driver/openmp-offload-gpu-new.c

This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP] Add options to only compile the host or device when offloadingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 426064

clang/include/clang/Driver/Options.td

clang/lib/Driver/Driver.cpp

clang/test/Driver/cuda-openmp-driver.cu

clang/test/Driver/openmp-offload-gpu-new.c

[OpenMP] Add options to only compile the host or device when offloading
ClosedPublic