This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/Driver/
-
Driver/
-
Driver.cpp
-
test/Driver/
-
Driver/
-
openmp-offload-gpu.c

Differential D97273

OpenMP: Fix object clobbering issue when using save-temps
ClosedPublic

Authored by pdhaliwal on Feb 23 2021, 5:05 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
JonChesterfield
ronlieb
tianshilei1992
sdmitriev

Commits

rG99951aa68da3: OpenMP: Fix object clobbering issue when using save-temps

Summary

There are two preconditions to reproduce the issue,

Use -save-temps option
Provide the -o option with name equal to the input file name without the file extension. For e.g. clang a.c -o a

With the -o specified, the AssembleJobAction after OffloadWrapperJobAction
will produce the object file with same name as host code object file.
Due to this clash, the OffloadWrapperAction overwrites the initial host
object file, which results in lld error. This also fixes the multiple definition of __dummy.omp_offloading.entry' issue in D96769 .

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

pdhaliwal created this revision.Feb 23 2021, 5:05 AM

Herald added subscribers: guansong, yaxunl. · View Herald TranscriptFeb 23 2021, 5:05 AM

pdhaliwal requested review of this revision.Feb 23 2021, 5:05 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 23 2021, 5:06 AM

Herald added subscribers: cfe-commits, sstefan1. · View Herald Transcript

pdhaliwal edited the summary of this revision. (Show Details)Feb 23 2021, 5:48 AM

pdhaliwal edited the summary of this revision. (Show Details)Feb 23 2021, 6:11 AM

Harbormaster completed remote builds in B90375: Diff 325746.Feb 23 2021, 6:35 AM

Here's a bit of background,
OffloadingPrefix was not getting properly set in the dependent actions of OffloadWrapperJobAction (which are backend [11] and assemble [12]). Since backend [11] and assemble [12] host-wrapper actions have same logic to the other host actions (3 & 4), those will overwrite the previous generated files from host-only actions.

For e.g. following were the names generated for output files previously (marked as bold). (clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda -save-temps -ccc-print-bindings helloworld.c -o helloworld)

"x86_64-unknown-linux-gnu" - "clang", inputs: ["helloworld.c"], output: "helloworld-host-x86_64-unknown-linux-gnu.i"
"x86_64-unknown-linux-gnu" - "clang", inputs: ["helloworld-host-x86_64-unknown-linux-gnu.i"], output: "helloworld-host-x86_64-unknown-linux-gnu.bc"
"x86_64-unknown-linux-gnu" - "clang", inputs: ["helloworld-host-x86_64-unknown-linux-gnu.bc"], output: "helloworld-host-x86_64-unknown-linux-gnu.s"
"x86_64-unknown-linux-gnu" - "clang::as", inputs: ["helloworld-host-x86_64-unknown-linux-gnu.s"], output: "helloworld-host-x86_64-unknown-linux-gnu.o"
"nvptx64-nvidia-cuda" - "clang", inputs: ["helloworld.c"], output: "helloworld-openmp-nvptx64-nvidia-cuda.i"
"nvptx64-nvidia-cuda" - "clang", inputs: ["helloworld-openmp-nvptx64-nvidia-cuda.i", "helloworld-host-x86_64-unknown-linux-gnu.bc"], output: "helloworld-openmp-nvptx64-nvidia-cuda.bc"
"nvptx64-nvidia-cuda" - "clang", inputs: ["helloworld-openmp-nvptx64-nvidia-cuda.bc"], output: "helloworld-openmp-nvptx64-nvidia-cuda.s"
"nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["helloworld-openmp-nvptx64-nvidia-cuda.s"], output: "helloworld-openmp-nvptx64-nvidia-cuda.o"
"nvptx64-nvidia-cuda" - "NVPTX::OpenMPLinker", inputs: ["helloworld-openmp-nvptx64-nvidia-cuda.o"], output: "a.out-openmp-nvptx64-nvidia-cuda"
"x86_64-unknown-linux-gnu" - "offload wrapper", inputs: ["a.out-openmp-nvptx64-nvidia-cuda"], output: "helloworld-host-x86_64-unknown-linux-gnu-wrapper.bc"
"x86_64-unknown-linux-gnu" - "clang", inputs: ["helloworld-host-x86_64-unknown-linux-gnu-wrapper.bc"], output: "helloworld-host-x86_64-unknown-linux-gnu.s"
"x86_64-unknown-linux-gnu" - "clang::as", inputs: ["helloworld-host-x86_64-unknown-linux-gnu.s"], output: "helloworld-host-x86_64-unknown-linux-gnu.o"
"x86_64-unknown-linux-gnu" - "GNU::Linker", inputs: ["helloworld-host-x86_64-unknown-linux-gnu.o", "helloworld-host-x86_64-unknown-linux-gnu.o"], output: "helloworld"

And here are names generated after this patch applied,

"x86_64-unknown-linux-gnu" - "clang", inputs: ["helloworld.c"], output: "helloworld-host-x86_64-unknown-linux-gnu.i"
"x86_64-unknown-linux-gnu" - "clang", inputs: ["helloworld-host-x86_64-unknown-linux-gnu.i"], output: "helloworld-host-x86_64-unknown-linux-gnu.bc"
"x86_64-unknown-linux-gnu" - "clang", inputs: ["helloworld-host-x86_64-unknown-linux-gnu.bc"], output: "helloworld-host-x86_64-unknown-linux-gnu.s"
"x86_64-unknown-linux-gnu" - "clang::as", inputs: ["helloworld-host-x86_64-unknown-linux-gnu.s"], output: "helloworld-host-x86_64-unknown-linux-gnu.o"
"nvptx64-nvidia-cuda" - "clang", inputs: ["helloworld.c"], output: "helloworld-openmp-nvptx64-nvidia-cuda.i"
"nvptx64-nvidia-cuda" - "clang", inputs: ["helloworld-openmp-nvptx64-nvidia-cuda.i", "helloworld-host-x86_64-unknown-linux-gnu.bc"], output: "helloworld-openmp-nvptx64-nvidia-cuda.bc"
"nvptx64-nvidia-cuda" - "clang", inputs: ["helloworld-openmp-nvptx64-nvidia-cuda.bc"], output: "helloworld-openmp-nvptx64-nvidia-cuda.s"
"nvptx64-nvidia-cuda" - "NVPTX::Assembler", inputs: ["helloworld-openmp-nvptx64-nvidia-cuda.s"], output: "helloworld-openmp-nvptx64-nvidia-cuda.o"
"nvptx64-nvidia-cuda" - "NVPTX::OpenMPLinker", inputs: ["helloworld-openmp-nvptx64-nvidia-cuda.o"], output: "a.out-openmp-nvptx64-nvidia-cuda"
"x86_64-unknown-linux-gnu" - "offload wrapper", inputs: ["a.out-openmp-nvptx64-nvidia-cuda"], output: "helloworld-wrapper-host-x86_64-unknown-linux-gnu.bc"
"x86_64-unknown-linux-gnu" - "clang", inputs: ["helloworld-wrapper-host-x86_64-unknown-linux-gnu.bc"], output: "helloworld-wrapper-host-x86_64-unknown-linux-gnu.s"
"x86_64-unknown-linux-gnu" - "clang::as", inputs: ["helloworld-wrapper-host-x86_64-unknown-linux-gnu.s"], output: "helloworld-wrapper-host-x86_64-unknown-linux-gnu.o"
"x86_64-unknown-linux-gnu" - "GNU::Linker", inputs: ["helloworld-host-x86_64-unknown-linux-gnu.o", "helloworld-wrapper-host-x86_64-unknown-linux-gnu.o"], output: "helloworld"

So for having OffloadingPrefix different for 11 & 12 would require to distinguish latter from 3 & 4 which I don't think is possible. However, the changes to BaseInput in OffloadWrapperJobAction [10] will also reflect in the dependent backend [11] and assemble [12] actions as BaseInput is present in InputInfo of the next actions (line number 4696).

pdhaliwal added a reviewer: sdmitriev.Feb 23 2021, 8:07 AM

LGTM, assuming it doesn't break support the reasoning makes sense.

This revision is now accepted and ready to land.Feb 24 2021, 9:28 AM

Works everywhere we have tried it. Fundamentally it renames a temporary file, so shouldn't break anything. Will be great to have -save-temps working for nvptx.

Closed by commit rG99951aa68da3: OpenMP: Fix object clobbering issue when using save-temps (authored by pdhaliwal). · Explain WhyFeb 24 2021, 9:51 PM

This revision was automatically updated to reflect the committed changes.

pdhaliwal added a commit: rG99951aa68da3: OpenMP: Fix object clobbering issue when using save-temps.

Revision Contents

Path

Size

clang/

lib/

Driver/

Driver.cpp

3 lines

test/

Driver/

openmp-offload-gpu.c

6 lines

Diff 326281

clang/lib/Driver/Driver.cpp

Show First 20 Lines • Show All 4,669 Lines • ▼ Show 20 Lines	InputInfo Driver::BuildJobsForActionNoCache(
else {		else {
// We only have to generate a prefix for the host if this is not a top-level		// We only have to generate a prefix for the host if this is not a top-level
// action.		// action.
std::string OffloadingPrefix = Action::GetOffloadingFileNamePrefix(		std::string OffloadingPrefix = Action::GetOffloadingFileNamePrefix(
A->getOffloadingDeviceKind(), TC->getTriple().normalize(),		A->getOffloadingDeviceKind(), TC->getTriple().normalize(),
/CreatePrefixForHost=/!!A->getOffloadingHostActiveKinds() &&		/CreatePrefixForHost=/!!A->getOffloadingHostActiveKinds() &&
!AtTopLevel);		!AtTopLevel);
if (isa<OffloadWrapperJobAction>(JA)) {		if (isa<OffloadWrapperJobAction>(JA)) {
OffloadingPrefix += "-wrapper";
if (Arg *FinalOutput = C.getArgs().getLastArg(options::OPT_o))		if (Arg *FinalOutput = C.getArgs().getLastArg(options::OPT_o))
BaseInput = FinalOutput->getValue();		BaseInput = FinalOutput->getValue();
else		else
BaseInput = getDefaultImageName();		BaseInput = getDefaultImageName();
		BaseInput =
		C.getArgs().MakeArgString(std::string(BaseInput) + "-wrapper");
}		}
Result = InputInfo(A, GetNamedOutputPath(C, *JA, BaseInput, BoundArch,		Result = InputInfo(A, GetNamedOutputPath(C, *JA, BaseInput, BoundArch,
AtTopLevel, MultipleArchs,		AtTopLevel, MultipleArchs,
OffloadingPrefix),		OffloadingPrefix),
BaseInput);		BaseInput);
}		}

if (CCCPrintBindings && !CCGenDiagnostics) {		if (CCCPrintBindings && !CCGenDiagnostics) {
▲ Show 20 Lines • Show All 776 Lines • Show Last 20 Lines

clang/test/Driver/openmp-offload-gpu.c

	Show First 20 Lines • Show All 304 Lines • ▼ Show 20 Lines
	// RUN: \| FileCheck -check-prefix=CUDA_RED_RECS %s			// RUN: \| FileCheck -check-prefix=CUDA_RED_RECS %s
	// CUDA_RED_RECS: clang{{.}}"-cc1"{{.}}"-triple" "nvptx64-nvidia-cuda"			// CUDA_RED_RECS: clang{{.}}"-cc1"{{.}}"-triple" "nvptx64-nvidia-cuda"
	// CUDA_RED_RECS-SAME: "-fopenmp-cuda-teams-reduction-recs-num=2048"			// CUDA_RED_RECS-SAME: "-fopenmp-cuda-teams-reduction-recs-num=2048"

	// RUN: %clang -### -no-canonical-prefixes -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda %s 2>&1 \			// RUN: %clang -### -no-canonical-prefixes -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda %s 2>&1 \
	// RUN: \| FileCheck -check-prefix=OPENMP_NVPTX_WRAPPERS %s			// RUN: \| FileCheck -check-prefix=OPENMP_NVPTX_WRAPPERS %s
	// OPENMP_NVPTX_WRAPPERS: clang{{.}}"-cc1"{{.}}"-triple" "nvptx64-nvidia-cuda"			// OPENMP_NVPTX_WRAPPERS: clang{{.}}"-cc1"{{.}}"-triple" "nvptx64-nvidia-cuda"
	// OPENMP_NVPTX_WRAPPERS-SAME: "-internal-isystem" "{{.*}}openmp_wrappers"			// OPENMP_NVPTX_WRAPPERS-SAME: "-internal-isystem" "{{.*}}openmp_wrappers"

				// RUN: %clang -### -fopenmp=libomp -fopenmp-targets=nvptx64-nvidia-cuda \
				// RUN: -save-temps -no-canonical-prefixes -ccc-print-bindings %s -o openmp-offload-gpu 2>&1 \
				// RUN: \| FileCheck -check-prefix=SAVE_TEMPS_NAMES %s

				// SAVE_TEMPS_NAMES-NOT: "GNU::Linker"{{.}}["[[SAVE_TEMPS_INPUT1:.\.o]]", "[[SAVE_TEMPS_INPUT1]]"]