This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/Driver/ToolChains/
-
Driver/
-
ToolChains/
1/2
AMDGPU.cpp
-
test/Driver/
-
Driver/
1/2
amdgpu-toolchain.c

Differential D144505

[Clang] Add options in LTO mode when cross compiling for AMDGPU
ClosedPublic

Authored by jhuber6 on Feb 21 2023, 9:45 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
arsenm
yaxunl
JonChesterfield
tianshilei1992

Commits

rGc45d2df05e0e: [Clang] Add options in LTO mode when cross compiling for AMDGPU

Summary

The AMDGPU toolchain support directly compiling GPU images using
cross-compilation such as clang --target=amdgcn-amd-amdhsa foo.c.
However, when attempting to link bitcode this does not work because the
-mcpu options are not forwarded to the linker among others. This patch
simply adds them so that clang --target=amdgcn-amd-amdhsa foo.c -flto
works correctly.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.Feb 21 2023, 9:45 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 21 2023, 9:45 AM

Herald added subscribers: kosarev, kerbowa, inglorion and 4 others. · View Herald Transcript

jhuber6 requested review of this revision.Feb 21 2023, 9:45 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 21 2023, 9:45 AM

Herald added subscribers: cfe-commits, MaskRay, wdng. · View Herald Transcript

Harbormaster completed remote builds in B215055: Diff 499216.Feb 21 2023, 2:07 PM

JonChesterfield added inline comments.Feb 22 2023, 8:05 AM

clang/lib/Driver/ToolChains/AMDGPU.cpp
545	What's the test against OK_Thin for? ThinLTO is a thing but I don't know if it exists (/works) on amdgpu, is this that?

NVM the above, all the other call sites to addLTOOptions have that test in them so it must be fine

void tools::addLTOOptions(const ToolChain &ToolChain, const ArgList &Args,
                          ArgStringList &CmdArgs, const InputInfo &Output,
                          const InputInfo &Input, bool IsThinLTO);

seems strange that it's a bool argument and not the result of getLTOMode, but pre-existing feature.

This revision is now accepted and ready to land.Feb 22 2023, 8:06 AM

jhuber6 added inline comments.Feb 22 2023, 8:10 AM

clang/lib/Driver/ToolChains/AMDGPU.cpp
545	AMDGPU's linker is `lld` so it works "in theory". But in practice, separate linking of object files isn't fully supported by the ABI. So I consider it a "your mileage may vary" scenario. FWIW, it functioned properly with my experimental libc setup $ cat main.cpp int foo(); int main() { return foo(); } $ cat foo.cpp int foo() { return 3; } $ clang++ crt1.o main.cpp foo.cpp --target=amdgcn-amd-amdhsa -mcpu=gfx1030 -Wl,-plugin-opt=mcpu=gfx1030 -nogpulib -o image -flto=thin $ amdhsa_loader image; echo $? 3

Closed by commit rGc45d2df05e0e: [Clang] Add options in LTO mode when cross compiling for AMDGPU (authored by jhuber6). · Explain WhyFeb 22 2023, 8:14 AM

This revision was automatically updated to reflect the committed changes.

jhuber6 added a commit: rGc45d2df05e0e: [Clang] Add options in LTO mode when cross compiling for AMDGPU.

jhuber6 mentioned this in rG6641f8da73ae: [libc] Fix amdgpu startup code flags.Feb 22 2023, 9:38 AM

arsenm added inline comments.Feb 22 2023, 10:17 AM

clang/test/Driver/amdgpu-toolchain.c
7	should add a test for thinlto?

jhuber6 added inline comments.Feb 22 2023, 10:19 AM

clang/test/Driver/amdgpu-toolchain.c
7	Only thing it would change is adding `-plugin-opt=thinlto`. Not sure if it's worth a test since it's probably checked by other uses of `addLTOOptions`.

Revision Contents

Path

Size

clang/

lib/

Driver/

ToolChains/

AMDGPU.cpp

3 lines

test/

Driver/

amdgpu-toolchain.c

2 lines

Diff 499517

clang/lib/Driver/ToolChains/AMDGPU.cpp

Show First 20 Lines • Show All 534 Lines • ▼ Show 20 Lines	void amdgpu::Linker::ConstructJob(Compilation &C, const JobAction &JA,
const InputInfoList &Inputs,		const InputInfoList &Inputs,
const ArgList &Args,		const ArgList &Args,
const char *LinkingOutput) const {		const char *LinkingOutput) const {

std::string Linker = getToolChain().GetProgramPath(getShortName());		std::string Linker = getToolChain().GetProgramPath(getShortName());
ArgStringList CmdArgs;		ArgStringList CmdArgs;
addLinkerCompressDebugSectionsOption(getToolChain(), Args, CmdArgs);		addLinkerCompressDebugSectionsOption(getToolChain(), Args, CmdArgs);
AddLinkerInputs(getToolChain(), Inputs, Args, CmdArgs, JA);		AddLinkerInputs(getToolChain(), Inputs, Args, CmdArgs, JA);
		if (C.getDriver().isUsingLTO())
		addLTOOptions(getToolChain(), Args, CmdArgs, Output, Inputs[0],
		C.getDriver().getLTOMode() == LTOK_Thin);
		JonChesterfieldUnsubmitted Not Done Reply Inline Actions What's the test against OK_Thin for? ThinLTO is a thing but I don't know if it exists (/works) on amdgpu, is this that? JonChesterfield: What's the test against OK_Thin for? ThinLTO is a thing but I don't know if it exists (/works)…
		jhuber6AuthorUnsubmitted Done Reply Inline Actions AMDGPU's linker is `lld` so it works "in theory". But in practice, separate linking of object files isn't fully supported by the ABI. So I consider it a "your mileage may vary" scenario. FWIW, it functioned properly with my experimental libc setup $ cat main.cpp int foo(); int main() { return foo(); } $ cat foo.cpp int foo() { return 3; } $ clang++ crt1.o main.cpp foo.cpp --target=amdgcn-amd-amdhsa -mcpu=gfx1030 -Wl,-plugin-opt=mcpu=gfx1030 -nogpulib -o image -flto=thin $ amdhsa_loader image; echo $? 3 jhuber6: AMDGPU's linker is `lld` so it works "in theory". But in practice, separate linking of object…
CmdArgs.push_back("-shared");		CmdArgs.push_back("-shared");
CmdArgs.push_back("-o");		CmdArgs.push_back("-o");
CmdArgs.push_back(Output.getFilename());		CmdArgs.push_back(Output.getFilename());
C.addCommand(std::make_unique<Command>(		C.addCommand(std::make_unique<Command>(
JA, *this, ResponseFileSupport::AtFileCurCP(), Args.MakeArgString(Linker),		JA, *this, ResponseFileSupport::AtFileCurCP(), Args.MakeArgString(Linker),
CmdArgs, Inputs, Output));		CmdArgs, Inputs, Output));
}		}

▲ Show 20 Lines • Show All 356 Lines • Show Last 20 Lines

clang/test/Driver/amdgpu-toolchain.c

	// RUN: %clang -### --target=amdgcn--amdhsa -x assembler -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=AS_LINK %s			// RUN: %clang -### --target=amdgcn--amdhsa -x assembler -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=AS_LINK %s
	// RUN: %clang -### -g --target=amdgcn--amdhsa -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=DWARF_VER %s			// RUN: %clang -### -g --target=amdgcn--amdhsa -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=DWARF_VER %s
	// RUN: %clang -### --target=amdgcn-amd-amdpal -x assembler -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=AS_LINK %s			// RUN: %clang -### --target=amdgcn-amd-amdpal -x assembler -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=AS_LINK %s
	// RUN: %clang -### -g --target=amdgcn-amd-amdpal -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=DWARF_VER %s			// RUN: %clang -### -g --target=amdgcn-amd-amdpal -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=DWARF_VER %s
	// RUN: %clang -### --target=amdgcn-mesa-mesa3d -x assembler -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=AS_LINK %s			// RUN: %clang -### --target=amdgcn-mesa-mesa3d -x assembler -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=AS_LINK %s
	// RUN: %clang -### -g --target=amdgcn-mesa-mesa3d -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=DWARF_VER %s			// RUN: %clang -### -g --target=amdgcn-mesa-mesa3d -mcpu=kaveri %s 2>&1 \| FileCheck -check-prefix=DWARF_VER %s

				arsenmUnsubmitted Not Done Reply Inline Actions should add a test for thinlto? arsenm: should add a test for thinlto?
				jhuber6AuthorUnsubmitted Done Reply Inline Actions Only thing it would change is adding `-plugin-opt=thinlto`. Not sure if it's worth a test since it's probably checked by other uses of `addLTOOptions`. jhuber6: Only thing it would change is adding `-plugin-opt=thinlto`. Not sure if it's worth a test since…
	// AS_LINK: "-cc1as"			// AS_LINK: "-cc1as"
	// AS_LINK: ld.lld{{.*}} "-shared"			// AS_LINK: ld.lld{{.*}} "-shared"

	// DWARF_VER: "-dwarf-version=5"			// DWARF_VER: "-dwarf-version=5"

	// RUN: %clang -### --target=amdgcn-amd-amdhsa -mcpu=gfx906 -nogpulib \			// RUN: %clang -### --target=amdgcn-amd-amdhsa -mcpu=gfx906 -nogpulib \
	// RUN: -flto %s 2>&1 \| FileCheck -check-prefix=LTO %s			// RUN: -flto %s 2>&1 \| FileCheck -check-prefix=LTO %s
	// LTO: clang{{.*}} "-flto=full"			// LTO: clang{{.*}} "-flto=full"
	// LTO: ld.lld{{.*}}			// LTO: ld.lld{{.*}}-plugin-opt=mcpu=gfx906