This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/
-
CodeGen/
-
CodeGenModule.cpp
-
Driver/
-
Driver.cpp
-
test/
-
CodeGenCUDA/
-
amdgpu-asan.cu
-
Driver/
-
hip-sanitize-options.hip

Differential D106315

[HIP] Preserve ASAN bitcode library functions
ClosedPublic

Authored by yaxunl on Jul 19 2021, 2:36 PM.

Download Raw Diff

Details

Reviewers

tra
b-sumner

Commits

rG44dbbe61060a: [HIP] Preserve ASAN bitcode library functions

Summary

Address sanitizer passes may generate call of ASAN bitcode library
functions after bitcode linking in lld, therefore lld cannot add
those symbols since it does not know they will be used later.

To solve this issue, clang emits a reference to a bicode library
function which calls all ASAN functions which need to be
preserved. This basically force all ASAN functions to be
linked in.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

yaxunl created this revision.Jul 19 2021, 2:36 PM

Herald added subscribers: kerbowa, nhaehnle, jvesely. · View Herald TranscriptJul 19 2021, 2:36 PM

yaxunl requested review of this revision.Jul 19 2021, 2:36 PM

Harbormaster completed remote builds in B114951: Diff 359917.Jul 19 2021, 3:42 PM

LGTM in general.

One question -- does it have to be a function calling other functions just for the sake of preserving them?
Can it be a flat array of pointers to the functions you need to keep around?

This revision is now accepted and ready to land.Jul 22 2021, 4:56 PM

In D106315#2898536, @tra wrote:

LGTM in general.

One question -- does it have to be a function calling other functions just for the sake of preserving them?
Can it be a flat array of pointers to the functions you need to keep around?

Yes that's possible. However that would require FE to know these functions and declare them, whereas the current approach leave the concern to the device library.

This revision was landed with ongoing or failed builds.Jul 23 2021, 7:36 AM

Closed by commit rG44dbbe61060a: [HIP] Preserve ASAN bitcode library functions (authored by yaxunl). · Explain Why

This revision was automatically updated to reflect the committed changes.

yaxunl added a commit: rG44dbbe61060a: [HIP] Preserve ASAN bitcode library functions.

Herald added a project: Restricted Project. · View Herald TranscriptJul 23 2021, 7:36 AM

In D106315#2899928, @yaxunl wrote:

Yes that's possible. However that would require FE to know these functions and declare them, whereas the current approach leave the concern to the device library.

I was thinking of just adding a pointer to an array of pointers to @llvm.compiler.used.
The array itself would come from the bitcode library and would be populated there. I'm not sure if it's doable without knowing the array size, though.

e.g

; Added by compiler
@llvm.compiler.used = appending global [1 x i8*] [i8* bitcast ([16 x i8*]* @_ZL5funcs to i8*)], section "llvm.metadata"

; comes from the bitcode library, initialized with the pointers to functions you need to keep.
@_ZL5funcs = internal global [16 x i8*] zeroinitializer, align 4

It should be possible to make the pointer to the array opaque with an intermadiate variable in the library.

In D106315#2900882, @tra wrote:
In D106315#2899928, @yaxunl wrote:

Yes that's possible. However that would require FE to know these functions and declare them, whereas the current approach leave the concern to the device library.

I was thinking of just adding a pointer to an array of pointers to @llvm.compiler.used.
The array itself would come from the bitcode library and would be populated there. I'm not sure if it's doable without knowing the array size, though.

e.g
; Added by compiler
@llvm.compiler.used = appending global [1 x i8*] [i8* bitcast ([16 x i8*]* @_ZL5funcs to i8*)], section "llvm.metadata"

; comes from the bitcode library, initialized with the pointers to functions you need to keep.
@_ZL5funcs = internal global [16 x i8*] zeroinitializer, align 4
It should be possible to make the pointer to the array opaque with an intermadiate variable in the library.

Yes that is possible. Actually it can be simpler. We can add __attribute__((used)) to the dummy function in the bitcode lib, and it will be kept when linked by -mlink-builtin-bitcode.

However, it has a drawback compared to the current approach. It has no control when to keep the dummy function, whereas the current approach only keep it for -fsanitize=asan.

Revision Contents

Path

Size

clang/

lib/

CodeGen/

CodeGenModule.cpp

16 lines

Driver/

Driver.cpp

11 lines

test/

CodeGenCUDA/

amdgpu-asan.cu

15 lines

Driver/

hip-sanitize-options.hip

2 lines

Diff 361202

clang/lib/CodeGen/CodeGenModule.cpp

Show First 20 Lines • Show All 517 Lines • ▼ Show 20 Lines	if (CodeGenOpts.SanitizeCfiCrossDso) {
CodeGenFunction(*this).EmitCfiCheckFail();		CodeGenFunction(*this).EmitCfiCheckFail();
CodeGenFunction(*this).EmitCfiCheckStub();		CodeGenFunction(*this).EmitCfiCheckStub();
}		}
emitAtAvailableLinkGuard();		emitAtAvailableLinkGuard();
if (Context.getTargetInfo().getTriple().isWasm() &&		if (Context.getTargetInfo().getTriple().isWasm() &&
!Context.getTargetInfo().getTriple().isOSEmscripten()) {		!Context.getTargetInfo().getTriple().isOSEmscripten()) {
EmitMainVoidAlias();		EmitMainVoidAlias();
}		}

		// Emit reference of __amdgpu_device_library_preserve_asan_functions to
		// preserve ASAN functions in bitcode libraries.
		if (LangOpts.Sanitize.has(SanitizerKind::Address) && getTriple().isAMDGPU()) {
		auto *FT = llvm::FunctionType::get(VoidTy, {});
		auto *F = llvm::Function::Create(
		FT, llvm::GlobalValue::ExternalLinkage,
		"__amdgpu_device_library_preserve_asan_functions", &getModule());
		auto *Var = new llvm::GlobalVariable(
		getModule(), FT->getPointerTo(),
		/isConstant=/true, llvm::GlobalValue::WeakAnyLinkage, F,
		"__amdgpu_device_library_preserve_asan_functions_ptr", nullptr,
		llvm::GlobalVariable::NotThreadLocal);
		addCompilerUsedGlobal(Var);
		}

emitLLVMUsed();		emitLLVMUsed();
if (SanStats)		if (SanStats)
SanStats->finish();		SanStats->finish();

if (CodeGenOpts.Autolink &&		if (CodeGenOpts.Autolink &&
(Context.getLangOpts().Modules \|\| !LinkerOptionsMetadata.empty())) {		(Context.getLangOpts().Modules \|\| !LinkerOptionsMetadata.empty())) {
EmitModuleLinkOptions();		EmitModuleLinkOptions();
}		}
▲ Show 20 Lines • Show All 5,903 Lines • Show Last 20 Lines

clang/lib/Driver/Driver.cpp

Show First 20 Lines • Show All 2,967 Lines • ▼ Show 20 Lines	getDeviceDependences(OffloadAction::DeviceDependences &DA,
if (!Relocatable && CurPhase == phases::Backend && !EmitLLVM &&		if (!Relocatable && CurPhase == phases::Backend && !EmitLLVM &&
!EmitAsm) {		!EmitAsm) {
// If we are in backend phase, we attempt to generate the fat binary.		// If we are in backend phase, we attempt to generate the fat binary.
// We compile each arch to IR and use a link action to generate code		// We compile each arch to IR and use a link action to generate code
// object containing ISA. Then we use a special "link" action to create		// object containing ISA. Then we use a special "link" action to create
// a fat binary containing all the code objects for different GPU's.		// a fat binary containing all the code objects for different GPU's.
// The fat binary is then an input to the host action.		// The fat binary is then an input to the host action.
for (unsigned I = 0, E = GpuArchList.size(); I != E; ++I) {		for (unsigned I = 0, E = GpuArchList.size(); I != E; ++I) {
if (GPUSanitize \|\| C.getDriver().isUsingLTO(/IsOffload=/true)) {		if (C.getDriver().isUsingLTO(/IsOffload=/true)) {
// When GPU sanitizer is enabled, since we need to link in the		// When LTO is enabled, skip the backend and assemble phases and
// the sanitizer runtime library after the sanitize pass, we have		// use lld to link the bitcode.
// to skip the backend and assemble phases and use lld to link
// the bitcode. The same happens if users request to use LTO
// explicitly.
ActionList AL;		ActionList AL;
AL.push_back(CudaDeviceActions[I]);		AL.push_back(CudaDeviceActions[I]);
// Create a link action to link device IR with device library		// Create a link action to link device IR with device library
// and generate ISA.		// and generate ISA.
CudaDeviceActions[I] =		CudaDeviceActions[I] =
C.MakeAction<LinkJobAction>(AL, types::TY_Image);		C.MakeAction<LinkJobAction>(AL, types::TY_Image);
} else {		} else {
// When GPU sanitizer is not enabled, we follow the conventional		// When LTO is not enabled, we follow the conventional
// compiler phases, including backend and assemble phases.		// compiler phases, including backend and assemble phases.
ActionList AL;		ActionList AL;
auto BackendAction = C.getDriver().ConstructPhaseAction(		auto BackendAction = C.getDriver().ConstructPhaseAction(
C, Args, phases::Backend, CudaDeviceActions[I],		C, Args, phases::Backend, CudaDeviceActions[I],
AssociatedOffloadKind);		AssociatedOffloadKind);
auto AssembleAction = C.getDriver().ConstructPhaseAction(		auto AssembleAction = C.getDriver().ConstructPhaseAction(
C, Args, phases::Assemble, BackendAction,		C, Args, phases::Assemble, BackendAction,
AssociatedOffloadKind);		AssociatedOffloadKind);
▲ Show 20 Lines • Show All 2,581 Lines • Show Last 20 Lines

clang/test/CodeGenCUDA/amdgpu-asan.cu

This file was added.

				// RUN: %clang_cc1 %s -emit-llvm -o - -triple=amdgcn-amd-amdhsa \
				// RUN: -fcuda-is-device -target-cpu gfx906 -fsanitize=address \
				// RUN: -x hip \| FileCheck -check-prefix=ASAN %s

				// RUN: %clang_cc1 %s -emit-llvm -o - -triple=amdgcn-amd-amdhsa \
				// RUN: -fcuda-is-device -target-cpu gfx906 -x hip \
				// RUN: \| FileCheck %s

				// REQUIRES: amdgpu-registered-target

				// ASAN-DAG: declare void @__amdgpu_device_library_preserve_asan_functions()
				// ASAN-DAG: @__amdgpu_device_library_preserve_asan_functions_ptr = weak addrspace(1) constant void ()* @__amdgpu_device_library_preserve_asan_functions
				// ASAN-DAG: @llvm.compiler.used = {{.*}}@__amdgpu_device_library_preserve_asan_functions_ptr

				// CHECK-NOT: @__amdgpu_device_library_preserve_asan_functions_ptr

clang/test/Driver/hip-sanitize-options.hip

	Show All 28 Lines
	// RUN: -fsanitize=address -fgpu-sanitize \			// RUN: -fsanitize=address -fgpu-sanitize \
	// RUN: -nogpuinc --rocm-path=%S/Inputs/rocm \			// RUN: -nogpuinc --rocm-path=%S/Inputs/rocm \
	// RUN: %s 2>&1 \| FileCheck -check-prefix=XNACK %s			// RUN: %s 2>&1 \| FileCheck -check-prefix=XNACK %s

	// CHECK-NOT: {{"[^"]clang[^"]".* "-fcuda-is-device".* "-fsanitize=address"}}			// CHECK-NOT: {{"[^"]clang[^"]".* "-fcuda-is-device".* "-fsanitize=address"}}
	// CHECK-NOT: {{"[^"]lld(\.exe){0,1}". ".*hip.bc"}}			// CHECK-NOT: {{"[^"]lld(\.exe){0,1}". ".*hip.bc"}}
	// CHECK: {{"[^"]clang[^"]".* "-triple" "x86_64-unknown-linux-gnu".* "-fsanitize=address"}}			// CHECK: {{"[^"]clang[^"]".* "-triple" "x86_64-unknown-linux-gnu".* "-fsanitize=address"}}

	// NORDC: {{"[^"]clang[^"]".* "-fcuda-is-device".* "-fsanitize=address".}} "-o" "[[OUT:[^"].bc]]"			// NORDC: {{"[^"]clang[^"]".* "-emit-obj".* "-fcuda-is-device".* "-fsanitize=address".}} "-o" "[[OUT:[^"].o]]"
	// NORDC: {{"[^"]lld(\.exe){0,1}".}} "[[OUT]]" {{".asanrtl.bc" ".hip.bc"}}			// NORDC: {{"[^"]lld(\.exe){0,1}".}} "[[OUT]]" {{".asanrtl.bc" ".hip.bc"}}
	// NORDC: {{"[^"]clang[^"]".* "-triple" "x86_64-unknown-linux-gnu".* "-fsanitize=address"}}			// NORDC: {{"[^"]clang[^"]".* "-triple" "x86_64-unknown-linux-gnu".* "-fsanitize=address"}}

	// RDC: {{"[^"]clang[^"]".* "-triple" "x86_64-unknown-linux-gnu".* "-fsanitize=address"}}			// RDC: {{"[^"]clang[^"]".* "-triple" "x86_64-unknown-linux-gnu".* "-fsanitize=address"}}
	// RDC: {{"[^"]clang[^"]".* "-emit-llvm-bc".* "-fcuda-is-device".* "-fsanitize=address".}} "-o" "[[OUT:[^"].bc]]"			// RDC: {{"[^"]clang[^"]".* "-emit-llvm-bc".* "-fcuda-is-device".* "-fsanitize=address".}} "-o" "[[OUT:[^"].bc]]"
	// RDC: {{"[^"]lld(\.exe){0,1}".}} "[[OUT]]" {{".asanrtl.bc" ".hip.bc"}}			// RDC: {{"[^"]lld(\.exe){0,1}".}} "[[OUT]]" {{".asanrtl.bc" ".hip.bc"}}

	// FAIL: AMDGPU address sanitizer runtime library (asanrtl) is not found. Please install ROCm device library which supports address sanitizer			// FAIL: AMDGPU address sanitizer runtime library (asanrtl) is not found. Please install ROCm device library which supports address sanitizer

	// XNACK: error: '-fgpu-sanitize' is not compatible with offload arch 'gfx900:xnack-'. Use an offload arch without 'xnack-' instead			// XNACK: error: '-fgpu-sanitize' is not compatible with offload arch 'gfx900:xnack-'. Use an offload arch without 'xnack-' instead