This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
docs/
-
ClangCommandLineReference.rst
-
include/clang/
-
clang/
-
Basic/
-
LangOptions.def
-
Driver/
-
CC1Options.td
-
Options.td
-
lib/
-
CodeGen/
-
TargetInfo.cpp
-
Driver/ToolChains/
-
ToolChains/
-
AMDGPU.cpp
-
Clang.cpp
-
Frontend/
-
CompilerInvocation.cpp
-
test/
-
CodeGen/
-
visibility-amdgpu-non-kernel-functions.cl
-
Driver/
-
amdgpu-visibility.cl

Differential D52891

[AMDGPU] Add -fvisibility-amdgpu-non-kernel-functions
AbandonedPublic

Authored by scott.linder on Oct 4 2018, 9:20 AM.

Download Raw Diff

Details

Reviewers

yaxunl
kzhuravl
arsenm

Summary

Controls the visibility of non-kernel functions when compiling for AMDGPU targets. Defaults to the default value visibility (-fvisibility), and is set in the AMDGPU toolchain to be "hidden" if not set explicitly.

This is a more fine-grained knob than "-fvisibility hidden", and allows the default visibility of functions to be controlled independent of kernels. This is useful for languages like OpenCL where kernel symbols are externally visible, but function symbols are not.

Diff Detail

Event Timeline

scott.linder created this revision.Oct 4 2018, 9:20 AM

Herald added subscribers: cfe-commits, Anastasia, tpr and 4 others. · View Herald TranscriptOct 4 2018, 9:20 AM

I don't know who else to add as a reviewer; Sam, is there someone else outside of AMD that would be interested in reviewing this?

Update docs

I think the name needs work, but I'm not sure what it should be. I think it should avoid using "non" and "amdgpu"

Tests should also include some global variables

In D52891#1256207, @arsenm wrote:

I think the name needs work, but I'm not sure what it should be. I think it should avoid using "non" and "amdgpu"

I think dropping amdgpu is fine since we can add (AMDGUP only) to the description of the option, following the precedence of

-ffixed-r9              Reserve the r9 register (ARM only)

However it is difficult to coin a different term for 'non-kernel-function'. Also, I saw precedence of using 'non' in option name:

-objcmt-ns-nonatomic-iosonly

So, probably we could use -fvisibility-nonkernel-function ?

Can you also fix HIP toolchain? It is in HIPToolChain::addClangTargetOptions. Thanks.

Use of the word kernel might confuse general people. Maybe it needs to specify OpenCL, but it also applies to HIP/CUDA

Another word commonly used across languages is "offload".

I will update the patch to modify the HIP toolchain and to add tests for global variables.

As far as the semantics are concerned, are we OK with this being AMDGPU only? I do not see a means of determining what is a "kernel" in a language-agnostic way other than checking for our AMDGPU-specific calling convention. If there is a more general mechanism, this could be implemented in LinkageComputer::getLVForNamespaceScopeDecl instead. As it stands, it sounds like being AMDGPU specific, but omitting amdgpu from the option name is preferred?

What about:

-fvisibility-non-offload-functions=<arg>

Set the default symbol visibility for non-offload function declarations (AMDGPU only)

I cannot think of a way to avoid non or something similar ending up in the name.

In D52891#1258070, @scott.linder wrote:

I will update the patch to modify the HIP toolchain and to add tests for global variables.

As far as the semantics are concerned, are we OK with this being AMDGPU only? I do not see a means of determining what is a "kernel" in a language-agnostic way other than checking for our AMDGPU-specific calling convention. If there is a more general mechanism, this could be implemented in LinkageComputer::getLVForNamespaceScopeDecl instead. As it stands, it sounds like being AMDGPU specific, but omitting amdgpu from the option name is preferred?

The checking of kernel functions can be made target independent. For now we only need to consider OpenCL and CUDA/HIP. We can check function attribute AT_CUDAGlobal and AT_OpenCLKernel. Then this option can be made target independent. HCC can add its own check out of tree.

What about:

-fvisibility-non-offload-functions=<arg>

This name looks good to me.

Offload to me sounds like it decided to extract out a section of the program for offload, which is not how OpenCL works

Will be superseded by either https://reviews.llvm.org/D53153 or https://reviews.llvm.org/D56871

Revision Contents

Path

Size

docs/

ClangCommandLineReference.rst

4 lines

include/

clang/

Basic/

LangOptions.def

2 lines

Driver/

CC1Options.td

2 lines

Options.td

2 lines

lib/

CodeGen/

TargetInfo.cpp

7 lines

Driver/

ToolChains/

AMDGPU.cpp

7 lines

Clang.cpp

6 lines

Frontend/

CompilerInvocation.cpp

11 lines

test/

CodeGen/

visibility-amdgpu-non-kernel-functions.cl

46 lines

Driver/

amdgpu-visibility.cl

14 lines

Diff 168314

docs/ClangCommandLineReference.rst

	Show First 20 Lines • Show All 1,990 Lines • ▼ Show 20 Lines
	.. option:: -fvisibility-ms-compat			.. option:: -fvisibility-ms-compat

	Give global types 'default' visibility and global functions and variables 'hidden' visibility by default			Give global types 'default' visibility and global functions and variables 'hidden' visibility by default

	.. option:: -fvisibility=<arg>			.. option:: -fvisibility=<arg>

	Set the default symbol visibility for all global declarations			Set the default symbol visibility for all global declarations

				.. option:: -fvisibility-amdgpu-non-kernel-functions=<arg>

				Set the default symbol visibility for non-kernel function declarations for AMDGPU targets

	.. option:: -fwhole-program-vtables, -fno-whole-program-vtables			.. option:: -fwhole-program-vtables, -fno-whole-program-vtables

	Enables whole-program vtable optimization. Requires -flto			Enables whole-program vtable optimization. Requires -flto

	.. option:: -fwrapv, -fno-wrapv			.. option:: -fwrapv, -fno-wrapv

	Treat signed integer overflow as two's complement			Treat signed integer overflow as two's complement

	▲ Show 20 Lines • Show All 1,064 Lines • Show Last 20 Lines

include/clang/Basic/LangOptions.def

Show First 20 Lines • Show All 252 Lines • ▼ Show 20 Lines	LANGOPT(
"Require member pointer base types to be complete at the point where the "		"Require member pointer base types to be complete at the point where the "
"type's inheritance model would be determined under the Microsoft ABI")		"type's inheritance model would be determined under the Microsoft ABI")

ENUM_LANGOPT(GC, GCMode, 2, NonGC, "Objective-C Garbage Collection mode")		ENUM_LANGOPT(GC, GCMode, 2, NonGC, "Objective-C Garbage Collection mode")
ENUM_LANGOPT(ValueVisibilityMode, Visibility, 3, DefaultVisibility,		ENUM_LANGOPT(ValueVisibilityMode, Visibility, 3, DefaultVisibility,
"value symbol visibility")		"value symbol visibility")
ENUM_LANGOPT(TypeVisibilityMode, Visibility, 3, DefaultVisibility,		ENUM_LANGOPT(TypeVisibilityMode, Visibility, 3, DefaultVisibility,
"type symbol visibility")		"type symbol visibility")
		ENUM_LANGOPT(AMDGPUNonKernelFunctionVisibilityMode, Visibility, 3,
		DefaultVisibility, "non-kernel function visibility")
ENUM_LANGOPT(StackProtector, StackProtectorMode, 2, SSPOff,		ENUM_LANGOPT(StackProtector, StackProtectorMode, 2, SSPOff,
"stack protector mode")		"stack protector mode")
ENUM_LANGOPT(SignedOverflowBehavior, SignedOverflowBehaviorTy, 2, SOB_Undefined,		ENUM_LANGOPT(SignedOverflowBehavior, SignedOverflowBehaviorTy, 2, SOB_Undefined,
"signed integer overflow handling")		"signed integer overflow handling")

BENIGN_LANGOPT(ArrowDepth, 32, 256,		BENIGN_LANGOPT(ArrowDepth, 32, 256,
"maximum number of operator->s to follow")		"maximum number of operator->s to follow")
BENIGN_LANGOPT(InstantiationDepth, 32, 1024,		BENIGN_LANGOPT(InstantiationDepth, 32, 1024,
▲ Show 20 Lines • Show All 54 Lines • Show Last 20 Lines

include/clang/Driver/CC1Options.td

	Show First 20 Lines • Show All 673 Lines • ▼ Show 20 Lines
	def static_define : Flag<["-"], "static-define">,			def static_define : Flag<["-"], "static-define">,
	HelpText<"Should __STATIC__ be defined">;			HelpText<"Should __STATIC__ be defined">;
	def stack_protector : Separate<["-"], "stack-protector">,			def stack_protector : Separate<["-"], "stack-protector">,
	HelpText<"Enable stack protectors">;			HelpText<"Enable stack protectors">;
	def stack_protector_buffer_size : Separate<["-"], "stack-protector-buffer-size">,			def stack_protector_buffer_size : Separate<["-"], "stack-protector-buffer-size">,
	HelpText<"Lower bound for a buffer to be considered for stack protection">;			HelpText<"Lower bound for a buffer to be considered for stack protection">;
	def fvisibility : Separate<["-"], "fvisibility">,			def fvisibility : Separate<["-"], "fvisibility">,
	HelpText<"Default type and symbol visibility">;			HelpText<"Default type and symbol visibility">;
				def fvisibility_amdgpu_non_kernel_functions : Separate<["-"], "fvisibility-amdgpu-non-kernel-functions">,
				HelpText<"Default non-kernel function symbol visibility for AMDGPU targets">;
	def ftype_visibility : Separate<["-"], "ftype-visibility">,			def ftype_visibility : Separate<["-"], "ftype-visibility">,
	HelpText<"Default type visibility">;			HelpText<"Default type visibility">;
	def ftemplate_depth : Separate<["-"], "ftemplate-depth">,			def ftemplate_depth : Separate<["-"], "ftemplate-depth">,
	HelpText<"Maximum depth of recursive template instantiation">;			HelpText<"Maximum depth of recursive template instantiation">;
	def foperator_arrow_depth : Separate<["-"], "foperator-arrow-depth">,			def foperator_arrow_depth : Separate<["-"], "foperator-arrow-depth">,
	HelpText<"Maximum number of 'operator->'s to call for a member access">;			HelpText<"Maximum number of 'operator->'s to call for a member access">;
	def fconstexpr_depth : Separate<["-"], "fconstexpr-depth">,			def fconstexpr_depth : Separate<["-"], "fconstexpr-depth">,
	HelpText<"Maximum depth of recursive constexpr function calls">;			HelpText<"Maximum depth of recursive constexpr function calls">;
	▲ Show 20 Lines • Show All 162 Lines • Show Last 20 Lines

include/clang/Driver/Options.td

	Show First 20 Lines • Show All 1,705 Lines • ▼ Show 20 Lines
	def fregister_global_dtors_with_atexit : Flag<["-"], "fregister-global-dtors-with-atexit">, Group<f_Group>, Flags<[CC1Option]>,			def fregister_global_dtors_with_atexit : Flag<["-"], "fregister-global-dtors-with-atexit">, Group<f_Group>, Flags<[CC1Option]>,
	HelpText<"Use atexit or __cxa_atexit to register global destructors">;			HelpText<"Use atexit or __cxa_atexit to register global destructors">;
	def fuse_init_array : Flag<["-"], "fuse-init-array">, Group<f_Group>, Flags<[CC1Option]>,			def fuse_init_array : Flag<["-"], "fuse-init-array">, Group<f_Group>, Flags<[CC1Option]>,
	HelpText<"Use .init_array instead of .ctors">;			HelpText<"Use .init_array instead of .ctors">;
	def fno_var_tracking : Flag<["-"], "fno-var-tracking">, Group<clang_ignored_f_Group>;			def fno_var_tracking : Flag<["-"], "fno-var-tracking">, Group<clang_ignored_f_Group>;
	def fverbose_asm : Flag<["-"], "fverbose-asm">, Group<f_Group>;			def fverbose_asm : Flag<["-"], "fverbose-asm">, Group<f_Group>;
	def fvisibility_EQ : Joined<["-"], "fvisibility=">, Group<f_Group>,			def fvisibility_EQ : Joined<["-"], "fvisibility=">, Group<f_Group>,
	HelpText<"Set the default symbol visibility for all global declarations">, Values<"hidden,default">;			HelpText<"Set the default symbol visibility for all global declarations">, Values<"hidden,default">;
				def fvisibility_amdgpu_non_kernel_functions_EQ : Joined<["-"], "fvisibility-amdgpu-non-kernel-functions=">, Group<f_Group>,
				HelpText<"Set the default symbol visibility for non-kernel function declarations for AMDGPU targets">, Values<"hidden,default">;
	def fvisibility_inlines_hidden : Flag<["-"], "fvisibility-inlines-hidden">, Group<f_Group>,			def fvisibility_inlines_hidden : Flag<["-"], "fvisibility-inlines-hidden">, Group<f_Group>,
	HelpText<"Give inline C++ member functions hidden visibility by default">,			HelpText<"Give inline C++ member functions hidden visibility by default">,
	Flags<[CC1Option]>;			Flags<[CC1Option]>;
	def fvisibility_ms_compat : Flag<["-"], "fvisibility-ms-compat">, Group<f_Group>,			def fvisibility_ms_compat : Flag<["-"], "fvisibility-ms-compat">, Group<f_Group>,
	HelpText<"Give global types 'default' visibility and global functions and "			HelpText<"Give global types 'default' visibility and global functions and "
	"variables 'hidden' visibility by default">;			"variables 'hidden' visibility by default">;
	def fwhole_program_vtables : Flag<["-"], "fwhole-program-vtables">, Group<f_Group>,			def fwhole_program_vtables : Flag<["-"], "fwhole-program-vtables">, Group<f_Group>,
	Flags<[CoreOption, CC1Option]>,			Flags<[CoreOption, CC1Option]>,
	▲ Show 20 Lines • Show All 1,330 Lines • Show Last 20 Lines

lib/CodeGen/TargetInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,728 Lines • ▼ Show 20 Lines	void AMDGPUTargetCodeGenInfo::setTargetAttributes(
if (GV->isDeclaration())		if (GV->isDeclaration())
return;		return;
const FunctionDecl *FD = dyn_cast_or_null<FunctionDecl>(D);		const FunctionDecl *FD = dyn_cast_or_null<FunctionDecl>(D);
if (!FD)		if (!FD)
return;		return;

llvm::Function *F = cast<llvm::Function>(GV);		llvm::Function *F = cast<llvm::Function>(GV);

		if (!FD->getExplicitVisibility(FunctionDecl::VisibilityForValue) &&
		F->getCallingConv() != llvm::CallingConv::AMDGPU_KERNEL) {
		GV->setVisibility(M.GetLLVMVisibility(
		M.getLangOpts().getAMDGPUNonKernelFunctionVisibilityMode()));
		M.setDSOLocal(GV);
		}

const auto *ReqdWGS = M.getLangOpts().OpenCL ?		const auto *ReqdWGS = M.getLangOpts().OpenCL ?
FD->getAttr<ReqdWorkGroupSizeAttr>() : nullptr;		FD->getAttr<ReqdWorkGroupSizeAttr>() : nullptr;

if (M.getLangOpts().OpenCL && FD->hasAttr<OpenCLKernelAttr>() &&		if (M.getLangOpts().OpenCL && FD->hasAttr<OpenCLKernelAttr>() &&
(M.getTriple().getOS() == llvm::Triple::AMDHSA))		(M.getTriple().getOS() == llvm::Triple::AMDHSA))
F->addFnAttr("amdgpu-implicitarg-num-bytes", "48");		F->addFnAttr("amdgpu-implicitarg-num-bytes", "48");

const auto *FlatWGS = FD->getAttr<AMDGPUFlatWorkGroupSizeAttr>();		const auto *FlatWGS = FD->getAttr<AMDGPUFlatWorkGroupSizeAttr>();
▲ Show 20 Lines • Show All 1,598 Lines • Show Last 20 Lines

lib/Driver/ToolChains/AMDGPU.cpp

	Show First 20 Lines • Show All 99 Lines • ▼ Show 20 Lines
	}			}

	void AMDGPUToolChain::addClangTargetOptions(			void AMDGPUToolChain::addClangTargetOptions(
	const llvm::opt::ArgList &DriverArgs,			const llvm::opt::ArgList &DriverArgs,
	llvm::opt::ArgStringList &CC1Args,			llvm::opt::ArgStringList &CC1Args,
	Action::OffloadKind DeviceOffloadingKind) const {			Action::OffloadKind DeviceOffloadingKind) const {
	// Default to "hidden" visibility, as object level linking will not be			// Default to "hidden" visibility, as object level linking will not be
	// supported for the forseeable future.			// supported for the forseeable future.
	if (!DriverArgs.hasArg(options::OPT_fvisibility_EQ,			if (!DriverArgs.hasArg(options::OPT_fvisibility_amdgpu_non_kernel_functions_EQ,
	options::OPT_fvisibility_ms_compat)) {			options::OPT_fvisibility_amdgpu_non_kernel_functions)) {
	CC1Args.push_back("-fvisibility");			CC1Args.append({"-fvisibility-amdgpu-non-kernel-functions", "hidden"});
	CC1Args.push_back("hidden");
	}			}
	}			}

lib/Driver/ToolChains/Clang.cpp

Show First 20 Lines • Show All 4,161 Lines • ▼ Show 20 Lines	if (A->getOption().matches(options::OPT_fvisibility_EQ)) {
assert(A->getOption().matches(options::OPT_fvisibility_ms_compat));		assert(A->getOption().matches(options::OPT_fvisibility_ms_compat));
CmdArgs.push_back("-fvisibility");		CmdArgs.push_back("-fvisibility");
CmdArgs.push_back("hidden");		CmdArgs.push_back("hidden");
CmdArgs.push_back("-ftype-visibility");		CmdArgs.push_back("-ftype-visibility");
CmdArgs.push_back("default");		CmdArgs.push_back("default");
}		}
}		}

		if (const Arg *A = Args.getLastArg(
		options::OPT_fvisibility_amdgpu_non_kernel_functions_EQ)) {
		CmdArgs.push_back("-fvisibility-amdgpu-non-kernel-functions");
		CmdArgs.push_back(A->getValue());
		}

Args.AddLastArg(CmdArgs, options::OPT_fvisibility_inlines_hidden);		Args.AddLastArg(CmdArgs, options::OPT_fvisibility_inlines_hidden);

Args.AddLastArg(CmdArgs, options::OPT_ftlsmodel_EQ);		Args.AddLastArg(CmdArgs, options::OPT_ftlsmodel_EQ);

// Forward -f (flag) options which we can pass directly.		// Forward -f (flag) options which we can pass directly.
Args.AddLastArg(CmdArgs, options::OPT_femit_all_decls);		Args.AddLastArg(CmdArgs, options::OPT_femit_all_decls);
Args.AddLastArg(CmdArgs, options::OPT_fheinous_gnu_extensions);		Args.AddLastArg(CmdArgs, options::OPT_fheinous_gnu_extensions);
Args.AddLastArg(CmdArgs, options::OPT_fdigraphs, options::OPT_fno_digraphs);		Args.AddLastArg(CmdArgs, options::OPT_fdigraphs, options::OPT_fno_digraphs);
▲ Show 20 Lines • Show All 1,740 Lines • Show Last 20 Lines

lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 2,304 Lines • ▼ Show 20 Lines	#include "clang/Frontend/LangStandards.def"

// The type-visibility mode defaults to the value-visibility mode.		// The type-visibility mode defaults to the value-visibility mode.
if (Arg *typeVisOpt = Args.getLastArg(OPT_ftype_visibility)) {		if (Arg *typeVisOpt = Args.getLastArg(OPT_ftype_visibility)) {
Opts.setTypeVisibilityMode(parseVisibility(typeVisOpt, Args, Diags));		Opts.setTypeVisibilityMode(parseVisibility(typeVisOpt, Args, Diags));
} else {		} else {
Opts.setTypeVisibilityMode(Opts.getValueVisibilityMode());		Opts.setTypeVisibilityMode(Opts.getValueVisibilityMode());
}		}

		// The amdgpu-non-kernel-function-visibility mode defaults to the
		// value-visibility mode.
		if (Arg *amdgpuVisOpt =
		Args.getLastArg(OPT_fvisibility_amdgpu_non_kernel_functions)) {
		Opts.setAMDGPUNonKernelFunctionVisibilityMode(
		parseVisibility(amdgpuVisOpt, Args, Diags));
		} else {
		Opts.setAMDGPUNonKernelFunctionVisibilityMode(
		Opts.getValueVisibilityMode());
		}

if (Args.hasArg(OPT_fvisibility_inlines_hidden))		if (Args.hasArg(OPT_fvisibility_inlines_hidden))
Opts.InlineVisibilityHidden = 1;		Opts.InlineVisibilityHidden = 1;

if (Args.hasArg(OPT_ftrapv)) {		if (Args.hasArg(OPT_ftrapv)) {
Opts.setSignedOverflowBehavior(LangOptions::SOB_Trapping);		Opts.setSignedOverflowBehavior(LangOptions::SOB_Trapping);
// Set the handler, if one is specified.		// Set the handler, if one is specified.
Opts.OverflowHandler =		Opts.OverflowHandler =
Args.getLastArgValue(OPT_ftrapv_handler);		Args.getLastArgValue(OPT_ftrapv_handler);
▲ Show 20 Lines • Show All 961 Lines • Show Last 20 Lines

test/CodeGen/visibility-amdgpu-non-kernel-functions.cl

This file was added.

				// RUN: %clang_cc1 -fvisibility hidden -triple amdgcn-unknown-unknown -S -emit-llvm -o - %s \| FileCheck --check-prefix=FVIS_HIDDEN %s
				// RUN: %clang_cc1 -fvisibility-amdgpu-non-kernel-functions hidden -triple amdgcn-unknown-unknown -S -emit-llvm -o - %s \| FileCheck --check-prefix=FVIS_AMDGPU_HIDDEN %s
				// RUN: %clang_cc1 -fvisibility-amdgpu-non-kernel-functions default -triple amdgcn-unknown-unknown -S -emit-llvm -o - %s \| FileCheck --check-prefix=FVIS_AMDGPU_DEFAULT %s
				// RUN: %clang_cc1 -fvisibility hidden -fvisibility-amdgpu-non-kernel-functions default -triple amdgcn-unknown-unknown -S -emit-llvm -o - %s \| FileCheck --check-prefix=FVIS_AMDGPU_OVERRIDE %s

				// FVIS_HIDDEN: define hidden amdgpu_kernel void @kern()
				// FVIS_AMDGPU_HIDDEN: define amdgpu_kernel void @kern()
				// FVIS_AMDGPU_DEFAULT: define amdgpu_kernel void @kern()
				// FVIS_AMDGPU_OVERRIDE: define hidden amdgpu_kernel void @kern()
				kernel void kern() {}
				// FVIS_HIDDEN: define amdgpu_kernel void @default_kern()
				// FVIS_AMDGPU_HIDDEN: define amdgpu_kernel void @default_kern()
				// FVIS_AMDGPU_DEFAULT: define amdgpu_kernel void @default_kern()
				// FVIS_AMDGPU_OVERRIDE: define amdgpu_kernel void @default_kern()
				__attribute__((visibility("default"))) kernel void default_kern() {}
				// FVIS_HIDDEN: define hidden amdgpu_kernel void @hidden_kern()
				// FVIS_AMDGPU_HIDDEN: define hidden amdgpu_kernel void @hidden_kern()
				// FVIS_AMDGPU_DEFAULT: define hidden amdgpu_kernel void @hidden_kern()
				// FVIS_AMDGPU_OVERRIDE: define hidden amdgpu_kernel void @hidden_kern()
				__attribute__((visibility("hidden"))) kernel void hidden_kern() {}
				// FVIS_HIDDEN: define protected amdgpu_kernel void @protected_kern()
				// FVIS_AMDGPU_HIDDEN: define protected amdgpu_kernel void @protected_kern()
				// FVIS_AMDGPU_DEFAULT: define protected amdgpu_kernel void @protected_kern()
				// FVIS_AMDGPU_OVERRIDE: define protected amdgpu_kernel void @protected_kern()
				__attribute__((visibility("protected"))) kernel void protected_kern() {}

				// FVIS_HIDDEN: define hidden void @func()
				// FVIS_AMDGPU_HIDDEN: define hidden void @func()
				// FVIS_AMDGPU_DEFAULT: define void @func()
				// FVIS_AMDGPU_OVERRIDE: define void @func()
				void func() {}
				// FVIS_HIDDEN: define void @default_func()
				// FVIS_AMDGPU_HIDDEN: define void @default_func()
				// FVIS_AMDGPU_DEFAULT: define void @default_func()
				// FVIS_AMDGPU_OVERRIDE: define void @default_func()
				__attribute__((visibility("default"))) void default_func() {}
				// FVIS_HIDDEN: define hidden void @hidden_func()
				// FVIS_AMDGPU_HIDDEN: define hidden void @hidden_func()
				// FVIS_AMDGPU_DEFAULT: define hidden void @hidden_func()
				// FVIS_AMDGPU_OVERRIDE: define hidden void @hidden_func()
				__attribute__((visibility("hidden"))) void hidden_func() {}
				// FVIS_HIDDEN: define protected void @protected_func()
				// FVIS_AMDGPU_HIDDEN: define protected void @protected_func()
				// FVIS_AMDGPU_DEFAULT: define protected void @protected_func()
				// FVIS_AMDGPU_OVERRIDE: define protected void @protected_func()
				__attribute__((visibility("protected"))) void protected_func() {}

test/Driver/amdgpu-visibility.cl

	// RUN: %clang -### -target amdgcn-amd-amdhsa -x cl -c -emit-llvm %s 2>&1 \| FileCheck -check-prefix=DEFAULT %s			// RUN: %clang -### -target amdgcn-amd-amdhsa -x cl -c -emit-llvm %s 2>&1 \| FileCheck -check-prefix=DEFAULT %s
	// RUN: %clang -### -target amdgcn-amd-amdhsa -x cl -c -emit-llvm -fvisibility=protected %s 2>&1 \| FileCheck -check-prefix=OVERRIDE-PROTECTED %s			// RUN: %clang -### -target amdgcn-amd-amdhsa -x cl -c -emit-llvm -fvisibility=hidden %s 2>&1 \| FileCheck -check-prefix=VISIBILITY-HIDDEN %s
	// RUN: %clang -### -target amdgcn-amd-amdhsa -x cl -c -emit-llvm -fvisibility-ms-compat %s 2>&1 \| FileCheck -check-prefix=OVERRIDE-MS %s			// RUN: %clang -### -target amdgcn-amd-amdhsa -x cl -c -emit-llvm -fvisibility-amdgpu-non-kernel-functions=hidden %s 2>&1 \| FileCheck -check-prefix=AMDGPU-HIDDEN %s
				// RUN: %clang -### -target amdgcn-amd-amdhsa -x cl -c -emit-llvm -fvisibility-amdgpu-non-kernel-functions=default %s 2>&1 \| FileCheck -check-prefix=AMDGPU-DEFAULT %s
				// RUN: %clang -### -target amdgcn-amd-amdhsa -x cl -c -emit-llvm -fvisibility=hidden -fvisibility-amdgpu-non-kernel-functions=default %s 2>&1 \| FileCheck -check-prefix=VISIBILITY-HIDDEN-AMDGPU-DEFAULT %s

	// DEFAULT: "-fvisibility" "hidden"			// DEFAULT: "-fvisibility-amdgpu-non-kernel-functions" "hidden"
	// OVERRIDE-PROTECTED: "-fvisibility" "protected"			// VISIBILITY-HIDDEN: "-fvisibility-amdgpu-non-kernel-functions" "hidden"
	// OVERRIDE-MS: "-fvisibility" "hidden" "-ftype-visibility" "default"			// AMDGPU-HIDDEN: "-fvisibility-amdgpu-non-kernel-functions" "hidden"
				// AMDGPU-DEFAULT: "-fvisibility-amdgpu-non-kernel-functions" "default"
				// VISIBILITY-HIDDEN-AMDGPU-DEFAULT: "-fvisibility-amdgpu-non-kernel-functions" "default"

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Add -fvisibility-amdgpu-non-kernel-functionsAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 168314

docs/ClangCommandLineReference.rst

include/clang/Basic/LangOptions.def

include/clang/Driver/CC1Options.td

include/clang/Driver/Options.td

lib/CodeGen/TargetInfo.cpp

lib/Driver/ToolChains/AMDGPU.cpp

lib/Driver/ToolChains/Clang.cpp

lib/Frontend/CompilerInvocation.cpp

test/CodeGen/visibility-amdgpu-non-kernel-functions.cl

test/Driver/amdgpu-visibility.cl

[AMDGPU] Add -fvisibility-amdgpu-non-kernel-functions
AbandonedPublic