This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/
-
IR/
10/10
ValueSymbolTable.cpp
-
Target/NVPTX/
-
NVPTX/
-
NVPTXAssignValidGlobalNames.cpp
-
test/CodeGen/NVPTX/
-
CodeGen/
-
NVPTX/
-
symbol-naming.ll

Differential D40573

[NVPTX] Assign valid global names
ClosedPublic

Authored by Hahnfeld on Nov 28 2017, 12:21 PM.

Download Raw Diff

Details

Reviewers

tra
• rafael
hfinkel
rnk

Commits

rG5db24d7c2290: [NVPTX] Assign valid global names
rL319657: [NVPTX] Assign valid global names

Summary

PTX requires that identifiers consist only of [a-zA-Z0-9_$]. The
existing pass already ensured this for globals and this patch adds
the cleanup for functions with local linkage.

However, there was a different problem in the case of collisions
of the adjusted name: The ValueSymbolTable then automatically
appended ".N" with increasing Ns to get a unique name while helping
the ABI demangling. Special case this behavior to omit the dots and
append N directly. This will always give us legal names according
to the PTX requirements.

Diff Detail

Event Timeline

Hahnfeld created this revision.Nov 28 2017, 12:21 PM

Herald added a subscriber: jholewinski. · View Herald TranscriptNov 28 2017, 12:21 PM

This is essentially the fourth attempt after https://reviews.llvm.org/D17738, https://reviews.llvm.org/D29883, and https://reviews.llvm.org/D39005.

Let me come back to some of the previous comments:

In D17738#416060, @rnk wrote:

The problem there is that we would have to make the IR symbol renamer not append '.N' when renaming symbols with name collisions. Until then, we need something like this.

In D17738#661115, @hfinkel wrote:

It is also true that we need some way to generate unique symbol names that does not interfere with existing mangling schemes. This means that we need to have some character that is not in [A-Za-z0-9_], and is not @ (because that's used by ELF symbol versioning). ?$@' are used by MSVC mangling. PTX allows only [A-Za-z0-9_$] (and also %, but only as the first character, so that doesn't help). Unfortunately, this does not seem to leave anything we can use (we can use '$' for PTX, but not for MSVC).

One option is that we add a function to LLVM get an available separator character, which can default to '.', but we set to '$' for nvptx, and use that for generating new names at the IR level. Thoughts?

I opted to implement this approach as a special case in ValueSymbolTable::makeUniqueName(). That way we can't get an "invalid" result from setName() which avoids the loop that the previous patches used for cleanup.
I basically followed @hfinkel's analysis of which characters we can use which doesn't leave much choice.

In D17738#661168, @rnk wrote:

This seems practical. Perhaps it could be part of the name mangling scheme already encoded in DataLayout?

In D39005#904580, @hfinkel wrote:

DataLayout generally holds information that the target-independent optimizer needs in order to simplify the IR into our canonical form. This is as opposed to TargetTransformInfo, which provides data necessary to optimize the IR in target-aware ways (e.g., do things that are orthogonal to canonicalization such as inlining and vectorization). It is also as opposed to external utility functions that might be used by the frontend (e.g., llvm::sys::getHostCPUName()). If I recall correctly, this is information that would be used by the frontend when generating the IR, and the function results are controlled by the triple. As a result, I think that a general utility function somewhere would be fine.

I explicitly chose not to encode that information in the DataLayout:

I'm not sure the core IR library has access to the currently used DataLayout.
If I understood the code correctly, the DataLayout is set by the frontend, probably Clang in most cases. I don't think we should teach the frontend about a (correctness) decision that the backend can get right based on the selected target.

Hahnfeld added a parent revision: D40572: [OpenMP] Make test robust against quotation, NFC..Nov 28 2017, 12:24 PM

There must be some truth in the saying "naming is one of the hardest problems in computer science". :-/

lib/IR/ValueSymbolTable.cpp
54–59	This patch addresses "we can't compile generated PTX because LLVM uses illegal characters", but exposes another issue -- having potentially different names on host and device is a problem for CUDA. For some objects host side may need to know what it's called on device side. We need it in order to access it from host (eg cudaMemcpyToSymbol(), or initializing static variables) and we currently assume that the names are the same. If such symbol gets different names on host and device, compilation will succeed, but we'll have problems at runtime. Does "." have any special meaning? Can we skip the unique delimiter altogether? If we can't find a suitable way to guarantee identical naming, we'll need a way to have a reliable way to determine the name used on the other side of the compilation.

In D40573#939676, @tra wrote:

There must be some truth in the saying "naming is one of the hardest problems in computer science". :-/

Indeed, and the conflicting requirements listed by Hal makes it even harder to be correct for all cases :-(

lib/IR/ValueSymbolTable.cpp
54–59	So the interesting question is: When will this code ever be hit? Most programming languages (including C and C++) obviously don't allow multiple variables of the same name - how would the compiler say which symbol you meant. The use case I've mostly seen is for compiler generated function, `omp_outlined` for example. These can be generated multiple times in the same translation unit and have to get unique names. Do you have another example where this could happen? I'm not really sure '.' has a special meaning. Maybe @rafael can help because one of his old commits (https://reviews.llvm.org/rL253804) says `For globals it is important to use "foo.1" to help C++ name demangling.`

tra added inline comments.Nov 29 2017, 2:25 PM

lib/IR/ValueSymbolTable.cpp
54–59	I vaguely recall that '.' was an indication for demangler that it should not proceed further. I.e. a sort-of-special character to indicate the end of the C++-mangled part of the symbol name. If name mangling can't be made identical (and it looks like it may be the case), we can probably work around it. I.e. for symbols that must have identical names on both sides we can generate a unique alias that's identical on both sides and use it instead when CUDA needs it.

Hahnfeld added inline comments.Nov 29 2017, 3:13 PM

lib/IR/ValueSymbolTable.cpp
54–59	But why is LLVM responsible for mangling? Shouldn't this be done by the Clang frontend? I've found this test in `libcxxabi/test/test_demangle.pass.cpp`: {"_ZNK10__cxxabiv111__libcxxabi5__sub20first_demangled_nameEPc.eh", "__cxxabiv1::__libcxxabi::__sub::first_demangled_name(char*) const (.eh)"}, As said, I can't imagine a user defined value where LLVM needs to generate a unique name, so this should be fine for the CUDA functions you mentioned because they operate on variables.

tra added inline comments.Nov 30 2017, 10:12 AM

lib/IR/ValueSymbolTable.cpp
54–59	Sorry. I should've said "the way LLVM creates unique names". C++ mangling is indeed handled by clang. AFAICT, we do need to consider possibility of LLVM generating unique name whenever Clang calls LLVM's Value::setName(). The API explicitly guarantees that the name will be unique. Another case would be an identifier with a unicode symbol in it. NVPTX would have to sanitize it with the result potentially clashing with existing name -- similar to @.str test case below. This is hypothetical at the moment as clang does not support unicode (yet?).

Hahnfeld added inline comments.Nov 30 2017, 10:26 AM

lib/IR/ValueSymbolTable.cpp
54–59	All right, then the next natural question is: Do we need to keep the C++ mangling in mind when creating the unique name? Because that is currently the reason that there is a dot separator. If we don't need it we could just drop the dot and every target would be happy. In addition this would guarantee that LLVM will produce a "valid" name after NVPTX sanitized the name and there was a crash.

tra added inline comments.Nov 30 2017, 10:49 AM

lib/IR/ValueSymbolTable.cpp
54–59	Alas dropping the dot does mess with the mangling. In that respect '$' would be lesser evil, at least we'll get the pre-uniquification C++ name demangled correctly. I'm not sure what effect that would have on demangling on windows. $ c++filt _Z1fv _Z1fv.1 _Z1fv$2 _Z1fv3 f() f() [clone .1] f() _Z1fv3 $ bin/llvm-cxxfilt _Z1fv _Z1fv.1 _Z1fv$2 _Z1fv3 f() f() (.1) f() _Z1fv3 It looks like we'll have to use '$' for uniquification in nvptx as your patch does. That, at least, will deal with the part of the issue we need right now -- generate PTX which ptxas can compile and keep demanglers working. The cases where uniquification mismatch would cause problems should be rare. We can solve that problem separately. It's not perfect, but it looks like we don't have many options here. I don't have any better ideas. @hfinkel, @rnk -- your thoughts?

Hahnfeld added inline comments.Nov 30 2017, 11:26 AM

lib/IR/ValueSymbolTable.cpp
54–59	Hehe, I guess you executed this on a shell? Then `$2` is an (empty) variable and the right commands are: $ c++filt _Z1fv _Z1fv.1 _Z1fv\$2 _Z1fv3 f() f() [clone .1] _Z1fv$2 _Z1fv3 $ bin/llvm-cxxfilt _Z1fv _Z1fv.1 _Z1fv\$2 _Z1fv3 f() f() (.1) _Z1fv$2 _Z1fv3 (at least the two demanglers agree...) So a dollar sign doesn't help much which means we could also drop it entirely.

tra added inline comments.Nov 30 2017, 11:56 AM

lib/IR/ValueSymbolTable.cpp
54–59	Oops. :-( Well, bummer. I guess we'll have to live with un-demangle-able unique names on NVPTX side. It's still an improvement over not being able to compile anything with such names.

Hahnfeld added inline comments.Nov 30 2017, 11:59 AM

lib/IR/ValueSymbolTable.cpp
54–59	Ok, I'll update the patch to not even use `$` which saves us quotation (see D40572).

Drop '$' and use no separator as it doesn't help with ABI demangling.

I am OK with the change, but please wait a bit in case @rnk or @hfinkel have further comments.

This revision is now accepted and ready to land.Nov 30 2017, 2:11 PM

lgtm

This is silly. This bug has been open for so long that nvidia could've just fixed their toolchain by now to accept dots in symbol names. =p

I defer to the others here.

Closed by commit rL319657: [NVPTX] Assign valid global names (authored by Hahnfeld). · Explain WhyDec 4 2017, 6:20 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

lib/

IR/

ValueSymbolTable.cpp

14 lines

Target/

NVPTX/

NVPTXAssignValidGlobalNames.cpp

6 lines

test/

CodeGen/

NVPTX/

symbol-naming.ll

33 lines

Diff 124616

lib/IR/ValueSymbolTable.cpp

	//===- ValueSymbolTable.cpp - Implement the ValueSymbolTable class --------===//			//===- ValueSymbolTable.cpp - Implement the ValueSymbolTable class --------===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file implements the ValueSymbolTable class for the IR library.			// This file implements the ValueSymbolTable class for the IR library.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/IR/ValueSymbolTable.h"			#include "llvm/IR/ValueSymbolTable.h"
	#include "llvm/ADT/SmallString.h"			#include "llvm/ADT/SmallString.h"
				#include "llvm/ADT/Triple.h"
	#include "llvm/IR/GlobalValue.h"			#include "llvm/IR/GlobalValue.h"
				#include "llvm/IR/Module.h"
	#include "llvm/IR/Type.h"			#include "llvm/IR/Type.h"
	#include "llvm/IR/Value.h"			#include "llvm/IR/Value.h"
	#include "llvm/Support/Casting.h"			#include "llvm/Support/Casting.h"
	#include "llvm/Support/Compiler.h"			#include "llvm/Support/Compiler.h"
	#include "llvm/Support/Debug.h"			#include "llvm/Support/Debug.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"
	#include <cassert>			#include <cassert>
	#include <utility>			#include <utility>
	Show All 15 Lines

	ValueName ValueSymbolTable::makeUniqueName(Value V,			ValueName ValueSymbolTable::makeUniqueName(Value V,
	SmallString<256> &UniqueName) {			SmallString<256> &UniqueName) {
	unsigned BaseSize = UniqueName.size();			unsigned BaseSize = UniqueName.size();
	while (true) {			while (true) {
	// Trim any suffix off and append the next number.			// Trim any suffix off and append the next number.
	UniqueName.resize(BaseSize);			UniqueName.resize(BaseSize);
	raw_svector_ostream S(UniqueName);			raw_svector_ostream S(UniqueName);
	if (isa<GlobalValue>(V))			if (auto *GV = dyn_cast<GlobalValue>(V)) {
				const Module *M = GV->getParent();
				// On NVPTX we cannot use a dot because PTX only allows [A-Za-z0-9_$] for
				// identifiers. Instead we use a dollar sign which wouldn't be legal for
				// MSVC name mangling but is fine for PTX.
				if (M && Triple(M->getTargetTriple()).isNVPTX())
				S << "$";
				else
	S << ".";			S << ".";
				}
				traUnsubmitted Done Reply Inline Actions This patch addresses "we can't compile generated PTX because LLVM uses illegal characters", but exposes another issue -- having potentially different names on host and device is a problem for CUDA. For some objects host side may need to know what it's called on device side. We need it in order to access it from host (eg cudaMemcpyToSymbol(), or initializing static variables) and we currently assume that the names are the same. If such symbol gets different names on host and device, compilation will succeed, but we'll have problems at runtime. Does "." have any special meaning? Can we skip the unique delimiter altogether? If we can't find a suitable way to guarantee identical naming, we'll need a way to have a reliable way to determine the name used on the other side of the compilation. tra: This patch addresses "we can't compile generated PTX because LLVM uses illegal characters", but…
				HahnfeldAuthorUnsubmitted Done Reply Inline Actions So the interesting question is: When will this code ever be hit? Most programming languages (including C and C++) obviously don't allow multiple variables of the same name - how would the compiler say which symbol you meant. The use case I've mostly seen is for compiler generated function, `omp_outlined` for example. These can be generated multiple times in the same translation unit and have to get unique names. Do you have another example where this could happen? I'm not really sure '.' has a special meaning. Maybe @rafael can help because one of his old commits (https://reviews.llvm.org/rL253804) says `For globals it is important to use "foo.1" to help C++ name demangling.` Hahnfeld: So the interesting question is: When will this code ever be hit? Most programming languages…
				traUnsubmitted Done Reply Inline Actions I vaguely recall that '.' was an indication for demangler that it should not proceed further. I.e. a sort-of-special character to indicate the end of the C++-mangled part of the symbol name. If name mangling can't be made identical (and it looks like it may be the case), we can probably work around it. I.e. for symbols that must have identical names on both sides we can generate a unique alias that's identical on both sides and use it instead when CUDA needs it. tra: I vaguely recall that '.' was an indication for demangler that it should not proceed further. I.
				HahnfeldAuthorUnsubmitted Done Reply Inline Actions But why is LLVM responsible for mangling? Shouldn't this be done by the Clang frontend? I've found this test in `libcxxabi/test/test_demangle.pass.cpp`: {"_ZNK10__cxxabiv111__libcxxabi5__sub20first_demangled_nameEPc.eh", "__cxxabiv1::__libcxxabi::__sub::first_demangled_name(char) const (.eh)"}, As said, I can't imagine a user defined value where LLVM needs to generate a unique name, so this should be fine for the CUDA functions you mentioned because they operate on variables. Hahnfeld:* But why is LLVM responsible for mangling? Shouldn't this be done by the Clang frontend? I've…
				traUnsubmitted Done Reply Inline Actions Sorry. I should've said "the way LLVM creates unique names". C++ mangling is indeed handled by clang. AFAICT, we do need to consider possibility of LLVM generating unique name whenever Clang calls LLVM's Value::setName(). The API explicitly guarantees that the name will be unique. Another case would be an identifier with a unicode symbol in it. NVPTX would have to sanitize it with the result potentially clashing with existing name -- similar to @.str test case below. This is hypothetical at the moment as clang does not support unicode (yet?). tra: Sorry. I should've said "the way LLVM creates unique names". C++ mangling is indeed handled by…
				HahnfeldAuthorUnsubmitted Done Reply Inline Actions All right, then the next natural question is: Do we need to keep the C++ mangling in mind when creating the unique name? Because that is currently the reason that there is a dot separator. If we don't need it we could just drop the dot and every target would be happy. In addition this would guarantee that LLVM will produce a "valid" name after NVPTX sanitized the name and there was a crash. Hahnfeld: All right, then the next natural question is: Do we need to keep the C++ mangling in mind when…
				traUnsubmitted Done Reply Inline Actions Alas dropping the dot does mess with the mangling. In that respect '$' would be lesser evil, at least we'll get the pre-uniquification C++ name demangled correctly. I'm not sure what effect that would have on demangling on windows. $ c++filt _Z1fv _Z1fv.1 _Z1fv$2 _Z1fv3 f() f() [clone .1] f() _Z1fv3 $ bin/llvm-cxxfilt _Z1fv _Z1fv.1 _Z1fv$2 _Z1fv3 f() f() (.1) f() _Z1fv3 It looks like we'll have to use '$' for uniquification in nvptx as your patch does. That, at least, will deal with the part of the issue we need right now -- generate PTX which ptxas can compile and keep demanglers working. The cases where uniquification mismatch would cause problems should be rare. We can solve that problem separately. It's not perfect, but it looks like we don't have many options here. I don't have any better ideas. @hfinkel, @rnk -- your thoughts? tra: Alas dropping the dot does mess with the mangling. In that respect '$' would be lesser evil, at…
				HahnfeldAuthorUnsubmitted Done Reply Inline Actions Hehe, I guess you executed this on a shell? Then `$2` is an (empty) variable and the right commands are: $ c++filt _Z1fv _Z1fv.1 _Z1fv\$2 _Z1fv3 f() f() [clone .1] _Z1fv$2 _Z1fv3 $ bin/llvm-cxxfilt _Z1fv _Z1fv.1 _Z1fv\$2 _Z1fv3 f() f() (.1) _Z1fv$2 _Z1fv3 (at least the two demanglers agree...) So a dollar sign doesn't help much which means we could also drop it entirely. Hahnfeld: Hehe, I guess you executed this on a shell? Then `$2` is an (empty) variable and the right…
				traUnsubmitted Done Reply Inline Actions Oops. :-( Well, bummer. I guess we'll have to live with un-demangle-able unique names on NVPTX side. It's still an improvement over not being able to compile anything with such names. tra: Oops. :-( Well, bummer. I guess we'll have to live with un-demangle-able unique names on NVPTX…
				HahnfeldAuthorUnsubmitted Done Reply Inline Actions Ok, I'll update the patch to not even use `$` which saves us quotation (see D40572). Hahnfeld: Ok, I'll update the patch to not even use `$` which saves us quotation (see D40572).
	S << ++LastUnique;			S << ++LastUnique;

	// Try insert the vmap entry with this suffix.			// Try insert the vmap entry with this suffix.
	auto IterBool = vmap.insert(std::make_pair(UniqueName, V));			auto IterBool = vmap.insert(std::make_pair(UniqueName, V));
	if (IterBool.second)			if (IterBool.second)
	return &*IterBool.first;			return &*IterBool.first;
	}			}
	}			}
	▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines

lib/Target/NVPTX/NVPTXAssignValidGlobalNames.cpp

Show All 12 Lines
// Currently NVPTX, like other backends, relies on generic symbol name		// Currently NVPTX, like other backends, relies on generic symbol name
// sanitizing done by MC. However, the ptxas assembler is more stringent and		// sanitizing done by MC. However, the ptxas assembler is more stringent and
// disallows some additional characters in symbol names. This pass makes sure		// disallows some additional characters in symbol names. This pass makes sure
// such names do not reach MC at all.		// such names do not reach MC at all.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "NVPTX.h"		#include "NVPTX.h"
		#include "llvm/IR/Function.h"
#include "llvm/IR/GlobalVariable.h"		#include "llvm/IR/GlobalVariable.h"
#include "llvm/IR/LegacyPassManager.h"		#include "llvm/IR/LegacyPassManager.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <string>		#include <string>

using namespace llvm;		using namespace llvm;

Show All 27 Lines	if (GV.hasLocalLinkage()) {
// setName doesn't do extra work if the name does not change.		// setName doesn't do extra work if the name does not change.
// Note: this does not create collisions - if setName is asked to set the		// Note: this does not create collisions - if setName is asked to set the
// name to something that already exists, it adds a proper postfix to		// name to something that already exists, it adds a proper postfix to
// avoid collisions.		// avoid collisions.
GV.setName(cleanUpName(GV.getName()));		GV.setName(cleanUpName(GV.getName()));
}		}
}		}

		// Do the same for local functions.
		for (Function &F : M.functions())
		if (F.hasLocalLinkage())
		F.setName(cleanUpName(F.getName()));

return true;		return true;
}		}

std::string NVPTXAssignValidGlobalNames::cleanUpName(StringRef Name) {		std::string NVPTXAssignValidGlobalNames::cleanUpName(StringRef Name) {
std::string ValidName;		std::string ValidName;
raw_string_ostream ValidNameStream(ValidName);		raw_string_ostream ValidNameStream(ValidName);
for (unsigned I = 0, E = Name.size(); I != E; ++I) {		for (unsigned I = 0, E = Name.size(); I != E; ++I) {
char C = Name[I];		char C = Name[I];
Show All 13 Lines

test/CodeGen/NVPTX/symbol-naming.ll

	; RUN: llc < %s -march=nvptx -mcpu=sm_20 \| FileCheck %s --check-prefix=PTX32			; RUN: llc < %s -march=nvptx -mcpu=sm_20 \| FileCheck %s
	; RUN: llc < %s -march=nvptx64 -mcpu=sm_20 \| FileCheck %s --check-prefix=PTX64			; RUN: llc < %s -march=nvptx64 -mcpu=sm_20 \| FileCheck %s

	; Verify that the NVPTX target removes invalid symbol names prior to emitting			; Verify that the NVPTX target removes invalid symbol names prior to emitting
	; PTX.			; PTX.

	; PTX32-NOT: .str			; CHECK-NOT: .str
	; PTX64-NOT: .str			; CHECK-NOT: .function.

	; PTX32-DAG: _$_str.1			; CHECK-DAG: _$_str
	; PTX32-DAG: _$_str			; CHECK-DAG: _$_str$1

	; PTX64-DAG: _$_str.1			; CHECK-DAG: _$_function_$_
	; PTX64-DAG: _$_str			; CHECK-DAG: _$_function_$_$2

	target datalayout = "e-i64:64-v16:16-v32:32-n16:32:64"			target datalayout = "e-i64:64-v16:16-v32:32-n16:32:64"
	target triple = "nvptx64-unknown-unknown"			target triple = "nvptx64-unknown-unknown"


	@.str = private unnamed_addr constant [13 x i8] c"%d %f %c %d\0A\00", align 1			@.str = private unnamed_addr constant [13 x i8] c"%d %f %c %d\0A\00", align 1
	@_$_str = private unnamed_addr constant [13 x i8] c"%d %f %c %d\0A\00", align 1			@_$_str = private unnamed_addr constant [13 x i8] c"%d %f %c %d\0A\00", align 1


	; Function Attrs: nounwind			; Function Attrs: nounwind
	define void @foo(i32 %a, float %b, i8 signext %c, i32 %e) {			define internal void @.function.() {
	entry:			entry:
	%call = call i32 (i8, ...) @printf(i8 getelementptr inbounds ([13 x i8], [13 x i8]* @.str, i32 0, i32 0))			%call = call i32 (i8, ...) @printf(i8 getelementptr inbounds ([13 x i8], [13 x i8]* @.str, i32 0, i32 0))
	ret void			ret void
	}			}

				; Function Attrs: nounwind
				define internal void @_$_function_$_() {
				entry:
				%call = call i32 (i8, ...) @printf(i8 getelementptr inbounds ([13 x i8], [13 x i8]* @_$_str, i32 0, i32 0))
				ret void
				}

				; Function Attrs: nounwind
				define void @global_function() {
				entry:
				call void @.function.()
				call void @_$_function_$_()
				ret void
				}

	declare i32 @printf(i8*, ...)			declare i32 @printf(i8*, ...)