This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/IPO/
-
Transforms/
-
IPO/
5/16
ThinLTOBitcodeWriter.cpp
-
test/
-
ThinLTO/X86/
-
X86/
-
devirt2.ll
-
Transforms/ThinLTOBitcodeWriter/
-
ThinLTOBitcodeWriter/
-
cfi-icall-static-inline-asm.ll
-
split-internal2.ll
-
split-vfunc-internal.ll

Differential D104058

ThinLTO: Fix inline assembly references to static functions with CFI
ClosedPublic

Authored by samitolvanen on Jun 10 2021, 1:44 PM.

Download Raw Diff

Details

Reviewers

nickdesaulniers
pcc
tejohnson
kees
eugenis

Commits

rG7ce1c4da7726: ThinLTO: Fix inline assembly references to static functions with CFI
rG700d07f8ce6f: ThinLTO: Fix inline assembly references to static functions with CFI
rG8e3b5cb39eef: ThinLTO: Fix inline assembly references to static functions with CFI
rGe3d24b45b8f8: ThinLTO: Fix inline assembly references to static functions with CFI
rG4474958d3a97: ThinLTO: Fix inline assembly references to static functions with CFI

Summary

Create an internal alias with the original name for static functions
that are renamed in promoteInternals to avoid breaking inline
assembly references to them.

Link: https://github.com/ClangBuiltLinux/linux/issues/1354

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

samitolvanen created this revision.Jun 10 2021, 1:44 PM

Herald added subscribers: ormris, steven_wu, hiraditya, inglorion. · View Herald TranscriptJun 10 2021, 1:44 PM

samitolvanen requested review of this revision.Jun 10 2021, 1:44 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJun 10 2021, 1:44 PM

Herald added subscribers: llvm-commits, cfe-commits. · View Herald Transcript

samitolvanen added reviewers: nickdesaulniers, pcc, tejohnson, kees, eugenis.Jun 10 2021, 1:47 PM

samitolvanen added inline comments.

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
93	Note that I'm adding the alias to llvm.compiler.used because it's otherwise removed during optimization. Is there are better way to accomplish this?

Harbormaster completed remote builds in B108682: Diff 351253.Jun 10 2021, 2:52 PM

LGTM

clang/test/CodeGen/thinlto-cfi-icall-static-inline-asm.c
3 ↗	(On Diff #351253)	Can the test be moved to `llvm/test/Transforms/ThinLTOBitcodeWriter` and made to use `opt -thinlto-bc`?
llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
93	Not as far as I know, that's what I'd recommend.

This revision is now accepted and ready to land.Jun 17 2021, 9:16 AM

Moved the test to llvm/test/Transforms/ThinLTOBitcodeWriter.

Harbormaster completed remote builds in B109792: Diff 352825.Jun 18 2021, 3:47 AM

Closed by commit rG4474958d3a97: ThinLTO: Fix inline assembly references to static functions with CFI (authored by samitolvanen). · Explain WhyJun 22 2021, 10:02 AM

This revision was automatically updated to reflect the committed changes.

samitolvanen added a commit: rG4474958d3a97: ThinLTO: Fix inline assembly references to static functions with CFI.

Looks like this breaks check-llvm on mac, see http://45.33.8.238/mac/32814/summary.html

Please take a look and revert for now if it takes a while to fix.

samitolvanen added a reverting change: rG33c9438f1166: Revert "ThinLTO: Fix inline assembly references to static functions with CFI".Jun 22 2021, 12:11 PM

samitolvanen reopened this revision.Jun 22 2021, 2:34 PM

This revision is now accepted and ready to land.Jun 22 2021, 2:34 PM

Fix a use-of-uninitialized-value error.

Harbormaster completed remote builds in B110508: Diff 353797.Jun 22 2021, 3:49 PM

nickdesaulniers added inline comments.Jun 22 2021, 6:09 PM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
71–93	Can you avoid making a copy of the OldName by doing the `appendToCompilerUsed` BEFORE making the dangling reference via `ExportGV.setName(NewName);`?

samitolvanen added inline comments.Jun 23 2021, 8:24 AM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
71–93	No, I have to rename the existing function before I can create an alias with the same name, and as `ExportGV.setName()` invalidates `Name`, I need to create a copy first.

nickdesaulniers accepted this revision.Jun 23 2021, 10:03 AM

This revision was landed with ongoing or failed builds.Jun 23 2021, 11:10 AM

Closed by commit rGe3d24b45b8f8: ThinLTO: Fix inline assembly references to static functions with CFI (authored by samitolvanen). · Explain Why

This revision was automatically updated to reflect the committed changes.

samitolvanen added a commit: rGe3d24b45b8f8: ThinLTO: Fix inline assembly references to static functions with CFI.

Hi, this caused compiler crash: "Assertion `materialized_use_empty() && "Uses remain when a value is destroyed!"'" on chromium build https://ci.chromium.org/ui/p/chromium/builders/try/linux-official/151/overview.
Here is the error message:

While deleting: void ()* %
Use still stuck around after Def is destroyed:i8* bitcast (void ()* <badref> to i8*)
While deleting: void ()* %
Use still stuck around after Def is destroyed:i8* bitcast (voidld.lld: /usr/local/google/home/zequanwu/llvm-project/llvm/lib/IR/Value.cpp:103: llvm::Value::~Value(): Assertion `materialized_use_empty() && "Uses remain when a value is destroyed!"' failed.
 ()* <badref> to i8*)

zequanwu added a reverting change: rG9393894331e9: Revert "ThinLTO: Fix inline assembly references to static functions with CFI".Jun 23 2021, 7:25 PM

Thanks for the revert, I'll take a look.

samitolvanen reopened this revision.Jul 13 2021, 11:19 AM

This revision is now accepted and ready to land.Jul 13 2021, 11:19 AM

Moved the alias creation to module level inline assembly to avoid issues with LowerTypeTestsModule, based on pcc's suggestion.

Harbormaster completed remote builds in B113802: Diff 358364.Jul 13 2021, 12:59 PM

Change LGTM, but I don't understand why the following tests are modified:

llvm/test/ThinLTO/X86/devirt2.ll
llvm/test/Transforms/ThinLTOBitcodeWriter/split-internal2.ll
llvm/test/Transforms/ThinLTOBitcodeWriter/split-vfunc-internal.ll

In D104058#2877631, @nickdesaulniers wrote:

Change LGTM, but I don't understand why the following tests are modified:

llvm/test/ThinLTO/X86/devirt2.ll

This is needed to fix two missing symbol resolution errors that are caused by the aliases we added.

llvm/test/Transforms/ThinLTOBitcodeWriter/split-internal2.ll

llvm/test/Transforms/ThinLTOBitcodeWriter/split-vfunc-internal.ll

And for these, we need to specify a target triple to use module inline assembly. According to pcc, there shouldn't be a real-world situation where the triple is missing, but these two tests don't currently specify one.

In D104058#2878083, @samitolvanen wrote:

In D104058#2877631, @nickdesaulniers wrote:

Change LGTM, but I don't understand why the following tests are modified:

llvm/test/ThinLTO/X86/devirt2.ll

This is needed to fix two missing symbol resolution errors that are caused by the aliases we added.

I'm curious if this will lead to breakages with LTO in general? I suppose not, since it's llvm-lto2 that needs the explicit list of symbols that can be linked against.

llvm/test/Transforms/ThinLTOBitcodeWriter/split-internal2.ll

llvm/test/Transforms/ThinLTOBitcodeWriter/split-vfunc-internal.ll

And for these, we need to specify a target triple to use module inline assembly. According to pcc, there shouldn't be a real-world situation where the triple is missing, but these two tests don't currently specify one.

Neither of these tests use module inline assembly though, AFAICT?

In D104058#2883804, @nickdesaulniers wrote:

In D104058#2878083, @samitolvanen wrote:

In D104058#2877631, @nickdesaulniers wrote:

Change LGTM, but I don't understand why the following tests are modified:

llvm/test/ThinLTO/X86/devirt2.ll

This is needed to fix two missing symbol resolution errors that are caused by the aliases we added.

I'm curious if this will lead to breakages with LTO in general? I suppose not, since it's llvm-lto2 that needs the explicit list of symbols that can be linked against.

No, this won't break LTO.

llvm/test/Transforms/ThinLTOBitcodeWriter/split-internal2.ll

llvm/test/Transforms/ThinLTOBitcodeWriter/split-vfunc-internal.ll

And for these, we need to specify a target triple to use module inline assembly. According to pcc, there shouldn't be a real-world situation where the triple is missing, but these two tests don't currently specify one.

Neither of these tests use module inline assembly though, AFAICT?

The tests don't, but we add module inline assembly to create the aliases, which means that IR with type metadata will need a target triple for ThinLTO.

nickdesaulniers accepted this revision.Jul 16 2021, 11:08 AM

LGTM

This revision was landed with ongoing or failed builds.Jul 16 2021, 2:34 PM

Closed by commit rG8e3b5cb39eef: ThinLTO: Fix inline assembly references to static functions with CFI (authored by samitolvanen). · Explain Why

This revision was automatically updated to reflect the committed changes.

samitolvanen added a commit: rG8e3b5cb39eef: ThinLTO: Fix inline assembly references to static functions with CFI.

samitolvanen added a reverting change: rG0ad1d9fdf22d: Revert "ThinLTO: Fix inline assembly references to static functions with CFI".Jul 16 2021, 2:48 PM

The tests that now specify a target triple also need ; REQUIRES: x86-registered-target or they will obviously fail. I'll upload another revision.

This revision is now accepted and ready to land.Jul 16 2021, 2:55 PM

Added REQUIRES: x86-registered-target to tests.

Sorry, I always forget when these are necessary.

Harbormaster completed remote builds in B114607: Diff 359463.Jul 16 2021, 4:03 PM

This revision was landed with ongoing or failed builds.Jul 20 2021, 10:30 AM

Closed by commit rG700d07f8ce6f: ThinLTO: Fix inline assembly references to static functions with CFI (authored by samitolvanen). · Explain Why

This revision was automatically updated to reflect the committed changes.

samitolvanen added a commit: rG700d07f8ce6f: ThinLTO: Fix inline assembly references to static functions with CFI.

This patch broke the sanitizer-windows bot: https://lab.llvm.org/buildbot/#/builders/127/builds/14257

Failed Tests (4):
  cfi-devirt-lld-thinlto-x86_64 :: anon-namespace.cpp
  cfi-devirt-lld-thinlto-x86_64 :: simple-pass.cpp
  cfi-standalone-lld-thinlto-x86_64 :: anon-namespace.cpp
  cfi-standalone-lld-thinlto-x86_64 :: simple-pass.cpp

Please revert or fix.

In D104058#2891551, @morehouse wrote:
This patch broke the sanitizer-windows bot: https://lab.llvm.org/buildbot/#/builders/127/builds/14257
Failed Tests (4):
  cfi-devirt-lld-thinlto-x86_64 :: anon-namespace.cpp
  cfi-devirt-lld-thinlto-x86_64 :: simple-pass.cpp
  cfi-standalone-lld-thinlto-x86_64 :: anon-namespace.cpp
  cfi-standalone-lld-thinlto-x86_64 :: simple-pass.cpp
Please revert or fix.

This should fix the -msvc targets: https://reviews.llvm.org/D106392

samitolvanen added a reverting change: rGe901e581ef45: Revert "ThinLTO: Fix inline assembly references to static functions with CFI".Jul 20 2021, 2:00 PM

samitolvanen reopened this revision.Jul 20 2021, 2:07 PM

This revision is now accepted and ready to land.Jul 20 2021, 2:07 PM

Folded in D106392.

nickdesaulniers added a subscriber: rnk.Jul 20 2021, 2:09 PM

nickdesaulniers added inline comments.

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
90	Is there more information about "promotion aliases with x86_64-pc-windows-msvc" from D106392? Can we quote these only for msvc target triples? Can we add a comment about the quoting be necessary for those targets? It's still not clear to me how tests using explicit -linux-gnu triples could fail on -mscv hosts.

nickdesaulniers added inline comments.Jul 20 2021, 2:15 PM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
90	Also, I think the quoting hurts the readability of the generated asm. Maybe that doesn't matter for LTO, but I'd be curious if we could do such escaping only when necessary? Perhaps that's only when targeting -msvc triples?

rnk added inline comments.Jul 20 2021, 2:37 PM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
90	I was going to say, the proper way to do this is what `MCSymbol::print` does: https://github.com/llvm/llvm-project/blob/main/llvm/lib/MC/MCSymbol.cpp#L59 That code doesn't seem exhaustive, but at least it escapes quotes. We can't call MC from here due to library layering. The LLVM IR readability is poor, but inline asm in IR is already hard to read. I wouldn't worry about that, I'd mainly worry about the output of -S. Quoting always seems fine to me, I guess.

Harbormaster completed remote builds in B115185: Diff 360267.Jul 20 2021, 3:35 PM

samitolvanen added inline comments.Jul 20 2021, 3:41 PM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
90	Is there more information about "promotion aliases with x86_64-pc-windows-msvc" from D106392? Yes, the -msvc targets use Visual C++ compatible name mangling, which requires quotes when referred to in inline assembly. Here's a trivial reproducer: $ cat test.cpp static void a(void) {} void* b(void) { return (void *)a; } $ clang -flto=thin -fvisibility=default -fsanitize=cfi -c test.cpp $ clang -flto=thin -fvisibility=default -fsanitize=cfi -target x86_64-pc-windows-msvc -c test.cpp Either SourceMgr should be available UNREACHABLE executed at llvm-project/llvm/lib/MC/MCContext.cpp:913! ... Here's the inline assembly generated when we compile the above example for the -msvc target: .set ?a@@YAXXZ,?a@@YAXXZ.d7b56b39ccc53bc7515ae1b2533f1e3d Can we quote these only for msvc target triples? Can we add a comment about the quoting be necessary for those targets? I assume we could limit this only to -msvc targets, but I feel like that would unnecessarily complicate the code as there's no harm in quoting the names always. It's still not clear to me how tests using explicit -linux-gnu triples could fail on -mscv hosts. These tests don't specify a `-linux-gnu` triple. They are C++ and end up using the default target, which in this case is `-msvc`.

samitolvanen added inline comments.Jul 20 2021, 4:32 PM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
90	I was going to say, the proper way to do this is what `MCSymbol::print` does: https://github.com/llvm/llvm-project/blob/main/llvm/lib/MC/MCSymbol.cpp#L59 That code doesn't seem exhaustive, but at least it escapes quotes. We can't call MC from here due to library layering. Is it actually possible to have function names that contain quotes? If so, I suppose we need to do something similar here and escape any quotes in the names. The LLVM IR readability is poor, but inline asm in IR is already hard to read. I wouldn't worry about that, I'd mainly worry about the output of -S. The promotion happens only when we write bitcode, so the aliases won't yet exist in the `-S` output.

samitolvanen planned changes to this revision.Jul 20 2021, 4:59 PM

As we only care about fixing inline assembly references, mangled names are
not that important in the first place. This version skips any functions
that have unusual characters in their names that would otherwise require
quotes, which includes any functions with MSVC compatible name mangling.

This revision is now accepted and ready to land.Jul 21 2021, 3:17 PM

Harbormaster completed remote builds in B115430: Diff 360615.Jul 21 2021, 5:09 PM

nickdesaulniers added inline comments.Jul 30 2021, 11:08 AM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
39	What about `MCAsmInfoXCOFF::isAcceptableChar`? I guess we could be targeting a XCOFF object file format with LTO?
45	Can llvm::any_of or llvm::none_of be used here? llvm/ADT/STLExtras.h

samitolvanen added inline comments.Jul 30 2021, 11:21 AM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
39	Sure, I'll drop '$' and '@' from the list to play nice with XCOFF.
45	Maybe, but I don't see how they would make this function any cleaner. Did you have something specific in mind?

Also skip functions with names incompatible with XCOFF.

nickdesaulniers added inline comments.Jul 30 2021, 11:45 AM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
45	Something like? return any_of(Name, [](const char &C) { return isAlnum(C) \|\| C == '_' \|\| C == '.'; } or maybe we need !none_of(...)? (not sure if characters of a string can be enumerated this way)

Harbormaster completed remote builds in B117236: Diff 363166.Jul 30 2021, 12:20 PM

samitolvanen added inline comments.Jul 30 2021, 12:20 PM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
45	Since we want to ensure all the characters in the string match the predicate, I believe it would make sense to use `all_of()` instead. However, I don't see the point of introducing additional complexity to such a trivial function to shave off a couple of lines of code. While it might not actually matter here, using `all_of()` also seems to generate ~5x as many instructions to execute: https://godbolt.org/z/Pndfxj6rM If you feel like the current version is too long, I can drop one line by changing the loop to use: if (!isAlnum(C) && C != '_' && C != '.') return false; I initially wanted to keep the test identical to `MCAsmInfo::isAcceptableChar()` to make it easier to see it actually matches, but since it's no longer identical, I suppose that doesn't matter. Thoughts?

I think the new approach of skipping non C-ish identifier names is reasonable. Looks good to me, but wait for the more active reviewers to stamp it.

nickdesaulniers added inline comments.Jul 30 2021, 12:49 PM

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp
45	using all_of() also seems to generate ~5x as many instructions to execute: Hard to argue with that. Might be nice to add some test coverage of this code with an mscv target triple though.

nickdesaulniers accepted this revision.Jul 30 2021, 12:49 PM

Closed by commit rG7ce1c4da7726: ThinLTO: Fix inline assembly references to static functions with CFI (authored by samitolvanen). · Explain WhyAug 3 2021, 11:48 AM

This revision was automatically updated to reflect the committed changes.

samitolvanen added a commit: rG7ce1c4da7726: ThinLTO: Fix inline assembly references to static functions with CFI.

Mentioning it here in case others run into the same thing: We bisected a 7x (!) binary size regression to this. Details at https://bugs.chromium.org/p/chromium/issues/detail?id=1261715

samitolvanen mentioned this in D112761: cfi-icall: Add -fsanitize-cfi-promotion-aliases.Oct 28 2021, 3:08 PM

samitolvanen mentioned this in D113613: [ThinLTO][MC] Use conditional assignments for promotion aliases.Nov 10 2021, 2:12 PM

samitolvanen mentioned this in rG9a74c753fe3f: [ThinLTO][MC] Use conditional assignments for promotion aliases.Dec 10 2021, 12:33 PM

samitolvanen mentioned this in D119296: KCFI sanitizer.Apr 20 2022, 2:25 PM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

IPO/

ThinLTOBitcodeWriter.cpp

21 lines

test/

ThinLTO/

X86/

devirt2.ll

4 lines

Transforms/

ThinLTOBitcodeWriter/

cfi-icall-static-inline-asm.ll

22 lines

split-internal2.ll

3 lines

split-vfunc-internal.ll

3 lines

Diff 363827

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp

	Show All 27 Lines
	#include "llvm/Transforms/IPO/FunctionImport.h"			#include "llvm/Transforms/IPO/FunctionImport.h"
	#include "llvm/Transforms/IPO/LowerTypeTests.h"			#include "llvm/Transforms/IPO/LowerTypeTests.h"
	#include "llvm/Transforms/Utils/Cloning.h"			#include "llvm/Transforms/Utils/Cloning.h"
	#include "llvm/Transforms/Utils/ModuleUtils.h"			#include "llvm/Transforms/Utils/ModuleUtils.h"
	using namespace llvm;			using namespace llvm;

	namespace {			namespace {

				// Determine if a promotion alias should be created for a symbol name.
				static bool allowPromotionAlias(const std::string &Name) {
				// Promotion aliases are used only in inline assembly. It's safe to
				// simply skip unusual names. Subset of MCAsmInfo::isAcceptableChar()
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions What about `MCAsmInfoXCOFF::isAcceptableChar`? I guess we could be targeting a XCOFF object file format with LTO? nickdesaulniers: What about `MCAsmInfoXCOFF::isAcceptableChar`? I guess we could be targeting a XCOFF object…
				samitolvanenAuthorUnsubmitted Done Reply Inline Actions Sure, I'll drop '$' and '@' from the list to play nice with XCOFF. samitolvanen: Sure, I'll drop '$' and '@' from the list to play nice with XCOFF.
				// and MCAsmInfoXCOFF::isAcceptableChar().
				for (const char &C : Name) {
				if (isAlnum(C) \|\| C == '_' \|\| C == '.')
				continue;
				return false;
				}
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions Can llvm::any_of or llvm::none_of be used here? llvm/ADT/STLExtras.h nickdesaulniers: Can llvm::any_of or llvm::none_of be used here? llvm/ADT/STLExtras.h
				samitolvanenAuthorUnsubmitted Not Done Reply Inline Actions Maybe, but I don't see how they would make this function any cleaner. Did you have something specific in mind? samitolvanen: Maybe, but I don't see how they would make this function any cleaner. Did you have something…
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions Something like? return any_of(Name, [](const char &C) { return isAlnum(C) \|\| C == '_' \|\| C == '.'; } or maybe we need !none_of(...)? (not sure if characters of a string can be enumerated this way) nickdesaulniers: Something like? return any_of(Name, [](const char &C) { return isAlnum(C) \|\| C == '_' \|\| C ==…
				samitolvanenAuthorUnsubmitted Not Done Reply Inline Actions Since we want to ensure all the characters in the string match the predicate, I believe it would make sense to use `all_of()` instead. However, I don't see the point of introducing additional complexity to such a trivial function to shave off a couple of lines of code. While it might not actually matter here, using `all_of()` also seems to generate ~5x as many instructions to execute: https://godbolt.org/z/Pndfxj6rM If you feel like the current version is too long, I can drop one line by changing the loop to use: if (!isAlnum(C) && C != '_' && C != '.') return false; I initially wanted to keep the test identical to `MCAsmInfo::isAcceptableChar()` to make it easier to see it actually matches, but since it's no longer identical, I suppose that doesn't matter. Thoughts? samitolvanen: Since we want to ensure all the characters in the string match the predicate, I believe it…
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions using all_of() also seems to generate ~5x as many instructions to execute: Hard to argue with that. Might be nice to add some test coverage of this code with an mscv target triple though. nickdesaulniers: > using all_of() also seems to generate ~5x as many instructions to execute: Hard to argue…
				return true;
				}

	// Promote each local-linkage entity defined by ExportM and used by ImportM by			// Promote each local-linkage entity defined by ExportM and used by ImportM by
	// changing visibility and appending the given ModuleId.			// changing visibility and appending the given ModuleId.
	void promoteInternals(Module &ExportM, Module &ImportM, StringRef ModuleId,			void promoteInternals(Module &ExportM, Module &ImportM, StringRef ModuleId,
	SetVector<GlobalValue *> &PromoteExtra) {			SetVector<GlobalValue *> &PromoteExtra) {
	DenseMap<const Comdat , Comdat > RenamedComdats;			DenseMap<const Comdat , Comdat > RenamedComdats;
	for (auto &ExportGV : ExportM.global_values()) {			for (auto &ExportGV : ExportM.global_values()) {
	if (!ExportGV.hasLocalLinkage())			if (!ExportGV.hasLocalLinkage())
	continue;			continue;

	auto Name = ExportGV.getName();			auto Name = ExportGV.getName();
	GlobalValue *ImportGV = nullptr;			GlobalValue *ImportGV = nullptr;
	if (!PromoteExtra.count(&ExportGV)) {			if (!PromoteExtra.count(&ExportGV)) {
	ImportGV = ImportM.getNamedValue(Name);			ImportGV = ImportM.getNamedValue(Name);
	if (!ImportGV)			if (!ImportGV)
	continue;			continue;
	ImportGV->removeDeadConstantUsers();			ImportGV->removeDeadConstantUsers();
	if (ImportGV->use_empty()) {			if (ImportGV->use_empty()) {
	ImportGV->eraseFromParent();			ImportGV->eraseFromParent();
	continue;			continue;
	}			}
	}			}

				std::string OldName = Name.str();
	std::string NewName = (Name + ModuleId).str();			std::string NewName = (Name + ModuleId).str();

	if (const auto *C = ExportGV.getComdat())			if (const auto *C = ExportGV.getComdat())
	if (C->getName() == Name)			if (C->getName() == Name)
	RenamedComdats.try_emplace(C, ExportM.getOrInsertComdat(NewName));			RenamedComdats.try_emplace(C, ExportM.getOrInsertComdat(NewName));

	ExportGV.setName(NewName);			ExportGV.setName(NewName);
	ExportGV.setLinkage(GlobalValue::ExternalLinkage);			ExportGV.setLinkage(GlobalValue::ExternalLinkage);
	ExportGV.setVisibility(GlobalValue::HiddenVisibility);			ExportGV.setVisibility(GlobalValue::HiddenVisibility);

	if (ImportGV) {			if (ImportGV) {
	ImportGV->setName(NewName);			ImportGV->setName(NewName);
	ImportGV->setVisibility(GlobalValue::HiddenVisibility);			ImportGV->setVisibility(GlobalValue::HiddenVisibility);
	}			}

				if (isa<Function>(&ExportGV) && allowPromotionAlias(OldName)) {
				// Create a local alias with the original name to avoid breaking
				// references from inline assembly.
				std::string Alias = ".set " + OldName + "," + NewName + "\n";
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions Is there more information about "promotion aliases with x86_64-pc-windows-msvc" from D106392? Can we quote these only for msvc target triples? Can we add a comment about the quoting be necessary for those targets? It's still not clear to me how tests using explicit -linux-gnu triples could fail on -mscv hosts. nickdesaulniers: Is there more information about "promotion aliases with x86_64-pc-windows-msvc" from D106392?
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions Also, I think the quoting hurts the readability of the generated asm. Maybe that doesn't matter for LTO, but I'd be curious if we could do such escaping only when necessary? Perhaps that's only when targeting -msvc triples? nickdesaulniers: Also, I think the quoting hurts the readability of the generated asm. Maybe that doesn't…
				rnkUnsubmitted Not Done Reply Inline Actions I was going to say, the proper way to do this is what `MCSymbol::print` does: https://github.com/llvm/llvm-project/blob/main/llvm/lib/MC/MCSymbol.cpp#L59 That code doesn't seem exhaustive, but at least it escapes quotes. We can't call MC from here due to library layering. The LLVM IR readability is poor, but inline asm in IR is already hard to read. I wouldn't worry about that, I'd mainly worry about the output of -S. Quoting always seems fine to me, I guess. rnk: I was going to say, the proper way to do this is what `MCSymbol::print` does: https://github.
				samitolvanenAuthorUnsubmitted Done Reply Inline Actions I was going to say, the proper way to do this is what `MCSymbol::print` does: https://github.com/llvm/llvm-project/blob/main/llvm/lib/MC/MCSymbol.cpp#L59 That code doesn't seem exhaustive, but at least it escapes quotes. We can't call MC from here due to library layering. Is it actually possible to have function names that contain quotes? If so, I suppose we need to do something similar here and escape any quotes in the names. The LLVM IR readability is poor, but inline asm in IR is already hard to read. I wouldn't worry about that, I'd mainly worry about the output of -S. The promotion happens only when we write bitcode, so the aliases won't yet exist in the `-S` output. samitolvanen: > I was going to say, the proper way to do this is what `MCSymbol::print` does: > https…
				samitolvanenAuthorUnsubmitted Done Reply Inline Actions Is there more information about "promotion aliases with x86_64-pc-windows-msvc" from D106392? Yes, the -msvc targets use Visual C++ compatible name mangling, which requires quotes when referred to in inline assembly. Here's a trivial reproducer: $ cat test.cpp static void a(void) {} void* b(void) { return (void )a; } $ clang -flto=thin -fvisibility=default -fsanitize=cfi -c test.cpp $ clang -flto=thin -fvisibility=default -fsanitize=cfi -target x86_64-pc-windows-msvc -c test.cpp Either SourceMgr should be available UNREACHABLE executed at llvm-project/llvm/lib/MC/MCContext.cpp:913! ... Here's the inline assembly generated when we compile the above example for the -msvc target: .set ?a@@YAXXZ,?a@@YAXXZ.d7b56b39ccc53bc7515ae1b2533f1e3d Can we quote these only for msvc target triples? Can we add a comment about the quoting be necessary for those targets? I assume we could limit this only to -msvc targets, but I feel like that would unnecessarily complicate the code as there's no harm in quoting the names always. It's still not clear to me how tests using explicit -linux-gnu triples could fail on -mscv hosts. These tests don't specify a `-linux-gnu` triple. They are C++ and end up using the default target, which in this case is `-msvc`. samitolvanen:* > Is there more information about "promotion aliases with x86_64-pc-windows-msvc" from D106392?
				ExportM.appendModuleInlineAsm(Alias);
				}
	}			}
				samitolvanenAuthorUnsubmitted Done Reply Inline Actions Note that I'm adding the alias to llvm.compiler.used because it's otherwise removed during optimization. Is there are better way to accomplish this? samitolvanen: Note that I'm adding the alias to llvm.compiler.used because it's otherwise removed during…
				pccUnsubmitted Not Done Reply Inline Actions Not as far as I know, that's what I'd recommend. pcc: Not as far as I know, that's what I'd recommend.
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions Can you avoid making a copy of the OldName by doing the `appendToCompilerUsed` BEFORE making the dangling reference via `ExportGV.setName(NewName);`? nickdesaulniers: Can you avoid making a copy of the OldName by doing the `appendToCompilerUsed` BEFORE making…
				samitolvanenAuthorUnsubmitted Done Reply Inline Actions No, I have to rename the existing function before I can create an alias with the same name, and as `ExportGV.setName()` invalidates `Name`, I need to create a copy first. samitolvanen: No, I have to rename the existing function before I can create an alias with the same name, and…

	if (!RenamedComdats.empty())			if (!RenamedComdats.empty())
	for (auto &GO : ExportM.global_objects())			for (auto &GO : ExportM.global_objects())
	if (auto *C = GO.getComdat()) {			if (auto *C = GO.getComdat()) {
	auto Replacement = RenamedComdats.find(C);			auto Replacement = RenamedComdats.find(C);
	if (Replacement != RenamedComdats.end())			if (Replacement != RenamedComdats.end())
	GO.setComdat(Replacement->second);			GO.setComdat(Replacement->second);
	}			}
	▲ Show 20 Lines • Show All 498 Lines • Show Last 20 Lines

llvm/test/ThinLTO/X86/devirt2.ll

	Show First 20 Lines • Show All 125 Lines • ▼ Show 20 Lines
	; RUN: -r=%t1.o,_ZTV1B, \			; RUN: -r=%t1.o,_ZTV1B, \
	; RUN: -r=%t1.o,_ZTV1C, \			; RUN: -r=%t1.o,_ZTV1C, \
	; RUN: -r=%t1.o,_ZTV1D, \			; RUN: -r=%t1.o,_ZTV1D, \
	; RUN: -r=%t1.o,_ZTV1D, \			; RUN: -r=%t1.o,_ZTV1D, \
	; RUN: -r=%t1.o,_ZN1D1mEi, \			; RUN: -r=%t1.o,_ZN1D1mEi, \
	; RUN: -r=%t1.o,_ZN1D1mEi, \			; RUN: -r=%t1.o,_ZN1D1mEi, \
	; RUN: -r=%t1.o,test2, \			; RUN: -r=%t1.o,test2, \
	; RUN: -r=%t2.o,_ZN1A1nEi,p \			; RUN: -r=%t2.o,_ZN1A1nEi,p \
				; RUN: -r=%t2.o,_ZN1A1nEi, \
	; RUN: -r=%t2.o,_ZN1B1fEi,p \			; RUN: -r=%t2.o,_ZN1B1fEi,p \
	; RUN: -r=%t2.o,_ZN1C1fEi,p \			; RUN: -r=%t2.o,_ZN1C1fEi,p \
	; RUN: -r=%t2.o,_ZN1D1mEi,p \			; RUN: -r=%t2.o,_ZN1D1mEi,p \
	; RUN: -r=%t2.o,_ZN1E1mEi,p \			; RUN: -r=%t2.o,_ZN1E1mEi,p \
				; RUN: -r=%t2.o,_ZN1E1mEi, \
	; RUN: -r=%t2.o,_ZTV1B, \			; RUN: -r=%t2.o,_ZTV1B, \
	; RUN: -r=%t2.o,_ZTV1C, \			; RUN: -r=%t2.o,_ZTV1C, \
	; RUN: -r=%t2.o,_ZTV1D, \			; RUN: -r=%t2.o,_ZTV1D, \
	; RUN: -r=%t2.o,_ZTV1E, \			; RUN: -r=%t2.o,_ZTV1E, \
	; RUN: -r=%t2.o,test2,px \			; RUN: -r=%t2.o,test2,px \
	; RUN: -r=%t2.o,_ZN1A1nEi, \			; RUN: -r=%t2.o,_ZN1A1nEi, \
	; RUN: -r=%t2.o,_ZN1B1fEi, \			; RUN: -r=%t2.o,_ZN1B1fEi, \
	; RUN: -r=%t2.o,_ZN1C1fEi, \			; RUN: -r=%t2.o,_ZN1C1fEi, \
	Show All 16 Lines
	; RUN: -r=%t1.o,_ZTV1B, \			; RUN: -r=%t1.o,_ZTV1B, \
	; RUN: -r=%t1.o,_ZTV1C, \			; RUN: -r=%t1.o,_ZTV1C, \
	; RUN: -r=%t1.o,_ZTV1D, \			; RUN: -r=%t1.o,_ZTV1D, \
	; RUN: -r=%t1.o,_ZTV1D, \			; RUN: -r=%t1.o,_ZTV1D, \
	; RUN: -r=%t1.o,_ZN1D1mEi, \			; RUN: -r=%t1.o,_ZN1D1mEi, \
	; RUN: -r=%t1.o,_ZN1D1mEi, \			; RUN: -r=%t1.o,_ZN1D1mEi, \
	; RUN: -r=%t1.o,test2, \			; RUN: -r=%t1.o,test2, \
	; RUN: -r=%t2.o,_ZN1A1nEi,p \			; RUN: -r=%t2.o,_ZN1A1nEi,p \
				; RUN: -r=%t2.o,_ZN1A1nEi, \
	; RUN: -r=%t2.o,_ZN1B1fEi,p \			; RUN: -r=%t2.o,_ZN1B1fEi,p \
	; RUN: -r=%t2.o,_ZN1C1fEi,p \			; RUN: -r=%t2.o,_ZN1C1fEi,p \
	; RUN: -r=%t2.o,_ZN1D1mEi,p \			; RUN: -r=%t2.o,_ZN1D1mEi,p \
	; RUN: -r=%t2.o,_ZN1E1mEi,p \			; RUN: -r=%t2.o,_ZN1E1mEi,p \
				; RUN: -r=%t2.o,_ZN1E1mEi, \
	; RUN: -r=%t2.o,_ZTV1B, \			; RUN: -r=%t2.o,_ZTV1B, \
	; RUN: -r=%t2.o,_ZTV1C, \			; RUN: -r=%t2.o,_ZTV1C, \
	; RUN: -r=%t2.o,_ZTV1D, \			; RUN: -r=%t2.o,_ZTV1D, \
	; RUN: -r=%t2.o,_ZTV1E, \			; RUN: -r=%t2.o,_ZTV1E, \
	; RUN: -r=%t2.o,test2,px \			; RUN: -r=%t2.o,test2,px \
	; RUN: -r=%t2.o,_ZN1A1nEi, \			; RUN: -r=%t2.o,_ZN1A1nEi, \
	; RUN: -r=%t2.o,_ZN1B1fEi, \			; RUN: -r=%t2.o,_ZN1B1fEi, \
	; RUN: -r=%t2.o,_ZN1C1fEi, \			; RUN: -r=%t2.o,_ZN1C1fEi, \
	▲ Show 20 Lines • Show All 104 Lines • Show Last 20 Lines

llvm/test/Transforms/ThinLTOBitcodeWriter/cfi-icall-static-inline-asm.ll

This file was added.

				; REQUIRES: x86-registered-target
				; RUN: opt -thinlto-bc -thinlto-split-lto-unit -o - %s \| llvm-modextract -b -n 0 -o - \| llvm-dis \| FileCheck %s

				target triple = "x86_64-unknown-linux-gnu"

				; CHECK: module asm ".set a,a.[[HASH:[0-9a-f]+]]"

				define void @b() {
				%f = alloca void ()*, align 8
				; CHECK: store{{.}} @a.[[HASH]],{{.}} %f
				store void ()* @a, void ()** %f, align 8
				; CHECK: %1 = call void ()* asm sideeffect "leaq a(%rip)
				%1 = call void ()* asm sideeffect "leaq a(%rip), $0\0A\09", "=r,~{dirflag},~{fpsr},~{flags}"()
				ret void
				}

				; CHECK: define{{.}} @a.[[HASH]](){{.}} !type
				define internal void @a() !type !0 {
				ret void
				}

				!0 = !{i64 0, !"typeid1"}

llvm/test/Transforms/ThinLTOBitcodeWriter/split-internal2.ll

				; REQUIRES: x86-registered-target
	; RUN: opt -thinlto-bc -thinlto-split-lto-unit -o %t %s			; RUN: opt -thinlto-bc -thinlto-split-lto-unit -o %t %s
	; RUN: llvm-modextract -b -n 0 -o %t0 %t			; RUN: llvm-modextract -b -n 0 -o %t0 %t
	; RUN: llvm-modextract -b -n 1 -o %t1 %t			; RUN: llvm-modextract -b -n 1 -o %t1 %t
	; RUN: not llvm-modextract -b -n 2 -o - %t 2>&1 \| FileCheck --check-prefix=ERROR %s			; RUN: not llvm-modextract -b -n 2 -o - %t 2>&1 \| FileCheck --check-prefix=ERROR %s
	; RUN: llvm-dis -o - %t0 \| FileCheck --check-prefix=M0 %s			; RUN: llvm-dis -o - %t0 \| FileCheck --check-prefix=M0 %s
	; RUN: llvm-dis -o - %t1 \| FileCheck --check-prefix=M1 %s			; RUN: llvm-dis -o - %t1 \| FileCheck --check-prefix=M1 %s
	; RUN: llvm-bcanalyzer -dump %t0 \| FileCheck --check-prefix=BCA0 %s			; RUN: llvm-bcanalyzer -dump %t0 \| FileCheck --check-prefix=BCA0 %s
	; RUN: llvm-bcanalyzer -dump %t1 \| FileCheck --check-prefix=BCA1 %s			; RUN: llvm-bcanalyzer -dump %t1 \| FileCheck --check-prefix=BCA1 %s

				target triple = "x86_64-unknown-linux-gnu"

	; ERROR: llvm-modextract: error: module index out of range; bitcode file contains 2 module(s)			; ERROR: llvm-modextract: error: module index out of range; bitcode file contains 2 module(s)

	; BCA0: <GLOBALVAL_SUMMARY_BLOCK			; BCA0: <GLOBALVAL_SUMMARY_BLOCK
	; BCA1-NOT: <GLOBALVAL_SUMMARY_BLOCK			; BCA1-NOT: <GLOBALVAL_SUMMARY_BLOCK

	; M0: @g = external global void ()*{{$}}			; M0: @g = external global void ()*{{$}}
	; M1: @g = global void ()* @f.13757e0fb71915e385efa4dc9d1e08fd, !type !0			; M1: @g = global void ()* @f.13757e0fb71915e385efa4dc9d1e08fd, !type !0
	@g = global void ()* @f, !type !0			@g = global void ()* @f, !type !0
	Show All 15 Lines

llvm/test/Transforms/ThinLTOBitcodeWriter/split-vfunc-internal.ll

				; REQUIRES: x86-registered-target
	; RUN: opt -thinlto-bc -thinlto-split-lto-unit -o %t %s			; RUN: opt -thinlto-bc -thinlto-split-lto-unit -o %t %s
	; RUN: llvm-modextract -b -n 0 -o - %t \| llvm-dis \| FileCheck --check-prefix=M0 %s			; RUN: llvm-modextract -b -n 0 -o - %t \| llvm-dis \| FileCheck --check-prefix=M0 %s
	; RUN: llvm-modextract -b -n 1 -o - %t \| llvm-dis \| FileCheck --check-prefix=M1 %s			; RUN: llvm-modextract -b -n 1 -o - %t \| llvm-dis \| FileCheck --check-prefix=M1 %s

				target triple = "x86_64-unknown-linux-gnu"

	define [1 x i8] @source() {			define [1 x i8] @source() {
	ret [1 x i8] @g			ret [1 x i8] @g
	}			}

	; M0: @g.84f59439b469192440047efc8de357fb = external hidden constant [1 x i8*]{{$}}			; M0: @g.84f59439b469192440047efc8de357fb = external hidden constant [1 x i8*]{{$}}
	; M1: @g.84f59439b469192440047efc8de357fb = hidden constant [1 x i8] [i8 bitcast (i64 (i8) @ok.84f59439b469192440047efc8de357fb to i8*)]			; M1: @g.84f59439b469192440047efc8de357fb = hidden constant [1 x i8] [i8 bitcast (i64 (i8) @ok.84f59439b469192440047efc8de357fb to i8*)]
	@g = internal constant [1 x i8*] [			@g = internal constant [1 x i8*] [
	i8* bitcast (i64 (i8) @ok to i8*)			i8* bitcast (i64 (i8) @ok to i8*)
	Show All 9 Lines

This is an archive of the discontinued LLVM Phabricator instance.

ThinLTO: Fix inline assembly references to static functions with CFIClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 363827

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp

llvm/test/ThinLTO/X86/devirt2.ll

llvm/test/Transforms/ThinLTOBitcodeWriter/cfi-icall-static-inline-asm.ll

llvm/test/Transforms/ThinLTOBitcodeWriter/split-internal2.ll

llvm/test/Transforms/ThinLTOBitcodeWriter/split-vfunc-internal.ll

ThinLTO: Fix inline assembly references to static functions with CFI
ClosedPublic