This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/AST/
-
clang/
-
AST/
-
Decl.h
-
DeclBase.h
-
lib/
-
AST/
-
Decl.cpp
-
CodeGen/
2/16
CodeGenFunction.cpp
-
Sema/
-
SemaDecl.cpp
-
test/CodeGen/
-
CodeGen/
-
gnu-inline-redecl.c
2/4
strlen-inline-builtin-redecl.c

Differential D112059

Fix inline builtin handling in case of redefinition
ClosedPublic

Authored by serge-sans-paille on Oct 19 2021, 2:33 AM.

Download Raw Diff

Details

Reviewers

nickdesaulniers
manojgupta
aaron.ballman

Commits

rG6bfc85c217e4: Fix inline builtin handling in case of redefinition

Summary

Basically, inline builtin definition are shadowed by externally visible
redefinition. This matches GCC behavior.

Diff Detail

Unit TestsFailed

	Time	Test
	2,190 ms	x64 debian > SanitizerCommon-asan-x86_64-Linux.SanitizerCommon-asan-x86_64-Linux::onprint.cpp
	660 ms	x64 debian > SanitizerCommon-lsan-x86_64-Linux.SanitizerCommon-lsan-x86_64-Linux::onprint.cpp
	970 ms	x64 debian > SanitizerCommon-msan-x86_64-Linux.SanitizerCommon-msan-x86_64-Linux::onprint.cpp
	2,029 ms	x64 debian > SanitizerCommon-tsan-x86_64-Linux.SanitizerCommon-tsan-x86_64-Linux::onprint.cpp
	1,310 ms	x64 debian > SanitizerCommon-ubsan-x86_64-Linux.SanitizerCommon-ubsan-x86_64-Linux::onprint.cpp

Event Timeline

serge-sans-paille requested review of this revision.Oct 19 2021, 2:33 AM

serge-sans-paille created this revision.

Herald added a project: Restricted Project. · View Herald TranscriptOct 19 2021, 2:33 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

serge-sans-paille mentioned this in D111009: Update inline builtin handling to honor gnu inline attribute.Oct 19 2021, 2:34 AM

Harbormaster completed remote builds in B129502: Diff 380619.Oct 19 2021, 3:01 AM

thanks, I can verify that it fixes the crash we were seeing.

Yes; GCC does behave this way. It does not consider a non-gnu-inline redefinition an error, and it does seem to prefer the non-gnu-inline redeclaration when both are present, AFAICT. The test is verifying that behavior correctly. This patch is fixing the test case, the reported reduced cases from @manojgupta link, @nathanchance link, and myself link, and fixing the kernel builds link.

Further, I did a build+boot tests of:

mainline x86_64 + CONFIG_FORTIFY_SOURCE=y
mainline x86_64 + CONFIG_FORTIFY_SOURCE=y + CONFIG_LTO_CLANG_FULL=y
mainline x86_64 + CONFIG_FORTIFY_SOURCE=y + CONFIG_LTO_CLANG_THIN=y

clang/lib/CodeGen/CodeGenFunction.cpp
1318	do we really want to be iterating every redaclaration like this, even for non-inline builtin declarations? Is there a way to avoid the below loop for most functions? Perhaps we should be doing this when we hit a redeclaration instead? Or wrap all this work in a check that there's a builtin ID associated with the FD?
1328	If there are multiple redeclarations, do we want to be erasing the clone each time, or can we `break` out of this loop?
clang/test/CodeGen/strlen-inline-builtin-redecl.c
11–13	Has this example been formatted? Does rotating the attributes to the front help?
43	I think this line can be dropped without affecting the test. Or consider using any of the further reduced test cases?

Reduce the number of time we would walk redecls.
Simplify test case

Avoid walking redecls

serge-sans-paille marked 4 inline comments as done.Oct 20 2021, 10:31 AM

ychen added a subscriber: ychen.Oct 20 2021, 10:54 AM

Harbormaster completed remote builds in B129776: Diff 381014.Oct 20 2021, 11:11 AM

nickdesaulniers added subscribers: aaron.ballman, rsmith.Oct 20 2021, 11:42 AM

nickdesaulniers added inline comments.

clang/lib/CodeGen/CodeGenFunction.cpp
1301–1302	I don't think we want to do all this work if just `Fn`; ie. create a new `std::string` with `.inline` suffix for every function we're going to generate code (IR) for. How about we add an additional unlikely guard: `if (FD->getBuiltinID() && FN) {` Because in the usual case, `FD` both has a builtin ID and is an inline builtin declaration, while in the exceptional case that this patch addresses, `FD` has a builtin ID but is not an inline builtin declaration.
1315–1317	Perhaps in `Sema::CheckFunctionDeclaration`? I see there is where we detect redeclarations. The calls from there to `FunctionDecl::setPreviousDeclaration()` seem to set up the redecl chain. Perhaps this exceptional case (or both cases, even) would be handled better there? cc @rsmith @aaron.ballman in case they have feedback/tips/cycles.
clang/test/CodeGen/strlen-inline-builtin-redecl.c
9	unused decl
11	Do you mind wrapping this to 80 chars wide? I suspect if you put the two `__attribute__`s first, then the formatter will do a better job. You can also combine these, a la `__attribute__((always_inline, gnu_inline))` to cut down on line length.

aaron.ballman added inline comments.Oct 21 2021, 5:11 AM

clang/lib/CodeGen/CodeGenFunction.cpp
1301–1302	Is it correct to gate this on whether it's a builtin or not? I thought that builtin-like (e.g., the usual pile of attributes) user code should also have the same effect, shouldn't it?
1315–1317	I don't know that it's a good idea to modify the redeclaration chain in this case. The comments on the chain are pretty clear that it's a temporal chain where "previous" means previously declared in relation to the current declaration. @rsmith may feel differently, however.

Bumping for an update here. We can tolerate a build breakage for our older kernels over the weekend, but we should really try to get this resolved by EOW, otherwise we need to look into reverting:

3d6f49a56995b845c40be5827ded5d1e3f692cec Tue Sep 28 13:24:25 2021 +0200 (breakage)
bd379915de38a9af3d65e19075a6a64ebbb8d6db Tue Sep 28 16:07:33 2021 +0200 (attempted fix forward of 3d6f49a56995b845)
0d76d4833dd2815e0b1c786250f474d222f6a0a1 Tue Sep 28 11:30:37 2021 -0700 (revert of 3d6f49a56995b845)
c3717b6858d32d64514a187ede1a77be8ba4e542 Tue Sep 28 21:00:47 2021 +0200 (reland, introduced kernel breakage)
0f0e31cf511def3e92244e615b2646c1fd0df0cd Mon Oct 4 22:26:25 2021 +0200 (fix forward)

clang/lib/CodeGen/CodeGenFunction.cpp
1301–1302	What do you mean? I'm sorry, I don't quite follow.
1315–1317	Sorry, I don't quite follow whether your approving of the current approach or dismissive?

aaron.ballman added inline comments.Oct 25 2021, 11:02 AM

clang/lib/CodeGen/CodeGenFunction.cpp
1301–1302	From the test cases below: extern inline __attribute__((always_inline)) __attribute__((gnu_inline)) unsigned long strlen(const char p) { return 1; } unsigned long mystrlen(char const s) { return strlen(s); } unsigned long strlen(const char s) { return 2; } These redeclarations resolve a particular way by GCC and this patch is intending to match that behavior. My question ultimately boils down to whether that also happens for this code, where the function is not a builtin but looks like one due to the attributes: extern inline __attribute__((always_inline)) __attribute__((gnu_inline)) unsigned long wahoo(const char p) { return 1; } unsigned long mywahoo(char const s) { return wahoo(s); } unsigned long wahoo(const char s) { return 2; } If this also reorders, then I don't think we can look at whether `FD->getBuiltinID() != 0` to decide whether to do the reordering dance because arbitrary user functions aren't Clang builtins and so they'd not get the correct behavior. Does that make sense?
1315–1317	I don't think we should modify the redecl chain from `CheckFunctionDeclaration()` -- this case would create a redeclaration chain whose previous link was not temporally the previous declaration. There might be another approach so we can avoid `replaceAllUsesWith()`. One possibility (no idea how feasible or what explodes) is to modify `FunctionDecl::getDefinition()` to look through the chain to return the best definition when there are multiple definitions to pick from.

nickdesaulniers added inline comments.Oct 25 2021, 11:26 AM

clang/lib/CodeGen/CodeGenFunction.cpp
1301–1302	If this also reorders It does; https://godbolt.org/z/bbrox7f6e. Does that make sense? Yes, thanks for the additional info. In this case, I guess we can disregard my feedback that started this thread, marking it as done? Perhaps @serge-sans-paille should add such a non-builtin test case as part of the change?

aaron.ballman added inline comments.Oct 25 2021, 11:59 AM

clang/lib/CodeGen/CodeGenFunction.cpp
1301–1302	I think you have a valid concern about the extra allocations, but I'm not certain of a better predicate to use. My intuition is that the overhead here won't be unacceptable, but it'd be good to know we're not regressing compile time performance significantly. Additional test coverage with a comment as to why we're testing it is a good idea!

nickdesaulniers added inline comments.Oct 25 2021, 12:21 PM

clang/lib/CodeGen/CodeGenFunction.cpp
1301–1302	Perhaps we can first test whether this FuctionDecl is a redecl, then do the allocation, then check if the `.inline` suffix? That way we avoid creating the new string unless we're codgen'ing a redecl, which should be uncommon in practice.

aaron.ballman added inline comments.Oct 25 2021, 12:26 PM

clang/lib/CodeGen/CodeGenFunction.cpp
1301–1302	That could save us a bit of perf, but I think redecls are actually quite common because a definition is itself a declaration, so having a decl in a header file and defn in a source file will produce a redecl chain.

nickdesaulniers added inline comments.Oct 25 2021, 12:43 PM

clang/lib/CodeGen/CodeGenFunction.cpp
1301–1302	Ah, ok then. Additional test coverage with a comment as to why we're testing it is a good idea! Yeah, with that and fixes to the other small nits in the test case, then I think this is ready to land.

Add a test case to ensure we keep the right behavior for non-intrinsic gnu inline
walk the redecl chain before doing an extra string alloc

Formatting nits

LGTM, though codegen is not my area of expertise.

This revision is now accepted and ready to land.Oct 26 2021, 8:50 AM

Harbormaster completed remote builds in B130721: Diff 382344.Oct 26 2021, 9:15 AM

Thank you for fixing this terrible edge case; LGTM.

Actually, it looks like:

diff --git a/clang/lib/Sema/SemaDecl.cpp b/clang/lib/Sema/SemaDecl.cpp
index 69d2ef631872..8e77cdef2ed5 100644
--- a/clang/lib/Sema/SemaDecl.cpp
+++ b/clang/lib/Sema/SemaDecl.cpp
@@ -10927,6 +10927,10 @@ bool Sema::CheckFunctionDeclaration(Scope *S, FunctionDecl *NewFD,
   }
 
   if (Redeclaration) {
+    if (cast<FunctionDecl>(OldDecl)->isInlineBuiltinDeclaration() && !NewFD->isInlineBuiltinDeclaration()) {
+      // Set a flag on NewFD that it's a shadowed gnu_inline that should be
+      // emitted, or on OldDecl that it should not be emitted?
+    }
     // NewFD and OldDecl represent declarations that need to be
     // merged.
     if (MergeFunctionDecl(NewFD, OldDecl, S, MergeTypeWithPrevious)) {

might detect what you need. Can we perhaps use those checks there (when we have already detected a redeclaration) to perhaps set new members (to be added) on FunctionDecl that they shouldn't be emitted or not because they are this weird case? Then CodeGenFunction::GenerateCode can simply check that flag, then erase the existing function (or not generate it in the first place by setting a flag on OldDecl perhaps?). No redecl walking for every FunctionDecl required.

nickdesaulniers added inline comments.Oct 26 2021, 4:21 PM

clang/test/CodeGen/user-func-gnu-inline-redecl.c
20 ↗	(On Diff #382344)	this test passes before this patch is applied; I wonder if we have existing coverage in tree for this case? Surprisingly, I don't think we do. Perhaps `gnu-inline-redecl.c` might be a more concise test name? I can't help but shake the feeling that the builtin id stuff is a degenerate case of how GCC treats redeclarations of extern inline (gnu_inline) functions and that perhaps by solving just that, fixing the case of builtins might just "fall out" from that.

Use a FunctionDecl Attribute to store the shadowed inline redecl status

serge-sans-paille added inline comments.Oct 27 2021, 7:48 AM

clang/test/CodeGen/user-func-gnu-inline-redecl.c
20 ↗	(On Diff #382344)	Clang indeed naturally handles gnu_inline in a decent way. The problem we're trying to solve now is a side effect of the premature renaming of function call site when we think it's a direct call to inline builtin. I've updated the implementation to avoid walking redecls.

In D112059#3090466, @serge-sans-paille wrote:

Use a FunctionDecl Attribute to store the shadowed inline redecl status

The downside to this approach is that we can now handle less ctor initializers because we need to steal a bit from there. We're going from allowing 2,097,152 ctor inits to 1,048,576, which is still a pretty large number of ctor inits, so I think this new approach is defensible. However, this does make it that much harder to add any new bits in the future. I agree that walking the redecl chain could potentially cause a performance issue, but that's speculative without some measurements. Because we don't have those measurements, I'm actually a bit more comfortable with walking the redecl chain than the current approach -- if we measure performance and find that walking this chain is a bottleneck, then it makes it more obvious that the new approach is worthwhile.

Harbormaster completed remote builds in B130960: Diff 382671.Oct 27 2021, 9:11 AM

I second @aaron.ballman there. I compiled the sqlite3.c amalgamation, -O0, with both approach, measuring the number of instructions as gathered by valgrind --tool=callgrind

when walking redecls: 9001630039 instructions, I changed the implementation a bit down to 9001628850
when storing redecl state: 9000816370 instructions

Here's a [hastily and poorly written] script to measure the average cycle counts for 30 invocations using linux perf: https://gist.github.com/nickdesaulniers/4a20ba10c26ac2ad02cb0425b8b0f826

For Diff 382671 (latest; storing redecl state), builds of the linux kernel x86_64 defconfig+CONFIG_FORTIFY_SOURCE=y:

$ /tmp/measure_30.sh 'make LLVM=1 -j72' 'make LLVM=1 -j72 clean'
...
Average of 30 runs: 10780685107280.83 cycles

For Diff 382344 (earlier; walking redecl chain), builds of the linux kernel x86_64 defconfig+CONFIG_FORTIFY_SOURCE=y:

$ /tmp/measure_30.sh 'make LLVM=1 -j72' 'make LLVM=1 -j72 clean'
...
Average of 30 runs: 10745227016663.00 cycles

Damn, so what I proposed was slower, at least for the major case that I care about. I suspect that perhaps there's more forward declarations than actual functions we end up generating IR for, perhaps...either way, I'm sorry for suggesting the "storing redecl state" approach. If we want to go back to the earlier version in Diff 382344, at this point, I'd be happy to accept that revision. Sorry for causing whiplash on this.

In D112059#3094464, @nickdesaulniers wrote:
Here's a [hastily and poorly written] script to measure the average cycle counts for 30 invocations using linux perf: https://gist.github.com/nickdesaulniers/4a20ba10c26ac2ad02cb0425b8b0f826

For Diff 382671 (latest; storing redecl state), builds of the linux kernel x86_64 defconfig+CONFIG_FORTIFY_SOURCE=y:
$ /tmp/measure_30.sh 'make LLVM=1 -j72' 'make LLVM=1 -j72 clean'
...
Average of 30 runs: 10780685107280.83 cycles
For Diff 382344 (earlier; walking redecl chain), builds of the linux kernel x86_64 defconfig+CONFIG_FORTIFY_SOURCE=y:
$ /tmp/measure_30.sh 'make LLVM=1 -j72' 'make LLVM=1 -j72 clean'
...
Average of 30 runs: 10745227016663.00 cycles
Damn, so what I proposed was slower, at least for the major case that I care about. I suspect that perhaps there's more forward declarations than actual functions we end up generating IR for, perhaps...either way, I'm sorry for suggesting the "storing redecl state" approach. If we want to go back to the earlier version in Diff 382344, at this point, I'd be happy to accept that revision. Sorry for causing whiplash on this.

Thank you for measuring this as well! And no worries on the whiplash (easy for me to say as a reviewer, hah) -- I think it was a reasonable thought to explore and measure the performance of. FWIW, I'd be happy accepting the earlier revision as well.

Re-uploading previous version that walks redef, with a slight change in the walking algorithm.

Harbormaster completed remote builds in B131350: Diff 383241.Oct 29 2021, 12:53 AM

LGTM aside from a formatting nit. I don't think the precommit CI failures are related to your patch from what I was seeing, but may be worth keeping an eye on once you land just in case.

clang/lib/CodeGen/CodeGenFunction.cpp
1323	Please fix the formatting.

thanks again for all of the work that went into this!

This revision was landed with ongoing or failed builds.Nov 2 2021, 1:54 AM

Closed by commit rG6bfc85c217e4: Fix inline builtin handling in case of redefinition (authored by serge-sans-paille). · Explain Why

This revision was automatically updated to reflect the committed changes.

serge-sans-paille added a commit: rG6bfc85c217e4: Fix inline builtin handling in case of redefinition.

Revision Contents

Path

Size

clang/

include/

clang/

AST/

Decl.h

10 lines

DeclBase.h

7 lines

lib/

AST/

Decl.cpp

1 line

CodeGen/

CodeGenFunction.cpp

38 lines

Sema/

SemaDecl.cpp

5 lines

test/

CodeGen/

gnu-inline-redecl.c

20 lines

strlen-inline-builtin-redecl.c

20 lines

Diff 382671

clang/include/clang/AST/Decl.h

Show First 20 Lines • Show All 2,598 Lines • ▼ Show 20 Lines	public:
/// Determine whether the function was declared in source context		/// Determine whether the function was declared in source context
/// that requires constrained FP intrinsics		/// that requires constrained FP intrinsics
bool UsesFPIntrin() const { return FunctionDeclBits.UsesFPIntrin; }		bool UsesFPIntrin() const { return FunctionDeclBits.UsesFPIntrin; }

/// Set whether the function was declared in source context		/// Set whether the function was declared in source context
/// that requires constrained FP intrinsics		/// that requires constrained FP intrinsics
void setUsesFPIntrin(bool I) { FunctionDeclBits.UsesFPIntrin = I; }		void setUsesFPIntrin(bool I) { FunctionDeclBits.UsesFPIntrin = I; }

		/// Determine whether the function shadows an inline builtin definition.
		bool shadowsGNUInlineIntrinsic() const {
		return FunctionDeclBits.ShadowsGNUInlineIntrinsic;
		}

		/// Set whether the function shadows an inline builtin definition.
		void setShadowsGNUInlineIntrinsic(bool I) {
		FunctionDeclBits.ShadowsGNUInlineIntrinsic = I;
		}

/// Flag that this function is implicitly inline.		/// Flag that this function is implicitly inline.
void setImplicitlyInline(bool I = true) { FunctionDeclBits.IsInline = I; }		void setImplicitlyInline(bool I = true) { FunctionDeclBits.IsInline = I; }

/// Determine whether this function should be inlined, because it is		/// Determine whether this function should be inlined, because it is
/// either marked "inline" or "constexpr" or is a member function of a class		/// either marked "inline" or "constexpr" or is a member function of a class
/// that was defined in the class body.		/// that was defined in the class body.
bool isInlined() const { return FunctionDeclBits.IsInline; }		bool isInlined() const { return FunctionDeclBits.IsInline; }

▲ Show 20 Lines • Show All 2,047 Lines • Show Last 20 Lines

clang/include/clang/AST/DeclBase.h

Show First 20 Lines • Show All 1,606 Lines • ▼ Show 20 Lines	class FunctionDeclBitfields {
/// deduction candidate' (is used during overload resolution).		/// deduction candidate' (is used during overload resolution).
uint64_t IsCopyDeductionCandidate : 1;		uint64_t IsCopyDeductionCandidate : 1;

/// Store the ODRHash after first calculation.		/// Store the ODRHash after first calculation.
uint64_t HasODRHash : 1;		uint64_t HasODRHash : 1;

/// Indicates if the function uses Floating Point Constrained Intrinsics		/// Indicates if the function uses Floating Point Constrained Intrinsics
uint64_t UsesFPIntrin : 1;		uint64_t UsesFPIntrin : 1;

		/// Indicates that this function shadows a gnu_inline intrinsic redefinition
		uint64_t ShadowsGNUInlineIntrinsic : 1;
};		};

/// Number of non-inherited bits in FunctionDeclBitfields.		/// Number of non-inherited bits in FunctionDeclBitfields.
enum { NumFunctionDeclBits = 27 };		enum { NumFunctionDeclBits = 28 };

/// Stores the bits used by CXXConstructorDecl. If modified		/// Stores the bits used by CXXConstructorDecl. If modified
/// NumCXXConstructorDeclBits and the accessor		/// NumCXXConstructorDeclBits and the accessor
/// methods in CXXConstructorDecl should be updated appropriately.		/// methods in CXXConstructorDecl should be updated appropriately.
class CXXConstructorDeclBitfields {		class CXXConstructorDeclBitfields {
friend class CXXConstructorDecl;		friend class CXXConstructorDecl;
/// For the bits in DeclContextBitfields.		/// For the bits in DeclContextBitfields.
uint64_t : NumDeclContextBits;		uint64_t : NumDeclContextBits;
/// For the bits in FunctionDeclBitfields.		/// For the bits in FunctionDeclBitfields.
uint64_t : NumFunctionDeclBits;		uint64_t : NumFunctionDeclBits;

/// 24 bits to fit in the remaining available space.		/// 24 bits to fit in the remaining available space.
/// Note that this makes CXXConstructorDeclBitfields take		/// Note that this makes CXXConstructorDeclBitfields take
/// exactly 64 bits and thus the width of NumCtorInitializers		/// exactly 64 bits and thus the width of NumCtorInitializers
/// will need to be shrunk if some bit is added to NumDeclContextBitfields,		/// will need to be shrunk if some bit is added to NumDeclContextBitfields,
/// NumFunctionDeclBitfields or CXXConstructorDeclBitfields.		/// NumFunctionDeclBitfields or CXXConstructorDeclBitfields.
uint64_t NumCtorInitializers : 21;		uint64_t NumCtorInitializers : 20;
uint64_t IsInheritingConstructor : 1;		uint64_t IsInheritingConstructor : 1;

/// Whether this constructor has a trail-allocated explicit specifier.		/// Whether this constructor has a trail-allocated explicit specifier.
uint64_t HasTrailingExplicitSpecifier : 1;		uint64_t HasTrailingExplicitSpecifier : 1;
/// If this constructor does't have a trail-allocated explicit specifier.		/// If this constructor does't have a trail-allocated explicit specifier.
/// Whether this constructor is explicit specified.		/// Whether this constructor is explicit specified.
uint64_t IsSimpleExplicit : 1;		uint64_t IsSimpleExplicit : 1;
};		};
▲ Show 20 Lines • Show All 1,001 Lines • Show Last 20 Lines

clang/lib/AST/Decl.cpp

Show First 20 Lines • Show All 2,902 Lines • ▼ Show 20 Lines	FunctionDecl::FunctionDecl(Kind DK, ASTContext &C, DeclContext *DC,
FunctionDeclBits.InstantiationIsPending = false;		FunctionDeclBits.InstantiationIsPending = false;
FunctionDeclBits.UsesSEHTry = false;		FunctionDeclBits.UsesSEHTry = false;
FunctionDeclBits.UsesFPIntrin = UsesFPIntrin;		FunctionDeclBits.UsesFPIntrin = UsesFPIntrin;
FunctionDeclBits.HasSkippedBody = false;		FunctionDeclBits.HasSkippedBody = false;
FunctionDeclBits.WillHaveBody = false;		FunctionDeclBits.WillHaveBody = false;
FunctionDeclBits.IsMultiVersion = false;		FunctionDeclBits.IsMultiVersion = false;
FunctionDeclBits.IsCopyDeductionCandidate = false;		FunctionDeclBits.IsCopyDeductionCandidate = false;
FunctionDeclBits.HasODRHash = false;		FunctionDeclBits.HasODRHash = false;
		FunctionDeclBits.ShadowsGNUInlineIntrinsic = false;
if (TrailingRequiresClause)		if (TrailingRequiresClause)
setTrailingRequiresClause(TrailingRequiresClause);		setTrailingRequiresClause(TrailingRequiresClause);
}		}

void FunctionDecl::getNameForDiagnostic(		void FunctionDecl::getNameForDiagnostic(
raw_ostream &OS, const PrintingPolicy &Policy, bool Qualified) const {		raw_ostream &OS, const PrintingPolicy &Policy, bool Qualified) const {
NamedDecl::getNameForDiagnostic(OS, Policy, Qualified);		NamedDecl::getNameForDiagnostic(OS, Policy, Qualified);
const TemplateArgumentList *TemplateArgs = getTemplateSpecializationArgs();		const TemplateArgumentList *TemplateArgs = getTemplateSpecializationArgs();
▲ Show 20 Lines • Show All 2,267 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenFunction.cpp

Show First 20 Lines • Show All 1,292 Lines • ▼ Show 20 Lines	void CodeGenFunction::GenerateCode(GlobalDecl GD, llvm::Function *Fn,
const FunctionDecl *FD = cast<FunctionDecl>(GD.getDecl());		const FunctionDecl *FD = cast<FunctionDecl>(GD.getDecl());
CurGD = GD;		CurGD = GD;

FunctionArgList Args;		FunctionArgList Args;
QualType ResTy = BuildFunctionArgList(GD, Args);		QualType ResTy = BuildFunctionArgList(GD, Args);

// When generating code for a builtin with an inline declaration, use a		// When generating code for a builtin with an inline declaration, use a
// mangled name to hold the actual body, while keeping an external definition		// mangled name to hold the actual body, while keeping an external definition
// in case the function pointer is referenced somewhere.		// in case the function pointer is referenced somewhere.
if (FD->isInlineBuiltinDeclaration() && Fn) {		if (Fn) {
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions I don't think we want to do all this work if just `Fn`; ie. create a new `std::string` with `.inline` suffix for every function we're going to generate code (IR) for. How about we add an additional unlikely guard: `if (FD->getBuiltinID() && FN) {` Because in the usual case, `FD` both has a builtin ID and is an inline builtin declaration, while in the exceptional case that this patch addresses, `FD` has a builtin ID but is not an inline builtin declaration. nickdesaulniers: I don't think we want to do all this work if just `Fn`; ie. create a new `std::string` with `.
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Is it correct to gate this on whether it's a builtin or not? I thought that builtin-like (e.g., the usual pile of attributes) user code should also have the same effect, shouldn't it? aaron.ballman: Is it correct to gate this on whether it's a builtin or not? I thought that builtin-like (e.g.
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions What do you mean? I'm sorry, I don't quite follow. nickdesaulniers: What do you mean? I'm sorry, I don't quite follow.
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions From the test cases below: extern inline __attribute__((always_inline)) __attribute__((gnu_inline)) unsigned long strlen(const char p) { return 1; } unsigned long mystrlen(char const s) { return strlen(s); } unsigned long strlen(const char s) { return 2; } These redeclarations resolve a particular way by GCC and this patch is intending to match that behavior. My question ultimately boils down to whether that also happens for this code, where the function is not a builtin but looks like one due to the attributes: extern inline __attribute__((always_inline)) __attribute__((gnu_inline)) unsigned long wahoo(const char p) { return 1; } unsigned long mywahoo(char const s) { return wahoo(s); } unsigned long wahoo(const char s) { return 2; } If this also reorders, then I don't think we can look at whether `FD->getBuiltinID() != 0` to decide whether to do the reordering dance because arbitrary user functions aren't Clang builtins and so they'd not get the correct behavior. Does that make sense? aaron.ballman: From the test cases below: ``` extern inline __attribute__((always_inline)) __attribute__…
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions If this also reorders It does; https://godbolt.org/z/bbrox7f6e. Does that make sense? Yes, thanks for the additional info. In this case, I guess we can disregard my feedback that started this thread, marking it as done? Perhaps @serge-sans-paille should add such a non-builtin test case as part of the change? nickdesaulniers: > If this also reorders It does; https://godbolt.org/z/bbrox7f6e. > Does that make sense?
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions I think you have a valid concern about the extra allocations, but I'm not certain of a better predicate to use. My intuition is that the overhead here won't be unacceptable, but it'd be good to know we're not regressing compile time performance significantly. Additional test coverage with a comment as to why we're testing it is a good idea! aaron.ballman: I think you have a valid concern about the extra allocations, but I'm not certain of a better…
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions Perhaps we can first test whether this FuctionDecl is a redecl, then do the allocation, then check if the `.inline` suffix? That way we avoid creating the new string unless we're codgen'ing a redecl, which should be uncommon in practice. nickdesaulniers: Perhaps we can first test whether this FuctionDecl is a redecl, then do the allocation, then…
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions That could save us a bit of perf, but I think redecls are actually quite common because a definition is itself a declaration, so having a decl in a header file and defn in a source file will produce a redecl chain. aaron.ballman: That could save us a bit of perf, but I think redecls are actually quite common because a…
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions Ah, ok then. Additional test coverage with a comment as to why we're testing it is a good idea! Yeah, with that and fixes to the other small nits in the test case, then I think this is ready to land. nickdesaulniers: Ah, ok then. > Additional test coverage with a comment as to why we're testing it is a good…
		if (FD->isInlineBuiltinDeclaration()) {
std::string FDInlineName = (Fn->getName() + ".inline").str();		std::string FDInlineName = (Fn->getName() + ".inline").str();
llvm::Module *M = Fn->getParent();		llvm::Module *M = Fn->getParent();
llvm::Function *Clone = M->getFunction(FDInlineName);		llvm::Function *Clone = M->getFunction(FDInlineName);
if (!Clone) {		if (!Clone) {
Clone = llvm::Function::Create(Fn->getFunctionType(),		Clone = llvm::Function::Create(Fn->getFunctionType(),
llvm::GlobalValue::InternalLinkage,		llvm::GlobalValue::InternalLinkage,
Fn->getAddressSpace(), FDInlineName, M);		Fn->getAddressSpace(), FDInlineName, M);
Clone->addFnAttr(llvm::Attribute::AlwaysInline);		Clone->addFnAttr(llvm::Attribute::AlwaysInline);
}		}
Fn->setLinkage(llvm::GlobalValue::ExternalLinkage);		Fn->setLinkage(llvm::GlobalValue::ExternalLinkage);
Fn = Clone;		Fn = Clone;
}		}

		// Detect the unusual situation where an inline version is shadowed by a
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions Perhaps in `Sema::CheckFunctionDeclaration`? I see there is where we detect redeclarations. The calls from there to `FunctionDecl::setPreviousDeclaration()` seem to set up the redecl chain. Perhaps this exceptional case (or both cases, even) would be handled better there? cc @rsmith @aaron.ballman in case they have feedback/tips/cycles. nickdesaulniers: Perhaps in `Sema::CheckFunctionDeclaration`? I see there is where we detect redeclarations.
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions I don't know that it's a good idea to modify the redeclaration chain in this case. The comments on the chain are pretty clear that it's a temporal chain where "previous" means previously declared in relation to the current declaration. @rsmith may feel differently, however. aaron.ballman: I don't know that it's a good idea to modify the redeclaration chain in this case. The comments…
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions Sorry, I don't quite follow whether your approving of the current approach or dismissive? nickdesaulniers: Sorry, I don't quite follow whether your approving of the current approach or dismissive?
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions I don't think we should modify the redecl chain from `CheckFunctionDeclaration()` -- this case would create a redeclaration chain whose previous link was not temporally the previous declaration. There might be another approach so we can avoid `replaceAllUsesWith()`. One possibility (no idea how feasible or what explodes) is to modify `FunctionDecl::getDefinition()` to look through the chain to return the best definition when there are multiple definitions to pick from. aaron.ballman: I don't think we should modify the redecl chain from `CheckFunctionDeclaration()` -- this case…
		// non-inline version. In that case we should pick the external one
		nickdesaulniersUnsubmitted Done Reply Inline Actions do we really want to be iterating every redaclaration like this, even for non-inline builtin declarations? Is there a way to avoid the below loop for most functions? Perhaps we should be doing this when we hit a redeclaration instead? Or wrap all this work in a check that there's a builtin ID associated with the FD? nickdesaulniers: do we really want to be iterating every redaclaration like this, even for non-inline builtin…
		// everywhere. That's GCC behavior too. Unfortunately, I cannot find a way
		// to detect that situation before we reach codegen, so do some late
		// replacement.
		else if (FD->shadowsGNUInlineIntrinsic()) {
		std::string FDInlineName = (Fn->getName() + ".inline").str();
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Please fix the formatting. aaron.ballman: Please fix the formatting.
		llvm::Module *M = Fn->getParent();
		if (llvm::Function *Clone = M->getFunction(FDInlineName)) {
		Clone->replaceAllUsesWith(Fn);
		Clone->eraseFromParent();
		}
		nickdesaulniersUnsubmitted Done Reply Inline Actions If there are multiple redeclarations, do we want to be erasing the clone each time, or can we `break` out of this loop? nickdesaulniers: If there are multiple redeclarations, do we want to be erasing the clone each time, or can we…
		}
		}

// Check if we should generate debug info for this function.		// Check if we should generate debug info for this function.
if (FD->hasAttr<NoDebugAttr>()) {		if (FD->hasAttr<NoDebugAttr>()) {
// Clear non-distinct debug info that was possibly attached to the function		// Clear non-distinct debug info that was possibly attached to the function
// due to an earlier declaration without the nodebug attribute		// due to an earlier declaration without the nodebug attribute
if (Fn)		if (Fn)
Fn->setSubprogram(nullptr);		Fn->setSubprogram(nullptr);
// Disable debug info indefinitely for this function		// Disable debug info indefinitely for this function
DebugInfo = nullptr;		DebugInfo = nullptr;
▲ Show 20 Lines • Show All 1,392 Lines • Show Last 20 Lines

clang/lib/Sema/SemaDecl.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 9,991 Lines • ▼ Show 20 Lines

	Diag(MD->getLocation(), diag::warn_cxx14_compat_constexpr_not_const)			Diag(MD->getLocation(), diag::warn_cxx14_compat_constexpr_not_const)
	<< FixItHint::CreateInsertion(AddConstLoc, " const");			<< FixItHint::CreateInsertion(AddConstLoc, " const");
	}			}
	}			}
	}			}

	if (Redeclaration) {			if (Redeclaration) {
				FunctionDecl *OldFDecl = dyn_cast<FunctionDecl>(OldDecl);
				if (OldFDecl && OldFDecl->isInlineBuiltinDeclaration() &&
				!NewFD->isInlineBuiltinDeclaration())
				NewFD->setShadowsGNUInlineIntrinsic(true);

	// NewFD and OldDecl represent declarations that need to be			// NewFD and OldDecl represent declarations that need to be
	// merged.			// merged.
	if (MergeFunctionDecl(NewFD, OldDecl, S, MergeTypeWithPrevious)) {			if (MergeFunctionDecl(NewFD, OldDecl, S, MergeTypeWithPrevious)) {
	NewFD->setInvalidDecl();			NewFD->setInvalidDecl();
	return Redeclaration;			return Redeclaration;
	}			}

	Previous.clear();			Previous.clear();
	▲ Show 20 Lines • Show All 7,714 Lines • Show Last 20 Lines

clang/test/CodeGen/gnu-inline-redecl.c

This file was added.

				// RUN: %clang_cc1 -triple x86_64 -S -emit-llvm -O1 -o - %s \| FileCheck %s
				//
				// Verifies that the gnu_inline version is ignored in favor of the redecl

				extern inline __attribute__((gnu_inline)) unsigned long some_size(int c) {
				return 1;
				}
				unsigned long mycall(int s) {
				// CHECK-LABEL: i64 @mycall
				// CHECK: ret i64 2
				return some_size(s);
				}
				unsigned long some_size(int c) {
				return 2;
				}
				unsigned long yourcall(int s) {
				// CHECK-LABEL: i64 @yourcall
				// CHECK: ret i64 2
				return some_size(s);
				}

clang/test/CodeGen/strlen-inline-builtin-redecl.c

This file was added.

				// RUN: %clang_cc1 -triple x86_64 -S -emit-llvm -disable-llvm-passes -o - %s \| FileCheck %s
				//
				// Verifies that clang-generated *.inline are removed when shadowed by an external definition

				// CHECK-NOT: strlen.inline

				unsigned long strnlen(const char *, unsigned long);

				extern inline __attribute__((always_inline)) __attribute__((gnu_inline)) unsigned long strlen(const char *p) {
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions unused decl nickdesaulniers: unused decl
				return 1;
				}
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions Do you mind wrapping this to 80 chars wide? I suspect if you put the two `__attribute__`s first, then the formatter will do a better job. You can also combine these, a la `__attribute__((always_inline, gnu_inline))` to cut down on line length. nickdesaulniers: Do you mind wrapping this to 80 chars wide? I suspect if you put the two `__attribute__`s first…
				unsigned long mystrlen(char const *s) {
				return strlen(s);
				nickdesaulniersUnsubmitted Done Reply Inline Actions Has this example been formatted? Does rotating the attributes to the front help? nickdesaulniers: Has this example been formatted? Does rotating the attributes to the front help?
				}
				unsigned long strlen(const char *s) {
				return 2;
				}
				unsigned long yourstrlen(char const *s) {
				return strlen(s);
				}
				nickdesaulniersUnsubmitted Done Reply Inline Actions I think this line can be dropped without affecting the test. Or consider using any of the further reduced test cases? nickdesaulniers: I think this line can be dropped without affecting the test. Or consider using any of the…