This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/CodeGen/
-
CodeGen/
1/7
CGDeclCXX.cpp
-
CodeGenModule.h
1
CodeGenModule.cpp
-
test/
-
CodeGenCXX/
-
aix-static-init-temp-spec-and-inline-var.cpp
-
microsoft-abi-static-initializers.cpp
-
static-member-variable-explicit-specialization.cpp
-
Modules/
-
initializers.cpp

Differential D126341

Order implicitly instantiated global variable's initializer by the reverse instantiation order
AbandonedPublic

Authored by ychen on May 24 2022, 4:01 PM.

Download Raw Diff

Details

Reviewers

rnk
rsmith
aaron.ballman

Group Reviewers

Restricted Project

Summary

By the standard https://eel.is/c++draft/basic.start#dynamic-1, implicitly
instantiated global variable's initializer has no order. However GCC has
the *intuitive behavior* for the two test cases in https://clang.godbolt.org/z/MPdhYTqhK.

The underlying problem is basically wg21.link/cwg362 which has no concensus yet.

I wish both cases could work, like GCC. However, to make the original cwg362 test case work,
I needs an extra data structure to track the instantiation order for implicitly
instantiated global variable. So the current patch only work for the first test case.

Will the reviewers be supportive if I make the original cwg362 test case work too?

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ychen created this revision.May 24 2022, 4:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 24 2022, 4:01 PM

ychen requested review of this revision.May 24 2022, 4:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 24 2022, 4:01 PM

Herald added a subscriber: cfe-commits. · View Herald Transcript

rebase

Harbormaster completed remote builds in B166161: Diff 431825.May 24 2022, 5:22 PM

Adding the language WG as a reviewer in case others have opinions.

The underlying problem is basically wg21.link/cwg362 which has no concensus yet.

According to https://www.open-std.org/jtc1/sc22/wg21/docs/cwg_defects.html#362 this was resolved in CD1 and we track it as being not applicable to us (https://clang.llvm.org/cxx_dr_status.html#362). (Is our status actually correct for this?)

Will the reviewers be supportive if I make the original cwg362 test case work too?

To me, it depends on what it does to compile times and memory overhead for the compiler when run on large projects. If the extra tracking is cheap and doesn't really impact anything, I think it's reasonable to want to match GCC's behavior. If it turns out this is expensive, I'm less keen on matching GCC.

In D126341#3537675, @aaron.ballman wrote:

Adding the language WG as a reviewer in case others have opinions.

The underlying problem is basically wg21.link/cwg362 which has no concensus yet.

According to https://www.open-std.org/jtc1/sc22/wg21/docs/cwg_defects.html#362 this was resolved in CD1 and we track it as being not applicable to us (https://clang.llvm.org/cxx_dr_status.html#362). (Is our status actually correct for this?)

Will the reviewers be supportive if I make the original cwg362 test case work too?

To me, it depends on what it does to compile times and memory overhead for the compiler when run on large projects. If the extra tracking is cheap and doesn't really impact anything, I think it's reasonable to want to match GCC's behavior. If it turns out this is expensive, I'm less keen on matching GCC.

Thanks for the opinion @aaron.ballman. I think the cost would be low. I'll get back with some numbers.

I'm somewhat supportive of the goal here, but I think there are still some underlying issues.

First, why should these guarantees be limited to instantiations and not inline variables? Such as:

int f();
inline int gv1 = f();
inline int gv2 = gv1 + 1; // rely on previous

Second, LLVM doesn't guarantee that global_ctors at the same priority execute in order. See the langref: https://llvm.org/docs/LangRef.html#the-llvm-global-ctors-global-variable So, without a guarantee from LLVM, Clang can't rely on this behavior. LLVM relies on this lack of an ordering guarantee to power globalopt.

Last, what happens when the same global is implicitly instantiated in some other TU? Won't that disrupt the ordering?

+@alexander-shaposhnikov, who is working on global opt changes.

In D126341#3537947, @rnk wrote:
I'm somewhat supportive of the goal here, but I think there are still some underlying issues.

First, why should these guarantees be limited to instantiations and not inline variables? Such as:
int f();
inline int gv1 = f();
inline int gv2 = gv1 + 1; // rely on previous

I think this is because (as discussed in cwg362, https://eel.is/c++draft/lex.phases#1.8) instantiations happen in instantiation units where no order is specified. inline variables live in TU, where the order is specified.

Second, LLVM doesn't guarantee that global_ctors at the same priority execute in order. See the langref: https://llvm.org/docs/LangRef.html#the-llvm-global-ctors-global-variable So, without a guarantee from LLVM, Clang can't rely on this behavior. LLVM relies on this lack of an ordering guarantee to power globalopt.

Yeah, I noticed that too. In practice, it looks like we do rely on the order in llvm.global_ctors to implement the static init order (for the above inline variable case, https://clang.godbolt.org/z/G3YoY5bc9). If globalopt decides to reorder the two init functions, the result would be wrong.

Last, what happens when the same global is implicitly instantiated in some other TU? Won't that disrupt the ordering?

+@alexander-shaposhnikov, who is working on global opt changes.

Hmm, that's a great point. My thoughts are it would not unless the linker is weird/inconsistent about which COMDAT to pick. Since for each TU, one globalvar has all its dependency globalvars initialized in the correct order in init array. Say two TUs, both have T<8>, no matter which T<8> the linker picks, it would pick all the T<8>'s dependencies according to the same criteria. Hence the init order is kept. I wouldn't say I'm 100% sure. At least it appears to me so at the moment. I'll do some experiments locally.

The underlying problem is basically wg21.link/cwg362 which has no concensus yet.

CWG362 has a clear consensus and has been closed with that consensus for 18 years. The consensus, per that issue, is:

We discussed this and agreed that we really do mean the the order is unspecified.

which seems clear that our current behavior is a valid choice.

In D126341#3537947, @rnk wrote:
First, why should these guarantees be limited to instantiations and not inline variables? Such as:
int f();
inline int gv1 = f();
inline int gv2 = gv1 + 1; // rely on previous

Inline variables have initialization order guarantees already. An inline variable B can rely on a prior inline variable A being initialized first if A is defined before B in every TU where B is defined. And a non-inline variable C can rely on a prior inline variable B being initialized first if B is defined before C.

There is no comparable guarantee for instantiated variables, in part because an intended implementation strategy is to instantiate them separately, and combine those instantiation units with the compilation units at link time in an arbitrary order, and in part because instantiation order could in general be different in different TUs.

Second, LLVM doesn't guarantee that global_ctors at the same priority execute in order. See the langref: https://llvm.org/docs/LangRef.html#the-llvm-global-ctors-global-variable So, without a guarantee from LLVM, Clang can't rely on this behavior. LLVM relies on this lack of an ordering guarantee to power globalopt.

It doesn't seem reasonable for us to provide a guarantee here, and in fact, we can't provide a guarantee in general (once there's more than one TU, a consistent global total order might not exist). I think the question is, should we make some minor effort to make the broken code that's relying on initialization order of unordered variables happen to do the right thing most of the time, or should we keep the status quo that is likely to result in problems happening earlier? I don't think it's clear whether the superficial increase in compatibility with GCC is a net positive or negative -- giving the surprising initialization order may counter-intuitively help people to find ordering bugs faster.

clang/lib/CodeGen/CGDeclCXX.cpp
582	This is effectively initializing instantiated variables in reverse instantiation order. That seems like it'll make things worse as much as it makes things better. For example, given: #include <iostream> template<typename T> int a = (std::cout << "hello, ", 0); template<typename T> int b = (std::cout << "world", 0); int main() { (void)a<int>; (void)b<int>; } ... we currently print `"hello, world"`, but with this change we'll print `"worldhello, "`. If we want a sensible initialization order, I think we need a different strategy, that will probably require `Sema` to be a lot more careful about what order it instantiates variables in and what order it passes them to the AST consumer: if an instantiation A triggers another instantiation B, we should defer passing A to the consumer until B has been instantiated and passed to the consumer. That's probably not too hard to implement, by adding an entry to the pending instantiation list to say "now pass this to the consumer" in the case where one instantiation triggers another. But I do wonder whether that level of complexity is worthwhile, given that code relying on this behavior is broken.
clang/lib/CodeGen/CodeGenModule.cpp
1562–1563	This is quadratic in the number of instantiated variables. I don't think that's acceptable.

rnk added inline comments.May 26 2022, 2:20 PM

clang/lib/CodeGen/CGDeclCXX.cpp
563	@rsmith , if inline global variable initialization has ordering requirements, we have a bug, because I believe this GVA_DiscardableODR codepath handles them, and we come through here to give them separate initializers in llvm.global_ctors. See the example with two separate global_ctors entries on godbolt: https://gcc.godbolt.org/z/5d577snqb As long as LLVM doesn't provide ordering guarantees about same priority initializers in global_ctors, inline globals have the same problems as template instantiations. IMO whatever solution we use to order inline globals should be used for template instantiations. Intuitively, that means LLVM should promise to run global_ctors in left-to-right order, and if all TUs instantiate initializers in the same order, everything should behave intuitively. The question then becomes, why doesn't this work already?

rsmith added inline comments.May 26 2022, 4:02 PM

clang/lib/CodeGen/CGDeclCXX.cpp
563	Intuitively, that means LLVM should promise to run global_ctors in left-to-right order I don't think that's sufficient, due to the way we use COMDATs to discard duplicate global initializers. Consider: TU 1 defines inline variable A. TU 2 defines inline variable A and then inline variable B. The standard guarantees that A is initialized before B in this scenario. But if (somehow) the linker picks the definition of A from TU 2, but orders the initializers from TU 1 first, then the resulting global_ctors order will be B then A, which is not allowed. Either we need a guarantee that linkers will use the same ordering between objects when picking COMDATs as when concatenating `.init_array`, or we need to stop using the COMDAT trick for them (and either make LLVM respect the `@llvm.global_ctors` order or coalesce all inline variable initializers into the same function we run non-inline initializers from or something). Getting that guarantee seems like the best path to me, since the other option will presumably mean we check the guard variable once on startup for each TU that defines the variable, and it's something I expect linkers already happen to guarantee in practice. IMO whatever solution we use to order inline globals should be used for template instantiations. That sounds like it would regress what globalopt is able to optimize, for no gain in conformance nor perhaps any real gain in initialization order guarantees. The inline variable case is different, both because we can guarantee an initialization order and because the standard requires us to do so. Should we add complexity to the compiler whose only purpose is to mask bugs, if that complexity doesn't actually define away the possibility of bad behavior? If we can actually describe a rule that we provide for initialization order of instantiated variables, and we can easily implement that rule and be confident we won't want to substantially weaken it later, and we can thereby assure our users that we will satisfy that rule, then I think that could be interesting, but anything less than that doesn't seem worthwhile to me. The question then becomes, why doesn't this work already? It looks like it mostly does. One factor here is instantiation order. When we instantiate a variable, we add the variables that it references to our "to be instantiated" list, which is processed later: template<int N> int Fib = Fib<N - 2> + Fib<N - 1>; template<> int Fib<0> = 0; template<> int Fib<1> = 1; instantiating `Fib<5>` will append `Fib<3>` and `Fib<4>` to the list then we visit `Fib<3>` and append `Fib<1>` and `Fib<2>` to the list then we visit `Fib<4>` and add no new entries We pass declarations to the consumer when we're done with the instantiation step. Sometimes this includes instantiating variables referenced by that variable, and sometimes it doesn't. The difference is whether we're performing "recursive" instantiation or not. When we're performing immediate instantiation of a variable (either because it was explicitly instantiated, or because we might need its value immediately because it might be usable in constant expressions), our instantiation step is non-recursive. We just add declarations to `Sema`'s "to be instantiated at end of TU" list. This is at least a little important semantically: we allow a matching specialization to be declared after the first use wherever possible. In that case, we'll pass a declaration to the consumer before we've instantiated the things it references. When we're performing the end-of-TU instantiation of all referenced template specializations, we do that recursively, and that means that we will instantiate any referenced variables before we pass the referencing variable to the AST consumer. You can see this happening here: https://godbolt.org/z/4sj1Y7W4G (look at the order in which we get the warnings: for `Fib<5>` then `x` then `Fib<3>`, then `Fib<2>`, then `Fib<4>`, and note that we initialize `Fib<5>` first, because it's the first thing added to the consumer. Then `Fib<2>`, `Fib<3>`, and `Fib<4>` get passed to the consumer in that order, because that is the order in which the instantiations of their definitions finish. But `Fib<5>`'s instantiation finishes first, because that's a non-recursive instantiation. Without the explicit instantiation, we pass the variables to the AST consumer in the order `Fib<2>`, `Fib<3>`, `Fib<4>`, `Fib<5>`: https://godbolt.org/z/sax1dvh7z leading to an "intuitive" result. They end up dependency-ordered because that's the order in which their instantiations happen to finish. (Example with a static data member: https://godbolt.org/z/9fsbPvWTP) Another factor (and the reason why my examples are working and @ychen's very similar examples are not) is `CodeGenModule`'s deferred emission of variables. If an instantiated variable has an initializer with no side effects, `CodeGenModule` won't emit it unless it emits a reference to it (there's no point emitting something that will just be discarded by the linker). And `CodeGenModule`'s process for emitting deferred declarations does all kinds of reordering. The way this works is: `CodeGenModule` is handed the globals, sees they don't need to be emitted, and ignores them. Then: it emits a reference to `Fib<5>` and decides that `Fib<5>` needs to be emitted and adds it to a queue of deferred declarations to emit it emits the deferred declaration `Fib<5>`, which adds `Fib<3>` and `Fib<4>` to a new queue for things used by `Fib<5>` it emits the things used by `Fib<5>`: `Fib<3>` and `Fib<4>` emitting `Fib<3>` adds `Fib<2>` to a new queue for things used by `Fib<3>`, so `Fib<2>` is emitted next emitting `Fib<4>` adds nothing new to the queue So when the initializers don't have side-effects, the variables are initialized in the order `Fib<5>`, `Fib<3>`, `Fib<2>`, `Fib<4>`. So the "reversing" has at least two sources (beyond anything that globalopt or the linker might do): non-recursive instantiations in `Sema` will cause an instantiated variable to be passed to the consumer before the things it references; this can mostly be avoided by not using explicit instantiations instantiated variables with side-effect-free initializers have their initializers emitted on use rather than in instantiation order; this can be avoided by building with `-femit-all-decls` or by adding a dummy side-effect to the initializer If we want to fix just the `CodeGen` side of this, I think the thing to do would be to follow the model that `CodeGen` uses for ordered initialization (it tracks a `DelayedCXXInitPosition` map giving the order in which the variables should be initialized, if their initializers are actually emitted). We could do the same thing for instantiated variables, allocating each one handed to CodeGen a slot which either gets filled in with that initializer, or doesn't get emitted if the variable is not emitted. But, as noted above, I'm not convinced it's worth it unless this leads to some actual user-facing behavior guarantee.
582	Please see my other (long) comment; `Sema` actually already does what I describe here, except in the cases where it does an eager, non-recursive instantiation.

rnk added inline comments.May 27 2022, 11:12 AM

clang/lib/CodeGen/CGDeclCXX.cpp
563	TU 1 defines inline variable A. TU 2 defines inline variable A and then inline variable B. The standard guarantees that A is initialized before B in this scenario. But if (somehow) the linker picks the definition of A from TU 2, but orders the initializers from TU 1 first, then the resulting global_ctors order will be B then A, which is not allowed. Yes, but we only get the ordering guarantee if A is defined before B in all TUs, and both MSVC and Clang seem to emit all inline globals with dynamic initializers, even if they are not ODR used: https://gcc.godbolt.org/z/MhKvGqTez So, in your example, TU1 must not have a definition of B, meaning there is no guarantee of ordering. All we need to do to fix the inline variable conformance bug is for LLVM to guarantee an initialization order in global_ctors. However, the bug isn't observable in practice because inline globals require guard variables, and globalopt can't optimize those yet. I don't want to get too deep into the clang implementation details. I trust you that it's complicated. My perspective is just that, if we have a working ordering solution for inline variables, we should use it, if it is simple and low cost, for instantiated variables. It sounds to me like ordering instantiations is not cheap and easy, so we shouldn't do it. And, this change in particular doesn't address many cases in practice.

rsmith added inline comments.May 27 2022, 2:14 PM

clang/lib/CodeGen/CGDeclCXX.cpp
563	TU 1 defines inline variable A. TU 2 defines inline variable A and then inline variable B. The standard guarantees that A is initialized before B in this scenario. But if (somehow) the linker picks the definition of A from TU 2, but orders the initializers from TU 1 first, then the resulting global_ctors order will be B then A, which is not allowed. Yes, but we only get the ordering guarantee if A is defined before B in all TUs, That's not quite right. We get the guarantee if A is defined before B in all TUs where B is defined. It's OK for there to be TUs where A is defined but B is not. (See the relevant language rule, which requires only that "for every definition of [B] there exists a definition of [A]" and not the other way around.) The intended user model is that it's OK for B to rely on A if they're defined in the same header file, or if A is defined in a header file that B includes. And it's OK if there's a source file that includes A's header but not B's. The intended implementation model is that inline variables are initialized (as if) in definition order within each TU they are defined in, with a guard variable preventing repeated initialization.

dblaikie added a subscriber: dblaikie.May 30 2022, 5:26 PM

Well, I guess we're out of luck, but that seems like a very poorly considered requirement from the standard. If we can't use comdats for inline variables, every time you include a header with a dynamically initialized variable, it will generate extra initialization code in every TU that cannot be optimized away. This reminds me of the problems people used to have where every TU including <iostream> emitted extra initialization code.

I think we have two options:

Full conformance: Stop using comdats altogether and suffer costs in code size and startup time
Partial / compromised conformance: Provide ordering guarantees between global_ctors entries so that we can ensure that inline variables in headers have the expected initialization order

In D126341#3554286, @rnk wrote:

Well, I guess we're out of luck, but that seems like a very poorly considered requirement from the standard. If we can't use comdats for inline variables, every time you include a header with a dynamically initialized variable, it will generate extra initialization code in every TU that cannot be optimized away. This reminds me of the problems people used to have where every TU including <iostream> emitted extra initialization code.

I think we have two options:

Full conformance: Stop using comdats altogether and suffer costs in code size and startup time

Partial / compromised conformance: Provide ordering guarantees between global_ctors entries so that we can ensure that inline variables in headers have the expected initialization order

I'd prefer option 2 due to the code size/startup time cost. About enforcing the order in global_ctors (and maybe also global_dtors), how about adding an integer order field to each entry in global_ctors. Then let the IR verifier check the order is non-descending (passes are allowed to reorder entries with the same order). Another guarantee needed for option 2 is that the linker consistently picks the first COMDAT. I think this is getting out of our hands but this assumption *could* be made in practice? In very rare cases, if a linker changes this behavior, we'll have to rely on user reports to find out.

clang/lib/CodeGen/CGDeclCXX.cpp
563	TU 1 defines inline variable A. TU 2 defines inline variable A and then inline variable B. The standard guarantees that A is initialized before B in this scenario. But if (somehow) the linker picks the definition of A from TU 2, but orders the initializers from TU 1 first, then the resulting global_ctors order will be B then A, which is not allowed. If I understand this example correctly, a consistent linker that always picks the last COMDAT would also produce the wrong result.

ychen mentioned this in D127233: [CodeGen] Sort llvm.global_ctors by lexing order before emission.Jun 7 2022, 10:34 AM

ychen mentioned this in D127259: [CodeGen] guarantee templated static variables are initialized in the reverse instantiation order.Jun 7 2022, 3:35 PM

ychen mentioned this in rGf9969a3d28e7: [CodeGen] Sort llvm.global_ctors by lexing order before emission.Aug 22 2022, 4:00 PM

ychen mentioned this in rGe423885e272c: [CodeGen] guarantee templated static variables are initialized in the reverse….Mar 3 2023, 12:13 AM

Superseded by D127233, D127259, rG7f8d844df5e9

Revision Contents

Path

Size

clang/

lib/

CodeGen/

CGDeclCXX.cpp

6 lines

CodeGenModule.h

3 lines

CodeGenModule.cpp

8 lines

test/

CodeGenCXX/

aix-static-init-temp-spec-and-inline-var.cpp

2 lines

microsoft-abi-static-initializers.cpp

4 lines

static-member-variable-explicit-specialization.cpp

53 lines

Modules/

initializers.cpp

10 lines

Diff 431825

clang/lib/CodeGen/CGDeclCXX.cpp

Show First 20 Lines • Show All 538 Lines • ▼ Show 20 Lines	llvm::Function *Fn = CreateGlobalInitOrCleanUpFunction(
FTy, FnName.str(), getTypes().arrangeNullaryFunction(), D->getLocation());		FTy, FnName.str(), getTypes().arrangeNullaryFunction(), D->getLocation());

auto *ISA = D->getAttr<InitSegAttr>();		auto *ISA = D->getAttr<InitSegAttr>();
CodeGenFunction(*this).GenerateCXXGlobalVarDeclInitFunc(Fn, D, Addr,		CodeGenFunction(*this).GenerateCXXGlobalVarDeclInitFunc(Fn, D, Addr,
PerformInit);		PerformInit);

llvm::GlobalVariable *COMDATKey =		llvm::GlobalVariable *COMDATKey =
supportsCOMDAT() && D->isExternallyVisible() ? Addr : nullptr;		supportsCOMDAT() && D->isExternallyVisible() ? Addr : nullptr;
		bool IsInstantiation =
		isTemplateInstantiation(D->getTemplateSpecializationKind());

if (D->getTLSKind()) {		if (D->getTLSKind()) {
// FIXME: Should we support init_priority for thread_local?		// FIXME: Should we support init_priority for thread_local?
// FIXME: We only need to register one __cxa_thread_atexit function for the		// FIXME: We only need to register one __cxa_thread_atexit function for the
// entire TU.		// entire TU.
CXXThreadLocalInits.push_back(Fn);		CXXThreadLocalInits.push_back(Fn);
CXXThreadLocalInitVars.push_back(D);		CXXThreadLocalInitVars.push_back(D);
} else if (PerformInit && ISA) {		} else if (PerformInit && ISA) {
EmitPointerToInitFunc(D, Addr, Fn, ISA);		EmitPointerToInitFunc(D, Addr, Fn, ISA);
} else if (auto *IPA = D->getAttr<InitPriorityAttr>()) {		} else if (auto *IPA = D->getAttr<InitPriorityAttr>()) {
OrderGlobalInitsOrStermFinalizers Key(IPA->getPriority(),		OrderGlobalInitsOrStermFinalizers Key(IPA->getPriority(),
PrioritizedCXXGlobalInits.size());		PrioritizedCXXGlobalInits.size());
PrioritizedCXXGlobalInits.push_back(std::make_pair(Key, Fn));		PrioritizedCXXGlobalInits.push_back(std::make_pair(Key, Fn));
} else if (isTemplateInstantiation(D->getTemplateSpecializationKind()) \|\|		} else if (IsInstantiation \|\|
getContext().GetGVALinkageForVariable(D) == GVA_DiscardableODR \|\|		getContext().GetGVALinkageForVariable(D) == GVA_DiscardableODR \|\|
		rnkUnsubmitted Not Done Reply Inline Actions @rsmith , if inline global variable initialization has ordering requirements, we have a bug, because I believe this GVA_DiscardableODR codepath handles them, and we come through here to give them separate initializers in llvm.global_ctors. See the example with two separate global_ctors entries on godbolt: https://gcc.godbolt.org/z/5d577snqb As long as LLVM doesn't provide ordering guarantees about same priority initializers in global_ctors, inline globals have the same problems as template instantiations. IMO whatever solution we use to order inline globals should be used for template instantiations. Intuitively, that means LLVM should promise to run global_ctors in left-to-right order, and if all TUs instantiate initializers in the same order, everything should behave intuitively. The question then becomes, why doesn't this work already? rnk: @rsmith , if inline global variable initialization has ordering requirements, we have a bug…
		rsmithUnsubmitted Not Done Reply Inline Actions Intuitively, that means LLVM should promise to run global_ctors in left-to-right order I don't think that's sufficient, due to the way we use COMDATs to discard duplicate global initializers. Consider: TU 1 defines inline variable A. TU 2 defines inline variable A and then inline variable B. The standard guarantees that A is initialized before B in this scenario. But if (somehow) the linker picks the definition of A from TU 2, but orders the initializers from TU 1 first, then the resulting global_ctors order will be B then A, which is not allowed. Either we need a guarantee that linkers will use the same ordering between objects when picking COMDATs as when concatenating `.init_array`, or we need to stop using the COMDAT trick for them (and either make LLVM respect the `@llvm.global_ctors` order or coalesce all inline variable initializers into the same function we run non-inline initializers from or something). Getting that guarantee seems like the best path to me, since the other option will presumably mean we check the guard variable once on startup for each TU that defines the variable, and it's something I expect linkers already happen to guarantee in practice. IMO whatever solution we use to order inline globals should be used for template instantiations. That sounds like it would regress what globalopt is able to optimize, for no gain in conformance nor perhaps any real gain in initialization order guarantees. The inline variable case is different, both because we can guarantee an initialization order and because the standard requires us to do so. Should we add complexity to the compiler whose only purpose is to mask bugs, if that complexity doesn't actually define away the possibility of bad behavior? If we can actually describe a rule that we provide for initialization order of instantiated variables, and we can easily implement that rule and be confident we won't want to substantially weaken it later, and we can thereby assure our users that we will satisfy that rule, then I think that could be interesting, but anything less than that doesn't seem worthwhile to me. The question then becomes, why doesn't this work already? It looks like it mostly does. One factor here is instantiation order. When we instantiate a variable, we add the variables that it references to our "to be instantiated" list, which is processed later: template<int N> int Fib = Fib<N - 2> + Fib<N - 1>; template<> int Fib<0> = 0; template<> int Fib<1> = 1; instantiating `Fib<5>` will append `Fib<3>` and `Fib<4>` to the list then we visit `Fib<3>` and append `Fib<1>` and `Fib<2>` to the list then we visit `Fib<4>` and add no new entries We pass declarations to the consumer when we're done with the instantiation step. Sometimes this includes instantiating variables referenced by that variable, and sometimes it doesn't. The difference is whether we're performing "recursive" instantiation or not. When we're performing immediate instantiation of a variable (either because it was explicitly instantiated, or because we might need its value immediately because it might be usable in constant expressions), our instantiation step is non-recursive. We just add declarations to `Sema`'s "to be instantiated at end of TU" list. This is at least a little important semantically: we allow a matching specialization to be declared after the first use wherever possible. In that case, we'll pass a declaration to the consumer before we've instantiated the things it references. When we're performing the end-of-TU instantiation of all referenced template specializations, we do that recursively, and that means that we will instantiate any referenced variables before we pass the referencing variable to the AST consumer. You can see this happening here: https://godbolt.org/z/4sj1Y7W4G (look at the order in which we get the warnings: for `Fib<5>` then `x` then `Fib<3>`, then `Fib<2>`, then `Fib<4>`, and note that we initialize `Fib<5>` first, because it's the first thing added to the consumer. Then `Fib<2>`, `Fib<3>`, and `Fib<4>` get passed to the consumer in that order, because that is the order in which the instantiations of their definitions finish. But `Fib<5>`'s instantiation finishes first, because that's a non-recursive instantiation. Without the explicit instantiation, we pass the variables to the AST consumer in the order `Fib<2>`, `Fib<3>`, `Fib<4>`, `Fib<5>`: https://godbolt.org/z/sax1dvh7z leading to an "intuitive" result. They end up dependency-ordered because that's the order in which their instantiations happen to finish. (Example with a static data member: https://godbolt.org/z/9fsbPvWTP) Another factor (and the reason why my examples are working and @ychen's very similar examples are not) is `CodeGenModule`'s deferred emission of variables. If an instantiated variable has an initializer with no side effects, `CodeGenModule` won't emit it unless it emits a reference to it (there's no point emitting something that will just be discarded by the linker). And `CodeGenModule`'s process for emitting deferred declarations does all kinds of reordering. The way this works is: `CodeGenModule` is handed the globals, sees they don't need to be emitted, and ignores them. Then: it emits a reference to `Fib<5>` and decides that `Fib<5>` needs to be emitted and adds it to a queue of deferred declarations to emit it emits the deferred declaration `Fib<5>`, which adds `Fib<3>` and `Fib<4>` to a new queue for things used by `Fib<5>` it emits the things used by `Fib<5>`: `Fib<3>` and `Fib<4>` emitting `Fib<3>` adds `Fib<2>` to a new queue for things used by `Fib<3>`, so `Fib<2>` is emitted next emitting `Fib<4>` adds nothing new to the queue So when the initializers don't have side-effects, the variables are initialized in the order `Fib<5>`, `Fib<3>`, `Fib<2>`, `Fib<4>`. So the "reversing" has at least two sources (beyond anything that globalopt or the linker might do): non-recursive instantiations in `Sema` will cause an instantiated variable to be passed to the consumer before the things it references; this can mostly be avoided by not using explicit instantiations instantiated variables with side-effect-free initializers have their initializers emitted on use rather than in instantiation order; this can be avoided by building with `-femit-all-decls` or by adding a dummy side-effect to the initializer If we want to fix just the `CodeGen` side of this, I think the thing to do would be to follow the model that `CodeGen` uses for ordered initialization (it tracks a `DelayedCXXInitPosition` map giving the order in which the variables should be initialized, if their initializers are actually emitted). We could do the same thing for instantiated variables, allocating each one handed to CodeGen a slot which either gets filled in with that initializer, or doesn't get emitted if the variable is not emitted. But, as noted above, I'm not convinced it's worth it unless this leads to some actual user-facing behavior guarantee. rsmith: > Intuitively, that means LLVM should promise to run global_ctors in left-to-right order I…
		rnkUnsubmitted Not Done Reply Inline Actions TU 1 defines inline variable A. TU 2 defines inline variable A and then inline variable B. The standard guarantees that A is initialized before B in this scenario. But if (somehow) the linker picks the definition of A from TU 2, but orders the initializers from TU 1 first, then the resulting global_ctors order will be B then A, which is not allowed. Yes, but we only get the ordering guarantee if A is defined before B in all TUs, and both MSVC and Clang seem to emit all inline globals with dynamic initializers, even if they are not ODR used: https://gcc.godbolt.org/z/MhKvGqTez So, in your example, TU1 must not have a definition of B, meaning there is no guarantee of ordering. All we need to do to fix the inline variable conformance bug is for LLVM to guarantee an initialization order in global_ctors. However, the bug isn't observable in practice because inline globals require guard variables, and globalopt can't optimize those yet. I don't want to get too deep into the clang implementation details. I trust you that it's complicated. My perspective is just that, if we have a working ordering solution for inline variables, we should use it, if it is simple and low cost, for instantiated variables. It sounds to me like ordering instantiations is not cheap and easy, so we shouldn't do it. And, this change in particular doesn't address many cases in practice. rnk: > TU 1 defines inline variable A. > TU 2 defines inline variable A and then inline variable B.
		rsmithUnsubmitted Not Done Reply Inline Actions TU 1 defines inline variable A. TU 2 defines inline variable A and then inline variable B. The standard guarantees that A is initialized before B in this scenario. But if (somehow) the linker picks the definition of A from TU 2, but orders the initializers from TU 1 first, then the resulting global_ctors order will be B then A, which is not allowed. Yes, but we only get the ordering guarantee if A is defined before B in all TUs, That's not quite right. We get the guarantee if A is defined before B in all TUs where B is defined. It's OK for there to be TUs where A is defined but B is not. (See the relevant language rule, which requires only that "for every definition of [B] there exists a definition of [A]" and not the other way around.) The intended user model is that it's OK for B to rely on A if they're defined in the same header file, or if A is defined in a header file that B includes. And it's OK if there's a source file that includes A's header but not B's. The intended implementation model is that inline variables are initialized (as if) in definition order within each TU they are defined in, with a guard variable preventing repeated initialization. rsmith: > > TU 1 defines inline variable A. > > TU 2 defines inline variable A and then inline variable…
		ychenAuthorUnsubmitted Done Reply Inline Actions TU 1 defines inline variable A. TU 2 defines inline variable A and then inline variable B. The standard guarantees that A is initialized before B in this scenario. But if (somehow) the linker picks the definition of A from TU 2, but orders the initializers from TU 1 first, then the resulting global_ctors order will be B then A, which is not allowed. If I understand this example correctly, a consistent linker that always picks the last COMDAT would also produce the wrong result. ychen: > TU 1 defines inline variable A. > TU 2 defines inline variable A and then inline variable B.
D->hasAttr<SelectAnyAttr>()) {		D->hasAttr<SelectAnyAttr>()) {
// C++ [basic.start.init]p2:		// C++ [basic.start.init]p2:
// Definitions of explicitly specialized class template static data		// Definitions of explicitly specialized class template static data
// members have ordered initialization. Other class template static data		// members have ordered initialization. Other class template static data
// members (i.e., implicitly or explicitly instantiated specializations)		// members (i.e., implicitly or explicitly instantiated specializations)
// have unordered initialization.		// have unordered initialization.
//		//
// As a consequence, we can put them into their own llvm.global_ctors entry.		// As a consequence, we can put them into their own llvm.global_ctors entry.
//		//
// If the global is externally visible, put the initializer into a COMDAT		// If the global is externally visible, put the initializer into a COMDAT
// group with the global being initialized. On most platforms, this is a		// group with the global being initialized. On most platforms, this is a
// minor startup time optimization. In the MS C++ ABI, there are no guard		// minor startup time optimization. In the MS C++ ABI, there are no guard
// variables, so this COMDAT key is required for correctness.		// variables, so this COMDAT key is required for correctness.
//		//
// SelectAny globals will be comdat-folded. Put the initializer into a		// SelectAny globals will be comdat-folded. Put the initializer into a
// COMDAT group associated with the global, so the initializers get folded		// COMDAT group associated with the global, so the initializers get folded
// too.		// too.

AddGlobalCtor(Fn, 65535, COMDATKey);		AddGlobalCtor(Fn, 65535, COMDATKey, IsInstantiation);
		rsmithUnsubmitted Not Done Reply Inline Actions This is effectively initializing instantiated variables in reverse instantiation order. That seems like it'll make things worse as much as it makes things better. For example, given: #include <iostream> template<typename T> int a = (std::cout << "hello, ", 0); template<typename T> int b = (std::cout << "world", 0); int main() { (void)a<int>; (void)b<int>; } ... we currently print `"hello, world"`, but with this change we'll print `"worldhello, "`. If we want a sensible initialization order, I think we need a different strategy, that will probably require `Sema` to be a lot more careful about what order it instantiates variables in and what order it passes them to the AST consumer: if an instantiation A triggers another instantiation B, we should defer passing A to the consumer until B has been instantiated and passed to the consumer. That's probably not too hard to implement, by adding an entry to the pending instantiation list to say "now pass this to the consumer" in the case where one instantiation triggers another. But I do wonder whether that level of complexity is worthwhile, given that code relying on this behavior is broken. rsmith: This is effectively initializing instantiated variables in reverse instantiation order. That…
		rsmithUnsubmitted Not Done Reply Inline Actions Please see my other (long) comment; `Sema` actually already does what I describe here, except in the cases where it does an eager, non-recursive instantiation. rsmith: Please see my other (long) comment; `Sema` actually already does what I describe here, except…
if (COMDATKey && (getTriple().isOSBinFormatELF() \|\|		if (COMDATKey && (getTriple().isOSBinFormatELF() \|\|
getTarget().getCXXABI().isMicrosoft())) {		getTarget().getCXXABI().isMicrosoft())) {
// When COMDAT is used on ELF or in the MS C++ ABI, the key must be in		// When COMDAT is used on ELF or in the MS C++ ABI, the key must be in
// llvm.used to prevent linker GC.		// llvm.used to prevent linker GC.
addUsedGlobal(COMDATKey);		addUsedGlobal(COMDATKey);
}		}

// If we used a COMDAT key for the global ctor, the init function can be		// If we used a COMDAT key for the global ctor, the init function can be
▲ Show 20 Lines • Show All 351 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenModule.h

Show First 20 Lines • Show All 1,538 Lines • ▼ Show 20 Lines	void EmitCXXGlobalVarDeclInitFunc(const VarDecl *D,
llvm::GlobalVariable *Addr,		llvm::GlobalVariable *Addr,
bool PerformInit);		bool PerformInit);

void EmitPointerToInitFunc(const VarDecl VD, llvm::GlobalVariable Addr,		void EmitPointerToInitFunc(const VarDecl VD, llvm::GlobalVariable Addr,
llvm::Function InitFunc, InitSegAttr ISA);		llvm::Function InitFunc, InitSegAttr ISA);

// FIXME: Hardcoding priority here is gross.		// FIXME: Hardcoding priority here is gross.
void AddGlobalCtor(llvm::Function *Ctor, int Priority = 65535,		void AddGlobalCtor(llvm::Function *Ctor, int Priority = 65535,
llvm::Constant *AssociatedData = nullptr);		llvm::Constant *AssociatedData = nullptr,
		bool InsertFront = false);
void AddGlobalDtor(llvm::Function *Dtor, int Priority = 65535,		void AddGlobalDtor(llvm::Function *Dtor, int Priority = 65535,
bool IsDtorAttrFunc = false);		bool IsDtorAttrFunc = false);

/// EmitCtorList - Generates a global array of functions and priorities using		/// EmitCtorList - Generates a global array of functions and priorities using
/// the given list and name. This array will have appending linkage and is		/// the given list and name. This array will have appending linkage and is
/// suitable for use as a LLVM constructor or destructor array. Clears Fns.		/// suitable for use as a LLVM constructor or destructor array. Clears Fns.
void EmitCtorList(CtorList &Fns, const char *GlobalName);		void EmitCtorList(CtorList &Fns, const char *GlobalName);

▲ Show 20 Lines • Show All 106 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenModule.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 1,550 Lines • ▼ Show 20 Lines

	llvm::GlobalValue *CodeGenModule::GetGlobalValue(StringRef Name) {			llvm::GlobalValue *CodeGenModule::GetGlobalValue(StringRef Name) {
	return getModule().getNamedValue(Name);			return getModule().getNamedValue(Name);
	}			}

	/// AddGlobalCtor - Add a function to the list that will be called before			/// AddGlobalCtor - Add a function to the list that will be called before
	/// main() runs.			/// main() runs.
	void CodeGenModule::AddGlobalCtor(llvm::Function *Ctor, int Priority,			void CodeGenModule::AddGlobalCtor(llvm::Function *Ctor, int Priority,
	llvm::Constant *AssociatedData) {			llvm::Constant *AssociatedData,
				bool InsertFront) {
	// FIXME: Type coercion of void()* types.			// FIXME: Type coercion of void()* types.
	GlobalCtors.push_back(Structor(Priority, Ctor, AssociatedData));			if (InsertFront)
				GlobalCtors.emplace(GlobalCtors.begin(), Priority, Ctor, AssociatedData);
				rsmithUnsubmitted Not Done Reply Inline Actions This is quadratic in the number of instantiated variables. I don't think that's acceptable. rsmith: This is quadratic in the number of instantiated variables. I don't think that's acceptable.
				else
				GlobalCtors.emplace_back(Priority, Ctor, AssociatedData);
	}			}

	/// AddGlobalDtor - Add a function to the list that will be called			/// AddGlobalDtor - Add a function to the list that will be called
	/// when the module is unloaded.			/// when the module is unloaded.
	void CodeGenModule::AddGlobalDtor(llvm::Function *Dtor, int Priority,			void CodeGenModule::AddGlobalDtor(llvm::Function *Dtor, int Priority,
	bool IsDtorAttrFunc) {			bool IsDtorAttrFunc) {
	if (CodeGenOpts.RegisterGlobalDtorsWithAtExit &&			if (CodeGenOpts.RegisterGlobalDtorsWithAtExit &&
	(!getContext().getTargetInfo().getTriple().isOSAIX() \|\| IsDtorAttrFunc)) {			(!getContext().getTargetInfo().getTriple().isOSAIX() \|\| IsDtorAttrFunc)) {
	▲ Show 20 Lines • Show All 5,267 Lines • Show Last 20 Lines

clang/test/CodeGenCXX/aix-static-init-temp-spec-and-inline-var.cpp

	Show First 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
	}			}
	template <>			template <>
	A<int> A<int>::instance = bar();			A<int> A<int>::instance = bar();
	} // namespace test2			} // namespace test2

	// CHECK: @_ZGVN5test12t2E = linkonce_odr global i64 0, align 8			// CHECK: @_ZGVN5test12t2E = linkonce_odr global i64 0, align 8
	// CHECK: @_ZGVN5test21AIvE8instanceE = weak_odr global i64 0, align 8			// CHECK: @_ZGVN5test21AIvE8instanceE = weak_odr global i64 0, align 8
	// CHECK: @_ZGVN5test12t1IiEE = linkonce_odr global i64 0, align 8			// CHECK: @_ZGVN5test12t1IiEE = linkonce_odr global i64 0, align 8
	// CHECK: @llvm.global_ctors = appending global [4 x { i32, void (), i8 }] [{ i32, void (), i8 } { i32 65535, void ()* @__cxx_global_var_init.1, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @__cxx_global_var_init.2, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @__cxx_global_var_init.4, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @_GLOBAL__sub_I__, i8* null }]			// CHECK: @llvm.global_ctors = appending global [4 x { i32, void (), i8 }] [{ i32, void (), i8 } { i32 65535, void ()* @__cxx_global_var_init.4, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @__cxx_global_var_init.2, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @__cxx_global_var_init.1, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @_GLOBAL__sub_I__, i8* null }]
	// CHECK: @llvm.global_dtors = appending global [4 x { i32, void (), i8 }] [{ i32, void (), i8 } { i32 65535, void ()* @__finalize__ZN5test12t2E, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @__finalize__ZN5test21AIvE8instanceE, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @__finalize__ZN5test12t1IiEE, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @_GLOBAL__D_a, i8* null }]			// CHECK: @llvm.global_dtors = appending global [4 x { i32, void (), i8 }] [{ i32, void (), i8 } { i32 65535, void ()* @__finalize__ZN5test12t2E, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @__finalize__ZN5test21AIvE8instanceE, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @__finalize__ZN5test12t1IiEE, i8* null }, { i32, void (), i8 } { i32 65535, void ()* @_GLOBAL__D_a, i8* null }]

	// CHECK: define internal void @__cxx_global_var_init() [[ATTR:#[0-9]+]] {			// CHECK: define internal void @__cxx_global_var_init() [[ATTR:#[0-9]+]] {
	// CHECK: entry:			// CHECK: entry:
	// CHECK32: call void @_ZN5test15Test1C1Ei(%"struct.test1::Test1"* noundef{{[^,]*}} @_ZN5test12t0E, i32 noundef 2)			// CHECK32: call void @_ZN5test15Test1C1Ei(%"struct.test1::Test1"* noundef{{[^,]*}} @_ZN5test12t0E, i32 noundef 2)
	// CHECK64: call void @_ZN5test15Test1C1Ei(%"struct.test1::Test1"* noundef{{[^,]*}} @_ZN5test12t0E, i32 noundef signext 2)			// CHECK64: call void @_ZN5test15Test1C1Ei(%"struct.test1::Test1"* noundef{{[^,]*}} @_ZN5test12t0E, i32 noundef signext 2)
	// CHECK: %0 = call i32 @atexit(void ()* @__dtor__ZN5test12t0E)			// CHECK: %0 = call i32 @atexit(void ()* @__dtor__ZN5test12t0E)
	// CHECK: ret void			// CHECK: ret void
	▲ Show 20 Lines • Show All 177 Lines • Show Last 20 Lines

clang/test/CodeGenCXX/microsoft-abi-static-initializers.cpp

	// RUN: %clang_cc1 -no-opaque-pointers -fms-extensions -fno-threadsafe-statics -emit-llvm %s -o - -mconstructor-aliases -triple=i386-pc-win32 \| FileCheck %s			// RUN: %clang_cc1 -no-opaque-pointers -fms-extensions -fno-threadsafe-statics -emit-llvm %s -o - -mconstructor-aliases -triple=i386-pc-win32 \| FileCheck %s

	// CHECK: @llvm.global_ctors = appending global [5 x { i32, void (), i8 }] [			// CHECK: @llvm.global_ctors = appending global [5 x { i32, void (), i8 }] [
				// CHECK: { i32, void (), i8 } { i32 65535, void ()* @"??__E?foo@?$B@H@@2VA@@A@@YAXXZ", i8* bitcast (%class.A* @"?foo@?$B@H@@2VA@@A" to i8*) },
				// CHECK: { i32, void (), i8 } { i32 65535, void ()* @"??__E?s@?$ExportedTemplate@H@@2US@@A@@YAXXZ", i8* getelementptr inbounds (%struct.S, %struct.S* @"?s@?$ExportedTemplate@H@@2US@@A", i32 0, i32 0) },
	// CHECK: { i32, void (), i8 } { i32 65535, void ()* @"??__Eselectany1@@YAXXZ", i8* getelementptr inbounds (%struct.S, %struct.S* @"?selectany1@@3US@@A", i32 0, i32 0) },			// CHECK: { i32, void (), i8 } { i32 65535, void ()* @"??__Eselectany1@@YAXXZ", i8* getelementptr inbounds (%struct.S, %struct.S* @"?selectany1@@3US@@A", i32 0, i32 0) },
	// CHECK: { i32, void (), i8 } { i32 65535, void ()* @"??__Eselectany2@@YAXXZ", i8* getelementptr inbounds (%struct.S, %struct.S* @"?selectany2@@3US@@A", i32 0, i32 0) },			// CHECK: { i32, void (), i8 } { i32 65535, void ()* @"??__Eselectany2@@YAXXZ", i8* getelementptr inbounds (%struct.S, %struct.S* @"?selectany2@@3US@@A", i32 0, i32 0) },
	// CHECK: { i32, void (), i8 } { i32 65535, void ()* @"??__E?s@?$ExportedTemplate@H@@2US@@A@@YAXXZ", i8* getelementptr inbounds (%struct.S, %struct.S* @"?s@?$ExportedTemplate@H@@2US@@A", i32 0, i32 0) },
	// CHECK: { i32, void (), i8 } { i32 65535, void ()* @"??__E?foo@?$B@H@@2VA@@A@@YAXXZ", i8* bitcast (%class.A* @"?foo@?$B@H@@2VA@@A" to i8*) },
	// CHECK: { i32, void (), i8 } { i32 65535, void ()* @_GLOBAL__sub_I_microsoft_abi_static_initializers.cpp, i8* null }			// CHECK: { i32, void (), i8 } { i32 65535, void ()* @_GLOBAL__sub_I_microsoft_abi_static_initializers.cpp, i8* null }
	// CHECK: ]			// CHECK: ]

	struct S {			struct S {
	S();			S();
	~S();			~S();
	};			};

	▲ Show 20 Lines • Show All 236 Lines • Show Last 20 Lines

clang/test/CodeGenCXX/static-member-variable-explicit-specialization.cpp

	Show All 23 Lines
	template<typename T> int A<T>::a = foo();			template<typename T> int A<T>::a = foo();

	// ALLK-NOT: @_ZN1AIcE1aE			// ALLK-NOT: @_ZN1AIcE1aE
	template<> int A<char>::a;			template<> int A<char>::a;

	// ALL: @_ZN1AIbE1aE ={{.*}} global i32 10			// ALL: @_ZN1AIbE1aE ={{.*}} global i32 10
	template<> int A<bool>::a = 10;			template<> int A<bool>::a = 10;

	// ALL: @llvm.global_ctors = appending global [8 x { i32, void (), i8 }]			// ALL: @llvm.global_ctors = appending global [11 x { i32, void (), i8 }]

	// ELF: [{ i32, void (), i8 } { i32 65535, void ()* @[[unordered1:[^,]]], i8 bitcast (i32* @_ZN1AIsE1aE to i8*) },			// ELF: [{ i32, void (), i8 } { i32 65535, void ()* @[[unordered10:[^,]]], i8 bitcast (i32* @_ZN1RILi1EE1aE to i8*) },
	// MACHO: [{ i32, void (), i8 } { i32 65535, void ()* @[[unordered1:[^,]]], i8 null },			// MACHO: [{ i32, void (), i8 } { i32 65535, void ()* @[[unordered10:[^,]]], i8 null },

	// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered2:[^,]]], i8 bitcast (i16* @_Z1xIsE to i8*) },			// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered9:[^,]]], i8 bitcast (i32* @_ZN1RILi2EE1aE to i8*) },
	// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered2:[^,]]], i8 null },			// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered9:[^,]]], i8 null },

	// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered3:[^,]]], i8 bitcast (i32* @_ZN2ns1aIiE1iE to i8*) },			// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered8:[^,]]], i8 bitcast (i32* @_ZN1RILi3EE1aE to i8*) },
	// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered3:[^,]]], i8 null },			// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered8:[^,]]], i8 null },

	// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered4:[^,]]], i8 bitcast (i32* @_ZN2ns1b1iIiEE to i8*) },			// ALL: { i32, void (), i8 } { i32 65535, void ()* @[[unordered7:[^,]]], i8 null },
	// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered4:[^,]]], i8 null },
				// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered6:[^,]]], i8 @_Z1xIcE },
				// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered6:[^,]]], i8 null },

	// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered5:[^,]]], i8 bitcast (i32* @_ZN1AIvE1aE to i8*) },			// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered5:[^,]]], i8 bitcast (i32* @_ZN1AIvE1aE to i8*) },
	// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered5:[^,]]], i8 null },			// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered5:[^,]]], i8 null },

	// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered6:[^,]]], i8 @_Z1xIcE },			// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered4:[^,]]], i8 bitcast (i32* @_ZN2ns1b1iIiEE to i8*) },
	// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered6:[^,]]], i8 null },			// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered4:[^,]]], i8 null },

	// ALL: { i32, void (), i8 } { i32 65535, void ()* @[[unordered7:[^,]]], i8 null },			// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered3:[^,]]], i8 bitcast (i32* @_ZN2ns1aIiE1iE to i8*) },
				// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered3:[^,]]], i8 null },

				// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered2:[^,]]], i8 bitcast (i16* @_Z1xIsE to i8*) },
				// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered2:[^,]]], i8 null },

				// ELF: { i32, void (), i8 } { i32 65535, void ()* @[[unordered1:[^,]]], i8 bitcast (i32* @_ZN1AIsE1aE to i8*) },
				// MACHO: { i32, void (), i8 } { i32 65535, void ()* @[[unordered1:[^,]]], i8 null },

	// ALL: { i32, void (), i8 } { i32 65535, void ()* @_GLOBAL__sub_I_static_member_variable_explicit_specialization.cpp, i8* null }]			// ALL: { i32, void (), i8 } { i32 65535, void ()* @_GLOBAL__sub_I_static_member_variable_explicit_specialization.cpp, i8* null }]

	/// llvm.used ensures SHT_INIT_ARRAY in a section group cannot be GCed.			/// llvm.used ensures SHT_INIT_ARRAY in a section group cannot be GCed.
	// ELF: @llvm.used = appending global [6 x i8] [i8 bitcast (i32* @_ZN1AIsE1aE to i8), i8 bitcast (i16* @_Z1xIsE to i8), i8 bitcast (i32* @_ZN2ns1aIiE1iE to i8), i8 bitcast (i32* @_ZN2ns1b1iIiEE to i8), i8 bitcast (i32* @_ZN1AIvE1aE to i8), i8 @_Z1xIcE]			// ELF: @llvm.used = appending global [9 x i8] [i8 bitcast (i32* @_ZN1AIsE1aE to i8), i8 bitcast (i16* @_Z1xIsE to i8), i8 bitcast (i32* @_ZN2ns1aIiE1iE to i8), i8 bitcast (i32* @_ZN2ns1b1iIiEE to i8), i8 bitcast (i32* @_ZN1AIvE1aE to i8), i8 @_Z1xIcE, i8* bitcast (i32* @_ZN1RILi3EE1aE to i8), i8 bitcast (i32* @_ZN1RILi2EE1aE to i8), i8 bitcast (i32* @_ZN1RILi1EE1aE to i8*)]

	template int A<short>::a; // Unordered			template int A<short>::a; // Unordered
	int b = foo();			int b = foo();
	int c = foo();			int c = foo();
	int d = A<void>::a; // Unordered			int d = A<void>::a; // Unordered

	// An explicit specialization is ordered, and goes in __GLOBAL_sub_I_static_member_variable_explicit_specialization.cpp.			// An explicit specialization is ordered, and goes in __GLOBAL_sub_I_static_member_variable_explicit_specialization.cpp.
	template<> struct A<int> { static int a; };			template<> struct A<int> { static int a; };
	Show All 23 Lines
	}			}

	namespace {			namespace {
	template<typename T> struct Internal { static int a; };			template<typename T> struct Internal { static int a; };
	template<typename T> int Internal<T>::a = foo();			template<typename T> int Internal<T>::a = foo();
	}			}
	int *use_internal_a = &Internal<int>::a;			int *use_internal_a = &Internal<int>::a;

				template<int n> struct R { static int a; };
				template<> int R<0>::a = 0;
				template<int n> int R<n>::a = R<n - 1>::a + 1;
				int f = R<3>::a;

	#endif			#endif

	// ALL: define internal void @[[unordered1]](			// ALL: define internal void @[[unordered1]](
	// ALL: call i32 @foo()			// ALL: call i32 @foo()
	// ALL: store {{.*}} @_ZN1AIsE1aE			// ALL: store {{.*}} @_ZN1AIsE1aE
	// ALL: ret			// ALL: ret

	// ALL: define internal void @[[unordered2]](			// ALL: define internal void @[[unordered2]](
	Show All 21 Lines
	// ALL: store {{.*}} @_Z1xIcE			// ALL: store {{.*}} @_Z1xIcE
	// ALL: ret			// ALL: ret

	// ALL: define internal void @[[unordered7]](			// ALL: define internal void @[[unordered7]](
	// ALL: call i32 @foo()			// ALL: call i32 @foo()
	// ALL: store {{.*}} @_ZN12_GLOBAL__N_18InternalIiE1aE			// ALL: store {{.*}} @_ZN12_GLOBAL__N_18InternalIiE1aE
	// ALL: ret			// ALL: ret

				// ALL: define internal void @[[unordered8]](
				// ALL: store {{.*}} @_ZGVN1RILi3EE1aE
				// ALL: ret

				// ALL: define internal void @[[unordered9]](
				// ALL: store {{.*}} @_ZGVN1RILi2EE1aE
				// ALL: ret

				// ALL: define internal void @[[unordered10]](
				// ALL: store {{.*}} @_ZGVN1RILi1EE1aE
				// ALL: ret

	// ALL: define internal void @_GLOBAL__sub_I_static_member_variable_explicit_specialization.cpp()			// ALL: define internal void @_GLOBAL__sub_I_static_member_variable_explicit_specialization.cpp()
	// We call unique stubs for every ordered dynamic initializer in the TU.			// We call unique stubs for every ordered dynamic initializer in the TU.
	// ALL: call			// ALL: call
	// ALL: call			// ALL: call
	// ALL: call			// ALL: call
	// ALL: call			// ALL: call
	// ALL: call			// ALL: call
	// ALL: call			// ALL: call
	// ALL: call			// ALL: call
	// ALL: call			// ALL: call
				// ALL: call
	// ALL-NOT: call			// ALL-NOT: call
	// ALL: ret			// ALL: ret

clang/test/Modules/initializers.cpp

	Show First 20 Lines • Show All 148 Lines • ▼ Show 20 Lines
	// CHECK-DAG: @[[XD:_ZN(2ns)?1XIiE1dE]] = linkonce_odr thread_local global i32 0, comdat, align 4			// CHECK-DAG: @[[XD:_ZN(2ns)?1XIiE1dE]] = linkonce_odr thread_local global i32 0, comdat, align 4
	// CHECK-DAG: @[[XE:_ZN(2ns)?1XIiE1eIiEE]] = linkonce_odr global i32 0, comdat, align 4			// CHECK-DAG: @[[XE:_ZN(2ns)?1XIiE1eIiEE]] = linkonce_odr global i32 0, comdat, align 4
	// CHECK-DAG: @[[XF:_ZN(2ns)?1XIiE1fIiEE]] = linkonce_odr global i32 0, comdat, align 4			// CHECK-DAG: @[[XF:_ZN(2ns)?1XIiE1fIiEE]] = linkonce_odr global i32 0, comdat, align 4
	// CHECK-DAG: @[[XG:_ZN(2ns)?1XIiE1gIiEE]] = linkonce_odr thread_local global i32 0, comdat, align 4			// CHECK-DAG: @[[XG:_ZN(2ns)?1XIiE1gIiEE]] = linkonce_odr thread_local global i32 0, comdat, align 4
	// CHECK-DAG: @[[XH:_ZN(2ns)?1XIiE1hIiEE]] = linkonce_odr thread_local global i32 0, comdat, align 4			// CHECK-DAG: @[[XH:_ZN(2ns)?1XIiE1hIiEE]] = linkonce_odr thread_local global i32 0, comdat, align 4

	// It's OK if the order of the first 6 of these changes.			// It's OK if the order of the first 6 of these changes.
	// CHECK: @llvm.global_ctors = appending global			// CHECK: @llvm.global_ctors = appending global
	// CHECK-SAME: @[[E_INIT:[^,]]], {{[^@]}} @[[E]]
	// CHECK-SAME: @[[F_INIT:[^,]]], {{[^@]}} @[[F]]
	// CHECK-SAME: @[[XA_INIT:[^,]]], {{[^@]}} @[[XA]]
	// CHECK-SAME: @[[XE_INIT:[^,]]], {{[^@]}} @[[XE]]
	// CHECK-SAME: @[[XF_INIT:[^,]]], {{[^@]}} @[[XF]]
	// CHECK-SAME: @[[XB_INIT:[^,]]], {{[^@]}} @[[XB]]			// CHECK-SAME: @[[XB_INIT:[^,]]], {{[^@]}} @[[XB]]
				// CHECK-SAME: @[[XF_INIT:[^,]]], {{[^@]}} @[[XF]]
				// CHECK-SAME: @[[XE_INIT:[^,]]], {{[^@]}} @[[XE]]
				// CHECK-SAME: @[[XA_INIT:[^,]]], {{[^@]}} @[[XA]]
				// CHECK-SAME: @[[F_INIT:[^,]]], {{[^@]}} @[[F]]
				// CHECK-SAME: @[[E_INIT:[^,]]], {{[^@]}} @[[E]]
	// CHECK-IMPORT-SAME: @[[TU_INIT:[^,]]], i8 null }]			// CHECK-IMPORT-SAME: @[[TU_INIT:[^,]]], i8 null }]

	// FIXME: Should this use __cxa_guard_acquire?			// FIXME: Should this use __cxa_guard_acquire?
	// CHECK: define {{.*}} @[[E_INIT]]()			// CHECK: define {{.*}} @[[E_INIT]]()
	// CHECK: load {{.}} (i64 @_ZGV			// CHECK: load {{.}} (i64 @_ZGV
	// CHECK: store {{.}}, i32 @[[E]],			// CHECK: store {{.}}, i32 @[[E]],

	// FIXME: Should this use __cxa_guard_acquire?			// FIXME: Should this use __cxa_guard_acquire?
	▲ Show 20 Lines • Show All 71 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Order implicitly instantiated global variable's initializer by the reverse instantiation orderAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 431825

clang/lib/CodeGen/CGDeclCXX.cpp

clang/lib/CodeGen/CodeGenModule.h

clang/lib/CodeGen/CodeGenModule.cpp

clang/test/CodeGenCXX/aix-static-init-temp-spec-and-inline-var.cpp

clang/test/CodeGenCXX/microsoft-abi-static-initializers.cpp

clang/test/CodeGenCXX/static-member-variable-explicit-specialization.cpp

clang/test/Modules/initializers.cpp

Order implicitly instantiated global variable's initializer by the reverse instantiation order
AbandonedPublic