This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/
-
ELF/
2/2
Config.h
22/22
Driver.cpp
-
LTO.cpp
2/2
Options.td
-
test/ELF/lto/
-
ELF/
-
lto/
-
Inputs/
1/1
devirt_validate_vtable_typeinfos.ll
-
devirt_validate_vtable_typeinfos_no_rtti.ll
-
devirt_validate_vtable_typeinfos_ref.ll
-
devirt_validate_vtable_typeinfos_undef.ll
7/7
devirt_validate_vtable_typeinfos.ll
3
devirt_validate_vtable_typeinfos_mixed_lto.ll
2/2
devirt_validate_vtable_typeinfos_no_rtti.ll
-
devirt_validate_vtable_typeinfos_ref.ll
-
llvm/
-
include/llvm/
-
llvm/
-
LTO/
1/1
Config.h
-
Transforms/IPO/
-
IPO/
-
WholeProgramDevirt.h
-
lib/
-
LTO/
5/5
LTO.cpp
-
LTOCodeGenerator.cpp
-
ThinLTOCodeGenerator.cpp
-
Transforms/IPO/
-
IPO/
28/28
WholeProgramDevirt.cpp
-
tools/opt/
-
opt/
1/1
opt.cpp

Differential D155659

[WPD][LLD] Add option to validate RTTI is enabled on all native types and prevent devirtualization on types with native RTTI
ClosedPublic

Authored by modimo on Jul 18 2023, 4:01 PM.

Download Raw Diff

Details

Reviewers

MaskRay
tejohnson

Commits

rG272bd6f9cc86: [WPD][LLD] Add option to validate RTTI is enabled on all native types and…

Summary

Discussion about this approach: https://discourse.llvm.org/t/rfc-safer-whole-program-class-hierarchy-analysis/65144/18

When enabling WPD in an environment where native binaries are present, types we want to optimize can be derived from inside these native files and devirtualizing them can lead to correctness issues. RTTI can be used as a way to determine all such types in native files and exclude them from WPD providing a safe checked way to enable WPD.

The approach is:

In the linker, identify if RTTI is available for all native types. If not, under --lto-validate-all-vtables-have-type-infos --lto-whole-program-visibility is automatically disabled. This is done by examining all .symtab symbols in object files and .dynsym symbols in DSOs for vtable (_ZTV) and typeinfo (_ZTI) symbols and ensuring there's always a match for every vtable symbol.
During thinlink, if --lto-validate-all-vtables-have-type-infos is set and RTTI is available for all native types, identify all typename (_ZTS) symbols via their corresponding typeinfo (_ZTI) symbols that are used natively or outside of our summary and exclude them from WPD.

Testing:
ninja check-all
large Meta service that uses boost, glog and libstdc++.so runs successfully with WPD via --lto-whole-program-visibility. Previously, native types in boost caused incorrect devirtualization that led to crashes.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

modimo created this revision.Jul 18 2023, 4:01 PM

Herald added a reviewer: MaskRay. · View Herald TranscriptJul 18 2023, 4:01 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: hoy, ormris, wenlei and 5 others. · View Herald Transcript

modimo requested review of this revision.Jul 18 2023, 4:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 18 2023, 4:01 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

modimo edited the summary of this revision. (Show Details)Jul 18 2023, 4:16 PM

modimo added a reviewer: tejohnson.

Currently this is only implemented on thinLTO as there's not a monoLTO use-case for this at Meta currently. If design-wise we want to keep parity I'm happy to add the pieces to support this in monoLTO/hybrid as well.

Harbormaster completed remote builds in B246384: Diff 541772.Jul 18 2023, 9:47 PM

Thanks for the patch. I have some initial comments below. From my reading, I guess any native object with a vtable and no RTTI will disable WPD globally, which is unfortunate. Although I do have a suggestion below for making this slightly less pessimistic. I'm curious to give this a try internally on a few codes and see how frequently it ends up disabling WPD.

llvm/lib/LTO/LTO.cpp
1733	Would be better to have a more specific name, since this is only queried with type names. I.e. local symbols are not visible outside the summary but don't have a GlobalResolution entry. But you aren't calling this lambda in that case (but that isn't clear, where the lambda is defined). Before I suggest a name, I have a question about the usage of this lambda down in the WPD code.
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
2455	Rather than doing this down here in LTO/WPD could the linker simply unset the HasWholeProgramVisibility config flag? That would also allow WPD to proceed on types with hidden LTO visibility. This early return would prevent any and all WPD which seems overly conservative in the case of hidden LTO visibility classes.
2521	Can we ever get here if RTTI is not enabled? My understanding of the change to line 2413 is that we return early in that case. Given that early return, aren't we guaranteed that the typename symbol has a GlobalResolution if it is non-local? Oh - I guess we are only early returning if RTTI is off in native objects, so you could get here if RTTI is only disabled in bitcode objects? And we need to be conservative for any typenames for vtables defined in bitcode objects with RTTI off? I didn't see a test for this case, can you add one (or did I miss it)?

Thanks for taking a look!

In D155659#4529471, @tejohnson wrote:

Thanks for the patch. I have some initial comments below. From my reading, I guess any native object with a vtable and no RTTI will disable WPD globally, which is unfortunate.

It makes sense from a correctness point of view but yeah it is a stringent requirement. The linker warning does keep this from being silent although how much churn this causes remains to be seen.

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
2455	That makes sense although it does tie this flag's functionality to requiring `--lto-whole-program-visibility`. Doing that though means we can instead pass the blocklist to `updateVCallVisibilityInIndex`/`updateVCallVisibilityInModule` similarly to how D91583 does it for dynamically exported symbols which would be cleaner. Thoughts on that approach?
2521	Oh - I guess we are only early returning if RTTI is off in native objects, so you could get here if RTTI is only disabled in bitcode objects? Yep! And we need to be conservative for any typenames for vtables defined in bitcode objects with RTTI off? This is primarily an implementation detail with how resolutions are only provided for IR symbols. If we instead pass more information from the linker (like resolutions for these summary symbols or the whole list of typenames) we can support bitcode files with RTTI off. There's not a correctness issue at play here since the summary information is a superset of RTTI. I didn't see a test for this case, can you add one (or did I miss it)? Good catch, will add a test case.

tejohnson added inline comments.Jul 24 2023, 2:58 PM

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
2455	That makes sense although it does tie this flag's functionality to requiring --lto-whole-program-visibility. What would be the use case of the proposed handling without --lto-whole-program-visibility? Are you saying that there are cases where the normal LTO visibility is incorrect? Doing that though means we can instead pass the blocklist to updateVCallVisibilityInIndex/updateVCallVisibilityInModule similarly to how D91583 does it for dynamically exported symbols which would be cleaner. Thoughts on that approach? Yep I think that would be cleaner.

modimo added inline comments.Jul 24 2023, 3:59 PM

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
2455	What would be the use case of the proposed handling without --lto-whole-program-visibility? Are you saying that there are cases where the normal LTO visibility is incorrect? I don't have a known case so this is more theoretical. Currently there's an assertion that it's on the user to make sure LTO visibility is correct but in this case and in D91583 we can catch violations and prevent them from causing problems. How much this should also apply to normal LTO visibility is a question but thinking more about it is orthogonal to this change.

modimo added inline comments.Jul 24 2023, 7:04 PM

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
2455	Ah right, the issue with doing this in `updateVCallVisibilityInIndex`/`updateVCallVisibilityInModule` is that vcall visibility is keyed off the vtable symbol. However, TypeID and RTTI are both keyed off of the typename symbol. There's not always a translation from typename to vtable since abstract base classes wouldn't have vtables and by the time we get the association we're in the same place the logic is right now. Given that, I think I'll keep the logic as-is.

Add test case for no RTTI in LTO Unit and explicitly hidden LTO types. Change logic to disable --lto-whole-program-visibility on validation failure. Change isVisibleOutsideSummary to typeInfoVisibleOutsideSummary.

Harbormaster completed remote builds in B247850: Diff 543786.Jul 25 2023, 4:08 AM

In D155659#4529471, @tejohnson wrote:

I'm curious to give this a try internally on a few codes and see how frequently it ends up disabling WPD.

I cranked through a bunch of builds with this change and thankfully while they all do have at least one vtable from an -fno-rtti native object, there are only a handful of unique symbols (which all appear safe), so we could consider using --lto-known-safe-vtables to allowlist them. I did find a couple that seem spurious (see comment inline below about this).

lld/ELF/Driver.cpp
1054	Does this get both defs and refs? I think the latter as I am seeing a case where we are linking a bitcode object that contains both the vtable and typename defs but are disabling --lto-whole-program-visibility, and the reason seems to be a reference to the vtable from a native object.
1114	Prefer message() over warn() because the latter causes builds using -Werror to fail.
llvm/lib/LTO/LTO.cpp
1728–1729	Can you do the same for updateVCallVisibilityInModule to get this fix to apply to regular LTO?
1733	nit: lambda name should be upper camel case. Also, can you add a comment here that this will return true for either the case where name is a local or where it is not defined, and so the expectation is that it will not be queried for local symbols.
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
2455	There's not always a translation from typename to vtable since abstract base classes wouldn't have vtables and by the time we get the association we're in the same place the logic is right now. Can you clarify the case you are concerned about and how this is handled in the lld code that expects a translation from vtable to typename? I tried an example with an abstract base class and do get a vtable and typename.

In D155659#4533387, @tejohnson wrote:

In D155659#4529471, @tejohnson wrote:

I'm curious to give this a try internally on a few codes and see how frequently it ends up disabling WPD.

I cranked through a bunch of builds with this change and thankfully while they all do have at least one vtable from an -fno-rtti native object, there are only a handful of unique symbols (which all appear safe), so we could consider using --lto-known-safe-vtables to allowlist them. I did find a couple that seem spurious (see comment inline below about this).

Nice! That mirrors my experience as well.

lld/ELF/Driver.cpp

1114

Is that the case for lld? Looking through the equivalent functionality is under --fatal-warnings and on a small example -Werror doesn't affect this flag/cause the build to fail.

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp

2455

Inputs/devirt_validate_vtable_typeinfos.ll has A as an abstract base class and Native deriving from it. Building down to an object file we get a vtable/typeinfo/typeid symbol for Native but only the typeinfo/typeid symbol for A:

~/llvm-project/lld/test/ELF/lto# ~/llvm-project/build-rel/bin/llc -filetype=obj Inputs/devirt_validate_vtable_typeinfos.ll -o devirt_validate_vtable_typeinfos.o
~/llvm-project/lld/test/ELF/lto# readelf -Ws devirt_validate_vtable_typeinfos.o | grep ZT
     5: 0000000000000000    16 OBJECT  WEAK   DEFAULT    3 _ZTVN10__cxxabiv117__class_type_infoE
     6: 0000000000000010    16 OBJECT  WEAK   DEFAULT    3 _ZTVN10__cxxabiv120__si_class_type_infoE
     7: 0000000000000020    32 OBJECT  WEAK   DEFAULT    3 _ZTV6Native
     8: 0000000000000050    24 OBJECT  WEAK   DEFAULT    3 _ZTI6Native
     9: 0000000000000040     8 OBJECT  WEAK   DEFAULT    3 _ZTS6Native
    10: 0000000000000070    16 OBJECT  WEAK   DEFAULT    3 _ZTI1A
    11: 0000000000000068     3 OBJECT  WEAK   DEFAULT    3 _ZTS1A

In LLD we're ensuring there's a map from every vtable symbol to its type information but we expect additional type information without vtables for these cases.

modimo added inline comments.Jul 25 2023, 5:32 PM

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
2455	That being said, the information to map typeid->[associated vtables] is `typeIdCompatibleVtableMap` for thinLTO and in full LTO we have the actual vtable global variables which contains `!type` metadata that maps back to typeid. For thinLTO, we can save a scan of `typeIdCompatibleVtableMap` for `updateVCallVisibilityInIndex` by currently combining it with the scan in `DevirtIndex::run` however that's probably not a big deal. For full LTO the information is better passed through `updateVCallVisibilityInModule` since we don't build the corresponding `TypeIDMap` until codegen and carrying this information around until then is too much. I think I've come back around to doing it in `updateVCallVisibilityInIndex`/`updateVCallVisibilityInModule`. A little less efficient for thinLTO but keeps consistency with full LTO.

modimo added inline comments.Jul 25 2023, 11:17 PM

lld/ELF/Driver.cpp
1054	Does this get both defs and refs? This goes through the entirety of .symtab so both defs and refs. I think the latter as I am seeing a case where we are linking a bitcode object that contains both the vtable and typename defs but are disabling --lto-whole-program-visibility, and the reason seems to be a reference to the vtable from a native object. Mocking up a forward class declaration in the native object where the definition is in bitcode I see --lto-whole-program-visibility get disabled. Only looking at vtable defs makes sense, I'll make the change and add a test.

tejohnson added inline comments.Jul 26 2023, 9:06 AM

lld/ELF/Driver.cpp
1114	Sorry, you are correct. We use --fatal-warnings for this on linker actions (and -Werror on compile actions). In general, I think this should be an informational message, not a warning, since it is being handled automatically.
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
2455	Ok, thanks. I think I have lost track of what the change will be - is it to replace passing down a global flag AllVtablesHaveTypeInfos, or is it to replace what is being down below in DevirtIndex::run()? For the former alone it doesn't seem worth it, but it would be nice to move the handling from DevirtIndex::run() into the vcall_visibility updates.

modimo added inline comments.Jul 26 2023, 10:58 AM

lld/ELF/Driver.cpp
1114	The case I want to catch is when an existing project changes its native dependencies and disables the optimization which would better fit a warning. In the general case agreed that this isn't a warning. I don't have much experience in linker protocol here, @MaskRay thoughts?
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
2455	Ah sorry yeah there's 2 different things. For passing down `AllVtablesHaveTypeInfos` I like the approach of unsetting --lto-whole-program-visibility instead. For moving the safety logic out of DevirtIndex::run() making this work for full LTO wants the logic to be in vcall_visibility and it makes sense to be consistent even if slightly less efficient for thinLTO.

Move logic to updateVCallVisibility*, apply change to full and split LTO, check only defs in linker.

modimo marked 4 inline comments as done.Jul 27 2023, 2:09 PM

Remove unintentional change to WholeProgramDevirt.cpp

Harbormaster completed remote builds in B248687: Diff 544926.Jul 27 2023, 4:30 PM

tejohnson added inline comments.Jul 29 2023, 7:51 AM

lld/ELF/Driver.cpp
1114	The problem is if it is a warning, then we have to go in and manually change options or builds will fail. Can you intercept message output, which should always be emitted (i.e. don't need verbose options).
lld/test/ELF/lto/devirt_validate_vtable_typeinfos_no_rtti.ll
9	The earlier version didn't have this second input file - why is it needed now for this test?
llvm/include/llvm/LTO/Config.h
85	This would read better like "If all native vtables have corresponding type infos, allow usage..."
llvm/include/llvm/LTO/LTO.h
367 ↗	(On Diff #544926)	What's the downside in practice with using VisibleOutsideSummary?
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
799	Just early return true here, and return false below the loop.
898	Suggest naming this VisibleToRegularObjVTables.
915	Hmm, now that I think about it, shouldn't local symbols have gotten VCallVisibilityTranslationUnit from clang? Same question in the regularLTO handling.

In D155659#4533387, @tejohnson wrote:

In D155659#4529471, @tejohnson wrote:

I'm curious to give this a try internally on a few codes and see how frequently it ends up disabling WPD.

I cranked through a bunch of builds with this change and thankfully while they all do have at least one vtable from an -fno-rtti native object, there are only a handful of unique symbols (which all appear safe), so we could consider using --lto-known-safe-vtables to allowlist them. I did find a couple that seem spurious (see comment inline below about this).

For clarification, were the builds only to validate the linker check? If so, are there plans to try out the E2E solution?

lld/ELF/Driver.cpp
1114	Sure, changed to message.
lld/test/ELF/lto/devirt_validate_vtable_typeinfos_no_rtti.ll
9	Good catch, I re-used the index/hybrid/full commands from `devirt_validate_vtable_typeinfos.ll` and that came along for the ride, removed.
llvm/include/llvm/LTO/LTO.h
367 ↗	(On Diff #544926)	It doesn't extend to Full LTO. However, the same functionality for full LTO is captured with `GlobalResolutions[name].Partition == GlobalResolution::External` so this doesn't need to be broken out separately.
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
898	Another area that's unknown for thinLTO are the types used in full LTO where `VisibleOutsideSummary` is set to false and vice-versa with full LTO where types used in thinLTO get `GlobalResolution::External` partition. I'm thinking then to categorize vtables we want to not upgrade as something like `RefOutsideWPD` instead of `VisibleToRegularObj`. WDYT?
915	Good point. Originally excluding based on type name had to explicitly take into account local vcall_visibility. Now, it's a test setup where `VCallVisibilityTranslationUnit` was only used for one of the local types. I'll remove this check and the type that doesn't have the proper vcall_visibility in the tests.

Review Feedback

Harbormaster completed remote builds in B249334: Diff 545835.Jul 31 2023, 4:59 PM

Exclude .virtual typeIDs

Harbormaster completed remote builds in B249358: Diff 545868.Jul 31 2023, 6:12 PM

In D155659#4548653, @modimo wrote:

In D155659#4533387, @tejohnson wrote:

In D155659#4529471, @tejohnson wrote:

I'm curious to give this a try internally on a few codes and see how frequently it ends up disabling WPD.

I cranked through a bunch of builds with this change and thankfully while they all do have at least one vtable from an -fno-rtti native object, there are only a handful of unique symbols (which all appear safe), so we could consider using --lto-known-safe-vtables to allowlist them. I did find a couple that seem spurious (see comment inline below about this).

For clarification, were the builds only to validate the linker check? If so, are there plans to try out the E2E solution?

I was just looking for the linker check messages, I didn't enable WPD. With this solution in place we can hopefully try it out internally, but it might not be immediate. However, I'm keen to have this solution available so we can move forward on WPD!

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
898	RegularLTO summaries are added to the combined index used by ThinLTO, but it looks like the vtable summaries aren't currently created for them. I think you are right in that there is a potential hole here for ThinLTO WPD if linked with a regular LTO object containing an override. Can you test this case to confirm? If that is an issue, then I guess we do need another GlobalRes field. Maybe VisibleOutsideLTOUnit or something like that?

In D155659#4555162, @tejohnson wrote:

In D155659#4548653, @modimo wrote:

In D155659#4533387, @tejohnson wrote:

In D155659#4529471, @tejohnson wrote:

I'm curious to give this a try internally on a few codes and see how frequently it ends up disabling WPD.

I cranked through a bunch of builds with this change and thankfully while they all do have at least one vtable from an -fno-rtti native object, there are only a handful of unique symbols (which all appear safe), so we could consider using --lto-known-safe-vtables to allowlist them. I did find a couple that seem spurious (see comment inline below about this).

For clarification, were the builds only to validate the linker check? If so, are there plans to try out the E2E solution?

I was just looking for the linker check messages, I didn't enable WPD. With this solution in place we can hopefully try it out internally, but it might not be immediate. However, I'm keen to have this solution available so we can move forward on WPD!

Thanks for the clarification! Definitely very interested in your results internally when this finalizes.

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
898	Added `devirt_validate_vtable_typeinfos_mixed_lto.ll` to test mixing LTO modes: RegularLTO without summary indeed does not export vtable summaries. With validation, because the type is present in a ThinLTO module the partition is set to `GlobalResolution::External` this type does not get its visibility upgraded in RegularLTO and with `VisibleOutsideSummary` also set to true this type does not get its visibility upgraded in ThinLTO either. RegularLTO with summary we do get all the vtable summaries in the combined index. With validation, `GlobalResolution::External` blocks RegularLTO visibility upgrade but since `VisibleOutsideSummary` is not set everything is optimized in the combined index. For the purposes of validation I think this is the behavior we want. It does seem like we may want to fix this hole even without validation enabled although that seems more of a separate change since it alters baseline behavior.

modimo updated this revision to Diff 547631.Aug 6 2023, 8:04 PM

Add devirt_validate_vtable_typeinfos_mixed_lto.ll, minor test changes

Harbormaster completed remote builds in B250666: Diff 547631.Aug 6 2023, 10:31 PM

tejohnson added inline comments.Aug 10 2023, 1:08 PM

lld/test/ELF/lto/devirt_validate_vtable_typeinfos_mixed_lto.ll
118	Both this and the CHECK-SUMMARY-IR case below are incorrect devirtualizations, right? Is this another case that we are not doing correctly without the validation options in this patch?
124	I think we only get the vtable summary from the regular LTO object because it doesn't have the EnableSplitLTOUnit module flag set in the IR here. Normally, this is added by clang when building -flto. And this currently prevents vtable summaries being added to the LTO summary (https://github.com/llvm/llvm-project/blob/8a15bdb5e637f81041591d97bea0267b5f053f16/llvm/lib/Analysis/ModuleSummaryAnalysis.cpp#L734-L736). When I added that guard, it was because I didn't think we needed these summaries when splitting was enabled, as I was thinking of either the everything-is-regular LTO case or the -fsplit-lto-unit case that you get by default with -flto=thin -fwhole-program-vtables, where all the vtables are placed in the regular LTO split modules. It's possible that we could remove that guard, but with it I think this case would do the wrong thing if the regular IR was built from clang with -flto.

Headed out on PTO and will be back on the 24th, will pick this back up then!

lld/test/ELF/lto/devirt_validate_vtable_typeinfos_mixed_lto.ll
124	I see, so the `BASE` scenario being tested here is already guarded by EnableSplitLTOUnit since ThinLTO would have EnableSplitLTOUnit=0 and RegularLTO would have EnableSplitLTOUnit=1. Is this the scenario described in the previous comment? RegularLTO summaries are added to the combined index used by ThinLTO, but it looks like the vtable summaries aren't currently created for them. I think you are right in that there is a potential hole here for ThinLTO WPD if linked with a regular LTO object containing an override. Can you test this case to confirm? If that is an issue, then I guess we do need another GlobalRes field. Maybe VisibleOutsideLTOUnit or something like that? The mixed case then would be RegularLTO combined with ThinLTO + -split-lto-unit where neither generate `typeidCompatibleVTable` and all the analysis is done on the combined RegularLTO module.

Looking at https://github.com/llvm/llvm-project/blob/9b6b6bb/clang/lib/CodeGen/BackendUtil.cpp#L170-L173, RegularLTO will have a summary attached through Clang by default except for ld64 targets. The mixed case then reduces to RegularLTO+Summary and ThinLTO+Split since EnableSplitLTOUnit must be consistent so everything is done by DevirtModule::run and there's no mixture with DevirtIndex::run. The mechanism for IsVisibleToRegularObj for RegularLTO also ends up being identical to that of ThinLTO since all symbols will be present in the combined summary.

For ld64 targets there is a potential hole mixing RegularLTO+NoSummary with ThinLTO which would be caught with the current validation scheme. However, the validation functionality is only for the lld ELF target so the ld64 targets are unchanged.

Change mixed scenario to RegularLTO+Summary and ThinLTO+Split. Modify other tests to have RegularLTO+Summary.

modimo added inline comments.Aug 25 2023, 1:53 PM

lld/test/ELF/lto/devirt_validate_vtable_typeinfos.ll
7	Appending module flags so RegularLTO correctly generates it's summary without `typeidCompatibleVTable` means the test can be re-used. However I think duplicating the tests is reasonable as well and could be cleaner, WDYT?

Set EnableSplitLTOUnit=1 for RegularLTO tests as well

Harbormaster completed remote builds in B254987: Diff 553610.Aug 25 2023, 6:19 PM

lgtm with a couple comments/suggestions below. Thanks!

In D155659#4618301, @modimo wrote:

Looking at https://github.com/llvm/llvm-project/blob/9b6b6bb/clang/lib/CodeGen/BackendUtil.cpp#L170-L173, RegularLTO will have a summary attached through Clang by default except for ld64 targets. The mixed case then reduces to RegularLTO+Summary and ThinLTO+Split since EnableSplitLTOUnit must be consistent so everything is done by DevirtModule::run and there's no mixture with DevirtIndex::run. The mechanism for IsVisibleToRegularObj for RegularLTO also ends up being identical to that of ThinLTO since all symbols will be present in the combined summary.

Ok, yes - I think the scenario I was worried about is caught by the existing verification that EnableSplitLTOUnit is set consistently.

lld/test/ELF/lto/devirt_validate_vtable_typeinfos.ll
7	Do we need these module flags for correct operation of this test (ditto for the similar no_rtti one later)? If not, then probably don't bother adding in these tests (I think these may only be needed in practice for the hybrid testing). If they are now needed for correct operation of the regular LTO testing, then I am ok with the approach here as I think it is probably better to reduce duplication of nearly identical IR tests (and I see this approach used in other tests too).
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
893	nit: the braces can be removed from the for loop

This revision is now accepted and ready to land.Aug 29 2023, 9:26 AM

Thanks for the patch. Will try to read through this :)

Fix nit: braces

Thanks for the review!

lld/test/ELF/lto/devirt_validate_vtable_typeinfos.ll
7	Do we need these module flags for correct operation of this test (ditto for the similar no_rtti one later)? Yeah, to trigger summary generation but on the RegularLTO pipeline requires these module flags. If they are now needed for correct operation of the regular LTO testing, then I am ok with the approach here as I think it is probably better to reduce duplication of nearly identical IR tests (and I see this approach used in other tests too). Sounds good, I'll leave it as is.

Will study...

lld/ELF/Driver.cpp
1114	This long list here makes me nervous. Will try to learn it.
lld/ELF/Options.td
607	New options use `EEq` to disallow single-dash long options, to not conflict with `-l`.
lld/test/ELF/lto/Inputs/devirt_validate_vtable_typeinfos.ll
5	`grtev4` => `unknown`
llvm/tools/opt/opt.cpp
573	The prevailing and recommended style liked by clang-format and clang-tidy is `/WholeProgramVisibilityEnabledInLTO=/false`

Review Feedback

modimo marked 3 inline comments as done.Aug 31 2023, 11:30 AM

modimo added inline comments.Aug 31 2023, 3:41 PM

lld/ELF/Driver.cpp
1114	Taking a look again I don't think these names need special exclusion. They're defined in libstdc++/libc++abi and the release packages in my scenarios have RTTI enabled meaning their type info symbols are present during linking. Testing E2E linking on some of our large services with these removed succeeds. If RTTI is disabled on these libraries considerably more symbols would not have matching type infos and --lto-whole-program-visibility should be disabled. Looking back I think these exclusions came about when I was only examining the .symtab/.dynsym of individual object/shared files which didn't take into account that these symbols would be resolved by libstdc++/libc++abi.

Remove explicit knownSafeVtableNames entries

modimo edited the summary of this revision. (Show Details)Sep 5 2023, 3:42 PM

Gentle ping @MaskRay

In D155659#4640159, @modimo wrote:

Gentle ping @MaskRay

Sorry for the delay. (There were Phabricator issues to handle beside work...)

I will need to re-read https://discourse.llvm.org/t/rfc-safer-whole-program-class-hierarchy-analysis/65144 and an internal discussion in May 2022 when I was thinking about _ZTI.

-frtti is discouraged by https://google.github.io/styleguide/cppguide.html#Run-Time_Type_Information__RTTI_ , so I think it may not benefit us... but this feature is still useful. I need to read these discussions...

lld/ELF/Driver.cpp
1046	omit braces for single-line single-statement body
1076	drop the two nested braces
1080
1083	If not starting with `_ZTV`, consider reporting an error?

I wonder why the following example is still incorrect. I haven't carefully studied the LTO part of code. It seems that you do handle isUsedInRegularObj.

cat > a.h <<'eof'
struct A { virtual int foo(); };
int bar(A *a);
eof
cat > a.cc <<'eof'
#include "a.h"
int A::foo() { return 1; }
int bar(A *a) { return a->foo(); }
eof
cat > b.cc <<'eof'
#include "a.h"
struct B : A { int foo() { return 2; } };
int baz() { B b; return bar(&b); }
eof
cat > main.cc <<'eof'
#include "a.h"
#include <stdio.h>
extern int baz();
int main() {
  A a;
  printf("%d %d\n", bar(&a), baz());
}
eof

clang++ -c -flto=thin -fwhole-program-vtables -O main.cc a.cc b.cc
clang++ -c -O b.cc -o b0.o

% clang++ -flto=thin -Wl,--lto-whole-program-visibility -fuse-ld=lld main.o a.o b.o && ./a.out
1 2
% clang++ -Wl,--lto-validate-all-vtables-have-type-infos -flto=thin -Wl,--lto-whole-program-visibility -fuse-ld=lld main.o a.o b0.o && ./a.out
1 1

lld/ELF/Config.h
251	Move `ltoAllVtablesHaveTypeInfos` (not an option) to `Ctx`
lld/ELF/Driver.cpp
1055	`getGlobalELFSyms` to skip local symbols
1087	and delete the assignment below
2879
lld/test/ELF/lto/devirt_validate_vtable_typeinfos.ll
14	You can remove the relocation-model=static object file as there is no testable difference. Then, consider renaming `%t2_pic.o` to `%t2.o`
85	; VALIDATE-NOT: single-impl: ; VALIDATE: single-impl: devirtualized a call to _ZN1D1mEi ; VALIDATE-NOT: single-impl:
164	Consider pasting the source code as well for readability and upgradability? I haven't carefully studied the tests yet...
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
795	typo: `will will` Add a period.
888	typo: `will will` append a period.
893	delete braces in this nested case when the only body has just one line.

MaskRay requested changes to this revision.Sep 6 2023, 9:24 PM

MaskRay added inline comments.

lld/ELF/Driver.cpp
1089	the order is not guaranteed to be deterministic. Consider a SmallSetVector with inline size=0. `auto &` => `StringRef`

This revision now requires changes to proceed.Sep 6 2023, 9:24 PM

In D155659#4640392, @MaskRay wrote:

In D155659#4640159, @modimo wrote:

Gentle ping @MaskRay

Sorry for the delay. (There were Phabricator issues to handle beside work...)

I will need to re-read https://discourse.llvm.org/t/rfc-safer-whole-program-class-hierarchy-analysis/65144 and an internal discussion in May 2022 when I was thinking about _ZTI.

-frtti is discouraged by https://google.github.io/styleguide/cppguide.html#Run-Time_Type_Information__RTTI_ , so I think it may not benefit us... but this feature is still useful. I need to read these discussions...

I looked at this more closely after this patch was mailed. While use of RTTI is discouraged, building with -fno-rtti isn't specifically encouraged afaict, and when I built a large number of important binaries internally with this patch in validation mode, it turned out that there were only a few objects/symbols affected, and we could allowlist them to enable WPD. See my earlier comment from July 25:

I cranked through a bunch of builds with this change and thankfully while they all do have at least one vtable from an -fno-rtti native object, there are only a handful of unique symbols (which all appear safe), so we could consider using --lto-known-safe-vtables to allowlist them.

I wonder why the following example is still incorrect.

Hmm, that seems like exactly the case that should be caught and handled automatically by this patch. Oh, I just compiled the same source code to a native object and nm shows a reference to the typeinfo for A, but no type name for A (_ZTS1A):

$ nm b.o
                 U _Z3barP1A
0000000000000000 T _Z3bazv
0000000000000000 W _ZN1B3fooEv
                 U _ZTI1A
0000000000000000 V _ZTI1B
0000000000000000 V _ZTS1B
0000000000000000 V _ZTV1B
                 U _ZTVN10__cxxabiv120__si_class_type_infoE

The _ZTS symbol is the one referenced by the type metadata, and what this patch is going to look for being referenced in native objects. Not familiar with the rules around when that symbol ends up being used, but if it isn't consistently referenced from native objects then this solution isn't going to work as well as hoped. Should it be looking for either the type name or the corresponding type info?

Good catch with the example! Looks like this is an interaction with the class A having a key function (https://lld.llvm.org/missingkeyfunction.html) defined in a.cc so b.cc doesn't generate a vtable symbol for class A and RTTI only emits a reference to _ZTI1A. The Itanium C++ ABI mandates type name as a field for every type info (https://itanium-cxx-abi.github.io/cxx-abi/abi.html#rtti) but because we only get a reference _ZTS1A doesn't come along for the ride. The vtable for class A being defined only inside the LTO Unit then means there's no native reference to key off of.

I think this means native symbol lookup should be keyed off the type info symbol corresponding to the type name we have in metadata. The layout of RTTI guarantees we'll have the base type info symbol(s) but as seen here not necessarily the type name symbol. WDYT?

Review Feedback

In D155659#4641354, @modimo wrote:

Good catch with the example! Looks like this is an interaction with the class A not having a key function (https://lld.llvm.org/missingkeyfunction.html) so b.cc doesn't generate a vtable symbol for class A and RTTI only emits a reference to _ZTI1A. The Itanium C++ ABI mandates type name as a field for every type info (https://itanium-cxx-abi.github.io/cxx-abi/abi.html#rtti) but because we only get a reference _ZTS1A doesn't come along for the ride. The vtable for class A being defined only inside the LTO Unit then means there's no native reference to key off of.

I think this means native symbol lookup should be keyed off the type info symbol corresponding to the type name we have in metadata. The layout of RTTI guarantees we'll have the base type info symbol(s) but as seen here not necessarily the type name symbol. WDYT?

This seems ok to me - there is already code in the lld part of this patch that maps _ZTV to _ZTI, so mapping _ZTS to _ZTI is not significantly different. Can you also add this as a test case (suggest including the original c++ code in a comment).

modimo marked an inline comment as not done.Sep 8 2023, 6:15 PM

modimo added inline comments.

lld/test/ELF/lto/devirt_validate_vtable_typeinfos.ll

164

I pulled the base IR from lld/test/ELF/lto/devirt_vcall_vis_export_dynamic.ll and modified it so no direct source.

The test case is effectively:

struct A {
  virtual void f(int) = 0;
  virtual void n(int) { return 0; }
};

struct B {
  virtual void f(int) { return 0; }
};

struct C {
  virtual void f(int) { return 0; }
};

namespace {
  struct D {
    virtual void int m(int) { return 0; }
  };
}

int _start(A *obj, D* obj2, int a) {
  call = obj->n(a); // single implementation in A, devirtualize unless a native type derives from A
  call2 = obj->f(call); // multiple implementation in B and C, never devirtualize
  call3 = obj2->m(call2); // local type, always devirtualize
  return call3;
}

Key off of type info (_ZTI) symbols, add test case

Move symbol check to common function

Gentle ping @MaskRay

Looks great with some nits! Checking _ZTS in TypeIDVisibleToRegularObj (switch to typeIDVisibleToRegularObj) looks good to me. Sorry for the delay.
There are a number of resolved comments you may want to mark as done.

lld/ELF/Driver.cpp
1042	The conventional style in lld/ omits `llvm::` for DenseSet.
1051
1073	This still relies on the iteration order of `DenseSet vtableSymbols`. We need SetVector for `vtableSymbols` as well. `auto &s` => `StringRef s`
2879	Not done.
lld/ELF/Options.td
607	`When --lto-validate-all-vtables-have-type-infos is enabled, skip validation on these vtables (_ZTV symbols)`
llvm/lib/LTO/LTO.cpp
1282	redundant hash table lookup here. Better to use `find(name)` with slightly more code
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
785	Perhaps we should fix these functions to follow https://llvm.org/docs/CodingStandards.html#use-namespace-qualifiers-to-implement-previously-declared-functions . Pushed b4d4146db3b9a29773259c8b8a6cb7c98da90e73 and you'll need a rebase.
796	Use `consume_front` here to avoid `consume_front` below.
805	to avoid constructing a possibly heap-allocated std::string twice.
809	`skipUpdateDueToValidation`

This revision is now accepted and ready to land.Sep 17 2023, 7:34 PM

MaskRay added inline comments.Sep 17 2023, 7:34 PM

lld/ELF/Config.h
481	We need `ltoAllVtablesHaveTypeInfos = false` in `reset`

In D155659#4647141, @MaskRay wrote:

Looks great with some nits! Checking _ZTS in TypeIDVisibleToRegularObj (switch to typeIDVisibleToRegularObj) looks good to me. Sorry for the delay.
There are a number of resolved comments you may want to mark as done.

Appreciate the thorough review! Good callout on the resolved comments, keeping those up to date keeps the context clearer for everyone involved--will keep them updated in the future.

llvm/lib/LTO/LTO.cpp
1282	Good call, other places here also use the find pattern.
llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp
785	Sounds good, I'll follow up with the correct style on the rebase

Review Feedback

Rebase

modimo edited the summary of this revision. (Show Details)Sep 18 2023, 3:51 PM

This revision was landed with ongoing or failed builds.Sep 18 2023, 3:52 PM

Closed by commit rG272bd6f9cc86: [WPD][LLD] Add option to validate RTTI is enabled on all native types and… (authored by modimo). · Explain Why

This revision was automatically updated to reflect the committed changes.

modimo added a commit: rG272bd6f9cc86: [WPD][LLD] Add option to validate RTTI is enabled on all native types and….

Harbormaster completed remote builds in B257363: Diff 556978.Sep 18 2023, 5:20 PM

Revision Contents

Path

Size

lld/

ELF/

4 lines

65 lines

3 lines

5 lines

test/

ELF/

lto/

Inputs/

devirt_validate_vtable_typeinfos.ll

26 lines

devirt_validate_vtable_typeinfos_no_rtti.ll

19 lines

devirt_validate_vtable_typeinfos_ref.ll

68 lines

devirt_validate_vtable_typeinfos_undef.ll

16 lines

devirt_validate_vtable_typeinfos.ll

263 lines

devirt_validate_vtable_typeinfos_mixed_lto.ll

183 lines

devirt_validate_vtable_typeinfos_no_rtti.ll

136 lines

devirt_validate_vtable_typeinfos_ref.ll

130 lines

llvm/

include/

llvm/

LTO/

Config.h

6 lines

Transforms/

IPO/

WholeProgramDevirt.h

12 lines

lib/

LTO/

LTO.cpp

55 lines

LTOCodeGenerator.cpp

13 lines

ThinLTOCodeGenerator.cpp

9 lines

Transforms/

IPO/

WholeProgramDevirt.cpp

76 lines

tools/

opt/

opt.cpp

11 lines

Diff 556979

lld/ELF/Config.h

Show First 20 Lines • Show All 241 Lines • ▼ Show 20 Lines	struct Config {
bool hasDynSymTab;		bool hasDynSymTab;
bool ignoreDataAddressEquality;		bool ignoreDataAddressEquality;
bool ignoreFunctionAddressEquality;		bool ignoreFunctionAddressEquality;
bool ltoCSProfileGenerate;		bool ltoCSProfileGenerate;
bool ltoPGOWarnMismatch;		bool ltoPGOWarnMismatch;
bool ltoDebugPassManager;		bool ltoDebugPassManager;
bool ltoEmitAsm;		bool ltoEmitAsm;
bool ltoUniqueBasicBlockSectionNames;		bool ltoUniqueBasicBlockSectionNames;
		bool ltoValidateAllVtablesHaveTypeInfos;
bool ltoWholeProgramVisibility;		bool ltoWholeProgramVisibility;
		MaskRayUnsubmitted Done Reply Inline Actions Move `ltoAllVtablesHaveTypeInfos` (not an option) to `Ctx` MaskRay: Move `ltoAllVtablesHaveTypeInfos` (not an option) to `Ctx`
bool mergeArmExidx;		bool mergeArmExidx;
bool mipsN32Abi = false;		bool mipsN32Abi = false;
bool mmapOutputFile;		bool mmapOutputFile;
bool nmagic;		bool nmagic;
bool noDynamicLinker = false;		bool noDynamicLinker = false;
bool noinhibitExec;		bool noinhibitExec;
bool nostdlib;		bool nostdlib;
bool oFormatBinary;		bool oFormatBinary;
▲ Show 20 Lines • Show All 211 Lines • ▼ Show 20 Lines	llvm::DenseMap<const Symbol *,
backwardReferences;		backwardReferences;
llvm::SmallSet<llvm::StringRef, 0> auxiliaryFiles;		llvm::SmallSet<llvm::StringRef, 0> auxiliaryFiles;
// True if SHT_LLVM_SYMPART is used.		// True if SHT_LLVM_SYMPART is used.
std::atomic<bool> hasSympart{false};		std::atomic<bool> hasSympart{false};
// True if there are TLS IE relocations. Set DF_STATIC_TLS if -shared.		// True if there are TLS IE relocations. Set DF_STATIC_TLS if -shared.
std::atomic<bool> hasTlsIe{false};		std::atomic<bool> hasTlsIe{false};
// True if we need to reserve two .got entries for local-dynamic TLS model.		// True if we need to reserve two .got entries for local-dynamic TLS model.
std::atomic<bool> needsTlsLd{false};		std::atomic<bool> needsTlsLd{false};
		// True if all native vtable symbols have corresponding type info symbols
		// during LTO.
		bool ltoAllVtablesHaveTypeInfos;
		MaskRayUnsubmitted Done Reply Inline Actions We need `ltoAllVtablesHaveTypeInfos = false` in `reset` MaskRay: We need `ltoAllVtablesHaveTypeInfos = false` in `reset`

// Each symbol assignment and DEFINED(sym) reference is assigned an increasing		// Each symbol assignment and DEFINED(sym) reference is assigned an increasing
// order. Each DEFINED(sym) evaluation checks whether the reference happens		// order. Each DEFINED(sym) evaluation checks whether the reference happens
// before a possible `sym = expr;`.		// before a possible `sym = expr;`.
unsigned scriptSymOrderCounter = 1;		unsigned scriptSymOrderCounter = 1;
llvm::DenseMap<const Symbol *, unsigned> scriptSymOrder;		llvm::DenseMap<const Symbol *, unsigned> scriptSymOrder;

void reset();		void reset();
Show All 22 Lines

lld/ELF/Driver.cpp

Show First 20 Lines • Show All 104 Lines • ▼ Show 20 Lines void Ctx::reset() {

whyExtractRecords.clear(); whyExtractRecords.clear();

backwardReferences.clear(); backwardReferences.clear();

auxiliaryFiles.clear(); auxiliaryFiles.clear();

hasSympart.store(false, std::memory_order_relaxed); hasSympart.store(false, std::memory_order_relaxed);

hasTlsIe.store(false, std::memory_order_relaxed); hasTlsIe.store(false, std::memory_order_relaxed);

needsTlsLd.store(false, std::memory_order_relaxed); needsTlsLd.store(false, std::memory_order_relaxed);

scriptSymOrderCounter = 1; scriptSymOrderCounter = 1;

scriptSymOrder.clear(); scriptSymOrder.clear();

ltoAllVtablesHaveTypeInfos = false;

} }

llvm::raw_fd_ostream Ctx::openAuxiliaryFile(llvm::StringRef filename, llvm::raw_fd_ostream Ctx::openAuxiliaryFile(llvm::StringRef filename,

std::error_code &ec) { std::error_code &ec) {

using namespace llvm::sys::fs; using namespace llvm::sys::fs;

OpenFlags flags = OpenFlags flags =

auxiliaryFiles.insert(filename).second ? OF_None : OF_Append; auxiliaryFiles.insert(filename).second ? OF_None : OF_Append;

return {filename, ec, flags}; return {filename, ec, flags};

▲ Show 20 Lines • Show All 910 Lines • ▼ Show 20 Lines for (uint32_t i = 0, size = cgProfile.size(); i < size; ++i) {

auto *from = dyn_cast_or_null<InputSectionBase>(fromSym->section); auto *from = dyn_cast_or_null<InputSectionBase>(fromSym->section);

auto *to = dyn_cast_or_null<InputSectionBase>(toSym->section); auto *to = dyn_cast_or_null<InputSectionBase>(toSym->section);

if (from && to) if (from && to)

config->callGraphProfile[{from, to}] += cgpe.cgp_weight; config->callGraphProfile[{from, to}] += cgpe.cgp_weight;

} }

template <class ELFT>

static void ltoValidateAllVtablesHaveTypeInfos(opt::InputArgList &args) {

DenseSet<StringRef> typeInfoSymbols;

MaskRayUnsubmitted

Done

The conventional style in lld/ omits llvm:: for DenseSet.

MaskRay: The conventional style in lld/ omits `llvm::` for DenseSet.

SmallSetVector<StringRef, 0> vtableSymbols;

auto processVtableAndTypeInfoSymbols = [&](StringRef name) {

if (name.consume_front("_ZTI"))

typeInfoSymbols.insert(name);

MaskRayUnsubmitted

Done

omit braces for single-line single-statement body

MaskRay: omit braces for single-line single-statement body

else if (name.consume_front("_ZTV"))

vtableSymbols.insert(name);

};

// Examine all native symbol tables.

MaskRayUnsubmitted

Done

vtableSymbols.insert(name);

};

- // Examine all native symbol tables

+ // Examine all native symbol tables.

for (ELFFileBase *f : ctx.objectFiles) {

MaskRay:

for (ELFFileBase *f : ctx.objectFiles) {

using Elf_Sym = typename ELFT::Sym;

for (const Elf_Sym &s : f->template getGlobalELFSyms<ELFT>()) {

tejohnsonUnsubmitted

Done

Does this get both defs and refs? I think the latter as I am seeing a case where we are linking a bitcode object that contains both the vtable and typename defs but are disabling --lto-whole-program-visibility, and the reason seems to be a reference to the vtable from a native object.

tejohnson: Does this get both defs and refs? I think the latter as I am seeing a case where we are linking…

modimoAuthorUnsubmitted

Done

Does this get both defs and refs?

This goes through the entirety of .symtab so both defs and refs.

I think the latter as I am seeing a case where we are linking a bitcode object that contains both the vtable and typename defs but are disabling --lto-whole-program-visibility, and the reason seems to be a reference to the vtable from a native object.

Mocking up a forward class declaration in the native object where the definition is in bitcode I see --lto-whole-program-visibility get disabled. Only looking at vtable defs makes sense, I'll make the change and add a test.

modimo: >Does this get both defs and refs? This goes through the entirety of .symtab so both defs and…

if (s.st_shndx != SHN_UNDEF) {

MaskRayUnsubmitted

Done

getGlobalELFSyms to skip local symbols

MaskRay: `getGlobalELFSyms` to skip local symbols

StringRef name = check(s.getName(f->getStringTable()));

processVtableAndTypeInfoSymbols(name);

}

for (SharedFile *f : ctx.sharedFiles) {

using Elf_Sym = typename ELFT::Sym;

for (const Elf_Sym &s : f->template getELFSyms<ELFT>()) {

if (s.st_shndx != SHN_UNDEF) {

StringRef name = check(s.getName(f->getStringTable()));

processVtableAndTypeInfoSymbols(name);

}

SmallSetVector<StringRef, 0> vtableSymbolsWithNoRTTI;

for (StringRef s : vtableSymbols)

MaskRayUnsubmitted

Done

This still relies on the iteration order of DenseSet vtableSymbols. We need SetVector for vtableSymbols as well.

auto &s => StringRef s

MaskRay: This still relies on the iteration order of `DenseSet vtableSymbols`. We need SetVector for…

if (!typeInfoSymbols.count(s))

vtableSymbolsWithNoRTTI.insert(s);

MaskRayUnsubmitted

Done

drop the two nested braces

MaskRay: drop the two nested braces

// Remove known safe symbols.

for (auto *arg : args.filtered(OPT_lto_known_safe_vtables)) {

StringRef knownSafeName = arg->getValue();

if (!knownSafeName.consume_front("_ZTV"))

MaskRayUnsubmitted

Done

}

- // Remove known safe symbols

+ // Remove known safe symbols.

for (auto *arg : args.filtered(OPT_lto_known_safe_vtables)) {

MaskRay:

error("--lto-known-safe-vtables=: expected symbol to start with _ZTV, "

"but got " +

knownSafeName);

MaskRayUnsubmitted

Done

If not starting with _ZTV, consider reporting an error?

MaskRay: If not starting with `_ZTV`, consider reporting an error?

vtableSymbolsWithNoRTTI.remove(knownSafeName);

}

ctx.ltoAllVtablesHaveTypeInfos = vtableSymbolsWithNoRTTI.empty();

MaskRayUnsubmitted

Done

vtableSymbolsWithNoRTTI.erase(knownSafeName);

}

- config->ltoAllVtablesHaveTypeInfos = true;

- // Check for unmatched RTTI symbols

+ config->ltoAllVtablesHaveTypeInfos = vtableSymbolsWithNoRTTI.empty(); // Check for unmatched RTTI symbols

and delete the assignment below

MaskRay: and delete the assignment below

// Check for unmatched RTTI symbols

for (StringRef s : vtableSymbolsWithNoRTTI) {

MaskRayUnsubmitted

Done

the order is not guaranteed to be deterministic. Consider a SmallSetVector with inline size=0.

auto & => StringRef

MaskRay: the order is not guaranteed to be deterministic. Consider a SmallSetVector with inline size=0.

message(

"--lto-validate-all-vtables-have-type-infos: RTTI missing for vtable "

"_ZTV" +

s + ", --lto-whole-program-visibility disabled");

}

static DebugCompressionType getCompressionType(StringRef s, StringRef option) { static DebugCompressionType getCompressionType(StringRef s, StringRef option) {

DebugCompressionType type = StringSwitch<DebugCompressionType>(s) DebugCompressionType type = StringSwitch<DebugCompressionType>(s)

.Case("zlib", DebugCompressionType::Zlib) .Case("zlib", DebugCompressionType::Zlib)

.Case("zstd", DebugCompressionType::Zstd) .Case("zstd", DebugCompressionType::Zstd)

.Default(DebugCompressionType::None); .Default(DebugCompressionType::None);

if (type == DebugCompressionType::None) { if (type == DebugCompressionType::None) {

if (s != "none") if (s != "none")

error("unknown " + option + " value: " + s); error("unknown " + option + " value: " + s);

} else if (const char *reason = compression::getReasonIfUnsupported( } else if (const char *reason = compression::getReasonIfUnsupported(

compression::formatFor(type))) { compression::formatFor(type))) {

error(option + ": " + reason); error(option + ": " + reason);

} }

return type; return type;

} }

static StringRef getAliasSpelling(opt::Arg *arg) { static StringRef getAliasSpelling(opt::Arg *arg) {

if (const opt::Arg *alias = arg->getAlias()) if (const opt::Arg *alias = arg->getAlias())

return alias->getSpelling(); return alias->getSpelling();

tejohnsonUnsubmitted

Done

Prefer message() over warn() because the latter causes builds using -Werror to fail.

tejohnson: Prefer message() over warn() because the latter causes builds using -Werror to fail.

modimoAuthorUnsubmitted

Done

Is that the case for lld? Looking through the equivalent functionality is under --fatal-warnings and on a small example -Werror doesn't affect this flag/cause the build to fail.

modimo: Is that the case for lld? Looking through the equivalent functionality is under `--fatal…

tejohnsonUnsubmitted

Done

Sorry, you are correct. We use --fatal-warnings for this on linker actions (and -Werror on compile actions). In general, I think this should be an informational message, not a warning, since it is being handled automatically.

tejohnson: Sorry, you are correct. We use --fatal-warnings for this on linker actions (and -Werror on…

modimoAuthorUnsubmitted

Done

The case I want to catch is when an existing project changes its native dependencies and disables the optimization which would better fit a warning. In the general case agreed that this isn't a warning. I don't have much experience in linker protocol here, @MaskRay thoughts?

modimo: The case I want to catch is when an existing project changes its native dependencies and…

tejohnsonUnsubmitted

Done

The problem is if it is a warning, then we have to go in and manually change options or builds will fail. Can you intercept message output, which should always be emitted (i.e. don't need verbose options).

tejohnson: The problem is if it is a warning, then we have to go in and manually change options or builds…

modimoAuthorUnsubmitted

Done

Sure, changed to message.

modimo: Sure, changed to message.

MaskRayUnsubmitted

Done

This long list here makes me nervous. Will try to learn it.

MaskRay: This long list here makes me nervous. Will try to learn it.

modimoAuthorUnsubmitted

Done

Taking a look again I don't think these names need special exclusion. They're defined in libstdc++/libc++abi and the release packages in my scenarios have RTTI enabled meaning their type info symbols are present during linking. Testing E2E linking on some of our large services with these removed succeeds.

If RTTI is disabled on these libraries considerably more symbols would not have matching type infos and --lto-whole-program-visibility should be disabled. Looking back I think these exclusions came about when I was only examining the .symtab/.dynsym of individual object/shared files which didn't take into account that these symbols would be resolved by libstdc++/libc++abi.

modimo: Taking a look again I don't think these names need special exclusion. They're defined in…

return arg->getSpelling(); return arg->getSpelling();

} }

static std::pair<StringRef, StringRef> getOldNewOptions(opt::InputArgList &args, static std::pair<StringRef, StringRef> getOldNewOptions(opt::InputArgList &args,

unsigned id) { unsigned id) {

auto *arg = args.getLastArg(id); auto *arg = args.getLastArg(id);

if (!arg) if (!arg)

return {"", ""}; return {"", ""};

▲ Show 20 Lines • Show All 166 Lines • ▼ Show 20 Lines static void readConfigs(opt::InputArgList &args) {

config->ltoPGOWarnMismatch = args.hasFlag(OPT_lto_pgo_warn_mismatch, config->ltoPGOWarnMismatch = args.hasFlag(OPT_lto_pgo_warn_mismatch,

OPT_no_lto_pgo_warn_mismatch, true); OPT_no_lto_pgo_warn_mismatch, true);

config->ltoDebugPassManager = args.hasArg(OPT_lto_debug_pass_manager); config->ltoDebugPassManager = args.hasArg(OPT_lto_debug_pass_manager);

config->ltoEmitAsm = args.hasArg(OPT_lto_emit_asm); config->ltoEmitAsm = args.hasArg(OPT_lto_emit_asm);

config->ltoNewPmPasses = args.getLastArgValue(OPT_lto_newpm_passes); config->ltoNewPmPasses = args.getLastArgValue(OPT_lto_newpm_passes);

config->ltoWholeProgramVisibility = config->ltoWholeProgramVisibility =

args.hasFlag(OPT_lto_whole_program_visibility, args.hasFlag(OPT_lto_whole_program_visibility,

OPT_no_lto_whole_program_visibility, false); OPT_no_lto_whole_program_visibility, false);

config->ltoValidateAllVtablesHaveTypeInfos =

args.hasFlag(OPT_lto_validate_all_vtables_have_type_infos,

OPT_no_lto_validate_all_vtables_have_type_infos, false);

config->ltoo = args::getInteger(args, OPT_lto_O, 2); config->ltoo = args::getInteger(args, OPT_lto_O, 2);

if (config->ltoo > 3) if (config->ltoo > 3)

error("invalid optimization level for LTO: " + Twine(config->ltoo)); error("invalid optimization level for LTO: " + Twine(config->ltoo));

unsigned ltoCgo = unsigned ltoCgo =

args::getInteger(args, OPT_lto_CGO, args::getCGOptLevel(config->ltoo)); args::getInteger(args, OPT_lto_CGO, args::getCGOptLevel(config->ltoo));

if (auto level = CodeGenOpt::getLevel(ltoCgo)) if (auto level = CodeGenOpt::getLevel(ltoCgo))

config->ltoCgo = *level; config->ltoCgo = *level;

else else

▲ Show 20 Lines • Show All 1,563 Lines • ▼ Show 20 Lines void LinkerDriver::link(opt::InputArgList &args) {

// compileBitcodeFiles, so we are done afterwards. --plugin-opt=emit-llvm and // compileBitcodeFiles, so we are done afterwards. --plugin-opt=emit-llvm and

// --plugin-opt=emit-asm create output files in bitcode or assembly code, // --plugin-opt=emit-asm create output files in bitcode or assembly code,

// respectively. When only certain thinLTO modules are specified for // respectively. When only certain thinLTO modules are specified for

// compilation, the intermediate object file are the expected output. // compilation, the intermediate object file are the expected output.

const bool skipLinkedOutput = config->thinLTOIndexOnly || config->emitLLVM || const bool skipLinkedOutput = config->thinLTOIndexOnly || config->emitLLVM ||

config->ltoEmitAsm || config->ltoEmitAsm ||

!config->thinLTOModulesToCompile.empty(); !config->thinLTOModulesToCompile.empty();

// Handle --lto-validate-all-vtables-have-type-infos.

MaskRayUnsubmitted

Done

!config->thinLTOModulesToCompile.empty();

- // Handle -lto-validate-all-vtables-had-type-infos

+ // Handle --lto-validate-all-vtables-have-type-infos.

if (config->ltoValidateAllVtablesHaveTypeInfos)

MaskRay:

MaskRayUnsubmitted

Done

Not done.

MaskRay: Not done.

if (config->ltoValidateAllVtablesHaveTypeInfos)

invokeELFT(ltoValidateAllVtablesHaveTypeInfos, args);

// Do link-time optimization if given files are LLVM bitcode files. // Do link-time optimization if given files are LLVM bitcode files.

// This compiles bitcode files into real object files. // This compiles bitcode files into real object files.

// //

// With this the symbol table should be complete. After this, no new names // With this the symbol table should be complete. After this, no new names

// except a few linker-synthesized ones will be added to the symbol table. // except a few linker-synthesized ones will be added to the symbol table.

const size_t numObjsBeforeLTO = ctx.objectFiles.size(); const size_t numObjsBeforeLTO = ctx.objectFiles.size();

invokeELFT(compileBitcodeFiles, skipLinkedOutput); invokeELFT(compileBitcodeFiles, skipLinkedOutput);

▲ Show 20 Lines • Show All 194 Lines • Show Last 20 Lines

lld/ELF/LTO.cpp

Show First 20 Lines • Show All 146 Lines • ▼ Show 20 Lines	static lto::Config createConfig() {

c.SampleProfile = std::string(config->ltoSampleProfile);		c.SampleProfile = std::string(config->ltoSampleProfile);
for (StringRef pluginFn : config->passPlugins)		for (StringRef pluginFn : config->passPlugins)
c.PassPlugins.push_back(std::string(pluginFn));		c.PassPlugins.push_back(std::string(pluginFn));
c.DebugPassManager = config->ltoDebugPassManager;		c.DebugPassManager = config->ltoDebugPassManager;
c.DwoDir = std::string(config->dwoDir);		c.DwoDir = std::string(config->dwoDir);

c.HasWholeProgramVisibility = config->ltoWholeProgramVisibility;		c.HasWholeProgramVisibility = config->ltoWholeProgramVisibility;
		c.ValidateAllVtablesHaveTypeInfos =
		config->ltoValidateAllVtablesHaveTypeInfos;
		c.AllVtablesHaveTypeInfos = ctx.ltoAllVtablesHaveTypeInfos;
c.AlwaysEmitRegularLTOObj = !config->ltoObjPath.empty();		c.AlwaysEmitRegularLTOObj = !config->ltoObjPath.empty();

for (const llvm::StringRef &name : config->thinLTOModulesToCompile)		for (const llvm::StringRef &name : config->thinLTOModulesToCompile)
c.ThinLTOModulesToCompile.emplace_back(name);		c.ThinLTOModulesToCompile.emplace_back(name);

c.TimeTraceEnabled = config->timeTraceEnabled;		c.TimeTraceEnabled = config->timeTraceEnabled;
c.TimeTraceGranularity = config->timeTraceGranularity;		c.TimeTraceGranularity = config->timeTraceGranularity;

▲ Show 20 Lines • Show All 243 Lines • Show Last 20 Lines

lld/ELF/Options.td

Show First 20 Lines • Show All 598 Lines • ▼ Show 20 Lines	def lto_partitions: JJ<"lto-partitions=">,
HelpText<"Number of LTO codegen partitions">;		HelpText<"Number of LTO codegen partitions">;
def lto_cs_profile_generate: FF<"lto-cs-profile-generate">,		def lto_cs_profile_generate: FF<"lto-cs-profile-generate">,
HelpText<"Perform context sensitive PGO instrumentation">;		HelpText<"Perform context sensitive PGO instrumentation">;
def lto_cs_profile_file: JJ<"lto-cs-profile-file=">,		def lto_cs_profile_file: JJ<"lto-cs-profile-file=">,
HelpText<"Context sensitive profile file path">;		HelpText<"Context sensitive profile file path">;
defm lto_pgo_warn_mismatch: BB<"lto-pgo-warn-mismatch",		defm lto_pgo_warn_mismatch: BB<"lto-pgo-warn-mismatch",
"turn on warnings about profile cfg mismatch (default)",		"turn on warnings about profile cfg mismatch (default)",
"turn off warnings about profile cfg mismatch">;		"turn off warnings about profile cfg mismatch">;
		defm lto_known_safe_vtables : EEq<"lto-known-safe-vtables",
		MaskRayUnsubmitted Done Reply Inline Actions New options use `EEq` to disallow single-dash long options, to not conflict with `-l`. MaskRay: New options use `EEq` to disallow single-dash long options, to not conflict with `-l`.
		MaskRayUnsubmitted Done Reply Inline Actions `When --lto-validate-all-vtables-have-type-infos is enabled, skip validation on these vtables (_ZTV symbols)` MaskRay: `When --lto-validate-all-vtables-have-type-infos is enabled, skip validation on these vtables…
		"When --lto-validate-all-vtables-have-type-infos is enabled, skip validation on these vtables (_ZTV symbols)">;
def lto_obj_path_eq: JJ<"lto-obj-path=">;		def lto_obj_path_eq: JJ<"lto-obj-path=">;
def lto_sample_profile: JJ<"lto-sample-profile=">,		def lto_sample_profile: JJ<"lto-sample-profile=">,
HelpText<"Sample profile file path">;		HelpText<"Sample profile file path">;
		defm lto_validate_all_vtables_have_type_infos: BB<"lto-validate-all-vtables-have-type-infos",
		"Validate that all vtables have type infos for LTO link",
		"Do not validate that all vtables have type infos for LTO link">;
defm lto_whole_program_visibility: BB<"lto-whole-program-visibility",		defm lto_whole_program_visibility: BB<"lto-whole-program-visibility",
"Asserts that the LTO link has whole program visibility",		"Asserts that the LTO link has whole program visibility",
"Asserts that the LTO link does not have whole program visibility">;		"Asserts that the LTO link does not have whole program visibility">;
def disable_verify: F<"disable-verify">;		def disable_verify: F<"disable-verify">;
defm mllvm: Eq<"mllvm", "Additional arguments to forward to LLVM's option processing">;		defm mllvm: Eq<"mllvm", "Additional arguments to forward to LLVM's option processing">;
def opt_remarks_filename: Separate<["--"], "opt-remarks-filename">,		def opt_remarks_filename: Separate<["--"], "opt-remarks-filename">,
HelpText<"YAML output file for optimization remarks">;		HelpText<"YAML output file for optimization remarks">;
defm opt_remarks_hotness_threshold: EEq<"opt-remarks-hotness-threshold",		defm opt_remarks_hotness_threshold: EEq<"opt-remarks-hotness-threshold",
▲ Show 20 Lines • Show All 158 Lines • Show Last 20 Lines

lld/test/ELF/lto/Inputs/devirt_validate_vtable_typeinfos.ll

This file was added.

				; REQUIRES: x86

				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				MaskRayUnsubmitted Done Reply Inline Actions `grtev4` => `unknown` MaskRay: `grtev4` => `unknown`
				%struct.A = type { ptr }
				%struct.Native = type { %struct.A }

				@_ZTV6Native = linkonce_odr unnamed_addr constant { [4 x ptr] } { [4 x ptr] [ptr null, ptr @_ZTI6Native, ptr @_ZN1A1nEi, ptr @_ZN6Native1fEi] }
				@_ZTS6Native = linkonce_odr constant [8 x i8] c"6Native\00"
				@_ZTI6Native = linkonce_odr constant { ptr, ptr, ptr } { ptr null, ptr @_ZTS6Native, ptr @_ZTI1A }

				; Base type A does not need to emit a vtable if it's never instantiated. However, RTTI still gets generated
				@_ZTS1A = linkonce_odr constant [3 x i8] c"1A\00"
				@_ZTI1A = linkonce_odr constant { ptr, ptr } { ptr null, ptr @_ZTS1A }


				define linkonce_odr i32 @_ZN6Native1fEi(ptr %this, i32 %a) #0 {
				ret i32 1;
				}

				define linkonce_odr i32 @_ZN1A1nEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				attributes #0 = { noinline optnone }

lld/test/ELF/lto/Inputs/devirt_validate_vtable_typeinfos_no_rtti.ll

This file was added.

				; REQUIRES: x86

				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.A = type { ptr }
				%struct.Native = type { %struct.A }

				@_ZTV6Native = linkonce_odr unnamed_addr constant { [4 x ptr] } { [4 x ptr] [ptr null, ptr null, ptr @_ZN1A1nEi, ptr @_ZN6Native1fEi] }

				define linkonce_odr i32 @_ZN6Native1fEi(ptr %this, i32 %a) #0 {
				ret i32 1;
				}

				define linkonce_odr i32 @_ZN1A1nEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				attributes #0 = { noinline optnone }

lld/test/ELF/lto/Inputs/devirt_validate_vtable_typeinfos_ref.ll

This file was added.

				;; Source code:
				;; cat > a.h <<'eof'
				;; struct A { virtual int foo(); };
				;; int bar(A *a);
				;; eof
				;; cat > b.cc <<'eof'
				;; #include "a.h"
				;; struct B : A { int foo() { return 2; } };
				;; int baz() { B b; return bar(&b); }
				;; eof
				;; clang++ -flto=thin b.cc -c

				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.B = type { %struct.A }
				%struct.A = type { ptr }

				@_ZTV1B = linkonce_odr dso_local unnamed_addr constant { [3 x ptr] } { [3 x ptr] [ptr null, ptr @_ZTI1B, ptr @_ZN1B3fooEv] }, !type !0, !type !1, !type !2, !type !3
				@_ZTS1B = linkonce_odr dso_local constant [3 x i8] c"1B\00"
				@_ZTI1A = external constant ptr
				@_ZTI1B = linkonce_odr dso_local constant { ptr, ptr, ptr } { ptr null, ptr @_ZTS1B, ptr @_ZTI1A }
				@_ZTV1A = external unnamed_addr constant { [3 x ptr] }

				define dso_local noundef i32 @_Z3bazv() #0 {
				entry:
				%b = alloca %struct.B
				call void @_ZN1BC2Ev(ptr noundef nonnull align 8 dereferenceable(8) %b)
				%call = call noundef i32 @_Z3barP1A(ptr noundef %b)
				ret i32 %call
				}

				define linkonce_odr dso_local void @_ZN1BC2Ev(ptr noundef nonnull align 8 dereferenceable(8) %this) #0 {
				entry:
				%this.addr = alloca ptr
				store ptr %this, ptr %this.addr
				%this1 = load ptr, ptr %this.addr
				call void @_ZN1AC2Ev(ptr noundef nonnull align 8 dereferenceable(8) %this1)
				store ptr getelementptr inbounds ({ [3 x ptr] }, ptr @_ZTV1B, i32 0, inrange i32 0, i32 2), ptr %this1
				ret void
				}

				declare i32 @_Z3barP1A(ptr noundef)

				define linkonce_odr dso_local void @_ZN1AC2Ev(ptr noundef nonnull align 8 dereferenceable(8) %this) #0 {
				entry:
				%this.addr = alloca ptr
				store ptr %this, ptr %this.addr
				%this1 = load ptr, ptr %this.addr
				store ptr getelementptr inbounds ({ [3 x ptr] }, ptr @_ZTV1A, i32 0, inrange i32 0, i32 2), ptr %this1
				ret void
				}

				define linkonce_odr i32 @_ZN1B3fooEv(ptr noundef nonnull align 8 dereferenceable(8) %this) #0 {
				entry:
				%this.addr = alloca ptr
				store ptr %this, ptr %this.addr
				%this1 = load ptr, ptr %this.addr
				ret i32 2
				}

				;; Make sure we don't inline or otherwise optimize out the direct calls.
				attributes #0 = { noinline optnone }

				!0 = !{i64 16, !"_ZTS1A"}
				!1 = !{i64 16, !"_ZTSM1AFivE.virtual"}
				!2 = !{i64 16, !"_ZTS1B"}
				!3 = !{i64 16, !"_ZTSM1BFivE.virtual"}

lld/test/ELF/lto/Inputs/devirt_validate_vtable_typeinfos_undef.ll

This file was added.

				; REQUIRES: x86

				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				@_ZTV1B = external unnamed_addr constant { [4 x ptr] }

				define linkonce_odr void @_ZN1BC2Ev(ptr %this) #0 {
				%this.addr = alloca ptr, align 8
				store ptr %this, ptr %this.addr, align 8
				%this1 = load ptr, ptr %this.addr, align 8
				store ptr getelementptr inbounds ({ [4 x ptr] }, ptr @_ZTV1B, i32 0, inrange i32 0, i32 2), ptr %this1, align 8
				ret void
				}

				attributes #0 = { noinline optnone }

lld/test/ELF/lto/devirt_validate_vtable_typeinfos.ll

This file was added.

				; REQUIRES: x86

				;; Common artifacts
				; RUN: opt --thinlto-bc -o %t1.o %s
				; RUN: opt --thinlto-bc --thinlto-split-lto-unit -o %t1_hybrid.o %s
				; RUN: cp %s %t1_regular.ll
				; RUN: echo '!llvm.module.flags = !{!12, !13}' >> %t1_regular.ll
				modimoAuthorUnsubmitted Done Reply Inline Actions Appending module flags so RegularLTO correctly generates it's summary without `typeidCompatibleVTable` means the test can be re-used. However I think duplicating the tests is reasonable as well and could be cleaner, WDYT? modimo: Appending module flags so RegularLTO correctly generates it's summary without…
				tejohnsonUnsubmitted Done Reply Inline Actions Do we need these module flags for correct operation of this test (ditto for the similar no_rtti one later)? If not, then probably don't bother adding in these tests (I think these may only be needed in practice for the hybrid testing). If they are now needed for correct operation of the regular LTO testing, then I am ok with the approach here as I think it is probably better to reduce duplication of nearly identical IR tests (and I see this approach used in other tests too). tejohnson: Do we need these module flags for correct operation of this test (ditto for the similar no_rtti…
				modimoAuthorUnsubmitted Done Reply Inline Actions Do we need these module flags for correct operation of this test (ditto for the similar no_rtti one later)? Yeah, to trigger summary generation but on the RegularLTO pipeline requires these module flags. If they are now needed for correct operation of the regular LTO testing, then I am ok with the approach here as I think it is probably better to reduce duplication of nearly identical IR tests (and I see this approach used in other tests too). Sounds good, I'll leave it as is. modimo: >Do we need these module flags for correct operation of this test (ditto for the similar…
				; RUN: echo '!12 = !{i32 1, !"ThinLTO", i32 0}' >> %t1_regular.ll
				; RUN: echo '!13 = !{i32 1, !"EnableSplitLTOUnit", i32 1}' >> %t1_regular.ll
				; RUN: opt -module-summary -o %t1_regular.o %t1_regular.ll

				; RUN: llvm-as %S/Inputs/devirt_validate_vtable_typeinfos.ll -o %t2.bc
				; RUN: llc -relocation-model=pic -filetype=obj %t2.bc -o %t2.o
				; RUN: ld.lld %t2.o -o %t2.so -shared
				MaskRayUnsubmitted Done Reply Inline Actions You can remove the relocation-model=static object file as there is no testable difference. Then, consider renaming `%t2_pic.o` to `%t2.o` MaskRay: You can remove the relocation-model=static object file as there is no testable difference.

				; RUN: llvm-as %S/Inputs/devirt_validate_vtable_typeinfos_no_rtti.ll -o %t2_nortti.bc
				; RUN: llc -relocation-model=pic -filetype=obj %t2_nortti.bc -o %t2_nortti.o
				; RUN: ld.lld %t2_nortti.o -o %t2_nortti.so -shared

				; RUN: llvm-as %S/Inputs/devirt_validate_vtable_typeinfos_undef.ll -o %t2_undef.bc
				; RUN: llc -relocation-model=pic -filetype=obj %t2_undef.bc -o %t2_undef.o
				; RUN: ld.lld %t2_undef.o -o %t2_undef.so -shared

				;; With --lto-whole-program-visibility, we assume no native types can interfere
				;; and thus proceed with devirtualization even in the presence of native types

				;; Index based WPD
				; RUN: ld.lld %t1.o %t2.o -o %t3_index -save-temps --lto-whole-program-visibility \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o %t2.o -o %t3_hybrid -save-temps --lto-whole-program-visibility \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o %t2.o -o %t3_regular -save-temps --lto-whole-program-visibility \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t3_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				; REMARK-DAG: single-impl: devirtualized a call to _ZN1A1nEi
				; REMARK-DAG: single-impl: devirtualized a call to _ZN1D1mEi

				;; With --lto-validate-all-vtables-have-type-infos, the linker checks for the presence of vtables
				;; and RTTI in native files and blocks devirtualization to be conservative on correctness
				;; for these types.

				;; Index based WPD
				; RUN: ld.lld %t1.o %t2.o -o %t4_index -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=VALIDATE
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-VALIDATE-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o %t2.o -o %t4_hybrid -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=VALIDATE
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-VALIDATE-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o %t2.o -o %t4_regular -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=VALIDATE
				; RUN: llvm-dis %t4_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-VALIDATE-IR

				;; DSOs behave similarly

				;; Index based WPD
				; RUN: ld.lld %t1.o %t2.so -o %t5_index -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=VALIDATE
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-VALIDATE-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o %t2.so -o %t5_hybrid -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=VALIDATE
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-VALIDATE-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o %t2.so -o %t5_regular -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=VALIDATE
				; RUN: llvm-dis %t5_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-VALIDATE-IR

				; VALIDATE-NOT: single-impl:
				; VALIDATE: single-impl: devirtualized a call to _ZN1D1mEi
				; VALIDATE-NOT: single-impl:

				;; When vtables without type infos are detected in native files, we have a hole in our knowledge so
				MaskRayUnsubmitted Done Reply Inline Actions ; VALIDATE-NOT: single-impl: ; VALIDATE: single-impl: devirtualized a call to _ZN1D1mEi ; VALIDATE-NOT: single-impl: MaskRay: ``` ; VALIDATE-NOT: single-impl: ; VALIDATE: single-impl: devirtualized a call to _ZN1D1mEi…
				;; --lto-validate-all-vtables-have-type-infos conservatively disables --lto-whole-program-visibility

				;; Index based WPD
				; RUN: ld.lld %t1.o %t2_nortti.o -o %t6_index -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=NO-RTTI
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-NO-RTTI-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o %t2_nortti.o -o %t6_hybrid -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=NO-RTTI
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-NO-RTTI-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o %t2_nortti.o -o %t6_regular -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=NO-RTTI
				; RUN: llvm-dis %t6_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-NO-RTTI-IR

				;; DSOs behave similarly

				;; Index based WPD
				; RUN: ld.lld %t1.o %t2_nortti.so -o %t7_index -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=NO-RTTI
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-NO-RTTI-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o %t2_nortti.so -o %t7_hybrid -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=NO-RTTI
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-NO-RTTI-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o %t2_nortti.so -o %t7_regular -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=NO-RTTI
				; RUN: llvm-dis %t7_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-NO-RTTI-IR

				; NO-RTTI-DAG: --lto-validate-all-vtables-have-type-infos: RTTI missing for vtable _ZTV6Native, --lto-whole-program-visibility disabled
				; NO-RTTI-DAG: single-impl: devirtualized a call to _ZN1D1mEi

				;; --lto-known-safe-vtables=* can be used to specifically allow types to participate in WPD
				;; even if they don't have corresponding RTTI

				;; Index based WPD
				; RUN: ld.lld %t1.o %t2_nortti.o -o %t8_index -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: --lto-known-safe-vtables=_ZTV6Native -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o %t2_nortti.o -o %t8_hybrid -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: --lto-known-safe-vtables=_ZTV6Native -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o %t2_nortti.o -o %t8_regular -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: --lto-known-safe-vtables=_ZTV6Native -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t8_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				;; Only check for definitions of vtables symbols, just having a reference does not allow a type to
				;; be derived from

				;; Index based WPD
				; RUN: ld.lld %t1.o %t2_undef.o -o %t9_index -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o %t2_undef.o -o %t9_hybrid -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o %t2_undef.o -o %t9_regular -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t9_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.A = type { ptr }
				%struct.B = type { %struct.A }
				%struct.C = type { %struct.A }
				MaskRayUnsubmitted Done Reply Inline Actions Consider pasting the source code as well for readability and upgradability? I haven't carefully studied the tests yet... MaskRay: Consider pasting the source code as well for readability and upgradability? I haven't…
				modimoAuthorUnsubmitted Done Reply Inline Actions I pulled the base IR from lld/test/ELF/lto/devirt_vcall_vis_export_dynamic.ll and modified it so no direct source. The test case is effectively: struct A { virtual void f(int) = 0; virtual void n(int) { return 0; } }; struct B { virtual void f(int) { return 0; } }; struct C { virtual void f(int) { return 0; } }; namespace { struct D { virtual void int m(int) { return 0; } }; } int _start(A obj, D obj2, int a) { call = obj->n(a); // single implementation in A, devirtualize unless a native type derives from A call2 = obj->f(call); // multiple implementation in B and C, never devirtualize call3 = obj2->m(call2); // local type, always devirtualize return call3; } modimo: I pulled the base IR from lld/test/ELF/lto/devirt_vcall_vis_export_dynamic.ll and modified it…
				%struct.D = type { ptr }

				@_ZTV1B = linkonce_odr constant { [4 x ptr] } { [4 x ptr] [ptr null, ptr @_ZTI1B, ptr @_ZN1B1fEi, ptr @_ZN1A1nEi] }, !type !0, !type !1, !type !2, !type !3, !type !4, !type !5
				@_ZTV1C = linkonce_odr constant { [4 x ptr] } { [4 x ptr] [ptr null, ptr @_ZTI1C, ptr @_ZN1C1fEi, ptr @_ZN1A1nEi] }, !type !0, !type !1, !type !2, !type !6, !type !7, !type !8
				@_ZTV1D = internal constant { [3 x ptr] } { [3 x ptr] [ptr null, ptr @_ZTI1D, ptr @_ZN1D1mEi] }, !type !9, !vcall_visibility !11

				@_ZTS1A = linkonce_odr constant [3 x i8] c"1A\00"
				@_ZTI1A = linkonce_odr constant { ptr, ptr } { ptr null, ptr @_ZTS1A }

				@_ZTS1B = linkonce_odr constant [3 x i8] c"1B\00"
				@_ZTI1B = linkonce_odr constant { ptr, ptr, ptr } { ptr null, ptr @_ZTS1B, ptr @_ZTI1A }

				@_ZTS1C = linkonce_odr constant [3 x i8] c"1C\00"
				@_ZTI1C = linkonce_odr constant { ptr, ptr, ptr } { ptr null, ptr @_ZTS1C, ptr @_ZTI1A }

				@_ZTS1D = internal constant [3 x i8] c"1D\00"
				@_ZTI1D = internal constant { ptr, ptr } { ptr null, ptr @_ZTS1D }

				;; Prevent the vtables from being dead code eliminated.
				@llvm.used = appending global [3 x ptr] [ ptr @_ZTV1B, ptr @_ZTV1C, ptr @_ZTV1D ]

				; CHECK-COMMON-IR-LABEL: define dso_local i32 @_start
				define i32 @_start(ptr %obj, ptr %obj2, i32 %a) {
				entry:
				%vtable = load ptr, ptr %obj
				%p = call i1 @llvm.type.test(ptr %vtable, metadata !"_ZTS1A")
				call void @llvm.assume(i1 %p)
				%fptrptr = getelementptr ptr, ptr %vtable, i32 1
				%fptr1 = load ptr, ptr %fptrptr, align 8

				;; Check that the call was devirtualized.
				; CHECK-IR: %call = tail call i32 @_ZN1A1nEi
				;; --lto-whole-program-visibility disabled so no devirtualization
				; CHECK-VALIDATE-IR: %call = tail call i32 %fptr1
				; CHECK-NO-RTTI-IR: %call = tail call i32 %fptr1
				%call = tail call i32 %fptr1(ptr nonnull %obj, i32 %a)

				%fptr22 = load ptr, ptr %vtable, align 8

				;; We still have to call it as virtual.
				; CHECK-IR: %call2 = tail call i32 %fptr22
				; CHECK-VALIDATE-IR: %call2 = tail call i32 %fptr22
				; CHECK-NO-RTTI-IR: %call2 = tail call i32 %fptr22
				%call2 = tail call i32 %fptr22(ptr nonnull %obj, i32 %call)

				%vtable2 = load ptr, ptr %obj2
				%p2 = call i1 @llvm.type.test(ptr %vtable2, metadata !10)
				call void @llvm.assume(i1 %p2)

				%fptr33 = load ptr, ptr %vtable2, align 8

				;; Check that the call was devirtualized.
				; CHECK-IR: %call3 = tail call i32 @_ZN1D1mEi
				;; Types not present in native files can still be devirtualized
				; CHECK-VALIDATE-IR: %call3 = tail call i32 @_ZN1D1mEi
				;; --lto-whole-program-visibility disabled but being local this
				;; has VCallVisibilityTranslationUnit visibility so it's still devirtualized
				; CHECK-NO-RTTI-IR: %call3 = tail call i32 @_ZN1D1mEi
				%call3 = tail call i32 %fptr33(ptr nonnull %obj2, i32 %call2)

				ret i32 %call3
				}
				; CHECK-COMMON-IR-LABEL: ret i32
				; CHECK-COMMON-IR-LABEL: }

				declare i1 @llvm.type.test(ptr, metadata)
				declare void @llvm.assume(i1)

				define linkonce_odr i32 @_ZN1B1fEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				define linkonce_odr i32 @_ZN1A1nEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				define linkonce_odr i32 @_ZN1C1fEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				define internal i32 @_ZN1D1mEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				;; Make sure we don't inline or otherwise optimize out the direct calls.
				attributes #0 = { noinline optnone }

				!0 = !{i64 16, !"_ZTS1A"}
				!1 = !{i64 16, !"_ZTSM1AFviE.virtual"}
				!2 = !{i64 24, !"_ZTSM1AFviE.virtual"}
				!3 = !{i64 16, !"_ZTS1B"}
				!4 = !{i64 16, !"_ZTSM1BFviE.virtual"}
				!5 = !{i64 24, !"_ZTSM1BFviE.virtual"}
				!6 = !{i64 16, !"_ZTS1C"}
				!7 = !{i64 16, !"_ZTSM1CFviE.virtual"}
				!8 = !{i64 24, !"_ZTSM1CFviE.virtual"}
				!9 = !{i64 16, !10}
				!10 = distinct !{}
				!11 = !{i64 2}

lld/test/ELF/lto/devirt_validate_vtable_typeinfos_mixed_lto.ll

This file was added.

				; REQUIRES: x86

				; RUN: rm -rf %t.dir
				; RUN: split-file %s %t.dir
				; RUN: cd %t.dir

				;; Common artifacts
				; RUN: opt --thinlto-bc --thinlto-split-lto-unit -o %t1.o ThinLTO.ll
				; RUN: opt -module-summary -o %t2.o RegularLTO.ll

				;; --lto-whole-program-visibility when there's split ThinLTO and a RegularLTO with summary optimizes
				;; using the combined index.
				; RUN: ld.lld %t1.o %t2.o -o %t3 -save-temps --lto-whole-program-visibility \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-IR,CHECK-COMMON-IR
				; RUN: llvm-dis %t3.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-REGULAR-IR,CHECK-COMMON-REGULAR-IR

				;; --lto-validate-all-vtables-have-type-infos when there's split ThinLTO and a RegularLTO with summary behaves the same
				;; as everything is present in the combined index.
				; RUN: ld.lld %t1.o %t2.o -o %t3 -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-IR,CHECK-COMMON-IR
				; RUN: llvm-dis %t3.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-REGULAR-IR,CHECK-COMMON-REGULAR-IR

				; REMARK-DAG: single-impl: devirtualized a call to _ZN1D1mEi
				; REMARK-DAG: single-impl: devirtualized a call to _ZN1A1nEi

				;--- ThinLTO.ll
				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.A = type { ptr }
				%struct.B = type { %struct.A }
				%struct.C = type { %struct.A }
				%struct.D = type { ptr }

				@_ZTV1B = linkonce_odr constant { [4 x ptr] } { [4 x ptr] [ptr null, ptr @_ZTI1B, ptr @_ZN1A1fEi, ptr @_ZN1A1nEi] }, !type !0, !type !1, !type !2, !type !3, !type !4, !type !5
				@_ZTV1C = linkonce_odr constant { [4 x ptr] } { [4 x ptr] [ptr null, ptr @_ZTI1C, ptr @_ZN1A1fEi, ptr @_ZN1A1nEi] }, !type !0, !type !1, !type !2, !type !6, !type !7, !type !8
				@_ZTV1D = internal constant { [3 x ptr] } { [3 x ptr] [ptr null, ptr @_ZTI1D, ptr @_ZN1D1mEi] }, !type !9, !vcall_visibility !11

				@_ZTS1A = linkonce_odr constant [3 x i8] c"1A\00"
				@_ZTI1A = linkonce_odr constant { ptr, ptr } { ptr null, ptr @_ZTS1A }

				@_ZTS1B = linkonce_odr constant [3 x i8] c"1B\00"
				@_ZTI1B = linkonce_odr constant { ptr, ptr, ptr } { ptr null, ptr @_ZTS1B, ptr @_ZTI1A }

				@_ZTS1C = linkonce_odr constant [3 x i8] c"1C\00"
				@_ZTI1C = linkonce_odr constant { ptr, ptr, ptr } { ptr null, ptr @_ZTS1C, ptr @_ZTI1A }

				@_ZTS1D = internal constant [3 x i8] c"1D\00"
				@_ZTI1D = internal constant { ptr, ptr } { ptr null, ptr @_ZTS1D }

				;; Prevent the vtables from being dead code eliminated.
				@llvm.used = appending global [3 x ptr] [ ptr @_ZTV1B, ptr @_ZTV1C, ptr @_ZTV1D ], section "llvm.metadata"

				; CHECK-COMMON-IR-LABEL: define dso_local i32 @_start
				define i32 @_start(ptr %obj, ptr %obj2, i32 %a) {
				;; Call function built with RegularLTO
				%RegularLTOResult = call i32 @RegularLTO(ptr %obj, i32 %a)

				;; ThinLTO code starts here
				%vtable = load ptr, ptr %obj
				%p = call i1 @llvm.type.test(ptr %vtable, metadata !"_ZTS1A")
				call void @llvm.assume(i1 %p)
				%fptrptr = getelementptr ptr, ptr %vtable, i32 1
				%fptr1 = load ptr, ptr %fptrptr, align 8

				;; Check that the call was devirtualized.
				; CHECK-IR: %call = tail call i32 @_ZN1A1nEi
				%call = tail call i32 %fptr1(ptr nonnull %obj, i32 %a)

				%fptr22 = load ptr, ptr %vtable, align 8

				;; Check that the call was not devirtualized.
				; CHECK-IR: %call2 = tail call i32 %fptr22
				%call2 = tail call i32 %fptr22(ptr nonnull %obj, i32 %call)

				%vtable2 = load ptr, ptr %obj2
				%p2 = call i1 @llvm.type.test(ptr %vtable2, metadata !10)
				call void @llvm.assume(i1 %p2)

				%fptr33 = load ptr, ptr %vtable2, align 8

				;; Check that the call was devirtualized.
				; CHECK-IR: %call3 = tail call i32 @_ZN1D1mEi
				%call3 = tail call i32 %fptr33(ptr nonnull %obj2, i32 %call2)

				ret i32 %call3
				}
				; CHECK-COMMON-IR-LABEL: ret i32
				; CHECK-COMMON-IR-LABEL: }

				declare i32 @RegularLTO(ptr)
				declare i1 @llvm.type.test(ptr, metadata)
				declare void @llvm.assume(i1)

				define linkonce_odr i32 @_ZN1A1fEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				define linkonce_odr i32 @_ZN1A1nEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				define internal i32 @_ZN1D1mEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				;; Make sure we don't inline or otherwise optimize out the direct calls.
				attributes #0 = { noinline optnone }

				!0 = !{i64 16, !"_ZTS1A"}
				!1 = !{i64 16, !"_ZTSM1AFviE.virtual"}
				!2 = !{i64 24, !"_ZTSM1AFviE.virtual"}
				!3 = !{i64 16, !"_ZTS1B"}
				!4 = !{i64 16, !"_ZTSM1BFviE.virtual"}
				!5 = !{i64 24, !"_ZTSM1BFviE.virtual"}
				!6 = !{i64 16, !"_ZTS1C"}
				tejohnsonUnsubmitted Not Done Reply Inline Actions Both this and the CHECK-SUMMARY-IR case below are incorrect devirtualizations, right? Is this another case that we are not doing correctly without the validation options in this patch? tejohnson: Both this and the CHECK-SUMMARY-IR case below are incorrect devirtualizations, right? Is this…
				!7 = !{i64 16, !"_ZTSM1CFviE.virtual"}
				!8 = !{i64 24, !"_ZTSM1CFviE.virtual"}
				!9 = !{i64 16, !10}
				!10 = distinct !{}
				!11 = !{i64 2}

				tejohnsonUnsubmitted Not Done Reply Inline Actions I think we only get the vtable summary from the regular LTO object because it doesn't have the EnableSplitLTOUnit module flag set in the IR here. Normally, this is added by clang when building -flto. And this currently prevents vtable summaries being added to the LTO summary (https://github.com/llvm/llvm-project/blob/8a15bdb5e637f81041591d97bea0267b5f053f16/llvm/lib/Analysis/ModuleSummaryAnalysis.cpp#L734-L736). When I added that guard, it was because I didn't think we needed these summaries when splitting was enabled, as I was thinking of either the everything-is-regular LTO case or the -fsplit-lto-unit case that you get by default with -flto=thin -fwhole-program-vtables, where all the vtables are placed in the regular LTO split modules. It's possible that we could remove that guard, but with it I think this case would do the wrong thing if the regular IR was built from clang with -flto. tejohnson: I think we only get the vtable summary from the regular LTO object because it doesn't have the…
				modimoAuthorUnsubmitted Not Done Reply Inline Actions I see, so the `BASE` scenario being tested here is already guarded by EnableSplitLTOUnit since ThinLTO would have EnableSplitLTOUnit=0 and RegularLTO would have EnableSplitLTOUnit=1. Is this the scenario described in the previous comment? RegularLTO summaries are added to the combined index used by ThinLTO, but it looks like the vtable summaries aren't currently created for them. I think you are right in that there is a potential hole here for ThinLTO WPD if linked with a regular LTO object containing an override. Can you test this case to confirm? If that is an issue, then I guess we do need another GlobalRes field. Maybe VisibleOutsideLTOUnit or something like that? The mixed case then would be RegularLTO combined with ThinLTO + -split-lto-unit where neither generate `typeidCompatibleVTable` and all the analysis is done on the combined RegularLTO module. modimo: I see, so the `BASE` scenario being tested here is already guarded by EnableSplitLTOUnit since…
				;--- RegularLTO.ll
				; REQUIRES: x86

				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.A = type { ptr }
				%struct.Native = type { %struct.A }

				@_ZTV7Regular = linkonce_odr unnamed_addr constant { [4 x ptr] } { [4 x ptr] [ptr null, ptr @_ZTI7Regular, ptr @_ZN7Regular1fEi, ptr @_ZN1A1nEi] } , !type !0, !type !1, !type !2, !type !3, !type !4, !type !5
				@_ZTS7Regular = linkonce_odr constant [9 x i8] c"7Regular\00"
				@_ZTI7Regular = linkonce_odr constant { ptr, ptr, ptr } { ptr null, ptr @_ZTS7Regular, ptr @_ZTI1A }

				; Base type A does not need to emit a vtable if it's never instantiated. However, RTTI still gets generated
				@_ZTS1A = linkonce_odr constant [3 x i8] c"1A\00"
				@_ZTI1A = linkonce_odr constant { ptr, ptr } { ptr null, ptr @_ZTS1A }

				;; Prevent the vtables from being dead code eliminated.
				@llvm.used = appending global [1 x ptr] [ ptr @_ZTV7Regular ], section "llvm.metadata"

				; CHECK-COMMON-REGULAR-IR-LABEL: define dso_local i32 @RegularLTO
				define i32 @RegularLTO(ptr %obj, i32 %a) #0 {
				entry:
				%vtable = load ptr, ptr %obj
				%p = call i1 @llvm.type.test(ptr %vtable, metadata !"_ZTS1A")
				call void @llvm.assume(i1 %p)
				%fptr1 = load ptr, ptr %vtable, align 8

				;; Check that the call was not devirtualized.
				; CHECK-REGULAR-IR: %call = tail call i32 %fptr1
				%call = tail call i32 %fptr1(ptr nonnull %obj, i32 %a)

				ret i32 %call
				}
				; CHECK-COMMON-REGULAR-IR-LABEL: ret i32
				; CHECK-COMMON-REGULAR-IR-LABEL: }

				declare i1 @llvm.type.test(ptr, metadata)
				declare void @llvm.assume(i1)

				define linkonce_odr i32 @_ZN7Regular1fEi(ptr %this, i32 %a) #0 {
				ret i32 1;
				}

				define linkonce_odr i32 @_ZN1A1nEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				attributes #0 = { noinline optnone }
				!llvm.module.flags = !{!6, !7}

				!0 = !{i64 16, !"_ZTS1A"}
				!1 = !{i64 16, !"_ZTSM1AFviE.virtual"}
				!2 = !{i64 24, !"_ZTSM1AFviE.virtual"}
				!3 = !{i64 16, !"_ZTS7Regular"}
				!4 = !{i64 16, !"_ZTSM7RegularFviE.virtual"}
				!5 = !{i64 24, !"_ZTSM7RegularFviE.virtual"}
				!6 = !{i32 1, !"ThinLTO", i32 0}
				!7 = !{i32 1, !"EnableSplitLTOUnit", i32 1}

lld/test/ELF/lto/devirt_validate_vtable_typeinfos_no_rtti.ll

This file was added.

				; REQUIRES: x86

				;; Common artifacts
				; RUN: opt --thinlto-bc -o %t1.o %s
				; RUN: opt --thinlto-bc --thinlto-split-lto-unit -o %t1_hybrid.o %s
				; RUN: cp %s %t1_regular.ll
				; RUN: echo '!llvm.module.flags = !{!6, !7}' >> %t1_regular.ll
				; RUN: echo '!6 = !{i32 1, !"ThinLTO", i32 0}' >> %t1_regular.ll
				; RUN: echo '!7 = !{i32 1, !"EnableSplitLTOUnit", i32 1}' >> %t1_regular.ll
				tejohnsonUnsubmitted Done Reply Inline Actions The earlier version didn't have this second input file - why is it needed now for this test? tejohnson: The earlier version didn't have this second input file - why is it needed now for this test?
				modimoAuthorUnsubmitted Done Reply Inline Actions Good catch, I re-used the index/hybrid/full commands from `devirt_validate_vtable_typeinfos.ll` and that came along for the ride, removed. modimo: Good catch, I re-used the index/hybrid/full commands from `devirt_validate_vtable_typeinfos.ll`…
				; RUN: opt -module-summary -o %t1_regular.o %t1_regular.ll

				;; With --lto-whole-program-visibility, we assume no native types can interfere
				;; and thus proceed with devirtualization even in the presence of native types

				;; Index based WPD
				; RUN: ld.lld %t1.o -o %t3_index -save-temps --lto-whole-program-visibility \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o -o %t3_hybrid -save-temps --lto-whole-program-visibility \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o -o %t3_regular -save-temps --lto-whole-program-visibility \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=REMARK
				; RUN: llvm-dis %t3_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-IR

				; REMARK-DAG: single-impl: devirtualized a call to _ZN1A1nEi
				; REMARK-DAG: single-impl: devirtualized a call to _ZN1D1mEi

				;; With --lto-whole-program-visibility and --lto-validate-all-vtables-have-type-infos
				;; we rely on resolutions on the typename symbol to inform us of what's outside the summary.
				;; Without the typename symbol in the LTO unit (e.g. RTTI disabled) this causes
				;; conservative disablement of WPD on these types unless it's local

				;; Index based WPD
				; RUN: ld.lld %t1.o -o %t3_index -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=VALIDATE
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-VALIDATE-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o -o %t3_hybrid -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=VALIDATE
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-VALIDATE-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o -o %t3_regular -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s --check-prefix=VALIDATE
				; RUN: llvm-dis %t3_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-COMMON-IR-LABEL,CHECK-VALIDATE-IR

				; VALIDATE-DAG: single-impl: devirtualized a call to _ZN1D1mEi

				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.A = type { ptr }
				%struct.B = type { %struct.A }
				%struct.C = type { %struct.A }
				%struct.D = type { ptr }

				@_ZTV1B = linkonce_odr constant { [4 x ptr] } { [4 x ptr] [ptr null, ptr null, ptr @_ZN1B1fEi, ptr @_ZN1A1nEi] }, !type !0, !type !1
				@_ZTV1C = linkonce_odr constant { [4 x ptr] } { [4 x ptr] [ptr null, ptr null, ptr @_ZN1C1fEi, ptr @_ZN1A1nEi] }, !type !0, !type !2
				@_ZTV1D = internal constant { [3 x ptr] } { [3 x ptr] [ptr null, ptr null, ptr @_ZN1D1mEi] }, !type !3, !vcall_visibility !5

				;; Prevent the vtables from being dead code eliminated.
				@llvm.used = appending global [3 x ptr] [ ptr @_ZTV1B, ptr @_ZTV1C, ptr @_ZTV1D ]

				; CHECK-COMMON-IR-LABEL: define dso_local i32 @_start
				define i32 @_start(ptr %obj, ptr %obj2, i32 %a) {
				entry:
				%vtable = load ptr, ptr %obj
				%p = call i1 @llvm.type.test(ptr %vtable, metadata !"_ZTS1A")
				call void @llvm.assume(i1 %p)
				%fptrptr = getelementptr ptr, ptr %vtable, i32 1
				%fptr1 = load ptr, ptr %fptrptr, align 8

				;; Check that the call was devirtualized.
				; CHECK-IR: %call = tail call i32 @_ZN1A1nEi
				;; No resolution for _ZTS1A means we don't devirtualize
				; CHECK-VALIDATE-IR: %call = tail call i32 %fptr1
				%call = tail call i32 %fptr1(ptr nonnull %obj, i32 %a)

				%fptr22 = load ptr, ptr %vtable, align 8

				;; We still have to call it as virtual.
				; CHECK-IR: %call3 = tail call i32 %fptr22
				; CHECK-VALIDATE-IR: %call3 = tail call i32 %fptr22
				%call3 = tail call i32 %fptr22(ptr nonnull %obj, i32 %call)

				%vtable2 = load ptr, ptr %obj2
				%p2 = call i1 @llvm.type.test(ptr %vtable2, metadata !4)
				call void @llvm.assume(i1 %p2)

				%fptr33 = load ptr, ptr %vtable2, align 8

				;; Check that the call was devirtualized.
				; CHECK-IR: %call4 = tail call i32 @_ZN1D1mEi
				;; Being local this has VCallVisibilityTranslationUnit
				;; visibility so it's still devirtualized
				; CHECK-VALIDATE-IR: %call4 = tail call i32 @_ZN1D1mEi
				%call4 = tail call i32 %fptr33(ptr nonnull %obj2, i32 %call3)
				ret i32 %call4
				}
				; CHECK-COMMON-IR-LABEL: ret i32
				; CHECK-COMMON-IR-LABEL: }

				declare i1 @llvm.type.test(ptr, metadata)
				declare void @llvm.assume(i1)

				define linkonce_odr i32 @_ZN1B1fEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				define linkonce_odr i32 @_ZN1A1nEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				define linkonce_odr i32 @_ZN1C1fEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				define internal i32 @_ZN1D1mEi(ptr %this, i32 %a) #0 {
				ret i32 0;
				}

				;; Make sure we don't inline or otherwise optimize out the direct calls.
				attributes #0 = { noinline optnone }

				!0 = !{i64 16, !"_ZTS1A"}
				!1 = !{i64 16, !"_ZTS1B"}
				!2 = !{i64 16, !"_ZTS1C"}
				!3 = !{i64 16, !4}
				!4 = distinct !{}
				!5 = !{i64 2}

lld/test/ELF/lto/devirt_validate_vtable_typeinfos_ref.ll

This file was added.

				; REQUIRES: x86

				;; Common artifacts
				; RUN: opt --thinlto-bc -o %t1.o %s
				; RUN: opt --thinlto-bc --thinlto-split-lto-unit -o %t1_hybrid.o %s
				; RUN: cp %s %t1_regular.ll
				; RUN: echo '!llvm.module.flags = !{!2, !3}' >> %t1_regular.ll
				; RUN: echo '!2 = !{i32 1, !"ThinLTO", i32 0}' >> %t1_regular.ll
				; RUN: echo '!3 = !{i32 1, !"EnableSplitLTOUnit", i32 1}' >> %t1_regular.ll
				; RUN: opt -module-summary -o %t1_regular.o %t1_regular.ll

				; RUN: llvm-as %S/Inputs/devirt_validate_vtable_typeinfos_ref.ll -o %t2.bc
				; RUN: llc -relocation-model=pic -filetype=obj %t2.bc -o %t2.o

				;; Native objects can contain only a reference to the base type infos if the base declaration has no key functions.
				;; Because of that, --lto-validate-all-vtables-have-type-infos needs to query for the type info symbol inside native files rather than the
				;; type name symbol that's used as the key in !type metadata to correctly stop devirtualization on the native type.

				;; Index based WPD
				; RUN: ld.lld %t1.o %t2.o -o %t3_index -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s
				; RUN: llvm-dis %t1.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-IR

				;; Hybrid WPD
				; RUN: ld.lld %t1_hybrid.o %t2.o -o %t3_hybrid -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s
				; RUN: llvm-dis %t1_hybrid.o.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-IR

				;; Regular LTO WPD
				; RUN: ld.lld %t1_regular.o %t2.o -o %t3_regular -save-temps --lto-whole-program-visibility --lto-validate-all-vtables-have-type-infos \
				; RUN: -mllvm -pass-remarks=. 2>&1 \| FileCheck %s
				; RUN: llvm-dis %t3_regular.0.4.opt.bc -o - \| FileCheck %s --check-prefixes=CHECK-IR

				; CHECK-NOT: single-impl: devirtualized a call to _ZN1A3fooEv

				;; Source code:
				;; cat > a.h <<'eof'
				;; struct A { virtual int foo(); };
				;; int bar(A *a);
				;; eof
				;; cat > main.cc <<'eof'
				;; #include "a.h"
				;;
				;; int A::foo() { return 1; }
				;; int bar(A *a) { return a->foo(); }
				;;
				;; extern int baz();
				;; int main() {
				;; A a;
				;; int i = bar(&a);
				;; int j = baz();
				;; return i + j;
				;; }
				;; eof
				;; clang++ -fwhole-program-vtables -fno-split-lto-unit -flto=thin main.cc -c

				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.A = type { %struct.Abase }
				%struct.Abase = type { ptr }

				@_ZTV1A = dso_local unnamed_addr constant { [3 x ptr] } { [3 x ptr] [ptr null, ptr @_ZTI1A, ptr @_ZN1A3fooEv] }, align 8, !type !0, !type !1
				@_ZTS1A = dso_local constant [3 x i8] c"1A\00", align 1
				@_ZTI1A = dso_local constant { ptr, ptr } { ptr null, ptr @_ZTS1A }, align 8

				define dso_local noundef i32 @_ZN1A3fooEv(ptr noundef nonnull align 8 dereferenceable(8) %this) #0 align 2 {
				entry:
				%this.addr = alloca ptr
				store ptr %this, ptr %this.addr
				%this1 = load ptr, ptr %this.addr
				ret i32 1
				}

				; CHECK-IR: define dso_local noundef i32 @_Z3barP1A
				define dso_local noundef i32 @_Z3barP1A(ptr noundef %a) #0 {
				entry:
				%a.addr = alloca ptr
				store ptr %a, ptr %a.addr
				%0 = load ptr, ptr %a.addr
				%vtable = load ptr, ptr %0
				%1 = call i1 @llvm.public.type.test(ptr %vtable, metadata !"_ZTS1A")
				call void @llvm.assume(i1 %1)
				%vfn = getelementptr inbounds ptr, ptr %vtable, i64 0
				%fptr = load ptr, ptr %vfn
				;; Check that the call was not devirtualized.
				; CHECK-IR: %call = call noundef i32 %fptr
				%call = call noundef i32 %fptr(ptr noundef nonnull align 8 dereferenceable(8) %0)
				ret i32 %call
				}
				; CHECK-IR: ret i32
				; CHECK-IR: }

				declare i1 @llvm.public.type.test(ptr, metadata)
				declare void @llvm.assume(i1 noundef)

				define dso_local noundef i32 @main() #0 {
				entry:
				%retval = alloca i32, align 4
				%a = alloca %struct.A, align 8
				%i = alloca i32, align 4
				%j = alloca i32, align 4
				store i32 0, ptr %retval, align 4
				call void @_ZN1AC2Ev(ptr noundef nonnull align 8 dereferenceable(8) %a)
				%call = call noundef i32 @_Z3barP1A(ptr noundef %a)
				store i32 %call, ptr %i, align 4
				%call1 = call noundef i32 @_Z3bazv()
				store i32 %call1, ptr %j, align 4
				%0 = load i32, ptr %i, align 4
				%1 = load i32, ptr %j, align 4
				%add = add nsw i32 %0, %1
				ret i32 %add
				}

				define linkonce_odr dso_local void @_ZN1AC2Ev(ptr noundef nonnull align 8 dereferenceable(8) %this) #0 align 2 {
				entry:
				%this.addr = alloca ptr, align 8
				store ptr %this, ptr %this.addr, align 8
				%this1 = load ptr, ptr %this.addr, align 8
				store ptr getelementptr inbounds ({ [3 x ptr] }, ptr @_ZTV1A, i32 0, inrange i32 0, i32 2), ptr %this1, align 8
				ret void
				}

				declare noundef i32 @_Z3bazv()

				;; Make sure we don't inline or otherwise optimize out the direct calls.
				attributes #0 = { noinline optnone }

				!0 = !{i64 16, !"_ZTS1A"}
				!1 = !{i64 16, !"_ZTSM1AFivE.virtual"}

llvm/include/llvm/LTO/Config.h

Show First 20 Lines • Show All 74 Lines • ▼ Show 20 Lines	struct Config {

/// Turn on/off the warning about a hash mismatch in the PGO profile data.		/// Turn on/off the warning about a hash mismatch in the PGO profile data.
bool PGOWarnMismatch = true;		bool PGOWarnMismatch = true;

/// Asserts whether we can assume whole program visibility during the LTO		/// Asserts whether we can assume whole program visibility during the LTO
/// link.		/// link.
bool HasWholeProgramVisibility = false;		bool HasWholeProgramVisibility = false;

		/// We're validating that all native vtables have corresponding type infos.
		bool ValidateAllVtablesHaveTypeInfos = false;
		/// If all native vtables have corresponding type infos, allow
		tejohnsonUnsubmitted Done Reply Inline Actions This would read better like "If all native vtables have corresponding type infos, allow usage..." tejohnson: This would read better like "If all native vtables have corresponding type infos, allow usage...
		/// usage of RTTI to block devirtualization on types used in native files.
		bool AllVtablesHaveTypeInfos = false;

/// Always emit a Regular LTO object even when it is empty because no Regular		/// Always emit a Regular LTO object even when it is empty because no Regular
/// LTO modules were linked. This option is useful for some build system which		/// LTO modules were linked. This option is useful for some build system which
/// want to know a priori all possible output files.		/// want to know a priori all possible output files.
bool AlwaysEmitRegularLTOObj = false;		bool AlwaysEmitRegularLTOObj = false;

/// Allows non-imported definitions to get the potentially more constraining		/// Allows non-imported definitions to get the potentially more constraining
/// visibility from the prevailing definition. FromPrevailing is the default		/// visibility from the prevailing definition. FromPrevailing is the default
/// because it works for many binary formats. ELF can use the more optimized		/// because it works for many binary formats. ELF can use the more optimized
▲ Show 20 Lines • Show All 215 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/IPO/WholeProgramDevirt.h

Show First 20 Lines • Show All 237 Lines • ▼ Show 20 Lines	struct VTableSlotSummary {
StringRef TypeID;		StringRef TypeID;
uint64_t ByteOffset;		uint64_t ByteOffset;
};		};
bool hasWholeProgramVisibility(bool WholeProgramVisibilityEnabledInLTO);		bool hasWholeProgramVisibility(bool WholeProgramVisibilityEnabledInLTO);
void updatePublicTypeTestCalls(Module &M,		void updatePublicTypeTestCalls(Module &M,
bool WholeProgramVisibilityEnabledInLTO);		bool WholeProgramVisibilityEnabledInLTO);
void updateVCallVisibilityInModule(		void updateVCallVisibilityInModule(
Module &M, bool WholeProgramVisibilityEnabledInLTO,		Module &M, bool WholeProgramVisibilityEnabledInLTO,
const DenseSet<GlobalValue::GUID> &DynamicExportSymbols);		const DenseSet<GlobalValue::GUID> &DynamicExportSymbols,
		bool ValidateAllVtablesHaveTypeInfos,
		function_ref<bool(StringRef)> IsVisibleToRegularObj);
void updateVCallVisibilityInIndex(		void updateVCallVisibilityInIndex(
ModuleSummaryIndex &Index, bool WholeProgramVisibilityEnabledInLTO,		ModuleSummaryIndex &Index, bool WholeProgramVisibilityEnabledInLTO,
const DenseSet<GlobalValue::GUID> &DynamicExportSymbols);		const DenseSet<GlobalValue::GUID> &DynamicExportSymbols,
		const DenseSet<GlobalValue::GUID> &VisibleToRegularObjSymbols);

		void getVisibleToRegularObjVtableGUIDs(
		ModuleSummaryIndex &Index,
		DenseSet<GlobalValue::GUID> &VisibleToRegularObjSymbols,
		function_ref<bool(StringRef)> IsVisibleToRegularObj);

/// Perform index-based whole program devirtualization on the \p Summary		/// Perform index-based whole program devirtualization on the \p Summary
/// index. Any devirtualized targets used by a type test in another module		/// index. Any devirtualized targets used by a type test in another module
/// are added to the \p ExportedGUIDs set. For any local devirtualized targets		/// are added to the \p ExportedGUIDs set. For any local devirtualized targets
/// only used within the defining module, the information necessary for		/// only used within the defining module, the information necessary for
/// locating the corresponding WPD resolution is recorded for the ValueInfo		/// locating the corresponding WPD resolution is recorded for the ValueInfo
/// in case it is exported by cross module importing (in which case the		/// in case it is exported by cross module importing (in which case the
/// devirtualized target name will need adjustment).		/// devirtualized target name will need adjustment).
Show All 14 Lines

llvm/lib/LTO/LTO.cpp

Show First 20 Lines • Show All 1,264 Lines • ▼ Show 20 Lines	if (OldGV) {
OldGV->eraseFromParent();		OldGV->eraseFromParent();
} else {		} else {
GV->setName(I.first);		GV->setName(I.first);
}		}
}		}

updateMemProfAttributes(*RegularLTO.CombinedModule, ThinLTO.CombinedIndex);		updateMemProfAttributes(*RegularLTO.CombinedModule, ThinLTO.CombinedIndex);

		bool WholeProgramVisibilityEnabledInLTO =
		Conf.HasWholeProgramVisibility &&
		// If validation is enabled, upgrade visibility only when all vtables
		// have typeinfos.
		(!Conf.ValidateAllVtablesHaveTypeInfos \|\| Conf.AllVtablesHaveTypeInfos);

		// This returns true when the name is local or not defined. Locals are
		// expected to be handled separately.
		auto IsVisibleToRegularObj = [&](StringRef name) {
		auto It = GlobalResolutions.find(name);
		MaskRayUnsubmitted Done Reply Inline Actions redundant hash table lookup here. Better to use `find(name)` with slightly more code MaskRay: redundant hash table lookup here. Better to use `find(name)` with slightly more code
		modimoAuthorUnsubmitted Done Reply Inline Actions Good call, other places here also use the find pattern. modimo: Good call, other places here also use the find pattern.
		return (It == GlobalResolutions.end() \|\| It->second.VisibleOutsideSummary);
		};

// If allowed, upgrade public vcall visibility metadata to linkage unit		// If allowed, upgrade public vcall visibility metadata to linkage unit
// visibility before whole program devirtualization in the optimizer.		// visibility before whole program devirtualization in the optimizer.
updateVCallVisibilityInModule(*RegularLTO.CombinedModule,		updateVCallVisibilityInModule(
Conf.HasWholeProgramVisibility,		*RegularLTO.CombinedModule, WholeProgramVisibilityEnabledInLTO,
DynamicExportSymbols);		DynamicExportSymbols, Conf.ValidateAllVtablesHaveTypeInfos,
		IsVisibleToRegularObj);
updatePublicTypeTestCalls(*RegularLTO.CombinedModule,		updatePublicTypeTestCalls(*RegularLTO.CombinedModule,
Conf.HasWholeProgramVisibility);		WholeProgramVisibilityEnabledInLTO);

if (Conf.PreOptModuleHook &&		if (Conf.PreOptModuleHook &&
!Conf.PreOptModuleHook(0, *RegularLTO.CombinedModule))		!Conf.PreOptModuleHook(0, *RegularLTO.CombinedModule))
return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));		return finalizeOptimizationRemarks(std::move(DiagnosticOutputFile));

if (!Conf.CodeGenOnly) {		if (!Conf.CodeGenOnly) {
for (const auto &R : GlobalResolutions) {		for (const auto &R : GlobalResolutions) {
GlobalValue *GV =		GlobalValue *GV =
▲ Show 20 Lines • Show All 390 Lines • ▼ Show 20 Lines	DenseMap<StringRef, FunctionImporter::ExportSetTy> ExportLists(
ThinLTO.ModuleMap.size());		ThinLTO.ModuleMap.size());
StringMap<std::map<GlobalValue::GUID, GlobalValue::LinkageTypes>> ResolvedODR;		StringMap<std::map<GlobalValue::GUID, GlobalValue::LinkageTypes>> ResolvedODR;

if (DumpThinCGSCCs)		if (DumpThinCGSCCs)
ThinLTO.CombinedIndex.dumpSCCs(outs());		ThinLTO.CombinedIndex.dumpSCCs(outs());

std::set<GlobalValue::GUID> ExportedGUIDs;		std::set<GlobalValue::GUID> ExportedGUIDs;

if (hasWholeProgramVisibility(Conf.HasWholeProgramVisibility))		bool WholeProgramVisibilityEnabledInLTO =
		Conf.HasWholeProgramVisibility &&
		// If validation is enabled, upgrade visibility only when all vtables
		// have typeinfos.
		(!Conf.ValidateAllVtablesHaveTypeInfos \|\| Conf.AllVtablesHaveTypeInfos);
		if (hasWholeProgramVisibility(WholeProgramVisibilityEnabledInLTO))
ThinLTO.CombinedIndex.setWithWholeProgramVisibility();		ThinLTO.CombinedIndex.setWithWholeProgramVisibility();

		// If we're validating, get the vtable symbols that should not be
		// upgraded because they correspond to typeIDs outside of index-based
		// WPD info.
		DenseSet<GlobalValue::GUID> VisibleToRegularObjSymbols;
		if (WholeProgramVisibilityEnabledInLTO &&
		Conf.ValidateAllVtablesHaveTypeInfos) {
		// This returns true when the name is local or not defined. Locals are
		// expected to be handled separately.
		auto IsVisibleToRegularObj = [&](StringRef name) {
		auto It = GlobalResolutions.find(name);
		return (It == GlobalResolutions.end() \|\|
		It->second.VisibleOutsideSummary);
		};

		getVisibleToRegularObjVtableGUIDs(ThinLTO.CombinedIndex,
		VisibleToRegularObjSymbols,
		IsVisibleToRegularObj);
		}

// If allowed, upgrade public vcall visibility to linkage unit visibility in		// If allowed, upgrade public vcall visibility to linkage unit visibility in
// the summaries before whole program devirtualization below.		// the summaries before whole program devirtualization below.
updateVCallVisibilityInIndex(ThinLTO.CombinedIndex,		updateVCallVisibilityInIndex(
		tejohnsonUnsubmitted Done Reply Inline Actions Can you do the same for updateVCallVisibilityInModule to get this fix to apply to regular LTO? tejohnson: Can you do the same for updateVCallVisibilityInModule to get this fix to apply to regular LTO?
Conf.HasWholeProgramVisibility,		ThinLTO.CombinedIndex, WholeProgramVisibilityEnabledInLTO,
DynamicExportSymbols);		DynamicExportSymbols, VisibleToRegularObjSymbols);

// Perform index-based WPD. This will return immediately if there are		// Perform index-based WPD. This will return immediately if there are
		tejohnsonUnsubmitted Done Reply Inline Actions Would be better to have a more specific name, since this is only queried with type names. I.e. local symbols are not visible outside the summary but don't have a GlobalResolution entry. But you aren't calling this lambda in that case (but that isn't clear, where the lambda is defined). Before I suggest a name, I have a question about the usage of this lambda down in the WPD code. tejohnson: Would be better to have a more specific name, since this is only queried with type names. I.e.
		tejohnsonUnsubmitted Done Reply Inline Actions nit: lambda name should be upper camel case. Also, can you add a comment here that this will return true for either the case where name is a local or where it is not defined, and so the expectation is that it will not be queried for local symbols. tejohnson: nit: lambda name should be upper camel case. Also, can you add a comment here that this will…
// no index entries in the typeIdMetadata map (e.g. if we are instead		// no index entries in the typeIdMetadata map (e.g. if we are instead
// performing IR-based WPD in hybrid regular/thin LTO mode).		// performing IR-based WPD in hybrid regular/thin LTO mode).
std::map<ValueInfo, std::vector<VTableSlotSummary>> LocalWPDTargetsMap;		std::map<ValueInfo, std::vector<VTableSlotSummary>> LocalWPDTargetsMap;
runWholeProgramDevirtOnIndex(ThinLTO.CombinedIndex, ExportedGUIDs,		runWholeProgramDevirtOnIndex(ThinLTO.CombinedIndex, ExportedGUIDs,
LocalWPDTargetsMap);		LocalWPDTargetsMap);

auto isPrevailing = [&](GlobalValue::GUID GUID, const GlobalValueSummary *S) {		auto isPrevailing = [&](GlobalValue::GUID GUID, const GlobalValueSummary *S) {
return ThinLTO.PrevailingModuleForGUID[GUID] == S->modulePath();		return ThinLTO.PrevailingModuleForGUID[GUID] == S->modulePath();
▲ Show 20 Lines • Show All 162 Lines • Show Last 20 Lines

llvm/lib/LTO/LTOCodeGenerator.cpp

Show First 20 Lines • Show All 599 Lines • ▼ Show 20 Lines	bool LTOCodeGenerator::optimize() {
StatsFile = std::move(StatsFileOrErr.get());		StatsFile = std::move(StatsFileOrErr.get());

// Currently there is no support for enabling whole program visibility via a		// Currently there is no support for enabling whole program visibility via a
// linker option in the old LTO API, but this call allows it to be specified		// linker option in the old LTO API, but this call allows it to be specified
// via the internal option. Must be done before WPD invoked via the optimizer		// via the internal option. Must be done before WPD invoked via the optimizer
// pipeline run below.		// pipeline run below.
updatePublicTypeTestCalls(*MergedModule,		updatePublicTypeTestCalls(*MergedModule,
/* WholeProgramVisibilityEnabledInLTO */ false);		/* WholeProgramVisibilityEnabledInLTO */ false);
updateVCallVisibilityInModule(*MergedModule,		updateVCallVisibilityInModule(
		*MergedModule,
/* WholeProgramVisibilityEnabledInLTO */ false,		/* WholeProgramVisibilityEnabledInLTO */ false,
// FIXME: This needs linker information via a		// FIXME: These need linker information via a
// TBD new interface.		// TBD new interface.
/* DynamicExportSymbols */ {});		/DynamicExportSymbols=/{},
		/ValidateAllVtablesHaveTypeInfos=/false,
		/IsVisibleToRegularObj=/[](StringRef) { return true; });

// We always run the verifier once on the merged module, the `DisableVerify`		// We always run the verifier once on the merged module, the `DisableVerify`
// parameter only applies to subsequent verify.		// parameter only applies to subsequent verify.
verifyMergedModuleOnce();		verifyMergedModuleOnce();

// Mark which symbols can not be internalized		// Mark which symbols can not be internalized
this->applyScopeRestrictions();		this->applyScopeRestrictions();

▲ Show 20 Lines • Show All 158 Lines • Show Last 20 Lines

llvm/lib/LTO/ThinLTOCodeGenerator.cpp

Show First 20 Lines • Show All 1,051 Lines • ▼ Show 20 Lines	void ThinLTOCodeGenerator::run() {
// Synthesize entry counts for functions in the combined index.		// Synthesize entry counts for functions in the combined index.
computeSyntheticCounts(*Index);		computeSyntheticCounts(*Index);

// Currently there is no support for enabling whole program visibility via a		// Currently there is no support for enabling whole program visibility via a
// linker option in the old LTO API, but this call allows it to be specified		// linker option in the old LTO API, but this call allows it to be specified
// via the internal option. Must be done before WPD below.		// via the internal option. Must be done before WPD below.
if (hasWholeProgramVisibility(/* WholeProgramVisibilityEnabledInLTO */ false))		if (hasWholeProgramVisibility(/* WholeProgramVisibilityEnabledInLTO */ false))
Index->setWithWholeProgramVisibility();		Index->setWithWholeProgramVisibility();

		// FIXME: This needs linker information via a TBD new interface
updateVCallVisibilityInIndex(*Index,		updateVCallVisibilityInIndex(*Index,
/* WholeProgramVisibilityEnabledInLTO */ false,		/WholeProgramVisibilityEnabledInLTO=/false,
// FIXME: This needs linker information via a		// FIXME: These need linker information via a
// TBD new interface.		// TBD new interface.
/* DynamicExportSymbols */ {});		/DynamicExportSymbols=/{},
		/VisibleToRegularObjSymbols=/{});

// Perform index-based WPD. This will return immediately if there are		// Perform index-based WPD. This will return immediately if there are
// no index entries in the typeIdMetadata map (e.g. if we are instead		// no index entries in the typeIdMetadata map (e.g. if we are instead
// performing IR-based WPD in hybrid regular/thin LTO mode).		// performing IR-based WPD in hybrid regular/thin LTO mode).
std::map<ValueInfo, std::vector<VTableSlotSummary>> LocalWPDTargetsMap;		std::map<ValueInfo, std::vector<VTableSlotSummary>> LocalWPDTargetsMap;
std::set<GlobalValue::GUID> ExportedGUIDs;		std::set<GlobalValue::GUID> ExportedGUIDs;
runWholeProgramDevirtOnIndex(*Index, ExportedGUIDs, LocalWPDTargetsMap);		runWholeProgramDevirtOnIndex(*Index, ExportedGUIDs, LocalWPDTargetsMap);
for (auto GUID : ExportedGUIDs)		for (auto GUID : ExportedGUIDs)
▲ Show 20 Lines • Show All 160 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp

Show First 20 Lines • Show All 776 Lines • ▼ Show 20 Lines

// Enable whole program visibility if enabled by client (e.g. linker) or // Enable whole program visibility if enabled by client (e.g. linker) or

// internal option, and not force disabled. // internal option, and not force disabled.

bool llvm::hasWholeProgramVisibility(bool WholeProgramVisibilityEnabledInLTO) { bool llvm::hasWholeProgramVisibility(bool WholeProgramVisibilityEnabledInLTO) {

return (WholeProgramVisibilityEnabledInLTO || WholeProgramVisibility) && return (WholeProgramVisibilityEnabledInLTO || WholeProgramVisibility) &&

!DisableWholeProgramVisibility; !DisableWholeProgramVisibility;

} }

static bool

MaskRayUnsubmitted

Done

!DisableWholeProgramVisibility;

}

- bool TypeIDVisibleToRegularObj(

+ bool typeIDVisibleToRegularObj(

StringRef TypeID, function_ref<bool(StringRef)> IsVisibleToRegularObj) {

Perhaps we should fix these functions to follow https://llvm.org/docs/CodingStandards.html#use-namespace-qualifiers-to-implement-previously-declared-functions . Pushed b4d4146db3b9a29773259c8b8a6cb7c98da90e73 and you'll need a rebase.

MaskRay: Perhaps we should fix these functions to follow https://llvm.org/docs/CodingStandards.html#use…

modimoAuthorUnsubmitted

Done

Sounds good, I'll follow up with the correct style on the rebase

modimo: Sounds good, I'll follow up with the correct style on the rebase

typeIDVisibleToRegularObj(StringRef TypeID,

function_ref<bool(StringRef)> IsVisibleToRegularObj) {

// TypeID for member function pointer type is an internal construct

// and won't exist in IsVisibleToRegularObj. The full TypeID

// will be present and participate in invalidation.

if (TypeID.ends_with(".virtual"))

return false;

// TypeID that doesn't start with Itanium mangling (_ZTS) will be

// non-externally visible types which cannot interact with

MaskRayUnsubmitted

Done

typo: will will

Add a period.

MaskRay: typo: `will will` Add a period.

// external native files. See CodeGenModule::CreateMetadataIdentifierImpl.

MaskRayUnsubmitted

Done

Use consume_front here to avoid consume_front below.

MaskRay: Use `consume_front` here to avoid `consume_front` below.

if (!TypeID.consume_front("_ZTS"))

return false;

tejohnsonUnsubmitted

Done

Just early return true here, and return false below the loop.

tejohnson: Just early return true here, and return false below the loop.

// TypeID is keyed off the type name symbol (_ZTS). However, the native

// object may not contain this symbol if it does not contain a key

// function for the base type and thus only contains a reference to the

// type info (_ZTI). To catch this case we query using the type info

// symbol corresponding to the TypeID.

std::string typeInfo = ("_ZTI" + TypeID).str();

MaskRayUnsubmitted

Done

TypeID.consume_front("_ZTS");

- std::string typeInfo = "_ZTI" + TypeID.str();

+ std::string typeInfo = ("_ZTI" + TypeID).str();

return IsVisibleToRegularObj(typeInfo);

to avoid constructing a possibly heap-allocated std::string twice.

MaskRay: to avoid constructing a possibly heap-allocated std::string twice.

return IsVisibleToRegularObj(typeInfo);

}

static bool

MaskRayUnsubmitted

Done

skipUpdateDueToValidation

MaskRay: `skipUpdateDueToValidation`

skipUpdateDueToValidation(GlobalVariable &GV,

function_ref<bool(StringRef)> IsVisibleToRegularObj) {

SmallVector<MDNode *, 2> Types;

GV.getMetadata(LLVMContext::MD_type, Types);

for (auto Type : Types)

if (auto *TypeID = dyn_cast<MDString>(Type->getOperand(1).get()))

return typeIDVisibleToRegularObj(TypeID->getString(),

IsVisibleToRegularObj);

return false;

}

/// If whole program visibility asserted, then upgrade all public vcall /// If whole program visibility asserted, then upgrade all public vcall

/// visibility metadata on vtable definitions to linkage unit visibility in /// visibility metadata on vtable definitions to linkage unit visibility in

/// Module IR (for regular or hybrid LTO). /// Module IR (for regular or hybrid LTO).

void llvm::updateVCallVisibilityInModule( void llvm::updateVCallVisibilityInModule(

Module &M, bool WholeProgramVisibilityEnabledInLTO, Module &M, bool WholeProgramVisibilityEnabledInLTO,

const DenseSet<GlobalValue::GUID> &DynamicExportSymbols) { const DenseSet<GlobalValue::GUID> &DynamicExportSymbols,

bool ValidateAllVtablesHaveTypeInfos,

function_ref<bool(StringRef)> IsVisibleToRegularObj) {

if (!hasWholeProgramVisibility(WholeProgramVisibilityEnabledInLTO)) if (!hasWholeProgramVisibility(WholeProgramVisibilityEnabledInLTO))

return; return;

for (GlobalVariable &GV : M.globals()) { for (GlobalVariable &GV : M.globals()) {

// Add linkage unit visibility to any variable with type metadata, which are // Add linkage unit visibility to any variable with type metadata, which are

// the vtable definitions. We won't have an existing vcall_visibility // the vtable definitions. We won't have an existing vcall_visibility

// metadata on vtable definitions with public visibility. // metadata on vtable definitions with public visibility.

if (GV.hasMetadata(LLVMContext::MD_type) && if (GV.hasMetadata(LLVMContext::MD_type) &&

GV.getVCallVisibility() == GlobalObject::VCallVisibilityPublic && GV.getVCallVisibility() == GlobalObject::VCallVisibilityPublic &&

// Don't upgrade the visibility for symbols exported to the dynamic // Don't upgrade the visibility for symbols exported to the dynamic

// linker, as we have no information on their eventual use. // linker, as we have no information on their eventual use.

!DynamicExportSymbols.count(GV.getGUID())) !DynamicExportSymbols.count(GV.getGUID()) &&

// With validation enabled, we want to exclude symbols visible to

// regular objects. Local symbols will be in this group due to the

// current implementation but those with VCallVisibilityTranslationUnit

// will have already been marked in clang so are unaffected.

!(ValidateAllVtablesHaveTypeInfos &&

skipUpdateDueToValidation(GV, IsVisibleToRegularObj)))

GV.setVCallVisibilityMetadata(GlobalObject::VCallVisibilityLinkageUnit); GV.setVCallVisibilityMetadata(GlobalObject::VCallVisibilityLinkageUnit);

} }

void llvm::updatePublicTypeTestCalls(Module &M, void llvm::updatePublicTypeTestCalls(Module &M,

bool WholeProgramVisibilityEnabledInLTO) { bool WholeProgramVisibilityEnabledInLTO) {

Function *PublicTypeTestFunc = Function *PublicTypeTestFunc =

M.getFunction(Intrinsic::getName(Intrinsic::public_type_test)); M.getFunction(Intrinsic::getName(Intrinsic::public_type_test));

Show All 15 Lines if (hasWholeProgramVisibility(WholeProgramVisibilityEnabledInLTO)) {

for (Use &U : make_early_inc_range(PublicTypeTestFunc->uses())) { for (Use &U : make_early_inc_range(PublicTypeTestFunc->uses())) {

auto *CI = cast<CallInst>(U.getUser()); auto *CI = cast<CallInst>(U.getUser());

CI->replaceAllUsesWith(True); CI->replaceAllUsesWith(True);

CI->eraseFromParent(); CI->eraseFromParent();

} }

/// Based on typeID string, get all associated vtable GUIDS that are

/// visible to regular objects.

void llvm::getVisibleToRegularObjVtableGUIDs(

ModuleSummaryIndex &Index,

DenseSet<GlobalValue::GUID> &VisibleToRegularObjSymbols,

function_ref<bool(StringRef)> IsVisibleToRegularObj) {

for (const auto &typeID : Index.typeIdCompatibleVtableMap()) {

if (typeIDVisibleToRegularObj(typeID.first, IsVisibleToRegularObj))

for (const TypeIdOffsetVtableInfo &P : typeID.second)

VisibleToRegularObjSymbols.insert(P.VTableVI.getGUID());

MaskRayUnsubmitted

Done

typo: will will

append a period.

MaskRay: typo: `will will` append a period.

}

/// If whole program visibility asserted, then upgrade all public vcall /// If whole program visibility asserted, then upgrade all public vcall

/// visibility metadata on vtable definition summaries to linkage unit /// visibility metadata on vtable definition summaries to linkage unit

tejohnsonUnsubmitted

Done

nit: the braces can be removed from the for loop

tejohnson: nit: the braces can be removed from the for loop

MaskRayUnsubmitted

Done

delete braces in this nested case when the only body has just one line.

MaskRay: delete braces in this nested case when the only body has just one line.

/// visibility in Module summary index (for ThinLTO). /// visibility in Module summary index (for ThinLTO).

void llvm::updateVCallVisibilityInIndex( void llvm::updateVCallVisibilityInIndex(

ModuleSummaryIndex &Index, bool WholeProgramVisibilityEnabledInLTO, ModuleSummaryIndex &Index, bool WholeProgramVisibilityEnabledInLTO,

const DenseSet<GlobalValue::GUID> &DynamicExportSymbols) { const DenseSet<GlobalValue::GUID> &DynamicExportSymbols,

const DenseSet<GlobalValue::GUID> &VisibleToRegularObjSymbols) {

tejohnsonUnsubmitted

Done

Suggest naming this VisibleToRegularObjVTables.

tejohnson: Suggest naming this VisibleToRegularObjVTables.

modimoAuthorUnsubmitted

Done

Another area that's unknown for thinLTO are the types used in full LTO where VisibleOutsideSummary is set to false and vice-versa with full LTO where types used in thinLTO get GlobalResolution::External partition. I'm thinking then to categorize vtables we want to not upgrade as something like RefOutsideWPD instead of VisibleToRegularObj. WDYT?

modimo: Another area that's unknown for thinLTO are the types used in full LTO where…

tejohnsonUnsubmitted

Done

RegularLTO summaries are added to the combined index used by ThinLTO, but it looks like the vtable summaries aren't currently created for them. I think you are right in that there is a potential hole here for ThinLTO WPD if linked with a regular LTO object containing an override. Can you test this case to confirm? If that is an issue, then I guess we do need another GlobalRes field. Maybe VisibleOutsideLTOUnit or something like that?

tejohnson: RegularLTO summaries are added to the combined index used by ThinLTO, but it looks like the…

modimoAuthorUnsubmitted

Done

Added devirt_validate_vtable_typeinfos_mixed_lto.ll to test mixing LTO modes:

RegularLTO without summary indeed does not export vtable summaries. With validation, because the type is present in a ThinLTO module the partition is set to GlobalResolution::External this type does not get its visibility upgraded in RegularLTO and with VisibleOutsideSummary also set to true this type does not get its visibility upgraded in ThinLTO either.
RegularLTO with summary we do get all the vtable summaries in the combined index. With validation, GlobalResolution::External blocks RegularLTO visibility upgrade but since VisibleOutsideSummary is not set everything is optimized in the combined index.

For the purposes of validation I think this is the behavior we want. It does seem like we may want to fix this hole even without validation enabled although that seems more of a separate change since it alters baseline behavior.

modimo: Added `devirt_validate_vtable_typeinfos_mixed_lto.ll` to test mixing LTO modes: 1. RegularLTO…

if (!hasWholeProgramVisibility(WholeProgramVisibilityEnabledInLTO)) if (!hasWholeProgramVisibility(WholeProgramVisibilityEnabledInLTO))

return; return;

for (auto &P : Index) { for (auto &P : Index) {

// Don't upgrade the visibility for symbols exported to the dynamic // Don't upgrade the visibility for symbols exported to the dynamic

// linker, as we have no information on their eventual use. // linker, as we have no information on their eventual use.

if (DynamicExportSymbols.count(P.first)) if (DynamicExportSymbols.count(P.first))

continue; continue;

for (auto &S : P.second.SummaryList) { for (auto &S : P.second.SummaryList) {

auto *GVar = dyn_cast<GlobalVarSummary>(S.get()); auto *GVar = dyn_cast<GlobalVarSummary>(S.get());

if (!GVar || if (!GVar ||

GVar->getVCallVisibility() != GlobalObject::VCallVisibilityPublic) GVar->getVCallVisibility() != GlobalObject::VCallVisibilityPublic)

continue; continue;

// With validation enabled, we want to exclude symbols visible to regular

// objects. Local symbols will be in this group due to the current

// implementation but those with VCallVisibilityTranslationUnit will have

// already been marked in clang so are unaffected.

if (VisibleToRegularObjSymbols.count(P.first))

tejohnsonUnsubmitted

Done

Hmm, now that I think about it, shouldn't local symbols have gotten VCallVisibilityTranslationUnit from clang? Same question in the regularLTO handling.

tejohnson: Hmm, now that I think about it, shouldn't local symbols have gotten…

modimoAuthorUnsubmitted

Done

Good point. Originally excluding based on type name had to explicitly take into account local vcall_visibility. Now, it's a test setup where VCallVisibilityTranslationUnit was only used for one of the local types. I'll remove this check and the type that doesn't have the proper vcall_visibility in the tests.

modimo: Good point. Originally excluding based on type name had to explicitly take into account local…

continue;

GVar->setVCallVisibility(GlobalObject::VCallVisibilityLinkageUnit); GVar->setVCallVisibility(GlobalObject::VCallVisibilityLinkageUnit);

} }

void llvm::runWholeProgramDevirtOnIndex( void llvm::runWholeProgramDevirtOnIndex(

ModuleSummaryIndex &Summary, std::set<GlobalValue::GUID> &ExportedGUIDs, ModuleSummaryIndex &Summary, std::set<GlobalValue::GUID> &ExportedGUIDs,

std::map<ValueInfo, std::vector<VTableSlotSummary>> &LocalWPDTargetsMap) { std::map<ValueInfo, std::vector<VTableSlotSummary>> &LocalWPDTargetsMap) {

▲ Show 20 Lines • Show All 177 Lines • ▼ Show 20 Lines for (const TypeMemberInfo &TM : TypeMemberInfos) {

TargetsForSlot.push_back({GV, &TM}); TargetsForSlot.push_back({GV, &TM});

} }

// Give up if we couldn't find any targets. // Give up if we couldn't find any targets.

return !TargetsForSlot.empty(); return !TargetsForSlot.empty();

} }

bool DevirtIndex::tryFindVirtualCallTargets( bool DevirtIndex::tryFindVirtualCallTargets(

std::vector<ValueInfo> &TargetsForSlot, const TypeIdCompatibleVtableInfo TIdInfo, std::vector<ValueInfo> &TargetsForSlot,

uint64_t ByteOffset) { const TypeIdCompatibleVtableInfo TIdInfo, uint64_t ByteOffset) {

for (const TypeIdOffsetVtableInfo &P : TIdInfo) { for (const TypeIdOffsetVtableInfo &P : TIdInfo) {

// Find a representative copy of the vtable initializer. // Find a representative copy of the vtable initializer.

// We can have multiple available_externally, linkonce_odr and weak_odr // We can have multiple available_externally, linkonce_odr and weak_odr

// vtable initializers. We can also have multiple external vtable // vtable initializers. We can also have multiple external vtable

// initializers in the case of comdats, which we cannot check here. // initializers in the case of comdats, which we cannot check here.

// The linker should give an error in this case. // The linker should give an error in this case.

// //

// Also, handle the case of same-named local Vtables with the same path // Also, handle the case of same-named local Vtables with the same path

▲ Show 20 Lines • Show All 1,327 Lines • ▼ Show 20 Lines

} }

void DevirtIndex::run() { void DevirtIndex::run() {

if (ExportSummary.typeIdCompatibleVtableMap().empty()) if (ExportSummary.typeIdCompatibleVtableMap().empty())

return; return;

DenseMap<GlobalValue::GUID, std::vector<StringRef>> NameByGUID; DenseMap<GlobalValue::GUID, std::vector<StringRef>> NameByGUID;

for (const auto &P : ExportSummary.typeIdCompatibleVtableMap()) { for (const auto &P : ExportSummary.typeIdCompatibleVtableMap()) {

NameByGUID[GlobalValue::getGUID(P.first)].push_back(P.first); NameByGUID[GlobalValue::getGUID(P.first)].push_back(P.first);

tejohnsonUnsubmitted

Done

Rather than doing this down here in LTO/WPD could the linker simply unset the HasWholeProgramVisibility config flag? That would also allow WPD to proceed on types with hidden LTO visibility. This early return would prevent any and all WPD which seems overly conservative in the case of hidden LTO visibility classes.

tejohnson: Rather than doing this down here in LTO/WPD could the linker simply unset the…

modimoAuthorUnsubmitted

Done

That makes sense although it does tie this flag's functionality to requiring --lto-whole-program-visibility.

Doing that though means we can instead pass the blocklist to updateVCallVisibilityInIndex/updateVCallVisibilityInModule similarly to how D91583 does it for dynamically exported symbols which would be cleaner. Thoughts on that approach?

modimo: That makes sense although it does tie this flag's functionality to requiring `--lto-whole…

tejohnsonUnsubmitted

Done

That makes sense although it does tie this flag's functionality to requiring --lto-whole-program-visibility.

What would be the use case of the proposed handling without --lto-whole-program-visibility? Are you saying that there are cases where the normal LTO visibility is incorrect?

Doing that though means we can instead pass the blocklist to updateVCallVisibilityInIndex/updateVCallVisibilityInModule similarly to how D91583 does it for dynamically exported symbols which would be cleaner. Thoughts on that approach?

Yep I think that would be cleaner.

tejohnson: > That makes sense although it does tie this flag's functionality to requiring --lto-whole…

modimoAuthorUnsubmitted

Done

What would be the use case of the proposed handling without --lto-whole-program-visibility? Are you saying that there are cases where the normal LTO visibility is incorrect?

I don't have a known case so this is more theoretical. Currently there's an assertion that it's on the user to make sure LTO visibility is correct but in this case and in D91583 we can catch violations and prevent them from causing problems. How much this should also apply to normal LTO visibility is a question but thinking more about it is orthogonal to this change.

modimo: > What would be the use case of the proposed handling without --lto-whole-program-visibility?

modimoAuthorUnsubmitted

Done

Ah right, the issue with doing this in updateVCallVisibilityInIndex/updateVCallVisibilityInModule is that vcall visibility is keyed off the vtable symbol. However, TypeID and RTTI are both keyed off of the typename symbol. There's not always a translation from typename to vtable since abstract base classes wouldn't have vtables and by the time we get the association we're in the same place the logic is right now.

Given that, I think I'll keep the logic as-is.

modimo: Ah right, the issue with doing this in `updateVCallVisibilityInIndex`/`updateVCallVisibilityInM…

tejohnsonUnsubmitted

Done

There's not always a translation from typename to vtable since abstract base classes wouldn't have vtables and by the time we get the association we're in the same place the logic is right now.

Can you clarify the case you are concerned about and how this is handled in the lld code that expects a translation from vtable to typename? I tried an example with an abstract base class and do get a vtable and typename.

tejohnson: > There's not always a translation from typename to vtable since abstract base classes wouldn't…

modimoAuthorUnsubmitted

Done

~/llvm-project/lld/test/ELF/lto# ~/llvm-project/build-rel/bin/llc -filetype=obj Inputs/devirt_validate_vtable_typeinfos.ll -o devirt_validate_vtable_typeinfos.o
~/llvm-project/lld/test/ELF/lto# readelf -Ws devirt_validate_vtable_typeinfos.o | grep ZT
     5: 0000000000000000    16 OBJECT  WEAK   DEFAULT    3 _ZTVN10__cxxabiv117__class_type_infoE
     6: 0000000000000010    16 OBJECT  WEAK   DEFAULT    3 _ZTVN10__cxxabiv120__si_class_type_infoE
     7: 0000000000000020    32 OBJECT  WEAK   DEFAULT    3 _ZTV6Native
     8: 0000000000000050    24 OBJECT  WEAK   DEFAULT    3 _ZTI6Native
     9: 0000000000000040     8 OBJECT  WEAK   DEFAULT    3 _ZTS6Native
    10: 0000000000000070    16 OBJECT  WEAK   DEFAULT    3 _ZTI1A
    11: 0000000000000068     3 OBJECT  WEAK   DEFAULT    3 _ZTS1A

In LLD we're ensuring there's a map from every vtable symbol to its type information but we expect additional type information without vtables for these cases.

modimo: Inputs/devirt_validate_vtable_typeinfos.ll has `A` as an abstract base class and `Native`…

modimoAuthorUnsubmitted

Done

That being said, the information to map typeid->[associated vtables] is typeIdCompatibleVtableMap for thinLTO and in full LTO we have the actual vtable global variables which contains !type metadata that maps back to typeid.

For thinLTO, we can save a scan of typeIdCompatibleVtableMap for updateVCallVisibilityInIndex by currently combining it with the scan in DevirtIndex::run however that's probably not a big deal.

For full LTO the information is better passed through updateVCallVisibilityInModule since we don't build the corresponding TypeIDMap until codegen and carrying this information around until then is too much.

I think I've come back around to doing it in updateVCallVisibilityInIndex/updateVCallVisibilityInModule. A little less efficient for thinLTO but keeps consistency with full LTO.

modimo: That being said, the information to map typeid->[associated vtables] is…

tejohnsonUnsubmitted

Done

Ok, thanks. I think I have lost track of what the change will be - is it to replace passing down a global flag AllVtablesHaveTypeInfos, or is it to replace what is being down below in DevirtIndex::run()? For the former alone it doesn't seem worth it, but it would be nice to move the handling from DevirtIndex::run() into the vcall_visibility updates.

tejohnson: Ok, thanks. I think I have lost track of what the change will be - is it to replace passing…

modimoAuthorUnsubmitted

Done

Ah sorry yeah there's 2 different things. For passing down AllVtablesHaveTypeInfos I like the approach of unsetting --lto-whole-program-visibility instead. For moving the safety logic out of DevirtIndex::run() making this work for full LTO wants the logic to be in vcall_visibility and it makes sense to be consistent even if slightly *less* efficient for thinLTO.

modimo: Ah sorry yeah there's 2 different things. For passing down `AllVtablesHaveTypeInfos` I like the…

// Create the type id summary resolution regardlness of whether we can // Create the type id summary resolution regardlness of whether we can

// devirtualize, so that lower type tests knows the type id is used on // devirtualize, so that lower type tests knows the type id is used on

// a global and not Unsat. We do this here rather than in the loop over the // a global and not Unsat. We do this here rather than in the loop over the

// CallSlots, since that handling will only see type tests that directly // CallSlots, since that handling will only see type tests that directly

// feed assumes, and we would miss any that aren't currently handled by WPD // feed assumes, and we would miss any that aren't currently handled by WPD

// (such as type tests that feed assumes via phis). // (such as type tests that feed assumes via phis).

ExportSummary.getOrInsertTypeIdSummary(P.first); ExportSummary.getOrInsertTypeIdSummary(P.first);

} }

▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines WholeProgramDevirtResolution *Res =

&ExportSummary.getTypeIdSummary(S.first.TypeID) &ExportSummary.getTypeIdSummary(S.first.TypeID)

->WPDRes[S.first.ByteOffset]; ->WPDRes[S.first.ByteOffset];

if (tryFindVirtualCallTargets(TargetsForSlot, *TidSummary, if (tryFindVirtualCallTargets(TargetsForSlot, *TidSummary,

S.first.ByteOffset)) { S.first.ByteOffset)) {

if (!trySingleImplDevirt(TargetsForSlot, S.first, S.second, Res, if (!trySingleImplDevirt(TargetsForSlot, S.first, S.second, Res,

DevirtTargets)) DevirtTargets))

continue; continue;

} }

tejohnsonUnsubmitted

Done

Can we ever get here if RTTI is not enabled? My understanding of the change to line 2413 is that we return early in that case. Given that early return, aren't we guaranteed that the typename symbol has a GlobalResolution if it is non-local?

Oh - I guess we are only early returning if RTTI is off in native objects, so you could get here if RTTI is only disabled in bitcode objects? And we need to be conservative for any typenames for vtables defined in bitcode objects with RTTI off? I didn't see a test for this case, can you add one (or did I miss it)?

tejohnson: Can we ever get here if RTTI is not enabled? My understanding of the change to line 2413 is…

modimoAuthorUnsubmitted

Done

Oh - I guess we are only early returning if RTTI is off in native objects, so you could get here if RTTI is only disabled in bitcode objects?

Yep!

And we need to be conservative for any typenames for vtables defined in bitcode objects with RTTI off?

This is primarily an implementation detail with how resolutions are only provided for IR symbols. If we instead pass more information from the linker (like resolutions for these summary symbols or the whole list of typenames) we can support bitcode files with RTTI off. There's not a correctness issue at play here since the summary information is a superset of RTTI.

I didn't see a test for this case, can you add one (or did I miss it)?

Good catch, will add a test case.

modimo: >Oh - I guess we are only early returning if RTTI is off in native objects, so you could get…

} }

// Optionally have the thin link print message for each devirtualized // Optionally have the thin link print message for each devirtualized

// function. // function.

if (PrintSummaryDevirt) if (PrintSummaryDevirt)

for (const auto &DT : DevirtTargets) for (const auto &DT : DevirtTargets)

errs() << "Devirtualized call to " << DT << "\n"; errs() << "Devirtualized call to " << DT << "\n";

NumDevirtTargets += DevirtTargets.size(); NumDevirtTargets += DevirtTargets.size();

} }

llvm/tools/opt/opt.cpp

Show First 20 Lines • Show All 562 Lines • ▼ Show 20 Lines	errs() << argv[0] << ": " << InputFilename
<< ": error: input module is broken!\n";		<< ": error: input module is broken!\n";
return 1;		return 1;
}		}

// Enable testing of whole program devirtualization on this module by invoking		// Enable testing of whole program devirtualization on this module by invoking
// the facility for updating public visibility to linkage unit visibility when		// the facility for updating public visibility to linkage unit visibility when
// specified by an internal option. This is normally done during LTO which is		// specified by an internal option. This is normally done during LTO which is
// not performed via opt.		// not performed via opt.
updateVCallVisibilityInModule(*M,		updateVCallVisibilityInModule(
/* WholeProgramVisibilityEnabledInLTO */ false,		*M,
/* DynamicExportSymbols */ {});		/WholeProgramVisibilityEnabledInLTO=/false,
		MaskRayUnsubmitted Done Reply Inline Actions The prevailing and recommended style liked by clang-format and clang-tidy is `/WholeProgramVisibilityEnabledInLTO=/false` MaskRay: The prevailing and recommended style liked by clang-format and clang-tidy is…
		// FIXME: These need linker information via a
		// TBD new interface.
		/DynamicExportSymbols=/{},
		/ValidateAllVtablesHaveTypeInfos=/false,
		/IsVisibleToRegularObj=/[](StringRef) { return true; });

// Figure out what stream we are supposed to write to...		// Figure out what stream we are supposed to write to...
std::unique_ptr<ToolOutputFile> Out;		std::unique_ptr<ToolOutputFile> Out;
std::unique_ptr<ToolOutputFile> ThinLinkOut;		std::unique_ptr<ToolOutputFile> ThinLinkOut;
if (NoOutput) {		if (NoOutput) {
if (!OutputFilename.empty())		if (!OutputFilename.empty())
errs() << "WARNING: The -o (output filename) option is ignored when\n"		errs() << "WARNING: The -o (output filename) option is ignored when\n"
"the --disable-output option is used.\n";		"the --disable-output option is used.\n";
▲ Show 20 Lines • Show All 309 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[WPD][LLD] Add option to validate RTTI is enabled on all native types and prevent devirtualization on types with native RTTIClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 556979

lld/ELF/Config.h

lld/ELF/Driver.cpp

lld/ELF/LTO.cpp

lld/ELF/Options.td

lld/test/ELF/lto/Inputs/devirt_validate_vtable_typeinfos.ll

lld/test/ELF/lto/Inputs/devirt_validate_vtable_typeinfos_no_rtti.ll

lld/test/ELF/lto/Inputs/devirt_validate_vtable_typeinfos_ref.ll

lld/test/ELF/lto/Inputs/devirt_validate_vtable_typeinfos_undef.ll

lld/test/ELF/lto/devirt_validate_vtable_typeinfos.ll

lld/test/ELF/lto/devirt_validate_vtable_typeinfos_mixed_lto.ll

lld/test/ELF/lto/devirt_validate_vtable_typeinfos_no_rtti.ll

lld/test/ELF/lto/devirt_validate_vtable_typeinfos_ref.ll

llvm/include/llvm/LTO/Config.h

llvm/include/llvm/Transforms/IPO/WholeProgramDevirt.h

llvm/lib/LTO/LTO.cpp

llvm/lib/LTO/LTOCodeGenerator.cpp

llvm/lib/LTO/ThinLTOCodeGenerator.cpp

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp

llvm/tools/opt/opt.cpp

[WPD][LLD] Add option to validate RTTI is enabled on all native types and prevent devirtualization on types with native RTTI
ClosedPublic