This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
1/1
FatLTO.rst
1/1
ReleaseNotes.rst
-
UserGuides.rst
-
include/llvm/
-
llvm/
-
Passes/
4/4
PassBuilder.h
-
Transforms/IPO/
-
IPO/
-
EmbedBitcodePass.h
-
lib/
-
IR/
1/2
StructuralHash.cpp
-
Object/
-
ObjectFile.cpp
-
Passes/
-
PassBuilder.cpp
16/16
PassBuilderPipelines.cpp
3/3
PassRegistry.def
-
Transforms/IPO/
-
IPO/
-
CMakeLists.txt
9/9
EmbedBitcodePass.cpp
-
test/
-
CodeGen/X86/
-
X86/
1/1
fat-lto-section.ll
-
Transforms/EmbedBitcode/
-
EmbedBitcode/
-
embed-multiple.ll
-
embed-unsupported-object-format.ll
4/4
embed.ll

Differential D146776

Reland [llvm] Preliminary fat-lto-objects support
ClosedPublic

Authored by paulkirth on Mar 23 2023, 5:54 PM.

Download Raw Diff

Details

Reviewers

phosek
tejohnson
MaskRay
alexander-shaposhnikov
nikic
sfertile

Commits

rG75a1797044fc: Reland [llvm] Preliminary fat-lto-objects support
rG44265dc3554e: Reland [llvm] Preliminary fat-lto-objects support
rGa67208e1c697: [llvm] Preliminary fat-lto-objects support

Summary

Fat LTO objects contain both LTO compatible IR, as well as generated
object code. This allows users to defer the choice of whether to use LTO
or not to link-time. This is a feature available in GCC for some time,
and makes the existing -ffat-lto-objects flag functional in the same
way as GCC's.

Within LLVM, we add a new EmbedBitcodePass that serializes the module to
the object file, and expose a new pass pipeline for compiling fat
objects. The new pipeline initially clones the module and runs the
selected (Thin)LTOPrelink pipeline, after which it will serialize the
module into a .llvm.lto section of an ELF file. When compiling for
(Thin)LTO, this normally the point at which the compiler would emit a
object file containing the bitcode and metadata.

After that point we compile the original module using the
PerModuleDefaultPipeline used for non-LTO compilation. We generate
standard object files at the end of this pipeline, which contain machine
code and the new .llvm.lto section containing bitcode.

Since the two pipelines operate on different copies of the module, we
can be sure that the bitcode in the .llvm.lto section and object code
in .text are congruent with the existing output produced by the
default and LTO pipelines.

Original RFC: https://discourse.llvm.org/t/rfc-ffat-lto-objects-support/63977

Earlier versions of this patch were missing REQUIRES lines for llc
related tests in Transforms/EmbedBitcode. Those tests are now under
CodeGen/X86, which should avoid running the check on unsupported
platforms.

The EmbedbBitcodePass also returned PreservedAnalyses::all when adding a
metadata section, which failed expensive checks, since it modified the
module. This is now corrected.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

paulkirth marked 5 inline comments as done.May 25 2023, 2:17 PM

paulkirth added inline comments.

llvm/lib/Passes/PassBuilderPipelines.cpp
1485	I don't think the prior code mimic'ed the non-LTO pipeline, however. My understanding is that the prior version did the following for the non embedded code on which it would generate a native object: (Thin)LTOPreLink + ModuleSimplification(ThinOrFullLTOPhase::None) + ModuleOptimization(ThinOrFullLTOPhase::None) So you would have been essentially duplicating ModuleSimplification (since it is invoked in both the *LTO pre-link and again with LTO Phase = None). That can cause some issues because there are passes that are only meant to be invoked once that you could duplicate. A good example is PGO instrumentation/annotation (see the conditions under which addPGOInstrPasses is called from buildModuleSimplificationPipeline). With your new version, my understanding is that you are calling: buildPerModuleDefaultPipeline on original Module -> generate native object (Thin)LTOPreLink on Module clone -> embed IR And buildPerModuleDefaultPipeline is largely just ModuleSimplification + ModuleOptimization. So I'm not sure about why it would be more wasteful (other than cloning the Module) - you are not running more pipelines total than in the prior code. Unless I am misunderstanding the former or current code, which is possible! I think that's an accurate summary. While my intent originally was to closely mimic the default pipeline, I don't think it was very close in reality. Also, as you pointed out the new approach isn't nearly as wasteful as I felt initially.

In D146776#4302238, @tejohnson wrote:

I agree ensuring full performance with FatLTO's non-LTO native object isn't terribly important. But my concern is whether this results in some unexpected behavior with certain passes not being run at all - e.g. ICP won't run at all if you are doing SamplePGO in this configuration.

...

As mentioned earlier, I think the question is whether it will be a surprise when using SamplePGO that certain things simply don't get run when using the non-LTO native objects. I don't know enough about when these will get used to know if this is a configuration anyone will care about. If it is, I suppose one approach would be to invoke the longer pipelines only under SamplePGO. Either way, this probably needs some clear documentation.

These are all performance concerns for the non-LTO native objects, not correctness right? I think if we clearly document that performance might be very meh for fat LTO non-LTO native objects this would be fine.

llvm/include/llvm/Bitcode/EmbedBitcodePass.h
32 ↗	(On Diff #517705)	is this default param necessary?
llvm/lib/Passes/PassBuilderPipelines.cpp
1265	do we still need `IsFatLTO` with the latest approach?
1485	IIUC the previous version was `ThinLTOPreLink + ModuleOptimization(ThinOrFullLTOPhase::None)`, where we'd write out the module after pre-link and before module optimization to some global. So we'd avoid running the simplification pipeline twice with that. (the problem is that the simplification pipeline is slightly different between a normal build and a ThinLTO build) the current version will run the simplification pipeline twice now, once for ThinLTO and once for the default pipeline
llvm/lib/Passes/PassRegistry.def
61	these should be parameterizable, see `MODULE_PASS_WITH_PARAMS`

In D146776#4374005, @aeubanks wrote:

These are all performance concerns for the non-LTO native objects, not correctness right? I think if we clearly document that performance might be very meh for fat LTO non-LTO native objects this would be fine.

In the prior version, I think its too hard to say if running pipelines, like ModuleSimplification, multiple times is safe/correct without some fairly detailed analysis. I can easily imagine any transform that is only expected to run once may introduce a subtle bug if run again on the same module. Nothing obvious here jumps out at me, but I'm also not confident enough to say that it can't happen.

That said, since FatLTO no longer use a modified/custom pipeline but uses the (Thin)LTO and Module piplines on independed modules, there should no longer be any correctness nor performance issues over a normal compilation for the non-LTO object code.

llvm/include/llvm/Bitcode/EmbedBitcodePass.h
32 ↗	(On Diff #517705)	IIRC it was required for some reason, but let me double check that.
llvm/lib/Passes/PassBuilderPipelines.cpp
1265	That's a good point. I don't think so, so I can simplify this futher. Thanks for pointing this out.
llvm/lib/Passes/PassRegistry.def
61	That is a much better way to handle this. Thank you.

Update summary, documentation, and address comments.

Harbormaster completed remote builds in B234730: Diff 525901.May 25 2023, 7:15 PM

aeubanks added inline comments.May 30 2023, 10:59 AM

llvm/include/llvm/Bitcode/EmbedBitcodePass.h
26 ↗	(On Diff #525901)	extraneous `;`
28 ↗	(On Diff #525901)	this is never used
43 ↗	(On Diff #525901)	if you really wanted to hook this up to the pipeline parsing (so you could specify a sub-passmanager) you'd have to thread through all the parsing code. probably not worth it
llvm/include/llvm/Passes/PassBuilder.h
242	comment out of date
llvm/lib/Passes/PassBuilderPipelines.cpp
1265	still not removed?
1471	this comment is obsolete now

Address remaining comments

Fix Doc string
Remove obsolete comment
Remove uses of IsThinkLTO
Remove some dead code
Fix some typos
Fix test looking for wrong pass name in the error string
git clang-format

llvm/lib/Passes/PassBuilderPipelines.cpp
1471	I was thinking once that patch lands we can take the `ThinOrFullLTOPhase` direcly and avoid the branch, but maybe it still makes sense to leave this as is.

Harbormaster completed remote builds in B235589: Diff 527074.May 31 2023, 10:44 AM

this basically lgtm

llvm/lib/Bitcode/Writer/EmbedBitcodePass.cpp
30–37 ↗	(On Diff #527074)	these should be unnecessary with the newly added pass params, e.g. `-passes=embed-bitcode<emit-summary>`

Rebase

Fix typo in docs
Remove unneeded cl::opts

@tejohnson @nikic any lingering concerns?

Harbormaster completed remote builds in B237104: Diff 529076.Jun 6 2023, 6:45 PM

LGTM with a couple of minor nits and a question back to @nikic about one of this comments.

llvm/lib/Bitcode/Writer/EmbedBitcodePass.cpp
30 ↗	(On Diff #514330)	true seems to be the default, did you mean false? The same could probably be said for the later call to report_fatal_error (ELF format). I actually think this one seems less like a user error and more like a compiler error (if this pass gets run twice somehow).
49 ↗	(On Diff #529076)	Document constant parameter (nullptr)
llvm/test/Bitcode/embed.ll
15 ↗	(On Diff #529076)	s/sue/sure

Thanks for the feedback @tejohnson. I'm about to take off for the weekend, but I'll address these on Monday.

llvm/lib/Bitcode/Writer/EmbedBitcodePass.cpp
30 ↗	(On Diff #514330)	Oh, yeah, probably. I do see your point though. I guess maybe `user error` encompasses adding a pass multiple times? IMO they're all kind of hard errors. I'm happy to do this either way, but if we want to suppress this one, then you're right that it should be `false` here.
49 ↗	(On Diff #529076)	ah, good catch. Thanks.
llvm/test/Bitcode/embed.ll
15 ↗	(On Diff #529076)	ty

I'm okay with the new approach -- I think it makes the feature somewhat less compelling, but it's the path of least resistance for now. However, I'd like to have it documented that we're explicitly not guaranteeing that the results are bit-for-bit identical, and this is just an artifact of the current implementation.

Two notes on the larger feature:

I'm concerned about the lld-only limitation. Unless there is some technical limitation here, this should be supported in LLVMgold as well.
It looks like this requires passing -fat-lto-objects to lld to actually get LTO -- and it looks like the clang doesn't do this itself even if -ffat-lto-objects is passed to the driver? Can you please check how these flags work in GCC? I'm not entirely confident on this, but I expect that if you have fat objects they will use LTO by default unless you pass -fno-lto to the driver. We should try to match whatever the GCC behavior is here.

llvm/docs/FatLTOSupport.rst
28 ↗	(On Diff #529076)
29 ↗	(On Diff #529076)
34 ↗	(On Diff #529076)
38–40 ↗	(On Diff #529076)
43 ↗	(On Diff #529076)	The pre-link pipeline is run as part of EmbedBitcode, not before it.
50 ↗	(On Diff #529076)	This seems like the important part for end users, so maybe put that in the introduction.
60 ↗	(On Diff #529076)	This is a very annoying limitation -- is it particularly hard to implementation support for this in the LLVMgold plugin? We won't be able to use fat objects if it's only supported by LLD.
llvm/docs/ReleaseNotes.rst
87	Maybe link the docs instead / as well here?
llvm/lib/Bitcode/Writer/EmbedBitcodePass.cpp
30 ↗	(On Diff #514330)	Sorry, I meant false here. My general rule of thumb is that if you can write a test for it, then it should not crash.

In D146776#4410967, @nikic wrote:

I'm okay with the new approach -- I think it makes the feature somewhat less compelling, but it's the path of least resistance for now. However, I'd like to have it documented that we're explicitly not guaranteeing that the results are bit-for-bit identical, and this is just an artifact of the current implementation.

Two notes on the larger feature:

I'm concerned about the lld-only limitation. Unless there is some technical limitation here, this should be supported in LLVMgold as well.

Great point. I see that the linker changes aren't in this patch, but in another related one. I agree it should be supported by the LLVM gold plugin as well, which should be straightforward. This can be another follow on patch. It looks like you would want to modify this change made to ignore the -fembed-bitcode sections so that we do in fact claim the fat object files:
https://github.com/llvm/llvm-project/blob/main/llvm/tools/gold/gold-plugin.cpp#L541-L547

It looks like this requires passing -fat-lto-objects to lld to actually get LTO -- and it looks like the clang doesn't do this itself even if -ffat-lto-objects is passed to the driver? Can you please check how these flags work in GCC? I'm not entirely confident on this, but I expect that if you have fat objects they will use LTO by default unless you pass -fno-lto to the driver. We should try to match whatever the GCC behavior is here.

agree

Rebase and address comments

Harbormaster completed remote builds in B238966: Diff 531532.Jun 14 2023, 4:15 PM

phosek mentioned this in D152967: [runtimes] Enable LTO when supported.Jun 15 2023, 12:21 AM

ldionne added a subscriber: ldionne.Jun 15 2023, 9:17 AM

paulkirth added inline comments.Jun 15 2023, 3:25 PM

llvm/test/Bitcode/embed-multiple.ll
1 ↗	(On Diff #531532)	looks like I failed to update the tests after making the errors non-fatal.

Rebase and fix test cases.

Harbormaster completed remote builds in B239467: Diff 532211.Jun 16 2023, 11:28 AM

LGTM

llvm/docs/FatLTOSupport.rst
37 ↗	(On Diff #532211)	unmodified
60 ↗	(On Diff #532211)	I feel like this information has been repeated (with minor variations) three times: In the bullet points above, the paragraph following it, and then here...
73 ↗	(On Diff #532211)	Despite the name, the gold plugin is also supported by the bfd linker, so you actually support all common linkers (on ELF based systems).

This revision is now accepted and ready to land.Jun 16 2023, 12:01 PM

Apologies for not commenting soon. I'll take a close look shortly.

In D146776#4429120, @MaskRay wrote:

Apologies for not commenting soon. I'll take a close look shortly.

TY!

llvm/docs/FatLTOSupport.rst
60 ↗	(On Diff #532211)	Yeah, it's definitely a bit redundant. I'll go through this one more time and try to streamline the text.
73 ↗	(On Diff #532211)	Ah, right. I completely forgot bfd could use the plugin.

Revise text in documentation.

Document constant parameter.

paulkirth added inline comments.Jun 16 2023, 1:05 PM

llvm/docs/FatLTOSupport.rst
60 ↗	(On Diff #532211)	I feel like the current version reads a bit better. WDYT?

Harbormaster completed remote builds in B239509: Diff 532271.Jun 16 2023, 2:18 PM

nikic added inline comments.Jun 17 2023, 12:55 AM

llvm/docs/FatLTOSupport.rst
60 ↗	(On Diff #532211)	Looks good!
49 ↗	(On Diff #532271)
61 ↗	(On Diff #532271)

Started looking...

-DBUILD_SHARED_LIBS=on uses -Wl,-z,defs and can detect some library layering (https://llvm.org/docs/CodingStandards.html#library-layering https://maskray.me/blog/2021-06-13-dependency-related-linker-options) problems. ninja LLVMBitWriter doesn't build due to missing dependencies.

One issue can be addressed with the following change but the dependency may be a bit weird.

diff --git i/llvm/lib/Bitcode/Writer/CMakeLists.txt w/llvm/lib/Bitcode/Writer/CMakeLists.txt
index 2b17aa912016..51849d2d0fc0 100644
--- i/llvm/lib/Bitcode/Writer/CMakeLists.txt
+++ w/llvm/lib/Bitcode/Writer/CMakeLists.txt
@@ -17,2 +17,4 @@ add_llvm_component_library(LLVMBitWriter
   TargetParser
+  TransformUtils
   )

The other issue (error: undefined symbol: llvm::ThinLTOBitcodeWriterPass::run) is more tricky: LLVMipo (llvm/lib/Transforms/IPO/CMakeLists.txt) depends on LLVMBitWriter, so LLVMBitWriter cannot depend on LLVMipo.

llvm/lib/Bitcode/Writer/EmbedBitcodePass.cpp
24 ↗	(On Diff #532271)	Including `llvm/Transforms` headers seems problematic.

This revision now requires changes to proceed.Jun 17 2023, 3:57 PM

MaskRay added inline comments.Jun 17 2023, 4:36 PM

llvm/docs/FatLTOSupport.rst
1 ↗	(On Diff #532271)	Maybe just `FatLTO.rst`. For other features, we don't use the `Support` suffix.
18 ↗	(On Diff #532271)
llvm/lib/Bitcode/Writer/EmbedBitcodePass.cpp
32 ↗	(On Diff #532271)	Nit: diagnostics do not need capitalization (https://llvm.org/docs/CodingStandards.html#error-and-warning-messages) but `report_fatal_error` messages often do capitalization, so I think capitalization is fine, but the period is often omitted.
55 ↗	(On Diff #532271)	You can delete ModuleData and even Buf.
llvm/lib/Passes/PassRegistry.def
181
llvm/test/Bitcode/embed.ll
9 ↗	(On Diff #532271)	Nit: complete sentences in comments end with a period.

I think we need a new section type (say, SHT_LLVM_LTO) in llvm/include/llvm/BinaryFormat/ELF.h. Then add a test llvm/test/MC/ELF to use llvm-readobj -S to test this new section type.

.llvm.lto should be added beside .llvmcmd in llvm/lib/CodeGen/TargetLoweringObjectFileImpl.cpp so that .llvm.lto has the SHF_EXCLUDE flag on ELF.
We need a llvm/test/CodeGen/ test using llvm-readelf -S to test the section flags (E).

MaskRay mentioned this in D153215: [Object] Add ELF section type SHT_LLVM_BITCODE for LLVM bitcode.Jun 17 2023, 6:57 PM

In D146776#4430608, @MaskRay wrote:
Started looking...

-DBUILD_SHARED_LIBS=on uses -Wl,-z,defs and can detect some library layering (https://llvm.org/docs/CodingStandards.html#library-layering https://maskray.me/blog/2021-06-13-dependency-related-linker-options) problems. ninja LLVMBitWriter doesn't build due to missing dependencies.

One issue can be addressed with the following change but the dependency may be a bit weird.
diff --git i/llvm/lib/Bitcode/Writer/CMakeLists.txt w/llvm/lib/Bitcode/Writer/CMakeLists.txt
index 2b17aa912016..51849d2d0fc0 100644
--- i/llvm/lib/Bitcode/Writer/CMakeLists.txt
+++ w/llvm/lib/Bitcode/Writer/CMakeLists.txt
@@ -17,2 +17,4 @@ add_llvm_component_library(LLVMBitWriter
   TargetParser
+  TransformUtils
   )
The other issue (error: undefined symbol: llvm::ThinLTOBitcodeWriterPass::run) is more tricky: LLVMipo (llvm/lib/Transforms/IPO/CMakeLists.txt) depends on LLVMBitWriter, so LLVMBitWriter cannot depend on LLVMipo.

What if we move the pass into transforms? Yes, its fairly correlated to bitcode writing, but the pass is really just creating a new section using existing utilities, which could just be considered a transform. There's also an argument that this pass should just live wherever the ThinLTOBitcodeWritterPass does. I think that should fix both the issues you've mentioned here.

Do you think that would be a better solution?

Address comments about documentation.

I still need to address more of @MaskRay's comments.

In D146776#4435430, @paulkirth wrote:
In D146776#4430608, @MaskRay wrote:
Started looking...

-DBUILD_SHARED_LIBS=on uses -Wl,-z,defs and can detect some library layering (https://llvm.org/docs/CodingStandards.html#library-layering https://maskray.me/blog/2021-06-13-dependency-related-linker-options) problems. ninja LLVMBitWriter doesn't build due to missing dependencies.

One issue can be addressed with the following change but the dependency may be a bit weird.
diff --git i/llvm/lib/Bitcode/Writer/CMakeLists.txt w/llvm/lib/Bitcode/Writer/CMakeLists.txt
index 2b17aa912016..51849d2d0fc0 100644
--- i/llvm/lib/Bitcode/Writer/CMakeLists.txt
+++ w/llvm/lib/Bitcode/Writer/CMakeLists.txt
@@ -17,2 +17,4 @@ add_llvm_component_library(LLVMBitWriter
   TargetParser
+  TransformUtils
   )
The other issue (error: undefined symbol: llvm::ThinLTOBitcodeWriterPass::run) is more tricky: LLVMipo (llvm/lib/Transforms/IPO/CMakeLists.txt) depends on LLVMBitWriter, so LLVMBitWriter cannot depend on LLVMipo.
What if we move the pass into transforms? Yes, its fairly correlated to bitcode writing, but the pass is really just creating a new section using existing utilities, which could just be considered a transform. There's also an argument that this pass should just live wherever the ThinLTOBitcodeWritterPass does. I think that should fix both the issues you've mentioned here.

Do you think that would be a better solution?

yes living alongside ThinLTOBitcodeWritterPass makes sense

Address some comments

Move EmbedBitcodePass into Transforms/IPO
Fix misc typos + formatting
Simplify some code in EmbedBitcodePass

I have not addressed anything related to the SHT_LLVM_LTO flag or handling of .llvm.lto yet.

Herald added a subscriber: hoy. · View Herald TranscriptJun 20 2023, 10:46 AM

Harbormaster completed remote builds in B240058: Diff 532989.Jun 20 2023, 11:27 AM

paulkirth added inline comments.Jun 20 2023, 12:10 PM

llvm/lib/Transforms/IPO/EmbedBitcodePass.cpp
55–58	Looks like I forgot to remove these lines.

paulkirth added inline comments.Jun 20 2023, 12:51 PM

llvm/lib/Transforms/IPO/EmbedBitcodePass.cpp
59	Also, while I'm not arguing that we shouldn't add `SHF_LLVM_LTO`, `embedBufferInModule` does mark it as `MD_exclude` https://github.com/llvm/llvm-project/blob/f1a040298381cdfc0657d5ecba231ebe6bbef61a/llvm/lib/Transforms/Utils/ModuleUtils.cpp#L375, which will eventually become SHF_EXCLUDE.

Thanks for the update. This fixes -DBUILD_SHARED_LIBS=on build of mine.

Perhaps llvm/test/Bitcode/ tests (which don't normally test -passes=) should be moved to elsewhere, e.g. llvm/test/Transforms/EmbedBitcodePass/ ?

Another strange thing to fix:

llc a.ll => .type .Lllvm.embedded.object,@object

We don't normally set attributes on temporary symbols (.L), which do not normally have symbol table entries anyway.

llvm/lib/Transforms/IPO/EmbedBitcodePass.cpp
10	If `EmbedBitcodePass.h` provides a descriptive comment, this comment can probably be removed. It does not give additional information to users:)
43	delete unneeded blank line
48	delete unneeded blank line

In D146776#4435885, @MaskRay wrote:

Thanks for the update. This fixes -DBUILD_SHARED_LIBS=on build of mine.

Perhaps llvm/test/Bitcode/ tests (which don't normally test -passes=) should be moved to elsewhere, e.g. llvm/test/Transforms/EmbedBitcodePass/ ?

Another strange thing to fix:

llc a.ll => .type .Lllvm.embedded.object,@object

We don't normally set attributes on temporary symbols (.L), which do not normally have symbol table entries anyway.

This is due to private but I think this is acceptable. Both GNU assembler and LLVM integrated assembler do not create a symbol table entry, so this is fine. This likely does not justify adding a special case for private lowering to the underlying ELF section.

Feel free to land this one with a readelf -S test to check SHF_EXCLUDE. Maybe give some time for the Clang patch. Landing this one shall make reviewing the Clang part easier.

llvm/docs/FatLTO.rst
63	`LLVMgold.so` plugin. Consider using hypertext to link this to `llvm/docs/GoldPlugin.rst`. I don't recall immediately the reST syntax...

This revision is now accepted and ready to land.Jun 20 2023, 2:03 PM

In D146776#4435885, @MaskRay wrote:

Thanks for the update. This fixes -DBUILD_SHARED_LIBS=on build of mine.

Perhaps llvm/test/Bitcode/ tests (which don't normally test -passes=) should be moved to elsewhere, e.g. llvm/test/Transforms/EmbedBitcodePass/ ?

Oh, yes. I should have moved those too. I'll take care of that in the next version

In D146776#4435915, @MaskRay wrote:

In D146776#4435885, @MaskRay wrote:

Thanks for the update. This fixes -DBUILD_SHARED_LIBS=on build of mine.

Perhaps llvm/test/Bitcode/ tests (which don't normally test -passes=) should be moved to elsewhere, e.g. llvm/test/Transforms/EmbedBitcodePass/ ?

Another strange thing to fix:

llc a.ll => .type .Lllvm.embedded.object,@object

We don't normally set attributes on temporary symbols (.L), which do not normally have symbol table entries anyway.

This is due to private but I think this is acceptable. Both GNU assembler and LLVM integrated assembler do not create a symbol table entry, so this is fine. This likely does not justify adding a special case for private lowering to the underlying ELF section.

I'm not sure I follow the conversation here. Is this an issue w/ the defaults used by embedBufferInModule()? or is the oddity just related to the symol being private?

Feel free to land this one with a readelf -S test to check SHF_EXCLUDE. Maybe give some time for the Clang patch. Landing this one shall make reviewing the Clang part easier.

Will do. Thanks for all the feedback.

llvm/lib/Transforms/IPO/EmbedBitcodePass.cpp
10	noted. I'll expand the description in the header and remove this.

Address comments

Add test for SHT_EXCLUDE
Fix some formatting
Improve documentation/description in header
Update documentation links

Harbormaster completed remote builds in B240303: Diff 533338.Jun 21 2023, 12:26 PM

This revision was landed with ongoing or failed builds.Jun 23 2023, 10:51 AM

Closed by commit rGa67208e1c697: [llvm] Preliminary fat-lto-objects support (authored by paulkirth). · Explain Why

This revision was automatically updated to reflect the committed changes.

paulkirth added a commit: rGa67208e1c697: [llvm] Preliminary fat-lto-objects support.

paulkirth added a reverting change: rGa3800ad9d87f: Revert "[llvm] Preliminary fat-lto-objects support".Jun 23 2023, 11:45 AM

Reopening until I can investigate the buildbot failures.

This revision is now accepted and ready to land.Jun 23 2023, 11:49 AM

MaskRay added inline comments.Jun 23 2023, 12:53 PM

llvm/test/Transforms/EmbedBitcode/embed.ll
24	We normally indent continuation lines with 2 spaces
44	delete trailing blank line

MaskRay requested changes to this revision.Jun 23 2023, 12:54 PM

MaskRay added inline comments.

llvm/test/Transforms/EmbedBitcode/embed.ll
2	llc needs `REQUIRES: x86-registered-target`

This revision now requires changes to proceed.Jun 23 2023, 12:54 PM

MaskRay added inline comments.Jun 23 2023, 12:56 PM

llvm/test/Transforms/EmbedBitcode/embed.ll
25	`--elf-output-style=JSON --pretty-print` is not normal. `llvm-readelf -S` output is much conciser.

Fix test code

move llc check for SHF_EXCLUDE under CodeGen/X86
fix formatting

Herald added a subscriber: pengfei. · View Herald TranscriptJun 23 2023, 1:53 PM

Update summary and title for reland

Remove some unused headers & fix formatting of the include guards

Harbormaster completed remote builds in B240849: Diff 534074.Jun 23 2023, 2:54 PM

Thanks! One nit in the test.

llvm/test/CodeGen/X86/fat-lto-section.ll
9	I typically just drop `[Nr]` and `[ 4]` from the output so that adding a new section will not cause the section index to shuffle and cause the test to fail.

This revision is now accepted and ready to land.Jun 23 2023, 3:55 PM

Make test less brittle to ordering changes

This revision was landed with ongoing or failed builds.Jun 23 2023, 4:24 PM

Closed by commit rG44265dc3554e: Reland [llvm] Preliminary fat-lto-objects support (authored by paulkirth). · Explain Why

This revision was automatically updated to reflect the committed changes.

paulkirth added a commit: rG44265dc3554e: Reland [llvm] Preliminary fat-lto-objects support.

Harbormaster completed remote builds in B240885: Diff 534114.Jun 23 2023, 4:57 PM

abrachet added a reverting change: rG6085eb308491: Revert "Reland [llvm] Preliminary fat-lto-objects support".Jun 23 2023, 6:16 PM

This seems to have failed on expensive checks because we marked the pass as PreservedAnalyses::all(). I guess the simplest thing to do is to change that to none, but I think there should be something else we can do here to avoid invalidating analyses that aren't affected by inserting the metadata section.

This revision is now accepted and ready to land.Jun 28 2023, 11:10 AM

paulkirth added inline comments.Jun 28 2023, 11:13 AM

llvm/lib/Transforms/IPO/EmbedBitcodePass.cpp
52	Is there another option we can use here. `none` seems like overkill. Since this is basically the first thing to run on the module, I'm not sure if there is much point in trying to do anything smarter, but it certainly doesn't feel right.

aeubanks added inline comments.Jun 28 2023, 11:15 AM

llvm/lib/Transforms/IPO/EmbedBitcodePass.cpp
52	perhaps something like https://reviews.llvm.org/D153855?

Don't preserve analysis to satisfy expensive checks

Harbormaster completed remote builds in B241849: Diff 535471.Jun 28 2023, 12:00 PM

Modify the structural hash since we don't modify the module except to add a
metadata section.

paulkirth marked an inline comment as done.Jun 28 2023, 1:50 PM

paulkirth added inline comments.

llvm/lib/Transforms/IPO/EmbedBitcodePass.cpp
52	That seems like it will work. Thanks for the suggestion. We can go back to `none` if we're unhappy about amending the structural hash.

aeubanks added inline comments.Jun 28 2023, 1:51 PM

llvm/lib/IR/StructuralHash.cpp
62–64	maybe we should exclude anything that begins with `llvm.`, but that should be a separate patch, so this lgtm

Update comment.

MaskRay mentioned this in rGa484e020d75d: [Object] Add ELF section type SHT_LLVM_BITCODE for LLVM bitcode.Jun 28 2023, 2:01 PM

paulkirth added inline comments.Jun 28 2023, 2:30 PM

llvm/lib/IR/StructuralHash.cpp
62–64	Filed https://github.com/llvm/llvm-project/issues/63590. I can make the follow up patch, but I may not get a chance till tomorrow.

This revision was landed with ongoing or failed builds.Jun 28 2023, 2:38 PM

Closed by commit rG75a1797044fc: Reland [llvm] Preliminary fat-lto-objects support (authored by paulkirth). · Explain Why

This revision was automatically updated to reflect the committed changes.

paulkirth added a commit: rG75a1797044fc: Reland [llvm] Preliminary fat-lto-objects support.

Harbormaster completed remote builds in B241884: Diff 535513.Jun 28 2023, 3:12 PM

paulkirth mentioned this in D154019: [IR] Ignore globals with the `llvm.` prefix when calculating module hash.Jun 28 2023, 5:01 PM

daltenty added a reviewer: sfertile.Jun 29 2023, 7:38 AM

paulkirth mentioned this in rGe8e499f5f9c4: [IR] Ignore globals with the `llvm.` prefix when calculating module hash.Jul 12 2023, 8:40 AM

Revision Contents

Path

Size

llvm/

docs/

FatLTO.rst

77 lines

ReleaseNotes.rst

6 lines

UserGuides.rst

1 line

include/

llvm/

Passes/

PassBuilder.h

12 lines

Transforms/

IPO/

EmbedBitcodePass.h

58 lines

lib/

IR/

StructuralHash.cpp

4 lines

Object/

ObjectFile.cpp

2 lines

Passes/

PassBuilder.cpp

21 lines

PassBuilderPipelines.cpp

12 lines

PassRegistry.def

7 lines

Transforms/

IPO/

CMakeLists.txt

1 line

EmbedBitcodePass.cpp

52 lines

test/

CodeGen/

X86/

fat-lto-section.ll

10 lines

Transforms/

EmbedBitcode/

embed-multiple.ll

6 lines

embed-unsupported-object-format.ll

5 lines

embed.ll

18 lines

Diff 535537

llvm/docs/FatLTO.rst

This file was added.

				===================
				FatLTO
				===================
				.. contents::
				:local:
				:depth: 2

				.. toctree::
				:maxdepth: 1

				Introduction
				============

				FatLTO objects are a special type of `fat object file
				<https://en.wikipedia.org/wiki/Fat_binary>`_ that contain LTO compatible IR in
				addition to generated object code, instead of containing object code for
				multiple target architectures. This allows users to defer the choice of whether
				to use LTO or not to link-time, and has been a feature available in other
				compilers, like `GCC
				<https://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html>`_, for some time.

				Under FatLTO the compiler can emit standard object files which contain both the
				machine code in the ``.text`` section and LLVM bitcode in the ``.llvm.lto``
				section.

				Overview
				========

				Within LLVM, FatLTO is supported by choosing the ``FatLTODefaultPipeline``.
				This pipeline will:

				#) Clone the IR module.
				#) Run the pre-link (Thin)LTO pipeline using the cloned module.
				#) Embed the pre-link bitcode in a special ``.llvm.lto`` section.
				#) Optimize the unmodified copy of the module using the normal compilation pipeline.
				#) Emit the object file, including the new ``.llvm.lto`` section.

				.. NOTE

				At the time of writing, we conservatively run independent pipelines to
				generate the bitcode section and the object code, which happen to be
				identical to those used outside of FatLTO. This results in compiled
				artifacts that are identical to those produced by the default and (Thin)LTO
				pipelines. However, this is not a guarantee, and we reserve the right to
				change this at any time. Explicitly, users should not rely on the produced
				bitcode or object code to mach their non-LTO counterparts precisely. They
				will exhibit similar performance characteristics, but may not be bit-for-bit
				the same.

				Internally, the ``.llvm.lto`` section is created by running the
				``EmbedBitcodePass`` at the start of the ``PerModuleDefaultPipeline``. This
				pass is responsible for cloning and optimizing the module with the appropriate
				LTO pipeline and emitting the ``.llvm.lto`` section. Afterwards, the
				``PerModuleDefaultPipeline`` runs normally and the compiler can emit the fat
				object file.

				Limitations
				===========

				Linkers
				-------

				Currently, using LTO with LLVM fat lto objects is supported by LLD and by the
				MaskRayUnsubmitted Done Reply Inline Actions `LLVMgold.so` plugin. Consider using hypertext to link this to `llvm/docs/GoldPlugin.rst`. I don't recall immediately the reST syntax... MaskRay: `LLVMgold.so` plugin. Consider using hypertext to link this to `llvm/docs/GoldPlugin.rst`. I…
				GNU linkers via :doc:`GoldPlugin`. This may change in the future, but
				extending support to other linkers isn't planned for now.

				.. NOTE
				For standard linking the fat object files should be usable by any
				linker capable of using ELF objects, since the ``.llvm.lto`` section is
				marked ``SHF_EXLUDE``.

				Supported File Formats
				----------------------

				The current implementation only supports ELF files. At time of writing, it is
				unclear if it will be useful to support other object file formats like ``COFF``
				or ``Mach-O``.

llvm/docs/ReleaseNotes.rst

	Show First 20 Lines • Show All 76 Lines • ▼ Show 20 Lines

	* Alloca merging in the inliner has been removed, since it only worked with the			* Alloca merging in the inliner has been removed, since it only worked with the
	legacy inliner pass. Backend stack coloring should handle cases alloca			legacy inliner pass. Backend stack coloring should handle cases alloca
	merging initially set out to handle.			merging initially set out to handle.

	* InstructionSimplify APIs now require instructions be inserted into a			* InstructionSimplify APIs now require instructions be inserted into a
	parent function.			parent function.

				* A new FatLTO pipeline was added to support generating object files that have
				both machine code and LTO compatible bitcode. See the :doc:`FatLTO`
				documentation and the original
				nikicUnsubmitted Done Reply Inline Actions Maybe link the docs instead / as well here? nikic: Maybe link the docs instead / as well here?
				`RFC <https://discourse.llvm.org/t/rfc-ffat-lto-objects-support/63977>`_
				for more details.

	Changes to building LLVM			Changes to building LLVM
	------------------------			------------------------

	Changes to TableGen			Changes to TableGen
	-------------------			-------------------

	Changes to Interprocedural Optimizations			Changes to Interprocedural Optimizations
	----------------------------------------			----------------------------------------
	▲ Show 20 Lines • Show All 304 Lines • Show Last 20 Lines

llvm/docs/UserGuides.rst

Show All 26 Lines	.. toctree::
CodeOfConduct		CodeOfConduct
CommandLine		CommandLine
CompileCudaWithLLVM		CompileCudaWithLLVM
CoverageMappingFormat		CoverageMappingFormat
CycleTerminology		CycleTerminology
DebuggingJITedCode		DebuggingJITedCode
DirectXUsage		DirectXUsage
Docker		Docker
		FatLTO
ExtendingLLVM		ExtendingLLVM
GoldPlugin		GoldPlugin
HowToBuildOnARM		HowToBuildOnARM
HowToBuildWithPGO		HowToBuildWithPGO
HowToBuildWindowsItaniumPrograms		HowToBuildWindowsItaniumPrograms
HowToCrossCompileBuiltinsOnArm		HowToCrossCompileBuiltinsOnArm
HowToCrossCompileLLVM		HowToCrossCompileLLVM
HowToUpdateDebugInfo		HowToUpdateDebugInfo
▲ Show 20 Lines • Show All 221 Lines • Show Last 20 Lines

llvm/include/llvm/Passes/PassBuilder.h

Show First 20 Lines • Show All 228 Lines • ▼ Show 20 Lines	public:
///		///
/// This provides a good default optimization pipeline for per-module		/// This provides a good default optimization pipeline for per-module
/// optimization and code generation without any link-time optimization. It		/// optimization and code generation without any link-time optimization. It
/// typically correspond to frontend "-O[123]" options for optimization		/// typically correspond to frontend "-O[123]" options for optimization
/// levels \c O1, \c O2 and \c O3 resp.		/// levels \c O1, \c O2 and \c O3 resp.
ModulePassManager buildPerModuleDefaultPipeline(OptimizationLevel Level,		ModulePassManager buildPerModuleDefaultPipeline(OptimizationLevel Level,
bool LTOPreLink = false);		bool LTOPreLink = false);

		/// Build a fat object default optimization pipeline.
		///
		/// This builds a pipeline that runs the LTO/ThinLTO pre-link pipeline, and
		/// emits a section containing the pre-link bitcode along side the object code
		nikicUnsubmitted Done Reply Inline Actions size -> side nikic: size -> side
		/// generated by running the PerModuleDefaultPipeline, used when compiling
		/// without LTO. It clones the module and runs the LTO/non-LTO pipelines
		aeubanksUnsubmitted Done Reply Inline Actions comment out of date aeubanks: comment out of date
		/// separately to avoid any inconsistencies with an ad-hoc pipeline that tries
		/// to approximate the PerModuleDefaultPipeline from the pre-link LTO
		/// pipelines.
		ModulePassManager buildFatLTODefaultPipeline(OptimizationLevel Level,
		nikicUnsubmitted Done Reply Inline Actions You can drop this paragraph, it should support `O0` fine. nikic: You can drop this paragraph, it should support `O0` fine.
		bool ThinLTO, bool EmitSummary);

		nikicUnsubmitted Done Reply Inline Actions ThinLTOPreLink -> ThinLTO as this is always a pre-link pipeline? nikic: ThinLTOPreLink -> ThinLTO as this is always a pre-link pipeline?
/// Build a pre-link, ThinLTO-targeting default optimization pipeline to		/// Build a pre-link, ThinLTO-targeting default optimization pipeline to
/// a pass manager.		/// a pass manager.
///		///
/// This adds the pre-link optimizations tuned to prepare a module for		/// This adds the pre-link optimizations tuned to prepare a module for
/// a ThinLTO run. It works to minimize the IR which needs to be analyzed		/// a ThinLTO run. It works to minimize the IR which needs to be analyzed
/// without making irreversible decisions which could be made better during		/// without making irreversible decisions which could be made better during
/// the LTO run.		/// the LTO run.
ModulePassManager buildThinLTOPreLinkDefaultPipeline(OptimizationLevel Level);		ModulePassManager buildThinLTOPreLinkDefaultPipeline(OptimizationLevel Level);
▲ Show 20 Lines • Show All 483 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/IPO/EmbedBitcodePass.h

This file was added.

				//===-- EmbedBitcodePass.h - Embeds bitcode into global ---------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				/// \file
				///
				/// This file provides a pass which clones the current module and runs the
				/// provided pass pipeline on the clone. The optimized module is stored into a
				/// global variable in the `.llvm.lto` section. Primarily, this pass is used
				/// to support the FatLTO pipeline, but could be used to generate a bitcode
				/// section for any arbitrary pass pipeline without changing the current module.
				///
				//===----------------------------------------------------------------------===//
				//
				#ifndef LLVM_TRANSFORMS_IPO_EMBEDBITCODEPASS_H
				#define LLVM_TRANSFORMS_IPO_EMBEDBITCODEPASS_H

				#include "llvm/IR/PassManager.h"

				namespace llvm {
				class Module;
				class ModulePass;
				class Pass;

				struct EmbedBitcodeOptions {
				EmbedBitcodeOptions() : EmbedBitcodeOptions(false, false) {}
				EmbedBitcodeOptions(bool IsThinLTO, bool EmitLTOSummary)
				: IsThinLTO(IsThinLTO), EmitLTOSummary(EmitLTOSummary) {}
				bool IsThinLTO;
				bool EmitLTOSummary;
				};

				/// Pass embeds a copy of the module optimized with the provided pass pipeline
				/// into a global variable.
				class EmbedBitcodePass : public PassInfoMixin<EmbedBitcodePass> {
				bool IsThinLTO;
				bool EmitLTOSummary;
				ModulePassManager MPM;

				public:
				EmbedBitcodePass(EmbedBitcodeOptions Opts)
				: EmbedBitcodePass(Opts.IsThinLTO, Opts.EmitLTOSummary,
				ModulePassManager()) {}
				EmbedBitcodePass(bool IsThinLTO, bool EmitLTOSummary, ModulePassManager &&MPM)
				: IsThinLTO(IsThinLTO), EmitLTOSummary(EmitLTOSummary),
				MPM(std::move(MPM)) {}

				PreservedAnalyses run(Module &M, ModuleAnalysisManager &);

				static bool isRequired() { return true; }
				};

				} // end namespace llvm.

				#endif

llvm/lib/IR/StructuralHash.cpp

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	while (!BBs.empty()) {
continue;		continue;
BBs.push_back(Term->getSuccessor(i));		BBs.push_back(Term->getSuccessor(i));
}		}
}		}
}		}

void update(const GlobalVariable &GV) {		void update(const GlobalVariable &GV) {
// used/compiler.used don't affect analyses.		// used/compiler.used don't affect analyses.
if (GV.getName() == "llvm.compiler.used" \|\| GV.getName() == "llvm.used")		// Same for llvm.embedded.object, which is always a metadata section.
		if (GV.getName() == "llvm.compiler.used" \|\| GV.getName() == "llvm.used" \|\|
		GV.getName() == "llvm.embedded.object")
		aeubanksUnsubmitted Not Done Reply Inline Actions maybe we should exclude anything that begins with `llvm.`, but that should be a separate patch, so this lgtm aeubanks: maybe we should exclude anything that begins with `llvm.`, but that should be a separate patch…
		paulkirthAuthorUnsubmitted Done Reply Inline Actions Filed https://github.com/llvm/llvm-project/issues/63590. I can make the follow up patch, but I may not get a chance till tomorrow. paulkirth: Filed https://github.com/llvm/llvm-project/issues/63590. I can make the follow up patch, but I…
return;		return;
hash(23456); // Global header		hash(23456); // Global header
hash(GV.getValueType()->getTypeID());		hash(GV.getValueType()->getTypeID());
}		}

void update(const Module &M) {		void update(const Module &M) {
for (const GlobalVariable &GV : M.globals())		for (const GlobalVariable &GV : M.globals())
update(GV);		update(GV);
Show All 20 Lines

llvm/lib/Object/ObjectFile.cpp

Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	Error ObjectFile::printSymbolName(raw_ostream &OS, DataRefImpl Symb) const {
return Error::success();		return Error::success();
}		}

uint32_t ObjectFile::getSymbolAlignment(DataRefImpl DRI) const { return 0; }		uint32_t ObjectFile::getSymbolAlignment(DataRefImpl DRI) const { return 0; }

bool ObjectFile::isSectionBitcode(DataRefImpl Sec) const {		bool ObjectFile::isSectionBitcode(DataRefImpl Sec) const {
Expected<StringRef> NameOrErr = getSectionName(Sec);		Expected<StringRef> NameOrErr = getSectionName(Sec);
if (NameOrErr)		if (NameOrErr)
return *NameOrErr == ".llvmbc";		return NameOrErr == ".llvmbc" \|\| NameOrErr == ".llvm.lto";
consumeError(NameOrErr.takeError());		consumeError(NameOrErr.takeError());
return false;		return false;
}		}

bool ObjectFile::isSectionStripped(DataRefImpl Sec) const { return false; }		bool ObjectFile::isSectionStripped(DataRefImpl Sec) const { return false; }

bool ObjectFile::isBerkeleyText(DataRefImpl Sec) const {		bool ObjectFile::isBerkeleyText(DataRefImpl Sec) const {
return isSectionText(Sec);		return isSectionText(Sec);
▲ Show 20 Lines • Show All 131 Lines • Show Last 20 Lines

llvm/lib/Passes/PassBuilder.cpp

Show First 20 Lines • Show All 96 Lines • ▼ Show 20 Lines
#include "llvm/Transforms/IPO/ArgumentPromotion.h"		#include "llvm/Transforms/IPO/ArgumentPromotion.h"
#include "llvm/Transforms/IPO/Attributor.h"		#include "llvm/Transforms/IPO/Attributor.h"
#include "llvm/Transforms/IPO/BlockExtractor.h"		#include "llvm/Transforms/IPO/BlockExtractor.h"
#include "llvm/Transforms/IPO/CalledValuePropagation.h"		#include "llvm/Transforms/IPO/CalledValuePropagation.h"
#include "llvm/Transforms/IPO/ConstantMerge.h"		#include "llvm/Transforms/IPO/ConstantMerge.h"
#include "llvm/Transforms/IPO/CrossDSOCFI.h"		#include "llvm/Transforms/IPO/CrossDSOCFI.h"
#include "llvm/Transforms/IPO/DeadArgumentElimination.h"		#include "llvm/Transforms/IPO/DeadArgumentElimination.h"
#include "llvm/Transforms/IPO/ElimAvailExtern.h"		#include "llvm/Transforms/IPO/ElimAvailExtern.h"
		#include "llvm/Transforms/IPO/EmbedBitcodePass.h"
#include "llvm/Transforms/IPO/ForceFunctionAttrs.h"		#include "llvm/Transforms/IPO/ForceFunctionAttrs.h"
#include "llvm/Transforms/IPO/FunctionAttrs.h"		#include "llvm/Transforms/IPO/FunctionAttrs.h"
#include "llvm/Transforms/IPO/FunctionImport.h"		#include "llvm/Transforms/IPO/FunctionImport.h"
#include "llvm/Transforms/IPO/GlobalDCE.h"		#include "llvm/Transforms/IPO/GlobalDCE.h"
#include "llvm/Transforms/IPO/GlobalOpt.h"		#include "llvm/Transforms/IPO/GlobalOpt.h"
#include "llvm/Transforms/IPO/GlobalSplit.h"		#include "llvm/Transforms/IPO/GlobalSplit.h"
#include "llvm/Transforms/IPO/HotColdSplitting.h"		#include "llvm/Transforms/IPO/HotColdSplitting.h"
#include "llvm/Transforms/IPO/IROutliner.h"		#include "llvm/Transforms/IPO/IROutliner.h"
▲ Show 20 Lines • Show All 624 Lines • ▼ Show 20 Lines	if (ParamName == "recover") {
formatv("invalid HWAddressSanitizer pass parameter '{0}' ", ParamName)		formatv("invalid HWAddressSanitizer pass parameter '{0}' ", ParamName)
.str(),		.str(),
inconvertibleErrorCode());		inconvertibleErrorCode());
}		}
}		}
return Result;		return Result;
}		}

		Expected<EmbedBitcodeOptions> parseEmbedBitcodePassOptions(StringRef Params) {
		EmbedBitcodeOptions Result;
		while (!Params.empty()) {
		StringRef ParamName;
		std::tie(ParamName, Params) = Params.split(';');

		if (ParamName == "thinlto") {
		Result.IsThinLTO = true;
		} else if (ParamName == "emit-summary") {
		Result.EmitLTOSummary = true;
		} else {
		return make_error<StringError>(
		formatv("invalid EmbedBitcode pass parameter '{0}' ", ParamName)
		.str(),
		inconvertibleErrorCode());
		}
		}
		return Result;
		}

Expected<MemorySanitizerOptions> parseMSanPassOptions(StringRef Params) {		Expected<MemorySanitizerOptions> parseMSanPassOptions(StringRef Params) {
MemorySanitizerOptions Result;		MemorySanitizerOptions Result;
while (!Params.empty()) {		while (!Params.empty()) {
StringRef ParamName;		StringRef ParamName;
std::tie(ParamName, Params) = Params.split(';');		std::tie(ParamName, Params) = Params.split(';');

if (ParamName == "recover") {		if (ParamName == "recover") {
Result.Recover = true;		Result.Recover = true;
▲ Show 20 Lines • Show All 1,272 Lines • Show Last 20 Lines

llvm/lib/Passes/PassBuilderPipelines.cpp

Show All 40 Lines
#include "llvm/Transforms/IPO/Annotation2Metadata.h"		#include "llvm/Transforms/IPO/Annotation2Metadata.h"
#include "llvm/Transforms/IPO/ArgumentPromotion.h"		#include "llvm/Transforms/IPO/ArgumentPromotion.h"
#include "llvm/Transforms/IPO/Attributor.h"		#include "llvm/Transforms/IPO/Attributor.h"
#include "llvm/Transforms/IPO/CalledValuePropagation.h"		#include "llvm/Transforms/IPO/CalledValuePropagation.h"
#include "llvm/Transforms/IPO/ConstantMerge.h"		#include "llvm/Transforms/IPO/ConstantMerge.h"
#include "llvm/Transforms/IPO/CrossDSOCFI.h"		#include "llvm/Transforms/IPO/CrossDSOCFI.h"
#include "llvm/Transforms/IPO/DeadArgumentElimination.h"		#include "llvm/Transforms/IPO/DeadArgumentElimination.h"
#include "llvm/Transforms/IPO/ElimAvailExtern.h"		#include "llvm/Transforms/IPO/ElimAvailExtern.h"
		#include "llvm/Transforms/IPO/EmbedBitcodePass.h"
#include "llvm/Transforms/IPO/ForceFunctionAttrs.h"		#include "llvm/Transforms/IPO/ForceFunctionAttrs.h"
#include "llvm/Transforms/IPO/FunctionAttrs.h"		#include "llvm/Transforms/IPO/FunctionAttrs.h"
#include "llvm/Transforms/IPO/GlobalDCE.h"		#include "llvm/Transforms/IPO/GlobalDCE.h"
#include "llvm/Transforms/IPO/GlobalOpt.h"		#include "llvm/Transforms/IPO/GlobalOpt.h"
#include "llvm/Transforms/IPO/GlobalSplit.h"		#include "llvm/Transforms/IPO/GlobalSplit.h"
#include "llvm/Transforms/IPO/HotColdSplitting.h"		#include "llvm/Transforms/IPO/HotColdSplitting.h"
#include "llvm/Transforms/IPO/IROutliner.h"		#include "llvm/Transforms/IPO/IROutliner.h"
#include "llvm/Transforms/IPO/InferFunctionAttrs.h"		#include "llvm/Transforms/IPO/InferFunctionAttrs.h"
▲ Show 20 Lines • Show All 1,199 Lines • ▼ Show 20 Lines	void PassBuilder::addVectorPasses(OptimizationLevel Level,

if (IsFullLTO)		if (IsFullLTO)
FPM.addPass(InstCombinePass());		FPM.addPass(InstCombinePass());
}		}

ModulePassManager		ModulePassManager
PassBuilder::buildModuleOptimizationPipeline(OptimizationLevel Level,		PassBuilder::buildModuleOptimizationPipeline(OptimizationLevel Level,
ThinOrFullLTOPhase LTOPhase) {		ThinOrFullLTOPhase LTOPhase) {
const bool LTOPreLink = isLTOPreLink(LTOPhase);		const bool LTOPreLink = isLTOPreLink(LTOPhase);
		aeubanksUnsubmitted Done Reply Inline Actions do we still need `IsFatLTO` with the latest approach? aeubanks: do we still need `IsFatLTO` with the latest approach?
		paulkirthAuthorUnsubmitted Done Reply Inline Actions That's a good point. I don't think so, so I can simplify this futher. Thanks for pointing this out. paulkirth: That's a good point. I don't think so, so I can simplify this futher. Thanks for pointing this…
		aeubanksUnsubmitted Done Reply Inline Actions still not removed? aeubanks: still not removed?
ModulePassManager MPM;		ModulePassManager MPM;

// Run partial inlining pass to partially inline functions that have		// Run partial inlining pass to partially inline functions that have
// large bodies.		// large bodies.
if (RunPartialInlining)		if (RunPartialInlining)
MPM.addPass(PartialInlinerPass());		MPM.addPass(PartialInlinerPass());

// Remove avail extern fns and globals definitions since we aren't compiling		// Remove avail extern fns and globals definitions since we aren't compiling
▲ Show 20 Lines • Show All 161 Lines • ▼ Show 20 Lines	PassBuilder::buildPerModuleDefaultPipeline(OptimizationLevel Level,

ModulePassManager MPM;		ModulePassManager MPM;

// Convert @llvm.global.annotations to !annotation metadata.		// Convert @llvm.global.annotations to !annotation metadata.
MPM.addPass(Annotation2MetadataPass());		MPM.addPass(Annotation2MetadataPass());

// Force any function attributes we want the rest of the pipeline to observe.		// Force any function attributes we want the rest of the pipeline to observe.
MPM.addPass(ForceFunctionAttrsPass());		MPM.addPass(ForceFunctionAttrsPass());

		nikicUnsubmitted Done Reply Inline Actions Splitting out this method should not be necessary, you can add one MPM to another. nikic: Splitting out this method should not be necessary, you can add one MPM to another.
		paulkirthAuthorUnsubmitted Done Reply Inline Actions Noted. I'll take a look at refactoring this accordingly. paulkirth: Noted. I'll take a look at refactoring this accordingly.
if (PGOOpt && PGOOpt->DebugInfoForProfiling)		if (PGOOpt && PGOOpt->DebugInfoForProfiling)
MPM.addPass(createModuleToFunctionPassAdaptor(AddDiscriminatorsPass()));		MPM.addPass(createModuleToFunctionPassAdaptor(AddDiscriminatorsPass()));

// Apply module pipeline start EP callback.		// Apply module pipeline start EP callback.
invokePipelineStartEPCallbacks(MPM, Level);		invokePipelineStartEPCallbacks(MPM, Level);

const ThinOrFullLTOPhase LTOPhase = LTOPreLink		const ThinOrFullLTOPhase LTOPhase = LTOPreLink
? ThinOrFullLTOPhase::FullLTOPreLink		? ThinOrFullLTOPhase::FullLTOPreLink
: ThinOrFullLTOPhase::None;		: ThinOrFullLTOPhase::None;
// Add the core simplification pipeline.		// Add the core simplification pipeline.
MPM.addPass(buildModuleSimplificationPipeline(Level, LTOPhase));		MPM.addPass(buildModuleSimplificationPipeline(Level, LTOPhase));

// Now add the optimization pipeline.		// Now add the optimization pipeline.
MPM.addPass(buildModuleOptimizationPipeline(Level, LTOPhase));		MPM.addPass(buildModuleOptimizationPipeline(Level, LTOPhase));

if (PGOOpt && PGOOpt->PseudoProbeForProfiling &&		if (PGOOpt && PGOOpt->PseudoProbeForProfiling &&
PGOOpt->Action == PGOOptions::SampleUse)		PGOOpt->Action == PGOOptions::SampleUse)
MPM.addPass(PseudoProbeUpdatePass());		MPM.addPass(PseudoProbeUpdatePass());

// Emit annotation remarks.		// Emit annotation remarks.
addAnnotationRemarksPass(MPM);		addAnnotationRemarksPass(MPM);

if (LTOPreLink)		if (LTOPreLink)
addRequiredLTOPreLinkPasses(MPM);		addRequiredLTOPreLinkPasses(MPM);
		return MPM;
		}

		ModulePassManager
		aeubanksUnsubmitted Done Reply Inline Actions this comment is obsolete now aeubanks: this comment is obsolete now
		paulkirthAuthorUnsubmitted Done Reply Inline Actions I was thinking once that patch lands we can take the `ThinOrFullLTOPhase` direcly and avoid the branch, but maybe it still makes sense to leave this as is. paulkirth: I was thinking once that patch lands we can take the `ThinOrFullLTOPhase` direcly and avoid the…
		PassBuilder::buildFatLTODefaultPipeline(OptimizationLevel Level, bool ThinLTO,
		bool EmitSummary) {
		ModulePassManager MPM;
		MPM.addPass(EmbedBitcodePass(ThinLTO, EmitSummary,
		ThinLTO
		? buildThinLTOPreLinkDefaultPipeline(Level)
		: buildLTOPreLinkDefaultPipeline(Level)));
		MPM.addPass(buildPerModuleDefaultPipeline(Level));
return MPM;		return MPM;
}		}
		nikicUnsubmitted Done Reply Inline Actions Running the full default pipeline after the pre-link pipeline doesn't make a lot of sense. You should only run the module optimization pipeline here. For ThinLTO, the pre-link pipeline + module optimization will give you pretty much exactly the default pipeline. For FatLTO, it will run module optimization twice, at least until D148010 lands. This will still have some problems though, such as running the optimization pipeline extension points twice. nikic: Running the full default pipeline after the pre-link pipeline doesn't make a lot of sense. You…
		paulkirthAuthorUnsubmitted Done Reply Inline Actions Running the full default pipeline after the pre-link pipeline doesn't make a lot of sense. You should only run the module optimization pipeline here. For ThinLTO, the pre-link pipeline + module optimization will give you pretty much exactly the default pipeline. This is a very helpful detail, so thanks for pointing that out. For FatLTO, it will run module optimization twice, at least until D148010 lands. Just for clarification, when you say `FatLTO` are you referring to full/normal LTO? In my experience `FatLTO` has typically referred to the feature being implemented in this patch, since it uses a fat object file format for LTO purposes, and has been common in GCC for a long time. The only place I've seen `FatLTO` used otherwise is in the Rust compiler, so I want to be sure I'm understanding you correctly. This will still have some problems though, such as running the optimization pipeline extension points twice. Do you have any recommendations about how we may address that? or at least suggestions on where to focus investigation? paulkirth: > Running the full default pipeline after the pre-link pipeline doesn't make a lot of sense.
		nikicUnsubmitted Done Reply Inline Actions Just for clarification, when you say FatLTO are you referring to full/normal LTO? In my experience FatLTO has typically referred to the feature being implemented in this patch, since it uses a fat object file format for LTO purposes, and has been common in GCC for a long time. The only place I've seen FatLTO used otherwise is in the Rust compiler, so I want to be sure I'm understanding you correctly. Yes, I was using fat as in non-thin here :) It's unfortunate that that the term "fat LTO" can now refer to both "non-thin LTO" and "fat-object LTO", including the peculiar combination of "fat thin LTO"... Do you have any recommendations about how we may address that? or at least suggestions on where to focus investigation? The way to observe this would be to try `-ffat-lto-object` together with something like `-fsanitize=address`. You should see address sanitizer getting run twice. As to addressing it, I'm not sure there's a super clean way to do that short of adding an extra parameter to buildThinLTOPreLinkDefaultPipeline to suppress running the relevant extension points. nikic: > Just for clarification, when you say FatLTO are you referring to full/normal LTO? In my…
		paulkirthAuthorUnsubmitted Done Reply Inline Actions Yes, I was using fat as in non-thin here :) It's unfortunate that that the term "fat LTO" can now refer to both "non-thin LTO" and "fat-object LTO", including the peculiar combination of "fat thin LTO"... Even more concerning: it could also be `ThinFatLTO` ;) so we're really in it now. Good thing English is so unambiguous and never leads to miscommunication. Do you have any recommendations about how we may address that? or at least suggestions on where to focus investigation? The way to observe this would be to try `-ffat-lto-object` together with something like `-fsanitize=address`. You should see address sanitizer getting run twice. As to addressing it, I'm not sure there's a super clean way to do that short of adding an extra parameter to buildThinLTOPreLinkDefaultPipeline to suppress running the relevant extension points. At first glance that doesn't look too bad, so that is probably what I'll try. Also thanks for the pointer on ASAN that will be useful as I try and clean this up. :) paulkirth: > Yes, I was using fat as in non-thin here :) It's unfortunate that that the term "fat LTO" can…

ModulePassManager		ModulePassManager
PassBuilder::buildThinLTOPreLinkDefaultPipeline(OptimizationLevel Level) {		PassBuilder::buildThinLTOPreLinkDefaultPipeline(OptimizationLevel Level) {
if (Level == OptimizationLevel::O0)		if (Level == OptimizationLevel::O0)
		tejohnsonUnsubmitted Done Reply Inline Actions Why pass in phase None for these calls to module simplification and optimization passes, instead of the corresponding post-LTO link phase? I would think we would want the complement of the phase provided earlier. I'm pretty sure there are some passes that are configured to run in exactly one of these phases, and we might end up duplicating them if running a pre-LTO link phase + a default (non-LTO) phase optimization pipeline. tejohnson: Why pass in phase None for these calls to module simplification and optimization passes…
		paulkirthAuthorUnsubmitted Done Reply Inline Actions The intent there was to closely mimic the non-lto pipeline. For now I've attempted to just split everything, but I'm not sure I love the approach. It's certainly simple, but I didn't like giving a pass its own passmanager... plus its does seem to be a bit wasteful to run two complete pipelines on the module. paulkirth: The intent there was to closely mimic the non-lto pipeline. For now I've attempted to just…
		tejohnsonUnsubmitted Done Reply Inline Actions I don't think the prior code mimic'ed the non-LTO pipeline, however. My understanding is that the prior version did the following for the non embedded code on which it would generate a native object: (Thin)LTOPreLink + ModuleSimplification(ThinOrFullLTOPhase::None) + ModuleOptimization(ThinOrFullLTOPhase::None) So you would have been essentially duplicating ModuleSimplification (since it is invoked in both the LTO pre-link and again with LTO Phase = None). That can cause some issues because there are passes that are only meant to be invoked once that you could duplicate. A good example is PGO instrumentation/annotation (see the conditions under which addPGOInstrPasses is called from buildModuleSimplificationPipeline). With your new version, my understanding is that you are calling: buildPerModuleDefaultPipeline on original Module -> generate native object (Thin)LTOPreLink on Module clone -> embed IR And buildPerModuleDefaultPipeline is largely just ModuleSimplification + ModuleOptimization. So I'm not sure about why it would be more wasteful (other than cloning the Module) - you are not running more pipelines total than in the prior code. Unless I am misunderstanding the former or current code, which is possible! tejohnson:* I don't think the prior code mimic'ed the non-LTO pipeline, however. My understanding is that…
		paulkirthAuthorUnsubmitted Done Reply Inline Actions I don't think the prior code mimic'ed the non-LTO pipeline, however. My understanding is that the prior version did the following for the non embedded code on which it would generate a native object: (Thin)LTOPreLink + ModuleSimplification(ThinOrFullLTOPhase::None) + ModuleOptimization(ThinOrFullLTOPhase::None) So you would have been essentially duplicating ModuleSimplification (since it is invoked in both the LTO pre-link and again with LTO Phase = None). That can cause some issues because there are passes that are only meant to be invoked once that you could duplicate. A good example is PGO instrumentation/annotation (see the conditions under which addPGOInstrPasses is called from buildModuleSimplificationPipeline). With your new version, my understanding is that you are calling: buildPerModuleDefaultPipeline on original Module -> generate native object (Thin)LTOPreLink on Module clone -> embed IR And buildPerModuleDefaultPipeline is largely just ModuleSimplification + ModuleOptimization. So I'm not sure about why it would be more wasteful (other than cloning the Module) - you are not running more pipelines total than in the prior code. Unless I am misunderstanding the former or current code, which is possible! I think that's an accurate summary. While my intent originally was to closely mimic the default pipeline, I don't think it was very close in reality. Also, as you pointed out the new approach isn't nearly as wasteful as I felt initially. paulkirth:* > I don't think the prior code mimic'ed the non-LTO pipeline, however. My understanding is that…
		aeubanksUnsubmitted Done Reply Inline Actions IIUC the previous version was `ThinLTOPreLink + ModuleOptimization(ThinOrFullLTOPhase::None)`, where we'd write out the module after pre-link and before module optimization to some global. So we'd avoid running the simplification pipeline twice with that. (the problem is that the simplification pipeline is slightly different between a normal build and a ThinLTO build) the current version will run the simplification pipeline twice now, once for ThinLTO and once for the default pipeline aeubanks: IIUC the previous version was `ThinLTOPreLink + ModuleOptimization(ThinOrFullLTOPhase::None)`…
return buildO0DefaultPipeline(Level, /LTOPreLink/true);		return buildO0DefaultPipeline(Level, /LTOPreLink/true);

ModulePassManager MPM;		ModulePassManager MPM;

// Convert @llvm.global.annotations to !annotation metadata.		// Convert @llvm.global.annotations to !annotation metadata.
MPM.addPass(Annotation2MetadataPass());		MPM.addPass(Annotation2MetadataPass());

// Force any function attributes we want the rest of the pipeline to observe.		// Force any function attributes we want the rest of the pipeline to observe.
▲ Show 20 Lines • Show All 549 Lines • Show Last 20 Lines

llvm/lib/Passes/PassRegistry.def

Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines

MODULE_PASS("coro-early", CoroEarlyPass()) MODULE_PASS("coro-early", CoroEarlyPass())

MODULE_PASS("coro-cleanup", CoroCleanupPass()) MODULE_PASS("coro-cleanup", CoroCleanupPass())

MODULE_PASS("cross-dso-cfi", CrossDSOCFIPass()) MODULE_PASS("cross-dso-cfi", CrossDSOCFIPass())

MODULE_PASS("deadargelim", DeadArgumentEliminationPass()) MODULE_PASS("deadargelim", DeadArgumentEliminationPass())

MODULE_PASS("debugify", NewPMDebugifyPass()) MODULE_PASS("debugify", NewPMDebugifyPass())

MODULE_PASS("dot-callgraph", CallGraphDOTPrinterPass()) MODULE_PASS("dot-callgraph", CallGraphDOTPrinterPass())

MODULE_PASS("elim-avail-extern", EliminateAvailableExternallyPass()) MODULE_PASS("elim-avail-extern", EliminateAvailableExternallyPass())

MODULE_PASS("extract-blocks", BlockExtractorPass({}, false)) MODULE_PASS("extract-blocks", BlockExtractorPass({}, false))

MODULE_PASS("forceattrs", ForceFunctionAttrsPass()) MODULE_PASS("forceattrs", ForceFunctionAttrsPass())

aeubanksUnsubmitted

Done

these should be parameterizable, see MODULE_PASS_WITH_PARAMS

aeubanks: these should be parameterizable, see `MODULE_PASS_WITH_PARAMS`

paulkirthAuthorUnsubmitted

Done

That is a much better way to handle this. Thank you.

paulkirth: That is a much better way to handle this. Thank you.

MODULE_PASS("function-import", FunctionImportPass()) MODULE_PASS("function-import", FunctionImportPass())

MODULE_PASS("globalopt", GlobalOptPass()) MODULE_PASS("globalopt", GlobalOptPass())

MODULE_PASS("globalsplit", GlobalSplitPass()) MODULE_PASS("globalsplit", GlobalSplitPass())

MODULE_PASS("hotcoldsplit", HotColdSplittingPass()) MODULE_PASS("hotcoldsplit", HotColdSplittingPass())

MODULE_PASS("inferattrs", InferFunctionAttrsPass()) MODULE_PASS("inferattrs", InferFunctionAttrsPass())

MODULE_PASS("inliner-wrapper", ModuleInlinerWrapperPass()) MODULE_PASS("inliner-wrapper", ModuleInlinerWrapperPass())

MODULE_PASS("inliner-ml-advisor-release", ModuleInlinerWrapperPass(getInlineParams(), true, {}, InliningAdvisorMode::Release, 0)) MODULE_PASS("inliner-ml-advisor-release", ModuleInlinerWrapperPass(getInlineParams(), true, {}, InliningAdvisorMode::Release, 0))

MODULE_PASS("print<inline-advisor>", InlineAdvisorAnalysisPrinterPass(dbgs())) MODULE_PASS("print<inline-advisor>", InlineAdvisorAnalysisPrinterPass(dbgs()))

▲ Show 20 Lines • Show All 101 Lines • ▼ Show 20 Lines MODULE_PASS_WITH_PARAMS("msan",

"recover;kernel;eager-checks;track-origins=N") "recover;kernel;eager-checks;track-origins=N")

MODULE_PASS_WITH_PARAMS("ipsccp", MODULE_PASS_WITH_PARAMS("ipsccp",

"IPSCCPPass", "IPSCCPPass",

[](IPSCCPOptions Opts) { [](IPSCCPOptions Opts) {

return IPSCCPPass(Opts); return IPSCCPPass(Opts);

}, },

parseIPSCCPOptions, parseIPSCCPOptions,

"no-func-spec;func-spec") "no-func-spec;func-spec")

MODULE_PASS_WITH_PARAMS("embed-bitcode",

"EmbedBitcodePass",

[](EmbedBitcodeOptions Opts) {

MaskRayUnsubmitted

Done

"EmbedBitcodePass",

- [](EmbedBitcodeOptions Opts){

+ [](EmbedBitcodeOptions Opts) {

return EmbedBitcodePass(Opts);

MaskRay:

return EmbedBitcodePass(Opts);

parseEmbedBitcodePassOptions,

"thinlto;emit-summary")

#undef MODULE_PASS_WITH_PARAMS #undef MODULE_PASS_WITH_PARAMS

#ifndef CGSCC_ANALYSIS #ifndef CGSCC_ANALYSIS

#define CGSCC_ANALYSIS(NAME, CREATE_PASS) #define CGSCC_ANALYSIS(NAME, CREATE_PASS)

#endif #endif

CGSCC_ANALYSIS("no-op-cgscc", NoOpCGSCCAnalysis()) CGSCC_ANALYSIS("no-op-cgscc", NoOpCGSCCAnalysis())

CGSCC_ANALYSIS("fam-proxy", FunctionAnalysisManagerCGSCCProxy()) CGSCC_ANALYSIS("fam-proxy", FunctionAnalysisManagerCGSCCProxy())

CGSCC_ANALYSIS("pass-instrumentation", PassInstrumentationAnalysis(PIC)) CGSCC_ANALYSIS("pass-instrumentation", PassInstrumentationAnalysis(PIC))

▲ Show 20 Lines • Show All 438 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/CMakeLists.txt

	add_llvm_component_library(LLVMipo			add_llvm_component_library(LLVMipo
	AlwaysInliner.cpp			AlwaysInliner.cpp
	Annotation2Metadata.cpp			Annotation2Metadata.cpp
	ArgumentPromotion.cpp			ArgumentPromotion.cpp
	Attributor.cpp			Attributor.cpp
	AttributorAttributes.cpp			AttributorAttributes.cpp
	BarrierNoopPass.cpp			BarrierNoopPass.cpp
	BlockExtractor.cpp			BlockExtractor.cpp
	CalledValuePropagation.cpp			CalledValuePropagation.cpp
	ConstantMerge.cpp			ConstantMerge.cpp
	CrossDSOCFI.cpp			CrossDSOCFI.cpp
	DeadArgumentElimination.cpp			DeadArgumentElimination.cpp
	ElimAvailExtern.cpp			ElimAvailExtern.cpp
				EmbedBitcodePass.cpp
	ExtractGV.cpp			ExtractGV.cpp
	ForceFunctionAttrs.cpp			ForceFunctionAttrs.cpp
	FunctionAttrs.cpp			FunctionAttrs.cpp
	FunctionImport.cpp			FunctionImport.cpp
	FunctionSpecialization.cpp			FunctionSpecialization.cpp
	GlobalDCE.cpp			GlobalDCE.cpp
	GlobalOpt.cpp			GlobalOpt.cpp
	GlobalSplit.cpp			GlobalSplit.cpp
	▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/EmbedBitcodePass.cpp

This file was added.

//===- EmbedBitcodePass.cpp - Pass that embeds the bitcode into a global---===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

#include "llvm/Transforms/IPO/EmbedBitcodePass.h"

#include "llvm/Bitcode/BitcodeWriter.h"

MaskRayUnsubmitted

Done

If EmbedBitcodePass.h provides a descriptive comment, this comment can probably be removed. It does not give additional information to users:)

MaskRay: If `EmbedBitcodePass.h` provides a descriptive comment, this comment can probably be removed.

paulkirthAuthorUnsubmitted

Done

noted. I'll expand the description in the header and remove this.

paulkirth: noted. I'll expand the description in the header and remove this.

#include "llvm/Bitcode/BitcodeWriterPass.h"

#include "llvm/IR/PassManager.h"

#include "llvm/Pass.h"

#include "llvm/Support/ErrorHandling.h"

#include "llvm/Support/MemoryBufferRef.h"

#include "llvm/Support/raw_ostream.h"

#include "llvm/TargetParser/Triple.h"

#include "llvm/Transforms/IPO/ThinLTOBitcodeWriter.h"

#include "llvm/Transforms/Utils/Cloning.h"

#include "llvm/Transforms/Utils/ModuleUtils.h"

#include <memory>

#include <string>

using namespace llvm;

PreservedAnalyses EmbedBitcodePass::run(Module &M, ModuleAnalysisManager &AM) {

if (M.getGlobalVariable("llvm.embedded.module", /*AllowInternal=*/true))

report_fatal_error("Can only embed the module once",

/*gen_crash_diag=*/false);

Triple T(M.getTargetTriple());

if (T.getObjectFormat() != Triple::ELF)

report_fatal_error(

"EmbedBitcode pass currently only supports ELF object format",

/*gen_crash_diag=*/false);

std::unique_ptr<Module> NewModule = CloneModule(M);

MPM.run(*NewModule, AM);

std::string Data;

raw_string_ostream OS(Data);

if (IsThinLTO)

MaskRayUnsubmitted

Done

delete unneeded blank line

MaskRay: delete unneeded blank line

ThinLTOBitcodeWriterPass(OS, /*ThinLinkOS=*/nullptr).run(*NewModule, AM);

else

BitcodeWriterPass(OS, /*ShouldPreserveUseListOrder=*/false, EmitLTOSummary)

.run(*NewModule, AM);

MaskRayUnsubmitted

Done

delete unneeded blank line

MaskRay: delete unneeded blank line

embedBufferInModule(M, MemoryBufferRef(Data, "ModuleData"), ".llvm.lto");

return PreservedAnalyses::all();

}

paulkirthAuthorUnsubmitted

Done

.run(*NewModule, AM);

- // StringRef ModuleData(OS.str().data(), OS.str().size());

- // MemoryBufferRef Buf(ModuleData, "ModuleData");

- // embedBufferInModule(M, Buf, ".llvm.lto");

embedBufferInModule(M, MemoryBufferRef(Data, "ModuleData"), ".llvm.lto");

Looks like I forgot to remove these lines.

paulkirth: Looks like I forgot to remove these lines.

paulkirthAuthorUnsubmitted

Done

Also, while I'm not arguing that we shouldn't add SHF_LLVM_LTO, embedBufferInModule does mark it as MD_exclude https://github.com/llvm/llvm-project/blob/f1a040298381cdfc0657d5ecba231ebe6bbef61a/llvm/lib/Transforms/Utils/ModuleUtils.cpp#L375, which will eventually become SHF_EXCLUDE.

paulkirth: Also, while I'm not arguing that we shouldn't add `SHF_LLVM_LTO`, `embedBufferInModule` does…

paulkirthAuthorUnsubmitted

Done

embedBufferInModule(M, MemoryBufferRef(Data, "ModuleData"), ".llvm.lto");

- return PreservedAnalyses::all();

+ return PreservedAnalyses::none();

}

Is there another option we can use here. none seems like overkill. Since this is basically the first thing to run on the module, I'm not sure if there is much point in trying to do anything smarter, but it certainly doesn't feel right.

paulkirth: Is there another option we can use here. `none` seems like overkill. Since this is basically…

aeubanksUnsubmitted

Done

perhaps something like https://reviews.llvm.org/D153855?

aeubanks: perhaps something like https://reviews.llvm.org/D153855?

paulkirthAuthorUnsubmitted

Done

That seems like it will work. Thanks for the suggestion. We can go back to none if we're unhappy about amending the structural hash.

paulkirth: That seems like it will work. Thanks for the suggestion. We can go back to `none` if we're…

llvm/test/CodeGen/X86/fat-lto-section.ll

This file was added.

				;; Ensure that the .llvm.lto section has SHT_EXCLUDE set.
				; RUN: opt --mtriple x86_64-unknown-linux-gnu < %s -passes="embed-bitcode<thinlto;emit-summary>" -S \
				; RUN: \| llc --mtriple x86_64-unknown-linux-gnu -filetype=obj \
				; RUN: \| llvm-readelf - --sections \
				; RUN: \| FileCheck %s --check-prefix=EXCLUDE

				; EXCLUDE: Name Type {{.*}} ES Flg Lk Inf Al
				; EXCLUDE: .llvm.lto PROGBITS {{.*}} 00 E 0 0 1

				MaskRayUnsubmitted Done Reply Inline Actions I typically just drop `[Nr]` and `[ 4]` from the output so that adding a new section will not cause the section index to shuffle and cause the test to fail. MaskRay: I typically just drop `[Nr]` and `[ 4]` from the output so that adding a new section will not…
				@a = global i32 1

llvm/test/Transforms/EmbedBitcode/embed-multiple.ll

This file was added.

				; RUN: not opt --mtriple x86_64-unknown-linux-gnu < %s -passes=embed-bitcode -S 2>&1 \| FileCheck %s

				@a = global i32 1
				@llvm.embedded.module = private constant [4 x i8] c"BC\C0\DE"

				; CHECK: LLVM ERROR: Can only embed the module once

llvm/test/Transforms/EmbedBitcode/embed-unsupported-object-format.ll

This file was added.

				; RUN: not opt --mtriple powerpc64-unknown-aix < %s -passes=embed-bitcode -S 2>&1 \| FileCheck %s

				@a = global i32 1

				; CHECK: LLVM ERROR: EmbedBitcode pass currently only supports ELF object format

llvm/test/Transforms/EmbedBitcode/embed.ll

This file was added.

				; RUN: opt --mtriple x86_64-unknown-linux-gnu < %s -passes="embed-bitcode" -S \| FileCheck %s
				; RUN: opt --mtriple x86_64-unknown-linux-gnu < %s -passes="embed-bitcode<thinlto>" -S \| FileCheck %s
				MaskRayUnsubmitted Done Reply Inline Actions llc needs `REQUIRES: x86-registered-target` MaskRay: llc needs `REQUIRES: x86-registered-target`
				; RUN: opt --mtriple x86_64-unknown-linux-gnu < %s -passes="embed-bitcode<emit-summary>" -S \| FileCheck %s
				; RUN: opt --mtriple x86_64-unknown-linux-gnu < %s -passes="embed-bitcode<thinlto;emit-summary>" -S \| FileCheck %s

				@a = global i32 1

				; CHECK: @a = global i32 1
				;; Make sure the module is in the correct section.
				; CHECK: @llvm.embedded.object = private constant {{.*}}, section ".llvm.lto", align 1

				;; Ensure that the metadata is in llvm.compiler.used.
				; CHECK: @llvm.compiler.used = appending global [1 x ptr] [ptr @llvm.embedded.object], section "llvm.metadata"

				;; Make sure the metadata correlates to the .llvm.lto section.
				; CHECK: !llvm.embedded.objects = !{!1}
				; CHECK: !0 = !{}
				; CHECK: !{ptr @llvm.embedded.object, !".llvm.lto"}
				MaskRayUnsubmitted Done Reply Inline Actions delete trailing blank line MaskRay: delete trailing blank line
				MaskRayUnsubmitted Done Reply Inline Actions We normally indent continuation lines with 2 spaces MaskRay: We normally indent continuation lines with 2 spaces
				MaskRayUnsubmitted Done Reply Inline Actions `--elf-output-style=JSON --pretty-print` is not normal. `llvm-readelf -S` output is much conciser. MaskRay: `--elf-output-style=JSON --pretty-print ` is not normal. `llvm-readelf -S` output is much…

This is an archive of the discontinued LLVM Phabricator instance.

Reland [llvm] Preliminary fat-lto-objects supportClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 535537

llvm/docs/FatLTO.rst

llvm/docs/ReleaseNotes.rst

llvm/docs/UserGuides.rst

llvm/include/llvm/Passes/PassBuilder.h

llvm/include/llvm/Transforms/IPO/EmbedBitcodePass.h

llvm/lib/IR/StructuralHash.cpp

llvm/lib/Object/ObjectFile.cpp

llvm/lib/Passes/PassBuilder.cpp

llvm/lib/Passes/PassBuilderPipelines.cpp

llvm/lib/Passes/PassRegistry.def

llvm/lib/Transforms/IPO/CMakeLists.txt

llvm/lib/Transforms/IPO/EmbedBitcodePass.cpp

llvm/test/CodeGen/X86/fat-lto-section.ll

llvm/test/Transforms/EmbedBitcode/embed-multiple.ll

llvm/test/Transforms/EmbedBitcode/embed-unsupported-object-format.ll

llvm/test/Transforms/EmbedBitcode/embed.ll

Reland [llvm] Preliminary fat-lto-objects support
ClosedPublic