This change allows lld to deduplicate bitcode file comdat groups, which will allow it to deduplicate .debug_types sections in thinlto.
For more context, see https://reviews.llvm.org/D62884
Differential D80765
[ELF] Handle bitcode comdat groups separately to deduplicate thinlto comdat sections christylee on May 28 2020, 3:29 PM. Authored by
Details This change allows lld to deduplicate bitcode file comdat groups, which will allow it to deduplicate .debug_types sections in thinlto. For more context, see https://reviews.llvm.org/D62884
Diff Detail
Unit Tests Event TimelineComment Actions Would you mind making the equivalent changes to lld/wasm? And a test?
This comment was removed by christylee. Comment Actions @sbc100 What would be a good way to go about writing a test for this? The reason why I noticed this bug was because .debug_types are not deduplicated in thinlto, but writing a lld test case using debug metadata seems clunky. I've also tried something like this but it doesn't repro: ; REQUIRES: x86 ; RUN: llvm-as %s -o %t.o ; RUN: llc %t.o -o %t2.o -filetype=obj ; RUN: ld.lld %t.o %t2.o %t.o %t2.o -o %t3.o --shared ; RUN: llvm-readobj --symbols %t3.o | FileCheck %s ; CHECK: Name: foo ; CHECK-NEXT: Value: ; CHECK-NEXT: Size: 1 ; CHECK-NEXT: Binding: Global ; CHECK-NEXT: Type: Function ; CHECK-NEXT: Other: 0 ; CHECK-NEXT: Section: .text target triple = "x86_64-unknown-linux-gnu" target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128" $foo = comdat any define void @foo() comdat { ret void } Comment Actions
I'm sorry I've not looked into this in much detail but shouldn't this same bug reproduce with all comdats? Maybe there is something different about debug metatdata? BTW, the test above doesn't use lto or thinglto does it? Comment Actions Addressed comments: 1). Changed isBitcodeFile to isLTOOutput. Change for COFF to follow. Comment Actions We don't need to make the same change to COFF. COFF does not ignore comdats, it attempts to add a comdat symbol as long as the symbol is external. It also tracks whether a symbol is from a regular object file or bitcode file so it won't have duplicates. Comment Actions The LLD change looks fine to me. But I am not well familiar with LTO stuff and hence do not feel myself is a right person to approve it. Comment Actions @tejohnson is a good one. I'll also get to this patch soon. I tried an earlier version of this patch and noticed issues testing internally. I'll retest since the patch has changed a lot. Comment Actions Since this is very specific to lld, and not in the LLVM LTO handling, I'm not sure I'm the best person to review. I looked at the change but don't have enough understanding of how lld is otherwise handling comdat groups to do a really informed review.
Sounds good.
Comment Actions Apologies for my slowness getting to this patch. symtab->ltoOutputComdatGroups does work: % readelf -g a3.o.lto.o a3.o1.lto.o File: a3.o.lto.o COMDAT group section [ 4] `.group' [4068369915778327548] contains 2 sections: [Index] Name [ 5] .debug_types [ 6] .rela.debug_types File: a3.o1.lto.o COMDAT group section [ 4] `.group' [4068369915778327548] contains 2 sections: [Index] Name [ 5] .debug_types [ 6] .rela.debug_types The .debug_types from a3.o1.lto.o can be discarded by the new COMDAT logic. However, I am concerned that making it general as this patch does can miss some codegen bugs. See @pcc's argument in
Comment Actions Thanks for taking time reviewing this change. It is an interesting argument whether LTO should introduce new COMDAT symbols. Do we have a conclusion on this? So far the .debug_types are the only COMDAT we've seen introduced by LTO, but in theory any LTO-exclusive pass may introduce a new COMDAT symbol and it's up to the pass how to set the symbol's linkage type.
Comment Actions In case we are concerned about correctness, we've been using this patch internally for a couple of months with no problems. Comment Actions
This still looks like a workaround done at an inappropriate layer to me. The .debug_types code generator should probably know that some .debug_types sections should not be emitted. Comment Actions
@MaskRay , one of our thinlto binaries went from a 6.9 GiB .debug_types section to 420 MiB, while another went from 7.8 GiB to 370 MiB. Comment Actions Yeah - one alternative here would be for ThinLTO To "home" debug type information, the same way I believe it "homes" inline functions. (original/incoming IR to the thinlink may have multiple definitions of inline functions - one in every module that has a call to the inline function - but something in the think link summary basically says "drop this definition" in all but one of those modules and in that one blessed module it says "always produce a weak definition of this function, even if you don't need it" - something like that should be done with type information, which would remove the redundancy in the object files) In the case of a full executable link (with the homing above implemented), I'd then suggest not enabling type units, since they just add overhead and wouldn't reduce any duplication - in the case of a static library built with ThinLTO (if that's even a thing), then you'd still potentially want to use type units so they could be deduplicated against other libraries. Comment Actions Enabling compile-time type deduplication is indeed an ideal solution, as you pointed out finding a prevailing symbol definition and dropping others for duplicate functions from different modules. This would require quite some work in parsing type metadata on the IR and might slow down thinLink. Also this may not completely address deduplication of new data introduced during LTO postLink code generation time. The type unit level deduplication could be the last resort to handle the duplicates, like what is done in non-LTO build. Comment Actions Fair point - though presumably the first stage compile would do the metadata walk through the IR, provide a list of types in the thin summary file and the thin link would then communicate to the backend compiles specifying which types each backend compile should emit into its respective object file.
Not sure I follow - what duplication would remain if the thin link homed type descriptions? When linking to non-LTO (normal machine code/object files) libraries? Yeah, fair point (tangential warning: LLVM's type unit hashes are non-DWARF-conforming and not compatible with GCC's type unit hashes... :/) It /sounds/ like, maybe the original fix in D62884 maybe was going in the wrong direction, and instead ThinLTO should be removing the comdat group from "homed" inline functions? Then there wouldn't be a need for a special case in how comdats are handled. Comment Actions I am with @dblaikie here. A bit of archaeology finds that rG4de44b7ef87bcef83798eee69fdcbfab9866d52e was probably going in the wrong direction but it was indeed the simplest/least intrusive approach. .section foo,"aG",@progbits,aaa,comdat .section foo,"aG",@progbits,bbb,comdat are two sections. Dropping the comdat key will make them one combined section. Unique linkage may work around it but there may be a fair amount of complexity in TargetLoweringObjectFileImpl/MC. Comment Actions Yeah, maybe something near/similar to https://github.com/llvm-mirror/llvm/blob/master/lib/Transforms/Utils/FunctionImportUtils.cpp#L275 - where the comdat is imported as available externally, and its comdat group is stripped. Theory is, maybe we should be stripping the comdat on the other side too. Hmm, maybe not, though? What about non-whole-program-ThinLTO? There could be a comdat copy of this function in a static library that it should be deduplicated against? I take it the underlying assumption lld is making is that a comdat group is only kept alive by references from within the object file that defines it? Is that formally specified/required? If not, then maybe we could/should remoev that assumption, so that ThinLTO can work naturally with non-(Thin)LTO archives and still do the suitable comdat deduplication or dropping?
Is that a problem? When linked they end up in the same section, right?
What additional arbitrary comdat objects are being generated or proposed to be generated? Comment Actions This is probably fine if the static library is a native input to lld , since lld itself can deduplicate comdat groups of its native input.
So far the Dwarf types are the only new artifacts generated by the LTO backend. What I was wondering was new LTO-exclusive codegen passes that may introduce new comdat groups, but sound like it is a design principle that new symbols/data should be only generated in preLink LTO phase. Comment Actions Oh, yeah, fair point.
Ah, OK, "new" in the sense that they weren't present as comdat groups in the ThinLTO input IR. What do we gain by having this restriction? I guess it removes a certain amount of work the linker would otherwise do when performing the final link step in ThinLTO? Anyone have a sense of the significance of that restriction compared to linking "as normal"? |
Perhaps this would better be names something like isLTOOutput? Otherwise it sounds like it might a bitcode object which is a different class here.