This patch adds support for debug_loclists.dwo section in llvm and llvm-dwarfdump.
Also Fixes PR43622, PR43623.
Details
Diff Detail
Event Timeline
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
2489 | This function looks like it may be more readable if it were organized like this: if (getDwarfVersion() >= 5) { for (const auto &List : DebugLocs.getLists()) ... } else /* DWARF v2-4 */ { for (const auto &List : DebugLocs.getLists()) ... } | |
2506 | Can you please make sure all comments are complete sentences ending in a .? |
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
1149–1154 | This is a bit of a surprising/uncommon piece of code (using a conditional operator without using its result, and using the comma operator). Probably best to use more common constructs for this sort of situation. I don't think there's any need to name the label differently (loclists_table_base/loclists_dwo_table_base) depending on dwo usage. Probably just keep it with a single/the same name always (loclists_table_base). | |
2480–2484 | This function handles the pre-standard loclists format (which can't use some of the new/more compact encoding options available in DWARFv5 - as the comment inside the loop mentions) It'd be better to reuse the emitDebugLoc() function (refactored to be able to use the appropriate .dwo section in some way (either add a function parameter, or choose the section based on whether Split DWARF is enabled, etc)) because it now has some advanced handling of choosing base address selection entries, etc. | |
llvm/lib/DebugInfo/DWARF/DWARFDie.cpp | ||
145–147 | I'm /guessing/ this is probably not compatible with DWP files (the DWARFUnit's "LocSectionData" is carefully populated from the DWP's cu_index - see DWARFUnit's ctor & try testing dumping of a DWP file (admittedly, there's no DWP tool that can produce a DWP file for DWARFv5 right now, so you'd have to hand-craft one/locally patch llvm-dwp). | |
llvm/test/DebugInfo/Inputs/dwarfdump-test-loclists-dwo.c | ||
2–19 | Probably easier/better to add extra RUN/check to test/DebugInfo/X86/sret.ll - see my changes in r374600: http://llvm.org/viewvc/llvm-project?view=revision&revision=374600 |
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
1149–1154 | Yeah, I'll clean this up in next revision. basically I was only interested in creating symbol based on whether we are in split mode or not. So didn't bothered to check the result of ternary operator. | |
2480–2484 | Hi @dblaikie, Thanks for reviewing this. if (useSplitDwarf()) emitDebugLocDWO(); -- this handles both v4 and v5 loc and loclists format else // Emit info into a debug loc section. emitDebugLoc(); -- as you, mentioned this only handles pre-standard loclist format.. So I choose to add it here. Now your suggestion is to add the v5 loclist.dwo format to emitDebugLoc() function -- as it can handle v4 and v5 both ?? if (getDwarfVersion() == 4 && UseSplitDwarf()) emitDebugLocDWO() -- handles only v4 split mode else () emitDebugLoc() -- handles everything else -- v4 v5 and v5 + split | |
llvm/lib/DebugInfo/DWARF/DWARFDie.cpp | ||
145–147 | I haven't got a chance to look into details of llvm-dwp utility. | |
llvm/test/DebugInfo/Inputs/dwarfdump-test-loclists-dwo.c | ||
2–19 | Adding extra checks to test/DebugInfo/X86/sret.ll is fine. As I can see from my initial implementation based on your suggestion. test/DebugInfo/X86/sret.ll this suffice our needs here. |
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
2480–2484 | Eh, yeah, looks like that'd be inconsistent with the other section support. looking at emitDebugRanges/emitDebugRangesDWO - I've refactored those in llvmorg-10-init-8906-g89b7f16204a - this should give a good basis on which to refactor emitDebugLoc/emitDebugLocDWO. With the exception that the existing emitDebugLocDWO will need to be a special case (the curretn/old code) that doesn't exist in the debugRanges/RangesDWO code. | |
llvm/lib/DebugInfo/DWARF/DWARFDie.cpp | ||
145–147 | Yeah, maybe leave a FIXIT here about DWP support? I think the whole thing needs a bit more deep surgery for more complete/general DWP support & such, out of scope for this change. | |
llvm/test/DebugInfo/Inputs/dwarfdump-test-loclists-dwo.c | ||
2–19 | Oh, sorry - a better test that's got source in there and everything, is llvm/test/CodeGen/X86/debug-loclists.ll Yes, you could add a test that tests that with split DWARF (& them dumping loclists.dwo from the dwo file). See llvm/test/DebugInfo/X86/cu-ranges.ll for the range list equivalent that tests both split and non-split forms. |
Sorry for the trouble, I should be able to land the other patches soon.
llvm/lib/DebugInfo/DWARF/DWARFDie.cpp | ||
---|---|---|
145–147 | Yep, lldb does not support loclists.dwo (and its support for the non-dwo variant is somewhat dodgy as well). I am planning to address all of that by making lldb use the llvm classes for parsing (hence my other patches). |
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
1195 | In current implementation, emitDebugLocDWO() only handles -gdwarf-4 -gsplit-dwarf case. |
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
1195 | I also have another implementation, which keeps above code uniform as in if (useSplitDwarf()) emitDebugLocDWO(); else emitDebugLoc(); But bloat's up a little emitDebugLocDWO() function. This is a bit cleaner but with one extra check in if. |
Since DW_FORM_loclistx form is supported now.
I'm planning to revise this to accommodate DW_FORM_loclistx form.
Thank you for your patience.
Hi @labath @dblaikie , could you guys please have a look at this revised update. So that we all are on same we page as we progress. Pavel has done lot changes thanks a lot, I just wanted my work to be in sync with his work.
+ This patch doesn't take account of dwp files{Dwarf-5}. dwp prior{Dwarf-5} is intact.
+ Though I haven't look at this D70394 but their might be small conflict with this. But I'm happy to resolve that and address comments on this.
Thanks!
llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp | ||
---|---|---|
189 ↗ | (On Diff #230024) | I wrote this as conditional, just to avoid breaking of DWP{Prior Dwarf-5} stuff . |
I've only looked at the consumption side, and it seems reasonable to me. The main thing I noticed is that it would be good to abstract the logic for computing the actual final offset in the loclists table somehow (I'm talking about the code that's presently in DWARFDie.cpp:dumpLocation). That's because the this logic will need to be repeated in the code which fetches the final resolved addresses programmatically (DWARFDie::getLocations in D70394). It also probably means that the code in that patch does not handle all of these cases correctly.
However, currently I don't know what's the best way to do that (and I don't even think it's you who needs to make that happen). Though, if you have any thoughts on this, I'd like to hear them.
llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp | ||
---|---|---|
190–199 ↗ | (On Diff #230024) | I'd probably factor this so that just the DWARFDataExtractor is created in the conditional, and the creation of the loclists object is a separate statement. |
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
2435–2452 | I'm not sure I understand why this change is here - if I'm reading it right, the only difference between the two emitRangeList calls is "ShouldUseBaseAddress" - if that change is needed, perhaps it'd be better to have a single call that uses "!DD.useSplitDwarf()" as the "ShouldUseBaseAddress" parameter. But I don't think that's correct - base addresses should be used in split and non-split DWARF5, I think. I believe base addresses should not be used in pre-DWARF5 split DWARF, but that codepath isn't using this function right now. (I think maybe it should, though) | |
2515–2518 | What motivated this change? Perhaps this should be in a separate patch if there's a bug to fix here that's separate from the new loclist support being added? | |
llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp | ||
514 ↗ | (On Diff #230024) | Can probably remove this FIXME now. |
529 ↗ | (On Diff #230024) | Maybe in the IsDWO case just set it to "HeaderSize"? Hmm, @aprantl @probinson - I can't find the wording in the DWARF5 spec about the use of loclists_base and rnglists_base in split units. It's unclear if these need to be specified in the split unit, or if they're assumed to be zero (or, technically, zero + sizeof(list header))? Currently LLVM doesn't generate a rnglists_base for split DWARFv5 units (I haven't checked the history on that change - but I might've had a hand in it) & this change is going to propagate that choice & do similarly for loclists_base. GCC 8.1 at least doesn't seem to give us any hint there - since they don't use rnglists.dwo so far as I can tell (I induced a function-local range list and it was put in the .o rnglists section, there was no rnglists.dwo section) In some ways it'd be tidier to emit it, of course - consistent between .o and .dwo, but also it'd likely be redundant/always have the same value in every .dwo file. |
llvm/test/CodeGen/X86/debug-loclists.ll | ||
5 | Does this check need a different prefix, or can it use the same/default prefix? At a glance it looks like all the CHECKs are the same (& I think they should be) so maybe they can be reused. | |
llvm/test/DebugInfo/X86/dwarfdump-debug-loclists-dwo.test | ||
2 | What does this test that's different from the debug-loclists.ll? This is a separate test for the parser only? We don't /always/ bother with that, but it's nice to have - if we're going to do that, the functionality should probably be committed in two separate patches - one with the parser functionality and its test, and then anotehr with the emission functionality and test. Is there no existing llm-dwarfdump/parser test for loclists? (existing source example that could be reused, possibly existing CHECKs that could be reused (they may need some adaptation to be flexible enough to handle a .dwo file, where the addresses won't be able to be resolved, etc), even if a new object file would have to be committed to go with it) |
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
2435–2452 | Thanks for noting this. Yeah, I was previously of holding on opposite thought, then
This seems fine, after referring back to Spec. But bear with me for a moment, I'm noting one further glitch-- if we do pass DW_LLE_base_addressx (0x0000000000000000) -- coming in every pair with 0 DW_LLE_offset_pair (0x0000000000000000, 0x0000000000000004): DW_OP_reg5 RDI DW_LLE_offset_pair (0x0000000000000004, 0x0000000000000018): DW_OP_reg3 RBX DW_LLE_end_of_list () ` I'm guessing base is not their for split ?? Any thought on this ?? | |
2515–2518 | Sure, will do this separately. | |
llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp | ||
190–199 ↗ | (On Diff #230024) | Thanks for suggestion, refactored code looks much cleaner. |
514 ↗ | (On Diff #230024) | Sure, Will do. |
llvm/test/CodeGen/X86/debug-loclists.ll | ||
5 | I'll try to remove redundancy, but I guess some are needed specially when checking dump of debug_loclists.dwo section. -- that contains / uses different form then non-split case. |
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
2435–2452 | + Surprisingly this DW_LLE_base_addressx doesn't pops up in non-split case, Not sure if this is fine. |
llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp | ||
---|---|---|
529 ↗ | (On Diff #230024) | Anyways it's getLocSectionBase() value is 0 . if (IsDWO) setLocSection(&Context.getDWARFObj().getLoclistsDWOSection(), 0); Added this here just for readibility / uniformity sake. |
llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp | ||
---|---|---|
529 ↗ | (On Diff #230024) | Does LocSectionBase have to be zero? Could set the LocSectionBase to sizeof(header) instead? (If the consensus is that DW_AT_loclists_base is not needed in dwo files -- which seems reasonable to me.) That way, you wouldn't need to do the if(DWO) dance here and also in DWARFDie.cpp:dumpLocation. I think that would actually mostly address my earlier comment about having a "central" place to compute the final location section offset. |
llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp | ||
---|---|---|
529 ↗ | (On Diff #230024) |
Yes, as per Spec also. I also tried to hack DW_AT_loclists_base to place in .dwo file. that ends up in a backend error
later, I re-read Spec to find the same, dwo section contain no relocation. this is also caused trouble, in another piece of my work, where I tried to insert
in a dwo file. |
llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp | ||
---|---|---|
529 ↗ | (On Diff #230024) |
If that's the only problem then presumably the relocation could be dispensed with by emitting the attribute value as a constant, or a label difference (distance from the start of the section) instead of a plain label (though I don't think that is necessary). |
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp | ||
---|---|---|
2435–2452 |
Yes, it's expected that dumping loclists.dwo would print zero (Or some other small numbers) in a base_addressx or startx_length - those are address pool indexes, not addresses. The addresses themselves are in the debug_addr section in the .o file, so llvm-dwarfdump (dumping the .dwo file) doesn't have access to them - so it prints out the raw entry parameters (address indexes, offsets, etc) rather than the "processed" values you see when dumping non-split DWARF. It should do this in non-split DWARF too: $ cat loc.cpp int f1(int i, int j) { int x = 5; int y = 3; int r = i + j; int undef; x = undef; y = 4; return r; } void f2() { } blaikie@blaikie-linux2:~/dev/scratch$ clang++-tot -ffunction-sections -O3 loc.cpp -gdwarf-5 -c && llvm-dwarfdump-tot loc.o -debug-loclists -v loc.o: file format ELF64-x86-64 .debug_loclists contents: 0x00000000: locations list header: length = 0x00000035, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000003 offsets: [ 0x0000000c => 0x00000018 0x0000001d => 0x00000029 0x00000025 => 0x00000031 ] 0x00000018: DW_LLE_base_addressx (0x0000000000000000) DW_LLE_offset_pair (0x0000000000000000, 0x0000000000000003): DW_OP_consts +3, DW_OP_stack_value DW_LLE_offset_pair (0x0000000000000003, 0x0000000000000004): DW_OP_consts +4, DW_OP_stack_value DW_LLE_end_of_list () 0x00000029: DW_LLE_startx_length (0x0000000000000000, 0x0000000000000003): DW_OP_consts +5, DW_OP_stack_value DW_LLE_end_of_list () 0x00000031: DW_LLE_base_addressx (0x0000000000000000) DW_LLE_offset_pair (0x0000000000000003, 0x0000000000000004): DW_OP_reg0 RAX DW_LLE_end_of_list () Though if you compile without -ffunction-sections, the CU will have a relocated base address, and the loclists will not need to use base_addressx - they will instead use offset_pair and rely on the CU's base address. In short: I would expect the loclists to be exactly the same in split and non-split cases. So I don't know why you're adding the "ShouldUseBaseAddress = false" part to the split case. | |
llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp | ||
529 ↗ | (On Diff #230024) | @labath - yeah, I like that idea of changing: if (IsDWO) setLocSection(&Context.getDWARFObj().getLoclistsDWOSection(), 0); to if (IsDWO) setLocSection(&Context.getDWARFObj().getLoclistsDWOSection(), DWARFListTableHeader::getHeaderSize(Header.getFormat()); And then simplifying this: uint64_t Offset = IsDWO ? HeaderSize + getLocSectionBase() : getLocSectionBase(); down to: uint64_t Offset = getLocSectionBase(); @SouraVX - please make those changes. We'll continue here without loclists_base in the split unit & if that (& rnglists_base in split units) needs to be addressed, we'll do them in separate commits/discussions, so as to avoid conflating/complicating this review further. |
llvm/test/CodeGen/X86/debug-loclists.ll | ||
5 | I believe these should be the same in split and non-split case (as discussed in the other thread about the implementation and use of ShouldUseBaseAddress) at which point these tests should be/can be simplified to use the same CHECKs for both split and non-split. |
Addressed and incorporated suggestions by David, Pavel. Thanks for this.
- Cleaned up / refactored code.
- Removed following test /files - dwarfdump-test-loclists-dwo.c, dwarfdump-test-loclists-dwo.dwo, dwarfdump-debug-loclists-dwo.test
- Considering a separate patch for llvm-dwarfdump parser for loclists.dwo, using / based on above files.
The consumption side looks good to me. I'll leave it to David to figure out the production and testing story..
llvm/lib/DebugInfo/DWARF/DWARFDie.cpp | ||
---|---|---|
124 | revert the whitespace change too |
Hmm, @aprantl @probinson - I can't find the wording in the DWARF5 spec about the use of loclists_base and rnglists_base in split units. It's unclear if these need to be specified in the split unit, or if they're assumed to be zero (or, technically, zero + sizeof(list header))? Currently LLVM doesn't generate a rnglists_base for split DWARFv5 units (I haven't checked the history on that change - but I might've had a hand in it) & this change is going to propagate that choice & do similarly for loclists_base.
If you look at Table F.1, p.395 in the DWARF 5 spec, it shows DW_AT_loclists_base/rnglists_base as used in conventional units but not skeleton/split units. Given that there's an index, I think the attribute itself would be superfluous.
This is a bit of a surprising/uncommon piece of code (using a conditional operator without using its result, and using the comma operator). Probably best to use more common constructs for this sort of situation.
I don't think there's any need to name the label differently (loclists_table_base/loclists_dwo_table_base) depending on dwo usage. Probably just keep it with a single/the same name always (loclists_table_base).