This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/MC/
-
llvm/
-
MC/
2/4
MCDwarf.h
-
lib/
-
CodeGen/AsmPrinter/
-
AsmPrinter/
1
DwarfCompileUnit.cpp
-
DwarfDebug.h
5/7
DwarfDebug.cpp
-
MC/
8/16
MCDwarf.cpp
-
test/DebugInfo/
-
DebugInfo/
-
XCOFF/
1/2
empty.ll
-
explicit-section.ll
-
function-sections.ll
-
debugline-endsequence.ll
-
debugline-endsequence.s

Differential D108261

[DebugInfo] Fix end_sequence of debug_line in LTO Object
ClosedPublic

Authored by kyulee on Aug 17 2021, 5:59 PM.

Download Raw Diff

Details

Reviewers

JDevlieghere
dblaikie
clayborg
vsk
jmorse
shchenz

Commits

rG6747d44bda8c: [DebugInfo] Fix end_sequence of debug_line in LTO Object

Summary

In a LTO build, the end_sequence in debug_line table for each compile unit (CU) points the end of text section which merged all CUs. The end_sequence needs to point to the end of each CU's range. This bug often causes invalid debug_line table in the final .dSYM binary for MachO after running dsymutil which tries to compensate an out-of-range address of end_sequence.
The fix is to sync the line table termination with the range operations that are already maintained in DwarfDebug. When CU or section changes, or nodebug functions appear or module is finished, the prior pending line table is terminated using the last range label. In the MC path where no range is tracked, the old logic is conservatively used to end the line table using the section end symbol.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

kyulee created this revision.Aug 17 2021, 5:59 PM

Herald added a reviewer: JDevlieghere. · View Herald TranscriptAug 17 2021, 5:59 PM

Herald added subscribers: ormris, steven_wu, hiraditya, inglorion. · View Herald Transcript

kyulee requested review of this revision.Aug 17 2021, 5:59 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 17 2021, 5:59 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

kyulee added reviewers: dblaikie, clayborg, vsk, jmorse.Aug 17 2021, 6:02 PM

ellis added a subscriber: ellis.Aug 17 2021, 6:23 PM

Would it be possible and easier to test this issue without any LTO concerns, with code like this:

void f1() { }
void __attribute__((nodebug)) void f2() { }

Looks like the bug manifests here by having the line table cover f2, when it should end at the end of f1 instead?

(admittedly there's still a problem, maybe this isn't sufficiently representative, if there was void f3() { } at the end - we don't end the sequence around f2 and restart it afterwards... )

Well, if you want to keep the existing LTO-related testing, you could link two IR files together with simple empty functions, rather than interesting functions like you have in your examples. Just void f1() { } and void f2() { } in two files, llvm-linkd together would suffice, I think?

llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp
2167–2169	What's this refactoring for? Looks like it causes duplicate lookups of the DwarfCompileUnit, that might be nice to avoid/not introduce? (if it is necessary/preferred for some reason, might be worth doing separately from the rest of this patch?) (both callers already have the DwarfCompileUnit available - perhaps this function should take that instead of the MachineFunction? that way it could avoid the extra lookups)
2226–2230	Perhaps this could be done once outside the loop? // MBBSectionRanges is always non-empty at this point, right? (could assert just in case, but I think it's a reasonable invariant) Asm->OutStreamer->getContext() .getMCDwarfLineTable(getDwarfCompileUnitID(MF)) .getMCLineSections() .updateEndLabel(Asm->MBBSectionRanges.back().second.EndLabel);
llvm/lib/MC/MCObjectStreamer.cpp
517–518 ↗	(On Diff #367079)	When is this condition false? (is it tested?)
llvm/test/DebugInfo/lto-debugline-main.ll
1 ↗	(On Diff #367079)	This patch only applies to llvm's codegen - so the test probably shouldn't include opt or LTO tools - the post-LTO'd IR should be checked in directly & then fed into llc in the test to exercise the codegen codepath, most likely?

Harbormaster completed remote builds in B120025: Diff 367079.Aug 17 2021, 6:28 PM

clayborg added inline comments.Aug 17 2021, 8:43 PM

llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp
2230	Are the "Asm->MBBSectionRanges" sorted? If not, do we need to find the highest EndLabel and only add that one?

kyulee mentioned this in D108271: [NFC][DebugInfo] getDwarfCompileUnitID.Aug 17 2021, 11:04 PM

Update the change based on the feedback

Refactor NFC change to a separate revision, https://reviews.llvm.org/D108271
Use a non-LTO unit test
Set EndLabel once after the loop
Simpliy the logic to directly use EndLabel for end_sequence address

kyulee added a parent revision: D108271: [NFC][DebugInfo] getDwarfCompileUnitID.Aug 17 2021, 11:10 PM

kyulee marked 4 inline comments as done.

kyulee edited the summary of this revision. (Show Details)Aug 17 2021, 11:13 PM

Harbormaster completed remote builds in B120054: Diff 367121.Aug 17 2021, 11:14 PM

Remove unnecessary MCObjectWriter.h inclusion.

Harbormaster completed remote builds in B120057: Diff 367123.Aug 18 2021, 12:05 AM

Restore symbol diff logic to fix Window failures

Harbormaster completed remote builds in B120155: Diff 367249.Aug 18 2021, 11:32 AM

kyulee mentioned this in rG829616c24119: [NFC][DebugInfo] getDwarfCompileUnitID.Aug 18 2021, 5:35 PM

Rebase. getDwarfCompileUnitID -> getDwarfCompileUnitIDForLineTable in https://reviews.llvm.org/D108271

Harbormaster completed remote builds in B120245: Diff 367372.Aug 18 2021, 6:20 PM

dblaikie added inline comments.Aug 18 2021, 10:35 PM

llvm/lib/MC/MCObjectStreamer.cpp
517–518 ↗	(On Diff #367079)	Still curious about this question ^

kyulee added inline comments.Aug 19 2021, 1:52 AM

llvm/lib/MC/MCObjectStreamer.cpp
517–518 ↗	(On Diff #367079)	I've attempted to delete this symbol diff check in https://reviews.llvm.org/D108261?id=367123 but https://buildkite.com/llvm-project/premerge-checks/builds/53063#d29a2537-c103-4035-86a0-4e01e0188dd4 showed many builds failed like Assertion failed: Abs && "We created a line delta with an invalid expression" So, I restored this logic back. As for `InSet = false`, I think it confirms the absolute symbol difference, or this check can be true optimistically by https://github.com/llvm/llvm-project/blob/d480f968ad8b56d3ee4a6b6df5532d485b0ad01e/llvm/lib/MC/MachObjectWriter.cpp#L680-L681. I tested the following test could fail if I set `InSet = true`. DebugInfo/X86/gmlt.test

dblaikie added inline comments.Aug 19 2021, 12:31 PM

llvm/lib/MC/MCObjectStreamer.cpp
517–518 ↗	(On Diff #367079)	This seems suspicious to me - "isSymbolRefDifferenceFullyResolvedImpl" (also a bit suspicious that this is calling an "Impl" function - usually those are only used as implementation details for some non-impl function, where the latter is intended as the general entry point) should only return false if the two symbols aren't in the same section. I wouldn't've thought that should happen - when/where/how/why does that happen?

Use isSymbolRefDifferenceFullyResolved instead of isSymbolRefDifferenceFullyResolvedImpl

kyulee added inline comments.Aug 19 2021, 2:15 PM

llvm/lib/MC/MCObjectStreamer.cpp
517–518 ↗	(On Diff #367079)	I've just replaced `isSymbolRefDifferenceFullyResolvedImpl` by `isSymbolRefDifferenceFullyResolved`. As shown https://github.com/llvm/llvm-project/blob/d480f968ad8b56d3ee4a6b6df5532d485b0ad01e/llvm/include/llvm/MC/MCObjectWriter.h#L67-L86, the arguments are very similar in these two except that one is with `MCSymbolRefExpr` and the other is with `MCSymbol`. Are you suggesting a new API `isSymbolRefDifferenceFullyResolved(constMCAssembler &Asm, constMCSymbolRefExpr A, constMCSymbolRefExpr B)` that defaults with `InSet = false`, and use it here?

dblaikie added inline comments.Aug 19 2021, 3:20 PM

llvm/lib/MC/MCObjectStreamer.cpp
517–518 ↗	(On Diff #367079)	Sorry, no I don't mean to rename or refactor this API. I'd like to understand why this function call ever produces a result of `false` - it seems to me that EndLabel and LastLabel should always be from the same section, and if they are, I think this function should never return false - so I think I'm misunderstanding something/don't understand where these labels are coming from such that they could end up being from different sections. I'd like to understand how that situation can arise before approving this patch - to understand better why this approach/test is suitable.

Harbormaster completed remote builds in B120423: Diff 367613.Aug 19 2021, 4:47 PM

kyulee added inline comments.Aug 19 2021, 9:03 PM

llvm/lib/MC/MCObjectStreamer.cpp
517–518 ↗	(On Diff #367079)	I'm not familiar with the whole logic, but I've traced a particular case below. DebugInfo/X86/cu-ranges.ll This test runs with `-function-sections` so it appears each function goes to each section (comdat?). I found `LastLabel` and `EndLabel` are in the same section for the last function only because `EndLabel` is set from the last function. This means all the prior functions go with the end section label because this symbol difference check fails. For the last function, `EndLabel` is actually the same offset to the end section label because each section has its own function only. So, I think this logic seems to cover this case correctly resulting in a valid line table. I'm not sure if there is a better way to guard this case.

Set EndLabel per Section. A compilation unit (CU) can have multiple sections (e.g., comdat) so track EndLabel per section.
This seems a way to avoid unnecessary symbol diff check because EndLabel and LastLabel would be in the same section.

Harbormaster completed remote builds in B120512: Diff 367733.Aug 20 2021, 2:02 AM

Handle DebugLineEntries and EndLabel per Section

kyulee added a child revision: D108531: [DebugInfo] Use EndLabel for AsmStreamer.Aug 22 2021, 7:27 PM

Harbormaster completed remote builds in B120724: Diff 368021.Aug 22 2021, 8:09 PM

@dblaikie Could you take another look? Thanks!

I'm a bit confused/unclear how this algorithm works/how this fixes things and whether it's the right direction. It'll be a while before I can find the time/brain bandwidth to really dig into this - so perhaps you can help me.

What I'm confused by is why multiple line tables would be unterminated & then end up needing to terminate them all/figure out which one to terminate? Instead I'd expect whenever another line table got an entry, the previously current line table entry would need to end - so there shouldn't be a need for the extra maps & maybe we can just keep track of the current "open" line table (whichever was the last one to emit an entry)?

In D108261#2966544, @dblaikie wrote:

I'm a bit confused/unclear how this algorithm works/how this fixes things and whether it's the right direction. It'll be a while before I can find the time/brain bandwidth to really dig into this - so perhaps you can help me.

What I'm confused by is why multiple line tables would be unterminated & then end up needing to terminate them all/figure out which one to terminate? Instead I'd expect whenever another line table got an entry, the previously current line table entry would need to end - so there shouldn't be a need for the extra maps & maybe we can just keep track of the current "open" line table (whichever was the last one to emit an entry)?

Here is my understanding. Basically a line table is emitted per compilation unit (CU). For simplicity, I consider two cases:

When multiple sections exist in a CU, we still want to emit them into a single line table (with multiple end_sequences per section).
In a LTO case, multiple CUs appear in a single (merged) object. In that case, we want to emit multiple line tables per CU.

So, this algorithm tries to track the end label of each function range per section in a CU. Likewise each line entries are already aggregated per section in a CU. This ensures emitting the end label per section per CU.
In case for asm parser path (instead of normal compilation path), those line entries/function range may not be emitted, so in that case (EndLabel = null), I fall back to the old logic that just uses the section end label conservatively.

This is two examples that this change tries to fix/improve.

Multiple sections in a CU case -- similar to Comdat (function per section).

void f2()
{
}
// The end of .text in CU line table

__attribute__((section(".s1")))
void f1()
{
}
// The end of .s1 in CU line table
__attribute__((nodebug,section(".s1")))
void f1nodebug()
{
}
// --> Before this change: The line table pointed to here because it's the end of the section .s1.

#if 0
// llvm-objdump -d t1.o
Disassembly of section .text:

0000000000000000 <f2>:
..
       5: c3                            retq

Disassembly of section .s1:

0000000000000000 <f1>:
..
       5: c3                            retq

0000000000000010 <f1nodebug>:
..
      15: c3                            retq


// llvm-dwarfdump --debugline t1.o
Address            Line   Column File   ISA Discriminator Flags
------------------ ------ ------ ------ --- ------------- -------------
0x0000000000000000      2      0      1   0             0  is_stmt
0x0000000000000004      3      1      1   0             0  is_stmt prologue_end
0x0000000000000006      3      1      1   0             0  is_stmt end_sequence
0x0000000000000000      8      0      1   0             0  is_stmt
0x0000000000000004      9      1      1   0             0  is_stmt prologue_end
0x0000000000000006      9      1      1   0             0  is_stmt end_sequence ---> Before this change, it pointed to offset 16, the end of the section .s1.
#endif

LTO case: Multiple CUs in a single section

// LTO object effectively has the view of the merged CUs.
// CU1 from t1.c
void t1()
{
}
// End of .text in CU1 line table

// CU2 from t2.c
void t2()
{
}
// End of .text in CU2 line table

#if 0
$ llvm-objdump -d lto.o
0000000000000000 <t1>:
...
       5: c3                            retq

0000000000000010 <t2>:
..
      15: c3                            retq

$ llvm-dwarfdump --debug-line lto.o
Address            Line   Column File   ISA Discriminator Flags
------------------ ------ ------ ------ --- ------------- -------------
0x0000000000000000      4      0      1   0             0  is_stmt
0x0000000000000004      5      1      1   0             0  is_stmt prologue_end
0x0000000000000006      5      1      1   0             0  is_stmt end_sequence --> Before this change, it pointed to offset 16 (the end of .text section).

Address            Line   Column File   ISA Discriminator Flags
------------------ ------ ------ ------ --- ------------- -------------
0x0000000000000010      4      0      1   0             0  is_stmt
0x0000000000000014      5      1      1   0             0  is_stmt prologue_end
0x0000000000000016      5      1      1   0             0  is_stmt end_sequence
#endif

BlakeLucchesi added a subscriber: BlakeLucchesi.Sep 4 2021, 12:41 PM

In D108261#2966644, @kyulee wrote:

In D108261#2966544, @dblaikie wrote:

I'm a bit confused/unclear how this algorithm works/how this fixes things and whether it's the right direction. It'll be a while before I can find the time/brain bandwidth to really dig into this - so perhaps you can help me.

What I'm confused by is why multiple line tables would be unterminated & then end up needing to terminate them all/figure out which one to terminate? Instead I'd expect whenever another line table got an entry, the previously current line table entry would need to end - so there shouldn't be a need for the extra maps & maybe we can just keep track of the current "open" line table (whichever was the last one to emit an entry)?

Here is my understanding. Basically a line table is emitted per compilation unit (CU). For simplicity, I consider two cases:

When multiple sections exist in a CU, we still want to emit them into a single line table (with multiple end_sequences per section).

In a LTO case, multiple CUs appear in a single (merged) object. In that case, we want to emit multiple line tables per CU.

^ Presumably multiple line tables per Module, one per CU.

So, this algorithm tries to track the end label of each function range per section in a CU. Likewise each line entries are already aggregated per section in a CU. This ensures emitting the end label per section per CU.
In case for asm parser path (instead of normal compilation path), those line entries/function range may not be emitted, so in that case (EndLabel = null), I fall back to the old logic that just uses the section end label conservatively.

Ah, I see, and you've in the "normal compilation path" the change here in DwarfDebug's endFunctionImpl to add an end label for the section. Wouldn't this have issues if you interleave functions from CUs in the same section - like CU1:func1:.text, then CU2:func2:.text, then CU1:func3:.text. (ah, right, I mentioned this in my first comment, but I'm feeling more like fixing the issue your seeing should involve the more general fix to that interleaved issue too)

Yeah, in that case you get CU1's line table covering the whole range, including func2, which isn't intended/desirable.

So I think this solution you have is incomplete & I'd say it's essentially the same bug - at least I think of it that way. But I guess it depends what sort of "invalid debug_line table in the final dsym" you're dealing with - they're all not great, but maybe there's a more severe invalidity in the end case?

I guess what I'm suggesting would probably still require that extra signal from DwarfDebug in the normal compilation path, but it could be more robust/address the interleaved issue which wouldn't be fixed by this approach.

What if whenever the section changes (in raw MC) or when a new line entry is emitted to another CU (or in DwarfDebug, if a nodebug function starts) - then emit an end_prologue to whatever the current open line table is? (have to keep track of that, I guess - "last emitted line table MCLineDivision")

That would miss some opportunities for shorter line table encodings (in the func1/2/3 scenario above, if func2 was in another section, then the line table for func1/3 wouldn't actually need to have two separate chunks - they could be contiguous) - so more advanced would be keeping "Last MCLineDivision per section". Terminate the last MCLineDivision whenever somethingr is emitted to that section that isn't the same division. DwarfDebug would have one extra bonus: If it starts a nodebug function, it could also pre-emptively terminate any current MCLineDivision.

I think that'd fix all these issues, probably?

In D108261#2995771, @dblaikie wrote:

In D108261#2966644, @kyulee wrote:

In D108261#2966544, @dblaikie wrote:

I'm a bit confused/unclear how this algorithm works/how this fixes things and whether it's the right direction. It'll be a while before I can find the time/brain bandwidth to really dig into this - so perhaps you can help me.

What I'm confused by is why multiple line tables would be unterminated & then end up needing to terminate them all/figure out which one to terminate? Instead I'd expect whenever another line table got an entry, the previously current line table entry would need to end - so there shouldn't be a need for the extra maps & maybe we can just keep track of the current "open" line table (whichever was the last one to emit an entry)?

Here is my understanding. Basically a line table is emitted per compilation unit (CU). For simplicity, I consider two cases:

When multiple sections exist in a CU, we still want to emit them into a single line table (with multiple end_sequences per section).

In a LTO case, multiple CUs appear in a single (merged) object. In that case, we want to emit multiple line tables per CU.

^ Presumably multiple line tables per Module, one per CU.

So, this algorithm tries to track the end label of each function range per section in a CU. Likewise each line entries are already aggregated per section in a CU. This ensures emitting the end label per section per CU.
In case for asm parser path (instead of normal compilation path), those line entries/function range may not be emitted, so in that case (EndLabel = null), I fall back to the old logic that just uses the section end label conservatively.

Ah, I see, and you've in the "normal compilation path" the change here in DwarfDebug's endFunctionImpl to add an end label for the section. Wouldn't this have issues if you interleave functions from CUs in the same section - like CU1:func1:.text, then CU2:func2:.text, then CU1:func3:.text. (ah, right, I mentioned this in my first comment, but I'm feeling more like fixing the issue your seeing should involve the more general fix to that interleaved issue too)

Yeah, in that case you get CU1's line table covering the whole range, including func2, which isn't intended/desirable.

So I think this solution you have is incomplete & I'd say it's essentially the same bug - at least I think of it that way. But I guess it depends what sort of "invalid debug_line table in the final dsym" you're dealing with - they're all not great, but maybe there's a more severe invalidity in the end case?

I guess what I'm suggesting would probably still require that extra signal from DwarfDebug in the normal compilation path, but it could be more robust/address the interleaved issue which wouldn't be fixed by this approach.

What if whenever the section changes (in raw MC) or when a new line entry is emitted to another CU (or in DwarfDebug, if a nodebug function starts) - then emit an end_prologue to whatever the current open line table is? (have to keep track of that, I guess - "last emitted line table MCLineDivision")

That would miss some opportunities for shorter line table encodings (in the func1/2/3 scenario above, if func2 was in another section, then the line table for func1/3 wouldn't actually need to have two separate chunks - they could be contiguous) - so more advanced would be keeping "Last MCLineDivision per section". Terminate the last MCLineDivision whenever somethingr is emitted to that section that isn't the same division. DwarfDebug would have one extra bonus: If it starts a nodebug function, it could also pre-emptively terminate any current MCLineDivision.

I think that'd fix all these issues, probably?

Thanks for comments! I roughly get your points but may need sometime to digest it.
I may misunderstand an important piece on your example -- CU1:func1:.text, then CU2:func2:.text, then CU1:func3:.text.
My understanding is CU is a compilation unit which seems to be processed sequentially (in LLVM). How could we interleave CUs like this? I thought we only interleave sections within a CU.
Does this interleaving come from a ThinLTO compilation? I would appreciate it if you can provide an example.

Thanks for comments! I roughly get your points but may need sometime to digest it.
I may misunderstand an important piece on your example -- CU1:func1:.text, then CU2:func2:.text, then CU1:func3:.text.
My understanding is CU is a compilation unit which seems to be processed sequentially (in LLVM).

Ah, that's not quite correct - a CU exists both by its existence in the cu list named module metadata, and by its reference from DISubprograms (which exist by reference from llvm::Functions). A CU is not processed atomically - some handling is done up front (at the start of the module, iterating the named module metadata list of CUs) and some is done at the end of the module - but between those, for all the DISubprograms, they're visited in the order of the llvm::Functions that reference them - so they can be arbitrarily interleaved.

How could we interleave CUs like this? I thought we only interleave sections within a CU.
Does this interleaving come from a ThinLTO compilation? I would appreciate it if you can provide an example.

I don't know that I have a concrete example that creates the interleaving I can reproduce by hand - but it's not an IR invariant that all Functions that reference CUs through DISubprograms must be contiguous for that CU.

To create such a situation by hand, create two files, let's say a.cpp and b.cpp and have two functions a1 and a2 in a.cpp and main in b.cpp, have main call a1 and a2. Compile to IR, link the IR, hand-modify the IR to reorder the functions so it goes a1, then main, then a2 - then you should be able to reproduce/see the strange situation where the line table for a.cpp continues/extends over main/b.cpp.

Any progress on this? This causes serious debugging issues.

Incorporated the feedback.

Bookkeep the current line table for CU so that when a line entry needs to be emitted for different CU, terminate the current line table by injecting an end sequence.
Didn't break this on section change, becuase the line table is already being tracked per section.
Still didn't handle nodebug function case -- no debug function may span debug functions in the line table. Terminating the line table was a bit tricky becuse no debug function basically does not have a reference line entry. But I think this impreciion is less concern in practice.

Harbormaster completed remote builds in B133393: Diff 386026.Nov 9 2021, 6:47 PM

Fix the test

Harbormaster completed remote builds in B133418: Diff 386061.Nov 9 2021, 10:49 PM

Rebase

Harbormaster completed remote builds in B133421: Diff 386064.Nov 9 2021, 11:28 PM

@dblaikie Can you take a look again when you're available? Thanks!

In D108261#3124854, @kyulee wrote:

@dblaikie Can you take a look again when you're available? Thanks!

Yep, it's on my list.

Rather than trying to do this at the MCDwarf/streamer level - what if we did this in the AsmPrinter/DwarfDebug level? It could call some kind of MC function to terminate the line table because it knows when to do so? (it'd terminate basically whenever DwarfDebug was about to change the value of its PrevCU member (so in DwarfCompileUnit::addRange/DwarfDebug::setPrevCU and in DwarfDebug::skippedNonDebugFunction))

Incorporate the feedback

Terminate the line table when switching CU in DwarfDebug.
Update the test since this now handles nodebug case correctly.

Harbormaster completed remote builds in B133865: Diff 386726.Nov 11 2021, 10:12 PM

dblaikie added inline comments.Nov 13 2021, 9:39 AM

llvm/include/llvm/MC/MCDwarf.h
221	When does this case come up? I think all of this would only happen when PrevCU was non-null/had been populated with some content first, right?
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp
2180	presumably this function shouldn't be called if PrevCU == NewCU, right? (maybe that could be asserted, rather than tested)
2184	Does is this empty? I /think/ that "PrevCU" is only set when PrevCU has already been populated with some content, right?
llvm/lib/MC/MCDwarf.cpp
245–250	Would be nice if we could eliminate the need for this case - since any time this gets used it risks being because a line table was allowed to "flow off the end" further than it should. What would it be like if we handled the line table similar to the way ranges are handled? Always terminated at the end of each function, but then extended if the next function happens to start immediately after the last entry ended?

kyulee added inline comments.Nov 13 2021, 10:24 AM

llvm/include/llvm/MC/MCDwarf.h
221	Correct. I changed it to an assert.
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp
2180	I checked this condition at the call-site and updated the comments.
2184	Yeah. I deleted the check.

Update changes per feedbacks.

kyulee added inline comments.Nov 13 2021, 10:43 AM

llvm/lib/MC/MCDwarf.cpp
245–250	Yeah. That sounds great, but it seems similar to what I initially tried, which tracked the end label of function as functions were added. But I wonder whether there is a case where the end label of the prior function is not matched with the start label of the next function due to some alignment, etc? In that case, do we want to terminate the line table for the prior function because of mismatch? I guess that seems an overkill.

Harbormaster completed remote builds in B134092: Diff 387035.Nov 13 2021, 10:58 AM

Fix the test failures by restoring PrevCU check instead of assertion.

Line table entrty insertion is independent of PrevCU creation.

kyulee added inline comments.Nov 13 2021, 11:55 AM

llvm/include/llvm/MC/MCDwarf.h
221	It turned out that there is the case where the line entry is empty in PrevCU. So I restored it with this check.

kyulee added inline comments.Nov 13 2021, 12:05 PM

llvm/lib/MC/MCDwarf.cpp
245–250	From the second thought, `addLineEntry` is used during function emission (which is independent of Range operation) while `Range` is added at the end of function. It doesn't seem to work to extend the line table using the end of function range.

Harbormaster completed remote builds in B134098: Diff 387041.Nov 13 2021, 12:20 PM

dblaikie added inline comments.Nov 13 2021, 3:23 PM

llvm/lib/MC/MCDwarf.cpp
196–198	When does this occur? I guess when the last function with debug info in this section is followed by a function in another section (with or without debug info) or a nodebug function? That does seem a bit subtle and like it'd be nicer if this API's invariant was just that all callers terminated their own lists. (this would involve fixing up the GenDwarfForAssembly codepath to do that too, I'd guess) For DwarfDebug/etc I guess this would be addressed by calling terminateLineTableForPrevCU (or perhaps resetPrevCU) in finishModule or whatever it's called?
245–250	Yeah, though I was thinking more driven by the DwarfDebug code so it's not tracked in two different places that might diverge/reduce the code/logic needed to handle these cases. I think alignment doesn't break this strategy (it doesn't seem to break it for the ranges data using this approach) - though the alignment does come between the start of one function and the beginning of the next function - the code in DwarfDebug/DwarfCompileUnit/etc extends the CU ranges to cover that & hopefully the line table could be powered by the same logic.

kyulee added inline comments.Nov 13 2021, 4:39 PM

llvm/lib/MC/MCDwarf.cpp
196–198	I can see we can terminate the line table for DwarfDebug in finishModule via `resetPrevCU` -- this will use an end label of range. However, for assembly path, I don't think we have explicit labels/symbols to terminate. I may introduce an api to patch the line table for all sections that do not have an end entry at the end. But in that case, I still need to synthesize/emit a label using section symbol like `MCStreamer::endSection`. It's just moving the place when we fill this gap -- either here or before coming here.

Ensure the line table is terminated in DwarfDebug (compiler) path.
Still preserve the old code to synthesize the end entry for the assembly path.

Harbormaster completed remote builds in B134108: Diff 387053.Nov 13 2021, 6:44 PM

Looks roughly right - one trailing question I wouldn't mind knowing the answer to (might merit a comment so it can be cleaned up later) - and also, maybe worth adding an assembly test case. Something like this:

.text
.file   1 "small.c"
.loc    1 1 0
nop
.section .text.2
.loc    1 2 0
nop
.text
nop

Showing that line table locations in assembly do flow on to the next chunk of a section even if there's an intermediate section switch. So unlike with the DwarfDebug case, which can end the line table after the function/whenever switching compilation units - the assembly mode might have any number of outstanding sections that need to be terminated at the end. (I guess in theory we could optimize the IR/DwarfDebug case slightly by keeping some of these open - but I don't think it likely in practice & seems cleaner with what this patch does now)

llvm/include/llvm/MC/MCDwarf.h
221	When does that case arise? (maybe when a function is zero-length/has no instructions? That's something I/we would like to fix at some point, FWIW)
llvm/lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp
497	Unrelated change - could commit this separately.

Terminate the line table aligning with Range creation:

When a new range is formed due to section or CU change
When nodebug function starts
When module is finished

From the above, I think the line table is well-formed in DwarfDebug (normal compiler path).
No need specialization for asm streamer or object stremer to end the line table.

However, the MC path seems to still terminate using the section end symbol conservatively,
because we do not track the range like the DwarfDebug path as above.
So, I still preserve emitDwarfLineEndEntry when no end entry is emitted in the loop.

kyulee added a reviewer: shchenz.Nov 14 2021, 9:46 AM

Remove a white-space diff

dblaikie added inline comments.Nov 14 2021, 10:08 AM

llvm/lib/MC/MCDwarf.cpp
148	When does this situation come up? (I think this question got lost from its previous place here: https://reviews.llvm.org/D108261#inline-1086075 during refactoring) (if it is necessary, it might be suitable to use `find` on the `MCLineDivisions` rather than `operator[]` (so this call doesn't create an entry when it's empty/unused))

kyulee added inline comments.Nov 14 2021, 10:13 AM

llvm/lib/MC/MCDwarf.cpp
148	Even though we have a range, we may not have any debug line entry in unit tests. Makes sense using `find`. Will update it with additional assembly test.

dblaikie added inline comments.Nov 14 2021, 10:17 AM

llvm/lib/MC/MCDwarf.cpp
148	Seems like we shouldn't be adding extra API surface area only for unit tests, though? But it's possible this comes up in real cases of empty functions - if that's the case (worth testing/validating), leaving a FIXME for when we remove/fix the existence of empty functions would be good.

Harbormaster completed remote builds in B134148: Diff 387103.Nov 14 2021, 11:03 AM

kyulee edited the summary of this revision. (Show Details)Nov 14 2021, 12:46 PM

Add an assembly test to cover the MC path that still emits the end entry using the section end symbol.
Add a check for addEndEntry before adding an end entry. The MCStreamer path may not populate the line entry.

Harbormaster completed remote builds in B134167: Diff 387127.Nov 14 2021, 1:30 PM

dblaikie added inline comments.Nov 14 2021, 1:34 PM

llvm/lib/MC/MCDwarf.cpp
148	find is preferred over count - so that the result can be used rather than a duplicate lookup: auto I = MCLineDivisions.find(Sec); if (I != MCLineDivisions.end()) { auto &Entries = I->second; ... } Though I'm still curious to better understand under what conditions this is needed.
llvm/test/DebugInfo/XCOFF/empty.ll
230	Hmm, what aspect of the change caused these labels to change name?

kyulee added inline comments.Nov 14 2021, 1:45 PM

llvm/lib/MC/MCDwarf.cpp
148	It turned out the MCAsmStreamer path (as opposed to MCObjectStreamer) typically doesn't populate the line entry. So, I keep this check to bail-out cloning/adding the end entry in that case. Or, hundreds of LIT tests failed.
148	Yea. `find` seems better since I need to use the map anyhow inside. Will update it. As commented above in the code, the assembly output path (MCAsmStreamer as opposed to MCObjectStreamer), the line entry is not added instead .loc directives are emitted during the function emission. So, the line entries are often irrelevant in the assembly output except a certain target -- I guess XCOFF shown below in the tests. I was thinking to check streamer and specialize this logic outside this, but not sure exactly how to do so.
llvm/test/DebugInfo/XCOFF/empty.ll
230	I think this XCOFF seems a unique path that generates the line table for the assembly output in DwarfDebug. So, the new logic is still kicked in, which adds an entry based on the range end label (instead of the section end label in the fall-back path). I think the range for function uses function labels, so that's why the change happens. Although this is the same in this unit tests, in theory, I think this new change is more precise.

Use find instead of count for a check in addEndEntry.

Harbormaster completed remote builds in B134169: Diff 387129.Nov 14 2021, 2:26 PM

dblaikie added inline comments.Nov 14 2021, 4:13 PM

llvm/lib/MC/MCDwarf.cpp
148	Ah, OK! Right - the solution to that would be to have a virtual function in MCStreamer that's differently implemented (a no-op in the asm streamer case - maybe someone'll eventually extend the assembly syntax to support terminating line contributions - so nodebug could be properly done in assembly) this would be side-by-side with the MCStreamer::emitDwarfLocDirective - or, perhaps it could use that specific function, with an extra parameter or flag value for "end entry"?

kyulee added inline comments.Nov 14 2021, 5:41 PM

llvm/lib/MC/MCDwarf.cpp
148	Yea. I tried to create `emitTerminateLineTable` in MCStreamer and differently extends it for MCAsmStreamer, which resolved the most cases. But I'm still seeing dozens of failures in unit tests which are probably because of incomplete debug data in the test cases that are manually synthesized. For instance, `DebugInfo/X86/multiple-at-const-val.ll`, it was like define i32 @main() !dbg !960 { entry: %call1.i = tail call %"class.std::basic_ostream"* @test(%"class.std::basic_ostream"* @_ZSt4cout, i8* getelementptr inbounds ([6 x i8], [6 x i8]* @.str, i64 0, i64 0), i64 5) ret i32 0 } This means the function appears to have a debug info, but there is no tag like `DILocation` for each line. I presume although the above is unlikely the real case from the compiler, but I think this seems a valid case (opt or any transformation may drop debug tag) so we should cope with. Having said that, I think this check seems worth being kept -- then I don't see the reason above to specialize streamer just for this.

Thanks for all the work/iteration/research here - sorry it was a bit fussy.

llvm/lib/MC/MCDwarf.cpp
148	Ah, hmm - yeah, there's certainly ways we could arrive at a function without any instructions having a location. Though I wonder what the line table should look like for that? Not having the line table cover those instructions seems a bit weird too. But if that's the state of things today, might as well leave ti as-is. Could you include a comment describing how that occurs with the MCObjectStreamer use case as well as the one outlined with MCAsmStreamer?

This revision is now accepted and ready to land.Nov 14 2021, 6:05 PM

In D108261#3130364, @dblaikie wrote:

Thanks for all the work/iteration/research here - sorry it was a bit fussy.

Thanks for the thorough review! I had a chance to dive a bit in this area.
Indeed, this debug world seems subtle to make it right.

llvm/lib/MC/MCDwarf.cpp
148	For the above case where instructions have no location at all, the line entries in the table didn't appear in dwarf although other contents appeared. Yeah. Will update the comments to include two cases (MCAsmStreamer and MCObjectStreamer).

Update the comments in addEndEntry for the skipping reasons.

Fix a typo in the comment

kyulee edited the summary of this revision. (Show Details)Nov 14 2021, 7:37 PM

Harbormaster completed remote builds in B134187: Diff 387150.Nov 14 2021, 8:14 PM

Closed by commit rG6747d44bda8c: [DebugInfo] Fix end_sequence of debug_line in LTO Object (authored by kyulee). · Explain WhyNov 14 2021, 8:25 PM

This revision was automatically updated to reflect the committed changes.

kyulee added a commit: rG6747d44bda8c: [DebugInfo] Fix end_sequence of debug_line in LTO Object.

kyulee mentioned this in D113870: [DebugInfo] Fix Test Targets in D108261.Nov 14 2021, 8:56 PM

kyulee mentioned this in rG0d1d05854444: [DebugInfo] Fix Test Targets in D108261.Nov 14 2021, 9:36 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

MC/

MCDwarf.h

13 lines

lib/

CodeGen/

AsmPrinter/

DwarfCompileUnit.cpp

6 lines

DwarfDebug.h

3 lines

DwarfDebug.cpp

16 lines

MC/

MCDwarf.cpp

59 lines

test/

DebugInfo/

XCOFF/

empty.ll

8 lines

explicit-section.ll

8 lines

function-sections.ll

8 lines

debugline-endsequence.ll

61 lines

debugline-endsequence.s

19 lines

Diff 387151

llvm/include/llvm/MC/MCDwarf.h

	Show First 20 Lines • Show All 182 Lines • ▼ Show 20 Lines

	public:			public:
	// Constructor to create an MCDwarfLineEntry given a symbol and the dwarf loc.			// Constructor to create an MCDwarfLineEntry given a symbol and the dwarf loc.
	MCDwarfLineEntry(MCSymbol *label, const MCDwarfLoc loc)			MCDwarfLineEntry(MCSymbol *label, const MCDwarfLoc loc)
	: MCDwarfLoc(loc), Label(label) {}			: MCDwarfLoc(loc), Label(label) {}

	MCSymbol *getLabel() const { return Label; }			MCSymbol *getLabel() const { return Label; }

				// This indicates the line entry is synthesized for an end entry.
				bool IsEndEntry = false;

				// Override the label with the given EndLabel.
				void setEndLabel(MCSymbol *EndLabel) {
				Label = EndLabel;
				IsEndEntry = true;
				}

	// This is called when an instruction is assembled into the specified			// This is called when an instruction is assembled into the specified
	// section and if there is information from the last .loc directive that			// section and if there is information from the last .loc directive that
	// has yet to have a line entry made for it is made.			// has yet to have a line entry made for it is made.
	static void make(MCStreamer MCOS, MCSection Section);			static void make(MCStreamer MCOS, MCSection Section);
	};			};

	/// Instances of this class represent the line information for a compile			/// Instances of this class represent the line information for a compile
	/// unit where machine instructions have been assembled after seeing .loc			/// unit where machine instructions have been assembled after seeing .loc
	/// directives. This is the information used to build the dwarf line			/// directives. This is the information used to build the dwarf line
	/// table for a section.			/// table for a section.
	class MCLineSection {			class MCLineSection {
	public:			public:
	// Add an entry to this MCLineSection's line entries.			// Add an entry to this MCLineSection's line entries.
	void addLineEntry(const MCDwarfLineEntry &LineEntry, MCSection *Sec) {			void addLineEntry(const MCDwarfLineEntry &LineEntry, MCSection *Sec) {
	MCLineDivisions[Sec].push_back(LineEntry);			MCLineDivisions[Sec].push_back(LineEntry);
	}			}

				// Add an end entry by cloning the last entry, if exists, for the section
				// the given EndLabel belongs to. The label is replaced by the given EndLabel.
				void addEndEntry(MCSymbol *EndLabel);

	using MCDwarfLineEntryCollection = std::vector<MCDwarfLineEntry>;			using MCDwarfLineEntryCollection = std::vector<MCDwarfLineEntry>;
				dblaikieUnsubmitted Not Done Reply Inline Actions When does this case come up? I think all of this would only happen when PrevCU was non-null/had been populated with some content first, right? dblaikie: When does this case come up? I think all of this would only happen when PrevCU was non-null/had…
				kyuleeAuthorUnsubmitted Done Reply Inline Actions Correct. I changed it to an assert. kyulee: Correct. I changed it to an assert.
				kyuleeAuthorUnsubmitted Done Reply Inline Actions It turned out that there is the case where the line entry is empty in PrevCU. So I restored it with this check. kyulee: It turned out that there is the case where the line entry is empty in PrevCU. So I restored it…
				dblaikieUnsubmitted Not Done Reply Inline Actions When does that case arise? (maybe when a function is zero-length/has no instructions? That's something I/we would like to fix at some point, FWIW) dblaikie: When does that case arise? (maybe when a function is zero-length/has no instructions? That's…
	using iterator = MCDwarfLineEntryCollection::iterator;			using iterator = MCDwarfLineEntryCollection::iterator;
	using const_iterator = MCDwarfLineEntryCollection::const_iterator;			using const_iterator = MCDwarfLineEntryCollection::const_iterator;
	using MCLineDivisionMap = MapVector<MCSection *, MCDwarfLineEntryCollection>;			using MCLineDivisionMap = MapVector<MCSection *, MCDwarfLineEntryCollection>;

	private:			private:
	// A collection of MCDwarfLineEntry for each section.			// A collection of MCDwarfLineEntry for each section.
	MCLineDivisionMap MCLineDivisions;			MCLineDivisionMap MCLineDivisions;

	▲ Show 20 Lines • Show All 475 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp

Show First 20 Lines • Show All 361 Lines • ▼ Show 20 Lines	DIE *DwarfCompileUnit::getOrCreateCommonBlock(
if (DIGlobalVariable *V = CB->getDecl())		if (DIGlobalVariable *V = CB->getDecl())
getCU().addLocationAttribute(&NDie, V, GlobalExprs);		getCU().addLocationAttribute(&NDie, V, GlobalExprs);
return &NDie;		return &NDie;
}		}

void DwarfCompileUnit::addRange(RangeSpan Range) {		void DwarfCompileUnit::addRange(RangeSpan Range) {
DD->insertSectionLabel(Range.Begin);		DD->insertSectionLabel(Range.Begin);

bool SameAsPrevCU = this == DD->getPrevCU();		auto *PrevCU = DD->getPrevCU();
		bool SameAsPrevCU = this == PrevCU;
DD->setPrevCU(this);		DD->setPrevCU(this);
// If we have no current ranges just add the range and return, otherwise,		// If we have no current ranges just add the range and return, otherwise,
// check the current section and CU against the previous section and CU we		// check the current section and CU against the previous section and CU we
// emitted into and the subprogram was contained within. If these are the		// emitted into and the subprogram was contained within. If these are the
// same then extend our current range, otherwise add this as a new range.		// same then extend our current range, otherwise add this as a new range.
if (CURanges.empty() \|\| !SameAsPrevCU \|\|		if (CURanges.empty() \|\| !SameAsPrevCU \|\|
(&CURanges.back().End->getSection() !=		(&CURanges.back().End->getSection() !=
&Range.End->getSection())) {		&Range.End->getSection())) {
		// Before a new range is added, always terminate the prior line table.
		if (PrevCU)
		DD->terminateLineTable(PrevCU);
CURanges.push_back(Range);		CURanges.push_back(Range);
return;		return;
}		}

CURanges.back().End = Range.End;		CURanges.back().End = Range.End;
}		}

void DwarfCompileUnit::initStmtList() {		void DwarfCompileUnit::initStmtList() {
▲ Show 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	case TargetFrameLowering::DwarfFrameBase::WasmFrameBase: {
addUInt(*Loc, dwarf::DW_FORM_data1, dwarf::DW_OP_WASM_location);		addUInt(*Loc, dwarf::DW_FORM_data1, dwarf::DW_OP_WASM_location);
addSInt(*Loc, dwarf::DW_FORM_sdata, TI_GLOBAL_RELOC);		addSInt(*Loc, dwarf::DW_FORM_sdata, TI_GLOBAL_RELOC);
if (!isDwoUnit()) {		if (!isDwoUnit()) {
addLabel(*Loc, dwarf::DW_FORM_data4, SPSym);		addLabel(*Loc, dwarf::DW_FORM_data4, SPSym);
} else {		} else {
// FIXME: when writing dwo, we need to avoid relocations. Probably		// FIXME: when writing dwo, we need to avoid relocations. Probably
// the "right" solution is to treat globals the way func and data		// the "right" solution is to treat globals the way func and data
// symbols are (with entries in .debug_addr).		// symbols are (with entries in .debug_addr).
// For now, since we only ever use index 0, this should work as-is.		// For now, since we only ever use index 0, this should work as-is.
		dblaikieUnsubmitted Not Done Reply Inline Actions Unrelated change - could commit this separately. dblaikie: Unrelated change - could commit this separately.
addUInt(*Loc, dwarf::DW_FORM_data4, FrameBase.Location.WasmLoc.Index);		addUInt(*Loc, dwarf::DW_FORM_data4, FrameBase.Location.WasmLoc.Index);
}		}
addUInt(*Loc, dwarf::DW_FORM_data1, dwarf::DW_OP_stack_value);		addUInt(*Loc, dwarf::DW_FORM_data1, dwarf::DW_OP_stack_value);
addBlock(*SPDie, dwarf::DW_AT_frame_base, Loc);		addBlock(*SPDie, dwarf::DW_AT_frame_base, Loc);
} else {		} else {
DIELoc *Loc = new (DIEValueAllocator) DIELoc;		DIELoc *Loc = new (DIEValueAllocator) DIELoc;
DIEDwarfExpression DwarfExpr(Asm, this, *Loc);		DIEDwarfExpression DwarfExpr(Asm, this, *Loc);
DIExpressionCursor Cursor({});		DIExpressionCursor Cursor({});
▲ Show 20 Lines • Show All 1,088 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/DwarfDebug.h

Show First 20 Lines • Show All 775 Lines • ▼ Show 20 Lines	public:
/// * DW_FORM_data8 for 64-bit DWARFv3;		/// * DW_FORM_data8 for 64-bit DWARFv3;
/// * DW_FORM_data4 for 32-bit DWARFv3 and DWARFv2.		/// * DW_FORM_data4 for 32-bit DWARFv3 and DWARFv2.
dwarf::Form getDwarfSectionOffsetForm() const;		dwarf::Form getDwarfSectionOffsetForm() const;

/// Returns the previous CU that was being updated		/// Returns the previous CU that was being updated
const DwarfCompileUnit *getPrevCU() const { return PrevCU; }		const DwarfCompileUnit *getPrevCU() const { return PrevCU; }
void setPrevCU(const DwarfCompileUnit *PrevCU) { this->PrevCU = PrevCU; }		void setPrevCU(const DwarfCompileUnit *PrevCU) { this->PrevCU = PrevCU; }

		/// Terminate the line table by adding the last range label.
		void terminateLineTable(const DwarfCompileUnit *CU);

/// Returns the entries for the .debug_loc section.		/// Returns the entries for the .debug_loc section.
const DebugLocStream &getDebugLocs() const { return DebugLocs; }		const DebugLocStream &getDebugLocs() const { return DebugLocs; }

/// Emit an entry for the debug loc section. This can be used to		/// Emit an entry for the debug loc section. This can be used to
/// handle an entry that's going to be emitted into the debug loc section.		/// handle an entry that's going to be emitted into the debug loc section.
void emitDebugLocEntry(ByteStreamer &Streamer,		void emitDebugLocEntry(ByteStreamer &Streamer,
const DebugLocStream::Entry &Entry,		const DebugLocStream::Entry &Entry,
const DwarfCompileUnit *CU);		const DwarfCompileUnit *CU);
▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp

Show First 20 Lines • Show All 1,401 Lines • ▼ Show 20 Lines	void DwarfDebug::finalizeModuleInfo() {
// Compute DIE offsets and sizes.		// Compute DIE offsets and sizes.
InfoHolder.computeSizeAndOffsets();		InfoHolder.computeSizeAndOffsets();
if (useSplitDwarf())		if (useSplitDwarf())
SkeletonHolder.computeSizeAndOffsets();		SkeletonHolder.computeSizeAndOffsets();
}		}

// Emit all Dwarf sections that should come after the content.		// Emit all Dwarf sections that should come after the content.
void DwarfDebug::endModule() {		void DwarfDebug::endModule() {
		// Terminate the pending line table.
		if (PrevCU)
		terminateLineTable(PrevCU);
		PrevCU = nullptr;
assert(CurFn == nullptr);		assert(CurFn == nullptr);
assert(CurMI == nullptr);		assert(CurMI == nullptr);

for (const auto &P : CUMap) {		for (const auto &P : CUMap) {
auto &CU = *P.second;		auto &CU = *P.second;
CU.createBaseTypeDIEs();		CU.createBaseTypeDIEs();
}		}

▲ Show 20 Lines • Show All 737 Lines • ▼ Show 20 Lines	void DwarfDebug::beginFunctionImpl(const MachineFunction *MF) {
Asm->OutStreamer->getContext().setDwarfCompileUnitID(		Asm->OutStreamer->getContext().setDwarfCompileUnitID(
getDwarfCompileUnitIDForLineTable(CU));		getDwarfCompileUnitIDForLineTable(CU));

// Record beginning of function.		// Record beginning of function.
PrologEndLoc = emitInitialLocDirective(		PrologEndLoc = emitInitialLocDirective(
*MF, Asm->OutStreamer->getContext().getDwarfCompileUnitID());		*MF, Asm->OutStreamer->getContext().getDwarfCompileUnitID());
}		}

unsigned		unsigned
DwarfDebug::getDwarfCompileUnitIDForLineTable(const DwarfCompileUnit &CU) {		DwarfDebug::getDwarfCompileUnitIDForLineTable(const DwarfCompileUnit &CU) {
// Set DwarfDwarfCompileUnitID in MCContext to the Compile Unit this function		// Set DwarfDwarfCompileUnitID in MCContext to the Compile Unit this function
		dblaikieUnsubmitted Done Reply Inline Actions What's this refactoring for? Looks like it causes duplicate lookups of the DwarfCompileUnit, that might be nice to avoid/not introduce? (if it is necessary/preferred for some reason, might be worth doing separately from the rest of this patch?) (both callers already have the DwarfCompileUnit available - perhaps this function should take that instead of the MachineFunction? that way it could avoid the extra lookups) dblaikie: What's this refactoring for? Looks like it causes duplicate lookups of the DwarfCompileUnit…
// belongs to so that we add to the correct per-cu line table in the		// belongs to so that we add to the correct per-cu line table in the
// non-asm case.		// non-asm case.
if (Asm->OutStreamer->hasRawTextSupport())		if (Asm->OutStreamer->hasRawTextSupport())
// Use a single line table if we are generating assembly.		// Use a single line table if we are generating assembly.
return 0;		return 0;
else		else
return CU.getUniqueID();		return CU.getUniqueID();
}		}

		void DwarfDebug::terminateLineTable(const DwarfCompileUnit *CU) {
		const auto &CURanges = CU->getRanges();
		dblaikieUnsubmitted Not Done Reply Inline Actions presumably this function shouldn't be called if PrevCU == NewCU, right? (maybe that could be asserted, rather than tested) dblaikie: presumably this function shouldn't be called if PrevCU == NewCU, right? (maybe that could be…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions I checked this condition at the call-site and updated the comments. kyulee: I checked this condition at the call-site and updated the comments.
		auto &LineTable = Asm->OutStreamer->getContext().getMCDwarfLineTable(
		getDwarfCompileUnitIDForLineTable(*CU));
		// Add the last range label for the given CU.
		LineTable.getMCLineSections().addEndEntry(
		dblaikieUnsubmitted Not Done Reply Inline Actions Does is this empty? I /think/ that "PrevCU" is only set when PrevCU has already been populated with some content, right? dblaikie: Does is this empty? I /think/ that "PrevCU" is only set when PrevCU has already been populated…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions Yeah. I deleted the check. kyulee: Yeah. I deleted the check.
		const_cast<MCSymbol *>(CURanges.back().End));
		}

void DwarfDebug::skippedNonDebugFunction() {		void DwarfDebug::skippedNonDebugFunction() {
// If we don't have a subprogram for this function then there will be a hole		// If we don't have a subprogram for this function then there will be a hole
// in the range information. Keep note of this by setting the previously used		// in the range information. Keep note of this by setting the previously used
// section to nullptr.		// section to nullptr.
		// Terminate the pending line table.
		if (PrevCU)
		terminateLineTable(PrevCU);
PrevCU = nullptr;		PrevCU = nullptr;
CurFn = nullptr;		CurFn = nullptr;
}		}

// Gather and emit post-function debug information.		// Gather and emit post-function debug information.
void DwarfDebug::endFunctionImpl(const MachineFunction *MF) {		void DwarfDebug::endFunctionImpl(const MachineFunction *MF) {
const DISubprogram *SP = MF->getFunction().getSubprogram();		const DISubprogram *SP = MF->getFunction().getSubprogram();

Show All 15 Lines	void DwarfDebug::endFunctionImpl(const MachineFunction *MF) {
DenseSet<InlinedEntity> Processed;		DenseSet<InlinedEntity> Processed;
collectEntityInfo(TheCU, SP, Processed);		collectEntityInfo(TheCU, SP, Processed);

// Add the range of this function to the list of ranges for the CU.		// Add the range of this function to the list of ranges for the CU.
// With basic block sections, add ranges for all basic block sections.		// With basic block sections, add ranges for all basic block sections.
for (const auto &R : Asm->MBBSectionRanges)		for (const auto &R : Asm->MBBSectionRanges)
TheCU.addRange({R.second.BeginLabel, R.second.EndLabel});		TheCU.addRange({R.second.BeginLabel, R.second.EndLabel});

// Under -gmlt, skip building the subprogram if there are no inlined		// Under -gmlt, skip building the subprogram if there are no inlined
// subroutines inside it. But with -fdebug-info-for-profiling, the subprogram		// subroutines inside it. But with -fdebug-info-for-profiling, the subprogram
// is still needed as we need its source location.		// is still needed as we need its source location.
if (!TheCU.getCUNode()->getDebugInfoForProfiling() &&		if (!TheCU.getCUNode()->getDebugInfoForProfiling() &&
TheCU.getCUNode()->getEmissionKind() == DICompileUnit::LineTablesOnly &&		TheCU.getCUNode()->getEmissionKind() == DICompileUnit::LineTablesOnly &&
		dblaikieUnsubmitted Done Reply Inline Actions Perhaps this could be done once outside the loop? // MBBSectionRanges is always non-empty at this point, right? (could assert just in case, but I think it's a reasonable invariant) Asm->OutStreamer->getContext() .getMCDwarfLineTable(getDwarfCompileUnitID(MF)) .getMCLineSections() .updateEndLabel(Asm->MBBSectionRanges.back().second.EndLabel); dblaikie: Perhaps this could be done once outside the loop? ``` // MBBSectionRanges is always non-empty…
		clayborgUnsubmitted Done Reply Inline Actions Are the "Asm->MBBSectionRanges" sorted? If not, do we need to find the highest EndLabel and only add that one? clayborg: Are the "Asm->MBBSectionRanges" sorted? If not, do we need to find the highest EndLabel and…
LScopes.getAbstractScopesList().empty() && !IsDarwin) {		LScopes.getAbstractScopesList().empty() && !IsDarwin) {
assert(InfoHolder.getScopeVariables().empty());		assert(InfoHolder.getScopeVariables().empty());
PrevLabel = nullptr;		PrevLabel = nullptr;
CurFn = nullptr;		CurFn = nullptr;
return;		return;
}		}

#ifndef NDEBUG		#ifndef NDEBUG
▲ Show 20 Lines • Show All 1,326 Lines • Show Last 20 Lines

llvm/lib/MC/MCDwarf.cpp

Show First 20 Lines • Show All 135 Lines • ▼ Show 20 Lines
makeStartPlusIntExpr(MCContext &Ctx, const MCSymbol &Start, int IntVal) {		makeStartPlusIntExpr(MCContext &Ctx, const MCSymbol &Start, int IntVal) {
MCSymbolRefExpr::VariantKind Variant = MCSymbolRefExpr::VK_None;		MCSymbolRefExpr::VariantKind Variant = MCSymbolRefExpr::VK_None;
const MCExpr *LHS = MCSymbolRefExpr::create(&Start, Variant, Ctx);		const MCExpr *LHS = MCSymbolRefExpr::create(&Start, Variant, Ctx);
const MCExpr *RHS = MCConstantExpr::create(IntVal, Ctx);		const MCExpr *RHS = MCConstantExpr::create(IntVal, Ctx);
const MCExpr *Res = MCBinaryExpr::create(MCBinaryExpr::Add, LHS, RHS, Ctx);		const MCExpr *Res = MCBinaryExpr::create(MCBinaryExpr::Add, LHS, RHS, Ctx);
return Res;		return Res;
}		}

		void MCLineSection::addEndEntry(MCSymbol *EndLabel) {
		auto *Sec = &EndLabel->getSection();
		// The line table may be empty, which we should skip adding an end entry.
		// There are two cases:
		// (1) MCAsmStreamer - emitDwarfLocDirective emits a location directive in
		dblaikieUnsubmitted Not Done Reply Inline Actions When does this situation come up? (I think this question got lost from its previous place here: https://reviews.llvm.org/D108261#inline-1086075 during refactoring) (if it is necessary, it might be suitable to use `find` on the `MCLineDivisions` rather than `operator[]` (so this call doesn't create an entry when it's empty/unused)) dblaikie: When does this situation come up? (I think this question got lost from its previous place here…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions Even though we have a range, we may not have any debug line entry in unit tests. Makes sense using `find`. Will update it with additional assembly test. kyulee: Even though we have a range, we may not have any debug line entry in unit tests. Makes sense…
		dblaikieUnsubmitted Not Done Reply Inline Actions Seems like we shouldn't be adding extra API surface area only for unit tests, though? But it's possible this comes up in real cases of empty functions - if that's the case (worth testing/validating), leaving a FIXME for when we remove/fix the existence of empty functions would be good. dblaikie: Seems like we shouldn't be adding extra API surface area only for unit tests, though? But it's…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions It turned out the MCAsmStreamer path (as opposed to MCObjectStreamer) typically doesn't populate the line entry. So, I keep this check to bail-out cloning/adding the end entry in that case. Or, hundreds of LIT tests failed. kyulee: It turned out the MCAsmStreamer path (as opposed to MCObjectStreamer) typically doesn't…
		dblaikieUnsubmitted Not Done Reply Inline Actions find is preferred over count - so that the result can be used rather than a duplicate lookup: auto I = MCLineDivisions.find(Sec); if (I != MCLineDivisions.end()) { auto &Entries = I->second; ... } Though I'm still curious to better understand under what conditions this is needed. dblaikie: find is preferred over count - so that the result can be used rather than a duplicate lookup…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions Yea. `find` seems better since I need to use the map anyhow inside. Will update it. As commented above in the code, the assembly output path (MCAsmStreamer as opposed to MCObjectStreamer), the line entry is not added instead .loc directives are emitted during the function emission. So, the line entries are often irrelevant in the assembly output except a certain target -- I guess XCOFF shown below in the tests. I was thinking to check streamer and specialize this logic outside this, but not sure exactly how to do so. kyulee: Yea. `find` seems better since I need to use the map anyhow inside. Will update it. As…
		dblaikieUnsubmitted Not Done Reply Inline Actions Ah, OK! Right - the solution to that would be to have a virtual function in MCStreamer that's differently implemented (a no-op in the asm streamer case - maybe someone'll eventually extend the assembly syntax to support terminating line contributions - so nodebug could be properly done in assembly) this would be side-by-side with the MCStreamer::emitDwarfLocDirective - or, perhaps it could use that specific function, with an extra parameter or flag value for "end entry"? dblaikie: Ah, OK! Right - the solution to that would be to have a virtual function in MCStreamer that's…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions Yea. I tried to create `emitTerminateLineTable` in MCStreamer and differently extends it for MCAsmStreamer, which resolved the most cases. But I'm still seeing dozens of failures in unit tests which are probably because of incomplete debug data in the test cases that are manually synthesized. For instance, `DebugInfo/X86/multiple-at-const-val.ll`, it was like define i32 @main() !dbg !960 { entry: %call1.i = tail call %"class.std::basic_ostream"* @test(%"class.std::basic_ostream"* @_ZSt4cout, i8* getelementptr inbounds ([6 x i8], [6 x i8]* @.str, i64 0, i64 0), i64 5) ret i32 0 } This means the function appears to have a debug info, but there is no tag like `DILocation` for each line. I presume although the above is unlikely the real case from the compiler, but I think this seems a valid case (opt or any transformation may drop debug tag) so we should cope with. Having said that, I think this check seems worth being kept -- then I don't see the reason above to specialize streamer just for this. kyulee: Yea. I tried to create `emitTerminateLineTable` in MCStreamer and differently extends it for…
		dblaikieUnsubmitted Not Done Reply Inline Actions Ah, hmm - yeah, there's certainly ways we could arrive at a function without any instructions having a location. Though I wonder what the line table should look like for that? Not having the line table cover those instructions seems a bit weird too. But if that's the state of things today, might as well leave ti as-is. Could you include a comment describing how that occurs with the MCObjectStreamer use case as well as the one outlined with MCAsmStreamer? dblaikie: Ah, hmm - yeah, there's certainly ways we could arrive at a function without any instructions…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions For the above case where instructions have no location at all, the line entries in the table didn't appear in dwarf although other contents appeared. Yeah. Will update the comments to include two cases (MCAsmStreamer and MCObjectStreamer). kyulee: For the above case where instructions have no location at all, the line entries in the table…
		// place instead of adding a line entry if the target has
		// usesDwarfFileAndLocDirectives.
		// (2) MCObjectStreamer - if a function has incomplete debug info where
		// instructions don't have DILocations, the line entries are missing.
		auto I = MCLineDivisions.find(Sec);
		if (I != MCLineDivisions.end()) {
		auto &Entries = I->second;
		auto EndEntry = Entries.back();
		EndEntry.setEndLabel(EndLabel);
		Entries.push_back(EndEntry);
		}
		}

//		//
// This emits the Dwarf line table for the specified section from the entries		// This emits the Dwarf line table for the specified section from the entries
// in the LineSection.		// in the LineSection.
//		//
void MCDwarfLineTable::emitOne(		void MCDwarfLineTable::emitOne(
MCStreamer MCOS, MCSection Section,		MCStreamer MCOS, MCSection Section,
const MCLineSection::MCDwarfLineEntryCollection &LineEntries) {		const MCLineSection::MCDwarfLineEntryCollection &LineEntries) {
unsigned FileNum = 1;
unsigned LastLine = 1;		unsigned FileNum, LastLine, Column, Flags, Isa, Discriminator;
unsigned Column = 0;		MCSymbol *LastLabel;
unsigned Flags = DWARF2_LINE_DEFAULT_IS_STMT ? DWARF2_FLAG_IS_STMT : 0;		auto init = [&]() {
unsigned Isa = 0;		FileNum = 1;
unsigned Discriminator = 0;		LastLine = 1;
MCSymbol *LastLabel = nullptr;		Column = 0;
		Flags = DWARF2_LINE_DEFAULT_IS_STMT ? DWARF2_FLAG_IS_STMT : 0;
		Isa = 0;
		Discriminator = 0;
		LastLabel = nullptr;
		};
		init();

// Loop through each MCDwarfLineEntry and encode the dwarf line number table.		// Loop through each MCDwarfLineEntry and encode the dwarf line number table.
		bool EndEntryEmitted = false;
for (const MCDwarfLineEntry &LineEntry : LineEntries) {		for (const MCDwarfLineEntry &LineEntry : LineEntries) {
		MCSymbol *Label = LineEntry.getLabel();
		const MCAsmInfo *asmInfo = MCOS->getContext().getAsmInfo();
		if (LineEntry.IsEndEntry) {
		MCOS->emitDwarfAdvanceLineAddr(INT64_MAX, LastLabel, Label,
		asmInfo->getCodePointerSize());
		init();
		EndEntryEmitted = true;
		continue;
		}

int64_t LineDelta = static_cast<int64_t>(LineEntry.getLine()) - LastLine;		int64_t LineDelta = static_cast<int64_t>(LineEntry.getLine()) - LastLine;

if (FileNum != LineEntry.getFileNum()) {		if (FileNum != LineEntry.getFileNum()) {
		dblaikieUnsubmitted Not Done Reply Inline Actions When does this occur? I guess when the last function with debug info in this section is followed by a function in another section (with or without debug info) or a nodebug function? That does seem a bit subtle and like it'd be nicer if this API's invariant was just that all callers terminated their own lists. (this would involve fixing up the GenDwarfForAssembly codepath to do that too, I'd guess) For DwarfDebug/etc I guess this would be addressed by calling terminateLineTableForPrevCU (or perhaps resetPrevCU) in finishModule or whatever it's called? dblaikie: When does this occur? I guess when the last function with debug info in this section is…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions I can see we can terminate the line table for DwarfDebug in finishModule via `resetPrevCU` -- this will use an end label of range. However, for assembly path, I don't think we have explicit labels/symbols to terminate. I may introduce an api to patch the line table for all sections that do not have an end entry at the end. But in that case, I still need to synthesize/emit a label using section symbol like `MCStreamer::endSection`. It's just moving the place when we fill this gap -- either here or before coming here. kyulee: I can see we can terminate the line table for DwarfDebug in finishModule via `resetPrevCU`…
FileNum = LineEntry.getFileNum();		FileNum = LineEntry.getFileNum();
MCOS->emitInt8(dwarf::DW_LNS_set_file);		MCOS->emitInt8(dwarf::DW_LNS_set_file);
MCOS->emitULEB128IntValue(FileNum);		MCOS->emitULEB128IntValue(FileNum);
}		}
if (Column != LineEntry.getColumn()) {		if (Column != LineEntry.getColumn()) {
Column = LineEntry.getColumn();		Column = LineEntry.getColumn();
MCOS->emitInt8(dwarf::DW_LNS_set_column);		MCOS->emitInt8(dwarf::DW_LNS_set_column);
MCOS->emitULEB128IntValue(Column);		MCOS->emitULEB128IntValue(Column);
Show All 18 Lines	for (const MCDwarfLineEntry &LineEntry : LineEntries) {
}		}
if (LineEntry.getFlags() & DWARF2_FLAG_BASIC_BLOCK)		if (LineEntry.getFlags() & DWARF2_FLAG_BASIC_BLOCK)
MCOS->emitInt8(dwarf::DW_LNS_set_basic_block);		MCOS->emitInt8(dwarf::DW_LNS_set_basic_block);
if (LineEntry.getFlags() & DWARF2_FLAG_PROLOGUE_END)		if (LineEntry.getFlags() & DWARF2_FLAG_PROLOGUE_END)
MCOS->emitInt8(dwarf::DW_LNS_set_prologue_end);		MCOS->emitInt8(dwarf::DW_LNS_set_prologue_end);
if (LineEntry.getFlags() & DWARF2_FLAG_EPILOGUE_BEGIN)		if (LineEntry.getFlags() & DWARF2_FLAG_EPILOGUE_BEGIN)
MCOS->emitInt8(dwarf::DW_LNS_set_epilogue_begin);		MCOS->emitInt8(dwarf::DW_LNS_set_epilogue_begin);

MCSymbol *Label = LineEntry.getLabel();

// At this point we want to emit/create the sequence to encode the delta in		// At this point we want to emit/create the sequence to encode the delta in
// line numbers and the increment of the address from the previous Label		// line numbers and the increment of the address from the previous Label
// and the current Label.		// and the current Label.
const MCAsmInfo *asmInfo = MCOS->getContext().getAsmInfo();
MCOS->emitDwarfAdvanceLineAddr(LineDelta, LastLabel, Label,		MCOS->emitDwarfAdvanceLineAddr(LineDelta, LastLabel, Label,
asmInfo->getCodePointerSize());		asmInfo->getCodePointerSize());

Discriminator = 0;		Discriminator = 0;
LastLine = LineEntry.getLine();		LastLine = LineEntry.getLine();
LastLabel = Label;		LastLabel = Label;
}		}

// Generate DWARF line end entry.		// Generate DWARF line end entry.
		// We do not need this for DwarfDebug that explicitly terminates the line
		// table using ranges whenever CU or section changes. However, the MC path
		// does not track ranges nor terminate the line table. In that case,
		// conservatively use the section end symbol to end the line table.
		if (!EndEntryEmitted)
MCOS->emitDwarfLineEndEntry(Section, LastLabel);		MCOS->emitDwarfLineEndEntry(Section, LastLabel);
		dblaikieUnsubmitted Not Done Reply Inline Actions Would be nice if we could eliminate the need for this case - since any time this gets used it risks being because a line table was allowed to "flow off the end" further than it should. What would it be like if we handled the line table similar to the way ranges are handled? Always terminated at the end of each function, but then extended if the next function happens to start immediately after the last entry ended? dblaikie: Would be nice if we could eliminate the need for this case - since any time this gets used it…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions Yeah. That sounds great, but it seems similar to what I initially tried, which tracked the end label of function as functions were added. But I wonder whether there is a case where the end label of the prior function is not matched with the start label of the next function due to some alignment, etc? In that case, do we want to terminate the line table for the prior function because of mismatch? I guess that seems an overkill. kyulee: Yeah. That sounds great, but it seems similar to what I initially tried, which tracked the end…
		dblaikieUnsubmitted Not Done Reply Inline Actions Yeah, though I was thinking more driven by the DwarfDebug code so it's not tracked in two different places that might diverge/reduce the code/logic needed to handle these cases. I think alignment doesn't break this strategy (it doesn't seem to break it for the ranges data using this approach) - though the alignment does come between the start of one function and the beginning of the next function - the code in DwarfDebug/DwarfCompileUnit/etc extends the CU ranges to cover that & hopefully the line table could be powered by the same logic. dblaikie: Yeah, though I was thinking more driven by the DwarfDebug code so it's not tracked in two…
		kyuleeAuthorUnsubmitted Done Reply Inline Actions From the second thought, `addLineEntry` is used during function emission (which is independent of Range operation) while `Range` is added at the end of function. It doesn't seem to work to extend the line table using the end of function range. kyulee: From the second thought, `addLineEntry` is used during function emission (which is independent…
}		}

//		//
// This emits the Dwarf file and the line tables.		// This emits the Dwarf file and the line tables.
//		//
void MCDwarfLineTable::emit(MCStreamer *MCOS, MCDwarfLineTableParams Params) {		void MCDwarfLineTable::emit(MCStreamer *MCOS, MCDwarfLineTableParams Params) {
MCContext &context = MCOS->getContext();		MCContext &context = MCOS->getContext();

▲ Show 20 Lines • Show All 1,681 Lines • Show Last 20 Lines

llvm/test/DebugInfo/XCOFF/empty.ll

	Show First 20 Lines • Show All 221 Lines • ▼ Show 20 Lines
	; ASM32-NEXT: .byte 10			; ASM32-NEXT: .byte 10
	; ASM32-NEXT: .byte 0 # Set address to L..tmp2			; ASM32-NEXT: .byte 0 # Set address to L..tmp2
	; ASM32-NEXT: .byte 5			; ASM32-NEXT: .byte 5
	; ASM32-NEXT: .byte 2			; ASM32-NEXT: .byte 2
	; ASM32-NEXT: .vbyte 4, L..tmp2			; ASM32-NEXT: .vbyte 4, L..tmp2
	; ASM32-NEXT: .byte 3 # Advance line 1			; ASM32-NEXT: .byte 3 # Advance line 1
	; ASM32-NEXT: .byte 1			; ASM32-NEXT: .byte 1
	; ASM32-NEXT: .byte 1			; ASM32-NEXT: .byte 1
	; ASM32-NEXT: .byte 0 # Set address to L..sec_end0			; ASM32-NEXT: .byte 0 # Set address to L..func_end0
				dblaikieUnsubmitted Not Done Reply Inline Actions Hmm, what aspect of the change caused these labels to change name? dblaikie: Hmm, what aspect of the change caused these labels to change name?
				kyuleeAuthorUnsubmitted Done Reply Inline Actions I think this XCOFF seems a unique path that generates the line table for the assembly output in DwarfDebug. So, the new logic is still kicked in, which adds an entry based on the range end label (instead of the section end label in the fall-back path). I think the range for function uses function labels, so that's why the change happens. Although this is the same in this unit tests, in theory, I think this new change is more precise. kyulee: I think this XCOFF seems a unique path that generates the line table for the assembly output in…
	; ASM32-NEXT: .byte 5			; ASM32-NEXT: .byte 5
	; ASM32-NEXT: .byte 2			; ASM32-NEXT: .byte 2
	; ASM32-NEXT: .vbyte 4, L..sec_end0			; ASM32-NEXT: .vbyte 4, L..func_end0
	; ASM32-NEXT: .byte 0 # End sequence			; ASM32-NEXT: .byte 0 # End sequence
	; ASM32-NEXT: .byte 1			; ASM32-NEXT: .byte 1
	; ASM32-NEXT: .byte 1			; ASM32-NEXT: .byte 1
	; ASM32-NEXT: L..debug_line_end0:			; ASM32-NEXT: L..debug_line_end0:

	; ASM64: .csect .text[PR],2			; ASM64: .csect .text[PR],2
	; ASM64-NEXT: .file "1.c"			; ASM64-NEXT: .file "1.c"
	; ASM64-NEXT: .globl main[DS] # -- Begin function main			; ASM64-NEXT: .globl main[DS] # -- Begin function main
	▲ Show 20 Lines • Show All 181 Lines • ▼ Show 20 Lines
	; ASM64-NEXT: .byte 10			; ASM64-NEXT: .byte 10
	; ASM64-NEXT: .byte 0 # Set address to L..tmp2			; ASM64-NEXT: .byte 0 # Set address to L..tmp2
	; ASM64-NEXT: .byte 9			; ASM64-NEXT: .byte 9
	; ASM64-NEXT: .byte 2			; ASM64-NEXT: .byte 2
	; ASM64-NEXT: .vbyte 8, L..tmp2			; ASM64-NEXT: .vbyte 8, L..tmp2
	; ASM64-NEXT: .byte 3 # Advance line 1			; ASM64-NEXT: .byte 3 # Advance line 1
	; ASM64-NEXT: .byte 1			; ASM64-NEXT: .byte 1
	; ASM64-NEXT: .byte 1			; ASM64-NEXT: .byte 1
	; ASM64-NEXT: .byte 0 # Set address to L..sec_end0			; ASM64-NEXT: .byte 0 # Set address to L..func_end0
	; ASM64-NEXT: .byte 9			; ASM64-NEXT: .byte 9
	; ASM64-NEXT: .byte 2			; ASM64-NEXT: .byte 2
	; ASM64-NEXT: .vbyte 8, L..sec_end0			; ASM64-NEXT: .vbyte 8, L..func_end0
	; ASM64-NEXT: .byte 0 # End sequence			; ASM64-NEXT: .byte 0 # End sequence
	; ASM64-NEXT: .byte 1			; ASM64-NEXT: .byte 1
	; ASM64-NEXT: .byte 1			; ASM64-NEXT: .byte 1
	; ASM64-NEXT: L..debug_line_end0:			; ASM64-NEXT: L..debug_line_end0:

	; DWARF32: : file format aixcoff-rs6000			; DWARF32: : file format aixcoff-rs6000
	; DWARF32: .debug_abbrev contents:			; DWARF32: .debug_abbrev contents:
	; DWARF32-NEXT: Abbrev table for offset: 0x00000000			; DWARF32-NEXT: Abbrev table for offset: 0x00000000
	▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

llvm/test/DebugInfo/XCOFF/explicit-section.ll

	Show First 20 Lines • Show All 290 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: .byte 10			; CHECK-NEXT: .byte 10
	; CHECK-NEXT: .byte 0 # Set address to L..tmp1			; CHECK-NEXT: .byte 0 # Set address to L..tmp1
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..tmp1			; CHECK-NEXT: .vbyte 4, L..tmp1
	; CHECK-NEXT: .byte 3 # Advance line 0			; CHECK-NEXT: .byte 3 # Advance line 0
	; CHECK-NEXT: .byte 0			; CHECK-NEXT: .byte 0
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 0 # Set address to L..sec_end0			; CHECK-NEXT: .byte 0 # Set address to L..func_end0
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..sec_end0			; CHECK-NEXT: .vbyte 4, L..func_end0
	; CHECK-NEXT: .byte 0 # End sequence			; CHECK-NEXT: .byte 0 # End sequence
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 0 # Set address to L..tmp3			; CHECK-NEXT: .byte 0 # Set address to L..tmp3
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..tmp3			; CHECK-NEXT: .vbyte 4, L..tmp3
	; CHECK-NEXT: .byte 19 # Start sequence			; CHECK-NEXT: .byte 19 # Start sequence
	Show All 12 Lines
	; CHECK-NEXT: .byte 6			; CHECK-NEXT: .byte 6
	; CHECK-NEXT: .byte 0 # Set address to L..tmp6			; CHECK-NEXT: .byte 0 # Set address to L..tmp6
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..tmp6			; CHECK-NEXT: .vbyte 4, L..tmp6
	; CHECK-NEXT: .byte 3 # Advance line 0			; CHECK-NEXT: .byte 3 # Advance line 0
	; CHECK-NEXT: .byte 0			; CHECK-NEXT: .byte 0
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 0 # Set address to L..sec_end0			; CHECK-NEXT: .byte 0 # Set address to L..func_end1
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..sec_end0			; CHECK-NEXT: .vbyte 4, L..func_end1
	; CHECK-NEXT: .byte 0 # End sequence			; CHECK-NEXT: .byte 0 # End sequence
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: L..debug_line_end0:			; CHECK-NEXT: L..debug_line_end0:

llvm/test/DebugInfo/XCOFF/function-sections.ll

	Show First 20 Lines • Show All 277 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: .byte 10			; CHECK-NEXT: .byte 10
	; CHECK-NEXT: .byte 0 # Set address to L..tmp1			; CHECK-NEXT: .byte 0 # Set address to L..tmp1
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..tmp1			; CHECK-NEXT: .vbyte 4, L..tmp1
	; CHECK-NEXT: .byte 3 # Advance line 1			; CHECK-NEXT: .byte 3 # Advance line 1
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 0 # Set address to L..sec_end0			; CHECK-NEXT: .byte 0 # Set address to L..func_end0
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..sec_end0			; CHECK-NEXT: .vbyte 4, L..func_end0
	; CHECK-NEXT: .byte 0 # End sequence			; CHECK-NEXT: .byte 0 # End sequence
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 0 # Set address to L..tmp3			; CHECK-NEXT: .byte 0 # Set address to L..tmp3
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..tmp3			; CHECK-NEXT: .vbyte 4, L..tmp3
	; CHECK-NEXT: .byte 24 # Start sequence			; CHECK-NEXT: .byte 24 # Start sequence
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 3			; CHECK-NEXT: .byte 3
	; CHECK-NEXT: .byte 10			; CHECK-NEXT: .byte 10
	; CHECK-NEXT: .byte 0 # Set address to L..tmp4			; CHECK-NEXT: .byte 0 # Set address to L..tmp4
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..tmp4			; CHECK-NEXT: .vbyte 4, L..tmp4
	; CHECK-NEXT: .byte 3 # Advance line 1			; CHECK-NEXT: .byte 3 # Advance line 1
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 0 # Set address to L..sec_end0			; CHECK-NEXT: .byte 0 # Set address to L..func_end1
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5
	; CHECK-NEXT: .byte 2			; CHECK-NEXT: .byte 2
	; CHECK-NEXT: .vbyte 4, L..sec_end0			; CHECK-NEXT: .vbyte 4, L..func_end1
	; CHECK-NEXT: .byte 0 # End sequence			; CHECK-NEXT: .byte 0 # End sequence
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: L..debug_line_end0:			; CHECK-NEXT: L..debug_line_end0:

llvm/test/DebugInfo/debugline-endsequence.ll

This file was added.

				; RUN: llc %s -filetype=obj -o - \| llvm-dwarfdump --debug-line - \| FileCheck %s

				target datalayout = "e-m:o-i64:64-i128:128-n32:64-S128"
				target triple = "arm64-apple-macosx12.0.0"

				; Check if the end_sequences are emitted for each debug range.

				; CU1 Line table
				; CHECK: 0x0000000000000004 [[T:.*]] end_sequence
				; CHECK: 0x0000000000000010 [[T:.*]] end_sequence
				;
				; CU2 Line table
				; CHECK: 0x0000000000000008 [[T:.*]] end_sequence

				; CU1 (0x0 ~ 0x4)
				define void @f1() !dbg !15 {
				ret void, !dbg !18
				}

				; CU2 (0x4 ~ 0x8)
				define void @f2() !dbg !21 {
				ret void, !dbg !22
				}

				; CU2 (nodebug) - (0x8 ~ 0xc)
				define void @f3() {
				ret void
				}

				; CU1 (0xc ~ 0x10)
				define void @f4() !dbg !19 {
				ret void, !dbg !20
				}

				!llvm.dbg.cu = !{!0, !3}
				!llvm.ident = !{!5, !5}
				!llvm.module.flags = !{!6, !7, !8, !9, !10, !11, !12, !13, !14}

				!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1, producer: "LLVM", isOptimized: true, runtimeVersion: 0, emissionKind: FullDebug, enums: !2, nameTableKind: None, sysroot: "/")
				!1 = !DIFile(filename: "<stdin>", directory: "/")
				!2 = !{}
				!3 = distinct !DICompileUnit(language: DW_LANG_C99, file: !4, producer: "LLVM", isOptimized: true, runtimeVersion: 0, emissionKind: FullDebug, enums: !2, nameTableKind: None, sysroot: "/")
				!4 = !DIFile(filename: "<stdin>", directory: "/")
				!5 = !{!"Apple clang version 13.0.0 (clang-1300.0.29.3)"}
				!6 = !{i32 2, !"SDK Version", [2 x i32] [i32 11, i32 3]}
				!7 = !{i32 7, !"Dwarf Version", i32 4}
				!8 = !{i32 2, !"Debug Info Version", i32 3}
				!9 = !{i32 1, !"wchar_size", i32 4}
				!10 = !{i32 1, !"branch-target-enforcement", i32 0}
				!11 = !{i32 1, !"sign-return-address", i32 0}
				!12 = !{i32 1, !"sign-return-address-all", i32 0}
				!13 = !{i32 1, !"sign-return-address-with-bkey", i32 0}
				!14 = !{i32 7, !"PIC Level", i32 2}
				!15 = distinct !DISubprogram(name: "f1", scope: !1, file: !1, line: 1, type: !16, scopeLine: 1, spFlags: DISPFlagDefinition, unit: !0, retainedNodes: !2)
				!16 = !DISubroutineType(types: !17)
				!17 = !{null}
				!18 = !DILocation(line: 2, column: 1, scope: !15)
				!19 = distinct !DISubprogram(name: "f4", scope: !1, file: !1, line: 4, type: !16, scopeLine: 4, spFlags: DISPFlagDefinition, unit: !0, retainedNodes: !2)
				!20 = !DILocation(line: 5, column: 1, scope: !19)
				!21 = distinct !DISubprogram(name: "f2", scope: !4, file: !4, line: 1, type: !16, scopeLine: 1, spFlags: DISPFlagDefinition, unit: !3, retainedNodes: !2)
				!22 = !DILocation(line: 2, column: 1, scope: !21)

llvm/test/DebugInfo/debugline-endsequence.s

This file was added.

				# RUN: llvm-mc -filetype=obj -triple=x86_64 %s -o - \| llvm-dwarfdump --debug-line - \| FileCheck %s

				# The line table is open in the MC path.
				# The end sequence is emitted using the section end label.

				# CHECK: 0x0000000000000001 [[T:.*]] end_sequence
				# CHECK: 0x0000000000000001 [[T:.*]] end_sequence

				.text
				.section .text.f1
				f1:
				.file 1 "/" "t1.c"
				.loc 1 1 0
				nop

				.section .text.f2
				f2:
				.loc 1 2 0
				nop

This is an archive of the discontinued LLVM Phabricator instance.

[DebugInfo] Fix end_sequence of debug_line in LTO ObjectClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 387151

llvm/include/llvm/MC/MCDwarf.h

llvm/lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp

llvm/lib/CodeGen/AsmPrinter/DwarfDebug.h

llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp

llvm/lib/MC/MCDwarf.cpp

llvm/test/DebugInfo/XCOFF/empty.ll

llvm/test/DebugInfo/XCOFF/explicit-section.ll

llvm/test/DebugInfo/XCOFF/function-sections.ll

llvm/test/DebugInfo/debugline-endsequence.ll

llvm/test/DebugInfo/debugline-endsequence.s

[DebugInfo] Fix end_sequence of debug_line in LTO Object
ClosedPublic