This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/AsmPrinter/
-
CodeGen/
-
AsmPrinter/
3/7
AsmPrinter.cpp
-
DbgEntityHistoryCalculator.cpp
1
DebugHandlerBase.cpp
2
DebugLocEntry.h
1/2
DwarfCompileUnit.cpp
1/2
DwarfDebug.cpp
-
DwarfExpression.h
2/4
DwarfExpression.cpp
-
test/DebugInfo/X86/
-
DebugInfo/
-
X86/
3
dbg_value_list_clobbers.mir
3/8
dbg_value_list_emission.mir

Differential D83495

[DebugInfo] Add DWARF emission for DBG_VALUE_LIST
ClosedPublic

Authored by StephenTozer on Jul 9 2020, 11:09 AM.

Download Raw Diff

Details

Reviewers

aprantl
probinson
vsk
djtodoro
dblaikie

Commits

rG0da27ba56c9f: [DebugInfo] Add DWARF emission for DBG_VALUE_LIST

Summary

Continuing the work discussed in the RFC[0], this patch implements the actual emission of DWARF from a DBG_VALUE_LIST instruction.

The logic for handling the new instruction is simple in most places; DbgEntityHistoryCalculator has a more complex set of changes since it's more involved with register tracking, and the code for producing DW_AT_location in both DwarfDebug and DwarfExpression also required some heftier work.

Previously, the code in emitDebugLocEntry functioned along the lines of:

Emit any fragment info
Emit any entry value info
Emit the location specified in the DBG_VALUE, e.g. DW_OP_reg X or DW_OP_constu X
Finally call DwarfExpression::addExpression(), which handles the DIExpression (except fragments)

Since there may now be multiple locations scattered throughout the expression, rather than a single location at the front, addExpression has been modified to optionally take a lambda that is used to handle DW_OP_LLVM_arg N; the lambda is passed in from emitDebugLocEntry, and performs step 3 using the Nth debug operand. Non-list debug values follow the same behaviour as before. DwarfCompileUnit::constructVariableDIEImpl is similar, but simpler.

The alternative to using the lambda would be to move some of the code in DwarfDebug::emitDebugLocEntry directly into DwarfExpr, and passing a list of locations to addExpression. The hard part with this is that DwarfDebug and DwarfCompileUnit perform step 3 differently, although it's possible their behaviour can be merged. The purpose of choosing the lambda was to minimize the amount of actual change made, but if the alternative option seems like an objectively good refactor then I'm happy to adjust.

[0] http://lists.llvm.org/pipermail/llvm-dev/2020-February/139376.html

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

StephenTozer created this revision.Jul 9 2020, 11:09 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 9 2020, 11:09 AM

Herald added subscribers: llvm-commits, aheejin, hiraditya, dschuff. · View Herald Transcript

Harbormaster failed remote builds in B63613: Diff 276774!Jul 9 2020, 11:10 AM

StephenTozer added a parent revision: D82363: [DebugInfo] Add new instruction and expression operator for variadic debug values.Jul 9 2020, 11:20 AM

Hey, just noticed a couple of comments to remove from the tests.

llvm/test/DebugInfo/X86/dbg_value_list_clobbers.mir
42	You can remove this XXX note.
68	I assume this no longer fails?
llvm/test/DebugInfo/X86/dbg_value_list_emission.mir
64	Can remove this XXX note. As you mentioned offline, having multiple references to the same arg (i.e. multiple `DW_OP_LLVM_arg, 0` in the expr) is never a problem. Though, slightly tangentially, I'm still a little unclear on what the final decision was on how to handle duplicate register arg operands. In D82363 you said 'always treat DBG_VALUE_LISTs as potentially having them'. Please could you explain a little further? (i.e. is it an error state, do we need to add extra checks when dealing with DBG_VALUE_LISTs etc).

StephenTozer marked an inline comment as done.Jul 10 2020, 4:28 AM

StephenTozer added inline comments.

llvm/test/DebugInfo/X86/dbg_value_list_emission.mir
64	It is not an error state, just a slightly more inconvenient form than one without duplicates. It requires some extra work in a few places (operating on a vector instead of a single pointer), but there is no reason for it to be invalid.

Remove old comments from tests.

Harbormaster failed remote builds in B63725: Diff 276989!Jul 10 2020, 4:31 AM

Could we reduce complexity by entirely replacing DBG_VALUE with DBG_VALUE_LIST, *, DW_OP_arg 0, *?

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
15	Out of curiosity: Is there an operand kind that we could switch() over?
llvm/lib/CodeGen/AsmPrinter/DwarfExpression.cpp
25	By using a callback here the callee cannot use the advanced functionality of addMachineRegExpression for any but a leading DW_OP_LLVM_arg. Do you see a way of either generalizing addMachineRegExpression or otherwise reorganizing this so the addMachineRegExpression functionality becomes available to DBG_VALUE_LIST?

In D83495#2158601, @aprantl wrote:

Could we reduce complexity by entirely replacing DBG_VALUE with DBG_VALUE_LIST, *, DW_OP_arg 0, *?

Yes, and in the long run I think that should be the goal; it would also be nice to remove the "indirect" operand and all the code paths that use it. So far it has been easier to create this as a separate instruction, but if the debug info cabal as a whole is positive on the replacement then there's no harm in bringing the replacement into these patches (or more likely, adding an extra patch to do so on top of the ones that are already being reviewed).

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
15	It would be useful to have one if there isn't; I didn't want to fold a change like that into this work, but if it exists I can use it (and if not it'd be nice to add in another patch).
llvm/lib/CodeGen/AsmPrinter/DwarfExpression.cpp
25	It actually can use that functionality - in this case, all of the functionality that would normally be applied to the location in the DBG_VALUE is applied by this callback. The callback in this case can advance the ExprCursor, so there are no issues with using addMachineRegExpression normally at any point in a DBG_VALUE_LIST.

Add test to confirm that addMachineRegExpression is being correctly used for DBG_VALUE_LIST regs; replace if-else chain with switch in AsmPrinter::emitDebugValueComment.

Harbormaster failed remote builds in B65595: Diff 280513!Jul 24 2020, 10:19 AM

StephenTozer added a child revision: D91722: [DebugInfo] Use variadic debug values to salvage BinOps and GEP instrs with non-const operands.Nov 18 2020, 8:56 AM

Rebased onto recent master; no functional changes, minor adjustments in DwarfDebug::emitDebugLocValue.

Harbormaster completed remote builds in B81440: Diff 310161.Dec 8 2020, 6:09 AM

scott.linder added a subscriber: scott.linder.Dec 22 2020, 3:27 PM

scott.linder added inline comments.

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
20–41	If you are changing this anyway, I'd vote to just split up these two cases, they don't actually seem related and the existing code reads worse to me.
llvm/lib/CodeGen/AsmPrinter/DebugHandlerBase.cpp
11–1	Would it make sense to still try applying the heuristic if `size(Instruction.debug_values()) == 1 && Instruction.getDebugOperand(0).isReg() && DIExpr->expr_op_begin()->getOp == dwarf::DW_OP_LLVM_arg`? I think this only really makes a difference when we eliminate the old version, but I assume we will still want this thing to work for the cases it can?
llvm/lib/CodeGen/AsmPrinter/DebugLocEntry.h
83	I think you can wrap this constructor body in `#ifndef NDEBUG`
97	Same here, the whole constructor should be `#ifndef NDEBUG`
llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp
67	Was changing this condition intentional? If so can it be in a separate patch?

StephenTozer added inline comments.Jan 4 2021, 6:01 AM

llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp
67	Unintentional, the change here got missed during a rebase (but is fixed in my local copy, which I'll push up shortly).

StephenTozer added inline comments.Jan 7 2021, 9:07 AM

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
20–41	The similarity here is that even when `Op` is a register, it can still have an offset - it's just calculated above, outside of this loop, by the debug offset operand of `MI`. I do think this looks quite messy; it'd probably be better to move the offset calculation down into this loop, so that it can be understood for both of them as a local variable.

scott.linder added inline comments.Jan 13 2021, 9:51 AM

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
20–41	+1 I think I completely missed the connection to the offset calculated above, thank you for clarifying.

Rebased onto recent master, fixed some test failures.

Harbormaster completed remote builds in B85064: Diff 316474.Jan 13 2021, 11:47 AM

scott.linder added inline comments.Jan 13 2021, 12:31 PM

llvm/test/DebugInfo/X86/dbg_value_list_emission.mir
52–56	Why does this behave differently than (what I understand to be) the equivalent `DBG_VALUE` ? DBG_VALUE $eax, $noreg, !12, !DIExpression(DW_OP_stack_value), debug-location !15 ; becomes: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value) It seems like the `DW_OP_and` is there to select a subregister (I assume EAX), but oddly it comes after the value of the register is already read (i.e. after the DW_OP_breg). I'm lost on what the intended behavior is, and why it differs between `DBG_VALUE` and `DBG_VALUE_LIST`. There is also the existing confusion around the "isIndirect" flag in `DBG_VALUE` which makes these two equivalent (and both seemingly wrong): DBG_VALUE $eax, $noreg, !25, !DIExpression(DW_OP_stack_value), debug-location !15 ; becomes: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value) DBG_VALUE $eax, 0, !26, !DIExpression(DW_OP_stack_value), debug-location !15 ; becomes: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value) which makes it harder still to compare. Would it be more straightforward to always be explicit about indirection in the new form? Why does `DW_OP_stack_value` imply a `DW_OP_deref` at all? I.e. why do we not get: DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value), $eax, debug-location !15 ; CHECK: DW_TAG_variable ; CHECK-NEXT: (DW_OP_reg RAX, DW_OP_stack_value) ; CHECK-NEXT: DW_AT_name ("locala") which in this case I imagine would just be an error. I would expect the correct expression to generate the `DW_OP_breg` would be something like: DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_deref, DW_OP_stack_value), $eax, debug-location !15 ; CHECK: DW_TAG_variable ; CHECK-NEXT: (DW_OP_breg0 RAX+0, DW_OP_stack_value) ; CHECK-NEXT: DW_AT_name ("locala") If we don't do this, we seem to retain some of the same ambiguity that makes the old "isIndirect" field so confusing.

StephenTozer added inline comments.Jan 15 2021, 9:30 AM

llvm/test/DebugInfo/X86/dbg_value_list_emission.mir
52–56	To the first point: I'm looking into it now; I noticed the `DW_OP_and` before, but I'm not sure where it's coming from myself yet - the `DBG_VALUE_LIST` handling should be following essentially the same code path as `DBG_VALUE`, so this is a bug one way or another. To the second point, I think your examples are slightly incorrect: The `isIndirect` flag in `DBG_VALUE` is confusing and inconsistent, what it actually does is dependent on the DIExpression and not well explained. The `DBG_VALUE_LIST` implementation has no such inconsistencies however (I hope). The problem with your examples is that I think you're using `DW_OP_reg` to mean a register's literal value, and `DW_OP_breg` to mean the address pointed to by a register. This isn't quite correct, although they do act like that for most variable locations. The actual meanings are slightly more complicated; the short answer is that `DW_OP_reg` is a Register location description: it refers to the register itself, not to the value of that register. `DW_OP_breg` on the other hand does refer to the literal value of a register; it's generally used with an offset as part of a Memory location expression, but if combined with `DW_OP_stack_value` then it gives the value in the register as the variable's value (albeit as an Implicit location rather than a Register location). So with all of that said, the meaning of `(DW_OP_breg0 RAX+0, DW_OP_stack_value)` is that the variable's value can be found in `$rax`, but the variable should be read-only, which matches the meaning of the `DBG_VALUE_LIST`.

scott.linder added inline comments.Jan 17 2021, 12:18 PM

llvm/test/DebugInfo/X86/dbg_value_list_emission.mir
52–56	The isIndirect flag in DBG_VALUE is confusing and inconsistent, what it actually does is dependent on the DIExpression and not well explained. Agreed, and my point is that a similar issue applies to the interpretation of `DW_OP_LLVM_arg` in your patch, even with `isIndirect` gone (modulo the `assert` mentioned below, which only side-steps the issue). The problem with your examples is that I think you're using DW_OP_reg to mean a register's literal value, and DW_OP_breg to mean the address pointed to by a register. That wasn't my intention in the examples I gave; in fact in the unambiguous model we tried to extract from the DWARF spec (https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html) we define the `DW_OP_reg` as pushing a register location description onto the stack, and `DW_OP_breg` as pushing a memory location description onto the stack. The interaction which makes the location description pushed onto the stack by `DW_OP_breg` behave like a value in some contexts (i.e. behave like the offset contents of the register) we begrudgingly capture with an implicit conversion to make our description backwards-compatible with DWARF 4 and 5, but even if you want to just define it as pushing a value onto the stack, at the very least the `breg` must represent reading the value of a `reg` register location. My question is then: why does adding `DW_OP_stack_value` implicitly cause the value of the register to end up on the stack, with no intervening operation? I.e. why is this the case: ; Sure, seems reasonable: `DW_OP_LLVM_arg` when referring to a register describes the register itself, not the value of the register. DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0), $eax, debug-location !11 ; CHECK: DW_AT_location (DW_OP_reg0 RAX) ; This follows too: if you do want the value of the register, you can read it explicitly with e.g. `DW_OP_deref`. DWARF actually requires this be collapsed into something like `DW_OP_breg` or `DW_OP_regval_type`, ; as `DW_OP_reg RAX, DW_OP_deref` is not a valid location description of any kind. This is an artificial constraint of the standard, and in any consistent view of the spec the two forms would have to otherwise be equivalent. DBG_VALUE_LIST !13, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_deref), $eax, debug-location !11 ; CHECK: DW_AT_location (DW_OP_breg0 RAX+0) ; Hmm, this doesn't seem right though: why are there now two indirections? Does `DW_OP_stack_value` imply one for some reason? DBG_VALUE_LIST !14, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_deref, DW_OP_stack_value), $eax, debug-location !11 ; CHECK: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_deref, DW_OP_stack_value) ; If you actually want the singly-indirect output you have to omit the semantically consistent `DW_OP_deref`. I would have expected this to just be an invalid DIExpression: DBG_VALUE_LIST !15, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value), $eax, debug-location !11 ; CHECK: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_stack_value) Note that I removed the asserts in your patch which seem to artificially require `DW_OP_stack_value` for `DBG_VALUE_LIST`. I didn't understand the purpose of them before, but perhaps this issue is one reason for them to be present? My fundamental argument is that this context-dependent interpretation of `DW_OP_LLVM_arg` is another source of confusion, just like `isIndirect`. I think this stems from the fact that DWARF as it is defined today is not general/composable enough to avoid this, but I don't think that should bleed into the internal representation used by LLVM: we can make a sensible choice up until we get into the DWARF backend, where certain expressions will have to be converted into a different form to be legal. Instead, what we have now is a situation where adding operations to the expression changes fundamentally how you are supposed to interpret the `DW_OP_LLVM_arg`. This is a pre-existing shortcoming, just like `isIndirect`, so saddling you with the burden of correcting it doesn't seem reasonable, but I think it is important to discuss. This is also important in the context of replacing `DBG_VALUE` entirely, as the `assert` will obviously need to go away.

StephenTozer added inline comments.Jan 18 2021, 2:42 AM

llvm/test/DebugInfo/X86/dbg_value_list_emission.mir
52–56	You are correct - currently, `DBG_VALUE_LIST` is consistent when we only represent expressions with a `DW_OP_stack_value`; there will need to be an additional flag to correctly represent the full set of DWARF expressions. Of the examples you've given however, I would say that it is the first two that are incorrect, and the latter two that are correct. The expression `!DIExpression(DW_OP_LLVM_arg, 0)` should, in the absence of a flag declaring it to be a direct/register location, mean that the variable is at the address given by the first argument, so the correct DWARF translation would be `DW_AT_location (DW_OP_breg0 RAX+0)`. This topic was discussed on the mailing list a while back, starting around here, concluding that we need an extra flag (with a different semantic meaning to the current IsIndirect that avoids the inconsistencies) to accurately represent all DWARF expressions. The reason why this hasn't been added as part of this patch is that this patch isn't replacing `DBG_VALUE` with `DBG_VALUE_LIST` yet; the only place where `DBG_VALUE_LIST` is used is in salvaging dbg.values, where it will necessarily use `DW_OP_stack_value`.

Fixed issue in which DBG_VALUE_LIST expressions were being emitted without subregister masking.

Harbormaster completed remote builds in B85601: Diff 317359.Jan 18 2021, 7:57 AM

Just chiming in with Scott here:

My fundamental argument is that this context-dependent interpretation of DW_OP_LLVM_arg is another source of confusion, just like isIndirect. I think this stems from the fact that DWARF as it is defined today is not general/composable enough to avoid this, but I don't think that should bleed into the internal representation used by LLVM: we can make a sensible choice up until we get into the DWARF backend, where certain expressions will have to be converted into a different form to be legal. Instead, what we have now is a situation where adding operations to the expression changes fundamentally how you are supposed to interpret the DW_OP_LLVM_arg. This is a pre-existing shortcoming, just like isIndirect, so saddling you with the burden of correcting it doesn't seem reasonable, but I think it is important to discuss. This is also important in the context of replacing DBG_VALUE entirely, as the assert will obviously need to go away.

I agree -- there's some existing misery here [0] where a pass has to be aware that modifying an expression might change the context it's interpreted in, which is an un-necessary complexity.

[0] https://github.com/llvm/llvm-project/blob/d06e94031bcdfa43512bf7b0cdfd4b4bad3ca4e1/llvm/lib/CodeGen/PrologEpilogInserter.cpp#L1238

In D83495#2542005, @jmorse wrote:

I agree -- there's some existing misery here [0] where a pass has to be aware that modifying an expression might change the context it's interpreted in, which is an un-necessary complexity.

This is true, and it is something that I intend to change in the next patch after this stack, which will be the replacement of the existing DBG_VALUE with the DBG_VALUE_LIST. The discussion about this topic on the mailing list was long, but it covers this issue reasonably well I think. Most of the existing ambiguity comes from the fact that DWARF has two ways to reference a register: DW_OP_reg, which always means "the variable lives in this register", and DW_OP_breg which pushes the value contained in the register onto the expression stack (usually to be interpreted as the address of the live variable), and our method for distinguishing the two is the Directness/Offset flag, which is context-dependent: it reads both the DIExpression and the location operand to see if it should really be indirect (and at different points in the program it does this check differently!). The replacement for that flag, that will be added in the aforementioned replacement patch, will not be context-dependent; if it is set to indirect, it will always mean that the result of the DIExpression is a memory address that the variable lives at, regardless of any location operand or the contents of the DIExpression.

Just to jump back to the example that Scott gave as well, the reason why DW_OP_LLVM_arg is currently context-dependent is because it follows the existing machine register pipeline. This is done for simplicity's sake, but it is worth noting that it is not currently intended to exist in all contexts. A DW_OP_LLVM_arg operator will only currently be produced as part of a DIExpression that ends with DW_OP_LLVM_stack_value, which fixes it to always be emitted as DW_OP_breg regardless of any other context. The replacement patch will simultaneously enable LLVM_arg operators to exist in non-stack_value expressions, and remove the existing inconsistencies from these cases.

This tentatively LsGTM, modulo some comments. Most of the discussion has been about the context-sensitivity of how the expression stack has evolved, which is awkward but pre-existing. Just to check with @scott.linder, you're alright with the patch going in, with improving expression handling a problem for another day?

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
31–32	This over-writes Offset from getFrameIndexReference if the DBG_VALUE is indirect -- which I don't believe is a behaviour in the old code. AFAIUI the "debug offset" should nowadays only be used as a flag indicating indirectness, not the actual numerical offset. There's at least one assertion out the that the offset is zero. (Obviously none of this is ideal).
llvm/lib/CodeGen/AsmPrinter/DebugLocEntry.h
61–1	It'd be good to elaborate on this -- a single machine location within a variable location description, one of possibly many?
llvm/lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp
84–85	To check my understanding: for non-variadic variables emission of integer operands might be signed or unsigned, depending on the type of the variable, we can see that earlier in this function and down in DwarfDebug::emitDebugLocValue. For a variadic expression, however, it's expected that the value gets "compiled in" unsigned, hence we add as unsigned here, yes?
llvm/lib/CodeGen/AsmPrinter/DwarfExpression.cpp
27–30	What about if the next opcode is stack_value, do we need to mask in that scenario?
llvm/test/DebugInfo/X86/dbg_value_list_clobbers.mir
2–3	Does not call FileCheck
llvm/test/DebugInfo/X86/dbg_value_list_emission.mir
6	Mega-nit: "good" suggests there's a subjective difference, could I suggest "correct"
95	Could I request a test that a DBG_VALUE_LIST with $noreg somewhere in it does not lead to a location-list entry -- I've been bitten by $noregs not terminating things in the past.

StephenTozer added inline comments.Feb 16 2021, 7:19 AM

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
31–32	This over-writes Offset from getFrameIndexReference if the DBG_VALUE is indirect -- which I don't believe is a behaviour in the old code. That's because the old code is somewhat misleading; `Offset` is declared at a single point above as positive iff `MI` is an indirect debug value; otherwise it is 0, and gets incremented if the location operand is not a register. These two conditions are mutually exclusive, so ultimately it's just being optionally assigned one of two values; I've just made that more explicit here, but the behaviour should be unchanged. AFAIUI the "debug offset" should nowadays only be used as a flag indicating indirectness, not the actual numerical offset. There's at least one assertion out the that the offset is zero. (Obviously none of this is ideal). That assertion does exist in some places, but I'm not sure if it's everywhere. I could be wrong, but I believe the assertions that the offset is zero are all at points prior to the stack layout being finalized; at the time we emit DWARF, I think we may still have non-zero offsets for some DBG_VALUEs.
llvm/lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp
84–85	This is really just a "dumb" carry-over of the logic above; when there is a non-trivial expression for a non-variadic debug value with an integer location in this function, it looks as though we just add the unsigned bytes and call it a day. I think the intent is that the signedness is ultimately interpreted according to the variable type? But in all existing cases I think the approach is flawed; unless you have an empty DIExpression, there's no guarantee that the signedness of the variable matches the signedness of the location operand.
llvm/lib/CodeGen/AsmPrinter/DwarfExpression.cpp
27–30	I think so? To be honest, it looks to me like the only time we can use a subregister and don't apply a mask is when describing either a Register location (where we cannot use subreg masking, we use DW_OP_piece instead), or when describing a simple memory location, i.e. a single DW_OP_breg with 0 offset. Although the latter case seems like a valid argument for "we don't always need to mask the subregister", I suspect that it's actually the case that we just never use a subregister for those locations. The full size of a register for a given architecture will also generally be the size of a memory address, so it may just be assumed that we don't need to check for subreg masking in that case.

This revision was not accepted when it landed; it landed in state Needs Review.Mar 10 2021, 5:47 AM

This revision was landed with ongoing or failed builds.

Closed by commit rG0da27ba56c9f: [DebugInfo] Add DWARF emission for DBG_VALUE_LIST (authored by StephenTozer). · Explain Why

This revision was automatically updated to reflect the committed changes.

StephenTozer added a commit: rG0da27ba56c9f: [DebugInfo] Add DWARF emission for DBG_VALUE_LIST.

StephenTozer added a reverting change: rG429c6ecbb302: Revert "[DebugInfo] Add DWARF emission for DBG_VALUE_LIST".Mar 10 2021, 6:35 AM

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

AsmPrinter/

AsmPrinter.cpp

104 lines

DbgEntityHistoryCalculator.cpp

78 lines

15 lines

129 lines

111 lines

168 lines

3 lines

35 lines

test/

DebugInfo/

X86/

dbg_value_list_clobbers.mir

84 lines

dbg_value_list_emission.mir

101 lines

Diff 329633

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

//===- AsmPrinter.cpp - Common AsmPrinter code ----------------------------===// //===- AsmPrinter.cpp - Common AsmPrinter code ----------------------------===//

// //

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. // Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information. // See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

// //

// This file implements the AsmPrinter class. // This file implements the AsmPrinter class.

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#include "llvm/CodeGen/AsmPrinter.h" #include "llvm/CodeGen/AsmPrinter.h"

#include "CodeViewDebug.h" #include "CodeViewDebug.h"

#include "DwarfDebug.h" #include "DwarfDebug.h"

aprantlUnsubmitted

Not Done

Out of curiosity: Is there an operand kind that we could switch() over?

aprantl: Out of curiosity: Is there an operand kind that we could switch() over?

StephenTozerAuthorUnsubmitted

Done

It would be useful to have one if there isn't; I didn't want to fold a change like that into this work, but if it exists I can use it (and if not it'd be nice to add in another patch).

StephenTozer: It would be useful to have one if there isn't; I didn't want to fold a change like that into…

#include "DwarfException.h" #include "DwarfException.h"

#include "PseudoProbePrinter.h" #include "PseudoProbePrinter.h"

#include "WasmException.h" #include "WasmException.h"

#include "WinCFGuard.h" #include "WinCFGuard.h"

#include "WinException.h" #include "WinException.h"

#include "llvm/ADT/APFloat.h" #include "llvm/ADT/APFloat.h"

#include "llvm/ADT/APInt.h" #include "llvm/ADT/APInt.h"

#include "llvm/ADT/DenseMap.h" #include "llvm/ADT/DenseMap.h"

#include "llvm/ADT/STLExtras.h" #include "llvm/ADT/STLExtras.h"

#include "llvm/ADT/SmallPtrSet.h" #include "llvm/ADT/SmallPtrSet.h"

#include "llvm/ADT/SmallString.h" #include "llvm/ADT/SmallString.h"

#include "llvm/ADT/SmallVector.h" #include "llvm/ADT/SmallVector.h"

#include "llvm/ADT/Statistic.h" #include "llvm/ADT/Statistic.h"

#include "llvm/ADT/StringRef.h" #include "llvm/ADT/StringRef.h"

#include "llvm/ADT/Triple.h" #include "llvm/ADT/Triple.h"

#include "llvm/ADT/Twine.h" #include "llvm/ADT/Twine.h"

#include "llvm/Analysis/ConstantFolding.h" #include "llvm/Analysis/ConstantFolding.h"

jmorseUnsubmitted

Not Done

This over-writes Offset from getFrameIndexReference if the DBG_VALUE is indirect -- which I don't believe is a behaviour in the old code.

AFAIUI the "debug offset" should nowadays only be used as a flag indicating indirectness, not the actual numerical offset. There's at least one assertion out the that the offset is zero. (Obviously none of this is ideal).

jmorse: This over-writes Offset from getFrameIndexReference if the DBG_VALUE is indirect -- which I…

StephenTozerAuthorUnsubmitted

Done

This over-writes Offset from getFrameIndexReference if the DBG_VALUE is indirect -- which I don't believe is a behaviour in the old code.

That's because the old code is somewhat misleading; Offset is declared at a single point above as positive iff MI is an indirect debug value; otherwise it is 0, and gets incremented if the location operand is not a register. These two conditions are mutually exclusive, so ultimately it's just being optionally assigned one of two values; I've just made that more explicit here, but the behaviour should be unchanged.

AFAIUI the "debug offset" should nowadays only be used as a flag indicating indirectness, not the actual numerical offset. There's at least one assertion out the that the offset is zero. (Obviously none of this is ideal).

That assertion does exist in some places, but I'm not sure if it's everywhere. I could be wrong, but I believe the assertions that the offset is zero are all at points prior to the stack layout being finalized; at the time we emit DWARF, I think we may still have non-zero offsets for some DBG_VALUEs.

StephenTozer: > This over-writes Offset from getFrameIndexReference if the DBG_VALUE is indirect -- which I…

#include "llvm/Analysis/EHPersonalities.h" #include "llvm/Analysis/EHPersonalities.h"

#include "llvm/Analysis/MemoryLocation.h" #include "llvm/Analysis/MemoryLocation.h"

#include "llvm/Analysis/OptimizationRemarkEmitter.h" #include "llvm/Analysis/OptimizationRemarkEmitter.h"

#include "llvm/BinaryFormat/COFF.h" #include "llvm/BinaryFormat/COFF.h"

#include "llvm/BinaryFormat/Dwarf.h" #include "llvm/BinaryFormat/Dwarf.h"

#include "llvm/BinaryFormat/ELF.h" #include "llvm/BinaryFormat/ELF.h"

#include "llvm/CodeGen/GCMetadata.h" #include "llvm/CodeGen/GCMetadata.h"

#include "llvm/CodeGen/GCMetadataPrinter.h" #include "llvm/CodeGen/GCMetadataPrinter.h"

#include "llvm/CodeGen/GCStrategy.h" #include "llvm/CodeGen/GCStrategy.h"

scott.linderUnsubmitted

Not Done

break;

}

- case MachineOperand::MO_Register:

- case MachineOperand::MO_FrameIndex: {

- Register Reg;

- if (Op.isReg())

- Reg = Op.getReg();

- else {

- const TargetFrameLowering *TFI =

- AP.MF->getSubtarget().getFrameLowering();

- Offset += TFI->getFrameIndexReference(*AP.MF, Op.getIndex(), Reg);

- MemLoc = true;

- }

- if (Reg == 0) {

- // Suppress offset, it is not meaningful here.

- OS << "undef";

- break;

- }

- if (MemLoc)

- OS << '[';

- OS << printReg(Reg, AP.MF->getSubtarget().getRegisterInfo());

- if (MemLoc)

- OS << '+' << Offset.getFixed() << ']';

- break;

- }

+ case MachineOperand::MO_Register: {

+ if (Op.getReg())

+ OS << printReg(Op.getReg(), AP.MF->getSubtarget().getRegisterInfo());

+ else

+ OS << "undef";

+ break;

+ }

+ case MachineOperand::MO_FrameIndex: {

+ Register Reg;

+ const TargetFrameLowering *TFI =

+ AP.MF->getSubtarget().getFrameLowering();

+ Offset += TFI->getFrameIndexReference(*AP.MF, Op.getIndex(), Reg);

+ if (Reg)

+ OS << '[' << printReg(Reg, AP.MF->getSubtarget().getRegisterInfo()) << '+' << Offset.getFixed() << ']';

+ else

+ OS << "undef";

+ break;

+ } }

default:

If you are changing this anyway, I'd vote to just split up these two cases, they don't actually seem related and the existing code reads worse to me.

scott.linder: If you are changing this anyway, I'd vote to just split up these two cases, they don't actually…

StephenTozerAuthorUnsubmitted

Done

The similarity here is that even when Op is a register, it can still have an offset - it's just calculated above, outside of this loop, by the debug offset operand of MI. I do think this looks quite messy; it'd probably be better to move the offset calculation down into this loop, so that it can be understood for both of them as a local variable.

StephenTozer: The similarity here is that even when `Op` is a register, it can still have an offset - it's…

scott.linderUnsubmitted

Not Done

I think I completely missed the connection to the offset calculated above, thank you for clarifying.

scott.linder: +1 I think I completely missed the connection to the offset calculated above, thank you for…

#include "llvm/CodeGen/MachineBasicBlock.h" #include "llvm/CodeGen/MachineBasicBlock.h"

#include "llvm/CodeGen/MachineConstantPool.h" #include "llvm/CodeGen/MachineConstantPool.h"

#include "llvm/CodeGen/MachineDominators.h" #include "llvm/CodeGen/MachineDominators.h"

#include "llvm/CodeGen/MachineFrameInfo.h" #include "llvm/CodeGen/MachineFrameInfo.h"

#include "llvm/CodeGen/MachineFunction.h" #include "llvm/CodeGen/MachineFunction.h"

#include "llvm/CodeGen/MachineFunctionPass.h" #include "llvm/CodeGen/MachineFunctionPass.h"

#include "llvm/CodeGen/MachineInstr.h" #include "llvm/CodeGen/MachineInstr.h"

#include "llvm/CodeGen/MachineInstrBundle.h" #include "llvm/CodeGen/MachineInstrBundle.h"

▲ Show 20 Lines • Show All 863 Lines • ▼ Show 20 Lines static bool emitDebugValueComment(const MachineInstr *MI, AsmPrinter &AP) {

if (auto *SP = dyn_cast<DISubprogram>(V->getScope())) { if (auto *SP = dyn_cast<DISubprogram>(V->getScope())) {

StringRef Name = SP->getName(); StringRef Name = SP->getName();

if (!Name.empty()) if (!Name.empty())

OS << Name << ":"; OS << Name << ":";

} }

OS << V->getName(); OS << V->getName();

OS << " <- "; OS << " <- ";

// The second operand is only an offset if it's an immediate.

bool MemLoc = MI->isIndirectDebugValue();

auto Offset = StackOffset::getFixed(MemLoc ? MI->getOperand(1).getImm() : 0);

const DIExpression *Expr = MI->getDebugExpression(); const DIExpression *Expr = MI->getDebugExpression();

if (Expr->getNumElements()) { if (Expr->getNumElements()) {

OS << '['; OS << '[';

ListSeparator LS; ListSeparator LS;

for (auto Op : Expr->expr_ops()) { for (auto Op : Expr->expr_ops()) {

OS << LS << dwarf::OperationEncodingString(Op.getOp()); OS << LS << dwarf::OperationEncodingString(Op.getOp());

for (unsigned I = 0; I < Op.getNumArgs(); ++I) for (unsigned I = 0; I < Op.getNumArgs(); ++I)

OS << ' ' << Op.getArg(I); OS << ' ' << Op.getArg(I);

} }

OS << "] "; OS << "] ";

} }

// Register or immediate value. Register 0 means undef. // Register or immediate value. Register 0 means undef.

if (MI->getDebugOperand(0).isFPImm()) { for (const MachineOperand &Op : MI->debug_operands()) {

APFloat APF = APFloat(MI->getDebugOperand(0).getFPImm()->getValueAPF()); if (&Op != MI->debug_operands().begin())

if (MI->getDebugOperand(0).getFPImm()->getType()->isFloatTy()) { OS << ", ";

switch (Op.getType()) {

case MachineOperand::MO_FPImmediate: {

APFloat APF = APFloat(Op.getFPImm()->getValueAPF());

if (Op.getFPImm()->getType()->isFloatTy()) {

OS << (double)APF.convertToFloat(); OS << (double)APF.convertToFloat();

} else if (MI->getDebugOperand(0).getFPImm()->getType()->isDoubleTy()) { } else if (Op.getFPImm()->getType()->isDoubleTy()) {

OS << APF.convertToDouble(); OS << APF.convertToDouble();

} else { } else {

// There is no good way to print long double. Convert a copy to // There is no good way to print long double. Convert a copy to

// double. Ah well, it's only a comment. // double. Ah well, it's only a comment.

bool ignored; bool ignored;

APF.convert(APFloat::IEEEdouble(), APFloat::rmNearestTiesToEven, APF.convert(APFloat::IEEEdouble(), APFloat::rmNearestTiesToEven,

&ignored); &ignored);

OS << "(long double) " << APF.convertToDouble(); OS << "(long double) " << APF.convertToDouble();

} }

} else if (MI->getDebugOperand(0).isImm()) { break;

OS << MI->getDebugOperand(0).getImm(); }

} else if (MI->getDebugOperand(0).isCImm()) { case MachineOperand::MO_Immediate: {

MI->getDebugOperand(0).getCImm()->getValue().print(OS, false /*isSigned*/); OS << Op.getImm();

} else if (MI->getDebugOperand(0).isTargetIndex()) { break;

auto Op = MI->getDebugOperand(0); }

case MachineOperand::MO_CImmediate: {

Op.getCImm()->getValue().print(OS, false /*isSigned*/);

break;

}

case MachineOperand::MO_TargetIndex: {

OS << "!target-index(" << Op.getIndex() << "," << Op.getOffset() << ")"; OS << "!target-index(" << Op.getIndex() << "," << Op.getOffset() << ")";

// NOTE: Want this comment at start of line, don't emit with AddComment. // NOTE: Want this comment at start of line, don't emit with AddComment.

AP.OutStreamer->emitRawComment(OS.str()); AP.OutStreamer->emitRawComment(OS.str());

return true; break;

} else { }

case MachineOperand::MO_Register:

case MachineOperand::MO_FrameIndex: {

if (MI->getDebugOperand(0).isReg()) { Optional<StackOffset> Offset;

Reg = MI->getDebugOperand(0).getReg(); if (Op.isReg()) {

Reg = Op.getReg();

} else { } else {

assert(MI->getDebugOperand(0).isFI() && "Unknown operand type"); const TargetFrameLowering *TFI =

const TargetFrameLowering *TFI = AP.MF->getSubtarget().getFrameLowering(); AP.MF->getSubtarget().getFrameLowering();

Offset += TFI->getFrameIndexReference( Offset = TFI->getFrameIndexReference(*AP.MF, Op.getIndex(), Reg);

*AP.MF, MI->getDebugOperand(0).getIndex(), Reg);

MemLoc = true;

} }

if (Reg == 0) { if (!Reg) {

// Suppress offset, it is not meaningful here. // Suppress offset, it is not meaningful here.

OS << "undef"; OS << "undef";

// NOTE: Want this comment at start of line, don't emit with AddComment. break;

AP.OutStreamer->emitRawComment(OS.str());

return true;

} }

if (MemLoc) // The second operand is only an offset if it's an immediate.

if (MI->isIndirectDebugValue())

Offset = StackOffset::getFixed(MI->getDebugOffset().getImm());

if (Offset)

OS << '['; OS << '[';

OS << printReg(Reg, AP.MF->getSubtarget().getRegisterInfo()); OS << printReg(Reg, AP.MF->getSubtarget().getRegisterInfo());

if (Offset)

OS << '+' << Offset->getFixed() << ']';

break;

}

default:

llvm_unreachable("Unknown operand type");

}

} }

if (MemLoc)

OS << '+' << Offset.getFixed() << ']';

// NOTE: Want this comment at start of line, don't emit with AddComment. // NOTE: Want this comment at start of line, don't emit with AddComment.

AP.OutStreamer->emitRawComment(OS.str()); AP.OutStreamer->emitRawComment(OS.str());

return true; return true;

} }

/// This method handles the target-independent form of DBG_LABEL, returning /// This method handles the target-independent form of DBG_LABEL, returning

/// true if it was able to do so. A false return means the target will need /// true if it was able to do so. A false return means the target will need

▲ Show 20 Lines • Show All 2,520 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/DbgEntityHistoryCalculator.cpp

Show All 31 Lines
using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "dwarfdebug"		#define DEBUG_TYPE "dwarfdebug"

namespace {		namespace {
using EntryIndex = DbgValueHistoryMap::EntryIndex;		using EntryIndex = DbgValueHistoryMap::EntryIndex;
}		}

// If @MI is a DBG_VALUE with debug value described by a
// defined register, returns the number of this register.
// In the other case, returns 0.
static Register isDescribedByReg(const MachineInstr &MI) {
assert(MI.isDebugValue());
assert(MI.getNumOperands() == 4);
// If the location of variable is an entry value (DW_OP_LLVM_entry_value)
// do not consider it as a register location.
if (MI.getDebugExpression()->isEntryValue())
return 0;
// If location of variable is described using a register (directly or
// indirectly), this register is always a first operand.
return MI.getDebugOperand(0).isReg() ? MI.getDebugOperand(0).getReg()
: Register();
}

void InstructionOrdering::initialize(const MachineFunction &MF) {		void InstructionOrdering::initialize(const MachineFunction &MF) {
// We give meta instructions the same ordinal as the preceding instruction		// We give meta instructions the same ordinal as the preceding instruction
// because this class is written for the task of comparing positions of		// because this class is written for the task of comparing positions of
// variable location ranges against scope ranges. To reflect what we'll see		// variable location ranges against scope ranges. To reflect what we'll see
// in the binary, when we look at location ranges we must consider all		// in the binary, when we look at location ranges we must consider all
// DBG_VALUEs between two real instructions at the same position. And a		// DBG_VALUEs between two real instructions at the same position. And a
// scope range which ends on a meta instruction should be considered to end		// scope range which ends on a meta instruction should be considered to end
// at the last seen real instruction. E.g.		// at the last seen real instruction. E.g.
▲ Show 20 Lines • Show All 264 Lines • ▼ Show 20 Lines	static void addRegDescribedVar(RegDescribedVarsMap &RegVars, unsigned RegNo,
InlinedEntity Var) {		InlinedEntity Var) {
assert(RegNo != 0U);		assert(RegNo != 0U);
auto &VarSet = RegVars[RegNo];		auto &VarSet = RegVars[RegNo];
assert(!is_contained(VarSet, Var));		assert(!is_contained(VarSet, Var));
VarSet.push_back(Var);		VarSet.push_back(Var);
}		}

/// Create a clobbering entry and end all open debug value entries		/// Create a clobbering entry and end all open debug value entries
/// for \p Var that are described by \p RegNo using that entry.		/// for \p Var that are described by \p RegNo using that entry. Inserts into \p
		/// FellowRegisters the set of Registers that were also used to describe \p Var
		/// alongside \p RegNo.
static void clobberRegEntries(InlinedEntity Var, unsigned RegNo,		static void clobberRegEntries(InlinedEntity Var, unsigned RegNo,
const MachineInstr &ClobberingInstr,		const MachineInstr &ClobberingInstr,
DbgValueEntriesMap &LiveEntries,		DbgValueEntriesMap &LiveEntries,
DbgValueHistoryMap &HistMap) {		DbgValueHistoryMap &HistMap,
		SmallVectorImpl<Register> &FellowRegisters) {
EntryIndex ClobberIndex = HistMap.startClobber(Var, ClobberingInstr);		EntryIndex ClobberIndex = HistMap.startClobber(Var, ClobberingInstr);

// Close all entries whose values are described by the register.		// Close all entries whose values are described by the register.
SmallVector<EntryIndex, 4> IndicesToErase;		SmallVector<EntryIndex, 4> IndicesToErase;
		// If a given register appears in a live DBG_VALUE_LIST for Var alongside the
		// clobbered register, and never appears in a live DBG_VALUE* for Var without
		// the clobbered register, then it is no longer linked to the variable.
		SmallSet<Register, 4> MaybeRemovedRegisters;
		SmallSet<Register, 4> KeepRegisters;
for (auto Index : LiveEntries[Var]) {		for (auto Index : LiveEntries[Var]) {
auto &Entry = HistMap.getEntry(Var, Index);		auto &Entry = HistMap.getEntry(Var, Index);
assert(Entry.isDbgValue() && "Not a DBG_VALUE in LiveEntries");		assert(Entry.isDbgValue() && "Not a DBG_VALUE in LiveEntries");
if (isDescribedByReg(*Entry.getInstr()) == RegNo) {		if (Entry.getInstr()->isDebugEntryValue())
		continue;
		if (Entry.getInstr()->hasDebugOperandForReg(RegNo)) {
IndicesToErase.push_back(Index);		IndicesToErase.push_back(Index);
Entry.endEntry(ClobberIndex);		Entry.endEntry(ClobberIndex);
		for (auto &MO : Entry.getInstr()->debug_operands())
		if (MO.isReg() && MO.getReg() && MO.getReg() != RegNo)
		MaybeRemovedRegisters.insert(MO.getReg());
		} else {
		for (auto &MO : Entry.getInstr()->debug_operands())
		if (MO.isReg() && MO.getReg())
		KeepRegisters.insert(MO.getReg());
}		}
}		}

		for (Register Reg : MaybeRemovedRegisters)
		if (!KeepRegisters.contains(Reg))
		FellowRegisters.push_back(Reg);

// Drop all entries that have ended.		// Drop all entries that have ended.
for (auto Index : IndicesToErase)		for (auto Index : IndicesToErase)
LiveEntries[Var].erase(Index);		LiveEntries[Var].erase(Index);
}		}

/// Add a new debug value for \p Var. Closes all overlapping debug values.		/// Add a new debug value for \p Var. Closes all overlapping debug values.
static void handleNewDebugValue(InlinedEntity Var, const MachineInstr &DV,		static void handleNewDebugValue(InlinedEntity Var, const MachineInstr &DV,
RegDescribedVarsMap &RegVars,		RegDescribedVarsMap &RegVars,
Show All 11 Lines	for (auto Index : LiveEntries[Var]) {
auto &Entry = HistMap.getEntry(Var, Index);		auto &Entry = HistMap.getEntry(Var, Index);
assert(Entry.isDbgValue() && "Not a DBG_VALUE in LiveEntries");		assert(Entry.isDbgValue() && "Not a DBG_VALUE in LiveEntries");
const MachineInstr &DV = *Entry.getInstr();		const MachineInstr &DV = *Entry.getInstr();
bool Overlaps = DIExpr->fragmentsOverlap(DV.getDebugExpression());		bool Overlaps = DIExpr->fragmentsOverlap(DV.getDebugExpression());
if (Overlaps) {		if (Overlaps) {
IndicesToErase.push_back(Index);		IndicesToErase.push_back(Index);
Entry.endEntry(NewIndex);		Entry.endEntry(NewIndex);
}		}
if (Register Reg = isDescribedByReg(DV))		if (!DV.isDebugEntryValue())
TrackedRegs[Reg] \|= !Overlaps;		for (const MachineOperand &Op : DV.debug_operands())
		if (Op.isReg() && Op.getReg())
		TrackedRegs[Op.getReg()] \|= !Overlaps;
}		}

// If the new debug value is described by a register, add tracking of		// If the new debug value is described by a register, add tracking of
// that register if it is not already tracked.		// that register if it is not already tracked.
if (Register NewReg = isDescribedByReg(DV)) {		if (!DV.isDebugEntryValue()) {
		for (const MachineOperand &Op : DV.debug_operands()) {
		if (Op.isReg() && Op.getReg()) {
		Register NewReg = Op.getReg();
if (!TrackedRegs.count(NewReg))		if (!TrackedRegs.count(NewReg))
addRegDescribedVar(RegVars, NewReg, Var);		addRegDescribedVar(RegVars, NewReg, Var);
LiveEntries[Var].insert(NewIndex);		LiveEntries[Var].insert(NewIndex);
TrackedRegs[NewReg] = true;		TrackedRegs[NewReg] = true;
}		}
		}
		}

// Drop tracking of registers that are no longer used.		// Drop tracking of registers that are no longer used.
for (auto I : TrackedRegs)		for (auto I : TrackedRegs)
if (!I.second)		if (!I.second)
dropRegDescribedVar(RegVars, I.first, Var);		dropRegDescribedVar(RegVars, I.first, Var);

// Drop all entries that have ended, and mark the new entry as live.		// Drop all entries that have ended, and mark the new entry as live.
for (auto Index : IndicesToErase)		for (auto Index : IndicesToErase)
LiveEntries[Var].erase(Index);		LiveEntries[Var].erase(Index);
LiveEntries[Var].insert(NewIndex);		LiveEntries[Var].insert(NewIndex);
}		}
}		}

// Terminate the location range for variables described by register at		// Terminate the location range for variables described by register at
// @I by inserting @ClobberingInstr to their history.		// @I by inserting @ClobberingInstr to their history.
static void clobberRegisterUses(RegDescribedVarsMap &RegVars,		static void clobberRegisterUses(RegDescribedVarsMap &RegVars,
RegDescribedVarsMap::iterator I,		RegDescribedVarsMap::iterator I,
DbgValueHistoryMap &HistMap,		DbgValueHistoryMap &HistMap,
DbgValueEntriesMap &LiveEntries,		DbgValueEntriesMap &LiveEntries,
const MachineInstr &ClobberingInstr) {		const MachineInstr &ClobberingInstr) {
// Iterate over all variables described by this register and add this		// Iterate over all variables described by this register and add this
// instruction to their history, clobbering it.		// instruction to their history, clobbering it. All registers that also
for (const auto &Var : I->second)		// describe the clobbered variables (i.e. in variadic debug values) will have
clobberRegEntries(Var, I->first, ClobberingInstr, LiveEntries, HistMap);		// those Variables removed from their DescribedVars.
		for (const auto &Var : I->second) {
		SmallVector<Register, 4> FellowRegisters;
		clobberRegEntries(Var, I->first, ClobberingInstr, LiveEntries, HistMap,
		FellowRegisters);
		for (Register RegNo : FellowRegisters)
		dropRegDescribedVar(RegVars, RegNo, Var);
		}
RegVars.erase(I);		RegVars.erase(I);
}		}

// Terminate the location range for variables described by register		// Terminate the location range for variables described by register
// @RegNo by inserting @ClobberingInstr to their history.		// @RegNo by inserting @ClobberingInstr to their history.
static void clobberRegisterUses(RegDescribedVarsMap &RegVars, unsigned RegNo,		static void clobberRegisterUses(RegDescribedVarsMap &RegVars, unsigned RegNo,
DbgValueHistoryMap &HistMap,		DbgValueHistoryMap &HistMap,
DbgValueEntriesMap &LiveEntries,		DbgValueEntriesMap &LiveEntries,
▲ Show 20 Lines • Show All 154 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/DebugHandlerBase.cpp

//===-- llvm/lib/CodeGen/AsmPrinter/DebugHandlerBase.cpp -------- C++ ---===//		//===-- llvm/lib/CodeGen/AsmPrinter/DebugHandlerBase.cpp -------- C++ ---===//
		scott.linderUnsubmitted Not Done Reply Inline Actions Would it make sense to still try applying the heuristic if `size(Instruction.debug_values()) == 1 && Instruction.getDebugOperand(0).isReg() && DIExpr->expr_op_begin()->getOp == dwarf::DW_OP_LLVM_arg`? I think this only really makes a difference when we eliminate the old version, but I assume we will still want this thing to work for the cases it can? scott.linder: Would it make sense to still try applying the heuristic if `size(Instruction.debug_values()) ==…
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// Common functionality for different debug information format backends.		// Common functionality for different debug information format backends.
Show All 20 Lines
/// If true, we drop variable location ranges which exist entirely outside the		/// If true, we drop variable location ranges which exist entirely outside the
/// variable's lexical scope instruction ranges.		/// variable's lexical scope instruction ranges.
static cl::opt<bool> TrimVarLocs("trim-var-locs", cl::Hidden, cl::init(true));		static cl::opt<bool> TrimVarLocs("trim-var-locs", cl::Hidden, cl::init(true));

Optional<DbgVariableLocation>		Optional<DbgVariableLocation>
DbgVariableLocation::extractFromMachineInstruction(		DbgVariableLocation::extractFromMachineInstruction(
const MachineInstr &Instruction) {		const MachineInstr &Instruction) {
DbgVariableLocation Location;		DbgVariableLocation Location;
if (!Instruction.isDebugValue())		// Variables calculated from multiple locations can't be represented here.
		if (Instruction.getNumDebugOperands() != 1)
return None;		return None;
if (!Instruction.getDebugOperand(0).isReg())		if (!Instruction.getDebugOperand(0).isReg())
return None;		return None;
Location.Register = Instruction.getDebugOperand(0).getReg();		Location.Register = Instruction.getDebugOperand(0).getReg();
Location.FragmentInfo.reset();		Location.FragmentInfo.reset();
// We only handle expressions generated by DIExpression::appendOffset,		// We only handle expressions generated by DIExpression::appendOffset,
// which doesn't require a full stack machine.		// which doesn't require a full stack machine.
int64_t Offset = 0;		int64_t Offset = 0;
const DIExpression *DIExpr = Instruction.getDebugExpression();		const DIExpression *DIExpr = Instruction.getDebugExpression();
auto Op = DIExpr->expr_op_begin();		auto Op = DIExpr->expr_op_begin();
		// We can handle a DBG_VALUE_LIST iff it has exactly one location operand that
		// appears exactly once at the start of the expression.
		if (Instruction.isDebugValueList()) {
		if (Instruction.getNumDebugOperands() == 1 &&
		Op->getOp() == dwarf::DW_OP_LLVM_arg)
		++Op;
		else
		return None;
		}
while (Op != DIExpr->expr_op_end()) {		while (Op != DIExpr->expr_op_end()) {
switch (Op->getOp()) {		switch (Op->getOp()) {
case dwarf::DW_OP_constu: {		case dwarf::DW_OP_constu: {
int Value = Op->getArg(0);		int Value = Op->getArg(0);
++Op;		++Op;
if (Op != DIExpr->expr_op_end()) {		if (Op != DIExpr->expr_op_end()) {
switch (Op->getOp()) {		switch (Op->getOp()) {
case dwarf::DW_OP_minus:		case dwarf::DW_OP_minus:
▲ Show 20 Lines • Show All 199 Lines • ▼ Show 20 Lines	void DebugHandlerBase::beginFunction(const MachineFunction *MF) {

// Request labels for the full history.		// Request labels for the full history.
for (const auto &I : DbgValues) {		for (const auto &I : DbgValues) {
const auto &Entries = I.second;		const auto &Entries = I.second;
if (Entries.empty())		if (Entries.empty())
continue;		continue;

auto IsDescribedByReg = [](const MachineInstr *MI) {		auto IsDescribedByReg = [](const MachineInstr *MI) {
return MI->getDebugOperand(0).isReg() && MI->getDebugOperand(0).getReg();		return any_of(MI->debug_operands(),
		[](auto &MO) { return MO.isReg() && MO.getReg(); });
};		};

// The first mention of a function argument gets the CurrentFnBegin label,		// The first mention of a function argument gets the CurrentFnBegin label,
// so arguments are visible when breaking at function entry.		// so arguments are visible when breaking at function entry.
//		//
// We do not change the label for values that are described by registers,		// We do not change the label for values that are described by registers,
// as that could place them above their defining instructions. We should		// as that could place them above their defining instructions. We should
// ideally not change the labels for constant debug values either, since		// ideally not change the labels for constant debug values either, since
▲ Show 20 Lines • Show All 139 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/DebugLocEntry.h

Show All 28 Lines	struct TargetIndexLocation {
TargetIndexLocation(unsigned Idx, int64_t Offset)		TargetIndexLocation(unsigned Idx, int64_t Offset)
: Index(Idx), Offset(Offset) {}		: Index(Idx), Offset(Offset) {}

bool operator==(const TargetIndexLocation &Other) const {		bool operator==(const TargetIndexLocation &Other) const {
return Index == Other.Index && Offset == Other.Offset;		return Index == Other.Index && Offset == Other.Offset;
}		}
};		};

/// A single location or constant.		/// A single location or constant within a variable location description, with
class DbgValueLoc {		/// either a single entry (with an optional DIExpression) used for a DBG_VALUE,
/// Any complex address location expression for this DbgValueLoc.		/// or a list of entries used for a DBG_VALUE_LIST.
const DIExpression *Expression;		class DbgValueLocEntry {

/// Type of entry that this represents.		/// Type of entry that this represents.
enum EntryType {		enum EntryType {
E_Location,		E_Location,
E_Integer,		E_Integer,
E_ConstantFP,		E_ConstantFP,
E_ConstantInt,		E_ConstantInt,
E_TargetIndexLocation		E_TargetIndexLocation
Show All 10 Lines	class DbgValueLocEntry {
union {		union {
/// Or a location in the machine frame.		/// Or a location in the machine frame.
MachineLocation Loc;		MachineLocation Loc;
/// Or a location from target specific location.		/// Or a location from target specific location.
TargetIndexLocation TIL;		TargetIndexLocation TIL;
};		};

public:		public:
DbgValueLoc(const DIExpression *Expr, int64_t i)		DbgValueLocEntry(int64_t i) : EntryKind(E_Integer) { Constant.Int = i; }
: Expression(Expr), EntryKind(E_Integer) {		DbgValueLocEntry(const ConstantFP *CFP) : EntryKind(E_ConstantFP) {
Constant.Int = i;
}
DbgValueLoc(const DIExpression Expr, const ConstantFP CFP)
: Expression(Expr), EntryKind(E_ConstantFP) {
Constant.CFP = CFP;		Constant.CFP = CFP;
}		}
DbgValueLoc(const DIExpression Expr, const ConstantInt CIP)		DbgValueLocEntry(const ConstantInt *CIP) : EntryKind(E_ConstantInt) {
: Expression(Expr), EntryKind(E_ConstantInt) {
Constant.CIP = CIP;		Constant.CIP = CIP;
}		}
DbgValueLoc(const DIExpression *Expr, MachineLocation Loc)		DbgValueLocEntry(MachineLocation Loc) : EntryKind(E_Location), Loc(Loc) {}
: Expression(Expr), EntryKind(E_Location), Loc(Loc) {		DbgValueLocEntry(TargetIndexLocation Loc)
assert(cast<DIExpression>(Expr)->isValid());		: EntryKind(E_TargetIndexLocation), TIL(Loc) {}
}
DbgValueLoc(const DIExpression *Expr, TargetIndexLocation Loc)
: Expression(Expr), EntryKind(E_TargetIndexLocation), TIL(Loc) {}

bool isLocation() const { return EntryKind == E_Location; }		bool isLocation() const { return EntryKind == E_Location; }
bool isTargetIndexLocation() const {		bool isTargetIndexLocation() const {
return EntryKind == E_TargetIndexLocation;		return EntryKind == E_TargetIndexLocation;
}		}
bool isInt() const { return EntryKind == E_Integer; }		bool isInt() const { return EntryKind == E_Integer; }
bool isConstantFP() const { return EntryKind == E_ConstantFP; }		bool isConstantFP() const { return EntryKind == E_ConstantFP; }
		scott.linderUnsubmitted Not Done Reply Inline Actions I think you can wrap this constructor body in `#ifndef NDEBUG` scott.linder: I think you can wrap this constructor body in `#ifndef NDEBUG`
bool isConstantInt() const { return EntryKind == E_ConstantInt; }		bool isConstantInt() const { return EntryKind == E_ConstantInt; }
int64_t getInt() const { return Constant.Int; }		int64_t getInt() const { return Constant.Int; }
const ConstantFP *getConstantFP() const { return Constant.CFP; }		const ConstantFP *getConstantFP() const { return Constant.CFP; }
const ConstantInt *getConstantInt() const { return Constant.CIP; }		const ConstantInt *getConstantInt() const { return Constant.CIP; }
MachineLocation getLoc() const { return Loc; }		MachineLocation getLoc() const { return Loc; }
TargetIndexLocation getTargetIndexLocation() const { return TIL; }		TargetIndexLocation getTargetIndexLocation() const { return TIL; }
bool isFragment() const { return getExpression()->isFragment(); }		friend bool operator==(const DbgValueLocEntry &, const DbgValueLocEntry &);
bool isEntryVal() const { return getExpression()->isEntryValue(); }
const DIExpression *getExpression() const { return Expression; }
friend bool operator==(const DbgValueLoc &, const DbgValueLoc &);
friend bool operator<(const DbgValueLoc &, const DbgValueLoc &);
#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
LLVM_DUMP_METHOD void dump() const {		LLVM_DUMP_METHOD void dump() const {
if (isLocation()) {		if (isLocation()) {
llvm::dbgs() << "Loc = { reg=" << Loc.getReg() << " ";		llvm::dbgs() << "Loc = { reg=" << Loc.getReg() << " ";
if (Loc.isIndirect())		if (Loc.isIndirect())
llvm::dbgs() << "+0";		llvm::dbgs() << "+0";
llvm::dbgs() << "} ";		llvm::dbgs() << "} ";
		scott.linderUnsubmitted Not Done Reply Inline Actions Same here, the whole constructor should be `#ifndef NDEBUG` scott.linder: Same here, the whole constructor should be `#ifndef NDEBUG`
} else if (isConstantInt())		} else if (isConstantInt())
Constant.CIP->dump();		Constant.CIP->dump();
else if (isConstantFP())		else if (isConstantFP())
Constant.CFP->dump();		Constant.CFP->dump();
		}
		#endif
		};

		/// The location of a single variable, composed of an expression and 0 or more
		/// DbgValueLocEntries.
		class DbgValueLoc {
		/// Any complex address location expression for this DbgValueLoc.
		const DIExpression *Expression;

		SmallVector<DbgValueLocEntry, 2> ValueLocEntries;

		bool IsVariadic;

		public:
		DbgValueLoc(const DIExpression *Expr, ArrayRef<DbgValueLocEntry> Locs)
		: Expression(Expr), ValueLocEntries(Locs.begin(), Locs.end()),
		IsVariadic(true) {
		#ifndef NDEBUG
		// Currently, DBG_VALUE_VAR expressions must use stack_value.
		assert(Expr && Expr->isValid() &&
		is_contained(Locs, dwarf::DW_OP_stack_value));
		for (DbgValueLocEntry &Entry : ValueLocEntries) {
		assert(!Entry.isConstantFP() && !Entry.isConstantInt() &&
		"Constant values should only be present in non-variadic "
		"DBG_VALUEs.");
		}
		#endif
		}

		DbgValueLoc(const DIExpression *Expr, ArrayRef<DbgValueLocEntry> Locs,
		bool IsVariadic)
		: Expression(Expr), ValueLocEntries(Locs.begin(), Locs.end()),
		IsVariadic(IsVariadic) {
		#ifndef NDEBUG
		assert(cast<DIExpression>(Expr)->isValid() \|\|
		!any_of(Locs, [](auto LE) { return LE.isLocation(); }));
		if (!IsVariadic) {
		assert(ValueLocEntries.size() == 1);
		} else {
		// Currently, DBG_VALUE_VAR expressions must use stack_value.
		assert(Expr && Expr->isValid() &&
		is_contained(Expr->getElements(), dwarf::DW_OP_stack_value));
		for (DbgValueLocEntry &Entry : ValueLocEntries) {
		assert(!Entry.isConstantFP() && !Entry.isConstantInt() &&
		"Constant values should only be present in non-variadic "
		"DBG_VALUEs.");
		}
		}
		#endif
		}

		DbgValueLoc(const DIExpression *Expr, DbgValueLocEntry Loc)
		: Expression(Expr), ValueLocEntries(1, Loc), IsVariadic(false) {
		assert(((Expr && Expr->isValid()) \|\| !Loc.isLocation()) &&
		"DBG_VALUE with a machine location must have a valid expression.");
		}

		bool isFragment() const { return getExpression()->isFragment(); }
		bool isEntryVal() const { return getExpression()->isEntryValue(); }
		bool isVariadic() const { return IsVariadic; }
		const DIExpression *getExpression() const { return Expression; }
		const ArrayRef<DbgValueLocEntry> getLocEntries() const {
		return ValueLocEntries;
		}
		friend bool operator==(const DbgValueLoc &, const DbgValueLoc &);
		friend bool operator<(const DbgValueLoc &, const DbgValueLoc &);
		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
		LLVM_DUMP_METHOD void dump() const {
		for (DbgValueLocEntry DV : ValueLocEntries)
		DV.dump();
if (Expression)		if (Expression)
Expression->dump();		Expression->dump();
}		}
#endif		#endif
};		};

/// This struct describes location entries emitted in the .debug_loc		/// This struct describes location entries emitted in the .debug_loc
/// section.		/// section.
▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	public:

/// Lower this entry into a DWARF expression.		/// Lower this entry into a DWARF expression.
void finalize(const AsmPrinter &AP,		void finalize(const AsmPrinter &AP,
DebugLocStream::ListBuilder &List,		DebugLocStream::ListBuilder &List,
const DIBasicType *BT,		const DIBasicType *BT,
DwarfCompileUnit &TheCU);		DwarfCompileUnit &TheCU);
};		};

/// Compare two DbgValueLocs for equality.		/// Compare two DbgValueLocEntries for equality.
inline bool operator==(const DbgValueLoc &A,		inline bool operator==(const DbgValueLocEntry &A, const DbgValueLocEntry &B) {
const DbgValueLoc &B) {
if (A.EntryKind != B.EntryKind)		if (A.EntryKind != B.EntryKind)
return false;		return false;

if (A.Expression != B.Expression)
return false;

switch (A.EntryKind) {		switch (A.EntryKind) {
case DbgValueLoc::E_Location:		case DbgValueLocEntry::E_Location:
return A.Loc == B.Loc;		return A.Loc == B.Loc;
case DbgValueLoc::E_TargetIndexLocation:		case DbgValueLocEntry::E_TargetIndexLocation:
return A.TIL == B.TIL;		return A.TIL == B.TIL;
case DbgValueLoc::E_Integer:		case DbgValueLocEntry::E_Integer:
return A.Constant.Int == B.Constant.Int;		return A.Constant.Int == B.Constant.Int;
case DbgValueLoc::E_ConstantFP:		case DbgValueLocEntry::E_ConstantFP:
return A.Constant.CFP == B.Constant.CFP;		return A.Constant.CFP == B.Constant.CFP;
case DbgValueLoc::E_ConstantInt:		case DbgValueLocEntry::E_ConstantInt:
return A.Constant.CIP == B.Constant.CIP;		return A.Constant.CIP == B.Constant.CIP;
}		}
llvm_unreachable("unhandled EntryKind");		llvm_unreachable("unhandled EntryKind");
}		}

		/// Compare two DbgValueLocs for equality.
		inline bool operator==(const DbgValueLoc &A, const DbgValueLoc &B) {
		return A.ValueLocEntries == B.ValueLocEntries &&
		A.Expression == B.Expression && A.IsVariadic == B.IsVariadic;
		}

/// Compare two fragments based on their offset.		/// Compare two fragments based on their offset.
inline bool operator<(const DbgValueLoc &A,		inline bool operator<(const DbgValueLoc &A,
const DbgValueLoc &B) {		const DbgValueLoc &B) {
return A.getExpression()->getFragmentInfo()->OffsetInBits <		return A.getExpression()->getFragmentInfo()->OffsetInBits <
B.getExpression()->getFragmentInfo()->OffsetInBits;		B.getExpression()->getFragmentInfo()->OffsetInBits;
}		}

}		}

#endif		#endif

llvm/lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	void DwarfCompileUnit::addLabelAddress(DIE &Die, dwarf::Attribute Attribute,

bool UseAddrOffsetFormOrExpressions =		bool UseAddrOffsetFormOrExpressions =
DD->useAddrOffsetForm() \|\| DD->useAddrOffsetExpressions();		DD->useAddrOffsetForm() \|\| DD->useAddrOffsetExpressions();

const MCSymbol *Base = nullptr;		const MCSymbol *Base = nullptr;
if (Label->isInSection() && UseAddrOffsetFormOrExpressions)		if (Label->isInSection() && UseAddrOffsetFormOrExpressions)
Base = DD->getSectionLabel(&Label->getSection());		Base = DD->getSectionLabel(&Label->getSection());

if (!Base \|\| Base == Label) {		if (!Base \|\| Base == Label) {
unsigned idx = DD->getAddressPool().getIndex(Label);		unsigned idx = DD->getAddressPool().getIndex(Label);
		jmorseUnsubmitted Not Done Reply Inline Actions To check my understanding: for non-variadic variables emission of integer operands might be signed or unsigned, depending on the type of the variable, we can see that earlier in this function and down in DwarfDebug::emitDebugLocValue. For a variadic expression, however, it's expected that the value gets "compiled in" unsigned, hence we add as unsigned here, yes? jmorse: To check my understanding: for non-variadic variables emission of integer operands might be…
		StephenTozerAuthorUnsubmitted Done Reply Inline Actions This is really just a "dumb" carry-over of the logic above; when there is a non-trivial expression for a non-variadic debug value with an integer location in this function, it looks as though we just add the unsigned bytes and call it a day. I think the intent is that the signedness is ultimately interpreted according to the variable type? But in all existing cases I think the approach is flawed; unless you have an empty DIExpression, there's no guarantee that the signedness of the variable matches the signedness of the location operand. StephenTozer: This is really just a "dumb" carry-over of the logic above; when there is a non-trivial…
Die.addValue(DIEValueAllocator, Attribute,		Die.addValue(DIEValueAllocator, Attribute,
DD->getDwarfVersion() >= 5 ? dwarf::DW_FORM_addrx		DD->getDwarfVersion() >= 5 ? dwarf::DW_FORM_addrx
: dwarf::DW_FORM_GNU_addr_index,		: dwarf::DW_FORM_GNU_addr_index,
DIEInteger(idx));		DIEInteger(idx));
return;		return;
}		}

// Could be extended to work with DWARFv4 Split DWARF if that's important for		// Could be extended to work with DWARFv4 Split DWARF if that's important for
▲ Show 20 Lines • Show All 629 Lines • ▼ Show 20 Lines	if (Index != ~0U) {
if (TagOffset)		if (TagOffset)
addUInt(*VariableDie, dwarf::DW_AT_LLVM_tag_offset, dwarf::DW_FORM_data1,		addUInt(*VariableDie, dwarf::DW_AT_LLVM_tag_offset, dwarf::DW_FORM_data1,
*TagOffset);		*TagOffset);
return VariableDie;		return VariableDie;
}		}

// Check if variable has a single location description.		// Check if variable has a single location description.
if (auto *DVal = DV.getValueLoc()) {		if (auto *DVal = DV.getValueLoc()) {
if (DVal->isLocation())		if (!DVal->isVariadic()) {
addVariableAddress(DV, *VariableDie, DVal->getLoc());		const DbgValueLocEntry *Entry = DVal->getLocEntries().begin();
else if (DVal->isInt()) {		if (Entry->isLocation()) {
		addVariableAddress(DV, *VariableDie, Entry->getLoc());
		} else if (Entry->isInt()) {
auto *Expr = DV.getSingleExpression();		auto *Expr = DV.getSingleExpression();
if (Expr && Expr->getNumElements()) {		if (Expr && Expr->getNumElements()) {
DIELoc *Loc = new (DIEValueAllocator) DIELoc;		DIELoc *Loc = new (DIEValueAllocator) DIELoc;
DIEDwarfExpression DwarfExpr(Asm, this, *Loc);		DIEDwarfExpression DwarfExpr(Asm, this, *Loc);
// If there is an expression, emit raw unsigned bytes.		// If there is an expression, emit raw unsigned bytes.
DwarfExpr.addFragmentOffset(Expr);		DwarfExpr.addFragmentOffset(Expr);
DwarfExpr.addUnsignedConstant(DVal->getInt());		DwarfExpr.addUnsignedConstant(Entry->getInt());
DwarfExpr.addExpression(Expr);		DwarfExpr.addExpression(Expr);
addBlock(*VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize());		addBlock(*VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize());
if (DwarfExpr.TagOffset)		if (DwarfExpr.TagOffset)
addUInt(*VariableDie, dwarf::DW_AT_LLVM_tag_offset,		addUInt(*VariableDie, dwarf::DW_AT_LLVM_tag_offset,
dwarf::DW_FORM_data1, *DwarfExpr.TagOffset);		dwarf::DW_FORM_data1, *DwarfExpr.TagOffset);

} else		} else
addConstantValue(*VariableDie, DVal->getInt(), DV.getType());		addConstantValue(*VariableDie, Entry->getInt(), DV.getType());
} else if (DVal->isConstantFP()) {		} else if (Entry->isConstantFP()) {
addConstantFPValue(*VariableDie, DVal->getConstantFP());		addConstantFPValue(*VariableDie, Entry->getConstantFP());
} else if (DVal->isConstantInt()) {		} else if (Entry->isConstantInt()) {
addConstantValue(*VariableDie, DVal->getConstantInt(), DV.getType());		addConstantValue(*VariableDie, Entry->getConstantInt(), DV.getType());
} else if (DVal->isTargetIndexLocation()) {		} else if (Entry->isTargetIndexLocation()) {
DIELoc *Loc = new (DIEValueAllocator) DIELoc;		DIELoc *Loc = new (DIEValueAllocator) DIELoc;
DIEDwarfExpression DwarfExpr(Asm, this, *Loc);		DIEDwarfExpression DwarfExpr(Asm, this, *Loc);
const DIBasicType *BT = dyn_cast<DIBasicType>(		const DIBasicType *BT = dyn_cast<DIBasicType>(
static_cast<const Metadata *>(DV.getVariable()->getType()));		static_cast<const Metadata *>(DV.getVariable()->getType()));
DwarfDebug::emitDebugLocValue(Asm, BT, DVal, DwarfExpr);		DwarfDebug::emitDebugLocValue(Asm, BT, DVal, DwarfExpr);
addBlock(*VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize());		addBlock(*VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize());
}		}
return VariableDie;		return VariableDie;
}		}
		// If any of the location entries are registers with the value 0, then the
		// location is undefined.
		if (any_of(DVal->getLocEntries(), [](const DbgValueLocEntry &Entry) {
		return Entry.isLocation() && !Entry.getLoc().getReg();
		}))
		return VariableDie;
		const DIExpression *Expr = DV.getSingleExpression();
		assert(Expr && "Variadic Debug Value must have an Expression.");
		DIELoc *Loc = new (DIEValueAllocator) DIELoc;
		DIEDwarfExpression DwarfExpr(Asm, this, *Loc);
		DwarfExpr.addFragmentOffset(Expr);
		DIExpressionCursor Cursor(Expr);
		const TargetRegisterInfo &TRI = *Asm->MF->getSubtarget().getRegisterInfo();

		// Declare the TargetMachine locally so we don't need to capture `this` in
		// the lambda.
		TargetMachine &TM = Asm->TM;
		auto AddEntry = [&DwarfExpr, &TRI, &TM](const DbgValueLocEntry &Entry,
		DIExpressionCursor &Cursor) {
		if (Entry.isLocation()) {
		if (!DwarfExpr.addMachineRegExpression(TRI, Cursor,
		Entry.getLoc().getReg()))
		return false;
		} else if (Entry.isInt()) {
		// If there is an expression, emit raw unsigned bytes.
		DwarfExpr.addUnsignedConstant(Entry.getInt());
		} else if (Entry.isConstantFP()) {
		APInt RawBytes = Entry.getConstantFP()->getValueAPF().bitcastToAPInt();
		DwarfExpr.addUnsignedConstant(RawBytes);
		} else if (Entry.isConstantInt()) {
		APInt RawBytes = Entry.getConstantInt()->getValue();
		DwarfExpr.addUnsignedConstant(RawBytes);
		} else if (Entry.isTargetIndexLocation()) {
		TargetIndexLocation Loc = Entry.getTargetIndexLocation();
		// TODO TargetIndexLocation is a target-independent. Currently only the
		// WebAssembly-specific encoding is supported.
		assert(TM.getTargetTriple().isWasm());
		DwarfExpr.addWasmLocation(Loc.Index, static_cast<uint64_t>(Loc.Offset));
		} else {
		llvm_unreachable("Unsupported Entry type.");
		}
		return true;
		};

		DwarfExpr.addExpression(
		std::move(Cursor),
		[&AddEntry, &DVal](unsigned Idx, DIExpressionCursor &Cursor) -> bool {
		return AddEntry(DVal->getLocEntries()[Idx], Cursor);
		});

		// Now attach the location information to the DIE.
		addBlock(*VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize());
		if (DwarfExpr.TagOffset)
		addUInt(*VariableDie, dwarf::DW_AT_LLVM_tag_offset, dwarf::DW_FORM_data1,
		*DwarfExpr.TagOffset);

		return VariableDie;
		}

// .. else use frame index.		// .. else use frame index.
if (!DV.hasFrameIndexExprs())		if (!DV.hasFrameIndexExprs())
return VariableDie;		return VariableDie;

Optional<unsigned> NVPTXAddressSpace;		Optional<unsigned> NVPTXAddressSpace;
DIELoc *Loc = new (DIEValueAllocator) DIELoc;		DIELoc *Loc = new (DIEValueAllocator) DIELoc;
DIEDwarfExpression DwarfExpr(Asm, this, *Loc);		DIEDwarfExpression DwarfExpr(Asm, this, *Loc);
▲ Show 20 Lines • Show All 731 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp

Show First 20 Lines • Show All 58 Lines • ▼ Show 20 Lines
#include <algorithm>		#include <algorithm>
#include <cstddef>		#include <cstddef>
#include <iterator>		#include <iterator>
#include <string>		#include <string>

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "dwarfdebug"		#define DEBUG_TYPE "dwarfdebug"

		scott.linderUnsubmitted Not Done Reply Inline Actions Was changing this condition intentional? If so can it be in a separate patch? scott.linder: Was changing this condition intentional? If so can it be in a separate patch?
		StephenTozerAuthorUnsubmitted Done Reply Inline Actions Unintentional, the change here got missed during a rebase (but is fixed in my local copy, which I'll push up shortly). StephenTozer: Unintentional, the change here got missed during a rebase (but is fixed in my local copy, which…
STATISTIC(NumCSParams, "Number of dbg call site params created");		STATISTIC(NumCSParams, "Number of dbg call site params created");

static cl::opt<bool> UseDwarfRangesBaseAddressSpecifier(		static cl::opt<bool> UseDwarfRangesBaseAddressSpecifier(
"use-dwarf-ranges-base-address-specifier", cl::Hidden,		"use-dwarf-ranges-base-address-specifier", cl::Hidden,
cl::desc("Use base address specifiers in debug_ranges"), cl::init(false));		cl::desc("Use base address specifiers in debug_ranges"), cl::init(false));

static cl::opt<bool> GenerateARangeSection("generate-arange-section",		static cl::opt<bool> GenerateARangeSection("generate-arange-section",
cl::Hidden,		cl::Hidden,
▲ Show 20 Lines • Show All 154 Lines • ▼ Show 20 Lines

const DIType *DbgVariable::getType() const {		const DIType *DbgVariable::getType() const {
return getVariable()->getType();		return getVariable()->getType();
}		}

/// Get .debug_loc entry for the instruction range starting at MI.		/// Get .debug_loc entry for the instruction range starting at MI.
static DbgValueLoc getDebugLocValue(const MachineInstr *MI) {		static DbgValueLoc getDebugLocValue(const MachineInstr *MI) {
const DIExpression *Expr = MI->getDebugExpression();		const DIExpression *Expr = MI->getDebugExpression();
assert(MI->getNumOperands() == 4);		const bool IsVariadic = MI->isDebugValueList();
if (MI->getDebugOperand(0).isReg()) {		assert(MI->getNumOperands() >= 3);
const auto &RegOp = MI->getDebugOperand(0);		SmallVector<DbgValueLocEntry, 4> DbgValueLocEntries;
const auto &Op1 = MI->getDebugOffset();		for (const MachineOperand &Op : MI->debug_operands()) {
// If the second operand is an immediate, this is a		if (Op.isReg()) {
// register-indirect address.		MachineLocation MLoc(Op.getReg(),
assert((!Op1.isImm() \|\| (Op1.getImm() == 0)) && "unexpected offset");		MI->isNonListDebugValue() && MI->isDebugOffsetImm());
MachineLocation MLoc(RegOp.getReg(), Op1.isImm());		DbgValueLocEntries.push_back(DbgValueLocEntry(MLoc));
return DbgValueLoc(Expr, MLoc);		} else if (Op.isTargetIndex()) {
}		DbgValueLocEntries.push_back(
if (MI->getDebugOperand(0).isTargetIndex()) {		DbgValueLocEntry(TargetIndexLocation(Op.getIndex(), Op.getOffset())));
const auto &Op = MI->getDebugOperand(0);		} else if (Op.isImm())
return DbgValueLoc(Expr,		DbgValueLocEntries.push_back(DbgValueLocEntry(Op.getImm()));
TargetIndexLocation(Op.getIndex(), Op.getOffset()));		else if (Op.isFPImm())
}		DbgValueLocEntries.push_back(DbgValueLocEntry(Op.getFPImm()));
if (MI->getDebugOperand(0).isImm())		else if (Op.isCImm())
return DbgValueLoc(Expr, MI->getDebugOperand(0).getImm());		DbgValueLocEntries.push_back(DbgValueLocEntry(Op.getCImm()));
if (MI->getDebugOperand(0).isFPImm())		else
return DbgValueLoc(Expr, MI->getDebugOperand(0).getFPImm());		llvm_unreachable("Unexpected debug operand in DBG_VALUE* instruction!");
if (MI->getDebugOperand(0).isCImm())		}
return DbgValueLoc(Expr, MI->getDebugOperand(0).getCImm());		return DbgValueLoc(Expr, DbgValueLocEntries, IsVariadic);

llvm_unreachable("Unexpected 4-operand DBG_VALUE instruction!");
}		}

void DbgVariable::initializeDbgValue(const MachineInstr *DbgValue) {		void DbgVariable::initializeDbgValue(const MachineInstr *DbgValue) {
assert(FrameIndexExprs.empty() && "Already initialized?");		assert(FrameIndexExprs.empty() && "Already initialized?");
assert(!ValueLoc.get() && "Already initialized?");		assert(!ValueLoc.get() && "Already initialized?");

assert(getVariable() == DbgValue->getDebugVariable() && "Wrong variable");		assert(getVariable() == DbgValue->getDebugVariable() && "Wrong variable");
assert(getInlinedAt() == DbgValue->getDebugLoc()->getInlinedAt() &&		assert(getInlinedAt() == DbgValue->getDebugLoc()->getInlinedAt() &&
▲ Show 20 Lines • Show All 371 Lines • ▼ Show 20 Lines	for (auto Param : DescribedParams) {
// parameter when walking through the instructions. Append that to the		// parameter when walking through the instructions. Append that to the
// base expression.		// base expression.
const DIExpression *CombinedExpr =		const DIExpression *CombinedExpr =
ShouldCombineExpressions ? combineDIExpressions(Expr, Param.Expr)		ShouldCombineExpressions ? combineDIExpressions(Expr, Param.Expr)
: Expr;		: Expr;
assert((!CombinedExpr \|\| CombinedExpr->isValid()) &&		assert((!CombinedExpr \|\| CombinedExpr->isValid()) &&
"Combined debug expression is invalid");		"Combined debug expression is invalid");

DbgValueLoc DbgLocVal(CombinedExpr, Val);		DbgValueLoc DbgLocVal(CombinedExpr, DbgValueLocEntry(Val));
DbgCallSiteParam CSParm(Param.ParamReg, DbgLocVal);		DbgCallSiteParam CSParm(Param.ParamReg, DbgLocVal);
Params.push_back(CSParm);		Params.push_back(CSParm);
++NumCSParams;		++NumCSParams;
}		}
}		}

/// Add \p Reg to the worklist, if it's not already present, and mark that the		/// Add \p Reg to the worklist, if it's not already present, and mark that the
/// given parameter registers' values can (potentially) be described using		/// given parameter registers' values can (potentially) be described using
▲ Show 20 Lines • Show All 951 Lines • ▼ Show 20 Lines	static bool validThroughout(LexicalScopes &LScopes,
// If the range of the DBG_VALUE is open-ended, report success.		// If the range of the DBG_VALUE is open-ended, report success.
if (!RangeEnd)		if (!RangeEnd)
return true;		return true;

// Single, constant DBG_VALUEs in the prologue are promoted to be live		// Single, constant DBG_VALUEs in the prologue are promoted to be live
// throughout the function. This is a hack, presumably for DWARF v2 and not		// throughout the function. This is a hack, presumably for DWARF v2 and not
// necessarily correct. It would be much better to use a dbg.declare instead		// necessarily correct. It would be much better to use a dbg.declare instead
// if we know the constant is live throughout the scope.		// if we know the constant is live throughout the scope.
if (DbgValue->getDebugOperand(0).isImm() && MBB->pred_empty())		if (MBB->pred_empty() &&
		all_of(DbgValue->debug_operands(),
		[](const MachineOperand &Op) { return Op.isImm(); }))
return true;		return true;

// Test if the location terminates before the end of the scope.		// Test if the location terminates before the end of the scope.
const MachineInstr *LScopeEnd = LSRange.back().second;		const MachineInstr *LScopeEnd = LSRange.back().second;
if (Ordering.isBefore(RangeEnd, LScopeEnd))		if (Ordering.isBefore(RangeEnd, LScopeEnd))
return false;		return false;

// There's a single location which starts at the scope start, and ends at or		// There's a single location which starts at the scope start, and ends at or
▲ Show 20 Lines • Show All 856 Lines • ▼ Show 20 Lines
}		}

void DwarfDebug::emitDebugLocValue(const AsmPrinter &AP, const DIBasicType *BT,		void DwarfDebug::emitDebugLocValue(const AsmPrinter &AP, const DIBasicType *BT,
const DbgValueLoc &Value,		const DbgValueLoc &Value,
DwarfExpression &DwarfExpr) {		DwarfExpression &DwarfExpr) {
auto *DIExpr = Value.getExpression();		auto *DIExpr = Value.getExpression();
DIExpressionCursor ExprCursor(DIExpr);		DIExpressionCursor ExprCursor(DIExpr);
DwarfExpr.addFragmentOffset(DIExpr);		DwarfExpr.addFragmentOffset(DIExpr);

		// If the DIExpr is is an Entry Value, we want to follow the same code path
		// regardless of whether the DBG_VALUE is variadic or not.
		if (DIExpr && DIExpr->isEntryValue()) {
		// Entry values can only be a single register with no additional DIExpr,
		// so just add it directly.
		assert(Value.getLocEntries().size() == 1);
		assert(Value.getLocEntries()[0].isLocation());
		MachineLocation Location = Value.getLocEntries()[0].getLoc();
		DwarfExpr.setLocation(Location, DIExpr);

		DwarfExpr.beginEntryValueExpression(ExprCursor);

		const TargetRegisterInfo &TRI = *AP.MF->getSubtarget().getRegisterInfo();
		if (!DwarfExpr.addMachineRegExpression(TRI, ExprCursor, Location.getReg()))
		return;
		return DwarfExpr.addExpression(std::move(ExprCursor));
		}

// Regular entry.		// Regular entry.
if (Value.isInt()) {		auto EmitValueLocEntry = [&DwarfExpr, &BT,
		&AP](const DbgValueLocEntry &Entry,
		DIExpressionCursor &Cursor) -> bool {
		if (Entry.isInt()) {
if (BT && (BT->getEncoding() == dwarf::DW_ATE_signed \|\|		if (BT && (BT->getEncoding() == dwarf::DW_ATE_signed \|\|
BT->getEncoding() == dwarf::DW_ATE_signed_char))		BT->getEncoding() == dwarf::DW_ATE_signed_char))
DwarfExpr.addSignedConstant(Value.getInt());		DwarfExpr.addSignedConstant(Entry.getInt());
else		else
DwarfExpr.addUnsignedConstant(Value.getInt());		DwarfExpr.addUnsignedConstant(Entry.getInt());
} else if (Value.isLocation()) {		} else if (Entry.isLocation()) {
MachineLocation Location = Value.getLoc();		MachineLocation Location = Entry.getLoc();
DwarfExpr.setLocation(Location, DIExpr);		if (Location.isIndirect())
DIExpressionCursor Cursor(DIExpr);		DwarfExpr.setMemoryLocationKind();

if (DIExpr->isEntryValue())
DwarfExpr.beginEntryValueExpression(Cursor);

const TargetRegisterInfo &TRI = *AP.MF->getSubtarget().getRegisterInfo();		const TargetRegisterInfo &TRI = *AP.MF->getSubtarget().getRegisterInfo();
if (!DwarfExpr.addMachineRegExpression(TRI, Cursor, Location.getReg()))		if (!DwarfExpr.addMachineRegExpression(TRI, Cursor, Location.getReg()))
return;		return false;
return DwarfExpr.addExpression(std::move(Cursor));		} else if (Entry.isTargetIndexLocation()) {
} else if (Value.isTargetIndexLocation()) {		TargetIndexLocation Loc = Entry.getTargetIndexLocation();
TargetIndexLocation Loc = Value.getTargetIndexLocation();		// TODO TargetIndexLocation is a target-independent. Currently only the
// TODO TargetIndexLocation is a target-independent. Currently only the WebAssembly-specific		// WebAssembly-specific encoding is supported.
// encoding is supported.
assert(AP.TM.getTargetTriple().isWasm());		assert(AP.TM.getTargetTriple().isWasm());
DwarfExpr.addWasmLocation(Loc.Index, static_cast<uint64_t>(Loc.Offset));		DwarfExpr.addWasmLocation(Loc.Index, static_cast<uint64_t>(Loc.Offset));
DwarfExpr.addExpression(std::move(ExprCursor));		} else if (Entry.isConstantFP()) {
return;
} else if (Value.isConstantFP()) {
if (AP.getDwarfVersion() >= 4 && !AP.getDwarfDebug()->tuneForSCE() &&		if (AP.getDwarfVersion() >= 4 && !AP.getDwarfDebug()->tuneForSCE() &&
!ExprCursor) {		!Cursor) {
DwarfExpr.addConstantFP(Value.getConstantFP()->getValueAPF(), AP);		DwarfExpr.addConstantFP(Entry.getConstantFP()->getValueAPF(), AP);
return;		} else if (Entry.getConstantFP()
}		->getValueAPF()
if (Value.getConstantFP()->getValueAPF().bitcastToAPInt().getBitWidth() <=		.bitcastToAPInt()
64 /bits/)		.getBitWidth() <= 64 /bits/) {
DwarfExpr.addUnsignedConstant(		DwarfExpr.addUnsignedConstant(
Value.getConstantFP()->getValueAPF().bitcastToAPInt());		Entry.getConstantFP()->getValueAPF().bitcastToAPInt());
else		} else {
LLVM_DEBUG(		LLVM_DEBUG(
dbgs()		dbgs() << "Skipped DwarfExpression creation for ConstantFP of size"
<< "Skipped DwarfExpression creation for ConstantFP of size"		<< Entry.getConstantFP()
<< Value.getConstantFP()->getValueAPF().bitcastToAPInt().getBitWidth()		->getValueAPF()
		.bitcastToAPInt()
		.getBitWidth()
<< " bits\n");		<< " bits\n");
		return false;
}		}
		} else {
		llvm_unreachable("Invalid Entry for a DW_AT_location expression.");
		}
		return true;
		};

		if (!Value.isVariadic()) {
		if (!EmitValueLocEntry(Value.getLocEntries()[0], ExprCursor))
		return;
DwarfExpr.addExpression(std::move(ExprCursor));		DwarfExpr.addExpression(std::move(ExprCursor));
		return;
		}

		// If any of the location entries are registers with the value 0, then the
		// location is undefined.
		if (any_of(Value.getLocEntries(), [](const DbgValueLocEntry &Entry) {
		return Entry.isLocation() && !Entry.getLoc().getReg();
		}))
		return;

		DwarfExpr.addExpression(
		std::move(ExprCursor),
		[EmitValueLocEntry, &Value](unsigned Idx,
		DIExpressionCursor &Cursor) -> bool {
		return EmitValueLocEntry(Value.getLocEntries()[Idx], Cursor);
		});
}		}

void DebugLocEntry::finalize(const AsmPrinter &AP,		void DebugLocEntry::finalize(const AsmPrinter &AP,
DebugLocStream::ListBuilder &List,		DebugLocStream::ListBuilder &List,
const DIBasicType *BT,		const DIBasicType *BT,
DwarfCompileUnit &TheCU) {		DwarfCompileUnit &TheCU) {
assert(!Values.empty() &&		assert(!Values.empty() &&
"location list entries without values are redundant");		"location list entries without values are redundant");
▲ Show 20 Lines • Show All 902 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/DwarfExpression.h

Show First 20 Lines • Show All 341 Lines • ▼ Show 20 Lines	public:

/// Emit all remaining operations in the DIExpressionCursor.		/// Emit all remaining operations in the DIExpressionCursor.
///		///
/// \param FragmentOffsetInBits If this is one fragment out of multiple		/// \param FragmentOffsetInBits If this is one fragment out of multiple
/// locations, this is the offset of the		/// locations, this is the offset of the
/// fragment inside the entire variable.		/// fragment inside the entire variable.
void addExpression(DIExpressionCursor &&Expr,		void addExpression(DIExpressionCursor &&Expr,
unsigned FragmentOffsetInBits = 0);		unsigned FragmentOffsetInBits = 0);
		void
		addExpression(DIExpressionCursor &&Expr,
		std::function<bool(unsigned, DIExpressionCursor &)> InsertArg);

/// If applicable, emit an empty DW_OP_piece / DW_OP_bit_piece to advance to		/// If applicable, emit an empty DW_OP_piece / DW_OP_bit_piece to advance to
/// the fragment described by \c Expr.		/// the fragment described by \c Expr.
void addFragmentOffset(const DIExpression *Expr);		void addFragmentOffset(const DIExpression *Expr);

void emitLegacySExt(unsigned FromBits);		void emitLegacySExt(unsigned FromBits);
void emitLegacyZExt(unsigned FromBits);		void emitLegacyZExt(unsigned FromBits);

▲ Show 20 Lines • Show All 79 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/DwarfExpression.cpp

Show All 16 Lines
#include "llvm/BinaryFormat/Dwarf.h"		#include "llvm/BinaryFormat/Dwarf.h"
#include "llvm/CodeGen/Register.h"		#include "llvm/CodeGen/Register.h"
#include "llvm/CodeGen/TargetRegisterInfo.h"		#include "llvm/CodeGen/TargetRegisterInfo.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include <algorithm>		#include <algorithm>

using namespace llvm;		using namespace llvm;

		aprantlUnsubmitted Not Done Reply Inline Actions By using a callback here the callee cannot use the advanced functionality of addMachineRegExpression for any but a leading DW_OP_LLVM_arg. Do you see a way of either generalizing addMachineRegExpression or otherwise reorganizing this so the addMachineRegExpression functionality becomes available to DBG_VALUE_LIST? aprantl: By using a callback here the callee cannot use the advanced functionality of…
		StephenTozerAuthorUnsubmitted Done Reply Inline Actions It actually can use that functionality - in this case, all of the functionality that would normally be applied to the location in the DBG_VALUE is applied by this callback. The callback in this case can advance the ExprCursor, so there are no issues with using addMachineRegExpression normally at any point in a DBG_VALUE_LIST. StephenTozer: It actually can use that functionality - in this case, all of the functionality that would…
#define DEBUG_TYPE "dwarfdebug"		#define DEBUG_TYPE "dwarfdebug"

void DwarfExpression::emitConstu(uint64_t Value) {		void DwarfExpression::emitConstu(uint64_t Value) {
if (Value < 32)		if (Value < 32)
emitOp(dwarf::DW_OP_lit0 + Value);		emitOp(dwarf::DW_OP_lit0 + Value);
		jmorseUnsubmitted Not Done Reply Inline Actions What about if the next opcode is stack_value, do we need to mask in that scenario? jmorse: What about if the next opcode is stack_value, do we need to mask in that scenario?
		StephenTozerAuthorUnsubmitted Done Reply Inline Actions I think so? To be honest, it looks to me like the only time we can use a subregister and don't apply a mask is when describing either a Register location (where we cannot use subreg masking, we use DW_OP_piece instead), or when describing a simple memory location, i.e. a single DW_OP_breg with 0 offset. Although the latter case seems like a valid argument for "we don't always need to mask the subregister", I suspect that it's actually the case that we just never use a subregister for those locations. The full size of a register for a given architecture will also generally be the size of a memory address, so it may just be assumed that we don't need to check for subreg masking in that case. StephenTozer: I think so? To be honest, it looks to me like the only time we can use a subregister and don't…
else if (Value == std::numeric_limits<uint64_t>::max()) {		else if (Value == std::numeric_limits<uint64_t>::max()) {
// Only do this for 64-bit values as the DWARF expression stack uses		// Only do this for 64-bit values as the DWARF expression stack uses
// target-address-size values.		// target-address-size values.
emitOp(dwarf::DW_OP_lit0);		emitOp(dwarf::DW_OP_lit0);
emitOp(dwarf::DW_OP_not);		emitOp(dwarf::DW_OP_not);
} else {		} else {
emitOp(dwarf::DW_OP_constu);		emitOp(dwarf::DW_OP_constu);
emitUnsigned(Value);		emitUnsigned(Value);
▲ Show 20 Lines • Show All 258 Lines • ▼ Show 20 Lines	if (isEntryValue()) {
finalizeEntryValue();		finalizeEntryValue();

if (!isIndirect() && !isParameterValue() && !HasComplexExpression &&		if (!isIndirect() && !isParameterValue() && !HasComplexExpression &&
DwarfVersion >= 4)		DwarfVersion >= 4)
emitOp(dwarf::DW_OP_stack_value);		emitOp(dwarf::DW_OP_stack_value);
}		}

DwarfRegs.clear();		DwarfRegs.clear();
		// If we need to mask out a subregister, do it now, unless the next
		// operation would emit an OpPiece anyway.
		auto NextOp = ExprCursor.peek();
		if (SubRegisterSizeInBits && NextOp &&
		(NextOp->getOp() != dwarf::DW_OP_LLVM_fragment))
		maskSubRegister();
return true;		return true;
}		}

// Don't emit locations that cannot be expressed without DW_OP_stack_value.		// Don't emit locations that cannot be expressed without DW_OP_stack_value.
if (DwarfVersion < 4)		if (DwarfVersion < 4)
if (any_of(ExprCursor, [](DIExpression::ExprOperand Op) -> bool {		if (any_of(ExprCursor, [](DIExpression::ExprOperand Op) -> bool {
return Op.getOp() == dwarf::DW_OP_stack_value;		return Op.getOp() == dwarf::DW_OP_stack_value;
})) {		})) {
Show All 36 Lines	if (Op && Op->getOp() == dwarf::DW_OP_constu) {
}		}
}		}

if (FBReg)		if (FBReg)
addFBReg(SignedOffset);		addFBReg(SignedOffset);
else		else
addBReg(Reg.DwarfRegNo, SignedOffset);		addBReg(Reg.DwarfRegNo, SignedOffset);
DwarfRegs.clear();		DwarfRegs.clear();

		// If we need to mask out a subregister, do it now, unless the next
		// operation would emit an OpPiece anyway.
		auto NextOp = ExprCursor.peek();
		if (SubRegisterSizeInBits && NextOp &&
		(NextOp->getOp() != dwarf::DW_OP_LLVM_fragment))
		maskSubRegister();

return true;		return true;
}		}

void DwarfExpression::setEntryValueFlags(const MachineLocation &Loc) {		void DwarfExpression::setEntryValueFlags(const MachineLocation &Loc) {
LocationFlags \|= EntryValue;		LocationFlags \|= EntryValue;
if (Loc.isIndirect())		if (Loc.isIndirect())
LocationFlags \|= Indirect;		LocationFlags \|= Indirect;
}		}
▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	default:
return false;		return false;
}		}
}		}
return true;		return true;
}		}

void DwarfExpression::addExpression(DIExpressionCursor &&ExprCursor,		void DwarfExpression::addExpression(DIExpressionCursor &&ExprCursor,
unsigned FragmentOffsetInBits) {		unsigned FragmentOffsetInBits) {
		addExpression(std::move(ExprCursor),
		[](unsigned Idx, DIExpressionCursor &Cursor) -> bool {
		llvm_unreachable("unhandled opcode found in expression");
		});
		}

		void DwarfExpression::addExpression(
		DIExpressionCursor &&ExprCursor,
		std::function<bool(unsigned, DIExpressionCursor &)> InsertArg) {
// Entry values can currently only cover the initial register location,		// Entry values can currently only cover the initial register location,
// and not any other parts of the following DWARF expression.		// and not any other parts of the following DWARF expression.
assert(!IsEmittingEntryValue && "Can't emit entry value around expression");		assert(!IsEmittingEntryValue && "Can't emit entry value around expression");

// If we need to mask out a subregister, do it now, unless the next
// operation would emit an OpPiece anyway.
auto N = ExprCursor.peek();
if (SubRegisterSizeInBits && N && (N->getOp() != dwarf::DW_OP_LLVM_fragment))
maskSubRegister();

Optional<DIExpression::ExprOperand> PrevConvertOp = None;		Optional<DIExpression::ExprOperand> PrevConvertOp = None;

while (ExprCursor) {		while (ExprCursor) {
auto Op = ExprCursor.take();		auto Op = ExprCursor.take();
uint64_t OpNum = Op->getOp();		uint64_t OpNum = Op->getOp();

if (OpNum >= dwarf::DW_OP_reg0 && OpNum <= dwarf::DW_OP_reg31) {		if (OpNum >= dwarf::DW_OP_reg0 && OpNum <= dwarf::DW_OP_reg31) {
emitOp(OpNum);		emitOp(OpNum);
continue;		continue;
} else if (OpNum >= dwarf::DW_OP_breg0 && OpNum <= dwarf::DW_OP_breg31) {		} else if (OpNum >= dwarf::DW_OP_breg0 && OpNum <= dwarf::DW_OP_breg31) {
addBReg(OpNum - dwarf::DW_OP_breg0, Op->getArg(0));		addBReg(OpNum - dwarf::DW_OP_breg0, Op->getArg(0));
continue;		continue;
}		}

switch (OpNum) {		switch (OpNum) {
		case dwarf::DW_OP_LLVM_arg:
		if (!InsertArg(Op->getArg(0), ExprCursor)) {
		LocationKind = Unknown;
		return;
		}
		break;
case dwarf::DW_OP_LLVM_fragment: {		case dwarf::DW_OP_LLVM_fragment: {
unsigned SizeInBits = Op->getArg(1);		unsigned SizeInBits = Op->getArg(1);
unsigned FragmentOffset = Op->getArg(0);		unsigned FragmentOffset = Op->getArg(0);
// The fragment offset must have already been adjusted by emitting an		// The fragment offset must have already been adjusted by emitting an
// empty DW_OP_piece / DW_OP_bit_piece before we emitted the base		// empty DW_OP_piece / DW_OP_bit_piece before we emitted the base
// location.		// location.
assert(OffsetInBits >= FragmentOffset && "fragment offset not added?");		assert(OffsetInBits >= FragmentOffset && "fragment offset not added?");
assert(SizeInBits >= OffsetInBits - FragmentOffset && "size underflow");		assert(SizeInBits >= OffsetInBits - FragmentOffset && "size underflow");
▲ Show 20 Lines • Show All 191 Lines • Show Last 20 Lines

llvm/test/DebugInfo/X86/dbg_value_list_clobbers.mir

This file was added.

				# RUN: llc %s --start-after=livedebugvalues -filetype=obj -o - \
				# RUN: \| llvm-dwarfdump - -name locala -o - \| FileCheck %s
				#
				jmorseUnsubmitted Not Done Reply Inline Actions Does not call FileCheck jmorse: Does not call FileCheck
				# Test that clobbers between DBG_VALUE_LIST and DBG_VALUE instructions work as
				# expected. Comments and test directives inline.

				--- \|
				target triple = "x86_64-unknown-linux-gnu"
				define dso_local i32 @fun() local_unnamed_addr !dbg !7 {
				entry:
				ret i32 0
				}

				!llvm.dbg.cu = !{!0}
				!llvm.module.flags = !{!3, !4, !5}
				!llvm.ident = !{!6}

				!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1, producer: "clang version 11.0.0", isOptimized: true, runtimeVersion: 0, emissionKind: FullDebug, enums: !2, splitDebugInlining: false, nameTableKind: None)
				!1 = !DIFile(filename: "example.c", directory: "/")
				!2 = !{}
				!3 = !{i32 7, !"Dwarf Version", i32 4}
				!4 = !{i32 2, !"Debug Info Version", i32 3}
				!5 = !{i32 1, !"wchar_size", i32 4}
				!6 = !{!"clang version 11.0.0"}
				!8 = !DISubroutineType(types: !9)
				!9 = !{!10}
				!10 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)
				!11 = !{!12}
				!22 = !DISubroutineType(types: !23)
				!23 = !{!10, !10}
				; --- Important metadata ---
				!7 = distinct !DISubprogram(name: "fun", scope: !1, file: !1, line: 2, type: !8, scopeLine: 2, flags: DIFlagAllCallsDescribed, spFlags: DISPFlagDefinition \| DISPFlagOptimized, unit: !0, retainedNodes: !11)
				!15 = !DILocation(line: 1, column: 1, scope: !7)
				!12 = !DILocalVariable(name: "locala", scope: !7, file: !1, line: 1, type: !10)

				...
				---
				name: fun
				body: \|
				bb.0.entry:
				; This test checks that we see expected location ranges for a single variable.
				; CHECK: {{.*}} DW_TAG_variable
				OrlandoUnsubmitted Not Done Reply Inline Actions You can remove this XXX note. Orlando: You can remove this XXX note.
				; CHECK-NEXT: DW_AT_location {{.*}}

				DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value), $eax, debug-location !15
				; CHECK-NEXT: [{{.*}}): DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value

				$edi = MOV32ri 1
				DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value), $esi, debug-location !15
				; CHECK-NEXT: [{{.*}}): DW_OP_breg4 RSI+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value

				$eax = MOV32ri 2
				DBG_VALUE $eax, $noreg, !12, !DIExpression(), debug-location !15
				; CHECK-NEXT: [{{.*}}): DW_OP_reg0 RAX

				$ecx = MOV32ri 3
				DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_LLVM_arg, 1, DW_OP_plus, DW_OP_stack_value), $eax, $ecx, debug-location !15
				; CHECK-NEXT: [{{.*}}): DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_breg2 RCX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_plus, DW_OP_stack_value

				; Check that a reg clobber prevents identical locations merging.
				$ecx = MOV32ri 4
				$ecx = MOV32ri 5
				DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_LLVM_arg, 1, DW_OP_plus, DW_OP_stack_value), $eax, $ecx, debug-location !15
				; CHECK-NEXT: [{{.*}}): DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_breg2 RCX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_plus, DW_OP_stack_value

				; Check that fragments are composed correctly.
				$ecx = MOV32ri 6
				DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value, DW_OP_LLVM_fragment, 0, 16), $eax, debug-location !15
				OrlandoUnsubmitted Not Done Reply Inline Actions I assume this no longer fails? Orlando: I assume this no longer fails?
				DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value, DW_OP_LLVM_fragment, 16, 16), $ecx, debug-location !15
				; CHECK-NEXT: [{{.*}}): DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value, DW_OP_piece 0x2, DW_OP_breg2 RCX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value, DW_OP_piece 0x2

				; Check that fragments clobber preceeding overlap.
				$edi = MOV32ri 7
				DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value, DW_OP_LLVM_fragment, 16, 16), $edi, debug-location !15
				; CHECK-NEXT: [{{.*}}): DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value, DW_OP_piece 0x2, DW_OP_breg5 RDI+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value, DW_OP_piece 0x2

				; Check that a (non-zero-offset) fragment works.
				$ecx = MOV32ri 8
				$ecx = MOV32ri 9
				DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_LLVM_arg, 1, DW_OP_plus, DW_OP_stack_value, DW_OP_LLVM_fragment, 16, 16), $eax, $ecx, debug-location !15
				; CHECK-NEXT: [{{.*}}): DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value, DW_OP_piece 0x2, DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_breg2 RCX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_plus, DW_OP_stack_value, DW_OP_piece 0x2

				RETQ debug-location !15
				...

llvm/test/DebugInfo/X86/dbg_value_list_emission.mir

This file was added.

				# RUN: llc %s --start-after=livedebugvalues -filetype=obj -o - \
				# RUN: \| llvm-dwarfdump - -name local* -regex \
				# RUN: \| FileCheck %s
				#
				# Test that we produce correct DWARF from DBG_VALUE_LIST instructions.
				# Comments and test directives inline.
				jmorseUnsubmitted Not Done Reply Inline Actions Mega-nit: "good" suggests there's a subjective difference, could I suggest "correct" jmorse: Mega-nit: "good" suggests there's a subjective difference, could I suggest "correct"

				--- \|
				target triple = "x86_64-unknown-linux-gnu"
				define dso_local i32 @fun() local_unnamed_addr !dbg !7 {
				entry:
				ret i32 0
				}

				!llvm.dbg.cu = !{!0}
				!llvm.module.flags = !{!3, !4, !5}
				!llvm.ident = !{!6}

				!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1, producer: "clang version 11.0.0", isOptimized: true, runtimeVersion: 0, emissionKind: FullDebug, enums: !2, splitDebugInlining: false, nameTableKind: None)
				!1 = !DIFile(filename: "example.c", directory: "/")
				!2 = !{}
				!3 = !{i32 7, !"Dwarf Version", i32 4}
				!4 = !{i32 2, !"Debug Info Version", i32 3}
				!5 = !{i32 1, !"wchar_size", i32 4}
				!6 = !{!"clang version 11.0.0"}
				!8 = !DISubroutineType(types: !9)
				!9 = !{!10}
				!10 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)
				!11 = !{!12, !13, !25}
				!22 = !DISubroutineType(types: !23)
				!23 = !{!10, !10}
				; --- Important metadata ---
				!7 = distinct !DISubprogram(name: "fun", scope: !1, file: !1, line: 2, type: !8, scopeLine: 2, flags: DIFlagAllCallsDescribed, spFlags: DISPFlagDefinition \| DISPFlagOptimized, unit: !0, retainedNodes: !11)
				!15 = !DILocation(line: 1, column: 1, scope: !7)
				!12 = !DILocalVariable(name: "locala", scope: !7, file: !1, line: 1, type: !10)
				!13 = !DILocalVariable(name: "localb", scope: !7, file: !1, line: 2, type: !10)
				!25 = !DILocalVariable(name: "localc", scope: !7, file: !1, line: 3, type: !10)
				!26 = !DILocalVariable(name: "locald", scope: !7, file: !1, line: 4, type: !10)
				!27 = !DILocalVariable(name: "locale", scope: !7, file: !1, line: 5, type: !10)
				!28 = !DILocalVariable(name: "localf", scope: !7, file: !1, line: 6, type: !10)
				!29 = !DILocalVariable(name: "localg", scope: !7, file: !1, line: 6, type: !10)
				!30 = !DILocalVariable(name: "localh", scope: !7, file: !1, line: 6, type: !10)

				...
				---
				name: fun
				body: \|
				bb.0.entry:
				; NOTE: By design, all DBG_VALUE_LIST instructions describe stack_value
				; locations, so they are always created with a DW_OP_stack_value op.
				;
				; (1) Check a single reg arg works.
				DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value), $eax, debug-location !15
				; CHECK: DW_TAG_variable
				; CHECK-NEXT: (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value)
				; CHECK-NEXT: DW_AT_name ("locala")
				scott.linderUnsubmitted Not Done Reply Inline Actions Why does this behave differently than (what I understand to be) the equivalent `DBG_VALUE` ? DBG_VALUE $eax, $noreg, !12, !DIExpression(DW_OP_stack_value), debug-location !15 ; becomes: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value) It seems like the `DW_OP_and` is there to select a subregister (I assume EAX), but oddly it comes after the value of the register is already read (i.e. after the DW_OP_breg). I'm lost on what the intended behavior is, and why it differs between `DBG_VALUE` and `DBG_VALUE_LIST`. There is also the existing confusion around the "isIndirect" flag in `DBG_VALUE` which makes these two equivalent (and both seemingly wrong): DBG_VALUE $eax, $noreg, !25, !DIExpression(DW_OP_stack_value), debug-location !15 ; becomes: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value) DBG_VALUE $eax, 0, !26, !DIExpression(DW_OP_stack_value), debug-location !15 ; becomes: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value) which makes it harder still to compare. Would it be more straightforward to always be explicit about indirection in the new form? Why does `DW_OP_stack_value` imply a `DW_OP_deref` at all? I.e. why do we not get: DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value), $eax, debug-location !15 ; CHECK: DW_TAG_variable ; CHECK-NEXT: (DW_OP_reg RAX, DW_OP_stack_value) ; CHECK-NEXT: DW_AT_name ("locala") which in this case I imagine would just be an error. I would expect the correct expression to generate the `DW_OP_breg` would be something like: DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_deref, DW_OP_stack_value), $eax, debug-location !15 ; CHECK: DW_TAG_variable ; CHECK-NEXT: (DW_OP_breg0 RAX+0, DW_OP_stack_value) ; CHECK-NEXT: DW_AT_name ("locala") If we don't do this, we seem to retain some of the same ambiguity that makes the old "isIndirect" field so confusing. scott.linder: Why does this behave differently than (what I understand to be) the equivalent `DBG_VALUE` ?
				StephenTozerAuthorUnsubmitted Done Reply Inline Actions To the first point: I'm looking into it now; I noticed the `DW_OP_and` before, but I'm not sure where it's coming from myself yet - the `DBG_VALUE_LIST` handling should be following essentially the same code path as `DBG_VALUE`, so this is a bug one way or another. To the second point, I think your examples are slightly incorrect: The `isIndirect` flag in `DBG_VALUE` is confusing and inconsistent, what it actually does is dependent on the DIExpression and not well explained. The `DBG_VALUE_LIST` implementation has no such inconsistencies however (I hope). The problem with your examples is that I think you're using `DW_OP_reg` to mean a register's literal value, and `DW_OP_breg` to mean the address pointed to by a register. This isn't quite correct, although they do act like that for most variable locations. The actual meanings are slightly more complicated; the short answer is that `DW_OP_reg` is a Register location description: it refers to the register itself, not to the value of that register. `DW_OP_breg` on the other hand does refer to the literal value of a register; it's generally used with an offset as part of a Memory location expression, but if combined with `DW_OP_stack_value` then it gives the value in the register as the variable's value (albeit as an Implicit location rather than a Register location). So with all of that said, the meaning of `(DW_OP_breg0 RAX+0, DW_OP_stack_value)` is that the variable's value can be found in `$rax`, but the variable should be read-only, which matches the meaning of the `DBG_VALUE_LIST`. StephenTozer: To the first point: I'm looking into it now; I noticed the `DW_OP_and` before, but I'm not sure…
				scott.linderUnsubmitted Not Done Reply Inline Actions The isIndirect flag in DBG_VALUE is confusing and inconsistent, what it actually does is dependent on the DIExpression and not well explained. Agreed, and my point is that a similar issue applies to the interpretation of `DW_OP_LLVM_arg` in your patch, even with `isIndirect` gone (modulo the `assert` mentioned below, which only side-steps the issue). The problem with your examples is that I think you're using DW_OP_reg to mean a register's literal value, and DW_OP_breg to mean the address pointed to by a register. That wasn't my intention in the examples I gave; in fact in the unambiguous model we tried to extract from the DWARF spec (https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html) we define the `DW_OP_reg` as pushing a register location description onto the stack, and `DW_OP_breg` as pushing a memory location description onto the stack. The interaction which makes the location description pushed onto the stack by `DW_OP_breg` behave like a value in some contexts (i.e. behave like the offset contents of the register) we begrudgingly capture with an implicit conversion to make our description backwards-compatible with DWARF 4 and 5, but even if you want to just define it as pushing a value onto the stack, at the very least the `breg` must represent reading the value of a `reg` register location. My question is then: why does adding `DW_OP_stack_value` implicitly cause the value of the register to end up on the stack, with no intervening operation? I.e. why is this the case: ; Sure, seems reasonable: `DW_OP_LLVM_arg` when referring to a register describes the register itself, not the value of the register. DBG_VALUE_LIST !12, !DIExpression(DW_OP_LLVM_arg, 0), $eax, debug-location !11 ; CHECK: DW_AT_location (DW_OP_reg0 RAX) ; This follows too: if you do want the value of the register, you can read it explicitly with e.g. `DW_OP_deref`. DWARF actually requires this be collapsed into something like `DW_OP_breg` or `DW_OP_regval_type`, ; as `DW_OP_reg RAX, DW_OP_deref` is not a valid location description of any kind. This is an artificial constraint of the standard, and in any consistent view of the spec the two forms would have to otherwise be equivalent. DBG_VALUE_LIST !13, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_deref), $eax, debug-location !11 ; CHECK: DW_AT_location (DW_OP_breg0 RAX+0) ; Hmm, this doesn't seem right though: why are there now two indirections? Does `DW_OP_stack_value` imply one for some reason? DBG_VALUE_LIST !14, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_deref, DW_OP_stack_value), $eax, debug-location !11 ; CHECK: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_deref, DW_OP_stack_value) ; If you actually want the singly-indirect output you have to omit the semantically consistent `DW_OP_deref`. I would have expected this to just be an invalid DIExpression: DBG_VALUE_LIST !15, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value), $eax, debug-location !11 ; CHECK: DW_AT_location (DW_OP_breg0 RAX+0, DW_OP_stack_value) Note that I removed the asserts in your patch which seem to artificially require `DW_OP_stack_value` for `DBG_VALUE_LIST`. I didn't understand the purpose of them before, but perhaps this issue is one reason for them to be present? My fundamental argument is that this context-dependent interpretation of `DW_OP_LLVM_arg` is another source of confusion, just like `isIndirect`. I think this stems from the fact that DWARF as it is defined today is not general/composable enough to avoid this, but I don't think that should bleed into the internal representation used by LLVM: we can make a sensible choice up until we get into the DWARF backend, where certain expressions will have to be converted into a different form to be legal. Instead, what we have now is a situation where adding operations to the expression changes fundamentally how you are supposed to interpret the `DW_OP_LLVM_arg`. This is a pre-existing shortcoming, just like `isIndirect`, so saddling you with the burden of correcting it doesn't seem reasonable, but I think it is important to discuss. This is also important in the context of replacing `DBG_VALUE` entirely, as the `assert` will obviously need to go away. scott.linder: > The isIndirect flag in DBG_VALUE is confusing and inconsistent, what it actually does is…
				StephenTozerAuthorUnsubmitted Done Reply Inline Actions You are correct - currently, `DBG_VALUE_LIST` is consistent when we only represent expressions with a `DW_OP_stack_value`; there will need to be an additional flag to correctly represent the full set of DWARF expressions. Of the examples you've given however, I would say that it is the first two that are incorrect, and the latter two that are correct. The expression `!DIExpression(DW_OP_LLVM_arg, 0)` should, in the absence of a flag declaring it to be a direct/register location, mean that the variable is at the address given by the first argument, so the correct DWARF translation would be `DW_AT_location (DW_OP_breg0 RAX+0)`. This topic was discussed on the mailing list a while back, starting around here, concluding that we need an extra flag (with a different semantic meaning to the current IsIndirect that avoids the inconsistencies) to accurately represent all DWARF expressions. The reason why this hasn't been added as part of this patch is that this patch isn't replacing `DBG_VALUE` with `DBG_VALUE_LIST` yet; the only place where `DBG_VALUE_LIST` is used is in salvaging dbg.values, where it will necessarily use `DW_OP_stack_value`. StephenTozer: You are correct - currently, `DBG_VALUE_LIST` is consistent when we only represent expressions…

				; (2) Check multiple reg args work.
				DBG_VALUE_LIST !13, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_LLVM_arg, 1, DW_OP_plus, DW_OP_stack_value), $eax, $edi, debug-location !15
				; CHECK: DW_TAG_variable
				; CHECK-NEXT: (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_breg5 RDI+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_plus, DW_OP_stack_value)
				; CHECK-NEXT: DW_AT_name ("localb")

				; (3) Check that multiple references to one reg arg works.
				OrlandoUnsubmitted Not Done Reply Inline Actions Can remove this XXX note. As you mentioned offline, having multiple references to the same arg (i.e. multiple `DW_OP_LLVM_arg, 0` in the expr) is never a problem. Though, slightly tangentially, I'm still a little unclear on what the final decision was on how to handle duplicate register arg operands. In D82363 you said 'always treat DBG_VALUE_LISTs as potentially having them'. Please could you explain a little further? (i.e. is it an error state, do we need to add extra checks when dealing with DBG_VALUE_LISTs etc). Orlando: Can remove this XXX note. As you mentioned offline, having multiple references to the same arg…
				StephenTozerAuthorUnsubmitted Done Reply Inline Actions It is not an error state, just a slightly more inconvenient form than one without duplicates. It requires some extra work in a few places (operating on a vector instead of a single pointer), but there is no reason for it to be invalid. StephenTozer: It is not an error state, just a slightly more inconvenient form than one without duplicates.
				DBG_VALUE_LIST !25, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_LLVM_arg, 0, DW_OP_minus, DW_OP_stack_value), $eax, debug-location !15
				; CHECK: DW_TAG_variable
				; CHECK-NEXT: (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_minus, DW_OP_stack_value)
				; CHECK-NEXT: DW_AT_name ("localc")

				; (4) Check constant and reg args work together.
				DBG_VALUE_LIST !26, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_LLVM_arg, 1, DW_OP_mul, DW_OP_stack_value), $eax, 5, debug-location !15
				; CHECK: DW_TAG_variable
				; CHECK-NEXT: (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_lit5, DW_OP_mul, DW_OP_stack_value)
				; CHECK-NEXT: DW_AT_name ("locald")

				; (5) Check that arg deref works.
				DBG_VALUE_LIST !27, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_deref, DW_OP_stack_value), $eax, debug-location !15
				; CHECK: DW_TAG_variable
				; CHECK-NEXT: (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_deref, DW_OP_stack_value)
				; CHECK-NEXT: DW_AT_name ("locale")

				; (6) Check that fragments work.
				DBG_VALUE_LIST !28, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_stack_value, DW_OP_LLVM_fragment, 0, 16), $eax, debug-location !15
				; CHECK: DW_TAG_variable
				; CHECK-NEXT: (DW_OP_breg0 RAX+0, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_stack_value, DW_OP_piece 0x2)
				; CHECK-NEXT: DW_AT_name ("localf")

				; (7) Check that constant register offsets are correctly folded.
				DBG_VALUE_LIST !29, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_plus_uconst, 5, DW_OP_LLVM_arg, 1, DW_OP_plus_uconst, 17, DW_OP_plus, DW_OP_stack_value), $eax, $edi, debug-location !15
				; CHECK: DW_TAG_variable
				; CHECK-NEXT: (DW_OP_breg0 RAX+5, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_breg5 RDI+17, DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_plus, DW_OP_stack_value)
				; CHECK-NEXT: DW_AT_name ("localg")

				; (8) Check that a single $noreg location invalidates the entire entry.
				DBG_VALUE_LIST !30, !DIExpression(DW_OP_LLVM_arg, 0, DW_OP_LLVM_arg, 1, DW_OP_plus, DW_OP_stack_value), $eax, $noreg, debug-location !15
				jmorseUnsubmitted Not Done Reply Inline Actions Could I request a test that a DBG_VALUE_LIST with $noreg somewhere in it does not lead to a location-list entry -- I've been bitten by $noregs not terminating things in the past. jmorse: Could I request a test that a DBG_VALUE_LIST with $noreg somewhere in it does not lead to a…
				; CHECK: DW_TAG_variable
				; CHECK-NEXT: DW_AT_name ("localh")
				; CHECK-NOT: DW_AT_location

				RETQ debug-location !15
				...

This is an archive of the discontinued LLVM Phabricator instance.

[DebugInfo] Add DWARF emission for DBG_VALUE_LISTClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 329633

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

llvm/lib/CodeGen/AsmPrinter/DbgEntityHistoryCalculator.cpp

llvm/lib/CodeGen/AsmPrinter/DebugHandlerBase.cpp

llvm/lib/CodeGen/AsmPrinter/DebugLocEntry.h

llvm/lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp

llvm/lib/CodeGen/AsmPrinter/DwarfDebug.cpp

llvm/lib/CodeGen/AsmPrinter/DwarfExpression.h

llvm/lib/CodeGen/AsmPrinter/DwarfExpression.cpp

llvm/test/DebugInfo/X86/dbg_value_list_clobbers.mir

llvm/test/DebugInfo/X86/dbg_value_list_emission.mir

[DebugInfo] Add DWARF emission for DBG_VALUE_LIST
ClosedPublic