This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/CodeGen/
-
llvm/
-
CodeGen/
1/2
MachineInstr.h
-
lib/CodeGen/
-
CodeGen/
2/3
CodeGenPrepare.cpp
-
test/
-
DebugInfo/
-
COFF/
-
register-variables.ll
-
NVPTX/
-
debug-info.ll
-
X86/
-
DW_AT_location-reference.ll
-
PR37234.ll
-
codegenprep-addrsink.ll
-
tools/llvm-locstats/
-
llvm-locstats/
-
locstats.ll

Differential D58453

[DebugInfo][CGP] Limit placeDbgValues movement of dbg.value intrinsics
ClosedPublic

Authored by jmorse on Feb 20 2019, 8:22 AM.

Download Raw Diff

Details

Reviewers

aprantl
bjope
vsk

Commits

rG00e238896cd8: [DebugInfo] Nerf placeDbgValues, with prejudice

Summary

This patch limits the activity of CodeGenPrepare::placeDbgValues, which originally helped preserve variable debug locations by placing them near Value definitions. Unfortunately, this now causes more problems than it solves -- see PRs 31878, 35194, 38754. One example problem is that conditional variable assignments can be hoisted and appear to debuggers unconditionally. This patch only allows dbg.value movement when the dbg.value would otherwise be a use-before-def: this is temporarily necessary because the rest of CodeGenPrepare makes no effort to put dbg.values in the right place. Eventually the whole placeDbgValues function should be deleted.

Here are some statistics from a clang-3.4 build with various options twiddled. Note that these were built with the other patches I've uploaded (which I'll add as dependencies) that soften the blow a bit. 'normal' means placeDbgValues is enabled, 'limited' is with this patch, 'deleted' is placeDbgValues completely removed.

normal placeDbgValues:  77.6% variable locations, 45.9% scope bytes
limited placeDbgValues: 76.1% variable locations, 46.4% scope bytes
deleted placeDbgValues: 75.8% variable locations, 45.8% scope bytes

Clearly there's a big decrease (1.8%) of variable coverage happening, and temporarily allowing non-dominated dbg.values to be moved doesn't make a large difference. I've now reached the point where I can't find other (CodeGen) bugs that make a serious impact on these statistics. My current belief, based on variable locations I've traced, is:

That coverage drop is almost entirely caused by DBG_VALUEs whose operands have gone out of liveness, and
LLVM previously hasn't paid this liveness penalty, because it's always moved dbg.values to be immediately after definitions.

I believe we *should* IMHO take this coverage hit. We only avoid it right now by cheating and forcefully bringing variables back into liveness, often with stupid effects.

An example of liveness misery is the "XRayThreshold" variable in this [0] function, ignoring the constant assignment on line 150. XRayThreshold gets a value assigned to it by the getAsInteger call on line 154. That function gets inlined and XRayThresholds use (to test early return) gets hoisted, the effect of which is that XRayThreshold has a location for two instructions, neither of which get a line number in the body of [0], and thus the variable is never visible. With placeDbgValues enabled we move the dbg.value for XRayThreshold so that it covers those two instructions -- with placeDbgValues disabled the calculation for XRayThreshold is dead by the time we reach its dbg.value. IMHO the location determined by placeDbgValues isn't useful to the developer, and skews statistics.

Another case I previously reported is [1] -- here the 'var' variable assignment has its dbg.value hoisted by placeDbgValues to above the switch, somewhere it hasn't been declared yet, and its location gets lost by the time it reaches the relevant portion of the switch statement in [1].

In summary, while the variable coverage statistic is going down here, what we don't measure (and possibly can't) is whether the variable locations are defined over instructions in the program that make sense. Which placeDbgValues actively damages IMO.

On to the three changing tests:

register-variables.ll: The first three deleted DEBUG_VALUE lines are caused by placeDbgValues hoisting locations to places they originally weren't, they're absent with this patch, and other DEBUG_VALUE lines covers the correct behaviour. The deleted 'DEBUG_VALUE' for 'b' drops due to the branch-folding pass (which it previously avoided through placeDbgValues hoisting). The DEBUG_VALUE line for 'c' has sunk back to its original (correct) location.
NVPTX test: a DBG_VALUE $noreg shifts one instruction down in this test, renumbering all the labels.
PR37234.ll: another case of hoisting over a block.

This patch effectively invalidates the behaviour of MachineInstr::collectDebugValues because DBG_VALUEs no longer come immediately after the vreg definition they refer to. The patches this one depends on kill off most of the users, but in the WebAssembly backend some target code calls collectDebugValues. I know nothing of WebAssembly -- I suspect this means mailing the backend owner, as it's not clear what their use case is.

I haven't added a test for the absence of this behaviour -- it doesn't strike me as something that anyone would think to re-introduce.

[0] https://github.com/llvm-mirror/llvm/blob/5926c6d6fe85f6623d2d14f473873bc72c72d482/lib/CodeGen/XRayInstrumentation.cpp#L154
[1] https://github.com/llvm-mirror/clang/blob/aa528ab4a083268edddd038d0f251afe792cfa36/lib/CodeGen/CGBlocks.cpp#L1858

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jmorse created this revision.Feb 20 2019, 8:22 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 20 2019, 8:22 AM

Herald added subscribers: llvm-commits, jdoerfert, sunfish and 3 others. · View Herald Transcript

jmorse added parent revisions: D58450: [DebugInfo][MachineCSE] Don't try to copy-propagate debuginfo for every COPY seen, D58403: [DebugInfo][CGP] Update dbg.values when updating memory address computations, D58238: [DebugInfo] MachineSink: Insert undef DBG_VALUEs when sinking instructions, try to forward copies, D58386: [DebugInfo] Pre-RA MachineSink: sink DBG_VALUEs that don't immediately follow the sunk instruction too, D56151: [DebugInfo] PR40010: Avoid register coalesing altering DBG_VALUE valuations.Feb 20 2019, 8:23 AM

gbedwell added a subscriber: gbedwell.Feb 20 2019, 8:40 AM

yurydelendik added a subscriber: yurydelendik.Feb 21 2019, 7:47 AM

aprantl added inline comments.Feb 21 2019, 8:24 AM

lib/CodeGen/CodeGenPrepare.cpp
6937 ↗	(On Diff #187587)	Please add a comment that explains why

There's a lot going on in the explanation, and partly I'm handicapped by lack of familiarity with the picky details of LLVM's location-list construction. Given all the attention to where dbg.value instructions are placed, it seems like placement is a significant factor in that construction. Intuitively, placing the dbg.value near the instruction that defines the value (assuming there is such an instruction) allows the location-list to better describe the instruction stream that is actually emitted. And, that's really what I want from a location list.
I would caution against focusing too much on the coverage statistics. As you say, what we really want to know is "whether the variable locations are defined over instructions in the program that make sense" and the statistics don't tell us that.
So, what instructions "make sense" in a location list? Well, the ones where the variable has an identifiable machine location or constant value. If optimization causes a value to be defined "before" the variable is declared, that does not really worry me. The goal here is to produce a correct description of the instructions-actually-emitted.
"One example problem is that conditional variable assignments can be hoisted and appear to debuggers unconditionally." Are the values actually defined unconditionally? Then it is correct to describe them that way to the debugger. I assume this is referring to PR38754. If the instructions to compute the value are hoisted, my assertion is that the location-list should describe the variable with that value when it actually occurs.
In my view, the real problem as stated in the PR appears to be that the value later gets *lost*, sadly before the instructions directly related to the switch-case where the variable is used; the real problem is *not* that the value's description starts too soon.

So given all that, how does this patch help address the real problem?

bjope added inline comments.Feb 21 2019, 3:30 PM

lib/CodeGen/CodeGenPrepare.cpp
6931 ↗	(On Diff #187587)	I think that the special handling of alloca can be removed now (either the alloca dominates the DVI and everyhing is fine, or is actually is possible that the DVI dominates the alloca, then I think we can move it). Although, no need to make such a change in this patch if you think it is risky/unrelated.
6937 ↗	(On Diff #187587)	I think this explanation can be added by updating the description of this function. Mentioning that the purpose is to resolve use-before-def problems (which really should be seen as bugs in this, as well as other, passes): "This function only moves (sinks) DbgValueInst:s when there is a use-before-def situation."

Update comments in placeDbgValues to reflect its new purpose; remove code calculating the previous non-debug-instr, as we now use a dominator tree check instead.

jmorse marked 5 inline comments as done.Feb 22 2019, 3:55 AM

jmorse added inline comments.

lib/CodeGen/CodeGenPrepare.cpp
6931 ↗	(On Diff #187587)	I agree and have removed it -- it's redundant and misleading to a future reader to be calculating the "previous non debug instruction" when there's a dominator tree check happening.
6937 ↗	(On Diff #187587)	Updated the function comment; I threw a "FIXME" in too indicating that placeDbgValues is a concession, rather than design feature.

In D58453#1406020, @probinson wrote:

In my view, the real problem as stated in the PR appears to be that the value later gets *lost*, sadly before the instructions directly related to the switch-case where the variable is used; the real problem is *not* that the value's description starts too soon.

So given all that, how does this patch help address the real problem?

Ah, to untangle the True (TM) problem I should illustrate some IR that corresponds to the problem in PR38754:

someblock:
  %foo = add i32 %bar, %bar
  br i1 %somecond, label %qux, label %quux
qux:
  %xyzzy = sub i32 %foo, 12
  call void @llvm.dbg.value(metadata i32 %xyzzy, [...])
  br label %quux

In this code, if %xyzzy is optimised out then the 'sub' instruction is deleted, and its operation folded into the dbg.value intrinsics DIExpression, something like:

call void @llvm.dbg.value(metadata i32 %foo, metadata !1, metadata !DIExpression(DW_OP_constu, 12, DW_OP_minus, DW_OP_stack_value))

Which is good. However the dbg.values operand is now %foo rather than %xyzzy, and placeDbgValues moves the dbg.value call to immediately after its operands definition, moving it up past a conditional branch to the 'someblock' block, giving us:

someblock:
  %foo = add i32 %bar, %bar
  call void @llvm.dbg.value(metadata i32 %foo, [...])
  br i1 %somecond, label %qux, label %quux

I'm not sure what the correct nomenclature is at this point, but the variable location that the dbg.value is defining now appears on paths through the program where it did not before, which in PR38754 makes a variable appear in the debugger to take on a value that isn't actually computed. Many different code hoisting / sinking / CSE passes can trigger similar behaviour.

So, what instructions "make sense" in a location list? Well, the ones where the variable has an identifiable machine location or constant value. If optimization causes a value to be defined "before" the variable is declared, that does not really worry me. The goal here is to produce a correct description of the instructions-actually-emitted.
"One example problem is that conditional variable assignments can be hoisted and appear to debuggers unconditionally." Are the values actually defined unconditionally? Then it is correct to describe them that way to the debugger. I assume this is referring to PR38754. If the instructions to compute the value are hoisted, my assertion is that the location-list should describe the variable with that value when it actually occurs.

I think the key phrase here is "when it actually occurs": if in a program:

Down two code paths the same value is computed but assigned to different variables, and
The computation is hoisted to a parent block,

Should both variables appear to hold that computed value once it's defined in the parent block?

(NB, I'm aware that many people approach this with a different language to me, i.e. in terms of variable locations rather than values, so it's entirely possible that there's some higher level concept I'm missing).

placeDbgValues has been bugging me (and probably others) for some time. So I'm happy with this patch.

Haven't done any recent tests myself, but hopefully we have solved most problems that has blocked this change by now. I know that @jmorse has done a splendid job in hunting down and fixing such problems.
As this patch is a compromising solution, we still sink the dbg.value when having use-before-def, the patch seem reasonable to me at this point in time. It might unveil a few not yet detected problems, but those would probably be hard to find unless we do this.
One idea is to have a command-line option to make it easier to revert to the old behavior (if we find some bigger problems), but I do not think it is worth the trouble.

So this LGTM now. But it looks like @probinson had some questions, so I'm not setting "ready to land" right now.

Hmmm. It appears that you are detaching the notion of here-the-value-is-computed from here-the-value-becomes-associated-with-this-variable. Usually these are concurrent in an "assignment" to the variable.
Separating these concerns means that if a value computation is hoisted, it might not be appropriate to hoist the association with the variable. That is, the "assignment" effect maybe should stay where it was, even if there's no specific instruction left in that position that implements that assignment.
I could see this situation arising in a variety of circumstances. CSE, various sorts of value-propagation (constant or not), passing a value to an inlined function parameter, yada yada.

So, this patch says, as long as the association (dbg.value) is dominated by the computation, assume the association is in the right place and don't move it. Dominance is certainly the correct prerequisite for dbg.value, so in that respect the patch seems reasonable, given the separation of concerns I mentioned previously. But...

I suspect there are a bunch of places in LLVM that encode knowledge of the "value computation is assignment" paradigm, which you are apparently trying to get rid of, and I suppose the other patches you have up for review are all along this same line. Rooting out all those places will take some doing. I am also somewhat concerned that you can leave a dbg.value lying around with no direct relationship to the rest of the block it's in, which makes the effect of other optimizations potentially vague and ill-defined.
Was there any llvm-dev discussion about this paradigm shift? I have been very distracted the past couple of months so it might have happened without my noticing or really paying attention to it. While I think it is probably a net positive in the evolution of debug info for optimized code, I do want to make sure the Usual Suspects are all attuned to it and supportive. It would also be well worthwhile putting together some words for the description of dbg.value in LangRef about this (which does not have to be part of this patch).

Hi Paul,

In D58453#1409640, @probinson wrote:

Hmmm. It appears that you are detaching the notion of here-the-value-is-computed from here-the-value-becomes-associated-with-this-variable. Usually these are concurrent in an "assignment" to the variable.
Separating these concerns means that if a value computation is hoisted, it might not be appropriate to hoist the association with the variable. That is, the "assignment" effect maybe should stay where it was, even if there's no specific instruction left in that position that implements that assignment.
I could see this situation arising in a variety of circumstances. CSE, various sorts of value-propagation (constant or not), passing a value to an inlined function parameter, yada yada.

So, this patch says, as long as the association (dbg.value) is dominated by the computation, assume the association is in the right place and don't move it. Dominance is certainly the correct prerequisite for dbg.value, so in that respect the patch seems reasonable, given the separation of concerns I mentioned previously. But...

I suspect there are a bunch of places in LLVM that encode knowledge of the "value computation is assignment" paradigm, which you are apparently trying to get rid of, and I suppose the other patches you have up for review are all along this same line. Rooting out all those places will take some doing. I am also somewhat concerned that you can leave a dbg.value lying around with no direct relationship to the rest of the block it's in, which makes the effect of other optimizations potentially vague and ill-defined.

Absolutely true -- and constant-valued dbg.values (including undef, technically) are frequently the source of bug reports (ex: PR40795) as their lifetime isn't associated with anything else in the IR (see also PR40628).

Was there any llvm-dev discussion about this paradigm shift? I have been very distracted the past couple of months so it might have happened without my noticing or really paying attention to it. While I think it is probably a net positive in the evolution of debug info for optimized code, I do want to make sure the Usual Suspects are all attuned to it and supportive. It would also be well worthwhile putting together some words for the description of dbg.value in LangRef about this (which does not have to be part of this patch).

Zero discussion sorry, I wasn't aware this is a change in interpretation -- however I'm also not aware of any Formal (TM) definition of the semantics of dbg.value intrinsics, particularly with respect to their location in the IR. I've assumed/inferred this is how it's supposed to work from existing code and review feedback.

Further discussion can't hurt (he said boldly) and getting some meaning written down in the langref would help everyone. I'll draft something and kick that off shortly.

In D58453#1409691, @jmorse wrote:

Absolutely true -- and constant-valued dbg.values (including undef, technically) are frequently the source of bug reports (ex: PR40795) as their lifetime isn't associated with anything else in the IR (see also PR40628).

That's disheartening. Reading the PR it's apparent that the location list construction is failing to implement the dataflow computations required to handle control flow correctly. On the bright side, if we *do* implement them, I suspect it will solve a bunch of issues all at once.

Was there any llvm-dev discussion about this paradigm shift? I have been very distracted the past couple of months so it might have happened without my noticing or really paying attention to it. While I think it is probably a net positive in the evolution of debug info for optimized code, I do want to make sure the Usual Suspects are all attuned to it and supportive. It would also be well worthwhile putting together some words for the description of dbg.value in LangRef about this (which does not have to be part of this patch).

Zero discussion sorry, I wasn't aware this is a change in interpretation -- however I'm also not aware of any Formal (TM) definition of the semantics of dbg.value intrinsics, particularly with respect to their location in the IR. I've assumed/inferred this is how it's supposed to work from existing code and review feedback.

Further discussion can't hurt (he said boldly) and getting some meaning written down in the langref would help everyone. I'll draft something and kick that off shortly.

https://llvm.org/docs/SourceLevelDebugging.html#format-common-intrinsics
"This intrinsic provides information when a user source variable is set to a new value." Naively I read this "when" as meaning, the position of the intrinsic in the instruction stream indicates the starting point where the variable takes on the new value. But making that clearer would be a Good Thing.
And as various reviews and whatnot fly by, I'm inclined to think it's at least a common misperception that the dbg.value has to be adjacent to the instruction that produces the value operand. Stating that it's not required would surely help.

It is definitely about time that we also refresh the documentation. One thing that is lacking is documentation of the backend parts (currently https://llvm.org/docs/SourceLevelDebugging.html talks about intrinsics like dbg.value etc, but shouldn't it also cover DBG_VALUE pseudo instruction). Afaict dbg.value and DBG_VALUE aren't exactly the same. And I think even DBG_VALUE might have some different properties depending on where in the llc pipeline we are (e.g. if the location only is valid up to the end of the BB or not).

In D58453#1409964, @probinson wrote:

In D58453#1409691, @jmorse wrote:

https://llvm.org/docs/SourceLevelDebugging.html#format-common-intrinsics
"This intrinsic provides information when a user source variable is set to a new value." Naively I read this "when" as meaning, the position of the intrinsic in the instruction stream indicates the starting point where the variable takes on the new value. But making that clearer would be a Good Thing.

That is indeed the intended reading of that sentence. Perhaps we should include an example here that uses a dbg.value with a constant to start a new range and an undef to end it to really drive the point home.

And as various reviews and whatnot fly by, I'm inclined to think it's at least a common misperception that the dbg.value has to be adjacent to the instruction that produces the value operand. Stating that it's not required would surely help.

Agreed!

jmorse mentioned this in D58726: [DebugInfo][Docs] Explicitly document how dbg.value intrinsics are interpreted in optimized code.Feb 27 2019, 9:45 AM

I've generated a short summary in https://reviews.llvm.org/D58726 which illustrates how I believe dbg.values are to be interpreted -- I'll chuck this at llvm-dev tomorrow if it's not obviously broken. As mentioned in the summary, it's likely that more could be said, but getting agreement on the location of a dbg.value being the location that an assignment ``happens'' would be good.

What is the status regarding this patch? Are we waiting for something?

In D58453#1439722, @bjope wrote:

What is the status regarding this patch? Are we waiting for something?

Good question -- I was going to gently start pinging the dependencies (one of them needs a revision, but the rest not) to get things in.

There's no technical reason why that's necessary though: given that we appear to agree on the meaning of the dbg.value intrinsics as per D58726, this could land right now. It's not like LLVM is branching any time soon.

In D58453#1441146, @jmorse wrote:

this could land right now.

(... assuming it's approved.)

jmorse mentioned this in D56151: [DebugInfo] PR40010: Avoid register coalesing altering DBG_VALUE valuations.Apr 26 2019, 3:58 AM

Freshen up patch to make it merge cleanly; check that DVI isn't null... not clear whether something has changed or if I uploaded a completely broken patch before.

The high-level plan (limit the amount of movement done in placeDbgValues, check for any fallout, and (hopefully) eventually eliminate all of it) sgtm.

lib/CodeGen/CodeGenPrepare.cpp
7126 ↗	(On Diff #205851)	Use CGP::getDT()?
7135 ↗	(On Diff #205851)	Early exit on !VI for clarity?

@yurydelendik Would this change be OK for WebAssembly debug value handling?

In D58453#1563748, @aheejin wrote:

@yurydelendik Would this change be OK for WebAssembly debug value handling?

The change will be OK: we rely on collectDebugValues in WebAssembly to locate instruction's debug values. And we worry only about these atm.

Hi,

In D58453#1564358, @yurydelendik wrote:

The change will be OK: we rely on collectDebugValues in WebAssembly to locate instruction's debug values. And we worry only about these atm.

Awkwardly collectDebugValues relies on a behaviour that this patch will mostly undo, specifically, that DBG_VALUE insts are placed after the defining instruction. collectDebugValues will work sometimes, but will "miss" DBG_VALUEs much more often that it does now.

We can tailor something to suit WebAssemblys needs: would it be OK if collectDebugValues returned _all_ DBG_VALUE uses of the virtual register it's called on/with? That would include, for example, debug uses from different basic blocks, which doesn't occur with the current collectDebugValues implementation.

In D58453#1564940, @jmorse wrote:

Hi,

Awkwardly collectDebugValues relies on a behaviour that this patch will mostly undo, specifically, that DBG_VALUE insts are placed after the defining instruction. collectDebugValues will work sometimes, but will "miss" DBG_VALUEs much more often that it does now.

I was counting on "collectDebugValues will work sometimes" part and still hoping it will perform with some degree to "finding" most of the related DBG_VALUEs in the current block.

We can tailor something to suit WebAssemblys needs: would it be OK if collectDebugValues returned _all_ DBG_VALUE uses of the virtual register it's called on/with? That would include, for example, debug uses from different basic blocks, which doesn't occur with the current collectDebugValues implementation.

Agree.

I've added some changes to the patch that makes collectDebugValues behave in the way I described, returning all debug users of a VReg, optionally limited to only the basic block of the source MachineInst. This should be useful for testing; and it's immediately caused the test/DebugInfo/WebAssembly/dbg-value-move-reg-stackify.mir [0] test to fail. The output is:

%0:i64 = ARGUMENT_i64 0, implicit $arguments
dead %1:i32 = I32_WRAP_I64 %0, implicit-def dead $arguments
dead %1:i32 = CALL_i32 @bar, implicit-def dead $arguments, implicit $sp32, implicit $sp64
%2:i32 = CALL_i32 @bar, implicit-def dead $arguments, implicit $sp32, implicit $sp64, implicit-def $value_stack, implicit $value_stack
DBG_VALUE %2, $noreg, <0x5490d00>, !DIExpression(), debug-location !DILocation(line: 359, column: 12, scope: <0x548f5f0>)
DBG_VALUE %2, $noreg, <0x5490cb0>, !DIExpression(), debug-location !DILocation(line: 358, column: 12, scope: <0x548f5f0>)
DBG_VALUE %2, $noreg, <0x5490a50>, !DIExpression(), debug-location !DILocation(line: 357, column: 12, scope: <0x548f5f0>)
CALL_VOID @foo, %2, implicit-def dead $arguments, implicit $sp32, implicit $sp64, implicit-def $value_stack, implicit $value_stack
RETURN_VOID implicit-def dead $arguments

While I don't know webassembly, it seems clear that a) the test post-SSA, and b) the test expects collectDebugValues to only return DBG_VALUEs that refer to the same vreg def that the source instruction does.

Is that a fair assessment, or some kind of test artefact? That behaviour is almost certainly achievable after this patch lands, although it might require more plumbing somewhere.

[0] https://github.com/llvm/llvm-project/blob/a1d97a960e622ee21550d92809512cb0870be499/llvm/test/DebugInfo/WebAssembly/dbg-value-move-reg-stackify.mir

In D58453#1582934, @jmorse wrote:
I've added some changes to the patch that makes collectDebugValues behave in the way I described, returning all debug users of a VReg, optionally limited to only the basic block of the source MachineInst. This should be useful for testing; and it's immediately caused the test/DebugInfo/WebAssembly/dbg-value-move-reg-stackify.mir [0] test to fail. The output is:
%0:i64 = ARGUMENT_i64 0, implicit $arguments
dead %1:i32 = I32_WRAP_I64 %0, implicit-def dead $arguments
dead %1:i32 = CALL_i32 @bar, implicit-def dead $arguments, implicit $sp32, implicit $sp64
%2:i32 = CALL_i32 @bar, implicit-def dead $arguments, implicit $sp32, implicit $sp64, implicit-def $value_stack, implicit $value_stack
DBG_VALUE %2, $noreg, <0x5490d00>, !DIExpression(), debug-location !DILocation(line: 359, column: 12, scope: <0x548f5f0>)
DBG_VALUE %2, $noreg, <0x5490cb0>, !DIExpression(), debug-location !DILocation(line: 358, column: 12, scope: <0x548f5f0>)
DBG_VALUE %2, $noreg, <0x5490a50>, !DIExpression(), debug-location !DILocation(line: 357, column: 12, scope: <0x548f5f0>)
CALL_VOID @foo, %2, implicit-def dead $arguments, implicit $sp32, implicit $sp64, implicit-def $value_stack, implicit $value_stack
RETURN_VOID implicit-def dead $arguments
While I don't know webassembly, it seems clear that a) the test post-SSA, and b) the test expects collectDebugValues to only return DBG_VALUEs that refer to the same vreg def that the source instruction does.

Is that a fair assessment, or some kind of test artefact? That behaviour is almost certainly achievable after this patch lands, although it might require more plumbing somewhere.

[0] https://github.com/llvm/llvm-project/blob/a1d97a960e622ee21550d92809512cb0870be499/llvm/test/DebugInfo/WebAssembly/dbg-value-move-reg-stackify.mir

Looks like I misunderstood meaning of *all*. I assumed that means DBG_VALUEs the refer the vreg def of the source instruction, excluding DBG_VALUEs that refer the same vreg def by the above instructions (in the same block). The test result above shows that collectDebugValues locates all DBG_VALUEs for the instruction even if it does not (re)define the value, but there are few instructions in the same block above that do. Is this the intended effect of this patch? If yes, we probably need to create more robust collectDebugValues-like method that will track lifetime of DBG_VALUEs.

jmorse mentioned this in D64630: [DebugInfo] Address performance regression with r364515.Jul 12 2019, 1:01 PM

! In D58453#1582991, @yurydelendik wrote:
Looks like I misunderstood meaning of *all*. I assumed that means DBG_VALUEs the refer the vreg def of the source instruction, excluding DBG_VALUEs that refer the same vreg def by the above instructions (in the same block). The test result above shows that collectDebugValues locates all DBG_VALUEs for the instruction even if it does not (re)define the value, but there are few instructions in the same block above that do. Is this the intended effect of this patch? If yes, we probably need to create more robust collectDebugValues-like method that will track lifetime of DBG_VALUEs.

Indeed, that's the intention. I guess a clearer way of saying this would have been that collectDebugValues will be much less useful after leaving SSA form, because from that point onwards one must consider register definitions, rather than just the register itself, to know what a DBG_VALUE refers to. While this is a pain, the behaviour the rest of this patch is trying to eliminate is destroying a lot of variable location information already.

I've had a quick look at the wasm-reg-stackify pass, and to me it looks like:

It's currently performed on a per-block basis
Domination and LiveInterval analyses are already available

Which means that the WebAssemblyDebugValueManager constructor could just apply an additional filter on collected DbgValues, that the LiveInterval/ValueNumber the DBG_VALUE refers to is the defining instruction.

One problem with that implementation would be that debug intrinsics outside the current basic block may still refer to a vreg that wasm-reg-stackify operates on, a situation that placeDbgValues currently masks. I don't know enough about wasm to know if this is a serious issue.

In D58453#1585436, @jmorse wrote:

the WebAssemblyDebugValueManager constructor could just apply an additional filter on collected DbgValues, that the LiveInterval/ValueNumber the DBG_VALUE refers to is the defining instruction.

Sounds good. That was the one reason for existence of WebAssemblyDebugValueManager. I would like not to change the test case, but fix the WebAssemblyDebugValueManager's logic here or in a different patch. Let me know if I need to follow up if the latter.

Great news everyone, it's placeDbgValues time again!

I've hopefully fixed a bunch of performance regressions with what were the dependencies of this fix, the remainder of which are based around D70676, after which this should be alright to land. Just to set expectations, as discussed above this patch reduces variable location coverage, but increases the quality by not artificially reordering assignments.

One significant new addition in this revision is the WebAssemblyDebugValueManager logic that was discussed a while back. Instead of only looking at DBG_VALUEs immediately after a vreg def, this examines DBG_VALUEs in the whole local block and collects those that refer to the same value number as the defining instruction. Happily this means no WASM test changes at all.

In the meantime, DW_AT_location-reference.ll has grown some modifications -- since last time it's because of LiveDebugValues improvements preserving locations for longer, which are currently masked by placeDbgValues reordering the assignments.

Herald added subscribers: hiraditya, jgravelle-google, sbc100. · View Herald TranscriptNov 25 2019, 11:53 AM

Orlando added a subscriber: Orlando.Nov 26 2019, 2:03 AM

Is it possible to do the WebAssembly updates in a separate patch, or is those changes dependent on also changing how CodeGenPrepare works?

llvm/include/llvm/CodeGen/MachineInstr.h
1654–1655	(Not something added in this patch, but since we are duplicate this, IMHO kind of weak, interface we might wanna do something about it.) The "matching" here is not that clear. I notice that you wrote "this instructions first vreg argument" below. But these functions those not even check that the first argument is a vreg. Nor that it is a def. And what it the MI has more than one def? One idea is to clearly specify that we only collect DBG_VALUE instructions using the register defined by operand zero (iff that operand is a def). And then the implementation of these functions should bail out if operand zero isn't a register def. We could also add a TODO that maybe we should look at all defs in the future. Or is it by design that we also collect DGB_VALUEs matching operand zero even if operand zero is a use? Then why aren't we doing this for all uses that this instruction got?
llvm/lib/CodeGen/MachineInstr.cpp
2124 ↗	(On Diff #230949)	Not sure what kind of idiom this is. Can't we just skip this MI that referes to this (it isn't used consistently below anyway as you are using "this->" in one place, and getParent() in another place. An alternative would be to make the function static, and provide MI as an argument.
2128 ↗	(On Diff #230949)	I think you definitely can skip "this->", but also "getParent()->getParent()->" since there is a getRegInfo() helper in MachineInstr.
2130 ↗	(On Diff #230949)	nit: If we assume that DBG_VALUE never is bundled (we should implement a verifier check for that if it does not exist), this could be use_bundles instead of use_instructions. I'm just saying that collectDebugValues is iterating on a bundle level, but right now this function also scan for DBG_VALUE inside bundles.

Bjorn wrote:

Is it possible to do the WebAssembly updates in a separate patch, or is those changes dependent on also changing how CodeGenPrepare works?

I guess they can, and they would get more visibility that way; they don't depend on placeDbgValues happily. I'll try splitting those into a separate patch.

llvm/include/llvm/CodeGen/MachineInstr.h
1654–1655	These are all great questions, and I don't have a great explanation for why collectDebugValues is the way it is. IMHO, the first operand being a vreg is an assumption that held in the past, largely hinging on placeDbgValues existing, and all the callers to collectDebugValues guaranteed it themselves. It's worth noting that I was originally going to completely delete this method, but the RISCV backend has grown a dependency on it, and would be the only user once this patch lands. That code (RISCVISelLower.cpp's emitSelectPseudo) seems to have a good use case, they're moving DBG_VALUEs out of a code sequence that's going to be completely rewritten. It doesn't need any consideration of what the DBG_VALUEs refer to, only that they come immediately after the current / "this" instruction. How about: Making collectDebugValues not consider operands at all: it becomes a helper to collect DBG_VALUEs that follow an instruction, nothing more. Making collectBlockDebugValues a helper for WebAssembly stuff, as that's the only real consumer. This means that there's no longer a helper method anywhere that "gets my debug insts". IMO, we should add these when we discover use cases, for example "collectDebugUsersOfDef", which could be defined a lot more precisely.
llvm/lib/CodeGen/MachineInstr.cpp
2124 ↗	(On Diff #230949)	I'll polish this and split it into the WebAssembly specific patch.
2130 ↗	(On Diff #230949)	Ah, I know approximately nothing about instruction bundling -- DBG_VALUEs never appear inside of a bundle?

This revision splits out the WebAssembly work... however it turns out that that code might not be necessary at all. I tried to put together a test demonstrating what would go wrong without placeDbgValues, but can't replicate any problem. For example, in the following block of wasm mir:

%0:i64 = ARGUMENT_i64 0, implicit $arguments
%1:i64 = ARGUMENT_i64 1, implicit $arguments
%2:i32 = ARGUMENT_i32 2, implicit $arguments
%7:i32 = I32_WRAP_I64 %0:i64, implicit-def dead $arguments
%4:i32 = ADD_I32 %2:i32, %7:i32, implicit-def dead $arguments
%5:i32 = I32_WRAP_I64 %1:i64, implicit-def dead $arguments
%6:i32 = ADD_I32 %4:i32, %5:i32, implicit-def dead $arguments
DBG_VALUE %4:i32, $noreg, !10, !DIExpression(), debug-location !13; <unknown>:357:12 line no:357
RETURN %6:i32, implicit-def dead $arguments

(Which, I think, adds some arguments together) the DBG_VALUE refers to a value several insts earlier, and wouldn't be picked up by collectDebugValues. Something due to the way the "stackifier" runs and rewrites instructions, means that the DBG_VALUE floats up to the nearest def of its vreg operand, and sticks there.

I can't say I really understand how wasm works, but it appears to be safe against DBG_VALUEs being placed over a wider region, at least for now. This does mean leaving collectDebugValues unchanged too, though.

aprantl added inline comments.Dec 4 2019, 4:11 PM

llvm/lib/CodeGen/CodeGenPrepare.cpp
7252	Is this expensive?
7255	continue, too?

Otherwise this looks far more manageable now. Ok with me if the other reviewers are fine.

Reshuffle the loop structure, early continue rather than an indented block. I've also moved the PHI/EHpad test ahead of the domination check for efficiency.

jmorse marked 2 inline comments as done.Dec 5 2019, 9:05 AM

jmorse added inline comments.

llvm/lib/CodeGen/CodeGenPrepare.cpp
7252	My understanding of dominator queries is that they're inexpensive, becoming just a hash table lookup of the instruction pointer. Building the dominator tree is what's expensive (at the start of placeDbgValues). It looks like CodeGenPrepare does keep a cached copy, but repeatedly invalidates and recomputes it as it changes things in the Function. IMO building the dominator tree at the start of the function shouldn't be "Too" (TM) expensive, and I have low confidence that we could re-use the one CodeGenPrepare keeps around: there might be a code change that invalidates it by the point placeDbgValues is called that we're not aware of.

LGTM pending Björn's review.

This revision is now accepted and ready to land.Dec 5 2019, 9:39 AM

LGTM!

Closed by commit rG00e238896cd8: [DebugInfo] Nerf placeDbgValues, with prejudice (authored by jmorse). · Explain WhyDec 9 2019, 4:55 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

MachineInstr.h

3 lines

lib/

CodeGen/

CodeGenPrepare.cpp

58 lines

test/

DebugInfo/

COFF/

42 lines

NVPTX/

debug-info.ll

4 lines

X86/

DW_AT_location-reference.ll

4 lines

PR37234.ll

3 lines

codegenprep-addrsink.ll

15 lines

tools/

llvm-locstats/

locstats.ll

4 lines

Diff 232811

llvm/include/llvm/CodeGen/MachineInstr.h

Show First 20 Lines • Show All 1,645 Lines • ▼ Show 20 Lines	if (MO.isReg() && MO.isTied()) {
getOperand(findTiedOperandIdx(OpIdx)).TiedTo = 0;		getOperand(findTiedOperandIdx(OpIdx)).TiedTo = 0;
MO.TiedTo = 0;		MO.TiedTo = 0;
}		}
}		}

/// Add all implicit def and use operands to this instruction.		/// Add all implicit def and use operands to this instruction.
void addImplicitDefUseOperands(MachineFunction &MF);		void addImplicitDefUseOperands(MachineFunction &MF);

/// Scan instructions following MI and collect any matching DBG_VALUEs.		/// Scan instructions immediately following MI and collect any matching
		/// DBG_VALUEs.
		bjopeUnsubmitted Not Done Reply Inline Actions (Not something added in this patch, but since we are duplicate this, IMHO kind of weak, interface we might wanna do something about it.) The "matching" here is not that clear. I notice that you wrote "this instructions first vreg argument" below. But these functions those not even check that the first argument is a vreg. Nor that it is a def. And what it the MI has more than one def? One idea is to clearly specify that we only collect DBG_VALUE instructions using the register defined by operand zero (iff that operand is a def). And then the implementation of these functions should bail out if operand zero isn't a register def. We could also add a TODO that maybe we should look at all defs in the future. Or is it by design that we also collect DGB_VALUEs matching operand zero even if operand zero is a use? Then why aren't we doing this for all uses that this instruction got? bjope: (Not something added in this patch, but since we are duplicate this, IMHO kind of weak…
		jmorseAuthorUnsubmitted Done Reply Inline Actions These are all great questions, and I don't have a great explanation for why collectDebugValues is the way it is. IMHO, the first operand being a vreg is an assumption that held in the past, largely hinging on placeDbgValues existing, and all the callers to collectDebugValues guaranteed it themselves. It's worth noting that I was originally going to completely delete this method, but the RISCV backend has grown a dependency on it, and would be the only user once this patch lands. That code (RISCVISelLower.cpp's emitSelectPseudo) seems to have a good use case, they're moving DBG_VALUEs out of a code sequence that's going to be completely rewritten. It doesn't need any consideration of what the DBG_VALUEs refer to, only that they come immediately after the current / "this" instruction. How about: Making collectDebugValues not consider operands at all: it becomes a helper to collect DBG_VALUEs that follow an instruction, nothing more. Making collectBlockDebugValues a helper for WebAssembly stuff, as that's the only real consumer. This means that there's no longer a helper method anywhere that "gets my debug insts". IMO, we should add these when we discover use cases, for example "collectDebugUsersOfDef", which could be defined a lot more precisely. jmorse: These are all great questions, and I don't have a great explanation for why collectDebugValues…
void collectDebugValues(SmallVectorImpl<MachineInstr *> &DbgValues);		void collectDebugValues(SmallVectorImpl<MachineInstr *> &DbgValues);

/// Find all DBG_VALUEs that point to the register def in this instruction		/// Find all DBG_VALUEs that point to the register def in this instruction
/// and point them to \p Reg instead.		/// and point them to \p Reg instead.
void changeDebugValuesDefReg(Register Reg);		void changeDebugValuesDefReg(Register Reg);

/// Returns the Intrinsic::ID for this instruction.		/// Returns the Intrinsic::ID for this instruction.
/// \pre Must have an intrinsic ID operand.		/// \pre Must have an intrinsic ID operand.
▲ Show 20 Lines • Show All 72 Lines • Show Last 20 Lines

llvm/lib/CodeGen/CodeGenPrepare.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,224 Lines • ▼ Show 20 Lines	if (SunkAddr) {
// information, and we can't generate bitcasts that may affect codegen.		// information, and we can't generate bitcasts that may affect codegen.
DVI.setOperand(0, MetadataAsValue::get(DVI.getContext(),		DVI.setOperand(0, MetadataAsValue::get(DVI.getContext(),
ValueAsMetadata::get(SunkAddr)));		ValueAsMetadata::get(SunkAddr)));
return true;		return true;
}		}
return false;		return false;
}		}

// llvm.dbg.value is far away from the value then iSel may not be able		// A llvm.dbg.value may be using a value before its definition, due to
// handle it properly. iSel will drop llvm.dbg.value if it can not		// optimizations in this pass and others. Scan for such dbg.values, and rescue
// find a node corresponding to the value.		// them by moving the dbg.value to immediately after the value definition.
		// FIXME: Ideally this should never be necessary, and this has the potential
		// to re-order dbg.value intrinsics.
bool CodeGenPrepare::placeDbgValues(Function &F) {		bool CodeGenPrepare::placeDbgValues(Function &F) {
bool MadeChange = false;		bool MadeChange = false;
		DominatorTree DT(F);

for (BasicBlock &BB : F) {		for (BasicBlock &BB : F) {
Instruction *PrevNonDbgInst = nullptr;
for (BasicBlock::iterator BI = BB.begin(), BE = BB.end(); BI != BE;) {		for (BasicBlock::iterator BI = BB.begin(), BE = BB.end(); BI != BE;) {
Instruction Insn = &BI++;		Instruction Insn = &BI++;
DbgValueInst *DVI = dyn_cast<DbgValueInst>(Insn);		DbgValueInst *DVI = dyn_cast<DbgValueInst>(Insn);
// Leave dbg.values that refer to an alloca alone. These		if (!DVI)
// intrinsics describe the address of a variable (= the alloca)
// being taken. They should not be moved next to the alloca
// (and to the beginning of the scope), but rather stay close to
// where said address is used.
if (!DVI \|\| (DVI->getValue() && isa<AllocaInst>(DVI->getValue()))) {
PrevNonDbgInst = Insn;
continue;		continue;
}

Instruction *VI = dyn_cast_or_null<Instruction>(DVI->getValue());		Instruction *VI = dyn_cast_or_null<Instruction>(DVI->getValue());
if (VI && VI != PrevNonDbgInst && !VI->isTerminator()) {
		if (!VI \|\| VI->isTerminator())
		continue;
		aprantlUnsubmitted Not Done Reply Inline Actions Is this expensive? aprantl: Is this expensive?
		jmorseAuthorUnsubmitted Done Reply Inline Actions My understanding of dominator queries is that they're inexpensive, becoming just a hash table lookup of the instruction pointer. Building the dominator tree is what's expensive (at the start of placeDbgValues). It looks like CodeGenPrepare does keep a cached copy, but repeatedly invalidates and recomputes it as it changes things in the Function. IMO building the dominator tree at the start of the function shouldn't be "Too" (TM) expensive, and I have low confidence that we could re-use the one CodeGenPrepare keeps around: there might be a code change that invalidates it by the point placeDbgValues is called that we're not aware of. jmorse: My understanding of dominator queries is that they're inexpensive, becoming just a hash table…

// If VI is a phi in a block with an EHPad terminator, we can't insert		// If VI is a phi in a block with an EHPad terminator, we can't insert
// after it.		// after it.
		aprantlUnsubmitted Done Reply Inline Actions continue, too? aprantl: continue, too?
if (isa<PHINode>(VI) && VI->getParent()->getTerminator()->isEHPad())		if (isa<PHINode>(VI) && VI->getParent()->getTerminator()->isEHPad())
continue;		continue;

		// If the defining instruction dominates the dbg.value, we do not need
		// to move the dbg.value.
		if (DT.dominates(VI, DVI))
		continue;

LLVM_DEBUG(dbgs() << "Moving Debug Value before :\n"		LLVM_DEBUG(dbgs() << "Moving Debug Value before :\n"
<< DVI << ' ' << VI);		<< DVI << ' ' << VI);
DVI->removeFromParent();		DVI->removeFromParent();
if (isa<PHINode>(VI))		if (isa<PHINode>(VI))
DVI->insertBefore(&*VI->getParent()->getFirstInsertionPt());		DVI->insertBefore(&*VI->getParent()->getFirstInsertionPt());
else		else
DVI->insertAfter(VI);		DVI->insertAfter(VI);
MadeChange = true;		MadeChange = true;
++NumDbgValueMoved;		++NumDbgValueMoved;
}		}
}		}
}
return MadeChange;		return MadeChange;
}		}

/// Scale down both weights to fit into uint32_t.		/// Scale down both weights to fit into uint32_t.
static void scaleWeights(uint64_t &NewTrue, uint64_t &NewFalse) {		static void scaleWeights(uint64_t &NewTrue, uint64_t &NewFalse) {
uint64_t NewMax = (NewTrue > NewFalse) ? NewTrue : NewFalse;		uint64_t NewMax = (NewTrue > NewFalse) ? NewTrue : NewFalse;
uint32_t Scale = (NewMax / std::numeric_limits<uint32_t>::max()) + 1;		uint32_t Scale = (NewMax / std::numeric_limits<uint32_t>::max()) + 1;
NewTrue = NewTrue / Scale;		NewTrue = NewTrue / Scale;
▲ Show 20 Lines • Show All 186 Lines • Show Last 20 Lines

llvm/test/DebugInfo/COFF/register-variables.ll

	Show All 26 Lines
	; ASM: #DEBUG_VALUE: f:p <- $ecx			; ASM: #DEBUG_VALUE: f:p <- $ecx
	; ASM: pushq %rsi			; ASM: pushq %rsi
	; ASM: subq $32, %rsp			; ASM: subq $32, %rsp
	; ASM: movl %ecx, %esi			; ASM: movl %ecx, %esi
	; ASM: [[p_ecx_esi:\.Ltmp.*]]:			; ASM: [[p_ecx_esi:\.Ltmp.*]]:
	; ASM: #DEBUG_VALUE: f:p <- $esi			; ASM: #DEBUG_VALUE: f:p <- $esi
	; ASM: callq getint			; ASM: callq getint
	; ASM: [[after_getint:\.Ltmp.*]]:			; ASM: [[after_getint:\.Ltmp.*]]:
	; ASM: #DEBUG_VALUE: a <- $eax
	; ASM: #DEBUG_VALUE: inlineinc:a <- $eax
	; ASM: #DEBUG_VALUE: c <- $eax
	; ASM: testl %esi, %esi			; ASM: testl %esi, %esi
	; ASM: je .LBB0_2			; ASM: je .LBB0_2
	; ASM: [[after_je:\.Ltmp.*]]:			; ASM: [[after_je:\.Ltmp.*]]:
	; ASM: # %bb.1: # %if.then			; ASM: # %bb.1: # %if.then
	; ASM-DAG: #DEBUG_VALUE: inlineinc:a <- $eax			; ASM-DAG: #DEBUG_VALUE: inlineinc:a <- $eax
	; ASM-DAG: #DEBUG_VALUE: a <- $eax			; ASM-DAG: #DEBUG_VALUE: a <- $eax
	; ASM-DAG: #DEBUG_VALUE: f:p <- $esi			; ASM-DAG: #DEBUG_VALUE: f:p <- $esi
	; ASM: addl $1, %eax			; ASM: addl $1, %eax
	; ASM: [[after_inc_eax:\.Ltmp.*]]:			; ASM: [[after_inc_eax:\.Ltmp.*]]:
	; ASM: #DEBUG_VALUE: inlineinc:b <- $eax			; ASM: #DEBUG_VALUE: inlineinc:b <- $eax
	; ASM: #DEBUG_VALUE: b <- $eax
	; ASM: addl $1, x(%rip)			; ASM: addl $1, x(%rip)
	; ASM: [[after_if:\.Ltmp.*]]:			; ASM: [[after_if:\.Ltmp.*]]:
	; ASM: .LBB0_2: # %if.else			; ASM: .LBB0_2: # %if.else
	; ASM: #DEBUG_VALUE: f:p <- $esi			; ASM: #DEBUG_VALUE: f:p <- $esi
				; ASM: #DEBUG_VALUE: c <- $eax
	; ASM: movl %eax, %ecx			; ASM: movl %eax, %ecx
	; ASM: addq $32, %rsp			; ASM: addq $32, %rsp
	; ASM: popq %rsi			; ASM: popq %rsi
	; ASM: [[func_end:\.Ltmp.*]]:			; ASM: [[func_end:\.Ltmp.*]]:
	; ASM: jmp putint # TAILCALL			; ASM: jmp putint # TAILCALL
				; ASM: [[func_finished:\.Ltmp.*]]:

	; ASM: .short 4414 # Record kind: S_LOCAL			; ASM: .short 4414 # Record kind: S_LOCAL
	; ASM: .asciz "p"			; ASM: .asciz "p"
	; ASM: .cv_def_range .Lfunc_begin0 [[p_ecx_esi]], reg, 18			; ASM: .cv_def_range .Lfunc_begin0 [[p_ecx_esi]], reg, 18
	; ASM: .cv_def_range [[p_ecx_esi]] [[func_end]], reg, 23			; ASM: .cv_def_range [[p_ecx_esi]] [[func_end]], reg, 23
	; ASM: .short 4414 # Record kind: S_LOCAL			; ASM: .short 4414 # Record kind: S_LOCAL
	; ASM: .asciz "c"			; ASM: .asciz "c"
	; ASM: .cv_def_range [[after_getint]] [[after_je]], reg, 17			; ASM: .cv_def_range [[after_if]] [[func_finished]], reg, 17
	; ASM: .short 4414 # Record kind: S_LOCAL			; ASM: .short 4414 # Record kind: S_LOCAL
	; ASM: .asciz "a"			; ASM: .asciz "a"
	; ASM: .cv_def_range [[after_getint]] [[after_inc_eax]], reg, 17			; ASM: .cv_def_range [[after_je]] [[after_inc_eax]], reg, 17
	; ASM: .short 4414 # Record kind: S_LOCAL			; ASM: .short 4414 # Record kind: S_LOCAL
	; ASM: .asciz "b"			; ASM: .asciz "b"
	; ASM: .cv_def_range [[after_inc_eax]] [[after_if]], reg, 17			; ASM: .cv_def_range [[after_if]] [[after_if]], reg, 17

				; Note: "b" is a victim of tail de-duplication / branch folding.

	; ASM: .short 4429 # Record kind: S_INLINESITE			; ASM: .short 4429 # Record kind: S_INLINESITE
	; ASM: .short 4414 # Record kind: S_LOCAL			; ASM: .short 4414 # Record kind: S_LOCAL
	; ASM: .asciz "a"			; ASM: .asciz "a"
	; ASM: .cv_def_range [[after_getint]] [[after_inc_eax]], reg, 17			; ASM: .cv_def_range [[after_je]] [[after_inc_eax]], reg, 17
	; ASM: .short 4414 # Record kind: S_LOCAL			; ASM: .short 4414 # Record kind: S_LOCAL
	; ASM: .asciz "b"			; ASM: .asciz "b"
	; ASM: .cv_def_range [[after_inc_eax]] [[after_if]], reg, 17			; ASM: .cv_def_range [[after_inc_eax]] [[after_if]], reg, 17
	; ASM: .short 4430 # Record kind: S_INLINESITE_END			; ASM: .short 4430 # Record kind: S_INLINESITE_END

	; OBJ: Subsection [			; OBJ: Subsection [
	; OBJ: SubSectionType: Symbols (0xF1)			; OBJ: SubSectionType: Symbols (0xF1)
	; OBJ: {{.}}Proc{{.}}Sym {			; OBJ: {{.}}Proc{{.}}Sym {
	Show All 26 Lines
	; OBJ: Type: int (0x74)			; OBJ: Type: int (0x74)
	; OBJ: Flags [ (0x0)			; OBJ: Flags [ (0x0)
	; OBJ: ]			; OBJ: ]
	; OBJ: VarName: c			; OBJ: VarName: c
	; OBJ: }			; OBJ: }
	; OBJ: DefRangeRegisterSym {			; OBJ: DefRangeRegisterSym {
	; OBJ: Register: EAX (0x11)			; OBJ: Register: EAX (0x11)
	; OBJ: LocalVariableAddrRange {			; OBJ: LocalVariableAddrRange {
	; OBJ: OffsetStart: .text+0xC			; OBJ: OffsetStart: .text+0x1A
	; OBJ: ISectStart: 0x0			; OBJ: ISectStart: 0x0
	; OBJ: Range: 0x4			; OBJ: Range: 0xC
	; OBJ: }			; OBJ: }
	; OBJ: }			; OBJ: }
	; OBJ: LocalSym {			; OBJ: LocalSym {
	; OBJ: Type: int (0x74)			; OBJ: Type: int (0x74)
	; OBJ: Flags [ (0x0)			; OBJ: Flags [ (0x0)
	; OBJ: ]			; OBJ: ]
	; OBJ: VarName: a			; OBJ: VarName: a
	; OBJ: }			; OBJ: }
	; OBJ: DefRangeRegisterSym {			; OBJ: DefRangeRegisterSym {
	; OBJ: Register: EAX (0x11)			; OBJ: Register: EAX (0x11)
	; OBJ: LocalVariableAddrRange {			; OBJ: LocalVariableAddrRange {
	; OBJ: OffsetStart: .text+0xC			; OBJ: OffsetStart: .text+0x10
	; OBJ: ISectStart: 0x0			; OBJ: ISectStart: 0x0
	; OBJ: Range: 0x7			; OBJ: Range: 0x3
	; OBJ: }
	; OBJ: }
	; OBJ: LocalSym {
	; OBJ: Type: int (0x74)
	; OBJ: Flags [ (0x0)
	; OBJ: ]
	; OBJ: VarName: b
	; OBJ: }
	; OBJ: DefRangeRegisterSym {
	; OBJ: Register: EAX (0x11)
	; OBJ: MayHaveNoName: 0
	; OBJ: OffsetStart: .text+0x13
	; OBJ: ISectStart: 0x0
	; OBJ: Range: 0x7
	; OBJ: }			; OBJ: }
	; OBJ: }			; OBJ: }
	; OBJ: InlineSiteSym {			; OBJ: InlineSiteSym {
	; OBJ: PtrParent: 0x0			; OBJ: PtrParent: 0x0
	; OBJ: PtrEnd: 0x0			; OBJ: PtrEnd: 0x0
	; OBJ: Inlinee: inlineinc (0x1002)			; OBJ: Inlinee: inlineinc (0x1002)
	; OBJ: }			; OBJ: }
	; OBJ: LocalSym {			; OBJ: LocalSym {
	; OBJ: Type: int (0x74)			; OBJ: Type: int (0x74)
	; OBJ: Flags [ (0x1)			; OBJ: Flags [ (0x1)
	; OBJ: IsParameter (0x1)			; OBJ: IsParameter (0x1)
	; OBJ: ]			; OBJ: ]
	; OBJ: VarName: a			; OBJ: VarName: a
	; OBJ: }			; OBJ: }
	; OBJ: DefRangeRegisterSym {			; OBJ: DefRangeRegisterSym {
	; OBJ: Register: EAX (0x11)			; OBJ: Register: EAX (0x11)
	; OBJ: LocalVariableAddrRange {			; OBJ: LocalVariableAddrRange {
	; OBJ: OffsetStart: .text+0xC			; OBJ: OffsetStart: .text+0x10
	; OBJ: ISectStart: 0x0			; OBJ: ISectStart: 0x0
	; OBJ: Range: 0x7			; OBJ: Range: 0x3
	; OBJ: }			; OBJ: }
	; OBJ: }			; OBJ: }
	; OBJ: LocalSym {			; OBJ: LocalSym {
	; OBJ: Type: int (0x74)			; OBJ: Type: int (0x74)
	; OBJ: Flags [ (0x0)			; OBJ: Flags [ (0x0)
	; OBJ: ]			; OBJ: ]
	; OBJ: VarName: b			; OBJ: VarName: b
	; OBJ: }			; OBJ: }
	▲ Show 20 Lines • Show All 116 Lines • Show Last 20 Lines

llvm/test/DebugInfo/NVPTX/debug-info.ll

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 8,382 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: .b32 9785 // DW_AT_abstract_origin			; CHECK-NEXT: .b32 9785 // DW_AT_abstract_origin
	; CHECK-NEXT: .b64 Ltmp2 // DW_AT_low_pc			; CHECK-NEXT: .b64 Ltmp2 // DW_AT_low_pc
	; CHECK-NEXT: .b64 Ltmp3 // DW_AT_high_pc			; CHECK-NEXT: .b64 Ltmp3 // DW_AT_high_pc
	; CHECK-NEXT: .b8 12 // DW_AT_call_file			; CHECK-NEXT: .b8 12 // DW_AT_call_file
	; CHECK-NEXT: .b8 6 // DW_AT_call_line			; CHECK-NEXT: .b8 6 // DW_AT_call_line
	; CHECK-NEXT: .b8 37 // DW_AT_call_column			; CHECK-NEXT: .b8 37 // DW_AT_call_column
	; CHECK-NEXT: .b8 43 // Abbrev [43] 0x2711:0x23 DW_TAG_inlined_subroutine			; CHECK-NEXT: .b8 43 // Abbrev [43] 0x2711:0x23 DW_TAG_inlined_subroutine
	; CHECK-NEXT: .b32 9791 // DW_AT_abstract_origin			; CHECK-NEXT: .b32 9791 // DW_AT_abstract_origin
	; CHECK-NEXT: .b64 Ltmp10 // DW_AT_low_pc			; CHECK-NEXT: .b64 Ltmp9 // DW_AT_low_pc
	; CHECK-NEXT: .b64 Ltmp11 // DW_AT_high_pc			; CHECK-NEXT: .b64 Ltmp10 // DW_AT_high_pc
	; CHECK-NEXT: .b8 12 // DW_AT_call_file			; CHECK-NEXT: .b8 12 // DW_AT_call_file
	; CHECK-NEXT: .b8 8 // DW_AT_call_line			; CHECK-NEXT: .b8 8 // DW_AT_call_line
	; CHECK-NEXT: .b8 5 // DW_AT_call_column			; CHECK-NEXT: .b8 5 // DW_AT_call_column
	; CHECK-NEXT: .b8 44 // Abbrev [44] 0x2729:0x5 DW_TAG_formal_parameter			; CHECK-NEXT: .b8 44 // Abbrev [44] 0x2729:0x5 DW_TAG_formal_parameter
	; CHECK-NEXT: .b32 9811 // DW_AT_abstract_origin			; CHECK-NEXT: .b32 9811 // DW_AT_abstract_origin
	; CHECK-NEXT: .b8 44 // Abbrev [44] 0x272e:0x5 DW_TAG_formal_parameter			; CHECK-NEXT: .b8 44 // Abbrev [44] 0x272e:0x5 DW_TAG_formal_parameter
	; CHECK-NEXT: .b32 9820 // DW_AT_abstract_origin			; CHECK-NEXT: .b32 9820 // DW_AT_abstract_origin
	; CHECK-NEXT: .b8 0 // End Of Children Mark			; CHECK-NEXT: .b8 0 // End Of Children Mark
	▲ Show 20 Lines • Show All 749 Lines • Show Last 20 Lines

llvm/test/DebugInfo/X86/DW_AT_location-reference.ll

	Show All 26 Lines
	; while (x & 2) { x *= a; }			; while (x & 2) { x *= a; }
	; a = g(0, x);			; a = g(0, x);
	; }			; }

	; // The 'x' variable and its symbol reference location			; // The 'x' variable and its symbol reference location
	; CHECK: .debug_info contents:			; CHECK: .debug_info contents:
	; CHECK: DW_TAG_variable			; CHECK: DW_TAG_variable
	; CHECK-NEXT: DW_AT_location [DW_FORM_sec_offset] (0x00000000			; CHECK-NEXT: DW_AT_location [DW_FORM_sec_offset] (0x00000000
	; Check that the location contains only 2 ranges.			; Check that the location contains only 4 ranges.
				; CHECK-NEXT: [0x{{[0-9a-f]}}, 0x{{[0-9a-f]}})
				; CHECK-NEXT: [0x{{[0-9a-f]}}, 0x{{[0-9a-f]}})
	; CHECK-NEXT: [0x{{[0-9a-f]}}, 0x{{[0-9a-f]}})			; CHECK-NEXT: [0x{{[0-9a-f]}}, 0x{{[0-9a-f]}})
	; CHECK-NEXT: [0x{{[0-9a-f]}}, 0x{{[0-9a-f]}}){{.*}})			; CHECK-NEXT: [0x{{[0-9a-f]}}, 0x{{[0-9a-f]}}){{.*}})
	; CHECK-NEXT: DW_AT_name {{.*}} "x"			; CHECK-NEXT: DW_AT_name {{.*}} "x"
	; CHECK-NEXT: DW_AT_decl_file			; CHECK-NEXT: DW_AT_decl_file
	; CHECK-NEXT: DW_AT_decl_line			; CHECK-NEXT: DW_AT_decl_line
	; CHECK-NEXT: DW_AT_type			; CHECK-NEXT: DW_AT_type

	; Check that we have no relocations in Darwin's output.			; Check that we have no relocations in Darwin's output.
	▲ Show 20 Lines • Show All 75 Lines • Show Last 20 Lines

llvm/test/DebugInfo/X86/PR37234.ll

	Show All 14 Lines
	; ++aa; // DexWatch('aa')			; ++aa; // DexWatch('aa')
	; }			; }
	; return aa;			; return aa;
	; }			; }


	; CHECK-LABEL: # %bb.{{.*}}:			; CHECK-LABEL: # %bb.{{.*}}:
	; CHECK: #DEBUG_VALUE: main:aa <- 0			; CHECK: #DEBUG_VALUE: main:aa <- 0
	; CHECK: #DEBUG_VALUE: main:aa <- $[[REG:[0-9a-z]+]]
	; CHECK: .LBB0_1:			; CHECK: .LBB0_1:
	; CHECK: #DEBUG_VALUE: main:aa <- $[[REG]]			; CHECK: #DEBUG_VALUE: main:aa <- $[[REG:[0-9a-z]+]]
	; CHECK: je .LBB0_4			; CHECK: je .LBB0_4
	; CHECK: # %bb.{{.*}}:			; CHECK: # %bb.{{.*}}:
	; CHECK: #DEBUG_VALUE: main:aa <- $[[REG]]			; CHECK: #DEBUG_VALUE: main:aa <- $[[REG]]
	; CHECK: jne .LBB0_1			; CHECK: jne .LBB0_1
	; CHECK: # %bb.{{.*}}:			; CHECK: # %bb.{{.*}}:
	; CHECK: #DEBUG_VALUE: main:aa <- $[[REG]]			; CHECK: #DEBUG_VALUE: main:aa <- $[[REG]]
	; CHECK: incl %[[REG]]			; CHECK: incl %[[REG]]
	; CHECK: #DEBUG_VALUE: main:aa <- $[[REG]]			; CHECK: #DEBUG_VALUE: main:aa <- $[[REG]]
	▲ Show 20 Lines • Show All 91 Lines • Show Last 20 Lines

llvm/test/DebugInfo/X86/codegenprep-addrsink.ll

	; RUN: llc -start-before=codegenprepare -stop-after=codegenprepare -mtriple=x86_64-unknown-unknown %s -o - \| FileCheck %s			; RUN: llc -start-before=codegenprepare -stop-after=codegenprepare -mtriple=x86_64-unknown-unknown %s -o - \| FileCheck %s
	;			;
	; CGP duplicates address calculation into each basic block that contains loads			; CGP duplicates address calculation into each basic block that contains loads
	; or stores, so that they can be folded into instruction memory operands for			; or stores, so that they can be folded into instruction memory operands for
	; example. dbg.value's should be redirected to identify such local address			; example. dbg.value's should be redirected to identify such local address
	; computations, to give the best opportunity for variable locations to be			; computations, to give the best opportunity for variable locations to be
	; preserved.			; preserved.
	; This test has two dbg.values in it, one before and one after the relevant			; This test has two dbg.values in it, one before and one after the relevant
	; memory instruction. Test that the one before does _not_ get updated (as that			; memory instruction. Test that the one before does _not_ get updated (as that
	; would either make it use-before-def or shift when the variable appears), and			; would either make it use-before-def or shift when the variable appears), and
	; that the dbg.value after the memory instruction does get updated.			; that the dbg.value after the memory instruction does get updated.
	;
	; Due to placeDbgValues, the dbg.values currently get shifted up a few
	; instructions.

	define dso_local i8 @foo(i32 *%p, i32 %cond) !dbg !7 {			define dso_local i8 @foo(i32 *%p, i32 %cond) !dbg !7 {
	entry:			entry:
	; The first dbg.value of %arith, in the 'next' block, will be moved up here			; There should be no dbg.values in this block.
	; by placeDbgValues,
	; CHECK-LABEL: entry:			; CHECK-LABEL: entry:
	; CHECK: dbg.value(metadata i8* %arith, metadata ![[DIVAR:[0-9]+]],			; CHECK-NOT: dbg.value
	; CHECK-SAME: metadata !DIExpression()
	%casted = bitcast i32 %p to i8			%casted = bitcast i32 %p to i8
	%arith = getelementptr i8, i8 *%casted, i32 3			%arith = getelementptr i8, i8 *%casted, i32 3
	%load1 = load i8, i8 *%arith			%load1 = load i8, i8 *%arith
	%cmpresult = icmp eq i32 %cond, 0			%cmpresult = icmp eq i32 %cond, 0
	br i1 %cmpresult, label %next, label %ret			br i1 %cmpresult, label %next, label %ret

	next:			next:
	; Address calcs should be duplicated into this block. One dbg.value should be			; Address calcs should be duplicated into this block. One dbg.value should be
	; updated, and the other should not.			; updated, and the other should not.
	; CHECK-LABEL: next:			; CHECK-LABEL: next:
	; CHECK: %[[CASTVAR:[0-9a-zA-Z]+]] = bitcast i32* %p to i8*			; CHECK: dbg.value(metadata i8* %arith, metadata ![[DIVAR:[0-9]+]],
				; CHECK-SAME: metadata !DIExpression()
				; CHECK-NEXT: %[[CASTVAR:[0-9a-zA-Z]+]] = bitcast i32* %p to i8*
	; CHECK-NEXT: %[[GEPVAR:[0-9a-zA-Z]+]] = getelementptr i8, i8* %[[CASTVAR]],			; CHECK-NEXT: %[[GEPVAR:[0-9a-zA-Z]+]] = getelementptr i8, i8* %[[CASTVAR]],
	; CHECK-SAME: i64 3			; CHECK-SAME: i64 3
				; CHECK-NEXT: %loaded = load i8, i8* %[[GEPVAR]]
	; CHECK-NEXT: call void @llvm.dbg.value(metadata i8* %[[GEPVAR]],			; CHECK-NEXT: call void @llvm.dbg.value(metadata i8* %[[GEPVAR]],
	; CHECK-SAME: metadata ![[DIVAR]],			; CHECK-SAME: metadata ![[DIVAR]],
	; CHECK-NEXT: %loaded = load i8, i8* %[[GEPVAR]]
	call void @llvm.dbg.value(metadata i8 *%arith, metadata !12, metadata !DIExpression()), !dbg !14			call void @llvm.dbg.value(metadata i8 *%arith, metadata !12, metadata !DIExpression()), !dbg !14
	%loaded = load i8, i8 *%arith			%loaded = load i8, i8 *%arith
	call void @llvm.dbg.value(metadata i8 *%arith, metadata !12, metadata !DIExpression()), !dbg !14			call void @llvm.dbg.value(metadata i8 *%arith, metadata !12, metadata !DIExpression()), !dbg !14
	ret i8 %loaded			ret i8 %loaded

	ret:			ret:
	ret i8 0			ret i8 0
	}			}
	Show All 24 Lines

llvm/test/tools/llvm-locstats/locstats.ll

	; UNSUPPORTED: system-windows			; UNSUPPORTED: system-windows
	; REQUIRES: x86-registered-target			; REQUIRES: x86-registered-target
	; RUN: llc %s -o %t0.o -filetype=obj			; RUN: llc %s -o %t0.o -filetype=obj
	; RUN: %llvm-locstats %t0.o \| FileCheck %s --check-prefix=LOCSTATS			; RUN: %llvm-locstats %t0.o \| FileCheck %s --check-prefix=LOCSTATS
	;			;
	; Test the llvm-locstats output.			; Test the llvm-locstats output.
	; LOCSTATS: 0% 0 0%			; LOCSTATS: 0% 0 0%
	; LOCSTATS: 1-9% 0 0%			; LOCSTATS: 1-9% 0 0%
	; LOCSTATS: 10-19% 0 0%			; LOCSTATS: 10-19% 0 0%
	; LOCSTATS: 20-29% 1 11%			; LOCSTATS: 20-29% 1 11%
	; LOCSTATS: 30-39% 0 0%			; LOCSTATS: 30-39% 0 0%
	; LOCSTATS: 40-49% 1 11%			; LOCSTATS: 40-49% 1 11%
	; LOCSTATS: 50-59% 1 11%			; LOCSTATS: 50-59% 1 11%
	; LOCSTATS: 60-69% 1 11%			; LOCSTATS: 60-69% 0 0%
	; LOCSTATS: 70-79% 0 0%			; LOCSTATS: 70-79% 0 0%
	; LOCSTATS: 80-89% 2 22%			; LOCSTATS: 80-89% 3 33%
	; LOCSTATS: 90-99% 1 11%			; LOCSTATS: 90-99% 1 11%
	; LOCSTATS: 100% 2 22%			; LOCSTATS: 100% 2 22%
	;			;
	; The source code of the test case:			; The source code of the test case:
	;extern int fn2 (int);			;extern int fn2 (int);
	;			;
	;__attribute__((noinline))			;__attribute__((noinline))
	;int			;int
	▲ Show 20 Lines • Show All 151 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[DebugInfo][CGP] Limit placeDbgValues movement of dbg.value intrinsicsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 232811

llvm/include/llvm/CodeGen/MachineInstr.h

llvm/lib/CodeGen/CodeGenPrepare.cpp

llvm/test/DebugInfo/COFF/register-variables.ll

llvm/test/DebugInfo/NVPTX/debug-info.ll

llvm/test/DebugInfo/X86/DW_AT_location-reference.ll

llvm/test/DebugInfo/X86/PR37234.ll

llvm/test/DebugInfo/X86/codegenprep-addrsink.ll

llvm/test/tools/llvm-locstats/locstats.ll

[DebugInfo][CGP] Limit placeDbgValues movement of dbg.value intrinsics
ClosedPublic