This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/docs/
-
docs/
11/31
HowToUpdateDebugInfo.rst

Differential D81198

[docs] Specify rules for updating debug locations
ClosedPublic

Authored by vsk on Jun 4 2020, 3:12 PM.

Download Raw Diff

Details

Reviewers

jmorse
aprantl
dblaikie
echristo

Group Reviewers

debug-info

Commits

rGb4459b597a67: [docs] Specify rules for updating debug locations

Summary

Restructure HowToUpdateDebugInfo.rst to specify rules for when
transformations should preserve, merge, or drop debug locations.

The goal is to have clear, well-justified rules that come with a few
examples and counter-examples, so that pass authors can pick the best
strategy for managing debug locations depending on the specific task at
hand.

I've tried to set down sensible rules here that mostly align with what
we already do in llvm today, and that take a diverse set of use cases
into account (interactive debugging, crash triage, SamplePGO).

Please *do* try to pick these rules apart and suggest clarifications or
improvements :).

Side note: Prior to 24660ea1, this document was structured as a long
list of very specific code transformations -- the idea being that we
would fill in what to do in each specific case. I chose to reorganize
the document as a list of actions to take because it drastically cuts
down on the amount of redundant exposition/explanation needed. I hope
that's fine...

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

vsk created this revision.Jun 4 2020, 3:12 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 4 2020, 3:12 PM

vsk added a reviewer: debug-info.Jun 4 2020, 3:13 PM

This is really awesome to write up! One inline request for some elaboration, but otherwise I'm happy with this. For the record I was contemplating asking for more specifics under when to drop or moving the bits for merging down... but I'm not entirely sure that's better, just different :)

-eric

llvm/docs/HowToUpdateDebugInfo.rst
86–87	Bit more clarification here?

This revision is now accepted and ready to land.Jun 4 2020, 3:47 PM

dblaikie added inline comments.Jun 4 2020, 4:05 PM

llvm/docs/HowToUpdateDebugInfo.rst
29	Is this kind of "must" language consistent with the rest of this document? In this case it's an aspirational, sort of "this is ideal/what we're striving for" rather than "if you don't do this it'll fail the verifier/break", etc?
62–64	Should this be "or" rather than "and"? If you merge two instructions even in the same BB, I think the goal is you should use a merged location, right?
121–123	Might be worth a separate header - not sure how consistently we do this or how problematic it is if this isn't done.

vsk marked an inline comment as done.Jun 4 2020, 4:52 PM

vsk added a subscriber: danielcdh.

vsk added inline comments.

llvm/docs/HowToUpdateDebugInfo.rst
29	This is aspirational -- would "should" be more appropriate?
62–64	I'm of two minds on this one. The applicable (hypothetical) example here is "(A * B) + C => llvm.fma.f32(A, B, C)". If the part about "being in a different block" is kept, then the location of the "fma" instruction should be determined by location cascade. That location has a realistic shot at mapping back to some real (if somewhat arbitrary) source line. If get rid of the "being in a different block" language, the location of the "fma" instruction should be artificial (line 0). It seems like both options suit the interactive debugging and crash triage use cases. But maybe location cascade would be a better fit for SamplePGO, if that needs to find a "real" line number to increment the right basic block execution count? It'd be helpful for anyone familiar with SamplePGO to express some opinion (cc @danielcdh).
86–87	How about: "Converting an if-then-else CFG diamond into a select. Preserving the debug locations of speculated instructions can make it seem like a condition is true when it's not (or vice versa), which can lead to a confusing single-stepping experience. The rule for :ref:`dropping locations<WhenToDropLocation>` should apply here." ?
121–123	That's a good point, e.g. I don't think the ASan pass bothers with setting line 0 locations. I'll try to split this up.

Harbormaster completed remote builds in B59154: Diff 268602.Jun 4 2020, 5:09 PM

dblaikie added inline comments.Jun 5 2020, 9:35 AM

llvm/docs/HowToUpdateDebugInfo.rst
29	"should" sounds right to me - but open to other folks preferences/ideas about how this should be framed
62–64	Not sure I follow the fma example. Let me rephrase the words in the document to see if we're both understanding them the same way: "IF multiple instructions are merged and the merged location is in a different block than one of the inputs - you /should/ (must, whatever) merge" - the implication is then if you have multiple instructions that don't change blocks, the location should /not/ be merged? That doesn't sound like what I'd expect - in the fma example, if the * and - were from the same basic block, I'd still expect to merge the location rather than dropping or zeroing the location. So that scope information was preserved (eg: if * and - came from the same inline function, then the fma should still be attributed to that inline function).

vsk added inline comments.Jun 5 2020, 12:39 PM

llvm/docs/HowToUpdateDebugInfo.rst
62–64	Re: "the implication is then if you have multiple instructions that don't change blocks, the location should /not/ be merged" -- yep, we're on the same page about the proposed rule. Your point about expecting the right inline scope on merged instructions is compelling though. I'll fix this up. Part of my confusion here was over how exactly SamplePGO works: can it correctly attribute a sample of a line 0 location to the scope it points to? Maybe that's moot, though, since relying on location cascade to assign locations to merged instructions could be more misleading than helpful.

dblaikie added inline comments.Jun 8 2020, 12:18 PM

llvm/docs/HowToUpdateDebugInfo.rst
62–64	Part of my confusion here was over how exactly SamplePGO works: I don't actually know how it works. can it correctly attribute a sample of a line 0 location to the scope it points to? In theory I think such samples could be used, but fairly likely they aren't used. (you could imagine a sample based profiler could look at all source lines within a sampled basic block - and mark all those source lines as sampled)

Greatly enjoying the fact that the rationale behind each rule is explained, thanks for putting this together.

llvm/docs/HowToUpdateDebugInfo.rst
29–32	Do you see this rule as covering tail duplication? AFAIUI preserving the location when tail duplicating is correct, but involves multiple non-unique predecessors. (Or, I misunderstand unique here).

probinson added a subscriber: probinson.Jun 8 2020, 12:40 PM

probinson added inline comments.

llvm/docs/HowToUpdateDebugInfo.rst
62–64	I believe that (at least how we use it in Sony) SPGO doesn't use a line-0 location, but snoops around looking for a real source location in the same block. Jeremy will poke a couple people who actually worked on it to try to get a more definitive answer tomorrow (they're in the UK).

rob.lougher added a subscriber: rob.lougher.Jun 9 2020, 7:04 AM

rob.lougher added inline comments.

llvm/docs/HowToUpdateDebugInfo.rst
62–64	When sampling an executable, the "hits" are addresses which must be mapped back to source locations using debug line info (i.e. addr2line). Any transform that moves or merges instructions without updating the debug location can therefore affect the quality of the sample. The standard example is tail-merging (which in fact was the first optimisation we fixed). Imagine we have an if-then-else, where the "then" and "else" blocks end with the same instruction(s). Tail-merge will common the instructions in the successor block. For block-reordering, we want to know if one side of the if-then-else is executed much more than the other. But if tail-merge has retained the original debug locations in the merged tail we will get an inaccurate sample. If the "then" location is used, hits on the common instructions will be attributed to the "then" block which is wrong. Likewise if the "else" locations were used. The location of the tail is ambiguous. To fix this we introduced the "getMergedLocation" call. The initial version was very simple. If the locations were the same, either can be used. But if the locations are different, we can't give a location and an empty ("unknown") location is used. At some point, this "unknown" location ends up generating a DWARF line-0 record. Now this is the bit I didn't directly work on so I don't know where it occurs. My memory is that the work was done by Paul, and that an "unknown" location at the start of a basic block forced a line-0. This did exactly what we wanted, and we rewrote our initial patch to tail-merge that explicitly created a line-0 location.

rob.lougher added inline comments.Jun 9 2020, 7:22 AM

llvm/docs/HowToUpdateDebugInfo.rst
62–64	P.S. We did a presentation of this work at EuroLLVM 2017 (https://www.youtube.com/watch?v=ceCEXnuWdmo) which you can watch for more examples.

probinson added inline comments.Jun 9 2020, 7:45 AM

llvm/docs/HowToUpdateDebugInfo.rst
62–64	Rob, I think the most relevant question is: If there's a sample hit on an address that maps to line 0, what does SPGO do with it? Throw it away? Rummage around for a non-zero location?

rob.lougher added inline comments.Jun 9 2020, 10:31 AM

llvm/docs/HowToUpdateDebugInfo.rst
62–64	This is handled by AutoFDO (which reads a "raw" sample and converts it into a sample with lines relative to the start of the function). It's 4 years since I hacked that code so I wanted to check to make sure. Anyway, the answer is that samples for addresses that map to line 0 are simply thrown away...

vsk marked 2 inline comments as done.Jun 10 2020, 11:37 AM

vsk added inline comments.

llvm/docs/HowToUpdateDebugInfo.rst
29–32	Thanks for catching this, I think it should cover tail duplication. I've updated the wording to: '... if its basic block is folded into a predecessor that branches unconditionally'.
62–64	@probinson @rob.lougher thanks for the context on SamplePGO. I'm going to propose switching the language here to recommend merging instruction locations, because the alternative of relying on location cascade doesn't seem like it's better for SamplePGO purposes.

Incorporate review feedback. Mainly:

Recommend merging locations in block-local combines. Add a counter-example where this doesn't apply.
Recommend preserving locations in tail duplication.

Harbormaster failed remote builds in B59854: Diff 269924!Jun 10 2020, 12:47 PM

Thanks! I added a few nits inline.

llvm/docs/HowToUpdateDebugInfo.rst
35	s/source lines/source locations/ That's also what they are called in Clang.
36	`Debugging, crash logs, and SamplePGO accuracy, ...`
41	s/line/source/
46	Source
48	Simple peephole optimizations that replace or expand an instruction, like
49	I would either reuse + and << or use add and shl in the previous line
54	s/breakpoint/source/
88	I wonder if we should drop the Simple/Complex adjectives altogether?
97	I would be curious about an explanation what makes this example different from the fma example above.

vsk marked 8 inline comments as done.Jun 11 2020, 6:25 PM

vsk added inline comments.

llvm/docs/HowToUpdateDebugInfo.rst
97	Thanks for pressing on this, it's not terribly clear as-written. How about: "Block-local peepholes which delete redundant instructions, like `(sext (zext i8 %x to i16) to i32) => (zext i8 %x to i32)`. The inner` `zext`` is modified but remains in its block, so the rule for preserving locations should apply."

Address some review feedback.

Harbormaster failed remote builds in B60056: Diff 270276!Jun 11 2020, 7:15 PM

aprantl added inline comments.Jun 12 2020, 10:41 AM

llvm/docs/HowToUpdateDebugInfo.rst
97	sounds good!

I think I've addressed all the outstanding feedback. I'll plan to land this in 24h if there aren't any more comments. Thanks!

We recently ran into a case where it looks like the variable's location was correct but the containing scope was wrong. I guess we'll need a separate update for how to manage scopes.
Not an objection to this patch, just observing there are more cases to think about.

For the record, this all LGTM.

In D81198#2101298, @probinson wrote:

We recently ran into a case where it looks like the variable's location was correct but the containing scope was wrong. I guess we'll need a separate update for how to manage scopes.
Not an objection to this patch, just observing there are more cases to think about.

Thanks for flagging the issue, it's not something I've spent much time thinking about yet. My plan is to try and write some guidelines for updating debug values next. If you (or anyone else) would like to write up some guidance about dealing with scopes, that would be great!

Closed by commit rGb4459b597a67: [docs] Specify rules for updating debug locations (authored by vsk). · Explain WhyJun 18 2020, 2:17 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

docs/

HowToUpdateDebugInfo.rst

112 lines

Diff 268602

llvm/docs/HowToUpdateDebugInfo.rst

	Show All 12 Lines

	This document specifies how to correctly update debug info in various kinds of			This document specifies how to correctly update debug info in various kinds of
	code transformations, and offers suggestions for how to create targeted debug			code transformations, and offers suggestions for how to create targeted debug
	info tests for arbitrary transformations.			info tests for arbitrary transformations.

	For more on the philosophy behind LLVM debugging information, see			For more on the philosophy behind LLVM debugging information, see
	:doc:`SourceLevelDebugging`.			:doc:`SourceLevelDebugging`.

	IR-level transformations			Rules for updating debug locations
	========================			==================================

	Deleting an Instruction			.. _WhenToPreserveLocation:
	-----------------------
				When to preserve an instruction location
				----------------------------------------

				A transformation must preserve the debug location of an instruction if the
				dblaikieUnsubmitted Not Done Reply Inline Actions Is this kind of "must" language consistent with the rest of this document? In this case it's an aspirational, sort of "this is ideal/what we're striving for" rather than "if you don't do this it'll fail the verifier/break", etc? dblaikie: Is this kind of "must" language consistent with the rest of this document? In this case it's an…
				vskAuthorUnsubmitted Not Done Reply Inline Actions This is aspirational -- would "should" be more appropriate? vsk: This is aspirational -- would "should" be more appropriate?
				dblaikieUnsubmitted Not Done Reply Inline Actions "should" sounds right to me - but open to other folks preferences/ideas about how this should be framed dblaikie: "should" sounds right to me - but open to other folks preferences/ideas about how this should…
				instruction either remains in its basic block, or if its basic block is folded
				into a unique predecessor. The APIs to use are ``IRBuilder``, or
				``Instruction::setDebugLoc``.
				jmorseUnsubmitted Not Done Reply Inline Actions Do you see this rule as covering tail duplication? AFAIUI preserving the location when tail duplicating is correct, but involves multiple non-unique predecessors. (Or, I misunderstand unique here). jmorse: Do you see this rule as covering tail duplication? AFAIUI preserving the location when tail…
				vskAuthorUnsubmitted Done Reply Inline Actions Thanks for catching this, I think it should cover tail duplication. I've updated the wording to: '... if its basic block is folded into a predecessor that branches unconditionally'. vsk: Thanks for catching this, I think it should cover tail duplication. I've updated the wording to…

				The purpose of this rule is to ensure that common block-local optimizations
				preserve the ability to set breakpoints on source lines corresponding to the
				aprantlUnsubmitted Done Reply Inline Actions s/source lines/source locations/ That's also what they are called in Clang. aprantl: s/source lines/source locations/ That's also what they are called in Clang.
				instructions they touch. Debugging, as well as SamplePGO accuracy, would be
				aprantlUnsubmitted Done Reply Inline Actions `Debugging, crash logs, and SamplePGO accuracy, ...` aprantl: `Debugging, crash logs, and SamplePGO accuracy, ...`
				severely impacted if that ability were lost.

				Examples of transformations that must follow this rule include:

				* Instruction scheduling. Block-local instruction reordering must not drop line
				aprantlUnsubmitted Done Reply Inline Actions s/line/source/ aprantl: s/line/source/
				locations, even though this may lead to jumpy single-stepping behavior.

				* Simple jump threading. For example, if block ``B1`` unconditionally jumps to
				``B2``, and is its unique predecessor, instructions from ``B2`` can be
				hoisted into ``B1``. Line locations from ``B2`` should be preserved.
				aprantlUnsubmitted Done Reply Inline Actions Source aprantl: Source

				* Simple peephole optimizations, like ``(X + X) => (X << 1)``. The location of
				aprantlUnsubmitted Done Reply Inline Actions Simple peephole optimizations that replace or expand an instruction, like aprantl: Simple peephole optimizations that replace or expand an instruction, like
				the ``shl`` instruction must be the same as the location of the ``add``
				aprantlUnsubmitted Done Reply Inline Actions I would either reuse + and << or use add and shl in the previous line aprantl: I would either reuse + and << or use add and shl in the previous line
				instruction.

				Examples of transformations for which this rule does not apply include:

				* LICM. E.g., if an instruction is moved from the loop body to the preheader,
				aprantlUnsubmitted Done Reply Inline Actions s/breakpoint/source/ aprantl: s/breakpoint/source/
				the rule for :ref:`dropping locations<WhenToDropLocation>` applies.

				.. _WhenToMergeLocation:

				When to merge instruction locations
				-----------------------------------

				A transformation must merge instruction locations if it replaces multiple
				instructions with a single merged instruction, and the merged instruction is
				in a different block than at least one of the instructions-to-be-merged. The
				dblaikieUnsubmitted Not Done Reply Inline Actions Should this be "or" rather than "and"? If you merge two instructions even in the same BB, I think the goal is you should use a merged location, right? dblaikie: Should this be "or" rather than "and"? If you merge two instructions even in the same BB, I…
				vskAuthorUnsubmitted Done Reply Inline Actions I'm of two minds on this one. The applicable (hypothetical) example here is "(A * B) + C => llvm.fma.f32(A, B, C)". If the part about "being in a different block" is kept, then the location of the "fma" instruction should be determined by location cascade. That location has a realistic shot at mapping back to some real (if somewhat arbitrary) source line. If get rid of the "being in a different block" language, the location of the "fma" instruction should be artificial (line 0). It seems like both options suit the interactive debugging and crash triage use cases. But maybe location cascade would be a better fit for SamplePGO, if that needs to find a "real" line number to increment the right basic block execution count? It'd be helpful for anyone familiar with SamplePGO to express some opinion (cc @danielcdh). vsk: I'm of two minds on this one. The applicable (hypothetical) example here is "(A * B) + C =>…
				dblaikieUnsubmitted Not Done Reply Inline Actions Not sure I follow the fma example. Let me rephrase the words in the document to see if we're both understanding them the same way: "IF multiple instructions are merged and the merged location is in a different block than one of the inputs - you /should/ (must, whatever) merge" - the implication is then if you have multiple instructions that don't change blocks, the location should /not/ be merged? That doesn't sound like what I'd expect - in the fma example, if the * and - were from the same basic block, I'd still expect to merge the location rather than dropping or zeroing the location. So that scope information was preserved (eg: if * and - came from the same inline function, then the fma should still be attributed to that inline function). dblaikie: Not sure I follow the fma example. Let me rephrase the words in the document to see if we're…
				vskAuthorUnsubmitted Not Done Reply Inline Actions Re: "the implication is then if you have multiple instructions that don't change blocks, the location should /not/ be merged" -- yep, we're on the same page about the proposed rule. Your point about expecting the right inline scope on merged instructions is compelling though. I'll fix this up. Part of my confusion here was over how exactly SamplePGO works: can it correctly attribute a sample of a line 0 location to the scope it points to? Maybe that's moot, though, since relying on location cascade to assign locations to merged instructions could be more misleading than helpful. vsk: Re: "the implication is then if you have multiple instructions that don't change blocks, the…
				dblaikieUnsubmitted Not Done Reply Inline Actions Part of my confusion here was over how exactly SamplePGO works: I don't actually know how it works. can it correctly attribute a sample of a line 0 location to the scope it points to? In theory I think such samples could be used, but fairly likely they aren't used. (you could imagine a sample based profiler could look at all source lines within a sampled basic block - and mark all those source lines as sampled) dblaikie: > Part of my confusion here was over how exactly SamplePGO works: I don't actually know how…
				probinsonUnsubmitted Not Done Reply Inline Actions I believe that (at least how we use it in Sony) SPGO doesn't use a line-0 location, but snoops around looking for a real source location in the same block. Jeremy will poke a couple people who actually worked on it to try to get a more definitive answer tomorrow (they're in the UK). probinson: I believe that (at least how we use it in Sony) SPGO doesn't use a line-0 location, but snoops…
				rob.lougherUnsubmitted Not Done Reply Inline Actions When sampling an executable, the "hits" are addresses which must be mapped back to source locations using debug line info (i.e. addr2line). Any transform that moves or merges instructions without updating the debug location can therefore affect the quality of the sample. The standard example is tail-merging (which in fact was the first optimisation we fixed). Imagine we have an if-then-else, where the "then" and "else" blocks end with the same instruction(s). Tail-merge will common the instructions in the successor block. For block-reordering, we want to know if one side of the if-then-else is executed much more than the other. But if tail-merge has retained the original debug locations in the merged tail we will get an inaccurate sample. If the "then" location is used, hits on the common instructions will be attributed to the "then" block which is wrong. Likewise if the "else" locations were used. The location of the tail is ambiguous. To fix this we introduced the "getMergedLocation" call. The initial version was very simple. If the locations were the same, either can be used. But if the locations are different, we can't give a location and an empty ("unknown") location is used. At some point, this "unknown" location ends up generating a DWARF line-0 record. Now this is the bit I didn't directly work on so I don't know where it occurs. My memory is that the work was done by Paul, and that an "unknown" location at the start of a basic block forced a line-0. This did exactly what we wanted, and we rewrote our initial patch to tail-merge that explicitly created a line-0 location. rob.lougher: When sampling an executable, the "hits" are addresses which must be mapped back to source…
				rob.lougherUnsubmitted Not Done Reply Inline Actions P.S. We did a presentation of this work at EuroLLVM 2017 (https://www.youtube.com/watch?v=ceCEXnuWdmo) which you can watch for more examples. rob.lougher: P.S. We did a presentation of this work at EuroLLVM 2017 (https://www.youtube.com/watch?
				probinsonUnsubmitted Not Done Reply Inline Actions Rob, I think the most relevant question is: If there's a sample hit on an address that maps to line 0, what does SPGO do with it? Throw it away? Rummage around for a non-zero location? probinson: Rob, I think the most relevant question is: If there's a sample hit on an address that maps to…
				rob.lougherUnsubmitted Not Done Reply Inline Actions This is handled by AutoFDO (which reads a "raw" sample and converts it into a sample with lines relative to the start of the function). It's 4 years since I hacked that code so I wanted to check to make sure. Anyway, the answer is that samples for addresses that map to line 0 are simply thrown away... rob.lougher: This is handled by AutoFDO (which reads a "raw" sample and converts it into a sample with lines…
				vskAuthorUnsubmitted Done Reply Inline Actions @probinson @rob.lougher thanks for the context on SamplePGO. I'm going to propose switching the language here to recommend merging instruction locations, because the alternative of relying on location cascade doesn't seem like it's better for SamplePGO purposes. vsk: @probinson @rob.lougher thanks for the context on SamplePGO. I'm going to propose switching the…
				API to use is ``Instruction::applyMergedLocation``.

				The purpose of this rule is to ensure that a) the single merged instruction
				has a location with an accurate scope attached, and b) to prevent misleading
				single-stepping (or breakpoint) behavior. Often, merged instructions are memory
				accesses which can trap: having an accurate scope attach greatly assists in
				crash triage by identifying the (possibly inlined) function where the bad
				memory access occurred. This rule is also meant to assist SamplePGO by banning
				scenarios in which a sample of a block containing a merged instruction is
				misattributed to a block containing one of the instructions-to-be-merged.

				Examples of transformations that must follow this rule include:

				* Merging identical loads/stores which occur on both sides of a CFG diamond
				(see the ``MergedLoadStoreMotion`` pass).

				* Merging identical loop-invariant stores (see the LICM utility
				``llvm::promoteLoopAccessesToScalars``).

				Examples of transformations for which this rule does not apply include:

				* Speculative execution of different instructions from both sides of a CFG
				diamond. The rule for :ref:`dropping locations<WhenToDropLocation>` applies.
				echristoUnsubmitted Not Done Reply Inline Actions Bit more clarification here? echristo: Bit more clarification here?
				vskAuthorUnsubmitted Not Done Reply Inline Actions How about: "Converting an if-then-else CFG diamond into a select. Preserving the debug locations of speculated instructions can make it seem like a condition is true when it's not (or vice versa), which can lead to a confusing single-stepping experience. The rule for :ref:`dropping locations<WhenToDropLocation>` should apply here." ? vsk: How about: "Converting an if-then-else CFG diamond into a select. Preserving the debug…

				aprantlUnsubmitted Done Reply Inline Actions I wonder if we should drop the Simple/Complex adjectives altogether? aprantl: I wonder if we should drop the Simple/Complex adjectives altogether?
				* Hoisting identical instructions which appear in several successor blocks into
				a predecessor block (see ``BranchFolder::HoistCommonCodeInSuccs``). In this
				case there is no single merged instruction. The rule for
				:ref:`dropping locations<WhenToDropLocation>` applies.

				* Complex peephole optimizations which combine multiple non-identical
				instructions together, like ``(A * B) + C => llvm.fma.f32(A, B, C)``. This is
				a block-local optimization. However, note that the location of the ``fma``
				doesn't exactly correspond to the locations of the ``mul`` or ``add``
				aprantlUnsubmitted Not Done Reply Inline Actions I would be curious about an explanation what makes this example different from the fma example above. aprantl: I would be curious about an explanation what makes this example different from the fma example…
				vskAuthorUnsubmitted Not Done Reply Inline Actions Thanks for pressing on this, it's not terribly clear as-written. How about: "Block-local peepholes which delete redundant instructions, like `(sext (zext i8 %x to i16) to i32) => (zext i8 %x to i32)`. The inner` `zext`` is modified but remains in its block, so the rule for preserving locations should apply." vsk: Thanks for pressing on this, it's not terribly clear as-written. How about: "Block-local…
				aprantlUnsubmitted Not Done Reply Inline Actions sounds good! aprantl: sounds good!
				instructions. The rule for :ref:`dropping locations<WhenToDropLocation>`
				applies.

				.. _WhenToDropLocation:

				When to drop an instruction location
				------------------------------------

				A transformation must drop debug locations (or apply artificial debug
				locations) if the rules for :ref:`preserving<WhenToPreserveLocation>` and
				:ref:`merging<WhenToMergeLocation>` debug locations do not apply. The API to
				use is ``Instruction::setDebugLoc()``.

				The purpose of this rule is to minimize erratic single-stepping behavior and to
				prevent misleading breakpoint behavior. To handle an instruction without a
				location, the DWARF generator defaults to allowing the last-set location after
				a label to cascade forward, or to setting a line 0 location with viable scope
				information if no previous location is available.

				See the discussion in the section about
				:ref:`merging locations<WhenToMergeLocation>` for examples of when the rule for
				dropping locations applies.

				When creating a new instruction that doesn't map back to a source line, use an
				artificial line 0 location instead of setting no location at all. The API to
				use for this is ``DILocation::get()``.
				dblaikieUnsubmitted Not Done Reply Inline Actions Might be worth a separate header - not sure how consistently we do this or how problematic it is if this isn't done. dblaikie: Might be worth a separate header - not sure how consistently we do this or how problematic it…
				vskAuthorUnsubmitted Not Done Reply Inline Actions That's a good point, e.g. I don't think the ASan pass bothers with setting line 0 locations. I'll try to split this up. vsk: That's a good point, e.g. I don't think the ASan pass bothers with setting line 0 locations.

				Rules for updating debug values
				===============================

				Deleting an IR-level Instruction
				--------------------------------

	When an ``Instruction`` is deleted, its debug uses change to ``undef``. This is			When an ``Instruction`` is deleted, its debug uses change to ``undef``. This is
	a loss of debug info: the value of a one or more source variables becomes			a loss of debug info: the value of a one or more source variables becomes
	unavailable, starting with the ``llvm.dbg.value(undef, ...)``. When there is no			unavailable, starting with the ``llvm.dbg.value(undef, ...)``. When there is no
	way to reconstitute the value of the lost instruction, this is the best			way to reconstitute the value of the lost instruction, this is the best
	possible outcome. However, it's often possible to do better:			possible outcome. However, it's often possible to do better:

	* If the dying instruction can be RAUW'd, do so. The			* If the dying instruction can be RAUW'd, do so. The
	▲ Show 20 Lines • Show All 277 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[docs] Specify rules for updating debug locationsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 268602

llvm/docs/HowToUpdateDebugInfo.rst

[docs] Specify rules for updating debug locations
ClosedPublic