This is an archive of the discontinued LLVM Phabricator instance.

docs/LangRef.rst
4613 ↗	(On Diff #156375)	What makes value operands to dbg.value implicit or concrete in LLVM IR? Are SSA values from local instructions concrete, and constants implicit? We could describe that here.

vsk added inline comments.Jul 19 2018, 4:26 PM

docs/LangRef.rst
4613 ↗	(On Diff #156375)	Sure. The way I read it, it depends on the DIType of the described variable. The value operand is concrete iff it's is a pointer to an instance of that DIType. So, the value operand in dbg.value(const-ptr-null, "int *p") is implicit, but concrete in dbg.value(const-ptr-null, "int"). At least, that's the only consistent explanation I've thought of. I don't know how the backend actually determines this. IIUC D49454/D49520 is an example of the backend getting this wrong: it treats a pointer to a std::deque as the implicit location of the std::deque.

bjope added inline comments.Jul 20 2018, 6:55 AM

docs/LangRef.rst
4605 ↗	(On Diff #156375)	Is it true that a debugger must be able to modify the variable for an `llvm.dbg.addr`? Any specific reason, or are we just trying to put limitations on the DIExpression in a `llvm.dbg.addr` intrinsic?
4613 ↗	(On Diff #156375)	My interpretation (with very little experience of `llvm.dbg.addr`) has been that `llvm.dbg.addr` is the IR version of an indirect DBG_VALUE. And `llvm.dbg.value` is the IR version of an non-indirect DBG_VALUE. At least that seems to be the difference in SelectionDAG. Afaict the first argument in a `dbg.value`, together with the DIExpression, describes the value of the variable. The first argument in `dbg.addr`, together with the DIExpression, describes the address of the variable. And I think the first argument in `dbg.value` should be treated as a value, and the first argument in `dbg.addr` should be treated as an indirect pointer. A DIExpression might be used both in dbg.declare, dbg.addr, dbg.value, direct DBG_VALUE and indirect DBG_VALUE, and it could be both tricky and confusing how to interpret the DIExpression. Depending on which intrinsic that is used, or if the DBG_VALUE is direct/indirect, the DIExpression could have an implied DW_OP_stack_value, DW_OP_deref, at the end (or even at the front?). As it might be hard to understand this, improving the documentation is a really nice initiative! One question is if we need to be able to indicate that there is an indirect value operand in a `dbg.value`. Or isn't it enough that if you for example want to describe a variables !Y:s value as (X[0] + 5), then you need to include a DW_OP_deref such as dbg.value(X, !Y, DIExpression(DW_OP_deref, DW_OP_constu 5, DW_OP_add)) The above will become a direct DBG_VALUE since `dbg.value` is used. The DW_OP_deref is needed since by default the first argument in `dbg.value` is treated as a value and not a pointer. The variable will be described using an "implicit location" (DWARF terminology). Are you even saying that depending on !Y it might be wrong to have the DW_OP_deref here? Btw, I think it is confusing to use "concrete" as terminology for the value operand. Isn't the question if the value operand is direct or indirect (if it is a value or a pointer)?

vsk updated this revision to Diff 156914.Jul 23 2018, 4:19 PM

vsk marked an inline comment as done.

vsk edited the summary of this revision. (Show Details)

vsk added inline comments.

docs/LangRef.rst
4605 ↗	(On Diff #156375)	No, I'll walk this back. It's valid to describe a read-only memory location. After thinking about it some more, I don't think there's really an issue with DW_OP_stack_value inside of a llvm.dbg.addr either.
4613 ↗	(On Diff #156375)	My first response to @rnk here was incorrect: implicit vs. concrete is not the same distinction as direct vs. indirect. The latter is the relevant distinction and it has nothing to do with DIType. I consider @bjope's description here to be the "common sense" one we all thought was correct: interpreting a dbg.value should give a direct value, and interpreting a dbg.{addr,declare} should give an indirect value. I'll update this patch to make those definitions precise. Basically, there should be exactly one way to interpret a DIExpression, without any implicit DW_OP_stack_value or DW_OP_deref added based on the context of which intrinsic / what type of location you have. Once we land the fix in D49454 I think we'll either actually have that model or be really close. Right now there is some special magic with non-empty DIExpressions, but I hope to eliminate that.

Minor wordsmithing.

lgtm

This revision is now accepted and ready to land.Jul 27 2018, 4:15 PM

Closed by commit rL338182: [docs] Clarify role of DIExpressions within debug intrinsics (authored by vedantk). · Explain WhyJul 27 2018, 5:34 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

docs/

LangRef.rst

37 lines

SourceLevelDebugging.rst

5 lines

Diff 157817

llvm/trunk/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,582 Lines • ▼ Show 20 Lines	.. code-block:: text
!2 = !DILocalVariable(name: "y", scope: !5, file: !2, line: 7, type: !3)		!2 = !DILocalVariable(name: "y", scope: !5, file: !2, line: 7, type: !3)

DIExpression		DIExpression
""""""""""""		""""""""""""

``DIExpression`` nodes represent expressions that are inspired by the DWARF		``DIExpression`` nodes represent expressions that are inspired by the DWARF
expression language. They are used in :ref:`debug intrinsics<dbg_intrinsics>`		expression language. They are used in :ref:`debug intrinsics<dbg_intrinsics>`
(such as ``llvm.dbg.declare`` and ``llvm.dbg.value``) to describe how the		(such as ``llvm.dbg.declare`` and ``llvm.dbg.value``) to describe how the
referenced LLVM variable relates to the source language variable.		referenced LLVM variable relates to the source language variable. Debug
		intrinsics are interpreted left-to-right: start by pushing the value/address
		operand of the intrinsic onto a stack, then repeatedly push and evaluate
		opcodes from the DIExpression until the final variable description is produced.

The current supported vocabulary is limited:		The current supported opcode vocabulary is limited:

- ``DW_OP_deref`` dereferences the top of the expression stack.		- ``DW_OP_deref`` dereferences the top of the expression stack.
- ``DW_OP_plus`` pops the last two entries from the expression stack, adds		- ``DW_OP_plus`` pops the last two entries from the expression stack, adds
them together and appends the result to the expression stack.		them together and appends the result to the expression stack.
- ``DW_OP_minus`` pops the last two entries from the expression stack, subtracts		- ``DW_OP_minus`` pops the last two entries from the expression stack, subtracts
the last entry from the second last entry and appends the result to the		the last entry from the second last entry and appends the result to the
expression stack.		expression stack.
- ``DW_OP_plus_uconst, 93`` adds ``93`` to the working expression.		- ``DW_OP_plus_uconst, 93`` adds ``93`` to the working expression.
- ``DW_OP_LLVM_fragment, 16, 8`` specifies the offset and size (``16`` and ``8``		- ``DW_OP_LLVM_fragment, 16, 8`` specifies the offset and size (``16`` and ``8``
here, respectively) of the variable fragment from the working expression. Note		here, respectively) of the variable fragment from the working expression. Note
that contrary to DW_OP_bit_piece, the offset is describing the location		that contrary to DW_OP_bit_piece, the offset is describing the location
within the described source variable.		within the described source variable.
- ``DW_OP_swap`` swaps top two stack entries.		- ``DW_OP_swap`` swaps top two stack entries.
- ``DW_OP_xderef`` provides extended dereference mechanism. The entry at the top		- ``DW_OP_xderef`` provides extended dereference mechanism. The entry at the top
of the stack is treated as an address. The second stack entry is treated as an		of the stack is treated as an address. The second stack entry is treated as an
address space identifier.		address space identifier.
- ``DW_OP_stack_value`` marks a constant value.		- ``DW_OP_stack_value`` marks a constant value.

DWARF specifies three kinds of simple location descriptions: Register, memory,		DWARF specifies three kinds of simple location descriptions: Register, memory,
and implicit location descriptions. Register and memory location descriptions		and implicit location descriptions. Note that a location description is
describe the location of a source variable (in the sense that a debugger might		defined over certain ranges of a program, i.e the location of a variable may
modify its value), whereas implicit locations describe merely the value of a		change over the course of the program. Register and memory location
source variable. DIExpressions also follow this model: A DIExpression that		descriptions describe the concrete location of a source variable (in the
doesn't have a trailing ``DW_OP_stack_value`` will describe an address when		sense that a debugger might modify its value), whereas implicit locations
combined with a concrete location.		describe merely the actual value of a source variable which might not exist
		in registers or in memory (see ``DW_OP_stack_value``).

		A ``llvm.dbg.addr`` or ``llvm.dbg.declare`` intrinsic describes an indirect
		value (the address) of a source variable. The first operand of the intrinsic
		must be an address of some kind. A DIExpression attached to the intrinsic
		refines this address to produce a concrete location for the source variable.

		A ``llvm.dbg.value`` intrinsic describes the direct value of a source variable.
		The first operand of the intrinsic may be a direct or indirect value. A
		DIExpresion attached to the intrinsic refines the first operand to produce a
		direct value. For example, if the first operand is an indirect value, it may be
		necessary to insert ``DW_OP_deref`` into the DIExpresion in order to produce a
		valid debug intrinsic.

		.. note::

		A DIExpression is interpreted in the same way regardless of which kind of
		debug intrinsic it's attached to.

.. code-block:: text		.. code-block:: text

!0 = !DIExpression(DW_OP_deref)		!0 = !DIExpression(DW_OP_deref)
!1 = !DIExpression(DW_OP_plus_uconst, 3)		!1 = !DIExpression(DW_OP_plus_uconst, 3)
!1 = !DIExpression(DW_OP_constu, 3, DW_OP_plus)		!1 = !DIExpression(DW_OP_constu, 3, DW_OP_plus)
!2 = !DIExpression(DW_OP_bit_piece, 3, 7)		!2 = !DIExpression(DW_OP_bit_piece, 3, 7)
!3 = !DIExpression(DW_OP_deref, DW_OP_constu, 3, DW_OP_plus, DW_OP_LLVM_fragment, 3, 7)		!3 = !DIExpression(DW_OP_deref, DW_OP_constu, 3, DW_OP_plus, DW_OP_LLVM_fragment, 3, 7)
▲ Show 20 Lines • Show All 10,647 Lines • Show Last 20 Lines

llvm/trunk/docs/SourceLevelDebugging.rst

Show First 20 Lines • Show All 238 Lines • ▼ Show 20 Lines	.. code-block:: llvm
void @llvm.dbg.value(metadata, metadata, metadata)		void @llvm.dbg.value(metadata, metadata, metadata)

This intrinsic provides information when a user source variable is set to a new		This intrinsic provides information when a user source variable is set to a new
value. The first argument is the new value (wrapped as metadata). The second		value. The first argument is the new value (wrapped as metadata). The second
argument is a `local variable <LangRef.html#dilocalvariable>`_ containing a		argument is a `local variable <LangRef.html#dilocalvariable>`_ containing a
description of the variable. The third argument is a `complex expression		description of the variable. The third argument is a `complex expression
<LangRef.html#diexpression>`_.		<LangRef.html#diexpression>`_.

		An `llvm.dbg.value` intrinsic describes the value of a source variable
		directly, not its address. Note that the value operand of this intrinsic may
		be indirect (i.e, a pointer to the source variable), provided that interpreting
		the complex expression derives the direct value.

Object lifetimes and scoping		Object lifetimes and scoping
============================		============================

In many languages, the local variables in functions can have their lifetimes or		In many languages, the local variables in functions can have their lifetimes or
scopes limited to a subset of a function. In the C family of languages, for		scopes limited to a subset of a function. In the C family of languages, for
example, variables are only live (readable and writable) within the source		example, variables are only live (readable and writable) within the source
block that they are defined in. In functional languages, values are only		block that they are defined in. In functional languages, values are only
readable after they have been defined. Though this is a very obvious concept,		readable after they have been defined. Though this is a very obvious concept,
▲ Show 20 Lines • Show All 1,389 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[docs] Clarify role of DIExpressions within debug intrinsicsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 157817

llvm/trunk/docs/LangRef.rst

llvm/trunk/docs/SourceLevelDebugging.rst

[docs] Clarify role of DIExpressions within debug intrinsics
ClosedPublic