This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
docs/
1
LangRef.rst
-
include/llvm/
-
llvm/
-
BinaryFormat/
-
Dwarf.h
-
CodeGen/
-
AsmPrinter.h
1/2
DIE.h
-
DIEValue.def
-
MC/
-
MCStreamer.h
-
lib/
-
AsmParser/
1
LLParser.cpp
-
BinaryFormat/
-
Dwarf.cpp
-
CodeGen/AsmPrinter/
-
AsmPrinter/
-
AsmPrinterDwarf.cpp
-
ByteStreamer.h
6/14
DIE.cpp
-
DIEHash.cpp
-
DebugLocEntry.h
-
DwarfCompileUnit.h
2/2
DwarfCompileUnit.cpp
-
DwarfDebug.h
9/24
DwarfDebug.cpp
-
DwarfExpression.h
5/8
DwarfExpression.cpp
-
DwarfFile.h
-
DwarfUnit.cpp
-
IR/
-
AsmWriter.cpp
-
DebugInfoMetadata.cpp
-
MC/
-
MCStreamer.cpp
-
Target/BPF/MCTargetDesc/
-
BPF/
-
MCTargetDesc/
-
BPFAsmBackend.cpp
-
Transforms/Utils/
-
Utils/
3/11
Local.cpp
-
test/
-
Assembler/
-
diexpression.ll
-
Transforms/InstCombine/
-
InstCombine/
-
cast-set-preserve-signed-dbg-val.ll
-
unittests/Transforms/Utils/
-
Transforms/
-
Utils/
-
LocalTest.cpp

Differential D56587

Introduce DW_OP_LLVM_convert
ClosedPublic

Authored by markus on Jan 11 2019, 4:04 AM.

Download Raw Diff

Details

Reviewers

aprantl
vsk
dblaikie

Commits

rGb86ce219f4da: [DebugInfo] Introduce DW_OP_LLVM_convert
rL356451: [DebugInfo] Introduce DW_OP_LLVM_convert
rGcd8a940b37b2: [DebugInfo] Introduce DW_OP_LLVM_convert
rL356442: [DebugInfo] Introduce DW_OP_LLVM_convert

Summary

Introduce a DW_OP_LLVM_convert Dwarf expression pseudo op that allows for a convenient way to perform type conversions on the Dwarf expression stack. As an additional bonus it paves the way for using other Dwarf v5 ops that need to reference a base_type.

Use the new operation in llvm::replaceAllDbgUsesWith where it replaces a complex (and broken) shift and mask based expression to perform sign- and zero-extension.

Update the AsmPrinter parts to allow for referencing a base_type from Dwarf expressions.

Diff Detail

Event Timeline

markus created this revision.Jan 11 2019, 4:04 AM

bjope added a subscriber: bjope.Jan 11 2019, 4:58 AM

aprantl added inline comments.Jan 11 2019, 8:13 AM

lib/Transforms/Utils/Local.cpp
1863	nit: `.` at the end.
1864	I haven't had any coffee yet, but shouldn't that be `FromBits` and From ?: 00001110 >> 4-1 * ~0 << 4 \| 00001110 1 * ~0 << 4 \| 00001110 11111111 << 4 \| 00001110 11110000 \| 00001110 11111110

ii) It is probably not safe to assume that the consumer/debugger would set the high bits to zero in zero extension in e.g. the case that the variable has been spilled to memory.

Why? The consumer should have truncate the fragment to its DW_AT_size, right?

bjope added inline comments.Jan 11 2019, 8:30 AM

lib/Transforms/Utils/Local.cpp
1864	This method is replacing one dbg use with another. So "from" is the old value and "to" is the new value. Here we replace a old large value (e.g. i32) by a new smaller value (e.g. i16). So we sign extend from `ToBits` to `FromBits` to convert the new value back into something that represents the old value in the debugger. I think I needed both coffee and lunch to understand that we extend from `To` to `From` here.

Thanks for the patch.

t’s not obvious to me from skimming what the bug is with sign extension expressions. Could you describe what goes wrong, and maybe share a small program which shows the debugger behaving incorrectly?

In D56587#1354394, @vsk wrote:

Thanks for the patch.

t’s not obvious to me from skimming what the bug is with sign extension expressions. Could you describe what goes wrong, and maybe share a small program which shows the debugger behaving incorrectly?

For the signed case the old DIExpression calculated

(signbit * -1) | x

which always resulted in -1 if the sign bit was set and x if the sign bit was unset. So the problem was that we modified the low bits when doing the OR.

What we really want to do is to copy the sign bit into the extended bits, hence the addition of the DW_OP_shl to get

((signbit * -1)  << "number of bits to extend from") | x

aprantl added inline comments.Jan 11 2019, 10:21 AM

lib/Transforms/Utils/Local.cpp
1872	General question: Should we generate a DWARF 5 type conversion here and lower it to this sequence for DWARF 4 and lower in DwarfExpression.cpp to save memory?

dblaikie added a subscriber: dblaikie.Jan 13 2019, 6:17 PM

In D56587#1354394, @vsk wrote:

Thanks for the patch.

t’s not obvious to me from skimming what the bug is with sign extension expressions. Could you describe what goes wrong, and maybe share a small program which shows the debugger behaving incorrectly?

What @bjope said and as for an example consider the following on x86

#include <stdlib.h>
short foo(short x) {
  int y = x;
  return y;
}
int main(int argc, char **argv) {
  return foo(strtol(argv[1], NULL, 0));
}

Compiled in a somewhat contrived manner

clang dbg.c -O0 -g3 -S -emit-llvm
sed -i 's/optnone//' dbg.ll
opt -mem2reg -instcombine dbg.ll -S -o dbg.opt.ll
llc -O0 dbg.opt.ll -o dbg.s
gcc dbg.s -o dbg

gdb --args ./dbg 0xf000

(gdb) b foo
Breakpoint 1 at 0x400517: file dbg.c, line 4.
(gdb) r
Starting program: /repo/elavkje/llvm/my-dbg-test/dwarf-sext/dbg 0xf000

Breakpoint 1, foo (x=-4096) at dbg.c:4
4             return y;
(gdb) whatis x
type = short
(gdb) p y
$1 = -1
(gdb) whatis y
type = int
(gdb)

It is clear that y should not read as -1 at that point.

In D56587#1354312, @aprantl wrote:

ii) It is probably not safe to assume that the consumer/debugger would set the high bits to zero in zero extension in e.g. the case that the variable has been spilled to memory.

Why? The consumer should have truncate the fragment to its DW_AT_size, right?

But wouldn't the DW_AT_size at this point be for the wider type? If it was the case that the consumer could automatically handle the zext then I don't see why it would not also be able to handle the sext by itself.
I think the following example illustrates where it goes wrong. Again contrived as we have to do some tricks to get the consumer to read the value from memory.

typedef unsigned short T;
T foo(T d0, T d1, T d2, T d3, T d4, T d5, T d6, T d7, T d8, T x) {
    unsigned int y = x;
    return y;
}
int main(int argc, char **argv) {
    return foo(0, 1, 2, 3, 4, 5, 6, 7, 8, -1);
}

clang dbg.c -O0 -g3 -S -emit-llvm
sed -i 's/optnone//' dbg.ll
opt -mem2reg -instcombine dbg.ll -S -o dbg.opt.ll
llc -O3 dbg.opt.ll -o dbg.s

Now modify the assembly in the caller to make sure that there are non-zero bits on the stack next to the 'x' argument.
i.e. replace

pushq   $65535                  # imm = 0xFFFF

with

pushq   $-1                  # imm = 0xFFFF

and finally assemble and test

gcc dbg.s -o dbg
gdb ./dbg

(gdb) b foo
Breakpoint 1 at 0x4004c8: file dbg.c, line 4.
(gdb) run
Starting program: /repo/elavkje/llvm/my-dbg-test/dwarf-sext/dbg 

Breakpoint 1, foo (d0=0, d1=1, d2=2, d3=3, d4=4, d5=5, d6=6, d7=7, d8=8, x=65535) at dbg.c:4
4           return y;
(gdb) whatis x
type = T
(gdb) whatis T
type = unsigned short
(gdb) p y
$1 = 4294967295
(gdb) whatis y
type = unsigned int
(gdb)

and it should be clear that zero extension did not happen. Unless my thinking is flawed of course, and that would not be the first time :)

Unfortunately I think that there are additional complications in that our sext/zext computations will have to also depend on the endianness of the target and if the value is stored in memory or not (as having the value loaded as the 'wrong type' would require bytes to be swapped on a big endian target before it could be sext/zext by this expression).

Probably inserting a pseudo op here lowering it at a later stage when it is known if the value will reside in memory would be the right thing to do. Not sure if a DWARF 5 DW_OP_convert would be the easiest option as its argument references another DIE and it seems that would require larger infrastructure changes (but I really don't know anything about this).

In D56587#1355882, @markus wrote:

Probably inserting a pseudo op here lowering it at a later stage when it is known if the value will reside in memory would be the right thing to do. Not sure if a DWARF 5 DW_OP_convert would be the easiest option as its argument references another DIE and it seems that would require larger infrastructure changes (but I really don't know anything about this).

Referencing a DIE in DIExpression would need a bit of additional work. DIExpression stores its operands in an array of uint64_t, so in order to support MDNode operands we'd have to add them as actual MDNode operands. For example, expr_op_iterator could know which DIExpression operations take MDNode arguments and inject them in the right places. Slightly more complicated would be actually finding a matching DIType in the debug info that we want to point to. If we want to convert to exactly the type of the DIVariable, we can use that type, otherwise we might have to generate new DIBasicTypes on the fly.

Even though this sounds complicated, I still think that it is preferable to encode type conversions as such rather than generating really complicated expressions that happen to have the same effect in the underspecified DWARF 4 stack language.

In D56587#1356258, @aprantl wrote:

Referencing a DIE in DIExpression would need a bit of additional work. DIExpression stores its operands in an array of uint64_t, so in order to support MDNode operands we'd have to add them as actual MDNode operands. For example, expr_op_iterator could know which DIExpression operations take MDNode arguments and inject them in the right places. Slightly more complicated would be actually finding a matching DIType in the debug info that we want to point to. If we want to convert to exactly the type of the DIVariable, we can use that type, otherwise we might have to generate new DIBasicTypes on the fly.

Even though this sounds complicated, I still think that it is preferable to encode type conversions as such rather than generating really complicated expressions that happen to have the same effect in the underspecified DWARF 4 stack language.

Yes, long term it is likely the best solution.

I played a bit with trying to insert a DW_OP_convert before I came up with this patch but was clueless on how to retrieve the DIE offset of the DIType when the expression was emitted as that section (.debug_info?) hadn't been emitted yet. If I could get some useful advice on tackling that issue I can have another go at it when I get back to work tomorrow.

In D56587#1356420, @markus wrote:

In D56587#1356258, @aprantl wrote:

Referencing a DIE in DIExpression would need a bit of additional work. DIExpression stores its operands in an array of uint64_t, so in order to support MDNode operands we'd have to add them as actual MDNode operands. For example, expr_op_iterator could know which DIExpression operations take MDNode arguments and inject them in the right places. Slightly more complicated would be actually finding a matching DIType in the debug info that we want to point to. If we want to convert to exactly the type of the DIVariable, we can use that type, otherwise we might have to generate new DIBasicTypes on the fly.

Even though this sounds complicated, I still think that it is preferable to encode type conversions as such rather than generating really complicated expressions that happen to have the same effect in the underspecified DWARF 4 stack language.

Yes, long term it is likely the best solution.

I played a bit with trying to insert a DW_OP_convert before I came up with this patch but was clueless on how to retrieve the DIE offset of the DIType when the expression was emitted as that section (.debug_info?) hadn't been emitted yet. If I could get some useful advice on tackling that issue I can have another go at it when I get back to work tomorrow.

You wouldn't hardcode the offset in IR, you'd refer to the DIType as an MDNode reference and then teach the backend to resolve the reference, similar to how DIE references are resolved in the entire DIType class hierarchy.

In LLVM Assembly this could look like:

!1 = !DIBasicType(name: "short", ...)
!2 = !DIExpression(DW_OP_convert, !1)

but the actual implementation would be as I outlined in my previous reply.

In D56587#1356721, @aprantl wrote:
You wouldn't hardcode the offset in IR, you'd refer to the DIType as an MDNode reference and then teach the backend to resolve the reference, similar to how DIE references are resolved in the entire DIType class hierarchy.

In LLVM Assembly this could look like:
!1 = !DIBasicType(name: "short", ...)
!2 = !DIExpression(DW_OP_convert, !1)
but the actual implementation would be as I outlined in my previous reply.

Right, I got that part. I have a somewhat temporary solution in place for that which brings me to DwarfExpression::addExpression where I can pickup the corresponding DIBasicType when I see a dwarf::DW_OP_convert. Proceeding from that point seems much more difficult though since the .debug_info section has not been emitted yet and no DIE offsets are available. The code here expects me to emit directly into a ByteStreamer object.. Maybe it is possible to implement some parallel data structure inside DebugLocDwarfExpression to keep track of the convert ops and allow us to insert the DIE offset at a later point when it is available.

I didn't realize you were talking about DwarfExpression, sorry . I'm not really sure what the best solution is because I don't quite understand MCLabels that well. Naively I was assuming that we would emit an MCLabel that is defined as the difference between DIE offset and .debug_info start label. Since we need to emit DIE references all over .debug_info I would start by looking at how it's done there. The fact that it is in a different section may complicate things though, I'm not sure.

bjope added inline comments.Jan 15 2019, 2:22 PM

lib/Transforms/Utils/Local.cpp
1865	I guess this still is wrong, at least if we end up with a DWARF location description for a memory location. If for example the variable is 32-bits, and we want to describe it using a 16-bit value, then we will get something like this: call void @llvm.dbg.value(metadata i16 %value, metadata "!variable", metadata DIExpression(DW_OP_dup, DW_OP_constu, 15, DW_OP_shr, DW_OP_lit0, DW_OP_not, DW_OP_mul, DW_OP_constu, 16, DW_OP_shl, DW_OP_or, DW_OP_stack_value) In llc this will become a DBG_VALUE. The value could either end up referring to a 16-bit register, or it could refer to a 16-bit stack slot (e.g. if this is an input argument passed on the stack). In the latter case we typically end up prepending the DIExpression with DW_OP_fbreg and an offset. The memory location will point to the 16-bit value. But we do not really express that the debugger should read a 16-bit value here, right? The debugger will only see that the variable is 32-bits, so it will read 32-bits, right? For a little endian target we will get garbage in bits 16-31 (since we read outside the 16-bit stack slot). For a big endian target we will get the wanted value in bits 16-31 and garbage in bits 0-15. Either way, the result would be wrong. For little endian we would need to clear bit 16-31 before the OR with the sign-extension mask. For big endian we aren't even operating on the correct bits. I'm not really sure what happens if the debugger finds a 16-bit register location for the 32-bit variable. Do we know that it only us reading 16-bits to the value stack? One solution could be to use DW_OP_deref_size when reading from memory, to specify that we only want to read 16 bits. I'm not sure exactly how DwarfExpression could know when this is needed. I guess we can not add the DW_OP_deref_size already here, because it would be wrong in case of ending up with a register location. But maybe we still need to do something more also for the register location scenario when using this approach. An alternative solution is to describe the variable using two dbg.value intrinsics. One using a fragment for bits 0-15, and another one using a fragment expression for bits 16-31. I guess it would look something like this: call void @llvm.dbg.value(metadata i16 %value, metadata "!variable", metadata DIExpression(DW_OP_LLVM_fragment 0, 16) call void @llvm.dbg.value(metadata i16 %value, metadata "!variable", metadata DIExpression(DW_OP_constu, 15, DW_OP_shr, DW_OP_lit0, DW_OP_not, DW_OP_mul, DW_OP_LLVM_fragment 16, 16) I've seen the discussion about DW_OP_convert. Would DW_OP_convert help in telling the debugger that any derefs should be 16 bits in this case. Then I guess that still would be good for DWARF5. Similar problem as described above also exists for the zext case below. At least for big endian when dereferencing memory, since we get the wrong value in the least significant bits when reading 32 bits from a 16-bit stack slot.

Uploading a prototype implementation using the DW_OP_LLVM_fragment as @bjope described. There are still some issues that need to be worked out with this so it is not final but rather I would like some input on if going down this track would be considered an acceptable solution. If there are clear objections to this approach it would be good to hear them early.

aprantl added inline comments.Jan 17 2019, 8:58 AM

lib/Transforms/Utils/Local.cpp
1854	I think we should make a public helper function to emit z/sext operations for a DIExpression. Either as a member of DIExpression or as a freestanding function in Local.h. I'm sure this will come in handy elsewhere.

With the fragment approach we still have problems in e.g. lib/CodeGen/PrologEpilogInserter.cpp:1105 where the stack location of a function argument is prepended to the expression and a sized DW_OP_deref would need to be inserted.

While the above is likely solvable I think it is a fair question to ask if would not be cleaner to simply go for a pseudo op approach such as https://reviews.llvm.org/D57010 ?

Insert DW_OP_deref_size in Prologue Epilogue Inserter.

markus marked an inline comment as done.Jan 28 2019, 7:39 AM

markus added inline comments.

lib/Transforms/Utils/Local.cpp
1865	Unfortunately I run into problems when I try having arbitrarily sized fragments ( https://bugs.llvm.org/show_bug.cgi?id=40462 ) so I have to make them byte sized here which produces quite a lot of them ...

Time to switch the approach. This time try adding support for typed Dwarf5 stack ops such as DW_OP_convert.

As an example in IR a sext would be represented as (where the DW_OP_convert arguments are indices into dbg.dwarf5.type.tbl):

call void @llvm.dbg.value(metadata i8 %x, metadata !15, metadata !DIExpression(DW_OP_convert, 0, DW_OP_convert, 1, DW_OP_stack_value)), !dbg !17
!dbg.dwarf5.type.tbl = !{!7, !8}
!7 = !DIBasicType(name: "s8", size: 8, encoding: DW_ATE_signed)
!8 = !DIBasicType(name: "s32", size: 32, encoding: DW_ATE_signed)

In assembly (x86) we get a labeled base_type entries in .debug_info

.Ltype_tbl0:                                                                                                                                                                                                       
        .byte   2                       # Abbrev [2] 0x2a:0x7 DW_TAG_base_type                                                                                                                                     
        .long   .Linfo_string3          # DW_AT_name                                                                                                                                                               
        .byte   5                       # DW_AT_encoding                                                                                                                                                           
        .byte   1                       # DW_AT_byte_size                                                                                                                                                          
.Ltype_tbl1:                                                                                                                                                                                                       
        .byte   2                       # Abbrev [2] 0x31:0x7 DW_TAG_base_type                                                                                                                                     
        .long   .Linfo_string4          # DW_AT_name                                                                                                                                                               
        .byte   5                       # DW_AT_encoding                                                                                                                                                           
        .byte   4                       # DW_AT_byte_size                                                                                                                                                          
        .byte   3                       # Abbrev [3] 0x38:0x38 DW_TAG_subprogram

that are referenced from the debug expression as in

.byte   168                     # DW_OP_convert
.uleb128 .Ltype_tbl0-.Lcu_begin0

As I see it the most significant benefit of this approach is that it would open up for using other typed DW5 ops as well.

Feedback please :)

Generally, I much prefer using DW_OP_convert because it's more space efficient and semantically unambiguous.

I have a couple of unsorted ideas:

call void @llvm.dbg.value(metadata i8 %x, metadata !15, metadata !DIExpression(DW_OP_convert, 0, DW_OP_convert, 1, DW_OP_stack_value)),

the type reference should be a metadata operand. Otherwise you'll need to implement support in llvm-link etc, too.
I would just implement the type reference as normal metadata operands in DIExpression and bake the knowledge that DW_OP_convert consumes one of the metadata operands into the DIExpression operand iterator.

MDNodes are uniqued, so just creating a new DIBasicType using a throwaway DIBuilder is cheap. That said, we need to find the types references by DIExpressions so they can be emitted in the .debug_info section.

On the other extreme, the only thing we really need from the basic types used by DW_OP_convert is the size and signedness. We could just encode that directly in the expression as DW_OP_LLVM_convert, 1, 32 for a signed int32_t or something like that. That doesn't solve the problem for how to determine which types we need to emit into the debug info, but it would be a very straightforward self-contained encoding up until AsmPrinter.

the type reference should be a metadata operand. Otherwise you'll need to implement support in llvm-link etc, too.
I would just implement the type reference as normal metadata operands in DIExpression and bake the knowledge that DW_OP_convert consumes one of the metadata operands into the DIExpression operand iterator.

That makes sense. I am working on that right now.

On the other extreme, the only thing we really need from the basic types used by DW_OP_convert is the size and signedness. We could just encode that directly in the expression as DW_OP_LLVM_convert, 1, 32 for a signed int32_t or something like that. That doesn't solve the problem for how to determine which types we need to emit into the debug info, but it would be a very straightforward self-contained encoding up until AsmPrinter.

I'd much prefer a generic solution that makes it easy (at least wrt this) to use other typed DW5 ops as well down the road.

In D56587#1383048, @markus wrote:

the type reference should be a metadata operand. Otherwise you'll need to implement support in llvm-link etc, too.
I would just implement the type reference as normal metadata operands in DIExpression and bake the knowledge that DW_OP_convert consumes one of the metadata operands into the DIExpression operand iterator.

That makes sense. I am working on that right now.

If you choose this path, one way to solve the issue of what to do with the dangling types that are produced by the conversion operations would be to create them in a separate DICompileUnit with a compiler-generated name and location. This way the types should get uniqued automatically during LTO, and you don't need to worry about llvm-link & friends.

On the other extreme, the only thing we really need from the basic types used by DW_OP_convert is the size and signedness. We could just encode that directly in the expression as DW_OP_LLVM_convert, 1, 32 for a signed int32_t or something like that. That doesn't solve the problem for how to determine which types we need to emit into the debug info, but it would be a very straightforward self-contained encoding up until AsmPrinter.

I'd much prefer a generic solution that makes it easy (at least wrt this) to use other typed DW5 ops as well down the road.

What other DWARF5 operations do you have in mind that would need pointers into the debug type hierarchy?

If you choose this path, one way to solve the issue of what to do with the dangling types that are produced by the conversion operations would be to create them in a separate DICompileUnit with a compiler-generated name and location. This way the types should get uniqued automatically during LTO, and you don't need to worry about llvm-link & friends.

Not sure that I understand this comment right now. Perhaps I will run into the issue you describe later on but for now my mind is not there just yet :)

What other DWARF5 operations do you have in mind that would need pointers into the debug type hierarchy?

For example I imagine that DW_OP_regval_type, DW_OP_deref_type and DW_OP_const_type could come in handy to set the type of the expression stack. The "each element
of the stack is the size of an address on the target machine" of Dwarf4 is a rather annoying limitation for our target.

Dwarf5 DW_OP_convert support.

- Extend DIExpression to accept MDNode (really DIBasicType) operands.
Since we are keeping references to the MDNodes outside the "Metadata
machinery" we need to be careful and use TrackingMDNodeRef to support
any RAUW operations that might happen during e.g. IR import of a
forward-reference.

- Modify replaceAllDbgUsesWith to generate dwarf::DW_OP_convert
operations. These ops need to reference a DIBasicType and hence uses the
support from the previous bullet point.

- Update AsmPrinter to allow DIExpressions (emitted in .debug_loc) to
reference DIBasicTypes (emitted in .debug_info). Various MCSymbols are
emitted to realize the references.

Not at all ready but as it looks right now would these changes be considered to be heading in the right direction?

markus marked an inline comment as done.Feb 5 2019, 7:23 AM

markus added inline comments.

lib/Transforms/Utils/Local.cpp
1865	Why is this the case? I was expecting that since I use TrackingMDRef the nodes would not be considered to have zero uses and as a result be deleted until the trackers let go.

bjope added inline comments.Feb 5 2019, 8:27 AM

lib/Transforms/Utils/Local.cpp
1870	I still have trouble to understand how/if helps the debugger (or llc) to know how many bits to dereference. I assume that llc needs to prepend a DW_OP_deref_type in case of a memory location, or DW_OP_regval_type in case of a register location to get the correct type on the expression stack for the original value. And then we only need one DW_OP_convert to convert to the final type. In case we think that it is ok to dereference more bits than `ToBits` (i.e. the smaller size), then we need to adjust the address to take care of endianess somewhere. If we go down this road, then I think we need some hack to get DW_OP_deref_type/DW_OP_regval_type in place first. Or what do you think?

In D56587#1385106, @markus wrote:

If you choose this path, one way to solve the issue of what to do with the dangling types that are produced by the conversion operations would be to create them in a separate DICompileUnit with a compiler-generated name and location. This way the types should get uniqued automatically during LTO, and you don't need to worry about llvm-link & friends.

Not sure that I understand this comment right now. Perhaps I will run into the issue you describe later on but for now my mind is not there just yet :)

My mistake, I mistakenly thought that DIBasicTypes would be owned by a DICompileUnit, but that is not the case, they are freestanding.

What other DWARF5 operations do you have in mind that would need pointers into the debug type hierarchy?

For example I imagine that DW_OP_regval_type, DW_OP_deref_type and DW_OP_const_type could come in handy to set the type of the expression stack. The "each element
of the stack is the size of an address on the target machine" of Dwarf4 is a rather annoying limitation for our target.

All three of these operations expect a DW_TAG_basic_type as argument. So we could easily introduce a DW_OP_LLVM_basic_type <signedness> <bits> operation that gets expanded in AsmPrinter into DW_TAG_basic_types like you do now and thus avoid having to deal with TrackingMDRefs. This would also reduce the memory footprint of all DIExpressions that don't need a type argument.

markus marked an inline comment as done.Feb 6 2019, 1:06 AM

markus added inline comments.

lib/Transforms/Utils/Local.cpp
1870	I still have trouble to understand how/if helps the debugger (or llc) to know how many bits to dereference. Ok, lets try to clarify that then so that we all can agree here. I assume that llc needs to prepend a DW_OP_deref_type in case of a memory location, or DW_OP_regval_type in case of a register location to get the correct type on the expression stack for the original value. And then we only need one DW_OP_convert to convert to the final type. Yes, down the road that would be ideal as I see it. In the meantime we could do with using two DW_OP_convert ops in sequence as in this patch. In case we think that it is ok to dereference more bits than ToBits (i.e. the smaller size), then we need to adjust the address to take care of endianess somewhere. I think that we immediately need to start using DW_OP_deref_size instead of DW_OP_deref to cover the endianess effects, but this is really a separate bug/issue. Modifying the address seems a less desirable way to achieve this. If we go down this road, then I think we need some hack to get DW_OP_deref_type/DW_OP_regval_type in place first. Or what do you think? I think that we can start using two DW_OP_convert in sequence and then treat the DW_OP_deref_size as a separate issue and finally DW_OP_deref_type and DW_OP_regval_type as long term goals. Makes sense?

Dropping all the metadata stuff and specifying the bit size and type encoding directly in the expression vector. e.g.

!DIExpression(DW_OP_convert, 8, 5, DW_OP_convert, 32, 5, DW_OP_stack_value)

This approach certainly has its advantages and require far less changes to existing code.

All three of these operations expect a DW_TAG_basic_type as argument. So we could easily introduce a DW_OP_LLVM_basic_type <signedness> <bits> operation that gets expanded in AsmPrinter into DW_TAG_basic_types like you do now and thus avoid having to deal with TrackingMDRefs. This would also reduce the memory footprint of all DIExpressions that don't need a type argument.

Yep, lets do that but I actually think we can do without a DW_OP_LLVM_basic_type and encode it directly into the expression vector as I have done in the new patch.

Thanks, I appreciate your perseverance even though we make you go through so many revisions :-)

Since we are not using the DWARF semantics by having an extra bitsize and encoding parameter, I would recommend to use a new enumerator DW_OP_LLVM_convert to avoid confusion. This is also why we use DW_OP_LLVM_fragment instead of DW_OP_bit_piece. We should then also document the new operator in the LangRef.rst or SourceLevelDebugging.rst (wherever DW_OP_LLVM_fragment is documented). Bonus points (but really not required!) for pretty-printing the encoding as DW_ATE_encoding enumerator in LLVM assembly.

Updated to DW_OP_LLVM_convert as suggested, did the pretty printing and updated a bunch of llvm-lit tests to cope with the newly generated labels. Now the IR format looks like this:

!DIExpression(DW_OP_LLVM_convert, 16, DW_ATE_signed, DW_OP_LLVM_convert, 32, DW_ATE_signed, DW_OP_stack_value)

There are some open issues that I would like feedback on. I will add inline comments at those locations.

markus marked 4 inline comments as done.Feb 7 2019, 6:30 AM

markus added inline comments.

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
948	If there were multiple Dwarf CUs in the same LLVM Module this would not work right. We need to emit base types for each DwarfCompileUnit but only those types that are used by DwarfExpressions in that unit. So appears to make sense to put the `MarkusNodes` inside the CU.
2248	For DWO how do we find the label into the corresponding `.debug_info` and how do we emit the base types? Would the code in DwarfDebug.cpp:946 work? I guess that I need to create some test cases for DWO.
lib/CodeGen/AsmPrinter/DwarfExpression.cpp
29	I think that we need one of these per DwarfCompileUnit.
394	A `DwarfExpression` should always know which CU it belongs to right?

aprantl added inline comments.Feb 7 2019, 1:37 PM

docs/LangRef.rst
4672	Thanks, this looks good!
lib/AsmParser/LLParser.cpp
4845	Can you also add a round-trip test to `test/Assembler/diexpression.ll`?
lib/CodeGen/AsmPrinter/DwarfDebug.cpp
948	Would creating a separate CU just for our basic types help in any way?

bjope added inline comments.Feb 7 2019, 2:52 PM

lib/Transforms/Utils/Local.cpp
1870	I think that we can start using two DW_OP_convert in sequence and then treat the DW_OP_deref_size as a separate issue and finally DW_OP_deref_type and DW_OP_regval_type as long term goals. Makes sense? Makes a little sense at least. I guess it all depends on if this is supposed to be "complete" or just a partial solution. If we go for the latter then you need to update the description (and probably also add some code comments here mentioning that this still gives faulty result in certain situations). Extra credits for adding test cases that show that we still do wrong in some situations. At least it seems like we agree that for a "complete" solution we need to express how many bits that should be dereferenced instead of the first DW_OP_convert. With this patch we still present wrong values in the debugger sometimes (at least for big endian platforms, right?).

aprantl added inline comments.Feb 7 2019, 3:21 PM

include/llvm/CodeGen/DIE.h
719	It seems excessive to burden every DIE object with an extra 8 bytes for something we'll only need on a handful of basic type DIEs. Would it be possible to store this in a DenseMap on the side instead?

How should DW_OP_convert be handled when targeting DWARF versions earlier than 5? There is the GNU extension DW_OP_GNU_convert, which GDB seems to have had support for since 2011. The operation seems to be the identical to the final version that got into DWARFv5, so LLDB should be able to handle the two variants transparently. Can we emit that GNU extension (under some limitations)?

In D56587#1390564, @dstenb wrote:

How should DW_OP_convert be handled when targeting DWARF versions earlier than 5? There is the GNU extension DW_OP_GNU_convert, which GDB seems to have had support for since 2011. The operation seems to be the identical to the final version that got into DWARFv5, so LLDB should be able to handle the two variants transparently. Can we emit that GNU extension (under some limitations)?

That still leaves the question of what to do when we're not going to emit a convert operator. I don't object to DW_OP_GNU_convert in principle, but I do object to requiring all debuggers to support it.

In D56587#1390564, @dstenb wrote:

How should DW_OP_convert be handled when targeting DWARF versions earlier than 5? There is the GNU extension DW_OP_GNU_convert, which GDB seems to have had support for since 2011. The operation seems to be the identical to the final version that got into DWARFv5, so LLDB should be able to handle the two variants transparently. Can we emit that GNU extension (under some limitations)?

That seems fine for debugger-tuning=gdb (and eventually lldb once we implemented support for it). Is emitting the byzantine shift/mask expression that this review started with in all other cases an option for a reasonably large subset of the uses?

In D56587#1391435, @aprantl wrote:

In D56587#1390564, @dstenb wrote:

How should DW_OP_convert be handled when targeting DWARF versions earlier than 5? There is the GNU extension DW_OP_GNU_convert, which GDB seems to have had support for since 2011. The operation seems to be the identical to the final version that got into DWARFv5, so LLDB should be able to handle the two variants transparently. Can we emit that GNU extension (under some limitations)?

That seems fine for debugger-tuning=gdb (and eventually lldb once we implemented support for it). Is emitting the byzantine shift/mask expression that this review started with in all other cases an option for a reasonably large subset of the uses?

If we have something nice for GDB and LLDB, and Paul's OK with one of those for Sony's stuff - then who would we be maintaining this extra code for? I'd be OK saying that backwards/sidewards/unspecified compatibility might not be worth it? If someone comes with a need, it could be resurrected/implemented - and until then, we could emit no location at all, potentially.

Addresses most of the open issues I had. The major question that remains is what to do when not targeting Dwarf v5.

Herald added a subscriber: jholewinski. · View Herald TranscriptFeb 11 2019, 3:33 AM

markus marked an inline comment as done.Feb 11 2019, 3:42 AM

markus added inline comments.

include/llvm/CodeGen/DIE.h
719	I agree and it does seem like the most logical place for such a DenseMap would be inside the CU. It is not clear however how to make that available to `AsmPrinter::emitDwarfDIE` without violating the current interfaces.

In D56587#1391461, @dblaikie wrote:

In D56587#1391435, @aprantl wrote:

In D56587#1390564, @dstenb wrote:

How should DW_OP_convert be handled when targeting DWARF versions earlier than 5? There is the GNU extension DW_OP_GNU_convert, which GDB seems to have had support for since 2011. The operation seems to be the identical to the final version that got into DWARFv5, so LLDB should be able to handle the two variants transparently. Can we emit that GNU extension (under some limitations)?

That seems fine for debugger-tuning=gdb (and eventually lldb once we implemented support for it). Is emitting the byzantine shift/mask expression that this review started with in all other cases an option for a reasonably large subset of the uses?

If we have something nice for GDB and LLDB, and Paul's OK with one of those for Sony's stuff - then who would we be maintaining this extra code for? I'd be OK saying that backwards/sidewards/unspecified compatibility might not be worth it? If someone comes with a need, it could be resurrected/implemented - and until then, we could emit no location at all, potentially.

IIUC what we have are:

v5: Everybody is okay with DW_OP_convert.
v4 GDB: clearly DW_OP_GNU_convert is the way to go.
v4 Sony: do the complicated thing.
v4 LLDB: either teach it about DW_OP_GNU_convert or do the complicated thing like Sony.

I didn't see any not-the-complicated-thing proposal for cases that don't/can't/won't know about DW_OP_GNU_convert?

In D56587#1393445, @probinson wrote:

In D56587#1391461, @dblaikie wrote:

In D56587#1391435, @aprantl wrote:

In D56587#1390564, @dstenb wrote:

How should DW_OP_convert be handled when targeting DWARF versions earlier than 5? There is the GNU extension DW_OP_GNU_convert, which GDB seems to have had support for since 2011. The operation seems to be the identical to the final version that got into DWARFv5, so LLDB should be able to handle the two variants transparently. Can we emit that GNU extension (under some limitations)?

That seems fine for debugger-tuning=gdb (and eventually lldb once we implemented support for it). Is emitting the byzantine shift/mask expression that this review started with in all other cases an option for a reasonably large subset of the uses?

If we have something nice for GDB and LLDB, and Paul's OK with one of those for Sony's stuff - then who would we be maintaining this extra code for? I'd be OK saying that backwards/sidewards/unspecified compatibility might not be worth it? If someone comes with a need, it could be resurrected/implemented - and until then, we could emit no location at all, potentially.

IIUC what we have are:

v5: Everybody is okay with DW_OP_convert.

v4 GDB: clearly DW_OP_GNU_convert is the way to go.

v4 Sony: do the complicated thing.

v4 LLDB: either teach it about DW_OP_GNU_convert or do the complicated thing like Sony.

I didn't see any not-the-complicated-thing proposal for cases that don't/can't/won't know about DW_OP_GNU_convert?

I don't think anyone mentioned it, that's why I was bringing it up - there's always an option to not render any location if it's not possible/worth the work. That's all I was asking - is it worth the complexity? (I wasn't sure anyone needed it - but sounds like Sony does, reckon it's worth the tradeoff in complexity in LLVM compared to the work required to support this in the Sony debugger?)

In D56587#1393556, @dblaikie wrote:

I don't think anyone mentioned it, that's why I was bringing it up - there's always an option to not render any location if it's not possible/worth the work. That's all I was asking - is it worth the complexity? (I wasn't sure anyone needed it - but sounds like Sony does, reckon it's worth the tradeoff in complexity in LLVM compared to the work required to support this in the Sony debugger?)

NVPTX also would need it, because they are stuck on DWARF v2.

In D56587#1393746, @probinson wrote:

In D56587#1393556, @dblaikie wrote:

I don't think anyone mentioned it, that's why I was bringing it up - there's always an option to not render any location if it's not possible/worth the work. That's all I was asking - is it worth the complexity? (I wasn't sure anyone needed it - but sounds like Sony does, reckon it's worth the tradeoff in complexity in LLVM compared to the work required to support this in the Sony debugger?)

NVPTX also would need it, because they are stuck on DWARF v2.

Any ideas if NVPTX hit this case? my understanding was that NVPTX has a fairly restrictive set of code or actions that can be used.

Removed Label from DIE (added DenseMap to CU and pass it around). Rebased to top of trunk.

Herald added a subscriber: jdoerfert. · View Herald TranscriptFeb 12 2019, 2:47 AM

aprantl added inline comments.Feb 12 2019, 9:21 AM

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1938	This is to inject the reference to the basic type die, right? This code should probably be factored out into a relocate/fixupTypeRefs() helper function. I also assume that you need to apply the same fixup for the case of a single, non-debug_lo, inline DW_AT_location, right? The fact that the placeholder is encoded as a LEB128 sounds really dangerous. If we ever support any branching operations, it will mess with the offsets. Can we assume that the finalized DIE ref will always be a DW_OP_ref_addr or something with a fixed size? Could we make the placeholder the same fixed size, too? If that doesn't work, the right solution is probably to defer the emission of DwarfExpressions until here, which we could do in a separate, preparatory commit.

markus marked an inline comment as done.Feb 12 2019, 11:49 AM

markus added inline comments.

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1938	This is to inject the reference to the basic type die, right? Yes This code should probably be factored out into a relocate/fixupTypeRefs() helper function. I also assume that you need to apply the same fixup for the case of a single, non-debug_lo, inline DW_AT_location, right? Sounds reasonable. Not sure I could easily find where in the code the inline expressions are inserted though. If you could point at a file and line number that would be helpful. I guess another option would be to force these expressions (the ones containing a base type reference) to always end up in .debug_loc right? The fact that the placeholder is encoded as a LEB128 sounds really dangerous. If we ever support any branching operations, it will mess with the offsets. Can we assume that the finalized DIE ref will always be a DW_OP_ref_addr or something with a fixed size? Could we make the placeholder the same fixed size, too? The spec states that the finalized base type DIE offset is encoded as a ULEB128 so not much choice about that but the value we pick up here (the one inserted in `DwarfExpression::addExpression`) is just a index so we could certainly encode that in a fixed size integer. If branches were to be introduced at a later point I imagine that the branch target in the emitted dwarf would be a label (`MCSymbol`) but in the intermediate expression vector a simple offset would probably suffice. If that doesn't work, the right solution is probably to defer the emission of DwarfExpressions until here, which we could do in a separate, preparatory commit. I think that would be a good thing to do but unfortunately it seems far from easy to get rid of the stuff that is in between.

aprantl added inline comments.Feb 12 2019, 12:46 PM

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1938	If you could point at a file and line number that would be helpful. git grep addBlock.DW_AT_location lib/CodeGen/AsmPrinter lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp: addBlock(VariableDIE, dwarf::DW_AT_location, DwarfExpr->finalize()); lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp: addBlock(VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize()); lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp: addBlock(VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize()); lib/CodeGen/AsmPrinter/DwarfUnit.cpp: addBlock(ParamDIE, dwarf::DW_AT_location, Loc); The spec states that the finalized base type DIE offset is encoded as a ULEB128 so not much choice about that but the value we pick up here (the one inserted in DwarfExpression::addExpression) is just a index so we could certainly encode that in a fixed size integer. I see. I didn't think about emitting branch targets a label differences, I thought we'd just hardcode the offsets. I guess we can defer this until it becomes an issue. Encoding the temporary die reference with a fixed size would probably still be a good idea, just to keep this code simpler.

probinson added inline comments.Feb 12 2019, 1:02 PM

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1938	Re the ULEB as how to find the base_type DIEs, the unstated assumption is that the base_type DIEs would be emitted unconditionally at the top of the CU, so everyone can just use them as needed. If you want to emit base_types lazily... don't do that. Re branches, there are already branch operators, if that's what you're talking about (DW_OP_skip and _bra).

dblaikie added inline comments.Feb 12 2019, 1:36 PM

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1938	I think maybe you don't need to make that assumption - you can produce a label difference as a uleb (though that may get complicated). Yeah, we use it in debug_rnglists, for instance: .uleb128 .Lfunc_end0-.Lfunc_begin0 # length & even if I change that to be a label difference between labels that bound the uleb itself (ie: where the difference would vary depending on the number of bytes required for the uleb) clang still assembles it at least... :)

markus marked 2 inline comments as done.Feb 13 2019, 12:29 AM

markus added inline comments.

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1938	Re the ULEB as how to find the base_type DIEs, the unstated assumption is that the base_type DIEs would be emitted unconditionally at the top of the CU, so everyone can just use them as needed. If you want to emit base_types lazily... don't do that. I played with emitting the base_type DIEs directly after the CU header but since the size of that header varies and due to the phase ordering of how the debug info is emitted I still need label differences to be able to locate the base_type DIEs in a robust manner. Right now I would not say that they are emitted lazily but rather we find out which ones we need and then emit them in table form. Still need the labels though. Re branches, there are already branch operators, if that's what you're talking about (DW_OP_skip and _bra). Where do you see these branch operators being used? I can't find them.
1938	I think maybe you don't need to make that assumption - you can produce a label difference as a uleb (though that may get complicated). Yes, that is what I currently do. It looks like this .byte 168 # DW_OP_convert .uleb128 .Lbase_type0-.Lcu_begin0 & even if I change that to be a label difference between labels that bound the uleb itself (ie: where the difference would vary depending on the number of bytes required for the uleb) clang still assembles it at least... :) Yep, I thought about that too when I realized that I need to add some prototype support for our downstream assembler. Came to the conclusion that I could treat the ULEB128s as fixed size by sign/zero extending them, so that should simplify things a lot even though it is not space efficient... Either way it is not a problem for this review how the assembler solves it :)

+ @ABataev re the question whether NVPTX runs into the situation described in this review.

The Sony debugger guys are okay with using the GCC operator in a pre-v5 expression. So, tentatively, for all debugger tunings, we can emit that instead of the more complicated expression. That way we are emitting compliant expressions, and the info doesn't just disappear sometimes (a much worse outcome IMO). The only remaining question is my hypothetical about NVPTX.

Re branch operators, I thought Adrian was throwing that out there as a general concern; yes branch operators exist, and yes we don't use them currently. As David says, the assembler knows how to convert a label difference into a ULEB and it will all Just Work. If/when we ever need it to.

I see now that the inlined places i.e. these:

git grep addBlock.*DW_AT_location lib/CodeGen/AsmPrinter
lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp:    addBlock(*VariableDIE, dwarf::DW_AT_location, DwarfExpr->finalize());
lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp:        addBlock(*VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize());
lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp:  addBlock(*VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize());
lib/CodeGen/AsmPrinter/DwarfUnit.cpp:        addBlock(ParamDIE, dwarf::DW_AT_location, Loc);

work quite differently and that will need some additional work.

The good news is that here it seems we might just have the infrastructure (DIEValue etc ) to realize the base_type reference in a much cleaner way.

In D56587#1396164, @probinson wrote:

+ @ABataev re the question whether NVPTX runs into the situation described in this review.

The Sony debugger guys are okay with using the GCC operator in a pre-v5 expression. So, tentatively, for all debugger tunings, we can emit that instead of the more complicated expression. That way we are emitting compliant expressions, and the info doesn't just disappear sometimes (a much worse outcome IMO). The only remaining question is my hypothetical about NVPTX.

Re branch operators, I thought Adrian was throwing that out there as a general concern; yes branch operators exist, and yes we don't use them currently. As David says, the assembler knows how to convert a label difference into a ULEB and it will all Just Work. If/when we ever need it to.

NVPTX supports only DWARF2 and does not know anything about DWARF5 operations. Also, it does not support any type of the expression in the DWARF sections, except for <section_name>+-<int_offset>.

NVPTX supports only DWARF2 and does not know anything about DWARF5 operations. Also, it does not support any type of the expression in the DWARF sections, except for <section_name>+-<int_offset>.

Isn't the latter a rather restrictive limitation that should be addressed in the NVPTX assembler?

In D56587#1396249, @markus wrote:

NVPTX supports only DWARF2 and does not know anything about DWARF5 operations. Also, it does not support any type of the expression in the DWARF sections, except for <section_name>+-<int_offset>.

Isn't the latter a rather restrictive limitation that should be addressed in the NVPTX assembler?

I agree, but this the limitation we have to live with, unfortunately. Either we support that limitation to have the debug info for NVPTX, or we don't have debug info for NVPTX. I cannot make NVidia to update their ptxas tool to support complex expressions in DWARF sections.

Addressed the inlined location expressions.

Got rid of all the label differences introduced in previous patches and instead use padded ULEB128s of a fixed size (4 bytes).

Using fixed size allows us to leverage the existing framework much better and avoid spraying symbols all over to place just to be able to cope with the variable sized .uleb128 directives. One drawback is that debug output is perhaps slightly less space efficient but that is only a matter of a few bytes each time a base_type is referenced from a DW_OP_convert.

Comments please (I know that I need to add lit tests and clean up comments).

Did some cleanup
Added tests
Emit DW_OP_convert for Dwarf5
Emit legacy shift and mask expression for Dwarf4 and lower
Added llc option -generate-typed-dwarf5-expr to force emission regardless of targeted Dwarf version

Herald added a subscriber: eraman. · View Herald TranscriptFeb 18 2019, 2:35 AM

aprantl added inline comments.Feb 18 2019, 10:12 AM

lib/CodeGen/AsmPrinter/DIE.cpp
512	under the current style you may drop the duplicate doxygen comments con the function implementations.
516	what happens when this assertion isn't met in a release compiler? Is that a purely hypothetical scenario?
lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp
1197	`for (auto &I : reverse(ExprRefedBaseTypes))`
lib/CodeGen/AsmPrinter/DwarfExpression.cpp
396	Why not use a SmallDenseSet for CU.ExprRefedBaseTypes?
test/DebugInfo/Generic/convert-inlined.ll
32 ↗	(On Diff #187203)	Sorry about the extra work this causes, but it would really pay off to also have a separate peraratory commit that adds dwarfdump support for DW_OP_convert so this is printed as `DW_OP_convert (0x000022 "DW_ATE_signed_32")` as well as dwarfdump --verify support that ensures that the offset actually points to a type DIE.

markus marked 3 inline comments as done.Feb 19 2019, 1:34 AM

markus added inline comments.

lib/CodeGen/AsmPrinter/DIE.cpp
516	what happens when this assertion isn't met in a release compiler? The debug info would be corrupted. Is that a purely hypothetical scenario? With `ULEB128PadSize = 4` we are fine as long as the types are placed within 256MB from the start of the `.debug_info` section. Since we take care to insert the types immediately after the CU this will not be an issue for the case of a single CU. However by using llvm-link it is possible to put several CUs in the same module and that could push us closer to the 256MB limit. If we set `ULEB128PadSize = 5` then the limit becomes 32GB and that should put us on the safe side for a considerable future (keeping in mind that this limit is for object files and not the final linked executable). (One could argue that symbols and label differences, effectively pushing the problem to the assembler, are the right way to go here and I would tend to agree with that but unfortunately that causes havoc in the existing DIE framework as it relies on being able to pre-compute sizes.)
lib/CodeGen/AsmPrinter/DwarfExpression.cpp
396	I don't think that would work as we rely on being able to index into it.
test/DebugInfo/Generic/convert-inlined.ll
32 ↗	(On Diff #187203)	I could look into that but before I spend too much time on dwarfdump I would like to get confirmation that would be the last remaining issue.

markus marked an inline comment as done.Feb 19 2019, 2:28 AM

markus added inline comments.

lib/CodeGen/AsmPrinter/DIE.cpp
516	However by using llvm-link it is possible to put several CUs in the same module and that could push us closer to the 256MB limit. Actually forget that part. I see now that is irrelevant since the offset is within CU and not from start of .debug_info. I got confused looking at the output of readelf as that tool prints .debug_info address even though it is encoded as a CU offset. So given this I think the 256MB limit is perfectly reasonable.

aprantl added a reviewer: dblaikie.Feb 19 2019, 8:45 AM

aprantl added inline comments.

lib/CodeGen/AsmPrinter/DIE.cpp
516	(One could argue that symbols and label differences, effectively pushing the problem to the assembler, are the right way to go here and I would tend to agree with that but unfortunately that causes havoc in the existing DIE framework as it relies on being able to pre-compute sizes.) @dblaikie has dealt with similar problems in the past while refactoring AsmPrinter to support DWOs, perhaps he has some ideas? So given this I think the 256MB limit is perfectly reasonable. A good measure to decide this is to look at the size of an LTO build of Clang with all targets, which is usually a good proxy for what we can expect from large programs.

aprantl added inline comments.Feb 19 2019, 8:51 AM

lib/CodeGen/AsmPrinter/DIE.cpp
516	When you say causes havoc in the existing DIE framework as it relies on being able to pre-compute sizes is the problem that we don't know the offsets of the location list entries ahead of time since they depend on the position of the referenced type DIEs and thus we don't know the size of the DW_AT_location attributes, or is the problem only within the location list section?

markus marked 2 inline comments as done.Feb 19 2019, 10:24 AM

markus added inline comments.

lib/CodeGen/AsmPrinter/DIE.cpp
516	(One could argue that symbols and label differences, effectively pushing the problem to the assembler, are the right way to go here and I would tend to agree with that but unfortunately that causes havoc in the existing DIE framework as it relies on being able to pre-compute sizes.) @dblaikie has dealt with similar problems in the past while refactoring AsmPrinter to support DWOs, perhaps he has some ideas? While I do think that solving this with label differences would be the right thing to do in general I do not think it is the right thing to do here as the changes become much larger than they need to be. So given this I think the 256MB limit is perfectly reasonable. A good measure to decide this is to look at the size of an LTO build of Clang with all targets, which is usually a good proxy for what we can expect from large programs. Yes, to clarify this. Since the base types that we generate are inserted immediately after the CU DIE we would need to insert a huge amount of these (>16 million I believe) to hit the 256MB limit. This should be quite impossible since we don't create entries for duplicate types and there simply aren't that many unique ones :)
516	is the problem that we don't know the offsets of the location list entries ahead of time since they depend on the position of the referenced type DIEs and thus we don't know the size of the DW_AT_location attributes, or is the problem only within the location list section? One of the problems is that as soon as we emit a `.uleb128` directive of a label difference we do not know that size of that and hence cannot compute the size of the block it is in without emitting more labels (begin and end for that block) and then do a label difference of that too. This is not how the DIE handling is currently designed (IIUC) as it relies on being able to compute the size of each DIE. This is especially a problem for the inlined DW_AT_location attributes. This is why having padded / fixed size ULEB128s, as we have in the current patch, makes things much easier.

aprantl added inline comments.Feb 19 2019, 10:34 AM

lib/CodeGen/AsmPrinter/DIE.cpp
516	So we only insert the special type DIEs into the very first DW_TAG_compile_unit? Then this is fine. It wasn't clear to me whether in the LTO case we'd inject the types into every CU. There's no need to do that, so if we don't then.we're good.

I'm a touch confused by the whole discussion here - so I'll write some things & perhaps folks can correct my understanding where necessary.

The issue is that a location expression (either in a direct location, or in a loclist) needs to reference a DIE.

Sounds like that DIE reference is necessarily CU-local (because we're talking about precomputing the offset - and we could only do that if it's CU-local).

We already emit other CU-local DIE references in attributes (eg: DW_AT_specification, etc) references with hardcoded 4 bytes in size- so why would it be problematic to emit this one in the same way (with a padded ULEB128 that we know will give us 4 bytes of offset to work with)?

& yeah, maybe if someone wants to save us some size (at the cost of whatever computational complexity this invokes in the assembler) it could be replaced with label differences (doing label differences for all DIE references would be nifty for manual editing/mucking about with DWARF, but not a big deal).

In D56587#1402682, @dblaikie wrote:

I'm a touch confused by the whole discussion here - so I'll write some things & perhaps folks can correct my understanding where necessary.

I think that we all are. Maybe it is because we tried some many different approaches in the same review that it is a bit convoluted now.

The issue is that a location expression (either in a direct location, or in a loclist) needs to reference a DIE.

Correct.

Sounds like that DIE reference is necessarily CU-local (because we're talking about precomputing the offset - and we could only do that if it's CU-local).

Also correct. The spec says that the reference is an offset into the current CU.

We already emit other CU-local DIE references in attributes (eg: DW_AT_specification, etc) references with hardcoded 4 bytes in size- so why would it be problematic to emit this one in the same way (with a padded ULEB128 that we know will give us 4 bytes of offset to work with)?

Agree. It should be no more problematic than what is done in other places already. The 4 byte ULEB128 gives us 28 bits to encode the offset in but as reasoned in other comments there is no way that limit can be reached.

& yeah, maybe if someone wants to save us some size (at the cost of whatever computational complexity this invokes in the assembler) it could be replaced with label differences (doing label differences for all DIE references would be nifty for manual editing/mucking about with DWARF, but not a big deal).

Agree.

lib/CodeGen/AsmPrinter/DIE.cpp
516	So we only insert the special type DIEs into the very first DW_TAG_compile_unit? Then this is fine. It wasn't clear to me whether in the LTO case we'd inject the types into every CU. There's no need to do that, so if we don't then.we're good. The special type DIEs are local to the CU that use them so they will not necessarily only be inserted into the first one but rather into the ones where they are used. This is not a problem though since according to the spec 'the operand is an unsigned LEB128 number that represents the offset of a debugging information entry in the current compilation unit' i.e. the offset is relative to the current CU so there is no chance that it will exceed the 256MB limit since they are inserted immediately after the DW_TAG_compile_unit DIE.

I have created a review for the requested 'llvm-dwarfdump' updates (the printing part) in https://reviews.llvm.org/D58442

+ Addressed review remark (use range-based loop instead of iterators).
+ Updated tests (probably forgot during last rebase. Verified the values used now with a clean master)

aprantl added inline comments.Feb 20 2019, 8:16 AM

lib/CodeGen/AsmPrinter/DIE.cpp
516	If they are CU-relative I agree that should be perfectly safe, thanks. What happens if I llvm-link two CUs that both contain the same DIExpression. Is it possible for two location list entries to be uniques across compile units? I think the answer is no, right?

markus marked an inline comment as done.Feb 20 2019, 11:21 AM

markus added inline comments.

lib/CodeGen/AsmPrinter/DIE.cpp
516	What happens if I llvm-link two CUs that both contain the same DIExpression. Is it possible for two location list entries to be uniques across compile units? I think the answer is no, right? Correct. The data structure is per CU so nothing happens across such units.

updated to top of trunk to have the llvm-dwarfdump updates committed earlier today
updated tests and made additional improvements to llvm-dwarfdump
fixed segfault bug where we crashed on release builds but not debug builds (the allocation of DIEBaseTypeRef)

Are we good to land this now or what is remaining?

There's a nonzero chance that this patch will break llvm/tools/dsymutil. I'll see if I can come up with a testcase for that (hopefully later today).

lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp
1195	... `so their offsets fit into the 5 bits reserved inside the location expressions.`
lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1937	You might want to reword this comment to be more assertive :-) Is my understanding correct that we only need to do this here because the inline DW_AT_location (DW_FORM_block, ..) are emitted ahead of time and thus have the correct offsets injected from the get go? Could you please move this out into a `fixupLocEntryDIEReferences()` (or something) function?
1977	Didn't your dwarfdump patch have this info in an enum or am I confusing things?
lib/CodeGen/AsmPrinter/DwarfExpression.cpp
29	Why is this needed? Shouldn't we key off the debugger tuning flag?
395	Can you add a comment that explains what happens in the non-location-list cases?

probinson added inline comments.Feb 21 2019, 11:31 AM

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
948	A separate CU for basic types would not be usable by DW_OP_convert? because it uses CU-relative offsets to find them.
lib/CodeGen/AsmPrinter/DwarfExpression.cpp
29	The debugger-tuning design is that we use it only in the DwarfDebug ctor to set other control flags, which can all be influenced independently. Tuning and its associated flags are not per-CU, although DWARF version is.
392	Pre-v5 needs to emit the GNU op, not the standard op.

Could you please add a third testcase with two DICompileUnits, both of which have a DIExpression with a DW_OP_LLVM_convert that will use the same type and then verify that we emit the type in both CUs? This will help guard against later modifications breaking this scenario.

Thanks!

test/DebugInfo/Generic/convert-debugloc.ll
144 ↗	(On Diff #187774)	I assume most of these attributes aren't needed?

aprantl added a parent revision: D58534: dsymutil support for DW_OP_convert.Feb 21 2019, 5:13 PM

Added test with two CUs.
Updated some comments.
Removed the -generate-typed-dwarf5-expr option, will only produce DW_OP_convert for Dwarf v5, for all prior versions the mask & shift expression is used. I suggest that we forget about DW_OP_GNU_convert and debugger tunings for this review. If that is really desired it can be added later in a separate patch.

markus marked 6 inline comments as done.Feb 22 2019, 5:07 AM

markus added inline comments.

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1937	Is my understanding correct that we only need to do this here because the inline DW_AT_location (DW_FORM_block, ..) are emitted ahead of time and thus have the correct offsets injected from the get go? Sort of. When these are inserted in `DwarfExpression::addExpression` (for both location-lists and inlined) the offset of the base_type DIE is not known so we need to insert a placeholder. For the the location-list case the data structure is unfortunately a plain byte stream so we need this elaborate state machine to extract the placeholder here. Could you please move this out into a fixupLocEntryDIEReferences() (or something) function? Since this is a state machine and hence keeps state I think that putting it in a separate function would only make it messier.
1977	`DWARFExpression` does contain similar information but it is private there. Not sure if refactoring would be worthwhile.

In D56587#1393750, @dblaikie wrote:

In D56587#1393746, @probinson wrote:

In D56587#1393556, @dblaikie wrote:

I don't think anyone mentioned it, that's why I was bringing it up - there's always an option to not render any location if it's not possible/worth the work. That's all I was asking - is it worth the complexity? (I wasn't sure anyone needed it - but sounds like Sony does, reckon it's worth the tradeoff in complexity in LLVM compared to the work required to support this in the Sony debugger?)

NVPTX also would need it, because they are stuck on DWARF v2.

Any ideas if NVPTX hit this case? my understanding was that NVPTX has a fairly restrictive set of code or actions that can be used.

Ping on this - still wondering if anyone needs the complicated code or if we could get away with the GNU extension + DWARFv5 standard form.

In D56587#1407634, @dblaikie wrote:

In D56587#1393750, @dblaikie wrote:

In D56587#1393746, @probinson wrote:

In D56587#1393556, @dblaikie wrote:

I don't think anyone mentioned it, that's why I was bringing it up - there's always an option to not render any location if it's not possible/worth the work. That's all I was asking - is it worth the complexity? (I wasn't sure anyone needed it - but sounds like Sony does, reckon it's worth the tradeoff in complexity in LLVM compared to the work required to support this in the Sony debugger?)

NVPTX also would need it, because they are stuck on DWARF v2.

Any ideas if NVPTX hit this case? my understanding was that NVPTX has a fairly restrictive set of code or actions that can be used.

Ping on this - still wondering if anyone needs the complicated code or if we could get away with the GNU extension + DWARFv5 standard

not sure, if NVPTX can hit this. It has no stack at all, so, maybe, this op won't be generated at all

Rewrote the expression parsing state machine in DwarfDebug::emitDebugLocEntry to use DWARFExpression as this avoids duplication of code describing ops and their arguments (got inspired by the dsymutil review).

aprantl added inline comments.Feb 27 2019, 9:14 AM

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1944	Nice. Should we do the same thing as in dsymutil here and check for Encoding::BaseTypeRef?

Removed explicit check for DW_OP_convert and replaced with a generic handling of operands that should work with all types (regardless if it is first, second or both that is BaseTypeRef).

We're almost there!

lib/CodeGen/AsmPrinter/DIE.cpp
512	please remove this comment from the implementation.
521	ditto.. it should be on the declaration in the header file.
lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1931	Remove the first word.
1944	Could you please still factor this into a static fixupBaseTypeRefs(or something along those lines) function for better readability?
1947	Please add an assert that fails if the opcode is `DW_OP_const_type` as it is not supported by this loop.
1949	Where is this getting copied?

We're almost there!

Yay :)

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1944	Sure, but I don't understand where to place the cut to improve readability. I.e. what should go into `fixupBaseTypeRefs` and what should remain in `emitDebugLocEntry`?
1949	It is not getting copied at all, 'Encoding::SizeNA` indicates that the operation does not have an operand in this slot. Or maybe I am not understanding the question?

aprantl accepted this revision.Mar 1 2019, 8:12 AM

aprantl added inline comments.

lib/CodeGen/AsmPrinter/DwarfDebug.cpp
1949	My bad, I was thinking about a non-base-type operand, but it is actually handled in the else branch!
1949	On second thought, I think we can leave it as is, too.

This revision is now accepted and ready to land.Mar 1 2019, 8:12 AM

Closed by commit rL356442: [DebugInfo] Introduce DW_OP_LLVM_convert (authored by markus). · Explain WhyMar 19 2019, 1:48 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptMar 19 2019, 1:48 AM

Herald added a subscriber: ormris. · View Herald Transcript

ormris removed a subscriber: ormris.Mar 19 2019, 8:54 AM

Revision Contents

Path

Size

docs/

LangRef.rst

4 lines

include/

llvm/

BinaryFormat/

Dwarf.h

3 lines

CodeGen/

AsmPrinter.h

2 lines

DIE.h

47 lines

DIEValue.def

1 line

MC/

MCStreamer.h

2 lines

lib/

AsmParser/

LLParser.cpp

9 lines

BinaryFormat/

Dwarf.cpp

3 lines

CodeGen/

AsmPrinter/

4 lines

10 lines

20 lines

3 lines

6 lines

14 lines

18 lines

6 lines

128 lines

18 lines

11 lines

2 lines

15 lines

IR/

AsmWriter.cpp

9 lines

DebugInfoMetadata.cpp

2 lines

MC/

MCStreamer.cpp

4 lines

Target/

BPF/

MCTargetDesc/

BPFAsmBackend.cpp

2 lines

Transforms/

Utils/

Local.cpp

19 lines

test/

Assembler/

diexpression.ll

6 lines

Transforms/

InstCombine/

cast-set-preserve-signed-dbg-val.ll

2 lines

unittests/

Transforms/

Utils/

LocalTest.cpp

40 lines

Diff 187015

docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 4,660 Lines • ▼ Show 20 Lines
	- ``DW_OP_minus`` pops the last two entries from the expression stack, subtracts			- ``DW_OP_minus`` pops the last two entries from the expression stack, subtracts
	the last entry from the second last entry and appends the result to the			the last entry from the second last entry and appends the result to the
	expression stack.			expression stack.
	- ``DW_OP_plus_uconst, 93`` adds ``93`` to the working expression.			- ``DW_OP_plus_uconst, 93`` adds ``93`` to the working expression.
	- ``DW_OP_LLVM_fragment, 16, 8`` specifies the offset and size (``16`` and ``8``			- ``DW_OP_LLVM_fragment, 16, 8`` specifies the offset and size (``16`` and ``8``
	here, respectively) of the variable fragment from the working expression. Note			here, respectively) of the variable fragment from the working expression. Note
	that contrary to DW_OP_bit_piece, the offset is describing the location			that contrary to DW_OP_bit_piece, the offset is describing the location
	within the described source variable.			within the described source variable.
				- ``DW_OP_LLVM_convert, 16, DW_ATE_signed`` specifies a bit size and encoding
				(``16`` and ``DW_ATE_signed`` here, respectively) to which the top of the
				expression stack is to be converted. Maps into a ``DW_OP_convert`` operation
				that references a base type constructed from the supplied values.
				aprantlUnsubmitted Not Done Reply Inline Actions Thanks, this looks good! aprantl: Thanks, this looks good!
	- ``DW_OP_swap`` swaps top two stack entries.			- ``DW_OP_swap`` swaps top two stack entries.
	- ``DW_OP_xderef`` provides extended dereference mechanism. The entry at the top			- ``DW_OP_xderef`` provides extended dereference mechanism. The entry at the top
	of the stack is treated as an address. The second stack entry is treated as an			of the stack is treated as an address. The second stack entry is treated as an
	address space identifier.			address space identifier.
	- ``DW_OP_stack_value`` marks a constant value.			- ``DW_OP_stack_value`` marks a constant value.

	DWARF specifies three kinds of simple location descriptions: Register, memory,			DWARF specifies three kinds of simple location descriptions: Register, memory,
	and implicit location descriptions. Note that a location description is			and implicit location descriptions. Note that a location description is
	▲ Show 20 Lines • Show All 12,154 Lines • Show Last 20 Lines

include/llvm/BinaryFormat/Dwarf.h

Show First 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	#include "llvm/BinaryFormat/Dwarf.def"
DW_FORM_lo_user = 0x1f00, ///< Not specified by DWARF.		DW_FORM_lo_user = 0x1f00, ///< Not specified by DWARF.
};		};

enum LocationAtom {		enum LocationAtom {
#define HANDLE_DW_OP(ID, NAME, VERSION, VENDOR) DW_OP_##NAME = ID,		#define HANDLE_DW_OP(ID, NAME, VERSION, VENDOR) DW_OP_##NAME = ID,
#include "llvm/BinaryFormat/Dwarf.def"		#include "llvm/BinaryFormat/Dwarf.def"
DW_OP_lo_user = 0xe0,		DW_OP_lo_user = 0xe0,
DW_OP_hi_user = 0xff,		DW_OP_hi_user = 0xff,
DW_OP_LLVM_fragment = 0x1000 ///< Only used in LLVM metadata.		DW_OP_LLVM_fragment = 0x1000, ///< Only used in LLVM metadata.
		DW_OP_LLVM_convert = 0x1001 ///< Only used in LLVM metadata.
};		};

enum TypeKind : uint8_t {		enum TypeKind : uint8_t {
#define HANDLE_DW_ATE(ID, NAME, VERSION, VENDOR) DW_ATE_##NAME = ID,		#define HANDLE_DW_ATE(ID, NAME, VERSION, VENDOR) DW_ATE_##NAME = ID,
#include "llvm/BinaryFormat/Dwarf.def"		#include "llvm/BinaryFormat/Dwarf.def"
DW_ATE_lo_user = 0x80,		DW_ATE_lo_user = 0x80,
DW_ATE_hi_user = 0xff		DW_ATE_hi_user = 0xff
};		};
▲ Show 20 Lines • Show All 487 Lines • Show Last 20 Lines

include/llvm/CodeGen/AsmPrinter.h

Show First 20 Lines • Show All 504 Lines • ▼ Show 20 Lines	public:
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Dwarf Emission Helper Routines		// Dwarf Emission Helper Routines
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//

/// Emit the specified signed leb128 value.		/// Emit the specified signed leb128 value.
void EmitSLEB128(int64_t Value, const char *Desc = nullptr) const;		void EmitSLEB128(int64_t Value, const char *Desc = nullptr) const;

/// Emit the specified unsigned leb128 value.		/// Emit the specified unsigned leb128 value.
void EmitULEB128(uint64_t Value, const char *Desc = nullptr) const;		void EmitULEB128(uint64_t Value, const char *Desc = nullptr, unsigned PadTo = 0) const;

/// Emit a .byte 42 directive that corresponds to an encoding. If verbose		/// Emit a .byte 42 directive that corresponds to an encoding. If verbose
/// assembly output is enabled, we output comments describing the encoding.		/// assembly output is enabled, we output comments describing the encoding.
/// Desc is a string saying what the encoding is specifying (e.g. "LSDA").		/// Desc is a string saying what the encoding is specifying (e.g. "LSDA").
void EmitEncodingByte(unsigned Val, const char *Desc = nullptr) const;		void EmitEncodingByte(unsigned Val, const char *Desc = nullptr) const;

/// Return the size of the encoding in bytes.		/// Return the size of the encoding in bytes.
unsigned GetSizeOfEncodedValue(unsigned Encoding) const;		unsigned GetSizeOfEncodedValue(unsigned Encoding) const;
▲ Show 20 Lines • Show All 154 Lines • Show Last 20 Lines

include/llvm/CodeGen/DIE.h

Show All 29 Lines
#include <iterator>		#include <iterator>
#include <new>		#include <new>
#include <type_traits>		#include <type_traits>
#include <utility>		#include <utility>
#include <vector>		#include <vector>

namespace llvm {		namespace llvm {

		class DwarfCompileUnit;
class AsmPrinter;		class AsmPrinter;
class DIE;		class DIE;
class DIEUnit;		class DIEUnit;
class MCExpr;		class MCExpr;
class MCSection;		class MCSection;
class MCSymbol;		class MCSymbol;
class raw_ostream;		class raw_ostream;

▲ Show 20 Lines • Show All 179 Lines • ▼ Show 20 Lines	public:

void EmitValue(const AsmPrinter *AP, dwarf::Form Form) const;		void EmitValue(const AsmPrinter *AP, dwarf::Form Form) const;
unsigned SizeOf(const AsmPrinter *AP, dwarf::Form Form) const;		unsigned SizeOf(const AsmPrinter *AP, dwarf::Form Form) const;

void print(raw_ostream &O) const;		void print(raw_ostream &O) const;
};		};

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
		/// A BaseTypeRef DIE.
		class DIEBaseTypeRef {
		const uint64_t Index;
		const DwarfCompileUnit *CU;
		static constexpr unsigned ULEB128PadSize = 4;

		public:
		explicit DIEBaseTypeRef(const DwarfCompileUnit *TheCU, uint64_t Idx) : Index(Idx), CU(TheCU) {}

		void EmitValue(const AsmPrinter *AP, dwarf::Form Form) const;
		unsigned SizeOf(const AsmPrinter *AP, dwarf::Form Form) const;

		void print(raw_ostream &O) const;
		};

		//===--------------------------------------------------------------------===//
/// A simple label difference DIE.		/// A simple label difference DIE.
///		///
class DIEDelta {		class DIEDelta {
const MCSymbol *LabelHi;		const MCSymbol *LabelHi;
const MCSymbol *LabelLo;		const MCSymbol *LabelLo;

public:		public:
DIEDelta(const MCSymbol Hi, const MCSymbol Lo) : LabelHi(Hi), LabelLo(Lo) {}		DIEDelta(const MCSymbol Hi, const MCSymbol Lo) : LabelHi(Hi), LabelLo(Lo) {}
▲ Show 20 Lines • Show All 103 Lines • ▼ Show 20 Lines	private:
dwarf::Form Form = (dwarf::Form)0;		dwarf::Form Form = (dwarf::Form)0;

/// Storage for the value.		/// Storage for the value.
///		///
/// All values that aren't standard layout (or are larger than 8 bytes)		/// All values that aren't standard layout (or are larger than 8 bytes)
/// should be stored by reference instead of by value.		/// should be stored by reference instead of by value.
using ValTy = AlignedCharArrayUnion<DIEInteger, DIEString, DIEExpr, DIELabel,		using ValTy = AlignedCharArrayUnion<DIEInteger, DIEString, DIEExpr, DIELabel,
DIEDelta , DIEEntry, DIEBlock ,		DIEDelta , DIEEntry, DIEBlock ,
DIELoc *, DIELocList>;		DIELoc , DIELocList, DIEBaseTypeRef >;

static_assert(sizeof(ValTy) <= sizeof(uint64_t) \|\|		static_assert(sizeof(ValTy) <= sizeof(uint64_t) \|\|
sizeof(ValTy) <= sizeof(void *),		sizeof(ValTy) <= sizeof(void *),
"Expected all large types to be stored via pointer");		"Expected all large types to be stored via pointer");

/// Underlying stored value.		/// Underlying stored value.
ValTy Val;		ValTy Val;

▲ Show 20 Lines • Show All 135 Lines • ▼ Show 20 Lines	void push_back(Node &N) {
assert(N.Next.getInt() == true && "Expected unlinked node");		assert(N.Next.getInt() == true && "Expected unlinked node");

if (Last) {		if (Last) {
N.Next = Last->Next;		N.Next = Last->Next;
Last->Next.setPointerAndInt(&N, false);		Last->Next.setPointerAndInt(&N, false);
}		}
Last = &N;		Last = &N;
}		}

		void push_front(Node &N) {
		assert(N.Next.getPointer() == &N && "Expected unlinked node");
		assert(N.Next.getInt() == true && "Expected unlinked node");

		if (Last) {
		assert(Last->Next.getInt() == true && "Expected Last to have bit set");
		N.Next.setPointerAndInt(Last->Next.getPointer(), false);
		Last->Next.setPointerAndInt(&N, true);
		} else {
		Last = &N;
		}
		}
};		};

template <class T> class IntrusiveBackList : IntrusiveBackListBase {		template <class T> class IntrusiveBackList : IntrusiveBackListBase {
public:		public:
using IntrusiveBackListBase::empty;		using IntrusiveBackListBase::empty;

void push_back(T &N) { IntrusiveBackListBase::push_back(N); }		void push_back(T &N) { IntrusiveBackListBase::push_back(N); }
		void push_front(T &N) { IntrusiveBackListBase::push_front(N); }
T &back() { return static_cast<T >(Last); }		T &back() { return static_cast<T >(Last); }
const T &back() const { return static_cast<T >(Last); }		const T &back() const { return static_cast<T >(Last); }

		T &front() { return static_cast<T >(Last->Next.getPointer()); }
		const T &front() const { return static_cast<T >(Last->Next.getPointer()); }

class const_iterator;		class const_iterator;
class iterator		class iterator
: public iterator_facade_base<iterator, std::forward_iterator_tag, T> {		: public iterator_facade_base<iterator, std::forward_iterator_tag, T> {
friend class const_iterator;		friend class const_iterator;

Node *N = nullptr;		Node *N = nullptr;

public:		public:
▲ Show 20 Lines • Show All 155 Lines • ▼ Show 20 Lines	class DIE : IntrusiveBackListNode, public DIEValueList {

/// The owner is either the parent DIE for children of other DIEs, or a		/// The owner is either the parent DIE for children of other DIEs, or a
/// DIEUnit which contains this DIE as its unit DIE.		/// DIEUnit which contains this DIE as its unit DIE.
PointerUnion<DIE , DIEUnit > Owner;		PointerUnion<DIE , DIEUnit > Owner;

explicit DIE(dwarf::Tag Tag) : Tag(Tag) {}		explicit DIE(dwarf::Tag Tag) : Tag(Tag) {}

public:		public:
DIE() = delete;		DIE() = delete;
		aprantlUnsubmitted Not Done Reply Inline Actions It seems excessive to burden every DIE object with an extra 8 bytes for something we'll only need on a handful of basic type DIEs. Would it be possible to store this in a DenseMap on the side instead? aprantl: It seems excessive to burden every DIE object with an extra 8 bytes for something we'll only…
		markusAuthorUnsubmitted Done Reply Inline Actions I agree and it does seem like the most logical place for such a DenseMap would be inside the CU. It is not clear however how to make that available to `AsmPrinter::emitDwarfDIE` without violating the current interfaces. markus: I agree and it does seem like the most logical place for such a DenseMap would be inside the CU.
DIE(const DIE &RHS) = delete;		DIE(const DIE &RHS) = delete;
DIE(DIE &&RHS) = delete;		DIE(DIE &&RHS) = delete;
DIE &operator=(const DIE &RHS) = delete;		DIE &operator=(const DIE &RHS) = delete;
DIE &operator=(const DIE &&RHS) = delete;		DIE &operator=(const DIE &&RHS) = delete;

static DIE *get(BumpPtrAllocator &Alloc, dwarf::Tag Tag) {		static DIE *get(BumpPtrAllocator &Alloc, dwarf::Tag Tag) {
return new (Alloc) DIE(Tag);		return new (Alloc) DIE(Tag);
}		}
▲ Show 20 Lines • Show All 66 Lines • ▼ Show 20 Lines	public:
/// \returns the DIEUnit that represents the compile or type unit that owns		/// \returns the DIEUnit that represents the compile or type unit that owns
/// this DIE, or NULL if this DIE hasn't been added to a unit DIE.		/// this DIE, or NULL if this DIE hasn't been added to a unit DIE.
const DIEUnit *getUnit() const;		const DIEUnit *getUnit() const;

void setOffset(unsigned O) { Offset = O; }		void setOffset(unsigned O) { Offset = O; }
void setSize(unsigned S) { Size = S; }		void setSize(unsigned S) { Size = S; }

/// Add a child to the DIE.		/// Add a child to the DIE.
DIE &addChild(DIE *Child) {		DIE &addChild(DIE *Child, bool PushFront = false) {
assert(!Child->getParent() && "Child should be orphaned");		assert(!Child->getParent() && "Child should be orphaned");
Child->Owner = this;		Child->Owner = this;
		if (PushFront) {
		Children.push_front(*Child);
		return Children.front();
		} else {
Children.push_back(*Child);		Children.push_back(*Child);
return Children.back();		return Children.back();
}		}
		}

/// Find a value in the DIE with the attribute given.		/// Find a value in the DIE with the attribute given.
///		///
/// Returns a default-constructed DIEValue (where \a DIEValue::getType()		/// Returns a default-constructed DIEValue (where \a DIEValue::getType()
/// gives \a DIEValue::isNone) if no such attribute exists.		/// gives \a DIEValue::isNone) if no such attribute exists.
DIEValue findAttribute(dwarf::Attribute Attribute) const;		DIEValue findAttribute(dwarf::Attribute Attribute) const;

void print(raw_ostream &O, unsigned IndentCount = 0) const;		void print(raw_ostream &O, unsigned IndentCount = 0) const;
▲ Show 20 Lines • Show All 128 Lines • Show Last 20 Lines

include/llvm/CodeGen/DIEValue.def

	Show All 28 Lines
	#ifndef HANDLE_DIEVALUE_LARGE			#ifndef HANDLE_DIEVALUE_LARGE
	#define HANDLE_DIEVALUE_LARGE(T) HANDLE_DIEVALUE(T)			#define HANDLE_DIEVALUE_LARGE(T) HANDLE_DIEVALUE(T)
	#endif			#endif

	HANDLE_DIEVALUE_SMALL(Integer)			HANDLE_DIEVALUE_SMALL(Integer)
	HANDLE_DIEVALUE_SMALL(String)			HANDLE_DIEVALUE_SMALL(String)
	HANDLE_DIEVALUE_SMALL(Expr)			HANDLE_DIEVALUE_SMALL(Expr)
	HANDLE_DIEVALUE_SMALL(Label)			HANDLE_DIEVALUE_SMALL(Label)
				HANDLE_DIEVALUE_SMALL(BaseTypeRef)
	HANDLE_DIEVALUE_LARGE(Delta)			HANDLE_DIEVALUE_LARGE(Delta)
	HANDLE_DIEVALUE_SMALL(Entry)			HANDLE_DIEVALUE_SMALL(Entry)
	HANDLE_DIEVALUE_LARGE(Block)			HANDLE_DIEVALUE_LARGE(Block)
	HANDLE_DIEVALUE_LARGE(Loc)			HANDLE_DIEVALUE_LARGE(Loc)
	HANDLE_DIEVALUE_SMALL(LocList)			HANDLE_DIEVALUE_SMALL(LocList)
	HANDLE_DIEVALUE_LARGE(InlineString)			HANDLE_DIEVALUE_LARGE(InlineString)

	#undef HANDLE_DIEVALUE			#undef HANDLE_DIEVALUE
	#undef HANDLE_DIEVALUE_SMALL			#undef HANDLE_DIEVALUE_SMALL
	#undef HANDLE_DIEVALUE_LARGE			#undef HANDLE_DIEVALUE_LARGE

include/llvm/MC/MCStreamer.h

Show First 20 Lines • Show All 628 Lines • ▼ Show 20 Lines	public:
virtual void EmitIntValue(uint64_t Value, unsigned Size);		virtual void EmitIntValue(uint64_t Value, unsigned Size);

virtual void EmitULEB128Value(const MCExpr *Value);		virtual void EmitULEB128Value(const MCExpr *Value);

virtual void EmitSLEB128Value(const MCExpr *Value);		virtual void EmitSLEB128Value(const MCExpr *Value);

/// Special case of EmitULEB128Value that avoids the client having to		/// Special case of EmitULEB128Value that avoids the client having to
/// pass in a MCExpr for constant integers.		/// pass in a MCExpr for constant integers.
void EmitULEB128IntValue(uint64_t Value);		void EmitULEB128IntValue(uint64_t Value, unsigned PadTo = 0);

/// Special case of EmitSLEB128Value that avoids the client having to		/// Special case of EmitSLEB128Value that avoids the client having to
/// pass in a MCExpr for constant integers.		/// pass in a MCExpr for constant integers.
void EmitSLEB128IntValue(int64_t Value);		void EmitSLEB128IntValue(int64_t Value);

/// Special case of EmitValue that avoids the client having to pass in		/// Special case of EmitValue that avoids the client having to pass in
/// a MCExpr for MCSymbols.		/// a MCExpr for MCSymbols.
void EmitSymbolValue(const MCSymbol *Sym, unsigned Size,		void EmitSymbolValue(const MCSymbol *Sym, unsigned Size,
▲ Show 20 Lines • Show All 367 Lines • Show Last 20 Lines

lib/AsmParser/LLParser.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,829 Lines • ▼ Show 20 Lines	do {
if (unsigned Op = dwarf::getOperationEncoding(Lex.getStrVal())) {		if (unsigned Op = dwarf::getOperationEncoding(Lex.getStrVal())) {
Lex.Lex();		Lex.Lex();
Elements.push_back(Op);		Elements.push_back(Op);
continue;		continue;
}		}
return TokError(Twine("invalid DWARF op '") + Lex.getStrVal() + "'");		return TokError(Twine("invalid DWARF op '") + Lex.getStrVal() + "'");
}		}

		if (Lex.getKind() == lltok::DwarfAttEncoding) {
		if (unsigned Op = dwarf::getAttributeEncoding(Lex.getStrVal())) {
		Lex.Lex();
		Elements.push_back(Op);
		continue;
		}
		return TokError(Twine("invalid DWARF attribute encoding '") + Lex.getStrVal() + "'");
		}
		aprantlUnsubmitted Not Done Reply Inline Actions Can you also add a round-trip test to `test/Assembler/diexpression.ll`? aprantl: Can you also add a round-trip test to `test/Assembler/diexpression.ll`?

if (Lex.getKind() != lltok::APSInt \|\| Lex.getAPSIntVal().isSigned())		if (Lex.getKind() != lltok::APSInt \|\| Lex.getAPSIntVal().isSigned())
return TokError("expected unsigned integer");		return TokError("expected unsigned integer");

auto &U = Lex.getAPSIntVal();		auto &U = Lex.getAPSIntVal();
if (U.ugt(UINT64_MAX))		if (U.ugt(UINT64_MAX))
return TokError("element too large, limit is " + Twine(UINT64_MAX));		return TokError("element too large, limit is " + Twine(UINT64_MAX));
Elements.push_back(U.getZExtValue());		Elements.push_back(U.getZExtValue());
Lex.Lex();		Lex.Lex();
▲ Show 20 Lines • Show All 3,690 Lines • Show Last 20 Lines

lib/BinaryFormat/Dwarf.cpp

	Show First 20 Lines • Show All 137 Lines • ▼ Show 20 Lines
	StringRef llvm::dwarf::OperationEncodingString(unsigned Encoding) {			StringRef llvm::dwarf::OperationEncodingString(unsigned Encoding) {
	switch (Encoding) {			switch (Encoding) {
	default:			default:
	return StringRef();			return StringRef();
	#define HANDLE_DW_OP(ID, NAME, VERSION, VENDOR) \			#define HANDLE_DW_OP(ID, NAME, VERSION, VENDOR) \
	case DW_OP_##NAME: \			case DW_OP_##NAME: \
	return "DW_OP_" #NAME;			return "DW_OP_" #NAME;
	#include "llvm/BinaryFormat/Dwarf.def"			#include "llvm/BinaryFormat/Dwarf.def"
				case DW_OP_LLVM_convert:
				return "DW_OP_LLVM_convert";
	case DW_OP_LLVM_fragment:			case DW_OP_LLVM_fragment:
	return "DW_OP_LLVM_fragment";			return "DW_OP_LLVM_fragment";
	}			}
	}			}

	unsigned llvm::dwarf::getOperationEncoding(StringRef OperationEncodingString) {			unsigned llvm::dwarf::getOperationEncoding(StringRef OperationEncodingString) {
	return StringSwitch<unsigned>(OperationEncodingString)			return StringSwitch<unsigned>(OperationEncodingString)
	#define HANDLE_DW_OP(ID, NAME, VERSION, VENDOR) \			#define HANDLE_DW_OP(ID, NAME, VERSION, VENDOR) \
	.Case("DW_OP_" #NAME, DW_OP_##NAME)			.Case("DW_OP_" #NAME, DW_OP_##NAME)
	#include "llvm/BinaryFormat/Dwarf.def"			#include "llvm/BinaryFormat/Dwarf.def"
				.Case("DW_OP_LLVM_convert", DW_OP_LLVM_convert)
	.Case("DW_OP_LLVM_fragment", DW_OP_LLVM_fragment)			.Case("DW_OP_LLVM_fragment", DW_OP_LLVM_fragment)
	.Default(0);			.Default(0);
	}			}

	unsigned llvm::dwarf::OperationVersion(dwarf::LocationAtom Op) {			unsigned llvm::dwarf::OperationVersion(dwarf::LocationAtom Op) {
	switch (Op) {			switch (Op) {
	default:			default:
	return 0;			return 0;
	▲ Show 20 Lines • Show All 561 Lines • Show Last 20 Lines

lib/CodeGen/AsmPrinter/AsmPrinterDwarf.cpp

	Show All 36 Lines
	/// EmitSLEB128 - emit the specified signed leb128 value.			/// EmitSLEB128 - emit the specified signed leb128 value.
	void AsmPrinter::EmitSLEB128(int64_t Value, const char *Desc) const {			void AsmPrinter::EmitSLEB128(int64_t Value, const char *Desc) const {
	if (isVerbose() && Desc)			if (isVerbose() && Desc)
	OutStreamer->AddComment(Desc);			OutStreamer->AddComment(Desc);

	OutStreamer->EmitSLEB128IntValue(Value);			OutStreamer->EmitSLEB128IntValue(Value);
	}			}

	void AsmPrinter::EmitULEB128(uint64_t Value, const char *Desc) const {			void AsmPrinter::EmitULEB128(uint64_t Value, const char *Desc, unsigned PadTo) const {
	if (isVerbose() && Desc)			if (isVerbose() && Desc)
	OutStreamer->AddComment(Desc);			OutStreamer->AddComment(Desc);

	OutStreamer->EmitULEB128IntValue(Value);			OutStreamer->EmitULEB128IntValue(Value, PadTo);
	}			}

	/// Emit something like ".uleb128 Hi-Lo".			/// Emit something like ".uleb128 Hi-Lo".
	void AsmPrinter::EmitLabelDifferenceAsULEB128(const MCSymbol *Hi,			void AsmPrinter::EmitLabelDifferenceAsULEB128(const MCSymbol *Hi,
	const MCSymbol *Lo) const {			const MCSymbol *Lo) const {
	OutStreamer->emitAbsoluteSymbolDiffAsULEB128(Hi, Lo);			OutStreamer->emitAbsoluteSymbolDiffAsULEB128(Hi, Lo);
	}			}

	▲ Show 20 Lines • Show All 216 Lines • Show Last 20 Lines

lib/CodeGen/AsmPrinter/ByteStreamer.h

Show All 25 Lines	protected:
~ByteStreamer() = default;		~ByteStreamer() = default;
ByteStreamer(const ByteStreamer&) = default;		ByteStreamer(const ByteStreamer&) = default;
ByteStreamer() = default;		ByteStreamer() = default;

public:		public:
// For now we're just handling the calls we need for dwarf emission/hashing.		// For now we're just handling the calls we need for dwarf emission/hashing.
virtual void EmitInt8(uint8_t Byte, const Twine &Comment = "") = 0;		virtual void EmitInt8(uint8_t Byte, const Twine &Comment = "") = 0;
virtual void EmitSLEB128(uint64_t DWord, const Twine &Comment = "") = 0;		virtual void EmitSLEB128(uint64_t DWord, const Twine &Comment = "") = 0;
virtual void EmitULEB128(uint64_t DWord, const Twine &Comment = "") = 0;		virtual void EmitULEB128(uint64_t DWord, const Twine &Comment = "", unsigned PadTo = 0) = 0;
};		};

class APByteStreamer final : public ByteStreamer {		class APByteStreamer final : public ByteStreamer {
private:		private:
AsmPrinter &AP;		AsmPrinter &AP;

public:		public:
APByteStreamer(AsmPrinter &Asm) : AP(Asm) {}		APByteStreamer(AsmPrinter &Asm) : AP(Asm) {}
void EmitInt8(uint8_t Byte, const Twine &Comment) override {		void EmitInt8(uint8_t Byte, const Twine &Comment) override {
AP.OutStreamer->AddComment(Comment);		AP.OutStreamer->AddComment(Comment);
AP.emitInt8(Byte);		AP.emitInt8(Byte);
}		}
void EmitSLEB128(uint64_t DWord, const Twine &Comment) override {		void EmitSLEB128(uint64_t DWord, const Twine &Comment) override {
AP.OutStreamer->AddComment(Comment);		AP.OutStreamer->AddComment(Comment);
AP.EmitSLEB128(DWord);		AP.EmitSLEB128(DWord);
}		}
void EmitULEB128(uint64_t DWord, const Twine &Comment) override {		void EmitULEB128(uint64_t DWord, const Twine &Comment, unsigned PadTo) override {
AP.OutStreamer->AddComment(Comment);		AP.OutStreamer->AddComment(Comment);
AP.EmitULEB128(DWord);		AP.EmitULEB128(DWord);
}		}
};		};

class HashingByteStreamer final : public ByteStreamer {		class HashingByteStreamer final : public ByteStreamer {
private:		private:
DIEHash &Hash;		DIEHash &Hash;
public:		public:
HashingByteStreamer(DIEHash &H) : Hash(H) {}		HashingByteStreamer(DIEHash &H) : Hash(H) {}
void EmitInt8(uint8_t Byte, const Twine &Comment) override {		void EmitInt8(uint8_t Byte, const Twine &Comment) override {
Hash.update(Byte);		Hash.update(Byte);
}		}
void EmitSLEB128(uint64_t DWord, const Twine &Comment) override {		void EmitSLEB128(uint64_t DWord, const Twine &Comment) override {
Hash.addSLEB128(DWord);		Hash.addSLEB128(DWord);
}		}
void EmitULEB128(uint64_t DWord, const Twine &Comment) override {		void EmitULEB128(uint64_t DWord, const Twine &Comment, unsigned PadTo) override {
Hash.addULEB128(DWord);		Hash.addULEB128(DWord);
}		}
};		};

class BufferByteStreamer final : public ByteStreamer {		class BufferByteStreamer final : public ByteStreamer {
private:		private:
SmallVectorImpl<char> &Buffer;		SmallVectorImpl<char> &Buffer;
SmallVectorImpl<std::string> &Comments;		SmallVectorImpl<std::string> &Comments;
Show All 20 Lines	if (GenerateComments) {
Comments.push_back(Comment.str());		Comments.push_back(Comment.str());
// Add some empty comments to keep the Buffer and Comments vectors aligned		// Add some empty comments to keep the Buffer and Comments vectors aligned
// with each other.		// with each other.
for (size_t i = 1; i < Length; ++i)		for (size_t i = 1; i < Length; ++i)
Comments.push_back("");		Comments.push_back("");

}		}
}		}
void EmitULEB128(uint64_t DWord, const Twine &Comment) override {		void EmitULEB128(uint64_t DWord, const Twine &Comment, unsigned PadTo) override {
raw_svector_ostream OSE(Buffer);		raw_svector_ostream OSE(Buffer);
unsigned Length = encodeULEB128(DWord, OSE);		unsigned Length = encodeULEB128(DWord, OSE, PadTo);
if (GenerateComments) {		if (GenerateComments) {
Comments.push_back(Comment.str());		Comments.push_back(Comment.str());
// Add some empty comments to keep the Buffer and Comments vectors aligned		// Add some empty comments to keep the Buffer and Comments vectors aligned
// with each other.		// with each other.
for (size_t i = 1; i < Length; ++i)		for (size_t i = 1; i < Length; ++i)
Comments.push_back("");		Comments.push_back("");

}		}
}		}
};		};

}		}

#endif		#endif

lib/CodeGen/AsmPrinter/DIE.cpp

Show First 20 Lines • Show All 500 Lines • ▼ Show 20 Lines	unsigned DIELabel::SizeOf(const AsmPrinter *AP, dwarf::Form Form) const {
if (Form == dwarf::DW_FORM_strp) return 4;		if (Form == dwarf::DW_FORM_strp) return 4;
return AP->MAI->getCodePointerSize();		return AP->MAI->getCodePointerSize();
}		}

LLVM_DUMP_METHOD		LLVM_DUMP_METHOD
void DIELabel::print(raw_ostream &O) const { O << "Lbl: " << Label->getName(); }		void DIELabel::print(raw_ostream &O) const { O << "Lbl: " << Label->getName(); }

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// DIEBaseTypeRef Implementation
		//===----------------------------------------------------------------------===//

		/// EmitValue - Emit label value.
		aprantlUnsubmitted Not Done Reply Inline Actions under the current style you may drop the duplicate doxygen comments con the function implementations. aprantl: under the current style you may drop the duplicate doxygen comments con the function…
		aprantlUnsubmitted Not Done Reply Inline Actions please remove this comment from the implementation. aprantl: please remove this comment from the implementation.
		///
		void DIEBaseTypeRef::EmitValue(const AsmPrinter *AP, dwarf::Form Form) const {
		AP->EmitULEB128(CU->ExprRefedBaseTypes[Index].Die->getOffset(),
		nullptr, ULEB128PadSize);
		aprantlUnsubmitted Not Done Reply Inline Actions what happens when this assertion isn't met in a release compiler? Is that a purely hypothetical scenario? aprantl: what happens when this assertion isn't met in a release compiler? Is that a purely hypothetical…
		markusAuthorUnsubmitted Done Reply Inline Actions what happens when this assertion isn't met in a release compiler? The debug info would be corrupted. Is that a purely hypothetical scenario? With `ULEB128PadSize = 4` we are fine as long as the types are placed within 256MB from the start of the `.debug_info` section. Since we take care to insert the types immediately after the CU this will not be an issue for the case of a single CU. However by using llvm-link it is possible to put several CUs in the same module and that could push us closer to the 256MB limit. If we set `ULEB128PadSize = 5` then the limit becomes 32GB and that should put us on the safe side for a considerable future (keeping in mind that this limit is for object files and not the final linked executable). (One could argue that symbols and label differences, effectively pushing the problem to the assembler, are the right way to go here and I would tend to agree with that but unfortunately that causes havoc in the existing DIE framework as it relies on being able to pre-compute sizes.) markus: > what happens when this assertion isn't met in a release compiler? The debug info would be…
		markusAuthorUnsubmitted Done Reply Inline Actions However by using llvm-link it is possible to put several CUs in the same module and that could push us closer to the 256MB limit. Actually forget that part. I see now that is irrelevant since the offset is within CU and not from start of .debug_info. I got confused looking at the output of readelf as that tool prints .debug_info address even though it is encoded as a CU offset. So given this I think the 256MB limit is perfectly reasonable. markus: > However by using llvm-link it is possible to put several CUs in the same module and that…
		aprantlUnsubmitted Not Done Reply Inline Actions (One could argue that symbols and label differences, effectively pushing the problem to the assembler, are the right way to go here and I would tend to agree with that but unfortunately that causes havoc in the existing DIE framework as it relies on being able to pre-compute sizes.) @dblaikie has dealt with similar problems in the past while refactoring AsmPrinter to support DWOs, perhaps he has some ideas? So given this I think the 256MB limit is perfectly reasonable. A good measure to decide this is to look at the size of an LTO build of Clang with all targets, which is usually a good proxy for what we can expect from large programs. aprantl: > (One could argue that symbols and label differences, effectively pushing the problem to the…
		aprantlUnsubmitted Not Done Reply Inline Actions When you say causes havoc in the existing DIE framework as it relies on being able to pre-compute sizes is the problem that we don't know the offsets of the location list entries ahead of time since they depend on the position of the referenced type DIEs and thus we don't know the size of the DW_AT_location attributes, or is the problem only within the location list section? aprantl: When you say > causes havoc in the existing DIE framework as it relies on being able to pre…
		markusAuthorUnsubmitted Done Reply Inline Actions is the problem that we don't know the offsets of the location list entries ahead of time since they depend on the position of the referenced type DIEs and thus we don't know the size of the DW_AT_location attributes, or is the problem only within the location list section? One of the problems is that as soon as we emit a `.uleb128` directive of a label difference we do not know that size of that and hence cannot compute the size of the block it is in without emitting more labels (begin and end for that block) and then do a label difference of that too. This is not how the DIE handling is currently designed (IIUC) as it relies on being able to compute the size of each DIE. This is especially a problem for the inlined DW_AT_location attributes. This is why having padded / fixed size ULEB128s, as we have in the current patch, makes things much easier. markus: > is the problem that we don't know the offsets of the location list entries ahead of time…
		markusAuthorUnsubmitted Done Reply Inline Actions (One could argue that symbols and label differences, effectively pushing the problem to the assembler, are the right way to go here and I would tend to agree with that but unfortunately that causes havoc in the existing DIE framework as it relies on being able to pre-compute sizes.) @dblaikie has dealt with similar problems in the past while refactoring AsmPrinter to support DWOs, perhaps he has some ideas? While I do think that solving this with label differences would be the right thing to do in general I do not think it is the right thing to do here as the changes become much larger than they need to be. So given this I think the 256MB limit is perfectly reasonable. A good measure to decide this is to look at the size of an LTO build of Clang with all targets, which is usually a good proxy for what we can expect from large programs. Yes, to clarify this. Since the base types that we generate are inserted immediately after the CU DIE we would need to insert a huge amount of these (>16 million I believe) to hit the 256MB limit. This should be quite impossible since we don't create entries for duplicate types and there simply aren't that many unique ones :) markus: >> (One could argue that symbols and label differences, effectively pushing the problem to the…
		aprantlUnsubmitted Not Done Reply Inline Actions So we only insert the special type DIEs into the very first DW_TAG_compile_unit? Then this is fine. It wasn't clear to me whether in the LTO case we'd inject the types into every CU. There's no need to do that, so if we don't then.we're good. aprantl: So we only insert the special type DIEs into the very first DW_TAG_compile_unit? Then this is…
		markusAuthorUnsubmitted Done Reply Inline Actions So we only insert the special type DIEs into the very first DW_TAG_compile_unit? Then this is fine. It wasn't clear to me whether in the LTO case we'd inject the types into every CU. There's no need to do that, so if we don't then.we're good. The special type DIEs are local to the CU that use them so they will not necessarily only be inserted into the first one but rather into the ones where they are used. This is not a problem though since according to the spec 'the operand is an unsigned LEB128 number that represents the offset of a debugging information entry in the current compilation unit' i.e. the offset is relative to the current CU so there is no chance that it will exceed the 256MB limit since they are inserted immediately after the DW_TAG_compile_unit DIE. markus: > So we only insert the special type DIEs into the very first DW_TAG_compile_unit? Then this is…
		aprantlUnsubmitted Not Done Reply Inline Actions If they are CU-relative I agree that should be perfectly safe, thanks. What happens if I llvm-link two CUs that both contain the same DIExpression. Is it possible for two location list entries to be uniques across compile units? I think the answer is no, right? aprantl: If they are CU-relative I agree that should be perfectly safe, thanks. What happens if I llvm…
		markusAuthorUnsubmitted Done Reply Inline Actions What happens if I llvm-link two CUs that both contain the same DIExpression. Is it possible for two location list entries to be uniques across compile units? I think the answer is no, right? Correct. The data structure is per CU so nothing happens across such units. markus: > What happens if I llvm-link two CUs that both contain the same DIExpression. Is it possible…
		}

		/// SizeOf - Determine size of label value in bytes.
		///
		unsigned DIEBaseTypeRef::SizeOf(const AsmPrinter *AP, dwarf::Form Form) const {
		aprantlUnsubmitted Not Done Reply Inline Actions ditto.. it should be on the declaration in the header file. aprantl: ditto.. it should be on the declaration in the header file.
		return ULEB128PadSize;
		}

		LLVM_DUMP_METHOD
		void DIEBaseTypeRef::print(raw_ostream &O) const { O << "Idx: " << Index; }

		//===----------------------------------------------------------------------===//
// DIEDelta Implementation		// DIEDelta Implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// EmitValue - Emit delta value.		/// EmitValue - Emit delta value.
///		///
void DIEDelta::EmitValue(const AsmPrinter *AP, dwarf::Form Form) const {		void DIEDelta::EmitValue(const AsmPrinter *AP, dwarf::Form Form) const {
AP->EmitLabelDifference(LabelHi, LabelLo, SizeOf(AP, Form));		AP->EmitLabelDifference(LabelHi, LabelLo, SizeOf(AP, Form));
}		}
▲ Show 20 Lines • Show All 284 Lines • Show Last 20 Lines

lib/CodeGen/AsmPrinter/DIEHash.cpp

Show First 20 Lines • Show All 219 Lines • ▼ Show 20 Lines
}		}

// Hash the contents of a loclistptr class.		// Hash the contents of a loclistptr class.
void DIEHash::hashLocList(const DIELocList &LocList) {		void DIEHash::hashLocList(const DIELocList &LocList) {
HashingByteStreamer Streamer(*this);		HashingByteStreamer Streamer(*this);
DwarfDebug &DD = *AP->getDwarfDebug();		DwarfDebug &DD = *AP->getDwarfDebug();
const DebugLocStream &Locs = DD.getDebugLocs();		const DebugLocStream &Locs = DD.getDebugLocs();
for (const auto &Entry : Locs.getEntries(Locs.getList(LocList.getValue())))		for (const auto &Entry : Locs.getEntries(Locs.getList(LocList.getValue())))
DD.emitDebugLocEntry(Streamer, Entry);		DD.emitDebugLocEntry(Streamer, Entry, nullptr);
}		}

// Hash an individual attribute \param Attr based on the type of attribute and		// Hash an individual attribute \param Attr based on the type of attribute and
// the form.		// the form.
void DIEHash::hashAttribute(const DIEValue &Value, dwarf::Tag Tag) {		void DIEHash::hashAttribute(const DIEValue &Value, dwarf::Tag Tag) {
dwarf::Attribute Attribute = Value.getAttribute();		dwarf::Attribute Attribute = Value.getAttribute();

// Other attribute values use the letter 'A' as the marker, and the value		// Other attribute values use the letter 'A' as the marker, and the value
▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	if (Value.getType() == DIEValue::isBlock) {
// a bit of work and not add a lot of uniqueness		// a bit of work and not add a lot of uniqueness
// to the hash in some way we could test.		// to the hash in some way we could test.
hashLocList(Value.getDIELocList());		hashLocList(Value.getDIELocList());
}		}
break;		break;
// FIXME: It's uncertain whether or not we should handle this at the moment.		// FIXME: It's uncertain whether or not we should handle this at the moment.
case DIEValue::isExpr:		case DIEValue::isExpr:
case DIEValue::isLabel:		case DIEValue::isLabel:
		case DIEValue::isBaseTypeRef:
case DIEValue::isDelta:		case DIEValue::isDelta:
llvm_unreachable("Add support for additional value types.");		llvm_unreachable("Add support for additional value types.");
}		}
}		}

// Go through the attributes from \param Attrs in the order specified in 7.27.4		// Go through the attributes from \param Attrs in the order specified in 7.27.4
// and hash them.		// and hash them.
void DIEHash::hashAttributes(const DIEAttrs &Attrs, dwarf::Tag Tag) {		void DIEHash::hashAttributes(const DIEAttrs &Attrs, dwarf::Tag Tag) {
▲ Show 20 Lines • Show All 104 Lines • Show Last 20 Lines

lib/CodeGen/AsmPrinter/DebugLocEntry.h

Show First 20 Lines • Show All 142 Lines • ▼ Show 20 Lines	Values.erase(
std::unique(		std::unique(
Values.begin(), Values.end(), [](const Value &A, const Value &B) {		Values.begin(), Values.end(), [](const Value &A, const Value &B) {
return A.getExpression() == B.getExpression();		return A.getExpression() == B.getExpression();
}),		}),
Values.end());		Values.end());
}		}

/// Lower this entry into a DWARF expression.		/// Lower this entry into a DWARF expression.
void finalize(const AsmPrinter &AP, DebugLocStream::ListBuilder &List,		void finalize(const AsmPrinter &AP,
const DIBasicType *BT);		DebugLocStream::ListBuilder &List,
		const DIBasicType *BT,
		DwarfCompileUnit &TheCU);
};		};

/// Compare two Values for equality.		/// Compare two Values for equality.
inline bool operator==(const DebugLocEntry::Value &A,		inline bool operator==(const DebugLocEntry::Value &A,
const DebugLocEntry::Value &B) {		const DebugLocEntry::Value &B) {
if (A.EntryKind != B.EntryKind)		if (A.EntryKind != B.EntryKind)
return false;		return false;

Show All 26 Lines

lib/CodeGen/AsmPrinter/DwarfCompileUnit.h

Show First 20 Lines • Show All 118 Lines • ▼ Show 20 Lines	public:
void applyStmtList(DIE &D);		void applyStmtList(DIE &D);

/// A pair of GlobalVariable and DIExpression.		/// A pair of GlobalVariable and DIExpression.
struct GlobalExpr {		struct GlobalExpr {
const GlobalVariable *Var;		const GlobalVariable *Var;
const DIExpression *Expr;		const DIExpression *Expr;
};		};

		struct BaseTypeRef {
		BaseTypeRef(unsigned BitSize, dwarf::TypeKind Encoding) :
		BitSize(BitSize), Encoding(Encoding) {}
		unsigned BitSize;
		dwarf::TypeKind Encoding;
		DIE *Die = nullptr;
		};

		std::vector<BaseTypeRef> ExprRefedBaseTypes;

/// Get or create global variable DIE.		/// Get or create global variable DIE.
DIE *		DIE *
getOrCreateGlobalVariableDIE(const DIGlobalVariable *GV,		getOrCreateGlobalVariableDIE(const DIGlobalVariable *GV,
ArrayRef<GlobalExpr> GlobalExprs);		ArrayRef<GlobalExpr> GlobalExprs);

/// addLabelAddress - Add a dwarf label attribute data and value using		/// addLabelAddress - Add a dwarf label attribute data and value using
/// either DW_FORM_addr or DW_FORM_GNU_addr_index.		/// either DW_FORM_addr or DW_FORM_GNU_addr_index.
void addLabelAddress(DIE &Die, dwarf::Attribute Attribute,		void addLabelAddress(DIE &Die, dwarf::Attribute Attribute,
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	public:
/// Construct a DIE for the given DbgLabel.		/// Construct a DIE for the given DbgLabel.
DIE *constructLabelDIE(DbgLabel &DL, const LexicalScope &Scope);		DIE *constructLabelDIE(DbgLabel &DL, const LexicalScope &Scope);

/// A helper function to create children of a Scope DIE.		/// A helper function to create children of a Scope DIE.
DIE createScopeChildrenDIE(LexicalScope Scope,		DIE createScopeChildrenDIE(LexicalScope Scope,
SmallVectorImpl<DIE *> &Children,		SmallVectorImpl<DIE *> &Children,
bool *HasNonScopeChildren = nullptr);		bool *HasNonScopeChildren = nullptr);

		void createBaseTypeDIEs();

/// Construct a DIE for this subprogram scope.		/// Construct a DIE for this subprogram scope.
DIE &constructSubprogramScopeDIE(const DISubprogram *Sub,		DIE &constructSubprogramScopeDIE(const DISubprogram *Sub,
LexicalScope *Scope);		LexicalScope *Scope);

DIE createAndAddScopeChildren(LexicalScope Scope, DIE &ScopeDIE);		DIE createAndAddScopeChildren(LexicalScope Scope, DIE &ScopeDIE);

void constructAbstractSubprogramScopeDIE(LexicalScope *Scope);		void constructAbstractSubprogramScopeDIE(LexicalScope *Scope);

▲ Show 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	public:

void setBaseAddress(const MCSymbol *Base) { BaseAddress = Base; }		void setBaseAddress(const MCSymbol *Base) { BaseAddress = Base; }
const MCSymbol *getBaseAddress() const { return BaseAddress; }		const MCSymbol *getBaseAddress() const { return BaseAddress; }

uint64_t getDWOId() const { return DWOId; }		uint64_t getDWOId() const { return DWOId; }
void setDWOId(uint64_t DwoId) { DWOId = DwoId; }		void setDWOId(uint64_t DwoId) { DWOId = DwoId; }

bool hasDwarfPubSections() const;		bool hasDwarfPubSections() const;

		void addBaseTypeRef(DIEValueList &Die, int64_t Idx);
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_LIB_CODEGEN_ASMPRINTER_DWARFCOMPILEUNIT_H		#endif // LLVM_LIB_CODEGEN_ASMPRINTER_DWARFCOMPILEUNIT_H

lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp

	Show First 20 Lines • Show All 1,179 Lines • ▼ Show 20 Lines
	void DwarfCompileUnit::addAddrTableBase() {			void DwarfCompileUnit::addAddrTableBase() {
	const TargetLoweringObjectFile &TLOF = Asm->getObjFileLowering();			const TargetLoweringObjectFile &TLOF = Asm->getObjFileLowering();
	MCSymbol *Label = DD->getAddressPool().getLabel();			MCSymbol *Label = DD->getAddressPool().getLabel();
	addSectionLabel(getUnitDie(),			addSectionLabel(getUnitDie(),
	getDwarfVersion() >= 5 ? dwarf::DW_AT_addr_base			getDwarfVersion() >= 5 ? dwarf::DW_AT_addr_base
	: dwarf::DW_AT_GNU_addr_base,			: dwarf::DW_AT_GNU_addr_base,
	Label, TLOF.getDwarfAddrSection()->getBeginSymbol());			Label, TLOF.getDwarfAddrSection()->getBeginSymbol());
	}			}

				void DwarfCompileUnit::addBaseTypeRef(DIEValueList &Die, int64_t Idx) {
				Die.addValue(DIEValueAllocator, (dwarf::Attribute)0, dwarf::DW_FORM_udata, DIEBaseTypeRef(this, Idx));
				}

				void DwarfCompileUnit::createBaseTypeDIEs() {
				for (std::vector<BaseTypeRef>::reverse_iterator
				I = ExprRefedBaseTypes.rbegin(), E = ExprRefedBaseTypes.rend();
				aprantlUnsubmitted Done Reply Inline Actions ... `so their offsets fit into the 5 bits reserved inside the location expressions.` aprantl: ... `so their offsets fit into the 5 bits reserved inside the location expressions.`
				I != E; ++I ) {
				auto &Btr = *I;
				aprantlUnsubmitted Done Reply Inline Actions `for (auto &I : reverse(ExprRefedBaseTypes))` aprantl: `for (auto &I : reverse(ExprRefedBaseTypes))`
				DIE &Die = getUnitDie().addChild(DIE::get(DIEValueAllocator, dwarf::DW_TAG_base_type), true);
				addString(Die, dwarf::DW_AT_name, "<internal type>");
				addUInt(Die, dwarf::DW_AT_encoding, dwarf::DW_FORM_data1, Btr.Encoding);
				addUInt(Die, dwarf::DW_AT_byte_size, None, Btr.BitSize / 8);

				Btr.Die = &Die;
				}
				}

lib/CodeGen/AsmPrinter/DwarfDebug.h

Show First 20 Lines • Show All 676 Lines • ▼ Show 20 Lines	public:
void setPrevCU(const DwarfCompileUnit *PrevCU) { this->PrevCU = PrevCU; }		void setPrevCU(const DwarfCompileUnit *PrevCU) { this->PrevCU = PrevCU; }

/// Returns the entries for the .debug_loc section.		/// Returns the entries for the .debug_loc section.
const DebugLocStream &getDebugLocs() const { return DebugLocs; }		const DebugLocStream &getDebugLocs() const { return DebugLocs; }

/// Emit an entry for the debug loc section. This can be used to		/// Emit an entry for the debug loc section. This can be used to
/// handle an entry that's going to be emitted into the debug loc section.		/// handle an entry that's going to be emitted into the debug loc section.
void emitDebugLocEntry(ByteStreamer &Streamer,		void emitDebugLocEntry(ByteStreamer &Streamer,
const DebugLocStream::Entry &Entry);		const DebugLocStream::Entry &Entry,
		const DwarfCompileUnit *CU);

/// Emit the location for a debug loc entry, including the size header.		/// Emit the location for a debug loc entry, including the size header.
void emitDebugLocEntryLocation(const DebugLocStream::Entry &Entry);		void emitDebugLocEntryLocation(const DebugLocStream::Entry &Entry,
		const DwarfCompileUnit *CU);

/// Find the MDNode for the given reference.		/// Find the MDNode for the given reference.
template <typename T> T *resolve(TypedDINodeRef<T> Ref) const {		template <typename T> T *resolve(TypedDINodeRef<T> Ref) const {
return Ref.resolve();		return Ref.resolve();
}		}

void addSubprogramNames(const DICompileUnit &CU, const DISubprogram *SP,		void addSubprogramNames(const DICompileUnit &CU, const DISubprogram *SP,
DIE &Die);		DIE &Die);
▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

lib/CodeGen/AsmPrinter/DwarfDebug.cpp

Show First 20 Lines • Show All 155 Lines • ▼ Show 20 Lines	DwarfLinkageNames("dwarf-linkage-names", cl::Hidden,
clEnumValN(AbstractLinkageNames, "Abstract",		clEnumValN(AbstractLinkageNames, "Abstract",
"Abstract subprograms")),		"Abstract subprograms")),
cl::init(DefaultLinkageNames));		cl::init(DefaultLinkageNames));

static const char *const DWARFGroupName = "dwarf";		static const char *const DWARFGroupName = "dwarf";
static const char *const DWARFGroupDescription = "DWARF Emission";		static const char *const DWARFGroupDescription = "DWARF Emission";
static const char *const DbgTimerName = "writer";		static const char *const DbgTimerName = "writer";
static const char *const DbgTimerDescription = "DWARF Debug Writer";		static const char *const DbgTimerDescription = "DWARF Debug Writer";
		static constexpr unsigned ULEB128PadSize = 4;

void DebugLocDwarfExpression::emitOp(uint8_t Op, const char *Comment) {		void DebugLocDwarfExpression::emitOp(uint8_t Op, const char *Comment) {
BS.EmitInt8(		BS.EmitInt8(
Op, Comment ? Twine(Comment) + " " + dwarf::OperationEncodingString(Op)		Op, Comment ? Twine(Comment) + " " + dwarf::OperationEncodingString(Op)
: dwarf::OperationEncodingString(Op));		: dwarf::OperationEncodingString(Op));
}		}

void DebugLocDwarfExpression::emitSigned(int64_t Value) {		void DebugLocDwarfExpression::emitSigned(int64_t Value) {
BS.EmitSLEB128(Value, Twine(Value));		BS.EmitSLEB128(Value, Twine(Value));
}		}

void DebugLocDwarfExpression::emitUnsigned(uint64_t Value) {		void DebugLocDwarfExpression::emitUnsigned(uint64_t Value) {
BS.EmitULEB128(Value, Twine(Value));		BS.EmitULEB128(Value, Twine(Value));
}		}

		void DebugLocDwarfExpression::emitBaseTypeRef(uint64_t Idx) {
		BS.EmitULEB128(Idx, Twine(Idx), ULEB128PadSize);
		}

bool DebugLocDwarfExpression::isFrameRegister(const TargetRegisterInfo &TRI,		bool DebugLocDwarfExpression::isFrameRegister(const TargetRegisterInfo &TRI,
unsigned MachineReg) {		unsigned MachineReg) {
// This information is not available while emitting .debug_loc entries.		// This information is not available while emitting .debug_loc entries.
return false;		return false;
}		}

bool DbgVariable::isBlockByrefVariable() const {		bool DbgVariable::isBlockByrefVariable() const {
assert(getVariable() && "Invalid complex DbgVariable!");		assert(getVariable() && "Invalid complex DbgVariable!");
▲ Show 20 Lines • Show All 748 Lines • ▼ Show 20 Lines	if (useSplitDwarf())
SkeletonHolder.computeSizeAndOffsets();		SkeletonHolder.computeSizeAndOffsets();
}		}

// Emit all Dwarf sections that should come after the content.		// Emit all Dwarf sections that should come after the content.
void DwarfDebug::endModule() {		void DwarfDebug::endModule() {
assert(CurFn == nullptr);		assert(CurFn == nullptr);
assert(CurMI == nullptr);		assert(CurMI == nullptr);

		for (const auto &P : CUMap) {
		markusAuthorUnsubmitted Done Reply Inline Actions If there were multiple Dwarf CUs in the same LLVM Module this would not work right. We need to emit base types for each DwarfCompileUnit but only those types that are used by DwarfExpressions in that unit. So appears to make sense to put the `MarkusNodes` inside the CU. markus: If there were multiple Dwarf CUs in the same LLVM Module this would not work right. We need to…
		aprantlUnsubmitted Not Done Reply Inline Actions Would creating a separate CU just for our basic types help in any way? aprantl: Would creating a separate CU just for our basic types help in any way?
		probinsonUnsubmitted Not Done Reply Inline Actions A separate CU for basic types would not be usable by DW_OP_convert? because it uses CU-relative offsets to find them. probinson: A separate CU for basic types would not be usable by DW_OP_convert? because it uses CU-relative…
		auto &CU = *P.second;
		CU.createBaseTypeDIEs();
		}

// If we aren't actually generating debug info (check beginModule -		// If we aren't actually generating debug info (check beginModule -
// conditionalized on !DisableDebugInfoPrinting and the presence of the		// conditionalized on !DisableDebugInfoPrinting and the presence of the
// llvm.dbg.cu metadata node)		// llvm.dbg.cu metadata node)
if (!MMI->hasDebugInfo())		if (!MMI->hasDebugInfo())
return;		return;

// Finalize the debug info for the module.		// Finalize the debug info for the module.
finalizeModuleInfo();		finalizeModuleInfo();
▲ Show 20 Lines • Show All 413 Lines • ▼ Show 20 Lines	for (const auto &I : DbgValues) {
// If the variable has a DIBasicType, extract it. Basic types cannot have		// If the variable has a DIBasicType, extract it. Basic types cannot have
// unique identifiers, so don't bother resolving the type with the		// unique identifiers, so don't bother resolving the type with the
// identifier map.		// identifier map.
const DIBasicType *BT = dyn_cast<DIBasicType>(		const DIBasicType *BT = dyn_cast<DIBasicType>(
static_cast<const Metadata *>(LocalVar->getType()));		static_cast<const Metadata *>(LocalVar->getType()));

// Finalize the entry by lowering it into a DWARF bytestream.		// Finalize the entry by lowering it into a DWARF bytestream.
for (auto &Entry : Entries)		for (auto &Entry : Entries)
Entry.finalize(*Asm, List, BT);		Entry.finalize(*Asm, List, BT, TheCU);
}		}

// For each InlinedEntity collected from DBG_LABEL instructions, convert to		// For each InlinedEntity collected from DBG_LABEL instructions, convert to
// DWARF-related DbgLabel.		// DWARF-related DbgLabel.
for (const auto &I : DbgLabels) {		for (const auto &I : DbgLabels) {
InlinedEntity IL = I.first;		InlinedEntity IL = I.first;
const MachineInstr *MI = I.second;		const MachineInstr *MI = I.second;
if (MI == nullptr)		if (MI == nullptr)
▲ Show 20 Lines • Show All 525 Lines • ▼ Show 20 Lines	if (useSegmentedStringOffsetsTable()) {
emitStringOffsetsTableHeader();		emitStringOffsetsTableHeader();
StringOffsetsSection = Asm->getObjFileLowering().getDwarfStrOffSection();		StringOffsetsSection = Asm->getObjFileLowering().getDwarfStrOffSection();
}		}
DwarfFile &Holder = useSplitDwarf() ? SkeletonHolder : InfoHolder;		DwarfFile &Holder = useSplitDwarf() ? SkeletonHolder : InfoHolder;
Holder.emitStrings(Asm->getObjFileLowering().getDwarfStrSection(),		Holder.emitStrings(Asm->getObjFileLowering().getDwarfStrSection(),
StringOffsetsSection, /* UseRelativeOffsets = */ true);		StringOffsetsSection, /* UseRelativeOffsets = */ true);
}		}

void DwarfDebug::emitDebugLocEntry(ByteStreamer &Streamer,		void DwarfDebug::emitDebugLocEntry(ByteStreamer &Streamer, const
const DebugLocStream::Entry &Entry) {		DebugLocStream::Entry &Entry,
		const DwarfCompileUnit *CU) {
auto &&Comments = DebugLocs.getComments(Entry);		auto &&Comments = DebugLocs.getComments(Entry);
auto Comment = Comments.begin();		auto Comment = Comments.begin();
auto End = Comments.end();		auto End = Comments.end();
for (uint8_t Byte : DebugLocs.getBytes(Entry))
		enum {Idle, SkipLEB128, BaseTypeFromULEB128} State = Idle;
		aprantlUnsubmitted Not Done Reply Inline Actions Remove the first word. aprantl: Remove the first word.
		uint64_t ULEB128Value;
		uint64_t ULEB128Shift;
		unsigned NumSkipLEB128s;
		for (uint8_t Byte : DebugLocs.getBytes(Entry)) {
		bool EmitByte = true;
		// XXX: This is admittedly pretty stupid but sadly appears to be the
		aprantlUnsubmitted Not Done Reply Inline Actions You might want to reword this comment to be more assertive :-) Is my understanding correct that we only need to do this here because the inline DW_AT_location (DW_FORM_block, ..) are emitted ahead of time and thus have the correct offsets injected from the get go? Could you please move this out into a `fixupLocEntryDIEReferences()` (or something) function? aprantl: You might want to reword this comment to be more assertive :-) Is my understanding correct…
		markusAuthorUnsubmitted Done Reply Inline Actions Is my understanding correct that we only need to do this here because the inline DW_AT_location (DW_FORM_block, ..) are emitted ahead of time and thus have the correct offsets injected from the get go? Sort of. When these are inserted in `DwarfExpression::addExpression` (for both location-lists and inlined) the offset of the base_type DIE is not known so we need to insert a placeholder. For the the location-list case the data structure is unfortunately a plain byte stream so we need this elaborate state machine to extract the placeholder here. Could you please move this out into a fixupLocEntryDIEReferences() (or something) function? Since this is a state machine and hence keeps state I think that putting it in a separate function would only make it messier. markus: > Is my understanding correct that we only need to do this here because the inline…
		// easiest way to pass custom values as the expressions are inserted into a
		aprantlUnsubmitted Not Done Reply Inline Actions This is to inject the reference to the basic type die, right? This code should probably be factored out into a relocate/fixupTypeRefs() helper function. I also assume that you need to apply the same fixup for the case of a single, non-debug_lo, inline DW_AT_location, right? The fact that the placeholder is encoded as a LEB128 sounds really dangerous. If we ever support any branching operations, it will mess with the offsets. Can we assume that the finalized DIE ref will always be a DW_OP_ref_addr or something with a fixed size? Could we make the placeholder the same fixed size, too? If that doesn't work, the right solution is probably to defer the emission of DwarfExpressions until here, which we could do in a separate, preparatory commit. aprantl: This is to inject the reference to the basic type die, right? This code should probably be…
		markusAuthorUnsubmitted Done Reply Inline Actions This is to inject the reference to the basic type die, right? Yes This code should probably be factored out into a relocate/fixupTypeRefs() helper function. I also assume that you need to apply the same fixup for the case of a single, non-debug_lo, inline DW_AT_location, right? Sounds reasonable. Not sure I could easily find where in the code the inline expressions are inserted though. If you could point at a file and line number that would be helpful. I guess another option would be to force these expressions (the ones containing a base type reference) to always end up in .debug_loc right? The fact that the placeholder is encoded as a LEB128 sounds really dangerous. If we ever support any branching operations, it will mess with the offsets. Can we assume that the finalized DIE ref will always be a DW_OP_ref_addr or something with a fixed size? Could we make the placeholder the same fixed size, too? The spec states that the finalized base type DIE offset is encoded as a ULEB128 so not much choice about that but the value we pick up here (the one inserted in `DwarfExpression::addExpression`) is just a index so we could certainly encode that in a fixed size integer. If branches were to be introduced at a later point I imagine that the branch target in the emitted dwarf would be a label (`MCSymbol`) but in the intermediate expression vector a simple offset would probably suffice. If that doesn't work, the right solution is probably to defer the emission of DwarfExpressions until here, which we could do in a separate, preparatory commit. I think that would be a good thing to do but unfortunately it seems far from easy to get rid of the stuff that is in between. markus: > This is to inject the reference to the basic type die, right? Yes > This code should probably…
		aprantlUnsubmitted Not Done Reply Inline Actions If you could point at a file and line number that would be helpful. git grep addBlock.DW_AT_location lib/CodeGen/AsmPrinter lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp: addBlock(VariableDIE, dwarf::DW_AT_location, DwarfExpr->finalize()); lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp: addBlock(VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize()); lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp: addBlock(VariableDie, dwarf::DW_AT_location, DwarfExpr.finalize()); lib/CodeGen/AsmPrinter/DwarfUnit.cpp: addBlock(ParamDIE, dwarf::DW_AT_location, Loc); The spec states that the finalized base type DIE offset is encoded as a ULEB128 so not much choice about that but the value we pick up here (the one inserted in DwarfExpression::addExpression) is just a index so we could certainly encode that in a fixed size integer. I see. I didn't think about emitting branch targets a label differences, I thought we'd just hardcode the offsets. I guess we can defer this until it becomes an issue. Encoding the temporary die reference with a fixed size would probably still be a good idea, just to keep this code simpler. aprantl: > If you could point at a file and line number that would be helpful. ``` git grep addBlock.
		probinsonUnsubmitted Not Done Reply Inline Actions Re the ULEB as how to find the base_type DIEs, the unstated assumption is that the base_type DIEs would be emitted unconditionally at the top of the CU, so everyone can just use them as needed. If you want to emit base_types lazily... don't do that. Re branches, there are already branch operators, if that's what you're talking about (DW_OP_skip and _bra). probinson: Re the ULEB as how to find the base_type DIEs, the unstated assumption is that the base_type…
		dblaikieUnsubmitted Not Done Reply Inline Actions I think maybe you don't need to make that assumption - you can produce a label difference as a uleb (though that may get complicated). Yeah, we use it in debug_rnglists, for instance: .uleb128 .Lfunc_end0-.Lfunc_begin0 # length & even if I change that to be a label difference between labels that bound the uleb itself (ie: where the difference would vary depending on the number of bytes required for the uleb) clang still assembles it at least... :) dblaikie: I think maybe you don't need to make that assumption - you can produce a label difference as a…
		markusAuthorUnsubmitted Done Reply Inline Actions I think maybe you don't need to make that assumption - you can produce a label difference as a uleb (though that may get complicated). Yes, that is what I currently do. It looks like this .byte 168 # DW_OP_convert .uleb128 .Lbase_type0-.Lcu_begin0 & even if I change that to be a label difference between labels that bound the uleb itself (ie: where the difference would vary depending on the number of bytes required for the uleb) clang still assembles it at least... :) Yep, I thought about that too when I realized that I need to add some prototype support for our downstream assembler. Came to the conclusion that I could treat the ULEB128s as fixed size by sign/zero extending them, so that should simplify things a lot even though it is not space efficient... Either way it is not a problem for this review how the assembler solves it :) markus: > I think maybe you don't need to make that assumption - you can produce a label difference as…
		markusAuthorUnsubmitted Done Reply Inline Actions Re the ULEB as how to find the base_type DIEs, the unstated assumption is that the base_type DIEs would be emitted unconditionally at the top of the CU, so everyone can just use them as needed. If you want to emit base_types lazily... don't do that. I played with emitting the base_type DIEs directly after the CU header but since the size of that header varies and due to the phase ordering of how the debug info is emitted I still need label differences to be able to locate the base_type DIEs in a robust manner. Right now I would not say that they are emitted lazily but rather we find out which ones we need and then emit them in table form. Still need the labels though. Re branches, there are already branch operators, if that's what you're talking about (DW_OP_skip and _bra). Where do you see these branch operators being used? I can't find them. markus: > Re the ULEB as how to find the base_type DIEs, the unstated assumption is that the base_type…
		// byte stream rather early (see DwarfExpression::addExpression).
		switch (State) {
		case Idle:
		switch (Byte) {
		default:
		if (dwarf::DW_OP_breg0 <= Byte && Byte <= dwarf::DW_OP_breg0 + 31) {
		aprantlUnsubmitted Not Done Reply Inline Actions Nice. Should we do the same thing as in dsymutil here and check for Encoding::BaseTypeRef? aprantl: Nice. Should we do the same thing as in dsymutil here and check for Encoding::BaseTypeRef?
		aprantlUnsubmitted Not Done Reply Inline Actions Could you please still factor this into a static fixupBaseTypeRefs(or something along those lines) function for better readability? aprantl: Could you please still factor this into a static fixupBaseTypeRefs(or something along those…
		markusAuthorUnsubmitted Done Reply Inline Actions Sure, but I don't understand where to place the cut to improve readability. I.e. what should go into `fixupBaseTypeRefs` and what should remain in `emitDebugLocEntry`? markus: Sure, but I don't understand where to place the cut to improve readability. I.e. what should go…
		NumSkipLEB128s = 1;
		State = SkipLEB128;
		} else if ((dwarf::DW_OP_lit0 <= Byte && Byte <= dwarf::DW_OP_lit0 + 31) \|\|
		aprantlUnsubmitted Not Done Reply Inline Actions Please add an assert that fails if the opcode is `DW_OP_const_type` as it is not supported by this loop. aprantl: Please add an assert that fails if the opcode is `DW_OP_const_type` as it is not supported by…
		(dwarf::DW_OP_reg0 <= Byte && Byte <= dwarf::DW_OP_reg0 + 31)) {
		// Do nothing.
		aprantlUnsubmitted Not Done Reply Inline Actions Where is this getting copied? aprantl: Where is this getting copied?
		markusAuthorUnsubmitted Done Reply Inline Actions It is not getting copied at all, 'Encoding::SizeNA` indicates that the operation does not have an operand in this slot. Or maybe I am not understanding the question? markus: It is not getting copied at all, 'Encoding::SizeNA` indicates that the operation does not have…
		aprantlUnsubmitted Not Done Reply Inline Actions My bad, I was thinking about a non-base-type operand, but it is actually handled in the else branch! aprantl: My bad, I was thinking about a non-base-type operand, but it is actually handled in the else…
		aprantlUnsubmitted Not Done Reply Inline Actions On second thought, I think we can leave it as is, too. aprantl: On second thought, I think we can leave it as is, too.
		} else {
		llvm_unreachable("unhandled opcode found in expression");
		}
		break;
		// Ops with 0 arguments.
		case dwarf::DW_OP_and:
		case dwarf::DW_OP_deref:
		case dwarf::DW_OP_div:
		case dwarf::DW_OP_dup:
		case dwarf::DW_OP_minus:
		case dwarf::DW_OP_mod:
		case dwarf::DW_OP_mul:
		case dwarf::DW_OP_not:
		case dwarf::DW_OP_or:
		case dwarf::DW_OP_plus:
		case dwarf::DW_OP_shl:
		case dwarf::DW_OP_shr:
		case dwarf::DW_OP_shra:
		case dwarf::DW_OP_stack_value:
		case dwarf::DW_OP_swap:
		case dwarf::DW_OP_xderef:
		case dwarf::DW_OP_xor:
		// Do nothing.
		break;
		// Ops with 2 [SU]LEB128 arguments.
		case dwarf::DW_OP_bit_piece:
		case dwarf::DW_OP_bregx:
		NumSkipLEB128s = 2;
		aprantlUnsubmitted Not Done Reply Inline Actions Didn't your dwarfdump patch have this info in an enum or am I confusing things? aprantl: Didn't your dwarfdump patch have this info in an enum or am I confusing things?
		markusAuthorUnsubmitted Done Reply Inline Actions `DWARFExpression` does contain similar information but it is private there. Not sure if refactoring would be worthwhile. markus: ` DWARFExpression` does contain similar information but it is private there. Not sure if…
		State = SkipLEB128;
		break;
		// Ops with 1 [SU]LEB128 arguments.
		case dwarf::DW_OP_consts:
		case dwarf::DW_OP_constu:
		case dwarf::DW_OP_fbreg:
		case dwarf::DW_OP_piece:
		case dwarf::DW_OP_plus_uconst:
		case dwarf::DW_OP_regx:
		NumSkipLEB128s = 1;
		State = SkipLEB128;
		break;
		case dwarf::DW_OP_convert:
		ULEB128Value = 0;
		ULEB128Shift = 0;
		State = BaseTypeFromULEB128;
		break;
		}
		break;

		case SkipLEB128:
		if (!(Byte & 0x80) && --NumSkipLEB128s == 0)
		State = Idle;
		break;
		case BaseTypeFromULEB128:
		EmitByte = false;
		ULEB128Value \|= (Byte & 0x7f) << ULEB128Shift;
		if (Byte & 0x80)
		ULEB128Shift += 7;
		else {
		if (CU)
		Asm->EmitULEB128(CU->ExprRefedBaseTypes[ULEB128Value].Die->getOffset(), nullptr, ULEB128PadSize);
		else
		// Emit a reference to the 'generic type'.
		Asm->EmitULEB128(0);
		State = Idle;
		}
		if (Comment != End)
		Comment++;
		break;
		}

		if (EmitByte)
Streamer.EmitInt8(Byte, Comment != End ? *(Comment++) : "");		Streamer.EmitInt8(Byte, Comment != End ? *(Comment++) : "");
}		}
		}

static void emitDebugLocValue(const AsmPrinter &AP, const DIBasicType *BT,		static void emitDebugLocValue(const AsmPrinter &AP, const DIBasicType *BT,
const DebugLocEntry::Value &Value,		const DebugLocEntry::Value &Value,
DwarfExpression &DwarfExpr) {		DwarfExpression &DwarfExpr) {
auto *DIExpr = Value.getExpression();		auto *DIExpr = Value.getExpression();
DIExpressionCursor ExprCursor(DIExpr);		DIExpressionCursor ExprCursor(DIExpr);
DwarfExpr.addFragmentOffset(DIExpr);		DwarfExpr.addFragmentOffset(DIExpr);
// Regular entry.		// Regular entry.
Show All 16 Lines	if (Value.isInt()) {
APInt RawBytes = Value.getConstantFP()->getValueAPF().bitcastToAPInt();		APInt RawBytes = Value.getConstantFP()->getValueAPF().bitcastToAPInt();
DwarfExpr.addUnsignedConstant(RawBytes);		DwarfExpr.addUnsignedConstant(RawBytes);
}		}
DwarfExpr.addExpression(std::move(ExprCursor));		DwarfExpr.addExpression(std::move(ExprCursor));
}		}

void DebugLocEntry::finalize(const AsmPrinter &AP,		void DebugLocEntry::finalize(const AsmPrinter &AP,
DebugLocStream::ListBuilder &List,		DebugLocStream::ListBuilder &List,
const DIBasicType *BT) {		const DIBasicType *BT,
		DwarfCompileUnit &TheCU) {
assert(Begin != End && "unexpected location list entry with empty range");		assert(Begin != End && "unexpected location list entry with empty range");
DebugLocStream::EntryBuilder Entry(List, Begin, End);		DebugLocStream::EntryBuilder Entry(List, Begin, End);
BufferByteStreamer Streamer = Entry.getStreamer();		BufferByteStreamer Streamer = Entry.getStreamer();
DebugLocDwarfExpression DwarfExpr(AP.getDwarfVersion(), Streamer);		DebugLocDwarfExpression DwarfExpr(AP.getDwarfVersion(), Streamer, TheCU);
const DebugLocEntry::Value &Value = Values[0];		const DebugLocEntry::Value &Value = Values[0];
if (Value.isFragment()) {		if (Value.isFragment()) {
// Emit all fragments that belong to the same variable and range.		// Emit all fragments that belong to the same variable and range.
assert(llvm::all_of(Values, [](DebugLocEntry::Value P) {		assert(llvm::all_of(Values, [](DebugLocEntry::Value P) {
return P.isFragment();		return P.isFragment();
}) && "all values are expected to be fragments");		}) && "all values are expected to be fragments");
assert(std::is_sorted(Values.begin(), Values.end()) &&		assert(std::is_sorted(Values.begin(), Values.end()) &&
"fragments are expected to be sorted");		"fragments are expected to be sorted");

for (auto Fragment : Values)		for (auto Fragment : Values)
emitDebugLocValue(AP, BT, Fragment, DwarfExpr);		emitDebugLocValue(AP, BT, Fragment, DwarfExpr);

} else {		} else {
assert(Values.size() == 1 && "only fragments may have >1 value");		assert(Values.size() == 1 && "only fragments may have >1 value");
emitDebugLocValue(AP, BT, Value, DwarfExpr);		emitDebugLocValue(AP, BT, Value, DwarfExpr);
}		}
DwarfExpr.finalize();		DwarfExpr.finalize();
}		}

void DwarfDebug::emitDebugLocEntryLocation(const DebugLocStream::Entry &Entry) {		void DwarfDebug::emitDebugLocEntryLocation(const DebugLocStream::Entry &Entry,
		const DwarfCompileUnit *CU) {
// Emit the size.		// Emit the size.
Asm->OutStreamer->AddComment("Loc expr size");		Asm->OutStreamer->AddComment("Loc expr size");
if (getDwarfVersion() >= 5)		if (getDwarfVersion() >= 5)
Asm->EmitULEB128(DebugLocs.getBytes(Entry).size());		Asm->EmitULEB128(DebugLocs.getBytes(Entry).size());
else		else
Asm->emitInt16(DebugLocs.getBytes(Entry).size());		Asm->emitInt16(DebugLocs.getBytes(Entry).size());
// Emit the entry.		// Emit the entry.
APByteStreamer Streamer(*Asm);		APByteStreamer Streamer(*Asm);
emitDebugLocEntry(Streamer, Entry);		emitDebugLocEntry(Streamer, Entry, CU);
}		}

// Emit the common part of the DWARF 5 range/locations list tables header.		// Emit the common part of the DWARF 5 range/locations list tables header.
static void emitListsTableHeaderStart(AsmPrinter *Asm, const DwarfFile &Holder,		static void emitListsTableHeaderStart(AsmPrinter *Asm, const DwarfFile &Holder,
MCSymbol *TableStart,		MCSymbol *TableStart,
MCSymbol *TableEnd) {		MCSymbol *TableEnd) {
// Build the table header, which starts with the length field.		// Build the table header, which starts with the length field.
Asm->OutStreamer->AddComment("Length");		Asm->OutStreamer->AddComment("Length");
▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	for (const auto &Entry : DebugLocs.getEntries(List)) {
Asm->EmitLabelDifferenceAsULEB128(Entry.BeginSym, Base);		Asm->EmitLabelDifferenceAsULEB128(Entry.BeginSym, Base);
Asm->OutStreamer->AddComment(" ending offset");		Asm->OutStreamer->AddComment(" ending offset");
Asm->EmitLabelDifferenceAsULEB128(Entry.EndSym, Base);		Asm->EmitLabelDifferenceAsULEB128(Entry.EndSym, Base);
} else {		} else {
Asm->EmitLabelDifference(Entry.BeginSym, Base, Size);		Asm->EmitLabelDifference(Entry.BeginSym, Base, Size);
Asm->EmitLabelDifference(Entry.EndSym, Base, Size);		Asm->EmitLabelDifference(Entry.EndSym, Base, Size);
}		}

emitDebugLocEntryLocation(Entry);		emitDebugLocEntryLocation(Entry, CU);
continue;		continue;
}		}

// We have no base address.		// We have no base address.
if (IsLocLists) {		if (IsLocLists) {
// TODO: Use DW_LLE_base_addressx + DW_LLE_offset_pair, or		// TODO: Use DW_LLE_base_addressx + DW_LLE_offset_pair, or
// DW_LLE_startx_length in case if there is only a single range.		// DW_LLE_startx_length in case if there is only a single range.
// That should reduce the size of the debug data emited.		// That should reduce the size of the debug data emited.
// For now just use the DW_LLE_startx_length for all cases.		// For now just use the DW_LLE_startx_length for all cases.
Asm->OutStreamer->AddComment("DW_LLE_startx_length");		Asm->OutStreamer->AddComment("DW_LLE_startx_length");
Asm->emitInt8(dwarf::DW_LLE_startx_length);		Asm->emitInt8(dwarf::DW_LLE_startx_length);
Asm->OutStreamer->AddComment(" start idx");		Asm->OutStreamer->AddComment(" start idx");
Asm->EmitULEB128(AddrPool.getIndex(Entry.BeginSym));		Asm->EmitULEB128(AddrPool.getIndex(Entry.BeginSym));
Asm->OutStreamer->AddComment(" length");		Asm->OutStreamer->AddComment(" length");
Asm->EmitLabelDifferenceAsULEB128(Entry.EndSym, Entry.BeginSym);		Asm->EmitLabelDifferenceAsULEB128(Entry.EndSym, Entry.BeginSym);
} else {		} else {
Asm->OutStreamer->EmitSymbolValue(Entry.BeginSym, Size);		Asm->OutStreamer->EmitSymbolValue(Entry.BeginSym, Size);
Asm->OutStreamer->EmitSymbolValue(Entry.EndSym, Size);		Asm->OutStreamer->EmitSymbolValue(Entry.EndSym, Size);
}		}

emitDebugLocEntryLocation(Entry);		emitDebugLocEntryLocation(Entry, CU);
}		}

if (IsLocLists) {		if (IsLocLists) {
// .debug_loclists section ends with DW_LLE_end_of_list.		// .debug_loclists section ends with DW_LLE_end_of_list.
Asm->OutStreamer->AddComment("DW_LLE_end_of_list");		Asm->OutStreamer->AddComment("DW_LLE_end_of_list");
Asm->OutStreamer->EmitIntValue(dwarf::DW_LLE_end_of_list, 1);		Asm->OutStreamer->EmitIntValue(dwarf::DW_LLE_end_of_list, 1);
} else {		} else {
// Terminate the .debug_loc list with two 0 values.		// Terminate the .debug_loc list with two 0 values.
Show All 19 Lines	for (const auto &Entry : DebugLocs.getEntries(List)) {
// * as of October 2018, at least		// * as of October 2018, at least
// Ideally/in v5, this could use SectionLabels to reuse existing addresses		// Ideally/in v5, this could use SectionLabels to reuse existing addresses
// in the address pool to minimize object size/relocations.		// in the address pool to minimize object size/relocations.
Asm->emitInt8(dwarf::DW_LLE_startx_length);		Asm->emitInt8(dwarf::DW_LLE_startx_length);
unsigned idx = AddrPool.getIndex(Entry.BeginSym);		unsigned idx = AddrPool.getIndex(Entry.BeginSym);
Asm->EmitULEB128(idx);		Asm->EmitULEB128(idx);
Asm->EmitLabelDifference(Entry.EndSym, Entry.BeginSym, 4);		Asm->EmitLabelDifference(Entry.EndSym, Entry.BeginSym, 4);

emitDebugLocEntryLocation(Entry);		emitDebugLocEntryLocation(Entry, List.CU);
		markusAuthorUnsubmitted Done Reply Inline Actions For DWO how do we find the label into the corresponding `.debug_info` and how do we emit the base types? Would the code in DwarfDebug.cpp:946 work? I guess that I need to create some test cases for DWO. markus: For DWO how do we find the label into the corresponding `.debug_info` and how do we emit the…
}		}
Asm->emitInt8(dwarf::DW_LLE_end_of_list);		Asm->emitInt8(dwarf::DW_LLE_end_of_list);
}		}
}		}

struct ArangeSpan {		struct ArangeSpan {
const MCSymbol Start, End;		const MCSymbol Start, End;
};		};
▲ Show 20 Lines • Show All 626 Lines • Show Last 20 Lines

lib/CodeGen/AsmPrinter/DwarfExpression.h

Show All 21 Lines
#include <cstdint>		#include <cstdint>
#include <iterator>		#include <iterator>

namespace llvm {		namespace llvm {

class AsmPrinter;		class AsmPrinter;
class APInt;		class APInt;
class ByteStreamer;		class ByteStreamer;
class DwarfUnit;		class DwarfCompileUnit;
class DIELoc;		class DIELoc;
class TargetRegisterInfo;		class TargetRegisterInfo;

/// Holds a DIExpression and keeps track of how many operands have been consumed		/// Holds a DIExpression and keeps track of how many operands have been consumed
/// so far.		/// so far.
class DIExpressionCursor {		class DIExpressionCursor {
DIExpression::expr_op_iterator Start, End;		DIExpression::expr_op_iterator Start, End;

▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines
protected:		protected:
/// Holds information about all subregisters comprising a register location.		/// Holds information about all subregisters comprising a register location.
struct Register {		struct Register {
int DwarfRegNo;		int DwarfRegNo;
unsigned Size;		unsigned Size;
const char *Comment;		const char *Comment;
};		};

		DwarfCompileUnit &CU;

/// The register location, if any.		/// The register location, if any.
SmallVector<Register, 2> DwarfRegs;		SmallVector<Register, 2> DwarfRegs;

/// Current Fragment Offset in Bits.		/// Current Fragment Offset in Bits.
uint64_t OffsetInBits = 0;		uint64_t OffsetInBits = 0;
unsigned DwarfVersion;		unsigned DwarfVersion;

/// Sometimes we need to add a DW_OP_bit_piece to describe a subregister.		/// Sometimes we need to add a DW_OP_bit_piece to describe a subregister.
Show All 17 Lines	protected:
virtual void emitOp(uint8_t Op, const char *Comment = nullptr) = 0;		virtual void emitOp(uint8_t Op, const char *Comment = nullptr) = 0;

/// Emit a raw signed value.		/// Emit a raw signed value.
virtual void emitSigned(int64_t Value) = 0;		virtual void emitSigned(int64_t Value) = 0;

/// Emit a raw unsigned value.		/// Emit a raw unsigned value.
virtual void emitUnsigned(uint64_t Value) = 0;		virtual void emitUnsigned(uint64_t Value) = 0;

		virtual void emitBaseTypeRef(uint64_t Idx) = 0;

/// Emit a normalized unsigned constant.		/// Emit a normalized unsigned constant.
void emitConstu(uint64_t Value);		void emitConstu(uint64_t Value);

/// Return whether the given machine register is the frame register in the		/// Return whether the given machine register is the frame register in the
/// current function.		/// current function.
virtual bool isFrameRegister(const TargetRegisterInfo &TRI, unsigned MachineReg) = 0;		virtual bool isFrameRegister(const TargetRegisterInfo &TRI, unsigned MachineReg) = 0;

/// Emit a DW_OP_reg operation. Note that this is only legal inside a DWARF		/// Emit a DW_OP_reg operation. Note that this is only legal inside a DWARF
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	protected:
/// constant value, so the producers and consumers started to rely on		/// constant value, so the producers and consumers started to rely on
/// heuristics to disambiguate the value vs. location status of the		/// heuristics to disambiguate the value vs. location status of the
/// expression. See PR21176 for more details.		/// expression. See PR21176 for more details.
void addStackValue();		void addStackValue();

~DwarfExpression() = default;		~DwarfExpression() = default;

public:		public:
DwarfExpression(unsigned DwarfVersion) : DwarfVersion(DwarfVersion) {}		DwarfExpression(unsigned DwarfVersion, DwarfCompileUnit &CU)
		: CU(CU), DwarfVersion(DwarfVersion) {}

/// This needs to be called last to commit any pending changes.		/// This needs to be called last to commit any pending changes.
void finalize();		void finalize();

/// Emit a signed constant.		/// Emit a signed constant.
void addSignedConstant(int64_t Value);		void addSignedConstant(int64_t Value);

/// Emit an unsigned constant.		/// Emit an unsigned constant.
Show All 40 Lines

/// DwarfExpression implementation for .debug_loc entries.		/// DwarfExpression implementation for .debug_loc entries.
class DebugLocDwarfExpression final : public DwarfExpression {		class DebugLocDwarfExpression final : public DwarfExpression {
ByteStreamer &BS;		ByteStreamer &BS;

void emitOp(uint8_t Op, const char *Comment = nullptr) override;		void emitOp(uint8_t Op, const char *Comment = nullptr) override;
void emitSigned(int64_t Value) override;		void emitSigned(int64_t Value) override;
void emitUnsigned(uint64_t Value) override;		void emitUnsigned(uint64_t Value) override;
		void emitBaseTypeRef(uint64_t Idx) override;
bool isFrameRegister(const TargetRegisterInfo &TRI,		bool isFrameRegister(const TargetRegisterInfo &TRI,
unsigned MachineReg) override;		unsigned MachineReg) override;

public:		public:
DebugLocDwarfExpression(unsigned DwarfVersion, ByteStreamer &BS)		DebugLocDwarfExpression(unsigned DwarfVersion, ByteStreamer &BS, DwarfCompileUnit &CU)
: DwarfExpression(DwarfVersion), BS(BS) {}		: DwarfExpression(DwarfVersion, CU), BS(BS) {}
};		};

/// DwarfExpression implementation for singular DW_AT_location.		/// DwarfExpression implementation for singular DW_AT_location.
class DIEDwarfExpression final : public DwarfExpression {		class DIEDwarfExpression final : public DwarfExpression {
const AsmPrinter &AP;		const AsmPrinter &AP;
DwarfUnit &DU;
DIELoc &DIE;		DIELoc &DIE;

void emitOp(uint8_t Op, const char *Comment = nullptr) override;		void emitOp(uint8_t Op, const char *Comment = nullptr) override;
void emitSigned(int64_t Value) override;		void emitSigned(int64_t Value) override;
void emitUnsigned(uint64_t Value) override;		void emitUnsigned(uint64_t Value) override;
		void emitBaseTypeRef(uint64_t Idx) override;
bool isFrameRegister(const TargetRegisterInfo &TRI,		bool isFrameRegister(const TargetRegisterInfo &TRI,
unsigned MachineReg) override;		unsigned MachineReg) override;
public:		public:
DIEDwarfExpression(const AsmPrinter &AP, DwarfUnit &DU, DIELoc &DIE);		DIEDwarfExpression(const AsmPrinter &AP, DwarfCompileUnit &CU, DIELoc &DIE);

DIELoc *finalize() {		DIELoc *finalize() {
DwarfExpression::finalize();		DwarfExpression::finalize();
return &DIE;		return &DIE;
}		}
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_LIB_CODEGEN_ASMPRINTER_DWARFEXPRESSION_H		#endif // LLVM_LIB_CODEGEN_ASMPRINTER_DWARFEXPRESSION_H

lib/CodeGen/AsmPrinter/DwarfExpression.cpp

//===- llvm/CodeGen/DwarfExpression.cpp - Dwarf Debug Framework -----------===//		//===- llvm/CodeGen/DwarfExpression.cpp - Dwarf Debug Framework -----------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file contains support for writing dwarf debug info into asm files.		// This file contains support for writing dwarf debug info into asm files.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "DwarfExpression.h"		#include "DwarfExpression.h"
		#include "DwarfCompileUnit.h"
#include "llvm/ADT/APInt.h"		#include "llvm/ADT/APInt.h"
#include "llvm/ADT/SmallBitVector.h"		#include "llvm/ADT/SmallBitVector.h"
#include "llvm/BinaryFormat/Dwarf.h"		#include "llvm/BinaryFormat/Dwarf.h"
#include "llvm/CodeGen/TargetRegisterInfo.h"		#include "llvm/CodeGen/TargetRegisterInfo.h"
#include "llvm/IR/DebugInfoMetadata.h"		#include "llvm/IR/DebugInfoMetadata.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include <algorithm>		#include <algorithm>
#include <cassert>		#include <cassert>
#include <cstdint>		#include <cstdint>

using namespace llvm;		using namespace llvm;

void DwarfExpression::emitConstu(uint64_t Value) {		void DwarfExpression::emitConstu(uint64_t Value) {
if (Value < 32)		if (Value < 32)
emitOp(dwarf::DW_OP_lit0 + Value);		emitOp(dwarf::DW_OP_lit0 + Value);
		markusAuthorUnsubmitted Done Reply Inline Actions I think that we need one of these per DwarfCompileUnit. markus: I think that we need one of these per DwarfCompileUnit.
		aprantlUnsubmitted Done Reply Inline Actions Why is this needed? Shouldn't we key off the debugger tuning flag? aprantl: Why is this needed? Shouldn't we key off the debugger tuning flag?
		probinsonUnsubmitted Not Done Reply Inline Actions The debugger-tuning design is that we use it only in the DwarfDebug ctor to set other control flags, which can all be influenced independently. Tuning and its associated flags are not per-CU, although DWARF version is. probinson: The debugger-tuning design is that we use it only in the DwarfDebug ctor to set other control…
else if (Value == std::numeric_limits<uint64_t>::max()) {		else if (Value == std::numeric_limits<uint64_t>::max()) {
// Only do this for 64-bit values as the DWARF expression stack uses		// Only do this for 64-bit values as the DWARF expression stack uses
// target-address-size values.		// target-address-size values.
emitOp(dwarf::DW_OP_lit0);		emitOp(dwarf::DW_OP_lit0);
emitOp(dwarf::DW_OP_not);		emitOp(dwarf::DW_OP_not);
} else {		} else {
emitOp(dwarf::DW_OP_constu);		emitOp(dwarf::DW_OP_constu);
emitUnsigned(Value);		emitUnsigned(Value);
▲ Show 20 Lines • Show All 342 Lines • ▼ Show 20 Lines	case dwarf::DW_OP_deref:
LocationKind = Memory;		LocationKind = Memory;
else		else
emitOp(dwarf::DW_OP_deref);		emitOp(dwarf::DW_OP_deref);
break;		break;
case dwarf::DW_OP_constu:		case dwarf::DW_OP_constu:
assert(LocationKind != Register);		assert(LocationKind != Register);
emitConstu(Op->getArg(0));		emitConstu(Op->getArg(0));
break;		break;
		case dwarf::DW_OP_LLVM_convert: {
		emitOp(dwarf::DW_OP_convert);
		// XXX: Simply emit the index into the raw byte stream as ULEB128,
		// DwarfDebug::emitDebugLocEntry has been fitted with means to extract it
		// later.
		probinsonUnsubmitted Not Done Reply Inline Actions Pre-v5 needs to emit the GNU op, not the standard op. probinson: Pre-v5 needs to emit the GNU op, not the standard op.
		emitBaseTypeRef(CU.ExprRefedBaseTypes.size());
		CU.ExprRefedBaseTypes.emplace_back(Op->getArg(0),
		markusAuthorUnsubmitted Done Reply Inline Actions A `DwarfExpression` should always know which CU it belongs to right? markus: A `DwarfExpression` should always know which CU it belongs to right?
		static_cast<dwarf::TypeKind>(Op->getArg(1)));
		aprantlUnsubmitted Done Reply Inline Actions Can you add a comment that explains what happens in the non-location-list cases? aprantl: Can you add a comment that explains what happens in the non-location-list cases?
		break;
		aprantlUnsubmitted Not Done Reply Inline Actions Why not use a SmallDenseSet for CU.ExprRefedBaseTypes? aprantl: Why not use a SmallDenseSet for CU.ExprRefedBaseTypes?
		markusAuthorUnsubmitted Done Reply Inline Actions I don't think that would work as we rely on being able to index into it. markus: I don't think that would work as we rely on being able to index into it.
		}
case dwarf::DW_OP_stack_value:		case dwarf::DW_OP_stack_value:
LocationKind = Implicit;		LocationKind = Implicit;
break;		break;
case dwarf::DW_OP_swap:		case dwarf::DW_OP_swap:
assert(LocationKind != Register);		assert(LocationKind != Register);
emitOp(dwarf::DW_OP_swap);		emitOp(dwarf::DW_OP_swap);
break;		break;
case dwarf::DW_OP_xderef:		case dwarf::DW_OP_xderef:
▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

lib/CodeGen/AsmPrinter/DwarfFile.h

Show First 20 Lines • Show All 141 Lines • ▼ Show 20 Lines	public:
/// Add a unit to the list of CUs.		/// Add a unit to the list of CUs.
void addUnit(std::unique_ptr<DwarfCompileUnit> U);		void addUnit(std::unique_ptr<DwarfCompileUnit> U);

/// Emit all of the units to the section listed with the given		/// Emit all of the units to the section listed with the given
/// abbreviation section.		/// abbreviation section.
void emitUnits(bool UseOffsets);		void emitUnits(bool UseOffsets);

/// Emit the given unit to its section.		/// Emit the given unit to its section.
void emitUnit(DwarfUnit *U, bool UseOffsets);		void emitUnit(DwarfUnit *TheU, bool UseOffsets);

/// Emit a set of abbreviations to the specific section.		/// Emit a set of abbreviations to the specific section.
void emitAbbrevs(MCSection *);		void emitAbbrevs(MCSection *);

/// Emit all of the strings to the section given. If OffsetSection is		/// Emit all of the strings to the section given. If OffsetSection is
/// non-null, emit a table of string offsets to it. If UseRelativeOffsets		/// non-null, emit a table of string offsets to it. If UseRelativeOffsets
/// is false, emit absolute offsets to the strings. Otherwise, emit		/// is false, emit absolute offsets to the strings. Otherwise, emit
/// relocatable references to the strings if they are supported by the target.		/// relocatable references to the strings if they are supported by the target.
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

lib/CodeGen/AsmPrinter/DwarfUnit.cpp

	Show All 40 Lines
	#include <cstdint>			#include <cstdint>
	#include <string>			#include <string>
	#include <utility>			#include <utility>

	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "dwarfdebug"			#define DEBUG_TYPE "dwarfdebug"

	DIEDwarfExpression::DIEDwarfExpression(const AsmPrinter &AP, DwarfUnit &DU,			DIEDwarfExpression::DIEDwarfExpression(const AsmPrinter &AP,
				DwarfCompileUnit &CU,
	DIELoc &DIE)			DIELoc &DIE)
	: DwarfExpression(AP.getDwarfVersion()), AP(AP), DU(DU),			: DwarfExpression(AP.getDwarfVersion(), CU), AP(AP),
	DIE(DIE) {}			DIE(DIE) {}

	void DIEDwarfExpression::emitOp(uint8_t Op, const char* Comment) {			void DIEDwarfExpression::emitOp(uint8_t Op, const char* Comment) {
	DU.addUInt(DIE, dwarf::DW_FORM_data1, Op);			CU.addUInt(DIE, dwarf::DW_FORM_data1, Op);
	}			}

	void DIEDwarfExpression::emitSigned(int64_t Value) {			void DIEDwarfExpression::emitSigned(int64_t Value) {
	DU.addSInt(DIE, dwarf::DW_FORM_sdata, Value);			CU.addSInt(DIE, dwarf::DW_FORM_sdata, Value);
	}			}

	void DIEDwarfExpression::emitUnsigned(uint64_t Value) {			void DIEDwarfExpression::emitUnsigned(uint64_t Value) {
	DU.addUInt(DIE, dwarf::DW_FORM_udata, Value);			CU.addUInt(DIE, dwarf::DW_FORM_udata, Value);
				}

				void DIEDwarfExpression::emitBaseTypeRef(uint64_t Idx) {
				CU.addBaseTypeRef(DIE, Idx);
	}			}

	bool DIEDwarfExpression::isFrameRegister(const TargetRegisterInfo &TRI,			bool DIEDwarfExpression::isFrameRegister(const TargetRegisterInfo &TRI,
	unsigned MachineReg) {			unsigned MachineReg) {
	return MachineReg == TRI.getFrameRegister(*AP.MF);			return MachineReg == TRI.getFrameRegister(*AP.MF);
	}			}

	DwarfUnit::DwarfUnit(dwarf::Tag UnitTag, const DICompileUnit *Node,			DwarfUnit::DwarfUnit(dwarf::Tag UnitTag, const DICompileUnit *Node,
	▲ Show 20 Lines • Show All 1,601 Lines • Show Last 20 Lines

lib/IR/AsmWriter.cpp

Show First 20 Lines • Show All 2,117 Lines • ▼ Show 20 Lines	static void writeDIExpression(raw_ostream &Out, const DIExpression *N,
Out << "!DIExpression(";		Out << "!DIExpression(";
FieldSeparator FS;		FieldSeparator FS;
if (N->isValid()) {		if (N->isValid()) {
for (auto I = N->expr_op_begin(), E = N->expr_op_end(); I != E; ++I) {		for (auto I = N->expr_op_begin(), E = N->expr_op_end(); I != E; ++I) {
auto OpStr = dwarf::OperationEncodingString(I->getOp());		auto OpStr = dwarf::OperationEncodingString(I->getOp());
assert(!OpStr.empty() && "Expected valid opcode");		assert(!OpStr.empty() && "Expected valid opcode");

Out << FS << OpStr;		Out << FS << OpStr;
		if (I->getOp() == dwarf::DW_OP_LLVM_convert) {
		Out << FS << I->getArg(0);
		Out << FS << dwarf::AttributeEncodingString(I->getArg(1));
		} else {
for (unsigned A = 0, AE = I->getNumArgs(); A != AE; ++A)		for (unsigned A = 0, AE = I->getNumArgs(); A != AE; ++A)
Out << FS << I->getArg(A);		Out << FS << I->getArg(A);
}		}
		}
} else {		} else {
for (const auto &I : N->getElements())		for (const auto &I : N->getElements())
Out << FS << I;		Out << FS << I;
}		}
Out << ")";		Out << ")";
}		}

static void writeDIGlobalVariableExpression(raw_ostream &Out,		static void writeDIGlobalVariableExpression(raw_ostream &Out,
▲ Show 20 Lines • Show All 2,239 Lines • Show Last 20 Lines

lib/IR/DebugInfoMetadata.cpp

Show First 20 Lines • Show All 807 Lines • ▼ Show 20 Lines	DIExpression *DIExpression::getImpl(LLVMContext &Context,
ArrayRef<uint64_t> Elements,		ArrayRef<uint64_t> Elements,
StorageType Storage, bool ShouldCreate) {		StorageType Storage, bool ShouldCreate) {
DEFINE_GETIMPL_LOOKUP(DIExpression, (Elements));		DEFINE_GETIMPL_LOOKUP(DIExpression, (Elements));
DEFINE_GETIMPL_STORE_NO_OPS(DIExpression, (Elements));		DEFINE_GETIMPL_STORE_NO_OPS(DIExpression, (Elements));
}		}

unsigned DIExpression::ExprOperand::getSize() const {		unsigned DIExpression::ExprOperand::getSize() const {
switch (getOp()) {		switch (getOp()) {
		case dwarf::DW_OP_LLVM_convert:
case dwarf::DW_OP_LLVM_fragment:		case dwarf::DW_OP_LLVM_fragment:
return 3;		return 3;
case dwarf::DW_OP_constu:		case dwarf::DW_OP_constu:
case dwarf::DW_OP_plus_uconst:		case dwarf::DW_OP_plus_uconst:
return 2;		return 2;
default:		default:
return 1;		return 1;
}		}
Show All 28 Lines	case dwarf::DW_OP_swap: {
// that keeps track of the stack depth and introduce something like a		// that keeps track of the stack depth and introduce something like a
// DW_LLVM_OP_implicit_location as a placeholder for the location this		// DW_LLVM_OP_implicit_location as a placeholder for the location this
// DIExpression is attached to, or else pass the number of implicit stack		// DIExpression is attached to, or else pass the number of implicit stack
// elements into isValid.		// elements into isValid.
if (getNumElements() == 1)		if (getNumElements() == 1)
return false;		return false;
break;		break;
}		}
		case dwarf::DW_OP_LLVM_convert:
case dwarf::DW_OP_constu:		case dwarf::DW_OP_constu:
case dwarf::DW_OP_plus_uconst:		case dwarf::DW_OP_plus_uconst:
case dwarf::DW_OP_plus:		case dwarf::DW_OP_plus:
case dwarf::DW_OP_minus:		case dwarf::DW_OP_minus:
case dwarf::DW_OP_mul:		case dwarf::DW_OP_mul:
case dwarf::DW_OP_div:		case dwarf::DW_OP_div:
case dwarf::DW_OP_mod:		case dwarf::DW_OP_mod:
case dwarf::DW_OP_or:		case dwarf::DW_OP_or:
▲ Show 20 Lines • Show All 275 Lines • Show Last 20 Lines

lib/MC/MCStreamer.cpp

Show First 20 Lines • Show All 129 Lines • ▼ Show 20 Lines	for (unsigned i = 0; i != Size; ++i) {
unsigned index = isLittleEndian ? i : (Size - i - 1);		unsigned index = isLittleEndian ? i : (Size - i - 1);
buf[i] = uint8_t(Value >> (index * 8));		buf[i] = uint8_t(Value >> (index * 8));
}		}
EmitBytes(StringRef(buf, Size));		EmitBytes(StringRef(buf, Size));
}		}

/// EmitULEB128IntValue - Special case of EmitULEB128Value that avoids the		/// EmitULEB128IntValue - Special case of EmitULEB128Value that avoids the
/// client having to pass in a MCExpr for constant integers.		/// client having to pass in a MCExpr for constant integers.
void MCStreamer::EmitULEB128IntValue(uint64_t Value) {		void MCStreamer::EmitULEB128IntValue(uint64_t Value, unsigned PadTo) {
SmallString<128> Tmp;		SmallString<128> Tmp;
raw_svector_ostream OSE(Tmp);		raw_svector_ostream OSE(Tmp);
encodeULEB128(Value, OSE);		encodeULEB128(Value, OSE, PadTo);
EmitBytes(OSE.str());		EmitBytes(OSE.str());
}		}

/// EmitSLEB128IntValue - Special case of EmitSLEB128Value that avoids the		/// EmitSLEB128IntValue - Special case of EmitSLEB128Value that avoids the
/// client having to pass in a MCExpr for constant integers.		/// client having to pass in a MCExpr for constant integers.
void MCStreamer::EmitSLEB128IntValue(int64_t Value) {		void MCStreamer::EmitSLEB128IntValue(int64_t Value) {
SmallString<128> Tmp;		SmallString<128> Tmp;
raw_svector_ostream OSE(Tmp);		raw_svector_ostream OSE(Tmp);
▲ Show 20 Lines • Show All 943 Lines • Show Last 20 Lines

lib/Target/BPF/MCTargetDesc/BPFAsmBackend.cpp

Show First 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	void BPFAsmBackend::applyFixup(const MCAssembler &Asm, const MCFixup &Fixup,
const MCSubtargetInfo *STI) const {		const MCSubtargetInfo *STI) const {
if (Fixup.getKind() == FK_SecRel_4 \|\| Fixup.getKind() == FK_SecRel_8) {		if (Fixup.getKind() == FK_SecRel_4 \|\| Fixup.getKind() == FK_SecRel_8) {
if (Value) {		if (Value) {
MCContext &Ctx = Asm.getContext();		MCContext &Ctx = Asm.getContext();
Ctx.reportError(Fixup.getLoc(),		Ctx.reportError(Fixup.getLoc(),
"Unsupported relocation: try to compile with -O2 or above, "		"Unsupported relocation: try to compile with -O2 or above, "
"or check your static variable usage");		"or check your static variable usage");
}		}
		} else if (Fixup.getKind() == FK_Data_2) {
		support::endian::write<uint16_t>(&Data[Fixup.getOffset()], Value, Endian);
} else if (Fixup.getKind() == FK_Data_4) {		} else if (Fixup.getKind() == FK_Data_4) {
support::endian::write<uint32_t>(&Data[Fixup.getOffset()], Value, Endian);		support::endian::write<uint32_t>(&Data[Fixup.getOffset()], Value, Endian);
} else if (Fixup.getKind() == FK_Data_8) {		} else if (Fixup.getKind() == FK_Data_8) {
support::endian::write<uint64_t>(&Data[Fixup.getOffset()], Value, Endian);		support::endian::write<uint64_t>(&Data[Fixup.getOffset()], Value, Endian);
} else if (Fixup.getKind() == FK_PCRel_4) {		} else if (Fixup.getKind() == FK_PCRel_4) {
Value = (uint32_t)((Value - 8) / 8);		Value = (uint32_t)((Value - 8) / 8);
if (Endian == support::little) {		if (Endian == support::little) {
Data[Fixup.getOffset() + 1] = 0x10;		Data[Fixup.getOffset() + 1] = 0x10;
Show All 31 Lines

lib/Transforms/Utils/Local.cpp

Show First 20 Lines • Show All 1,845 Lines • ▼ Show 20 Lines	if (FromTy->isIntegerTy() && ToTy->isIntegerTy()) {

// When the width of the result grows, assume that a debugger will only		// When the width of the result grows, assume that a debugger will only
// access the low `FromBits` bits when inspecting the source variable.		// access the low `FromBits` bits when inspecting the source variable.
if (FromBits < ToBits)		if (FromBits < ToBits)
return rewriteDebugUsers(From, To, DomPoint, DT, Identity);		return rewriteDebugUsers(From, To, DomPoint, DT, Identity);

// The width of the result has shrunk. Use sign/zero extension to describe		// The width of the result has shrunk. Use sign/zero extension to describe
// the source variable's high bits.		// the source variable's high bits.
auto SignOrZeroExt = [&](DbgVariableIntrinsic &DII) -> DbgValReplacement {		auto SignOrZeroExt = [&](DbgVariableIntrinsic &DII) -> DbgValReplacement {
		aprantlUnsubmitted Not Done Reply Inline Actions I think we should make a public helper function to emit z/sext operations for a DIExpression. Either as a member of DIExpression or as a freestanding function in Local.h. I'm sure this will come in handy elsewhere. aprantl: I think we should make a public helper function to emit z/sext operations for a DIExpression.
DILocalVariable *Var = DII.getVariable();		DILocalVariable *Var = DII.getVariable();

// Without knowing signedness, sign/zero extension isn't possible.		// Without knowing signedness, sign/zero extension isn't possible.
auto Signedness = Var->getSignedness();		auto Signedness = Var->getSignedness();
if (!Signedness)		if (!Signedness)
return None;		return None;

bool Signed = *Signedness == DIBasicType::Signedness::Signed;		bool Signed = *Signedness == DIBasicType::Signedness::Signed;
		dwarf::TypeKind TK = Signed ? dwarf::DW_ATE_signed : dwarf::DW_ATE_unsigned;
		aprantlUnsubmitted Not Done Reply Inline Actions nit: `.` at the end. aprantl: nit: `.` at the end.
if (!Signed) {		SmallVector<uint64_t, 8> Ops({dwarf::DW_OP_LLVM_convert, ToBits, TK,
		aprantlUnsubmitted Not Done Reply Inline Actions I haven't had any coffee yet, but shouldn't that be `FromBits` and From ?: 00001110 >> 4-1 * ~0 << 4 \| 00001110 1 * ~0 << 4 \| 00001110 11111111 << 4 \| 00001110 11110000 \| 00001110 11111110 aprantl: I haven't had any coffee yet, but shouldn't that be `FromBits` and From ?: ``` 00001110 >> 4-1…
		bjopeUnsubmitted Not Done Reply Inline Actions This method is replacing one dbg use with another. So "from" is the old value and "to" is the new value. Here we replace a old large value (e.g. i32) by a new smaller value (e.g. i16). So we sign extend from `ToBits` to `FromBits` to convert the new value back into something that represents the old value in the debugger. I think I needed both coffee and lunch to understand that we extend from `To` to `From` here. bjope: This method is replacing one dbg use with another. So "from" is the old value and "to" is the…
// In the unsigned case, assume that a debugger will initialize the		dwarf::DW_OP_LLVM_convert, FromBits, TK});
		bjopeUnsubmitted Not Done Reply Inline Actions I guess this still is wrong, at least if we end up with a DWARF location description for a memory location. If for example the variable is 32-bits, and we want to describe it using a 16-bit value, then we will get something like this: call void @llvm.dbg.value(metadata i16 %value, metadata "!variable", metadata DIExpression(DW_OP_dup, DW_OP_constu, 15, DW_OP_shr, DW_OP_lit0, DW_OP_not, DW_OP_mul, DW_OP_constu, 16, DW_OP_shl, DW_OP_or, DW_OP_stack_value) In llc this will become a DBG_VALUE. The value could either end up referring to a 16-bit register, or it could refer to a 16-bit stack slot (e.g. if this is an input argument passed on the stack). In the latter case we typically end up prepending the DIExpression with DW_OP_fbreg and an offset. The memory location will point to the 16-bit value. But we do not really express that the debugger should read a 16-bit value here, right? The debugger will only see that the variable is 32-bits, so it will read 32-bits, right? For a little endian target we will get garbage in bits 16-31 (since we read outside the 16-bit stack slot). For a big endian target we will get the wanted value in bits 16-31 and garbage in bits 0-15. Either way, the result would be wrong. For little endian we would need to clear bit 16-31 before the OR with the sign-extension mask. For big endian we aren't even operating on the correct bits. I'm not really sure what happens if the debugger finds a 16-bit register location for the 32-bit variable. Do we know that it only us reading 16-bits to the value stack? One solution could be to use DW_OP_deref_size when reading from memory, to specify that we only want to read 16 bits. I'm not sure exactly how DwarfExpression could know when this is needed. I guess we can not add the DW_OP_deref_size already here, because it would be wrong in case of ending up with a register location. But maybe we still need to do something more also for the register location scenario when using this approach. An alternative solution is to describe the variable using two dbg.value intrinsics. One using a fragment for bits 0-15, and another one using a fragment expression for bits 16-31. I guess it would look something like this: call void @llvm.dbg.value(metadata i16 %value, metadata "!variable", metadata DIExpression(DW_OP_LLVM_fragment 0, 16) call void @llvm.dbg.value(metadata i16 %value, metadata "!variable", metadata DIExpression(DW_OP_constu, 15, DW_OP_shr, DW_OP_lit0, DW_OP_not, DW_OP_mul, DW_OP_LLVM_fragment 16, 16) I've seen the discussion about DW_OP_convert. Would DW_OP_convert help in telling the debugger that any derefs should be 16 bits in this case. Then I guess that still would be good for DWARF5. Similar problem as described above also exists for the zext case below. At least for big endian when dereferencing memory, since we get the wrong value in the least significant bits when reading 32 bits from a 16-bit stack slot. bjope: I guess this still is wrong, at least if we end up with a DWARF location description for a…
		markusAuthorUnsubmitted Done Reply Inline Actions Unfortunately I run into problems when I try having arbitrarily sized fragments ( https://bugs.llvm.org/show_bug.cgi?id=40462 ) so I have to make them byte sized here which produces quite a lot of them ... markus: Unfortunately I run into problems when I try having arbitrarily sized fragments ( https://bugs.
		markusAuthorUnsubmitted Done Reply Inline Actions Why is this the case? I was expecting that since I use TrackingMDRef the nodes would not be considered to have zero uses and as a result be deleted until the trackers let go. markus: Why is this the case? I was expecting that since I use TrackingMDRef the nodes would not be…
// high bits to 0 and do a no-op conversion.
return Identity(DII);
} else {
// In the signed case, the high bits are given by sign extension, i.e:
// (To >> (ToBits - 1)) * ((2 ^ FromBits) - 1)
// Calculate the high bits and OR them together with the low bits.
SmallVector<uint64_t, 8> Ops({dwarf::DW_OP_dup, dwarf::DW_OP_constu,
(ToBits - 1), dwarf::DW_OP_shr,
dwarf::DW_OP_lit0, dwarf::DW_OP_not,
dwarf::DW_OP_mul, dwarf::DW_OP_or});
return DIExpression::appendToStack(DII.getExpression(), Ops);		return DIExpression::appendToStack(DII.getExpression(), Ops);
}
};		};
return rewriteDebugUsers(From, To, DomPoint, DT, SignOrZeroExt);		return rewriteDebugUsers(From, To, DomPoint, DT, SignOrZeroExt);
}		}

		bjopeUnsubmitted Not Done Reply Inline Actions I still have trouble to understand how/if helps the debugger (or llc) to know how many bits to dereference. I assume that llc needs to prepend a DW_OP_deref_type in case of a memory location, or DW_OP_regval_type in case of a register location to get the correct type on the expression stack for the original value. And then we only need one DW_OP_convert to convert to the final type. In case we think that it is ok to dereference more bits than `ToBits` (i.e. the smaller size), then we need to adjust the address to take care of endianess somewhere. If we go down this road, then I think we need some hack to get DW_OP_deref_type/DW_OP_regval_type in place first. Or what do you think? bjope: I still have trouble to understand how/if helps the debugger (or llc) to know how many bits to…
		markusAuthorUnsubmitted Done Reply Inline Actions I still have trouble to understand how/if helps the debugger (or llc) to know how many bits to dereference. Ok, lets try to clarify that then so that we all can agree here. I assume that llc needs to prepend a DW_OP_deref_type in case of a memory location, or DW_OP_regval_type in case of a register location to get the correct type on the expression stack for the original value. And then we only need one DW_OP_convert to convert to the final type. Yes, down the road that would be ideal as I see it. In the meantime we could do with using two DW_OP_convert ops in sequence as in this patch. In case we think that it is ok to dereference more bits than ToBits (i.e. the smaller size), then we need to adjust the address to take care of endianess somewhere. I think that we immediately need to start using DW_OP_deref_size instead of DW_OP_deref to cover the endianess effects, but this is really a separate bug/issue. Modifying the address seems a less desirable way to achieve this. If we go down this road, then I think we need some hack to get DW_OP_deref_type/DW_OP_regval_type in place first. Or what do you think? I think that we can start using two DW_OP_convert in sequence and then treat the DW_OP_deref_size as a separate issue and finally DW_OP_deref_type and DW_OP_regval_type as long term goals. Makes sense? markus: > I still have trouble to understand how/if helps the debugger (or llc) to know how many bits…
		bjopeUnsubmitted Not Done Reply Inline Actions I think that we can start using two DW_OP_convert in sequence and then treat the DW_OP_deref_size as a separate issue and finally DW_OP_deref_type and DW_OP_regval_type as long term goals. Makes sense? Makes a little sense at least. I guess it all depends on if this is supposed to be "complete" or just a partial solution. If we go for the latter then you need to update the description (and probably also add some code comments here mentioning that this still gives faulty result in certain situations). Extra credits for adding test cases that show that we still do wrong in some situations. At least it seems like we agree that for a "complete" solution we need to express how many bits that should be dereferenced instead of the first DW_OP_convert. With this patch we still present wrong values in the debugger sometimes (at least for big endian platforms, right?). bjope: > I think that we can start using two DW_OP_convert in sequence and then treat the…
// TODO: Floating-point conversions, vectors.		// TODO: Floating-point conversions, vectors.
return false;		return false;
		aprantlUnsubmitted Not Done Reply Inline Actions General question: Should we generate a DWARF 5 type conversion here and lower it to this sequence for DWARF 4 and lower in DwarfExpression.cpp to save memory? aprantl: General question: Should we generate a DWARF 5 type conversion here and lower it to this…
}		}

unsigned llvm::removeAllNonTerminatorAndEHPadInstructions(BasicBlock *BB) {		unsigned llvm::removeAllNonTerminatorAndEHPadInstructions(BasicBlock *BB) {
unsigned NumDeadInst = 0;		unsigned NumDeadInst = 0;
// Delete the instructions backwards, as it has a reduced likelihood of		// Delete the instructions backwards, as it has a reduced likelihood of
// having to update as many def-use and use-def chains.		// having to update as many def-use and use-def chains.
Instruction *EndInst = BB->getTerminator(); // Last not to be deleted.		Instruction *EndInst = BB->getTerminator(); // Last not to be deleted.
while (EndInst != &BB->front()) {		while (EndInst != &BB->front()) {
▲ Show 20 Lines • Show All 1,006 Lines • Show Last 20 Lines

test/Assembler/diexpression.ll

	; RUN: llvm-as < %s \| llvm-dis \| llvm-as \| llvm-dis \| FileCheck %s			; RUN: llvm-as < %s \| llvm-dis \| llvm-as \| llvm-dis \| FileCheck %s
	; RUN: verify-uselistorder %s			; RUN: verify-uselistorder %s

	; CHECK: !named = !{			; CHECK: !named = !{
	; CHECK-SAME: !DIExpression(),			; CHECK-SAME: !DIExpression(),
	; CHECK-SAME: !DIExpression(DW_OP_deref),			; CHECK-SAME: !DIExpression(DW_OP_deref),
	; CHECK-SAME: !DIExpression(DW_OP_constu, 3, DW_OP_plus),			; CHECK-SAME: !DIExpression(DW_OP_constu, 3, DW_OP_plus),
	; CHECK-SAME: !DIExpression(DW_OP_LLVM_fragment, 3, 7),			; CHECK-SAME: !DIExpression(DW_OP_LLVM_fragment, 3, 7),
	; CHECK-SAME: !DIExpression(DW_OP_deref, DW_OP_plus_uconst, 3, DW_OP_LLVM_fragment, 3, 7),			; CHECK-SAME: !DIExpression(DW_OP_deref, DW_OP_plus_uconst, 3, DW_OP_LLVM_fragment, 3, 7),
	; CHECK-SAME: !DIExpression(DW_OP_constu, 2, DW_OP_swap, DW_OP_xderef),			; CHECK-SAME: !DIExpression(DW_OP_constu, 2, DW_OP_swap, DW_OP_xderef),
	; CHECK-SAME: !DIExpression(DW_OP_plus_uconst, 3)}			; CHECK-SAME: !DIExpression(DW_OP_plus_uconst, 3)
				; CHECK-SAME: !DIExpression(DW_OP_LLVM_convert, 16, DW_ATE_unsigned, DW_OP_LLVM_convert, 32, DW_ATE_signed)}

	!named = !{!0, !1, !2, !3, !4, !5, !6}			!named = !{!0, !1, !2, !3, !4, !5, !6, !7}

	!0 = !DIExpression()			!0 = !DIExpression()
	!1 = !DIExpression(DW_OP_deref)			!1 = !DIExpression(DW_OP_deref)
	!2 = !DIExpression(DW_OP_constu, 3, DW_OP_plus)			!2 = !DIExpression(DW_OP_constu, 3, DW_OP_plus)
	!3 = !DIExpression(DW_OP_LLVM_fragment, 3, 7)			!3 = !DIExpression(DW_OP_LLVM_fragment, 3, 7)
	!4 = !DIExpression(DW_OP_deref, DW_OP_plus_uconst, 3, DW_OP_LLVM_fragment, 3, 7)			!4 = !DIExpression(DW_OP_deref, DW_OP_plus_uconst, 3, DW_OP_LLVM_fragment, 3, 7)
	!5 = !DIExpression(DW_OP_constu, 2, DW_OP_swap, DW_OP_xderef)			!5 = !DIExpression(DW_OP_constu, 2, DW_OP_swap, DW_OP_xderef)
	!6 = !DIExpression(DW_OP_plus_uconst, 3)			!6 = !DIExpression(DW_OP_plus_uconst, 3)
				!7 = !DIExpression(DW_OP_LLVM_convert, 16, DW_ATE_unsigned, DW_OP_LLVM_convert, 32, DW_ATE_signed)

test/Transforms/InstCombine/cast-set-preserve-signed-dbg-val.ll

	; RUN: opt -instcombine -S < %s \| FileCheck %s			; RUN: opt -instcombine -S < %s \| FileCheck %s

	; CHECK-LABEL: define {{.*}} @test5			; CHECK-LABEL: define {{.*}} @test5
	define i16 @test5(i16 %A) !dbg !34 {			define i16 @test5(i16 %A) !dbg !34 {
	; CHECK: [[and:%.*]] = and i16 %A, 15			; CHECK: [[and:%.*]] = and i16 %A, 15

	%B = sext i16 %A to i32, !dbg !40			%B = sext i16 %A to i32, !dbg !40
	call void @llvm.dbg.value(metadata i32 %B, metadata !36, metadata !DIExpression()), !dbg !40			call void @llvm.dbg.value(metadata i32 %B, metadata !36, metadata !DIExpression()), !dbg !40

	%C = and i32 %B, 15, !dbg !41			%C = and i32 %B, 15, !dbg !41
	call void @llvm.dbg.value(metadata i32 %C, metadata !37, metadata !DIExpression()), !dbg !41			call void @llvm.dbg.value(metadata i32 %C, metadata !37, metadata !DIExpression()), !dbg !41

	; Preserve the dbg.value for the DCE'd 32-bit 'and'.			; Preserve the dbg.value for the DCE'd 32-bit 'and'.
	;			;
	; The high 16 bits of the original 'and' require sign-extending the new 16-bit and:			; The high 16 bits of the original 'and' require sign-extending the new 16-bit and:
	; CHECK-NEXT: call void @llvm.dbg.value(metadata i16 [[and]], metadata [[C:![0-9]+]],			; CHECK-NEXT: call void @llvm.dbg.value(metadata i16 [[and]], metadata [[C:![0-9]+]],
	; CHECK-SAME: metadata !DIExpression(DW_OP_dup, DW_OP_constu, 15, DW_OP_shr, DW_OP_lit0, DW_OP_not, DW_OP_mul, DW_OP_or, DW_OP_stack_value)			; CHECK-SAME: metadata !DIExpression(DW_OP_LLVM_convert, 16, DW_ATE_signed, DW_OP_LLVM_convert, 32, DW_ATE_signed, DW_OP_stack_value)

	%D = trunc i32 %C to i16, !dbg !42			%D = trunc i32 %C to i16, !dbg !42
	call void @llvm.dbg.value(metadata i16 %D, metadata !38, metadata !DIExpression()), !dbg !42			call void @llvm.dbg.value(metadata i16 %D, metadata !38, metadata !DIExpression()), !dbg !42

	; The dbg.value for a truncate should simply point to the result of the 16-bit 'and'.			; The dbg.value for a truncate should simply point to the result of the 16-bit 'and'.
	; CHECK-NEXT: call void @llvm.dbg.value(metadata i16 [[and]], metadata [[D:![0-9]+]], metadata !DIExpression())			; CHECK-NEXT: call void @llvm.dbg.value(metadata i16 [[and]], metadata [[D:![0-9]+]], metadata !DIExpression())

	ret i16 %D, !dbg !43			ret i16 %D, !dbg !43
	Show All 25 Lines

unittests/Transforms/Utils/LocalTest.cpp

Show First 20 Lines • Show All 782 Lines • ▼ Show 20 Lines	TEST(Local, ReplaceAllDbgUsesWith) {
auto hasADbgVal = [&](ArrayRef<uint64_t> Ops) {		auto hasADbgVal = [&](ArrayRef<uint64_t> Ops) {
return any_of(ADbgVals, [&](DbgValueInst *DVI) {		return any_of(ADbgVals, [&](DbgValueInst *DVI) {
assert(DVI->getVariable()->getName() == "2");		assert(DVI->getVariable()->getName() == "2");
return DVI->getExpression()->getElements() == Ops;		return DVI->getExpression()->getElements() == Ops;
});		});
};		};

// Case 1: The original expr is empty, so no deref is needed.		// Case 1: The original expr is empty, so no deref is needed.
EXPECT_TRUE(hasADbgVal({DW_OP_dup, DW_OP_constu, 31, DW_OP_shr, DW_OP_lit0,		EXPECT_TRUE(hasADbgVal({DW_OP_LLVM_convert, 32, DW_ATE_signed,
DW_OP_not, DW_OP_mul, DW_OP_or, DW_OP_stack_value}));		DW_OP_LLVM_convert, 64, DW_ATE_signed,
		DW_OP_stack_value}));

// Case 2: Perform an address calculation with the original expr, deref it,		// Case 2: Perform an address calculation with the original expr, deref it,
// then sign-extend the result.		// then sign-extend the result.
EXPECT_TRUE(hasADbgVal({DW_OP_lit0, DW_OP_mul, DW_OP_deref, DW_OP_dup,		EXPECT_TRUE(hasADbgVal({DW_OP_lit0, DW_OP_mul, DW_OP_deref,
DW_OP_constu, 31, DW_OP_shr, DW_OP_lit0, DW_OP_not,		DW_OP_LLVM_convert, 32, DW_ATE_signed,
DW_OP_mul, DW_OP_or, DW_OP_stack_value}));		DW_OP_LLVM_convert, 64, DW_ATE_signed,
		DW_OP_stack_value}));

// Case 3: Insert the sign-extension logic before the DW_OP_stack_value.		// Case 3: Insert the sign-extension logic before the DW_OP_stack_value.
EXPECT_TRUE(hasADbgVal({DW_OP_lit0, DW_OP_mul, DW_OP_dup, DW_OP_constu, 31,		EXPECT_TRUE(hasADbgVal({DW_OP_lit0, DW_OP_mul, DW_OP_LLVM_convert, 32,
DW_OP_shr, DW_OP_lit0, DW_OP_not, DW_OP_mul, DW_OP_or,		DW_ATE_signed, DW_OP_LLVM_convert, 64, DW_ATE_signed,
DW_OP_stack_value}));		DW_OP_stack_value}));

// Cases 4-6: Just like cases 1-3, but preserve the fragment at the end.		// Cases 4-6: Just like cases 1-3, but preserve the fragment at the end.
EXPECT_TRUE(hasADbgVal({DW_OP_dup, DW_OP_constu, 31, DW_OP_shr, DW_OP_lit0,		EXPECT_TRUE(hasADbgVal({DW_OP_LLVM_convert, 32, DW_ATE_signed,
DW_OP_not, DW_OP_mul, DW_OP_or, DW_OP_stack_value,		DW_OP_LLVM_convert, 64, DW_ATE_signed,
DW_OP_LLVM_fragment, 0, 8}));
EXPECT_TRUE(
hasADbgVal({DW_OP_lit0, DW_OP_mul, DW_OP_deref, DW_OP_dup, DW_OP_constu,
31, DW_OP_shr, DW_OP_lit0, DW_OP_not, DW_OP_mul, DW_OP_or,
DW_OP_stack_value, DW_OP_LLVM_fragment, 0, 8}));		DW_OP_stack_value, DW_OP_LLVM_fragment, 0, 8}));
EXPECT_TRUE(hasADbgVal({DW_OP_lit0, DW_OP_mul, DW_OP_dup, DW_OP_constu, 31,
DW_OP_shr, DW_OP_lit0, DW_OP_not, DW_OP_mul, DW_OP_or,		EXPECT_TRUE(hasADbgVal({DW_OP_lit0, DW_OP_mul, DW_OP_deref,
		DW_OP_LLVM_convert, 32, DW_ATE_signed,
		DW_OP_LLVM_convert, 64, DW_ATE_signed,
		DW_OP_stack_value, DW_OP_LLVM_fragment, 0, 8}));

		EXPECT_TRUE(hasADbgVal({DW_OP_lit0, DW_OP_mul, DW_OP_LLVM_convert, 32,
		DW_ATE_signed, DW_OP_LLVM_convert, 64, DW_ATE_signed,
DW_OP_stack_value, DW_OP_LLVM_fragment, 0, 8}));		DW_OP_stack_value, DW_OP_LLVM_fragment, 0, 8}));

verifyModule(*M, &errs(), &BrokenDebugInfo);		verifyModule(*M, &errs(), &BrokenDebugInfo);
ASSERT_FALSE(BrokenDebugInfo);		ASSERT_FALSE(BrokenDebugInfo);
}		}

TEST(Local, RemoveUnreachableBlocks) {		TEST(Local, RemoveUnreachableBlocks) {
LLVMContext C;		LLVMContext C;

▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Introduce DW_OP_LLVM_convertClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 187015

docs/LangRef.rst

include/llvm/BinaryFormat/Dwarf.h

include/llvm/CodeGen/AsmPrinter.h

include/llvm/CodeGen/DIE.h

include/llvm/CodeGen/DIEValue.def

include/llvm/MC/MCStreamer.h

lib/AsmParser/LLParser.cpp

lib/BinaryFormat/Dwarf.cpp

lib/CodeGen/AsmPrinter/AsmPrinterDwarf.cpp

lib/CodeGen/AsmPrinter/ByteStreamer.h

lib/CodeGen/AsmPrinter/DIE.cpp

lib/CodeGen/AsmPrinter/DIEHash.cpp

lib/CodeGen/AsmPrinter/DebugLocEntry.h

lib/CodeGen/AsmPrinter/DwarfCompileUnit.h

lib/CodeGen/AsmPrinter/DwarfCompileUnit.cpp

lib/CodeGen/AsmPrinter/DwarfDebug.h

lib/CodeGen/AsmPrinter/DwarfDebug.cpp

lib/CodeGen/AsmPrinter/DwarfExpression.h

lib/CodeGen/AsmPrinter/DwarfExpression.cpp

lib/CodeGen/AsmPrinter/DwarfFile.h

lib/CodeGen/AsmPrinter/DwarfUnit.cpp

lib/IR/AsmWriter.cpp

lib/IR/DebugInfoMetadata.cpp

lib/MC/MCStreamer.cpp

lib/Target/BPF/MCTargetDesc/BPFAsmBackend.cpp

lib/Transforms/Utils/Local.cpp

test/Assembler/diexpression.ll

test/Transforms/InstCombine/cast-set-preserve-signed-dbg-val.ll

unittests/Transforms/Utils/LocalTest.cpp

Introduce DW_OP_LLVM_convert
ClosedPublic