This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/DebugInfo/DWARF/
-
llvm/
-
DebugInfo/
-
DWARF/
-
DWARFContext.h
1
DWARFUnit.h
-
DWARFUnitIndex.h
-
lib/DebugInfo/DWARF/
-
DebugInfo/
-
DWARF/
19/24
DWARFContext.cpp
4/4
DWARFUnitIndex.cpp
-
test/tools/llvm-dwp/X86/
-
tools/
-
llvm-dwp/
-
X86/
-
debug_macro_v5.s
-
info-v5.s
-
loclists.s
-
rnglists.s
-
tu_units_v5.s
-
type_dedup.test
-
tools/llvm-dwarfdump/
-
llvm-dwarfdump/
3/3
llvm-dwarfdump.cpp

Differential D137882

[DWARFLibrary] Add support to re-construct cu-index
ClosedPublic

Authored by ayermolo on Nov 11 2022, 6:24 PM.

Download Raw Diff

Details

Reviewers

jhenderson
dblaikie

Commits

rGc0db06227721: [DWARFLibrary] Add support to re-construct cu-index
rG73712c8790a9: [DWARFLibrary] Add support to re-construct cu-index
rGa5bd76a6e311: [DWARFLibrary] Add support to re-construct cu-index

Summary

According to DWARF5 specification and gnu specification for DWARF4 the offset
entry in the CU/TU Index is 32 bits. This presents a problem when
.debug_info.dwo in DWP file grows beyond 4GB. The CU Index becomes partially
corrupted.

This diff adds manual parsing of .debug_info.dwo/.debug_abbrev.dwo to
reconstruct CU index in general, and TU index for DWARF5. This is a work around
until DWARF6 spec is finalized.

Next patch will change internal CU/TU struct to 64 bit, and change uses as
necessary. The plan is to land all the patches in one go after all are approved.

This patch originates from the discussion in: https://discourse.llvm.org/t/dwarf-dwp-4gb-limit/63902

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ayermolo created this revision.Nov 11 2022, 6:24 PM

Herald added a reviewer: jhenderson. · View Herald TranscriptNov 11 2022, 6:24 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: hoy, modimo, wenlei and 3 others. · View Herald Transcript

ayermolo requested review of this revision.Nov 11 2022, 6:24 PM

Herald added a project: Restricted Project. · View Herald TranscriptNov 11 2022, 6:24 PM

Herald added subscribers: llvm-commits, MaskRay. · View Herald Transcript

Harbormaster completed remote builds in B197350: Diff 474912.Nov 11 2022, 6:25 PM

ayermolo mentioned this in D137657: [DWARFLibrary] Add support to re-construct cu-index.Nov 11 2022, 6:26 PM

nit

Harbormaster completed remote builds in B197352: Diff 474913.Nov 11 2022, 6:30 PM

dblaikie added a subscriber: dblaikie.Nov 12 2022, 4:30 PM

dblaikie added inline comments.

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
805	Is this how error handling's generally done here? I think maybe the DWARFContext has error handling callbacks that are meant to be used? (& should probably propagate up a failure result through all of this rather than continuing with corrupt data?)
812	Could you rely on the version of the index? (version 2, I think, for pre-standard index, version 5 for the DWARFv5 standard index) rather than having to wait to parse a unit to see what version it has. There's currently no way to mix pre-standard and standard indexes, I think (owing to the valid columns accepted in each)? So that should be adequate.
833	Rather than exposing mutability in the index interface, could this whole function (fixupIndex) be moved into the index & performed there as part of parsing?
843	any reason to believe the lengths would be incorrect? Perhaps we can limit the scope a bit by not touching those?
llvm/lib/DebugInfo/DWARF/DWARFUnitIndex.cpp
256–259	Should this return by reference?
llvm/tools/llvm-dwarfdump/llvm-dwarfdump.cpp
259	Maybe, to avoid the "CU/TU/etc" Could use "Unit" consistently in both flag name and opt variable, etc. (so there's no confusion that maybe it's specifically only for the CUIndex and not the TUIndex)
260–261	maybe ".debug_info" rather than "debug info"?
262	Maybe DW_SECT_INFO rather than "DEBUG_INFO"?

Addressing comments

Harbormaster completed remote builds in B197599: Diff 475244.Nov 14 2022, 12:33 PM

missed llvm-dwarfdump comments

ayermolo marked 3 inline comments as done.Nov 14 2022, 12:53 PM

ayermolo added inline comments.

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
805	Changed it to logAllUnhandledErrors(createError()) for now To propagate up can chagne DWARFContext::get(CU,TU}Index() to return Expectec<DWARFUnitIndex>? I was going with something more localized to minimize the impact. I guess it comes down to philosophical question of whether if CU/TU index is partially corrupted we want to consider whole thing corrupted, or keep the current behavior of at least being able to access debug info below 4GB.
833	Wasn't part of the feedback from other diff is to minimize impact and not modify cu/tu index parsing, or did I miss understand? We then will need to modify parse to pass in context, and if we are parsing CU or TU.
843	I didn't want to assume anything about the producer. If Offset is corrupt, depending how how length is calculated at least one might be corrupt also. Also we are overriding all the offsets, if we mess up on that, doesn't really matter if new length is correct or not.
llvm/lib/DebugInfo/DWARF/DWARFUnitIndex.cpp
256–259	I was trying to keep same return type as the const version. Changed to reference.

Harbormaster completed remote builds in B197602: Diff 475249.Nov 14 2022, 12:53 PM

rebase

Harbormaster completed remote builds in B197613: Diff 475262.Nov 14 2022, 2:20 PM

fixed tests

Harbormaster completed remote builds in B197814: Diff 475543.Nov 15 2022, 1:19 PM

Yeah, I'm mostly OK with this direction. It's pretty isolated, maybe easy enough to explain if needed, etc.

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
785–787	These could probably all sink into `fixupIndex` as locals?
787	Maybe DenseMap rather than unordered_map?
833	Fair enough - mixed feelings, but I'll rescind this piece at least.
843	I'd prefer to be a bit less permissive, really - to not end up creating more weird cases that systems might come to depend on. Maybe fail if the length doesn't match, until we know of any particular case with mismatched lengths that we understand enough to want to/figure out how to support?
2045	looks like this unrelated change snuck in?
llvm/lib/DebugInfo/DWARF/DWARFUnitIndex.cpp
256–259	oh, yeah, the other should probably change too... ifyou could do that in a separate patch, that'd be great

ayermolo marked 6 inline comments as done.Nov 23 2022, 1:47 PM

ayermolo added inline comments.

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
2045	Ah yeah clang-format change.
llvm/lib/DebugInfo/DWARF/DWARFUnitIndex.cpp
256–259	Sounds good.

addressed comments

ayermolo marked 4 inline comments as done.Nov 23 2022, 1:48 PM

Harbormaster completed remote builds in B199280: Diff 477596.Nov 23 2022, 2:41 PM

ayermolo mentioned this in D138618: [LLDB] Enable 64 bit debug/type offset.Nov 23 2022, 3:17 PM

LLDB changes to enable 64bit support there. https://reviews.llvm.org/D138618

I think test coverage might be more suitable if it were a single dedicated test that has a corrupted index, to demonstrate that rebuilding the index comes up with a different (& correct) answer - rather than adding what looks sort of like redundant testing to existing test cases?

In D137882#3948380, @dblaikie wrote:

I think test coverage might be more suitable if it were a single dedicated test that has a corrupted index, to demonstrate that rebuilding the index comes up with a different (& correct) answer - rather than adding what looks sort of like redundant testing to existing test cases?

I am not sure how to construct corrupted CU Index manually. Would Yaml2Obj be able to do it?
For other tests I think it's ads testing coverage to make sure we are parsing cu/tu indexes correctly in various DWARF versions and debug types enablements combinations.

*To add/clarify those tests do exercise common code paths. I think adding "corrupt" index test would make sense to test some of the error conditions.

In D137882#3949594, @ayermolo wrote:

In D137882#3948380, @dblaikie wrote:

I think test coverage might be more suitable if it were a single dedicated test that has a corrupted index, to demonstrate that rebuilding the index comes up with a different (& correct) answer - rather than adding what looks sort of like redundant testing to existing test cases?

I am not sure how to construct corrupted CU Index manually. Would Yaml2Obj be able to do it?

I'm not sure - I don't think so. Might just have to be raw assembly - check the other symbolizer dwp testing? I think it's probably hand-crafted assembly? (maybe llvm-dwp testing that merges existing dwp files has some hand crafted dwp files too you could be inspired by)

But maybe Yaml2Obj could help - I'm just not very familiar with this.

For other tests I think it's ads testing coverage to make sure we are parsing cu/tu indexes correctly in various DWARF versions and debug types enablements combinations.

I'd rather keep things a bit narrower - demonstrate that the index is computed correctly, and assume that everything else works correctly once the index is correct.

In D137882#3957811, @dblaikie wrote:

In D137882#3949594, @ayermolo wrote:

In D137882#3948380, @dblaikie wrote:

I think test coverage might be more suitable if it were a single dedicated test that has a corrupted index, to demonstrate that rebuilding the index comes up with a different (& correct) answer - rather than adding what looks sort of like redundant testing to existing test cases?

I am not sure how to construct corrupted CU Index manually. Would Yaml2Obj be able to do it?

I'm not sure - I don't think so. Might just have to be raw assembly - check the other symbolizer dwp testing? I think it's probably hand-crafted assembly? (maybe llvm-dwp testing that merges existing dwp files has some hand crafted dwp files too you could be inspired by)

But maybe Yaml2Obj could help - I'm just not very familiar with this.

For other tests I think it's ads testing coverage to make sure we are parsing cu/tu indexes correctly in various DWARF versions and debug types enablements combinations.

I'd rather keep things a bit narrower - demonstrate that the index is computed correctly, and assume that everything else works correctly once the index is correct.

So...
Good news is that I got inspired and created a hand written assembly with invalid index.
Bad news is that this version of the patch relies on overflow behavior. See uint32_t TruncOffset. So just having one incorrect offset doesn't do anything.
I tried to play with length in the header the Header.extract fails.

tschuett added a subscriber: tschuett.Dec 1 2022, 7:44 AM

tschuett added inline comments.

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
60	Is this include useless?

removed unordered_map

ayermolo marked an inline comment as done.Dec 1 2022, 10:22 AM

ayermolo added inline comments.

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
60	Thanks removed. Leftover from switch to DenseMap.

Harbormaster completed remote builds in B200567: Diff 479349.Dec 1 2022, 1:48 PM

In D137882#3961341, @ayermolo wrote:

In D137882#3957811, @dblaikie wrote:

In D137882#3949594, @ayermolo wrote:

In D137882#3948380, @dblaikie wrote:

I think test coverage might be more suitable if it were a single dedicated test that has a corrupted index, to demonstrate that rebuilding the index comes up with a different (& correct) answer - rather than adding what looks sort of like redundant testing to existing test cases?

I am not sure how to construct corrupted CU Index manually. Would Yaml2Obj be able to do it?

I'm not sure - I don't think so. Might just have to be raw assembly - check the other symbolizer dwp testing? I think it's probably hand-crafted assembly? (maybe llvm-dwp testing that merges existing dwp files has some hand crafted dwp files too you could be inspired by)

But maybe Yaml2Obj could help - I'm just not very familiar with this.

For other tests I think it's ads testing coverage to make sure we are parsing cu/tu indexes correctly in various DWARF versions and debug types enablements combinations.

I'd rather keep things a bit narrower - demonstrate that the index is computed correctly, and assume that everything else works correctly once the index is correct.

So...
Good news is that I got inspired and created a hand written assembly with invalid index.
Bad news is that this version of the patch relies on overflow behavior. See uint32_t TruncOffset.

UB when converting a uint64_t into a uint32_t? Hmm, I didn't think that was undefined - figured that'd wrap around with well defined behavior. Maybe I'm misunderstanding - what's the particular UB you're referring to?

So just having one incorrect offset doesn't do anything.

Yeah, I'm still missing a step here. But I guess since we're only walking the offsets and looking for wraparound, this only has an effect if there's a genuine overflow, which would require a genuinely very large dwp file, which is infeasible to checkin as a test?

In D137882#3965250, @dblaikie wrote:

In D137882#3961341, @ayermolo wrote:

In D137882#3957811, @dblaikie wrote:

In D137882#3949594, @ayermolo wrote:

In D137882#3948380, @dblaikie wrote:

I think test coverage might be more suitable if it were a single dedicated test that has a corrupted index, to demonstrate that rebuilding the index comes up with a different (& correct) answer - rather than adding what looks sort of like redundant testing to existing test cases?

I am not sure how to construct corrupted CU Index manually. Would Yaml2Obj be able to do it?

I'm not sure - I don't think so. Might just have to be raw assembly - check the other symbolizer dwp testing? I think it's probably hand-crafted assembly? (maybe llvm-dwp testing that merges existing dwp files has some hand crafted dwp files too you could be inspired by)

But maybe Yaml2Obj could help - I'm just not very familiar with this.

For other tests I think it's ads testing coverage to make sure we are parsing cu/tu indexes correctly in various DWARF versions and debug types enablements combinations.

I'd rather keep things a bit narrower - demonstrate that the index is computed correctly, and assume that everything else works correctly once the index is correct.

So...
Good news is that I got inspired and created a hand written assembly with invalid index.
Bad news is that this version of the patch relies on overflow behavior. See uint32_t TruncOffset.

UB when converting a uint64_t into a uint32_t? Hmm, I didn't think that was undefined - figured that'd wrap around with well defined behavior. Maybe I'm misunderstanding - what's the particular UB you're referring to?

Sorry was a bit vague with the terms. When I said "overflow behavior" I didn't mean it's UB, but was referring to "wrap around behavior" which is defined.

So just having one incorrect offset doesn't do anything.

Yeah, I'm still missing a step here. But I guess since we're only walking the offsets and looking for wraparound, this only has an effect if there's a genuine overflow, which would require a genuinely very large dwp file, which is infeasible to checkin as a test?

Right exactly. We are iterating over headers and computing TruncOffset and don't hit the bad case until we wrap around (.debug_info.dwo over 4GB). So I don't see a way to create a test to trigger this unless we have a very large dwp file.

In D137882#3965278, @ayermolo wrote:

In D137882#3965250, @dblaikie wrote:

In D137882#3961341, @ayermolo wrote:

In D137882#3957811, @dblaikie wrote:

In D137882#3949594, @ayermolo wrote:

In D137882#3948380, @dblaikie wrote:

I think test coverage might be more suitable if it were a single dedicated test that has a corrupted index, to demonstrate that rebuilding the index comes up with a different (& correct) answer - rather than adding what looks sort of like redundant testing to existing test cases?

I am not sure how to construct corrupted CU Index manually. Would Yaml2Obj be able to do it?

I'm not sure - I don't think so. Might just have to be raw assembly - check the other symbolizer dwp testing? I think it's probably hand-crafted assembly? (maybe llvm-dwp testing that merges existing dwp files has some hand crafted dwp files too you could be inspired by)

But maybe Yaml2Obj could help - I'm just not very familiar with this.

For other tests I think it's ads testing coverage to make sure we are parsing cu/tu indexes correctly in various DWARF versions and debug types enablements combinations.

I'd rather keep things a bit narrower - demonstrate that the index is computed correctly, and assume that everything else works correctly once the index is correct.

So...
Good news is that I got inspired and created a hand written assembly with invalid index.
Bad news is that this version of the patch relies on overflow behavior. See uint32_t TruncOffset.

UB when converting a uint64_t into a uint32_t? Hmm, I didn't think that was undefined - figured that'd wrap around with well defined behavior. Maybe I'm misunderstanding - what's the particular UB you're referring to?

Sorry was a bit vague with the terms. When I said "overflow behavior" I didn't mean it's UB, but was referring to "wrap around behavior" which is defined.

So just having one incorrect offset doesn't do anything.

Yeah, I'm still missing a step here. But I guess since we're only walking the offsets and looking for wraparound, this only has an effect if there's a genuine overflow, which would require a genuinely very large dwp file, which is infeasible to checkin as a test?

Right exactly. We are iterating over headers and computing TruncOffset and don't hit the bad case until we wrap around (.debug_info.dwo over 4GB). So I don't see a way to create a test to trigger this unless we have a very large dwp file.

*nod* fair enough. Any chance of hand writing something that uses small assembly, but produces large enough output that it hits the overflow (using .zero or the like) - we wouldn't want to run it (don't want to produce multi-GB output files in test execution) but maybe comment it as a manual test, or at least have mentioned it/shown it in this review.

If you could write up something like that, copy/paste manually running it to show that the patch makes a difference and include the assembly in this review, I guess that'll have to be adequate?

In D137882#3965320, @dblaikie wrote:

In D137882#3965278, @ayermolo wrote:

In D137882#3965250, @dblaikie wrote:

In D137882#3961341, @ayermolo wrote:

In D137882#3957811, @dblaikie wrote:

In D137882#3949594, @ayermolo wrote:

In D137882#3948380, @dblaikie wrote:

I think test coverage might be more suitable if it were a single dedicated test that has a corrupted index, to demonstrate that rebuilding the index comes up with a different (& correct) answer - rather than adding what looks sort of like redundant testing to existing test cases?

I am not sure how to construct corrupted CU Index manually. Would Yaml2Obj be able to do it?

I'm not sure - I don't think so. Might just have to be raw assembly - check the other symbolizer dwp testing? I think it's probably hand-crafted assembly? (maybe llvm-dwp testing that merges existing dwp files has some hand crafted dwp files too you could be inspired by)

But maybe Yaml2Obj could help - I'm just not very familiar with this.

For other tests I think it's ads testing coverage to make sure we are parsing cu/tu indexes correctly in various DWARF versions and debug types enablements combinations.

I'd rather keep things a bit narrower - demonstrate that the index is computed correctly, and assume that everything else works correctly once the index is correct.

So...
Good news is that I got inspired and created a hand written assembly with invalid index.
Bad news is that this version of the patch relies on overflow behavior. See uint32_t TruncOffset.

UB when converting a uint64_t into a uint32_t? Hmm, I didn't think that was undefined - figured that'd wrap around with well defined behavior. Maybe I'm misunderstanding - what's the particular UB you're referring to?

Sorry was a bit vague with the terms. When I said "overflow behavior" I didn't mean it's UB, but was referring to "wrap around behavior" which is defined.

So just having one incorrect offset doesn't do anything.

Yeah, I'm still missing a step here. But I guess since we're only walking the offsets and looking for wraparound, this only has an effect if there's a genuine overflow, which would require a genuinely very large dwp file, which is infeasible to checkin as a test?

Right exactly. We are iterating over headers and computing TruncOffset and don't hit the bad case until we wrap around (.debug_info.dwo over 4GB). So I don't see a way to create a test to trigger this unless we have a very large dwp file.

*nod* fair enough. Any chance of hand writing something that uses small assembly, but produces large enough output that it hits the overflow (using .zero or the like) - we wouldn't want to run it (don't want to produce multi-GB output files in test execution) but maybe comment it as a manual test, or at least have mentioned it/shown it in this review.

If you could write up something like that, copy/paste manually running it to show that the patch makes a difference and include the assembly in this review, I guess that'll have to be adequate?

Oh I think I see.
Let me look into it.

bin/llvm-lit -a /data/users/ayermolo/server-llvm/llvm-project/llvm/test/tools/llvm-dwp/X86/invalid-cu-index.s

bin/llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

bin/llvm-dwarfdump --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

# This test checks that with invalid offset in the cu index
# we can reconstruct it manually.

# RUN: llvm-mc --filetype=obj --triple x86_64 %s -o %t.dwp
# RUN: llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index %t.dwp | FileCheck %s

# This test checks that we parse correctly cu-index that has entries over 4GB.
# It is setup to work with current llvm implementation where cu-index is 32bit.
# Once we move to 64bit internal representation, it will need to be modified.

# CHECK:        0x970c277d61e66cb3
# CHECK-SAME: [0x00000000, 0xfffffff0)
# CHECK:      0xd725a83371e7e913
# CHECK-SAME: [0xfffffff0, 0x0000001b)
# CHECK:      0x93f541184fb98d75
# CHECK-SAME: [0x0000001b, 0x00000046)

        .section        .debug_abbrev.dwo,"e",@progbits
.Labbrev1:
        .byte   1                       # Abbreviation Code
        .byte   17                      # DW_TAG_compile_unit
        .byte   0                       # DW_CHILDREN_no
        .byte   37                      # DW_AT_producer
        .byte   8                       # DW_FORM_string
        .byte   3                       # DW_AT_name
        .byte   8                       # DW_FORM_string
        .ascii  "\261B"                 # DW_AT_GNU_dwo_id
        .byte   7                       # DW_FORM_data8
        .byte   0                       # EOM(1)
        .byte   0                       # EOM(2)
        .byte   0                       # EOM(3)
.Labbrev_end1:

        .section        .debug_info.dwo,"e",@progbits
# DWO CU1
.Lcu_begin1:
        .long   .Ldebug_info_end1-.Ldebug_info_start1 # Length of Unit 0x2b
.Ldebug_info_start1:
        .short  4                       # DWARF version number
        .long   0                       # Offset Into Abbrev. Section
        .byte   8                       # Address Size (in bytes)
        .byte   1                       # Abbrev DW_TAG_compile_unit
        .asciz  "Hand-written DWARF"    # DW_AT_producer
        .byte   '1', '.', 'c', 0        # DW_AT_name
        .quad   0x970c277d61e66cb3      # DW_AT_GNU_dwo_id
        .zero   0xfffffff0 - 0x2b       # 0xfffffff0 is mimimum reserved length
.Ldebug_info_end1:

# DWO CU2
.Lcu_begin2:
        .long   .Ldebug_info_end2-.Ldebug_info_start2 # Length of Unit 0x2b
.Ldebug_info_start2:
        .short  4                       # DWARF version number
        .long   0                       # Offset Into Abbrev. Section
        .byte   8                       # Address Size (in bytes)
        .byte   1                       # Abbrev DW_TAG_compile_unit
        .asciz  "Hand-written DWARF"    # DW_AT_producer
        .byte   '2', '.', 'c', 0        # DW_AT_name
        .quad   0xd725a83371e7e913      # DW_AT_GNU_dwo_id
.Ldebug_info_end2:

# DWO CU3
.Lcu_begin3:
        .long   .Ldebug_info_end3-.Ldebug_info_start3 # Length of Unit 0x2b
.Ldebug_info_start3:
        .short  4                       # DWARF version number
        .long   0                       # Offset Into Abbrev. Section
        .byte   8                       # Address Size (in bytes)
        .byte   1                       # Abbrev DW_TAG_compile_unit
        .asciz  "Hand-written DWARF"    # DW_AT_producer
        .byte   '3', '.', 'c', 0        # DW_AT_name
        .quad   0x93f541184fb98d75      # DW_AT_GNU_dwo_id
.Ldebug_info_end3:

        .section        .debug_cu_index,"",@progbits
        .long   2                       # DWARF version number
        .long   2                       # Section count
        .long   3                       # Unit count
        .long   8                       # Slot count

        .quad   0x970c277d61e66cb3, 0, 0, 0xd725a83371e7e913, 0, 0x93f541184fb98d75, 0, 0  # Hash table
        .long   1, 0, 0, 2, 0, 3, 0, 0  # Index table

        .long   1                       # DW_SECT_INFO
        .long   3                       # DW_SECT_ABBREV
# Offsets
        # row 0
        .long  .Lcu_begin1-.debug_info.dwo
        .long  .Labbrev1-.debug_abbrev.dwo
        # row 1
        .long  .Lcu_begin2-.debug_info.dwo
        .long  .Labbrev1-.debug_abbrev.dwo
        # row 2
        .long  0x1b # setting this manually, otherwise llvm-mc crashes
        .long  .Labbrev1-.debug_abbrev.dwo
# Lengths
        # row 0
        .long .Ldebug_info_end1-.Lcu_begin1
        .long .Labbrev_end1-.Labbrev1
        # row 1
        .long .Ldebug_info_end2-.Lcu_begin2
        .long .Labbrev_end1-.Labbrev1
        # row 2
        .long .Ldebug_info_end3-.Lcu_begin3
        .long .Labbrev_end1-.Labbrev1

In D137882#3967704, @ayermolo wrote:

bin/llvm-lit -a /data/users/ayermolo/server-llvm/llvm-project/llvm/test/tools/llvm-dwp/X86/invalid-cu-index.s

bin/llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

bin/llvm-dwarfdump --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

These don't quite look like how I'd expect. I'd have thought the manually-generate-unit-index would be different/correct, showing the value in the manual generation over the pre-built but buggy/overflowed index?

# This test checks that with invalid offset in the cu index
# we can reconstruct it manually.

# RUN: llvm-mc --filetype=obj --triple x86_64 %s -o %t.dwp
# RUN: llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index %t.dwp | FileCheck %s

# This test checks that we parse correctly cu-index that has entries over 4GB.
# It is setup to work with current llvm implementation where cu-index is 32bit.
# Once we move to 64bit internal representation, it will need to be modified.

Sorry, I'm not following here ^ could you explain in more detail/rephrase?

In D137882#3967890, @dblaikie wrote:
In D137882#3967704, @ayermolo wrote:

bin/llvm-lit -a /data/users/ayermolo/server-llvm/llvm-project/llvm/test/tools/llvm-dwp/X86/invalid-cu-index.s

bin/llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

bin/llvm-dwarfdump --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

These don't quite look like how I'd expect. I'd have thought the manually-generate-unit-index would be different/correct, showing the value in the manual generation over the pre-built but buggy/overflowed index?
# This test checks that with invalid offset in the cu index
# we can reconstruct it manually.

# RUN: llvm-mc --filetype=obj --triple x86_64 %s -o %t.dwp
# RUN: llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index %t.dwp | FileCheck %s

# This test checks that we parse correctly cu-index that has entries over 4GB.
# It is setup to work with current llvm implementation where cu-index is 32bit.
# Once we move to 64bit internal representation, it will need to be modified.
Sorry, I'm not following here ^ could you explain in more detail/rephrase?

Well internal data structure for SectionContribution is still 32bit. So right now even with parsing we are still restricted by that. So output won't change.

struct SectionContribution {
      uint32_t Offset;
      uint32_t Length;
    };

My next patch will be changing it to 64bit and add accessors, but I am waiting for https://reviews.llvm.org/D138618 to go through review and land before posting it.
Maybe it's wrong order? Put up second patch changing data structure to 64bit, and in lldb code return 32 bit until that review goes through?

Does this clarify it for your second question, or should I try to rephrase?

In D137882#3967905, @ayermolo wrote:
In D137882#3967890, @dblaikie wrote:
In D137882#3967704, @ayermolo wrote:

bin/llvm-lit -a /data/users/ayermolo/server-llvm/llvm-project/llvm/test/tools/llvm-dwp/X86/invalid-cu-index.s

bin/llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

bin/llvm-dwarfdump --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

These don't quite look like how I'd expect. I'd have thought the manually-generate-unit-index would be different/correct, showing the value in the manual generation over the pre-built but buggy/overflowed index?
# This test checks that with invalid offset in the cu index
# we can reconstruct it manually.

# RUN: llvm-mc --filetype=obj --triple x86_64 %s -o %t.dwp
# RUN: llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index %t.dwp | FileCheck %s

# This test checks that we parse correctly cu-index that has entries over 4GB.
# It is setup to work with current llvm implementation where cu-index is 32bit.
# Once we move to 64bit internal representation, it will need to be modified.
Sorry, I'm not following here ^ could you explain in more detail/rephrase?
Well internal data structure for SectionContribution is still 32bit. So right now even with parsing we are still restricted by that. So output won't change.
struct SectionContribution {
      uint32_t Offset;
      uint32_t Length;
    };
My next patch will be changing it to 64bit and add accessors, but I am waiting for https://reviews.llvm.org/D138618 to go through review and land before posting it.
Maybe it's wrong order? Put up second patch changing data structure to 64bit, and in lldb code return 32 bit until that review goes through?

Does this clarify it for your second question, or should I try to rephrase?

Ah, that makes this patch a bit untestable, though - perhaps this patch should wait for the 64bit underlying change, then this patch will have a real purpose/make a difference, where today it doesn't change anything. (committing things that don't change anything, then committing some otherwise-cleanup-ish refactoring that enables the previous change is tricky, because then it's hard to keep track of things being tested, since they couldn't be tested when they were committed owing to the missing underlying refactoring/changes - though it can be a chicke-and-egg situation, in this case I think it's probably clear enough that changing 32 bit offsets to 64 bit ones is pretty benign/mechanical & could be done/reviewed relatively easily (relatively, still lots of changes) & then this change would go on top making the behavior change/tested (albeit manually), etc)

In D137882#3968005, @dblaikie wrote:
In D137882#3967905, @ayermolo wrote:
In D137882#3967890, @dblaikie wrote:
In D137882#3967704, @ayermolo wrote:

bin/llvm-lit -a /data/users/ayermolo/server-llvm/llvm-project/llvm/test/tools/llvm-dwp/X86/invalid-cu-index.s

bin/llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

bin/llvm-dwarfdump --debug-cu-index /data/users/ayermolo/llvm-build-release/test/tools/llvm-dwp/X86/Output/invalid-cu-index.s.tmp.dwp

These don't quite look like how I'd expect. I'd have thought the manually-generate-unit-index would be different/correct, showing the value in the manual generation over the pre-built but buggy/overflowed index?
# This test checks that with invalid offset in the cu index
# we can reconstruct it manually.

# RUN: llvm-mc --filetype=obj --triple x86_64 %s -o %t.dwp
# RUN: llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index %t.dwp | FileCheck %s

# This test checks that we parse correctly cu-index that has entries over 4GB.
# It is setup to work with current llvm implementation where cu-index is 32bit.
# Once we move to 64bit internal representation, it will need to be modified.
Sorry, I'm not following here ^ could you explain in more detail/rephrase?
Well internal data structure for SectionContribution is still 32bit. So right now even with parsing we are still restricted by that. So output won't change.
struct SectionContribution {
      uint32_t Offset;
      uint32_t Length;
    };
My next patch will be changing it to 64bit and add accessors, but I am waiting for https://reviews.llvm.org/D138618 to go through review and land before posting it.
Maybe it's wrong order? Put up second patch changing data structure to 64bit, and in lldb code return 32 bit until that review goes through?

Does this clarify it for your second question, or should I try to rephrase?
Ah, that makes this patch a bit untestable, though - perhaps this patch should wait for the 64bit underlying change, then this patch will have a real purpose/make a difference, where today it doesn't change anything. (committing things that don't change anything, then committing some otherwise-cleanup-ish refactoring that enables the previous change is tricky, because then it's hard to keep track of things being tested, since they couldn't be tested when they were committed owing to the missing underlying refactoring/changes - though it can be a chicke-and-egg situation, in this case I think it's probably clear enough that changing 32 bit offsets to 64 bit ones is pretty benign/mechanical & could be done/reviewed relatively easily (relatively, still lots of changes) & then this change would go on top making the behavior change/tested (albeit manually), etc)

I wasn't really planning on committing it without the other patch.
Let me do this. I'll remove dependency of the 32bit to 64bit patch on lldb being 64 bit, post it "on top" of this patch, and in that patch test it as a whole. Sounds good?

I wasn't really planning on committing it without the other patch.

Oh, OK - then if you could run the manual test/experiment with that patch applied, happy to review it on that basis and approve this so long as it goes after that work?

Let me do this. I'll remove dependency of the 32bit to 64bit patch on lldb being 64 bit, post it "on top" of this patch, and in that patch test it as a whole. Sounds good?

Not sure I follow - I think that's what I'd rather avoid committing this code (D137882) without a test. That it should wait for whatever refactoring's required before it's committed with some verification, even if it's manual. I don't want this patch to be committed in a state where it's a no-op/unobservable & rely on testing it in a subsequent patch.

Depends on D139379

Harbormaster completed remote builds in B201243: Diff 480276.Dec 5 2022, 4:24 PM

ayermolo added a parent revision: D139379: [llvm][dwwarf] Change CU/TU index to 64-bit.Dec 5 2022, 4:24 PM

Updated test with 64bit printout.

# RUN: llvm-mc --filetype=obj --triple x86_64 %s -o %t.dwp
# RUN: llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index %t.dwp | FileCheck %s

# This test checks that we parse correctly cu-index that has entries over 4GB.
# It is setup to work with current llvm implementation where cu-index is 32bit.
# Once we move to 64bit internal representation, it will need to be modified.

# CHECK:        0x970c277d61e66cb3
# CHECK-SAME: [0x0000000000000000, 0x00000000fffffff0)
# CHECK:      0xd725a83371e7e913
# CHECK-SAME: [0x00000000fffffff0, 0x000000010000001b)
# CHECK:      0x93f541184fb98d75
# CHECK-SAME: [0x000000010000001b, 0x0000000100000046)

        .section        .debug_abbrev.dwo,"e",@progbits
.Labbrev1:
        .byte   1                       # Abbreviation Code
        .byte   17                      # DW_TAG_compile_unit
        .byte   0                       # DW_CHILDREN_no
        .byte   37                      # DW_AT_producer
        .byte   8                       # DW_FORM_string
        .byte   3                       # DW_AT_name
        .byte   8                       # DW_FORM_string
        .ascii  "\261B"                 # DW_AT_GNU_dwo_id
        .byte   7                       # DW_FORM_data8
        .byte   0                       # EOM(1)
        .byte   0                       # EOM(2)
        .byte   0                       # EOM(3)
.Labbrev_end1:

        .section        .debug_info.dwo,"e",@progbits
# DWO CU1
.Lcu_begin1:
        .long   .Ldebug_info_end1-.Ldebug_info_start1 # Length of Unit 0x2b
.Ldebug_info_start1:
        .short  4                       # DWARF version number
        .long   0                       # Offset Into Abbrev. Section
        .byte   8                       # Address Size (in bytes)
        .byte   1                       # Abbrev DW_TAG_compile_unit
        .asciz  "Hand-written DWARF"    # DW_AT_producer
        .byte   '1', '.', 'c', 0        # DW_AT_name
        .quad   0x970c277d61e66cb3      # DW_AT_GNU_dwo_id
        .zero   0xfffffff0 - 0x2b       # 0xfffffff0 is mimimum reserved length
.Ldebug_info_end1:

# DWO CU2
.Lcu_begin2:
        .long   .Ldebug_info_end2-.Ldebug_info_start2 # Length of Unit 0x2b
.Ldebug_info_start2:
        .short  4                       # DWARF version number
        .long   0                       # Offset Into Abbrev. Section
        .byte   8                       # Address Size (in bytes)
        .byte   1                       # Abbrev DW_TAG_compile_unit
        .asciz  "Hand-written DWARF"    # DW_AT_producer
        .byte   '2', '.', 'c', 0        # DW_AT_name
        .quad   0xd725a83371e7e913      # DW_AT_GNU_dwo_id
.Ldebug_info_end2:

# DWO CU3
.Lcu_begin3:
        .long   .Ldebug_info_end3-.Ldebug_info_start3 # Length of Unit 0x2b
.Ldebug_info_start3:
        .short  4                       # DWARF version number
        .long   0                       # Offset Into Abbrev. Section
        .byte   8                       # Address Size (in bytes)
        .byte   1                       # Abbrev DW_TAG_compile_unit
        .asciz  "Hand-written DWARF"    # DW_AT_producer
        .byte   '3', '.', 'c', 0        # DW_AT_name
        .quad   0x93f541184fb98d75      # DW_AT_GNU_dwo_id
.Ldebug_info_end3:

        .section        .debug_cu_index,"",@progbits
        .long   2                       # DWARF version number
        .long   2                       # Section count
        .long   3                       # Unit count
        .long   8                       # Slot count

        .quad   0x970c277d61e66cb3, 0, 0, 0xd725a83371e7e913, 0, 0x93f541184fb98d75, 0, 0  # Hash table
        .long   1, 0, 0, 2, 0, 3, 0, 0  # Index table

        .long   1                       # DW_SECT_INFO
        .long   3                       # DW_SECT_ABBREV
# Offsets
        # row 0
        .long  .Lcu_begin1-.debug_info.dwo
        .long  .Labbrev1-.debug_abbrev.dwo
        # row 1
        .long  .Lcu_begin2-.debug_info.dwo
        .long  .Labbrev1-.debug_abbrev.dwo
        # row 2
        .long  0x1b # setting this manually, otherwis llvm-mc crashes
        .long  .Labbrev1-.debug_abbrev.dwo
# Lengths
        # row 0
        .long .Ldebug_info_end1-.Lcu_begin1
        .long .Labbrev_end1-.Labbrev1
        # row 1
        .long .Ldebug_info_end2-.Lcu_begin2
        .long .Labbrev_end1-.Labbrev1
        # row 2
        .long .Ldebug_info_end3-.Lcu_begin3
        .long .Labbrev_end1-.Labbrev1

Sounds good to me. Thanks for your patience/please only commit once the 64 bit changes to LLVM's index data structures are committed first.

In D137882#3972721, @ayermolo wrote:

Updated test with 64bit printout.

# RUN: llvm-mc --filetype=obj --triple x86_64 %s -o %t.dwp
# RUN: llvm-dwarfdump --manaully-generate-unit-index --debug-cu-index %t.dwp | FileCheck %s

# This test checks that we parse correctly cu-index that has entries over 4GB.
# It is setup to work with current llvm implementation where cu-index is 32bit.
# Once we move to 64bit internal representation, it will need to be modified.

I guess this comment needs to be updated ^

# CHECK:        0x970c277d61e66cb3
# CHECK-SAME: [0x0000000000000000, 0x00000000fffffff0)
# CHECK:      0xd725a83371e7e913
# CHECK-SAME: [0x00000000fffffff0, 0x000000010000001b)
# CHECK:      0x93f541184fb98d75
# CHECK-SAME: [0x000000010000001b, 0x0000000100000046)

        .section        .debug_abbrev.dwo,"e",@progbits
.Labbrev1:
        .byte   1                       # Abbreviation Code
        .byte   17                      # DW_TAG_compile_unit
        .byte   0                       # DW_CHILDREN_no
        .byte   37                      # DW_AT_producer
        .byte   8                       # DW_FORM_string
        .byte   3                       # DW_AT_name
        .byte   8                       # DW_FORM_string
        .ascii  "\261B"                 # DW_AT_GNU_dwo_id
        .byte   7                       # DW_FORM_data8
        .byte   0                       # EOM(1)
        .byte   0                       # EOM(2)
        .byte   0                       # EOM(3)
.Labbrev_end1:

        .section        .debug_info.dwo,"e",@progbits
# DWO CU1
.Lcu_begin1:
        .long   .Ldebug_info_end1-.Ldebug_info_start1 # Length of Unit 0x2b
.Ldebug_info_start1:
        .short  4                       # DWARF version number
        .long   0                       # Offset Into Abbrev. Section
        .byte   8                       # Address Size (in bytes)
        .byte   1                       # Abbrev DW_TAG_compile_unit
        .asciz  "Hand-written DWARF"    # DW_AT_producer
        .byte   '1', '.', 'c', 0        # DW_AT_name
        .quad   0x970c277d61e66cb3      # DW_AT_GNU_dwo_id
        .zero   0xfffffff0 - 0x2b       # 0xfffffff0 is mimimum reserved length
.Ldebug_info_end1:

# DWO CU2
.Lcu_begin2:
        .long   .Ldebug_info_end2-.Ldebug_info_start2 # Length of Unit 0x2b
.Ldebug_info_start2:
        .short  4                       # DWARF version number
        .long   0                       # Offset Into Abbrev. Section
        .byte   8                       # Address Size (in bytes)
        .byte   1                       # Abbrev DW_TAG_compile_unit
        .asciz  "Hand-written DWARF"    # DW_AT_producer
        .byte   '2', '.', 'c', 0        # DW_AT_name
        .quad   0xd725a83371e7e913      # DW_AT_GNU_dwo_id
.Ldebug_info_end2:

# DWO CU3
.Lcu_begin3:
        .long   .Ldebug_info_end3-.Ldebug_info_start3 # Length of Unit 0x2b
.Ldebug_info_start3:
        .short  4                       # DWARF version number
        .long   0                       # Offset Into Abbrev. Section
        .byte   8                       # Address Size (in bytes)
        .byte   1                       # Abbrev DW_TAG_compile_unit
        .asciz  "Hand-written DWARF"    # DW_AT_producer
        .byte   '3', '.', 'c', 0        # DW_AT_name
        .quad   0x93f541184fb98d75      # DW_AT_GNU_dwo_id
.Ldebug_info_end3:

        .section        .debug_cu_index,"",@progbits
        .long   2                       # DWARF version number
        .long   2                       # Section count
        .long   3                       # Unit count
        .long   8                       # Slot count

        .quad   0x970c277d61e66cb3, 0, 0, 0xd725a83371e7e913, 0, 0x93f541184fb98d75, 0, 0  # Hash table
        .long   1, 0, 0, 2, 0, 3, 0, 0  # Index table

        .long   1                       # DW_SECT_INFO
        .long   3                       # DW_SECT_ABBREV
# Offsets
        # row 0
        .long  .Lcu_begin1-.debug_info.dwo
        .long  .Labbrev1-.debug_abbrev.dwo
        # row 1
        .long  .Lcu_begin2-.debug_info.dwo
        .long  .Labbrev1-.debug_abbrev.dwo
        # row 2
        .long  0x1b # setting this manually, otherwis llvm-mc crashes
        .long  .Labbrev1-.debug_abbrev.dwo
# Lengths
        # row 0
        .long .Ldebug_info_end1-.Lcu_begin1
        .long .Labbrev_end1-.Labbrev1
        # row 1
        .long .Ldebug_info_end2-.Lcu_begin2
        .long .Labbrev_end1-.Labbrev1
        # row 2
        .long .Ldebug_info_end3-.Lcu_begin3
        .long .Labbrev_end1-.Labbrev1

llvm/include/llvm/DebugInfo/DWARF/DWARFUnit.h
96	This change is unneeded/can be removed, yeah?

This revision is now accepted and ready to land.Dec 5 2022, 5:07 PM

rebase, addressed comment

Thanks for reviewing. :)

Harbormaster completed remote builds in B201257: Diff 480293.Dec 5 2022, 11:40 PM

fixed some tests

Reduced the scope a bit. Removed some v5 tests and added one new tests.

Harbormaster completed remote builds in B201539: Diff 480682.Dec 7 2022, 1:54 AM

This revision was landed with ongoing or failed builds.Dec 7 2022, 1:09 PM

Closed by commit rGa5bd76a6e311: [DWARFLibrary] Add support to re-construct cu-index (authored by ayermolo). · Explain Why

This revision was automatically updated to reflect the committed changes.

ayermolo added a commit: rGa5bd76a6e311: [DWARFLibrary] Add support to re-construct cu-index.

ayermolo added a reverting change: rGa77376479dc8: Revert "[DWARFLibrary] Add support to re-construct cu-index".Dec 7 2022, 1:15 PM

Accidentally pushed when pushing bolt changes. Reverted.

ayermolo mentioned this in D139578: [DWARFLibrary] Add support to re-construct cu-index.Dec 7 2022, 1:59 PM

ayermolo reopened this revision.Dec 7 2022, 3:37 PM

This revision is now accepted and ready to land.Dec 7 2022, 3:37 PM

ayermolo removed a parent revision: D139379: [llvm][dwwarf] Change CU/TU index to 64-bit.Jan 10 2023, 3:10 PM

rebase

This revision was landed with ongoing or failed builds.Jan 10 2023, 3:16 PM

Closed by commit rG73712c8790a9: [DWARFLibrary] Add support to re-construct cu-index (authored by ayermolo). · Explain Why

This revision was automatically updated to reflect the committed changes.

ayermolo added a commit: rG73712c8790a9: [DWARFLibrary] Add support to re-construct cu-index.

Harbormaster completed remote builds in B206949: Diff 488008.Jan 10 2023, 8:46 PM

vitalybuka mentioned this in rG9220c0c7afaa: [DWARFLibrary] Init field after D137882.Jan 10 2023, 11:55 PM

Sorry, this change causes a MemorySanitizer error in LLVM testsuite. I'm going to revert it. Please investigate, fix, and feel free to re-land.

Here's a sample error from llvm/test/CodeGen/X86/dwarf-split-line-1.ll:

==6051==WARNING: MemorySanitizer: use-of-uninitialized-value
    #0 0x55fcfe0c7a00 in llvm::DWARFContext::getTUIndex() llvm-project/llvm/lib/DebugInfo/DWARF/DWARFContext.cpp:870:7
    #1 0x55fcfe17365f in operator() llvm-project/llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp:87:39
    #2 0x55fcfe17365f in __invoke<(lambda at llvm-project/llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp:74:14) &, unsigned long, llvm::DWARFSectionKind, const llvm::DWARFSection *, const llvm::DWARFUnitIndex::Entry *> [...]/c++/v1/__functional/invoke.h:394:23
    #3 0x55fcfe17365f in __call<(lambda at llvm-project/llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp:74:14) &, unsigned long, llvm::DWARFSectionKind, const llvm::DWARFSection *, const llvm::DWARFUnitIndex::Entry *> [...]/c++/v1/__functional/invoke.h:478:16
    #4 0x55fcfe17365f in operator() [...]/c++/v1/__functional/function.h:232:12
    #5 0x55fcfe17365f in std::__msan::unique_ptr<llvm::DWARFUnit, std::__msan::default_delete<llvm::DWARFUnit>> std::__msan::__function::__policy_invoker<std::__msan::unique_ptr<llvm::DWARFUnit, std::__msan::default_delete<llvm::DWARFUnit>> (unsigned long, llvm::DWARFSectionKind, llvm::DWARFSection const*, llvm::DWARFUnitIndex::Entry const*)>::__call_impl<std::__msan::__function::__default_alloc_func<llvm::DWARFUnitVector::addUnitsImpl(llvm::DWARFContext&, llvm::DWARFObject const&, llvm::DWARFSection const&, llvm::DWARFDebugAbbrev const*, llvm::DWARFSection const*, llvm::DWARFSection const*, llvm::StringRef, llvm::DWARFSection const&, llvm::DWARFSection const*, llvm::DWARFSection const&, bool, bool, bool, llvm::DWARFSectionKind)::$_0, std::__msan::unique_ptr<llvm::DWARFUnit, std::__msan::default_delete<llvm::DWARFUnit>> (unsigned long, llvm::DWARFSectionKind, llvm::DWARFSection const*, llvm::DWARFUnitIndex::Entry const*)>>(std::__msan::__function::__policy_storage const*, unsigned long, llvm::DWARFSectionKind, llvm::DWARFSection const*, llvm::DWARFUnitIndex::Entry const*) v1/__functional/function.h:711:16
    #6 0x55fcfe166ef6 in operator() [...]/c++/v1/__functional/function.h:842:16
    #7 0x55fcfe166ef6 in operator() [...]/c++/v1/__functional/function.h:1152:12
    #8 0x55fcfe166ef6 in llvm::DWARFUnitVector::addUnitsImpl(llvm::DWARFContext&, llvm::DWARFObject const&, llvm::DWARFSection const&, llvm::DWARFDebugAbbrev const*, llvm::DWARFSection const*, llvm::DWARFSection const*, llvm::StringRef, llvm::DWARFSection const&, llvm::DWARFSection const*, llvm::DWARFSection const&, bool, bool, bool, llvm::DWARFSectionKind) llvm-project/llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp:127:14
    #9 0x55fcfe1671cd in llvm::DWARFUnitVector::addUnitsForDWOSection(llvm::DWARFContext&, llvm::DWARFSection const&, llvm::DWARFSectionKind, bool) llvm-project/llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp:58:3
    #10 0x55fcfe0dee91 in operator() llvm-project/llvm/include/llvm/ADT/STLFunctionalExtras.h:68:12
    #11 0x55fcfe0dee91 in (anonymous namespace)::DWARFObjInMemory::forEachInfoDWOSections(llvm::function_ref<void (llvm::DWARFSection const&)>) const llvm-project/llvm/lib/DebugInfo/DWARF/DWARFContext.cpp:1999:7
    #12 0x55fcfe0cabdf in llvm::DWARFContext::parseDWOUnits(bool) llvm-project/llvm/lib/DebugInfo/DWARF/DWARFContext.cpp:1106:9
    #13 0x55fcfe0bc1f5 in getNumDWOCompileUnits llvm-project/llvm/include/llvm/DebugInfo/DWARF/DWARFContext.h:229:5
    #14 0x55fcfe0bc1f5 in llvm::DWARFContext::dump(llvm::raw_ostream&, llvm::DIDumpOptions, std::__msan::array<std::__msan::optional<unsigned long>, 28ul>) llvm-project/llvm/lib/DebugInfo/DWARF/DWARFContext.cpp:398:24
    #15 0x55fcfcdc085a in dumpObjectFile(llvm::object::ObjectFile&, llvm::DWARFContext&, llvm::Twine const&, llvm::raw_ostream&) llvm-project/llvm/tools/llvm-dwarfdump/llvm-dwarfdump.cpp:631:9
    #16 0x55fcfcdc298d in operator() [...]/c++/v1/__functional/function.h:842:16
    #17 0x55fcfcdc298d in operator() [...]/c++/v1/__functional/function.h:1152:12
    #18 0x55fcfcdc298d in handleBuffer(llvm::StringRef, llvm::MemoryBufferRef, std::__msan::function<bool (llvm::object::ObjectFile&, llvm::DWARFContext&, llvm::Twine const&, llvm::raw_ostream&)>, llvm::raw_ostream&) llvm-project/llvm/tools/llvm-dwarfdump/llvm-dwarfdump.cpp:686:12
    #19 0x55fcfcdbc35d in handleFile(llvm::StringRef, std::__msan::function<bool (llvm::object::ObjectFile&, llvm::DWARFContext&, llvm::Twine const&, llvm::raw_ostream&)>, llvm::raw_ostream&) llvm-project/llvm/tools/llvm-dwarfdump/llvm-dwarfdump.cpp:724:10
    #20 0x55fcfcdbb99a in main llvm-project/llvm/tools/llvm-dwarfdump/llvm-dwarfdump.cpp:820:18
    #21 0x7faa82510632 in __libc_start_main
    #22 0x55fcfcd15169 in _start

SUMMARY: MemorySanitizer: use-of-uninitialized-value llvm-project/llvm/lib/DebugInfo/DWARF/DWARFContext.cpp:870:7 in llvm::DWARFContext::getTUIndex()

Full list of affected tests, the immediate cause looks similar (use-of-uninitialized-value in llvm::DWARFContext::getTUIndex()):

llvm/test/CodeGen/X86/dwarf-headers.ll
llvm/test/CodeGen/X86/dwarf-split-line-1.ll
llvm/test/CodeGen/X86/dwarf-split-line-2.ll
llvm/test/DebugInfo/WebAssembly/dwarf-headers.ll
llvm/test/DebugInfo/X86/dwarfdump-header.s
llvm/test/DebugInfo/X86/dwarfdump-str-offsets-v4-dwarf64-dwo.s
llvm/test/DebugInfo/X86/dwarfdump-str-offsets.s
llvm/test/DebugInfo/X86/generate-odr-hash.ll
llvm/test/DebugInfo/X86/gnu-public-names-tu.ll
llvm/test/DebugInfo/X86/string-offsets-multiple-cus.ll
llvm/test/DebugInfo/X86/tu-to-non-named-type.ll
llvm/test/DebugInfo/X86/tu-to-non-tu.ll
llvm/test/DebugInfo/X86/type_units_with_addresses.ll

gribozavr added a reverting change: rG4cf83c474700: Revert "[DWARFLibrary] Add support to re-construct cu-index".Jan 11 2023, 2:31 AM

Sorry for break, let me take a look.

This revision is now accepted and ready to land.Jan 11 2023, 3:13 PM

fixed ub, mem santiziers

Harbormaster completed remote builds in B207262: Diff 488453.Jan 11 2023, 7:52 PM

fixed one more mem sanitizer bug

Looks OK, if it clears up the failures.

Harbormaster completed remote builds in B207398: Diff 488644.Jan 12 2023, 9:52 AM

In D137882#4048006, @dblaikie wrote:

Looks OK, if it clears up the failures.

thanks. Re-ran ninja check all locally with memsan build, and came back clean.

Closed by commit rGc0db06227721: [DWARFLibrary] Add support to re-construct cu-index (authored by ayermolo). · Explain WhyJan 12 2023, 11:00 AM

This revision was automatically updated to reflect the committed changes.

ayermolo added a commit: rGc0db06227721: [DWARFLibrary] Add support to re-construct cu-index.

@dblaikie looks like no more build bot breaks. :D Thank you again for reviewing.

ayermolo mentioned this in rGf36fe009c0fc: [LLDB] Enable 64 bit debug/type offset.Feb 13 2023, 1:10 PM

ayermolo mentioned this in rG2062e90aa531: [LLDB] Enable 64 bit debug/type offset.Feb 16 2023, 2:47 PM

ayermolo mentioned this in D144565: dwp check overflow.Feb 22 2023, 9:38 AM

ayermolo mentioned this in rG34a8e6eee666: [LLDB] Enable 64 bit debug/type offset.Feb 22 2023, 11:34 AM

dblaikie added inline comments.Mar 16 2023, 5:56 PM

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
865–868	Any idea what this comment/code were about checking the type index version? I don't see any reason this fixup wouldn't be relevant to DWARFv4 type unit indexes - though the fixup code would have to take the section to parse as a parameter, since it's currently hardcoded to the debug_info sections, not the debug_types sections.

ayermolo added inline comments.Mar 16 2023, 6:26 PM

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
865–868	I don't quite remember why. I think it was to narrow the scope. Also don't we also need to call forEachTypesDWOSections if we want to handle .debug_types section? Why do you see overflows in that section also?

dblaikie added inline comments.Mar 17 2023, 3:20 PM

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
865–868	We aren't having an issue here - since it's only DWARFv4+type units, and we're using DWARFv5 these days. I've been looking at adding DWARFv5 overflow recovery (which can be more robust than DWARFv4, since DWARFv5 can get the DWOID/type signature without needing the abbrev section) and just looking more closely at this code/seeing these quirks & wanted to understand it better. But, yeah, maybe just not worth supporting shrug no worries, thanks!

ayermolo added inline comments.Mar 19 2023, 3:39 PM

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
865–868	Hopefully we will be moving to DWARF5 his year also. :D The code should work with DWARF5 + debug types, but you are right it could be made more robust by specializing and taking advantage of DWO ID being in a header. Not sure it's worth it either. Also I am not sure how often debug-types are used with DWARF5 since one has to choose between them and .debug_names. Although looking at https://reviews.llvm.org/D49420 should be easy to turn on now that llvm generates DWARF5 type units.

dblaikie added inline comments.Mar 20 2023, 5:45 PM

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
865–868	Hopefully we will be moving to DWARF5 his year also. :D fingers crossed The code should work with DWARF5 + debug types, but you are right it could be made more robust by specializing and taking advantage of DWO ID being in a header. Not sure it's worth it either. You mean DWARF4+debug types? It does already work with DWARF5 and debug types. Also I am not sure how often debug-types are used with DWARF5 since one has to choose between them and .debug_names. Although looking at https://reviews.llvm.org/D49420 should be easy to turn on now that llvm generates DWARF5 type units. It's what we're doing at Google, at least - we're still using gdb_index, not lldb - and when we use lldb it's without any index solution, so, yeah, slow... Hadn't realized there was that outstanding patch for debug_names for type units. I'll have to take a look...

dblaikie added inline comments.Mar 20 2023, 5:53 PM

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
865–868	oh, looked at D49420, thought that was an outstanding patch for DWARFv5 debug_names support for type units, but it's the patch that enabled debug_names but not when type units are enabled. I think I looked at it recently and it's not quite as simple as removing the opt-out - there's some implementation complexity to address in building debug_names, also in terms of what we put in them... but yeah, needs some looking at/implementing for sure.

ayermolo added inline comments.Mar 20 2023, 6:01 PM

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
865–868	err sorry didn't phrase it correctly. Just affirming this will support DWARF5 + debug types. The patch is original one that went in a while back. Reading description acceleration table was disabled because at that time DWARF4 debug types were created by clang. Ah I see. How do you generate gdb_index with gdb-add-index?

dblaikie added inline comments.Mar 21 2023, 10:20 AM

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
865–868	Ah I see. How do you generate gdb_index with gdb-add-index? `-ggnu-pubnames` and `-Wl,--gdb-index` - so name lists in the object files then the linker generates the efficient index

ayermolo added inline comments.Mar 21 2023, 10:34 AM

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp
865–868	ah gotcha version 7 of gdb-index that lld produces. thanks.

Revision Contents

Path

Size

llvm/

include/

llvm/

DebugInfo/

DWARF/

DWARFContext.h

12 lines

DWARFUnit.h

1 line

DWARFUnitIndex.h

6 lines

lib/

DebugInfo/

DWARF/

DWARFContext.cpp

74 lines

DWARFUnitIndex.cpp

5 lines

test/

tools/

llvm-dwp/

X86/

6 lines

11 lines

11 lines

11 lines

12 lines

27 lines

tools/

llvm-dwarfdump/

llvm-dwarfdump.cpp

8 lines

Diff 475249

llvm/include/llvm/DebugInfo/DWARF/DWARFContext.h

Show First 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	class DWARFContext : public DIContext {
/// section.		/// section.
enum MacroSecType {		enum MacroSecType {
MacinfoSection,		MacinfoSection,
MacinfoDwoSection,		MacinfoDwoSection,
MacroSection,		MacroSection,
MacroDwoSection		MacroDwoSection
};		};

		// When set parses debug_info.dwo/debug_abbrev.dwo manually and populates CU
		// Index, and TU Index for DWARF5.
		bool ParseCUTUIndexManually;

public:		public:
DWARFContext(std::unique_ptr<const DWARFObject> DObj,		DWARFContext(std::unique_ptr<const DWARFObject> DObj,
std::string DWPName = "",		std::string DWPName = "",
std::function<void(Error)> RecoverableErrorHandler =		std::function<void(Error)> RecoverableErrorHandler =
WithColor::defaultErrorHandler,		WithColor::defaultErrorHandler,
std::function<void(Error)> WarningHandler =		std::function<void(Error)> WarningHandler =
WithColor::defaultWarningHandler);		WithColor::defaultWarningHandler);
~DWARFContext() override;		~DWARFContext() override;
▲ Show 20 Lines • Show All 327 Lines • ▼ Show 20 Lines	public:
}		}

/// Return the compile unit which contains instruction with provided		/// Return the compile unit which contains instruction with provided
/// address.		/// address.
/// TODO: change input parameter from "uint64_t Address"		/// TODO: change input parameter from "uint64_t Address"
/// into "SectionedAddress Address"		/// into "SectionedAddress Address"
DWARFCompileUnit *getCompileUnitForAddress(uint64_t Address);		DWARFCompileUnit *getCompileUnitForAddress(uint64_t Address);

		/// Returns whether CU/TU should be populated manually. TU Index populated
		/// manually only for DWARF5.
		bool getParseCUTUIndexManually() const { return ParseCUTUIndexManually; }

		/// Sets whether CU/TU should be populated manually. TU Index populated
		/// manually only for DWARF5.
		void setParseCUTUIndexManually(bool PCUTU) { ParseCUTUIndexManually = PCUTU; }

private:		private:
/// Parse a macro[.dwo] or macinfo[.dwo] section.		/// Parse a macro[.dwo] or macinfo[.dwo] section.
std::unique_ptr<DWARFDebugMacro>		std::unique_ptr<DWARFDebugMacro>
parseMacroOrMacinfo(MacroSecType SectionType);		parseMacroOrMacinfo(MacroSecType SectionType);

void addLocalsForDie(DWARFCompileUnit *CU, DWARFDie Subprogram, DWARFDie Die,		void addLocalsForDie(DWARFCompileUnit *CU, DWARFDie Subprogram, DWARFDie Die,
std::vector<DILocal> &Result);		std::vector<DILocal> &Result);
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_DEBUGINFO_DWARF_DWARFCONTEXT_H		#endif // LLVM_DEBUGINFO_DWARF_DWARFCONTEXT_H

llvm/include/llvm/DebugInfo/DWARF/DWARFUnit.h

Show First 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	public:
dwarf::DwarfFormat getFormat() const { return FormParams.Format; }		dwarf::DwarfFormat getFormat() const { return FormParams.Format; }
uint8_t getAddressByteSize() const { return FormParams.AddrSize; }		uint8_t getAddressByteSize() const { return FormParams.AddrSize; }
uint8_t getRefAddrByteSize() const { return FormParams.getRefAddrByteSize(); }		uint8_t getRefAddrByteSize() const { return FormParams.getRefAddrByteSize(); }
uint8_t getDwarfOffsetByteSize() const {		uint8_t getDwarfOffsetByteSize() const {
return FormParams.getDwarfOffsetByteSize();		return FormParams.getDwarfOffsetByteSize();
}		}
uint64_t getLength() const { return Length; }		uint64_t getLength() const { return Length; }
uint64_t getAbbrOffset() const { return AbbrOffset; }		uint64_t getAbbrOffset() const { return AbbrOffset; }
		void setAbbrOffset(uint64_t Offset) { AbbrOffset = Offset; }
		dblaikieUnsubmitted Not Done Reply Inline Actions This change is unneeded/can be removed, yeah? dblaikie: This change is unneeded/can be removed, yeah?
Optional<uint64_t> getDWOId() const { return DWOId; }		Optional<uint64_t> getDWOId() const { return DWOId; }
void setDWOId(uint64_t Id) {		void setDWOId(uint64_t Id) {
assert((!DWOId \|\| *DWOId == Id) && "setting DWOId to a different value");		assert((!DWOId \|\| *DWOId == Id) && "setting DWOId to a different value");
DWOId = Id;		DWOId = Id;
}		}
const DWARFUnitIndex::Entry *getIndexEntry() const { return IndexEntry; }		const DWARFUnitIndex::Entry *getIndexEntry() const { return IndexEntry; }
uint64_t getTypeHash() const { return TypeHash; }		uint64_t getTypeHash() const { return TypeHash; }
uint64_t getTypeOffset() const { return TypeOffset; }		uint64_t getTypeOffset() const { return TypeOffset; }
▲ Show 20 Lines • Show All 481 Lines • Show Last 20 Lines

llvm/include/llvm/DebugInfo/DWARF/DWARFUnitIndex.h

Show First 20 Lines • Show All 118 Lines • ▼ Show 20 Lines	private:
const DWARFUnitIndex *Index;		const DWARFUnitIndex *Index;
uint64_t Signature;		uint64_t Signature;
std::unique_ptr<SectionContribution[]> Contributions;		std::unique_ptr<SectionContribution[]> Contributions;
friend class DWARFUnitIndex;		friend class DWARFUnitIndex;

public:		public:
const SectionContribution *getContribution(DWARFSectionKind Sec) const;		const SectionContribution *getContribution(DWARFSectionKind Sec) const;
const SectionContribution *getContribution() const;		const SectionContribution *getContribution() const;
		SectionContribution &getContribution();

const SectionContribution *getContributions() const {		const SectionContribution *getContributions() const {
return Contributions.get();		return Contributions.get();
}		}

uint64_t getSignature() const { return Signature; }		uint64_t getSignature() const { return Signature; }
		bool isValid() { return Index; }
};		};

private:		private:
struct Header Header;		struct Header Header;

DWARFSectionKind InfoColumnKind;		DWARFSectionKind InfoColumnKind;
int InfoColumn = -1;		int InfoColumn = -1;
std::unique_ptr<DWARFSectionKind[]> ColumnKinds;		std::unique_ptr<DWARFSectionKind[]> ColumnKinds;
Show All 24 Lines	public:

ArrayRef<DWARFSectionKind> getColumnKinds() const {		ArrayRef<DWARFSectionKind> getColumnKinds() const {
return makeArrayRef(ColumnKinds.get(), Header.NumColumns);		return makeArrayRef(ColumnKinds.get(), Header.NumColumns);
}		}

ArrayRef<Entry> getRows() const {		ArrayRef<Entry> getRows() const {
return makeArrayRef(Rows.get(), Header.NumBuckets);		return makeArrayRef(Rows.get(), Header.NumBuckets);
}		}

		MutableArrayRef<Entry> getMutableRows() {
		return makeMutableArrayRef(Rows.get(), Header.NumBuckets);
		}
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_DEBUGINFO_DWARF_DWARFUNITINDEX_H		#endif // LLVM_DEBUGINFO_DWARF_DWARFUNITINDEX_H

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp

Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <algorithm>		#include <algorithm>
#include <cstdint>		#include <cstdint>
#include <deque>		#include <deque>
#include <map>		#include <map>
#include <string>		#include <string>
		#include <unordered_map>
		tschuettUnsubmitted Done Reply Inline Actions Is this include useless? tschuett: Is this include useless?
		ayermoloAuthorUnsubmitted Done Reply Inline Actions Thanks removed. Leftover from switch to DenseMap. ayermolo: Thanks removed. Leftover from switch to DenseMap.
#include <utility>		#include <utility>
#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;
using namespace dwarf;		using namespace dwarf;
using namespace object;		using namespace object;

#define DEBUG_TYPE "dwarf"		#define DEBUG_TYPE "dwarf"
▲ Show 20 Lines • Show All 708 Lines • ▼ Show 20 Lines	bool DWARFContext::verify(raw_ostream &OS, DIDumpOptions DumpOpts) {
if (DumpOpts.DumpType & DIDT_DebugInfo)		if (DumpOpts.DumpType & DIDT_DebugInfo)
Success &= verifier.handleDebugInfo();		Success &= verifier.handleDebugInfo();
if (DumpOpts.DumpType & DIDT_DebugLine)		if (DumpOpts.DumpType & DIDT_DebugLine)
Success &= verifier.handleDebugLine();		Success &= verifier.handleDebugLine();
Success &= verifier.handleAccelTables();		Success &= verifier.handleAccelTables();
return Success;		return Success;
}		}

		enum class IndexType { CUIndex, TUIndex };
		using EntryType = DWARFUnitIndex::Entry::SectionContribution;
		using EntryMap = std::unordered_map<uint32_t, EntryType>;
		dblaikieUnsubmitted Done Reply Inline Actions These could probably all sink into `fixupIndex` as locals? dblaikie: These could probably all sink into `fixupIndex` as locals?
		dblaikieUnsubmitted Done Reply Inline Actions Maybe DenseMap rather than unordered_map? dblaikie: Maybe DenseMap rather than unordered_map?
		void fixupIndex(const DWARFObject &DObj, DWARFContext &C,
		DWARFUnitIndex &Index) {
		EntryMap Map;
		if (DObj.getCUIndexSection().empty())
		return;

		uint64_t Offset = 0;
		uint32_t TruncOffset = 0;
		DObj.forEachInfoDWOSections([&](const DWARFSection &S) {
		if (!(C.getParseCUTUIndexManually() \|\|
		S.Data.size() >= std::numeric_limits<uint32_t>::max()))
		return;

		DWARFDataExtractor Data(DObj, S, C.isLittleEndian(), 0);
		while (Data.isValidOffset(Offset)) {
		DWARFUnitHeader Header;
		if (!Header.extract(C, Data, &Offset, DWARFSectionKind::DW_SECT_INFO)) {
		logAllUnhandledErrors(
		dblaikieUnsubmitted Done Reply Inline Actions Is this how error handling's generally done here? I think maybe the DWARFContext has error handling callbacks that are meant to be used? (& should probably propagate up a failure result through all of this rather than continuing with corrupt data?) dblaikie: Is this how error handling's generally done here? I think maybe the DWARFContext has error…
		ayermoloAuthorUnsubmitted Done Reply Inline Actions Changed it to logAllUnhandledErrors(createError()) for now To propagate up can chagne DWARFContext::get(CU,TU}Index() to return Expectec<DWARFUnitIndex>? I was going with something more localized to minimize the impact. I guess it comes down to philosophical question of whether if CU/TU index is partially corrupted we want to consider whole thing corrupted, or keep the current behavior of at least being able to access debug info below 4GB. ayermolo: Changed it to logAllUnhandledErrors(createError()) for now To propagate up can chagne…
		createError("Failed to parse CU header in DWP file"), errs());
		Map.clear();
		break;
		}

		auto Iter = Map.insert(
		{TruncOffset,
		dblaikieUnsubmitted Done Reply Inline Actions Could you rely on the version of the index? (version 2, I think, for pre-standard index, version 5 for the DWARFv5 standard index) rather than having to wait to parse a unit to see what version it has. There's currently no way to mix pre-standard and standard indexes, I think (owing to the valid columns accepted in each)? So that should be adequate. dblaikie: Could you rely on the version of the index? (version 2, I think, for pre-standard index…
		{(uint32_t)Header.getOffset(),
		(uint32_t)(Header.getNextUnitOffset() - Header.getOffset())}});
		if (!Iter.second) {
		logAllUnhandledErrors(
		createError("Collision occured between two truncated offsets"),
		errs());
		Map.clear();
		return;
		}

		Offset = Header.getNextUnitOffset();
		TruncOffset = Offset;
		}
		});

		if (Map.empty())
		return;

		for (DWARFUnitIndex::Entry &E : Index.getMutableRows()) {
		if (!E.isValid())
		continue;
		dblaikieUnsubmitted Done Reply Inline Actions Rather than exposing mutability in the index interface, could this whole function (fixupIndex) be moved into the index & performed there as part of parsing? dblaikie: Rather than exposing mutability in the index interface, could this whole function (fixupIndex)…
		ayermoloAuthorUnsubmitted Done Reply Inline Actions Wasn't part of the feedback from other diff is to minimize impact and not modify cu/tu index parsing, or did I miss understand? We then will need to modify parse to pass in context, and if we are parsing CU or TU. ayermolo: Wasn't part of the feedback from other diff is to minimize impact and not modify cu/tu index…
		dblaikieUnsubmitted Done Reply Inline Actions Fair enough - mixed feelings, but I'll rescind this piece at least. dblaikie: Fair enough - mixed feelings, but I'll rescind this piece at least.
		DWARFUnitIndex::Entry::SectionContribution &CUOff = E.getContribution();
		auto Iter = Map.find(CUOff.Offset);
		if (Iter == Map.end()) {
		errs() << "Could not find CU Offset in the Map\n";
		break;
		}
		CUOff.Offset = Iter->second.Offset;
		CUOff.Length = Iter->second.Length;
		}

		dblaikieUnsubmitted Done Reply Inline Actions any reason to believe the lengths would be incorrect? Perhaps we can limit the scope a bit by not touching those? dblaikie: any reason to believe the lengths would be incorrect? Perhaps we can limit the scope a bit by…
		ayermoloAuthorUnsubmitted Done Reply Inline Actions I didn't want to assume anything about the producer. If Offset is corrupt, depending how how length is calculated at least one might be corrupt also. Also we are overriding all the offsets, if we mess up on that, doesn't really matter if new length is correct or not. ayermolo: I didn't want to assume anything about the producer. If Offset is corrupt, depending how how…
		dblaikieUnsubmitted Done Reply Inline Actions I'd prefer to be a bit less permissive, really - to not end up creating more weird cases that systems might come to depend on. Maybe fail if the length doesn't match, until we know of any particular case with mismatched lengths that we understand enough to want to/figure out how to support? dblaikie: I'd prefer to be a bit less permissive, really - to not end up creating more weird cases that…
		return;
		}

const DWARFUnitIndex &DWARFContext::getCUIndex() {		const DWARFUnitIndex &DWARFContext::getCUIndex() {
if (CUIndex)		if (CUIndex)
return *CUIndex;		return *CUIndex;

DataExtractor CUIndexData(DObj->getCUIndexSection(), isLittleEndian(), 0);		DataExtractor CUIndexData(DObj->getCUIndexSection(), isLittleEndian(), 0);

CUIndex = std::make_unique<DWARFUnitIndex>(DW_SECT_INFO);		CUIndex = std::make_unique<DWARFUnitIndex>(DW_SECT_INFO);
CUIndex->parse(CUIndexData);		CUIndex->parse(CUIndexData);
		fixupIndex(DObj, this, *CUIndex.get());
return *CUIndex;		return *CUIndex;
}		}

const DWARFUnitIndex &DWARFContext::getTUIndex() {		const DWARFUnitIndex &DWARFContext::getTUIndex() {
if (TUIndex)		if (TUIndex)
return *TUIndex;		return *TUIndex;

DataExtractor TUIndexData(DObj->getTUIndexSection(), isLittleEndian(), 0);		DataExtractor TUIndexData(DObj->getTUIndexSection(), isLittleEndian(), 0);

TUIndex = std::make_unique<DWARFUnitIndex>(DW_SECT_EXT_TYPES);		TUIndex = std::make_unique<DWARFUnitIndex>(DW_SECT_EXT_TYPES);
TUIndex->parse(TUIndexData);		TUIndex->parse(TUIndexData);
		// If we are parsing TU-index and for .debug_types section we don't need
		// to do anything.
		if (CUIndex->getVersion() != 2)
		fixupIndex(DObj, this, *TUIndex.get());
		dblaikieUnsubmitted Not Done Reply Inline Actions Any idea what this comment/code were about checking the type index version? I don't see any reason this fixup wouldn't be relevant to DWARFv4 type unit indexes - though the fixup code would have to take the section to parse as a parameter, since it's currently hardcoded to the debug_info sections, not the debug_types sections. dblaikie: Any idea what this comment/code were about checking the type index version? I don't see any…
		ayermoloAuthorUnsubmitted Done Reply Inline Actions I don't quite remember why. I think it was to narrow the scope. Also don't we also need to call forEachTypesDWOSections if we want to handle .debug_types section? Why do you see overflows in that section also? ayermolo: I don't quite remember why. I think it was to narrow the scope. Also don't we also need to call…
		dblaikieUnsubmitted Not Done Reply Inline Actions We aren't having an issue here - since it's only DWARFv4+type units, and we're using DWARFv5 these days. I've been looking at adding DWARFv5 overflow recovery (which can be more robust than DWARFv4, since DWARFv5 can get the DWOID/type signature without needing the abbrev section) and just looking more closely at this code/seeing these quirks & wanted to understand it better. But, yeah, maybe just not worth supporting shrug no worries, thanks! dblaikie: We aren't having an issue here - since it's only DWARFv4+type units, and we're using DWARFv5…
		ayermoloAuthorUnsubmitted Done Reply Inline Actions Hopefully we will be moving to DWARF5 his year also. :D The code should work with DWARF5 + debug types, but you are right it could be made more robust by specializing and taking advantage of DWO ID being in a header. Not sure it's worth it either. Also I am not sure how often debug-types are used with DWARF5 since one has to choose between them and .debug_names. Although looking at https://reviews.llvm.org/D49420 should be easy to turn on now that llvm generates DWARF5 type units. ayermolo: Hopefully we will be moving to DWARF5 his year also. :D The code should work with DWARF5 +…
		dblaikieUnsubmitted Not Done Reply Inline Actions Hopefully we will be moving to DWARF5 his year also. :D fingers crossed The code should work with DWARF5 + debug types, but you are right it could be made more robust by specializing and taking advantage of DWO ID being in a header. Not sure it's worth it either. You mean DWARF4+debug types? It does already work with DWARF5 and debug types. Also I am not sure how often debug-types are used with DWARF5 since one has to choose between them and .debug_names. Although looking at https://reviews.llvm.org/D49420 should be easy to turn on now that llvm generates DWARF5 type units. It's what we're doing at Google, at least - we're still using gdb_index, not lldb - and when we use lldb it's without any index solution, so, yeah, slow... Hadn't realized there was that outstanding patch for debug_names for type units. I'll have to take a look... dblaikie: > Hopefully we will be moving to DWARF5 his year also. :D fingers crossed > The code should…
		dblaikieUnsubmitted Not Done Reply Inline Actions oh, looked at D49420, thought that was an outstanding patch for DWARFv5 debug_names support for type units, but it's the patch that enabled debug_names but not when type units are enabled. I think I looked at it recently and it's not quite as simple as removing the opt-out - there's some implementation complexity to address in building debug_names, also in terms of what we put in them... but yeah, needs some looking at/implementing for sure. dblaikie: oh, looked at D49420, thought that was an outstanding patch for DWARFv5 debug_names support for…
		ayermoloAuthorUnsubmitted Done Reply Inline Actions err sorry didn't phrase it correctly. Just affirming this will support DWARF5 + debug types. The patch is original one that went in a while back. Reading description acceleration table was disabled because at that time DWARF4 debug types were created by clang. Ah I see. How do you generate gdb_index with gdb-add-index? ayermolo: err sorry didn't phrase it correctly. Just affirming this will support DWARF5 + debug types.
		dblaikieUnsubmitted Not Done Reply Inline Actions Ah I see. How do you generate gdb_index with gdb-add-index? `-ggnu-pubnames` and `-Wl,--gdb-index` - so name lists in the object files then the linker generates the efficient index dblaikie: > Ah I see. How do you generate gdb_index with gdb-add-index? `-ggnu-pubnames` and `-Wl,--gdb…
		ayermoloAuthorUnsubmitted Done Reply Inline Actions ah gotcha version 7 of gdb-index that lld produces. thanks. ayermolo: ah gotcha version 7 of gdb-index that lld produces. thanks.
return *TUIndex;		return *TUIndex;
}		}

DWARFGdbIndex &DWARFContext::getGdbIndex() {		DWARFGdbIndex &DWARFContext::getGdbIndex() {
if (GdbIndex)		if (GdbIndex)
return *GdbIndex;		return *GdbIndex;

DataExtractor GdbIndexData(DObj->getGdbIndexSection(), true /LE/, 0);		DataExtractor GdbIndexData(DObj->getGdbIndexSection(), true /LE/, 0);
▲ Show 20 Lines • Show All 1,160 Lines • ▼ Show 20 Lines	const DWARFSection &getAppleTypesSection() const override {
return AppleTypesSection;		return AppleTypesSection;
}		}
const DWARFSection &getAppleNamespacesSection() const override {		const DWARFSection &getAppleNamespacesSection() const override {
return AppleNamespacesSection;		return AppleNamespacesSection;
}		}
const DWARFSection &getAppleObjCSection() const override {		const DWARFSection &getAppleObjCSection() const override {
return AppleObjCSection;		return AppleObjCSection;
}		}
const DWARFSection &getNamesSection() const override {		const DWARFSection &getNamesSection() const override { return NamesSection; }
		dblaikieUnsubmitted Done Reply Inline Actions looks like this unrelated change snuck in? dblaikie: looks like this unrelated change snuck in?
		ayermoloAuthorUnsubmitted Done Reply Inline Actions Ah yeah clang-format change. ayermolo: Ah yeah clang-format change.
return NamesSection;
}

StringRef getFileName() const override { return FileName; }		StringRef getFileName() const override { return FileName; }
uint8_t getAddressSize() const override { return AddressSize; }		uint8_t getAddressSize() const override { return AddressSize; }
void forEachInfoSections(		void forEachInfoSections(
function_ref<void(const DWARFSection &)> F) const override {		function_ref<void(const DWARFSection &)> F) const override {
for (auto &P : InfoSections)		for (auto &P : InfoSections)
F(P.second);		F(P.second);
}		}
▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/lib/DebugInfo/DWARF/DWARFUnitIndex.cpp

	Show First 20 Lines • Show All 247 Lines • ▼ Show 20 Lines
	DWARFUnitIndex::Entry::getContribution(DWARFSectionKind Sec) const {			DWARFUnitIndex::Entry::getContribution(DWARFSectionKind Sec) const {
	uint32_t i = 0;			uint32_t i = 0;
	for (; i != Index->Header.NumColumns; ++i)			for (; i != Index->Header.NumColumns; ++i)
	if (Index->ColumnKinds[i] == Sec)			if (Index->ColumnKinds[i] == Sec)
	return &Contributions[i];			return &Contributions[i];
	return nullptr;			return nullptr;
	}			}

				DWARFUnitIndex::Entry::SectionContribution &
				DWARFUnitIndex::Entry::getContribution() {
				return Contributions[Index->InfoColumn];
				}
				dblaikieUnsubmitted Done Reply Inline Actions Should this return by reference? dblaikie: Should this return by reference?
				ayermoloAuthorUnsubmitted Done Reply Inline Actions I was trying to keep same return type as the const version. Changed to reference. ayermolo: I was trying to keep same return type as the const version. Changed to reference.
				dblaikieUnsubmitted Done Reply Inline Actions oh, yeah, the other should probably change too... ifyou could do that in a separate patch, that'd be great dblaikie: oh, yeah, the other should probably change too... ifyou could do that in a separate patch…
				ayermoloAuthorUnsubmitted Done Reply Inline Actions Sounds good. ayermolo: Sounds good.

	const DWARFUnitIndex::Entry::SectionContribution *			const DWARFUnitIndex::Entry::SectionContribution *
	DWARFUnitIndex::Entry::getContribution() const {			DWARFUnitIndex::Entry::getContribution() const {
	return &Contributions[Index->InfoColumn];			return &Contributions[Index->InfoColumn];
	}			}

	const DWARFUnitIndex::Entry *			const DWARFUnitIndex::Entry *
	DWARFUnitIndex::getFromOffset(uint32_t Offset) const {			DWARFUnitIndex::getFromOffset(uint32_t Offset) const {
	if (OffsetLookup.empty()) {			if (OffsetLookup.empty()) {
	Show All 38 Lines

llvm/test/tools/llvm-dwp/X86/debug_macro_v5.s

	# This test checks the support for writing macro sections and their index (v5).			# This test checks the support for writing macro sections and their index (v5).

	# RUN: llvm-mc -triple x86_64-unknown-linux --filetype=obj --split-dwarf-file=%t.dwo -dwarf-version=5 %s -o %t.o			# RUN: llvm-mc -triple x86_64-unknown-linux --filetype=obj --split-dwarf-file=%t.dwo -dwarf-version=5 %s -o %t.o
	# RUN: llvm-dwp %t.dwo -o %t.dwp 2>&1			# RUN: llvm-dwp %t.dwo -o %t.dwp 2>&1
	# RUN: llvm-dwarfdump -debug-macro -debug-cu-index %t.dwp \| FileCheck %s			# RUN: llvm-dwarfdump -debug-macro -debug-cu-index %t.dwp \| FileCheck -check-prefix=CHECK %s
				# RUN: llvm-dwarfdump -debug-macro -debug-cu-index -manaully-generate-cu-tu-index %t.dwp \| FileCheck -check-prefix=CHECK2 %s

	# CHECK-DAG: .debug_macro.dwo contents:			# CHECK-DAG: .debug_macro.dwo contents:
	# CHECK: macro header: version = 0x0005, flags = 0x00, format = DWARF32			# CHECK: macro header: version = 0x0005, flags = 0x00, format = DWARF32
	# CHECK-NEXT: DW_MACRO_start_file - lineno: 0 filenum: 0			# CHECK-NEXT: DW_MACRO_start_file - lineno: 0 filenum: 0
	# CHECK-NEXT: DW_MACRO_define_strx - lineno: 1 macro: x 5			# CHECK-NEXT: DW_MACRO_define_strx - lineno: 1 macro: x 5
	# CHECK-NEXT: DW_MACRO_end_file			# CHECK-NEXT: DW_MACRO_end_file

	# CHECK-DAG: .debug_cu_index contents:			# CHECK-DAG: .debug_cu_index contents:
	# CHECK-NEXT: version = 5, units = 1, slots = 2			# CHECK-NEXT: version = 5, units = 1, slots = 2
	# CHECK: Index Signature INFO ABBREV STR_OFFSETS MACRO			# CHECK: Index Signature INFO ABBREV STR_OFFSETS MACRO
	# CHECK: 1 0x0000000000000000 [0x00000000, 0x00000019) [0x00000000, 0x00000008) [0x00000000, 0x0000000c) [0x00000000, 0x0000000b)			# CHECK: 1 0x0000000000000000 [0x00000000, 0x00000019) [0x00000000, 0x00000008) [0x00000000, 0x0000000c) [0x00000000, 0x0000000b)

				# CHECK2: Index Signature INFO ABBREV STR_OFFSETS MACRO
				# CHECK2: 1 0x0000000000000000 [0x00000000, 0x00000019) [0x00000000, 0x00000008) [0x00000000, 0x0000000c) [0x00000000, 0x0000000b)

	.section .debug_info.dwo,"e",@progbits			.section .debug_info.dwo,"e",@progbits
	.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit			.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit
	.Ldebug_info_dwo_start0:			.Ldebug_info_dwo_start0:
	.short 5 # DWARF version number			.short 5 # DWARF version number
	.byte 5 # DWARF Unit Type (DW_UT_split_compile)			.byte 5 # DWARF Unit Type (DW_UT_split_compile)
	.byte 8 # Address Size (in bytes)			.byte 8 # Address Size (in bytes)
	.long 0 # Offset Into Abbrev. Section			.long 0 # Offset Into Abbrev. Section
	.quad 0			.quad 0
	Show All 30 Lines

llvm/test/tools/llvm-dwp/X86/info-v5.s

	# this checks llvm-dwp handling of DWARFv5 Info section header.			# this checks llvm-dwp handling of DWARFv5 Info section header.

	# RUN: llvm-mc --triple=x86_64-unknown-linux --filetype=obj --split-dwarf-file=%t.dwo -dwarf-version=5 %s -o %t.o			# RUN: llvm-mc --triple=x86_64-unknown-linux --filetype=obj --split-dwarf-file=%t.dwo -dwarf-version=5 %s -o %t.o

	# RUN: llvm-dwp %t.dwo -o %t.dwp			# RUN: llvm-dwp %t.dwo -o %t.dwp
	# RUN: llvm-dwarfdump -v %t.dwp \| FileCheck %s			# RUN: llvm-dwarfdump -v %t.dwp \| FileCheck -check-prefix=CHECK %s
				# RUN: llvm-dwarfdump -manaully-generate-cu-tu-index -v %t.dwp \| FileCheck -check-prefix=CHECK2 %s

	#CHECK-DAG: .debug_info.dwo contents:			#CHECK-DAG: .debug_info.dwo contents:
	#CHECK: 0x00000000: Compile Unit: length = 0x00000050, format = DWARF32, version = 0x0005, unit_type = DW_UT_split_compile, abbr_offset = 0x0000, addr_size = 0x08, DWO_id = [[DWOID:.*]] (next unit at 0x00000054)			#CHECK: 0x00000000: Compile Unit: length = 0x00000050, format = DWARF32, version = 0x0005, unit_type = DW_UT_split_compile, abbr_offset = 0x0000, addr_size = 0x08, DWO_id = [[DWOID:.*]] (next unit at 0x00000054)

	# CHECK-DAG: .debug_cu_index contents:			# CHECK-DAG: .debug_cu_index contents:
	# CHECK: version = 5, units = 1, slots = 2			# CHECK: version = 5, units = 1, slots = 2
	# CHECK: Index Signature INFO ABBREV			# CHECK: Index Signature INFO ABBREV
	# CHECK: 1 [[DWOID]] [0x00000000, 0x00000054) [0x00000000, 0x0000002a)			# CHECK: 1 [[DWOID]] [0x00000000, 0x00000054) [0x00000000, 0x0000002a)

				#CHECK2-DAG: .debug_info.dwo contents:
				#CHECK2: 0x00000000: Compile Unit: length = 0x00000050, format = DWARF32, version = 0x0005, unit_type = DW_UT_split_compile, abbr_offset = 0x0000, addr_size = 0x08, DWO_id = [[DWOID2:.*]] (next unit at 0x00000054)

				# CHECK2-DAG: .debug_cu_index contents:
				# CHECK2: version = 5, units = 1, slots = 2
				# CHECK2: Index Signature INFO
				# CHECK21: 1 [[DWOID2]] [0x00000000, 0x00000054) [0x00000000, 0x0000002a)

	.section .debug_info.dwo,"e",@progbits			.section .debug_info.dwo,"e",@progbits
	.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit			.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit
	.Ldebug_info_dwo_start0:			.Ldebug_info_dwo_start0:
	.short 5 # DWARF version number			.short 5 # DWARF version number
	.byte 5 # DWARF Unit Type			.byte 5 # DWARF Unit Type
	.byte 8 # Address Size (in bytes)			.byte 8 # Address Size (in bytes)
	.long 0 # Offset Into Abbrev. Section			.long 0 # Offset Into Abbrev. Section
	.quad -1173350285159172090			.quad -1173350285159172090
	▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

llvm/test/tools/llvm-dwp/X86/loclists.s

	# This test checks if llvm-dwp outputs .debug_loclists.			# This test checks if llvm-dwp outputs .debug_loclists.

	# RUN: llvm-mc -triple x86_64-unknown-linux %s -filetype=obj -o %t.o \			# RUN: llvm-mc -triple x86_64-unknown-linux %s -filetype=obj -o %t.o \
	# RUN: -split-dwarf-file=%t.dwo -dwarf-version=5			# RUN: -split-dwarf-file=%t.dwo -dwarf-version=5
	# RUN: llvm-dwp %t.dwo -o %t.dwp			# RUN: llvm-dwp %t.dwo -o %t.dwp
	# RUN: llvm-dwarfdump -debug-loclists -debug-cu-index -debug-tu-index %t.dwp \| FileCheck %s			# RUN: llvm-dwarfdump -debug-loclists -debug-cu-index -debug-tu-index %t.dwp \| FileCheck -check-prefix=CHECK %s
				# RUN: llvm-dwarfdump -debug-cu-index -debug-tu-index -manaully-generate-cu-tu-index %t.dwp \| FileCheck -check-prefix=CHECK2 %s

	# CHECK-DAG: .debug_loclists.dwo contents:			# CHECK-DAG: .debug_loclists.dwo contents:
	# CHECK: locations list header: length = 0x00000019, format = DWARF32, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000001			# CHECK: locations list header: length = 0x00000019, format = DWARF32, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000001
	# CHECK-NEXT: offsets: [			# CHECK-NEXT: offsets: [
	# CHECK-NEXT: 0x00000004			# CHECK-NEXT: 0x00000004
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK: DW_LLE_base_addressx (0x0000000000000000)			# CHECK: DW_LLE_base_addressx (0x0000000000000000)
	# CHECK-NEXT: DW_LLE_offset_pair (0x0000000000000000, 0x0000000000000004): DW_OP_reg5 RDI			# CHECK-NEXT: DW_LLE_offset_pair (0x0000000000000000, 0x0000000000000004): DW_OP_reg5 RDI
	# CHECK-NEXT: DW_LLE_offset_pair (0x0000000000000004, 0x0000000000000008): DW_OP_reg3 RBX			# CHECK-NEXT: DW_LLE_offset_pair (0x0000000000000004, 0x0000000000000008): DW_OP_reg3 RBX

	# CHECK-DAG: .debug_cu_index contents:			# CHECK-DAG: .debug_cu_index contents:
	# CHECK: Index Signature INFO ABBREV LOCLISTS			# CHECK: Index Signature INFO ABBREV LOCLISTS
	# CHECK: 1 {{.*}} [0x00000018, 0x0000002d) [0x00000000, 0x00000004) [0x00000000, 0x0000001d)			# CHECK: 1 {{.*}} [0x00000018, 0x0000002d) [0x00000000, 0x00000004) [0x00000000, 0x0000001d)

	# CHECK-DAG: .debug_tu_index contents:			# CHECK-DAG: .debug_tu_index contents:
	# CHECK: Index Signature INFO ABBREV LOCLISTS			# CHECK: Index Signature INFO ABBREV LOCLISTS
	# CHECK: 2 {{.*}} [0x00000000, 0x00000018) [0x00000000, 0x00000004) [0x00000000, 0x0000001d)			# CHECK: 2 {{.*}} [0x00000000, 0x00000018) [0x00000000, 0x00000004) [0x00000000, 0x0000001d)

				# CHECK2-DAG: .debug_cu_index contents:
				# CHECK2: Index Signature INFO ABBREV LOCLISTS
				# CHECK2: 1 {{.*}} [0x00000018, 0x0000002d) [0x00000000, 0x00000004) [0x00000000, 0x0000001d)

				# CHECK2-DAG: .debug_tu_index contents:
				# CHECK2: Index Signature INFO ABBREV LOCLISTS
				# CHECK2: 2 {{.*}} [0x00000000, 0x00000018) [0x00000000, 0x00000004) [0x00000000, 0x0000001d)

	.section .debug_info.dwo,"e",@progbits			.section .debug_info.dwo,"e",@progbits
	.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit			.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit
	.Ldebug_info_dwo_start0:			.Ldebug_info_dwo_start0:
	.short 5 # DWARF version number			.short 5 # DWARF version number
	.byte 6 # DWARF Unit Type			.byte 6 # DWARF Unit Type
	.byte 8 # Address Size (in bytes)			.byte 8 # Address Size (in bytes)
	.long 0 # Offset Into Abbrev. Section			.long 0 # Offset Into Abbrev. Section
	.quad -4287463584810542331 # Type Signature			.quad -4287463584810542331 # Type Signature
	▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/test/tools/llvm-dwp/X86/rnglists.s

	# This test checks if llvm-dwp outputs .debug_rnglists.			# This test checks if llvm-dwp outputs .debug_rnglists.

	# RUN: llvm-mc -triple x86_64-unknown-linux %s -filetype=obj -o %t.o \			# RUN: llvm-mc -triple x86_64-unknown-linux %s -filetype=obj -o %t.o \
	# RUN: -split-dwarf-file=%t.dwo -dwarf-version=5			# RUN: -split-dwarf-file=%t.dwo -dwarf-version=5
	# RUN: llvm-dwp %t.dwo -o %t.dwp			# RUN: llvm-dwp %t.dwo -o %t.dwp
	# RUN: llvm-dwarfdump -debug-rnglists -debug-cu-index -debug-tu-index %t.dwp \| FileCheck %s			# RUN: llvm-dwarfdump -debug-rnglists -debug-cu-index -debug-tu-index %t.dwp \| FileCheck -check-prefix=CHECK %s
				# RUN: llvm-dwarfdump -debug-cu-index -debug-tu-index -manaully-generate-cu-tu-index %t.dwp \| FileCheck -check-prefix=CHECK2 %s

	# CHECK-DAG: .debug_cu_index contents:			# CHECK-DAG: .debug_cu_index contents:
	# CHECK: Index Signature INFO ABBREV RNGLISTS			# CHECK: Index Signature INFO ABBREV RNGLISTS
	# CHECK: 1 {{.*}} [0x00000018, 0x0000002d) [0x00000000, 0x00000004) [0x00000000, 0x00000017)			# CHECK: 1 {{.*}} [0x00000018, 0x0000002d) [0x00000000, 0x00000004) [0x00000000, 0x00000017)

	# CHECK-DAG: .debug_tu_index contents:			# CHECK-DAG: .debug_tu_index contents:
	# CHECK: Index Signature INFO ABBREV RNGLISTS			# CHECK: Index Signature INFO ABBREV RNGLISTS
	# CHECK: 2 {{.*}} [0x00000000, 0x00000018) [0x00000000, 0x00000004) [0x00000000, 0x00000017)			# CHECK: 2 {{.*}} [0x00000000, 0x00000018) [0x00000000, 0x00000004) [0x00000000, 0x00000017)

	# CHECK-DAG: .debug_rnglists.dwo contents:			# CHECK-DAG: .debug_rnglists.dwo contents:
	# range list header: length = 0x00000013, format = DWARF32, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000001			# range list header: length = 0x00000013, format = DWARF32, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000001
	# CHECK: offsets: [			# CHECK: offsets: [
	# CHECK-NEXT: 0x00000004			# CHECK-NEXT: 0x00000004
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: ranges:			# CHECK-NEXT: ranges:
	# CHECK-NEXT: [0x0000000000000004, 0x0000000000000008)			# CHECK-NEXT: [0x0000000000000004, 0x0000000000000008)
	# CHECK-NEXT: [0x000000000000000c, 0x0000000000000010)			# CHECK-NEXT: [0x000000000000000c, 0x0000000000000010)

				# CHECK2-DAG: .debug_cu_index contents:
				# CHECK2: Index Signature INFO ABBREV RNGLISTS
				# CHECK2: 1 {{.*}} [0x00000018, 0x0000002d) [0x00000000, 0x00000004) [0x00000000, 0x00000017)

				# CHECK2-DAG: .debug_tu_index contents:
				# CHECK2: Index Signature INFO ABBREV RNGLISTS
				# CHECK2: 2 {{.*}} [0x00000000, 0x00000018) [0x00000000, 0x00000004) [0x00000000, 0x00000017)

	.section .debug_info.dwo,"e",@progbits			.section .debug_info.dwo,"e",@progbits
	.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit			.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit
	.Ldebug_info_dwo_start0:			.Ldebug_info_dwo_start0:
	.short 5 # DWARF version number			.short 5 # DWARF version number
	.byte 6 # DWARF Unit Type			.byte 6 # DWARF Unit Type
	.byte 8 # Address Size (in bytes)			.byte 8 # Address Size (in bytes)
	.long 0 # Offset Into Abbrev. Section			.long 0 # Offset Into Abbrev. Section
	.quad -4287463584810542331 # Type Signature			.quad -4287463584810542331 # Type Signature
	Show All 35 Lines

llvm/test/tools/llvm-dwp/X86/tu_units_v5.s

	# This test checks if llvm-dwp can correctly generate the tu index section (v5).			# This test checks if llvm-dwp can correctly generate the tu index section (v5).

	# RUN: llvm-mc -triple x86_64-unknown-linux %s -filetype=obj -o %t.o \			# RUN: llvm-mc -triple x86_64-unknown-linux %s -filetype=obj -o %t.o \
	# RUN: -split-dwarf-file=%t.dwo -dwarf-version=5			# RUN: -split-dwarf-file=%t.dwo -dwarf-version=5
	# RUN: llvm-dwp %t.dwo -o %t.dwp			# RUN: llvm-dwp %t.dwo -o %t.dwp
	# RUN: llvm-dwarfdump -debug-info -debug-tu-index %t.dwp \| FileCheck %s			# RUN: llvm-dwarfdump -debug-info -debug-tu-index %t.dwp \| FileCheck -check-prefix=CHECK %s
				# RUN2: llvm-dwarfdump -debug-info -debug-tu-index -manaully-generate-cu-tu-index %t.dwp \| FileCheck -check-prefix=CHECK2 %s

	## Note: In order to check whether the type unit index is generated			## Note: In order to check whether the type unit index is generated
	## there is no need to add the missing DIEs for the structure type of the type unit.			## there is no need to add the missing DIEs for the structure type of the type unit.

	# CHECK-DAG: .debug_info.dwo contents:			# CHECK-DAG: .debug_info.dwo contents:
	# CHECK: 0x00000000: Type Unit: length = 0x00000017, format = DWARF32, version = 0x0005, unit_type = DW_UT_split_type, abbr_offset = 0x0000, addr_size = 0x08, name = '', type_signature = [[TUID1:.*]], type_offset = 0x0019 (next unit at 0x0000001b)			# CHECK: 0x00000000: Type Unit: length = 0x00000017, format = DWARF32, version = 0x0005, unit_type = DW_UT_split_type, abbr_offset = 0x0000, addr_size = 0x08, name = '', type_signature = [[TUID1:.*]], type_offset = 0x0019 (next unit at 0x0000001b)
	# CHECK: 0x0000001b: Type Unit: length = 0x00000017, format = DWARF32, version = 0x0005, unit_type = DW_UT_split_type, abbr_offset = 0x0000, addr_size = 0x08, name = '', type_signature = [[TUID2:.*]], type_offset = 0x0019 (next unit at 0x00000036)			# CHECK: 0x0000001b: Type Unit: length = 0x00000017, format = DWARF32, version = 0x0005, unit_type = DW_UT_split_type, abbr_offset = 0x0000, addr_size = 0x08, name = '', type_signature = [[TUID2:.*]], type_offset = 0x0019 (next unit at 0x00000036)
	# CHECK-DAG: .debug_tu_index contents:			# CHECK-DAG: .debug_tu_index contents:
	# CHECK: version = 5, units = 2, slots = 4			# CHECK: version = 5, units = 2, slots = 4
	# CHECK: Index Signature INFO ABBREV			# CHECK: Index Signature INFO ABBREV
	# CHECK: 1 [[TUID1]] [0x00000000, 0x0000001b) [0x00000000, 0x00000010)			# CHECK: 1 [[TUID1]] [0x00000000, 0x0000001b) [0x00000000, 0x00000010)
	# CHECK: 4 [[TUID2]] [0x0000001b, 0x00000036) [0x00000000, 0x00000010)			# CHECK: 4 [[TUID2]] [0x0000001b, 0x00000036) [0x00000000, 0x00000010)

				# CHECK2-DAG: .debug_info.dwo contents:
				# CHECK2: 0x00000000: Type Unit: length = 0x00000017, format = DWARF32, version = 0x0005, unit_type = DW_UT_split_type, abbr_offset = 0x0000, addr_size = 0x08, name = '', type_signature = [[TUID1:.*]], type_offset = 0x0019 (next unit at 0x0000001b)
				# CHECK2: 0x0000001b: Type Unit: length = 0x00000017, format = DWARF32, version = 0x0005, unit_type = DW_UT_split_type, abbr_offset = 0x0000, addr_size = 0x08, name = '', type_signature = [[TUID2:.*]], type_offset = 0x0019 (next unit at 0x00000036)
				# CHECK2-DAG: .debug_tu_index contents:
				# CHECK2: version = 5, units = 2, slots = 4
				# CHECK2: Index Signature INFO ABBREV
				# CHECK2: 1 [[TUID1]] [0x00000000, 0x0000001b) [0x00000000, 0x00000010)
				# CHECK2: 4 [[TUID2]] [0x0000001b, 0x00000036) [0x00000000, 0x00000010)

	.section .debug_info.dwo,"e",@progbits			.section .debug_info.dwo,"e",@progbits
	.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit			.long .Ldebug_info_dwo_end0-.Ldebug_info_dwo_start0 # Length of Unit
	.Ldebug_info_dwo_start0:			.Ldebug_info_dwo_start0:
	.short 5 # DWARF version number			.short 5 # DWARF version number
	.byte 6 # DWARF Unit Type (DW_UT_split_type)			.byte 6 # DWARF Unit Type (DW_UT_split_type)
	.byte 8 # Address Size (in bytes)			.byte 8 # Address Size (in bytes)
	.long 0 # Offset Into Abbrev. Section			.long 0 # Offset Into Abbrev. Section
	.quad 5657452045627120676 # Type Signature			.quad 5657452045627120676 # Type Signature
	▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

llvm/test/tools/llvm-dwp/X86/type_dedup.test

	RUN: llvm-dwp %p/../Inputs/type_dedup/a.dwo %p/../Inputs/type_dedup/b.dwo -o %t			RUN: llvm-dwp %p/../Inputs/type_dedup/a.dwo %p/../Inputs/type_dedup/b.dwo -o %t
	RUN: llvm-dwarfdump -v %t \| FileCheck %s			RUN: llvm-dwarfdump -v %t \| FileCheck -check-prefix=CHECK %s
				RUN: llvm-dwarfdump -v -manaully-generate-cu-tu-index %t \| FileCheck -check-prefix=CHECK2 %s
	RUN: llvm-dwp %p/../Inputs/type_dedup/b.dwo -o %tb.dwp			RUN: llvm-dwp %p/../Inputs/type_dedup/b.dwo -o %tb.dwp
	RUN: llvm-dwp %p/../Inputs/type_dedup/a.dwo %tb.dwp -o %t			RUN: llvm-dwp %p/../Inputs/type_dedup/a.dwo %tb.dwp -o %t
	RUN: llvm-dwarfdump -v %t \| FileCheck %s			RUN: llvm-dwarfdump -v %t \| FileCheck -check-prefix=CHECK %s
				RUN: llvm-dwarfdump -v -manaully-generate-cu-tu-index %t \| FileCheck -check-prefix=CHECK2 %s

	a.cpp:			a.cpp:
	struct common { };			struct common { };
	common a1;			common a1;
	struct adistinct { };			struct adistinct { };
	adistinct a2;			adistinct a2;

	b.cpp:			b.cpp:
	Show All 17 Lines
	CHECK: DW_AT_name {{.*}} "adistinct"			CHECK: DW_AT_name {{.*}} "adistinct"
	CHECK: [[BUOFF]]:			CHECK: [[BUOFF]]:
	CHECK-LABEL: Type Unit: length = 0x00000020, format = DWARF32, version = 0x0004, abbr_offset =			CHECK-LABEL: Type Unit: length = 0x00000020, format = DWARF32, version = 0x0004, abbr_offset =
	CHECK: 0x{{.}}, addr_size = 0x08, name = 'bdistinct', type_signature = [[BSIG:0x[0-9a-f]]], type_offset = 0x[[BOFF:.]] (next unit at [[XUOFF:.]])			CHECK: 0x{{.}}, addr_size = 0x08, name = 'bdistinct', type_signature = [[BSIG:0x[0-9a-f]]], type_offset = 0x[[BOFF:.]] (next unit at [[XUOFF:.]])
	CHECK: DW_TAG_type_unit			CHECK: DW_TAG_type_unit
	CHECK: 0x00000066: DW_TAG_structure_type			CHECK: 0x00000066: DW_TAG_structure_type
	CHECK: DW_AT_name {{.*}} "bdistinct"			CHECK: DW_AT_name {{.*}} "bdistinct"
	CHECK-NOT: Type Unit			CHECK-NOT: Type Unit

				CHECK2-LABEL: .debug_types.dwo contents:
				CHECK2: [[COMMONUOFF:0x[0-9a-f]*]]:
				CHECK2-LABEL: Type Unit: length = 0x00000020, format = DWARF32, version = 0x0004, abbr_offset =
				CHECK2: 0x0000, addr_size = 0x08, name = 'common', type_signature = [[COMMONSIG:0x[0-9a-f]]], type_offset = 0x[[COMMONOFF:.]] (next unit at [[AUOFF:.*]])
				CHECK2: DW_TAG_type_unit
				CHECK2: [[COMMONOFF]]: DW_TAG_structure_type
				CHECK2: DW_AT_name {{.*}} "common"
				CHECK2: [[AUOFF]]:
				CHECK2-LABEL: Type Unit: length = 0x00000020, format = DWARF32, version = 0x0004, abbr_offset =
				CHECK2: 0x0000, addr_size = 0x08, name = 'adistinct', type_signature = [[ASIG:0x[0-9a-f]]], type_offset = 0x[[AOFF:.]] (next unit at [[BUOFF:.*]])
				CHECK2: DW_TAG_type_unit
				CHECK2: 0x00000042: DW_TAG_structure_type
				CHECK2: DW_AT_name {{.*}} "adistinct"
				CHECK2: [[BUOFF]]:
				CHECK2-LABEL: Type Unit: length = 0x00000020, format = DWARF32, version = 0x0004, abbr_offset =
				CHECK2: 0x{{.}}, addr_size = 0x08, name = 'bdistinct', type_signature = [[BSIG:0x[0-9a-f]]], type_offset = 0x[[BOFF:.]] (next unit at [[XUOFF:.]])
				CHECK2: DW_TAG_type_unit
				CHECK2: 0x00000066: DW_TAG_structure_type
				CHECK2: DW_AT_name {{.*}} "bdistinct"
				CHECK2-NOT: Type Unit

llvm/tools/llvm-dwarfdump/llvm-dwarfdump.cpp

Show First 20 Lines • Show All 249 Lines • ▼ Show 20 Lines	static cl::opt<bool>
ShowSectionSizes("show-section-sizes",		ShowSectionSizes("show-section-sizes",
cl::desc("Show the sizes of all debug sections, "		cl::desc("Show the sizes of all debug sections, "
"expressed in bytes."),		"expressed in bytes."),
cat(DwarfDumpCategory));		cat(DwarfDumpCategory));
static cl::opt<bool>		static cl::opt<bool>
ShowSources("show-sources",		ShowSources("show-sources",
cl::desc("Show the sources across all compilation units."),		cl::desc("Show the sources across all compilation units."),
cat(DwarfDumpCategory));		cat(DwarfDumpCategory));
		static cl::opt<bool> ManuallyGenerateUnitIndex(
		"manaully-generate-unit-index",
		dblaikieUnsubmitted Done Reply Inline Actions Maybe, to avoid the "CU/TU/etc" Could use "Unit" consistently in both flag name and opt variable, etc. (so there's no confusion that maybe it's specifically only for the CUIndex and not the TUIndex) dblaikie: Maybe, to avoid the "CU/TU/etc" Could use "Unit" consistently in both flag name and opt…
		cl::desc("if the input is dwp file, parse .debug_info "
		"section and use it to populate "
		dblaikieUnsubmitted Done Reply Inline Actions maybe ".debug_info" rather than "debug info"? dblaikie: maybe ".debug_info" rather than "debug info"?
		"DW_SECT_INFO contributions in cu-index. "
		dblaikieUnsubmitted Done Reply Inline Actions Maybe DW_SECT_INFO rather than "DEBUG_INFO"? dblaikie: Maybe DW_SECT_INFO rather than "DEBUG_INFO"?
		"For DWARF5 it also populated TU Index."),
		cl::init(false), cl::Hidden, cl::cat(DwarfDumpCategory));
// facebook begin D13311561		// facebook begin D13311561
static cl::opt<bool>		static cl::opt<bool>
Analyze("analyze",		Analyze("analyze",
cl::desc("Analyze the DWARF and report detailed information."),		cl::desc("Analyze the DWARF and report detailed information."),
cl::cat(AnalyzeCategory));		cl::cat(AnalyzeCategory));
// facebook end		// facebook end
static opt<bool> Verify("verify", desc("Verify the DWARF debug info."),		static opt<bool> Verify("verify", desc("Verify the DWARF debug info."),
cat(DwarfDumpCategory));		cat(DwarfDumpCategory));
▲ Show 20 Lines • Show All 398 Lines • ▼ Show 20 Lines	auto RecoverableErrorHandler = [&](Error E) {
Result = false;		Result = false;
WithColor::defaultErrorHandler(std::move(E));		WithColor::defaultErrorHandler(std::move(E));
};		};
if (auto *Obj = dyn_cast<ObjectFile>(BinOrErr->get())) {		if (auto *Obj = dyn_cast<ObjectFile>(BinOrErr->get())) {
if (filterArch(*Obj)) {		if (filterArch(*Obj)) {
std::unique_ptr<DWARFContext> DICtx = DWARFContext::create(		std::unique_ptr<DWARFContext> DICtx = DWARFContext::create(
*Obj, DWARFContext::ProcessDebugRelocations::Process, nullptr, "",		*Obj, DWARFContext::ProcessDebugRelocations::Process, nullptr, "",
RecoverableErrorHandler);		RecoverableErrorHandler);
		DICtx->setParseCUTUIndexManually(ManuallyGenerateUnitIndex);
if (!HandleObj(Obj, DICtx, Filename, OS))		if (!HandleObj(Obj, DICtx, Filename, OS))
Result = false;		Result = false;
}		}
} else if (auto *Fat = dyn_cast<MachOUniversalBinary>(BinOrErr->get()))		} else if (auto *Fat = dyn_cast<MachOUniversalBinary>(BinOrErr->get()))
for (auto &ObjForArch : Fat->objects()) {		for (auto &ObjForArch : Fat->objects()) {
std::string ObjName =		std::string ObjName =
(Filename + "(" + ObjForArch.getArchFlagName() + ")").str();		(Filename + "(" + ObjForArch.getArchFlagName() + ")").str();
if (auto MachOOrErr = ObjForArch.getAsObjectFile()) {		if (auto MachOOrErr = ObjForArch.getAsObjectFile()) {
▲ Show 20 Lines • Show All 137 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[DWARFLibrary] Add support to re-construct cu-indexClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 475249

llvm/include/llvm/DebugInfo/DWARF/DWARFContext.h

llvm/include/llvm/DebugInfo/DWARF/DWARFUnit.h

llvm/include/llvm/DebugInfo/DWARF/DWARFUnitIndex.h

llvm/lib/DebugInfo/DWARF/DWARFContext.cpp

llvm/lib/DebugInfo/DWARF/DWARFUnitIndex.cpp

llvm/test/tools/llvm-dwp/X86/debug_macro_v5.s

llvm/test/tools/llvm-dwp/X86/info-v5.s

llvm/test/tools/llvm-dwp/X86/loclists.s

llvm/test/tools/llvm-dwp/X86/rnglists.s

llvm/test/tools/llvm-dwp/X86/tu_units_v5.s

llvm/test/tools/llvm-dwp/X86/type_dedup.test

llvm/tools/llvm-dwarfdump/llvm-dwarfdump.cpp

[DWARFLibrary] Add support to re-construct cu-index
ClosedPublic