This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/
-
llvm/
-
BinaryFormat/
-
Dwarf.h
-
DebugInfo/DWARF/
-
DWARF/
3/5
DWARFDebugLoc.h
-
lib/
-
BinaryFormat/
1
Dwarf.cpp
-
DebugInfo/DWARF/
-
DWARF/
-
DWARFContext.cpp
8/13
DWARFDebugLoc.cpp
-
DWARFDie.cpp
-
test/
-
CodeGen/X86/
-
X86/
3/5
debug-loclists.ll
-
DebugInfo/X86/
-
X86/
-
dwarfdump-debug-loclists-error-cases2.s
1/3
dwarfdump-debug-loclists.test
1/2
fission-ranges.ll
-
loclists-dwp.ll
-
tools/llvm-dwarfdump/X86/
-
llvm-dwarfdump/
-
X86/
-
debug_loc_dwo.s
-
debug_loclists_startx_length.s

Differential D68270

DWARFDebugLoc: Add a function to get the address range of an entry
AbandonedPublic

Authored by labath on Oct 1 2019, 7:21 AM.

Download Raw Diff

Details

Reviewers

JDevlieghere
dblaikie
probinson

Summary

Interpreting a .debug_loclists entry is not completely trivial [citation
needed]. This patch creates a function which can be used by any
libDebugInfo user (thinking of LLDB mainly) to get the range of an
entry.

The debug_loclists parser already contained a partial implementation of
that in the dump function. This implementation is replaced by a call to
the new "getRange" function, and it falls back to printing of raw data
in case we fail to get the address range.

Because LLDB is not fully converted to llvm's debug info parser, I
provide two getRange signatures: one takes a DWARFUnit*, which is used
to resolve .debug_addr references; and one which delegates this job to a
user-supplied callback.

I add a more thorough test of debug_loclists dumping capabilities. In
writing this test, I discovered that we're not able to handle
relocations in the debug_loclists section. However, fixing this was not
completely straight-forward, so I left a TODO, and will address that in
a separate patch.

Diff Detail

Repository

rL LLVM

Build Status

Buildable 38993
Build 38992: arc lint + arc unit

Event Timeline

labath created this revision.Oct 1 2019, 7:21 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 1 2019, 7:21 AM

Herald added a subscriber: aprantl. · View Herald Transcript

Harbormaster completed remote builds in B38815: Diff 222607.Oct 1 2019, 7:22 AM

labath added a child revision: D68271: DWARFDebugLoclists: Make it possible to read relocated addresses.Oct 1 2019, 7:29 AM

dblaikie added inline comments.Oct 1 2019, 8:04 AM

include/llvm/DebugInfo/DWARF/DWARFDebugLoc.h
89–104	I think using an error to express base address selection entries probably is a bit much. I think this API would be more suited to a higher level abstraction over the whole list, rather than one entry in it. (same way we do with ranges - that API has been more fleshed out because of the presence of more users (symbolizers, etc) that want to abstract over the different representations (v4, split, v5, v5-split) & I think shows a fairly good direction this should go in too) One possible caveat: it might make more sense to provide either a lazy iterator or a callback for entries, rather than (as the ranges API does currently) a full in-memory vector of the computed/finalized ranges, for efficiency's sake. Also - would be super great if we could generalize both the range and loclist printing to use some common infrastructure for printing both the half-open range in non-verbose form and the verbose printing with underlying forms including RLE/LLE encodings, old-style base address selection entries (in v4), and also being able to print the section details (like we do for ranges when they're printed in the debug_info section, but we don't do it when they're printed in the actual debug_ranges/rnglists section... ). I realize this is a bit of a big feature request, but figured I'd mention it in case it helps inform your design direction/makes sense to address together, etc. (for instance, it probably means that a pair of uint64_t is insufficient, since we'll want to carry SectionIndex too - and maybe the start/end/section triple could be the same data structure (with the same printing support) as used for ranges, nested inside the data structure that includes the expression and can print that too, etc)
lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
284–286	LLVM style suggests avoiding "else after return" ( https://llvm.org/docs/CodingStandards.html#don-t-use-else-after-a-return )

Thanks for the quick feedback. I didn't realize that the range list code handles some of this stuff already (I didn't look at it -- I guess I should've). I'll try to play around with this a bit, and then figure out what to do.

Regarding the iterator api: it has occurred to me that such an api would be more suitable (in fact, lldb's current api already has that), but since the location lists are generally small, I did not want to make a big deal out of it. But now that you mention it, I'll definitely consider it...

Upload a new version of the patch.

This isn't fully ready for submission, but I am putting it up anyway, to get
some feedback on the direction I am taking this, and ask some questions.

First I tried to do a complete rewrite of the loclists class in a manner similar
to the rnglists parser, but then I ran into the problem called .debug_loc.dwo
(v4 extension vaguely similar to DWARF5 loclists). Right now, it is possible to
share the parsing code between this format and .debug_loclists. That would be
pretty tricky to do with the rnglists approach.

So, instead I went for a bottom-up approach and tried to rewrite/reuse/make
similar the lower level classes, which can be shared more easily with the
rnglists stuff. This patch creates a DWARFLocation class, which is based on the
existing DWARFAddressRange class. The next step would be (or maybe I'll land it
before this patch) a LocationListEntry class akin to the existing
RangeListEntry.

Harbormaster completed remote builds in B38993: Diff 223193.Oct 4 2019, 5:22 AM

labath marked 8 inline comments as done.Oct 4 2019, 5:45 AM

labath added inline comments.

include/llvm/DebugInfo/DWARF/DWARFDebugLoc.h
91	I went for an iterator-like approach (instead of a callback or direct materialization) because it's easier to use. In particular it makes it possible to reuse this stuff in the dumping code, which would have been pretty hard with callbacks.
138	Using a callback is consistent with what rnglists code does. This uses a std::function instead of a function_ref because it's easier for iterators to escape out of scopes. However, I've wondered if we shouldn't define an AddrOffsetResolver interface that llvm, lldb DWARFUnits and anyone else who wishes so (unit tests ?) can implement.
lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
164	A significant deviation from the rnglists code: Here I return an error if it is not possible to compute the address range correctly. The rnglists parser cannot compute the value, it will just return something, which can be very far from the correct range -- for instance, it's happy to use the base_addressx index as the offset, if the index cannot be resolved correctly. And it doesn't provide any indication that it has done so, which doesn't seem like a very useful behavior. If this is the behavior we want, I can also try to make the rnglists parser do something similar.
291–295	This parallel iteration is not completely nice, but I think it's worth being able to reuse the absolute range computation code. I'm open to ideas for improvement though.
test/CodeGen/X86/debug-loclists.ll
16	This tries to follow the RLE format as closely as possible, but I think something like [DW_LLE_offset_pair, 0x0000000000000000, 0x0000000000000004] => [0x0000000000000000, 0x0000000000000004): DW_OP_breg5 RDI+0 would make more sense (both here and for RLE).
test/DebugInfo/X86/fission-ranges.ll
48	This is somewhat annoying, because the entries printed through the loclists section will always have this error (as we don't have the DWARFUnit). I'll have to figure out a way to suppress those, while still keeping them around when printing from DWARFDie (as there a failure means a real error).

dblaikie added inline comments.Oct 4 2019, 1:57 PM

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	Ah, I see - this is what you meant about "In particular it makes it possible to reuse this stuff in the dumping code, which would have been pretty hard with callbacks.". I'm wondering if that might be worth revisiting somewhat. A full iterator abstraction for one user here (well, two once you include lldb - but I assume it's likely going to build its own data structure from the iteration anyway, right? (it's not going to keep the iterator around, do anything interesting like partial iterations, re-iterate/etc - such that a callback would suffice)) I could imagine two callback APIs for this - one that gets entries and locations and one that only gets locations by filtering on the entry version. eg: // for non-verbose output: LL.forEachEntry([&](const Entry &E, Expected<DWARFLocation> L) { if (Verbose && actually dumping debug_loc) print(E) // print any LLE_, raw parameters, etc if (L) print(L) // print the resulting address range, section name (if verbose), else print(error stuff) }); One question would be "when/where do we print the DWARF expression" - if there's an error computing the address range, we can still print the expression, so maybe that happens unconditionally at the end of the callback, using the expression in the Entry? (then, arguably, the expression doesn't need to be in the DWARFLocation - and I'd say make the DWARFLocation a sectioned range, exactly the same type as for ranges so that part of the dumping code, etc, can be maximally reused)
test/CodeGen/X86/debug-loclists.ll
16	Yep, that'd make more sense to me - are you planning to unify the codepaths for this? I think that'd be for the best. If I were picking a printing from scratch, I might go with: DW_LLE_offset_pair(0x0000, 0x0004) => [0x0000, 0x0004): DW_OP_breg5 RDI+0 Making it look a bit more like a function call and function arguments. Though the () might be confusing with the range notation. I'm also undecided on the " => " separator. Whether a ':' might be better/fine, etc. Totally open to ideas, but mostly I'd really love these to use loclist and ranges to use the same code as much as possible, so we can get consistency and any readability benefits, etc in both.
test/DebugInfo/X86/dwarfdump-debug-loclists.test
7	I don't think the inline dumping should print the encoding - I'd borrow a lot from/try to unify with the ranges printing, which doesn't. I think verbose ranges print the same as non-verbose except they also add the section name/number.
test/DebugInfo/X86/fission-ranges.ll
48	IMHO we may want to move to a model where we don't try to create/parse any content except by finding a reference from a CU (or the DWARFv5 stanfdalone line tables). In theory, it's perfectly find to have random garbage in debug sections other than debug_info (or the standalone line table) - because the only parts that should be parsed are those referenced from debug_info. This came up in the form of a bug in location list dumping when the binary is linked with bfd ld. It doesn't update any addresses to discarded sections, leaving them as zero (whereas gold and lld write the addend to the relocation - which generally makes sure any range pair doesn't end up as "zero zero" which marks the end of a list) which terminates a list early and leads to the following location expression to be parsed as the start of a new list... which is totally bogus. Now, granted, the resulting debug info from bfd ld is wrong (if you had a location list spanning multiple functions (eg: a global variable had been put in a register for the duration of a function, etc) then resolving any one of those location entries to zero-zero would terminate the list early even though there might be non-dropped functions in the list after that point) - but I still think there's something to be said for it. There's a fair counterargument too - that we might want to be able to make a best-effort to dump content that isn't complete (eg: if a section was emitted alone - or there was some hunk of unreferenced location list in the debug_loc section, it might be interesting to know what's in that hunk - might give you hints about where it /should/ have been referenced from) Apparently binutils objdump when printing debug info only dumps those referenced pieces and prints info about "holes" when there's unreferenced chunks. Ah, here's the bug context on that: https://bugs.llvm.org/show_bug.cgi?id=43290 But, yeah, all that aside - given the architecture of libDebugInfoDWARF/llvm-dwarfdump right now, yes, it'd be good to omit those error messages. Also note that address indexes wouldn't be resolvable when dumping .dwo files - since the debug_addr would be in the .o file instead. So it'd be good to not print lots of error messages there either.

labath marked 3 inline comments as done.Oct 7 2019, 11:27 AM

labath added inline comments.

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	Actually, what lldb currently does is that it does not build any data structures at all (except storing the pointer to the right place in the debug_loc section. Then, whenever it wants to do something to the loclist, it parses it afresh. I don't know why it does this exactly, but I assume it has something to do with most locations never being used, or being only a couple of times, and the actual parsing being fairly fast. What this means is that lldb is not really a single "user", but there are like four or five places where it iterates through the list, depending on what does it actually want to do with it. It also does partial iteration where it stops as soon as it find the entry it was interested in. Now, all of that is possible with a callback (though I am generally trying to avoid them), but it does resurface the issue of what should be the value of the second argument for DW_LLE_base_address entries (the thing which I originally used a error type for). Maybe this should be actually one callback API, taking two callback functions, with one of them being invoked for base_address entries, and one for others? However, if we stick to the current approaches in both LLE and RLE of making the address pool resolution function a parameter (which I'd like to keep, as it makes my job in lldb easier), then this would actually be three callbacks, which starts to get unwieldy. Though one of those callbacks could be removed with the "DWARFUnit implementing a AddrOffsetResolver interface" idea, which I really like. :)
test/CodeGen/X86/debug-loclists.ll
16	I like the function call format. I hoping to get some code reuse, though it's still not fully clear to me how to achieve that..
test/DebugInfo/X86/dwarfdump-debug-loclists.test
7	Sure, I can do that, though I think that means there won't be a single place where one can see both the raw encodings and their interpretation -- section-based dumping will not show the interpretation (would you want me to show still show them I they happen to be interpretable without the base address or the address pool?), and the debug_info dumping will not show the encoding. Is that bad? -- I don't know...

dblaikie added inline comments.Oct 7 2019, 6:31 PM

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	Ah, thanks for the details on LLDB's location parsing logic. That's interesting indeed! I can appreciate an iterator-based API if that's the sort of usage we've got, though I expect it doesn't have any interest in the low-level encoding & just wants the fully processed address ranges/locations - it doesn't want base_address or end_of_list entries? & I think the dual-iteration is a fairly awkward API design, trying to iterate them in lock-step, etc. I'd rather avoid that if reasonably possible. Either having an iterator API that gives only the fully processed data/semantic view & a completely different API if you want to access the low level primitives (LLE, etc) (this is how ranges works - there's an API that gives a collection of ranges & abstracts over v4/v5/rnglists/etc - though that's partly motivated by a strong multi-client need for that functionality for symbolizing, etc - but I think it's a good abstraction/model anyway (& one of the reasons the inline range list printing doesn't include encoding information, the API it uses is too high level to even have access to it)) Now, all of that is possible with a callback (though I am generally trying to avoid them), but it does resurface the issue of what should be the value of the second argument for DW_LLE_base_address entries (the thing which I originally used a error type for). Sorry, my intent in the above API was for the second argument to be Optional's "None" state when... oh, I see, I did use Expected there, rather than Optional, because there are legit error cases. I know it's sort of awkward, but I might be inclined to use Optional<Expected<AddressRange>> there. I realize two layers of wrapping is a bit weird, but I think it'd be nicer than having an error state for what, I think, isn't erroneous. Maybe this should be actually one callback API, taking two callback functions, with one of them being invoked for base_address entries, and one for others? However, if we stick to the current approaches in both LLE and RLE of making the address pool resolution function a parameter (which I'd like to keep, as it makes my job in lldb easier), then this would actually be three callbacks, which starts to get unwieldy. Don't mind three callbacks too much. Though one of those callbacks could be removed with the "DWARFUnit implementing a AddrOffsetResolver interface" idea, which I really like. :) Sorry, I haven't really looked at where the address resolver callback is registered and alternative designs being discussed - but yeah, going off just the one-sentence, it seems reasonable to have the DWARFUnit own an address resolver/be the thing you consult when you want to resolve an address (just through a normal function call in DWARFUnit, perhaps - which might, internally, use a callback registered when it was constructed).
test/CodeGen/X86/debug-loclists.ll
16	I've posted my unification of range/loc/v4/v5 emission here: https://reviews.llvm.org/D68620 - & I'd imagine something similar in the parsing side.
test/DebugInfo/X86/dwarfdump-debug-loclists.test
7	Fair - that comes back to the issue I mentioned in a previous comment about potentially limiting dumping of non-debug_info sections based on the presence of a CU that references it (& only dumping it that way, rather than trying to parse it without a CU). DWARF isn't really designed to be parsed without the CU anyway. (could leave it in as best-effort to parse things without a referencing CU for debugging, etc). Mostly I'm interested in unification perhaps more/primarily, than feature improvements - then we can make feature improvements to both ranges and locs without having to duplicate things.

labath marked 2 inline comments as done.Oct 8 2019, 8:24 AM

labath added inline comments.

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	I know it's sort of awkward, but I might be inclined to use Optional<Expected<AddressRange>> there. I realize two layers of wrapping is a bit weird, but I think it'd be nicer than having an error state for what, I think, isn't erroneous. Actually, my very first attempt at this patch used an `Expected<Optional<Whatever>>`, but then I scrapped it because I didn't think you'd like it. It's not the friendliest of APIs, but I think we can go with that. Sorry, I haven't really looked at where the address resolver callback is registered and alternative designs being discussed - but yeah, going off just the one-sentence, it seems reasonable to have the DWARFUnit own an address resolver/be the thing you consult when you want to resolve an address (just through a normal function call in DWARFUnit, perhaps - which might, internally, use a callback registered when it was constructed). I think you got that backwards. I don't want the DWARFUnit to be the source of truth for address pool resolutions, as that would make it hard to use from lldb (it's far from ready to start using the llvm version right now). What I wanted was to replace the lambda/function_ref with a single-method interface. Then both DWARFUnits could implement that interface so that passing a DWARFUnit& would "just work" (but you wouldn't be limited to DWARFUnits as anyone could implement that interface, just like anyone can write a lambda).
test/CodeGen/X86/debug-loclists.ll
16	cool. I'll see what I can do with that.

Do we care whether llvm-dwarfdump's output bears any similarities to the output from GNU readelf or objdump? There has been a push lately to get the LLVM "binutils" to behave more like GNU's, although AFAIK it hasn't gotten to the DWARF dumping part.

In D68270#1700108, @probinson wrote:

Do we care whether llvm-dwarfdump's output bears any similarities to the output from GNU readelf or objdump? There has been a push lately to get the LLVM "binutils" to behave more like GNU's, although AFAIK it hasn't gotten to the DWARF dumping part.

I am not too fond of the readelf output. At least for the .debug_info dumping. I like the see indentation. But I do see the appeal of consistent output.

In D68270#1700108, @probinson wrote:

Do we care whether llvm-dwarfdump's output bears any similarities to the output from GNU readelf or objdump? There has been a push lately to get the LLVM "binutils" to behave more like GNU's, although AFAIK it hasn't gotten to the DWARF dumping part.

Generally I hope not to deal with that until there's a user with a need for it who wants to do the work & has a specific use-case that can help motivate which similarities are desirable and which ones don't matter (& perhaps if there's enough that they start to tradeoff usability - maybe the "compatibility mode" is a separate tool or separate flag to the existing tool).

My broader hope is probably that llvm-dwarfdump is more for interactive uses than other dumpers, so fewer people might try to build automated things on top of it & thus expect specific output (this gives us both the freedom not to match the GNU tools, and the freedom not to match previous llvm-dwarfdump behavior (which we've done a fair bit in the past - which seems to support the theory that people don't seem to be building much on top of this))

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	As for Expected<Optional<Whatever>> (or Optional<Expected<>>) - yeah, I think this is a non-obvious API (both the general problem and this specific solution). I think it's probably worth discussing this design a bit more to save you time writing/rewriting things a bit. I guess there are a few layers of failure here. There's the possibility that the iteration itself could fail - even for debug_loc style lists (if we reached the end of the section before encountering a terminating {0,0}). That would suggest a fallible iterator idiom: http://llvm.org/docs/ProgrammersManual.html#building-fallible-iterators-and-iterator-ranges But then, yes, when looking at the "processed"/semantic view, that could fail too in the case of an invalid address index, etc. The generic/processed/abstracted-over-ranges-and-rnglists API for ranges produces a fully computer vector (& then returns Expected<vector> of that range) - is that reasonable? (this does mean manifesting a whole location in memory, which may not be needed so I could understand avoiding that even without fully implementing & demonstrating the vector solution is inadequate). But I /think/ maybe the we could/should have two APIs - one generic API that abstracts over loc/loclists and only provides the fully processed view, and another that is type specific for dumping the underlying representation (only used in dumping debug_loclists).

labath marked an inline comment as done.Oct 11 2019, 5:30 AM

labath added inline comments.

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	If we were computing the final address ranges from scratch (which would be the best match for the current lldb usage, but which I am not considering now for fear of changing too many things), then I agree that we would need the fallible_iterator iterator thingy. But in this case we are "interpreting" the already parsed ranges, so we can assume some level of correctness here, and the thing that can fail is only the computation of a single range, which does not affect our ability to process the next entry. This indicates to me that either each entry in the list should be an Expected<>, or that the invalid entries should be just dropped (possibly accompanied by some flag which would tell the caller that the result was not exhaustive). This is connected to one of the issues I have with the debug ranges API -- it tries _really_ hard to return something -- if resolving the indirect base address entry fails, it is perfectly happy to use the address _index_ as the base address. This makes sense for dumping, where you want to show something (though it would still be good to indicate that you're not showing a real address), but it definitely does not help consumers which then need to make decisions based on the returned data. Anyway, yes, I agree that we need to APIs, and probably callbacks are the easiest way to achieve that. We could have a "base" callback that is not particularly nice to use, but provides the full information via a combination of `UnparsedLL` and `Optional<Expected<ParsedLL>>` arguments. The dumper could use that to print out everything it needs. And then we could have a second API, built on top of the first one, which ignores base address entries and the raw data and returns just a bunch of `Expected<ParsedLL>`. This could be used by users like lldb, who just want to see the final data. The `ParsedLL` type would be independent of the location list type, so that the debug_loc parser could provide the same kind of API (but implemented on top of something else, as the `UnparsedLL` types would differ). Also, under the hood, the location list dumper for debug_loclists (but not debug_loc) could reuse some implementation details with the debug_rnglists dumper via a suitable combination of templates and callbacks. How does that sound?

dblaikie added inline comments.Oct 11 2019, 1:29 PM

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	What sort of things are you concerned about with deeper API changes here? I think it's probably worth building the "right" thing now - as good a time as any. LLVM's debug info APIs, as you've pointed out, aren't exactly "sturdy" (treating address indexes as offsets, etc, etc), so no time like the present to clean it up. I think if we had an abstraction over v4 and v5 location descriptions, parsing from scratch, fallible iterators, etc - that'd be the ideal thing to use in the inline dumping code (that dumps inside debug_info) - which currently uses "parseOneLocationList" - so it is parsing from scratch and dumping. But equally I understand not wanting to make you/me/anyone fix everything when just trying to get something actually done.

labath marked an inline comment as done.Oct 14 2019, 6:58 AM

labath added inline comments.

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	I think I am mainly concerned with the scope explosion of doing changes like that. I don't know how well founded is that concern, because I don't know all the use cases for this information right now (but that's a part of the problem). But anyway, let's continue the discussion. The main problem I see with "inline" dumping using some higher-level abstraction is what should one do in case of incomplete data (e.g. dwo files). If this abstraction returns "cooked" data, then the inline dump could not show anything for dwo files dumped standalone. This is perfectly fine for lldb and maybe other tools (which never look at dwo files separately, and mostly don't care about the reason why the "cooking" failed -- if they just need to get the data it can rely on), but for something like llvm-dwarfdump, we'd probably want to display _something_. I guess this is why the ranges api returns indexes as offsets, but I think we agree we don't want that. Then the question is what should that "something" be? If it's going to be a raw dump of the location list entry, then solution would not be fully generic, as they raw entry type will depend on the location list kind. Though, we could still arrange it so that the various location lists can be processed uniformly via a template if they e.g. have a dump() function with the same signature... Suppose we go down that path (this is the path I wanted to go down in my previous comment, modulo the parse-from-scratch part). The question then is what to do with the section-based dumping. Should it use the same mechanism? It probably should, because the first callback/iterator will provide all data it can possibly want. But then, should it also build the parsed representation like it does now? If yes, then what for? Should we also build some mechanism to "cook" that data too. If not, are we ok with having all users reparse from scratch always? (There are only two users I found right now, and they look like they'd be fine with it.) Or should the DWARFDebugLoclists object cache the cooked data instead? llvm-dwarfdump --statistics would actually prefer that, but that might make the DWARF verifier sad (though I guess it could always reparse from scratch if needed).

labath mentioned this in D68271: DWARFDebugLoclists: Make it possible to read relocated addresses.Oct 31 2019, 6:32 AM

labath removed a child revision: D68271: DWARFDebugLoclists: Make it possible to read relocated addresses.Oct 31 2019, 6:32 AM

JDevlieghere added inline comments.Oct 31 2019, 9:00 AM

include/llvm/DebugInfo/DWARF/DWARFDebugLoc.h
138	I like the idea, +1 from me.
147	Most dump methods give a default argument for DIDumpOptions (`DumpOpts = {}`). Should we do the same here?
lib/BinaryFormat/Dwarf.cpp
479	Maybe this should go in in `DWARF.h`?
lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	I'm very late to the discussion and I'm not as familiar as both of you with the details and the API uses, so please ignore my suggestion if it doesn't make any sense... Could we have an API where we parse a list of Uncooked (to reuse Pavel's nomenclature) and then have the ability to resolve each Uncooked entry into a Cooked entry? Then both LLDB and dwarfdump could get the list of Uncooked entries and try to get the cooked variant. If that works, great, we dump/use that, if not we move on and have separate failure modes for errors originating from parsing and from cooking.

Abandoning. I'll create separate patches with the new implementation.

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp
291–295	We've discussed this with David last week, and we have hopefully agreed on the rough direction forward (and I think it's going to roughly correspond to what you had in mind). I'm preparing a patch (first of many) to implement that and I'm hoping I'll be able to upload something today.

labath mentioned this in D69672: DWARFDebugLoclists: Move to a incremental parsing model.Oct 31 2019, 9:58 AM

labath mentioned this in rGe1f8c8a16f44: DWARFDebugLoclists: Move to a incremental parsing model.Nov 6 2019, 7:31 AM

Revision Contents

Path

Size

include/

llvm/

BinaryFormat/

Dwarf.h

1 line

DebugInfo/

DWARF/

DWARFDebugLoc.h

77 lines

lib/

BinaryFormat/

Dwarf.cpp

25 lines

DebugInfo/

DWARF/

DWARFContext.cpp

4 lines

DWARFDebugLoc.cpp

158 lines

DWARFDie.cpp

35 lines

test/

CodeGen/

X86/

debug-loclists.ll

36 lines

DebugInfo/

X86/

dwarfdump-debug-loclists-error-cases2.s

4 lines

dwarfdump-debug-loclists.test

15 lines

fission-ranges.ll

18 lines

loclists-dwp.ll

4 lines

tools/

llvm-dwarfdump/

X86/

debug_loc_dwo.s

2 lines

debug_loclists_startx_length.s

2 lines

Diff 223193

include/llvm/BinaryFormat/Dwarf.h

	Show First 20 Lines • Show All 469 Lines • ▼ Show 20 Lines
	StringRef CaseString(unsigned Case);			StringRef CaseString(unsigned Case);
	StringRef ConventionString(unsigned Convention);			StringRef ConventionString(unsigned Convention);
	StringRef InlineCodeString(unsigned Code);			StringRef InlineCodeString(unsigned Code);
	StringRef ArrayOrderString(unsigned Order);			StringRef ArrayOrderString(unsigned Order);
	StringRef LNStandardString(unsigned Standard);			StringRef LNStandardString(unsigned Standard);
	StringRef LNExtendedString(unsigned Encoding);			StringRef LNExtendedString(unsigned Encoding);
	StringRef MacinfoString(unsigned Encoding);			StringRef MacinfoString(unsigned Encoding);
	StringRef RangeListEncodingString(unsigned Encoding);			StringRef RangeListEncodingString(unsigned Encoding);
				StringRef LocationListEncodingString(unsigned Entry);
	StringRef CallFrameString(unsigned Encoding, Triple::ArchType Arch);			StringRef CallFrameString(unsigned Encoding, Triple::ArchType Arch);
	StringRef ApplePropertyString(unsigned);			StringRef ApplePropertyString(unsigned);
	StringRef UnitTypeString(unsigned);			StringRef UnitTypeString(unsigned);
	StringRef AtomTypeString(unsigned Atom);			StringRef AtomTypeString(unsigned Atom);
	StringRef GDBIndexEntryKindString(GDBIndexEntryKind Kind);			StringRef GDBIndexEntryKindString(GDBIndexEntryKind Kind);
	StringRef GDBIndexEntryLinkageString(GDBIndexEntryLinkage Linkage);			StringRef GDBIndexEntryLinkageString(GDBIndexEntryLinkage Linkage);
	StringRef IndexString(unsigned Idx);			StringRef IndexString(unsigned Idx);
	/// @}			/// @}
	▲ Show 20 Lines • Show All 196 Lines • Show Last 20 Lines

include/llvm/DebugInfo/DWARF/DWARFDebugLoc.h

	//===- DWARFDebugLoc.h ------------------------------------------- C++ --===//			//===- DWARFDebugLoc.h ------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_DEBUGINFO_DWARF_DWARFDEBUGLOC_H			#ifndef LLVM_DEBUGINFO_DWARF_DWARFDEBUGLOC_H
	#define LLVM_DEBUGINFO_DWARF_DWARFDEBUGLOC_H			#define LLVM_DEBUGINFO_DWARF_DWARFDEBUGLOC_H

	#include "llvm/ADT/Optional.h"			#include "llvm/ADT/Optional.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
				#include "llvm/ADT/iterator.h"
				#include "llvm/DebugInfo/DWARF/DWARFAddressRange.h"
	#include "llvm/DebugInfo/DWARF/DWARFDataExtractor.h"			#include "llvm/DebugInfo/DWARF/DWARFDataExtractor.h"
	#include "llvm/DebugInfo/DWARF/DWARFRelocMap.h"			#include "llvm/DebugInfo/DWARF/DWARFRelocMap.h"
	#include <cstdint>			#include <cstdint>

	namespace llvm {			namespace llvm {
	class DWARFUnit;			class DWARFUnit;
	class MCRegisterInfo;			class MCRegisterInfo;
	class raw_ostream;			class raw_ostream;

				struct DWARFLocation {
				DWARFAddressRange Range;
				ArrayRef<uint8_t> Location;
				};

	class DWARFDebugLoc {			class DWARFDebugLoc {
	public:			public:
	/// A single location within a location list.			/// A single location within a location list.
	struct Entry {			struct Entry {
	/// The beginning address of the instruction range.			/// The beginning address of the instruction range.
	uint64_t Begin;			uint64_t Begin;
	/// The ending address of the instruction range.			/// The ending address of the instruction range.
	uint64_t End;			uint64_t End;
	▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines

	class DWARFDebugLoclists {			class DWARFDebugLoclists {
	public:			public:
	struct Entry {			struct Entry {
	uint8_t Kind;			uint8_t Kind;
	uint64_t Value0;			uint64_t Value0;
	uint64_t Value1;			uint64_t Value1;
	SmallVector<uint8_t, 4> Loc;			SmallVector<uint8_t, 4> Loc;
	};			};

				class EntryIterator {
				labathAuthorUnsubmitted Done Reply Inline Actions I went for an iterator-like approach (instead of a callback or direct materialization) because it's easier to use. In particular it makes it possible to reuse this stuff in the dumping code, which would have been pretty hard with callbacks. labath: I went for an iterator-like approach (instead of a callback or direct materialization) because…
				public:
				using iterator_category = std::input_iterator_tag;
				using value_type = Expected<DWARFLocation>;
				using difference_type = std::ptrdiff_t;
				using pointer = value_type *;
				using reference = value_type;

				EntryIterator(
				ArrayRef<Entry> Entries,
				llvm::Optional<object::SectionedAddress> BaseAddr,
				std::function<llvm::Optional<object::SectionedAddress>(uint32_t)>
				AddrOffsetResolver)
				: Entries(Entries), BaseAddr(BaseAddr),
				dblaikieUnsubmitted Done Reply Inline Actions I think using an error to express base address selection entries probably is a bit much. I think this API would be more suited to a higher level abstraction over the whole list, rather than one entry in it. (same way we do with ranges - that API has been more fleshed out because of the presence of more users (symbolizers, etc) that want to abstract over the different representations (v4, split, v5, v5-split) & I think shows a fairly good direction this should go in too) One possible caveat: it might make more sense to provide either a lazy iterator or a callback for entries, rather than (as the ranges API does currently) a full in-memory vector of the computed/finalized ranges, for efficiency's sake. Also - would be super great if we could generalize both the range and loclist printing to use some common infrastructure for printing both the half-open range in non-verbose form and the verbose printing with underlying forms including RLE/LLE encodings, old-style base address selection entries (in v4), and also being able to print the section details (like we do for ranges when they're printed in the debug_info section, but we don't do it when they're printed in the actual debug_ranges/rnglists section... ). I realize this is a bit of a big feature request, but figured I'd mention it in case it helps inform your design direction/makes sense to address together, etc. (for instance, it probably means that a pair of uint64_t is insufficient, since we'll want to carry SectionIndex too - and maybe the start/end/section triple could be the same data structure (with the same printing support) as used for ranges, nested inside the data structure that includes the expression and can print that too, etc) dblaikie: I think using an error to express base address selection entries probably is a bit much. I…
				AddrOffsetResolver(AddrOffsetResolver) {
				processBaseAddressEntries();
				}

				Expected<DWARFLocation> operator*() const;

				EntryIterator &operator++() {
				Entries = Entries.drop_front();
				processBaseAddressEntries();
				return *this;
				}

				EntryIterator operator++(int) {
				EntryIterator Save = *this;
				++*this;
				return Save;
				}

				friend bool operator==(const EntryIterator &L, const EntryIterator &R) {
				return L.Entries.begin() == R.Entries.begin();
				}

				friend bool operator!=(const EntryIterator &L, const EntryIterator &R) {
				return !(L == R);
				}

				const Entry *position() const { return Entries.begin(); }

				private:
				void processBaseAddressEntries();

				ArrayRef<Entry> Entries;
				llvm::Optional<object::SectionedAddress> BaseAddr;
				std::function<Optional<object::SectionedAddress>(uint32_t)>
				labathAuthorUnsubmitted Done Reply Inline Actions Using a callback is consistent with what rnglists code does. This uses a std::function instead of a function_ref because it's easier for iterators to escape out of scopes. However, I've wondered if we shouldn't define an AddrOffsetResolver interface that llvm, lldb DWARFUnits and anyone else who wishes so (unit tests ?) can implement. labath: Using a callback is consistent with what rnglists code does. This uses a std::function instead…
				JDevlieghereUnsubmitted Not Done Reply Inline Actions I like the idea, +1 from me. JDevlieghere: I like the idea, +1 from me.
				AddrOffsetResolver;
				};

	struct LocationList {			struct LocationList {
	uint64_t Offset;			uint64_t Offset;
	SmallVector<Entry, 2> Entries;			SmallVector<Entry, 2> Entries;
	void dump(raw_ostream &OS, uint64_t BaseAddr, bool IsLittleEndian,			void dump(raw_ostream &OS, uint64_t BaseAddr, bool IsLittleEndian,
	unsigned AddressSize, const MCRegisterInfo *RegInfo,			unsigned AddressSize, const MCRegisterInfo *RegInfo,
				DIDumpOptions DumpOpts,
				JDevlieghereUnsubmitted Not Done Reply Inline Actions Most dump methods give a default argument for DIDumpOptions (`DumpOpts = {}`). Should we do the same here? JDevlieghere: Most dump methods give a default argument for DIDumpOptions (`DumpOpts = {}`). Should we do the…
				function_ref<Optional<object::SectionedAddress>(uint32_t)>
				LookupPooledAddress,
	DWARFUnit *U, unsigned Indent) const;			DWARFUnit *U, unsigned Indent) const;

				iterator_range<EntryIterator> getAbsoluteLocations(
				Optional<object::SectionedAddress> BaseAddr,
				std::function<Optional<object::SectionedAddress>(uint32_t)>
				AddrOffsetResolver) const {
				return make_range(
				EntryIterator(Entries, BaseAddr, std::move(AddrOffsetResolver)),
				EntryIterator(makeArrayRef(Entries.end(), Entries.end()), llvm::None,
				{}));
				}

				iterator_range<EntryIterator>
				getAbsoluteLocations(Optional<object::SectionedAddress> BaseAddr,
				DWARFUnit &U) const;
	};			};

	private:			private:
	using LocationLists = SmallVector<LocationList, 4>;			using LocationLists = SmallVector<LocationList, 4>;

	LocationLists Locations;			LocationLists Locations;

	unsigned AddressSize;			unsigned AddressSize;

	bool IsLittleEndian;			bool IsLittleEndian;

	public:			public:
	void parse(DataExtractor data, unsigned Version);			void parse(DataExtractor data, unsigned Version);
	void dump(raw_ostream &OS, uint64_t BaseAddr, const MCRegisterInfo *RegInfo,			void dump(raw_ostream &OS, uint64_t BaseAddr, const MCRegisterInfo *RegInfo,
	Optional<uint64_t> Offset) const;			DIDumpOptions DumpOpts, Optional<uint64_t> Offset) const;

	/// Return the location list at the given offset or nullptr.			/// Return the location list at the given offset or nullptr.
	LocationList const *getLocationListAtOffset(uint64_t Offset) const;			LocationList const *getLocationListAtOffset(uint64_t Offset) const;

	static Expected<LocationList> parseOneLocationList(const DataExtractor &Data,			static Expected<LocationList> parseOneLocationList(const DataExtractor &Data,
	uint64_t *Offset,			uint64_t *Offset,
	unsigned Version);			unsigned Version);
	};			};

	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_DEBUGINFO_DWARF_DWARFDEBUGLOC_H			#endif // LLVM_DEBUGINFO_DWARF_DWARFDEBUGLOC_H

lib/BinaryFormat/Dwarf.cpp

Show First 20 Lines • Show All 466 Lines • ▼ Show 20 Lines	default:
return StringRef();		return StringRef();
#define HANDLE_DW_RLE(ID, NAME) \		#define HANDLE_DW_RLE(ID, NAME) \
case DW_RLE_##NAME: \		case DW_RLE_##NAME: \
return "DW_RLE_" #NAME;		return "DW_RLE_" #NAME;
#include "llvm/BinaryFormat/Dwarf.def"		#include "llvm/BinaryFormat/Dwarf.def"
}		}
}		}

		StringRef llvm::dwarf::LocationListEncodingString(unsigned Entry) {
		switch(Entry) {
		default:
		return StringRef();
		case DW_LLE_end_of_list:
		JDevlieghereUnsubmitted Not Done Reply Inline Actions Maybe this should go in in `DWARF.h`? JDevlieghere: Maybe this should go in in `DWARF.h`?
		return "DW_LLE_end_of_list";
		case DW_LLE_base_addressx:
		return "DW_LLE_base_addressx";
		case DW_LLE_startx_endx:
		return "DW_LLE_startx_endx";
		case DW_LLE_startx_length:
		return "DW_LLE_startx_length";
		case DW_LLE_offset_pair:
		return "DW_LLE_offset_pair";
		case DW_LLE_default_location:
		return "DW_LLE_default_location";
		case DW_LLE_base_address:
		return "DW_LLE_base_address";
		case DW_LLE_start_end:
		return "DW_LLE_start_end";
		case DW_LLE_start_length:
		return "DW_LLE_start_length";
		}
		}

StringRef llvm::dwarf::CallFrameString(unsigned Encoding,		StringRef llvm::dwarf::CallFrameString(unsigned Encoding,
Triple::ArchType Arch) {		Triple::ArchType Arch) {
assert(Arch != llvm::Triple::ArchType::UnknownArch);		assert(Arch != llvm::Triple::ArchType::UnknownArch);
#define SELECT_AARCH64 (Arch == llvm::Triple::aarch64_be \|\| Arch == llvm::Triple::aarch64)		#define SELECT_AARCH64 (Arch == llvm::Triple::aarch64_be \|\| Arch == llvm::Triple::aarch64)
#define SELECT_MIPS64 Arch == llvm::Triple::mips64		#define SELECT_MIPS64 Arch == llvm::Triple::mips64
#define SELECT_SPARC (Arch == llvm::Triple::sparc \|\| Arch == llvm::Triple::sparcv9)		#define SELECT_SPARC (Arch == llvm::Triple::sparc \|\| Arch == llvm::Triple::sparcv9)
#define SELECT_X86 (Arch == llvm::Triple::x86 \|\| Arch == llvm::Triple::x86_64)		#define SELECT_X86 (Arch == llvm::Triple::x86 \|\| Arch == llvm::Triple::x86_64)
#define HANDLE_DW_CFA(ID, NAME)		#define HANDLE_DW_CFA(ID, NAME)
▲ Show 20 Lines • Show All 248 Lines • Show Last 20 Lines

lib/DebugInfo/DWARF/DWARFContext.cpp

Show First 20 Lines • Show All 297 Lines • ▼ Show 20 Lines	if (Error E = Header.extract(Data, &Offset)) {
return;		return;
}		}

Header.dump(OS, DumpOpts);		Header.dump(OS, DumpOpts);
DataExtractor LocData(Data.getData().drop_front(Offset),		DataExtractor LocData(Data.getData().drop_front(Offset),
Data.isLittleEndian(), Header.getAddrSize());		Data.isLittleEndian(), Header.getAddrSize());

Loclists.parse(LocData, Header.getVersion());		Loclists.parse(LocData, Header.getVersion());
Loclists.dump(OS, 0, MRI, DumpOffset);		Loclists.dump(OS, 0, MRI, DumpOpts, DumpOffset);
}		}

void DWARFContext::dump(		void DWARFContext::dump(
raw_ostream &OS, DIDumpOptions DumpOpts,		raw_ostream &OS, DIDumpOptions DumpOpts,
std::array<Optional<uint64_t>, DIDT_ID_Count> DumpOffsets) {		std::array<Optional<uint64_t>, DIDT_ID_Count> DumpOffsets) {

uint64_t DumpType = DumpOpts.DumpType;		uint64_t DumpType = DumpOpts.DumpType;

▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	if (const auto *Off =
DObj->getLoclistsSection().Data)) {		DObj->getLoclistsSection().Data)) {
DWARFDataExtractor Data(*DObj, DObj->getLoclistsSection(), isLittleEndian(),		DWARFDataExtractor Data(*DObj, DObj->getLoclistsSection(), isLittleEndian(),
0);		0);
dumpLoclistsSection(OS, DumpOpts, Data, getRegisterInfo(), *Off);		dumpLoclistsSection(OS, DumpOpts, Data, getRegisterInfo(), *Off);
}		}
if (const auto *Off =		if (const auto *Off =
shouldDump(ExplicitDWO, ".debug_loc.dwo", DIDT_ID_DebugLoc,		shouldDump(ExplicitDWO, ".debug_loc.dwo", DIDT_ID_DebugLoc,
DObj->getLocDWOSection().Data)) {		DObj->getLocDWOSection().Data)) {
getDebugLocDWO()->dump(OS, 0, getRegisterInfo(), *Off);		getDebugLocDWO()->dump(OS, 0, getRegisterInfo(), DumpOpts, *Off);
}		}

if (const auto *Off = shouldDump(Explicit, ".debug_frame", DIDT_ID_DebugFrame,		if (const auto *Off = shouldDump(Explicit, ".debug_frame", DIDT_ID_DebugFrame,
DObj->getFrameSection().Data))		DObj->getFrameSection().Data))
getDebugFrame()->dump(OS, getRegisterInfo(), *Off);		getDebugFrame()->dump(OS, getRegisterInfo(), *Off);

if (const auto *Off = shouldDump(Explicit, ".eh_frame", DIDT_ID_DebugFrame,		if (const auto *Off = shouldDump(Explicit, ".eh_frame", DIDT_ID_DebugFrame,
DObj->getEHFrameSection().Data))		DObj->getEHFrameSection().Data))
▲ Show 20 Lines • Show All 1,452 Lines • Show Last 20 Lines

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp

Show All 9 Lines
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/BinaryFormat/Dwarf.h"		#include "llvm/BinaryFormat/Dwarf.h"
#include "llvm/DebugInfo/DWARF/DWARFContext.h"		#include "llvm/DebugInfo/DWARF/DWARFContext.h"
#include "llvm/DebugInfo/DWARF/DWARFExpression.h"		#include "llvm/DebugInfo/DWARF/DWARFExpression.h"
#include "llvm/DebugInfo/DWARF/DWARFRelocMap.h"		#include "llvm/DebugInfo/DWARF/DWARFRelocMap.h"
#include "llvm/DebugInfo/DWARF/DWARFUnit.h"		#include "llvm/DebugInfo/DWARF/DWARFUnit.h"
#include "llvm/Support/Compiler.h"		#include "llvm/Support/Compiler.h"
#include "llvm/Support/Format.h"		#include "llvm/Support/Format.h"
		#include "llvm/Support/FormatAdapters.h"
		#include "llvm/Support/FormatVariadic.h"
#include "llvm/Support/WithColor.h"		#include "llvm/Support/WithColor.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <algorithm>		#include <algorithm>
#include <cinttypes>		#include <cinttypes>
#include <cstdint>		#include <cstdint>

using namespace llvm;		using namespace llvm;
		using object::SectionedAddress;

// When directly dumping the .debug_loc without a compile unit, we have to guess		// When directly dumping the .debug_loc without a compile unit, we have to guess
// at the DWARF version. This only affects DW_OP_call_ref, which is a rare		// at the DWARF version. This only affects DW_OP_call_ref, which is a rare
// expression that LLVM doesn't produce. Guessing the wrong version means we		// expression that LLVM doesn't produce. Guessing the wrong version means we
// won't be able to pretty print expressions in DWARF2 binaries produced by		// won't be able to pretty print expressions in DWARF2 binaries produced by
// non-LLVM tools.		// non-LLVM tools.
static void dumpExpression(raw_ostream &OS, ArrayRef<uint8_t> Data,		static void dumpExpression(raw_ostream &OS, ArrayRef<uint8_t> Data,
bool IsLittleEndian, unsigned AddressSize,		bool IsLittleEndian, unsigned AddressSize,
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	if (auto LL = parseOneLocationList(data, &Offset))
Locations.push_back(std::move(*LL));		Locations.push_back(std::move(*LL));
else {		else {
logAllUnhandledErrors(LL.takeError(), WithColor::error());		logAllUnhandledErrors(LL.takeError(), WithColor::error());
break;		break;
}		}
}		}
}		}

		Expected<DWARFLocation> DWARFDebugLoclists::EntryIterator::operator*() const {
		const Entry &E = Entries.front();
		DWARFLocation Result;
		switch (E.Kind) {
		case dwarf::DW_LLE_startx_length: {
		auto Start = AddrOffsetResolver(E.Value0);
		if (!Start)
		return createStringError(errc::invalid_argument,
		"Failed to read address offset %u",
		unsigned(E.Value0));
		Result.Range.LowPC = Start->Address;
		Result.Range.HighPC = Start->Address + E.Value1;
		Result.Range.SectionIndex = Start->SectionIndex;
		break;
		}
		case dwarf::DW_LLE_start_length:
		Result.Range.LowPC = E.Value0;
		Result.Range.HighPC = E.Value0 + E.Value1;
		// TODO: Store the SectionedAddress in the Entry class
		Result.Range.SectionIndex = SectionedAddress::UndefSection;
		break;
		case dwarf::DW_LLE_offset_pair:
		if (!BaseAddr)
		return createStringError(errc::invalid_argument,
		labathAuthorUnsubmitted Done Reply Inline Actions A significant deviation from the rnglists code: Here I return an error if it is not possible to compute the address range correctly. The rnglists parser cannot compute the value, it will just return something, which can be very far from the correct range -- for instance, it's happy to use the base_addressx index as the offset, if the index cannot be resolved correctly. And it doesn't provide any indication that it has done so, which doesn't seem like a very useful behavior. If this is the behavior we want, I can also try to make the rnglists parser do something similar. labath: A significant deviation from the rnglists code: Here I return an error if it is not possible to…
		"Cannot interpret DW_LLE_offset_pair entry due "
		"to missing base address");

		Result.Range.LowPC = BaseAddr->Address + E.Value0;
		Result.Range.HighPC = BaseAddr->Address + E.Value1;
		Result.Range.SectionIndex = BaseAddr->SectionIndex;
		break;
		case dwarf::DW_LLE_base_address:
		case dwarf::DW_LLE_base_addressx:
		llvm_unreachable("Base address selection entries handled elsewhere!");
		default:
		// Entries rejected by the parser.
		llvm_unreachable("Unsupported location list kind!");
		}
		Result.Location = E.Loc;
		return Result;
		}

		void DWARFDebugLoclists::EntryIterator::processBaseAddressEntries() {
		for (; !Entries.empty(); Entries = Entries.drop_front()) {
		const Entry &E = Entries.front();
		switch (E.Kind) {
		case dwarf::DW_LLE_base_address:
		// TODO: Store the SectionedAddress in the Entry class
		BaseAddr->Address = E.Value0;
		BaseAddr->SectionIndex = SectionedAddress::UndefSection;
		break;
		case dwarf::DW_LLE_base_addressx:
		// Entry rejected by the parser.
		llvm_unreachable("Unsupported location list kind!");
		default:
		return;
		}
		}
		}

Expected<DWARFDebugLoclists::LocationList>		Expected<DWARFDebugLoclists::LocationList>
DWARFDebugLoclists::parseOneLocationList(const DataExtractor &Data,		DWARFDebugLoclists::parseOneLocationList(const DataExtractor &Data,
uint64_t *Offset, unsigned Version) {		uint64_t *Offset, unsigned Version) {
LocationList LL;		LocationList LL;
LL.Offset = *Offset;		LL.Offset = *Offset;
DataExtractor::Cursor C(*Offset);		DataExtractor::Cursor C(*Offset);

// dwarf::DW_LLE_end_of_list_entry is 0 and indicates the end of the list.		// dwarf::DW_LLE_end_of_list_entry is 0 and indicates the end of the list.
▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines
DWARFDebugLoclists::getLocationListAtOffset(uint64_t Offset) const {		DWARFDebugLoclists::getLocationListAtOffset(uint64_t Offset) const {
auto It = partition_point(		auto It = partition_point(
Locations, [=](const LocationList &L) { return L.Offset < Offset; });		Locations, [=](const LocationList &L) { return L.Offset < Offset; });
if (It != Locations.end() && It->Offset == Offset)		if (It != Locations.end() && It->Offset == Offset)
return &(*It);		return &(*It);
return nullptr;		return nullptr;
}		}

void DWARFDebugLoclists::LocationList::dump(raw_ostream &OS, uint64_t BaseAddr,		iterator_range<DWARFDebugLoclists::EntryIterator>
bool IsLittleEndian,		DWARFDebugLoclists::LocationList::getAbsoluteLocations(
unsigned AddressSize,		Optional<SectionedAddress> BaseAddr, DWARFUnit &U) const {
const MCRegisterInfo *MRI,		return getAbsoluteLocations(BaseAddr, [&U](uint32_t Index) {
DWARFUnit *U,		return U.getAddrOffsetSectionItem(Index);
unsigned Indent) const {		});
		}

		void DWARFDebugLoclists::LocationList::dump(
		raw_ostream &OS, uint64_t BaseAddr, bool IsLittleEndian,
		dblaikieUnsubmitted Done Reply Inline Actions LLVM style suggests avoiding "else after return" ( https://llvm.org/docs/CodingStandards.html#don-t-use-else-after-a-return ) dblaikie: LLVM style suggests avoiding "else after return" ( https://llvm.org/docs/CodingStandards.
		unsigned AddressSize, const MCRegisterInfo *MRI, DIDumpOptions DumpOpts,
		function_ref<Optional<SectionedAddress>(uint32_t)> LookupPooledAddress,
		DWARFUnit *U, unsigned Indent) const {
		uint8_t MaxEncodingStringLength = 20;
		EntryIterator Absolute =
		getAbsoluteLocations(
		SectionedAddress{BaseAddr, SectionedAddress::UndefSection},
		LookupPooledAddress)
		.begin();
		labathAuthorUnsubmitted Done Reply Inline Actions This parallel iteration is not completely nice, but I think it's worth being able to reuse the absolute range computation code. I'm open to ideas for improvement though. labath: This parallel iteration is not completely nice, but I think it's worth being able to reuse the…
		dblaikieUnsubmitted Not Done Reply Inline Actions Ah, I see - this is what you meant about "In particular it makes it possible to reuse this stuff in the dumping code, which would have been pretty hard with callbacks.". I'm wondering if that might be worth revisiting somewhat. A full iterator abstraction for one user here (well, two once you include lldb - but I assume it's likely going to build its own data structure from the iteration anyway, right? (it's not going to keep the iterator around, do anything interesting like partial iterations, re-iterate/etc - such that a callback would suffice)) I could imagine two callback APIs for this - one that gets entries and locations and one that only gets locations by filtering on the entry version. eg: // for non-verbose output: LL.forEachEntry([&](const Entry &E, Expected<DWARFLocation> L) { if (Verbose && actually dumping debug_loc) print(E) // print any LLE_, raw parameters, etc if (L) print(L) // print the resulting address range, section name (if verbose), else print(error stuff) }); One question would be "when/where do we print the DWARF expression" - if there's an error computing the address range, we can still print the expression, so maybe that happens unconditionally at the end of the callback, using the expression in the Entry? (then, arguably, the expression doesn't need to be in the DWARFLocation - and I'd say make the DWARFLocation a sectioned range, exactly the same type as for ranges so that part of the dumping code, etc, can be maximally reused) dblaikie: Ah, I see - this is what you meant about "In particular it makes it possible to reuse this…
		labathAuthorUnsubmitted Done Reply Inline Actions Actually, what lldb currently does is that it does not build any data structures at all (except storing the pointer to the right place in the debug_loc section. Then, whenever it wants to do something to the loclist, it parses it afresh. I don't know why it does this exactly, but I assume it has something to do with most locations never being used, or being only a couple of times, and the actual parsing being fairly fast. What this means is that lldb is not really a single "user", but there are like four or five places where it iterates through the list, depending on what does it actually want to do with it. It also does partial iteration where it stops as soon as it find the entry it was interested in. Now, all of that is possible with a callback (though I am generally trying to avoid them), but it does resurface the issue of what should be the value of the second argument for DW_LLE_base_address entries (the thing which I originally used a error type for). Maybe this should be actually one callback API, taking two callback functions, with one of them being invoked for base_address entries, and one for others? However, if we stick to the current approaches in both LLE and RLE of making the address pool resolution function a parameter (which I'd like to keep, as it makes my job in lldb easier), then this would actually be three callbacks, which starts to get unwieldy. Though one of those callbacks could be removed with the "DWARFUnit implementing a AddrOffsetResolver interface" idea, which I really like. :) labath: Actually, what lldb currently does is that it does not build any data structures at all (except…
		dblaikieUnsubmitted Not Done Reply Inline Actions Ah, thanks for the details on LLDB's location parsing logic. That's interesting indeed! I can appreciate an iterator-based API if that's the sort of usage we've got, though I expect it doesn't have any interest in the low-level encoding & just wants the fully processed address ranges/locations - it doesn't want base_address or end_of_list entries? & I think the dual-iteration is a fairly awkward API design, trying to iterate them in lock-step, etc. I'd rather avoid that if reasonably possible. Either having an iterator API that gives only the fully processed data/semantic view & a completely different API if you want to access the low level primitives (LLE, etc) (this is how ranges works - there's an API that gives a collection of ranges & abstracts over v4/v5/rnglists/etc - though that's partly motivated by a strong multi-client need for that functionality for symbolizing, etc - but I think it's a good abstraction/model anyway (& one of the reasons the inline range list printing doesn't include encoding information, the API it uses is too high level to even have access to it)) Now, all of that is possible with a callback (though I am generally trying to avoid them), but it does resurface the issue of what should be the value of the second argument for DW_LLE_base_address entries (the thing which I originally used a error type for). Sorry, my intent in the above API was for the second argument to be Optional's "None" state when... oh, I see, I did use Expected there, rather than Optional, because there are legit error cases. I know it's sort of awkward, but I might be inclined to use Optional<Expected<AddressRange>> there. I realize two layers of wrapping is a bit weird, but I think it'd be nicer than having an error state for what, I think, isn't erroneous. Maybe this should be actually one callback API, taking two callback functions, with one of them being invoked for base_address entries, and one for others? However, if we stick to the current approaches in both LLE and RLE of making the address pool resolution function a parameter (which I'd like to keep, as it makes my job in lldb easier), then this would actually be three callbacks, which starts to get unwieldy. Don't mind three callbacks too much. Though one of those callbacks could be removed with the "DWARFUnit implementing a AddrOffsetResolver interface" idea, which I really like. :) Sorry, I haven't really looked at where the address resolver callback is registered and alternative designs being discussed - but yeah, going off just the one-sentence, it seems reasonable to have the DWARFUnit own an address resolver/be the thing you consult when you want to resolve an address (just through a normal function call in DWARFUnit, perhaps - which might, internally, use a callback registered when it was constructed). dblaikie: Ah, thanks for the details on LLDB's location parsing logic. That's interesting indeed! I can…
		labathAuthorUnsubmitted Done Reply Inline Actions I know it's sort of awkward, but I might be inclined to use Optional<Expected<AddressRange>> there. I realize two layers of wrapping is a bit weird, but I think it'd be nicer than having an error state for what, I think, isn't erroneous. Actually, my very first attempt at this patch used an `Expected<Optional<Whatever>>`, but then I scrapped it because I didn't think you'd like it. It's not the friendliest of APIs, but I think we can go with that. Sorry, I haven't really looked at where the address resolver callback is registered and alternative designs being discussed - but yeah, going off just the one-sentence, it seems reasonable to have the DWARFUnit own an address resolver/be the thing you consult when you want to resolve an address (just through a normal function call in DWARFUnit, perhaps - which might, internally, use a callback registered when it was constructed). I think you got that backwards. I don't want the DWARFUnit to be the source of truth for address pool resolutions, as that would make it hard to use from lldb (it's far from ready to start using the llvm version right now). What I wanted was to replace the lambda/function_ref with a single-method interface. Then both DWARFUnits could implement that interface so that passing a DWARFUnit& would "just work" (but you wouldn't be limited to DWARFUnits as anyone could implement that interface, just like anyone can write a lambda). labath: > I know it's sort of awkward, but I might be inclined to use Optional<Expected<AddressRange>>…
		dblaikieUnsubmitted Not Done Reply Inline Actions As for Expected<Optional<Whatever>> (or Optional<Expected<>>) - yeah, I think this is a non-obvious API (both the general problem and this specific solution). I think it's probably worth discussing this design a bit more to save you time writing/rewriting things a bit. I guess there are a few layers of failure here. There's the possibility that the iteration itself could fail - even for debug_loc style lists (if we reached the end of the section before encountering a terminating {0,0}). That would suggest a fallible iterator idiom: http://llvm.org/docs/ProgrammersManual.html#building-fallible-iterators-and-iterator-ranges But then, yes, when looking at the "processed"/semantic view, that could fail too in the case of an invalid address index, etc. The generic/processed/abstracted-over-ranges-and-rnglists API for ranges produces a fully computer vector (& then returns Expected<vector> of that range) - is that reasonable? (this does mean manifesting a whole location in memory, which may not be needed so I could understand avoiding that even without fully implementing & demonstrating the vector solution is inadequate). But I /think/ maybe the we could/should have two APIs - one generic API that abstracts over loc/loclists and only provides the fully processed view, and another that is type specific for dumping the underlying representation (only used in dumping debug_loclists). dblaikie: As for Expected<Optional<Whatever>> (or Optional<Expected<>>) - yeah, I think this is a non…
		labathAuthorUnsubmitted Done Reply Inline Actions If we were computing the final address ranges from scratch (which would be the best match for the current lldb usage, but which I am not considering now for fear of changing too many things), then I agree that we would need the fallible_iterator iterator thingy. But in this case we are "interpreting" the already parsed ranges, so we can assume some level of correctness here, and the thing that can fail is only the computation of a single range, which does not affect our ability to process the next entry. This indicates to me that either each entry in the list should be an Expected<>, or that the invalid entries should be just dropped (possibly accompanied by some flag which would tell the caller that the result was not exhaustive). This is connected to one of the issues I have with the debug ranges API -- it tries _really_ hard to return something -- if resolving the indirect base address entry fails, it is perfectly happy to use the address _index_ as the base address. This makes sense for dumping, where you want to show something (though it would still be good to indicate that you're not showing a real address), but it definitely does not help consumers which then need to make decisions based on the returned data. Anyway, yes, I agree that we need to APIs, and probably callbacks are the easiest way to achieve that. We could have a "base" callback that is not particularly nice to use, but provides the full information via a combination of `UnparsedLL` and `Optional<Expected<ParsedLL>>` arguments. The dumper could use that to print out everything it needs. And then we could have a second API, built on top of the first one, which ignores base address entries and the raw data and returns just a bunch of `Expected<ParsedLL>`. This could be used by users like lldb, who just want to see the final data. The `ParsedLL` type would be independent of the location list type, so that the debug_loc parser could provide the same kind of API (but implemented on top of something else, as the `UnparsedLL` types would differ). Also, under the hood, the location list dumper for debug_loclists (but not debug_loc) could reuse some implementation details with the debug_rnglists dumper via a suitable combination of templates and callbacks. How does that sound? labath: If we were computing the final address ranges from scratch (which would be the best match for…
		dblaikieUnsubmitted Not Done Reply Inline Actions What sort of things are you concerned about with deeper API changes here? I think it's probably worth building the "right" thing now - as good a time as any. LLVM's debug info APIs, as you've pointed out, aren't exactly "sturdy" (treating address indexes as offsets, etc, etc), so no time like the present to clean it up. I think if we had an abstraction over v4 and v5 location descriptions, parsing from scratch, fallible iterators, etc - that'd be the ideal thing to use in the inline dumping code (that dumps inside debug_info) - which currently uses "parseOneLocationList" - so it is parsing from scratch and dumping. But equally I understand not wanting to make you/me/anyone fix everything when just trying to get something actually done. dblaikie: What sort of things are you concerned about with deeper API changes here? I think it's probably…
		labathAuthorUnsubmitted Done Reply Inline Actions I think I am mainly concerned with the scope explosion of doing changes like that. I don't know how well founded is that concern, because I don't know all the use cases for this information right now (but that's a part of the problem). But anyway, let's continue the discussion. The main problem I see with "inline" dumping using some higher-level abstraction is what should one do in case of incomplete data (e.g. dwo files). If this abstraction returns "cooked" data, then the inline dump could not show anything for dwo files dumped standalone. This is perfectly fine for lldb and maybe other tools (which never look at dwo files separately, and mostly don't care about the reason why the "cooking" failed -- if they just need to get the data it can rely on), but for something like llvm-dwarfdump, we'd probably want to display _something_. I guess this is why the ranges api returns indexes as offsets, but I think we agree we don't want that. Then the question is what should that "something" be? If it's going to be a raw dump of the location list entry, then solution would not be fully generic, as they raw entry type will depend on the location list kind. Though, we could still arrange it so that the various location lists can be processed uniformly via a template if they e.g. have a dump() function with the same signature... Suppose we go down that path (this is the path I wanted to go down in my previous comment, modulo the parse-from-scratch part). The question then is what to do with the section-based dumping. Should it use the same mechanism? It probably should, because the first callback/iterator will provide all data it can possibly want. But then, should it also build the parsed representation like it does now? If yes, then what for? Should we also build some mechanism to "cook" that data too. If not, are we ok with having all users reparse from scratch always? (There are only two users I found right now, and they look like they'd be fine with it.) Or should the DWARFDebugLoclists object cache the cooked data instead? llvm-dwarfdump --statistics would actually prefer that, but that might make the DWARF verifier sad (though I guess it could always reparse from scratch if needed). labath: I think I am mainly concerned with the scope explosion of doing changes like that. I don't know…
		JDevlieghereUnsubmitted Not Done Reply Inline Actions I'm very late to the discussion and I'm not as familiar as both of you with the details and the API uses, so please ignore my suggestion if it doesn't make any sense... Could we have an API where we parse a list of Uncooked (to reuse Pavel's nomenclature) and then have the ability to resolve each Uncooked entry into a Cooked entry? Then both LLDB and dwarfdump could get the list of Uncooked entries and try to get the cooked variant. If that works, great, we dump/use that, if not we move on and have separate failure modes for errors originating from parsing and from cooking. JDevlieghere: I'm very late to the discussion and I'm not as familiar as both of you with the details and the…
		labathAuthorUnsubmitted Done Reply Inline Actions We've discussed this with David last week, and we have hopefully agreed on the rough direction forward (and I think it's going to roughly correspond to what you had in mind). I'm preparing a patch (first of many) to implement that and I'm hoping I'll be able to upload something today. labath: We've discussed this with David last week, and we have hopefully agreed on the rough direction…
for (const Entry &E : Entries) {		for (const Entry &E : Entries) {
		// We dump the raw encoding if we're in verbose mode, or if we failed to
		// produce the absolute address range.
		const auto &DumpEncoding = [&] {
		OS << format("[%-*s]", MaxEncodingStringLength,
		dwarf::LocationListEncodingString(E.Kind).data());
		if (E.Kind != dwarf::DW_LLE_end_of_list)
		OS << ": ";
switch (E.Kind) {		switch (E.Kind) {
case dwarf::DW_LLE_startx_length:		case dwarf::DW_LLE_end_of_list:
OS << '\n';		// TODO: Generate these entries.
OS.indent(Indent);		llvm_unreachable("unreachable locations list kind");
OS << "Addr idx " << E.Value0 << " (w/ length " << E.Value1 << "): ";		case dwarf::DW_LLE_base_address:
		OS << format_hex(E.Value0, 2 + AddressSize * 2);
break;		break;
		case dwarf::DW_LLE_startx_length:
case dwarf::DW_LLE_start_length:		case dwarf::DW_LLE_start_length:
OS << '\n';
OS.indent(Indent);
OS << format("[0x%." PRIx64 ", 0x%." PRIx64 "): ", AddressSize * 2,
AddressSize * 2, E.Value0, AddressSize * 2, AddressSize * 2,
E.Value0 + E.Value1);
break;
case dwarf::DW_LLE_offset_pair:		case dwarf::DW_LLE_offset_pair:
OS << '\n';		OS << format_hex(E.Value0, 2 + AddressSize * 2) << ", "
		<< format_hex(E.Value1, 2 + AddressSize * 2);
		}
		};
		OS << "\n";
OS.indent(Indent);		OS.indent(Indent);
OS << format("[0x%." PRIx64 ", 0x%." PRIx64 "): ", AddressSize * 2,		if (DumpOpts.Verbose)
AddressSize * 2, BaseAddr + E.Value0, AddressSize * 2,		DumpEncoding();
AddressSize * 2, BaseAddr + E.Value1);		if (E.Kind == dwarf::DW_LLE_base_address)
break;		continue;
case dwarf::DW_LLE_base_address:		assert(Absolute.position() == &E);
BaseAddr = E.Value0;		if (auto ExpectedLocation = *Absolute) {
break;		if (DumpOpts.Verbose)
default:		OS << " => ";
llvm_unreachable("unreachable locations list kind");		ExpectedLocation->Range.dump(OS, AddressSize);
		} else {
		DumpEncoding();
		OS << " => ";
		OS << formatv("<{0}>", fmt_consume(ExpectedLocation.takeError()));
}		}
		++Absolute;
		OS << " ";
dumpExpression(OS, E.Loc, IsLittleEndian, AddressSize, MRI, U);		dumpExpression(OS, E.Loc, IsLittleEndian, AddressSize, MRI, U);
}		}
}		}

void DWARFDebugLoclists::dump(raw_ostream &OS, uint64_t BaseAddr,		void DWARFDebugLoclists::dump(raw_ostream &OS, uint64_t BaseAddr,
const MCRegisterInfo *MRI,		const MCRegisterInfo *MRI, DIDumpOptions DumpOpts,
Optional<uint64_t> Offset) const {		Optional<uint64_t> Offset) const {
auto DumpLocationList = [&](const LocationList &L) {		auto DumpLocationList = [&](const LocationList &L) {
OS << format("0x%8.8" PRIx64 ": ", L.Offset);		OS << format("0x%8.8" PRIx64 ": ", L.Offset);
L.dump(OS, BaseAddr, IsLittleEndian, AddressSize, MRI, nullptr, /Indent=/12);		L.dump(
		OS, BaseAddr, IsLittleEndian, AddressSize, MRI, DumpOpts,
		[](uint32_t Index) { return llvm::None; }, nullptr, /Indent=/12);
OS << "\n\n";		OS << "\n\n";
};		};

if (Offset) {		if (Offset) {
if (auto L = getLocationListAtOffset(Offset))		if (auto L = getLocationListAtOffset(Offset))
DumpLocationList(*L);		DumpLocationList(*L);
return;		return;
}		}

for (const LocationList &L : Locations) {		for (const LocationList &L : Locations) {
DumpLocationList(L);		DumpLocationList(L);
}		}
}		}

lib/DebugInfo/DWARF/DWARFDie.cpp

Show First 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	if (FormValue.isFormClass(DWARFFormValue::FC_Block) \|\|
DataExtractor Data(StringRef((const char *)Expr.data(), Expr.size()),		DataExtractor Data(StringRef((const char *)Expr.data(), Expr.size()),
Ctx.isLittleEndian(), 0);		Ctx.isLittleEndian(), 0);
DWARFExpression(Data, U->getVersion(), U->getAddressByteSize())		DWARFExpression(Data, U->getVersion(), U->getAddressByteSize())
.print(OS, MRI, U);		.print(OS, MRI, U);
return;		return;
}		}

FormValue.dump(OS, DumpOpts);		FormValue.dump(OS, DumpOpts);
const auto &DumpLL = [&](auto ExpectedLL) {		const auto &DumpError = [&](Error E) {
if (ExpectedLL) {
uint64_t BaseAddr = 0;
if (Optional<object::SectionedAddress> BA = U->getBaseAddress())
BaseAddr = BA->Address;
ExpectedLL->dump(OS, BaseAddr, Ctx.isLittleEndian(), Obj.getAddressSize(),
MRI, U, Indent);
} else {
OS << '\n';		OS << '\n';
OS.indent(Indent);		OS.indent(Indent);
OS << formatv("error extracting location list: {0}",		OS << formatv("error extracting location list: {0}",
fmt_consume(ExpectedLL.takeError()));		fmt_consume(std::move(E)));
}
};		};
if (FormValue.isFormClass(DWARFFormValue::FC_SectionOffset)) {		if (FormValue.isFormClass(DWARFFormValue::FC_SectionOffset)) {
uint64_t Offset = *FormValue.getAsSectionOffset();		uint64_t Offset = *FormValue.getAsSectionOffset();

		uint64_t BaseAddr = 0;
		if (Optional<object::SectionedAddress> BA = U->getBaseAddress())
		BaseAddr = BA->Address;

if (!U->isDWOUnit() && !U->getLocSection()->Data.empty()) {		if (!U->isDWOUnit() && !U->getLocSection()->Data.empty()) {
DWARFDebugLoc DebugLoc;		DWARFDebugLoc DebugLoc;
DWARFDataExtractor Data(Obj, *U->getLocSection(), Ctx.isLittleEndian(),		DWARFDataExtractor Data(Obj, *U->getLocSection(), Ctx.isLittleEndian(),
Obj.getAddressSize());		Obj.getAddressSize());
DumpLL(DebugLoc.parseOneLocationList(Data, &Offset));		if (auto ExpectedLL = DebugLoc.parseOneLocationList(Data, &Offset))
		ExpectedLL->dump(OS, BaseAddr, Ctx.isLittleEndian(),
		Obj.getAddressSize(), MRI, U, Indent);
		else
		DumpError(ExpectedLL.takeError());
return;		return;
}		}

bool UseLocLists = !U->isDWOUnit();		bool UseLocLists = !U->isDWOUnit();
StringRef LoclistsSectionData =		StringRef LoclistsSectionData =
UseLocLists ? Obj.getLoclistsSection().Data : U->getLocSectionData();		UseLocLists ? Obj.getLoclistsSection().Data : U->getLocSectionData();

if (!LoclistsSectionData.empty()) {		if (!LoclistsSectionData.empty()) {
DataExtractor Data(LoclistsSectionData, Ctx.isLittleEndian(),		DataExtractor Data(LoclistsSectionData, Ctx.isLittleEndian(),
Obj.getAddressSize());		Obj.getAddressSize());

// Old-style location list were used in DWARF v4 (.debug_loc.dwo section).		// Old-style location list were used in DWARF v4 (.debug_loc.dwo section).
// Modern locations list (.debug_loclists) are used starting from v5.		// Modern locations list (.debug_loclists) are used starting from v5.
// Ideally we should take the version from the .debug_loclists section		// Ideally we should take the version from the .debug_loclists section
// header, but using CU's version for simplicity.		// header, but using CU's version for simplicity.
DumpLL(DWARFDebugLoclists::parseOneLocationList(		if (auto ExpectedLL = DWARFDebugLoclists::parseOneLocationList(
Data, &Offset, UseLocLists ? U->getVersion() : 4));		Data, &Offset, UseLocLists ? U->getVersion() : 4))
		ExpectedLL->dump(
		OS, BaseAddr, Ctx.isLittleEndian(), Obj.getAddressSize(), MRI,
		DumpOpts,
		[U](uint32_t Index) { return U->getAddrOffsetSectionItem(Index); },
		U, Indent);
		else
		DumpError(ExpectedLL.takeError());

}		}
}		}
}		}

/// Dump the name encoded in the type tag.		/// Dump the name encoded in the type tag.
static void dumpTypeTagName(raw_ostream &OS, dwarf::Tag T) {		static void dumpTypeTagName(raw_ostream &OS, dwarf::Tag T) {
StringRef TagStr = TagString(T);		StringRef TagStr = TagString(T);
if (!TagStr.startswith("DW_TAG_") \|\| !TagStr.endswith("_type"))		if (!TagStr.startswith("DW_TAG_") \|\| !TagStr.endswith("_type"))
▲ Show 20 Lines • Show All 598 Lines • Show Last 20 Lines

test/CodeGen/X86/debug-loclists.ll

	; RUN: llc -mtriple=x86_64-pc-linux -filetype=obj -o %t < %s			; RUN: llc -mtriple=x86_64-pc-linux -filetype=obj -o %t < %s
	; RUN: llvm-dwarfdump -v %t \| FileCheck %s			; RUN: llvm-dwarfdump -v %t \| FileCheck %s

	; CHECK: 0x00000033: DW_TAG_formal_parameter [3]			; CHECK: 0x00000033: DW_TAG_formal_parameter [3]
	; CHECK-NEXT: DW_AT_location [DW_FORM_sec_offset] (0x0000000c			; CHECK-NEXT: DW_AT_location [DW_FORM_sec_offset] (0x0000000c
	; CHECK-NEXT: [0x0000000000000000, 0x0000000000000004): DW_OP_breg5 RDI+0			; CHECK-NEXT: [0x0000000000000000, 0x0000000000000004) DW_OP_breg5 RDI+0
	; CHECK-NEXT: [0x0000000000000004, 0x0000000000000012): DW_OP_breg3 RBX+0)			; CHECK-NEXT: [0x0000000000000004, 0x0000000000000012) DW_OP_breg3 RBX+0)
	; CHECK-NEXT: DW_AT_name [DW_FORM_strx1] (indexed (0000000e) string = "a")			; CHECK-NEXT: DW_AT_name [DW_FORM_strx1] (indexed (0000000e) string = "a")
	; CHECK-NEXT: DW_AT_decl_file [DW_FORM_data1] ("/home/folder{{\\\|\/}}test.cc")			; CHECK-NEXT: DW_AT_decl_file [DW_FORM_data1] ("/home/folder{{\\\|\/}}test.cc")
	; CHECK-NEXT: DW_AT_decl_line [DW_FORM_data1] (6)			; CHECK-NEXT: DW_AT_decl_line [DW_FORM_data1] (6)
	; CHECK-NEXT: DW_AT_type [DW_FORM_ref4] (cu + 0x0040 => {0x00000040} "A")			; CHECK-NEXT: DW_AT_type [DW_FORM_ref4] (cu + 0x0040 => {0x00000040} "A")

	; CHECK: .debug_loclists contents:			; CHECK: .debug_loclists contents:
	; CHECK-NEXT: 0x00000000: locations list header: length = 0x00000015, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000000			; CHECK-NEXT: 0x00000000: locations list header: length = 0x00000015, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000000
	; CHECK-NEXT: 0x00000000:			; CHECK-NEXT: 0x00000000:
	; CHECK-NEXT: [0x0000000000000000, 0x0000000000000004): DW_OP_breg5 RDI+0			; CHECK-NEXT: [DW_LLE_offset_pair ]: 0x0000000000000000, 0x0000000000000004 => [0x0000000000000000, 0x0000000000000004) DW_OP_breg5 RDI+0
				labathAuthorUnsubmitted Done Reply Inline Actions This tries to follow the RLE format as closely as possible, but I think something like [DW_LLE_offset_pair, 0x0000000000000000, 0x0000000000000004] => [0x0000000000000000, 0x0000000000000004): DW_OP_breg5 RDI+0 would make more sense (both here and for RLE). labath: This tries to follow the RLE format as closely as possible, but I think something like ```…
				dblaikieUnsubmitted Not Done Reply Inline Actions Yep, that'd make more sense to me - are you planning to unify the codepaths for this? I think that'd be for the best. If I were picking a printing from scratch, I might go with: DW_LLE_offset_pair(0x0000, 0x0004) => [0x0000, 0x0004): DW_OP_breg5 RDI+0 Making it look a bit more like a function call and function arguments. Though the () might be confusing with the range notation. I'm also undecided on the " => " separator. Whether a ':' might be better/fine, etc. Totally open to ideas, but mostly I'd really love these to use loclist and ranges to use the same code as much as possible, so we can get consistency and any readability benefits, etc in both. dblaikie: Yep, that'd make more sense to me - are you planning to unify the codepaths for this? I think…
				labathAuthorUnsubmitted Done Reply Inline Actions I like the function call format. I hoping to get some code reuse, though it's still not fully clear to me how to achieve that.. labath: I like the function call format. I hoping to get some code reuse, though it's still not fully…
				dblaikieUnsubmitted Not Done Reply Inline Actions I've posted my unification of range/loc/v4/v5 emission here: https://reviews.llvm.org/D68620 - & I'd imagine something similar in the parsing side. dblaikie: I've posted my unification of range/loc/v4/v5 emission here: https://reviews.llvm.org/D68620…
				labathAuthorUnsubmitted Done Reply Inline Actions cool. I'll see what I can do with that. labath: cool. I'll see what I can do with that.
	; CHECK-NEXT: [0x0000000000000004, 0x0000000000000012): DW_OP_breg3 RBX+0			; CHECK-NEXT: [DW_LLE_offset_pair ]: 0x0000000000000004, 0x0000000000000012 => [0x0000000000000004, 0x0000000000000012) DW_OP_breg3 RBX+0

	; There is no way to use llvm-dwarfdump atm (2018, october) to verify the DW_LLE_* codes emited,
	; because dumper is not yet implements that. Use asm code to do this check instead.
	;
	; RUN: llc -mtriple=x86_64-pc-linux -filetype=asm < %s -o - \| FileCheck %s --check-prefix=ASM
	; ASM: .section .debug_loclists,"",@progbits
	; ASM-NEXT: .long .Ldebug_loclist_table_end0-.Ldebug_loclist_table_start0 # Length
	; ASM-NEXT: .Ldebug_loclist_table_start0:
	; ASM-NEXT: .short 5 # Version
	; ASM-NEXT: .byte 8 # Address size
	; ASM-NEXT: .byte 0 # Segment selector size
	; ASM-NEXT: .long 0 # Offset entry count
	; ASM-NEXT: .Lloclists_table_base0:
	; ASM-NEXT: .Ldebug_loc0:
	; ASM-NEXT: .byte 4 # DW_LLE_offset_pair
	; ASM-NEXT: .uleb128 .Lfunc_begin0-.Lfunc_begin0 # starting offset
	; ASM-NEXT: .uleb128 .Ltmp0-.Lfunc_begin0 # ending offset
	; ASM-NEXT: .byte 2 # Loc expr size
	; ASM-NEXT: .byte 117 # DW_OP_breg5
	; ASM-NEXT: .byte 0 # 0
	; ASM-NEXT: .byte 4 # DW_LLE_offset_pair
	; ASM-NEXT: .uleb128 .Ltmp0-.Lfunc_begin0 # starting offset
	; ASM-NEXT: .uleb128 .Ltmp1-.Lfunc_begin0 # ending offset
	; ASM-NEXT: .byte 2 # Loc expr size
	; ASM-NEXT: .byte 115 # DW_OP_breg3
	; ASM-NEXT: .byte 0 # 0
	; ASM-NEXT: .byte 0 # DW_LLE_end_of_list
	; ASM-NEXT: .Ldebug_loclist_table_end0:

	; ModuleID = 'test.cc'			; ModuleID = 'test.cc'
	source_filename = "test.cc"			source_filename = "test.cc"
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	%struct.A = type { i32 (...)** }			%struct.A = type { i32 (...)** }

	▲ Show 20 Lines • Show All 89 Lines • Show Last 20 Lines

test/DebugInfo/X86/dwarfdump-debug-loclists-error-cases2.s

	# RUN: llvm-mc -triple=x86_64-pc-linux -filetype=obj %s > %t			# RUN: llvm-mc -triple=x86_64-pc-linux -filetype=obj %s > %t
	# RUN: llvm-dwarfdump %t \| FileCheck %s			# RUN: llvm-dwarfdump %t \| FileCheck %s

	# CHECK: DW_AT_name ("x0")			# CHECK: DW_AT_name ("x0")
	# CHECK-NEXT: DW_AT_location (0x0000000c			# CHECK-NEXT: DW_AT_location (0x0000000c
	# CHECK-NEXT: [0x0000000000000000, 0x0000000000000002): DW_OP_reg5 RDI			# CHECK-NEXT: [0x0000000000000000, 0x0000000000000002) DW_OP_reg5 RDI
	# CHECK-NEXT: [0x0000000000000002, 0x0000000000000003): DW_OP_reg0 RAX)			# CHECK-NEXT: [0x0000000000000002, 0x0000000000000003) DW_OP_reg0 RAX)

	# CHECK: DW_AT_name ("x1")			# CHECK: DW_AT_name ("x1")
	# CHECK-NEXT: DW_AT_location (0xdeadbeef			# CHECK-NEXT: DW_AT_location (0xdeadbeef
	# CHECK-NEXT: error extracting location list: unexpected end of data)			# CHECK-NEXT: error extracting location list: unexpected end of data)

	# CHECK: DW_AT_name ("x2")			# CHECK: DW_AT_name ("x2")
	# CHECK-NEXT: DW_AT_location (0x00000025			# CHECK-NEXT: DW_AT_location (0x00000025
	# CHECK-NEXT: error extracting location list: unexpected end of data)			# CHECK-NEXT: error extracting location list: unexpected end of data)
	▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

test/DebugInfo/X86/dwarfdump-debug-loclists.test

	# RUN: llvm-mc %s -filetype obj -triple x86_64-pc-linux -o %t.o			# RUN: llvm-mc %s -filetype obj -triple x86_64-pc-linux -o %t.o
	# RUN: llvm-dwarfdump -v %t.o \| FileCheck %s			# RUN: llvm-dwarfdump -v %t.o \| FileCheck %s

	# CHECK: .debug_info			# CHECK: .debug_info
	# CHECK: DW_AT_name{{.*}}"stub"			# CHECK: DW_AT_name{{.*}}"stub"
	# CHECK: DW_AT_location [DW_FORM_sec_offset] (0x0000000c			# CHECK: DW_AT_location [DW_FORM_sec_offset] (0x0000000c
	# CHECK-NEXT: [0x0000000000000010, 0x0000000000000020): DW_OP_breg5 RDI+0			# CHECK-NEXT: [DW_LLE_offset_pair ]: 0x0000000000000000, 0x0000000000000010 => [0x0000000000000010, 0x0000000000000020) DW_OP_breg5 RDI+0
				dblaikieUnsubmitted Not Done Reply Inline Actions I don't think the inline dumping should print the encoding - I'd borrow a lot from/try to unify with the ranges printing, which doesn't. I think verbose ranges print the same as non-verbose except they also add the section name/number. dblaikie: I don't think the inline dumping should print the encoding - I'd borrow a lot from/try to unify…
				labathAuthorUnsubmitted Done Reply Inline Actions Sure, I can do that, though I think that means there won't be a single place where one can see both the raw encodings and their interpretation -- section-based dumping will not show the interpretation (would you want me to show still show them I they happen to be interpretable without the base address or the address pool?), and the debug_info dumping will not show the encoding. Is that bad? -- I don't know... labath: Sure, I can do that, though I think that means there won't be a single place where one can see…
				dblaikieUnsubmitted Not Done Reply Inline Actions Fair - that comes back to the issue I mentioned in a previous comment about potentially limiting dumping of non-debug_info sections based on the presence of a CU that references it (& only dumping it that way, rather than trying to parse it without a CU). DWARF isn't really designed to be parsed without the CU anyway. (could leave it in as best-effort to parse things without a referencing CU for debugging, etc). Mostly I'm interested in unification perhaps more/primarily, than feature improvements - then we can make feature improvements to both ranges and locs without having to duplicate things. dblaikie: Fair - that comes back to the issue I mentioned in a previous comment about potentially…
	# CHECK-NEXT: [0x0000000000000530, 0x0000000000000540): DW_OP_breg6 RBP-8, DW_OP_deref			# CHECK-NEXT: [DW_LLE_base_address ]: 0x0000000000000500
	# CHECK-NEXT: [0x0000000000000700, 0x0000000000000710): DW_OP_breg5 RDI+0			#
				# CHECK-NEXT: [DW_LLE_offset_pair ]: 0x0000000000000030, 0x0000000000000040 => [0x0000000000000530, 0x0000000000000540) DW_OP_breg6 RBP-8, DW_OP_deref
				# CHECK-NEXT: [DW_LLE_start_length ]: 0x0000000000000700, 0x0000000000000010 => [0x0000000000000700, 0x0000000000000710) DW_OP_breg5 RDI+0

	# CHECK: .debug_loclists contents:			# CHECK: .debug_loclists contents:
	# CHECK-NEXT: 0x00000000: locations list header: length = 0x0000002c, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000000			# CHECK-NEXT: 0x00000000: locations list header: length = 0x0000002c, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000000
	# CHECK-NEXT: 0x00000000:			# CHECK-NEXT: 0x00000000:
	# CHECK-NEXT: [0x0000000000000000, 0x0000000000000010): DW_OP_breg5 RDI+0			# CHECK-NEXT: [DW_LLE_offset_pair ]: 0x0000000000000000, 0x0000000000000010 => [0x0000000000000000, 0x0000000000000010) DW_OP_breg5 RDI+0
	# CHECK-NEXT: [0x0000000000000530, 0x0000000000000540): DW_OP_breg6 RBP-8, DW_OP_deref			# CHECK-NEXT: [DW_LLE_base_address ]: 0x0000000000000500
	# CHECK-NEXT: [0x0000000000000700, 0x0000000000000710): DW_OP_breg5 RDI+0			# CHECK-NEXT: [DW_LLE_offset_pair ]: 0x0000000000000030, 0x0000000000000040 => [0x0000000000000530, 0x0000000000000540) DW_OP_breg6 RBP-8, DW_OP_deref
				# CHECK-NEXT: [DW_LLE_start_length ]: 0x0000000000000700, 0x0000000000000010 => [0x0000000000000700, 0x0000000000000710) DW_OP_breg5 RDI+0

	.section .debug_str,"MS",@progbits,1			.section .debug_str,"MS",@progbits,1
	.asciz "stub"			.asciz "stub"

	.section .debug_str_offsets,"",@progbits			.section .debug_str_offsets,"",@progbits
	.long 68			.long 68
	.short 5			.short 5
	.short 0			.short 0
	▲ Show 20 Lines • Show All 143 Lines • Show Last 20 Lines

test/DebugInfo/X86/fission-ranges.ll

	Show All 39 Lines
	; CHECK-NOT: .debug_loc contents:			; CHECK-NOT: .debug_loc contents:
	; CHECK-NOT: Beginning address offset			; CHECK-NOT: Beginning address offset
	; CHECK: .debug_loc.dwo contents:			; CHECK: .debug_loc.dwo contents:

	; Don't assume these locations are entirely correct - feel free to update them			; Don't assume these locations are entirely correct - feel free to update them
	; if they've changed due to a bugfix, change in register allocation, etc.			; if they've changed due to a bugfix, change in register allocation, etc.

	; CHECK: [[A]]:			; CHECK: [[A]]:
	; CHECK-NEXT: Addr idx 2 (w/ length 15): DW_OP_consts +0, DW_OP_stack_value			; CHECK-NEXT: [DW_LLE_startx_length]: 0x00000002, 0x0000000f => <Failed to read address offset 2> DW_OP_consts +0, DW_OP_stack_value
				labathAuthorUnsubmitted Done Reply Inline Actions This is somewhat annoying, because the entries printed through the loclists section will always have this error (as we don't have the DWARFUnit). I'll have to figure out a way to suppress those, while still keeping them around when printing from DWARFDie (as there a failure means a real error). labath: This is somewhat annoying, because the entries printed through the loclists section will always…
				dblaikieUnsubmitted Not Done Reply Inline Actions IMHO we may want to move to a model where we don't try to create/parse any content except by finding a reference from a CU (or the DWARFv5 stanfdalone line tables). In theory, it's perfectly find to have random garbage in debug sections other than debug_info (or the standalone line table) - because the only parts that should be parsed are those referenced from debug_info. This came up in the form of a bug in location list dumping when the binary is linked with bfd ld. It doesn't update any addresses to discarded sections, leaving them as zero (whereas gold and lld write the addend to the relocation - which generally makes sure any range pair doesn't end up as "zero zero" which marks the end of a list) which terminates a list early and leads to the following location expression to be parsed as the start of a new list... which is totally bogus. Now, granted, the resulting debug info from bfd ld is wrong (if you had a location list spanning multiple functions (eg: a global variable had been put in a register for the duration of a function, etc) then resolving any one of those location entries to zero-zero would terminate the list early even though there might be non-dropped functions in the list after that point) - but I still think there's something to be said for it. There's a fair counterargument too - that we might want to be able to make a best-effort to dump content that isn't complete (eg: if a section was emitted alone - or there was some hunk of unreferenced location list in the debug_loc section, it might be interesting to know what's in that hunk - might give you hints about where it /should/ have been referenced from) Apparently binutils objdump when printing debug info only dumps those referenced pieces and prints info about "holes" when there's unreferenced chunks. Ah, here's the bug context on that: https://bugs.llvm.org/show_bug.cgi?id=43290 But, yeah, all that aside - given the architecture of libDebugInfoDWARF/llvm-dwarfdump right now, yes, it'd be good to omit those error messages. Also note that address indexes wouldn't be resolvable when dumping .dwo files - since the debug_addr would be in the .o file instead. So it'd be good to not print lots of error messages there either. dblaikie: IMHO we may want to move to a model where we don't try to create/parse any content except by…
	; CHECK-NEXT: Addr idx 3 (w/ length 15): DW_OP_reg0 RAX			; CHECK-NEXT: [DW_LLE_startx_length]: 0x00000003, 0x0000000f => <Failed to read address offset 3> DW_OP_reg0 RAX
	; CHECK-NEXT: Addr idx 4 (w/ length 18): DW_OP_breg7 RSP-8			; CHECK-NEXT: [DW_LLE_startx_length]: 0x00000004, 0x00000012 => <Failed to read address offset 4> DW_OP_breg7 RSP-8
	; CHECK: [[E]]:			; CHECK: [[E]]:
	; CHECK-NEXT: Addr idx 5 (w/ length 9): DW_OP_reg0 RAX			; CHECK-NEXT: [DW_LLE_startx_length]: 0x00000005, 0x00000009 => <Failed to read address offset 5> DW_OP_reg0 RAX
	; CHECK-NEXT: Addr idx 6 (w/ length 98): DW_OP_breg7 RSP-44			; CHECK-NEXT: [DW_LLE_startx_length]: 0x00000006, 0x00000062 => <Failed to read address offset 6> DW_OP_breg7 RSP-44
	; CHECK: [[B]]:			; CHECK: [[B]]:
	; CHECK-NEXT: Addr idx 7 (w/ length 15): DW_OP_reg0 RAX			; CHECK-NEXT: [DW_LLE_startx_length]: 0x00000007, 0x0000000f => <Failed to read address offset 7> DW_OP_reg0 RAX
	; CHECK-NEXT: Addr idx 8 (w/ length 66): DW_OP_breg7 RSP-32			; CHECK-NEXT: [DW_LLE_startx_length]: 0x00000008, 0x00000042 => <Failed to read address offset 8> DW_OP_breg7 RSP-32
	; CHECK: [[D]]:			; CHECK: [[D]]:
	; CHECK-NEXT: Addr idx 9 (w/ length 15): DW_OP_reg0 RAX			; CHECK-NEXT: [DW_LLE_startx_length]: 0x00000009, 0x0000000f => <Failed to read address offset 9> DW_OP_reg0 RAX
	; CHECK-NEXT: Addr idx 10 (w/ length 42): DW_OP_breg7 RSP-20			; CHECK-NEXT: [DW_LLE_startx_length]: 0x0000000a, 0x0000002a => <Failed to read address offset 10> DW_OP_breg7 RSP-20

	; Make sure we don't produce any relocations in any .dwo section (though in particular, debug_info.dwo)			; Make sure we don't produce any relocations in any .dwo section (though in particular, debug_info.dwo)
	; HDR-NOT: .rela.{{.*}}.dwo			; HDR-NOT: .rela.{{.*}}.dwo

	; Make sure we have enough stuff in the debug_addr to cover the address indexes			; Make sure we have enough stuff in the debug_addr to cover the address indexes
	; (10 is the last index in debug_loc.dwo, making 11 entries of 8 bytes each,			; (10 is the last index in debug_loc.dwo, making 11 entries of 8 bytes each,
	; 11 * 8 == 88 base 10 == 58 base 16)			; 11 * 8 == 88 base 10 == 58 base 16)

	▲ Show 20 Lines • Show All 168 Lines • Show Last 20 Lines

test/DebugInfo/X86/loclists-dwp.ll

	Show All 13 Lines
	; y();			; y();
	; asm("" : : : "rdi");			; asm("" : : : "rdi");
	; }			; }
	;			;
	; b.cpp:			; b.cpp:
	; void b(int i) { asm("" : : : "rdi"); }			; void b(int i) { asm("" : : : "rdi"); }

	; CHECK: DW_AT_location [DW_FORM_sec_offset] (0x00000000			; CHECK: DW_AT_location [DW_FORM_sec_offset] (0x00000000
	; CHECK-NEXT: Addr idx 0 (w/ length 6): DW_OP_reg5 RDI)			; CHECK-NEXT: [DW_LLE_startx_length]: 0x0000000000000000, 0x0000000000000006 => <Failed to read address offset 0> DW_OP_reg5 RDI)

	; CHECK: DW_AT_location [DW_FORM_sec_offset] (0x00000000			; CHECK: DW_AT_location [DW_FORM_sec_offset] (0x00000000
	; CHECK-NEXT: Addr idx 0 (w/ length 0): DW_OP_reg5 RDI)			; CHECK-NEXT: [DW_LLE_startx_length]: 0x0000000000000000, 0x0000000000000000 => <Failed to read address offset 0> DW_OP_reg5 RDI)

	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define dso_local void @_Z1ai(i32 %i) local_unnamed_addr !dbg !7 {			define dso_local void @_Z1ai(i32 %i) local_unnamed_addr !dbg !7 {
	entry:			entry:
	call void @llvm.dbg.value(metadata i32 %i, metadata !12, metadata !DIExpression()), !dbg !13			call void @llvm.dbg.value(metadata i32 %i, metadata !12, metadata !DIExpression()), !dbg !13
	tail call void @_Z1yv(), !dbg !14			tail call void @_Z1yv(), !dbg !14
	tail call void asm sideeffect "", "~{rdi},~{dirflag},~{fpsr},~{flags}"(), !dbg !15, !srcloc !16			tail call void asm sideeffect "", "~{rdi},~{dirflag},~{fpsr},~{flags}"(), !dbg !15, !srcloc !16
	Show All 29 Lines

test/tools/llvm-dwarfdump/X86/debug_loc_dwo.s

	# RUN: llvm-mc %s -filetype obj -triple x86_64-pc-linux -o %t.o			# RUN: llvm-mc %s -filetype obj -triple x86_64-pc-linux -o %t.o
	# RUN: llvm-dwarfdump --debug-loc %t.o \| FileCheck %s			# RUN: llvm-dwarfdump --debug-loc %t.o \| FileCheck %s

	# We make sure that llvm-dwarfdump can dump the .debug_loc.dwo section			# We make sure that llvm-dwarfdump can dump the .debug_loc.dwo section
	# without requiring a compilation unit in the .debug_info.dwo section.			# without requiring a compilation unit in the .debug_info.dwo section.

	# CHECK: .debug_loc.dwo contents:			# CHECK: .debug_loc.dwo contents:
	# CHECK-NEXT: 0x00000000:			# CHECK-NEXT: 0x00000000:
	# CHECK-NEXT: Addr idx 1 (w/ length 16): DW_OP_reg5 RDI			# CHECK-NEXT: [DW_LLE_startx_length]: 0x00000001, 0x00000010 => <Failed to read address offset 1> DW_OP_reg5 RDI

	.section .debug_loc.dwo,"",@progbits			.section .debug_loc.dwo,"",@progbits
	# One location list. The pre-DWARF v5 implementation only recognizes			# One location list. The pre-DWARF v5 implementation only recognizes
	# DW_LLE_startx_length as an entry kind in .debug_loc.dwo (besides			# DW_LLE_startx_length as an entry kind in .debug_loc.dwo (besides
	# end_of_list), which is what llvm generates as well.			# end_of_list), which is what llvm generates as well.
	.byte 3 # DW_LLE_startx_length			.byte 3 # DW_LLE_startx_length
	.byte 0x01 # Index			.byte 0x01 # Index
	.long 0x10 # Length			.long 0x10 # Length
	.short 1 # Loc expr size			.short 1 # Loc expr size
	.byte 85 # DW_OP_reg5			.byte 85 # DW_OP_reg5
	.byte 0 # DW_LLE_end_of_list			.byte 0 # DW_LLE_end_of_list

test/tools/llvm-dwarfdump/X86/debug_loclists_startx_length.s

	# RUN: llvm-mc %s -filetype obj -triple x86_64-pc-linux -o %t.o			# RUN: llvm-mc %s -filetype obj -triple x86_64-pc-linux -o %t.o
	# RUN: llvm-dwarfdump -v %t.o \| FileCheck %s			# RUN: llvm-dwarfdump -v %t.o \| FileCheck %s

	# DW_LLE_startx_length has different `length` encoding in pre-DWARF 5			# DW_LLE_startx_length has different `length` encoding in pre-DWARF 5
	# and final DWARF 5 versions. This test checks we are able to parse			# and final DWARF 5 versions. This test checks we are able to parse
	# the final version which uses ULEB128 and not the U32.			# the final version which uses ULEB128 and not the U32.

	# CHECK: .debug_loclists contents:			# CHECK: .debug_loclists contents:
	# CHECK-NEXT: 0x00000000: locations list header: length = 0x0000000e, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000000			# CHECK-NEXT: 0x00000000: locations list header: length = 0x0000000e, version = 0x0005, addr_size = 0x08, seg_size = 0x00, offset_entry_count = 0x00000000
	# CHECK-NEXT: 0x00000000:			# CHECK-NEXT: 0x00000000:
	# CHECK-NEXT: Addr idx 1 (w/ length 16): DW_OP_reg5 RDI			# CHECK-NEXT: [DW_LLE_startx_length]: 0x0000000000000001, 0x0000000000000010[DW_LLE_startx_length]: 0x0000000000000001, 0x0000000000000010 => <Failed to read address offset 1> DW_OP_reg5 RDI

	.section .debug_loclists,"",@progbits			.section .debug_loclists,"",@progbits
	.long .Ldebug_loclist_table_end0-.Ldebug_loclist_table_start0			.long .Ldebug_loclist_table_end0-.Ldebug_loclist_table_start0
	.Ldebug_loclist_table_start0:			.Ldebug_loclist_table_start0:
	.short 5 # Version.			.short 5 # Version.
	.byte 8 # Address size.			.byte 8 # Address size.
	.byte 0 # Segmen selector size.			.byte 0 # Segmen selector size.
	.long 0 # Offset entry count.			.long 0 # Offset entry count.

	.byte 3 # DW_LLE_startx_length			.byte 3 # DW_LLE_startx_length
	.byte 0x01 # Index			.byte 0x01 # Index
	.uleb128 0x10 # Length			.uleb128 0x10 # Length
	.byte 1 # Loc expr size			.byte 1 # Loc expr size
	.byte 85 # DW_OP_reg5			.byte 85 # DW_OP_reg5
	.byte 0 # DW_LLE_end_of_list			.byte 0 # DW_LLE_end_of_list
	.Ldebug_loclist_table_end0:			.Ldebug_loclist_table_end0:

This is an archive of the discontinued LLVM Phabricator instance.

DWARFDebugLoc: Add a function to get the address range of an entryAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 223193

include/llvm/BinaryFormat/Dwarf.h

include/llvm/DebugInfo/DWARF/DWARFDebugLoc.h

lib/BinaryFormat/Dwarf.cpp

lib/DebugInfo/DWARF/DWARFContext.cpp

lib/DebugInfo/DWARF/DWARFDebugLoc.cpp

lib/DebugInfo/DWARF/DWARFDie.cpp

test/CodeGen/X86/debug-loclists.ll

test/DebugInfo/X86/dwarfdump-debug-loclists-error-cases2.s

test/DebugInfo/X86/dwarfdump-debug-loclists.test

test/DebugInfo/X86/fission-ranges.ll

test/DebugInfo/X86/loclists-dwp.ll

test/tools/llvm-dwarfdump/X86/debug_loc_dwo.s

test/tools/llvm-dwarfdump/X86/debug_loclists_startx_length.s

DWARFDebugLoc: Add a function to get the address range of an entry
AbandonedPublic