This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lldb/
-
include/lldb/
-
lldb/
-
Core/
-
Module.h
-
PluginManager.h
-
Expression/
-
DWARFEvaluator.h
-
DWARFEvaluatorFactory.h
-
DWARFExpression.h
-
Target/
-
Process.h
-
lldb-forward.h
-
lldb-private-interfaces.h
-
source/
-
Core/
-
Module.cpp
-
PluginManager.cpp
-
Value.cpp
-
ValueObject.cpp
-
Expression/
-
CMakeLists.txt
1/1
DWARFEvaluator.cpp
-
DWARFEvaluatorFactory.cpp
-
DWARFExpression.cpp
-
Interpreter/
1/1
CommandInterpreter.cpp
-
Plugins/
-
CMakeLists.txt
-
DWARFEvaluator/
-
CMakeLists.txt
-
wasm/
-
CMakeLists.txt
-
WasmDWARFEvaluator.h
-
WasmDWARFEvaluator.cpp
-
WasmDWARFEvaluatorFactory.h
-
WasmDWARFEvaluatorFactory.cpp
-
Plugins.def.in
-
Process/
-
CMakeLists.txt
-
elf-core/
-
ProcessElfCore.h
-
ProcessElfCore.cpp
-
gdb-remote/
-
ProcessGDBRemote.h
-
ProcessGDBRemote.cpp
-
mach-core/
-
ProcessMachCore.h
-
ProcessMachCore.cpp
-
minidump/
-
ProcessMinidump.h
-
ProcessMinidump.cpp
-
wasm/
-
CMakeLists.txt
-
ProcessWasm.h
1/1
ProcessWasm.cpp
-
ThreadWasm.h
-
ThreadWasm.cpp
-
UnwindWasm.h
1/1
UnwindWasm.cpp
-
Target/
-
Platform.cpp
-
Process.cpp

Differential D78801

[LLDB] Add class WasmProcess for WebAssembly debugging
Needs ReviewPublic

Authored by paolosev on Apr 24 2020, 1:57 AM.

Download Raw Diff

Details

Reviewers

clayborg
labath
jingham
asmith

Summary

This is the fourth in a series of patches to enable LLDB debugging of WebAssembly code that runs in a WebAssembly engine. Previous patches added ObjectFile, SymbolVendor and DynamicLoader plugin classes for Wasm, see: D71575, D72751, D72650.

The idea is to use the GDB-remote protocol to connect to a Wasm engine that implements a GDB-remote stub that offers the ability to access the engine runtime internal state. This patch introduce a new Process plugin wasm, with:

Class ProcessWasm that inherits from ProcessGDBRemote and that provides functions to access the Wasm engine state through a GDBRemote connection.
Class ThreadWasm that inherits from ThreadGDBRemote and that provides functions to access the Wasm call stack and create the Wasm stack unwinder.
Class UnwindWasm that manages stack unwinding for Wasm.

The code that represents a DWARF expression is now separated from the logic to evaluate that expression. The former is still in class DWARFExpression, while for the latter a new plugin type is introduced, DWARFEvaluator.

Class DWARFEvaluator contains the generic code for evaluating DWARF expressions, and it is possible to introduce platform-specific plugin classes to evaluate platform-specific DWARF opcodes.
Class DWARFEvaluatorFactory is the plugin base-class that is initialized, once per module, and cached in a Module instance. It creates DWARFEvaluator objects for a given DWARFExpression.

Class WasmDWARFEvaluatorFactory represents the plugin to create DWARFEvaluators specific for WebAssembly.
class WasmDWARFEvaluator contains the logic to evaluate DW_OP_WASM_location, which requires accesssing the Wasm engine through gdb-remote to query the state of the Wasm program (locals, globals, stack, memory and data sections), and to send requests through gdb-remote.

Note that the GDB-remote protocol needs to be extended with a few Wasm-specific custom query commands, implemented in ProcessWasm and used to access Wasm-specific constructs like the Wasm memory, Wasm locals and globals.

Wasm addresses are encoded with 64 bits with this format:

63 61           32            0
+-+-------------+-------------+
|T|  module_id  |   offset    |
+-+-------------+-------------+

where T is 0:Code, 1:Data, 2:Memory.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

paolosev created this revision.Apr 24 2020, 1:57 AM

Herald added subscribers: lldb-commits, sunfish, aheejin and 4 others. · View Herald TranscriptApr 24 2020, 1:57 AM

What is the best way to test classes WasmProcessGDBRemote and UnwindWasm?

lldb/source/Plugins/Process/wasm/ProcessWasm.cpp
93	This will be implemented as: return GetGDBRemote().GetWasmLocal(frame_index, index, buf, buffer_size, size); as soon as `GetWasmLocal` can be added to GDBRemoteCommunicationClient.
lldb/source/Plugins/Process/wasm/UnwindWasm.cpp
35–36	This cast works but it is ugly. Is there a better coding pattern I could use here?

Harbormaster failed remote builds in B54558: Diff 259830!Apr 24 2020, 2:40 AM

Before we get into the details of this patch (with all the criss-cross friends and dependencies there's a lot to talk about there too), could you give an overview of how do you imagine this working as a whole, and why it is necessary to create these new classes. Having spoken to some wasm folks, I think I know the answers to some of the "why"s. However, I don't think other developers do, and even I don't know the "how" story.

So a few things here. It doesn't seem like it is necessary to create the WasmProcessGDBRemote and IWasmProcess. It would be fine to extend the current ProcessGDBRemote and ThreadGDBRemote classes. The whole reason seems to be that the variables (globals, locals, etc) are fetched through the GDB server API and that doesn't happen for other users of the protocol where this information is fetched via the debug info. Is this correct? You seem to have debug info and DWARF (since you mentioned a new DWARF expression opcode), so how do variables actually work? Do you use debug info? What info for variables do you need to fetch from the API?

It also seems that you fetch the stack backtrace via the GBB remote protocol as well. This would be easy to add in to the generic GDB remote protocol. This could also be built in at the lldb_private::Process/lldb_private::Thread API level where a process/thread specifies it fetches the variables and/or stack itself instead of letting the unwind engine do its thing. This can be really useful for instance if a core or minidump file that is able to store a backtrace so that when you don't have all the system libraries you can still get a good backtrace from a core file. So the backtrace part should definitely be part of the core LLDB logic where it can ask a process or thread if it provides a backtrace or not and we add new virtual APIs to the lldb_private::Process/lldb_private::Thread classes to detect and handle this. The ProcessGDBRemote and ThreadGDBRemote would then implement these functions and answer "yes" if the GDB server supports fetching these things.

So if you can elaborate in detail how variables work and how the stack trace works and exactly what needs to go through the GDB server API, we can work out how this should happen in LLDB. From what I understand right now I would:

modify lldb_private::Process/lldb_private::Thread to add new virtual (not pure virtual) APIs that answer "false" when asked if the process/thread provides variables and stacks
modify the GDB remote protocol to handle a new "qSupported" variant that asks if variables and stacks are supported via the API. Most GDB servers will answer with not supported. See https://sourceware.org/gdb/current/onlinedocs/gdb/General-Query-Packets.html#qSupported
modify ProcessGDBRemote and ThreadGDBRemote to override these APIs and answer "true" to handling variables and stack if the server supports this.
Modify the unwind code to ask the lldb_private::Thread if it provides a backtrace. If true, then skip the normal unwind and use the new APIs on lldb_private::Thread
Remove the ProcessWasm code and UnwindWasm code.

I am adding all the pieces to this patch to make the whole picture clearer; I thought to add a piece at the time to simplify reviews, but probably it ended up making things more obscure. I can always split this patch later and I need to refactor everything anyway.

So, the idea is to use DWARF as debug info for Wasm, as it is already supported by LLVM and Emscripten. For this we introduced some time ago the plugin classes ObjectFileWasm, SymbolVendorWasm and DynamicLoaderWasmDYLD. However, WebAssembly is peculiarly different from the native targets. When source code is compiled to Wasm, Clang produces a module that contains Wasm bytecode (a bit like it happens with Java and C#) and the DWARF info refers to this bytecode.
The Wasm module then runs in a Wasm runtime. (It is also possible to AoT-compile Wasm to native, but this is outside the scope of this patch).

Therefore, LLDB cannot debug Wasm by just controlling the inferior process, but it needs to talk with the Wasm engine to query the Wasm engine state. For example, for backtrace, only the runtime knows what is the current call stack. Hence the idea of using the gdb-remote protocol: if a Wasm engine has a GDB stub LLDB can connect to it to start a debugging session and access its state.

Wasm execution is defined in terms of a stack machine. There are no registers (besides the implicit IP) and most Wasm instructions push/pop values into/from a virtual stack. Besides the stack the other possible stores are a set of parameters and locals defined in the function, a set of global variables defined in the module and the module memory, which is separated from the code address space.

The DWARF debug info to evaluate the value of variables is defined in terms of these constructs. For example, we can have something like this in DWARF:

0x00005a88:      DW_TAG_variable
                          DW_AT_location	(0x000006f3: 
                             [0x00000840, 0x00000850): DW_OP_WASM_location 0x0 +8, DW_OP_stack_value)
                          DW_AT_name	("xx")
                          DW_AT_type	(0x00002b17 "float")
                          […]

Which says that on that address range the value of ‘xx’ can be evaluated as the content of the 8th local. Here DW_OP_WASM_location is a Wasm-specific opcode, with two args, the first defines the store (0: Local, 1: Global, 2: the operand stack) and the index in that store. In most cases the value of the variable could be retrieved from the Wasm memory instead.

So, when LLDB wants to evaluate this variable, in DWARFExpression::Evaluate(), it needs to know what is the current the value of the Wasm locals, or to access the memory, and for this it needs to query the Wasm engine.

This is why there are changes to DWARFExpression::Evaluate(), to support the DW_OP_WASM_location case, and this is also why I created a class that derives from ProcessGDBRemote and overrides ReadMemory() in order to query the wasm engine. Also Value::GetValueAsData() needs to be modified when the value is retrieved from Wasm memory.

GDBRemoteCommunicationClient needs to be extended with a few Wasm-specific query packets:

qWasmGlobal: query the value of a Wasm global variable
qWasmLocal: query the value of a Wasm function argument or local
qWasmStackValue: query the value in the Wasm operand stack
qWasmMem: read from a Wasm memory
qWasmCallStack: retrieve the Wasm call stack.

These are all the changes we need to fully support Wasm debugging.

Why the IWasmProcess interface? I was not sure whether gdb-remote should be the only way to access the engine state. In the future LLDB could also use some other (and less chatty) mechanisms to communicate with a Wasm engine. I did not want to put a dependency on GDBRemote in a class like DWARFExpression or Value, which should not care about these details. Therefore, I thought that the new class WasmProcessGDBRemote could implement the IWasmProcess interface, forwarding requests through the base class ProcessGDBRemote which then send the new gdb-remote query packets. But I agree that this makes the code certainly more convoluted and quite ugly.

My initial idea was to keep all the Wasm-related code as much as possible isolated in plugin classes. Now, I guess that the next steps instead would be to refactor the code to eliminate the new classes WasmProcessGDBRemote and UnwindWasm and modify existing ProcessGDBRemote and ThreadGDBRemote instead. However, I am not sure if this is possible without touching also the base classes Process and Thread. For example, let’s consider function DWARFExpression::Evaluate(). There, when the DWARF opcode is DW_OP_WASM_location, we need to access the Wasm state. We can get to the Process object with frame->CalculateProcess() and then can we assume the process must always be a ProcessGDBRemote if the target machine is a llvm::Triple::wasm32 and cast Process* to ProcessGDBRemote* and then use Wasm-specific query functions added to that class? Would this pattern be acceptable, in your opinion?

PS, I am sorry for the late reply… this lockdown is making me a little unproductive… :-(

Harbormaster failed remote builds in B54739: Diff 260216!Apr 27 2020, 12:29 AM

In D78801#2004248, @clayborg wrote:

So a few things here. It doesn't seem like it is necessary to create the WasmProcessGDBRemote and IWasmProcess. It would be fine to extend the current ProcessGDBRemote and ThreadGDBRemote classes. The whole reason seems to be that the variables (globals, locals, etc) are fetched through the GDB server API and that doesn't happen for other users of the protocol where this information is fetched via the debug info. Is this correct? You seem to have debug info and DWARF (since you mentioned a new DWARF expression opcode), so how do variables actually work? Do you use debug info? What info for variables do you need to fetch from the API?

It also seems that you fetch the stack backtrace via the GBB remote protocol as well. This would be easy to add in to the generic GDB remote protocol. This could also be built in at the lldb_private::Process/lldb_private::Thread API level where a process/thread specifies it fetches the variables and/or stack itself instead of letting the unwind engine do its thing. This can be really useful for instance if a core or minidump file that is able to store a backtrace so that when you don't have all the system libraries you can still get a good backtrace from a core file. So the backtrace part should definitely be part of the core LLDB logic where it can ask a process or thread if it provides a backtrace or not and we add new virtual APIs to the lldb_private::Process/lldb_private::Thread classes to detect and handle this. The ProcessGDBRemote and ThreadGDBRemote would then implement these functions and answer "yes" if the GDB server supports fetching these things.

So if you can elaborate in detail how variables work and how the stack trace works and exactly what needs to go through the GDB server API, we can work out how this should happen in LLDB. From what I understand right now I would:

modify lldb_private::Process/lldb_private::Thread to add new virtual (not pure virtual) APIs that answer "false" when asked if the process/thread provides variables and stacks

The above idea is fairly interesting, but I don't see why a new API like that would be necessary to implement it. We already have an abstraction for a producer of stack frames -- the Unwind class. Reusing the existing abstraction (as this patch does) seems like simpler/cleaner design then adding a new api, and then having users switch on its value.

In D78801#2004501, @paolosev wrote:

My initial idea was to keep all the Wasm-related code as much as possible isolated in plugin classes.

While I would definitely like to see that happen, I don't think the current approach achieves that. The "IWasmProcess" is still in the wasm plugin so we still end up with a lot of "core" code depending on the wasm plugin. And if we put IWasmProcess into the "core", then it's not much different than putting the relevant APIs into the "Process" class directly (though it could introduce some grouping which might count for something). If we accept that DW_OP_WASM_location as a first-class entity, then having some core interfaces to support it would not seem unreasonable. The thing which makes that blurry is that this is a vendor extension, not an official "first class" thing.

Now, I guess that the next steps instead would be to refactor the code to eliminate the new classes WasmProcessGDBRemote and UnwindWasm and modify existing ProcessGDBRemote and ThreadGDBRemote instead.

It may be interesting to see how that ends up looking like (maybe you could put that in a separate patch to compare), but I don't think that at this point we have chosen the right way to go forward, and we still need to discuss/think about things...

PS, I am sorry for the late reply… this lockdown is making me a little unproductive… :-(

You replied on the next business day. I don't think this is late by any standard.

Thanks for your comments!
I have refactored this code in a separate patch, https://reviews.llvm.org/D78978, removing WasmProcessGDBRemote, moving part of the logic into ProcessGDBRemote but still keeping class UnwindWasm.
Let me know what you think...

In D78801#2004501, @paolosev wrote:
I am adding all the pieces to this patch to make the whole picture clearer; I thought to add a piece at the time to simplify reviews, but probably it ended up making things more obscure. I can always split this patch later and I need to refactor everything anyway.

So, the idea is to use DWARF as debug info for Wasm, as it is already supported by LLVM and Emscripten. For this we introduced some time ago the plugin classes ObjectFileWasm, SymbolVendorWasm and DynamicLoaderWasmDYLD. However, WebAssembly is peculiarly different from the native targets. When source code is compiled to Wasm, Clang produces a module that contains Wasm bytecode (a bit like it happens with Java and C#) and the DWARF info refers to this bytecode.
The Wasm module then runs in a Wasm runtime. (It is also possible to AoT-compile Wasm to native, but this is outside the scope of this patch).

Therefore, LLDB cannot debug Wasm by just controlling the inferior process, but it needs to talk with the Wasm engine to query the Wasm engine state. For example, for backtrace, only the runtime knows what is the current call stack. Hence the idea of using the gdb-remote protocol: if a Wasm engine has a GDB stub LLDB can connect to it to start a debugging session and access its state.

Wasm execution is defined in terms of a stack machine. There are no registers (besides the implicit IP) and most Wasm instructions push/pop values into/from a virtual stack. Besides the stack the other possible stores are a set of parameters and locals defined in the function, a set of global variables defined in the module and the module memory, which is separated from the code address space.

The DWARF debug info to evaluate the value of variables is defined in terms of these constructs. For example, we can have something like this in DWARF:
0x00005a88:      DW_TAG_variable
                          DW_AT_location	(0x000006f3: 
                             [0x00000840, 0x00000850): DW_OP_WASM_location 0x0 +8, DW_OP_stack_value)
                          DW_AT_name	("xx")
                          DW_AT_type	(0x00002b17 "float")
                          […]
Which says that on that address range the value of ‘xx’ can be evaluated as the content of the 8th local. Here DW_OP_WASM_location is a Wasm-specific opcode, with two args, the first defines the store (0: Local, 1: Global, 2: the operand stack) and the index in that store. In most cases the value of the variable could be retrieved from the Wasm memory instead.

So is there memory to be read from the WASM runtime? Couldn't DW_OP_WASM_location 0x0 +8 be turned into an address that can be used to read the variable? It is also unclear what DW_OP_stack_value is used for here. The DWARF expression has no idea how many bytes to read for this value unless each virtual stack location knows how big it is? What happens if you have an array of a million items? That will not fit on the DWARF expression stack and each member would need to be read from memory?

It seems like the DW_OP_WASM_location + args should result in the address of the variable being pushed into the stack and the DW_OP_stack_value should be removed. This would mean at the end of the expression the address of the variable is on the stack and LLDB will just read it using the normal memory read? Am I missing something? Are there multiple memory regions? Are variables not considered to be in memory?

So, when LLDB wants to evaluate this variable, in DWARFExpression::Evaluate(), it needs to know what is the current the value of the Wasm locals, or to access the memory, and for this it needs to query the Wasm engine.

This is why there are changes to DWARFExpression::Evaluate(), to support the DW_OP_WASM_location case, and this is also why I created a class that derives from ProcessGDBRemote and overrides ReadMemory() in order to query the wasm engine. Also Value::GetValueAsData() needs to be modified when the value is retrieved from Wasm memory.

It would be fine to ask the lldb_private::Process class to evaluate any unknown DWARF expression opcodes like DW_OP_WASM_location and return the result.

Why do we need to override read memory? Is there more than one address space? Can't the DWARF expression DW_OP_WASM_location + args turn into an address that normal read memory can access? Or are the virtual stacks separate and not actually in the address space? If the virtual stack slot for locals/globals and stack values always know their sizes and can provide the contents, the DW_OP_WASM_location opcode should end up creating a buffer just like DW_OP_piece does and the value will be contained in there in the DWARF expression and there is no need for the DW_OP_stack_value?

GDBRemoteCommunicationClient needs to be extended with a few Wasm-specific query packets:

qWasmGlobal: query the value of a Wasm global variable

qWasmLocal: query the value of a Wasm function argument or local

qWasmStackValue: query the value in the Wasm operand stack

These three could be boiled down to a "qEvaluateCustomDWARFExpressionOpcode" packet (shorter name please!) and the args like 0x0 and +8 can be sent. The result could provide the bytes for the value?

qWasmMem: read from a Wasm memory

How does normal memory reading differ from Wasm memory?

qWasmCallStack: retrieve the Wasm call stack.

Seems like this packet doesn't need to be Wasm specific. Are there any other GDB remote packets that fetch stack traces already that we would re-use?

These are all the changes we need to fully support Wasm debugging.

Why the IWasmProcess interface? I was not sure whether gdb-remote should be the only way to access the engine state. In the future LLDB could also use some other (and less chatty) mechanisms to communicate with a Wasm engine. I did not want to put a dependency on GDBRemote in a class like DWARFExpression or Value, which should not care about these details. Therefore, I thought that the new class WasmProcessGDBRemote could implement the IWasmProcess interface, forwarding requests through the base class ProcessGDBRemote which then send the new gdb-remote query packets. But I agree that this makes the code certainly more convoluted and quite ugly.

My initial idea was to keep all the Wasm-related code as much as possible isolated in plugin classes. Now, I guess that the next steps instead would be to refactor the code to eliminate the new classes WasmProcessGDBRemote and UnwindWasm and modify existing ProcessGDBRemote and ThreadGDBRemote instead. However, I am not sure if this is possible without touching also the base classes Process and Thread. For example, let’s consider function DWARFExpression::Evaluate(). There, when the DWARF opcode is DW_OP_WASM_location, we need to access the Wasm state. We can get to the Process object with frame->CalculateProcess() and then can we assume the process must always be a ProcessGDBRemote if the target machine is a llvm::Triple::wasm32 and cast Process* to ProcessGDBRemote* and then use Wasm-specific query functions added to that class? Would this pattern be acceptable, in your opinion?

A new virtual function in lldb_private::Process like:

class Process {
  virtual Error EvaluateCustomDWARFExpressionOpcode(uint16_t opcode, uint64_t arg1, uint64_t arg2) {
    return createStringError(std::errc::invalid_argument, "unhandled DWARF expression opcode");
  }

could be added, and then the ProcessGDBRemote can pass this along to the GDB server. Anything in DWARFExpression needs to _only_ call virtual functions on lldb_private::Process/Thread/StackFrame and no deps should be added on custom plug-ins.

In D78801#2007083, @clayborg wrote:

It would be fine to ask the lldb_private::Process class to evaluate any unknown DWARF expression opcodes like DW_OP_WASM_location and return the result.

While that idea has occurred to me too, I am not convinced it is a good one:

it replaces one odd dependency with another one. Why should a Process need to know how to evaluate a DWARF expression? Or even that DWARF exists for that matter? This seems totally unrelated to what other Process functions are doing currently...
I am not sure it even completely removes wasm knowledge from e.g. DWARFExpression -- the class would presumably still need to know how to parse this opcode.
the interface could get very complicated if we wanted to implement typed stacks present in DWARF5 -- presumably the API would need to return the type of the result, in addition to its value.

In D78801#2007083, @clayborg wrote:

So is there memory to be read from the WASM runtime? Couldn't DW_OP_WASM_location 0x0 +8 be turned into an address that can be used to read the variable? It is also unclear what DW_OP_stack_value is used for here. The DWARF expression has no idea how many bytes to read for this value unless each virtual stack location knows how big it is? What happens if you have an array of a million items? That will not fit on the DWARF expression stack and each member would need to be read from memory?

It seems like the DW_OP_WASM_location + args should result in the address of the variable being pushed into the stack and the DW_OP_stack_value should be removed. This would mean at the end of the expression the address of the variable is on the stack and LLDB will just read it using the normal memory read? Am I missing something? Are there multiple memory regions? Are variables not considered to be in memory?

DW_OP_WASM_location 0x0 +8 is not really in memory, or more precisely, its runtime representation is an internal detail of the Wasm runtime.
WebAssembly code has a peculiar structure, see for example https://developer.mozilla.org/en-US/docs/WebAssembly/Understanding_the_text_format for more details.
Ignoring memory for a moment, there are no registers in Wasm and instead Wasm instructions read/write from/to function locals, module globals and stack operands, which can only have one of these types:

i32: 32-bit integer
i64: 64-bit integer
f32: 32-bit floating point
f64: 64-bit floating point

There is still is ongoing work in LLVM (https://reviews.llvm.org/D77353/new/#change-OJue38RNV2Gz) to define the perfect representation of these Wasm constructs in DWARF, but currently what is generated by LLVM has this format:

DW_OP_WASM_location wasm-op index

Where:

DW_OP_WASM_location := 0xED
wasm-op := wasm-local | wasm-global | wasm-operand-stack

wasm-local := 0x00 i:uleb128            (The value is located in the currently executing function’s index-th local)
wasm-global := 0x01 i:uleb128           (The value is located in the index-th global)
wasm-operand-stack := 0x02 i:uleb128    (The value is located in the indexth entry on the operand stack)

https://yurydelendik.github.io/webassembly-dwarf/ describes the rationale behind the addition of DW_OP_WASM_location to DWARF.

For example a function like:

int add(int a, int b) { return a + b; }

Could be compiled to:

(func $add (param $lhs i32) (param $rhs i32) (result i32)
  local.get $lhs
  local.get $rhs
  i32.add)

and the corresponding DWARF would describe that:

the value of a can be retrieved as DW_OP_WASM_location 0 0 (first local in the function)
the value of b can be retrieved as DW_OP_WASM_location 0 1 (second local in the function)

Of course DW_OP_WASM_location cannot represent the values of complex types. For a complex type like a C++ array with 1M items:

uint8_t* p = new uint8_t[1000000];

DWARF would describe the location of the pointer p (for example it could be in a local) and then the debugger would find DWARF info that describes its type, it would then send a request like qWasmLocal to get the value from the Wasm runtime, and receive the value of p, let’s say 0x8000c000.
From there LLDB might query to read chunks of memory starting from 0x8000c000, if the user asks to explore the content of the array.

Note that not all Wasm code requires the new location description DW_OP_WASM_location. In many cases locations are encoded using preexisting codes. For example when compiling without optimizations, -O0, almost all variables are encoded as a delta from the frame pointer register. But the frame pointer register itself is often defined as a DW_OP_WASM_location:

0x00000112:   DW_TAG_subprogram
                DW_AT_low_pc	(0x0000000000000761)
                DW_AT_high_pc	(0x00000000000007db)
                DW_AT_frame_base	(DW_OP_WASM_location 0x0 +4, DW_OP_stack_value)
                DW_AT_linkage_name	("_Z10quick_sortI4NodeIyE4lessIS1_EEvPT_xT0_")
                DW_AT_name	("quick_sort<Node<unsigned long long>, less<Node<unsigned long long> > >")
                DW_AT_decl_file	("C:\dev\test\emscripten_tests\sort\.\sort.h")
                DW_AT_decl_line	(45)
                DW_AT_external	(true)

0x0000012a:     DW_TAG_formal_parameter
                  DW_AT_location	(DW_OP_fbreg +20)
                  DW_AT_name	("array")
                  DW_AT_type	(0x000003bb "Node<unsigned long long>*")
                  …

This would also work because LLDB would send a qWasmLocal to calculate the value of the frame register.

Why do we need to override read memory? Is there more than one address space? Can't the DWARF expression DW_OP_WASM_location + args turn into an address that normal read memory can access? Or are the virtual stacks separate and not actually in the address space? If the virtual stack slot for locals/globals and stack values always know their sizes and can provide the contents, the DW_OP_WASM_location opcode should end up creating a buffer just like DW_OP_piece does and the value will be contained in there in the DWARF expression and there is no need for the DW_OP_stack_value?

How does normal memory reading differ from Wasm memory?

In WebAssembly the memory address space is separated from the code address space. Each Wasm modules has a ‘Code’ section with the wasm bytecode.
A Wasm module also has one (for the moment only one) Memory, which is a linear, byte-addressable range of bytes, of a configured size.
So there are two separated address spaces for code and memory, and DWARF info refers to both: address ranges are defined as offsets from the start of the Code section in the module, while location expressions imply reading from Wasm Memory instances.

This is why we need qWasmMem. When GDBProcess:: ReadMemory is called during this process, it sends "m" packets to the Wasm engine, which may be interpreted as reads from the module Code address space. But we also need a different way to express reads from the module Memory space.

For the code address space, the idea is to use a 64-bit virtual address space, where the code of each module is located at module_id << 32.

0x00000000`00000000 +------------------------------------+
                    |                                    |
                    |                                    |
                    |                                    |
0x00000001`00000000 +------------------------------------+
                    |  code module_id 1                  |
                    |                                    |
                    .                                    .
0x00000002`00000000 +------------------------------------+
                    |  code module_id 2                  |
                    .                                    .
0x00000003`00000000 +------------------------------------+
                    ~                                    ~

Classes ObjectFileWasm, DynamicLoaderWasmDYLD already support this, therefore LLDB emits requests to read memory at 64 addresses so formed.

But to read from the memory instances, as said, we need a separate command, qWasmMem. This is the reason why Value::GetValueAsData is modified in this patch, to check if we are debugging Wasm, and in that case we want to use qWasmMem because evaluating a value we are reading from the Wasm memory address space, not from the Code address space.

The GDB-remote query extensions are currently defined in the following way:

// Get a Wasm global value in the Wasm module specified.
// IN : $qWasmGlobal:frame_index;index
// OUT: $xx..xx

// Get a Wasm local value in the stack frame specified.
// IN : $qWasmLocal:frame_index;index
// OUT: $xx..xx

// Get a Wasm local from the operand stack at the index specified.
// IN : qWasmStackValue:frame_index;index
// OUT: $xx..xx

// Read Wasm memory.
// IN : $qWasmMem:frame_index;addr;len
// OUT: $xx..xx

// Get the current call stack.
// IN : $qWasmCallStack
// OUT: $xx..xxyy..yyzz..zz (A sequence of uint64_t values represented as consecutive 8-bytes blocks).

All packets contain a frame_index, that the runtime can use to identify the Wasm module the query refers to.
The size of the returned hex chars represent the size of the returned value. For qWasmGlobal, qWasmLocal, qWasmStackValue, currently the size can only be 4 or 8 bytes, but for qWasmMem it should match the number of bytes requested in the query.

These three could be boiled down to a "qEvaluateCustomDWARFExpressionOpcode" packet (shorter name please!) and the args like 0x0 and +8 can be sent. The result could provide the bytes for the value?

It is absolutely true that the first three packets (qWasmGlobal, qWasmLocal, qWasmStackValue) could be condensed in a single packet with an additional argument that describes the type of store.

qWasmCallStack: retrieve the Wasm call stack.

Seems like this packet doesn't need to be Wasm specific. Are there any other GDB remote packets that fetch stack traces already that we would re-use?

For qWasmCallStack, I could not find in the GDBRemote protocol (https://sourceware.org/gdb/current/onlinedocs/gdb/General-Query-Packets.html#General-Query-Packets) an existing command to query a thread call stack.

A new virtual function in lldb_private::Process like:
class Process {
  virtual Error EvaluateCustomDWARFExpressionOpcode(uint16_t opcode, uint64_t arg1, uint64_t arg2) {
    return createStringError(std::errc::invalid_argument, "unhandled DWARF expression opcode");
  }
could be added, and then the ProcessGDBRemote can pass this along to the GDB server. Anything in DWARFExpression needs to _only_ call virtual functions on lldb_private::Process/Thread/StackFrame and no deps should be added on custom plug-ins.

Having EvaluateCustomDWARFExpressionOpcode could work for DWARFExpression::Evaluate, with the drawbacks mentioned by @labath; but it would not help with Value::GetValueAsData(), I am afraid.

In D78801#2007795, @labath wrote:

In D78801#2007083, @clayborg wrote:

It would be fine to ask the lldb_private::Process class to evaluate any unknown DWARF expression opcodes like DW_OP_WASM_location and return the result.

While that idea has occurred to me too, I am not convinced it is a good one:

it replaces one odd dependency with another one. Why should a Process need to know how to evaluate a DWARF expression? Or even that DWARF exists for that matter? This seems totally unrelated to what other Process functions are doing currently...

But it is what people do in reality. DW_OP_low_user and DW_OP_high_user are ranges that are made available to people to customize their DWARF opcodes. If you don't handle it, you are hosed and can't show a variable location. And to make things worse, two different compilers could both use the same value in that range. So they made DWARF expressions customizable with no real attempt to make them function for different architectures. that is unless you standardize it and make a real opcode that gets accepted into DWARF. The kind of DWARF location opcode that is being used here could easily be generalized into a DW_OP_get_stack_variable with a bunch of args, but at some point you have to talk to someone that is in communication with the runtime of the thing you are debugging to get the answer. So I do believe asking the process for this is not out of scope.

I am not sure it even completely removes wasm knowledge from e.g. DWARFExpression -- the class would presumably still need to know how to parse this opcode.

It is true and this is another hole in the "people can extend DWARF easily" scenario. We need to know what opcode arguments are and that would need to be hard coded for now. But it wouldn't have to rely on anything except virtual function on the generic lldb_private::Process/Thread APIs. In this case as soon as we get an unknown opcode we would need to pass the DataExtractor and the offset into it so the process could extract the arguments. Not clean, but better than making DWARFExpression depend on process plug-ins IMHO.

the interface could get very complicated if we wanted to implement typed stacks present in DWARF5 -- presumably the API would need to return the type of the result, in addition to its value.

DWARF5 just further clarifies what each value on the opcode stack is (file address, load address, the value itself, etc). Right now DWARF expression just infer what a value is based on the opcodes. So I don't see a huge problem here as anything we do will need to work with DWARF5.

Thanks for the explanations if everything WASM related. I now understand much better what you have.

Read Wasm memory.
IN : $qWasmMem:frame_index;addr;len
OUT: $xx..xx

frame index seems weird in a memory read packet. Seems like the module ID should be passed instead of the frame index.

Reading memory could be handled with memory identifiers, or segments. Currently the packets for m and M only have an address, but someone out there must support reading from CODE and DATA address segments in the DSP world. I have an email out to Ted Woodward to see what they do for Qualcomm's hexagon DSPs. I'll let you know what I find. Maybe each WASM module can identify N segments it needs and each module would have its own unique segments. Are module ID's just 1 based indexes?

Could we just always use memory reading and have the address contain more info? Right now you have the top 32 bits for the module ID. Could it be something like:

struct WasmAddress {
  uint64_t module_id:16;
  uint64_t space:4; // 0 == code, 1 == data, 2 == global, 3==local, 4 == stack
  uint64_t frame_id:??;
  uint64_t addr: ??;
}

This would be a bitfield that would all fit into a 64 bit value and could then be easily sent to the GDB server with the standard m and M packets.

In D78801#2009103, @clayborg wrote:

Thanks for the explanations if everything WASM related. I now understand much better what you have.

Read Wasm memory.
IN : $qWasmMem:frame_index;addr;len
OUT: $xx..xx

frame index seems weird in a memory read packet. Seems like the module ID should be passed instead of the frame index.

Reading memory could be handled with memory identifiers, or segments. Currently the packets for m and M only have an address, but someone out there must support reading from CODE and DATA address segments in the DSP world. I have an email out to Ted Woodward to see what they do for Qualcomm's hexagon DSPs. I'll let you know what I find. Maybe each WASM module can identify N segments it needs and each module would have its own unique segments. Are module ID's just 1 based indexes?

True, for qWasmMem (and actually also for qWasmGlobal) it would be sufficient to pass the moduleId, but qWasmLocal and qWasmStackValue really need a frame index, and it is very easy for the runtime to find the module from a frame index, that's why I was passing only frame indices for uniformity. Easy to change :)

For reading memory, yes, I came up with ad hoc mechanism that works for Wasm, but if I could reuse existing mechanisms already used by other architectures and supported by Wasm, obviously it would be better.

In D78801#2009128, @clayborg wrote:
Could we just always use memory reading and have the address contain more info? Right now you have the top 32 bits for the module ID. Could it be something like:
struct WasmAddress {
  uint64_t module_id:16;
  uint64_t space:4; // 0 == code, 1 == data, 2 == global, 3==local, 4 == stack
  uint64_t frame_id:??;
  uint64_t addr: ??;
}
This would be a bitfield that would all fit into a 64 bit value and could then be easily sent to the GDB server with the standard m and M packets.

This is interesting. We could certainly use a few bits to specify the space 0 == code, 1 == data, 2 == global, 3==local, 4 == stack and then have either the module_id or the frame_index, according to the space, and just send "m" packets.

struct WasmAddress {
  uint64_t scope:3;
  uint64_t module_id_or_frame_index:29;
  uint64_t address: 32;
}

But then these "m" packets would have to be interpreted according to these rules by a Wasm-aware GDB-remote stub, so I am not sure we would gain much besides avoiding the introduction of four new custom query commands.
In a function like Value::GetValueAsData we would still have to have Wasm-specific code to generate the memory address in this format, and actually there it is easier to pass the current frame_index, which is readily available, rather than calculating the corresponding module_index.

What if the wasm engine actually made some effort to present a more "conventional" view of the wasm "process"? I.e., in addition to the "implied" PC, we could have an "implied" SP (and maybe an FP too). Then it could lay out the call stack and the function arguments in "memory", and point the "SP" to it so that lldb is able to reconstruct the frames&variables using the "normal" algorithm. For the sake of exposition, lets assume the following "encoding" of the 64 bit memory space:

unsigned:16 type; // 0x5555 = stack
unsigned:16 tid; // are there threads in wasm?
unsigned:16 frame; // grows down, 0x0000 = bottommost frame (main), 0x0001 = second from bottom, etc.
unsigned:16 address;

Then the engine could say that the "SP" of thread 0x1234 is 0x5555123400050000, "FP" is `0x5555123400048010 and the have the memory contents be

0x5555123400048020: // third "local" (of frame 4)
0x5555123400048018: // second "local" (of frame 4)
0x5555123400048010: // first "local" (of frame 4)
0x5555123400048008: 0x5555123400038010 // previous FP (frame 3)
0x5555123400048000: ???? // previous PC (frame 3)
0x5555123400040010: // third "argument" (of frame 4)
0x5555123400040008: // second "argument" (of frame 4)
0x5555123400040000: // first "argument" (of frame 4)
0x5555123400038020: // third "local" (of frame 3)
0x5555123400038018: // second "local" (of frame 3)
0x5555123400038010: // first "local" (of frame 3)
0x5555123400038008: 0x5555123400028010 // previous FP (frame 2)
0x5555123400038000: ???? // previous PC (frame 2)
etc.

Then all it would be needed is to translate DW_OP_WASM_location into an appropriate FP+offset combo. Somehow...

I realize that this is basically throwing the problem "over the fence", and asking the other side to deal with things, but I am starting to get sceptical that we will be able to come up with a satisfactory solution within lldb.

In D78801#2009048, @clayborg wrote:

In D78801#2007795, @labath wrote:

While that idea has occurred to me too, I am not convinced it is a good one:

it replaces one odd dependency with another one. Why should a Process need to know how to evaluate a DWARF expression? Or even that DWARF exists for that matter? This seems totally unrelated to what other Process functions are doing currently...

But it is what people do in reality. DW_OP_low_user and DW_OP_high_user are ranges that are made available to people to customize their DWARF opcodes. If you don't handle it, you are hosed and can't show a variable location. And to make things worse, two different compilers could both use the same value in that range. So they made DWARF expressions customizable with no real attempt to make them function for different architectures. that is unless you standardize it and make a real opcode that gets accepted into DWARF. The kind of DWARF location opcode that is being used here could easily be generalized into a DW_OP_get_stack_variable with a bunch of args, but at some point you have to talk to someone that is in communication with the runtime of the thing you are debugging to get the answer. So I do believe asking the process for this is not out of scope.

I think the "at some point" part is very important here. I get how dwarf expressions are meant to be extended, and that doing that is tricky, but I don't think that automatically means that we should delegate that to a different process. There are various ways that could be implemented and the delegation could be performed at different levels. For example the DWARFExpression class could be made into a class hierarchy and we could have a subclass of it for each architecture with funny operations. Then the subclass would have enough knowledge about wasm to properly parse the expression and evaluate it (possibly by making additional queries to someone else) -- this is sort of what the current patch does, without the "subclass" part.

The problem I have with a function like EvaluateCustomDWARFExpressionOpcode is that it is completely unlike anything else that our process class needs to deal with. The Process deals with threads (and how to control them), memory (read/write) and, to a limited degree, with modules. It knows nothing about "stack frames" or "variables" or "dwarf expressions" -- these are concepts built on top of that. This becomes even more true if we start to talk about the gdb-remote protocol instead of the lldb Process abstraction.

I am not sure it even completely removes wasm knowledge from e.g. DWARFExpression -- the class would presumably still need to know how to parse this opcode.

It is true and this is another hole in the "people can extend DWARF easily" scenario. We need to know what opcode arguments are and that would need to be hard coded for now. But it wouldn't have to rely on anything except virtual function on the generic lldb_private::Process/Thread APIs. In this case as soon as we get an unknown opcode we would need to pass the DataExtractor and the offset into it so the process could extract the arguments.

Not only that, we might need to pass in the entire DWARF stack, in case the opcode depends on some of the stack arguments.

Not clean, but better than making DWARFExpression depend on process plug-ins IMHO.

The dependence on a process plugin could be dealt with by making GetWasmGlobal/Local/etc a virtual function on the Process class. Also not clean, but it's not clear to me whether it's cleaner than having EvaluateCustomDWARFExpressionOpcode virtual function.

Anyway, just the fact that we can't come up with a "clean" solution doesn't mean that we should accept an "unclean" one. This wouldn't be the first feature that ends up sitting in a fork somewhere because it does not integrate cleanly with llvm (probably the largest example of that is swift-lldb).

And I believe the current problems are just the tip of the iceberg. I can't imagine what hoops we'll need to jump through once we start evaluating expressions...

the interface could get very complicated if we wanted to implement typed stacks present in DWARF5 -- presumably the API would need to return the type of the result, in addition to its value.

DWARF5 just further clarifies what each value on the opcode stack is (file address, load address, the value itself, etc). Right now DWARF expression just infer what a value is based on the opcodes. So I don't see a huge problem here as anything we do will need to work with DWARF5.

That doesn't sounds right. DWARF5 introduces opcodes like DW_OP_deref_type which takes a _die offset_ as an argument so that you can specify what is the type of the dereference result value you are accessing. Fortunately that offset must refer to a DW_TAG_base_type, which means the most interesting aspects are byte size and signedness (and byte size is sort of implied by the result), but that still leaves the door open to more languages with more complicated "base" types.

In D78801#2009856, @labath wrote:
What if the wasm engine actually made some effort to present a more "conventional" view of the wasm "process"? I.e., in addition to the "implied" PC, we could have an "implied" SP (and maybe an FP too). Then it could lay out the call stack and the function arguments in "memory", and point the "SP" to it so that lldb is able to reconstruct the frames&variables using the "normal" algorithm. For the sake of exposition, lets assume the following "encoding" of the 64 bit memory space:
unsigned:16 type; // 0x5555 = stack
unsigned:16 tid; // are there threads in wasm?
unsigned:16 frame; // grows down, 0x0000 = bottommost frame (main), 0x0001 = second from bottom, etc.
unsigned:16 address;
Then the engine could say that the "SP" of thread 0x1234 is 0x5555123400050000, "FP" is `0x5555123400048010 and the have the memory contents be
0x5555123400048020: // third "local" (of frame 4)
0x5555123400048018: // second "local" (of frame 4)
0x5555123400048010: // first "local" (of frame 4)
0x5555123400048008: 0x5555123400038010 // previous FP (frame 3)
0x5555123400048000: ???? // previous PC (frame 3)
0x5555123400040010: // third "argument" (of frame 4)
0x5555123400040008: // second "argument" (of frame 4)
0x5555123400040000: // first "argument" (of frame 4)
0x5555123400038020: // third "local" (of frame 3)
0x5555123400038018: // second "local" (of frame 3)
0x5555123400038010: // first "local" (of frame 3)
0x5555123400038008: 0x5555123400028010 // previous FP (frame 2)
0x5555123400038000: ???? // previous PC (frame 2)
etc.
Then all it would be needed is to translate DW_OP_WASM_location into an appropriate FP+offset combo. Somehow...

I realize that this is basically throwing the problem "over the fence", and asking the other side to deal with things, but I am starting to get sceptical that we will be able to come up with a satisfactory solution within lldb.

When you say

translate DW_OP_WASM_location into an appropriate FP+offset combo.

do you mean that LLVM should generate these FP+offset combos rather than DW_OP_WASM_location or that LLDB should somehow do this translation?
I think the engine can do more to help, here, but not a lot more; I am afraid. Yes, it could expose an implied “SP” and “FP”, and that should be sufficient to represent locals and arguments and make stack walking more orthodox. But DW_OP_WASM_location also describes locations in the set of wasm globals and in the Wasm operand stack, so we would need at least a second. parallel stack to represent the operand stack.

Also, for C++ LLVM emits code to maintain a “shadow stack” in the linear memory of the module, and location expressions like DW_OP_fbreg +N are already used to describe the location of a parameter or a local variable in that shadow stack. The stack frame pointer for that function is described with DW_AT_frame_base, expressed as a DW_OP_WASM_location expression.

In the end walking the stack is not a big problem, its logic can already be encapsulated in a Unwind-derived plugin class. The issues are:

in DWARFExpression::Evaluate, where we need to handle DW_OP_WASM_location somehow, and
in Value::GetValueAsData, where we need to read from the memory of the current Wasm module, which is a space separated from the address space of code.

I understand that it is not easy to plug in this functionality in a very neat way, and maybe I am missing something else here, but if there are no other places involved maybe we can come up with a clean solution.

And I believe the current problems are just the tip of the iceberg. I can't imagine what hoops we'll need to jump through once we start evaluating expressions...

Expression evaluation works, in my prototype, for simple expressions. For complex expressions I see logged errors like this, in IRInterpreter::CanInterpret():

Unsupported instruction: %call = call float @_ZNK4Vec3IfE3dotERKS0_(%class.Vec3* %7, %class.Vec3* dereferenceable(12) %8)

It’s not clear to me if the problem is caused by the debug symbols or by the IR generated for Wasm… is there any doc where I could learn more about expression evaluation in LLDB? It’s a topic that really interests me, even outside the scope of this Wasm work.

Hi Paulo,
@clayborg asked me to look at this, because I've worked with systems that have multiple address spaces. I was thinking, instead of a WebAssembly specific memory read, we should implement an optional generic memory read and write with memory space support.

So instead of qWasmMem:frame_index;addr;len, we have something like qMemSpaceRead:addr;space;len and qMemSpaceWrite:addr;space;len;data .

"space" is stub dependent - it's just a number, and can mean different things for different targets. It could be different modules, like you talk about here, or different physical memory areas like in a Motorola 56K DSP, or different ways to get at the same memory (physical/virtual/cacheable, or directly controlling MESI bits instead of relying on TLB entries) like I implemented in FSLDBG.

If we do this, other targets that want to work with memory spaces can use them instead of having to implement their own extensions.

About the expression problem - the IR Interpreter doesn't handle some complex expressions. It needs work, but most targets use JIT. On Hexagon, we only recently enabled JIT in the compiler, and haven't done it in the debugger yet, so we use the IR Interpreter for everything. Unfortunately I haven't had time to dive into it to make it better.

In D78801#2010536, @ted wrote:

Hi Paulo,
@clayborg asked me to look at this, because I've worked with systems that have multiple address spaces. I was thinking, instead of a WebAssembly specific memory read, we should implement an optional generic memory read and write with memory space support.

So instead of qWasmMem:frame_index;addr;len, we have something like qMemSpaceRead:addr;space;len and qMemSpaceWrite:addr;space;len;data .

"space" is stub dependent - it's just a number, and can mean different things for different targets. It could be different modules, like you talk about here, or different physical memory areas like in a Motorola 56K DSP, or different ways to get at the same memory (physical/virtual/cacheable, or directly controlling MESI bits instead of relying on TLB entries) like I implemented in FSLDBG.

If we do this, other targets that want to work with memory spaces can use them instead of having to implement their own extensions.

About the expression problem - the IR Interpreter doesn't handle some complex expressions. It needs work, but most targets use JIT. On Hexagon, we only recently enabled JIT in the compiler, and haven't done it in the debugger yet, so we use the IR Interpreter for everything. Unfortunately I haven't had time to dive into it to make it better.

Hi Ted,

I really like the idea of defining generic commands for reading and writing memory on systems with multiple address spaces! In fact there is no reason why that should be Wasm-specific.

Also, to eliminate qWasmCallStack we could maybe add the call stack (a list of PCs) to the stop packet. The format of a stop packet is:

T AA n1:r1;n2:r2;…

The program received signal number AA (a two-digit hexadecimal number). [...] Each ‘n:r’ pair is interpreted as follows:
If n is a hexadecimal number, it is a register number, and the corresponding r gives that register’s value. [...]
If n is ‘thread’, then r is the thread-id of the stopped thread, as specified in thread-id syntax.
If n is ‘core’, then r is the hexadecimal number of the core on which the stop event was detected.
Otherwise, GDB should ignore this ‘n:r’ pair and go on to the next; this allows us to extend the protocol in the future.

So adding a 'stack:xxxxx...xxx' pair should be possible. If we reuse m, M in place of qWasmLocal, qWasmGlobal and qWasmStackValue and add generic qMemSpaceRead/qMemSpaceWrite for the memory, then there would be no new Wasm-specific command to add to the protocol.

Sounds like we have an approach to try! I would like to see solutions that are not WASM specific when possible.

One other way of doing the memory read/write with segments: maybe the "m" and "M" packets can be overloaded include the memory segment identifier if the remote GDB server responds with "OK" to a QEnable packet. LLDB currently enables some features in the remote stub if the QEnable packet responds with "OK", like:

<  23> send packet: $QSetDetachOnError:1#f8
<   6> read packet: $OK#00

What if we added a new "QEnableMemorySegments" packet that then requires all memory reads/writes to include the memory segment in the packet? Without this or if the GDB server responds with "$#00" (unimplemented), the memory read packets look like:

m addr,length
M addr,length:XX…

If the GDB server responds with "OK" to the QEnableMemorySegments, then the packets become:

m addr,segment,length
M addr,segment,length:XX…

I am guessing we can't be the first to do segmented memory read/write via GDB server, so it would be good to look around to see what other may have done.

In D78801#2009917, @labath wrote:

In D78801#2009048, @clayborg wrote:

In D78801#2007795, @labath wrote:

While that idea has occurred to me too, I am not convinced it is a good one:

it replaces one odd dependency with another one. Why should a Process need to know how to evaluate a DWARF expression? Or even that DWARF exists for that matter? This seems totally unrelated to what other Process functions are doing currently...

But it is what people do in reality. DW_OP_low_user and DW_OP_high_user are ranges that are made available to people to customize their DWARF opcodes. If you don't handle it, you are hosed and can't show a variable location. And to make things worse, two different compilers could both use the same value in that range. So they made DWARF expressions customizable with no real attempt to make them function for different architectures. that is unless you standardize it and make a real opcode that gets accepted into DWARF. The kind of DWARF location opcode that is being used here could easily be generalized into a DW_OP_get_stack_variable with a bunch of args, but at some point you have to talk to someone that is in communication with the runtime of the thing you are debugging to get the answer. So I do believe asking the process for this is not out of scope.

I think the "at some point" part is very important here. I get how dwarf expressions are meant to be extended, and that doing that is tricky, but I don't think that automatically means that we should delegate that to a different process. There are various ways that could be implemented and the delegation could be performed at different levels. For example the DWARFExpression class could be made into a class hierarchy and we could have a subclass of it for each architecture with funny operations. Then the subclass would have enough knowledge about wasm to properly parse the expression and evaluate it (possibly by making additional queries to someone else) -- this is sort of what the current patch does, without the "subclass" part.

The problem I have with a function like EvaluateCustomDWARFExpressionOpcode is that it is completely unlike anything else that our process class needs to deal with. The Process deals with threads (and how to control them), memory (read/write) and, to a limited degree, with modules. It knows nothing about "stack frames" or "variables" or "dwarf expressions" -- these are concepts built on top of that. This becomes even more true if we start to talk about the gdb-remote protocol instead of the lldb Process abstraction.

I am not sure it even completely removes wasm knowledge from e.g. DWARFExpression -- the class would presumably still need to know how to parse this opcode.

It is true and this is another hole in the "people can extend DWARF easily" scenario. We need to know what opcode arguments are and that would need to be hard coded for now. But it wouldn't have to rely on anything except virtual function on the generic lldb_private::Process/Thread APIs. In this case as soon as we get an unknown opcode we would need to pass the DataExtractor and the offset into it so the process could extract the arguments.

Not only that, we might need to pass in the entire DWARF stack, in case the opcode depends on some of the stack arguments.

Yes

Not clean, but better than making DWARFExpression depend on process plug-ins IMHO.

The dependence on a process plugin could be dealt with by making GetWasmGlobal/Local/etc a virtual function on the Process class. Also not clean, but it's not clear to me whether it's cleaner than having EvaluateCustomDWARFExpressionOpcode virtual function.

I would really like to see a solution that does not include "Wasm" in any process or thread virtual functions. I am fine with something like:

enum IndexedVariableType {
 Global,
 Local,
 Stack
};

Process::GetIndexedVariable(IndexedVariableType type, size_t index, ...)

I would rather see a DWARF function added to process over a WASM specific one.

Anyway, just the fact that we can't come up with a "clean" solution doesn't mean that we should accept an "unclean" one. This wouldn't be the first feature that ends up sitting in a fork somewhere because it does not integrate cleanly with llvm (probably the largest example of that is swift-lldb).

I think there is a clean way to do memory reading and writing with segment IDs and also to access variables from the process runtime.

Actually, that brings up an idea: a process runtime plug-in for languages that have a runtime that can be communicated with. Java has a runtime, Wasm has a runtime, most Javascript engines have runtimes. Maybe this wouldn't be too hard to abstract? Then we can add the GetGlobal(index), GetLocal(index), GetStack(index) to this plug-in?

And I believe the current problems are just the tip of the iceberg. I can't imagine what hoops we'll need to jump through once we start evaluating expressions...

the interface could get very complicated if we wanted to implement typed stacks present in DWARF5 -- presumably the API would need to return the type of the result, in addition to its value.

DWARF5 just further clarifies what each value on the opcode stack is (file address, load address, the value itself, etc). Right now DWARF expression just infer what a value is based on the opcodes. So I don't see a huge problem here as anything we do will need to work with DWARF5.

That doesn't sounds right. DWARF5 introduces opcodes like DW_OP_deref_type which takes a _die offset_ as an argument so that you can specify what is the type of the dereference result value you are accessing. Fortunately that offset must refer to a DW_TAG_base_type, which means the most interesting aspects are byte size and signedness (and byte size is sort of implied by the result), but that still leaves the door open to more languages with more complicated "base" types.

Sorry for the overly long post. I tried to reply to all messages from last night. I don't claim that all of my comments are consistent with one another -- that's a reflection of the fact that I really don't know what is the right solution...

In D78801#2010713, @paolosev wrote:
Also, to eliminate qWasmCallStack we could maybe add the call stack (a list of PCs) to the stop packet. The format of a stop packet is:
T AA n1:r1;n2:r2;…

The program received signal number AA (a two-digit hexadecimal number). [...] Each ‘n:r’ pair is interpreted as follows:
If n is a hexadecimal number, it is a register number, and the corresponding r gives that register’s value. [...]
If n is ‘thread’, then r is the thread-id of the stopped thread, as specified in thread-id syntax.
If n is ‘core’, then r is the hexadecimal number of the core on which the stop event was detected.
Otherwise, GDB should ignore this ‘n:r’ pair and go on to the next; this allows us to extend the protocol in the future.
So adding a 'stack:xxxxx...xxx' pair should be possible. If we reuse m, M in place of qWasmLocal, qWasmGlobal and qWasmStackValue and add generic qMemSpaceRead/qMemSpaceWrite for the memory, then there would be no new Wasm-specific command to add to the protocol.

I don't see a real difference between these two options. The only reason the stack: key is not wasm-specific is because you excluded wasm from the name. If we renamed qWasmCallStack to qCallStack, the effect would be the same. For me the question is more fundamendal -- who/how/why should be computing the call stack, not the exact protocol details.

It may also be interesting to note that the stop-reply packet kind of also includes the call stack information via the memory field. The idea there is that the stub will walk the FP chain and send over the relevant bits of memory. However, there is a big difference -- all of this is just a hint to the client and an optimization to reduce packet count. It's still the client who determines the final call stack.

In D78801#2010598, @paolosev wrote:

I really like the idea of defining generic commands for reading and writing memory on systems with multiple address spaces! In fact there is no reason why that should be Wasm-specific.

I know a lot of people are interested in adding address spaces to lldb. But I don't think the problem is adding a gdb-remote extension to the m packet -- that is the easy part. The same thing could be done to the "memory read" command in lldb. The interesting stuff begins when you want to go beyond that: If the address space is just an opaque number, then what does that mean? What can lldb do with such address spaces? How do they map to the address spaces in llvm? How do they relate to addresses in object files (no common format has support for them)? How about addresses in debug info? etc.

If dwarf had a notion of address spaces then there'd probably be no need for DW_OP_WASM_location. If the dwarf committee hasn't been able to come up with a unified way to represent address spaces, then I'm not sure we will do better. And we'll still be left with the job of translating dwarf (and other) entries into this other concept.

In D78801#2010471, @paolosev wrote:

When you say

translate DW_OP_WASM_location into an appropriate FP+offset combo.

do you mean that LLVM should generate these FP+offset combos rather than DW_OP_WASM_location or that LLDB should somehow do this translation?

I meant the latter, though if we could somehow achieve the former, it would be even better, obviously. :)

I think the engine can do more to help, here, but not a lot more; I am afraid. Yes, it could expose an implied “SP” and “FP”, and that should be sufficient to represent locals and arguments and make stack walking more orthodox. But DW_OP_WASM_location also describes locations in the set of wasm globals and in the Wasm operand stack, so we would need at least a second. parallel stack to represent the operand stack.

Also, for C++ LLVM emits code to maintain a “shadow stack” in the linear memory of the module, and location expressions like DW_OP_fbreg +N are already used to describe the location of a parameter or a local variable in that shadow stack. The stack frame pointer for that function is described with DW_AT_frame_base, expressed as a DW_OP_WASM_location expression.

So, the pointer to this "shadow stack" is one of the function arguments, represented as DW_OP_WASM_location 0x0 (local) + constant. And then we get to a variable on the shadow stack by "dereferencing" this value, and adding another constant. Is that right?

That sound like it should be expressible in standard dwarf. If we set DW_AT_frame_base to DW_OP_regX FP, then "real" arguments could be expressed as DW_OP_fbreg +Y and "shadow" arguments as DW_OP_fbreg +index_of_shadow_argument, DW_OP_deref, DW_OP_const Y, DW_OP_add.
I guess the reason this hasn't been done is because that would give off the impression that all of these things are in memory, in the same address space, but the "real" arguments don't have a memory location at all? But if they don't have a memory location, is it right to represent them as address spaces?

In the end walking the stack is not a big problem, its logic can already be encapsulated in a Unwind-derived plugin class.

That is true, and there are elements of the current solution there that I like a lot. The thing I am resisting is to put all of this stuff in the the ProcessGDBRemote class. Instead of trying to generalize it so that it can handle everything (and generalize to the wrong thing), I very much like the idea of introducing a WasmProcess plugin class that handles all wasm stuff. If that class happens to use parts of gdb-remote, then so be it, but it means that it's not a problem for that class to use a dialect of the gdb-remote protocol, which includes as many wasm-specific packets as it needs. Then this class can create it's own implementation of the "Unwind" interface, which will use some WasmProcess-specific apis to undwind, but that will also be ok, since both classes will be wasm-specific.

The question is whether something similar can be done for the other two cases. I believe it might be possible for the DWARFExpression. We just need to have a way to separate the creation of the dwarf expression data from the process of evaluating it. Right now, both of these things happens in SymbolFileDWARF -- it creates a DWARFExpression object which both holds the data and knows how to evaluate it. But I don't believe SymbolFileDWARF needs the second part. If we could make it so that something else is responsible for creating the evaluator for dwarf expressions, that something could create a WasmDWARFExpression which would know about DW_OP_WASM_location and WasmProcess, and could evaluate it.

The GetValueAsData problem is trickier, particularly as I'm not even sure the current implementation is correct. Are you sure that you really need the _current_ module there? What happens if I use "target variable" to display a variable from a different module? What if I then dereference that variable? If the dereference should happen in the context of the other module, then I guess the "module" should be a property of the value, not of the current execution context. And it sounds like some address space-y thing would help. But that might require teaching a lot of places in lldb about address spaces, in particular that a dereferencing a value in one address space, should produce a value in the same address space (at least for "near" pointer or something).
If we should really use the current frame as the context, then I guess we'd need some sort of a interfaces to ask a stack frame to get the value of a "Value".

And I believe the current problems are just the tip of the iceberg. I can't imagine what hoops we'll need to jump through once we start evaluating expressions...

Expression evaluation works, in my prototype, for simple expressions. For complex expressions I see logged errors like this, in IRInterpreter::CanInterpret():
Unsupported instruction: %call = call float @_ZNK4Vec3IfE3dotERKS0_(%class.Vec3* %7, %class.Vec3* dereferenceable(12) %8)
It’s not clear to me if the problem is caused by the debug symbols or by the IR generated for Wasm… is there any doc where I could learn more about expression evaluation in LLDB? It’s a topic that really interests me, even outside the scope of this Wasm work.

Yes, this is because complex expressions require us to inject code into the inferior to run it. I expect that to be quite a tough nut to crack. Even so, I do find it pretty impressive that simple expressions to work.

I am not aware of any good documentation for the expression evaluator. Probably the closest thing are some devmtg tutorials.

In D78801#2011511, @clayborg wrote:

Not only that, we might need to pass in the entire DWARF stack, in case the opcode depends on some of the stack arguments.

Yes

In principle, I am fine with having a "Process" (or someone else -- we may want to do this for not-yet-started processes a'la "target variable" too) specifying the semantics of a dwarf expression. Though it that case, I think it would be cleaner to just have this entity provide a "DWARFEvaluator" object, which will handle the entirety of the evaluation. The fact that 99% of the different evaluators will be identical can be handled by putting that code in a common base class.

I would really like to see a solution that does not include "Wasm" in any process or thread virtual functions. I am fine with something like:
enum IndexedVariableType {
 Global,
 Local,
 Stack
};

Process::GetIndexedVariable(IndexedVariableType type, size_t index, ...)

The thing I fear with such a solution is that it will be wasm-specific in everything but the name. It's very hard to create a good generic interface with just a few (one?) data points. There are plenty of examples for that in lldb, where an API tries to look very generic, but in reality it only makes sense for darwin/macho/objc...

Actually, that brings up an idea: a process runtime plug-in for languages that have a runtime that can be communicated with. Java has a runtime, Wasm has a runtime, most Javascript engines have runtimes. Maybe this wouldn't be too hard to abstract? Then we can add the GetGlobal(index), GetLocal(index), GetStack(index) to this plug-in?

I think that is a very interesting idea to explore. We already have language runtime plugins, and they do have the ability to refine the type of a variable/value. Maybe they could also somehow help with computing the value of a variable? Then instead of a GetGlobal/Local(index), we could have GetValue(Variable), and they would be the ones responsible for making sense of the dwarf expressions and address spaces?

yurydelendik added a subscriber: yurydelendik.Apr 30 2020, 11:03 AM

The thing I am resisting is to put all of this stuff in the the ProcessGDBRemote class. Instead of trying to generalize it so that it can handle everything (and generalize to the wrong thing), I very much like the idea of introducing a WasmProcess plugin class that handles all wasm stuff. If that class happens to use parts of gdb-remote, then so be it, but it means that it's not a problem for that class to use a dialect of the gdb-remote protocol, which includes as many wasm-specific packets as it needs. Then this class can create it's own implementation of the "Unwind" interface, which will use some WasmProcess-specific apis to undwind, but that will also be ok, since both classes will be wasm-specific.

I think that this would solve a lot of problems. WasmProcess could inherit from GDBRemoteProcess and could send itself Wasm-specific commands like qWasmMem just by calling GetGDBRemote().SendPacketAndWaitForResponse() Then there would be no changes to ProcessGDBRemote at all.
For walking the call stack I would then keep the current design with the Unwind-derived class that sends a command to get the call stack from the stub. It’s true that normally it is the client that calculates call stacks, but it does so even because it cannot really ask them to the stub, but here we have a runtime that can provide this information.

If dwarf had a notion of address spaces then there'd probably be no need for DW_OP_WASM_location. If the dwarf committee hasn't been able to come up with a unified way to represent address spaces, then I'm not sure we will do better. And we'll still be left with the job of translating dwarf (and other) entries into this other concept.

Yes, it is possible to come up with some representation of a unified address space for Wasm locals, globals and stack items (and also code and memory). But it's also true that locals, globals and stack items don’t really have a memory location. The reason for the introduction of DW_OP_WASM_location is indeed because there was nothing in DWARF that could well represent these entities. While there are certainly other architectures with multiple memory spaces, the execution model of WebAssembly is quite peculiar in this aspect, I think.

The question is whether something similar can be done for the other two cases. I believe it might be possible for the DWARFExpression. We just need to have a way to separate the creation of the dwarf expression data from the process of evaluating it. Right now, both of these things happens in SymbolFileDWARF -- it creates a DWARFExpression object which both holds the data and knows how to evaluate it. But I don't believe SymbolFileDWARF needs the second part. If we could make it so that something else is responsible for creating the evaluator for dwarf expressions, that something could create a WasmDWARFExpression which would know about DW_OP_WASM_location and WasmProcess, and could evaluate it.

Separating the DWARFExpression data and the logic to evaluate it would certainly make sense. I think that this must be a general problem: since DWARF provides the way to define custom expression location, different architectures might define specific DWARF codes that they need to handle, so it would be great if we had a DWARFEvaluator that was also pluggable. I don’t know how complex would be to refactor DWARFExpression in this way but I can investigate.

The GetValueAsData problem is trickier, particularly as I'm not even sure the current implementation is correct. Are you sure that you really need the _current_ module there? What happens if I use "target variable" to display a variable from a different module? What if I then dereference that variable? If the dereference should happen in the context of the other module, then I guess the "module" should be a property of the value, not of the current execution context. And it sounds like some address space-y thing would help. But that might require teaching a lot of places in lldb about address spaces, in particular that a dereferencing a value in one address space, should produce a value in the same address space (at least for "near" pointer or something).
If we should really use the current frame as the context, then I guess we'd need some sort of a interfaces to ask a stack frame to get the value of a "Value".

The fact that the code address space is separated from the memory address space is really what makes things complicated. However, we know for sure that every time that all memory reads made while evaluating DWARF expressions or variables always target the module memory space, never the code space.

I must confess that had not really tested target variable so far; I did it today and I found that it already almost works. What happens is that the location of the global variable is calculated with DWARFExpression::Evaluate (in my tests I only see DW_OP_addr, but maybe there could be other ways?) and there it calls Value::ConvertToLoadAddress() passing the correct module, and this produces an address in the form module_id|offset, which is then used in Value::GetValueAsData(), which currently sends qWasmMem requests.

We don’t really need the frame_index in Value:: GetValueAsData; the stub only uses frame_index to calculate the corresponding module_id, and we already pass the current Module* to this function.
So it seems possible to make this work for WebAssembly, but the tricky part is how to make Wasm-specific reads in a class like Value that should not have any knowledge about Wasm.

Actually, that brings up an idea: a process runtime plug-in for languages that have a runtime that can be communicated with. Java has a runtime, Wasm has a runtime, most Javascript engines have runtimes. Maybe this wouldn't be too hard to abstract? Then we can add the GetGlobal(index), GetLocal(index), GetStack(index) to this plug-in?

I think that is a very interesting idea to explore. We already have language runtime plugins, and they do have the ability to refine the type of a variable/value. Maybe they could also somehow help with computing the value of a variable? Then instead of a GetGlobal/Local(index), we could have GetValue(Variable), and they would be the ones responsible for making sense of the dwarf expressions and address spaces?

Summarizing, if we assume that we can create:

WasmProcess, derived from GDBRemoteProcess,
WasmUnwind, derived from Unwind, and
WasmDWARFEvaluator, derived from a new class DWARFEvaluator,

then we would not have to touch any existing GDBRemote- and DWARFExpression code.
At this point the way we query locals/globals/stack and the call stack from the Wasm engine would be just an implementation detail of these Wasm plugin classes (assuming that we can define new gdb-remote custom commands like qWasmLocal, qWasmGlobal, qWasmStackValue, qWasmCallStack. Then we would not really need to abstract this functionality with a generic interface, in my opinion.

What is left to decide would be “just” how to handle memory. In particular how to make Wasm-specific memory requests, targeting a particular Wasm module, from class Value.

This patch is a work in progress where I am refactoring the code, as suggested, to remove almost all dependencies on Wasm from the LLDB core that were present in the previous patch. In particular:

The logic to evaluate a DWARF expression is now separated from the DWARF expression data. There is a new kind of plugin, DWARFEvaluator, that can be used to define platform-specific evaluators. Base class DWARFEvaluator contains all the evaluation code extracted from class DWARFExpression.
Plugin class WasmDWARFEvaluation takes care of evaluating WASM-specific codes like DW_OP_WASM_location.

Process-plugin class WasmProcess provides functions to access the Wasm engine state through a GDB-remote connection. Now WasmProcess contains all the logic to send Wasm-specific queries to the GDB stub; there are no more changes to existing classes like GDBRemoteClientConnection.
Like before, class UnwindWasm handles stack unwinding by requesting the call stack from the Wasm engine.

In this way, the patch does not impose (almost) any knowledge of WebAssembly on the LLDB core, besides the logic to register Wasm-specific plugin classes. The only exception still left is in class Value where in this patch there is still a dependency on WasmProcess that I need to remove.

Please, let me know if this is a step in the right direction, in your opinion.
(I realize that this patch has become annoyingly large, but of course once we understand what could be a good solution I will split it in more manageable pieces).

Harbormaster failed remote builds in B55623: Diff 261780!May 4 2020, 5:50 AM

Interesting approach to DWARF expression evaluation, though it might be simpler to leave the DWARFExpression as it is, and have the plug-in part only handle unknown opcodes. Right now if you want to add just one opcode, you must subclass and make a plug-in instance where 99% of opcodes get forwarded to the original implementation and the one new opcode gets handled by the plug-in. What if the DWARFExpression class was passed to a plug-in that only handles unknown opcodes? Might be fewer code changes and be a bit cleaner. The plug-in would handle only the DW_OP_WASM_location and would still be found/detected if and only if one is encountered?

As for unwinding, I still don't think we need the UnwindWASM class. See my inline comment in "Unwind &Thread::GetUnwinder()" for a bit of reasoning. It is very common for runtimes for languages to support supplying the stack frames for a thread, so this should be built into the lldb_private::Process/lldb_private::Thread classes. For stack unwindind I would suggest adding functions to lldb_private::Thread:

class Thread {
  /// Check if the runtime supports unwinding call stacks and return true if so.
  ///
  /// If the language runs in a runtime that knows how to unwind the call stack
  /// for a thread, then this function should return true.
  ///
  /// If true is returned, unwinding will use a RuntimeUnwind class that will call
  /// into this class' Thread::GetRuntimeFrame* functions to do the unwinding.
  /// If false is returned, the standard UnwindLLDB will be used where unwinding
  /// will be done using registers, unwind info and other debug info.
  virtual bool HasRuntimeUnwindSupport() {
    return false; // Default to using UnwindLLDB()
  }
  virtual uint32_t GetRuntimeFrameCount() {
    return 0;
 }
 virtual bool GetRuntimeFrameInfoAtIndex(uint32_t frame_idx, lldb::addr_t &cfa, lldb::addr_t &pc, bool &behaves_like_zeroth_frame) {
  return false;
 }
 virtual lldb::RegisterContextSP GetRuntimeFrameRegisterContext(uint32_t frame_idx) {
  return lldb::RegisterContextSP();
 }

Then either ThreadGDBRemote, or possibly a subclass like ThreadWasm will implement these virtual functions.

For the GetWasmLocal, GetWasmGlobal, GetWasmStackValue, I still think abstracting this into the lldb_private::Process/lldb_private::Thread is the right way to do this. The way you have this right now, there is not way to tell how big the buffer needs to be to fetch any of these local/global/stack values. They all assume 16 bytes is enough.

If we have a runtime that knows about information in a stack frame, function or global, then we should have a way to fetch that from a process/thread. For example:

class Process {
  /// Fetch a global variable from the process runtime if this is supported.
  ///
  /// \param var_id A 64 bit value that needs to encode all of the data needed to fetch a
  ///              global variable and should uniquely identify a global variable in a module.
  /// \param buf A buffer that will get the bytes from the global variable. If buf is NULL, then
  ///             size will be returned so the client knows how large the variable is.
  /// \param size The size of the buffer pointed to by "buf". If buf is NULL, this value can be 
  ///              zero to indicate the function should return the actual size of the global variable.
  ///
 /// \returns The actual size of the variable which might be larger that the "size" parameter.
  virtual size_t GetRuntimeGlobalValue(lldb::user_id_t var_id, void *buf, size_t buffer_size) {
    return 0;
  }

We would need to do something similar on the Thread class for locals and stack values:

class Thread {
  virtual size_t GetRuntimeLocalValue(uint32_t frame_idx, lldb::user_id_t var_id, void *buf, size_t buffer_size) {
    return 0;
  }
  virtual size_t GetRuntimeStackValue(uint32_t frame_idx, lldb::user_id_t var_id, void *buf, size_t buffer_size) {
    return 0;
  }

lldb/source/Expression/DWARFEvaluator.cpp
328–330	This shouldn't be in here. Remove DW_OP_WASM_location as it is custom.
lldb/source/Plugins/Process/wasm/WasmProcess.cpp
71–76 ↗	(On Diff #261780)	This seems flaky to me. How are you ever going to get the frame index right unless we encode it into the "vm_addr" using bitfields like we spoke about before. And if we encode it all into the 64 bit address, then we don't need this special read. Seems like we need to figure out if we are going to encode everything into an uint64_t or not. That will be the easiest way to integrate this into LLDB as all memory reads take a "lldb::addr_t" right now (no memory space information). We would change ReadMemory and WriteMemory to start taking a more complex type like: AddressSpecifier { lldb::addr_t addr; uint64_t segment; }; But that will be a huge change to the LLDB source code that should be done in a separate patch before we do anything here.
185 ↗	(On Diff #261780)	This API seems wrong to be on the process. It should be on the ThreadWasm class (if we end up with one). see my main comments for more details.
lldb/source/Plugins/Process/wasm/WasmProcess.h
20 ↗	(On Diff #261780)	Should be named ProcessWasm to follow all other process classes (if we decide we need to specialize a Wasm process class and not just abstract it into ProcessGDBRRemote).
lldb/source/Target/Thread.cpp
1857–1863 ↗	(On Diff #261780)	If the call stack is available through the Process/Thread interface, I think we should be asking the thread for this information. So this code could be: if (!m_unwinder_up) { if (HasRuntimeUnwindSupport()) m_unwinder_up.reset(new RuntimeUnwind(this)); else m_unwinder_up.reset(new UnwindLLDB(this)); } RuntimeUnwind would be a class that will fetch the stack frame information through the new virtual functions on the Thread class, but only if the virtual Thread::HasRuntimeUnwindSupport() returns true. As new languages and runtimes are added in the future I don't want to see this function look like: if (!m_unwinder_up) { if (CalculateTarget()->GetArchitecture().GetMachine() == llvm::Triple::wasm32) m_unwinder_up.reset(new wasm::UnwindWasm(this)); else if (CalculateTarget()->GetArchitecture().GetMachine() == llvm::Triple::Rust) m_unwinder_up.reset(new rust::UnwindRust(this)); else if (CalculateTarget()->GetArchitecture().GetMachine() == llvm::Triple::Go) m_unwinder_up.reset(new go::UnwindGo(this)); else m_unwinder_up.reset(new UnwindLLDB(this)); } So we should find a way to generalize the stack frames being fetched from the process/thread classes using virtual functions. I know this is the way GDB was built (many ifdefs and arch specific detecting code everywhere), but we have plug-ins in LLDB that are there to abstract us from this kind of code.

In D78801#2019384, @clayborg wrote:

Interesting approach to DWARF expression evaluation, though it might be simpler to leave the DWARFExpression as it is, and have the plug-in part only handle unknown opcodes. Right now if you want to add just one opcode, you must subclass and make a plug-in instance where 99% of opcodes get forwarded to the original implementation and the one new opcode gets handled by the plug-in. What if the DWARFExpression class was passed to a plug-in that only handles unknown opcodes? Might be fewer code changes and be a bit cleaner. The plug-in would handle only the DW_OP_WASM_location and would still be found/detected if and only if one is encountered?

I think that the main reason this is awkward is that the evaluator is plugged in at the level of a single dwarf operation. That requires passing around of a lot of state. If it was plugged in at the level of evaluating the entire expression, then the amount of state to pass is much smaller. Obviously, the evaluation itself would then need to be factored into multiple functions so that one can override just the evaluation of a single expression, but that's pretty standard software engineering work, and something that we probably should do for code health anyway.

In fact, I believe that if we do this right then the result could be much cleaner that the current situation. Going off of the idea of caching the evaluator in the module, what if we don't "cache" the evaluator itself, but actually a factory (function) for it. The advantage of that (the factory function creating a evaluator instance for a specific expression) is that we could store a lot of the evaluation state in the evaluator object, instead of passing it all around through function arguments (I find the long function argument lists to be one of the main problems of the current DWARFExpression class).

As for unwinding, I still don't think we need the UnwindWASM class. See my inline comment in "Unwind &Thread::GetUnwinder()" for a bit of reasoning. It is very common for runtimes for languages to support supplying the stack frames for a thread, so this should be built into the lldb_private::Process/lldb_private::Thread classes. For stack unwindind I would suggest adding functions to lldb_private::Thread:
class Thread {
  /// Check if the runtime supports unwinding call stacks and return true if so.
  ///
  /// If the language runs in a runtime that knows how to unwind the call stack
  /// for a thread, then this function should return true.
  ///
  /// If true is returned, unwinding will use a RuntimeUnwind class that will call
  /// into this class' Thread::GetRuntimeFrame* functions to do the unwinding.
  /// If false is returned, the standard UnwindLLDB will be used where unwinding
  /// will be done using registers, unwind info and other debug info.
  virtual bool HasRuntimeUnwindSupport() {
    return false; // Default to using UnwindLLDB()
  }
  virtual uint32_t GetRuntimeFrameCount() {
    return 0;
 }
 virtual bool GetRuntimeFrameInfoAtIndex(uint32_t frame_idx, lldb::addr_t &cfa, lldb::addr_t &pc, bool &behaves_like_zeroth_frame) {
  return false;
 }
 virtual lldb::RegisterContextSP GetRuntimeFrameRegisterContext(uint32_t frame_idx) {
  return lldb::RegisterContextSP();
 }
Then either ThreadGDBRemote, or possibly a subclass like ThreadWasm will implement these virtual functions.

If we have ThreadWasm then I believe we don't need any of this as ThreadWasm can just override the appropriate function which returns the unwinder object.

For the GetWasmLocal, GetWasmGlobal, GetWasmStackValue, I still think abstracting this into the lldb_private::Process/lldb_private::Thread is the right way to do this. The way you have this right now, there is not way to tell how big the buffer needs to be to fetch any of these local/global/stack values. They all assume 16 bytes is enough.

For me, the main one of the advantages of having a wasm-specific class is that it can make wasm-specific assumptions. That said, the apis in question are definitely very c-like and could definitely be brought into the c++14 world.

lldb/source/Interpreter/CommandInterpreter.cpp
738–755	One way to improve this would be to have lldb detect the kind of plugin that it is talking to and create an appropriate instance based on that. However, that would require more refactorings, so this is good for a start anyway.
lldb/source/Plugins/Process/wasm/WasmProcess.cpp
185 ↗	(On Diff #261780)	I guess that's because Paolo was avoiding (or not being able to) create ThreadWasm objects. If we make that possible (per my other comment) then this should be doable as well.
lldb/source/Target/Thread.cpp
1857–1863 ↗	(On Diff #261780)	The way I would imagine this happening is that ProcessWasm overrides the appropriate method (if there isn't one we can create it) so that it creates `ThreadWasm`s instead of plain `ThreadGdbRemote`s. Then `ThreadWasm` can override `GetUnwinder` to return an `UnwindWasm` instance.

In D78801#2017413, @paolosev wrote:

Please, let me know if this is a step in the right direction, in your opinion.

I believe it is. I think that creating ThreadWasm objects would make it even better (and address a lot of Greg's issues). Unfortunately, I still don't know what to do about the whole Value business...

I forgot I wanted to respond to this part.

In D78801#2014515, @paolosev wrote:

The fact that the code address space is separated from the memory address space is really what makes things complicated. However, we know for sure that every time that all memory reads made while evaluating DWARF expressions or variables always target the module memory space, never the code space.

I must confess that had not really tested target variable so far; I did it today and I found that it already almost works. What happens is that the location of the global variable is calculated with DWARFExpression::Evaluate (in my tests I only see DW_OP_addr, but maybe there could be other ways?) and there it calls Value::ConvertToLoadAddress() passing the correct module, and this produces an address in the form module_id|offset, which is then used in Value::GetValueAsData(), which currently sends qWasmMem requests.

Ok, I can see how that would sort of work. But what about dereferencing global variables (target variable *global_ptr)? I have a feeling that will be trickier because that address doesn't go through file->load address conversion, as it's expected to already be a load address. I don't think that we will automatically infer the right "module" for it, and I'm not sure how to make lldb infer the right module/address space there without teaching it a lot more about non-flat address spaces.

For the DWARF expression evaluation, I am here following the suggestion of defining a Factory plugin (DWARFEvaluatorFactory) which is cached by Module. Is there a way to define a simple plugin function, avoiding all the overhead of a plugin class? Most of the state is in DWARFExpression, which is passed to the evaluator, so the amount of state to pass around is much smaller.

Note that WasmDWARFEvaluator::Evaluate() not only needs to handle Wasm-specific opcodes like DW_OP_WASM_location, but might also handle in a Wasm-specific way some "standard" opcodes. For example, static variables are generally encoded with DW_OP_addr, but for WebAssembly the corresponding offset is the offset in the module Data section, which is mapped into an area of the Memory section when the module is loaded. So, memory reads related to DW_OP_addr refer to a different space and should be dealt differently by the runtime.
This makes target variable *global_ptr work when we have multiple modules., because we can retrieve the module_id from the Module associated to the DWARFExpression.

Wasm addresses may refer to different address spaces, and are internally encoded with 64 bits as:

63 61           32            0
+-+-------------+-------------+
|T|  module_id  |   offset    |
+-+-------------+-------------+

where T is 0:Code, 1:Data, 2:Memory.

I introduced a class ThreadWasm, as suggested, which overrides the function to create the unwinder object. This required a small change also to ProcessGDBRemote, a new virtual factory method to create a thread.

lldb/source/Plugins/Process/wasm/WasmProcess.cpp
71–76 ↗	(On Diff #261780)	I look forward to implementing more cleanly in the future with AddressSpecifiers in ReadMemory and WriteMemory; for the moment I have implemented this with bitfields as suggested: enum WasmAddressType { Code = 0x00, Data = 0x01, Memory = 0x02, Invalid = 0x03 }; struct wasm_addr_t { uint64_t type : 2; uint64_t module_id : 30; uint64_t offset : 32; }; I still need to override `ReadMemory` here because it is not always possible to generate a full `wasm_addr_t` without making too many changes. Sometimes, for example in `ValueObjectChild::UpdateValue -> ValueObject::GetPointerValue` the existing code generates addresses without the knowledge of module_ids. But this is not a big problem because all these reads always refer to the Wasm Memory and the module_id can easily be retrieved from the current execution context. This is always true because a Wasm module can never read/write the memory of another module in the same process, at most Memories can be shared, but this is transparent to the debugger. However, this requires a small changes to `ReadMemory`: I would introduce `ExecutionContext *exe_ctx` as an optional final parameter, null by default, passed by `Value` and `ValueObject`, ignored by all other process classes and utilized by ProcessWasm to deduce the module.

Harbormaster failed remote builds in B56138: Diff 262824!May 8 2020, 1:35 AM

Hello @clayborg , @labath,
Any thoughts on this latest patch? :-)

In D78801#2050449, @paolosev wrote:

Hello @clayborg , @labath,
Any thoughts on this latest patch? :-)

Sorry about the delay. I think that in terms of the design we've come as far as we can without making substantial changes to a lot of lldb interfaces. The question on my mind is.. is that enough?

Here, I am mainly thinking about the introduction of the ExecutionContext argument to ReadMemory. In a universe with static flat address spaces, the argument looks flat out wrong. If one thinks of "memory" as something more dynamic, it does not seem to be that bad, as it can be viewed as the context in which to interpret the memory addresses. However, I am having trouble assigning a semantic to it besides saying "it does what webassembly needs". Maybe it's just because I live in a flat universe and lack address space intuition...

Anyway, I think it would be good to get more people's opinions on this. For start, I nominate Jim. :)

The problem with "looking forward to implementing more cleanly in the future with AddressSpecifiers" is that _everyone_ is looking forward to having address spaces, but noone is actually working on implementing them. And I want to be careful about accumulating technical debt like this upstream, because it's the technical debt which makes future implementations hard.

In D78801#2026636, @paolosev wrote:

Note that WasmDWARFEvaluator::Evaluate() not only needs to handle Wasm-specific opcodes like DW_OP_WASM_location, but might also handle in a Wasm-specific way some "standard" opcodes. For example, static variables are generally encoded with DW_OP_addr, but for WebAssembly the corresponding offset is the offset in the module Data section, which is mapped into an area of the Memory section when the module is loaded. So, memory reads related to DW_OP_addr refer to a different space and should be dealt differently by the runtime.
This makes target variable *global_ptr work when we have multiple modules., because we can retrieve the module_id from the Module associated to the DWARFExpression.

I am confused by this line of reasoning. For a global variable, the DW_OP_addr describes the location of the variable itself (the value of &global_ptr). That is what makes it possible to display the value of the pointer (global_ptr). Displaying the value of the pointed-to object (*global_ptr) is a different thing, and AFAIK it completely bypasses any dwarf expressions. Are you saying that the value of the pointer somehow inherits the "address space" of the memory where the pointer itself is located?

In D78801#2062403, @labath wrote:

In D78801#2050449, @paolosev wrote:

Hello @clayborg , @labath,
Any thoughts on this latest patch? :-)

Sorry about the delay. I think that in terms of the design we've come as far as we can without making substantial changes to a lot of lldb interfaces. The question on my mind is.. is that enough?

Here, I am mainly thinking about the introduction of the ExecutionContext argument to ReadMemory. In a universe with static flat address spaces, the argument looks flat out wrong. If one thinks of "memory" as something more dynamic, it does not seem to be that bad, as it can be viewed as the context in which to interpret the memory addresses. However, I am having trouble assigning a semantic to it besides saying "it does what webassembly needs". Maybe it's just because I live in a flat universe and lack address space intuition...

Anyway, I think it would be good to get more people's opinions on this. For start, I nominate Jim. :)

The problem with "looking forward to implementing more cleanly in the future with AddressSpecifiers" is that _everyone_ is looking forward to having address spaces, but noone is actually working on implementing them. And I want to be careful about accumulating technical debt like this upstream, because it's the technical debt which makes future implementations hard.

I wasn't following very closely, sorry...

Before getting to far along, I'd just like to toss out something...

I always thought it was a shame that we ever introduced Process::ReadMemory(addr_t addr, ...) We already have a nice Address object. Seems to me the only reason we don't use everywhere is so we can do math on addresses more simply. For instance, it seems like the Address class is the proper place to pass along Address Space information. After all, when you make the address you are planning on fetching data from, you have to know what address space you were targeting, so you could naturally encode it at that point.

What about replacing ProcessReadMemory(addr_t addr, ...) with ProcessReadMemory(Address addr, ...), or even banning the use of lldb::addr_t on everywhere except in the bowels of Process subclasses and as an interchange for getting addresses as text from users. You might want to store lldb::addr_t's for Symbol values for space concerns, but presumably the Symbol File would know the relevant address space, so it would fix that up and always hand out Addresses.

This would be a pretty big but mostly formal change. It would seem more natural, Address is our abstraction for addresses in a process. Instead we have this odd mix of API's that take lldb::addr_t and some that take Address.

In D78801#2026636, @paolosev wrote:

Note that WasmDWARFEvaluator::Evaluate() not only needs to handle Wasm-specific opcodes like DW_OP_WASM_location, but might also handle in a Wasm-specific way some "standard" opcodes. For example, static variables are generally encoded with DW_OP_addr, but for WebAssembly the corresponding offset is the offset in the module Data section, which is mapped into an area of the Memory section when the module is loaded. So, memory reads related to DW_OP_addr refer to a different space and should be dealt differently by the runtime.
This makes target variable *global_ptr work when we have multiple modules., because we can retrieve the module_id from the Module associated to the DWARFExpression.

I am confused by this line of reasoning. For a global variable, the DW_OP_addr describes the location of the variable itself (the value of &global_ptr). That is what makes it possible to display the value of the pointer (global_ptr). Displaying the value of the pointed-to object (*global_ptr) is a different thing, and AFAIK it completely bypasses any dwarf expressions. Are you saying that the value of the pointer somehow inherits the "address space" of the memory where the pointer itself is located?

What about replacing ProcessReadMemory(addr_t addr, ...) with ProcessReadMemory(Address addr, ...), or even banning the use of lldb::addr_t on everywhere except in the bowels of Process subclasses and as an interchange for getting addresses as text from users. You might want to store lldb::addr_t's for Symbol values for space concerns, but presumably the Symbol File would know the relevant address space, so it would fix that up and always hand out Addresses.

This would be a pretty big but mostly formal change. It would seem more natural, Address is our abstraction for addresses in a process. Instead we have this odd mix of API's that take lldb::addr_t and some that take Address.

Before all, I really apologize for the huge delay of this reply to your last comments. I had to focus on different tasks in the last couple of weeks and I needed to find some time to understand how Process::ReadMemory (and WriteMemory) could be modified to work with Address objects and not with addr_t.
It is certainly possible, but a problem, IMHO, is that class Address can be several things:

An offset in a section in a file
An offset in a loaded section
An absolute address in memory

If we modify ReadMemory to be:

Process::ReadMemory(const Address& address, void *buf, size_t size, Status &error)

then there we need to convert this Address into a concrete addr_t, to actually read from the MemoryCache (MemoryCache::Read) or from the debuggee (ReadMemoryFromInferior).
But how do we do this conversion? If we are dealing with a file we should use lldb::addr_t Address::GetFileAddress(). Otherwise we should call lldb::addr_t Address::GetLoadAddress(Target *target) I don't know if we have enough information in Process::ReadMemory() to always be able to correctly differentiate between the two cases. And in the latter case we should rely on Process::GetTarget() returning the valid target to pass to Address::GetLoadAddress.

The question then is: should Address know if it needs to be interpreted as a file address or load address? Should we have an AddressType field in Address?
We already have this enum that maybe we could reuse:

enum AddressType {
  eAddressTypeInvalid = 0,
  eAddressTypeFile, /// Address is an address as found in an object or symbol file
  eAddressTypeLoad, /// Address is an address as in the current target inferior process
  eAddressTypeHost  /// Address is an address in the process that is running code
};

Process::ReadMemory is called by many places in the LLDB code. In some of them, like Value::GetValueAsData, DWARFExpression::Evaluate, ValueObject::GetPointeeData, it should be fairly simple to construct the Address to read from. (And that should solve the problems for Wasm).
But in other places it is not so clear what we should do. Some code works with absolute addresses and does not need Sections (see table below). In these cases, we could rely on the Address constructor that accepts just an absolute addr_t as argument, and sets m_section_wp as null:

Address(lldb::addr_t abs_addr);

The existence of this constructor already makes most of the LLDB compile even after changing function ReadMemory to use an Address as argument, in class Process and in Process-derived classes. But then class Address would be in many cases just a wrapper over an addr_t. It seems to me that would go against the idea of using just Address as our abstraction for all addresses in a process.

This table describes where Process::ReadMemory is called and how the addresses we pass to ReadMemory originate:

ABI\AArch64\ABIMacOSX_arm64.cpp	LoadValueFromConsecutiveGPRRegisters	addr_t from reg_ctx
ABI\AArch64\ABISysV_arm64.cpp	LoadValueFromConsecutiveGPRRegisters	addr_t from RegisterContext
ABI\ARM\ABISysV_arm.cpp	GetReturnValuePassedInMemory	addr_t from RegisterContext
ABI\PowerPC\ABISysV_ppc64.cpp	GetStructValueObject	addr_t from Register
Commands\CommandObjectMemory.cpp	CommandObjectMemoryFind::ProcessMemoryIterator:: operator[]	addr_t from m_base_data_address
DynamicLoader\MacOSX-DYLD\DynamicLoaderMacOSXDYLD.cpp	DoInitialImageFetch	addr_t from Process::GetImageInfoAddress
Expression\IRExecutionUnit.cpp	IRExecutionUnit::DisassembleFunction	addr_t from JittedFunction::m_remote_addr
Expression\IRMemoryMap.cpp	IRMemoryMap::ReadMemory(addr_t process_address)	addr_t passed as argument
JITLoader\GDB\JITLoaderGDB.cpp	ReadJITEntry	addr_t passed as argument
Language\CPlusPlus\LibCxx.cpp	formatters::LibCxxMapIteratorSyntheticFrontEnd::Update	addr_t from ValueObject::GetValueAsUnsigned()
Language\CPlusPlus\LibCxxVector.cpp	formatters::LibcxxVectorBoolSyntheticFrontEnd::GetChildAtIndex	addr_t from m_base_data_address
Language\ObjC\CF.cpp	formatters::CFBitVectorSummaryProvider	ValueObject::GetValueAsUnsigned() => Process::ReadPointerFromMemory() => addr_t
Language\ObjC\NSArray.cpp (and similar classes)	formatters::XXX::Update	addr_t from ValueObject::GetValueAsUnsigned()
LanguageRuntime\ObjC\AppleObjCRuntime\AppleObjCClassDescriptorV2.cpp	ClassDescriptorV2::objc_class_t::Read (and similar functions)	addr_t passed as argument
LanguageRuntime\ObjC\AppleObjCRuntime\AppleObjCRuntimeV1.cpp	AppleObjCRuntimeV1::UpdateISAToDescriptorMapIfNeeded	addr_t from GetISAHashTablePointer() or from DataExtractor
LanguageRuntime\ObjC\AppleObjCRuntime\AppleObjCRuntimeV2.cpp	AppleObjCRuntimeV2::UpdateISAToDescriptorMapDynamic	addr_t from Process::AllocateMemory
LanguageRuntime\ObjC\AppleObjCTrampolineHandler\AppleObjCRuntimeV1.cpp	AppleObjCTrampolineHandler::AppleObjCVTables::VTableRegion::SetUpRegion	addr_t from ValueObject::GetValueAsUnsigned
LanguageRuntime\RenderScript\RenderScriptRuntime\RenderScriptRuntime.cpp	GetArgsX86 (and similar)	addr_t from reg_ctx
ObjectFile\PECOFF\ObjectFilePECOFF.cpp	ObjectFilePECOFF::ReadImageData(offset)	addr_t from m_image_base, addr_t calculated from COFF header
Symbol\CompactUnwindInfo.cpp	CompactUnwindInfo::ScanIndex	addr_t from Section:: GetLoadBaseAddress()
Symbol\ObjectFile.cpp	ObjectFile::ReadMemory	addr_t passed as argument
Symbol\ObjectFile.cpp	ObjectFile::ReadSectionData((Section, offset_t)	addr_t from Section::GetLoadBaseAddress
Symbol\Type.cpp	Type::ReadFromMemory(addr_t)	addr_t passed as argument
SystemRuntime\MacOSX\SystemRuntimeMacOSX.cpp	SystemRuntimeMacOSX::GetQueueNameFromThreadQAddress	addr_t from Process::ReadPointerFromMemory
SystemRuntime\MacOSX\SystemRuntimeMacOSX.cpp	SystemRuntimeMacOSX::GetExtendedBacktraceThread (and similar fns)	addr_t from GetThreadItemInfo
Target\RegisterContext.cpp	RegisterContext::ReadRegisterValueFromMemory(addr_t)	addr_t passed as argument
Target\Target.cpp	Target::CreateAddressInModuleBreakpoint(addr_t)	addr_t passed as argument
Target\ThreadPlanTracer.cpp	ThreadPlanAssemblyTracer::Log	addr_t from reg_ctx->GetPC()
TypeSystem\Clang\TypeSystemClang.cpp	TypeSystemClang::DumpSummary	addr_t from DataExtractor
Utility\RegisterContextMemory.cpp	RegisterContextMemory::ReadAllRegisterValues	addr_t from reg_data_addr passed to constructor

A similar analysis could be done for WriteMemory, of course. I might have forgotten something, but the point is that in most places where we call Process::ReadMemory we calculate the address as a uint64_t taken from a register, read from a DataExtractor, or passed by other functions; there is not really a Section to use to construct an Address. Not sure if this is a problem or not.
In other words... I will need some guidance on how best to make this refactoring :-)

In D78801#2026636, @paolosev wrote:

Note that WasmDWARFEvaluator::Evaluate() not only needs to handle Wasm-specific opcodes like DW_OP_WASM_location, but might also handle in a Wasm-specific way some "standard" opcodes. For example, static variables are generally encoded with DW_OP_addr, but for WebAssembly the corresponding offset is the offset in the module Data section, which is mapped into an area of the Memory section when the module is loaded. So, memory reads related to DW_OP_addr refer to a different space and should be dealt differently by the runtime.
This makes target variable *global_ptr work when we have multiple modules., because we can retrieve the module_id from the Module associated to the DWARFExpression.

I am confused by this line of reasoning. For a global variable, the DW_OP_addr describes the location of the variable itself (the value of &global_ptr). That is what makes it possible to display the value of the pointer (global_ptr). Displaying the value of the pointed-to object (*global_ptr) is a different thing, and AFAIK it completely bypasses any dwarf expressions. Are you saying that the value of the pointer somehow inherits the "address space" of the memory where the pointer itself is located?

I wrote that badly. What I meant was that if I have different Wasm modules, and one of these modules has some static pointer, like:

static uint8_t kBuff[3];

the corresponding DWARF data will be something like:

0x00000026:   DW_TAG_variable
                DW_AT_name	("kBuff")
                DW_AT_type	(0x0000003b "uint8_t[3]")
                ...
                DW_AT_location	(DW_OP_addr 0x0)
                DW_AT_linkage_name	("_ZL5kBuff")

For Wasm, this DW_OP_addr location represents an offset in the Data section, and the Data section is mapped into a region of the Memory for that module. So we can use this DWARF data to display the value of this pointer, but we need to handle this DW_OP_addr opcode in a specific way for Wasm. Then, once we have the memory location, the actual value can be found and displayed in the usual way, by reading the memory.

• pannous added a subscriber: • pannous.Oct 17 2020, 8:24 AM

Herald added a subscriber: JDevlieghere. · View Herald TranscriptOct 17 2020, 8:24 AM

• vwzm228 mentioned this in D78978: [LLDB] Add support for WebAssembly debugging.Jan 5 2021, 10:03 PM

In D78978#2481358, @vwzm228 wrote:

Is there any progress about such patch and D78801？

I have implemented the debugging feature in our Wasm VM based on https://reviews.llvm.org/D78801, and it already work to attach, set breakpoint, step, show variable value, backtrace...

I am not sure if I need to change LLDB part to this one, or keep using D78801.

But if both patches wont be merged, I have to maintain a private LLDB version....

Unfortunately I have not received any more feedback for this patch, or the alternative version D78978, so I assumed it won't move forward :(
Meanwhile, I have created a personal fork (https://github.com/paolosevMSFT/llvm-project/tree/WasmDbg).

I still think that it would be very useful to add support for Wasm debugging to LLDB. Especially in scenarios where Wasm is not used as part of a web app, but server side (node.js) or running on micro runtime for IoT devices.
I know that there is the problem that LLDB does not support segmented address spaces, and so this patch requires a couple of tiny but smelly changes to core code.
From the mailing list I seem to remember that somebody was working to add support for segmented addresses, but I don't know what is the current state of the initiative.

I'd be happy to keep working on this, please let me know what I could do to progress toward an acceptable solution.

Hi @paolosev, many thanks for such a great patch which makes it possible to debug WebAssembly applications. It is really really useful especially in non-browser environments.

We have enabled source debugging feature in WebAssembly Micro Runtime based on your patch (thanks @vwzm228 for the great work to make this happen!),
and we have put the link of this patch to the ATTRIBUTIONS, and the acknowledgements in the document.

Please let me know if you have any concern or suggestion about this :)

Thanks @xujuntwt95329! I am very happy that this was useful for WebAssembly Micro Runtime!

In D78801#3279811, @paolosev wrote:

Thanks @xujuntwt95329! I am very happy that this was useful for WebAssembly Micro Runtime!

I believe we are making the world of WebAssembly better!

BTW, I find an issue when trying to debug multi-thread wasm app:
The qWasmLocal package doesn't contain the thread id, which means it can only get locals of the current thread. If we have thread A, B and C, and they stopped at the same time, then LLDB will send three qWasmLocal package, and the wasm runtime will give same reply.

I think we should add thread id into qWasmLocal so that wasm runtime will know which thread to process, Am I right?

Hi @xujuntwt95329,
Honestly I wasn't thinking to support multithreading with this initial patch so I am not surprised that it doesn't work. You are right, we should add thread information to the messages.
Actually I am a little surprised that the patch still works after all this time (more than one year), and that there have not been small changes that caused merge errors.
I am afraid that manually patching LLDB might not be a sustainable solution. :(

@labath, @jingham
Given that the debugging feature is being supported also by WebAssembly Micro Runtime now, maybe we could work together to finalize an implementation of this patch (or also some other different solution) that will be less intrusive and more acceptable to be merged in LLDB?

In D78801#3281504, @paolosev wrote:

Hi @xujuntwt95329,
Honestly I wasn't thinking to support multithreading with this initial patch so I am not surprised that it doesn't work. You are right, we should add thread information to the messages.
Actually I am a little surprised that the patch still works after all this time (more than one year), and that there have not been small changes that caused merge errors.
I am afraid that manually patching LLDB might not be a sustainable solution. :(

@labath, @jingham
Given that the debugging feature is being supported also by WebAssembly Micro Runtime now, maybe we could work together to finalize an implementation of this patch (or also some other different solution) that will be less intrusive and more acceptable to be merged in LLDB?

Well, your patch works well with WebAssembly Micro Runtime (WAMR) source debugging feature in most cases, I have debugged several wasm applications written in C, C++, Rust and Go, it is really useful !
I'll try to add thread id to qWasmLocal message and check if it works.

I am afraid that manually patching LLDB might not be a sustainable solution. :(

I totally agree with you, currently we ask the users to download llvm-project, apply the patch, and build the customized lldb, it takes a long time. It will be great if this can be merged into upstream !!!

asmith added a reviewer: asmith.Jan 3 2023, 7:05 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 3 2023, 7:05 AM

Herald added subscribers: Michael137, pmatos, asb. · View Herald Transcript

penzn added a subscriber: penzn.Jan 3 2023, 10:27 AM

eloparco added a subscriber: eloparco.Mar 25 2023, 6:36 PM

Revision Contents

Path

Size

lldb/

include/

lldb/

Core/

Module.h

5 lines

PluginManager.h

11 lines

Expression/

DWARFEvaluator.h

108 lines

DWARFEvaluatorFactory.h

55 lines

DWARFExpression.h

9 lines

Target/

Process.h

2 lines

lldb-forward.h

1 line

lldb-private-interfaces.h

2 lines

source/

Core/

7 lines

29 lines

2 lines

2 lines

Expression/

CMakeLists.txt

2 lines

DWARFEvaluator.cpp

1980 lines

DWARFEvaluatorFactory.cpp

57 lines

DWARFExpression.cpp

1753 lines

Interpreter/

CommandInterpreter.cpp

18 lines

Plugins/

CMakeLists.txt

4 lines

DWARFEvaluator/

CMakeLists.txt

1 line

wasm/

CMakeLists.txt

10 lines

WasmDWARFEvaluator.h

46 lines

WasmDWARFEvaluator.cpp

133 lines

WasmDWARFEvaluatorFactory.h

54 lines

WasmDWARFEvaluatorFactory.cpp

64 lines

Plugins.def.in

1 line

Process/

CMakeLists.txt

1 line

elf-core/

ProcessElfCore.h

3 lines

ProcessElfCore.cpp

2 lines

gdb-remote/

ProcessGDBRemote.h

2 lines

ProcessGDBRemote.cpp

9 lines

mach-core/

ProcessMachCore.h

3 lines

ProcessMachCore.cpp

2 lines

minidump/

ProcessMinidump.h

4 lines

ProcessMinidump.cpp

2 lines

wasm/

12 lines

123 lines

267 lines

40 lines

35 lines

54 lines

74 lines

Target/

Platform.cpp

6 lines

Process.cpp

3 lines

Diff 262824

lldb/include/lldb/Core/Module.h

Show All 35 Lines
#include <mutex>		#include <mutex>
#include <stddef.h>		#include <stddef.h>
#include <stdint.h>		#include <stdint.h>
#include <string>		#include <string>
#include <vector>		#include <vector>

namespace lldb_private {		namespace lldb_private {
class CompilerDeclContext;		class CompilerDeclContext;
		class DWARFEvaluatorFactory;
class Function;		class Function;
class Log;		class Log;
class ObjectFile;		class ObjectFile;
class RegularExpression;		class RegularExpression;
class SectionList;		class SectionList;
class Stream;		class Stream;
class Symbol;		class Symbol;
class SymbolContext;		class SymbolContext;
▲ Show 20 Lines • Show All 810 Lines • ▼ Show 20 Lines	public:
/// /b true if \a path was successfully located and \a new_path		/// /b true if \a path was successfully located and \a new_path
/// is filled in with a new source path, \b false otherwise.		/// is filled in with a new source path, \b false otherwise.
bool RemapSourceFile(llvm::StringRef path, std::string &new_path) const;		bool RemapSourceFile(llvm::StringRef path, std::string &new_path) const;
bool RemapSourceFile(const char *, std::string &) const = delete;		bool RemapSourceFile(const char *, std::string &) const = delete;

/// Update the ArchSpec to a more specific variant.		/// Update the ArchSpec to a more specific variant.
bool MergeArchitecture(const ArchSpec &arch_spec);		bool MergeArchitecture(const ArchSpec &arch_spec);

		DWARFEvaluatorFactory *GetDWARFExpressionEvaluatorFactory();

/// \class LookupInfo Module.h "lldb/Core/Module.h"		/// \class LookupInfo Module.h "lldb/Core/Module.h"
/// A class that encapsulates name lookup information.		/// A class that encapsulates name lookup information.
///		///
/// Users can type a wide variety of partial names when setting breakpoints		/// Users can type a wide variety of partial names when setting breakpoints
/// by name or when looking for functions by name. The SymbolFile object is		/// by name or when looking for functions by name. The SymbolFile object is
/// only required to implement name lookup for function basenames and for		/// only required to implement name lookup for function basenames and for
/// fully mangled names. This means if the user types in a partial name, we		/// fully mangled names. This means if the user types in a partial name, we
/// must reduce this to a name lookup that will work with all SymbolFile		/// must reduce this to a name lookup that will work with all SymbolFile
▲ Show 20 Lines • Show All 110 Lines • ▼ Show 20 Lines	protected:

std::atomic<bool> m_did_load_objfile{false};		std::atomic<bool> m_did_load_objfile{false};
std::atomic<bool> m_did_load_symfile{false};		std::atomic<bool> m_did_load_symfile{false};
std::atomic<bool> m_did_set_uuid{false};		std::atomic<bool> m_did_set_uuid{false};
mutable bool m_file_has_changed : 1,		mutable bool m_file_has_changed : 1,
m_first_file_changed_log : 1; /// See if the module was modified after it		m_first_file_changed_log : 1; /// See if the module was modified after it
/// was initially opened.		/// was initially opened.

		std::unique_ptr<DWARFEvaluatorFactory> m_dwarf_evaluator_factory;

/// Resolve a file or load virtual address.		/// Resolve a file or load virtual address.
///		///
/// Tries to resolve \a vm_addr as a file address (if \a		/// Tries to resolve \a vm_addr as a file address (if \a
/// vm_addr_is_file_addr is true) or as a load address if \a		/// vm_addr_is_file_addr is true) or as a load address if \a
/// vm_addr_is_file_addr is false) in the symbol vendor. \a resolve_scope		/// vm_addr_is_file_addr is false) in the symbol vendor. \a resolve_scope
/// indicates what clients wish to resolve and can be used to limit the		/// indicates what clients wish to resolve and can be used to limit the
/// scope of what is parsed.		/// scope of what is parsed.
///		///
▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

lldb/include/lldb/Core/PluginManager.h

Show First 20 Lines • Show All 438 Lines • ▼ Show 20 Lines	public:

static lldb::OptionValuePropertiesSP		static lldb::OptionValuePropertiesSP
GetSettingForStructuredDataPlugin(Debugger &debugger,		GetSettingForStructuredDataPlugin(Debugger &debugger,
ConstString setting_name);		ConstString setting_name);

static bool CreateSettingForStructuredDataPlugin(		static bool CreateSettingForStructuredDataPlugin(
Debugger &debugger, const lldb::OptionValuePropertiesSP &properties_sp,		Debugger &debugger, const lldb::OptionValuePropertiesSP &properties_sp,
ConstString description, bool is_global_property);		ConstString description, bool is_global_property);

		// DWARFEvaluatorFactory
		static bool
		RegisterPlugin(ConstString name, const char *description,
		DWARFEvaluatorFactoryCreateInstance create_callback);

		static bool
		UnregisterPlugin(DWARFEvaluatorFactoryCreateInstance create_callback);

		static DWARFEvaluatorFactoryCreateInstance
		GetDWARFEvaluatorFactoryCreateCallbackAtIndex(uint32_t idx);
};		};

} // namespace lldb_private		} // namespace lldb_private

#endif // LLDB_CORE_PLUGINMANAGER_H		#endif // LLDB_CORE_PLUGINMANAGER_H

lldb/include/lldb/Expression/DWARFEvaluator.h

This file was added.

				//===-- DWARFEvaluator.h ----------------------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLDB_EXPRESSION_DWARFEVALUATOR_H
				#define LLDB_EXPRESSION_DWARFEVALUATOR_H

				#include "lldb/lldb-private.h"
				#include <vector>

				namespace lldb_private {

				class DWARFExpression;

				/// \class DWARFEvaluator DWARFEvaluator.h
				/// "lldb/Expression/DWARFEvaluator.h" Evaluates DWARF opcodes.
				///
				class DWARFEvaluator {
				public:
				/// Crates a DWARF location expression evaluator
				///
				/// \param[in] dwarf_expression
				/// The DWARF expression to evaluate.
				///
				/// \param[in] exe_ctx
				/// The execution context in which to evaluate the location
				/// expression. The location expression may access the target's
				/// memory, especially if it comes from the expression parser.
				///
				/// \param[in] reg_ctx
				/// An optional parameter which provides a RegisterContext for use
				/// when evaluating the expression (i.e. for fetching register values).
				/// Normally this will come from the ExecutionContext's StackFrame but
				/// in the case where an expression needs to be evaluated while building
				/// the stack frame list, this short-cut is available.
				///
				/// \param[in] initial_value_ptr
				/// A value to put on top of the interpreter stack before evaluating
				/// the expression, if the expression is parametrized. Can be NULL.
				///
				/// \param[in] object_address_ptr
				///
				DWARFEvaluator(const DWARFExpression &dwarf_expression,
				ExecutionContext exe_ctx, RegisterContext reg_ctx,
				const Value *initial_value_ptr,
				const Value *object_address_ptr);

				/// DWARFEvaluator protocol.
				/// \{

				/// Evaluate the DWARF location expression
				///
				/// \param[in] result
				/// A value into which the result of evaluating the expression is
				/// to be placed.
				///
				/// \param[in] error_ptr
				/// If non-NULL, used to report errors in expression evaluation.
				///
				/// \return
				/// True on success; false otherwise. If error_ptr is non-NULL,
				/// details of the failure are provided through it.
				virtual bool Evaluate(Value &result, Status *error_ptr);

				/// Evaluate the DWARF location expression with the opcodes specified.
				///
				/// \param[in] opcodes
				/// The DWARF opcodes to evaluate.
				///
				/// \param[in] result
				/// A value into which the result of evaluating the expression is
				/// to be placed.
				///
				/// \param[in] error_ptr
				/// If non-NULL, used to report errors in expression evaluation.
				///
				/// \return
				/// True on success; false otherwise. If error_ptr is non-NULL,
				/// details of the failure are provided through it.
				virtual bool Evaluate(const DataExtractor &opcodes, Value &result,
				Status *error_ptr);

				/// Evaluates a specific DWARF opcode in the context of a DWARF expression
				virtual bool Evaluate(const uint8_t op, Process process, StackFrame frame,
				std::vector<Value> &stack, const DataExtractor &opcodes,
				lldb::offset_t &offset, Value &pieces,
				uint64_t &op_piece_offset, Log log, Status error_ptr);

				/// \}

				protected:
				const DWARFExpression &m_dwarf_expression;
				ExecutionContext *m_exe_ctx;
				RegisterContext *m_reg_ctx;
				const Value *m_initial_value_ptr;
				const Value *m_object_address_ptr;

				private:
				DISALLOW_COPY_AND_ASSIGN(DWARFEvaluator);
				};

				} // namespace lldb_private

				#endif // LLDB_EXPRESSION_DWARFEVALUATOR_H

lldb/include/lldb/Expression/DWARFEvaluatorFactory.h

This file was added.

				//===-- DWARFEvaluatorFactory.h ---------------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLDB_EXPRESSION_DWARFEVALUATORFACTORY_H
				#define LLDB_EXPRESSION_DWARFEVALUATORFACTORY_H

				#include "lldb/Core/PluginInterface.h"
				#include "lldb/Utility/ConstString.h"
				#include "lldb/lldb-private.h"

				class DWARFUnit;

				namespace lldb_private {

				class DWARFEvaluator;
				class DWARFExpression;

				/// \class DWARFEvaluatorFactory DWARFEvaluatorFactory.h
				/// "lldb/Expression/DWARFEvaluatorFactory.h" Factory class that allows the
				/// registration of platform-specific DWARF expression evaluators, used to
				/// handle platform-specific DWARF opcodes.
				class DWARFEvaluatorFactory : public PluginInterface {
				public:
				static std::unique_ptr<DWARFEvaluatorFactory> FindPlugin(Module *module);

				/// PluginInterface protocol.
				/// \{
				ConstString GetPluginName() override;

				uint32_t GetPluginVersion() override { return 1; }
				/// \}

				DWARFEvaluatorFactory() {}

				/// DWARFEvaluatorFactory protocol.
				/// \{
				virtual std::unique_ptr<DWARFEvaluator>
				CreateDWARFEvaluator(const DWARFExpression &dwarf_expression,
				ExecutionContext exe_ctx, RegisterContext reg_ctx,
				const Value *initial_value_ptr,
				const Value *object_address_ptr);
				/// \}

				private:
				DISALLOW_COPY_AND_ASSIGN(DWARFEvaluatorFactory);
				};

				} // namespace lldb_private

				#endif // LLDB_EXPRESSION_DWARFEVALUATORFACTORY_H

lldb/include/lldb/Expression/DWARFExpression.h

Show First 20 Lines • Show All 114 Lines • ▼ Show 20 Lines	public:

bool Update_DW_OP_addr(lldb::addr_t file_addr);		bool Update_DW_OP_addr(lldb::addr_t file_addr);

void UpdateValue(uint64_t const_value, lldb::offset_t const_value_byte_size,		void UpdateValue(uint64_t const_value, lldb::offset_t const_value_byte_size,
uint8_t addr_byte_size);		uint8_t addr_byte_size);

void SetModule(const lldb::ModuleSP &module) { m_module_wp = module; }		void SetModule(const lldb::ModuleSP &module) { m_module_wp = module; }

		lldb::ModuleSP GetModule() const { return m_module_wp.lock(); }

		const DWARFUnit *GetDWARFCompileUnit() const { return m_dwarf_cu; }

bool ContainsThreadLocalStorage() const;		bool ContainsThreadLocalStorage() const;

bool LinkThreadLocalStorage(		bool LinkThreadLocalStorage(
lldb::ModuleSP new_module_sp,		lldb::ModuleSP new_module_sp,
std::function<lldb::addr_t(lldb::addr_t file_addr)> const		std::function<lldb::addr_t(lldb::addr_t file_addr)> const
&link_address_callback);		&link_address_callback);

/// Tells the expression that it refers to a location list.		/// Tells the expression that it refers to a location list.
///		///
/// \param[in] cu_file_addr		/// \param[in] cu_file_addr
/// The base address to use for interpreting relative location list		/// The base address to use for interpreting relative location list
/// entries.		/// entries.
/// \param[in] func_file_addr		/// \param[in] func_file_addr
/// The file address of the function containing this location list. This		/// The file address of the function containing this location list. This
/// address will be used to relocate the location list on the fly (in		/// address will be used to relocate the location list on the fly (in
/// conjuction with the func_load_addr arguments).		/// conjuction with the func_load_addr arguments).
void SetLocationListAddresses(lldb::addr_t cu_file_addr,		void SetLocationListAddresses(lldb::addr_t cu_file_addr,
lldb::addr_t func_file_addr);		lldb::addr_t func_file_addr);

/// Return the call-frame-info style register kind		/// Return the call-frame-info style register kind
int GetRegisterKind();		lldb::RegisterKind GetRegisterKind() const;

/// Set the call-frame-info style register kind		/// Set the call-frame-info style register kind
///		///
/// \param[in] reg_kind		/// \param[in] reg_kind
/// The register kind.		/// The register kind.
void SetRegisterKind(lldb::RegisterKind reg_kind);		void SetRegisterKind(lldb::RegisterKind reg_kind);

/// Wrapper for the static evaluate function that accepts an		/// Wrapper for the static evaluate function that accepts an
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	static bool PrintDWARFExpression(Stream &s, const DataExtractor &data,
bool location_expression);		bool location_expression);

static void PrintDWARFLocationList(Stream &s, const DWARFUnit *cu,		static void PrintDWARFLocationList(Stream &s, const DWARFUnit *cu,
const DataExtractor &debug_loc_data,		const DataExtractor &debug_loc_data,
lldb::offset_t offset);		lldb::offset_t offset);

bool MatchesOperand(StackFrame &frame, const Instruction::Operand &op);		bool MatchesOperand(StackFrame &frame, const Instruction::Operand &op);

		static lldb::addr_t ReadAddressFromDebugAddrSection(const DWARFUnit *dwarf_cu,
		uint32_t index);

private:		private:
/// Pretty-prints the location expression to a stream		/// Pretty-prints the location expression to a stream
///		///
/// \param[in] s		/// \param[in] s
/// The stream to use for pretty-printing.		/// The stream to use for pretty-printing.
///		///
/// \param[in] data		/// \param[in] data
/// The data extractor.		/// The data extractor.
Show All 38 Lines

lldb/include/lldb/Target/Process.h

Show First 20 Lines • Show All 1,444 Lines • ▼ Show 20 Lines	public:
///		///
/// \return		/// \return
/// The number of bytes that were actually read into \a buf. If		/// The number of bytes that were actually read into \a buf. If
/// the returned number is greater than zero, yet less than \a		/// the returned number is greater than zero, yet less than \a
/// size, then this function will get called again with \a		/// size, then this function will get called again with \a
/// vm_addr, \a buf, and \a size updated appropriately. Zero is		/// vm_addr, \a buf, and \a size updated appropriately. Zero is
/// returned in the case of an error.		/// returned in the case of an error.
virtual size_t ReadMemory(lldb::addr_t vm_addr, void *buf, size_t size,		virtual size_t ReadMemory(lldb::addr_t vm_addr, void *buf, size_t size,
Status &error);		Status &error, ExecutionContext *exe_ctx = nullptr);

/// Read of memory from a process.		/// Read of memory from a process.
///		///
/// This function has the same semantics of ReadMemory except that it		/// This function has the same semantics of ReadMemory except that it
/// bypasses caching.		/// bypasses caching.
///		///
/// \param[in] vm_addr		/// \param[in] vm_addr
/// A virtual load address that indicates where to start reading		/// A virtual load address that indicates where to start reading
▲ Show 20 Lines • Show All 1,482 Lines • Show Last 20 Lines

lldb/include/lldb/lldb-forward.h

	Show First 20 Lines • Show All 68 Lines • ▼ Show 20 Lines
	class DataExtractor;			class DataExtractor;
	class Debugger;			class Debugger;
	class Declaration;			class Declaration;
	class DiagnosticManager;			class DiagnosticManager;
	class Disassembler;			class Disassembler;
	class DumpValueObjectOptions;			class DumpValueObjectOptions;
	class DynamicCheckerFunctions;			class DynamicCheckerFunctions;
	class DynamicLoader;			class DynamicLoader;
				class DWARFEvaluatorFactory;
	class Editline;			class Editline;
	class EmulateInstruction;			class EmulateInstruction;
	class Environment;			class Environment;
	class EvaluateExpressionOptions;			class EvaluateExpressionOptions;
	class Event;			class Event;
	class EventData;			class EventData;
	class EventDataStructuredData;			class EventDataStructuredData;
	class ExecutionContext;			class ExecutionContext;
	▲ Show 20 Lines • Show All 383 Lines • Show Last 20 Lines

lldb/include/lldb/lldb-private-interfaces.h

	Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines
	typedef lldb::TypeSystemSP (*TypeSystemCreateInstance)(			typedef lldb::TypeSystemSP (*TypeSystemCreateInstance)(
	lldb::LanguageType language, Module module, Target target);			lldb::LanguageType language, Module module, Target target);
	typedef lldb::REPLSP (*REPLCreateInstance)(Status &error,			typedef lldb::REPLSP (*REPLCreateInstance)(Status &error,
	lldb::LanguageType language,			lldb::LanguageType language,
	Debugger debugger, Target target,			Debugger debugger, Target target,
	const char *repl_options);			const char *repl_options);
	typedef int (ComparisonFunction)(const void , const void *);			typedef int (ComparisonFunction)(const void , const void *);
	typedef void (*DebuggerInitializeCallback)(Debugger &debugger);			typedef void (*DebuggerInitializeCallback)(Debugger &debugger);
				typedef DWARFEvaluatorFactory (DWARFEvaluatorFactoryCreateInstance)(
				Module *module);

	} // namespace lldb_private			} // namespace lldb_private

	#endif // #if defined(__cplusplus)			#endif // #if defined(__cplusplus)

	#endif // LLDB_LLDB_PRIVATE_INTERFACES_H			#endif // LLDB_LLDB_PRIVATE_INTERFACES_H

lldb/source/Core/Module.cpp

	Show All 10 Lines
	#include "lldb/Core/AddressRange.h"			#include "lldb/Core/AddressRange.h"
	#include "lldb/Core/AddressResolverFileLine.h"			#include "lldb/Core/AddressResolverFileLine.h"
	#include "lldb/Core/Debugger.h"			#include "lldb/Core/Debugger.h"
	#include "lldb/Core/FileSpecList.h"			#include "lldb/Core/FileSpecList.h"
	#include "lldb/Core/Mangled.h"			#include "lldb/Core/Mangled.h"
	#include "lldb/Core/ModuleSpec.h"			#include "lldb/Core/ModuleSpec.h"
	#include "lldb/Core/SearchFilter.h"			#include "lldb/Core/SearchFilter.h"
	#include "lldb/Core/Section.h"			#include "lldb/Core/Section.h"
				#include "lldb/Expression/DWARFEvaluatorFactory.h"
	#include "lldb/Host/FileSystem.h"			#include "lldb/Host/FileSystem.h"
	#include "lldb/Host/Host.h"			#include "lldb/Host/Host.h"
	#include "lldb/Interpreter/CommandInterpreter.h"			#include "lldb/Interpreter/CommandInterpreter.h"
	#include "lldb/Interpreter/ScriptInterpreter.h"			#include "lldb/Interpreter/ScriptInterpreter.h"
	#include "lldb/Symbol/CompileUnit.h"			#include "lldb/Symbol/CompileUnit.h"
	#include "lldb/Symbol/Function.h"			#include "lldb/Symbol/Function.h"
	#include "lldb/Symbol/ObjectFile.h"			#include "lldb/Symbol/ObjectFile.h"
	#include "lldb/Symbol/Symbol.h"			#include "lldb/Symbol/Symbol.h"
	▲ Show 20 Lines • Show All 1,616 Lines • ▼ Show 20 Lines
	bool Module::GetIsDynamicLinkEditor() {			bool Module::GetIsDynamicLinkEditor() {
	ObjectFile *obj_file = GetObjectFile();			ObjectFile *obj_file = GetObjectFile();

	if (obj_file)			if (obj_file)
	return obj_file->GetIsDynamicLinkEditor();			return obj_file->GetIsDynamicLinkEditor();

	return false;			return false;
	}			}

				DWARFEvaluatorFactory *Module::GetDWARFExpressionEvaluatorFactory() {
				if (!m_dwarf_evaluator_factory)
				m_dwarf_evaluator_factory = DWARFEvaluatorFactory::FindPlugin(this);
				return m_dwarf_evaluator_factory.get();
				}

lldb/source/Core/PluginManager.cpp

	Show First 20 Lines • Show All 1,458 Lines • ▼ Show 20 Lines
	bool PluginManager::CreateSettingForStructuredDataPlugin(			bool PluginManager::CreateSettingForStructuredDataPlugin(
	Debugger &debugger, const lldb::OptionValuePropertiesSP &properties_sp,			Debugger &debugger, const lldb::OptionValuePropertiesSP &properties_sp,
	ConstString description, bool is_global_property) {			ConstString description, bool is_global_property) {
	return CreateSettingForPlugin(			return CreateSettingForPlugin(
	debugger, ConstString(kStructuredDataPluginName),			debugger, ConstString(kStructuredDataPluginName),
	ConstString("Settings for structured data plug-ins"), properties_sp,			ConstString("Settings for structured data plug-ins"), properties_sp,
	description, is_global_property);			description, is_global_property);
	}			}

				#pragma mark DWARFEvaluator

				typedef PluginInstance<DWARFEvaluatorFactoryCreateInstance>
				DWARFEvaluatorFactoryInstance;
				typedef PluginInstances<DWARFEvaluatorFactoryInstance>
				DWARFEvaluatorFactoryInstances;

				static DWARFEvaluatorFactoryInstances &GetDWARFEvaluatorFactoryInstances() {
				static DWARFEvaluatorFactoryInstances g_instances;
				return g_instances;
				}

				bool PluginManager::RegisterPlugin(
				ConstString name, const char *description,
				DWARFEvaluatorFactoryCreateInstance create_callback) {
				return GetDWARFEvaluatorFactoryInstances().RegisterPlugin(name, description,
				create_callback);
				}

				bool PluginManager::UnregisterPlugin(
				DWARFEvaluatorFactoryCreateInstance create_callback) {
				return GetDWARFEvaluatorFactoryInstances().UnregisterPlugin(create_callback);
				}

				DWARFEvaluatorFactoryCreateInstance
				PluginManager::GetDWARFEvaluatorFactoryCreateCallbackAtIndex(uint32_t idx) {
				return GetDWARFEvaluatorFactoryInstances().GetCallbackAtIndex(idx);
				}

lldb/source/Core/Value.cpp

Show First 20 Lines • Show All 558 Lines • ▼ Show 20 Lines	if (address_type == eAddressTypeHost) {
// The execution context might have a NULL process, but it might have a		// The execution context might have a NULL process, but it might have a
// valid process in the exe_ctx->target, so use the		// valid process in the exe_ctx->target, so use the
// ExecutionContext::GetProcess accessor to ensure we get the process		// ExecutionContext::GetProcess accessor to ensure we get the process
// if there is one.		// if there is one.
Process *process = exe_ctx->GetProcessPtr();		Process *process = exe_ctx->GetProcessPtr();

if (process) {		if (process) {
const size_t bytes_read =		const size_t bytes_read =
process->ReadMemory(address, dst, byte_size, error);		process->ReadMemory(address, dst, byte_size, error, exe_ctx);
if (bytes_read != byte_size)		if (bytes_read != byte_size)
error.SetErrorStringWithFormat(		error.SetErrorStringWithFormat(
"read memory from 0x%" PRIx64 " failed (%u of %u bytes read)",		"read memory from 0x%" PRIx64 " failed (%u of %u bytes read)",
(uint64_t)address, (uint32_t)bytes_read, (uint32_t)byte_size);		(uint64_t)address, (uint32_t)bytes_read, (uint32_t)byte_size);
} else {		} else {
error.SetErrorStringWithFormat("read memory from 0x%" PRIx64		error.SetErrorStringWithFormat("read memory from 0x%" PRIx64
" failed (invalid process)",		" failed (invalid process)",
(uint64_t)address);		(uint64_t)address);
▲ Show 20 Lines • Show All 137 Lines • Show Last 20 Lines

lldb/source/Core/ValueObject.cpp

Show First 20 Lines • Show All 837 Lines • ▼ Show 20 Lines	case eAddressTypeFile: {
}		}
} break;		} break;
case eAddressTypeLoad: {		case eAddressTypeLoad: {
ExecutionContext exe_ctx(GetExecutionContextRef());		ExecutionContext exe_ctx(GetExecutionContextRef());
Process *process = exe_ctx.GetProcessPtr();		Process *process = exe_ctx.GetProcessPtr();
if (process) {		if (process) {
heap_buf_ptr->SetByteSize(bytes);		heap_buf_ptr->SetByteSize(bytes);
size_t bytes_read = process->ReadMemory(		size_t bytes_read = process->ReadMemory(
addr + offset, heap_buf_ptr->GetBytes(), bytes, error);		addr + offset, heap_buf_ptr->GetBytes(), bytes, error, &exe_ctx);
if (error.Success() \|\| bytes_read > 0) {		if (error.Success() \|\| bytes_read > 0) {
data.SetData(data_sp);		data.SetData(data_sp);
return bytes_read;		return bytes_read;
}		}
}		}
} break;		} break;
case eAddressTypeHost: {		case eAddressTypeHost: {
auto max_bytes =		auto max_bytes =
▲ Show 20 Lines • Show All 2,536 Lines • Show Last 20 Lines

lldb/source/Expression/CMakeLists.txt

	if(NOT LLDB_BUILT_STANDALONE)			if(NOT LLDB_BUILT_STANDALONE)
	set(tablegen_deps intrinsics_gen)			set(tablegen_deps intrinsics_gen)
	endif()			endif()

	add_lldb_library(lldbExpression			add_lldb_library(lldbExpression
	DiagnosticManager.cpp			DiagnosticManager.cpp
				DWARFEvaluator.cpp
				DWARFEvaluatorFactory.cpp
	DWARFExpression.cpp			DWARFExpression.cpp
	Expression.cpp			Expression.cpp
	ExpressionVariable.cpp			ExpressionVariable.cpp
	FunctionCaller.cpp			FunctionCaller.cpp
	IRExecutionUnit.cpp			IRExecutionUnit.cpp
	IRInterpreter.cpp			IRInterpreter.cpp
	IRMemoryMap.cpp			IRMemoryMap.cpp
	LLVMUserExpression.cpp			LLVMUserExpression.cpp
	Show All 23 Lines

lldb/source/Expression/DWARFEvaluator.cpp

This file was added.

				//===-- DWARFEvaluator.cpp ------------ -----------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "lldb/Expression/DWARFEvaluator.h"
				#include "lldb/Expression/DWARFExpression.h"

				#include "lldb/Core/Module.h"
				#include "lldb/Core/Value.h"
				#include "lldb/Core/dwarf.h"

				#include "lldb/Utility/Log.h"
				#include "lldb/Utility/RegisterValue.h"

				#include "lldb/Target/Process.h"
				#include "lldb/Target/RegisterContext.h"
				#include "lldb/Target/StackFrame.h"

				#include "Plugins/SymbolFile/DWARF/DWARFUnit.h"

				using namespace lldb;
				using namespace lldb_private;

				DWARFEvaluator::DWARFEvaluator(const DWARFExpression &dwarf_expression,
				ExecutionContext *exe_ctx,
				RegisterContext *reg_ctx,
				const Value *initial_value_ptr,
				const Value *object_address_ptr)
				: m_dwarf_expression(dwarf_expression), m_exe_ctx(exe_ctx),
				m_reg_ctx(reg_ctx), m_initial_value_ptr(initial_value_ptr),
				m_object_address_ptr(object_address_ptr) {}

				static bool ReadRegisterValueAsScalar(RegisterContext *reg_ctx,
				lldb::RegisterKind reg_kind,
				uint32_t reg_num, Status *error_ptr,
				Value &value) {
				if (reg_ctx == nullptr) {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat("No register context in frame.\n");
				} else {
				uint32_t native_reg =
				reg_ctx->ConvertRegisterKindToRegisterNumber(reg_kind, reg_num);
				if (native_reg == LLDB_INVALID_REGNUM) {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat("Unable to convert register "
				"kind=%u reg_num=%u to a native "
				"register number.\n",
				reg_kind, reg_num);
				} else {
				const RegisterInfo *reg_info =
				reg_ctx->GetRegisterInfoAtIndex(native_reg);
				RegisterValue reg_value;
				if (reg_ctx->ReadRegister(reg_info, reg_value)) {
				if (reg_value.GetScalarValue(value.GetScalar())) {
				value.SetValueType(Value::eValueTypeScalar);
				value.SetContext(Value::eContextTypeRegisterInfo,
				const_cast<RegisterInfo *>(reg_info));
				if (error_ptr)
				error_ptr->Clear();
				return true;
				} else {
				// If we get this error, then we need to implement a value buffer in
				// the dwarf expression evaluation function...
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"register %s can't be converted to a scalar value",
				reg_info->name);
				}
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat("register %s is not available",
				reg_info->name);
				}
				}
				}
				return false;
				}

				static bool Evaluate_DW_OP_entry_value(std::vector<Value> &stack,
				ExecutionContext *exe_ctx,
				RegisterContext *reg_ctx,
				const DataExtractor &opcodes,
				lldb::offset_t &opcode_offset,
				Status error_ptr, Log log) {
				// DW_OP_entry_value(sub-expr) describes the location a variable had upon
				// function entry: this variable location is presumed to be optimized out at
				// the current PC value. The caller of the function may have call site
				// information that describes an alternate location for the variable (e.g. a
				// constant literal, or a spilled stack value) in the parent frame.
				//
				// Example (this is pseudo-code & pseudo-DWARF, but hopefully illustrative):
				//
				// void child(int &sink, int x) {
				// ...
				// /* "x" gets optimized out. */
				//
				// /* The location of "x" here is: DW_OP_entry_value($reg2). */
				// ++sink;
				// }
				//
				// void parent() {
				// int sink;
				//
				// /*
				// * The callsite information emitted here is:
				// *
				// * DW_TAG_call_site
				// * DW_AT_return_pc ... (for "child(sink, 123);")
				// * DW_TAG_call_site_parameter (for "sink")
				// * DW_AT_location ($reg1)
				// * DW_AT_call_value ($SP - 8)
				// * DW_TAG_call_site_parameter (for "x")
				// * DW_AT_location ($reg2)
				// * DW_AT_call_value ($literal 123)
				// *
				// * DW_TAG_call_site
				// * DW_AT_return_pc ... (for "child(sink, 456);")
				// * ...
				// */
				// child(sink, 123);
				// child(sink, 456);
				// }
				//
				// When the program stops at "++sink" within `child`, the debugger determines
				// the call site by analyzing the return address. Once the call site is found,
				// the debugger determines which parameter is referenced by DW_OP_entry_value
				// and evaluates the corresponding location for that parameter in `parent`.

				// 1. Find the function which pushed the current frame onto the stack.
				if ((!exe_ctx \|\| !exe_ctx->HasTargetScope()) \|\| !reg_ctx) {
				LLDB_LOG(log, "Evaluate_DW_OP_entry_value: no exe/reg context");
				return false;
				}

				StackFrame *current_frame = exe_ctx->GetFramePtr();
				Thread *thread = exe_ctx->GetThreadPtr();
				if (!current_frame \|\| !thread) {
				LLDB_LOG(log, "Evaluate_DW_OP_entry_value: no current frame/thread");
				return false;
				}

				Target &target = exe_ctx->GetTargetRef();
				StackFrameSP parent_frame = nullptr;
				addr_t return_pc = LLDB_INVALID_ADDRESS;
				uint32_t current_frame_idx = current_frame->GetFrameIndex();
				uint32_t num_frames = thread->GetStackFrameCount();
				for (uint32_t parent_frame_idx = current_frame_idx + 1;
				parent_frame_idx < num_frames; ++parent_frame_idx) {
				parent_frame = thread->GetStackFrameAtIndex(parent_frame_idx);
				// Require a valid sequence of frames.
				if (!parent_frame)
				break;

				// Record the first valid return address, even if this is an inlined frame,
				// in order to look up the associated call edge in the first non-inlined
				// parent frame.
				if (return_pc == LLDB_INVALID_ADDRESS) {
				return_pc = parent_frame->GetFrameCodeAddress().GetLoadAddress(&target);
				LLDB_LOG(log,
				"Evaluate_DW_OP_entry_value: immediate ancestor with pc = {0:x}",
				return_pc);
				}

				// If we've found an inlined frame, skip it (these have no call site
				// parameters).
				if (parent_frame->IsInlined())
				continue;

				// We've found the first non-inlined parent frame.
				break;
				}
				if (!parent_frame \|\| !parent_frame->GetRegisterContext()) {
				LLDB_LOG(log, "Evaluate_DW_OP_entry_value: no parent frame with reg ctx");
				return false;
				}

				Function *parent_func =
				parent_frame->GetSymbolContext(eSymbolContextFunction).function;
				if (!parent_func) {
				LLDB_LOG(log, "Evaluate_DW_OP_entry_value: no parent function");
				return false;
				}

				// 2. Find the call edge in the parent function responsible for creating the
				// current activation.
				Function *current_func =
				current_frame->GetSymbolContext(eSymbolContextFunction).function;
				if (!current_func) {
				LLDB_LOG(log, "Evaluate_DW_OP_entry_value: no current function");
				return false;
				}

				CallEdge *call_edge = nullptr;
				ModuleList &modlist = target.GetImages();
				ExecutionContext parent_exe_ctx = *exe_ctx;
				parent_exe_ctx.SetFrameSP(parent_frame);
				if (!parent_frame->IsArtificial()) {
				// If the parent frame is not artificial, the current activation may be
				// produced by an ambiguous tail call. In this case, refuse to proceed.
				call_edge = parent_func->GetCallEdgeForReturnAddress(return_pc, target);
				if (!call_edge) {
				LLDB_LOG(log,
				"Evaluate_DW_OP_entry_value: no call edge for retn-pc = {0:x} "
				"in parent frame {1}",
				return_pc, parent_func->GetName());
				return false;
				}
				Function *callee_func = call_edge->GetCallee(modlist, parent_exe_ctx);
				if (callee_func != current_func) {
				LLDB_LOG(log, "Evaluate_DW_OP_entry_value: ambiguous call sequence, "
				"can't find real parent frame");
				return false;
				}
				} else {
				// The StackFrameList solver machinery has deduced that an unambiguous tail
				// call sequence that produced the current activation. The first edge in
				// the parent that points to the current function must be valid.
				for (auto &edge : parent_func->GetTailCallingEdges()) {
				if (edge->GetCallee(modlist, parent_exe_ctx) == current_func) {
				call_edge = edge.get();
				break;
				}
				}
				}
				if (!call_edge) {
				LLDB_LOG(log, "Evaluate_DW_OP_entry_value: no unambiguous edge from parent "
				"to current function");
				return false;
				}

				// 3. Attempt to locate the DW_OP_entry_value expression in the set of
				// available call site parameters. If found, evaluate the corresponding
				// parameter in the context of the parent frame.
				const uint32_t subexpr_len = opcodes.GetULEB128(&opcode_offset);
				const void *subexpr_data = opcodes.GetData(&opcode_offset, subexpr_len);
				if (!subexpr_data) {
				LLDB_LOG(log, "Evaluate_DW_OP_entry_value: subexpr could not be read");
				return false;
				}

				const CallSiteParameter *matched_param = nullptr;
				for (const CallSiteParameter &param : call_edge->GetCallSiteParameters()) {
				DataExtractor param_subexpr_extractor;
				if (!param.LocationInCallee.GetExpressionData(param_subexpr_extractor))
				continue;
				lldb::offset_t param_subexpr_offset = 0;
				const void *param_subexpr_data =
				param_subexpr_extractor.GetData(&param_subexpr_offset, subexpr_len);
				if (!param_subexpr_data \|\|
				param_subexpr_extractor.BytesLeft(param_subexpr_offset) != 0)
				continue;

				// At this point, the DW_OP_entry_value sub-expression and the callee-side
				// expression in the call site parameter are known to have the same length.
				// Check whether they are equal.
				//
				// Note that an equality check is sufficient: the contents of the
				// DW_OP_entry_value subexpression are only used to identify the right call
				// site parameter in the parent, and do not require any special handling.
				if (memcmp(subexpr_data, param_subexpr_data, subexpr_len) == 0) {
				matched_param = &param;
				break;
				}
				}
				if (!matched_param) {
				LLDB_LOG(log,
				"Evaluate_DW_OP_entry_value: no matching call site param found");
				return false;
				}

				// TODO: Add support for DW_OP_push_object_address within a DW_OP_entry_value
				// subexpresion whenever llvm does.
				Value result;
				const DWARFExpression &param_expr = matched_param->LocationInCaller;
				if (!param_expr.Evaluate(&parent_exe_ctx,
				parent_frame->GetRegisterContext().get(),
				/loclist_base_addr=/LLDB_INVALID_ADDRESS,
				/initial_value_ptr=/nullptr,
				/object_address_ptr=/nullptr, result, error_ptr)) {
				LLDB_LOG(log,
				"Evaluate_DW_OP_entry_value: call site param evaluation failed");
				return false;
				}

				stack.push_back(result);
				return true;
				}

				bool DWARFEvaluator::Evaluate(Value &result, Status *error_ptr) {
				DataExtractor opcodes;
				if (!m_dwarf_expression.GetExpressionData(opcodes)) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"no location, value may have been optimized out");
				return false;
				}
				return Evaluate(opcodes, result, error_ptr);
				}

				bool DWARFEvaluator::Evaluate(const DataExtractor &opcodes, Value &result,
				Status *error_ptr) {
				if (opcodes.GetByteSize() == 0) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"no location, value may have been optimized out");
				return false;
				}
				std::vector<Value> stack;

				Process *process = nullptr;
				StackFrame *frame = nullptr;

				if (m_exe_ctx) {
				process = m_exe_ctx->GetProcessPtr();
				frame = m_exe_ctx->GetFramePtr();
				}
				if (m_reg_ctx == nullptr && frame)
				m_reg_ctx = frame->GetRegisterContext().get();

				if (m_initial_value_ptr)
				stack.push_back(*m_initial_value_ptr);

				lldb::offset_t offset = 0;

				/// Insertion point for evaluating multi-piece expression.
				uint64_t op_piece_offset = 0;
				clayborgUnsubmitted Done Reply Inline Actions This shouldn't be in here. Remove DW_OP_WASM_location as it is custom. clayborg: This shouldn't be in here. Remove DW_OP_WASM_location as it is custom.
				Value pieces; // Used for DW_OP_piece

				Log *log(lldb_private::GetLogIfAllCategoriesSet(LIBLLDB_LOG_EXPRESSIONS));

				uint8_t _opcode = 0;

				while (opcodes.ValidOffset(offset)) {
				const lldb::offset_t op_offset = offset;
				const uint8_t op = opcodes.GetU8(&offset);
				_opcode = op;

				if (log && log->GetVerbose()) {
				size_t count = stack.size();
				LLDB_LOGF(log, "Stack before operation has %" PRIu64 " values:",
				(uint64_t)count);
				for (size_t i = 0; i < count; ++i) {
				StreamString new_value;
				new_value.Printf("[%" PRIu64 "]", (uint64_t)i);
				stack[i].Dump(&new_value);
				LLDB_LOGF(log, " %s", new_value.GetData());
				}
				LLDB_LOGF(log, "0x%8.8" PRIx64 ": %s", op_offset,
				DW_OP_value_to_name(op));
				}

				if (!Evaluate(op, process, frame, stack, opcodes, offset, pieces,
				op_piece_offset, log, error_ptr))
				return false;
				}

				if (stack.empty()) {
				// Nothing on the stack, check if we created a piece value from DW_OP_piece
				// or DW_OP_bit_piece opcodes
				if (pieces.GetBuffer().GetByteSize())
				result = pieces;
				else {
				if (error_ptr)
				error_ptr->SetErrorString("Stack empty after evaluation.");
				return false;
				}
				} else {
				if (log && log->GetVerbose()) {
				size_t count = stack.size();
				LLDB_LOGF(log, "Stack after operation has %" PRIu64 " values:",
				(uint64_t)count);
				for (size_t i = 0; i < count; ++i) {
				StreamString new_value;
				new_value.Printf("[%" PRIu64 "]", (uint64_t)i);
				stack[i].Dump(&new_value);
				LLDB_LOGF(log, " %s", new_value.GetData());
				}
				}
				result = stack.back();
				}
				return true; // Return true on success
				}

				bool DWARFEvaluator::Evaluate(const uint8_t op, Process *process,
				StackFrame *frame, std::vector<Value> &stack,
				const DataExtractor &opcodes,
				lldb::offset_t &offset, Value &pieces,
				uint64_t &op_piece_offset, Log *log,
				Status *error_ptr) {
				Value tmp;
				uint32_t reg_num;

				lldb::ModuleSP module_sp = m_dwarf_expression.GetModule();
				const DWARFUnit *dwarf_cu = m_dwarf_expression.GetDWARFCompileUnit();
				const lldb::RegisterKind reg_kind = m_dwarf_expression.GetRegisterKind();

				switch (op) {
				// The DW_OP_addr operation has a single operand that encodes a machine
				// address and whose size is the size of an address on the target machine.
				case DW_OP_addr:
				stack.push_back(Scalar(opcodes.GetAddress(&offset)));
				stack.back().SetValueType(Value::eValueTypeFileAddress);
				// Convert the file address to a load address, so subsequent
				// DWARF operators can operate on it.
				if (frame)
				stack.back().ConvertToLoadAddress(module_sp.get(),
				frame->CalculateTarget().get());
				break;

				// The DW_OP_addr_sect_offset4 is used for any location expressions in
				// shared libraries that have a location like:
				// DW_OP_addr(0x1000)
				// If this address resides in a shared library, then this virtual address
				// won't make sense when it is evaluated in the context of a running
				// process where shared libraries have been slid. To account for this, this
				// new address type where we can store the section pointer and a 4 byte
				// offset.
				// case DW_OP_addr_sect_offset4:
				// {
				// result_type = eResultTypeFileAddress;
				// lldb::Section *sect = (lldb::Section
				// )opcodes.GetMaxU64(&offset, sizeof(void ));
				// lldb::addr_t sect_offset = opcodes.GetU32(&offset);
				//
				// Address so_addr (sect, sect_offset);
				// lldb::addr_t load_addr = so_addr.GetLoadAddress();
				// if (load_addr != LLDB_INVALID_ADDRESS)
				// {
				// // We successfully resolve a file address to a load
				// // address.
				// stack.push_back(load_addr);
				// break;
				// }
				// else
				// {
				// // We were able
				// if (error_ptr)
				// error_ptr->SetErrorStringWithFormat ("Section %s in
				// %s is not currently loaded.\n",
				// sect->GetName().AsCString(),
				// sect->GetModule()->GetFileSpec().GetFilename().AsCString());
				// return false;
				// }
				// }
				// break;

				// OPCODE: DW_OP_deref
				// OPERANDS: none
				// DESCRIPTION: Pops the top stack entry and treats it as an address.
				// The value retrieved from that address is pushed. The size of the data
				// retrieved from the dereferenced address is the size of an address on the
				// target machine.
				case DW_OP_deref: {
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString("Expression stack empty for DW_OP_deref.");
				return false;
				}
				Value::ValueType value_type = stack.back().GetValueType();
				switch (value_type) {
				case Value::eValueTypeHostAddress: {
				void src = (void )stack.back().GetScalar().ULongLong();
				intptr_t ptr;
				::memcpy(&ptr, src, sizeof(void *));
				stack.back().GetScalar() = ptr;
				stack.back().ClearContext();
				} break;
				case Value::eValueTypeFileAddress: {
				auto file_addr = stack.back().GetScalar().ULongLong(LLDB_INVALID_ADDRESS);
				if (!module_sp) {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"need module to resolve file address for DW_OP_deref");
				return false;
				}
				Address so_addr;
				if (!module_sp->ResolveFileAddress(file_addr, so_addr)) {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"failed to resolve file address in module");
				return false;
				}
				addr_t load_Addr = so_addr.GetLoadAddress(m_exe_ctx->GetTargetPtr());
				if (load_Addr == LLDB_INVALID_ADDRESS) {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat("failed to resolve load address");
				return false;
				}
				stack.back().GetScalar() = load_Addr;
				stack.back().SetValueType(Value::eValueTypeLoadAddress);
				// Fall through to load address code below...
				}
				LLVM_FALLTHROUGH;
				case Value::eValueTypeLoadAddress:
				if (m_exe_ctx) {
				if (process) {
				lldb::addr_t pointer_addr =
				stack.back().GetScalar().ULongLong(LLDB_INVALID_ADDRESS);
				Status error;
				lldb::addr_t pointer_value =
				process->ReadPointerFromMemory(pointer_addr, error);
				if (pointer_value != LLDB_INVALID_ADDRESS) {
				stack.back().GetScalar() = pointer_value;
				stack.back().ClearContext();
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"Failed to dereference pointer from 0x%" PRIx64
				" for DW_OP_deref: %s\n",
				pointer_addr, error.AsCString());
				return false;
				}
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"NULL process for DW_OP_deref.\n");
				return false;
				}
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"NULL execution context for DW_OP_deref.\n");
				return false;
				}
				break;

				default:
				break;
				}

				} break;

				// OPCODE: DW_OP_deref_size
				// OPERANDS: 1
				// 1 - uint8_t that specifies the size of the data to dereference.
				// DESCRIPTION: Behaves like the DW_OP_deref operation: it pops the top
				// stack entry and treats it as an address. The value retrieved from that
				// address is pushed. In the DW_OP_deref_size operation, however, the size
				// in bytes of the data retrieved from the dereferenced address is
				// specified by the single operand. This operand is a 1-byte unsigned
				// integral constant whose value may not be larger than the size of an
				// address on the target machine. The data retrieved is zero extended to
				// the size of an address on the target machine before being pushed on the
				// expression stack.
				case DW_OP_deref_size: {
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack empty for DW_OP_deref_size.");
				return false;
				}
				uint8_t size = opcodes.GetU8(&offset);
				Value::ValueType value_type = stack.back().GetValueType();
				switch (value_type) {
				case Value::eValueTypeHostAddress: {
				void src = (void )stack.back().GetScalar().ULongLong();
				intptr_t ptr;
				::memcpy(&ptr, src, sizeof(void *));
				// I can't decide whether the size operand should apply to the bytes in
				// their
				// lldb-host endianness or the target endianness.. I doubt this'll ever
				// come up but I'll opt for assuming big endian regardless.
				switch (size) {
				case 1:
				ptr = ptr & 0xff;
				break;
				case 2:
				ptr = ptr & 0xffff;
				break;
				case 3:
				ptr = ptr & 0xffffff;
				break;
				case 4:
				ptr = ptr & 0xffffffff;
				break;
				// the casts are added to work around the case where intptr_t is a 32
				// bit quantity;
				// presumably we won't hit the 5..7 cases if (void*) is 32-bits in this
				// program.
				case 5:
				ptr = (intptr_t)ptr & 0xffffffffffULL;
				break;
				case 6:
				ptr = (intptr_t)ptr & 0xffffffffffffULL;
				break;
				case 7:
				ptr = (intptr_t)ptr & 0xffffffffffffffULL;
				break;
				default:
				break;
				}
				stack.back().GetScalar() = ptr;
				stack.back().ClearContext();
				} break;
				case Value::eValueTypeLoadAddress:
				if (m_exe_ctx) {
				if (process) {
				lldb::addr_t pointer_addr =
				stack.back().GetScalar().ULongLong(LLDB_INVALID_ADDRESS);
				uint8_t addr_bytes[sizeof(lldb::addr_t)];
				Status error;
				if (process->ReadMemory(pointer_addr, &addr_bytes, size, error) ==
				size) {
				DataExtractor addr_data(addr_bytes, sizeof(addr_bytes),
				process->GetByteOrder(), size);
				lldb::offset_t addr_data_offset = 0;
				switch (size) {
				case 1:
				stack.back().GetScalar() = addr_data.GetU8(&addr_data_offset);
				break;
				case 2:
				stack.back().GetScalar() = addr_data.GetU16(&addr_data_offset);
				break;
				case 4:
				stack.back().GetScalar() = addr_data.GetU32(&addr_data_offset);
				break;
				case 8:
				stack.back().GetScalar() = addr_data.GetU64(&addr_data_offset);
				break;
				default:
				stack.back().GetScalar() =
				addr_data.GetAddress(&addr_data_offset);
				}
				stack.back().ClearContext();
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"Failed to dereference pointer from 0x%" PRIx64
				" for DW_OP_deref: %s\n",
				pointer_addr, error.AsCString());
				return false;
				}
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"NULL process for DW_OP_deref.\n");
				return false;
				}
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"NULL execution context for DW_OP_deref.\n");
				return false;
				}
				break;

				default:
				break;
				}

				} break;

				// OPCODE: DW_OP_xderef_size
				// OPERANDS: 1
				// 1 - uint8_t that specifies the size of the data to dereference.
				// DESCRIPTION: Behaves like the DW_OP_xderef operation: the entry at
				// the top of the stack is treated as an address. The second stack entry is
				// treated as an "address space identifier" for those architectures that
				// support multiple address spaces. The top two stack elements are popped,
				// a data item is retrieved through an implementation-defined address
				// calculation and pushed as the new stack top. In the DW_OP_xderef_size
				// operation, however, the size in bytes of the data retrieved from the
				// dereferenced address is specified by the single operand. This operand is
				// a 1-byte unsigned integral constant whose value may not be larger than
				// the size of an address on the target machine. The data retrieved is zero
				// extended to the size of an address on the target machine before being
				// pushed on the expression stack.
				case DW_OP_xderef_size:
				if (error_ptr)
				error_ptr->SetErrorString("Unimplemented opcode: DW_OP_xderef_size.");
				return false;
				// OPCODE: DW_OP_xderef
				// OPERANDS: none
				// DESCRIPTION: Provides an extended dereference mechanism. The entry at
				// the top of the stack is treated as an address. The second stack entry is
				// treated as an "address space identifier" for those architectures that
				// support multiple address spaces. The top two stack elements are popped,
				// a data item is retrieved through an implementation-defined address
				// calculation and pushed as the new stack top. The size of the data
				// retrieved from the dereferenced address is the size of an address on the
				// target machine.
				case DW_OP_xderef:
				if (error_ptr)
				error_ptr->SetErrorString("Unimplemented opcode: DW_OP_xderef.");
				return false;

				// All DW_OP_constXXX opcodes have a single operand as noted below:
				//
				// Opcode Operand 1
				// DW_OP_const1u 1-byte unsigned integer constant DW_OP_const1s
				// 1-byte signed integer constant DW_OP_const2u 2-byte unsigned integer
				// constant DW_OP_const2s 2-byte signed integer constant DW_OP_const4u
				// 4-byte unsigned integer constant DW_OP_const4s 4-byte signed integer
				// constant DW_OP_const8u 8-byte unsigned integer constant DW_OP_const8s
				// 8-byte signed integer constant DW_OP_constu unsigned LEB128 integer
				// constant DW_OP_consts signed LEB128 integer constant
				case DW_OP_const1u:
				stack.push_back(Scalar((uint8_t)opcodes.GetU8(&offset)));
				break;
				case DW_OP_const1s:
				stack.push_back(Scalar((int8_t)opcodes.GetU8(&offset)));
				break;
				case DW_OP_const2u:
				stack.push_back(Scalar((uint16_t)opcodes.GetU16(&offset)));
				break;
				case DW_OP_const2s:
				stack.push_back(Scalar((int16_t)opcodes.GetU16(&offset)));
				break;
				case DW_OP_const4u:
				stack.push_back(Scalar((uint32_t)opcodes.GetU32(&offset)));
				break;
				case DW_OP_const4s:
				stack.push_back(Scalar((int32_t)opcodes.GetU32(&offset)));
				break;
				case DW_OP_const8u:
				stack.push_back(Scalar((uint64_t)opcodes.GetU64(&offset)));
				break;
				case DW_OP_const8s:
				stack.push_back(Scalar((int64_t)opcodes.GetU64(&offset)));
				break;
				case DW_OP_constu:
				stack.push_back(Scalar(opcodes.GetULEB128(&offset)));
				break;
				case DW_OP_consts:
				stack.push_back(Scalar(opcodes.GetSLEB128(&offset)));
				break;

				// OPCODE: DW_OP_dup
				// OPERANDS: none
				// DESCRIPTION: duplicates the value at the top of the stack
				case DW_OP_dup:
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString("Expression stack empty for DW_OP_dup.");
				return false;
				} else
				stack.push_back(stack.back());
				break;

				// OPCODE: DW_OP_drop
				// OPERANDS: none
				// DESCRIPTION: pops the value at the top of the stack
				case DW_OP_drop:
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString("Expression stack empty for DW_OP_drop.");
				return false;
				} else
				stack.pop_back();
				break;

				// OPCODE: DW_OP_over
				// OPERANDS: none
				// DESCRIPTION: Duplicates the entry currently second in the stack at
				// the top of the stack.
				case DW_OP_over:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_over.");
				return false;
				} else
				stack.push_back(stack[stack.size() - 2]);
				break;

				// OPCODE: DW_OP_pick
				// OPERANDS: uint8_t index into the current stack
				// DESCRIPTION: The stack entry with the specified index (0 through 255,
				// inclusive) is pushed on the stack
				case DW_OP_pick: {
				uint8_t pick_idx = opcodes.GetU8(&offset);
				if (pick_idx < stack.size())
				stack.push_back(stack[stack.size() - 1 - pick_idx]);
				else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"Index %u out of range for DW_OP_pick.\n", pick_idx);
				return false;
				}
				} break;

				// OPCODE: DW_OP_swap
				// OPERANDS: none
				// DESCRIPTION: swaps the top two stack entries. The entry at the top
				// of the stack becomes the second stack entry, and the second entry
				// becomes the top of the stack
				case DW_OP_swap:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_swap.");
				return false;
				} else {
				tmp = stack.back();
				stack.back() = stack[stack.size() - 2];
				stack[stack.size() - 2] = tmp;
				}
				break;

				// OPCODE: DW_OP_rot
				// OPERANDS: none
				// DESCRIPTION: Rotates the first three stack entries. The entry at
				// the top of the stack becomes the third stack entry, the second entry
				// becomes the top of the stack, and the third entry becomes the second
				// entry.
				case DW_OP_rot:
				if (stack.size() < 3) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 3 items for DW_OP_rot.");
				return false;
				} else {
				size_t last_idx = stack.size() - 1;
				Value old_top = stack[last_idx];
				stack[last_idx] = stack[last_idx - 1];
				stack[last_idx - 1] = stack[last_idx - 2];
				stack[last_idx - 2] = old_top;
				}
				break;

				// OPCODE: DW_OP_abs
				// OPERANDS: none
				// DESCRIPTION: pops the top stack entry, interprets it as a signed
				// value and pushes its absolute value. If the absolute value can not be
				// represented, the result is undefined.
				case DW_OP_abs:
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 1 item for DW_OP_abs.");
				return false;
				} else if (!stack.back().ResolveValue(m_exe_ctx).AbsoluteValue()) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Failed to take the absolute value of the first stack item.");
				return false;
				}
				break;

				// OPCODE: DW_OP_and
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values, performs a bitwise and
				// operation on the two, and pushes the result.
				case DW_OP_and:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_and.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) & tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_div
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values, divides the former second
				// entry by the former top of the stack using signed division, and pushes
				// the result.
				case DW_OP_div:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_div.");
				return false;
				} else {
				tmp = stack.back();
				if (tmp.ResolveValue(m_exe_ctx).IsZero()) {
				if (error_ptr)
				error_ptr->SetErrorString("Divide by zero.");
				return false;
				} else {
				stack.pop_back();
				stack.back() =
				stack.back().ResolveValue(m_exe_ctx) / tmp.ResolveValue(m_exe_ctx);
				if (!stack.back().ResolveValue(m_exe_ctx).IsValid()) {
				if (error_ptr)
				error_ptr->SetErrorString("Divide failed.");
				return false;
				}
				}
				}
				break;

				// OPCODE: DW_OP_minus
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values, subtracts the former top
				// of the stack from the former second entry, and pushes the result.
				case DW_OP_minus:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_minus.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) - tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_mod
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values and pushes the result of
				// the calculation: former second stack entry modulo the former top of the
				// stack.
				case DW_OP_mod:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_mod.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) % tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_mul
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack entries, multiplies them
				// together, and pushes the result.
				case DW_OP_mul:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_mul.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) * tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_neg
				// OPERANDS: none
				// DESCRIPTION: pops the top stack entry, and pushes its negation.
				case DW_OP_neg:
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 1 item for DW_OP_neg.");
				return false;
				} else {
				if (!stack.back().ResolveValue(m_exe_ctx).UnaryNegate()) {
				if (error_ptr)
				error_ptr->SetErrorString("Unary negate failed.");
				return false;
				}
				}
				break;

				// OPCODE: DW_OP_not
				// OPERANDS: none
				// DESCRIPTION: pops the top stack entry, and pushes its bitwise
				// complement
				case DW_OP_not:
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 1 item for DW_OP_not.");
				return false;
				} else {
				if (!stack.back().ResolveValue(m_exe_ctx).OnesComplement()) {
				if (error_ptr)
				error_ptr->SetErrorString("Logical NOT failed.");
				return false;
				}
				}
				break;

				// OPCODE: DW_OP_or
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack entries, performs a bitwise or
				// operation on the two, and pushes the result.
				case DW_OP_or:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_or.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) \| tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_plus
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack entries, adds them together, and
				// pushes the result.
				case DW_OP_plus:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_plus.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().GetScalar() += tmp.GetScalar();
				}
				break;

				// OPCODE: DW_OP_plus_uconst
				// OPERANDS: none
				// DESCRIPTION: pops the top stack entry, adds it to the unsigned LEB128
				// constant operand and pushes the result.
				case DW_OP_plus_uconst:
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 1 item for DW_OP_plus_uconst.");
				return false;
				} else {
				const uint64_t uconst_value = opcodes.GetULEB128(&offset);
				// Implicit conversion from a UINT to a Scalar...
				stack.back().GetScalar() += uconst_value;
				if (!stack.back().GetScalar().IsValid()) {
				if (error_ptr)
				error_ptr->SetErrorString("DW_OP_plus_uconst failed.");
				return false;
				}
				}
				break;

				// OPCODE: DW_OP_shl
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack entries, shifts the former
				// second entry left by the number of bits specified by the former top of
				// the stack, and pushes the result.
				case DW_OP_shl:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_shl.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) <<= tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_shr
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack entries, shifts the former second
				// entry right logically (filling with zero bits) by the number of bits
				// specified by the former top of the stack, and pushes the result.
				case DW_OP_shr:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_shr.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				if (!stack.back().ResolveValue(m_exe_ctx).ShiftRightLogical(
				tmp.ResolveValue(m_exe_ctx))) {
				if (error_ptr)
				error_ptr->SetErrorString("DW_OP_shr failed.");
				return false;
				}
				}
				break;

				// OPCODE: DW_OP_shra
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack entries, shifts the former second
				// entry right arithmetically (divide the magnitude by 2, keep the same
				// sign for the result) by the number of bits specified by the former top
				// of the stack, and pushes the result.
				case DW_OP_shra:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_shra.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) >>= tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_xor
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack entries, performs the bitwise
				// exclusive-or operation on the two, and pushes the result.
				case DW_OP_xor:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_xor.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) ^ tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_skip
				// OPERANDS: int16_t
				// DESCRIPTION: An unconditional branch. Its single operand is a 2-byte
				// signed integer constant. The 2-byte constant is the number of bytes of
				// the DWARF expression to skip forward or backward from the current
				// operation, beginning after the 2-byte constant.
				case DW_OP_skip: {
				int16_t skip_offset = (int16_t)opcodes.GetU16(&offset);
				lldb::offset_t new_offset = offset + skip_offset;
				if (opcodes.ValidOffset(new_offset))
				offset = new_offset;
				else {
				if (error_ptr)
				error_ptr->SetErrorString("Invalid opcode offset in DW_OP_skip.");
				return false;
				}
				} break;

				// OPCODE: DW_OP_bra
				// OPERANDS: int16_t
				// DESCRIPTION: A conditional branch. Its single operand is a 2-byte
				// signed integer constant. This operation pops the top of stack. If the
				// value popped is not the constant 0, the 2-byte constant operand is the
				// number of bytes of the DWARF expression to skip forward or backward from
				// the current operation, beginning after the 2-byte constant.
				case DW_OP_bra:
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 1 item for DW_OP_bra.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				int16_t bra_offset = (int16_t)opcodes.GetU16(&offset);
				Scalar zero(0);
				if (tmp.ResolveValue(m_exe_ctx) != zero) {
				lldb::offset_t new_offset = offset + bra_offset;
				if (opcodes.ValidOffset(new_offset))
				offset = new_offset;
				else {
				if (error_ptr)
				error_ptr->SetErrorString("Invalid opcode offset in DW_OP_bra.");
				return false;
				}
				}
				}
				break;

				// OPCODE: DW_OP_eq
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values, compares using the
				// equals (==) operator.
				// STACK RESULT: push the constant value 1 onto the stack if the result
				// of the operation is true or the constant value 0 if the result of the
				// operation is false.
				case DW_OP_eq:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_eq.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) == tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_ge
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values, compares using the
				// greater than or equal to (>=) operator.
				// STACK RESULT: push the constant value 1 onto the stack if the result
				// of the operation is true or the constant value 0 if the result of the
				// operation is false.
				case DW_OP_ge:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_ge.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) >= tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_gt
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values, compares using the
				// greater than (>) operator.
				// STACK RESULT: push the constant value 1 onto the stack if the result
				// of the operation is true or the constant value 0 if the result of the
				// operation is false.
				case DW_OP_gt:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_gt.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) > tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_le
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values, compares using the
				// less than or equal to (<=) operator.
				// STACK RESULT: push the constant value 1 onto the stack if the result
				// of the operation is true or the constant value 0 if the result of the
				// operation is false.
				case DW_OP_le:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_le.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) <= tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_lt
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values, compares using the
				// less than (<) operator.
				// STACK RESULT: push the constant value 1 onto the stack if the result
				// of the operation is true or the constant value 0 if the result of the
				// operation is false.
				case DW_OP_lt:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_lt.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) < tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_ne
				// OPERANDS: none
				// DESCRIPTION: pops the top two stack values, compares using the
				// not equal (!=) operator.
				// STACK RESULT: push the constant value 1 onto the stack if the result
				// of the operation is true or the constant value 0 if the result of the
				// operation is false.
				case DW_OP_ne:
				if (stack.size() < 2) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 2 items for DW_OP_ne.");
				return false;
				} else {
				tmp = stack.back();
				stack.pop_back();
				stack.back().ResolveValue(m_exe_ctx) =
				stack.back().ResolveValue(m_exe_ctx) != tmp.ResolveValue(m_exe_ctx);
				}
				break;

				// OPCODE: DW_OP_litn
				// OPERANDS: none
				// DESCRIPTION: encode the unsigned literal values from 0 through 31.
				// STACK RESULT: push the unsigned literal constant value onto the top
				// of the stack.
				case DW_OP_lit0:
				case DW_OP_lit1:
				case DW_OP_lit2:
				case DW_OP_lit3:
				case DW_OP_lit4:
				case DW_OP_lit5:
				case DW_OP_lit6:
				case DW_OP_lit7:
				case DW_OP_lit8:
				case DW_OP_lit9:
				case DW_OP_lit10:
				case DW_OP_lit11:
				case DW_OP_lit12:
				case DW_OP_lit13:
				case DW_OP_lit14:
				case DW_OP_lit15:
				case DW_OP_lit16:
				case DW_OP_lit17:
				case DW_OP_lit18:
				case DW_OP_lit19:
				case DW_OP_lit20:
				case DW_OP_lit21:
				case DW_OP_lit22:
				case DW_OP_lit23:
				case DW_OP_lit24:
				case DW_OP_lit25:
				case DW_OP_lit26:
				case DW_OP_lit27:
				case DW_OP_lit28:
				case DW_OP_lit29:
				case DW_OP_lit30:
				case DW_OP_lit31:
				stack.push_back(Scalar((uint64_t)(op - DW_OP_lit0)));
				break;

				// OPCODE: DW_OP_regN
				// OPERANDS: none
				// DESCRIPTION: Push the value in register n on the top of the stack.
				case DW_OP_reg0:
				case DW_OP_reg1:
				case DW_OP_reg2:
				case DW_OP_reg3:
				case DW_OP_reg4:
				case DW_OP_reg5:
				case DW_OP_reg6:
				case DW_OP_reg7:
				case DW_OP_reg8:
				case DW_OP_reg9:
				case DW_OP_reg10:
				case DW_OP_reg11:
				case DW_OP_reg12:
				case DW_OP_reg13:
				case DW_OP_reg14:
				case DW_OP_reg15:
				case DW_OP_reg16:
				case DW_OP_reg17:
				case DW_OP_reg18:
				case DW_OP_reg19:
				case DW_OP_reg20:
				case DW_OP_reg21:
				case DW_OP_reg22:
				case DW_OP_reg23:
				case DW_OP_reg24:
				case DW_OP_reg25:
				case DW_OP_reg26:
				case DW_OP_reg27:
				case DW_OP_reg28:
				case DW_OP_reg29:
				case DW_OP_reg30:
				case DW_OP_reg31: {
				reg_num = op - DW_OP_reg0;

				if (ReadRegisterValueAsScalar(m_reg_ctx, reg_kind, reg_num, error_ptr, tmp))
				stack.push_back(tmp);
				else
				return false;
				} break;
				// OPCODE: DW_OP_regx
				// OPERANDS:
				// ULEB128 literal operand that encodes the register.
				// DESCRIPTION: Push the value in register on the top of the stack.
				case DW_OP_regx: {
				reg_num = opcodes.GetULEB128(&offset);
				if (ReadRegisterValueAsScalar(m_reg_ctx, reg_kind, reg_num, error_ptr, tmp))
				stack.push_back(tmp);
				else
				return false;
				} break;

				// OPCODE: DW_OP_bregN
				// OPERANDS:
				// SLEB128 offset from register N
				// DESCRIPTION: Value is in memory at the address specified by register
				// N plus an offset.
				case DW_OP_breg0:
				case DW_OP_breg1:
				case DW_OP_breg2:
				case DW_OP_breg3:
				case DW_OP_breg4:
				case DW_OP_breg5:
				case DW_OP_breg6:
				case DW_OP_breg7:
				case DW_OP_breg8:
				case DW_OP_breg9:
				case DW_OP_breg10:
				case DW_OP_breg11:
				case DW_OP_breg12:
				case DW_OP_breg13:
				case DW_OP_breg14:
				case DW_OP_breg15:
				case DW_OP_breg16:
				case DW_OP_breg17:
				case DW_OP_breg18:
				case DW_OP_breg19:
				case DW_OP_breg20:
				case DW_OP_breg21:
				case DW_OP_breg22:
				case DW_OP_breg23:
				case DW_OP_breg24:
				case DW_OP_breg25:
				case DW_OP_breg26:
				case DW_OP_breg27:
				case DW_OP_breg28:
				case DW_OP_breg29:
				case DW_OP_breg30:
				case DW_OP_breg31: {
				reg_num = op - DW_OP_breg0;

				if (ReadRegisterValueAsScalar(m_reg_ctx, reg_kind, reg_num, error_ptr,
				tmp)) {
				int64_t breg_offset = opcodes.GetSLEB128(&offset);
				tmp.ResolveValue(m_exe_ctx) += (uint64_t)breg_offset;
				tmp.ClearContext();
				stack.push_back(tmp);
				stack.back().SetValueType(Value::eValueTypeLoadAddress);
				} else
				return false;
				} break;
				// OPCODE: DW_OP_bregx
				// OPERANDS: 2
				// ULEB128 literal operand that encodes the register.
				// SLEB128 offset from register N
				// DESCRIPTION: Value is in memory at the address specified by register
				// N plus an offset.
				case DW_OP_bregx: {
				reg_num = opcodes.GetULEB128(&offset);

				if (ReadRegisterValueAsScalar(m_reg_ctx, reg_kind, reg_num, error_ptr,
				tmp)) {
				int64_t breg_offset = opcodes.GetSLEB128(&offset);
				tmp.ResolveValue(m_exe_ctx) += (uint64_t)breg_offset;
				tmp.ClearContext();
				stack.push_back(tmp);
				stack.back().SetValueType(Value::eValueTypeLoadAddress);
				} else
				return false;
				} break;

				case DW_OP_fbreg:
				if (m_exe_ctx) {
				if (frame) {
				Scalar value;
				if (frame->GetFrameBaseValue(value, error_ptr)) {
				int64_t fbreg_offset = opcodes.GetSLEB128(&offset);
				value += fbreg_offset;
				stack.push_back(value);
				stack.back().SetValueType(Value::eValueTypeLoadAddress);
				} else
				return false;
				} else {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Invalid stack frame in context for DW_OP_fbreg opcode.");
				return false;
				}
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"NULL execution context for DW_OP_fbreg.\n");
				return false;
				}

				break;

				// OPCODE: DW_OP_nop
				// OPERANDS: none
				// DESCRIPTION: A place holder. It has no effect on the location stack
				// or any of its values.
				case DW_OP_nop:
				break;

				// OPCODE: DW_OP_piece
				// OPERANDS: 1
				// ULEB128: byte size of the piece
				// DESCRIPTION: The operand describes the size in bytes of the piece of
				// the object referenced by the DWARF expression whose result is at the top
				// of the stack. If the piece is located in a register, but does not occupy
				// the entire register, the placement of the piece within that register is
				// defined by the ABI.
				//
				// Many compilers store a single variable in sets of registers, or store a
				// variable partially in memory and partially in registers. DW_OP_piece
				// provides a way of describing how large a part of a variable a particular
				// DWARF expression refers to.
				case DW_OP_piece: {
				const uint64_t piece_byte_size = opcodes.GetULEB128(&offset);

				if (piece_byte_size > 0) {
				Value curr_piece;

				if (stack.empty()) {
				// In a multi-piece expression, this means that the current piece is
				// not available. Fill with zeros for now by resizing the data and
				// appending it
				curr_piece.ResizeData(piece_byte_size);
				// Note that "0" is not a correct value for the unknown bits.
				// It would be better to also return a mask of valid bits together
				// with the expression result, so the debugger can print missing
				// members as "<optimized out>" or something.
				::memset(curr_piece.GetBuffer().GetBytes(), 0, piece_byte_size);
				pieces.AppendDataToHostBuffer(curr_piece);
				} else {
				Status error;
				// Extract the current piece into "curr_piece"
				Value curr_piece_source_value(stack.back());
				stack.pop_back();

				const Value::ValueType curr_piece_source_value_type =
				curr_piece_source_value.GetValueType();
				switch (curr_piece_source_value_type) {
				case Value::eValueTypeLoadAddress:
				if (process) {
				if (curr_piece.ResizeData(piece_byte_size) == piece_byte_size) {
				lldb::addr_t load_addr =
				curr_piece_source_value.GetScalar().ULongLong(
				LLDB_INVALID_ADDRESS);
				if (process->ReadMemory(
				load_addr, curr_piece.GetBuffer().GetBytes(),
				piece_byte_size, error) != piece_byte_size) {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"failed to read memory DW_OP_piece(%" PRIu64
				") from 0x%" PRIx64,
				piece_byte_size, load_addr);
				return false;
				}
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"failed to resize the piece memory buffer for "
				"DW_OP_piece(%" PRIu64 ")",
				piece_byte_size);
				return false;
				}
				}
				break;

				case Value::eValueTypeFileAddress:
				case Value::eValueTypeHostAddress:
				if (error_ptr) {
				lldb::addr_t addr = curr_piece_source_value.GetScalar().ULongLong(
				LLDB_INVALID_ADDRESS);
				error_ptr->SetErrorStringWithFormat(
				"failed to read memory DW_OP_piece(%" PRIu64
				") from %s address 0x%" PRIx64,
				piece_byte_size,
				curr_piece_source_value.GetValueType() ==
				Value::eValueTypeFileAddress
				? "file"
				: "host",
				addr);
				}
				return false;

				case Value::eValueTypeScalar: {
				uint32_t bit_size = piece_byte_size * 8;
				uint32_t bit_offset = 0;
				Scalar &scalar = curr_piece_source_value.GetScalar();
				if (!scalar.ExtractBitfield(bit_size, bit_offset)) {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"unable to extract %" PRIu64 " bytes from a %" PRIu64
				" byte scalar value.",
				piece_byte_size,
				(uint64_t)curr_piece_source_value.GetScalar().GetByteSize());
				return false;
				}
				// Create curr_piece with bit_size. By default Scalar
				// grows to the nearest host integer type.
				llvm::APInt fail_value(1, 0, false);
				llvm::APInt ap_int = scalar.UInt128(fail_value);
				assert(ap_int.getBitWidth() >= bit_size);
				llvm::ArrayRef<uint64_t> buf{ap_int.getRawData(),
				ap_int.getNumWords()};
				curr_piece.GetScalar() = Scalar(llvm::APInt(bit_size, buf));
				} break;

				case Value::eValueTypeVector: {
				if (curr_piece_source_value.GetVector().length >= piece_byte_size)
				curr_piece_source_value.GetVector().length = piece_byte_size;
				else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"unable to extract %" PRIu64 " bytes from a %" PRIu64
				" byte vector value.",
				piece_byte_size,
				(uint64_t)curr_piece_source_value.GetVector().length);
				return false;
				}
				} break;
				}

				// Check if this is the first piece?
				if (op_piece_offset == 0) {
				// This is the first piece, we should push it back onto the stack
				// so subsequent pieces will be able to access this piece and add
				// to it.
				if (pieces.AppendDataToHostBuffer(curr_piece) == 0) {
				if (error_ptr)
				error_ptr->SetErrorString("failed to append piece data");
				return false;
				}
				} else {
				// If this is the second or later piece there should be a value on
				// the stack.
				if (pieces.GetBuffer().GetByteSize() != op_piece_offset) {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"DW_OP_piece for offset %" PRIu64
				" but top of stack is of size %" PRIu64,
				op_piece_offset, pieces.GetBuffer().GetByteSize());
				return false;
				}

				if (pieces.AppendDataToHostBuffer(curr_piece) == 0) {
				if (error_ptr)
				error_ptr->SetErrorString("failed to append piece data");
				return false;
				}
				}
				}
				op_piece_offset += piece_byte_size;
				}
				} break;

				case DW_OP_bit_piece: // 0x9d ULEB128 bit size, ULEB128 bit offset (DWARF3);
				if (stack.size() < 1) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 1 item for DW_OP_bit_piece.");
				return false;
				} else {
				const uint64_t piece_bit_size = opcodes.GetULEB128(&offset);
				const uint64_t piece_bit_offset = opcodes.GetULEB128(&offset);
				switch (stack.back().GetValueType()) {
				case Value::eValueTypeScalar: {
				if (!stack.back().GetScalar().ExtractBitfield(piece_bit_size,
				piece_bit_offset)) {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"unable to extract %" PRIu64 " bit value with %" PRIu64
				" bit offset from a %" PRIu64 " bit scalar value.",
				piece_bit_size, piece_bit_offset,
				(uint64_t)(stack.back().GetScalar().GetByteSize() * 8));
				return false;
				}
				} break;

				case Value::eValueTypeFileAddress:
				case Value::eValueTypeLoadAddress:
				case Value::eValueTypeHostAddress:
				if (error_ptr) {
				error_ptr->SetErrorStringWithFormat(
				"unable to extract DW_OP_bit_piece(bit_size = %" PRIu64
				", bit_offset = %" PRIu64 ") from an address value.",
				piece_bit_size, piece_bit_offset);
				}
				return false;

				case Value::eValueTypeVector:
				if (error_ptr) {
				error_ptr->SetErrorStringWithFormat(
				"unable to extract DW_OP_bit_piece(bit_size = %" PRIu64
				", bit_offset = %" PRIu64 ") from a vector value.",
				piece_bit_size, piece_bit_offset);
				}
				return false;
				}
				}
				break;

				// OPCODE: DW_OP_push_object_address
				// OPERANDS: none
				// DESCRIPTION: Pushes the address of the object currently being
				// evaluated as part of evaluation of a user presented expression. This
				// object may correspond to an independent variable described by its own
				// DIE or it may be a component of an array, structure, or class whose
				// address has been dynamically determined by an earlier step during user
				// expression evaluation.
				case DW_OP_push_object_address:
				if (m_object_address_ptr)
				stack.push_back(*m_object_address_ptr);
				else {
				if (error_ptr)
				error_ptr->SetErrorString("DW_OP_push_object_address used without "
				"specifying an object address");
				return false;
				}
				break;

				// OPCODE: DW_OP_call2
				// OPERANDS:
				// uint16_t compile unit relative offset of a DIE
				// DESCRIPTION: Performs subroutine calls during evaluation
				// of a DWARF expression. The operand is the 2-byte unsigned offset of a
				// debugging information entry in the current compilation unit.
				//
				// Operand interpretation is exactly like that for DW_FORM_ref2.
				//
				// This operation transfers control of DWARF expression evaluation to the
				// DW_AT_location attribute of the referenced DIE. If there is no such
				// attribute, then there is no effect. Execution of the DWARF expression of
				// a DW_AT_location attribute may add to and/or remove from values on the
				// stack. Execution returns to the point following the call when the end of
				// the attribute is reached. Values on the stack at the time of the call
				// may be used as parameters by the called expression and values left on
				// the stack by the called expression may be used as return values by prior
				// agreement between the calling and called expressions.
				case DW_OP_call2:
				if (error_ptr)
				error_ptr->SetErrorString("Unimplemented opcode DW_OP_call2.");
				return false;
				// OPCODE: DW_OP_call4
				// OPERANDS: 1
				// uint32_t compile unit relative offset of a DIE
				// DESCRIPTION: Performs a subroutine call during evaluation of a DWARF
				// expression. For DW_OP_call4, the operand is a 4-byte unsigned offset of
				// a debugging information entry in the current compilation unit.
				//
				// Operand interpretation DW_OP_call4 is exactly like that for
				// DW_FORM_ref4.
				//
				// This operation transfers control of DWARF expression evaluation to the
				// DW_AT_location attribute of the referenced DIE. If there is no such
				// attribute, then there is no effect. Execution of the DWARF expression of
				// a DW_AT_location attribute may add to and/or remove from values on the
				// stack. Execution returns to the point following the call when the end of
				// the attribute is reached. Values on the stack at the time of the call
				// may be used as parameters by the called expression and values left on
				// the stack by the called expression may be used as return values by prior
				// agreement between the calling and called expressions.
				case DW_OP_call4:
				if (error_ptr)
				error_ptr->SetErrorString("Unimplemented opcode DW_OP_call4.");
				return false;

				// OPCODE: DW_OP_stack_value
				// OPERANDS: None
				// DESCRIPTION: Specifies that the object does not exist in memory but
				// rather is a constant value. The value from the top of the stack is the
				// value to be used. This is the actual object value and not the location.
				case DW_OP_stack_value:
				if (stack.empty()) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 1 item for DW_OP_stack_value.");
				return false;
				}
				stack.back().SetValueType(Value::eValueTypeScalar);
				break;

				// OPCODE: DW_OP_convert
				// OPERANDS: 1
				// A ULEB128 that is either a DIE offset of a
				// DW_TAG_base_type or 0 for the generic (pointer-sized) type.
				//
				// DESCRIPTION: Pop the top stack element, convert it to a
				// different type, and push the result.
				case DW_OP_convert: {
				if (stack.size() < 1) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Expression stack needs at least 1 item for DW_OP_convert.");
				return false;
				}
				const uint64_t die_offset = opcodes.GetULEB128(&offset);
				Scalar::Type type = Scalar::e_void;
				uint64_t bit_size;
				if (die_offset == 0) {
				// The generic type has the size of an address on the target
				// machine and an unspecified signedness. Scalar has no
				// "unspecified signedness", so we use unsigned types.
				if (!module_sp) {
				if (error_ptr)
				error_ptr->SetErrorString("No module");
				return false;
				}
				bit_size = module_sp->GetArchitecture().GetAddressByteSize() * 8;
				if (!bit_size) {
				if (error_ptr)
				error_ptr->SetErrorString("unspecified architecture");
				return false;
				}
				type = Scalar::GetBestTypeForBitSize(bit_size, false);
				} else {
				// Retrieve the type DIE that the value is being converted to.
				// FIXME: the constness has annoying ripple effects.
				DWARFDIE die = const_cast<DWARFUnit *>(dwarf_cu)->GetDIE(die_offset);
				if (!die) {
				if (error_ptr)
				error_ptr->SetErrorString("Cannot resolve DW_OP_convert type DIE");
				return false;
				}
				uint64_t encoding =
				die.GetAttributeValueAsUnsigned(DW_AT_encoding, DW_ATE_hi_user);
				bit_size = die.GetAttributeValueAsUnsigned(DW_AT_byte_size, 0) * 8;
				if (!bit_size)
				bit_size = die.GetAttributeValueAsUnsigned(DW_AT_bit_size, 0);
				if (!bit_size) {
				if (error_ptr)
				error_ptr->SetErrorString("Unsupported type size in DW_OP_convert");
				return false;
				}
				switch (encoding) {
				case DW_ATE_signed:
				case DW_ATE_signed_char:
				type = Scalar::GetBestTypeForBitSize(bit_size, true);
				break;
				case DW_ATE_unsigned:
				case DW_ATE_unsigned_char:
				type = Scalar::GetBestTypeForBitSize(bit_size, false);
				break;
				default:
				if (error_ptr)
				error_ptr->SetErrorString("Unsupported encoding in DW_OP_convert");
				return false;
				}
				}
				if (type == Scalar::e_void) {
				if (error_ptr)
				error_ptr->SetErrorString("Unsupported pointer size");
				return false;
				}
				Scalar &top = stack.back().ResolveValue(m_exe_ctx);
				top.TruncOrExtendTo(type, bit_size);
				break;
				}

				// OPCODE: DW_OP_call_frame_cfa
				// OPERANDS: None
				// DESCRIPTION: Specifies a DWARF expression that pushes the value of
				// the canonical frame address consistent with the call frame information
				// located in .debug_frame (or in the FDEs of the eh_frame section).
				case DW_OP_call_frame_cfa:
				if (frame) {
				// Note that we don't have to parse FDEs because this DWARF expression
				// is commonly evaluated with a valid stack frame.
				StackID id = frame->GetStackID();
				addr_t cfa = id.GetCallFrameAddress();
				if (cfa != LLDB_INVALID_ADDRESS) {
				stack.push_back(Scalar(cfa));
				stack.back().SetValueType(Value::eValueTypeLoadAddress);
				} else if (error_ptr)
				error_ptr->SetErrorString("Stack frame does not include a canonical "
				"frame address for DW_OP_call_frame_cfa "
				"opcode.");
				} else {
				if (error_ptr)
				error_ptr->SetErrorString("Invalid stack frame in context for "
				"DW_OP_call_frame_cfa opcode.");
				return false;
				}
				break;

				// OPCODE: DW_OP_form_tls_address (or the old pre-DWARFv3 vendor extension
				// opcode, DW_OP_GNU_push_tls_address)
				// OPERANDS: none
				// DESCRIPTION: Pops a TLS offset from the stack, converts it to
				// an address in the current thread's thread-local storage block, and
				// pushes it on the stack.
				case DW_OP_form_tls_address:
				case DW_OP_GNU_push_tls_address: {
				if (stack.size() < 1) {
				if (error_ptr) {
				if (op == DW_OP_form_tls_address)
				error_ptr->SetErrorString(
				"DW_OP_form_tls_address needs an argument.");
				else
				error_ptr->SetErrorString(
				"DW_OP_GNU_push_tls_address needs an argument.");
				}
				return false;
				}

				if (!m_exe_ctx \|\| !module_sp) {
				if (error_ptr)
				error_ptr->SetErrorString("No context to evaluate TLS within.");
				return false;
				}

				Thread *thread = m_exe_ctx->GetThreadPtr();
				if (!thread) {
				if (error_ptr)
				error_ptr->SetErrorString("No thread to evaluate TLS within.");
				return false;
				}

				// Lookup the TLS block address for this thread and module.
				const addr_t tls_file_addr =
				stack.back().GetScalar().ULongLong(LLDB_INVALID_ADDRESS);
				const addr_t tls_load_addr =
				thread->GetThreadLocalData(module_sp, tls_file_addr);

				if (tls_load_addr == LLDB_INVALID_ADDRESS) {
				if (error_ptr)
				error_ptr->SetErrorString(
				"No TLS data currently exists for this thread.");
				return false;
				}

				stack.back().GetScalar() = tls_load_addr;
				stack.back().SetValueType(Value::eValueTypeLoadAddress);
				} break;

				// OPCODE: DW_OP_addrx (DW_OP_GNU_addr_index is the legacy name.)
				// OPERANDS: 1
				// ULEB128: index to the .debug_addr section
				// DESCRIPTION: Pushes an address to the stack from the .debug_addr
				// section with the base address specified by the DW_AT_addr_base attribute
				// and the 0 based index is the ULEB128 encoded index.
				case DW_OP_addrx:
				case DW_OP_GNU_addr_index: {
				if (!dwarf_cu) {
				if (error_ptr)
				error_ptr->SetErrorString("DW_OP_GNU_addr_index found without a "
				"compile unit being specified");
				return false;
				}
				uint64_t index = opcodes.GetULEB128(&offset);
				lldb::addr_t value =
				DWARFExpression::ReadAddressFromDebugAddrSection(dwarf_cu, index);
				stack.push_back(Scalar(value));
				stack.back().SetValueType(Value::eValueTypeFileAddress);
				} break;

				// OPCODE: DW_OP_GNU_const_index
				// OPERANDS: 1
				// ULEB128: index to the .debug_addr section
				// DESCRIPTION: Pushes an constant with the size of a machine address to
				// the stack from the .debug_addr section with the base address specified
				// by the DW_AT_addr_base attribute and the 0 based index is the ULEB128
				// encoded index.
				case DW_OP_GNU_const_index: {
				if (!dwarf_cu) {
				if (error_ptr)
				error_ptr->SetErrorString("DW_OP_GNU_const_index found without a "
				"compile unit being specified");
				return false;
				}
				uint64_t index = opcodes.GetULEB128(&offset);
				lldb::addr_t value =
				DWARFExpression::ReadAddressFromDebugAddrSection(dwarf_cu, index);
				stack.push_back(Scalar(value));
				} break;

				case DW_OP_entry_value: {
				if (!Evaluate_DW_OP_entry_value(stack, m_exe_ctx, m_reg_ctx, opcodes,
				offset, error_ptr, log)) {
				LLDB_ERRORF(error_ptr, "Could not evaluate %s.", DW_OP_value_to_name(op));
				return false;
				}
				break;
				}

				default:
				LLDB_LOGF(log, "Unhandled opcode %s in DWARFExpression.",
				DW_OP_value_to_name(op));
				break;
				}

				return true;
				}

lldb/source/Expression/DWARFEvaluatorFactory.cpp

This file was added.

				//===-- DWARFEvaluatorFactory.cpp -----------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "lldb/Expression/DWARFEvaluatorFactory.h"
				#include "lldb/Expression/DWARFEvaluator.h"

				#include "lldb/Core/PluginManager.h"
				#include "lldb/Core/Value.h"
				#include "lldb/Target/RegisterContext.h"

				using namespace lldb;
				using namespace lldb_private;

				// PluginInterface protocol
				lldb_private::ConstString DWARFEvaluatorFactory::GetPluginName() {
				static ConstString g_name("vendor-default");
				return g_name;
				}

				// FindPlugin
				//
				// Platforms can register a callback to use when creating DWARF expression
				// evaluators to allow handling platform-specific DWARF codes.
				std::unique_ptr<DWARFEvaluatorFactory>
				DWARFEvaluatorFactory::FindPlugin(Module *module) {
				std::unique_ptr<DWARFEvaluatorFactory> instance_up;
				DWARFEvaluatorFactoryCreateInstance create_callback;

				for (size_t idx = 0;
				(create_callback =
				PluginManager::GetDWARFEvaluatorFactoryCreateCallbackAtIndex(
				idx)) != nullptr;
				++idx) {
				instance_up.reset(create_callback(module));

				if (instance_up) {
				return instance_up;
				}
				}

				instance_up.reset(new DWARFEvaluatorFactory());
				return instance_up;
				}

				std::unique_ptr<DWARFEvaluator> DWARFEvaluatorFactory::CreateDWARFEvaluator(
				const DWARFExpression &dwarf_expression, ExecutionContext *exe_ctx,
				RegisterContext reg_ctx, const Value initial_value_ptr,
				const Value *object_address_ptr) {
				return std::make_unique<DWARFEvaluator>(dwarf_expression, exe_ctx, reg_ctx,
				initial_value_ptr,
				object_address_ptr);
				}

lldb/source/Expression/DWARFExpression.cpp

//===-- DWARFExpression.cpp -----------------------------------------------===//		//===-- DWARFExpression.cpp -----------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "lldb/Expression/DWARFExpression.h"		#include "lldb/Expression/DWARFExpression.h"

#include <inttypes.h>

#include <vector>

#include "lldb/Core/Module.h"		#include "lldb/Core/Module.h"
#include "lldb/Core/Value.h"		#include "lldb/Core/Value.h"
#include "lldb/Core/dwarf.h"		#include "lldb/Core/dwarf.h"
		#include "lldb/Expression/DWARFEvaluator.h"
		#include "lldb/Expression/DWARFEvaluatorFactory.h"
#include "lldb/Utility/DataEncoder.h"		#include "lldb/Utility/DataEncoder.h"
#include "lldb/Utility/Log.h"		#include "lldb/Utility/Log.h"
#include "lldb/Utility/RegisterValue.h"
#include "lldb/Utility/Scalar.h"
#include "lldb/Utility/StreamString.h"
#include "lldb/Utility/VMRange.h"

#include "lldb/Host/Host.h"
#include "lldb/Utility/Endian.h"

#include "lldb/Symbol/Function.h"		#include "lldb/Symbol/Function.h"

#include "lldb/Target/ABI.h"		#include "lldb/Target/ABI.h"
#include "lldb/Target/ExecutionContext.h"
#include "lldb/Target/Process.h"
#include "lldb/Target/RegisterContext.h"		#include "lldb/Target/RegisterContext.h"
#include "lldb/Target/StackFrame.h"		#include "lldb/Target/StackFrame.h"
#include "lldb/Target/StackID.h"
#include "lldb/Target/Target.h"		#include "lldb/Target/Target.h"
#include "lldb/Target/Thread.h"		#include "lldb/Target/Thread.h"

#include "Plugins/SymbolFile/DWARF/DWARFUnit.h"		#include "Plugins/SymbolFile/DWARF/DWARFUnit.h"

using namespace lldb;		using namespace lldb;
using namespace lldb_private;		using namespace lldb_private;

static lldb::addr_t		lldb::addr_t
ReadAddressFromDebugAddrSection(const DWARFUnit *dwarf_cu,		DWARFExpression::ReadAddressFromDebugAddrSection(const DWARFUnit *dwarf_cu,
uint32_t index) {		uint32_t index) {
uint32_t index_size = dwarf_cu->GetAddressByteSize();		uint32_t index_size = dwarf_cu->GetAddressByteSize();
dw_offset_t addr_base = dwarf_cu->GetAddrBase();		dw_offset_t addr_base = dwarf_cu->GetAddrBase();
lldb::offset_t offset = addr_base + index * index_size;		lldb::offset_t offset = addr_base + index * index_size;
const DWARFDataExtractor &data =		const DWARFDataExtractor &data =
dwarf_cu->GetSymbolFileDWARF().GetDWARFContext().getOrLoadAddrData();		dwarf_cu->GetSymbolFileDWARF().GetDWARFContext().getOrLoadAddrData();
if (data.ValidOffsetForDataOfSize(offset, index_size))		if (data.ValidOffsetForDataOfSize(offset, index_size))
return data.GetMaxU64_unchecked(&offset, index_size);		return data.GetMaxU64_unchecked(&offset, index_size);
return LLDB_INVALID_ADDRESS;		return LLDB_INVALID_ADDRESS;
Show All 38 Lines	llvm::DWARFExpression(data.GetAsLLVM(), data.GetAddressByteSize())
nullptr);		nullptr);
}		}

void DWARFExpression::SetLocationListAddresses(addr_t cu_file_addr,		void DWARFExpression::SetLocationListAddresses(addr_t cu_file_addr,
addr_t func_file_addr) {		addr_t func_file_addr) {
m_loclist_addresses = LoclistAddresses{cu_file_addr, func_file_addr};		m_loclist_addresses = LoclistAddresses{cu_file_addr, func_file_addr};
}		}

int DWARFExpression::GetRegisterKind() { return m_reg_kind; }		RegisterKind DWARFExpression::GetRegisterKind() const { return m_reg_kind; }

void DWARFExpression::SetRegisterKind(RegisterKind reg_kind) {		void DWARFExpression::SetRegisterKind(RegisterKind reg_kind) {
m_reg_kind = reg_kind;		m_reg_kind = reg_kind;
}		}

bool DWARFExpression::IsLocationList() const {		bool DWARFExpression::IsLocationList() const {
return bool(m_loclist_addresses);		return bool(m_loclist_addresses);
}		}
Show All 37 Lines	loctable_up->dumpLocationList(
DummyDWARFObject(m_data.GetByteOrder() == eByteOrderLittle), nullptr,		DummyDWARFObject(m_data.GetByteOrder() == eByteOrderLittle), nullptr,
DumpOpts, s->GetIndentLevel() + 2);		DumpOpts, s->GetIndentLevel() + 2);
} else {		} else {
// We have a normal location that contains DW_OP location opcodes		// We have a normal location that contains DW_OP location opcodes
DumpLocation(s, m_data, level, abi);		DumpLocation(s, m_data, level, abi);
}		}
}		}

static bool ReadRegisterValueAsScalar(RegisterContext *reg_ctx,
lldb::RegisterKind reg_kind,
uint32_t reg_num, Status *error_ptr,
Value &value) {
if (reg_ctx == nullptr) {
if (error_ptr)
error_ptr->SetErrorStringWithFormat("No register context in frame.\n");
} else {
uint32_t native_reg =
reg_ctx->ConvertRegisterKindToRegisterNumber(reg_kind, reg_num);
if (native_reg == LLDB_INVALID_REGNUM) {
if (error_ptr)
error_ptr->SetErrorStringWithFormat("Unable to convert register "
"kind=%u reg_num=%u to a native "
"register number.\n",
reg_kind, reg_num);
} else {
const RegisterInfo *reg_info =
reg_ctx->GetRegisterInfoAtIndex(native_reg);
RegisterValue reg_value;
if (reg_ctx->ReadRegister(reg_info, reg_value)) {
if (reg_value.GetScalarValue(value.GetScalar())) {
value.SetValueType(Value::eValueTypeScalar);
value.SetContext(Value::eContextTypeRegisterInfo,
const_cast<RegisterInfo *>(reg_info));
if (error_ptr)
error_ptr->Clear();
return true;
} else {
// If we get this error, then we need to implement a value buffer in
// the dwarf expression evaluation function...
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"register %s can't be converted to a scalar value",
reg_info->name);
}
} else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat("register %s is not available",
reg_info->name);
}
}
}
return false;
}

/// Return the length in bytes of the set of operands for \p op. No guarantees		/// Return the length in bytes of the set of operands for \p op. No guarantees
/// are made on the state of \p data after this call.		/// are made on the state of \p data after this call.
static offset_t GetOpcodeDataSize(const DataExtractor &data,		static offset_t GetOpcodeDataSize(const DataExtractor &data,
const lldb::offset_t data_offset,		const lldb::offset_t data_offset,
const uint8_t op) {		const uint8_t op) {
lldb::offset_t offset = data_offset;		lldb::offset_t offset = data_offset;
switch (op) {		switch (op) {
case DW_OP_addr:		case DW_OP_addr:
▲ Show 20 Lines • Show All 649 Lines • ▼ Show 20 Lines
bool DWARFExpression::Evaluate(ExecutionContext *exe_ctx,		bool DWARFExpression::Evaluate(ExecutionContext *exe_ctx,
RegisterContext *reg_ctx,		RegisterContext *reg_ctx,
lldb::addr_t func_load_addr,		lldb::addr_t func_load_addr,
const Value *initial_value_ptr,		const Value *initial_value_ptr,
const Value *object_address_ptr, Value &result,		const Value *object_address_ptr, Value &result,
Status *error_ptr) const {		Status *error_ptr) const {
ModuleSP module_sp = m_module_wp.lock();		ModuleSP module_sp = m_module_wp.lock();

		// Use the DWARF expression evaluator registered for this module (or
		// DWARFEvaluator by default).
		DWARFEvaluatorFactory *evaluator_factory =
		module_sp->GetDWARFExpressionEvaluatorFactory();
		std::unique_ptr<DWARFEvaluator> evaluator =
		evaluator_factory->CreateDWARFEvaluator(
		*this, exe_ctx, reg_ctx, initial_value_ptr, object_address_ptr);

if (IsLocationList()) {		if (IsLocationList()) {
addr_t pc;		addr_t pc;
StackFrame *frame = nullptr;		StackFrame *frame = nullptr;
if (reg_ctx)		if (reg_ctx)
pc = reg_ctx->GetPC();		pc = reg_ctx->GetPC();
else {		else {
frame = exe_ctx->GetFramePtr();		frame = exe_ctx->GetFramePtr();
if (!frame)		if (!frame)
return false;		return false;
RegisterContextSP reg_ctx_sp = frame->GetRegisterContext();		RegisterContextSP reg_ctx_sp = frame->GetRegisterContext();
if (!reg_ctx_sp)		if (!reg_ctx_sp)
return false;		return false;
pc = reg_ctx_sp->GetPC();		pc = reg_ctx_sp->GetPC();
}		}

if (func_load_addr != LLDB_INVALID_ADDRESS) {		if (func_load_addr != LLDB_INVALID_ADDRESS) {
if (pc == LLDB_INVALID_ADDRESS) {		if (pc == LLDB_INVALID_ADDRESS) {
if (error_ptr)		if (error_ptr)
error_ptr->SetErrorString("Invalid PC in frame.");		error_ptr->SetErrorString("Invalid PC in frame.");
return false;		return false;
}		}

if (llvm::Optional<DataExtractor> expr =		if (llvm::Optional<DataExtractor> expr =
GetLocationExpression(func_load_addr, pc)) {		GetLocationExpression(func_load_addr, pc)) {
return DWARFExpression::Evaluate(		return evaluator->Evaluate(*expr, result, error_ptr);
exe_ctx, reg_ctx, module_sp, *expr, m_dwarf_cu, m_reg_kind,
initial_value_ptr, object_address_ptr, result, error_ptr);
}		}
}		}
if (error_ptr)		if (error_ptr)
error_ptr->SetErrorString("variable not available");		error_ptr->SetErrorString("variable not available");
return false;		return false;
}		}

// Not a location list, just a single expression.		// Not a location list, just a single expression.
return DWARFExpression::Evaluate(exe_ctx, reg_ctx, module_sp, m_data,		return evaluator->Evaluate(result, error_ptr);
m_dwarf_cu, m_reg_kind, initial_value_ptr,
object_address_ptr, result, error_ptr);
}		}

bool DWARFExpression::Evaluate(		bool DWARFExpression::Evaluate(
ExecutionContext exe_ctx, RegisterContext reg_ctx,		ExecutionContext exe_ctx, RegisterContext reg_ctx,
lldb::ModuleSP module_sp, const DataExtractor &opcodes,		lldb::ModuleSP module_sp, const DataExtractor &opcodes,
const DWARFUnit *dwarf_cu, const lldb::RegisterKind reg_kind,		const DWARFUnit *dwarf_cu, const lldb::RegisterKind reg_kind,
const Value initial_value_ptr, const Value object_address_ptr,		const Value initial_value_ptr, const Value object_address_ptr,
Value &result, Status *error_ptr) {		Value &result, Status *error_ptr) {
		DWARFExpression expr(module_sp, opcodes, dwarf_cu);
		expr.SetRegisterKind(reg_kind);

if (opcodes.GetByteSize() == 0) {		// Use the DWARF expression evaluator registered for this module (or
if (error_ptr)		// DWARFEvaluator by default).
error_ptr->SetErrorString(		DWARFEvaluatorFactory *evaluator_factory =
"no location, value may have been optimized out");		module_sp->GetDWARFExpressionEvaluatorFactory();
return false;		std::unique_ptr<DWARFEvaluator> evaluator =
}		evaluator_factory->CreateDWARFEvaluator(
std::vector<Value> stack;		expr, exe_ctx, reg_ctx, initial_value_ptr, object_address_ptr);
		return evaluator->Evaluate(result, error_ptr);
Process *process = nullptr;
StackFrame *frame = nullptr;

if (exe_ctx) {
process = exe_ctx->GetProcessPtr();
frame = exe_ctx->GetFramePtr();
}
if (reg_ctx == nullptr && frame)
reg_ctx = frame->GetRegisterContext().get();

if (initial_value_ptr)
stack.push_back(*initial_value_ptr);

lldb::offset_t offset = 0;
Value tmp;
uint32_t reg_num;

/// Insertion point for evaluating multi-piece expression.
uint64_t op_piece_offset = 0;
Value pieces; // Used for DW_OP_piece

Log *log(lldb_private::GetLogIfAllCategoriesSet(LIBLLDB_LOG_EXPRESSIONS));

while (opcodes.ValidOffset(offset)) {
const lldb::offset_t op_offset = offset;
const uint8_t op = opcodes.GetU8(&offset);

if (log && log->GetVerbose()) {
size_t count = stack.size();
LLDB_LOGF(log, "Stack before operation has %" PRIu64 " values:",
(uint64_t)count);
for (size_t i = 0; i < count; ++i) {
StreamString new_value;
new_value.Printf("[%" PRIu64 "]", (uint64_t)i);
stack[i].Dump(&new_value);
LLDB_LOGF(log, " %s", new_value.GetData());
}
LLDB_LOGF(log, "0x%8.8" PRIx64 ": %s", op_offset,
DW_OP_value_to_name(op));
}

switch (op) {
// The DW_OP_addr operation has a single operand that encodes a machine
// address and whose size is the size of an address on the target machine.
case DW_OP_addr:
stack.push_back(Scalar(opcodes.GetAddress(&offset)));
stack.back().SetValueType(Value::eValueTypeFileAddress);
// Convert the file address to a load address, so subsequent
// DWARF operators can operate on it.
if (frame)
stack.back().ConvertToLoadAddress(module_sp.get(),
frame->CalculateTarget().get());
break;

// The DW_OP_addr_sect_offset4 is used for any location expressions in
// shared libraries that have a location like:
// DW_OP_addr(0x1000)
// If this address resides in a shared library, then this virtual address
// won't make sense when it is evaluated in the context of a running
// process where shared libraries have been slid. To account for this, this
// new address type where we can store the section pointer and a 4 byte
// offset.
// case DW_OP_addr_sect_offset4:
// {
// result_type = eResultTypeFileAddress;
// lldb::Section *sect = (lldb::Section
// )opcodes.GetMaxU64(&offset, sizeof(void ));
// lldb::addr_t sect_offset = opcodes.GetU32(&offset);
//
// Address so_addr (sect, sect_offset);
// lldb::addr_t load_addr = so_addr.GetLoadAddress();
// if (load_addr != LLDB_INVALID_ADDRESS)
// {
// // We successfully resolve a file address to a load
// // address.
// stack.push_back(load_addr);
// break;
// }
// else
// {
// // We were able
// if (error_ptr)
// error_ptr->SetErrorStringWithFormat ("Section %s in
// %s is not currently loaded.\n",
// sect->GetName().AsCString(),
// sect->GetModule()->GetFileSpec().GetFilename().AsCString());
// return false;
// }
// }
// break;

// OPCODE: DW_OP_deref
// OPERANDS: none
// DESCRIPTION: Pops the top stack entry and treats it as an address.
// The value retrieved from that address is pushed. The size of the data
// retrieved from the dereferenced address is the size of an address on the
// target machine.
case DW_OP_deref: {
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString("Expression stack empty for DW_OP_deref.");
return false;
}
Value::ValueType value_type = stack.back().GetValueType();
switch (value_type) {
case Value::eValueTypeHostAddress: {
void src = (void )stack.back().GetScalar().ULongLong();
intptr_t ptr;
::memcpy(&ptr, src, sizeof(void *));
stack.back().GetScalar() = ptr;
stack.back().ClearContext();
} break;
case Value::eValueTypeFileAddress: {
auto file_addr = stack.back().GetScalar().ULongLong(
LLDB_INVALID_ADDRESS);
if (!module_sp) {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"need module to resolve file address for DW_OP_deref");
return false;
}
Address so_addr;
if (!module_sp->ResolveFileAddress(file_addr, so_addr)) {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"failed to resolve file address in module");
return false;
}
addr_t load_Addr = so_addr.GetLoadAddress(exe_ctx->GetTargetPtr());
if (load_Addr == LLDB_INVALID_ADDRESS) {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"failed to resolve load address");
return false;
}
stack.back().GetScalar() = load_Addr;
stack.back().SetValueType(Value::eValueTypeLoadAddress);
// Fall through to load address code below...
} LLVM_FALLTHROUGH;
case Value::eValueTypeLoadAddress:
if (exe_ctx) {
if (process) {
lldb::addr_t pointer_addr =
stack.back().GetScalar().ULongLong(LLDB_INVALID_ADDRESS);
Status error;
lldb::addr_t pointer_value =
process->ReadPointerFromMemory(pointer_addr, error);
if (pointer_value != LLDB_INVALID_ADDRESS) {
stack.back().GetScalar() = pointer_value;
stack.back().ClearContext();
} else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"Failed to dereference pointer from 0x%" PRIx64
" for DW_OP_deref: %s\n",
pointer_addr, error.AsCString());
return false;
}
} else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"NULL process for DW_OP_deref.\n");
return false;
}
} else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"NULL execution context for DW_OP_deref.\n");
return false;
}
break;

default:
break;
}

} break;

// OPCODE: DW_OP_deref_size
// OPERANDS: 1
// 1 - uint8_t that specifies the size of the data to dereference.
// DESCRIPTION: Behaves like the DW_OP_deref operation: it pops the top
// stack entry and treats it as an address. The value retrieved from that
// address is pushed. In the DW_OP_deref_size operation, however, the size
// in bytes of the data retrieved from the dereferenced address is
// specified by the single operand. This operand is a 1-byte unsigned
// integral constant whose value may not be larger than the size of an
// address on the target machine. The data retrieved is zero extended to
// the size of an address on the target machine before being pushed on the
// expression stack.
case DW_OP_deref_size: {
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack empty for DW_OP_deref_size.");
return false;
}
uint8_t size = opcodes.GetU8(&offset);
Value::ValueType value_type = stack.back().GetValueType();
switch (value_type) {
case Value::eValueTypeHostAddress: {
void src = (void )stack.back().GetScalar().ULongLong();
intptr_t ptr;
::memcpy(&ptr, src, sizeof(void *));
// I can't decide whether the size operand should apply to the bytes in
// their
// lldb-host endianness or the target endianness.. I doubt this'll ever
// come up but I'll opt for assuming big endian regardless.
switch (size) {
case 1:
ptr = ptr & 0xff;
break;
case 2:
ptr = ptr & 0xffff;
break;
case 3:
ptr = ptr & 0xffffff;
break;
case 4:
ptr = ptr & 0xffffffff;
break;
// the casts are added to work around the case where intptr_t is a 32
// bit quantity;
// presumably we won't hit the 5..7 cases if (void*) is 32-bits in this
// program.
case 5:
ptr = (intptr_t)ptr & 0xffffffffffULL;
break;
case 6:
ptr = (intptr_t)ptr & 0xffffffffffffULL;
break;
case 7:
ptr = (intptr_t)ptr & 0xffffffffffffffULL;
break;
default:
break;
}
stack.back().GetScalar() = ptr;
stack.back().ClearContext();
} break;
case Value::eValueTypeLoadAddress:
if (exe_ctx) {
if (process) {
lldb::addr_t pointer_addr =
stack.back().GetScalar().ULongLong(LLDB_INVALID_ADDRESS);
uint8_t addr_bytes[sizeof(lldb::addr_t)];
Status error;
if (process->ReadMemory(pointer_addr, &addr_bytes, size, error) ==
size) {
DataExtractor addr_data(addr_bytes, sizeof(addr_bytes),
process->GetByteOrder(), size);
lldb::offset_t addr_data_offset = 0;
switch (size) {
case 1:
stack.back().GetScalar() = addr_data.GetU8(&addr_data_offset);
break;
case 2:
stack.back().GetScalar() = addr_data.GetU16(&addr_data_offset);
break;
case 4:
stack.back().GetScalar() = addr_data.GetU32(&addr_data_offset);
break;
case 8:
stack.back().GetScalar() = addr_data.GetU64(&addr_data_offset);
break;
default:
stack.back().GetScalar() =
addr_data.GetAddress(&addr_data_offset);
}
stack.back().ClearContext();
} else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"Failed to dereference pointer from 0x%" PRIx64
" for DW_OP_deref: %s\n",
pointer_addr, error.AsCString());
return false;
}
} else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"NULL process for DW_OP_deref.\n");
return false;
}
} else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"NULL execution context for DW_OP_deref.\n");
return false;
}
break;

default:
break;
}

} break;

// OPCODE: DW_OP_xderef_size
// OPERANDS: 1
// 1 - uint8_t that specifies the size of the data to dereference.
// DESCRIPTION: Behaves like the DW_OP_xderef operation: the entry at
// the top of the stack is treated as an address. The second stack entry is
// treated as an "address space identifier" for those architectures that
// support multiple address spaces. The top two stack elements are popped,
// a data item is retrieved through an implementation-defined address
// calculation and pushed as the new stack top. In the DW_OP_xderef_size
// operation, however, the size in bytes of the data retrieved from the
// dereferenced address is specified by the single operand. This operand is
// a 1-byte unsigned integral constant whose value may not be larger than
// the size of an address on the target machine. The data retrieved is zero
// extended to the size of an address on the target machine before being
// pushed on the expression stack.
case DW_OP_xderef_size:
if (error_ptr)
error_ptr->SetErrorString("Unimplemented opcode: DW_OP_xderef_size.");
return false;
// OPCODE: DW_OP_xderef
// OPERANDS: none
// DESCRIPTION: Provides an extended dereference mechanism. The entry at
// the top of the stack is treated as an address. The second stack entry is
// treated as an "address space identifier" for those architectures that
// support multiple address spaces. The top two stack elements are popped,
// a data item is retrieved through an implementation-defined address
// calculation and pushed as the new stack top. The size of the data
// retrieved from the dereferenced address is the size of an address on the
// target machine.
case DW_OP_xderef:
if (error_ptr)
error_ptr->SetErrorString("Unimplemented opcode: DW_OP_xderef.");
return false;

// All DW_OP_constXXX opcodes have a single operand as noted below:
//
// Opcode Operand 1
// DW_OP_const1u 1-byte unsigned integer constant DW_OP_const1s
// 1-byte signed integer constant DW_OP_const2u 2-byte unsigned integer
// constant DW_OP_const2s 2-byte signed integer constant DW_OP_const4u
// 4-byte unsigned integer constant DW_OP_const4s 4-byte signed integer
// constant DW_OP_const8u 8-byte unsigned integer constant DW_OP_const8s
// 8-byte signed integer constant DW_OP_constu unsigned LEB128 integer
// constant DW_OP_consts signed LEB128 integer constant
case DW_OP_const1u:
stack.push_back(Scalar((uint8_t)opcodes.GetU8(&offset)));
break;
case DW_OP_const1s:
stack.push_back(Scalar((int8_t)opcodes.GetU8(&offset)));
break;
case DW_OP_const2u:
stack.push_back(Scalar((uint16_t)opcodes.GetU16(&offset)));
break;
case DW_OP_const2s:
stack.push_back(Scalar((int16_t)opcodes.GetU16(&offset)));
break;
case DW_OP_const4u:
stack.push_back(Scalar((uint32_t)opcodes.GetU32(&offset)));
break;
case DW_OP_const4s:
stack.push_back(Scalar((int32_t)opcodes.GetU32(&offset)));
break;
case DW_OP_const8u:
stack.push_back(Scalar((uint64_t)opcodes.GetU64(&offset)));
break;
case DW_OP_const8s:
stack.push_back(Scalar((int64_t)opcodes.GetU64(&offset)));
break;
case DW_OP_constu:
stack.push_back(Scalar(opcodes.GetULEB128(&offset)));
break;
case DW_OP_consts:
stack.push_back(Scalar(opcodes.GetSLEB128(&offset)));
break;

// OPCODE: DW_OP_dup
// OPERANDS: none
// DESCRIPTION: duplicates the value at the top of the stack
case DW_OP_dup:
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString("Expression stack empty for DW_OP_dup.");
return false;
} else
stack.push_back(stack.back());
break;

// OPCODE: DW_OP_drop
// OPERANDS: none
// DESCRIPTION: pops the value at the top of the stack
case DW_OP_drop:
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString("Expression stack empty for DW_OP_drop.");
return false;
} else
stack.pop_back();
break;

// OPCODE: DW_OP_over
// OPERANDS: none
// DESCRIPTION: Duplicates the entry currently second in the stack at
// the top of the stack.
case DW_OP_over:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_over.");
return false;
} else
stack.push_back(stack[stack.size() - 2]);
break;

// OPCODE: DW_OP_pick
// OPERANDS: uint8_t index into the current stack
// DESCRIPTION: The stack entry with the specified index (0 through 255,
// inclusive) is pushed on the stack
case DW_OP_pick: {
uint8_t pick_idx = opcodes.GetU8(&offset);
if (pick_idx < stack.size())
stack.push_back(stack[stack.size() - 1 - pick_idx]);
else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"Index %u out of range for DW_OP_pick.\n", pick_idx);
return false;
}
} break;

// OPCODE: DW_OP_swap
// OPERANDS: none
// DESCRIPTION: swaps the top two stack entries. The entry at the top
// of the stack becomes the second stack entry, and the second entry
// becomes the top of the stack
case DW_OP_swap:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_swap.");
return false;
} else {
tmp = stack.back();
stack.back() = stack[stack.size() - 2];
stack[stack.size() - 2] = tmp;
}
break;

// OPCODE: DW_OP_rot
// OPERANDS: none
// DESCRIPTION: Rotates the first three stack entries. The entry at
// the top of the stack becomes the third stack entry, the second entry
// becomes the top of the stack, and the third entry becomes the second
// entry.
case DW_OP_rot:
if (stack.size() < 3) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 3 items for DW_OP_rot.");
return false;
} else {
size_t last_idx = stack.size() - 1;
Value old_top = stack[last_idx];
stack[last_idx] = stack[last_idx - 1];
stack[last_idx - 1] = stack[last_idx - 2];
stack[last_idx - 2] = old_top;
}
break;

// OPCODE: DW_OP_abs
// OPERANDS: none
// DESCRIPTION: pops the top stack entry, interprets it as a signed
// value and pushes its absolute value. If the absolute value can not be
// represented, the result is undefined.
case DW_OP_abs:
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 1 item for DW_OP_abs.");
return false;
} else if (!stack.back().ResolveValue(exe_ctx).AbsoluteValue()) {
if (error_ptr)
error_ptr->SetErrorString(
"Failed to take the absolute value of the first stack item.");
return false;
}
break;

// OPCODE: DW_OP_and
// OPERANDS: none
// DESCRIPTION: pops the top two stack values, performs a bitwise and
// operation on the two, and pushes the result.
case DW_OP_and:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_and.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) & tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_div
// OPERANDS: none
// DESCRIPTION: pops the top two stack values, divides the former second
// entry by the former top of the stack using signed division, and pushes
// the result.
case DW_OP_div:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_div.");
return false;
} else {
tmp = stack.back();
if (tmp.ResolveValue(exe_ctx).IsZero()) {
if (error_ptr)
error_ptr->SetErrorString("Divide by zero.");
return false;
} else {
stack.pop_back();
stack.back() =
stack.back().ResolveValue(exe_ctx) / tmp.ResolveValue(exe_ctx);
if (!stack.back().ResolveValue(exe_ctx).IsValid()) {
if (error_ptr)
error_ptr->SetErrorString("Divide failed.");
return false;
}
}
}
break;

// OPCODE: DW_OP_minus
// OPERANDS: none
// DESCRIPTION: pops the top two stack values, subtracts the former top
// of the stack from the former second entry, and pushes the result.
case DW_OP_minus:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_minus.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) - tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_mod
// OPERANDS: none
// DESCRIPTION: pops the top two stack values and pushes the result of
// the calculation: former second stack entry modulo the former top of the
// stack.
case DW_OP_mod:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_mod.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) % tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_mul
// OPERANDS: none
// DESCRIPTION: pops the top two stack entries, multiplies them
// together, and pushes the result.
case DW_OP_mul:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_mul.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) * tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_neg
// OPERANDS: none
// DESCRIPTION: pops the top stack entry, and pushes its negation.
case DW_OP_neg:
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 1 item for DW_OP_neg.");
return false;
} else {
if (!stack.back().ResolveValue(exe_ctx).UnaryNegate()) {
if (error_ptr)
error_ptr->SetErrorString("Unary negate failed.");
return false;
}
}
break;

// OPCODE: DW_OP_not
// OPERANDS: none
// DESCRIPTION: pops the top stack entry, and pushes its bitwise
// complement
case DW_OP_not:
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 1 item for DW_OP_not.");
return false;
} else {
if (!stack.back().ResolveValue(exe_ctx).OnesComplement()) {
if (error_ptr)
error_ptr->SetErrorString("Logical NOT failed.");
return false;
}
}
break;

// OPCODE: DW_OP_or
// OPERANDS: none
// DESCRIPTION: pops the top two stack entries, performs a bitwise or
// operation on the two, and pushes the result.
case DW_OP_or:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_or.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) \| tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_plus
// OPERANDS: none
// DESCRIPTION: pops the top two stack entries, adds them together, and
// pushes the result.
case DW_OP_plus:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_plus.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().GetScalar() += tmp.GetScalar();
}
break;

// OPCODE: DW_OP_plus_uconst
// OPERANDS: none
// DESCRIPTION: pops the top stack entry, adds it to the unsigned LEB128
// constant operand and pushes the result.
case DW_OP_plus_uconst:
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 1 item for DW_OP_plus_uconst.");
return false;
} else {
const uint64_t uconst_value = opcodes.GetULEB128(&offset);
// Implicit conversion from a UINT to a Scalar...
stack.back().GetScalar() += uconst_value;
if (!stack.back().GetScalar().IsValid()) {
if (error_ptr)
error_ptr->SetErrorString("DW_OP_plus_uconst failed.");
return false;
}
}
break;

// OPCODE: DW_OP_shl
// OPERANDS: none
// DESCRIPTION: pops the top two stack entries, shifts the former
// second entry left by the number of bits specified by the former top of
// the stack, and pushes the result.
case DW_OP_shl:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_shl.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) <<= tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_shr
// OPERANDS: none
// DESCRIPTION: pops the top two stack entries, shifts the former second
// entry right logically (filling with zero bits) by the number of bits
// specified by the former top of the stack, and pushes the result.
case DW_OP_shr:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_shr.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
if (!stack.back().ResolveValue(exe_ctx).ShiftRightLogical(
tmp.ResolveValue(exe_ctx))) {
if (error_ptr)
error_ptr->SetErrorString("DW_OP_shr failed.");
return false;
}
}
break;

// OPCODE: DW_OP_shra
// OPERANDS: none
// DESCRIPTION: pops the top two stack entries, shifts the former second
// entry right arithmetically (divide the magnitude by 2, keep the same
// sign for the result) by the number of bits specified by the former top
// of the stack, and pushes the result.
case DW_OP_shra:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_shra.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) >>= tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_xor
// OPERANDS: none
// DESCRIPTION: pops the top two stack entries, performs the bitwise
// exclusive-or operation on the two, and pushes the result.
case DW_OP_xor:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_xor.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) ^ tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_skip
// OPERANDS: int16_t
// DESCRIPTION: An unconditional branch. Its single operand is a 2-byte
// signed integer constant. The 2-byte constant is the number of bytes of
// the DWARF expression to skip forward or backward from the current
// operation, beginning after the 2-byte constant.
case DW_OP_skip: {
int16_t skip_offset = (int16_t)opcodes.GetU16(&offset);
lldb::offset_t new_offset = offset + skip_offset;
if (opcodes.ValidOffset(new_offset))
offset = new_offset;
else {
if (error_ptr)
error_ptr->SetErrorString("Invalid opcode offset in DW_OP_skip.");
return false;
}
} break;

// OPCODE: DW_OP_bra
// OPERANDS: int16_t
// DESCRIPTION: A conditional branch. Its single operand is a 2-byte
// signed integer constant. This operation pops the top of stack. If the
// value popped is not the constant 0, the 2-byte constant operand is the
// number of bytes of the DWARF expression to skip forward or backward from
// the current operation, beginning after the 2-byte constant.
case DW_OP_bra:
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 1 item for DW_OP_bra.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
int16_t bra_offset = (int16_t)opcodes.GetU16(&offset);
Scalar zero(0);
if (tmp.ResolveValue(exe_ctx) != zero) {
lldb::offset_t new_offset = offset + bra_offset;
if (opcodes.ValidOffset(new_offset))
offset = new_offset;
else {
if (error_ptr)
error_ptr->SetErrorString("Invalid opcode offset in DW_OP_bra.");
return false;
}
}
}
break;

// OPCODE: DW_OP_eq
// OPERANDS: none
// DESCRIPTION: pops the top two stack values, compares using the
// equals (==) operator.
// STACK RESULT: push the constant value 1 onto the stack if the result
// of the operation is true or the constant value 0 if the result of the
// operation is false.
case DW_OP_eq:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_eq.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) == tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_ge
// OPERANDS: none
// DESCRIPTION: pops the top two stack values, compares using the
// greater than or equal to (>=) operator.
// STACK RESULT: push the constant value 1 onto the stack if the result
// of the operation is true or the constant value 0 if the result of the
// operation is false.
case DW_OP_ge:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_ge.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) >= tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_gt
// OPERANDS: none
// DESCRIPTION: pops the top two stack values, compares using the
// greater than (>) operator.
// STACK RESULT: push the constant value 1 onto the stack if the result
// of the operation is true or the constant value 0 if the result of the
// operation is false.
case DW_OP_gt:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_gt.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) > tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_le
// OPERANDS: none
// DESCRIPTION: pops the top two stack values, compares using the
// less than or equal to (<=) operator.
// STACK RESULT: push the constant value 1 onto the stack if the result
// of the operation is true or the constant value 0 if the result of the
// operation is false.
case DW_OP_le:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_le.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) <= tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_lt
// OPERANDS: none
// DESCRIPTION: pops the top two stack values, compares using the
// less than (<) operator.
// STACK RESULT: push the constant value 1 onto the stack if the result
// of the operation is true or the constant value 0 if the result of the
// operation is false.
case DW_OP_lt:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_lt.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) < tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_ne
// OPERANDS: none
// DESCRIPTION: pops the top two stack values, compares using the
// not equal (!=) operator.
// STACK RESULT: push the constant value 1 onto the stack if the result
// of the operation is true or the constant value 0 if the result of the
// operation is false.
case DW_OP_ne:
if (stack.size() < 2) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 2 items for DW_OP_ne.");
return false;
} else {
tmp = stack.back();
stack.pop_back();
stack.back().ResolveValue(exe_ctx) =
stack.back().ResolveValue(exe_ctx) != tmp.ResolveValue(exe_ctx);
}
break;

// OPCODE: DW_OP_litn
// OPERANDS: none
// DESCRIPTION: encode the unsigned literal values from 0 through 31.
// STACK RESULT: push the unsigned literal constant value onto the top
// of the stack.
case DW_OP_lit0:
case DW_OP_lit1:
case DW_OP_lit2:
case DW_OP_lit3:
case DW_OP_lit4:
case DW_OP_lit5:
case DW_OP_lit6:
case DW_OP_lit7:
case DW_OP_lit8:
case DW_OP_lit9:
case DW_OP_lit10:
case DW_OP_lit11:
case DW_OP_lit12:
case DW_OP_lit13:
case DW_OP_lit14:
case DW_OP_lit15:
case DW_OP_lit16:
case DW_OP_lit17:
case DW_OP_lit18:
case DW_OP_lit19:
case DW_OP_lit20:
case DW_OP_lit21:
case DW_OP_lit22:
case DW_OP_lit23:
case DW_OP_lit24:
case DW_OP_lit25:
case DW_OP_lit26:
case DW_OP_lit27:
case DW_OP_lit28:
case DW_OP_lit29:
case DW_OP_lit30:
case DW_OP_lit31:
stack.push_back(Scalar((uint64_t)(op - DW_OP_lit0)));
break;

// OPCODE: DW_OP_regN
// OPERANDS: none
// DESCRIPTION: Push the value in register n on the top of the stack.
case DW_OP_reg0:
case DW_OP_reg1:
case DW_OP_reg2:
case DW_OP_reg3:
case DW_OP_reg4:
case DW_OP_reg5:
case DW_OP_reg6:
case DW_OP_reg7:
case DW_OP_reg8:
case DW_OP_reg9:
case DW_OP_reg10:
case DW_OP_reg11:
case DW_OP_reg12:
case DW_OP_reg13:
case DW_OP_reg14:
case DW_OP_reg15:
case DW_OP_reg16:
case DW_OP_reg17:
case DW_OP_reg18:
case DW_OP_reg19:
case DW_OP_reg20:
case DW_OP_reg21:
case DW_OP_reg22:
case DW_OP_reg23:
case DW_OP_reg24:
case DW_OP_reg25:
case DW_OP_reg26:
case DW_OP_reg27:
case DW_OP_reg28:
case DW_OP_reg29:
case DW_OP_reg30:
case DW_OP_reg31: {
reg_num = op - DW_OP_reg0;

if (ReadRegisterValueAsScalar(reg_ctx, reg_kind, reg_num, error_ptr, tmp))
stack.push_back(tmp);
else
return false;
} break;
// OPCODE: DW_OP_regx
// OPERANDS:
// ULEB128 literal operand that encodes the register.
// DESCRIPTION: Push the value in register on the top of the stack.
case DW_OP_regx: {
reg_num = opcodes.GetULEB128(&offset);
if (ReadRegisterValueAsScalar(reg_ctx, reg_kind, reg_num, error_ptr, tmp))
stack.push_back(tmp);
else
return false;
} break;

// OPCODE: DW_OP_bregN
// OPERANDS:
// SLEB128 offset from register N
// DESCRIPTION: Value is in memory at the address specified by register
// N plus an offset.
case DW_OP_breg0:
case DW_OP_breg1:
case DW_OP_breg2:
case DW_OP_breg3:
case DW_OP_breg4:
case DW_OP_breg5:
case DW_OP_breg6:
case DW_OP_breg7:
case DW_OP_breg8:
case DW_OP_breg9:
case DW_OP_breg10:
case DW_OP_breg11:
case DW_OP_breg12:
case DW_OP_breg13:
case DW_OP_breg14:
case DW_OP_breg15:
case DW_OP_breg16:
case DW_OP_breg17:
case DW_OP_breg18:
case DW_OP_breg19:
case DW_OP_breg20:
case DW_OP_breg21:
case DW_OP_breg22:
case DW_OP_breg23:
case DW_OP_breg24:
case DW_OP_breg25:
case DW_OP_breg26:
case DW_OP_breg27:
case DW_OP_breg28:
case DW_OP_breg29:
case DW_OP_breg30:
case DW_OP_breg31: {
reg_num = op - DW_OP_breg0;

if (ReadRegisterValueAsScalar(reg_ctx, reg_kind, reg_num, error_ptr,
tmp)) {
int64_t breg_offset = opcodes.GetSLEB128(&offset);
tmp.ResolveValue(exe_ctx) += (uint64_t)breg_offset;
tmp.ClearContext();
stack.push_back(tmp);
stack.back().SetValueType(Value::eValueTypeLoadAddress);
} else
return false;
} break;
// OPCODE: DW_OP_bregx
// OPERANDS: 2
// ULEB128 literal operand that encodes the register.
// SLEB128 offset from register N
// DESCRIPTION: Value is in memory at the address specified by register
// N plus an offset.
case DW_OP_bregx: {
reg_num = opcodes.GetULEB128(&offset);

if (ReadRegisterValueAsScalar(reg_ctx, reg_kind, reg_num, error_ptr,
tmp)) {
int64_t breg_offset = opcodes.GetSLEB128(&offset);
tmp.ResolveValue(exe_ctx) += (uint64_t)breg_offset;
tmp.ClearContext();
stack.push_back(tmp);
stack.back().SetValueType(Value::eValueTypeLoadAddress);
} else
return false;
} break;

case DW_OP_fbreg:
if (exe_ctx) {
if (frame) {
Scalar value;
if (frame->GetFrameBaseValue(value, error_ptr)) {
int64_t fbreg_offset = opcodes.GetSLEB128(&offset);
value += fbreg_offset;
stack.push_back(value);
stack.back().SetValueType(Value::eValueTypeLoadAddress);
} else
return false;
} else {
if (error_ptr)
error_ptr->SetErrorString(
"Invalid stack frame in context for DW_OP_fbreg opcode.");
return false;
}
} else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"NULL execution context for DW_OP_fbreg.\n");
return false;
}

break;

// OPCODE: DW_OP_nop
// OPERANDS: none
// DESCRIPTION: A place holder. It has no effect on the location stack
// or any of its values.
case DW_OP_nop:
break;

// OPCODE: DW_OP_piece
// OPERANDS: 1
// ULEB128: byte size of the piece
// DESCRIPTION: The operand describes the size in bytes of the piece of
// the object referenced by the DWARF expression whose result is at the top
// of the stack. If the piece is located in a register, but does not occupy
// the entire register, the placement of the piece within that register is
// defined by the ABI.
//
// Many compilers store a single variable in sets of registers, or store a
// variable partially in memory and partially in registers. DW_OP_piece
// provides a way of describing how large a part of a variable a particular
// DWARF expression refers to.
case DW_OP_piece: {
const uint64_t piece_byte_size = opcodes.GetULEB128(&offset);

if (piece_byte_size > 0) {
Value curr_piece;

if (stack.empty()) {
// In a multi-piece expression, this means that the current piece is
// not available. Fill with zeros for now by resizing the data and
// appending it
curr_piece.ResizeData(piece_byte_size);
// Note that "0" is not a correct value for the unknown bits.
// It would be better to also return a mask of valid bits together
// with the expression result, so the debugger can print missing
// members as "<optimized out>" or something.
::memset(curr_piece.GetBuffer().GetBytes(), 0, piece_byte_size);
pieces.AppendDataToHostBuffer(curr_piece);
} else {
Status error;
// Extract the current piece into "curr_piece"
Value curr_piece_source_value(stack.back());
stack.pop_back();

const Value::ValueType curr_piece_source_value_type =
curr_piece_source_value.GetValueType();
switch (curr_piece_source_value_type) {
case Value::eValueTypeLoadAddress:
if (process) {
if (curr_piece.ResizeData(piece_byte_size) == piece_byte_size) {
lldb::addr_t load_addr =
curr_piece_source_value.GetScalar().ULongLong(
LLDB_INVALID_ADDRESS);
if (process->ReadMemory(
load_addr, curr_piece.GetBuffer().GetBytes(),
piece_byte_size, error) != piece_byte_size) {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"failed to read memory DW_OP_piece(%" PRIu64
") from 0x%" PRIx64,
piece_byte_size, load_addr);
return false;
}
} else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"failed to resize the piece memory buffer for "
"DW_OP_piece(%" PRIu64 ")",
piece_byte_size);
return false;
}
}
break;

case Value::eValueTypeFileAddress:
case Value::eValueTypeHostAddress:
if (error_ptr) {
lldb::addr_t addr = curr_piece_source_value.GetScalar().ULongLong(
LLDB_INVALID_ADDRESS);
error_ptr->SetErrorStringWithFormat(
"failed to read memory DW_OP_piece(%" PRIu64
") from %s address 0x%" PRIx64,
piece_byte_size, curr_piece_source_value.GetValueType() ==
Value::eValueTypeFileAddress
? "file"
: "host",
addr);
}
return false;

case Value::eValueTypeScalar: {
uint32_t bit_size = piece_byte_size * 8;
uint32_t bit_offset = 0;
Scalar &scalar = curr_piece_source_value.GetScalar();
if (!scalar.ExtractBitfield(
bit_size, bit_offset)) {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"unable to extract %" PRIu64 " bytes from a %" PRIu64
" byte scalar value.",
piece_byte_size,
(uint64_t)curr_piece_source_value.GetScalar()
.GetByteSize());
return false;
}
// Create curr_piece with bit_size. By default Scalar
// grows to the nearest host integer type.
llvm::APInt fail_value(1, 0, false);
llvm::APInt ap_int = scalar.UInt128(fail_value);
assert(ap_int.getBitWidth() >= bit_size);
llvm::ArrayRef<uint64_t> buf{ap_int.getRawData(),
ap_int.getNumWords()};
curr_piece.GetScalar() = Scalar(llvm::APInt(bit_size, buf));
} break;

case Value::eValueTypeVector: {
if (curr_piece_source_value.GetVector().length >= piece_byte_size)
curr_piece_source_value.GetVector().length = piece_byte_size;
else {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"unable to extract %" PRIu64 " bytes from a %" PRIu64
" byte vector value.",
piece_byte_size,
(uint64_t)curr_piece_source_value.GetVector().length);
return false;
}
} break;
}

// Check if this is the first piece?
if (op_piece_offset == 0) {
// This is the first piece, we should push it back onto the stack
// so subsequent pieces will be able to access this piece and add
// to it.
if (pieces.AppendDataToHostBuffer(curr_piece) == 0) {
if (error_ptr)
error_ptr->SetErrorString("failed to append piece data");
return false;
}
} else {
// If this is the second or later piece there should be a value on
// the stack.
if (pieces.GetBuffer().GetByteSize() != op_piece_offset) {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"DW_OP_piece for offset %" PRIu64
" but top of stack is of size %" PRIu64,
op_piece_offset, pieces.GetBuffer().GetByteSize());
return false;
}

if (pieces.AppendDataToHostBuffer(curr_piece) == 0) {
if (error_ptr)
error_ptr->SetErrorString("failed to append piece data");
return false;
}
}
}
op_piece_offset += piece_byte_size;
}
} break;

case DW_OP_bit_piece: // 0x9d ULEB128 bit size, ULEB128 bit offset (DWARF3);
if (stack.size() < 1) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 1 item for DW_OP_bit_piece.");
return false;
} else {
const uint64_t piece_bit_size = opcodes.GetULEB128(&offset);
const uint64_t piece_bit_offset = opcodes.GetULEB128(&offset);
switch (stack.back().GetValueType()) {
case Value::eValueTypeScalar: {
if (!stack.back().GetScalar().ExtractBitfield(piece_bit_size,
piece_bit_offset)) {
if (error_ptr)
error_ptr->SetErrorStringWithFormat(
"unable to extract %" PRIu64 " bit value with %" PRIu64
" bit offset from a %" PRIu64 " bit scalar value.",
piece_bit_size, piece_bit_offset,
(uint64_t)(stack.back().GetScalar().GetByteSize() * 8));
return false;
}
} break;

case Value::eValueTypeFileAddress:
case Value::eValueTypeLoadAddress:
case Value::eValueTypeHostAddress:
if (error_ptr) {
error_ptr->SetErrorStringWithFormat(
"unable to extract DW_OP_bit_piece(bit_size = %" PRIu64
", bit_offset = %" PRIu64 ") from an address value.",
piece_bit_size, piece_bit_offset);
}
return false;

case Value::eValueTypeVector:
if (error_ptr) {
error_ptr->SetErrorStringWithFormat(
"unable to extract DW_OP_bit_piece(bit_size = %" PRIu64
", bit_offset = %" PRIu64 ") from a vector value.",
piece_bit_size, piece_bit_offset);
}
return false;
}
}
break;

// OPCODE: DW_OP_push_object_address
// OPERANDS: none
// DESCRIPTION: Pushes the address of the object currently being
// evaluated as part of evaluation of a user presented expression. This
// object may correspond to an independent variable described by its own
// DIE or it may be a component of an array, structure, or class whose
// address has been dynamically determined by an earlier step during user
// expression evaluation.
case DW_OP_push_object_address:
if (object_address_ptr)
stack.push_back(*object_address_ptr);
else {
if (error_ptr)
error_ptr->SetErrorString("DW_OP_push_object_address used without "
"specifying an object address");
return false;
}
break;

// OPCODE: DW_OP_call2
// OPERANDS:
// uint16_t compile unit relative offset of a DIE
// DESCRIPTION: Performs subroutine calls during evaluation
// of a DWARF expression. The operand is the 2-byte unsigned offset of a
// debugging information entry in the current compilation unit.
//
// Operand interpretation is exactly like that for DW_FORM_ref2.
//
// This operation transfers control of DWARF expression evaluation to the
// DW_AT_location attribute of the referenced DIE. If there is no such
// attribute, then there is no effect. Execution of the DWARF expression of
// a DW_AT_location attribute may add to and/or remove from values on the
// stack. Execution returns to the point following the call when the end of
// the attribute is reached. Values on the stack at the time of the call
// may be used as parameters by the called expression and values left on
// the stack by the called expression may be used as return values by prior
// agreement between the calling and called expressions.
case DW_OP_call2:
if (error_ptr)
error_ptr->SetErrorString("Unimplemented opcode DW_OP_call2.");
return false;
// OPCODE: DW_OP_call4
// OPERANDS: 1
// uint32_t compile unit relative offset of a DIE
// DESCRIPTION: Performs a subroutine call during evaluation of a DWARF
// expression. For DW_OP_call4, the operand is a 4-byte unsigned offset of
// a debugging information entry in the current compilation unit.
//
// Operand interpretation DW_OP_call4 is exactly like that for
// DW_FORM_ref4.
//
// This operation transfers control of DWARF expression evaluation to the
// DW_AT_location attribute of the referenced DIE. If there is no such
// attribute, then there is no effect. Execution of the DWARF expression of
// a DW_AT_location attribute may add to and/or remove from values on the
// stack. Execution returns to the point following the call when the end of
// the attribute is reached. Values on the stack at the time of the call
// may be used as parameters by the called expression and values left on
// the stack by the called expression may be used as return values by prior
// agreement between the calling and called expressions.
case DW_OP_call4:
if (error_ptr)
error_ptr->SetErrorString("Unimplemented opcode DW_OP_call4.");
return false;

// OPCODE: DW_OP_stack_value
// OPERANDS: None
// DESCRIPTION: Specifies that the object does not exist in memory but
// rather is a constant value. The value from the top of the stack is the
// value to be used. This is the actual object value and not the location.
case DW_OP_stack_value:
if (stack.empty()) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 1 item for DW_OP_stack_value.");
return false;
}
stack.back().SetValueType(Value::eValueTypeScalar);
break;

// OPCODE: DW_OP_convert
// OPERANDS: 1
// A ULEB128 that is either a DIE offset of a
// DW_TAG_base_type or 0 for the generic (pointer-sized) type.
//
// DESCRIPTION: Pop the top stack element, convert it to a
// different type, and push the result.
case DW_OP_convert: {
if (stack.size() < 1) {
if (error_ptr)
error_ptr->SetErrorString(
"Expression stack needs at least 1 item for DW_OP_convert.");
return false;
}
const uint64_t die_offset = opcodes.GetULEB128(&offset);
Scalar::Type type = Scalar::e_void;
uint64_t bit_size;
if (die_offset == 0) {
// The generic type has the size of an address on the target
// machine and an unspecified signedness. Scalar has no
// "unspecified signedness", so we use unsigned types.
if (!module_sp) {
if (error_ptr)
error_ptr->SetErrorString("No module");
return false;
}
bit_size = module_sp->GetArchitecture().GetAddressByteSize() * 8;
if (!bit_size) {
if (error_ptr)
error_ptr->SetErrorString("unspecified architecture");
return false;
}
type = Scalar::GetBestTypeForBitSize(bit_size, false);
} else {
// Retrieve the type DIE that the value is being converted to.
// FIXME: the constness has annoying ripple effects.
DWARFDIE die = const_cast<DWARFUnit *>(dwarf_cu)->GetDIE(die_offset);
if (!die) {
if (error_ptr)
error_ptr->SetErrorString("Cannot resolve DW_OP_convert type DIE");
return false;
}
uint64_t encoding =
die.GetAttributeValueAsUnsigned(DW_AT_encoding, DW_ATE_hi_user);
bit_size = die.GetAttributeValueAsUnsigned(DW_AT_byte_size, 0) * 8;
if (!bit_size)
bit_size = die.GetAttributeValueAsUnsigned(DW_AT_bit_size, 0);
if (!bit_size) {
if (error_ptr)
error_ptr->SetErrorString("Unsupported type size in DW_OP_convert");
return false;
}
switch (encoding) {
case DW_ATE_signed:
case DW_ATE_signed_char:
type = Scalar::GetBestTypeForBitSize(bit_size, true);
break;
case DW_ATE_unsigned:
case DW_ATE_unsigned_char:
type = Scalar::GetBestTypeForBitSize(bit_size, false);
break;
default:
if (error_ptr)
error_ptr->SetErrorString("Unsupported encoding in DW_OP_convert");
return false;
}
}
if (type == Scalar::e_void) {
if (error_ptr)
error_ptr->SetErrorString("Unsupported pointer size");
return false;
}
Scalar &top = stack.back().ResolveValue(exe_ctx);
top.TruncOrExtendTo(type, bit_size);
break;
}

// OPCODE: DW_OP_call_frame_cfa
// OPERANDS: None
// DESCRIPTION: Specifies a DWARF expression that pushes the value of
// the canonical frame address consistent with the call frame information
// located in .debug_frame (or in the FDEs of the eh_frame section).
case DW_OP_call_frame_cfa:
if (frame) {
// Note that we don't have to parse FDEs because this DWARF expression
// is commonly evaluated with a valid stack frame.
StackID id = frame->GetStackID();
addr_t cfa = id.GetCallFrameAddress();
if (cfa != LLDB_INVALID_ADDRESS) {
stack.push_back(Scalar(cfa));
stack.back().SetValueType(Value::eValueTypeLoadAddress);
} else if (error_ptr)
error_ptr->SetErrorString("Stack frame does not include a canonical "
"frame address for DW_OP_call_frame_cfa "
"opcode.");
} else {
if (error_ptr)
error_ptr->SetErrorString("Invalid stack frame in context for "
"DW_OP_call_frame_cfa opcode.");
return false;
}
break;

// OPCODE: DW_OP_form_tls_address (or the old pre-DWARFv3 vendor extension
// opcode, DW_OP_GNU_push_tls_address)
// OPERANDS: none
// DESCRIPTION: Pops a TLS offset from the stack, converts it to
// an address in the current thread's thread-local storage block, and
// pushes it on the stack.
case DW_OP_form_tls_address:
case DW_OP_GNU_push_tls_address: {
if (stack.size() < 1) {
if (error_ptr) {
if (op == DW_OP_form_tls_address)
error_ptr->SetErrorString(
"DW_OP_form_tls_address needs an argument.");
else
error_ptr->SetErrorString(
"DW_OP_GNU_push_tls_address needs an argument.");
}
return false;
}

if (!exe_ctx \|\| !module_sp) {
if (error_ptr)
error_ptr->SetErrorString("No context to evaluate TLS within.");
return false;
}

Thread *thread = exe_ctx->GetThreadPtr();
if (!thread) {
if (error_ptr)
error_ptr->SetErrorString("No thread to evaluate TLS within.");
return false;
}

// Lookup the TLS block address for this thread and module.
const addr_t tls_file_addr =
stack.back().GetScalar().ULongLong(LLDB_INVALID_ADDRESS);
const addr_t tls_load_addr =
thread->GetThreadLocalData(module_sp, tls_file_addr);

if (tls_load_addr == LLDB_INVALID_ADDRESS) {
if (error_ptr)
error_ptr->SetErrorString(
"No TLS data currently exists for this thread.");
return false;
}

stack.back().GetScalar() = tls_load_addr;
stack.back().SetValueType(Value::eValueTypeLoadAddress);
} break;

// OPCODE: DW_OP_addrx (DW_OP_GNU_addr_index is the legacy name.)
// OPERANDS: 1
// ULEB128: index to the .debug_addr section
// DESCRIPTION: Pushes an address to the stack from the .debug_addr
// section with the base address specified by the DW_AT_addr_base attribute
// and the 0 based index is the ULEB128 encoded index.
case DW_OP_addrx:
case DW_OP_GNU_addr_index: {
if (!dwarf_cu) {
if (error_ptr)
error_ptr->SetErrorString("DW_OP_GNU_addr_index found without a "
"compile unit being specified");
return false;
}
uint64_t index = opcodes.GetULEB128(&offset);
lldb::addr_t value = ReadAddressFromDebugAddrSection(dwarf_cu, index);
stack.push_back(Scalar(value));
stack.back().SetValueType(Value::eValueTypeFileAddress);
} break;

// OPCODE: DW_OP_GNU_const_index
// OPERANDS: 1
// ULEB128: index to the .debug_addr section
// DESCRIPTION: Pushes an constant with the size of a machine address to
// the stack from the .debug_addr section with the base address specified
// by the DW_AT_addr_base attribute and the 0 based index is the ULEB128
// encoded index.
case DW_OP_GNU_const_index: {
if (!dwarf_cu) {
if (error_ptr)
error_ptr->SetErrorString("DW_OP_GNU_const_index found without a "
"compile unit being specified");
return false;
}
uint64_t index = opcodes.GetULEB128(&offset);
lldb::addr_t value = ReadAddressFromDebugAddrSection(dwarf_cu, index);
stack.push_back(Scalar(value));
} break;

case DW_OP_entry_value: {
if (!Evaluate_DW_OP_entry_value(stack, exe_ctx, reg_ctx, opcodes, offset,
error_ptr, log)) {
LLDB_ERRORF(error_ptr, "Could not evaluate %s.",
DW_OP_value_to_name(op));
return false;
}
break;
}

default:
LLDB_LOGF(log, "Unhandled opcode %s in DWARFExpression.",
DW_OP_value_to_name(op));
break;
}
}

if (stack.empty()) {
// Nothing on the stack, check if we created a piece value from DW_OP_piece
// or DW_OP_bit_piece opcodes
if (pieces.GetBuffer().GetByteSize()) {
result = pieces;
} else {
if (error_ptr)
error_ptr->SetErrorString("Stack empty after evaluation.");
return false;
}
} else {
if (log && log->GetVerbose()) {
size_t count = stack.size();
LLDB_LOGF(log, "Stack after operation has %" PRIu64 " values:",
(uint64_t)count);
for (size_t i = 0; i < count; ++i) {
StreamString new_value;
new_value.Printf("[%" PRIu64 "]", (uint64_t)i);
stack[i].Dump(&new_value);
LLDB_LOGF(log, " %s", new_value.GetData());
}
}
result = stack.back();
}
return true; // Return true on success
}		}

static bool print_dwarf_exp_op(Stream &s, const DataExtractor &data,		static bool print_dwarf_exp_op(Stream &s, const DataExtractor &data,
lldb::offset_t *offset_ptr, int address_size,		lldb::offset_t *offset_ptr, int address_size,
int dwarf_ref_size) {		int dwarf_ref_size) {
uint8_t opcode = data.GetU8(offset_ptr);		uint8_t opcode = data.GetU8(offset_ptr);
DRC_class opcode_class;		DRC_class opcode_class;
uint64_t uint;		uint64_t uint;
▲ Show 20 Lines • Show All 362 Lines • ▼ Show 20 Lines	return MatchUnaryOp(
MatchOpType(Instruction::Operand::Type::Dereference),		MatchOpType(Instruction::Operand::Type::Dereference),
MatchBinaryOp(MatchOpType(Instruction::Operand::Type::Sum),		MatchBinaryOp(MatchOpType(Instruction::Operand::Type::Sum),
MatchRegOp(*reg),		MatchRegOp(*reg),
MatchImmOp(offset)))(operand);		MatchImmOp(offset)))(operand);
} else {		} else {
return MatchRegOp(*reg)(operand);		return MatchRegOp(*reg)(operand);
}		}
}		}

lldb/source/Interpreter/CommandInterpreter.cpp

Show First 20 Lines • Show All 729 Lines • ▼ Show 20 Lines	if (connect_gdb_remote_cmd_up->AddRegexCommand(
connect_gdb_remote_cmd_up->AddRegexCommand(		connect_gdb_remote_cmd_up->AddRegexCommand(
"^([[:digit:]]+)$",		"^([[:digit:]]+)$",
"process connect --plugin gdb-remote connect://localhost:%1")) {		"process connect --plugin gdb-remote connect://localhost:%1")) {
CommandObjectSP command_sp(connect_gdb_remote_cmd_up.release());		CommandObjectSP command_sp(connect_gdb_remote_cmd_up.release());
m_command_dict[std::string(command_sp->GetCommandName())] = command_sp;		m_command_dict[std::string(command_sp->GetCommandName())] = command_sp;
}		}
}		}

		std::unique_ptr<CommandObjectRegexCommand> connect_wasm_cmd_up(
		new CommandObjectRegexCommand(
		*this, "wasm",
		"Connect to a WebAssembly process via remote GDB server. "
		"If no host is specifed, localhost is assumed.",
		"wasm [<hostname>:]<portnum>", 2, 0, false));
		if (connect_wasm_cmd_up) {
		if (connect_wasm_cmd_up->AddRegexCommand(
		"^([^:]+\|\\[[0-9a-fA-F:]+.*\\]):([0-9]+)$",
		"process connect --plugin wasm connect://%1:%2") &&
		connect_wasm_cmd_up->AddRegexCommand(
		"^([[:digit:]]+)$",
		"process connect --plugin wasm connect://localhost:%1")) {
		CommandObjectSP command_sp(connect_wasm_cmd_up.release());
		m_command_dict[std::string(command_sp->GetCommandName())] = command_sp;
		}
		}

		labathUnsubmitted Done Reply Inline Actions One way to improve this would be to have lldb detect the kind of plugin that it is talking to and create an appropriate instance based on that. However, that would require more refactorings, so this is good for a start anyway. labath: One way to improve this would be to have lldb detect the kind of plugin that it is talking to…
std::unique_ptr<CommandObjectRegexCommand> connect_kdp_remote_cmd_up(		std::unique_ptr<CommandObjectRegexCommand> connect_kdp_remote_cmd_up(
new CommandObjectRegexCommand(		new CommandObjectRegexCommand(
*this, "kdp-remote",		*this, "kdp-remote",
"Connect to a process via remote KDP server. "		"Connect to a process via remote KDP server. "
"If no UDP port is specified, port 41139 is "		"If no UDP port is specified, port 41139 is "
"assumed.",		"assumed.",
"kdp-remote <hostname>[:<portnum>]", 2, 0, false));		"kdp-remote <hostname>[:<portnum>]", 2, 0, false));
if (connect_kdp_remote_cmd_up) {		if (connect_kdp_remote_cmd_up) {
▲ Show 20 Lines • Show All 2,397 Lines • Show Last 20 Lines

lldb/source/Plugins/CMakeLists.txt

	add_subdirectory(ABI)			add_subdirectory(ABI)
	add_subdirectory(Architecture)			add_subdirectory(Architecture)
	add_subdirectory(Disassembler)			add_subdirectory(Disassembler)
	add_subdirectory(DynamicLoader)			add_subdirectory(DynamicLoader)
				add_subdirectory(DWARFEvaluator)
	add_subdirectory(ExpressionParser)			add_subdirectory(ExpressionParser)
	add_subdirectory(Instruction)			add_subdirectory(Instruction)
	add_subdirectory(InstrumentationRuntime)			add_subdirectory(InstrumentationRuntime)
	add_subdirectory(JITLoader)			add_subdirectory(JITLoader)
	add_subdirectory(Language)			add_subdirectory(Language)
	add_subdirectory(LanguageRuntime)			add_subdirectory(LanguageRuntime)
	add_subdirectory(MemoryHistory)			add_subdirectory(MemoryHistory)
	add_subdirectory(ObjectContainer)			add_subdirectory(ObjectContainer)
	Show All 12 Lines
	set(LLDB_STRIPPED_PLUGINS)			set(LLDB_STRIPPED_PLUGINS)
	get_property(LLDB_ALL_PLUGINS GLOBAL PROPERTY LLDB_PLUGINS)			get_property(LLDB_ALL_PLUGINS GLOBAL PROPERTY LLDB_PLUGINS)

	set(LLDB_ENUM_PLUGINS "")			set(LLDB_ENUM_PLUGINS "")

	# FIXME: ProcessWindowsCommon needs to be initialized after all other process			# FIXME: ProcessWindowsCommon needs to be initialized after all other process
	# plugins but before ProcessGDBRemote.			# plugins but before ProcessGDBRemote.
	set(LLDB_PROCESS_WINDOWS_PLUGIN "")			set(LLDB_PROCESS_WINDOWS_PLUGIN "")
				set(LLDB_PROCESS_WASM_PLUGIN "")
	set(LLDB_PROCESS_GDB_PLUGIN "")			set(LLDB_PROCESS_GDB_PLUGIN "")

	foreach(p ${LLDB_ALL_PLUGINS})			foreach(p ${LLDB_ALL_PLUGINS})
	# Strip lldbPlugin form the plugin name.			# Strip lldbPlugin form the plugin name.
	string(SUBSTRING ${p} 10 -1 pStripped)			string(SUBSTRING ${p} 10 -1 pStripped)
	if(${pStripped} MATCHES "^ScriptInterpreter*")			if(${pStripped} MATCHES "^ScriptInterpreter*")
	set(LLDB_ENUM_PLUGINS "${LLDB_ENUM_PLUGINS}LLDB_SCRIPT_PLUGIN(${pStripped})\n")			set(LLDB_ENUM_PLUGINS "${LLDB_ENUM_PLUGINS}LLDB_SCRIPT_PLUGIN(${pStripped})\n")
	elseif(${pStripped} STREQUAL "ProcessWindowsCommon")			elseif(${pStripped} STREQUAL "ProcessWindowsCommon")
	set(LLDB_PROCESS_WINDOWS_PLUGIN "LLDB_PLUGIN(${pStripped})\n")			set(LLDB_PROCESS_WINDOWS_PLUGIN "LLDB_PLUGIN(${pStripped})\n")
	elseif(${pStripped} STREQUAL "ProcessGDBRemote")			elseif(${pStripped} STREQUAL "ProcessGDBRemote")
	set(LLDB_PROCESS_GDB_PLUGIN "LLDB_PLUGIN(${pStripped})\n")			set(LLDB_PROCESS_GDB_PLUGIN "LLDB_PLUGIN(${pStripped})\n")
				elseif(${pStripped} STREQUAL "ProcessWasm")
				set(LLDB_PROCESS_WASM_PLUGIN "LLDB_PLUGIN(${pStripped})\n")
	else()			else()
	set(LLDB_ENUM_PLUGINS "${LLDB_ENUM_PLUGINS}LLDB_PLUGIN(${pStripped})\n")			set(LLDB_ENUM_PLUGINS "${LLDB_ENUM_PLUGINS}LLDB_PLUGIN(${pStripped})\n")
	endif()			endif()
	endforeach(p)			endforeach(p)

	configure_file(			configure_file(
	${CMAKE_CURRENT_SOURCE_DIR}/Plugins.def.in			${CMAKE_CURRENT_SOURCE_DIR}/Plugins.def.in
	${CMAKE_CURRENT_BINARY_DIR}/Plugins.def			${CMAKE_CURRENT_BINARY_DIR}/Plugins.def
	)			)

	set_property(GLOBAL PROPERTY LLDB_PLUGINS_INCLUDE_DIR ${CMAKE_CURRENT_BINARY_DIR})			set_property(GLOBAL PROPERTY LLDB_PLUGINS_INCLUDE_DIR ${CMAKE_CURRENT_BINARY_DIR})

lldb/source/Plugins/DWARFEvaluator/CMakeLists.txt

This file was added.

add_subdirectory(wasm)

lldb/source/Plugins/DWARFEvaluator/wasm/CMakeLists.txt

This file was added.

				add_lldb_library(lldbPluginWasmDWARFEvaluatorFactory PLUGIN
				WasmDWARFEvaluator.cpp
				WasmDWARFEvaluatorFactory.cpp

				LINK_LIBS
				lldbCore
				lldbHost
				lldbSymbol
				lldbPluginObjectFileWasm
				)

lldb/source/Plugins/DWARFEvaluator/wasm/WasmDWARFEvaluator.h

This file was added.

				//===-- WasmDWARFEvaluator.h ------------------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLDB_SOURCE_PLUGINS_DWARFEVALUATOR_WASM_WASMDWARFEVALUATOR_H
				#define LLDB_SOURCE_PLUGINS_DWARFEVALUATOR_WASM_WASMDWARFEVALUATOR_H

				#include "lldb/Expression/DWARFEvaluator.h"
				#include "lldb/lldb-private.h"

				namespace lldb_private {
				namespace wasm {

				/// \class WasmDWARFEvaluator evaluates DWARF expressions in the context of a
				/// WebAssembly process.
				///
				class WasmDWARFEvaluator : public DWARFEvaluator {
				public:
				WasmDWARFEvaluator(const DWARFExpression &dwarf_expression,
				ExecutionContext exe_ctx, RegisterContext reg_ctx,
				const Value *initial_value_ptr,
				const Value *object_address_ptr)
				: DWARFEvaluator(dwarf_expression, exe_ctx, reg_ctx, initial_value_ptr,
				object_address_ptr) {}

				/// DWARFEvaluator protocol.
				/// \{
				bool Evaluate(const uint8_t op, Process process, StackFrame frame,
				std::vector<Value> &stack, const DataExtractor &opcodes,
				lldb::offset_t &offset, Value &pieces,
				uint64_t &op_piece_offset, Log *log,
				Status *error_ptr) override;
				/// \}

				private:
				DISALLOW_COPY_AND_ASSIGN(WasmDWARFEvaluator);
				};

				} // namespace wasm
				} // namespace lldb_private

				#endif // LLDB_SOURCE_PLUGINS_DWARFEVALUATOR_WASM_WASMDWARFEVALUATOR_H

lldb/source/Plugins/DWARFEvaluator/wasm/WasmDWARFEvaluator.cpp

This file was added.

				//===-- WasmDWARFEvaluator.cpp --------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "WasmDWARFEvaluator.h"

				#include "Plugins/ObjectFile/wasm/ObjectFileWasm.h"
				#include "Plugins/Process/wasm/ProcessWasm.h"
				#include "lldb/Core/Module.h"
				#include "lldb/Core/PluginManager.h"
				#include "lldb/Core/Value.h"
				#include "lldb/Core/dwarf.h"
				#include "lldb/Expression/DWARFExpression.h"

				using namespace lldb;
				using namespace lldb_private;
				using namespace lldb_private::wasm;

				bool WasmDWARFEvaluator::Evaluate(const uint8_t op, Process *process,
				StackFrame *frame, std::vector<Value> &stack,
				const DataExtractor &opcodes,
				lldb::offset_t &offset, Value &pieces,
				uint64_t &op_piece_offset, Log *log,
				Status *error_ptr) {
				lldb::ModuleSP module_sp = m_dwarf_expression.GetModule();

				switch (op) {
				case DW_OP_WASM_location: {
				if (frame) {
				const llvm::Triple::ArchType machine =
				frame->CalculateTarget()->GetArchitecture().GetMachine();
				if (machine != llvm::Triple::wasm32) {
				if (error_ptr)
				error_ptr->SetErrorString("Invalid target architecture for "
				"DW_OP_WASM_location opcode.");
				return false;
				}

				ProcessWasm *wasm_process =
				static_cast<wasm::ProcessWasm *>(frame->CalculateProcess().get());
				int frame_index = frame->GetConcreteFrameIndex();
				uint64_t wasm_op = opcodes.GetULEB128(&offset);
				uint64_t index = opcodes.GetULEB128(&offset);
				uint8_t buf[16];
				size_t size = 0;
				switch (wasm_op) {
				case 0: // Local
				if (!wasm_process->GetWasmLocal(frame_index, index, buf, 16, size)) {
				return false;
				}
				break;
				case 1: // Global
				if (!wasm_process->GetWasmGlobal(frame_index, index, buf, 16, size)) {
				return false;
				}
				break;
				case 2: // Operand Stack
				if (!wasm_process->GetWasmStackValue(frame_index, index, buf, 16,
				size)) {
				return false;
				}
				break;
				default:
				return false;
				}

				if (size == sizeof(uint32_t)) {
				uint32_t value;
				memcpy(&value, buf, size);
				stack.push_back(Scalar(value));
				} else if (size == sizeof(uint64_t)) {
				uint64_t value;
				memcpy(&value, buf, size);
				stack.push_back(Scalar(value));
				} else
				return false;
				} else {
				if (error_ptr)
				error_ptr->SetErrorString("Invalid stack frame in context for "
				"DW_OP_WASM_location opcode.");
				return false;
				}
				} break;

				case DW_OP_addr: {
				/// {addr} is an offset in the module Data section.
				lldb::addr_t addr = opcodes.GetAddress(&offset);
				uint32_t wasm_module_id =
				module_sp->GetObjectFile()->GetBaseAddress().GetOffset() >> 32;
				wasm_addr_t wasm_addr(WasmAddressType::Data, wasm_module_id, addr);
				stack.push_back(Scalar(wasm_addr));
				stack.back().SetValueType(Value::eValueTypeLoadAddress);
				} break;

				case DW_OP_fbreg:
				if (m_exe_ctx) {
				if (frame) {
				Scalar value;
				if (frame->GetFrameBaseValue(value, error_ptr)) {
				// The value is an address in the Wasm Memory space.
				int64_t fbreg_offset = opcodes.GetSLEB128(&offset);
				uint32_t wasm_module_id =
				module_sp->GetObjectFile()->GetBaseAddress().GetOffset() >> 32;
				wasm_addr_t wasm_addr(WasmAddressType::Memory, wasm_module_id,
				value.ULong() + fbreg_offset);
				stack.push_back(Scalar(wasm_addr));
				stack.back().SetValueType(Value::eValueTypeLoadAddress);
				} else
				return false;
				} else {
				if (error_ptr)
				error_ptr->SetErrorString(
				"Invalid stack frame in context for DW_OP_fbreg opcode.");
				return false;
				}
				} else {
				if (error_ptr)
				error_ptr->SetErrorStringWithFormat(
				"NULL execution context for DW_OP_fbreg.\n");
				return false;
				}
				break;

				default:
				return DWARFEvaluator::Evaluate(op, process, frame, stack, opcodes, offset,
				pieces, op_piece_offset, log, error_ptr);
				}
				return true;
				}

lldb/source/Plugins/DWARFEvaluator/wasm/WasmDWARFEvaluatorFactory.h

This file was added.

				//===-- WasmDWARFEvaluatorFactory.h ------------------------------ C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLDB_SOURCE_PLUGINS_DWARFEVALUATOR_WASM_WASMDWARFEVALUATORFACTORY_H
				#define LLDB_SOURCE_PLUGINS_DWARFEVALUATOR_WASM_WASMDWARFEVALUATORFACTORY_H

				#include "lldb/Expression/DWARFEvaluatorFactory.h"

				namespace lldb_private {
				namespace wasm {

				/// \class WasmDWARFEvaluatorFactory creates DWARF evaluators specialized to
				/// manage DWARF-specific opcodes.
				class WasmDWARFEvaluatorFactory : public DWARFEvaluatorFactory {
				public:
				static void Initialize();
				static void Terminate();
				static lldb_private::ConstString GetPluginNameStatic();
				static const char *GetPluginDescriptionStatic();

				static lldb_private::DWARFEvaluatorFactory CreateInstance(Module module);

				/// PluginInterface protocol.
				/// \{
				lldb_private::ConstString GetPluginName() override {
				return GetPluginNameStatic();
				}
				uint32_t GetPluginVersion() override { return 1; }
				/// \}

				WasmDWARFEvaluatorFactory() {}

				/// DWARFEvaluatorFactory protocol.
				/// \{
				std::unique_ptr<DWARFEvaluator>
				CreateDWARFEvaluator(const DWARFExpression &dwarf_expression,
				ExecutionContext exe_ctx, RegisterContext reg_ctx,
				const Value *initial_value_ptr,
				const Value *object_address_ptr) override;
				/// \}

				private:
				DISALLOW_COPY_AND_ASSIGN(WasmDWARFEvaluatorFactory);
				};

				} // namespace wasm
				} // namespace lldb_private

				#endif // LLDB_SOURCE_PLUGINS_DWARFEVALUATOR_WASM_WASMDWARFEVALUATORFACTORY_H

lldb/source/Plugins/DWARFEvaluator/wasm/WasmDWARFEvaluatorFactory.cpp

This file was added.

				//===-- WasmDWARFEvaluatorFactory.cpp -------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "WasmDWARFEvaluatorFactory.h"
				#include "WasmDWARFEvaluator.h"

				#include "Plugins/ObjectFile/wasm/ObjectFileWasm.h"
				#include "lldb/Core/Module.h"
				#include "lldb/Core/PluginManager.h"

				using namespace lldb;
				using namespace lldb_private;
				using namespace lldb_private::wasm;

				LLDB_PLUGIN_DEFINE(WasmDWARFEvaluatorFactory)

				void WasmDWARFEvaluatorFactory::Initialize() {
				PluginManager::RegisterPlugin(GetPluginNameStatic(),
				GetPluginDescriptionStatic(), CreateInstance);
				}

				void WasmDWARFEvaluatorFactory::Terminate() {
				PluginManager::UnregisterPlugin(CreateInstance);
				}

				lldb_private::ConstString WasmDWARFEvaluatorFactory::GetPluginNameStatic() {
				static ConstString g_name("WASM");
				return g_name;
				}

				const char *WasmDWARFEvaluatorFactory::GetPluginDescriptionStatic() {
				return "DWARF expression evaluator factory for WASM.";
				}

				// CreateInstance
				//
				// Platforms can register a callback to use when creating DWARF expression
				// evaluators to allow handling platform-specific DWARF codes.
				DWARFEvaluatorFactory *
				WasmDWARFEvaluatorFactory::CreateInstance(Module *module) {
				if (!module)
				return nullptr;

				ObjectFileWasm *obj_file =
				llvm::dyn_cast_or_null<ObjectFileWasm>(module->GetObjectFile());
				if (!obj_file)
				return nullptr;

				return new WasmDWARFEvaluatorFactory();
				}

				std::unique_ptr<DWARFEvaluator> WasmDWARFEvaluatorFactory::CreateDWARFEvaluator(
				const DWARFExpression &dwarf_expression, ExecutionContext *exe_ctx,
				RegisterContext reg_ctx, const Value initial_value_ptr,
				const Value *object_address_ptr) {
				return std::make_unique<WasmDWARFEvaluator>(dwarf_expression, exe_ctx,
				reg_ctx, initial_value_ptr,
				object_address_ptr);
				}

lldb/source/Plugins/Plugins.def.in

	Show All 25 Lines
	#endif			#endif

	#ifndef LLDB_SCRIPT_PLUGIN			#ifndef LLDB_SCRIPT_PLUGIN
	#define LLDB_SCRIPT_PLUGIN(p) LLDB_PLUGIN(p)			#define LLDB_SCRIPT_PLUGIN(p) LLDB_PLUGIN(p)
	#endif			#endif

	@LLDB_ENUM_PLUGINS@			@LLDB_ENUM_PLUGINS@
	@LLDB_PROCESS_WINDOWS_PLUGIN@			@LLDB_PROCESS_WINDOWS_PLUGIN@
				@LLDB_PROCESS_WASM_PLUGIN@
	@LLDB_PROCESS_GDB_PLUGIN@			@LLDB_PROCESS_GDB_PLUGIN@

	#undef LLDB_PLUGIN			#undef LLDB_PLUGIN
	#undef LLDB_SCRIPT_PLUGIN			#undef LLDB_SCRIPT_PLUGIN

lldb/source/Plugins/Process/CMakeLists.txt

	Show All 11 Lines
	elseif (CMAKE_SYSTEM_NAME MATCHES "Darwin")			elseif (CMAKE_SYSTEM_NAME MATCHES "Darwin")
	add_subdirectory(MacOSX-Kernel)			add_subdirectory(MacOSX-Kernel)
	endif()			endif()
	add_subdirectory(gdb-remote)			add_subdirectory(gdb-remote)
	add_subdirectory(Utility)			add_subdirectory(Utility)
	add_subdirectory(elf-core)			add_subdirectory(elf-core)
	add_subdirectory(mach-core)			add_subdirectory(mach-core)
	add_subdirectory(minidump)			add_subdirectory(minidump)
				add_subdirectory(wasm)

lldb/source/Plugins/Process/elf-core/ProcessElfCore.h

Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	public:

// Process Queries		// Process Queries
bool IsAlive() override;		bool IsAlive() override;

bool WarnBeforeDetach() const override { return false; }		bool WarnBeforeDetach() const override { return false; }

// Process Memory		// Process Memory
size_t ReadMemory(lldb::addr_t addr, void *buf, size_t size,		size_t ReadMemory(lldb::addr_t addr, void *buf, size_t size,
lldb_private::Status &error) override;		lldb_private::Status &error,
		lldb_private::ExecutionContext *exe_ctx = nullptr) override;

size_t DoReadMemory(lldb::addr_t addr, void *buf, size_t size,		size_t DoReadMemory(lldb::addr_t addr, void *buf, size_t size,
lldb_private::Status &error) override;		lldb_private::Status &error) override;

lldb_private::Status		lldb_private::Status
GetMemoryRegionInfo(lldb::addr_t load_addr,		GetMemoryRegionInfo(lldb::addr_t load_addr,
lldb_private::MemoryRegionInfo &region_info) override;		lldb_private::MemoryRegionInfo &region_info) override;

▲ Show 20 Lines • Show All 74 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/elf-core/ProcessElfCore.cpp

	Show First 20 Lines • Show All 278 Lines • ▼ Show 20 Lines
	Status ProcessElfCore::DoDestroy() { return Status(); }			Status ProcessElfCore::DoDestroy() { return Status(); }

	// Process Queries			// Process Queries

	bool ProcessElfCore::IsAlive() { return true; }			bool ProcessElfCore::IsAlive() { return true; }

	// Process Memory			// Process Memory
	size_t ProcessElfCore::ReadMemory(lldb::addr_t addr, void *buf, size_t size,			size_t ProcessElfCore::ReadMemory(lldb::addr_t addr, void *buf, size_t size,
	Status &error) {			Status &error, ExecutionContext *exe_ctx) {
	// Don't allow the caching that lldb_private::Process::ReadMemory does since			// Don't allow the caching that lldb_private::Process::ReadMemory does since
	// in core files we have it all cached our our core file anyway.			// in core files we have it all cached our our core file anyway.
	return DoReadMemory(addr, buf, size, error);			return DoReadMemory(addr, buf, size, error);
	}			}

	Status ProcessElfCore::GetMemoryRegionInfo(lldb::addr_t load_addr,			Status ProcessElfCore::GetMemoryRegionInfo(lldb::addr_t load_addr,
	MemoryRegionInfo &region_info) {			MemoryRegionInfo &region_info) {
	region_info.Clear();			region_info.Clear();
	▲ Show 20 Lines • Show All 627 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/gdb-remote/ProcessGDBRemote.h

Show First 20 Lines • Show All 230 Lines • ▼ Show 20 Lines	public:
std::string HarmonizeThreadIdsForProfileData(		std::string HarmonizeThreadIdsForProfileData(
StringExtractorGDBRemote &inputStringExtractor);		StringExtractorGDBRemote &inputStringExtractor);

protected:		protected:
friend class ThreadGDBRemote;		friend class ThreadGDBRemote;
friend class GDBRemoteCommunicationClient;		friend class GDBRemoteCommunicationClient;
friend class GDBRemoteRegisterContext;		friend class GDBRemoteRegisterContext;

		virtual std::shared_ptr<ThreadGDBRemote> CreateThread(lldb::tid_t tid);

/// Broadcaster event bits definitions.		/// Broadcaster event bits definitions.
enum {		enum {
eBroadcastBitAsyncContinue = (1 << 0),		eBroadcastBitAsyncContinue = (1 << 0),
eBroadcastBitAsyncThreadShouldExit = (1 << 1),		eBroadcastBitAsyncThreadShouldExit = (1 << 1),
eBroadcastBitAsyncThreadDidExit = (1 << 2)		eBroadcastBitAsyncThreadDidExit = (1 << 2)
};		};

GDBRemoteCommunicationClient m_gdb_comm;		GDBRemoteCommunicationClient m_gdb_comm;
▲ Show 20 Lines • Show All 213 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/gdb-remote/ProcessGDBRemote.cpp

Show First 20 Lines • Show All 335 Lines • ▼ Show 20 Lines	ProcessGDBRemote::~ProcessGDBRemote() {
KillDebugserverProcess();		KillDebugserverProcess();
}		}

// PluginInterface		// PluginInterface
ConstString ProcessGDBRemote::GetPluginName() { return GetPluginNameStatic(); }		ConstString ProcessGDBRemote::GetPluginName() { return GetPluginNameStatic(); }

uint32_t ProcessGDBRemote::GetPluginVersion() { return 1; }		uint32_t ProcessGDBRemote::GetPluginVersion() { return 1; }

		std::shared_ptr<ThreadGDBRemote>
		ProcessGDBRemote::CreateThread(lldb::tid_t tid) {
		return std::make_shared<ThreadGDBRemote>(*this, tid);
		}

bool ProcessGDBRemote::ParsePythonTargetDefinition(		bool ProcessGDBRemote::ParsePythonTargetDefinition(
const FileSpec &target_definition_fspec) {		const FileSpec &target_definition_fspec) {
ScriptInterpreter *interpreter =		ScriptInterpreter *interpreter =
GetTarget().GetDebugger().GetScriptInterpreter();		GetTarget().GetDebugger().GetScriptInterpreter();
Status error;		Status error;
StructuredData::ObjectSP module_object_sp(		StructuredData::ObjectSP module_object_sp(
interpreter->LoadPluginModule(target_definition_fspec, error));		interpreter->LoadPluginModule(target_definition_fspec, error));
if (module_object_sp) {		if (module_object_sp) {
▲ Show 20 Lines • Show All 1,271 Lines • ▼ Show 20 Lines	bool ProcessGDBRemote::UpdateThreadList(ThreadList &old_thread_list,

ThreadList old_thread_list_copy(old_thread_list);		ThreadList old_thread_list_copy(old_thread_list);
if (num_thread_ids > 0) {		if (num_thread_ids > 0) {
for (size_t i = 0; i < num_thread_ids; ++i) {		for (size_t i = 0; i < num_thread_ids; ++i) {
tid_t tid = m_thread_ids[i];		tid_t tid = m_thread_ids[i];
ThreadSP thread_sp(		ThreadSP thread_sp(
old_thread_list_copy.RemoveThreadByProtocolID(tid, false));		old_thread_list_copy.RemoveThreadByProtocolID(tid, false));
if (!thread_sp) {		if (!thread_sp) {
thread_sp = std::make_shared<ThreadGDBRemote>(*this, tid);		thread_sp = CreateThread(tid);
LLDB_LOGV(log, "Making new thread: {0} for thread ID: {1:x}.",		LLDB_LOGV(log, "Making new thread: {0} for thread ID: {1:x}.",
thread_sp.get(), thread_sp->GetID());		thread_sp.get(), thread_sp->GetID());
} else {		} else {
LLDB_LOGV(log, "Found old thread: {0} for thread ID: {1:x}.",		LLDB_LOGV(log, "Found old thread: {0} for thread ID: {1:x}.",
thread_sp.get(), thread_sp->GetID());		thread_sp.get(), thread_sp->GetID());
}		}

SetThreadPc(thread_sp, i);		SetThreadPc(thread_sp, i);
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	// Scope for "locker" below
// the mutex between the call to m_thread_list_real.FindThreadByID(...)		// the mutex between the call to m_thread_list_real.FindThreadByID(...)
// and the m_thread_list_real.AddThread(...) so it doesn't change on us		// and the m_thread_list_real.AddThread(...) so it doesn't change on us
std::lock_guard<std::recursive_mutex> guard(		std::lock_guard<std::recursive_mutex> guard(
m_thread_list_real.GetMutex());		m_thread_list_real.GetMutex());
thread_sp = m_thread_list_real.FindThreadByProtocolID(tid, false);		thread_sp = m_thread_list_real.FindThreadByProtocolID(tid, false);

if (!thread_sp) {		if (!thread_sp) {
// Create the thread if we need to		// Create the thread if we need to
thread_sp = std::make_shared<ThreadGDBRemote>(*this, tid);		thread_sp = CreateThread(tid);
m_thread_list_real.AddThread(thread_sp);		m_thread_list_real.AddThread(thread_sp);
}		}
}		}

if (thread_sp) {		if (thread_sp) {
ThreadGDBRemote *gdb_thread =		ThreadGDBRemote *gdb_thread =
static_cast<ThreadGDBRemote *>(thread_sp.get());		static_cast<ThreadGDBRemote *>(thread_sp.get());
gdb_thread->GetRegisterContext()->InvalidateIfNeeded(true);		gdb_thread->GetRegisterContext()->InvalidateIfNeeded(true);
▲ Show 20 Lines • Show All 3,639 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/mach-core/ProcessMachCore.h

Show First 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	public:

// Process Queries		// Process Queries
bool IsAlive() override;		bool IsAlive() override;

bool WarnBeforeDetach() const override;		bool WarnBeforeDetach() const override;

// Process Memory		// Process Memory
size_t ReadMemory(lldb::addr_t addr, void *buf, size_t size,		size_t ReadMemory(lldb::addr_t addr, void *buf, size_t size,
lldb_private::Status &error) override;		lldb_private::Status &error,
		lldb_private::ExecutionContext *exe_ctx = nullptr) override;

size_t DoReadMemory(lldb::addr_t addr, void *buf, size_t size,		size_t DoReadMemory(lldb::addr_t addr, void *buf, size_t size,
lldb_private::Status &error) override;		lldb_private::Status &error) override;

lldb_private::Status		lldb_private::Status
GetMemoryRegionInfo(lldb::addr_t load_addr,		GetMemoryRegionInfo(lldb::addr_t load_addr,
lldb_private::MemoryRegionInfo &region_info) override;		lldb_private::MemoryRegionInfo &region_info) override;

▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/mach-core/ProcessMachCore.cpp

	Show First 20 Lines • Show All 542 Lines • ▼ Show 20 Lines
	// Process Queries			// Process Queries

	bool ProcessMachCore::IsAlive() { return true; }			bool ProcessMachCore::IsAlive() { return true; }

	bool ProcessMachCore::WarnBeforeDetach() const { return false; }			bool ProcessMachCore::WarnBeforeDetach() const { return false; }

	// Process Memory			// Process Memory
	size_t ProcessMachCore::ReadMemory(addr_t addr, void *buf, size_t size,			size_t ProcessMachCore::ReadMemory(addr_t addr, void *buf, size_t size,
	Status &error) {			Status &error, ExecutionContext *exe_ctx) {
	// Don't allow the caching that lldb_private::Process::ReadMemory does since			// Don't allow the caching that lldb_private::Process::ReadMemory does since
	// in core files we have it all cached our our core file anyway.			// in core files we have it all cached our our core file anyway.
	return DoReadMemory(addr, buf, size, error);			return DoReadMemory(addr, buf, size, error);
	}			}

	size_t ProcessMachCore::DoReadMemory(addr_t addr, void *buf, size_t size,			size_t ProcessMachCore::DoReadMemory(addr_t addr, void *buf, size_t size,
	Status &error) {			Status &error) {
	ObjectFile *core_objfile = m_core_module_sp->GetObjectFile();			ObjectFile *core_objfile = m_core_module_sp->GetObjectFile();
	▲ Show 20 Lines • Show All 118 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/minidump/ProcessMinidump.h

Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	public:
Status DoDestroy() override;		Status DoDestroy() override;

void RefreshStateAfterStop() override;		void RefreshStateAfterStop() override;

bool IsAlive() override;		bool IsAlive() override;

bool WarnBeforeDetach() const override;		bool WarnBeforeDetach() const override;

size_t ReadMemory(lldb::addr_t addr, void *buf, size_t size,		size_t ReadMemory(lldb::addr_t addr, void *buf, size_t size, Status &error,
Status &error) override;		ExecutionContext *exe_ctx = nullptr) override;

size_t DoReadMemory(lldb::addr_t addr, void *buf, size_t size,		size_t DoReadMemory(lldb::addr_t addr, void *buf, size_t size,
Status &error) override;		Status &error) override;

ArchSpec GetArchitecture();		ArchSpec GetArchitecture();

Status GetMemoryRegionInfo(lldb::addr_t load_addr,		Status GetMemoryRegionInfo(lldb::addr_t load_addr,
MemoryRegionInfo &range_info) override;		MemoryRegionInfo &range_info) override;
▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/minidump/ProcessMinidump.cpp

Show First 20 Lines • Show All 301 Lines • ▼ Show 20 Lines	void ProcessMinidump::RefreshStateAfterStop() {
stop_thread->SetStopInfo(stop_info);		stop_thread->SetStopInfo(stop_info);
}		}

bool ProcessMinidump::IsAlive() { return true; }		bool ProcessMinidump::IsAlive() { return true; }

bool ProcessMinidump::WarnBeforeDetach() const { return false; }		bool ProcessMinidump::WarnBeforeDetach() const { return false; }

size_t ProcessMinidump::ReadMemory(lldb::addr_t addr, void *buf, size_t size,		size_t ProcessMinidump::ReadMemory(lldb::addr_t addr, void *buf, size_t size,
Status &error) {		Status &error, ExecutionContext *exe_ctx) {
// Don't allow the caching that lldb_private::Process::ReadMemory does since		// Don't allow the caching that lldb_private::Process::ReadMemory does since
// we have it all cached in our dump file anyway.		// we have it all cached in our dump file anyway.
return DoReadMemory(addr, buf, size, error);		return DoReadMemory(addr, buf, size, error);
}		}

size_t ProcessMinidump::DoReadMemory(lldb::addr_t addr, void *buf, size_t size,		size_t ProcessMinidump::DoReadMemory(lldb::addr_t addr, void *buf, size_t size,
Status &error) {		Status &error) {

▲ Show 20 Lines • Show All 615 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/wasm/CMakeLists.txt

This file was added.


				add_lldb_library(lldbPluginProcessWasm PLUGIN
				ProcessWasm.cpp
				ThreadWasm.cpp
				UnwindWasm.cpp

				LINK_LIBS
				lldbCore
				${LLDB_PLUGINS}
				LINK_COMPONENTS
				Support
				)

lldb/source/Plugins/Process/wasm/ProcessWasm.h

This file was added.

				//===-- ProcessWasm.h -------------------------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLDB_SOURCE_PLUGINS_PROCESS_WASM_PROCESSWASM_H
				#define LLDB_SOURCE_PLUGINS_PROCESS_WASM_PROCESSWASM_H

				#include "Plugins/Process/gdb-remote/ProcessGDBRemote.h"
				#include "lldb/Target/RegisterContext.h"

				namespace lldb_private {
				namespace wasm {

				// Each WebAssembly module has separated address spaces for Code and Memory.
				// A WebAssembly module also has a Data section which, when the module is
				// loaded, gets mapped into a region in the module Memory.
				// For the purpose of debugging, we can represent all these separated 32-bit
				// address spaces with a single virtual 64-bit address space.
				//
				// Struct wasm_addr_t provides this encoding using bitfields
				//
				enum WasmAddressType {
				Code = 0x00,
				Data = 0x01,
				Memory = 0x02,
				Invalid = 0x03
				};
				struct wasm_addr_t {
				uint64_t type : 2;
				uint64_t module_id : 30;
				uint64_t offset : 32;

				wasm_addr_t(lldb::addr_t addr)
				: type(addr >> 62), module_id((addr & 0x00ffffff00000000) >> 32),
				offset(addr & 0x00000000ffffffff) {}

				wasm_addr_t(WasmAddressType type_, uint32_t module_id_, uint32_t offset_)
				: type(type_), module_id(module_id_), offset(offset_) {}

				WasmAddressType GetType() { return static_cast<WasmAddressType>(type); }
				operator lldb::addr_t() { return (type << 62) \| (module_id << 32) \| offset; }
				};

				/// ProcessWasm provides the access to the Wasm program state
				/// retrieved from the Wasm engine.
				class ProcessWasm : public process_gdb_remote::ProcessGDBRemote {
				public:
				ProcessWasm(lldb::TargetSP target_sp, lldb::ListenerSP listener_sp);
				~ProcessWasm() override = default;

				static lldb::ProcessSP CreateInstance(lldb::TargetSP target_sp,
				lldb::ListenerSP listener_sp,
				const FileSpec *crash_file_path);

				static void Initialize();
				static void DebuggerInitialize(Debugger &debugger);
				static void Terminate();
				static ConstString GetPluginNameStatic();
				static const char *GetPluginDescriptionStatic();

				/// PluginInterface protocol.
				/// \{
				ConstString GetPluginName() override;
				uint32_t GetPluginVersion() override;
				/// \}

				/// Process protocol.
				/// \{
				size_t ReadMemory(lldb::addr_t vm_addr, void *buf, size_t size, Status &error,
				ExecutionContext *exe_ctx = nullptr) override;
				/// \}

				/// Query the value of a WebAssembly local variable from the WebAssembly
				/// remote process.
				bool GetWasmLocal(int frame_index, int index, void *buf, size_t buffer_size,
				size_t &size);

				/// Query the value of a WebAssembly global variable from the WebAssembly
				/// remote process.
				bool GetWasmGlobal(int frame_index, int index, void *buf, size_t buffer_size,
				size_t &size);

				/// Query the value of an item in the WebAssembly operand stack from the
				/// WebAssembly remote process.
				bool GetWasmStackValue(int frame_index, int index, void *buf,
				size_t buffer_size, size_t &size);

				/// Read from the WebAssembly Memory space.
				bool WasmReadMemory(uint32_t wasm_module_id, lldb::addr_t addr, void *buf,
				size_t buffer_size);

				/// Read from the WebAssembly Data space.
				bool WasmReadData(uint32_t wasm_module_id, lldb::addr_t addr, void *buf,
				size_t buffer_size);

				/// Retrieve the current call stack from the WebAssembly remote process.
				bool GetWasmCallStack(lldb::tid_t tid,
				std::vector<lldb::addr_t> &call_stack_pcs);

				protected:
				/// ProcessGDBRemote protocol.
				/// \{
				std::shared_ptr<process_gdb_remote::ThreadGDBRemote>
				CreateThread(lldb::tid_t tid) override;
				/// \}

				private:
				friend class UnwindWasm;
				process_gdb_remote::GDBRemoteDynamicRegisterInfo &GetRegisterInfo() {
				return m_register_info;
				}

				DISALLOW_COPY_AND_ASSIGN(ProcessWasm);
				};

				} // namespace wasm
				} // namespace lldb_private

				#endif // LLDB_SOURCE_PLUGINS_PROCESS_WASM_PROCESSWASM_H

lldb/source/Plugins/Process/wasm/ProcessWasm.cpp

This file was added.

				//===-- ProcessWasm.cpp ---------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "ProcessWasm.h"
				#include "ThreadWasm.h"
				#include "lldb/Core/PluginManager.h"
				#include "lldb/Utility/DataBufferHeap.h"

				using namespace lldb;
				using namespace lldb_private;
				using namespace lldb_private::process_gdb_remote;
				using namespace lldb_private::wasm;

				LLDB_PLUGIN_DEFINE(ProcessWasm)

				// ProcessGDBRemote constructor
				ProcessWasm::ProcessWasm(lldb::TargetSP target_sp, ListenerSP listener_sp)
				: ProcessGDBRemote(target_sp, listener_sp) {}

				void ProcessWasm::Initialize() {
				static llvm::once_flag g_once_flag;

				llvm::call_once(g_once_flag, []() {
				PluginManager::RegisterPlugin(GetPluginNameStatic(),
				GetPluginDescriptionStatic(), CreateInstance,
				DebuggerInitialize);
				});
				}

				void ProcessWasm::DebuggerInitialize(Debugger &debugger) {
				ProcessGDBRemote::DebuggerInitialize(debugger);
				}

				// PluginInterface
				ConstString ProcessWasm::GetPluginName() { return GetPluginNameStatic(); }

				uint32_t ProcessWasm::GetPluginVersion() { return 1; }

				ConstString ProcessWasm::GetPluginNameStatic() {
				static ConstString g_name("wasm");
				return g_name;
				}

				const char *ProcessWasm::GetPluginDescriptionStatic() {
				return "GDB Remote protocol based WebAssembly debugging plug-in.";
				}

				void ProcessWasm::Terminate() {
				PluginManager::UnregisterPlugin(ProcessWasm::CreateInstance);
				}

				lldb::ProcessSP ProcessWasm::CreateInstance(lldb::TargetSP target_sp,
				ListenerSP listener_sp,
				const FileSpec *crash_file_path) {
				lldb::ProcessSP process_sp;
				if (crash_file_path == nullptr)
				process_sp = std::make_shared<ProcessWasm>(target_sp, listener_sp);
				return process_sp;
				}

				std::shared_ptr<ThreadGDBRemote> ProcessWasm::CreateThread(lldb::tid_t tid) {
				return std::make_shared<ThreadWasm>(*this, tid);
				}

				size_t ProcessWasm::ReadMemory(lldb::addr_t vm_addr, void *buf, size_t size,
				Status &error, ExecutionContext *exe_ctx) {
				wasm_addr_t wasm_addr(vm_addr);

				// If we don't have a valid module_id, this is actually a read from the
				// Wasm memory space. We can calculate the module_id from the execution
				// context.
				if (wasm_addr.module_id == 0 && exe_ctx != nullptr) {
				StackFrame *frame = exe_ctx->GetFramePtr();
				assert(frame->CalculateTarget()->GetArchitecture().GetMachine() ==
				llvm::Triple::wasm32);
				wasm_addr.module_id = wasm_addr_t(frame->GetStackID().GetPC()).module_id;
				wasm_addr.type = WasmAddressType::Memory;
				}

				switch (wasm_addr.GetType()) {
				case WasmAddressType::Memory:
				if (WasmReadMemory(wasm_addr.module_id, wasm_addr.offset, buf, size))
				return size;
				error.SetErrorStringWithFormat("Wasm memory read failed for 0x%" PRIx64,
				vm_addr);
				return 0;

				case WasmAddressType::Data:
				paolosevAuthorUnsubmitted Done Reply Inline Actions This will be implemented as: return GetGDBRemote().GetWasmLocal(frame_index, index, buf, buffer_size, size); as soon as `GetWasmLocal` can be added to GDBRemoteCommunicationClient. paolosev: This will be implemented as: ``` return GetGDBRemote().GetWasmLocal(frame_index, index, buf…
				if (WasmReadData(wasm_addr.module_id, wasm_addr.offset, buf, size))
				return size;
				error.SetErrorStringWithFormat("Wasm data read failed for 0x%" PRIx64,
				vm_addr);
				return 0;

				case WasmAddressType::Code:
				return ProcessGDBRemote::ReadMemory(vm_addr, buf, size, error);

				case WasmAddressType::Invalid:
				default:
				error.SetErrorStringWithFormat(
				"Wasm read failed for invalid address 0x%" PRIx64, vm_addr);
				return 0;
				}
				}

				bool ProcessWasm::WasmReadMemory(uint32_t wasm_module_id, lldb::addr_t addr,
				void *buf, size_t buffer_size) {
				char packet[64];
				int packet_len =
				::snprintf(packet, sizeof(packet), "qWasmMem:%d;%" PRIx64 ";%" PRIx64,
				wasm_module_id, static_cast<uint64_t>(addr),
				static_cast<uint64_t>(buffer_size));
				assert(packet_len + 1 < (int)sizeof(packet));
				UNUSED_IF_ASSERT_DISABLED(packet_len);
				StringExtractorGDBRemote response;
				if (m_gdb_comm.SendPacketAndWaitForResponse(packet, response, true) ==
				GDBRemoteCommunication::PacketResult::Success) {
				if (response.IsNormalResponse()) {
				return buffer_size ==
				response.GetHexBytes(llvm::MutableArrayRef<uint8_t>(
				static_cast<uint8_t *>(buf), buffer_size),
				'\xdd');
				}
				}
				return false;
				}

				bool ProcessWasm::WasmReadData(uint32_t wasm_module_id, lldb::addr_t addr,
				void *buf, size_t buffer_size) {
				char packet[64];
				int packet_len =
				::snprintf(packet, sizeof(packet), "qWasmData:%d;%" PRIx64 ";%" PRIx64,
				wasm_module_id, static_cast<uint64_t>(addr),
				static_cast<uint64_t>(buffer_size));
				assert(packet_len + 1 < (int)sizeof(packet));
				UNUSED_IF_ASSERT_DISABLED(packet_len);
				StringExtractorGDBRemote response;
				if (m_gdb_comm.SendPacketAndWaitForResponse(packet, response, true) ==
				GDBRemoteCommunication::PacketResult::Success) {
				if (response.IsNormalResponse()) {
				return buffer_size ==
				response.GetHexBytes(llvm::MutableArrayRef<uint8_t>(
				static_cast<uint8_t *>(buf), buffer_size),
				'\xdd');
				}
				}
				return false;
				}

				bool ProcessWasm::GetWasmLocal(int frame_index, int index, void *buf,
				size_t buffer_size, size_t &size) {
				StreamString packet;
				packet.Printf("qWasmLocal:");
				packet.Printf("%d;%d", frame_index, index);
				StringExtractorGDBRemote response;
				if (m_gdb_comm.SendPacketAndWaitForResponse(packet.GetString(), response,
				false) !=
				GDBRemoteCommunication::PacketResult::Success) {
				return false;
				}

				if (!response.IsNormalResponse()) {
				return false;
				}

				DataBufferSP buffer_sp(
				new DataBufferHeap(response.GetStringRef().size() / 2, 0));
				response.GetHexBytes(buffer_sp->GetData(), '\xcc');
				size = buffer_sp->GetByteSize();
				if (size <= buffer_size) {
				memcpy(buf, buffer_sp->GetBytes(), size);
				return true;
				}

				return false;
				}

				bool ProcessWasm::GetWasmGlobal(int frame_index, int index, void *buf,
				size_t buffer_size, size_t &size) {
				StreamString packet;
				packet.PutCString("qWasmGlobal:");
				packet.Printf("%d;%d", frame_index, index);
				StringExtractorGDBRemote response;
				if (m_gdb_comm.SendPacketAndWaitForResponse(packet.GetString(), response,
				false) !=
				GDBRemoteCommunication::PacketResult::Success) {
				return false;
				}

				if (!response.IsNormalResponse()) {
				return false;
				}

				DataBufferSP buffer_sp(
				new DataBufferHeap(response.GetStringRef().size() / 2, 0));
				response.GetHexBytes(buffer_sp->GetData(), '\xcc');
				size = buffer_sp->GetByteSize();
				if (size <= buffer_size) {
				memcpy(buf, buffer_sp->GetBytes(), size);
				return true;
				}

				return false;
				}

				bool ProcessWasm::GetWasmStackValue(int frame_index, int index, void *buf,
				size_t buffer_size, size_t &size) {
				StreamString packet;
				packet.PutCString("qWasmStackValue:");
				packet.Printf("%d;%d", frame_index, index);
				StringExtractorGDBRemote response;
				if (m_gdb_comm.SendPacketAndWaitForResponse(packet.GetString(), response,
				false) !=
				GDBRemoteCommunication::PacketResult::Success) {
				return false;
				}

				if (!response.IsNormalResponse()) {
				return false;
				}

				DataBufferSP buffer_sp(
				new DataBufferHeap(response.GetStringRef().size() / 2, 0));
				response.GetHexBytes(buffer_sp->GetData(), '\xcc');
				size = buffer_sp->GetByteSize();
				if (size <= buffer_size) {
				memcpy(buf, buffer_sp->GetBytes(), size);
				return true;
				}

				return false;
				}

				bool ProcessWasm::GetWasmCallStack(lldb::tid_t tid,
				std::vector<lldb::addr_t> &call_stack_pcs) {
				call_stack_pcs.clear();
				StreamString packet;
				packet.Printf("qWasmCallStack:");
				packet.Printf("%d", tid);
				StringExtractorGDBRemote response;
				if (m_gdb_comm.SendPacketAndWaitForResponse(packet.GetString(), response,
				false) !=
				GDBRemoteCommunication::PacketResult::Success) {
				return false;
				}

				if (!response.IsNormalResponse()) {
				return false;
				}

				addr_t buf[1024 / sizeof(addr_t)];
				size_t bytes = response.GetHexBytes(
				llvm::MutableArrayRef<uint8_t>((uint8_t *)buf, sizeof(buf)), '\xdd');
				if (bytes == 0) {
				return false;
				}

				for (size_t i = 0; i < bytes / sizeof(addr_t); i++) {
				call_stack_pcs.push_back(buf[i]);
				}
				return true;
				}

lldb/source/Plugins/Process/wasm/ThreadWasm.h

This file was added.

				//===-- ThreadWasm.h --------------------------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLDB_SOURCE_PLUGINS_PROCESS_WASM_THREADWASM_H
				#define LLDB_SOURCE_PLUGINS_PROCESS_WASM_THREADWASM_H

				#include "Plugins/Process/gdb-remote/ThreadGDBRemote.h"

				namespace lldb_private {
				namespace wasm {

				/// ProcessWasm provides the access to the Wasm program state
				/// retrieved from the Wasm engine.
				class ThreadWasm : public process_gdb_remote::ThreadGDBRemote {
				public:
				ThreadWasm(Process &process, lldb::tid_t tid)
				: process_gdb_remote::ThreadGDBRemote(process, tid) {}
				~ThreadWasm() override = default;

				/// Retrieve the current call stack from the WebAssembly remote process.
				bool GetWasmCallStack(std::vector<lldb::addr_t> &call_stack_pcs);

				protected:
				/// Thread protocol.
				/// \{
				Unwind &GetUnwinder() override;
				/// \}

				DISALLOW_COPY_AND_ASSIGN(ThreadWasm);
				};

				} // namespace wasm
				} // namespace lldb_private

				#endif // LLDB_SOURCE_PLUGINS_PROCESS_WASM_THREADWASM_H

lldb/source/Plugins/Process/wasm/ThreadWasm.cpp

This file was added.

				//===-- ThreadWasm.cpp ----------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "ThreadWasm.h"

				#include "ProcessWasm.h"
				#include "UnwindWasm.h"
				#include "lldb/Target/Target.h"

				using namespace lldb;
				using namespace lldb_private;
				using namespace lldb_private::wasm;

				Unwind &ThreadWasm::GetUnwinder() {
				if (!m_unwinder_up) {
				assert(CalculateTarget()->GetArchitecture().GetMachine() ==
				llvm::Triple::wasm32);
				m_unwinder_up.reset(new wasm::UnwindWasm(*this));
				}
				return *m_unwinder_up;
				}

				bool ThreadWasm::GetWasmCallStack(std::vector<lldb::addr_t> &call_stack_pcs) {
				ProcessSP process_sp(GetProcess());
				if (process_sp) {
				ProcessWasm wasm_process = static_cast<ProcessWasm >(process_sp.get());
				return wasm_process->GetWasmCallStack(GetID(), call_stack_pcs);
				}
				return false;
				}

lldb/source/Plugins/Process/wasm/UnwindWasm.h

This file was added.

				//===-- UnwindWasm.h --------------------------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef lldb_UnwindWasm_h_
				#define lldb_UnwindWasm_h_

				#include "lldb/Target/RegisterContext.h"
				#include "lldb/Target/Unwind.h"
				#include <vector>

				namespace lldb_private {
				namespace wasm {

				/// UnwindWasm manages stack unwinding for a WebAssembly process.
				class UnwindWasm : public lldb_private::Unwind {
				public:
				UnwindWasm(lldb_private::Thread &thread)
				: Unwind(thread), m_frames(), m_unwind_complete(false) {}
				~UnwindWasm() override = default;

				protected:
				/// Unwind protocol.
				/// \{
				void DoClear() override {
				m_frames.clear();
				m_unwind_complete = false;
				}

				uint32_t DoGetFrameCount() override;

				bool DoGetFrameInfoAtIndex(uint32_t frame_idx, lldb::addr_t &cfa,
				lldb::addr_t &pc,
				bool &behaves_like_zeroth_frame) override;

				lldb::RegisterContextSP
				DoCreateRegisterContextForFrame(lldb_private::StackFrame *frame) override;
				/// \}

				private:
				std::vector<lldb::addr_t> m_frames;
				bool m_unwind_complete;

				DISALLOW_COPY_AND_ASSIGN(UnwindWasm);
				};

				} // namespace wasm
				} // namespace lldb_private

				#endif // lldb_UnwindWasm_h_

lldb/source/Plugins/Process/wasm/UnwindWasm.cpp

This file was added.

				//===-- UnwindWasm.cpp ----------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "UnwindWasm.h"
				#include "Plugins/Process/gdb-remote/ThreadGDBRemote.h"
				#include "Plugins/Process/wasm/ProcessWasm.h"
				#include "Plugins/Process/wasm/ThreadWasm.h"

				using namespace lldb;
				using namespace lldb_private;
				using namespace process_gdb_remote;
				using namespace wasm;

				class WasmGDBRemoteRegisterContext : public GDBRemoteRegisterContext {
				public:
				WasmGDBRemoteRegisterContext(ThreadGDBRemote &thread,
				uint32_t concrete_frame_idx,
				GDBRemoteDynamicRegisterInfo &reg_info,
				uint64_t pc)
				: GDBRemoteRegisterContext(thread, concrete_frame_idx, reg_info, false,
				false) {
				PrivateSetRegisterValue(0, pc);
				}
				};

				lldb::RegisterContextSP
				UnwindWasm::DoCreateRegisterContextForFrame(lldb_private::StackFrame *frame) {
				if (m_frames.size() <= frame->GetFrameIndex()) {
				return lldb::RegisterContextSP();
				}

				paolosevAuthorUnsubmitted Done Reply Inline Actions This cast works but it is ugly. Is there a better coding pattern I could use here? paolosev: This cast works but it is ugly. Is there a better coding pattern I could use here?
				ThreadSP thread = frame->GetThread();
				ThreadGDBRemote gdb_thread = static_cast<ThreadGDBRemote >(thread.get());
				ProcessWasm *wasm_process =
				static_cast<ProcessWasm *>(thread->GetProcess().get());
				std::shared_ptr<GDBRemoteRegisterContext> reg_ctx_sp =
				std::make_shared<WasmGDBRemoteRegisterContext>(
				*gdb_thread, frame->GetConcreteFrameIndex(),
				wasm_process->GetRegisterInfo(), m_frames[frame->GetFrameIndex()]);
				return reg_ctx_sp;
				}

				uint32_t UnwindWasm::DoGetFrameCount() {
				if (!m_unwind_complete) {
				m_unwind_complete = true;
				m_frames.clear();

				ThreadWasm &wasm_thread = static_cast<ThreadWasm &>(GetThread());
				if (!wasm_thread.GetWasmCallStack(m_frames))
				m_frames.clear();
				}
				return m_frames.size();
				}

				bool UnwindWasm::DoGetFrameInfoAtIndex(uint32_t frame_idx, lldb::addr_t &cfa,
				lldb::addr_t &pc,
				bool &behaves_like_zeroth_frame) {
				if (m_frames.size() == 0) {
				DoGetFrameCount();
				}

				if (frame_idx < m_frames.size()) {
				behaves_like_zeroth_frame = (frame_idx == 0);
				cfa = 0;
				pc = m_frames[frame_idx];
				return true;
				}
				return false;
				}
				No newline at end of file

lldb/source/Target/Platform.cpp

Show First 20 Lines • Show All 1,911 Lines • ▼ Show 20 Lines	size_t Platform::GetSoftwareBreakpointTrapOpcode(Target &target,

case llvm::Triple::x86:		case llvm::Triple::x86:
case llvm::Triple::x86_64: {		case llvm::Triple::x86_64: {
static const uint8_t g_i386_opcode[] = {0xCC};		static const uint8_t g_i386_opcode[] = {0xCC};
trap_opcode = g_i386_opcode;		trap_opcode = g_i386_opcode;
trap_opcode_size = sizeof(g_i386_opcode);		trap_opcode_size = sizeof(g_i386_opcode);
} break;		} break;

		case llvm::Triple::wasm32: {
		static const uint8_t g_wasm_opcode[] = {0x00}; // unreachable
		trap_opcode = g_wasm_opcode;
		trap_opcode_size = sizeof(g_wasm_opcode);
		} break;

default:		default:
llvm_unreachable(		llvm_unreachable(
"Unhandled architecture in Platform::GetSoftwareBreakpointTrapOpcode");		"Unhandled architecture in Platform::GetSoftwareBreakpointTrapOpcode");
}		}

assert(bp_site);		assert(bp_site);
if (bp_site->SetTrapOpcode(trap_opcode, trap_opcode_size))		if (bp_site->SetTrapOpcode(trap_opcode, trap_opcode_size))
return trap_opcode_size;		return trap_opcode_size;

return 0;		return 0;
}		}

lldb/source/Target/Process.cpp

Show First 20 Lines • Show All 1,953 Lines • ▼ Show 20 Lines	LLDB_LOGF(
bp_site->GetID(), (uint64_t)bp_addr, error.AsCString());		bp_site->GetID(), (uint64_t)bp_addr, error.AsCString());
return error;		return error;
}		}

// Uncomment to verify memory caching works after making changes to caching		// Uncomment to verify memory caching works after making changes to caching
// code		// code
//#define VERIFY_MEMORY_READS		//#define VERIFY_MEMORY_READS

size_t Process::ReadMemory(addr_t addr, void *buf, size_t size, Status &error) {		size_t Process::ReadMemory(addr_t addr, void *buf, size_t size, Status &error,
		ExecutionContext *exe_ctx) {
error.Clear();		error.Clear();
if (!GetDisableMemoryCache()) {		if (!GetDisableMemoryCache()) {
#if defined(VERIFY_MEMORY_READS)		#if defined(VERIFY_MEMORY_READS)
// Memory caching is enabled, with debug verification		// Memory caching is enabled, with debug verification

if (buf && size) {		if (buf && size) {
// Uncomment the line below to make sure memory caching is working.		// Uncomment the line below to make sure memory caching is working.
// I ran this through the test suite and got no assertions, so I am		// I ran this through the test suite and got no assertions, so I am
▲ Show 20 Lines • Show All 4,101 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LLDB] Add class WasmProcess for WebAssembly debuggingNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 262824

lldb/include/lldb/Core/Module.h

lldb/include/lldb/Core/PluginManager.h

lldb/include/lldb/Expression/DWARFEvaluator.h

lldb/include/lldb/Expression/DWARFEvaluatorFactory.h

lldb/include/lldb/Expression/DWARFExpression.h

lldb/include/lldb/Target/Process.h

lldb/include/lldb/lldb-forward.h

lldb/include/lldb/lldb-private-interfaces.h

lldb/source/Core/Module.cpp

lldb/source/Core/PluginManager.cpp

lldb/source/Core/Value.cpp

lldb/source/Core/ValueObject.cpp

lldb/source/Expression/CMakeLists.txt

lldb/source/Expression/DWARFEvaluator.cpp

lldb/source/Expression/DWARFEvaluatorFactory.cpp

lldb/source/Expression/DWARFExpression.cpp

lldb/source/Interpreter/CommandInterpreter.cpp

lldb/source/Plugins/CMakeLists.txt

lldb/source/Plugins/DWARFEvaluator/CMakeLists.txt

lldb/source/Plugins/DWARFEvaluator/wasm/CMakeLists.txt

lldb/source/Plugins/DWARFEvaluator/wasm/WasmDWARFEvaluator.h

lldb/source/Plugins/DWARFEvaluator/wasm/WasmDWARFEvaluator.cpp

lldb/source/Plugins/DWARFEvaluator/wasm/WasmDWARFEvaluatorFactory.h

lldb/source/Plugins/DWARFEvaluator/wasm/WasmDWARFEvaluatorFactory.cpp

lldb/source/Plugins/Plugins.def.in

lldb/source/Plugins/Process/CMakeLists.txt

lldb/source/Plugins/Process/elf-core/ProcessElfCore.h

lldb/source/Plugins/Process/elf-core/ProcessElfCore.cpp

lldb/source/Plugins/Process/gdb-remote/ProcessGDBRemote.h

lldb/source/Plugins/Process/gdb-remote/ProcessGDBRemote.cpp

lldb/source/Plugins/Process/mach-core/ProcessMachCore.h

lldb/source/Plugins/Process/mach-core/ProcessMachCore.cpp

lldb/source/Plugins/Process/minidump/ProcessMinidump.h

lldb/source/Plugins/Process/minidump/ProcessMinidump.cpp

lldb/source/Plugins/Process/wasm/CMakeLists.txt

lldb/source/Plugins/Process/wasm/ProcessWasm.h

lldb/source/Plugins/Process/wasm/ProcessWasm.cpp

lldb/source/Plugins/Process/wasm/ThreadWasm.h

lldb/source/Plugins/Process/wasm/ThreadWasm.cpp

lldb/source/Plugins/Process/wasm/UnwindWasm.h

lldb/source/Plugins/Process/wasm/UnwindWasm.cpp

lldb/source/Target/Platform.cpp

lldb/source/Target/Process.cpp

[LLDB] Add class WasmProcess for WebAssembly debugging
Needs ReviewPublic