This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
debuginfo-tests/dexter/dex/
-
dexter/
-
dex/
-
debugger/
1/2
Debuggers.py
-
lldb/
-
LLDB.py
-
dextIR/
-
ProgramState.py
-
ValueIR.py
-
tools/
-
Main.py
-
TestToolBase.py
-
gen/
1/2
Tool.py
-
__init__.py

Differential D75343

[RFC][debuginfo-tests][dexter] Add a test generation tool
AbandonedPublic

Authored by Pierre-vh on Feb 28 2020, 4:48 AM.

Download Raw Diff

Details

Reviewers

jmorse
Orlando
StephenTozer
TWeaver

Summary

We have been working on Dexter for the past few weeks, and we tried to find ways to improve Dexter to make it even better.
One of the ideas we had was to add a tool that can automatically generate Dexter tests. This is the result of our work.

This is a working implementation of a test generation tool. Here's a list of the currently supported features:

Seamless integration with Dexter: the tool can simply be invoked like any other tool, using python3 dexter.py gen //...
LLDB debugger support (note: this is currently the only debugger supported)
Supports generation of DexExpectWatchValue and DexExpectStepKind commands
- Generation of DexExpectStepKind commands can be disabled using --no-expect-steps
Generation of Dexter commands for each visible function argument, local variable or global variable in the top stack frame of each debugger step
- By default, it generates command for everything, but you can also restrict the set of commands that'll be generated by using --expect-values-of (ALL | ARG | LOCAL | GLOBAL) // default is ALL
  - e.g. If you use --expect-values-of ARG, the tool will only generate DexExpectWatchValue commands for function arguments visible in the top stack frame of each debugger step

Here is an example output from this tool.

void set(int &dest, int val) {
    for(int k = 0; k < 5; ++k)
        dest += val;
}

int main()
{
	int x = 0;
    set(x, 5);
    return x;
}
//===--- AUTO-GENERATED DEXTER COMMANDS ---===//
//DexExpectWatchValue('x', '0',  on_line=9, require_in_order=False)
//DexExpectWatchValue('dest', '0', '5', '10', '15', '20', '25',  on_line=2, require_in_order=False)
//DexExpectWatchValue('val', '5', '5', '5', '5', '5', '5',  on_line=2, require_in_order=False)
//DexExpectWatchValue('dest', '0', '5', '10', '15', '20',  on_line=3, require_in_order=False)
//DexExpectWatchValue('val', '5', '5', '5', '5', '5',  on_line=3, require_in_order=False)
//DexExpectWatchValue('k', '0', '1', '2', '3', '4',  on_line=3, require_in_order=False)
//DexExpectWatchValue('dest', '25',  on_line=4, require_in_order=False)
//DexExpectWatchValue('val', '5',  on_line=4, require_in_order=False)
//DexExpectWatchValue('x', '25',  on_line=10, require_in_order=False)
//DexExpectWatchValue('x', '25',  on_line=11, require_in_order=False)
//DexExpectStepKind('FUNC', 2)
//DexExpectStepKind('VERTICAL_FORWARD', 9)
//DexExpectStepKind('VERTICAL_BACKWARD', 5)
//===--------------------------------------===//

Of course, this implementation is still in a "prototype" stage. Some things are not perfect and it still needs some work before being fully ready for upstream.

For instance, the tool always generates commands using require_in_order=False, as there is a bug that prevents us from generating the values in the correct order.
The current implementation derives from TestToolBase, which is probably not the best thing to do
The current implementation also collects the set of visible variables for every stack frame of every step, every single time whether we are running in test or gen mode. This is not ideal performance-wise.

That said, this implementation is good enough to to run some tests, so we ran some experiments using this tool (using a bash script); Here are some graphs!
(We generated tests using some configuration, and we ran them against GCC/Clang in every optimization mode)

Test generated using GCC -Og:
Test generated using GCC -O2:
Test generated using Clang O0 that just tests for the value of function arguments and does not expect debugger steps:

Here is the .tar containing every graph (70 total). Keep in mind that those were generated relatively early in the development of this tool so the actual results could be slightly different (but still within 10% I believe):

graphs.tar.xz2 MBDownload

And here's what we found (our conclusions):

This tool is great at generating very precise tests, but those tests can be too precise and that can negatively influence the score when running it under some configurations (e.g. a test generated using GCC -Og will not make a perfect score in Clang O0 due to some GCC-isms)
We found that Dexter isn't good at detecting cases where the debug experience is actually better than what the test expects. The graph above (test generated using GCC -O2) is a perfect example of this
- In short, tests should always be generated under the compiler configuration that provides the best debug experience, else the results can't be trusted (e.g. GCC Og or Clang O0)
The results of tests generated using the this tool under some compiler (e.g. GCC) shouldn't be trusted if ran under a different compiler (e.g. Clang)
- This is due to some differences in how clang and GCC generate debug info. For instance, GCC generates an extra step for the closing } of a function while Clang does not. In short, every compiler is different, and tests generated using this tool will be inevitably biased towards the compiler used the generate the test.

We'd really like to get some feedback before investing too much work into this tool. So the question is: What do you think? Is there an upstream interest for this?

Diff Detail

Event Timeline

Pierre-vh created this revision.Feb 28 2020, 4:48 AM

Pierre-vh created this object with visibility "No One".

Herald added a project: Restricted Project. · View Herald TranscriptFeb 28 2020, 4:48 AM

Pierre-vh added a parent revision: D75337: [debuginfo-tests][dexter] Add _verify_options to TestToolBase.Feb 28 2020, 4:48 AM

Pierre-vh added reviewers: jmorse, Orlando.Feb 28 2020, 4:52 AM

Pierre-vh changed the visibility from "No One" to "Public (No Login Required)".

Herald added subscribers: llvm-commits, JDevlieghere, aprantl. · View Herald TranscriptFeb 28 2020, 4:52 AM

gbedwell added a subscriber: gbedwell.Feb 28 2020, 5:14 AM

Harbormaster completed remote builds in B47573: Diff 247221.Feb 28 2020, 5:19 AM

Adding Stephen and Tom who have also worked on dexter.

Pierre-vh mentioned this in D75339: [debuginfo-tests][dexter] Add --builder gcc support for POSIX.Mar 2 2020, 8:22 AM

I like the idea of this and think it looks useful, and personally I would say
please do continue to work on it after giving people a chance to reply to this
comment.

I'd expect generated tests to still get attention from a human though. Debug
info doesn't all have the same value to a (compiler) user. E.g. loop induction
variables are likely very important. Because of this we like to use dexter to
target specific pain points while ignoring less important debug info. So a test
which looks at _all_ the debug info isn't necessarily indicative of user
expectations (and therefore may not be valuable).

The current implementation also collects the set of visible variables for
every stack frame of every step, every single time whether we are running in
test or gen mode. This is not ideal performance-wise

@TWeaver is working on a feature which requires conditional breakpoints but he is
out of office at the moment.

We found that Dexter isn't good at detecting cases where the debug experience
is actually better than what the test expects

We once had a similar tool, and we also always used O0 as a baseline.

The results of tests generated using the this tool under some compiler
(e.g. GCC) shouldn't be trusted if ran under a different compiler (e.g. Clang)

See second paragraph in this comment.

Pending comments from Tom and no objections I think you should go for it.

Does anyone else have any comments or concerns about this?

Here is the updated revision with _verify_options removed.

Pierre-vh removed a parent revision: D75337: [debuginfo-tests][dexter] Add _verify_options to TestToolBase.Mar 3 2020, 1:10 AM

Hi Pierre,

I really like this idea and think there could be some useful applications.

The work I'm doing atm involves implementing a new type of stepping behaviour in DebuggerBase. I'm currently at the prototyping stage but I was thinking about adding support for new and interesting stepping behaviours (such as yours) that don't follow the default use case.

My code is very similar to yours in that it's written within the existing code base and modifies the existing stepping behaviour. I don't think this is the optimal way to go about supporting different stepping behaviours.

Perhaps, if you've the time, we could put our heads together and come up with a design that can support the original use case, our new use cases and any future use cases that may arise.

Do you think there's scope to generalise your work so that it can be implemented in the other debugger/s too?

Kindest Regard
Tom W

In D75343#1902893, @TWeaver wrote:

Hi Pierre,

I really like this idea and think there could be some useful applications.

The work I'm doing atm involves implementing a new type of stepping behaviour in DebuggerBase. I'm currently at the prototyping stage but I was thinking about adding support for new and interesting stepping behaviours (such as yours) that don't follow the default use case.

My code is very similar to yours in that it's written within the existing code base and modifies the existing stepping behaviour. I don't think this is the optimal way to go about supporting different stepping behaviours.

Perhaps, if you've the time, we could put our heads together and come up with a design that can support the original use case, our new use cases and any future use cases that may arise.

Do you think there's scope to generalise your work so that it can be implemented in the other debugger/s too?

Kindest Regard
Tom W

Hello,

I haven't attempted to generalize my work as I entirely rely on the LLDB library for this (and I work on Linux so I can't work with any other debugger supported by Dexter)
Maybe we can try to find something together? Where do you wish to discuss this?

Kind regards,
Pierre van Houtryve

(NB, I haven't looked at the code as it seems you're requesting feedback for now,)

We'd really like to get some feedback before investing too much work into this tool. So the question is: What do you think? Is there an upstream interest for this?

I agree with Orlando and Tom, and I think there's definitely a place for this kind of tool to assist with writing tests. Determining what the debug experience _should_ be is hard (as Orlando points out) and probably requires a human to verify, but it'd be helpful to automate the test writing process to avoid boilerplate writing. At the very least, when writing tests we could auto-generate expectations and then customise them to the ones that we're interested in. It's easier to delete the uninteresting stuff than to write the interesting stuff from scratch.

More generally, it'd be great to have additional tooling for comparing debug behaviours between optimisation levels. A while back I had a script that:

Generated a random test with csmith
Ran Dexter over it and interpreted some of its error-output to determine what values variables contained at -O0
Did the same again at -O2
Examined whether there were any variable values present at -O2 that didn't appear at -O0

Alas it didn't find anything interesting, but applied to a different test suite it could well have. Any development in this kind of direction is helpful IMO.

TWeaver added inline comments.Mar 3 2020, 8:37 AM

debuginfo-tests/dexter/dex/debugger/Debuggers.py
166	I understand this is at the prototype stage, but... 😉 This extra else step here I believe is to setup an empty dict that can be filled with commands generated by the test run for the annotated test file generation (correct me if I'm wrong). I'm not completely against having an else statement here but I feel there should at least be a comment describing why we'd want differing behaviour. Also, whilst I'm not a huge fan of the 'big-ball-of-context' approach taken in the past, we could reduce the parameter list to get_debugger_steps by interrogating the context object.
debuginfo-tests/dexter/dex/tools/gen/Tool.py
103	does this mean the generation tool runs inside the same directory the test is within? if so, where does it store the newly generated/annotated test file? thanks Tom W

Pierre-vh marked 2 inline comments as done.Mar 3 2020, 8:55 AM

Pierre-vh added inline comments.

debuginfo-tests/dexter/dex/debugger/Debuggers.py
166	These are really good suggestions, thank you The else statement is needed because it crashes without it Traceback (most recent call last): File "/llvm-project/debuginfo-tests/dexter/dex/../dexter.py", line 15, in <module> return_code = main() File "/llvm-project/debuginfo-tests/dexter/dex/tools/Main.py", line 193, in main return tool_main(context, module.Tool(context), args) File "/llvm-project/debuginfo-tests/dexter/dex/tools/Main.py", line 162, in tool_main return_code = tool.go() File "/llvm-project/debuginfo-tests/dexter/dex/tools/run_debugger_internal_/Tool.py", line 68, in go debugger.start() File "/llvm-project/debuginfo-tests/dexter/dex/debugger/DebuggerBase.py", line 125, in start for command_obj in chain.from_iterable(self.steps.commands.values()): AttributeError: 'NoneType' object has no attribute 'values' I'll try to find a better solution to this problem, and if I can't find one, I'll just add a comment explaining why this `else` is there
debuginfo-tests/dexter/dex/tools/gen/Tool.py
103	Currently, it runs in the test directory and writes the command directly to the test source So, in order to use this tool, you need to create a folder, put your `.cpp` file inside it alongside a `test.cfg`, like you would do for a normal test I know this isn't ideal, but I did that because it allowed me to re-use most of the existing logic (because this is a prototype). A final version should probably work on the .cpp directly and have options to write the commands to another file/stdout/whatever.

I haven't attempted to generalize my work as I entirely rely on the LLDB library for this (and I work on Linux so I can't work with any other debugger supported by Dexter)
Maybe we can try to find something together? Where do you wish to discuss this?

Here is a fine place to discuss imo. I'm currently cooking up a basic outline of the architecture I'd like to propose. I'll need a little bit of time to consider the cases but hopefully I'll have something to propose very soon.

cheers
Tom W

Hi Pierre,

I'm working out the kinks on this proposal but I'd to get your input too,

how do you feel about changing the way _get_step_info works? I'd like to make getting the step info as generic as possible so that we can compose different levels of information at the step getting point. I'd like to add abstract methods to debuggerbase for getting frame ir, loc ir, watch irs, function irs and all the other individual parts that make up a step, including your 'visible variables' .

by being able to compose what information we want to gather at the 'getting step info' point we can choose how hard we want to the debugger to work. We could implement several different step info getters, ones that get a small portion of the information, gather all the info or some place in the middle.

By adding abstract methods to debugger base we could implement the individual methods that get the IR in the different debuggers too.

what do you think?

I'm also working on stripping out the 'start' method within debugger base into a separate controller class so that we can separate debugger operating behaviour from the functions/features of the debugger itself. This may help with your 'step_collection.commands' issue as you could implement your own 'start' method in a different controller class

It's still rather vague and early days but I welcome your feedback on these ideas.

cheers
Tom W

In D75343#1905686, @TWeaver wrote:

Hi Pierre,

I'm working out the kinks on this proposal but I'd to get your input too,

how do you feel about changing the way _get_step_info works? I'd like to make getting the step info as generic as possible so that we can compose different levels of information at the step getting point. I'd like to add abstract methods to debuggerbase for getting frame ir, loc ir, watch irs, function irs and all the other individual parts that make up a step, including your 'visible variables' .

by being able to compose what information we want to gather at the 'getting step info' point we can choose how hard we want to the debugger to work. We could implement several different step info getters, ones that get a small portion of the information, gather all the info or some place in the middle.

By adding abstract methods to debugger base we could implement the individual methods that get the IR in the different debuggers too.

what do you think?

I'm also working on stripping out the 'start' method within debugger base into a separate controller class so that we can separate debugger operating behaviour from the functions/features of the debugger itself. This may help with your 'step_collection.commands' issue as you could implement your own 'start' method in a different controller class

It's still rather vague and early days but I welcome your feedback on these ideas.

cheers
Tom W

Breaking up _get_step_info into smaller pieces is definitely a good idea: In my opinion, the debugger classes should just act as "translators" between Dexter and the underlying debugger, nothing more, nothing less. Deciding what information to collect should be up to the caller.
- Ideally, the API should allow the caller to choose what to collect per frame. This would be very useful for the generation tool, as it could ask the debugger to just return the set of visible variables in the top stack frame and then stop. A good alternative would to have a maximum_depth parameter (0 = no limit, 1 = collect info for just one frame, 2 = etc.)
I believe that the step, go and launch methods should be renamed. Something like step_in, continue and start_debugging_session seem more appropriate to me. Their current name is a bit cryptic

That's all I can think of for now, I don't have many objections, your plan sounds good to me and would definitely be a step in the right direction.

Hi @Pierre-vh ,

I hope you're keeping safe in these unprecedented times!

first of all, sorry for the delay, I've actually gone and implemented a bunch of stuff in Dexter that is currently going through the final stages of internal review, I'll be posting something up for you to take a look at today.

secondly, in regards to compositing step information from the debugger, I've not done anything about this just yet but I have implemented a new class that allows you run the debugger in any way you wish. This is going to causes changes in your patch. Setting up of a 'test' run is now done in the tool before the calling of _get_debugger_steps. this does mean that there's some interface and argument passing changes that'll need to take place in your patch.

where we go from here will require some discussion but I look forward to potentially collaborating with you in the future.

I'll link the patch here and mark you as a reviewer so you can take a good look at it. if you've any further questions please let me know.

Cheers
Tom W

In D75343#1945852, @TWeaver wrote:

Hi @Pierre-vh ,

I hope you're keeping safe in these unprecedented times!

first of all, sorry for the delay, I've actually gone and implemented a bunch of stuff in Dexter that is currently going through the final stages of internal review, I'll be posting something up for you to take a look at today.

secondly, in regards to compositing step information from the debugger, I've not done anything about this just yet but I have implemented a new class that allows you run the debugger in any way you wish. This is going to causes changes in your patch. Setting up of a 'test' run is now done in the tool before the calling of _get_debugger_steps. this does mean that there's some interface and argument passing changes that'll need to take place in your patch.

where we go from here will require some discussion but I look forward to potentially collaborating with you in the future.

I'll link the patch here and mark you as a reviewer so you can take a good look at it. if you've any further questions please let me know.

Cheers
Tom W

Hello!

I'm looking forward to seeing your changes, and I'll try my best to give you some useful feedback on it!

However, I'm not sure I'll be able to continue work on this gen tool patch in the future.
The generation tool was done as a prototype for my internship (to run a few experiments with Dexter), but I've since moved to other things (as I'm only here for a limited amount of time), and I don't know if I'll have enough time to come back to this and finish it.

Kind regards,
Pierre van Houtryve

The debugger controller patch is now live and awaiting review at

https://reviews.llvm.org/D76926

@Pierre-vh and not a problem buddy, I wish you all the best in your future endeavours, don't worry about feedback on the controller patch if it's not related to your current work! :) you get on with what you need to.

Closing for now as I'm working on other things, I'll reopen when/if I can come back to this.

Revision Contents

Path

Size

debuginfo-tests/

dexter/

dex/

debugger/

Debuggers.py

19 lines

lldb/

LLDB.py

27 lines

dextIR/

ProgramState.py

2 lines

ValueIR.py

12 lines

tools/

Main.py

2 lines

TestToolBase.py

21 lines

gen/

Tool.py

266 lines

__init__.py

8 lines

Diff 247817

debuginfo-tests/dexter/dex/debugger/Debuggers.py

	Show First 20 Lines • Show All 145 Lines • ▼ Show 20 Lines

	def empty_debugger_steps(context):			def empty_debugger_steps(context):
	return DextIR(			return DextIR(
	executable_path=context.options.executable,			executable_path=context.options.executable,
	source_paths=context.options.source_files,			source_paths=context.options.source_files,
	dexter_version=context.version)			dexter_version=context.version)


	def get_debugger_steps(context):			def get_debugger_steps(context, parse_commands=True):
	step_collection = empty_debugger_steps(context)			step_collection = empty_debugger_steps(context)

				if parse_commands:
	with Timer('parsing commands'):			with Timer('parsing commands'):
	try:			try:
	step_collection.commands = _get_command_infos(context)			step_collection.commands = _get_command_infos(context)
	except CommandParseError as e:			except CommandParseError as e:
	msg = 'parser error: <d>{}({}):</> {}\n{}\n{}\n'.format(			msg = 'parser error: <d>{}({}):</> {}\n{}\n{}\n'.format(
	e.filename, e.lineno, e.info, e.src, e.caret)			e.filename, e.lineno, e.info, e.src, e.caret)
	raise DebuggerException(msg)			raise DebuggerException(msg)
				else:
				step_collection.commands = OrderedDict()
				TWeaverUnsubmitted Not Done Reply Inline Actions I understand this is at the prototype stage, but... 😉 This extra else step here I believe is to setup an empty dict that can be filled with commands generated by the test run for the annotated test file generation (correct me if I'm wrong). I'm not completely against having an else statement here but I feel there should at least be a comment describing why we'd want differing behaviour. Also, whilst I'm not a huge fan of the 'big-ball-of-context' approach taken in the past, we could reduce the parameter list to get_debugger_steps by interrogating the context object. TWeaver: I understand this is at the prototype stage, but... 😉 This extra else step here I believe is…
				Pierre-vhAuthorUnsubmitted Done Reply Inline Actions These are really good suggestions, thank you The else statement is needed because it crashes without it Traceback (most recent call last): File "/llvm-project/debuginfo-tests/dexter/dex/../dexter.py", line 15, in <module> return_code = main() File "/llvm-project/debuginfo-tests/dexter/dex/tools/Main.py", line 193, in main return tool_main(context, module.Tool(context), args) File "/llvm-project/debuginfo-tests/dexter/dex/tools/Main.py", line 162, in tool_main return_code = tool.go() File "/llvm-project/debuginfo-tests/dexter/dex/tools/run_debugger_internal_/Tool.py", line 68, in go debugger.start() File "/llvm-project/debuginfo-tests/dexter/dex/debugger/DebuggerBase.py", line 125, in start for command_obj in chain.from_iterable(self.steps.commands.values()): AttributeError: 'NoneType' object has no attribute 'values' I'll try to find a better solution to this problem, and if I can't find one, I'll just add a comment explaining why this `else` is there Pierre-vh: These are really good suggestions, thank you The else statement is needed because it crashes…

	with NamedTemporaryFile(			with NamedTemporaryFile(
	dir=context.working_directory.path, delete=False) as fp:			dir=context.working_directory.path, delete=False) as fp:
	pickle.dump(step_collection, fp, protocol=pickle.HIGHEST_PROTOCOL)			pickle.dump(step_collection, fp, protocol=pickle.HIGHEST_PROTOCOL)
	steps_path = fp.name			steps_path = fp.name

	with NamedTemporaryFile(			with NamedTemporaryFile(
	dir=context.working_directory.path, delete=False, mode='wb') as fp:			dir=context.working_directory.path, delete=False, mode='wb') as fp:
	▲ Show 20 Lines • Show All 128 Lines • Show Last 20 Lines

debuginfo-tests/dexter/dex/debugger/lldb/LLDB.py

# DExTer : Debugging Experience Tester		# DExTer : Debugging Experience Tester
# ~~~~~~ ~ ~~ ~ ~~		# ~~~~~~ ~ ~~ ~ ~~
#		#
# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
# See https://llvm.org/LICENSE.txt for license information.		# See https://llvm.org/LICENSE.txt for license information.
# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
"""Interface for communicating with the LLDB debugger via its python interface.		"""Interface for communicating with the LLDB debugger via its python interface.
"""		"""

import imp		import imp
import os		import os
		from collections import OrderedDict
from subprocess import CalledProcessError, check_output, STDOUT		from subprocess import CalledProcessError, check_output, STDOUT
import sys		import sys

from dex.debugger.DebuggerBase import DebuggerBase		from dex.debugger.DebuggerBase import DebuggerBase
from dex.dextIR import FrameIR, LocIR, StepIR, StopReason, ValueIR		from dex.dextIR import FrameIR, LocIR, StepIR, StopReason, ValueIR
from dex.dextIR import StackFrame, SourceLocation, ProgramState		from dex.dextIR import StackFrame, SourceLocation, ProgramState
		from dex.dextIR.ValueIR import ValueKind
from dex.utils.Exceptions import DebuggerException, LoadDebuggerException		from dex.utils.Exceptions import DebuggerException, LoadDebuggerException
from dex.utils.ReturnCode import ReturnCode		from dex.utils.ReturnCode import ReturnCode


class LLDB(DebuggerBase):		class LLDB(DebuggerBase):
def __init__(self, context, *args):		def __init__(self, context, *args):
self.lldb_executable = context.options.lldb_executable		self.lldb_executable = context.options.lldb_executable
self._debugger = None		self._debugger = None
▲ Show 20 Lines • Show All 127 Lines • ▼ Show 20 Lines	def get_step_info(self):

if any(		if any(
name in (frame.function or '') # pylint: disable=no-member		name in (frame.function or '') # pylint: disable=no-member
for name in self.frames_below_main):		for name in self.frames_below_main):
break		break

frames.append(frame)		frames.append(frame)

		visible_variables = OrderedDict()
		args = sb_frame.get_arguments()
		local_variables = sb_frame.get_locals()
		for value in sb_frame.get_all_variables():
		# Ignore the value if it has not been declared yet
		decl = value.GetDeclaration()
		if decl:
		if sb_line.GetLine() <= decl.GetLine():
		continue
		# We don't add pointer values into the set of visible variables as their value changes between runs
		if not value.TypeIsPointerType():
		value_name = str(value.GetName())
		kind = ValueKind.GLOBAL
		if args.GetFirstValueByName(value_name):
		kind = ValueKind.ARG
		elif local_variables.GetFirstValueByName(value_name):
		kind = ValueKind.LOCAL
		visible_variables[value_name] = self.evaluate_expression(value_name, 0, kind)

state_frame = StackFrame(function=frame.function,		state_frame = StackFrame(function=frame.function,
is_inlined=frame.is_inlined,		is_inlined=frame.is_inlined,
location=SourceLocation(**loc_dict),		location=SourceLocation(**loc_dict),
		visible_variables=visible_variables,
watches={})		watches={})
for expr in map(		for expr in map(
lambda watch, idx=i: self.evaluate_expression(watch, idx),		lambda watch, idx=i: self.evaluate_expression(watch, idx),
self.watches):		self.watches):
state_frame.watches[expr.expression] = expr		state_frame.watches[expr.expression] = expr
state_frames.append(state_frame)		state_frames.append(state_frame)

if len(frames) == 1 and frames[0].function is None:		if len(frames) == 1 and frames[0].function is None:
Show All 14 Lines	class LLDB(DebuggerBase):
@property		@property
def is_finished(self):		def is_finished(self):
return not self._thread.GetFrameAtIndex(0)		return not self._thread.GetFrameAtIndex(0)

@property		@property
def frames_below_main(self):		def frames_below_main(self):
return ['__scrt_common_main_seh', '__libc_start_main']		return ['__scrt_common_main_seh', '__libc_start_main']

def evaluate_expression(self, expression, frame_idx=0) -> ValueIR:		def evaluate_expression(self, expression, frame_idx=0, kind: ValueKind = None) -> ValueIR:
result = self._thread.GetFrameAtIndex(frame_idx		result = self._thread.GetFrameAtIndex(frame_idx
).EvaluateExpression(expression)		).EvaluateExpression(expression)
error_string = str(result.error)		error_string = str(result.error)

value = result.value		value = result.value
could_evaluate = not any(s in error_string for s in [		could_evaluate = not any(s in error_string for s in [
"Can't run the expression locally",		"Can't run the expression locally",
"use of undeclared identifier",		"use of undeclared identifier",
"no member named",		"no member named",
		"has unknown type", # is this a proper fix?
		"indirection requires pointer operand", # is this a proper fix?
"Couldn't lookup symbols",		"Couldn't lookup symbols",
"reference to local variable",		"reference to local variable",
"invalid use of 'this' outside of a non-static member function",		"invalid use of 'this' outside of a non-static member function",
])		])

is_optimized_away = any(s in error_string for s in [		is_optimized_away = any(s in error_string for s in [
'value may have been optimized out',		'value may have been optimized out',
])		])
Show All 24 Lines	def evaluate_expression(self, expression, frame_idx=0, kind: ValueKind = None) -> ValueIR:
return ValueIR(		return ValueIR(
expression=expression,		expression=expression,
value=value,		value=value,
type_name=type_name,		type_name=type_name,
error_string=error_string,		error_string=error_string,
could_evaluate=could_evaluate,		could_evaluate=could_evaluate,
is_optimized_away=is_optimized_away,		is_optimized_away=is_optimized_away,
is_irretrievable=is_irretrievable,		is_irretrievable=is_irretrievable,
		kind=kind
)		)

debuginfo-tests/dexter/dex/dextIR/ProgramState.py

Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	def match(self, other) -> bool:
return True		return True


class StackFrame:		class StackFrame:
def __init__(self,		def __init__(self,
function: str = None,		function: str = None,
is_inlined: bool = None,		is_inlined: bool = None,
location: SourceLocation = None,		location: SourceLocation = None,
		visible_variables: OrderedDict = None,
watches: OrderedDict = None):		watches: OrderedDict = None):
if watches is None:		if watches is None:
watches = {}		watches = {}

self.function = function		self.function = function
self.is_inlined = is_inlined		self.is_inlined = is_inlined
self.location = location		self.location = location
		self.visible_variables = visible_variables
self.watches = watches		self.watches = watches

def __str__(self):		def __str__(self):
return '{}{}: {} \| {}'.format(		return '{}{}: {} \| {}'.format(
self.function,		self.function,
' (inlined)' if self.is_inlined else '',		' (inlined)' if self.is_inlined else '',
self.location,		self.location,
{k: str(self.watches[k]) for k in self.watches})		{k: str(self.watches[k]) for k in self.watches})
▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

debuginfo-tests/dexter/dex/dextIR/ValueIR.py

	# DExTer : Debugging Experience Tester			# DExTer : Debugging Experience Tester
	# ~~~~~~ ~ ~~ ~ ~~			# ~~~~~~ ~ ~~ ~ ~~
	#			#
	# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	# See https://llvm.org/LICENSE.txt for license information.			# See https://llvm.org/LICENSE.txt for license information.
	# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

				from enum import Enum


				class ValueKind(Enum):
				ARG = 0
				LOCAL = 1
				GLOBAL = 2


	class ValueIR:			class ValueIR:
	"""Data class to store the result of an expression evaluation."""			"""Data class to store the result of an expression evaluation."""

	def __init__(self,			def __init__(self,
	expression: str,			expression: str,
	value: str,			value: str,
	type_name: str,			type_name: str,
	could_evaluate: bool,			could_evaluate: bool,
	error_string: str = None,			error_string: str = None,
	is_optimized_away: bool = False,			is_optimized_away: bool = False,
	is_irretrievable: bool = False):			is_irretrievable: bool = False,
				kind: ValueKind = None):
	self.expression = expression			self.expression = expression
	self.value = value			self.value = value
	self.type_name = type_name			self.type_name = type_name
	self.could_evaluate = could_evaluate			self.could_evaluate = could_evaluate
	self.error_string = error_string			self.error_string = error_string
	self.is_optimized_away = is_optimized_away			self.is_optimized_away = is_optimized_away
	self.is_irretrievable = is_irretrievable			self.is_irretrievable = is_irretrievable
				self.kind = kind

	def __str__(self):			def __str__(self):
	prefix = '"{}": '.format(self.expression)			prefix = '"{}": '.format(self.expression)
	if self.error_string is not None:			if self.error_string is not None:
	return prefix + self.error_string			return prefix + self.error_string
	if self.value is not None:			if self.value is not None:
	return prefix + '({}) {}'.format(self.type_name, self.value)			return prefix + '({}) {}'.format(self.type_name, self.value)
	return (prefix +			return (prefix +
	'could_evaluate: {}; irretrievable: {}; optimized_away: {};'			'could_evaluate: {}; irretrievable: {}; optimized_away: {};'
	.format(self.could_evaluate, self.is_irretrievable,			.format(self.could_evaluate, self.is_irretrievable,
	self.is_optimized_away))			self.is_optimized_away))

debuginfo-tests/dexter/dex/tools/Main.py

Show First 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	def get_tools_directory():
assert os.path.isdir(tools_directory), tools_directory		assert os.path.isdir(tools_directory), tools_directory
return tools_directory		return tools_directory


def get_tool_names():		def get_tool_names():
""" Returns a list of expected DExTer Tools		""" Returns a list of expected DExTer Tools
"""		"""
return [		return [
'clang-opt-bisect', 'help', 'list-debuggers', 'no-tool-',		'clang-opt-bisect', 'help', 'gen', 'list-debuggers', 'no-tool-',
'run-debugger-internal-', 'test', 'view'		'run-debugger-internal-', 'test', 'view'
]		]


def _set_auto_highlights(context):		def _set_auto_highlights(context):
"""Flag some strings for auto-highlighting.		"""Flag some strings for auto-highlighting.
"""		"""
context.o.auto_reds.extend([		context.o.auto_reds.extend([
▲ Show 20 Lines • Show All 130 Lines • Show Last 20 Lines

debuginfo-tests/dexter/dex/tools/TestToolBase.py

Show First 20 Lines • Show All 78 Lines • ▼ Show 20 Lines	def handle_options(self, defaults):
raise Error(e)		raise Error(e)

options.test_path = os.path.abspath(options.test_path)		options.test_path = os.path.abspath(options.test_path)
if not os.path.isfile(options.test_path) and not os.path.isdir(options.test_path):		if not os.path.isfile(options.test_path) and not os.path.isdir(options.test_path):
raise Error(		raise Error(
'<d>could not find test path</> <r>"{}"</>'.format(		'<d>could not find test path</> <r>"{}"</>'.format(
options.test_path))		options.test_path))

		if self._needs_results_directory():
options.results_directory = os.path.abspath(options.results_directory)		options.results_directory = os.path.abspath(options.results_directory)
if not os.path.isdir(options.results_directory):		if not os.path.isdir(options.results_directory):
try:		try:
os.makedirs(options.results_directory, exist_ok=True)		os.makedirs(options.results_directory, exist_ok=True)
except OSError as e:		except OSError as e:
raise Error(		raise Error(
'<d>could not create directory</> <r>"{}"</> <y>({})</>'.		'<d>could not create directory</> <r>"{}"</> <y>({})</>'.
format(options.results_directory, e.strerror))		format(options.results_directory, e.strerror))

		# Returns true if this test tool requires a result directory
		def _needs_results_directory(self) -> bool:
		return True

def go(self) -> ReturnCode: # noqa		def go(self) -> ReturnCode: # noqa
options = self.context.options		options = self.context.options

options.executable = os.path.join(		options.executable = os.path.join(
self.context.working_directory.path, 'tmp.exe')		self.context.working_directory.path, 'tmp.exe')

if os.path.isdir(options.test_path):		if os.path.isdir(options.test_path):

▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

debuginfo-tests/dexter/dex/tools/gen/Tool.py

This file was added.

				# DExTer : Debugging Experience Tester
				# ~~~~~~ ~ ~~ ~ ~~
				#
				# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				# See https://llvm.org/LICENSE.txt for license information.
				# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				"""Gen tool."""

				import shutil
				from collections import defaultdict

				from dex.builder import add_builder_tool_arguments, run_external_build_script
				from dex.debugger.Debuggers import add_debugger_tool_arguments, get_debugger_steps
				from dex.dextIR import StepKind, BuilderIR
				from dex.dextIR.ValueIR import ValueKind, ValueIR
				from dex.tools import TestToolBase
				from dex.tools.test.Tool import TestCase
				from dex.utils.Exceptions import BuildScriptException, DebuggerException, Error
				from dex.utils.ReturnCode import ReturnCode


				# The expected debugging experience
				#
				# Contains dictionaries for every expected value and debugger steps for one file
				class ExpectedExperience:
				def __init__(self, args, *kwargs):
				self._expected_values = defaultdict(list)
				self._num_step_kinds = defaultdict(int)

				# Records that an expr must have a given value at some line
				def record_value(self, expr: str, line: int, value: ValueIR):
				self._expected_values[(expr, line)].append(value)

				# Records a new occurence of a StepKind step_kind
				def record_step(self, step_kind: StepKind):
				self._num_step_kinds[step_kind] += 1

				# Returns a dictionary of (expr, line) -> [ValueIR]
				def get_expected_values_dict(self):
				return self._expected_values

				# Returns a dictionary of StepKind -> number of times this step is expected
				def get_num_expected_steps_dict(self):
				return self._expected_values

				# Generates a list of DexExpectWatchValue/DexExpectStepKind commands for this ExpectedExperience
				def generate_dex_commands(self, expect_steps=True, accepted_var_kinds: [ValueKind] = None) -> [str]:
				commands = []
				for keys, values in self._expected_values.items():
				if len(values) == 0:
				continue

				var_kind = values[0].kind

				# Check the kind of the variable - if we don't have to expect it, ignore it
				if accepted_var_kinds and var_kind not in accepted_var_kinds:
				continue

				command = "DexExpectWatchValue('" + keys[0] + "', "
				# FIXME: Ideally, we should get rid of the require_in_order, but for some reasons values are not always
				# in the correct order so we have to use it for now

				# Note: The 'if v.value' is to ignore 'None' values in the array
				values = list(v.value for v in values if v.value)
				if len(values) == 0:
				continue

				for value in values:
				value_str = value.replace("'", "\\'")
				command += "'" + value_str + "', "
				command += " on_line=" + str(keys[1]) + ", require_in_order=False)"
				commands.append(command)
				if expect_steps:
				for step_kind, num_occurences in self._num_step_kinds.items():
				commands.append("DexExpectStepKind('" + step_kind.name + "', " + str(num_occurences) + ")")
				return commands


				# DExTer generator tool test
				class Tool(TestToolBase):
				"""DExTer generator tool test - Automatically generates DExTer commands for C/C++ files
				"""

				def __init__(self, args, *kwargs):
				super(Tool, self).__init__(args, *kwargs)
				self.dextIR = None

				@property
				def name(self):
				return 'DExTer Generator'

				def add_tool_arguments(self, parser, defaults):
				parser.description = Tool.__doc__
				add_builder_tool_arguments(parser)
				add_debugger_tool_arguments(parser, self.context, defaults)
				# FIXME: This shouldn't be named "test_path" - it should be something more generic, but it will require
				# changes to TestToolbase
				parser.add_argument(
				'test_path',
				type=str,
				metavar='<test_path>',
				nargs='?',
				help='directory in which the generation tool will run')
				TWeaverUnsubmitted Not Done Reply Inline Actions does this mean the generation tool runs inside the same directory the test is within? if so, where does it store the newly generated/annotated test file? thanks Tom W TWeaver: does this mean the generation tool runs inside the same directory the test is within? if so…
				Pierre-vhAuthorUnsubmitted Done Reply Inline Actions Currently, it runs in the test directory and writes the command directly to the test source So, in order to use this tool, you need to create a folder, put your `.cpp` file inside it alongside a `test.cfg`, like you would do for a normal test I know this isn't ideal, but I did that because it allowed me to re-use most of the existing logic (because this is a prototype). A final version should probably work on the .cpp directly and have options to write the commands to another file/stdout/whatever. Pierre-vh: Currently, it runs in the test directory and writes the command directly to the test source So…
				parser.add_argument(
				'--no-expect-steps',
				action="store_true",
				help='do not generate DexExpectStepKind commands')
				parser.add_argument(
				'--expect-values-of',
				nargs='+',
				default="ALL",
				help='sets the kind of variables that should be considered by the generator. Possible values: ALL, ARG, '
				'LOCAL and/or GLOBAL')

				# Appends the DExTer commands from the 'ExpectedExperience' to the file 'filepath'
				def _write_expected_experience(self, filepath, expected_experience: ExpectedExperience):
				generated_dex_commands_beg = "//===--- AUTO-GENERATED DEXTER COMMANDS ---===//"
				generated_dex_commands_end = "//===--------------------------------------===//"

				# Read every line in the file and append them to a vector if they don't contain auto-generated
				# commands
				lines = []
				with open(filepath, "r") as file:
				in_generated_dex_commands = False
				for line in file.readlines():
				if generated_dex_commands_beg in line:
				# Sometimes, generated_dex_commands_beg can be at the end of a file
				# e.g. } //===---...
				# Detect such cases and extract the part of the line before the marker.
				if line.strip() != generated_dex_commands_beg:
				lines.append(line.split(generated_dex_commands_beg)[0])
				in_generated_dex_commands = True
				elif generated_dex_commands_end in line:
				in_generated_dex_commands = False
				elif not in_generated_dex_commands:
				lines.append(line)

				options = self.context.options
				# The set of accepted variable kinds. If set to None, every variable kind will be accepted
				accepted_var_kinds = set()
				for kind in options.expect_values_of:
				if kind.lower() == "all":
				accepted_var_kinds = None
				break
				if kind.lower() == "arg":
				accepted_var_kinds.add(ValueKind.ARG)
				elif kind.lower() == "local":
				accepted_var_kinds.add(ValueKind.LOCAL)
				elif kind.lower() == "global":
				accepted_var_kinds.add(ValueKind.GLOBAL)

				if self.context.options.verbose:
				if accepted_var_kinds:
				kinds_list = list(k.name for k in accepted_var_kinds)
				out = "Generating commands for the following variable kinds: " + str(kinds_list)
				else:
				out = "Generating commands for every variable kind"
				self.context.o.auto(out + "\n\n")

				commands = expected_experience.generate_dex_commands(expect_steps=not options.no_expect_steps,
				accepted_var_kinds=accepted_var_kinds)

				# If the last line doesn't have a newline at the end, insert one.
				if lines[-1][-1] != '\n':
				lines[-1] += '\n'

				# Add the generated DExTer commands to the array
				lines.append(generated_dex_commands_beg + "\n")
				for command in commands:
				lines.append("//" + command + "\n")
				lines.append(generated_dex_commands_end + "\n")

				if self.context.options.verbose:
				self.context.o.auto("%s commands generated in file '%s'\n" % (str(len(commands)), filepath))

				# Write everything back to the file
				with open(filepath, "w") as file:
				for line in lines:
				file.write(line)

				def _get_steps(self, builderIR):
				"""Generate a list of debugger steps"""
				steps = get_debugger_steps(self.context, parse_commands=False)
				steps.builder = builderIR
				return steps

				def _handle_results(self) -> ReturnCode:
				return ReturnCode.OK

				def _build_test_case(self):
				"""Build an executable from the test source with the given --builder
				script and flags (--cflags, --ldflags) in the working directory.
				Or, if the --binary option has been given, copy the executable provided
				into the working directory and rename it to match the --builder output.

				FIXME: This is duplicate code from test.Tool
				"""

				options = self.context.options
				if options.binary:
				# Copy user's binary into the tmp working directory
				shutil.copy(options.binary, options.executable)
				builderIR = BuilderIR(
				name='binary',
				cflags=str([options.binary]),
				ldflags='')
				else:
				options = self.context.options
				compiler_options = [options.cflags for _ in options.source_files]
				linker_options = options.ldflags
				_, _, builderIR = run_external_build_script(
				self.context,
				script_path=self.build_script,
				source_files=options.source_files,
				compiler_options=compiler_options,
				linker_options=linker_options,
				executable_file=options.executable)
				return builderIR

				def handle_options(self, defaults):
				options = self.context.options
				if options.debugger.lower() != "lldb":
				raise Error("--debugger %s is not supported by the generation tool - only 'lldb' is "
				"supported" % options.debugger)
				super(Tool, self).handle_options(defaults)

				def _needs_results_directory(self) -> bool:
				return False

				def _run_test(self, test_dir):
				test_name = self._get_test_name(test_dir)
				try:
				# Build the test case and retrieve the debugger steps
				builderIR = self._build_test_case()
				steps = self._get_steps(builderIR)
				if self.context.options.verbose:
				self.context.o.auto(str(steps))

				# A dictionary of file path -> ExpectedExperience
				expected_experiences = defaultdict(ExpectedExperience)

				# Iterate over each debugger step
				for step in steps.steps:
				frame_info = step.current_frame
				loc = frame_info.loc
				line = loc.lineno

				# The expected experience instance for the file in which this step happens
				expected_experience = expected_experiences[loc.path]

				# Record the step
				expected_experience.record_step(step.step_kind)

				# Now record the expected value of every visible variable in the top frame
				frame = step.program_state.frames[0]
				for var, value in frame.visible_variables.items():
				expected_experience.record_value(var, line, value)

				# Finally, write the set of expected experiences as DExTer commands in the file
				for filepath, expected_experience in expected_experiences.items():
				self._write_expected_experience(filepath, expected_experience)

				except (BuildScriptException, DebuggerException) as e:
				# FIXME: Report errors without using TestCase!
				self.context.o.auto(TestCase(self.context, test_name, None, e))
				return

debuginfo-tests/dexter/dex/tools/gen/init.py

This file was added.

				# DExTer : Debugging Experience Tester
				# ~~~~~~ ~ ~~ ~ ~~
				#
				# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				# See https://llvm.org/LICENSE.txt for license information.
				# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

				from dex.tools.gen.Tool import Tool