Download Raw Diff

Details

Reviewers

labath
davide
aprantl

Commits

rGd77c2e092663: [Reproducers] Capture and replay interpreter commands.
rLLDB355249: [Reproducers] Capture and replay interpreter commands.
rL355249: [Reproducers] Capture and replay interpreter commands.

Summary

Add a provider to the command interpreter. Essentially this writes all the commands to a file which is used during replay as input to the command interpreter.

Diff Detail

Event Timeline

JDevlieghere created this revision.Feb 22 2019, 5:29 PM

Herald added a subscriber: jdoerfert. · View Herald TranscriptFeb 22 2019, 5:29 PM

JDevlieghere added a child revision: D57475: [Reproducers] Add SBReproducer macros.Feb 22 2019, 5:29 PM

aprantl added inline comments.Feb 25 2019, 12:42 PM

lldb/include/lldb/Interpreter/CommandInterpreter.h
31	Doxygen comments?
617	`///`
lldb/source/Interpreter/CommandInterpreter.cpp
121	Any toplevel Doxygen comments that would be useful to add here?
132	At least from the outside it's not quite obvious what this function does. Could you perhaps add a high-level comment?

Add comments

aprantl accepted this revision.Feb 25 2019, 2:15 PM

This revision is now accepted and ready to land.Feb 25 2019, 2:15 PM

Add testcase.
Don't log sourced commands twice.

For some reason I thought that we no longer needed to differentiate between commands being sourced from a file. I remember an earlier discussing with Pavel on this topic where we considered doing this at a lower level, but there we face the same issue, without an easy way to differentiate.

I am sorry that I won't have much time to review this in the next couple of weeks, but I don't think this is a good direction here. I don't see how this will interact with the SB API recorder, specifically with things like SBCommandInterpreter::HandleCommand, and ::HandleCommandsFromFile. The thing I would expect to see is that SB recorder captures the input of those commands (for a somewhat broad interpretation of "capture") during recording, and then substitute this during replay. That way the CommandInterpreter class would not need (almost?) any modifications.

With this approach (shoving all commands into a single stream in the CommandInterpreter) it becomes impossible to replay the API calls above. If you want to proceed with this, then go ahead (it's your feature), but I believe you'll run into some problems down the line.

In D58564#1410213, @labath wrote:

I am sorry that I won't have much time to review this in the next couple of weeks, but I don't think this is a good direction here. I don't see how this will interact with the SB API recorder, specifically with things like SBCommandInterpreter::HandleCommand, and ::HandleCommandsFromFile. The thing I would expect to see is that SB recorder captures the input of those commands (for a somewhat broad interpretation of "capture") during recording, and then substitute this during replay. That way the CommandInterpreter class would not need (almost?) any modifications.

It’s been a while but wasn’t this exactly what you proposed in the other differential? How would you capture commands that are entered interactively (through RunCommqndInterpreter)?

Anyway, I don’t believe this is a concern. The provider here only capture what’s entered interactively, hence the flag. Replaying the API call should work exactly as expected. I’ll double check later today.

With this approach (shoving all commands into a single stream in the CommandInterpreter) it becomes impossible to replay the API calls above. If you want to proceed with this, then go ahead (it's your feature), but I believe you'll run into some problems down the line.

In D58564#1410729, @JDevlieghere wrote:

In D58564#1410213, @labath wrote:

I am sorry that I won't have much time to review this in the next couple of weeks, but I don't think this is a good direction here. I don't see how this will interact with the SB API recorder, specifically with things like SBCommandInterpreter::HandleCommand, and ::HandleCommandsFromFile. The thing I would expect to see is that SB recorder captures the input of those commands (for a somewhat broad interpretation of "capture") during recording, and then substitute this during replay. That way the CommandInterpreter class would not need (almost?) any modifications.

It’s been a while but wasn’t this exactly what you proposed in the other differential? How would you capture commands that are entered interactively (through RunCommqndInterpreter)?

Anyway, I don’t believe this is a concern. The provider here only capture what’s entered interactively, hence the flag. Replaying the API call should work exactly as expected. I’ll double check later today.

I have to admit I haven't looked at this in detail, but the thing I'm missing here is the connection between commands and API calls. If we take RunCommandInterpreter, for instance, you can see that the lldb driver invokes this function three times. How do you ensure the "right" commands get replayed as a part of the API call?

If you look at the driver more closely, you'll see that each call to RunCommandInterpreter is preceeded by a call to SetInputFileHandle. I don't have this idea fully baked, but the way I'd try to approach this is to have each SetInputFileHandle create a new buffer where the commands will be stored in. Then as the commands are being processed (in RunCommandInterpreter), they would be added into this buffer. Then, when replaying you would know that you only should replay the commands from the given buffer.

I'd also probably try to capture these commands at a slightly lower level, because I am hoping that this will allow us to get rid of the add_to_reproducer flag. Ideally, this should fall out naturally due to the different source the commands are coming from -- the commands executed through the HandleCommand API would be captured at the SB boundary, and the "interactive" commands would be captured by whoever invokes the command interpreter in interactive mode.

(I have no idea how easy it is to achieve this, but that's how I'd approach this.)

Pavel made a good point that with the previous implementation, the first call to RunCommandInterpreter would replay every recorded commands. This is indeed incorrect, because it's possible and likely that the state of the debugger has changed between different runs of the commands interpreter.

Now every call to RunCommandInterpreter gets its own buffer. During replay, we will change the input file handler before invocation of "RunCommandInterpreter". This works because this function is only called through the SB layer.

I'm not convinced doing the recorded at a lower level has any benefits. I investigated this route before and the IOHandler seems basically the same things as the command interpreter. We would still need to make the distinction between things that should and shouldn't be recorded (e.g. sourcing a file vs every command in the file). This would be a lot harder to do there, because we have less information.

In D58564#1412674, @JDevlieghere wrote:

Pavel made a good point that with the previous implementation, the first call to RunCommandInterpreter would replay every recorded commands. This is indeed incorrect, because it's possible and likely that the state of the debugger has changed between different runs of the commands interpreter.

Now every call to RunCommandInterpreter gets its own buffer. During replay, we will change the input file handler before invocation of "RunCommandInterpreter". This works because this function is only called through the SB layer.

I'm not convinced doing the recorded at a lower level has any benefits. I investigated this route before and the IOHandler seems basically the same things as the command interpreter. We would still need to make the distinction between things that should and shouldn't be recorded (e.g. sourcing a file vs every command in the file). This would be a lot harder to do there, because we have less information.

In my imagination, this "information" would come down from SBDebugger::SetInputFileHandle, together with the FILE* we actually read the commands from. So, a really crude prototype could be something like:

SBDebugger::SetInputFileHandle(FILE *in) {
if (recording) m_debugger->SetInputFileHandle(in, /*new argument!!!*/ new FileShadowRecorder(...));
else if (replaying) m_debugger->SetInputFileHandle(GetRecordedFile(...), nullptr);
}

The FILE* would trickle down into where you read the commands (this could be the IOHandler, or it could be the CommandInterpreter object), where you would do something like:

auto stuff = read_stuff(in);
if (shadow_recorder)
  shadow_recorder->record_stuff(stuff);

Now when you're sourcing an external file, you just pass in a nullptr for the recorder when you're setting the input handle for the command interpreter.

So, in essence the shadow recorder would kind of serve the same purpose as your add_to_reproducer flag, but IMO this would be better because it would come straight from the source (SetInputFileHandle) and you wouldn't need to rely on comments like "This works because this function is only called through the SB layer". Another benefit would be that this would work out-of-the-box in case you have multiple SBDebugger objects around, each with it's own command interpreter (right now this wouldn't work because the StartNewBuffer thingies would step on each others toes). I don't know when or if you plan to support that, but right now that tells me that this is a better design.

lldb/source/Interpreter/CommandInterpreter.cpp
144	This is incorrect usage of the Twine class. The temporary Twine objects will be destroyed before you get a chance to stringify them.

JDevlieghere updated this revision to Diff 188809.Feb 28 2019, 4:19 PM

Thanks Pavel. I've updated the patch with your suggestion. I agree it's a lot better :-)

I implemented the logic in the Debugger rather than the SBDebugger because I think the latter should be a thin wrapper, but let me know if you had a particular reason for this.

In D58564#1414410, @JDevlieghere wrote:

Thanks Pavel. I've updated the patch with your suggestion. I agree it's a lot better :-)

I implemented the logic in the Debugger rather than the SBDebugger because I think the latter should be a thin wrapper, but let me know if you had a particular reason for this.

Thanks Jonas. This looks even simpler than I anticipated. :)

While I agree that SB layer should be as thin as possible, I would say this deserves an exception. My reasoning behind that is that this command capture mechanism is kind of an extended arm of the SB recorder. Like, if we had an oracle, and were able to "instantly" capture the contents of the FILE* at the SB layer, we would just do that, and avoid this shadowing dance altogether. Unfortunately, we cannot see into the future, so we have to have this recorder "shadow" follow the input FILE* whereever it goes and capture the input as it is being read. The cleanest way to me seems to be to have that shadow start tracking the value as soon as it crosses the SB boundary.

Or, looking at it from a different angle, if somebody other than the SBDebugger calls Debugger::SetInputFileHandle, then we probably don't want to capture that input because it did not come from the outside world. (I have no idea why anybody would want to do that, but it does not seem completely out of the question.). This FILE* then most likely came from the filesystem, in which case it would be captured by the FileSystem recorder. Or if it did come through the SB API, but through a different function, then it should have been shadowed as soon it entered that function.

Apart from this I have some inline comments about error handling, but overall, I am very happy with how this is turning out.

lldb/include/lldb/Utility/Reproducer.h
120 ↗	(On Diff #188813)	Remove `operator bool` and the embedded error code. Instead have a static factory function which checks the error and only returns a DataRecorder if the file was created correctly. I'm thinking of something like static std::unique_ptr<DataRecorder> /or Expected<...> ?/ Create(FileSpec f) { std::error_code ec; auto rec = make_unique<DataRecorder>(f, ec); if (!ec) return rec; return nullptr; /or the error code/ }
lldb/source/Core/Debugger.cpp
933 ↗	(On Diff #188813)	Why not store this as a field? This way you still can have problems if two debugger objects are active concurrently.
lldb/source/Utility/Reproducer.cpp
228–229 ↗	(On Diff #188813)	So what happens if creating the file fails here? We abort when the assert in `Debugger::GetInputRecorder` fails? If we're going to crash, it's probably better to do it here, as that's closer to where the actual failure happened (alternatively, we could log a message and exit slightly more gracefully; or log a message and continue without recording).

Moved the logic into SBDebugger.
Created DataRecorder factory.
Stored DataRecorder as a field in Debugger.

I have a couple of more comments, including some things I missed on the previous pass, but I don't want to hold this up any more. Feel free to commit after taking the last batch into consideration.

lldb/include/lldb/Utility/Reproducer.h
116 ↗	(On Diff #188932)	The error_code will need to be a reference argument for this to have any effect.
lldb/source/API/SBDebugger.cpp
63 ↗	(On Diff #188932)	Use a factory function here too?
74 ↗	(On Diff #188932)	It might be nice to log these errors even if we don't plan to do anything about them.
lldb/source/Core/Debugger.cpp
887 ↗	(On Diff #188932)	Sticking with the "shadow" idea, I think it would be better if the input recorder is set as an extra argument to this function (instead of as a separate call), because they should always be changed together. Setting one without the other is almost certainly a mistake.

Addressed in commit. Thank you Pavel!

Closed by commit rL355249: [Reproducers] Capture and replay interpreter commands. (authored by JDevlieghere). · Explain WhyMar 1 2019, 4:19 PM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptMar 1 2019, 4:19 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Diff 188244

lldb/include/lldb/Interpreter/CommandInterpreter.h

Show All 21 Lines
#include "lldb/Utility/Log.h"		#include "lldb/Utility/Log.h"
#include "lldb/Utility/StringList.h"		#include "lldb/Utility/StringList.h"
#include "lldb/lldb-forward.h"		#include "lldb/lldb-forward.h"
#include "lldb/lldb-private.h"		#include "lldb/lldb-private.h"
#include <mutex>		#include <mutex>

namespace lldb_private {		namespace lldb_private {

		/// Reproducer provider for the command interpreter. The info struct needs to
		/// be public because replay takes place at the SB API layer.
		aprantlUnsubmitted Done Reply Inline Actions Doxygen comments? aprantl: Doxygen comments?
		class CommandProvider;
		struct CommandProviderInfo {
		static const char *name;
		static const char *file;
		};

class CommandInterpreterRunOptions {		class CommandInterpreterRunOptions {
public:		public:
//------------------------------------------------------------------		//------------------------------------------------------------------
/// Construct a CommandInterpreterRunOptions object. This class is used to		/// Construct a CommandInterpreterRunOptions object. This class is used to
/// control all the instances where we run multiple commands, e.g.		/// control all the instances where we run multiple commands, e.g.
/// HandleCommands, HandleCommandsFromFile, RunCommandInterpreter.		/// HandleCommands, HandleCommandsFromFile, RunCommandInterpreter.
///		///
/// The meanings of the options in this object are:		/// The meanings of the options in this object are:
▲ Show 20 Lines • Show All 563 Lines • ▼ Show 20 Lines	ChildrenTruncatedWarningStatus m_truncation_warning; // Whether we truncated
// children and whether		// children and whether
// the user has been told		// the user has been told
uint32_t m_command_source_depth;		uint32_t m_command_source_depth;
std::vector<uint32_t> m_command_source_flags;		std::vector<uint32_t> m_command_source_flags;
uint32_t m_num_errors;		uint32_t m_num_errors;
bool m_quit_requested;		bool m_quit_requested;
bool m_stopped_for_crash;		bool m_stopped_for_crash;

		/// Reproducer provider.
		aprantlUnsubmitted Done Reply Inline Actions `///` aprantl: `///`
		CommandProvider *m_provider = nullptr;

// The exit code the user has requested when calling the 'quit' command.		// The exit code the user has requested when calling the 'quit' command.
// No value means the user hasn't set a custom exit code so far.		// No value means the user hasn't set a custom exit code so far.
llvm::Optional<int> m_quit_exit_code;		llvm::Optional<int> m_quit_exit_code;
// If the driver is accepts custom exit codes for the 'quit' command.		// If the driver is accepts custom exit codes for the 'quit' command.
bool m_allow_exit_code = false;		bool m_allow_exit_code = false;
};		};

} // namespace lldb_private		} // namespace lldb_private

#endif // liblldb_CommandInterpreter_h_		#endif // liblldb_CommandInterpreter_h_

lldb/source/Commands/CommandObjectReproducer.cpp

Show All 31 Lines	if (!command.empty()) {
result.AppendErrorWithFormat("'%s' takes no arguments",		result.AppendErrorWithFormat("'%s' takes no arguments",
m_cmd_name.c_str());		m_cmd_name.c_str());
return false;		return false;
}		}

auto &r = repro::Reproducer::Instance();		auto &r = repro::Reproducer::Instance();
if (auto generator = r.GetGenerator()) {		if (auto generator = r.GetGenerator()) {
generator->Keep();		generator->Keep();
		} else if (r.GetLoader()) {
		// Make this operation a NOP in replay mode.
		result.SetStatus(eReturnStatusSuccessFinishNoResult);
		return result.Succeeded();
} else {		} else {
result.AppendErrorWithFormat("Unable to get the reproducer generator");		result.AppendErrorWithFormat("Unable to get the reproducer generator");
		result.SetStatus(eReturnStatusFailed);
return false;		return false;
}		}

result.GetOutputStream()		result.GetOutputStream()
<< "Reproducer written to '" << r.GetReproducerPath() << "'\n";		<< "Reproducer written to '" << r.GetReproducerPath() << "'\n";

result.SetStatus(eReturnStatusSuccessFinishResult);		result.SetStatus(eReturnStatusSuccessFinishResult);
return result.Succeeded();		return result.Succeeded();
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

lldb/source/Interpreter/CommandInterpreter.cpp

Show All 39 Lines
#include "Commands/CommandObjectType.h"		#include "Commands/CommandObjectType.h"
#include "Commands/CommandObjectVersion.h"		#include "Commands/CommandObjectVersion.h"
#include "Commands/CommandObjectWatchpoint.h"		#include "Commands/CommandObjectWatchpoint.h"

#include "lldb/Core/Debugger.h"		#include "lldb/Core/Debugger.h"
#include "lldb/Core/PluginManager.h"		#include "lldb/Core/PluginManager.h"
#include "lldb/Core/StreamFile.h"		#include "lldb/Core/StreamFile.h"
#include "lldb/Utility/Log.h"		#include "lldb/Utility/Log.h"
		#include "lldb/Utility/Reproducer.h"
#include "lldb/Utility/State.h"		#include "lldb/Utility/State.h"
#include "lldb/Utility/Stream.h"		#include "lldb/Utility/Stream.h"
#include "lldb/Utility/Timer.h"		#include "lldb/Utility/Timer.h"

#ifndef LLDB_DISABLE_LIBEDIT		#ifndef LLDB_DISABLE_LIBEDIT
#include "lldb/Host/Editline.h"		#include "lldb/Host/Editline.h"
#endif		#endif
#include "lldb/Host/Host.h"		#include "lldb/Host/Host.h"
Show All 13 Lines

#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallString.h"		#include "llvm/ADT/SmallString.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/PrettyStackTrace.h"		#include "llvm/Support/PrettyStackTrace.h"

using namespace lldb;		using namespace lldb;
using namespace lldb_private;		using namespace lldb_private;
		using namespace llvm;

static const char *k_white_space = " \t\v";		static const char *k_white_space = " \t\v";

static constexpr bool NoGlobalSetting = true;		static constexpr bool NoGlobalSetting = true;
static constexpr uintptr_t DefaultValueTrue = true;		static constexpr uintptr_t DefaultValueTrue = true;
static constexpr uintptr_t DefaultValueFalse = false;		static constexpr uintptr_t DefaultValueFalse = false;
static constexpr const char *NoCStrDefault = nullptr;		static constexpr const char *NoCStrDefault = nullptr;

Show All 26 Lines	enum {
ePropertyExpandRegexAliases = 0,		ePropertyExpandRegexAliases = 0,
ePropertyPromptOnQuit = 1,		ePropertyPromptOnQuit = 1,
ePropertyStopCmdSourceOnError = 2,		ePropertyStopCmdSourceOnError = 2,
eSpaceReplPrompts = 3,		eSpaceReplPrompts = 3,
eEchoCommands = 4,		eEchoCommands = 4,
eEchoCommentCommands = 5		eEchoCommentCommands = 5
};		};

		/// Provider for the command interpreter. Every command is logged to file which
		aprantlUnsubmitted Done Reply Inline Actions Any toplevel Doxygen comments that would be useful to add here? aprantl: Any toplevel Doxygen comments that would be useful to add here?
		/// is used as input during replay. The latter takes place at the SB API layer
		/// by changing the input file handle.
		class lldb_private::CommandProvider
		: public repro::Provider<lldb_private::CommandProvider> {
		public:
		typedef CommandProviderInfo info;

		CommandProvider(const FileSpec &directory) : Provider(directory) {}

		/// Capture a single command.
		void CaptureCommand(std::string command) {
		aprantlUnsubmitted Done Reply Inline Actions At least from the outside it's not quite obvious what this function does. Could you perhaps add a high-level comment? aprantl: At least from the outside it's not quite obvious what this function does. Could you perhaps add…
		m_commands.push_back(std::move(command));
		}

		/// Commands are kept in memory and written to a file when the reproducer
		/// needs to be kept.
		void Keep() override {
		FileSpec file =
		GetRoot().CopyByAppendingPathComponent(CommandProviderInfo::file);

		std::error_code ec;
		llvm::raw_fd_ostream os(file.GetPath(), ec, llvm::sys::fs::F_Text);

		labathUnsubmitted Not Done Reply Inline Actions This is incorrect usage of the Twine class. The temporary Twine objects will be destroyed before you get a chance to stringify them. labath: This is incorrect usage of the Twine class. The temporary Twine objects will be destroyed…
		if (ec)
		return;

		for (auto &command : m_commands)
		os << command << '\n';
		}

		/// Commands are kept in memory and are cleared when the reproducer is
		/// discarded.
		void Discard() override { m_commands.clear(); }

		static char ID;

		private:
		std::vector<std::string> m_commands;
		};

		char CommandProvider::ID = 0;
		const char *CommandProviderInfo::name = "command-interpreter";
		const char *CommandProviderInfo::file = "command-interpreter.txt";

ConstString &CommandInterpreter::GetStaticBroadcasterClass() {		ConstString &CommandInterpreter::GetStaticBroadcasterClass() {
static ConstString class_name("lldb.commandInterpreter");		static ConstString class_name("lldb.commandInterpreter");
return class_name;		return class_name;
}		}

CommandInterpreter::CommandInterpreter(Debugger &debugger,		CommandInterpreter::CommandInterpreter(Debugger &debugger,
ScriptLanguage script_language,		ScriptLanguage script_language,
bool synchronous_execution)		bool synchronous_execution)
Show All 9 Lines	: Broadcaster(debugger.GetBroadcasterManager(),
m_command_source_depth(0), m_num_errors(0), m_quit_requested(false),		m_command_source_depth(0), m_num_errors(0), m_quit_requested(false),
m_stopped_for_crash(false) {		m_stopped_for_crash(false) {
debugger.SetScriptLanguage(script_language);		debugger.SetScriptLanguage(script_language);
SetEventName(eBroadcastBitThreadShouldExit, "thread-should-exit");		SetEventName(eBroadcastBitThreadShouldExit, "thread-should-exit");
SetEventName(eBroadcastBitResetPrompt, "reset-prompt");		SetEventName(eBroadcastBitResetPrompt, "reset-prompt");
SetEventName(eBroadcastBitQuitCommandReceived, "quit");		SetEventName(eBroadcastBitQuitCommandReceived, "quit");
CheckInWithManager();		CheckInWithManager();
m_collection_sp->Initialize(g_properties);		m_collection_sp->Initialize(g_properties);

		if (repro::Generator *g = repro::Reproducer::Instance().GetGenerator())
		m_provider = &g->GetOrCreate<CommandProvider>();
}		}

bool CommandInterpreter::GetExpandRegexAliases() const {		bool CommandInterpreter::GetExpandRegexAliases() const {
const uint32_t idx = ePropertyExpandRegexAliases;		const uint32_t idx = ePropertyExpandRegexAliases;
return m_collection_sp->GetPropertyAtIndexAsBoolean(		return m_collection_sp->GetPropertyAtIndexAsBoolean(
nullptr, idx, g_properties[idx].default_uint_value != 0);		nullptr, idx, g_properties[idx].default_uint_value != 0);
}		}

▲ Show 20 Lines • Show All 1,537 Lines • ▼ Show 20 Lines	if (empty_command) {
}		}
} else if (comment_command) {		} else if (comment_command) {
result.SetStatus(eReturnStatusSuccessFinishNoResult);		result.SetStatus(eReturnStatusSuccessFinishNoResult);
return true;		return true;
}		}

Status error(PreprocessCommand(command_string));		Status error(PreprocessCommand(command_string));

		if (m_provider)
		m_provider->CaptureCommand(original_command_string);

if (error.Fail()) {		if (error.Fail()) {
result.AppendError(error.AsCString());		result.AppendError(error.AsCString());
result.SetStatus(eReturnStatusFailed);		result.SetStatus(eReturnStatusFailed);
return false;		return false;
}		}

// Phase 1.		// Phase 1.

▲ Show 20 Lines • Show All 1,474 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Reproducers] Add command provider
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 188244

lldb/include/lldb/Interpreter/CommandInterpreter.h

lldb/source/Commands/CommandObjectReproducer.cpp

lldb/source/Interpreter/CommandInterpreter.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[Reproducers] Add command providerClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 188244

lldb/include/lldb/Interpreter/CommandInterpreter.h

lldb/source/Commands/CommandObjectReproducer.cpp

lldb/source/Interpreter/CommandInterpreter.cpp

[Reproducers] Add command provider
ClosedPublic