This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lldb/
-
include/lldb/
-
lldb/
-
Host/
-
Host.h
-
Utility/
-
ProcessInfo.h
-
source/
-
Commands/
1/1
CommandObjectReproducer.cpp
-
Host/
-
common/
-
Host.cpp
-
linux/
-
Host.cpp
-
macosx/objcxx/
-
objcxx/
1/1
Host.mm
-
netbsd/
-
Host.cpp
-
openbsd/
-
Host.cpp
-
Utility/
3/4
ProcessInfo.cpp
-
test/API/functionalities/reproducers/attach/
-
API/
-
functionalities/
-
reproducers/
-
attach/
-
Makefile
1/5
TestReproducerAttach.py
1/1
main.cpp

Differential D75877

[lldb/Reproducers] Fix replay for process attach workflows
ClosedPublic

Authored by JDevlieghere on Mar 9 2020, 3:02 PM.

Download Raw Diff

Details

Reviewers

labath
jasonmolenda

Commits

rGabd7ab559148: [lldb/Reproducers] Intercept the FindProcesses API
rG2451cbf07bbc: [lldb/Reproducers] Intercept the FindProcesses API

Summary

Support replaying debug sessions that attach to an existing process instead of lldb launching the inferior. Bypass the logic that looks for a process with the given name and use an arbitrary PID. The value of the PID doesn't matter as the gdb remote replay infrastructure intercepts the attach and pretends that we're connected to the original process.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

JDevlieghere created this revision.Mar 9 2020, 3:02 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 9 2020, 3:02 PM

Herald added a subscriber: teemperor. · View Herald Transcript

A more principled way to make this work would be to intercept (record) the Host::FindProcesses api. That way other functionality pertaining to running processes (e.g. the "platform process list" command) would also work. But if this is all you care about right now, then maybe this is fine...

The part that worries me more is the test. There are a lot of subtleties involved in making attach tests (and attach-by-name tests in particular) work reliably everywhere. I think this should be a dotest test, as there we already have some machinery to do these kinds of things (lldb_enable_attach, wait_for_file_on_target), and python is generally much better at complex control flow (I am very much against subshells and background processes in lit tests). Reproducers make this somewhat complicated because you cannot use the liblldb instance already loaded into the python process. But maybe you could run lldb in a subprocess similar to how the pexpect tests do it?

lldb/test/Shell/Reproducer/Inputs/sleep.c
5 ↗	(On Diff #249226)	For attach to work reliably on linux, you need to ensure the process declares itself willing to be attached to. This is what the `lldb_enable_attach` macro in dotest inferiors does. Then you also need to ensure that the process has executed that statement before you attempt to attach. This is usually done via some pid file synchronization.
lldb/test/Shell/Reproducer/TestAttach.test
7 ↗	(On Diff #249226)	How is this different from a plain `%t/attach.out &` ?
9 ↗	(On Diff #249226)	Though normally determinism is good, in this case I think it is actually better to generate an unpredictable name for the process to avoid having the test be impacted by parallel test suite runs or leftover zombies. Other attach-by-name tests usually embed some a pid or a timestamp into the process name.

In D75877#1913959, @labath wrote:

A more principled way to make this work would be to intercept (record) the Host::FindProcesses api. That way other functionality pertaining to running processes (e.g. the "platform process list" command) would also work. But if this is all you care about right now, then maybe this is fine...

We could totally add a provider for that. I didn't because it seemed like overkill but if you're on board I also prefer that over a random PID.

The part that worries me more is the test. There are a lot of subtleties involved in making attach tests (and attach-by-name tests in particular) work reliably everywhere. I think this should be a dotest test, as there we already have some machinery to do these kinds of things (lldb_enable_attach, wait_for_file_on_target), and python is generally much better at complex control flow (I am very much against subshells and background processes in lit tests). Reproducers make this somewhat complicated because you cannot use the liblldb instance already loaded into the python process. But maybe you could run lldb in a subprocess similar to how the pexpect tests do it?

The problem is the SBDebugger::Initialize() that's called from the SWIG bindings, as soon as you import lldb it's already too late for the reproducers. I'm working on being able to capture/replay the test suite and currently I have the following hack:

if 'DOTEST_CAPTURE_PATH' in os.environ:
   SBReproducer.Capture(os.environ['DOTEST_CAPTURE_PATH'])
SBDebugger.Initialize()

If you're fine with having that in python.swig unconditionally we could make a dotest-test work.

lldb/test/Shell/Reproducer/TestAttach.test
7 ↗	(On Diff #249226)	Should that work? That's the first thing I tried and lit complained about invalid syntax.

In D75877#1914755, @JDevlieghere wrote:

In D75877#1913959, @labath wrote:

A more principled way to make this work would be to intercept (record) the Host::FindProcesses api. That way other functionality pertaining to running processes (e.g. the "platform process list" command) would also work. But if this is all you care about right now, then maybe this is fine...

We could totally add a provider for that. I didn't because it seemed like overkill but if you're on board I also prefer that over a random PID.

In general, I am in favor of doing the capture at the lowest level possible. For this particular feature/bug, it is overkill, but OTOH, this will also make it possible to support things other things without adding hacks into random pieces of code.

The part that worries me more is the test. There are a lot of subtleties involved in making attach tests (and attach-by-name tests in particular) work reliably everywhere. I think this should be a dotest test, as there we already have some machinery to do these kinds of things (lldb_enable_attach, wait_for_file_on_target), and python is generally much better at complex control flow (I am very much against subshells and background processes in lit tests). Reproducers make this somewhat complicated because you cannot use the liblldb instance already loaded into the python process. But maybe you could run lldb in a subprocess similar to how the pexpect tests do it?

The problem is the SBDebugger::Initialize() that's called from the SWIG bindings, as soon as you import lldb it's already too late for the reproducers. I'm working on being able to capture/replay the test suite and currently I have the following hack:
if 'DOTEST_CAPTURE_PATH' in os.environ:
   SBReproducer.Capture(os.environ['DOTEST_CAPTURE_PATH'])
SBDebugger.Initialize()
If you're fine with having that in python.swig unconditionally we could make a dotest-test work.

I'm not sure how that would help because for the test you need to run lldb both in capture and replay mode, and I don't think you can currently do that within a single process. It would be cool if that was possible, but even then we'd have the impendance mismatch because we'd need to run SBDebugger.Initialize inside a specific test method, whereas normally it gets run much earlier.

That's why I was talking about subprocesses in the previous patch. The test would only be responsible for building the inferior and driving the whole thing, while capture/replay would happen inside separate processes:

self.spawnSubproces(randomized_inferior_name, [token_path])
lldbutil.wait_for_file_on_target(token_path)
self.spawnSubprocess(lldbtest_config.lldbExec, ['--capture', reproducer, '-n', randomized_inferior_name, ...])
...
self.spawnSubprocess(lldbtest_config.lldbExec, ['--replay', reproducer])

lldb/test/Shell/Reproducer/TestAttach.test
7 ↗	(On Diff #249226)	I've seen tests do that (`RUN: setsid %run %t/LFSIGUSR -merge=1 -merge_control_file=%t/MCF %t/C1 %t/C2 2>%t/log & export PID=$!` in `./compiler-rt/test/fuzzer/merge-sigusr.test`), but as I said, I don't think that is a good idea, so I don't really want to encourange it...

In D75877#1914900, @labath wrote:
In D75877#1914755, @JDevlieghere wrote:

In D75877#1913959, @labath wrote:

A more principled way to make this work would be to intercept (record) the Host::FindProcesses api. That way other functionality pertaining to running processes (e.g. the "platform process list" command) would also work. But if this is all you care about right now, then maybe this is fine...

We could totally add a provider for that. I didn't because it seemed like overkill but if you're on board I also prefer that over a random PID.

In general, I am in favor of doing the capture at the lowest level possible. For this particular feature/bug, it is overkill, but OTOH, this will also make it possible to support things other things without adding hacks into random pieces of code.
The part that worries me more is the test. There are a lot of subtleties involved in making attach tests (and attach-by-name tests in particular) work reliably everywhere. I think this should be a dotest test, as there we already have some machinery to do these kinds of things (lldb_enable_attach, wait_for_file_on_target), and python is generally much better at complex control flow (I am very much against subshells and background processes in lit tests). Reproducers make this somewhat complicated because you cannot use the liblldb instance already loaded into the python process. But maybe you could run lldb in a subprocess similar to how the pexpect tests do it?

The problem is the SBDebugger::Initialize() that's called from the SWIG bindings, as soon as you import lldb it's already too late for the reproducers. I'm working on being able to capture/replay the test suite and currently I have the following hack:
if 'DOTEST_CAPTURE_PATH' in os.environ:
   SBReproducer.Capture(os.environ['DOTEST_CAPTURE_PATH'])
SBDebugger.Initialize()
If you're fine with having that in python.swig unconditionally we could make a dotest-test work.
I'm not sure how that would help because for the test you need to run lldb both in capture and replay mode, and I don't think you can currently do that within a single process. It would be cool if that was possible, but even then we'd have the impendance mismatch because we'd need to run SBDebugger.Initialize inside a specific test method, whereas normally it gets run much earlier.

That's why I was talking about subprocesses in the previous patch. The test would only be responsible for building the inferior and driving the whole thing, while capture/replay would happen inside separate processes:

Ah, I misunderstood subprocess as another Python process, yeah launching the driver should work. Thanks for the clarification.

Add ProcessInfo provider.
Rewrite test as an dotest-test.

A bunch of comments but nothing really major. Maybe it would be nice to put the code for yamlification of ProcessInfo into a separate patch?

lldb/source/Commands/CommandObjectReproducer.cpp
542–555	Maybe some kind of a utility function to convert a file to an object? `template<typename T> Expected<T> readAsYaml(StringRef filename)` ?
lldb/source/Host/macosx/objcxx/Host.mm
595	This means that every implementation of FindProcesses will need to introduce this bolierplate. We should put this into common code somehow. One way to do that would be to rename all the platform-specific implementations to something like DoFindProcesses, and then implement FindProcesses `source/Host/common/Host.cpp` to handle the delegation & reproducer logic.
lldb/source/Utility/FileSpec.cpp
543 ↗	(On Diff #249555)	There's more to FileSpecs than just the path -- they also hold the path syntax and the "case-sensitive" bit. Kind of not needed for your current goal, but maybe we should add those too while we're here?
lldb/source/Utility/ProcessInfo.cpp
400–403	You don't actually have to provide these functions if they are not going to do anything.
417–430	random thought: Would any of this be simpler if this wasn't a "multi" provider but rather stored all of the responses as a sequence in a single file?
436	what's the type of this?
lldb/test/API/functionalities/reproducers/attach/TestReproducerAttach.py
31	You still need to do the `wait_for_file_on_target` dance here to ensure that `lldb_enable_attach` is executed before we actually attach. One example of that is in `test/API/python_api/hello_world/main.c`.
lldb/test/API/functionalities/reproducers/attach/main.cpp
11	You probably copied this from some existing test, but I'd say this is putting unnecessary load on the system. For this use case even a 1-second sleep would be perfectly fine.

Address code review feedback.

JDevlieghere marked 7 inline comments as done.Mar 11 2020, 2:49 PM

JDevlieghere added inline comments.

lldb/source/Utility/ProcessInfo.cpp
417–430	Maybe/Probably? I'm not sure. But even if it were a bit simpler, I think it's better to reuse the existing multi-provider for consistency.

labath accepted this revision.Mar 13 2020, 4:38 AM

labath added inline comments.

lldb/test/API/functionalities/reproducers/attach/TestReproducerAttach.py
36	s/patch/path
54–55	self.assertIn(needle, haystack)
67–76	Would it be possible to run this in the context of the current process (via self.expect?)
75–76	I guess you meant assertIn here too
78–83	The reproducer dir will still linger on if the test fails for any reason. If you just put this in the build directory (by (ab)using `self.getBuildArtifact`), then maybe you don't need to clean it up, as it will be automatically deleted the next time the test suite runs...

This revision is now accepted and ready to land.Mar 13 2020, 4:38 AM

Closed by commit rG2451cbf07bbc: [lldb/Reproducers] Intercept the FindProcesses API (authored by JDevlieghere). · Explain WhyMar 13 2020, 9:40 AM

This revision was automatically updated to reflect the committed changes.

JDevlieghere marked an inline comment as done.

Revision Contents

Path

Size

lldb/

include/

lldb/

Host/

Host.h

4 lines

Utility/

ProcessInfo.h

37 lines

source/

Commands/

CommandObjectReproducer.cpp

73 lines

Host/

common/

Host.cpp

20 lines

linux/

Host.cpp

4 lines

macosx/

objcxx/

Host.mm

4 lines

netbsd/

Host.cpp

4 lines

openbsd/

Host.cpp

4 lines

Utility/

ProcessInfo.cpp

84 lines

test/

API/

functionalities/

reproducers/

attach/

Makefile

2 lines

TestReproducerAttach.py

71 lines

main.cpp

24 lines

Diff 250236

lldb/include/lldb/Host/Host.h

Show First 20 Lines • Show All 226 Lines • ▼ Show 20 Lines	public:

static bool OpenFileInExternalEditor(const FileSpec &file_spec,		static bool OpenFileInExternalEditor(const FileSpec &file_spec,
uint32_t line_no);		uint32_t line_no);

static Environment GetEnvironment();		static Environment GetEnvironment();

static std::unique_ptr<Connection>		static std::unique_ptr<Connection>
CreateDefaultConnection(llvm::StringRef url);		CreateDefaultConnection(llvm::StringRef url);

		protected:
		static uint32_t FindProcessesImpl(const ProcessInstanceInfoMatch &match_info,
		ProcessInstanceInfoList &proc_infos);
};		};

} // namespace lldb_private		} // namespace lldb_private

namespace llvm {		namespace llvm {
template <> struct format_provider<lldb_private::WaitStatus> {		template <> struct format_provider<lldb_private::WaitStatus> {
/// Options = "" gives a human readable description of the status Options =		/// Options = "" gives a human readable description of the status Options =
/// "g" gives a gdb-remote protocol status (e.g., X09)		/// "g" gives a gdb-remote protocol status (e.g., X09)
static void format(const lldb_private::WaitStatus &WS, raw_ostream &OS,		static void format(const lldb_private::WaitStatus &WS, raw_ostream &OS,
llvm::StringRef Options);		llvm::StringRef Options);
};		};
} // namespace llvm		} // namespace llvm

#endif // LLDB_HOST_HOST_H		#endif // LLDB_HOST_HOST_H

lldb/include/lldb/Utility/ProcessInfo.h

//===-- ProcessInfo.h -------------------------------------------- C++ --===//		//===-- ProcessInfo.h -------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLDB_UTILITY_PROCESSINFO_H		#ifndef LLDB_UTILITY_PROCESSINFO_H
#define LLDB_UTILITY_PROCESSINFO_H		#define LLDB_UTILITY_PROCESSINFO_H

#include "lldb/Utility/ArchSpec.h"		#include "lldb/Utility/ArchSpec.h"
#include "lldb/Utility/Args.h"		#include "lldb/Utility/Args.h"
#include "lldb/Utility/Environment.h"		#include "lldb/Utility/Environment.h"
#include "lldb/Utility/FileSpec.h"		#include "lldb/Utility/FileSpec.h"
#include "lldb/Utility/NameMatches.h"		#include "lldb/Utility/NameMatches.h"
		#include "lldb/Utility/Reproducer.h"
#include "llvm/Support/YAMLTraits.h"		#include "llvm/Support/YAMLTraits.h"
#include <vector>		#include <vector>

namespace lldb_private {		namespace lldb_private {

class UserIDResolver;		class UserIDResolver;

// ProcessInfo		// ProcessInfo
▲ Show 20 Lines • Show All 185 Lines • ▼ Show 20 Lines	public:
void Clear();		void Clear();

protected:		protected:
ProcessInstanceInfo m_match_info;		ProcessInstanceInfo m_match_info;
NameMatch m_name_match_type;		NameMatch m_name_match_type;
bool m_match_all_users;		bool m_match_all_users;
};		};

		namespace repro {
		class ProcessInfoRecorder : public AbstractRecorder {
		public:
		ProcessInfoRecorder(const FileSpec &filename, std::error_code &ec)
		: AbstractRecorder(filename, ec) {}

		static llvm::Expected<std::unique_ptr<ProcessInfoRecorder>>
		Create(const FileSpec &filename);

		void Record(const ProcessInstanceInfoList &process_infos);
		};

		class ProcessInfoProvider : public repro::Provider<ProcessInfoProvider> {
		public:
		struct Info {
		static const char *name;
		static const char *file;
		};

		ProcessInfoProvider(const FileSpec &directory) : Provider(directory) {}

		ProcessInfoRecorder *GetNewProcessInfoRecorder();

		void Keep() override;
		void Discard() override;

		static char ID;

		private:
		std::unique_ptr<llvm::raw_fd_ostream> m_stream_up;
		std::vector<std::unique_ptr<ProcessInfoRecorder>> m_process_info_recorders;
		};

		llvm::Optional<ProcessInstanceInfoList> GetReplayProcessInstanceInfoList();

		} // namespace repro
} // namespace lldb_private		} // namespace lldb_private

LLVM_YAML_IS_SEQUENCE_VECTOR(lldb_private::ProcessInstanceInfo)		LLVM_YAML_IS_SEQUENCE_VECTOR(lldb_private::ProcessInstanceInfo)

namespace llvm {		namespace llvm {
namespace yaml {		namespace yaml {
template <> struct MappingTraits<lldb_private::ProcessInstanceInfo> {		template <> struct MappingTraits<lldb_private::ProcessInstanceInfo> {
static void mapping(IO &io, lldb_private::ProcessInstanceInfo &PII);		static void mapping(IO &io, lldb_private::ProcessInstanceInfo &PII);
};		};
} // namespace yaml		} // namespace yaml
} // namespace llvm		} // namespace llvm

#endif // LLDB_UTILITY_PROCESSINFO_H		#endif // LLDB_UTILITY_PROCESSINFO_H

lldb/source/Commands/CommandObjectReproducer.cpp

//===-- CommandObjectReproducer.cpp ---------------------------------------===//		//===-- CommandObjectReproducer.cpp ---------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "CommandObjectReproducer.h"		#include "CommandObjectReproducer.h"

		#include "lldb/Host/HostInfo.h"
#include "lldb/Host/OptionParser.h"		#include "lldb/Host/OptionParser.h"
#include "lldb/Utility/GDBRemote.h"
#include "lldb/Utility/Reproducer.h"

#include "lldb/Interpreter/CommandInterpreter.h"		#include "lldb/Interpreter/CommandInterpreter.h"
#include "lldb/Interpreter/CommandReturnObject.h"		#include "lldb/Interpreter/CommandReturnObject.h"
#include "lldb/Interpreter/OptionArgParser.h"		#include "lldb/Interpreter/OptionArgParser.h"
		#include "lldb/Utility/GDBRemote.h"
		#include "lldb/Utility/ProcessInfo.h"
		#include "lldb/Utility/Reproducer.h"

#include <csignal>		#include <csignal>

using namespace lldb;		using namespace lldb;
using namespace llvm;		using namespace llvm;
using namespace lldb_private;		using namespace lldb_private;
using namespace lldb_private::repro;		using namespace lldb_private::repro;

enum ReproducerProvider {		enum ReproducerProvider {
eReproducerProviderCommands,		eReproducerProviderCommands,
eReproducerProviderFiles,		eReproducerProviderFiles,
eReproducerProviderGDB,		eReproducerProviderGDB,
		eReproducerProviderProcessInfo,
eReproducerProviderVersion,		eReproducerProviderVersion,
eReproducerProviderWorkingDirectory,		eReproducerProviderWorkingDirectory,
eReproducerProviderNone		eReproducerProviderNone
};		};

static constexpr OptionEnumValueElement g_reproducer_provider_type[] = {		static constexpr OptionEnumValueElement g_reproducer_provider_type[] = {
{		{
eReproducerProviderCommands,		eReproducerProviderCommands,
"commands",		"commands",
"Command Interpreter Commands",		"Command Interpreter Commands",
},		},
{		{
eReproducerProviderFiles,		eReproducerProviderFiles,
"files",		"files",
"Files",		"Files",
},		},
{		{
eReproducerProviderGDB,		eReproducerProviderGDB,
"gdb",		"gdb",
"GDB Remote Packets",		"GDB Remote Packets",
},		},
{		{
		eReproducerProviderProcessInfo,
		"processes",
		"Process Info",
		},
		{
eReproducerProviderVersion,		eReproducerProviderVersion,
"version",		"version",
"Version",		"Version",
},		},
{		{
eReproducerProviderWorkingDirectory,		eReproducerProviderWorkingDirectory,
"cwd",		"cwd",
"Working Directory",		"Working Directory",
Show All 32 Lines

static constexpr OptionEnumValues ReproducerSignalType() {		static constexpr OptionEnumValues ReproducerSignalType() {
return OptionEnumValues(g_reproducer_signaltype);		return OptionEnumValues(g_reproducer_signaltype);
}		}

#define LLDB_OPTIONS_reproducer_xcrash		#define LLDB_OPTIONS_reproducer_xcrash
#include "CommandOptions.inc"		#include "CommandOptions.inc"

		template <typename T>
		llvm::Expected<T> static ReadFromYAML(StringRef filename) {
		auto error_or_file = MemoryBuffer::getFile(filename);
		if (auto err = error_or_file.getError()) {
		return errorCodeToError(err);
		}

		T t;
		yaml::Input yin((*error_or_file)->getBuffer());
		yin >> t;

		if (auto err = yin.error()) {
		return errorCodeToError(err);
		}

		return t;
		}

class CommandObjectReproducerGenerate : public CommandObjectParsed {		class CommandObjectReproducerGenerate : public CommandObjectParsed {
public:		public:
CommandObjectReproducerGenerate(CommandInterpreter &interpreter)		CommandObjectReproducerGenerate(CommandInterpreter &interpreter)
: CommandObjectParsed(		: CommandObjectParsed(
interpreter, "reproducer generate",		interpreter, "reproducer generate",
"Generate reproducer on disk. When the debugger is in capture "		"Generate reproducer on disk. When the debugger is in capture "
"mode, this command will output the reproducer to a directory on "		"mode, this command will output the reproducer to a directory on "
"disk and quit. In replay mode this command in a no-op.",		"disk and quit. In replay mode this command in a no-op.",
▲ Show 20 Lines • Show All 345 Lines • ▼ Show 20 Lines	case eReproducerProviderGDB: {
SetError(result,		SetError(result,
make_error<StringError>("Unable to create GDB loader.",		make_error<StringError>("Unable to create GDB loader.",
llvm::inconvertibleErrorCode()));		llvm::inconvertibleErrorCode()));
return false;		return false;
}		}

llvm::Optional<std::string> gdb_file;		llvm::Optional<std::string> gdb_file;
while ((gdb_file = multi_loader->GetNextFile())) {		while ((gdb_file = multi_loader->GetNextFile())) {
auto error_or_file = MemoryBuffer::getFile(*gdb_file);		if (llvm::Expected<std::vector<GDBRemotePacket>> packets =
if (auto err = error_or_file.getError()) {		ReadFromYAML<std::vector<GDBRemotePacket>>(*gdb_file)) {
SetError(result, errorCodeToError(err));		for (GDBRemotePacket &packet : *packets) {
		packet.Dump(result.GetOutputStream());
		}
		} else {
		SetError(result, packets.takeError());
return false;		return false;
}		}
		}

std::vector<GDBRemotePacket> packets;		result.SetStatus(eReturnStatusSuccessFinishResult);
yaml::Input yin((*error_or_file)->getBuffer());		return true;
yin >> packets;		}
		case eReproducerProviderProcessInfo: {
		std::unique_ptr<repro::MultiLoader<repro::ProcessInfoProvider>>
		multi_loader =
		repro::MultiLoader<repro::ProcessInfoProvider>::Create(loader);

if (auto err = yin.error()) {		if (!multi_loader) {
SetError(result, errorCodeToError(err));		SetError(result, make_error<StringError>(
		llvm::inconvertibleErrorCode(),
		"Unable to create process info loader."));
return false;		return false;
}		}

for (GDBRemotePacket &packet : packets) {		llvm::Optional<std::string> process_file;
packet.Dump(result.GetOutputStream());		while ((process_file = multi_loader->GetNextFile())) {
		if (llvm::Expected<ProcessInstanceInfoList> infos =
		ReadFromYAML<ProcessInstanceInfoList>(*process_file)) {
		for (ProcessInstanceInfo info : *infos)
		info.Dump(result.GetOutputStream(), HostInfo::GetUserIDResolver());
		} else {
		SetError(result, infos.takeError());
		return false;
}		}
}		}

result.SetStatus(eReturnStatusSuccessFinishResult);		result.SetStatus(eReturnStatusSuccessFinishResult);
return true;		return true;
}		}
case eReproducerProviderNone:		case eReproducerProviderNone:
result.SetError("No valid provider specified.");		result.SetError("No valid provider specified.");
return false;		return false;
}		}

result.SetStatus(eReturnStatusSuccessFinishNoResult);		result.SetStatus(eReturnStatusSuccessFinishNoResult);
return result.Succeeded();		return result.Succeeded();
}		}

private:		private:
CommandOptions m_options;		CommandOptions m_options;
};		};

CommandObjectReproducer::CommandObjectReproducer(		CommandObjectReproducer::CommandObjectReproducer(
CommandInterpreter &interpreter)		CommandInterpreter &interpreter)
: CommandObjectMultiword(		: CommandObjectMultiword(
interpreter, "reproducer",		interpreter, "reproducer",
"Commands for manipulating reproducers. Reproducers make it "		"Commands for manipulating reproducers. Reproducers make it "
"possible "		"possible "
"to capture full debug sessions with all its dependencies. The "		"to capture full debug sessions with all its dependencies. The "
"resulting reproducer is used to replay the debug session while "		"resulting reproducer is used to replay the debug session while "
"debugging the debugger.\n"		"debugging the debugger.\n"
"Because reproducers need the whole the debug session from "		"Because reproducers need the whole the debug session from "
"beginning to end, you need to launch the debugger in capture or "		"beginning to end, you need to launch the debugger in capture or "
"replay mode, commonly though the command line driver.\n"		"replay mode, commonly though the command line driver.\n"
"Reproducers are unrelated record-replay debugging, as you cannot "		"Reproducers are unrelated record-replay debugging, as you cannot "
"interact with the debugger during replay.\n",		"interact with the debugger during replay.\n",
"reproducer <subcommand> [<subcommand-options>]") {		"reproducer <subcommand> [<subcommand-options>]") {
LoadSubCommand(		LoadSubCommand(
		labathUnsubmitted Done Reply Inline Actions Maybe some kind of a utility function to convert a file to an object? `template<typename T> Expected<T> readAsYaml(StringRef filename)` ? labath: Maybe some kind of a utility function to convert a file to an object? `template<typename T>…
"generate",		"generate",
CommandObjectSP(new CommandObjectReproducerGenerate(interpreter)));		CommandObjectSP(new CommandObjectReproducerGenerate(interpreter)));
LoadSubCommand("status", CommandObjectSP(		LoadSubCommand("status", CommandObjectSP(
new CommandObjectReproducerStatus(interpreter)));		new CommandObjectReproducerStatus(interpreter)));
LoadSubCommand("dump",		LoadSubCommand("dump",
CommandObjectSP(new CommandObjectReproducerDump(interpreter)));		CommandObjectSP(new CommandObjectReproducerDump(interpreter)));
LoadSubCommand("xcrash", CommandObjectSP(		LoadSubCommand("xcrash", CommandObjectSP(
new CommandObjectReproducerXCrash(interpreter)));		new CommandObjectReproducerXCrash(interpreter)));
}		}

CommandObjectReproducer::~CommandObjectReproducer() = default;		CommandObjectReproducer::~CommandObjectReproducer() = default;

lldb/source/Host/common/Host.cpp

Show First 20 Lines • Show All 672 Lines • ▼ Show 20 Lines	case WaitStatus::Signal:
desc = "Killed by signal";		desc = "Killed by signal";
break;		break;
case WaitStatus::Stop:		case WaitStatus::Stop:
desc = "Stopped by signal";		desc = "Stopped by signal";
break;		break;
}		}
OS << desc << " " << int(WS.status);		OS << desc << " " << int(WS.status);
}		}

		uint32_t Host::FindProcesses(const ProcessInstanceInfoMatch &match_info,
		ProcessInstanceInfoList &process_infos) {

		if (llvm::Optional<ProcessInstanceInfoList> infos =
		repro::GetReplayProcessInstanceInfoList()) {
		process_infos = *infos;
		return process_infos.size();
		}

		uint32_t result = FindProcessesImpl(match_info, process_infos);

		if (repro::Generator *g = repro::Reproducer::Instance().GetGenerator()) {
		g->GetOrCreate<repro::ProcessInfoProvider>()
		.GetNewProcessInfoRecorder()
		->Record(process_infos);
		}

		return result;
		}

lldb/source/Host/linux/Host.cpp

Show First 20 Lines • Show All 215 Lines • ▼ Show 20 Lines	static bool GetProcessAndStatInfo(::pid_t pid,

// Get User and Group IDs and get tracer pid.		// Get User and Group IDs and get tracer pid.
if (!GetStatusInfo(pid, process_info, State, tracerpid))		if (!GetStatusInfo(pid, process_info, State, tracerpid))
return false;		return false;

return true;		return true;
}		}

uint32_t Host::FindProcesses(const ProcessInstanceInfoMatch &match_info,		uint32_t Host::FindProcessesImpl(const ProcessInstanceInfoMatch &match_info,
ProcessInstanceInfoList &process_infos) {		ProcessInstanceInfoList &process_infos) {
static const char procdir[] = "/proc/";		static const char procdir[] = "/proc/";

DIR *dirproc = opendir(procdir);		DIR *dirproc = opendir(procdir);
if (dirproc) {		if (dirproc) {
struct dirent *direntry = nullptr;		struct dirent *direntry = nullptr;
const uid_t our_uid = getuid();		const uid_t our_uid = getuid();
const lldb::pid_t our_pid = getpid();		const lldb::pid_t our_pid = getpid();
bool all_users = match_info.GetMatchAllUsers();		bool all_users = match_info.GetMatchAllUsers();
▲ Show 20 Lines • Show All 78 Lines • Show Last 20 Lines

lldb/source/Host/macosx/objcxx/Host.mm

Show First 20 Lines • Show All 585 Lines • ▼ Show 20 Lines	static bool GetMacOSXProcessUserAndGroup(ProcessInstanceInfo &process_info) {
process_info.SetParentProcessID(LLDB_INVALID_PROCESS_ID);		process_info.SetParentProcessID(LLDB_INVALID_PROCESS_ID);
process_info.SetUserID(UINT32_MAX);		process_info.SetUserID(UINT32_MAX);
process_info.SetGroupID(UINT32_MAX);		process_info.SetGroupID(UINT32_MAX);
process_info.SetEffectiveUserID(UINT32_MAX);		process_info.SetEffectiveUserID(UINT32_MAX);
process_info.SetEffectiveGroupID(UINT32_MAX);		process_info.SetEffectiveGroupID(UINT32_MAX);
return false;		return false;
}		}

uint32_t Host::FindProcesses(const ProcessInstanceInfoMatch &match_info,		uint32_t Host::FindProcessesImpl(const ProcessInstanceInfoMatch &match_info,
ProcessInstanceInfoList &process_infos) {		ProcessInstanceInfoList &process_infos) {
		labathUnsubmitted Done Reply Inline Actions This means that every implementation of FindProcesses will need to introduce this bolierplate. We should put this into common code somehow. One way to do that would be to rename all the platform-specific implementations to something like DoFindProcesses, and then implement FindProcesses `source/Host/common/Host.cpp` to handle the delegation & reproducer logic. labath: This means that every implementation of FindProcesses will need to introduce this bolierplate.
std::vector<struct kinfo_proc> kinfos;		std::vector<struct kinfo_proc> kinfos;

int mib[3] = {CTL_KERN, KERN_PROC, KERN_PROC_ALL};		int mib[3] = {CTL_KERN, KERN_PROC, KERN_PROC_ALL};

size_t pid_data_size = 0;		size_t pid_data_size = 0;
if (::sysctl(mib, 3, nullptr, &pid_data_size, nullptr, 0) != 0)		if (::sysctl(mib, 3, nullptr, &pid_data_size, nullptr, 0) != 0)
return 0;		return 0;

▲ Show 20 Lines • Show All 903 Lines • Show Last 20 Lines

lldb/source/Host/netbsd/Host.cpp

Show First 20 Lines • Show All 170 Lines • ▼ Show 20 Lines	error:
process_info.SetParentProcessID(LLDB_INVALID_PROCESS_ID);		process_info.SetParentProcessID(LLDB_INVALID_PROCESS_ID);
process_info.SetUserID(UINT32_MAX);		process_info.SetUserID(UINT32_MAX);
process_info.SetGroupID(UINT32_MAX);		process_info.SetGroupID(UINT32_MAX);
process_info.SetEffectiveUserID(UINT32_MAX);		process_info.SetEffectiveUserID(UINT32_MAX);
process_info.SetEffectiveGroupID(UINT32_MAX);		process_info.SetEffectiveGroupID(UINT32_MAX);
return false;		return false;
}		}

uint32_t Host::FindProcesses(const ProcessInstanceInfoMatch &match_info,		uint32_t Host::FindProcessesImpl(const ProcessInstanceInfoMatch &match_info,
ProcessInstanceInfoList &process_infos) {		ProcessInstanceInfoList &process_infos) {
const ::pid_t our_pid = ::getpid();		const ::pid_t our_pid = ::getpid();
const ::uid_t our_uid = ::getuid();		const ::uid_t our_uid = ::getuid();

const bool all_users =		const bool all_users =
match_info.GetMatchAllUsers() \|\|		match_info.GetMatchAllUsers() \|\|
// Special case, if lldb is being run as root we can attach to anything		// Special case, if lldb is being run as root we can attach to anything
(our_uid == 0);		(our_uid == 0);

▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

lldb/source/Host/openbsd/Host.cpp

Show First 20 Lines • Show All 134 Lines • ▼ Show 20 Lines	static bool GetOpenBSDProcessUserAndGroup(ProcessInstanceInfo &process_info) {
process_info.SetParentProcessID(LLDB_INVALID_PROCESS_ID);		process_info.SetParentProcessID(LLDB_INVALID_PROCESS_ID);
process_info.SetUserID(UINT32_MAX);		process_info.SetUserID(UINT32_MAX);
process_info.SetGroupID(UINT32_MAX);		process_info.SetGroupID(UINT32_MAX);
process_info.SetEffectiveUserID(UINT32_MAX);		process_info.SetEffectiveUserID(UINT32_MAX);
process_info.SetEffectiveGroupID(UINT32_MAX);		process_info.SetEffectiveGroupID(UINT32_MAX);
return false;		return false;
}		}

uint32_t Host::FindProcesses(const ProcessInstanceInfoMatch &match_info,		uint32_t Host::FindProcessesImpl(const ProcessInstanceInfoMatch &match_info,
ProcessInstanceInfoList &process_infos) {		ProcessInstanceInfoList &process_infos) {
std::vector<struct kinfo_proc> kinfos;		std::vector<struct kinfo_proc> kinfos;

int mib[3] = {CTL_KERN, KERN_PROC, KERN_PROC_ALL};		int mib[3] = {CTL_KERN, KERN_PROC, KERN_PROC_ALL};

size_t pid_data_size = 0;		size_t pid_data_size = 0;
if (::sysctl(mib, 3, NULL, &pid_data_size, NULL, 0) != 0)		if (::sysctl(mib, 3, NULL, &pid_data_size, NULL, 0) != 0)
return 0;		return 0;

▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

lldb/source/Utility/ProcessInfo.cpp

Show All 12 Lines
#include "lldb/Utility/StreamString.h"		#include "lldb/Utility/StreamString.h"
#include "lldb/Utility/UserIDResolver.h"		#include "lldb/Utility/UserIDResolver.h"
#include "llvm/ADT/SmallString.h"		#include "llvm/ADT/SmallString.h"

#include <climits>		#include <climits>

using namespace lldb;		using namespace lldb;
using namespace lldb_private;		using namespace lldb_private;
		using namespace lldb_private::repro;

ProcessInfo::ProcessInfo()		ProcessInfo::ProcessInfo()
: m_executable(), m_arguments(), m_environment(), m_uid(UINT32_MAX),		: m_executable(), m_arguments(), m_environment(), m_uid(UINT32_MAX),
m_gid(UINT32_MAX), m_arch(), m_pid(LLDB_INVALID_PROCESS_ID) {}		m_gid(UINT32_MAX), m_arch(), m_pid(LLDB_INVALID_PROCESS_ID) {}

ProcessInfo::ProcessInfo(const char *name, const ArchSpec &arch,		ProcessInfo::ProcessInfo(const char *name, const ArchSpec &arch,
lldb::pid_t pid)		lldb::pid_t pid)
: m_executable(name), m_arguments(), m_environment(), m_uid(UINT32_MAX),		: m_executable(name), m_arguments(), m_environment(), m_uid(UINT32_MAX),
▲ Show 20 Lines • Show All 310 Lines • ▼ Show 20 Lines	void llvm::yaml::MappingTraits<ProcessInstanceInfo>::mapping(
io.mapRequired("arch", Info.m_arch);		io.mapRequired("arch", Info.m_arch);
io.mapRequired("uid", Info.m_uid);		io.mapRequired("uid", Info.m_uid);
io.mapRequired("gid", Info.m_gid);		io.mapRequired("gid", Info.m_gid);
io.mapRequired("pid", Info.m_pid);		io.mapRequired("pid", Info.m_pid);
io.mapRequired("effective-uid", Info.m_euid);		io.mapRequired("effective-uid", Info.m_euid);
io.mapRequired("effective-gid", Info.m_egid);		io.mapRequired("effective-gid", Info.m_egid);
io.mapRequired("parent-pid", Info.m_parent_pid);		io.mapRequired("parent-pid", Info.m_parent_pid);
}		}

		llvm::Expected<std::unique_ptr<ProcessInfoRecorder>>
		ProcessInfoRecorder::Create(const FileSpec &filename) {
		std::error_code ec;
		auto recorder =
		std::make_unique<ProcessInfoRecorder>(std::move(filename), ec);
		if (ec)
		return llvm::errorCodeToError(ec);
		return std::move(recorder);
		}

		void ProcessInfoProvider::Keep() {
		std::vector<std::string> files;
		for (auto &recorder : m_process_info_recorders) {
		recorder->Stop();
		files.push_back(recorder->GetFilename().GetPath());
		}

		FileSpec file = GetRoot().CopyByAppendingPathComponent(Info::file);
		std::error_code ec;
		llvm::raw_fd_ostream os(file.GetPath(), ec, llvm::sys::fs::OF_Text);
		if (ec)
		return;
		llvm::yaml::Output yout(os);
		yout << files;
		}

		void ProcessInfoProvider::Discard() { m_process_info_recorders.clear(); }

		ProcessInfoRecorder *ProcessInfoProvider::GetNewProcessInfoRecorder() {
		std::size_t i = m_process_info_recorders.size() + 1;
		std::string filename = (llvm::Twine(Info::name) + llvm::Twine("-") +
		llvm::Twine(i) + llvm::Twine(".yaml"))
		.str();
		auto recorder_or_error = ProcessInfoRecorder::Create(
		GetRoot().CopyByAppendingPathComponent(filename));
		if (!recorder_or_error) {
		llvm::consumeError(recorder_or_error.takeError());
		return nullptr;
		}

		m_process_info_recorders.push_back(std::move(*recorder_or_error));
		return m_process_info_recorders.back().get();
		}

		void ProcessInfoRecorder::Record(const ProcessInstanceInfoList &process_infos) {
		if (!m_record)
		return;
		llvm::yaml::Output yout(m_os);
		yout << const_cast<ProcessInstanceInfoList &>(process_infos);
		m_os.flush();
		}

		llvm::Optional<ProcessInstanceInfoList>
		repro::GetReplayProcessInstanceInfoList() {
		static std::unique_ptr<repro::MultiLoader<repro::ProcessInfoProvider>>
		labathUnsubmitted Not Done Reply Inline Actions You don't actually have to provide these functions if they are not going to do anything. labath: You don't actually have to provide these functions if they are not going to do anything.
		loader = repro::MultiLoader<repro::ProcessInfoProvider>::Create(
		repro::Reproducer::Instance().GetLoader());

		if (!loader)
		return {};

		llvm::Optional<std::string> nextfile = loader->GetNextFile();
		if (!nextfile)
		return {};

		auto error_or_file = llvm::MemoryBuffer::getFile(*nextfile);
		if (std::error_code err = error_or_file.getError())
		return {};

		ProcessInstanceInfoList infos;
		llvm::yaml::Input yin((*error_or_file)->getBuffer());
		yin >> infos;

		if (auto err = yin.error())
		return {};

		return infos;
		}

		char ProcessInfoProvider::ID = 0;
		const char *ProcessInfoProvider::Info::file = "process-info.yaml";
		const char *ProcessInfoProvider::Info::name = "process-info";
		labathUnsubmitted Done Reply Inline Actions what's the type of this? labath: what's the type of this?
		labathUnsubmitted Done Reply Inline Actions random thought: Would any of this be simpler if this wasn't a "multi" provider but rather stored all of the responses as a sequence in a single file? labath: random thought: Would any of this be simpler if this wasn't a "multi" provider but rather…
		JDevlieghereAuthorUnsubmitted Done Reply Inline Actions Maybe/Probably? I'm not sure. But even if it were a bit simpler, I think it's better to reuse the existing multi-provider for consistency. JDevlieghere: Maybe/Probably? I'm not sure. But even if it were a bit simpler, I think it's better to reuse…

lldb/test/API/functionalities/reproducers/attach/Makefile

This file was added.

				CXX_SOURCES := main.cpp
				include Makefile.rules

lldb/test/API/functionalities/reproducers/attach/TestReproducerAttach.py

This file was added.

				"""
				Test reproducer attach.
				"""

				import lldb
				import tempfile
				from lldbsuite.test import lldbtest_config
				from lldbsuite.test.decorators import *
				from lldbsuite.test.lldbtest import *
				from lldbsuite.test import lldbutil


				class CreateAfterAttachTestCase(TestBase):

				mydir = TestBase.compute_mydir(__file__)
				NO_DEBUG_INFO_TESTCASE = True

				@skipIfFreeBSD
				@skipIfNetBSD
				@skipIfWindows
				@skipIfRemote
				@skipIfiOSSimulator
				def test_create_after_attach_with_fork(self):
				"""Test thread creation after process attach."""
				exe = '%s_%d' % (self.testMethodName, os.getpid())

				token = self.getBuildArtifact(exe + '.token')
				if os.path.exists(token):
				os.remove(token)

				reproducer = self.getBuildArtifact(exe + '.reproducer')
				labathUnsubmitted Done Reply Inline Actions You still need to do the `wait_for_file_on_target` dance here to ensure that `lldb_enable_attach` is executed before we actually attach. One example of that is in `test/API/python_api/hello_world/main.c`. labath: You still need to do the `wait_for_file_on_target` dance here to ensure that…
				if os.path.exists(reproducer):
				try:
				shutil.rmtree(reproducer)
				except OSError:
				pass
				labathUnsubmitted Not Done Reply Inline Actions s/patch/path labath: s/patch/path

				self.build(dictionary={'EXE': exe})
				self.addTearDownHook(self.cleanupSubprocesses)

				inferior = self.spawnSubprocess(self.getBuildArtifact(exe), [token])
				pid = inferior.pid

				lldbutil.wait_for_file_on_target(self, token)

				# Use Popen because pexpect is overkill and spawnSubprocess is
				# asynchronous.
				capture = subprocess.Popen([
				lldbtest_config.lldbExec, '-b', '--capture', '--capture-path',
				reproducer, '-o', 'proc att -n {}'.format(exe), '-o',
				'reproducer generate'
				],
				stdin=subprocess.PIPE,
				stdout=subprocess.PIPE,
				stderr=subprocess.PIPE)
				labathUnsubmitted Not Done Reply Inline Actions self.assertIn(needle, haystack) labath: self.assertIn(needle, haystack)
				outs, errs = capture.communicate()
				self.assertIn('Process {} stopped'.format(pid), outs)
				self.assertIn('Reproducer written', outs)

				# Check that replay works.
				replay = subprocess.Popen(
				[lldbtest_config.lldbExec, '-replay', reproducer],
				stdin=subprocess.PIPE,
				stdout=subprocess.PIPE,
				stderr=subprocess.PIPE)
				outs, errs = replay.communicate()
				self.assertIn('Process {} stopped'.format(pid), outs)

				# We can dump the reproducer in the current context.
				self.expect('reproducer dump -f {} -p process'.format(reproducer),
				substrs=['pid = {}'.format(pid), 'name = {}'.format(exe)])
				labathUnsubmitted Not Done Reply Inline Actions I guess you meant assertIn here too labath: I guess you meant assertIn here too
				labathUnsubmitted Not Done Reply Inline Actions The reproducer dir will still linger on if the test fails for any reason. If you just put this in the build directory (by (ab)using `self.getBuildArtifact`), then maybe you don't need to clean it up, as it will be automatically deleted the next time the test suite runs... labath: The reproducer dir will still linger on if the test fails for any reason. If you just put this…

lldb/test/API/functionalities/reproducers/attach/main.cpp

This file was added.

				#include <chrono>
				#include <stdio.h>
				#include <thread>

				using std::chrono::seconds;

				int main(int argc, char const *argv[]) {
				lldb_enable_attach();

				// Create the synchronization token.
				FILE *f;
				labathUnsubmitted Done Reply Inline Actions You probably copied this from some existing test, but I'd say this is putting unnecessary load on the system. For this use case even a 1-second sleep would be perfectly fine. labath: You probably copied this from some existing test, but I'd say this is putting unnecessary load…
				if (f = fopen(argv[1], "wx")) {
				fputs("\n", f);
				fflush(f);
				fclose(f);
				} else
				return 1;

				while (true) {
				std::this_thread::sleep_for(seconds(1));
				}

				return 0;
				}

This is an archive of the discontinued LLVM Phabricator instance.

[lldb/Reproducers] Fix replay for process attach workflowsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 250236

lldb/include/lldb/Host/Host.h

lldb/include/lldb/Utility/ProcessInfo.h

lldb/source/Commands/CommandObjectReproducer.cpp

lldb/source/Host/common/Host.cpp

lldb/source/Host/linux/Host.cpp

lldb/source/Host/macosx/objcxx/Host.mm

lldb/source/Host/netbsd/Host.cpp

lldb/source/Host/openbsd/Host.cpp

lldb/source/Utility/ProcessInfo.cpp

lldb/test/API/functionalities/reproducers/attach/Makefile

lldb/test/API/functionalities/reproducers/attach/TestReproducerAttach.py

lldb/test/API/functionalities/reproducers/attach/main.cpp

[lldb/Reproducers] Fix replay for process attach workflows
ClosedPublic