This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lldb/
-
source/Plugins/Process/Linux/
-
Plugins/
-
Process/
-
Linux/
-
NativeProcessLinux.h
3/6
NativeProcessLinux.cpp
-
test/API/tools/lldb-server/
-
API/
-
tools/
-
lldb-server/
-
TestGdbRemoteFork.py

Differential D116372

[lldb-server/linux] Fix waitpid for multithreaded forks
ClosedPublic

Authored by labath on Dec 29 2021, 5:54 AM.

Download Raw Diff

Details

Reviewers

mgorny
DavidSpickett

Commits

rGca271f4ef5a2: [lldb-server/linux] Fix waitpid for multithreaded forks

Summary

The lldb-server code is currently set up in a way that each
NativeProcess instance does its own waitpid handling. This works fine
for BSDs, where the code can do a waitpid(process_id), and get
information for all threads in that process.

The situation is trickier on linux, because waitpid(pid) will only
return information for the main thread of the process (one whose tid ==
pid). For this reason the linux code does a waitpid(-1), to get
information for all threads. This was fine while we were supporting just
a single process, but becomes a problem when we have multiple processes
as they end up stealing each others events.

There are two possible solutions to this problem:

call waitpid(-1) centrally, and then dispatch the events to the appropriate process
have each process call waitpid(tid) for all the threads it manages

This patch implements the second approach. Besides fitting better into
the existing design, it also has the added benefit of ensuring
predictable ordering for thread/process creation events (which come in
pairs -- one for the parent and one for the child). The first approach
OTOH, would make this ordering even more complicated since we would
have to keep the half-threads hanging in mid-air until we find the
process we should attach them to.

The downside to this approach is an increased number of syscalls (one
waitpid for each thread), but I think we're pretty far from optimizing
things like this, and so the cleanliness of the design is worth it.

The included test reproduces the circumstances which should demonstrate
the bug (which manifests as a hung test), but I have not been able to
get it to fail. The only place I've seen this failure modes are very
rare hangs in the thread sanitizer tests (tsan forks an addr2line
process to produce its error messages).

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

labath requested review of this revision.Dec 29 2021, 5:54 AM

labath created this revision.

Herald added a project: Restricted Project. · View Herald TranscriptDec 29 2021, 5:54 AM

Harbormaster completed remote builds in B140890: Diff 396521.Dec 29 2021, 5:56 AM

Herald added a subscriber: JDevlieghere. · View Herald TranscriptDec 29 2021, 5:56 AM

Thank you a lot for doing this. I love how the code becomes simpler, and the cost doesn't seem much.

lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp
1909	Hmm, I'm a bit confused here. Why would the key already exist? I mean, you start with empty `tid_events` and catch only one event for every thread, correct?
1918	Are we talking about some kind of race here? Or some thread that appears in `m_threads` but is not returned by `GetThreadByID()`? I was wondering if you could use thread pointers as keys.

labath added inline comments.Dec 30 2021, 1:53 AM

lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp
1909	It doesn't exist. It's just how the api is named. c++ std::map offers both emplace and try_emplace (which, ironically, don't differ in "trying" -- they only differ in the way they construct the pair). DenseMap offers only try_emplace (maybe for that reason).
1918	The problem is when a thread disappears. This can happen in case of a main thread exit or an execve, in which case we remove all non-main threads from the list. However, we can still have some pending events for the other threads. Now, I haven't managed to reproduce this in my experiments, but the manpage is adamant that a SIGKILL should immediately terminate a process. In my (limited) tests the debugger always got a PTRACE_EVENT_EXIT stop for each threads (which is again something that the manpage says should not happen), so we theoretically (with careful management of thread lifetimes) might ensure that a thread with pending events does not disappear, but depending on it doesn't seem like a good idea.

mgorny added inline comments.Dec 30 2021, 2:26 AM

lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp
1918	Hmm, so it could disappear while `MonitorCallback()` is executing; do I understand correctly?

labath added inline comments.Dec 30 2021, 3:18 AM

lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp
1918	Yes. I meant disappearing from the list we maintain, not from the system.

LGTM.

This revision is now accepted and ready to land.Dec 31 2021, 4:08 AM

Closed by commit rGca271f4ef5a2: [lldb-server/linux] Fix waitpid for multithreaded forks (authored by labath). · Explain WhyJan 3 2022, 5:28 AM

This revision was automatically updated to reflect the committed changes.

labath added a commit: rGca271f4ef5a2: [lldb-server/linux] Fix waitpid for multithreaded forks.

labath mentioned this in rGdf4ad3625fad: [lldb/linux] Fix a race in handling of simultaneous thread exits.Jan 5 2022, 4:22 AM

labath mentioned this in D146977: [lldb-server/linux] Use waitpid(-1) to collect inferior events.Mar 27 2023, 8:48 AM

labath mentioned this in rGe64cc756819d: [lldb-server/linux] Use waitpid(-1) to collect inferior events.Mar 30 2023, 3:48 AM

Revision Contents

Path

Size

lldb/

source/

Plugins/

Process/

Linux/

NativeProcessLinux.h

21 lines

NativeProcessLinux.cpp

188 lines

test/

API/

tools/

lldb-server/

TestGdbRemoteFork.py

34 lines

Diff 397048

lldb/source/Plugins/Process/Linux/NativeProcessLinux.h

Show First 20 Lines • Show All 158 Lines • ▼ Show 20 Lines	NativeProcessLinux(::pid_t pid, int terminal_fd, NativeDelegate &delegate,
const ArchSpec &arch, MainLoop &mainloop,		const ArchSpec &arch, MainLoop &mainloop,
llvm::ArrayRef<::pid_t> tids);		llvm::ArrayRef<::pid_t> tids);

// Returns a list of process threads that we have attached to.		// Returns a list of process threads that we have attached to.
static llvm::Expected<std::vector<::pid_t>> Attach(::pid_t pid);		static llvm::Expected<std::vector<::pid_t>> Attach(::pid_t pid);

static Status SetDefaultPtraceOpts(const lldb::pid_t);		static Status SetDefaultPtraceOpts(const lldb::pid_t);

void MonitorCallback(lldb::pid_t pid, WaitStatus status);		void MonitorCallback(NativeThreadLinux &thread, WaitStatus status);

void WaitForCloneNotification(::pid_t pid);		void WaitForCloneNotification(::pid_t pid);

void MonitorSIGTRAP(const siginfo_t &info, NativeThreadLinux &thread);		void MonitorSIGTRAP(const siginfo_t &info, NativeThreadLinux &thread);

void MonitorTrace(NativeThreadLinux &thread);		void MonitorTrace(NativeThreadLinux &thread);

void MonitorBreakpoint(NativeThreadLinux &thread);		void MonitorBreakpoint(NativeThreadLinux &thread);

void MonitorWatchpoint(NativeThreadLinux &thread, uint32_t wp_index);		void MonitorWatchpoint(NativeThreadLinux &thread, uint32_t wp_index);

void MonitorSignal(const siginfo_t &info, NativeThreadLinux &thread);		void MonitorSignal(const siginfo_t &info, NativeThreadLinux &thread);

bool HasThreadNoLock(lldb::tid_t thread_id);		bool HasThreadNoLock(lldb::tid_t thread_id);

bool StopTrackingThread(lldb::tid_t thread_id);		void StopTrackingThread(NativeThreadLinux &thread);

/// Create a new thread.		/// Create a new thread.
///		///
/// If process tracing is enabled and the thread can't be traced, then the		/// If process tracing is enabled and the thread can't be traced, then the
/// thread is left stopped with a \a eStopReasonProcessorTrace status, and		/// thread is left stopped with a \a eStopReasonProcessorTrace status, and
/// then the process is stopped.		/// then the process is stopped.
///		///
/// \param[in] resume		/// \param[in] resume
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	private:

void SigchldHandler();		void SigchldHandler();

Status PopulateMemoryRegionCache();		Status PopulateMemoryRegionCache();

/// Manages Intel PT process and thread traces.		/// Manages Intel PT process and thread traces.
IntelPTManager m_intel_pt_manager;		IntelPTManager m_intel_pt_manager;

struct CloneInfo {		// Handle a clone()-like event.
int event;		bool MonitorClone(NativeThreadLinux &parent, lldb::pid_t child_pid,
lldb::tid_t parent_tid;		int event);
};

// Map of child processes that have been signaled once, and we are
// waiting for the second signal.
llvm::DenseMap<lldb::pid_t, llvm::Optional<CloneInfo>> m_pending_pid_map;

// Handle a clone()-like event. If received by parent, clone_info contains
// additional info. Returns true if the event is handled, or false if it
// is pending second notification.
bool MonitorClone(lldb::pid_t child_pid,
llvm::Optional<CloneInfo> clone_info);
};		};

} // namespace process_linux		} // namespace process_linux
} // namespace lldb_private		} // namespace lldb_private

#endif // #ifndef liblldb_NativeProcessLinux_H_		#endif // #ifndef liblldb_NativeProcessLinux_H_

lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp

Show First 20 Lines • Show All 420 Lines • ▼ Show 20 Lines	Status NativeProcessLinux::SetDefaultPtraceOpts(lldb::pid_t pid) {
// Have the tracer trace vfork-done in order to restore breakpoints after		// Have the tracer trace vfork-done in order to restore breakpoints after
// the child finishes sharing memory.		// the child finishes sharing memory.
ptrace_opts \|= PTRACE_O_TRACEVFORKDONE;		ptrace_opts \|= PTRACE_O_TRACEVFORKDONE;

return PtraceWrapper(PTRACE_SETOPTIONS, pid, nullptr, (void *)ptrace_opts);		return PtraceWrapper(PTRACE_SETOPTIONS, pid, nullptr, (void *)ptrace_opts);
}		}

// Handles all waitpid events from the inferior process.		// Handles all waitpid events from the inferior process.
void NativeProcessLinux::MonitorCallback(lldb::pid_t pid, WaitStatus status) {		void NativeProcessLinux::MonitorCallback(NativeThreadLinux &thread,
		WaitStatus status) {
Log *log(GetLogIfAnyCategoriesSet(LIBLLDB_LOG_PROCESS));		Log *log(GetLogIfAnyCategoriesSet(LIBLLDB_LOG_PROCESS));

// Certain activities differ based on whether the pid is the tid of the main		// Certain activities differ based on whether the pid is the tid of the main
// thread.		// thread.
const bool is_main_thread = (pid == GetID());		const bool is_main_thread = (thread.GetID() == GetID());

// Handle when the thread exits.		// Handle when the thread exits.
if (status.type == WaitStatus::Exit \|\| status.type == WaitStatus::Signal) {		if (status.type == WaitStatus::Exit \|\| status.type == WaitStatus::Signal) {
LLDB_LOG(log,		LLDB_LOG(log,
"got exit status({0}) , tid = {1} ({2} main thread), process "		"got exit status({0}) , tid = {1} ({2} main thread), process "
"state = {3}",		"state = {3}",
status, pid, is_main_thread ? "is" : "is not", GetState());		status, thread.GetID(), is_main_thread ? "is" : "is not",
		GetState());

// This is a thread that exited. Ensure we're not tracking it anymore.		// This is a thread that exited. Ensure we're not tracking it anymore.
StopTrackingThread(pid);		StopTrackingThread(thread);

if (is_main_thread) {		if (is_main_thread) {
// The main thread exited. We're done monitoring. Report to delegate.		// The main thread exited. We're done monitoring. Report to delegate.
SetExitStatus(status, true);		SetExitStatus(status, true);

// Notify delegate that our process has exited.		// Notify delegate that our process has exited.
SetState(StateType::eStateExited, true);		SetState(StateType::eStateExited, true);
}		}
return;		return;
}		}

siginfo_t info;		siginfo_t info;
const auto info_err = GetSignalInfo(pid, &info);		const auto info_err = GetSignalInfo(thread.GetID(), &info);
auto thread_sp = GetThreadByID(pid);

if (!thread_sp) {
// Normally, the only situation when we cannot find the thread is if we
// have just received a new thread notification. This is indicated by
// GetSignalInfo() returning si_code == SI_USER and si_pid == 0
LLDB_LOG(log, "received notification about an unknown tid {0}.", pid);

if (info_err.Fail()) {
LLDB_LOG(log,
"(tid {0}) GetSignalInfo failed ({1}). "
"Ingoring this notification.",
pid, info_err);
return;
}

LLDB_LOG(log, "tid {0}, si_code: {1}, si_pid: {2}", pid, info.si_code,
info.si_pid);

MonitorClone(pid, llvm::None);
return;
}

// Get details on the signal raised.		// Get details on the signal raised.
if (info_err.Success()) {		if (info_err.Success()) {
// We have retrieved the signal info. Dispatch appropriately.		// We have retrieved the signal info. Dispatch appropriately.
if (info.si_signo == SIGTRAP)		if (info.si_signo == SIGTRAP)
MonitorSIGTRAP(info, *thread_sp);		MonitorSIGTRAP(info, thread);
else		else
MonitorSignal(info, *thread_sp);		MonitorSignal(info, thread);
} else {		} else {
if (info_err.GetError() == EINVAL) {		if (info_err.GetError() == EINVAL) {
// This is a group stop reception for this tid. We can reach here if we		// This is a group stop reception for this tid. We can reach here if we
// reinject SIGSTOP, SIGSTP, SIGTTIN or SIGTTOU into the tracee,		// reinject SIGSTOP, SIGSTP, SIGTTIN or SIGTTOU into the tracee,
// triggering the group-stop mechanism. Normally receiving these would		// triggering the group-stop mechanism. Normally receiving these would
// stop the process, pending a SIGCONT. Simulating this state in a		// stop the process, pending a SIGCONT. Simulating this state in a
// debugger is hard and is generally not needed (one use case is		// debugger is hard and is generally not needed (one use case is
// debugging background task being managed by a shell). For general use,		// debugging background task being managed by a shell). For general use,
// it is sufficient to stop the process in a signal-delivery stop which		// it is sufficient to stop the process in a signal-delivery stop which
// happens before the group stop. This done by MonitorSignal and works		// happens before the group stop. This done by MonitorSignal and works
// correctly for all signals.		// correctly for all signals.
LLDB_LOG(log,		LLDB_LOG(log,
"received a group stop for pid {0} tid {1}. Transparent "		"received a group stop for pid {0} tid {1}. Transparent "
"handling of group stops not supported, resuming the "		"handling of group stops not supported, resuming the "
"thread.",		"thread.",
GetID(), pid);		GetID(), thread.GetID());
ResumeThread(*thread_sp, thread_sp->GetState(),		ResumeThread(thread, thread.GetState(), LLDB_INVALID_SIGNAL_NUMBER);
LLDB_INVALID_SIGNAL_NUMBER);
} else {		} else {
// ptrace(GETSIGINFO) failed (but not due to group-stop).		// ptrace(GETSIGINFO) failed (but not due to group-stop).

// A return value of ESRCH means the thread/process is no longer on the		// A return value of ESRCH means the thread/process is no longer on the
// system, so it was killed somehow outside of our control. Either way,		// system, so it was killed somehow outside of our control. Either way,
// we can't do anything with it anymore.		// we can't do anything with it anymore.

// Stop tracking the metadata for the thread since it's entirely off the		// Stop tracking the metadata for the thread since it's entirely off the
// system now.		// system now.
const bool thread_found = StopTrackingThread(pid);		StopTrackingThread(thread);

LLDB_LOG(log,		LLDB_LOG(log,
"GetSignalInfo failed: {0}, tid = {1}, status = {2}, "		"GetSignalInfo failed: {0}, tid = {1}, status = {2}, "
"status = {3}, main_thread = {4}, thread_found: {5}",		"status = {3}, main_thread = {4}",
info_err, pid, status, status, is_main_thread, thread_found);		info_err, thread.GetID(), status, status, is_main_thread);

if (is_main_thread) {		if (is_main_thread) {
// Notify the delegate - our process is not available but appears to		// Notify the delegate - our process is not available but appears to
// have been killed outside our control. Is eStateExited the right		// have been killed outside our control. Is eStateExited the right
// exit state in this case?		// exit state in this case?
SetExitStatus(status, true);		SetExitStatus(status, true);
SetState(StateType::eStateExited, true);		SetState(StateType::eStateExited, true);
} else {		} else {
// This thread was pulled out from underneath us. Anything to do here?		// This thread was pulled out from underneath us. Anything to do here?
// Do we want to do an all stop?		// Do we want to do an all stop?
LLDB_LOG(log,		LLDB_LOG(log,
"pid {0} tid {1} non-main thread exit occurred, didn't "		"pid {0} tid {1} non-main thread exit occurred, didn't "
"tell delegate anything since thread disappeared out "		"tell delegate anything since thread disappeared out "
"from underneath us",		"from underneath us",
GetID(), pid);		GetID(), thread.GetID());
}		}
}		}
}		}
}		}

void NativeProcessLinux::WaitForCloneNotification(::pid_t pid) {		void NativeProcessLinux::WaitForCloneNotification(::pid_t pid) {
Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_PROCESS));		Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_PROCESS));

// The PID is not tracked yet, let's wait for it to appear.		// The PID is not tracked yet, let's wait for it to appear.
int status = -1;		int status = -1;
LLDB_LOG(log,		LLDB_LOG(log,
"received clone event for pid {0}. pid not tracked yet, "		"received clone event for pid {0}. pid not tracked yet, "
"waiting for it to appear...",		"waiting for it to appear...",
pid);		pid);
::pid_t wait_pid =		::pid_t wait_pid =
llvm::sys::RetryAfterSignal(-1, ::waitpid, pid, &status, __WALL);		llvm::sys::RetryAfterSignal(-1, ::waitpid, pid, &status, __WALL);
// Since we are waiting on a specific pid, this must be the creation event.
// But let's do some checks just in case.
if (wait_pid != pid) {
LLDB_LOG(log,
"waiting for pid {0} failed. Assuming the pid has "
"disappeared in the meantime",
pid);
// The only way I know of this could happen is if the whole process was
// SIGKILLed in the mean time. In any case, we can't do anything about that
// now.
return;
}
if (WIFEXITED(status)) {
LLDB_LOG(log,
"waiting for pid {0} returned an 'exited' event. Not "
"tracking it.",
pid);
// Also a very improbable event.
m_pending_pid_map.erase(pid);
return;
}

MonitorClone(pid, llvm::None);		// It's theoretically possible to get other events if the entire process was
		// SIGKILLed before we got a chance to check this. In that case, we'll just
		// clean everything up when we get the process exit event.

		LLDB_LOG(log,
		"waitpid({0}, &status, __WALL) => {1} (errno: {2}, status = {3})",
		pid, wait_pid, errno, WaitStatus::Decode(status));
}		}

void NativeProcessLinux::MonitorSIGTRAP(const siginfo_t &info,		void NativeProcessLinux::MonitorSIGTRAP(const siginfo_t &info,
NativeThreadLinux &thread) {		NativeThreadLinux &thread) {
Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_PROCESS));		Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_PROCESS));
const bool is_main_thread = (thread.GetID() == GetID());		const bool is_main_thread = (thread.GetID() == GetID());

assert(info.si_signo == SIGTRAP && "Unexpected child signal!");		assert(info.si_signo == SIGTRAP && "Unexpected child signal!");
Show All 10 Lines	case (SIGTRAP \| (PTRACE_EVENT_CLONE << 8)): {
unsigned long event_message = 0;		unsigned long event_message = 0;
if (GetEventMessage(thread.GetID(), &event_message).Fail()) {		if (GetEventMessage(thread.GetID(), &event_message).Fail()) {
LLDB_LOG(log,		LLDB_LOG(log,
"pid {0} received clone() event but GetEventMessage failed "		"pid {0} received clone() event but GetEventMessage failed "
"so we don't know the new pid/tid",		"so we don't know the new pid/tid",
thread.GetID());		thread.GetID());
ResumeThread(thread, thread.GetState(), LLDB_INVALID_SIGNAL_NUMBER);		ResumeThread(thread, thread.GetState(), LLDB_INVALID_SIGNAL_NUMBER);
} else {		} else {
if (!MonitorClone(event_message, {{(info.si_code >> 8), thread.GetID()}}))		MonitorClone(thread, event_message, info.si_code >> 8);
WaitForCloneNotification(event_message);
}		}

break;		break;
}		}

case (SIGTRAP \| (PTRACE_EVENT_EXEC << 8)): {		case (SIGTRAP \| (PTRACE_EVENT_EXEC << 8)): {
LLDB_LOG(log, "received exec event, code = {0}", info.si_code ^ SIGTRAP);		LLDB_LOG(log, "received exec event, code = {0}", info.si_code ^ SIGTRAP);

▲ Show 20 Lines • Show All 270 Lines • ▼ Show 20 Lines	void NativeProcessLinux::MonitorSignal(const siginfo_t &info,
// This thread is stopped.		// This thread is stopped.
LLDB_LOG(log, "received signal {0}", Host::GetSignalAsCString(signo));		LLDB_LOG(log, "received signal {0}", Host::GetSignalAsCString(signo));
thread.SetStoppedBySignal(signo, &info);		thread.SetStoppedBySignal(signo, &info);

// Send a stop to the debugger after we get all other threads to stop.		// Send a stop to the debugger after we get all other threads to stop.
StopRunningThreads(thread.GetID());		StopRunningThreads(thread.GetID());
}		}

bool NativeProcessLinux::MonitorClone(		bool NativeProcessLinux::MonitorClone(NativeThreadLinux &parent,
lldb::pid_t child_pid,		lldb::pid_t child_pid, int event) {
llvm::Optional<NativeProcessLinux::CloneInfo> clone_info) {
Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_PROCESS));		Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_PROCESS));
LLDB_LOG(log, "clone, child_pid={0}, clone info?={1}", child_pid,		LLDB_LOG(log, "parent_tid={0}, child_pid={1}, event={2}", parent.GetID(),
clone_info.hasValue());		child_pid, event);

auto find_it = m_pending_pid_map.find(child_pid);
if (find_it == m_pending_pid_map.end()) {
// not in the map, so this is the first signal for the PID
m_pending_pid_map.insert({child_pid, clone_info});
return false;
}
m_pending_pid_map.erase(find_it);

// second signal for the pid
assert(clone_info.hasValue() != find_it->second.hasValue());
if (!clone_info) {
// child signal does not indicate the event, so grab the one stored
// earlier
clone_info = find_it->second;
}

LLDB_LOG(log, "second signal for child_pid={0}, parent_tid={1}, event={2}",		WaitForCloneNotification(child_pid);
child_pid, clone_info->parent_tid, clone_info->event);

auto *parent_thread = GetThreadByID(clone_info->parent_tid);		switch (event) {
assert(parent_thread);

switch (clone_info->event) {
case PTRACE_EVENT_CLONE: {		case PTRACE_EVENT_CLONE: {
// PTRACE_EVENT_CLONE can either mean a new thread or a new process.		// PTRACE_EVENT_CLONE can either mean a new thread or a new process.
// Try to grab the new process' PGID to figure out which one it is.		// Try to grab the new process' PGID to figure out which one it is.
// If PGID is the same as the PID, then it's a new process. Otherwise,		// If PGID is the same as the PID, then it's a new process. Otherwise,
// it's a thread.		// it's a thread.
auto tgid_ret = getPIDForTID(child_pid);		auto tgid_ret = getPIDForTID(child_pid);
if (tgid_ret != child_pid) {		if (tgid_ret != child_pid) {
// A new thread should have PGID matching our process' PID.		// A new thread should have PGID matching our process' PID.
assert(!tgid_ret \|\| tgid_ret.getValue() == GetID());		assert(!tgid_ret \|\| tgid_ret.getValue() == GetID());

NativeThreadLinux &child_thread = AddThread(child_pid, /resume/ true);		NativeThreadLinux &child_thread = AddThread(child_pid, /resume/ true);
ThreadWasCreated(child_thread);		ThreadWasCreated(child_thread);

// Resume the parent.		// Resume the parent.
ResumeThread(*parent_thread, parent_thread->GetState(),		ResumeThread(parent, parent.GetState(), LLDB_INVALID_SIGNAL_NUMBER);
LLDB_INVALID_SIGNAL_NUMBER);
break;		break;
}		}
}		}
LLVM_FALLTHROUGH;		LLVM_FALLTHROUGH;
case PTRACE_EVENT_FORK:		case PTRACE_EVENT_FORK:
case PTRACE_EVENT_VFORK: {		case PTRACE_EVENT_VFORK: {
bool is_vfork = clone_info->event == PTRACE_EVENT_VFORK;		bool is_vfork = event == PTRACE_EVENT_VFORK;
std::unique_ptr<NativeProcessLinux> child_process{new NativeProcessLinux(		std::unique_ptr<NativeProcessLinux> child_process{new NativeProcessLinux(
static_cast<::pid_t>(child_pid), m_terminal_fd, m_delegate, m_arch,		static_cast<::pid_t>(child_pid), m_terminal_fd, m_delegate, m_arch,
m_main_loop, {static_cast<::pid_t>(child_pid)})};		m_main_loop, {static_cast<::pid_t>(child_pid)})};
if (!is_vfork)		if (!is_vfork)
child_process->m_software_breakpoints = m_software_breakpoints;		child_process->m_software_breakpoints = m_software_breakpoints;

Extension expected_ext = is_vfork ? Extension::vfork : Extension::fork;		Extension expected_ext = is_vfork ? Extension::vfork : Extension::fork;
if (bool(m_enabled_extensions & expected_ext)) {		if (bool(m_enabled_extensions & expected_ext)) {
m_delegate.NewSubprocess(this, std::move(child_process));		m_delegate.NewSubprocess(this, std::move(child_process));
// NB: non-vfork clone() is reported as fork		// NB: non-vfork clone() is reported as fork
parent_thread->SetStoppedByFork(is_vfork, child_pid);		parent.SetStoppedByFork(is_vfork, child_pid);
StopRunningThreads(parent_thread->GetID());		StopRunningThreads(parent.GetID());
} else {		} else {
child_process->Detach();		child_process->Detach();
ResumeThread(*parent_thread, parent_thread->GetState(),		ResumeThread(parent, parent.GetState(), LLDB_INVALID_SIGNAL_NUMBER);
LLDB_INVALID_SIGNAL_NUMBER);
}		}
break;		break;
}		}
default:		default:
llvm_unreachable("unknown clone_info.event");		llvm_unreachable("unknown clone_info.event");
}		}

return true;		return true;
▲ Show 20 Lines • Show All 758 Lines • ▼ Show 20 Lines	if (thread->GetID() == thread_id) {
return true;		return true;
}		}
}		}

// We don't have this thread.		// We don't have this thread.
return false;		return false;
}		}

bool NativeProcessLinux::StopTrackingThread(lldb::tid_t thread_id) {		void NativeProcessLinux::StopTrackingThread(NativeThreadLinux &thread) {
Log *const log = ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_THREAD);		Log *const log = ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_THREAD);
LLDB_LOG(log, "tid: {0})", thread_id);		lldb::tid_t thread_id = thread.GetID();
		LLDB_LOG(log, "tid: {0}", thread_id);

bool found = false;		auto it = llvm::find_if(m_threads, [&](const auto &thread_up) {
for (auto it = m_threads.begin(); it != m_threads.end(); ++it) {		return thread_up.get() == &thread;
if (it && ((it)->GetID() == thread_id)) {		});
		assert(it != m_threads.end());
m_threads.erase(it);		m_threads.erase(it);
found = true;
break;
}
}

if (found)
NotifyTracersOfThreadDestroyed(thread_id);		NotifyTracersOfThreadDestroyed(thread_id);

SignalIfAllThreadsStopped();		SignalIfAllThreadsStopped();
return found;
}		}

Status NativeProcessLinux::NotifyTracersOfNewThread(lldb::tid_t tid) {		Status NativeProcessLinux::NotifyTracersOfNewThread(lldb::tid_t tid) {
Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_THREAD));		Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_THREAD));
Status error(m_intel_pt_manager.OnThreadCreated(tid));		Status error(m_intel_pt_manager.OnThreadCreated(tid));
if (error.Fail())		if (error.Fail())
LLDB_LOG(log, "Failed to trace a new thread with intel-pt, tid = {0}. {1}",		LLDB_LOG(log, "Failed to trace a new thread with intel-pt, tid = {0}. {1}",
tid, error.AsCString());		tid, error.AsCString());
▲ Show 20 Lines • Show All 182 Lines • ▼ Show 20 Lines	if (m_pending_notification_tid != LLDB_INVALID_THREAD_ID &&
// We will need to wait for this new thread to stop as well before firing		// We will need to wait for this new thread to stop as well before firing
// the notification.		// the notification.
thread.RequestStop();		thread.RequestStop();
}		}
}		}

void NativeProcessLinux::SigchldHandler() {		void NativeProcessLinux::SigchldHandler() {
Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_PROCESS));		Log *log(ProcessPOSIXLog::GetLogIfAllCategoriesSet(POSIX_LOG_PROCESS));
// Process all pending waitpid notifications.
while (true) {		// Threads can appear or disappear as a result of event processing, so gather
		// the events upfront.
		llvm::DenseMap<lldb::tid_t, WaitStatus> tid_events;
		for (const auto &thread_up : m_threads) {
int status = -1;		int status = -1;
::pid_t wait_pid = llvm::sys::RetryAfterSignal(-1, ::waitpid, -1, &status,		::pid_t wait_pid =
		llvm::sys::RetryAfterSignal(-1, ::waitpid, thread_up->GetID(), &status,
__WALL \| __WNOTHREAD \| WNOHANG);		__WALL \| __WNOTHREAD \| WNOHANG);

if (wait_pid == 0)		if (wait_pid == 0)
break; // We are done.		continue; // Nothing to do for this thread.

if (wait_pid == -1) {		if (wait_pid == -1) {
Status error(errno, eErrorTypePOSIX);		Status error(errno, eErrorTypePOSIX);
LLDB_LOG(log, "waitpid (-1, &status, _) failed: {0}", error);		LLDB_LOG(log, "waitpid({0}, &status, _) failed: {1}", thread_up->GetID(),
break;		error);
		continue;
}		}

		assert(wait_pid == static_cast<::pid_t>(thread_up->GetID()));

WaitStatus wait_status = WaitStatus::Decode(status);		WaitStatus wait_status = WaitStatus::Decode(status);

LLDB_LOG(log, "waitpid (-1, &status, _) => pid = {0}, status = {1}",		LLDB_LOG(log, "waitpid({0}) got status = {1}", thread_up->GetID(),
wait_pid, wait_status);		wait_status);
		tid_events.try_emplace(thread_up->GetID(), wait_status);
		mgornyUnsubmitted Not Done Reply Inline Actions Hmm, I'm a bit confused here. Why would the key already exist? I mean, you start with empty `tid_events` and catch only one event for every thread, correct? mgorny: Hmm, I'm a bit confused here. Why would the key already exist? I mean, you start with empty…
		labathAuthorUnsubmitted Done Reply Inline Actions It doesn't exist. It's just how the api is named. c++ std::map offers both emplace and try_emplace (which, ironically, don't differ in "trying" -- they only differ in the way they construct the pair). DenseMap offers only try_emplace (maybe for that reason). labath: It doesn't exist. It's just how the api is named. c++ std::map offers both emplace and…
		}

MonitorCallback(wait_pid, wait_status);		for (auto &KV : tid_events) {
		LLDB_LOG(log, "processing {0}({1}) ...", KV.first, KV.second);
		NativeThreadLinux *thread = GetThreadByID(KV.first);
		if (thread) {
		MonitorCallback(*thread, KV.second);
		} else {
		// This can happen if one of the events is an main thread exit.
		mgornyUnsubmitted Not Done Reply Inline Actions Are we talking about some kind of race here? Or some thread that appears in `m_threads` but is not returned by `GetThreadByID()`? I was wondering if you could use thread pointers as keys. mgorny: Are we talking about some kind of race here? Or some thread that appears in `m_threads` but is…
		labathAuthorUnsubmitted Done Reply Inline Actions The problem is when a thread disappears. This can happen in case of a main thread exit or an execve, in which case we remove all non-main threads from the list. However, we can still have some pending events for the other threads. Now, I haven't managed to reproduce this in my experiments, but the manpage is adamant that a SIGKILL should immediately terminate a process. In my (limited) tests the debugger always got a PTRACE_EVENT_EXIT stop for each threads (which is again something that the manpage says should not happen), so we theoretically (with careful management of thread lifetimes) might ensure that a thread with pending events does not disappear, but depending on it doesn't seem like a good idea. labath: The problem is when a thread disappears. This can happen in case of a main thread exit or an…
		mgornyUnsubmitted Not Done Reply Inline Actions Hmm, so it could disappear while `MonitorCallback()` is executing; do I understand correctly? mgorny: Hmm, so it could disappear while `MonitorCallback()` is executing; do I understand correctly?
		labathAuthorUnsubmitted Done Reply Inline Actions Yes. I meant disappearing from the list we maintain, not from the system. labath: Yes. I meant disappearing from the list we maintain, not from the system.
		LLDB_LOG(log, "... but the thread has disappeared");
		}
}		}
}		}

// Wrapper for ptrace to catch errors and log calls. Note that ptrace sets		// Wrapper for ptrace to catch errors and log calls. Note that ptrace sets
// errno on error because -1 can be a valid result (i.e. for PTRACE_PEEK*)		// errno on error because -1 can be a valid result (i.e. for PTRACE_PEEK*)
Status NativeProcessLinux::PtraceWrapper(int req, lldb::pid_t pid, void *addr,		Status NativeProcessLinux::PtraceWrapper(int req, lldb::pid_t pid, void *addr,
void *data, size_t data_size,		void *data, size_t data_size,
long *result) {		long *result) {
▲ Show 20 Lines • Show All 72 Lines • Show Last 20 Lines

lldb/test/API/tools/lldb-server/TestGdbRemoteFork.py

	import gdbremote_testcase			import gdbremote_testcase
	from lldbsuite.test.decorators import *			from lldbsuite.test.decorators import *
	from lldbsuite.test.lldbtest import *			from lldbsuite.test.lldbtest import *
	from lldbsuite.test import lldbutil			from lldbsuite.test import lldbutil

	class TestGdbRemoteFork(gdbremote_testcase.GdbRemoteTestCaseBase):			class TestGdbRemoteFork(gdbremote_testcase.GdbRemoteTestCaseBase):
	mydir = TestBase.compute_mydir(__file__)			mydir = TestBase.compute_mydir(__file__)

				@add_test_categories(["fork"])
				def test_fork_multithreaded(self):
				self.build()
				self.prep_debug_monitor_and_inferior(inferior_args=["thread:new"]*2 + ["fork"])
				self.add_qSupported_packets(["multiprocess+", "fork-events+"])
				ret = self.expect_gdbremote_sequence()
				self.assertIn("fork-events+", ret["qSupported_response"])
				self.reset_test_sequence()

				# continue and expect fork
				fork_regex = "[$]T.;fork:p([0-9a-f]+)[.]([0-9a-f]+)."
				self.test_sequence.add_log_lines([
				"read packet: $c#00",
				{"direction": "send", "regex": fork_regex,
				"capture": {1: "pid", 2: "tid"}},
				], True)
				ret = self.expect_gdbremote_sequence()
				pid = int(ret["pid"], 16)
				self.reset_test_sequence()

				# detach the forked child
				self.test_sequence.add_log_lines([
				"read packet: $D;{:x}#00".format(pid),
				{"direction": "send", "regex": r"[$]OK#.*"},
				], True)
				ret = self.expect_gdbremote_sequence()
				self.reset_test_sequence()

				# resume the parent
				self.test_sequence.add_log_lines([
				"read packet: $k#00",
				], True)
				self.expect_gdbremote_sequence()

	def fork_and_detach_test(self, variant):			def fork_and_detach_test(self, variant):
	self.build()			self.build()
	self.prep_debug_monitor_and_inferior(inferior_args=[variant])			self.prep_debug_monitor_and_inferior(inferior_args=[variant])
	self.add_qSupported_packets(["multiprocess+",			self.add_qSupported_packets(["multiprocess+",
	"{}-events+".format(variant)])			"{}-events+".format(variant)])
	ret = self.expect_gdbremote_sequence()			ret = self.expect_gdbremote_sequence()
	self.assertIn("{}-events+".format(variant), ret["qSupported_response"])			self.assertIn("{}-events+".format(variant), ret["qSupported_response"])
	self.reset_test_sequence()			self.reset_test_sequence()
	▲ Show 20 Lines • Show All 178 Lines • Show Last 20 Lines