This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lldb/trunk/
-
trunk/
-
include/lldb/Target/
-
lldb/
-
Target/
-
Process.h
-
source/Target/
-
Target/
-
Process.cpp

Differential D21296

[lldb] Fixed race condition on private state thread exit, take 2
ClosedPublic

Authored by cameron314 on Jun 13 2016, 9:24 AM.

Download Raw Diff

Details

Reviewers

zturner
clayborg
jingham
labath

Commits

rG2c00fc41b494: [lldb] Fixed race conditions on private state thread exit
rLLDB272682: [lldb] Fixed race conditions on private state thread exit
rL272682: [lldb] Fixed race conditions on private state thread exit

Summary

This is the follow-up to D19122, which was accepted but subsequently reverted due to a bug it introduced (that I didn't see during local testing on Windows but which manifested quite often on Linux). That bug (a race between the Process object was being destroyed and the thread terminating, caused by the join not being done under certain conditions) is fixed in this version of the patch.

This patch fixes various races between the time the private state thread is signaled to exit and the time it actually exits (during which it no longer responds to events). Previously, this was consistently causing 2-second timeout delays on process detach/stop for us.

This also prevents crashes that were caused by the thread controlling its own owning pointer while the controller was using it (copying the thread wrapper is not enough to mitigate this, since the internal thread object was getting reset anyway). Again, we were seeing this consistently.

For what it's worth, I've run the test suite with this change (on Linux) without any regressions, and the number of reruns dropped from 15 to 0 for me (though that last part may be coincidence).

Diff Detail

Repository: rL LLVM

Event Timeline

cameron314 updated this revision to Diff 60536.Jun 13 2016, 9:24 AM

cameron314 retitled this revision from to [lldb] Fixed race condition on private state thread exit, take 2.

cameron314 updated this object.

cameron314 added reviewers: clayborg, labath, zturner.

cameron314 added a subscriber: lldb-commits.

This looks like it would work for normal operation. I am not sure it will work when an extra private state thread is spun up. In certain circumstances we need to create more private state threads. This happens when you have an expression that is run from the private state thread. Since the private state thread is what controls the actual process, we can't run an expression from the current private state thread, so we spin up new ones. The current code is doing some tricky things to deal with this, and that was part of the reason Process::ControlPrivateStateThread() was making a copy of the current value of "m_private_state_thread" into a local variable named "private_state_thread":

HostThread private_state_thread(m_private_state_thread);

See the code in "Process::RunThreadPlan()" around the code:

if (m_private_state_thread.EqualsThread(Host::GetCurrentThread()))

The new loop as written in Process::ControlPrivateStateThread() could end up using m_private_state_thread with differing contents in the "if (m_private_state_thread.IsJoinable())" if statement. Jim Ingham made these changes, so we should add him to the reviewer list. I am going to mark as "Request Changes" so we can address any such issues, but Jim should chime in on this before we proceed.

The way this normally happens is an expression is being run and while handling the expression on the private state thread, we need to run another expression (like a call to "mmap" in the debugged process so we can allocate memory), so we need to spin up another private state thread to handle the needed starts and stops. Only one of these threads will be actively grabbing events at a single time, so your patch might just work, but I want to get some more eyes on this to be sure.

Please add Jim Ingham as a reviewer.

This revision now requires changes to proceed.Jun 13 2016, 1:40 PM

cameron314 added a reviewer: jingham.Jun 13 2016, 2:01 PM

@clayborg: Thanks for having a look! I've added Jim Ingham as a reviewer. @jingham, I'd appreciate if you could take a few minutes to look this over.

Right, I'd seen the backup/restore of the thread. As far as I can tell it should still work; the code in ControlPrivateStateThread has no idea it's working with a temporary thread, just as it didn't know before (in fact, if you look carefully at the code in the present tip of the trunk, a recent change seems to have introduced a mix of using both private_state_thread and m_private_state_thread, probably by accident). m_private_state_thread cannot be reset to the backup during a control event, since the first thing that's done before restoring the backup thread is to stop the temporary thread.

Ok, as long as Jim agrees, then I will give it the go ahead.

This looks okay to me.

Looks good if Jim is happy.

This revision is now accepted and ready to land.Jun 13 2016, 3:37 PM

Seems to run fine on linux now. Thanks for investigating this. We'll monitor the buildbots and let you know if anything bad happens. ;)

BTW. your comment in ControlPrivateStateThread seems to indicate that the linux behavior is inconsistent/unexpected in some way. Do you think it would be worth filing a bug about that?

Thanks everyone :-)

Ah, yeah, sorry if I gave the wrong impression, but that comment is not specific to Linux (in fact, I've only seen it once, on Windows). At one point the debugger had entered ControlPrivateStateThread on one thread to stop it, seen that the thread was already in an invalid state (it was), and assumed that meant that the thread was already exiting and did a join without sending the stop. But the state thread somehow wasn't on its way out yet, it was stuck waiting for a control event first (this is the part that I'm not sure should be possible, but empirically is). This caused a deadlock. So I changed my patch to always send the event if the thread is joinable, not just if its state is valid, and left that comment to explain why this must remain so.

Closed by commit rL272682: [lldb] Fixed race conditions on private state thread exit (authored by cameron314). · Explain WhyJun 14 2016, 9:29 AM

This revision was automatically updated to reflect the committed changes.

OK, i see. Thanks for the explanation. This may actually be some
windows specific thing then, as I remember zachary mentioning they
have some flakyness issues there.

BTW, this has sped up the LLDB test suite nearly 2x, so thanks a lot
for that. :)

Revision Contents

Path

Size

lldb/

trunk/

include/

lldb/

Target/

Process.h

8 lines

source/

Target/

Process.cpp

54 lines

Diff 60693

lldb/trunk/include/lldb/Target/Process.h

Show First 20 Lines • Show All 3,303 Lines • ▼ Show 20 Lines	protected:
private:		private:
uint32_t m_exec_count;		uint32_t m_exec_count;
std::string m_exit_string;		std::string m_exit_string;
};		};

bool		bool
PrivateStateThreadIsValid () const		PrivateStateThreadIsValid () const
{		{
return m_private_state_thread.IsJoinable();		lldb::StateType state = m_private_state.GetValue();
		return state != lldb::eStateInvalid &&
		state != lldb::eStateDetached &&
		state != lldb::eStateExited &&
		m_private_state_thread.IsJoinable();
}		}

void		void
ForceNextEventDelivery()		ForceNextEventDelivery()
{		{
m_force_next_event_delivery = true;		m_force_next_event_delivery = true;
}		}

//------------------------------------------------------------------		//------------------------------------------------------------------
// Type definitions		// Type definitions
▲ Show 20 Lines • Show All 226 Lines • Show Last 20 Lines

lldb/trunk/source/Target/Process.cpp

Show First 20 Lines • Show All 4,082 Lines • ▼ Show 20 Lines
Process::ResumePrivateStateThread ()		Process::ResumePrivateStateThread ()
{		{
ControlPrivateStateThread (eBroadcastInternalStateControlResume);		ControlPrivateStateThread (eBroadcastInternalStateControlResume);
}		}

void		void
Process::StopPrivateStateThread ()		Process::StopPrivateStateThread ()
{		{
if (PrivateStateThreadIsValid ())		if (m_private_state_thread.IsJoinable ())
ControlPrivateStateThread (eBroadcastInternalStateControlStop);		ControlPrivateStateThread (eBroadcastInternalStateControlStop);
else		else
{		{
Log *log(lldb_private::GetLogIfAllCategoriesSet(LIBLLDB_LOG_PROCESS));		Log *log(lldb_private::GetLogIfAllCategoriesSet(LIBLLDB_LOG_PROCESS));
if (log)		if (log)
log->Printf ("Went to stop the private state thread, but it was already invalid.");		log->Printf ("Went to stop the private state thread, but it was already invalid.");
}		}
}		}

void		void
Process::ControlPrivateStateThread (uint32_t signal)		Process::ControlPrivateStateThread (uint32_t signal)
{		{
Log *log(lldb_private::GetLogIfAllCategoriesSet(LIBLLDB_LOG_PROCESS));		Log *log(lldb_private::GetLogIfAllCategoriesSet(LIBLLDB_LOG_PROCESS));

assert (signal == eBroadcastInternalStateControlStop \|\|		assert (signal == eBroadcastInternalStateControlStop \|\|
signal == eBroadcastInternalStateControlPause \|\|		signal == eBroadcastInternalStateControlPause \|\|
signal == eBroadcastInternalStateControlResume);		signal == eBroadcastInternalStateControlResume);

if (log)		if (log)
log->Printf ("Process::%s (signal = %d)", __FUNCTION__, signal);		log->Printf ("Process::%s (signal = %d)", __FUNCTION__, signal);

// Signal the private state thread. First we should copy this is case the		// Signal the private state thread
// thread starts exiting since the private state thread will NULL this out		if (m_private_state_thread.IsJoinable())
// when it exits
{
HostThread private_state_thread(m_private_state_thread);
if (private_state_thread.IsJoinable())
{		{
		// Broadcast the event.
		// It is important to do this outside of the if below, because
		// it's possible that the thread state is invalid but that the
		// thread is waiting on a control event instead of simply being
		// on its way out (this should not happen, but it apparently can).
if (log)		if (log)
log->Printf ("Sending control event of type: %d.", signal);		log->Printf ("Sending control event of type: %d.", signal);
// Send the control event and wait for the receipt or for the private state
// thread to exit
std::shared_ptr<EventDataReceipt> event_receipt_sp(new EventDataReceipt());		std::shared_ptr<EventDataReceipt> event_receipt_sp(new EventDataReceipt());
m_private_state_control_broadcaster.BroadcastEvent(signal, event_receipt_sp);		m_private_state_control_broadcaster.BroadcastEvent(signal, event_receipt_sp);

		// Wait for the event receipt or for the private state thread to exit
bool receipt_received = false;		bool receipt_received = false;
		if (PrivateStateThreadIsValid())
		{
while (!receipt_received)		while (!receipt_received)
{		{
bool timed_out = false;		bool timed_out = false;
TimeValue timeout_time;		TimeValue timeout_time;
timeout_time = TimeValue::Now();		timeout_time = TimeValue::Now();
timeout_time.OffsetWithSeconds(2);		timeout_time.OffsetWithSeconds(2);
// Check for a receipt for 2 seconds and then check if the private state		// Check for a receipt for 2 seconds and then check if the private state
// thread is still around.		// thread is still around.
receipt_received = event_receipt_sp->WaitForEventReceived (&timeout_time, &timed_out);		receipt_received = event_receipt_sp->WaitForEventReceived (&timeout_time, &timed_out);
if (!receipt_received)		if (!receipt_received)
{		{
// Check if the private state thread is still around. If it isn't then we are done waiting		// Check if the private state thread is still around. If it isn't then we are done waiting
if (!m_private_state_thread.IsJoinable())		if (!PrivateStateThreadIsValid())
break; // Private state thread exited, we are done		break; // Private state thread exited or is exiting, we are done
		}
}		}
}		}

if (signal == eBroadcastInternalStateControlStop)		if (signal == eBroadcastInternalStateControlStop)
{		{
thread_result_t result = NULL;		thread_result_t result = NULL;
private_state_thread.Join(&result);		m_private_state_thread.Join(&result);
		m_private_state_thread.Reset();
}		}
}		}
else		else
{		{
if (log)		if (log)
log->Printf ("Private state thread already dead, no need to signal it to stop.");		log->Printf("Private state thread already dead, no need to signal it to stop.");
}		}
}		}
}

void		void
Process::SendAsyncInterrupt ()		Process::SendAsyncInterrupt ()
{		{
if (PrivateStateThreadIsValid())		if (PrivateStateThreadIsValid())
m_private_state_broadcaster.BroadcastEvent(Process::eBroadcastBitInterrupt, nullptr);		m_private_state_broadcaster.BroadcastEvent(Process::eBroadcastBitInterrupt, nullptr);
else		else
BroadcastEvent(Process::eBroadcastBitInterrupt, nullptr);		BroadcastEvent(Process::eBroadcastBitInterrupt, nullptr);
▲ Show 20 Lines • Show All 275 Lines • ▼ Show 20 Lines	if (log)
log->Printf ("Process::%s (arg = %p, pid = %" PRIu64 ") thread exiting...",		log->Printf ("Process::%s (arg = %p, pid = %" PRIu64 ") thread exiting...",
__FUNCTION__, static_cast<void*>(this), GetID());		__FUNCTION__, static_cast<void*>(this), GetID());

// If we are a secondary thread, then the primary thread we are working for will have already		// If we are a secondary thread, then the primary thread we are working for will have already
// acquired the public_run_lock, and isn't done with what it was doing yet, so don't		// acquired the public_run_lock, and isn't done with what it was doing yet, so don't
// try to change it on the way out.		// try to change it on the way out.
if (!is_secondary_thread)		if (!is_secondary_thread)
m_public_run_lock.SetStopped();		m_public_run_lock.SetStopped();
m_private_state_thread.Reset();
return NULL;		return NULL;
}		}

//------------------------------------------------------------------		//------------------------------------------------------------------
// Process Event Data		// Process Event Data
//------------------------------------------------------------------		//------------------------------------------------------------------

Process::ProcessEventData::ProcessEventData () :		Process::ProcessEventData::ProcessEventData () :
▲ Show 20 Lines • Show All 2,152 Lines • Show Last 20 Lines