This is an archive of the discontinued LLVM Phabricator instance.

Differential D101517

tsan: refactor fork handling
ClosedPublic

Authored by dvyukov on Apr 29 2021, 3:10 AM.

Download Raw Diff

Details

Reviewers

vitalybuka

Commits

rGed7bf7d73fa2: tsan: refactor fork handling

Summary

Commit efd254b6362 ("tsan: fix deadlock in pthread_atfork callbacks")
fixed another deadlock related to atfork handling.
But builders with DCHECKs enabled reported failures of
pthread_atfork_deadlock2.c and pthread_atfork_deadlock3.c tests
related to the fact that we hold runtime locks on interceptor exit:
https://lab.llvm.org/buildbot/#/builders/70/builds/6727
This issue is somewhat inherent to the current approach,
we indeed execute user code (atfork callbacks) with runtime lock held.

Refactor fork handling to not run user code (atfork callbacks)
with runtime locks held. This change does this by installing
own atfork callbacks during runtime initialization.
Atfork callbacks run in LIFO order, so the expectation is that
our callbacks run last, right before the actual fork.
This way we lock runtime mutexes around fork, but not around
user callbacks.

Extend tests to also install after fork callbacks just to cover
more scenarios. Some tests also started reporting real races
that we previously suppressed.

Also extend tests to cover fork syscall support.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dvyukov created this revision.Apr 29 2021, 3:10 AM

Herald added a subscriber: jfb. · View Herald TranscriptApr 29 2021, 3:10 AM

dvyukov requested review of this revision.Apr 29 2021, 3:10 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 29 2021, 3:10 AM

Herald added a subscriber: Restricted Project. · View Herald Transcript

Third attempt...
Problem with the previous version manifested on an internal codebase and was related to fork syscall hooks. I've fixed it and added the test (test/tsan/Linux/fork_syscall.cpp).

Harbormaster completed remote builds in B101596: Diff 341454.Apr 29 2021, 3:58 AM

In D101517#2725127, @dvyukov wrote:

Third attempt...
Problem with the previous version manifested on an internal codebase and was related to fork syscall hooks. I've fixed it and added the test (test/tsan/Linux/fork_syscall.cpp).

I usually reopen previous review after revert and upload new patch there. This way easier to see incremental changes.
However I applied the patch and it looks very different from the reverted one.

compiler-rt/lib/tsan/rtl/tsan_rtl.cpp
528	why now you don't need to update ignore_reads_and_writes as before?
compiler-rt/lib/tsan/rtl/tsan_rtl_report.cpp
145	I don't see this function being used anywhere

This revision is now accepted and ready to land.Apr 29 2021, 5:42 PM

In D101517#2727557, @vitalybuka wrote:

In D101517#2725127, @dvyukov wrote:

Third attempt...
Problem with the previous version manifested on an internal codebase and was related to fork syscall hooks. I've fixed it and added the test (test/tsan/Linux/fork_syscall.cpp).

I usually reopen previous review after revert and upload new patch there. This way easier to see incremental changes.
However I applied the patch and it looks very different from the reverted one.

Thanks for the tip, will try next time.
It was reverted twice, maybe you diffed with the first one.
The last one is https://reviews.llvm.org/D101385 it should be close.

dvyukov added inline comments.Apr 29 2021, 11:28 PM

compiler-rt/lib/tsan/rtl/tsan_rtl_report.cpp
145	You are right. I messed something re-applying the patch...

Re-upload original diff of https://reviews.llvm.org/D101385

upload diff on top of https://reviews.llvm.org/D101385

I've uploaded the reverted patch version as Diff 2, and then new changes as Diff 3.

dvyukov added inline comments.Apr 29 2021, 11:47 PM

compiler-rt/lib/tsan/rtl/tsan_rtl.cpp
528	The problem was when memory accesses actually tried to report a race, that deadlocked, and that's why we enabled ignore_reads_and_writes. However, the same deadlock can happen on any other reports (mutex misuses, signal spoiling errno, etc), so instead we now disable all reports with thr->suppress_reports++. Thus disabling memory accesses separately becomes unnecessary. At least all existing and new tests pass.

This revision was landed with ongoing or failed builds.Apr 29 2021, 11:48 PM

Closed by commit rGed7bf7d73fa2: tsan: refactor fork handling (authored by dvyukov). · Explain Why

This revision was automatically updated to reflect the committed changes.

dvyukov added a commit: rGed7bf7d73fa2: tsan: refactor fork handling.

Harbormaster completed remote builds in B101835: Diff 341777.Apr 30 2021, 12:03 AM

Harbormaster completed remote builds in B101837: Diff 341779.Apr 30 2021, 12:21 AM

Revision Contents

Path

Size

compiler-rt/

lib/

tsan/

rtl/

tsan_interceptors_posix.cpp

60 lines

2 lines

1 line

18 lines

6 lines

36 lines

2 lines

test/

tsan/

Linux/

fork_syscall.cpp

47 lines

pthread_atfork_deadlock.c

2 lines

pthread_atfork_deadlock2.c

10 lines

pthread_atfork_deadlock3.c

98 lines

Diff 341782

compiler-rt/lib/tsan/rtl/tsan_interceptors_posix.cpp

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines
#elif defined(__aarch64__) \|\| SANITIZER_PPC64V2		#elif defined(__aarch64__) \|\| SANITIZER_PPC64V2
#define PTHREAD_ABI_BASE "GLIBC_2.17"		#define PTHREAD_ABI_BASE "GLIBC_2.17"
#endif		#endif

extern "C" int pthread_attr_init(void *attr);		extern "C" int pthread_attr_init(void *attr);
extern "C" int pthread_attr_destroy(void *attr);		extern "C" int pthread_attr_destroy(void *attr);
DECLARE_REAL(int, pthread_attr_getdetachstate, void , void )		DECLARE_REAL(int, pthread_attr_getdetachstate, void , void )
extern "C" int pthread_attr_setstacksize(void *attr, uptr stacksize);		extern "C" int pthread_attr_setstacksize(void *attr, uptr stacksize);
		extern "C" int pthread_atfork(void (prepare)(void), void (parent)(void),
		void (*child)(void));
extern "C" int pthread_key_create(unsigned key, void (destructor)(void* v));		extern "C" int pthread_key_create(unsigned key, void (destructor)(void* v));
extern "C" int pthread_setspecific(unsigned key, const void *v);		extern "C" int pthread_setspecific(unsigned key, const void *v);
DECLARE_REAL(int, pthread_mutexattr_gettype, void , void )		DECLARE_REAL(int, pthread_mutexattr_gettype, void , void )
DECLARE_REAL(int, fflush, __sanitizer_FILE *fp)		DECLARE_REAL(int, fflush, __sanitizer_FILE *fp)
DECLARE_REAL_AND_INTERCEPTOR(void *, malloc, uptr size)		DECLARE_REAL_AND_INTERCEPTOR(void *, malloc, uptr size)
DECLARE_REAL_AND_INTERCEPTOR(void, free, void *ptr)		DECLARE_REAL_AND_INTERCEPTOR(void, free, void *ptr)
extern "C" void *pthread_self();		extern "C" void *pthread_self();
extern "C" void _exit(int status);		extern "C" void _exit(int status);
▲ Show 20 Lines • Show All 1,879 Lines • ▼ Show 20 Lines	static void CallUserSignalHandler(ThreadState *thr, bool sync, bool acquire,
}		}
// We do not detect errno spoiling for SIGTERM,		// We do not detect errno spoiling for SIGTERM,
// because some SIGTERM handlers do spoil errno but reraise SIGTERM,		// because some SIGTERM handlers do spoil errno but reraise SIGTERM,
// tsan reports false positive in such case.		// tsan reports false positive in such case.
// It's difficult to properly detect this situation (reraise),		// It's difficult to properly detect this situation (reraise),
// because in async signal processing case (when handler is called directly		// because in async signal processing case (when handler is called directly
// from rtl_generic_sighandler) we have not yet received the reraised		// from rtl_generic_sighandler) we have not yet received the reraised
// signal; and it looks too fragile to intercept all ways to reraise a signal.		// signal; and it looks too fragile to intercept all ways to reraise a signal.
if (flags()->report_bugs && !sync && sig != SIGTERM && errno != 99) {		if (ShouldReport(thr, ReportTypeErrnoInSignal) && !sync && sig != SIGTERM &&
		errno != 99) {
VarSizeStackTrace stack;		VarSizeStackTrace stack;
// StackTrace::GetNestInstructionPc(pc) is used because return address is		// StackTrace::GetNestInstructionPc(pc) is used because return address is
// expected, OutputReport() will undo this.		// expected, OutputReport() will undo this.
ObtainCurrentStack(thr, StackTrace::GetNextInstructionPc(pc), &stack);		ObtainCurrentStack(thr, StackTrace::GetNextInstructionPc(pc), &stack);
ThreadRegistryLock l(ctx->thread_registry);		ThreadRegistryLock l(ctx->thread_registry);
ScopedReport rep(ReportTypeErrnoInSignal);		ScopedReport rep(ReportTypeErrnoInSignal);
if (!IsFiredSuppression(ctx, ReportTypeErrnoInSignal, stack)) {		if (!IsFiredSuppression(ctx, ReportTypeErrnoInSignal, stack)) {
rep.AddStack(stack, true);		rep.AddStack(stack, true);
▲ Show 20 Lines • Show All 153 Lines • ▼ Show 20 Lines	TSAN_INTERCEPTOR(int, getaddrinfo, void node, void service,
ThreadIgnoreEnd(thr, pc);		ThreadIgnoreEnd(thr, pc);
return res;		return res;
}		}

TSAN_INTERCEPTOR(int, fork, int fake) {		TSAN_INTERCEPTOR(int, fork, int fake) {
if (in_symbolizer())		if (in_symbolizer())
return REAL(fork)(fake);		return REAL(fork)(fake);
SCOPED_INTERCEPTOR_RAW(fork, fake);		SCOPED_INTERCEPTOR_RAW(fork, fake);
		return REAL(fork)(fake);
		}

		void atfork_prepare() {
		if (in_symbolizer())
		return;
		ThreadState *thr = cur_thread();
		const uptr pc = StackTrace::GetCurrentPc();
ForkBefore(thr, pc);		ForkBefore(thr, pc);
int pid;
{
// On OS X, REAL(fork) can call intercepted functions (OSSpinLockLock), and
// we'll assert in CheckNoLocks() unless we ignore interceptors.
ScopedIgnoreInterceptors ignore;
pid = REAL(fork)(fake);
}		}
if (pid == 0) {
// child		void atfork_parent() {
ForkChildAfter(thr, pc);		if (in_symbolizer())
FdOnFork(thr, pc);		return;
} else if (pid > 0) {		ThreadState *thr = cur_thread();
// parent		const uptr pc = StackTrace::GetCurrentPc();
ForkParentAfter(thr, pc);
} else {
// error
ForkParentAfter(thr, pc);		ForkParentAfter(thr, pc);
}		}
return pid;
		void atfork_child() {
		if (in_symbolizer())
		return;
		ThreadState *thr = cur_thread();
		const uptr pc = StackTrace::GetCurrentPc();
		ForkChildAfter(thr, pc);
		FdOnFork(thr, pc);
}		}

TSAN_INTERCEPTOR(int, vfork, int fake) {		TSAN_INTERCEPTOR(int, vfork, int fake) {
// Some programs (e.g. openjdk) call close for all file descriptors		// Some programs (e.g. openjdk) call close for all file descriptors
// in the child process. Under tsan it leads to false positives, because		// in the child process. Under tsan it leads to false positives, because
// address space is shared, so the parent process also thinks that		// address space is shared, so the parent process also thinks that
// the descriptors are closed (while they are actually not).		// the descriptors are closed (while they are actually not).
// This leads to false positives due to missed synchronization.		// This leads to false positives due to missed synchronization.
▲ Show 20 Lines • Show All 336 Lines • ▼ Show 20 Lines
}		}

static USED void syscall_fd_release(uptr pc, int fd) {		static USED void syscall_fd_release(uptr pc, int fd) {
TSAN_SYSCALL();		TSAN_SYSCALL();
DPrintf("syscall_fd_release(%p)\n", fd);		DPrintf("syscall_fd_release(%p)\n", fd);
FdRelease(thr, pc, fd);		FdRelease(thr, pc, fd);
}		}

static void syscall_pre_fork(uptr pc) {		static void syscall_pre_fork(uptr pc) { ForkBefore(cur_thread(), pc); }
TSAN_SYSCALL();
ForkBefore(thr, pc);
}

static void syscall_post_fork(uptr pc, int pid) {		static void syscall_post_fork(uptr pc, int pid) {
TSAN_SYSCALL();		ThreadState *thr = cur_thread();
if (pid == 0) {		if (pid == 0) {
// child		// child
ForkChildAfter(thr, pc);		ForkChildAfter(thr, pc);
FdOnFork(thr, pc);		FdOnFork(thr, pc);
} else if (pid > 0) {		} else if (pid > 0) {
// parent		// parent
ForkParentAfter(thr, pc);		ForkParentAfter(thr, pc);
} else {		} else {
▲ Show 20 Lines • Show All 299 Lines • ▼ Show 20 Lines	#if !SANITIZER_MAC && !SANITIZER_ANDROID
// But atexit is emitted directly into the module, so can't be resolved.		// But atexit is emitted directly into the module, so can't be resolved.
REAL(atexit) = (int()(void()()))unreachable;		REAL(atexit) = (int()(void()()))unreachable;
#endif		#endif

if (REAL(__cxa_atexit)(&finalize, 0, 0)) {		if (REAL(__cxa_atexit)(&finalize, 0, 0)) {
Printf("ThreadSanitizer: failed to setup atexit callback\n");		Printf("ThreadSanitizer: failed to setup atexit callback\n");
Die();		Die();
}		}
		if (pthread_atfork(atfork_prepare, atfork_parent, atfork_child)) {
		Printf("ThreadSanitizer: failed to setup atfork callbacks\n");
		Die();
		}

#if !SANITIZER_MAC && !SANITIZER_NETBSD && !SANITIZER_FREEBSD		#if !SANITIZER_MAC && !SANITIZER_NETBSD && !SANITIZER_FREEBSD
if (pthread_key_create(&interceptor_ctx()->finalize_key, &thread_finalize)) {		if (pthread_key_create(&interceptor_ctx()->finalize_key, &thread_finalize)) {
Printf("ThreadSanitizer: failed to create thread key\n");		Printf("ThreadSanitizer: failed to create thread key\n");
Die();		Die();
}		}
#endif		#endif

▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

compiler-rt/lib/tsan/rtl/tsan_mman.cpp

	Show First 20 Lines • Show All 139 Lines • ▼ Show 20 Lines
	}			}

	void AllocatorPrintStats() {			void AllocatorPrintStats() {
	allocator()->PrintStats();			allocator()->PrintStats();
	}			}

	static void SignalUnsafeCall(ThreadState *thr, uptr pc) {			static void SignalUnsafeCall(ThreadState *thr, uptr pc) {
	if (atomic_load_relaxed(&thr->in_signal_handler) == 0 \|\|			if (atomic_load_relaxed(&thr->in_signal_handler) == 0 \|\|
	!flags()->report_signal_unsafe)			!ShouldReport(thr, ReportTypeSignalUnsafe))
	return;			return;
	VarSizeStackTrace stack;			VarSizeStackTrace stack;
	ObtainCurrentStack(thr, pc, &stack);			ObtainCurrentStack(thr, pc, &stack);
	if (IsFiredSuppression(ctx, ReportTypeSignalUnsafe, stack))			if (IsFiredSuppression(ctx, ReportTypeSignalUnsafe, stack))
	return;			return;
	ThreadRegistryLock l(ctx->thread_registry);			ThreadRegistryLock l(ctx->thread_registry);
	ScopedReport rep(ReportTypeSignalUnsafe);			ScopedReport rep(ReportTypeSignalUnsafe);
	rep.AddStack(stack, true);			rep.AddStack(stack, true);
	▲ Show 20 Lines • Show All 249 Lines • Show Last 20 Lines

compiler-rt/lib/tsan/rtl/tsan_rtl.h

	Show First 20 Lines • Show All 618 Lines • ▼ Show 20 Lines
	public:			public:
	explicit ScopedReport(ReportType typ, uptr tag = kExternalTagNone);			explicit ScopedReport(ReportType typ, uptr tag = kExternalTagNone);
	~ScopedReport();			~ScopedReport();

	private:			private:
	ScopedErrorReportLock lock_;			ScopedErrorReportLock lock_;
	};			};

				bool ShouldReport(ThreadState *thr, ReportType typ);
	ThreadContext IsThreadStackOrTls(uptr addr, bool is_stack);			ThreadContext IsThreadStackOrTls(uptr addr, bool is_stack);
	void RestoreStack(int tid, const u64 epoch, VarSizeStackTrace *stk,			void RestoreStack(int tid, const u64 epoch, VarSizeStackTrace *stk,
	MutexSet mset, uptr tag = nullptr);			MutexSet mset, uptr tag = nullptr);

	// The stack could look like:			// The stack could look like:
	// <start> \| <main> \| <foo> \| tag \| <bar>			// <start> \| <main> \| <foo> \| tag \| <bar>
	// This will extract the tag and keep:			// This will extract the tag and keep:
	// <start> \| <main> \| <foo> \| <bar>			// <start> \| <main> \| <foo> \| <bar>
	▲ Show 20 Lines • Show All 259 Lines • Show Last 20 Lines

compiler-rt/lib/tsan/rtl/tsan_rtl.cpp

Show First 20 Lines • Show All 513 Lines • ▼ Show 20 Lines	#endif

return failed ? common_flags()->exitcode : 0;		return failed ? common_flags()->exitcode : 0;
}		}

#if !SANITIZER_GO		#if !SANITIZER_GO
void ForkBefore(ThreadState *thr, uptr pc) {		void ForkBefore(ThreadState *thr, uptr pc) {
ctx->thread_registry->Lock();		ctx->thread_registry->Lock();
ctx->report_mtx.Lock();		ctx->report_mtx.Lock();
// Ignore memory accesses in the pthread_atfork callbacks.		// Suppress all reports in the pthread_atfork callbacks.
// If any of them triggers a data race we will deadlock		// Reports will deadlock on the report_mtx.
// on the report_mtx.		// We could ignore sync operations as well,
// We could ignore interceptors and sync operations as well,
// but so far it's unclear if it will do more good or harm.		// but so far it's unclear if it will do more good or harm.
// Unnecessarily ignoring things can lead to false positives later.		// Unnecessarily ignoring things can lead to false positives later.
ThreadIgnoreBegin(thr, pc);		thr->suppress_reports++;
		// On OS X, REAL(fork) can call intercepted functions (OSSpinLockLock), and
		vitalybukaUnsubmitted Not Done Reply Inline Actions why now you don't need to update ignore_reads_and_writes as before? vitalybuka: why now you don't need to update ignore_reads_and_writes as before?
		dvyukovAuthorUnsubmitted Done Reply Inline Actions The problem was when memory accesses actually tried to report a race, that deadlocked, and that's why we enabled ignore_reads_and_writes. However, the same deadlock can happen on any other reports (mutex misuses, signal spoiling errno, etc), so instead we now disable all reports with thr->suppress_reports++. Thus disabling memory accesses separately becomes unnecessary. At least all existing and new tests pass. dvyukov: The problem was when memory accesses actually tried to report a race, that deadlocked, and…
		// we'll assert in CheckNoLocks() unless we ignore interceptors.
		thr->ignore_interceptors++;
}		}

void ForkParentAfter(ThreadState *thr, uptr pc) {		void ForkParentAfter(ThreadState *thr, uptr pc) {
ThreadIgnoreEnd(thr, pc); // Begin is in ForkBefore.		thr->suppress_reports--; // Enabled in ForkBefore.
		thr->ignore_interceptors--;
ctx->report_mtx.Unlock();		ctx->report_mtx.Unlock();
ctx->thread_registry->Unlock();		ctx->thread_registry->Unlock();
}		}

void ForkChildAfter(ThreadState *thr, uptr pc) {		void ForkChildAfter(ThreadState *thr, uptr pc) {
ThreadIgnoreEnd(thr, pc); // Begin is in ForkBefore.		thr->suppress_reports--; // Enabled in ForkBefore.
		thr->ignore_interceptors--;
ctx->report_mtx.Unlock();		ctx->report_mtx.Unlock();
ctx->thread_registry->Unlock();		ctx->thread_registry->Unlock();

uptr nthread = 0;		uptr nthread = 0;
ctx->thread_registry->GetNumberOfThreads(0, 0, &nthread /* alive threads */);		ctx->thread_registry->GetNumberOfThreads(0, 0, &nthread /* alive threads */);
VPrintf(1, "ThreadSanitizer: forked new process with pid %d,"		VPrintf(1, "ThreadSanitizer: forked new process with pid %d,"
" parent had %d threads\n", (int)internal_getpid(), (int)nthread);		" parent had %d threads\n", (int)internal_getpid(), (int)nthread);
if (nthread == 1) {		if (nthread == 1) {
▲ Show 20 Lines • Show All 605 Lines • Show Last 20 Lines

compiler-rt/lib/tsan/rtl/tsan_rtl_mutex.cpp

Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
}		}

static void ReportMutexMisuse(ThreadState *thr, uptr pc, ReportType typ,		static void ReportMutexMisuse(ThreadState *thr, uptr pc, ReportType typ,
uptr addr, u64 mid) {		uptr addr, u64 mid) {
// In Go, these misuses are either impossible, or detected by std lib,		// In Go, these misuses are either impossible, or detected by std lib,
// or false positives (e.g. unlock in a different thread).		// or false positives (e.g. unlock in a different thread).
if (SANITIZER_GO)		if (SANITIZER_GO)
return;		return;
		if (!ShouldReport(thr, typ))
		return;
ThreadRegistryLock l(ctx->thread_registry);		ThreadRegistryLock l(ctx->thread_registry);
ScopedReport rep(typ);		ScopedReport rep(typ);
rep.AddMutex(mid);		rep.AddMutex(mid);
VarSizeStackTrace trace;		VarSizeStackTrace trace;
ObtainCurrentStack(thr, pc, &trace);		ObtainCurrentStack(thr, pc, &trace);
rep.AddStack(trace, true);		rep.AddStack(trace, true);
rep.AddLocation(addr, 1);		rep.AddLocation(addr, 1);
OutputReport(thr, rep);		OutputReport(thr, rep);
Show All 40 Lines	if (flags()->report_destroy_locked
s->SetFlags(MutexFlagBroken);		s->SetFlags(MutexFlagBroken);
unlock_locked = true;		unlock_locked = true;
}		}
u64 mid = s->GetId();		u64 mid = s->GetId();
u64 last_lock = s->last_lock;		u64 last_lock = s->last_lock;
if (!unlock_locked)		if (!unlock_locked)
s->Reset(thr->proc()); // must not reset it before the report is printed		s->Reset(thr->proc()); // must not reset it before the report is printed
s->mtx.Unlock();		s->mtx.Unlock();
if (unlock_locked) {		if (unlock_locked && ShouldReport(thr, ReportTypeMutexDestroyLocked)) {
ThreadRegistryLock l(ctx->thread_registry);		ThreadRegistryLock l(ctx->thread_registry);
ScopedReport rep(ReportTypeMutexDestroyLocked);		ScopedReport rep(ReportTypeMutexDestroyLocked);
rep.AddMutex(mid);		rep.AddMutex(mid);
VarSizeStackTrace trace;		VarSizeStackTrace trace;
ObtainCurrentStack(thr, pc, &trace);		ObtainCurrentStack(thr, pc, &trace);
rep.AddStack(trace, true);		rep.AddStack(trace, true);
FastState last(last_lock);		FastState last(last_lock);
RestoreStack(last.tid(), last.epoch(), &trace, 0);		RestoreStack(last.tid(), last.epoch(), &trace, 0);
▲ Show 20 Lines • Show All 410 Lines • ▼ Show 20 Lines	void AcquireReleaseImpl(ThreadState thr, uptr pc, SyncClock c) {
thr->clock.set(thr->fast_state.epoch());		thr->clock.set(thr->fast_state.epoch());
thr->fast_synch_epoch = thr->fast_state.epoch();		thr->fast_synch_epoch = thr->fast_state.epoch();
thr->clock.acq_rel(&thr->proc()->clock_cache, c);		thr->clock.acq_rel(&thr->proc()->clock_cache, c);
StatInc(thr, StatSyncAcquire);		StatInc(thr, StatSyncAcquire);
StatInc(thr, StatSyncRelease);		StatInc(thr, StatSyncRelease);
}		}

void ReportDeadlock(ThreadState thr, uptr pc, DDReport r) {		void ReportDeadlock(ThreadState thr, uptr pc, DDReport r) {
if (r == 0)		if (r == 0 \|\| !ShouldReport(thr, ReportTypeDeadlock))
return;		return;
ThreadRegistryLock l(ctx->thread_registry);		ThreadRegistryLock l(ctx->thread_registry);
ScopedReport rep(ReportTypeDeadlock);		ScopedReport rep(ReportTypeDeadlock);
for (int i = 0; i < r->n; i++) {		for (int i = 0; i < r->n; i++) {
rep.AddMutex(r->loop[i].mtx_ctx0);		rep.AddMutex(r->loop[i].mtx_ctx0);
rep.AddUniqueTid((int)r->loop[i].thr_ctx);		rep.AddUniqueTid((int)r->loop[i].thr_ctx);
rep.AddThread((int)r->loop[i].thr_ctx);		rep.AddThread((int)r->loop[i].thr_ctx);
}		}
Show All 17 Lines

compiler-rt/lib/tsan/rtl/tsan_rtl_report.cpp

Show First 20 Lines • Show All 136 Lines • ▼ Show 20 Lines	static ReportStack *SymbolizeStack(StackTrace trace) {
}		}
StackStripMain(top);		StackStripMain(top);

ReportStack *stack = ReportStack::New();		ReportStack *stack = ReportStack::New();
stack->frames = top;		stack->frames = top;
return stack;		return stack;
}		}

		bool ShouldReport(ThreadState *thr, ReportType typ) {
		vitalybukaUnsubmitted Not Done Reply Inline Actions I don't see this function being used anywhere vitalybuka: I don't see this function being used anywhere
		dvyukovAuthorUnsubmitted Done Reply Inline Actions You are right. I messed something re-applying the patch... dvyukov: You are right. I messed something re-applying the patch...
		// We set thr->suppress_reports in the fork context.
		// Taking any locking in the fork context can lead to deadlocks.
		// If any locks are already taken, it's too late to do this check.
		CheckNoLocks(thr);
		// For the same reason check we didn't lock thread_registry yet.
		if (SANITIZER_DEBUG)
		ThreadRegistryLock l(ctx->thread_registry);
		if (!flags()->report_bugs \|\| thr->suppress_reports)
		return false;
		switch (typ) {
		case ReportTypeSignalUnsafe:
		return flags()->report_signal_unsafe;
		case ReportTypeThreadLeak:
		#if !SANITIZER_GO
		// It's impossible to join phantom threads
		// in the child after fork.
		if (ctx->after_multithreaded_fork)
		return false;
		#endif
		return flags()->report_thread_leaks;
		case ReportTypeMutexDestroyLocked:
		return flags()->report_destroy_locked;
		default:
		return true;
		}
		}

ScopedReportBase::ScopedReportBase(ReportType typ, uptr tag) {		ScopedReportBase::ScopedReportBase(ReportType typ, uptr tag) {
ctx->thread_registry->CheckLocked();		ctx->thread_registry->CheckLocked();
void *mem = internal_alloc(MBlockReport, sizeof(ReportDesc));		void *mem = internal_alloc(MBlockReport, sizeof(ReportDesc));
rep_ = new(mem) ReportDesc;		rep_ = new(mem) ReportDesc;
rep_->typ = typ;		rep_->typ = typ;
rep_->tag = tag;		rep_->tag = tag;
ctx->report_mtx.Lock();		ctx->report_mtx.Lock();
}		}
▲ Show 20 Lines • Show All 339 Lines • ▼ Show 20 Lines	static bool HandleRacyAddress(ThreadState *thr, uptr addr_min, uptr addr_max) {
Lock lock(&ctx->racy_mtx);		Lock lock(&ctx->racy_mtx);
if (FindRacyAddress(ra0))		if (FindRacyAddress(ra0))
return true;		return true;
ctx->racy_addresses.PushBack(ra0);		ctx->racy_addresses.PushBack(ra0);
return false;		return false;
}		}

bool OutputReport(ThreadState *thr, const ScopedReport &srep) {		bool OutputReport(ThreadState *thr, const ScopedReport &srep) {
if (!flags()->report_bugs \|\| thr->suppress_reports)		// These should have been checked in ShouldReport.
return false;		// It's too late to check them here, we have already taken locks.
		CHECK(flags()->report_bugs);
		CHECK(!thr->suppress_reports);
atomic_store_relaxed(&ctx->last_symbolize_time_ns, NanoTime());		atomic_store_relaxed(&ctx->last_symbolize_time_ns, NanoTime());
const ReportDesc *rep = srep.GetReport();		const ReportDesc *rep = srep.GetReport();
CHECK_EQ(thr->current_report, nullptr);		CHECK_EQ(thr->current_report, nullptr);
thr->current_report = rep;		thr->current_report = rep;
Suppression *supp = 0;		Suppression *supp = 0;
uptr pc_or_addr = 0;		uptr pc_or_addr = 0;
for (uptr i = 0; pc_or_addr == 0 && i < rep->mops.Size(); i++)		for (uptr i = 0; pc_or_addr == 0 && i < rep->mops.Size(); i++)
pc_or_addr = IsSuppressed(rep->typ, rep->mops[i]->stack, &supp);		pc_or_addr = IsSuppressed(rep->typ, rep->mops[i]->stack, &supp);
▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines

void ReportRace(ThreadState *thr) {		void ReportRace(ThreadState *thr) {
CheckNoLocks(thr);		CheckNoLocks(thr);

// Symbolizer makes lots of intercepted calls. If we try to process them,		// Symbolizer makes lots of intercepted calls. If we try to process them,
// at best it will cause deadlocks on internal mutexes.		// at best it will cause deadlocks on internal mutexes.
ScopedIgnoreInterceptors ignore;		ScopedIgnoreInterceptors ignore;

if (!flags()->report_bugs)		if (!ShouldReport(thr, ReportTypeRace))
return;		return;
if (!flags()->report_atomic_races && !RaceBetweenAtomicAndFree(thr))		if (!flags()->report_atomic_races && !RaceBetweenAtomicAndFree(thr))
return;		return;

bool freed = false;		bool freed = false;
{		{
Shadow s(thr->racy_state[1]);		Shadow s(thr->racy_state[1]);
freed = s.GetFreedAndReset();		freed = s.GetFreedAndReset();
▲ Show 20 Lines • Show All 152 Lines • Show Last 20 Lines

compiler-rt/lib/tsan/rtl/tsan_rtl_thread.cpp

	Show First 20 Lines • Show All 204 Lines • ▼ Show 20 Lines
	}			}
	#else			#else
	static void ThreadCheckIgnore(ThreadState *thr) {}			static void ThreadCheckIgnore(ThreadState *thr) {}
	#endif			#endif

	void ThreadFinalize(ThreadState *thr) {			void ThreadFinalize(ThreadState *thr) {
	ThreadCheckIgnore(thr);			ThreadCheckIgnore(thr);
	#if !SANITIZER_GO			#if !SANITIZER_GO
	if (!flags()->report_thread_leaks)			if (!ShouldReport(thr, ReportTypeThreadLeak))
	return;			return;
	ThreadRegistryLock l(ctx->thread_registry);			ThreadRegistryLock l(ctx->thread_registry);
	Vector<ThreadLeak> leaks;			Vector<ThreadLeak> leaks;
	ctx->thread_registry->RunCallbackForEachThreadLocked(			ctx->thread_registry->RunCallbackForEachThreadLocked(
	MaybeReportThreadLeak, &leaks);			MaybeReportThreadLeak, &leaks);
	for (uptr i = 0; i < leaks.Size(); i++) {			for (uptr i = 0; i < leaks.Size(); i++) {
	ScopedReport rep(ReportTypeThreadLeak);			ScopedReport rep(ReportTypeThreadLeak);
	rep.AddThread(leaks[i].tctx, true);			rep.AddThread(leaks[i].tctx, true);
	▲ Show 20 Lines • Show All 241 Lines • Show Last 20 Lines

compiler-rt/test/tsan/Linux/fork_syscall.cpp

This file was added.

				// RUN: %clangxx_tsan -O1 %s -o %t && %env_tsan_opts=atexit_sleep_ms=50 %run %t 2>&1 \| FileCheck %s
				#include "../test.h"
				#include <errno.h>
				#include <sanitizer/linux_syscall_hooks.h>
				#include <sys/syscall.h>
				#include <sys/types.h>
				#include <sys/wait.h>

				int counter;

				static void incrementer(void p) {
				for (;;)
				__sync_fetch_and_add(&counter, 1);
				return 0;
				}

				int myfork() {
				__sanitizer_syscall_pre_fork();
				int res = syscall(SYS_fork);
				__sanitizer_syscall_post_fork(res);
				return res;
				}

				int main() {
				barrier_init(&barrier, 2);
				pthread_t th1;
				pthread_create(&th1, 0, incrementer, 0);
				for (int i = 0; i < 10; i++) {
				switch (myfork()) {
				default: // parent
				while (wait(0) < 0) {
				}
				fprintf(stderr, ".");
				break;
				case 0: // child
				__sync_fetch_and_add(&counter, 1);
				exit(0);
				break;
				case -1: // error
				fprintf(stderr, "failed to fork (%d)\n", errno);
				exit(1);
				}
				}
				fprintf(stderr, "OK\n");
				}

				// CHECK: OK

compiler-rt/test/tsan/pthread_atfork_deadlock.c

	Show All 15 Lines

	void atfork() {			void atfork() {
	fprintf(stderr, "ATFORK\n");			fprintf(stderr, "ATFORK\n");
	glob++;			glob++;
	}			}

	int main() {			int main() {
	barrier_init(&barrier, 2);			barrier_init(&barrier, 2);
	pthread_atfork(atfork, NULL, NULL);			pthread_atfork(atfork, atfork, atfork);
	pthread_t t;			pthread_t t;
	pthread_create(&t, NULL, worker, NULL);			pthread_create(&t, NULL, worker, NULL);
	glob++;			glob++;
	barrier_wait(&barrier);			barrier_wait(&barrier);
	pthread_join(t, NULL);			pthread_join(t, NULL);
	// CHECK: ThreadSanitizer: data race			// CHECK: ThreadSanitizer: data race
	// CHECK-NOT: ATFORK			// CHECK-NOT: ATFORK
	return 0;			return 0;
	}			}

compiler-rt/test/tsan/pthread_atfork_deadlock2.c

	// RUN: %clang_tsan -O1 %s -o %t && %run %t 2>&1 \| FileCheck %s			// RUN: %clang_tsan -O1 %s -o %t && %deflake %run %t \| FileCheck %s
	// Regression test for			// Regression test for
	// https://groups.google.com/d/msg/thread-sanitizer/e_zB9gYqFHM/DmAiTsrLAwAJ			// https://groups.google.com/d/msg/thread-sanitizer/e_zB9gYqFHM/DmAiTsrLAwAJ
	// pthread_atfork() callback triggers a data race and we deadlocked			// pthread_atfork() callback triggers a data race and we deadlocked
	// on the report_mtx as we lock it around fork.			// on the report_mtx as we lock it around fork.
	#include "test.h"			#include "test.h"
	#include <sys/types.h>			#include <sys/types.h>
	#include <sys/wait.h>			#include <sys/wait.h>
	#include <errno.h>			#include <errno.h>

	int glob = 0;			int glob = 0;

	void worker(void unused) {			void worker(void unused) {
	glob++;			glob++;
	barrier_wait(&barrier);			barrier_wait(&barrier);
	return NULL;			return NULL;
	}			}

	void atfork() {			void atfork() {
	glob++;			glob++;
	}			}

	int main() {			int main() {
	barrier_init(&barrier, 2);			barrier_init(&barrier, 2);
	pthread_atfork(atfork, NULL, NULL);			pthread_atfork(atfork, atfork, atfork);
	pthread_t t;			pthread_t t;
	pthread_create(&t, NULL, worker, NULL);			pthread_create(&t, NULL, worker, NULL);
	barrier_wait(&barrier);			barrier_wait(&barrier);
	pid_t pid = fork();			pid_t pid = fork();
	if (pid < 0) {			if (pid < 0) {
	fprintf(stderr, "fork failed: %d\n", errno);			fprintf(stderr, "fork failed: %d\n", errno);
	return 1;			return 1;
	}			}
	if (pid == 0) {			if (pid == 0) {
	fprintf(stderr, "CHILD\n");			fprintf(stderr, "CHILD\n");
	return 0;			return 0;
	}			}
	if (pid != waitpid(pid, NULL, 0)) {			if (pid != waitpid(pid, NULL, 0)) {
	fprintf(stderr, "waitpid failed: %d\n", errno);			fprintf(stderr, "waitpid failed: %d\n", errno);
	return 1;			return 1;
	}			}
	pthread_join(t, NULL);			pthread_join(t, NULL);
	fprintf(stderr, "PARENT\n");			fprintf(stderr, "PARENT\n");
	return 0;			return 0;
	}			}

	// CHECK-NOT: ThreadSanitizer: data race			// CHECK: ThreadSanitizer: data race
				// CHECK: Write of size 4
				// CHECK: #0 atfork
				// CHECK: Previous write of size 4
				// CHECK: #0 worker
	// CHECK: CHILD			// CHECK: CHILD
	// CHECK: PARENT			// CHECK: PARENT

compiler-rt/test/tsan/pthread_atfork_deadlock3.c

This file was added.

				// RUN: %clang_tsan -O1 %s -o %t && %deflake %run %t \| FileCheck %s
				// Regression test for
				// https://groups.google.com/g/thread-sanitizer/c/TQrr4-9PRYo/m/HFR4FMi6AQAJ
				#include "test.h"
				#include <sys/types.h>
				#include <sys/wait.h>
				#include <errno.h>
				#include <string.h>
				#include <signal.h>

				long glob = 0;

				void worker(void main) {
				glob++;
				// synchronize with main
				barrier_wait(&barrier);
				// synchronize with atfork
				barrier_wait(&barrier);
				pthread_kill((pthread_t)main, SIGPROF);
				barrier_wait(&barrier);
				// synchronize with afterfork
				barrier_wait(&barrier);
				pthread_kill((pthread_t)main, SIGPROF);
				barrier_wait(&barrier);
				return NULL;
				}

				void atfork() {
				barrier_wait(&barrier);
				barrier_wait(&barrier);
				write(2, "in atfork\n", strlen("in atfork\n"));
				static volatile long a;
				__atomic_fetch_add(&a, 1, __ATOMIC_RELEASE);
				}

				void afterfork() {
				barrier_wait(&barrier);
				barrier_wait(&barrier);
				write(2, "in afterfork\n", strlen("in afterfork\n"));
				static volatile long a;
				__atomic_fetch_add(&a, 1, __ATOMIC_RELEASE);
				}

				void afterfork_child() {
				// Can't synchronize with barriers because we are
				// in the new process, but want consistent output.
				sleep(1);
				write(2, "in afterfork_child\n", strlen("in afterfork_child\n"));
				glob++;
				}

				void handler(int sig) {
				write(2, "in handler\n", strlen("in handler\n"));
				glob++;
				}

				int main() {
				barrier_init(&barrier, 2);
				struct sigaction act = {};
				act.sa_handler = &handler;
				if (sigaction(SIGPROF, &act, 0)) {
				perror("sigaction");
				exit(1);
				}
				pthread_atfork(atfork, afterfork, afterfork_child);
				pthread_t t;
				pthread_create(&t, NULL, worker, (void*)pthread_self());
				barrier_wait(&barrier);
				pid_t pid = fork();
				if (pid < 0) {
				fprintf(stderr, "fork failed: %d\n", errno);
				return 1;
				}
				if (pid == 0) {
				fprintf(stderr, "CHILD\n");
				return 0;
				}
				if (pid != waitpid(pid, NULL, 0)) {
				fprintf(stderr, "waitpid failed: %d\n", errno);
				return 1;
				}
				pthread_join(t, NULL);
				fprintf(stderr, "PARENT\n");
				return 0;
				}

				// CHECK: in atfork
				// CHECK: in handler
				// CHECK: ThreadSanitizer: data race
				// CHECK: Write of size 8
				// CHECK: #0 handler
				// CHECK: Previous write of size 8
				// CHECK: #0 worker
				// CHECK: afterfork
				// CHECK: in handler
				// CHECK: afterfork_child
				// CHECK: CHILD
				// CHECK: PARENT

This is an archive of the discontinued LLVM Phabricator instance.

tsan: refactor fork handlingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 341782

compiler-rt/lib/tsan/rtl/tsan_interceptors_posix.cpp

compiler-rt/lib/tsan/rtl/tsan_mman.cpp

compiler-rt/lib/tsan/rtl/tsan_rtl.h

compiler-rt/lib/tsan/rtl/tsan_rtl.cpp

compiler-rt/lib/tsan/rtl/tsan_rtl_mutex.cpp

compiler-rt/lib/tsan/rtl/tsan_rtl_report.cpp

compiler-rt/lib/tsan/rtl/tsan_rtl_thread.cpp

compiler-rt/test/tsan/Linux/fork_syscall.cpp

compiler-rt/test/tsan/pthread_atfork_deadlock.c

compiler-rt/test/tsan/pthread_atfork_deadlock2.c

compiler-rt/test/tsan/pthread_atfork_deadlock3.c

tsan: refactor fork handling
ClosedPublic