This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
compiler-rt/
-
lib/tsan/rtl/
-
tsan/
-
rtl/
9/26
tsan_interface_atomic.cpp
-
test/tsan/
-
tsan/
-
compare_exchange.cpp
-
llvm/test/Instrumentation/ThreadSanitizer/
-
test/
-
Instrumentation/
-
ThreadSanitizer/
-
atomic.ll

Differential D99434

[TSAN] Honor failure memory orders in AtomicCAS
ClosedPublic

Authored by bruno on Mar 26 2021, 12:12 PM.

Download Raw Diff

Details

Reviewers

dvyukov
delcypher
vitalybuka
yln
aralisza
kubamracek

Commits

rGfd184c062c1a: [TSAN] Honor failure memory orders in AtomicCAS

Summary

LLVM has lifted strong requirements for CAS failure memory orders in 431e3138a and 819e0d105e84.

Add support for honoring them in AtomicCAS.

https://github.com/google/sanitizers/issues/970

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

bruno created this revision.Mar 26 2021, 12:12 PM

Herald added subscribers: hoy, modimo, wenlei, lxfind. · View Herald TranscriptMar 26 2021, 12:12 PM

bruno requested review of this revision.Mar 26 2021, 12:12 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 26 2021, 12:12 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B95915: Diff 333603.Mar 26 2021, 12:48 PM

Ping.

bruno added reviewers: delcypher, vitalybuka.Apr 5 2021, 12:08 PM

bruno added a reviewer: yln.

delcypher added a reviewer: aralisza.Apr 6 2021, 8:45 AM

delcypher added a reviewer: kubamracek.

@bruno Thanks for the patch. TSan's runtime isn't my specialty so I've added other reviewers.

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
444	Is this something user facing code can set? If yes, then we might want to emit a warning rather than crashing the process.

I am happy with the mechanics and quality of the patch. Ideally @dvyukov could give a final sign-off.

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp

444

I think calls are generated by the compiler in ThreadSanitizer::instrumentAtomic():

Value *Args[] = {IRB.CreatePointerCast(Addr, PtrTy),
                 CmpOperand,
                 NewOperand,
                 createOrdering(&IRB, CASI->getSuccessOrdering()),
                 createOrdering(&IRB, CASI->getFailureOrdering())};
CallInst *C = IRB.CreateCall(TsanAtomicCAS[Idx], Args);

This revision is now accepted and ready to land.Apr 6 2021, 9:38 AM

Thanks both of you for the review @delcypher and @yln, will wait for @dvyukov to sign-off!

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
444	Right, but before things are expanded for instrumentation, there's some sanitization of the failure memory order in `emitAtomicCmpXchgFailureSet`, so an invalid combination shouldn't get to this point.

dvyukov added inline comments.Apr 14 2021, 1:40 AM

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
445	Can't fmo be consume/acq_rel/seq_cst? If yes, please add a test.
447	I think we should not search the object and re-acquire the mutex second time for performance reasons and complexity (won't need to re-read the value). Note that we may have already acquired due to mo. We should not acquire second time in that case.
454	I don't think this is correct. Can't this lead to false positives? Consider that a thread does CAS-release to hand off the object to another thread, and that thread frees the object. I think this memory access can race with the free. It's generally not OK to touch memory after atomic operations. If if leads to a false positive, please add a test that catches it.
compiler-rt/test/tsan/compare_exchange_release_relaxed.cpp
18 ↗	(On Diff #333603)	If this test differs only by memory order, it can make sense to use parametrized tests. It would also be good to test other mo combinations (e.g. that the CHECK(IsLoadOrder(fmo)) does not fail). See e.g. test/tsan/ignore_lib0.cpp for an example.

Also I wonder if we could first evaluate CAS and then use either mo or fmo accordingly, so that we don't over-synchronize on failure if mo is stronger than fmo.

Not directly related to the change but we could also episodically fail weak CASes to test fmo case better.

Unapprove until Dmitry's comments are addressed.

This revision now requires changes to proceed.Apr 20 2021, 6:26 PM

Address reviewer comments. To prevent false positive on release/consume, this now depends on https://reviews.llvm.org/D101501.

bruno marked 2 inline comments as done.Apr 28 2021, 5:16 PM

bruno added inline comments.

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
445	Added several variations. The consume failing order currently fallbacks to monotonic in LLVM, just opened https://reviews.llvm.org/D101501 to be consistent with success and fallback to acquire instead, without this change it leads to false positive, thanks for bringing this up.
454	Looks like I overthought the approach, thanks for pointing out.

Harbormaster completed remote builds in B101531: Diff 341362.Apr 28 2021, 11:15 PM

dvyukov added inline comments.Apr 28 2021, 11:38 PM

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
429–430	I think it's better to avoid doing this if the CAS will fail and fmo == mo_relaxed. It will provide more precise race detection. I think we could do something along the following lines: if either mo or fmo != relaxed, do GetOrCreateAndLock if either mo or fmo involves release, do a write lock evaluate cas respect mo or fmo based on the cas result
447	We did not create/obtain the sync object if mo == mo_relaxed. Is mo == mo_relaxed and fmo == mo_acquire possible? If yes, then we will fail to respect fmo.
449	We used a different condition to decide if we need to do read lock. Can't this deadlock? I think we need to check write_lock.

Apply review comments.

Note that we cannot test mo == relaxed and fmo == anything yet because LLVM fallbacks to relaxed/relaxed.

Remove the dep on https://reviews.llvm.org/D101501 for now, since there's even more stuff to improve in LLVM before we get this 100% right, at least we've got release/acquire working as of this patch. I'll come back to fix the other tests once LLVM is fixed.

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
429–430	Sounds good, minor correction on the fact that fmo cannot be release.
447	Right, will fix.
449	Yea, somehow it slipped, thanks!

Update broken comment in the testcase.

Harbormaster completed remote builds in B102396: Diff 342557.May 3 2021, 5:10 PM

Harbormaster completed remote builds in B102400: Diff 342565.May 3 2021, 5:19 PM

bruno mentioned this in D101501: [CGAtomic] Lyft strong requirement for remaining compare_exchange combinations.May 3 2021, 7:08 PM

dvyukov added inline comments.May 4 2021, 12:42 AM

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
429–430	Do I wait for this in this change? I see we still acquire/release before evaluating CAS. Or you want to do it later with other improvements you mentioned?
442	Better to do in the beginning of the function.
449	With the current code it makes sense to do "IsAcquireOrder(fmo) && !IsAcquireOrder(mo)", because otherwise we already acquired. But even better what's discussed above: first evaluate CAS, then decide what order to use.

Apply more reviewer suggested changes: first evaluate CAS, then decide what order to use.

Now that LLVM side of this is fixed, also cover all fmo's allowed, add more tests.

Cover more cases in atomic.ll.

Harbormaster completed remote builds in B103133: Diff 343584.May 6 2021, 11:42 PM

dvyukov added inline comments.May 6 2021, 11:48 PM

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
406	Can it be seq_cst? If yes, then I think write_lock below is incorrect and checking only IsAcquireOrder(fmo) is incorrect as well.
435–436	The CAS needs to be evaluated under the sync object mutex, otherwise we can get inconsistent value/memory visibility. If we are going to lock the mutex, we need to lock it before CAS.

Thank you @dvyukov for another round of reviews! A have more questions for you.

Yes, fmo can be seq_cst. Take mo == mo_release and fmo == mo_seq_cst, we'd be acquiring the mutex with write_lock, which is fine under success. However, during failure, I don't see another solution besides s->mtx.Unlock() before honoring and calling GetOrCreateAndLock again to obtain the lock with write_lock = false. Since you said in a previous review that it would be too expensive to lock again, what do you suggest as an alternative? I wonder whether my concern is even legit (?) given that the mo == mo_release and fmo == mo_seq_cst in the testcase works regardless of this lock/unlock dance.

It's also the case that the current added testcase takes ~20s on a fast linux machine, which seems way over the bar, I haven't tried to make it better yet, but (at least) diminishing the mo/fmo combinations should help, other ideas?

In the meantime, I've updated the patch with that approach.

Harbormaster completed remote builds in B103573: Diff 344174.May 10 2021, 7:24 PM

In D99434#2748778, @bruno wrote:

Thank you @dvyukov for another round of reviews! A have more questions for you.

Yes, fmo can be seq_cst. Take mo == mo_release and fmo == mo_seq_cst, we'd be acquiring the mutex with write_lock, which is fine under success. However, during failure, I don't see another solution besides s->mtx.Unlock() before honoring and calling GetOrCreateAndLock again to obtain the lock with write_lock = false. Since you said in a previous review that it would be too expensive to lock again, what do you suggest as an alternative? I wonder whether my concern is even legit (?) given that the mo == mo_release and fmo == mo_seq_cst in the testcase works regardless of this lock/unlock dance.

We don't need to re-lock in read mode. We can lock in the strongest mode: write lock if either mo or fmo requires a release, and then do acquire under write if it turned out we need only acquire. It's fine to read a data structure under write lock.

In D99434#2748778, @bruno wrote:

Thank you @dvyukov for another round of reviews! A have more questions for you.

Yes, fmo can be seq_cst. Take mo == mo_release and fmo == mo_seq_cst, we'd be acquiring the mutex with write_lock, which is fine under success. However, during failure, I don't see another solution besides s->mtx.Unlock() before honoring and calling GetOrCreateAndLock again to obtain the lock with write_lock = false. Since you said in a previous review that it would be too expensive to lock again, what do you suggest as an alternative? I wonder whether my concern is even legit (?) given that the mo == mo_release and fmo == mo_seq_cst in the testcase works regardless of this lock/unlock dance.

It's also the case that the current added testcase takes ~20s on a fast linux machine, which seems way over the bar, I haven't tried to make it better yet, but (at least) diminishing the mo/fmo combinations should help, other ideas?

This is too long. But I think it's just to the "atexit sleep" in tsan. You can do this, and maybe even set atexit_sleep_ms to 0:

test/tsan/fork_atexit.cpp:// RUN: %clangxx_tsan -O1 %s -o %t && %env_tsan_opts=atexit_sleep_ms=50 %run %t 2>&1 | FileCheck %s

Cool, thanks for the input!

Update the patch to prevent unnecessary lock/unlock dance.
Changing atexit_sleep_ms didn't really changed the ballpark, rewrite test to include all checks in one run, this brings it down to 1.5-2s total time.

dvyukov added inline comments.May 11 2021, 10:08 PM

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp

406

If I am not missing something, I think this comment still holds:

Can it be seq_cst?
If yes, then I think write_lock below is incorrect and checking only IsAcquireOrder(fmo) is incorrect as well.

452

There is some duplication here between handling of success case and failure case: adding to trace, acquire, unlock. And taking into account that fmo can be seq_cst, we will need even more duplication to fix this, e.g. we may need to do Release and WriteUnlock here.
I would structure it along the following lines (it should both handle all cases and avoid duplication at the same time):

SyncVar *s = 0;
bool write_lock = IsReleaseOrder(mo) || IsReleaseOrder(fmo);
if (mo != mo_relaxed || fmo != mo_relaxed)
  s = ctx->metamap.GetOrCreateAndLock(thr, pc, (uptr)a, write_lock);

T cc = *c;
T pr = func_cas(a, cc, v);
bool success = pr == cc;
if (!success) {
  *c = pr;
  mo = fmo;
}
if (s) {
  thr->fast_state.IncrementEpoch();
  // Can't increment epoch w/o writing to the trace as well.
  TraceAddEvent(thr, thr->fast_state, EventTypeMop, 0);
  if (IsAcqRelOrder(mo))
    AcquireReleaseImpl(thr, pc, &s->clock);
  else if (IsReleaseOrder(mo))
    ReleaseImpl(thr, pc, &s->clock);
  else if (IsAcquireOrder(mo))
    AcquireImpl(thr, pc, &s->clock);

  if (write_lock)
    s->mtx.Unlock();
  else
    s->mtx.ReadUnlock();
}
return success;

Harbormaster completed remote builds in B103904: Diff 344614.May 12 2021, 12:02 AM

bruno added inline comments.May 12 2021, 2:23 PM

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
406	If I am not missing something, I think this comment still holds: Can it be seq_cst? Yes, I also added tests to cover that. If yes, then I think write_lock below is incorrect and checking only IsAcquireOrder(fmo) is incorrect as well. See my next comment.
452	fmo cannot be `mo_release` or `mo_acq_rel`, and when it's `mo_seq_cst` my understanding (which could be wrong) is that it has load semantics. @rjmccall @jfb does my understanding makes sense? With that in mind, `write_lock` only needs to track `IsReleaseOrder(mo)`, which is what the current patch does.

Update patch to reduce code dup as suggested by @dvyukov

Harbormaster completed remote builds in B104146: Diff 344961.May 12 2021, 9:49 PM

I don't see any issues now. Thanks for bearing with me.

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
448	"success &&" is unnecessary here right? I see it's not harmful, but it also makes me a but nervous because we used just the "write_lock" condition when locking, but we use a different one when unlocking. This is not symmetric and have some potential to get out of sync, so I would drop "success &&" part here.
452	I think you are right. This makes sense.

bruno added inline comments.May 13 2021, 12:20 AM

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp
448	No problemo, will do. Thanks for the careful review!

This revision was not accepted when it landed; it landed in state Needs Review.May 13 2021, 1:07 AM

Closed by commit rGfd184c062c1a: [TSAN] Honor failure memory orders in AtomicCAS (authored by bruno). · Explain Why

This revision was automatically updated to reflect the committed changes.

bruno added a commit: rGfd184c062c1a: [TSAN] Honor failure memory orders in AtomicCAS.

Herald added a project: Restricted Project. · View Herald TranscriptMay 13 2021, 1:07 AM

Herald added a subscriber: Restricted Project. · View Herald Transcript

rupprecht mentioned this in D103846: [libcxx][atomic] Fix failure mapping in compare_exchange_{strong,weak}..Jun 7 2021, 2:57 PM

rupprecht mentioned this in rG6d33362dafb6: [libcxx][atomic] Fix failure mapping in compare_exchange_{strong,weak}..Jun 15 2021, 7:56 AM

yln mentioned this in D105844: [TSan] Avoid triggering assert when program calls OSAtomicCompareAndSwapLong.Jul 12 2021, 2:17 PM

yln mentioned this in rG1893b630fec0: Avoid triggering assert when program calls OSAtomicCompareAndSwapLong.Jul 13 2021, 9:34 AM

Revision Contents

Path

Size

compiler-rt/

lib/

tsan/

rtl/

tsan_interface_atomic.cpp

37 lines

test/

tsan/

compare_exchange.cpp

106 lines

llvm/

test/

Instrumentation/

ThreadSanitizer/

atomic.ll

80 lines

Diff 345067

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp

	Show First 20 Lines • Show All 396 Lines • ▼ Show 20 Lines
	static T NoTsanAtomicCAS(volatile T *a, T c, T v, morder mo, morder fmo) {			static T NoTsanAtomicCAS(volatile T *a, T c, T v, morder mo, morder fmo) {
	NoTsanAtomicCAS(a, &c, v, mo, fmo);			NoTsanAtomicCAS(a, &c, v, mo, fmo);
	return c;			return c;
	}			}

	template<typename T>			template<typename T>
	static bool AtomicCAS(ThreadState *thr, uptr pc,			static bool AtomicCAS(ThreadState *thr, uptr pc,
	volatile T a, T c, T v, morder mo, morder fmo) {			volatile T a, T c, T v, morder mo, morder fmo) {
	(void)fmo; // Unused because llvm does not pass it yet.			// 31.7.2.18: "The failure argument shall not be memory_order_release
				// nor memory_order_acq_rel". LLVM (2021-05) fallbacks to Monotonic
				dvyukovUnsubmitted Not Done Reply Inline Actions Can it be seq_cst? If yes, then I think write_lock below is incorrect and checking only IsAcquireOrder(fmo) is incorrect as well. dvyukov: Can it be seq_cst? If yes, then I think write_lock below is incorrect and checking only…
				dvyukovUnsubmitted Not Done Reply Inline Actions If I am not missing something, I think this comment still holds: Can it be seq_cst? If yes, then I think write_lock below is incorrect and checking only IsAcquireOrder(fmo) is incorrect as well. dvyukov: If I am not missing something, I think this comment still holds: > Can it be seq_cst? > If yes…
				brunoAuthorUnsubmitted Done Reply Inline Actions If I am not missing something, I think this comment still holds: Can it be seq_cst? Yes, I also added tests to cover that. If yes, then I think write_lock below is incorrect and checking only IsAcquireOrder(fmo) is incorrect as well. See my next comment. bruno: > If I am not missing something, I think this comment still holds: > > > Can it be seq_cst?
				// (mo_relaxed) when those are used.
				CHECK(IsLoadOrder(fmo));

	MemoryWriteAtomic(thr, pc, (uptr)a, SizeLog<T>());			MemoryWriteAtomic(thr, pc, (uptr)a, SizeLog<T>());
	SyncVar *s = 0;			SyncVar *s = 0;
	bool write_lock = mo != mo_acquire && mo != mo_consume;			bool write_lock = IsReleaseOrder(mo);
	if (mo != mo_relaxed) {
				if (mo != mo_relaxed \|\| fmo != mo_relaxed)
	s = ctx->metamap.GetOrCreateAndLock(thr, pc, (uptr)a, write_lock);			s = ctx->metamap.GetOrCreateAndLock(thr, pc, (uptr)a, write_lock);

				T cc = *c;
				T pr = func_cas(a, cc, v);
				bool success = pr == cc;
				if (!success) {
				*c = pr;
				mo = fmo;
				}

				if (s) {
	thr->fast_state.IncrementEpoch();			thr->fast_state.IncrementEpoch();
	// Can't increment epoch w/o writing to the trace as well.			// Can't increment epoch w/o writing to the trace as well.
	TraceAddEvent(thr, thr->fast_state, EventTypeMop, 0);			TraceAddEvent(thr, thr->fast_state, EventTypeMop, 0);
	if (IsAcqRelOrder(mo))
				if (success && IsAcqRelOrder(mo))
				dvyukovUnsubmitted Not Done Reply Inline Actions I think it's better to avoid doing this if the CAS will fail and fmo == mo_relaxed. It will provide more precise race detection. I think we could do something along the following lines: if either mo or fmo != relaxed, do GetOrCreateAndLock if either mo or fmo involves release, do a write lock evaluate cas respect mo or fmo based on the cas result dvyukov: I think it's better to avoid doing this if the CAS will fail and fmo == mo_relaxed. It will…
				brunoAuthorUnsubmitted Done Reply Inline Actions Sounds good, minor correction on the fact that fmo cannot be release. bruno: Sounds good, minor correction on the fact that fmo cannot be release.
				dvyukovUnsubmitted Not Done Reply Inline Actions Do I wait for this in this change? I see we still acquire/release before evaluating CAS. Or you want to do it later with other improvements you mentioned? dvyukov: Do I wait for this in this change? I see we still acquire/release before evaluating CAS. Or you…
	AcquireReleaseImpl(thr, pc, &s->clock);			AcquireReleaseImpl(thr, pc, &s->clock);
	else if (IsReleaseOrder(mo))			else if (success && IsReleaseOrder(mo))
	ReleaseImpl(thr, pc, &s->clock);			ReleaseImpl(thr, pc, &s->clock);
	else if (IsAcquireOrder(mo))			else if (IsAcquireOrder(mo))
	AcquireImpl(thr, pc, &s->clock);			AcquireImpl(thr, pc, &s->clock);
	}
				dvyukovUnsubmitted Not Done Reply Inline Actions The CAS needs to be evaluated under the sync object mutex, otherwise we can get inconsistent value/memory visibility. If we are going to lock the mutex, we need to lock it before CAS. dvyukov: The CAS needs to be evaluated under the sync object mutex, otherwise we can get inconsistent…
	T cc = *c;
	T pr = func_cas(a, cc, v);
	if (s) {
	if (write_lock)			if (write_lock)
	s->mtx.Unlock();			s->mtx.Unlock();
	else			else
	s->mtx.ReadUnlock();			s->mtx.ReadUnlock();
	}			}
	if (pr == cc)
				dvyukovUnsubmitted Not Done Reply Inline Actions Better to do in the beginning of the function. dvyukov: Better to do in the beginning of the function.
	return true;			return success;
	*c = pr;
	return false;
	}			}
				delcypherUnsubmitted Not Done Reply Inline Actions Is this something user facing code can set? If yes, then we might want to emit a warning rather than crashing the process. delcypher: Is this something user facing code can set? If yes, then we might want to emit a warning rather…
				ylnUnsubmitted Not Done Reply Inline Actions I think calls are generated by the compiler in `ThreadSanitizer::instrumentAtomic()`: Value Args[] = {IRB.CreatePointerCast(Addr, PtrTy), CmpOperand, NewOperand, createOrdering(&IRB, CASI->getSuccessOrdering()), createOrdering(&IRB, CASI->getFailureOrdering())}; CallInst C = IRB.CreateCall(TsanAtomicCAS[Idx], Args); yln: I think calls are generated by the compiler in `ThreadSanitizer::instrumentAtomic()`: ```…
				brunoAuthorUnsubmitted Done Reply Inline Actions Right, but before things are expanded for instrumentation, there's some sanitization of the failure memory order in `emitAtomicCmpXchgFailureSet`, so an invalid combination shouldn't get to this point. bruno: Right, but before things are expanded for instrumentation, there's some sanitization of the…

				dvyukovUnsubmitted Done Reply Inline Actions Can't fmo be consume/acq_rel/seq_cst? If yes, please add a test. dvyukov: Can't fmo be consume/acq_rel/seq_cst? If yes, please add a test.
				brunoAuthorUnsubmitted Done Reply Inline Actions Added several variations. The consume failing order currently fallbacks to monotonic in LLVM, just opened https://reviews.llvm.org/D101501 to be consistent with success and fallback to acquire instead, without this change it leads to false positive, thanks for bringing this up. bruno: Added several variations. The consume failing order currently fallbacks to monotonic in LLVM…
	template<typename T>			template<typename T>
	static T AtomicCAS(ThreadState *thr, uptr pc,			static T AtomicCAS(ThreadState *thr, uptr pc,
				dvyukovUnsubmitted Done Reply Inline Actions I think we should not search the object and re-acquire the mutex second time for performance reasons and complexity (won't need to re-read the value). Note that we may have already acquired due to mo. We should not acquire second time in that case. dvyukov: I think we should not search the object and re-acquire the mutex second time for performance…
				dvyukovUnsubmitted Not Done Reply Inline Actions We did not create/obtain the sync object if mo == mo_relaxed. Is mo == mo_relaxed and fmo == mo_acquire possible? If yes, then we will fail to respect fmo. dvyukov: We did not create/obtain the sync object if mo == mo_relaxed. Is mo == mo_relaxed and fmo ==…
				brunoAuthorUnsubmitted Done Reply Inline Actions Right, will fix. bruno: Right, will fix.
	volatile T *a, T c, T v, morder mo, morder fmo) {			volatile T *a, T c, T v, morder mo, morder fmo) {
				dvyukovUnsubmitted Not Done Reply Inline Actions "success &&" is unnecessary here right? I see it's not harmful, but it also makes me a but nervous because we used just the "write_lock" condition when locking, but we use a different one when unlocking. This is not symmetric and have some potential to get out of sync, so I would drop "success &&" part here. dvyukov: "success &&" is unnecessary here right? I see it's not harmful, but it also makes me a but…
				brunoAuthorUnsubmitted Done Reply Inline Actions No problemo, will do. Thanks for the careful review! bruno: No problemo, will do. Thanks for the careful review!
	AtomicCAS(thr, pc, a, &c, v, mo, fmo);			AtomicCAS(thr, pc, a, &c, v, mo, fmo);
				dvyukovUnsubmitted Not Done Reply Inline Actions We used a different condition to decide if we need to do read lock. Can't this deadlock? I think we need to check write_lock. dvyukov: We used a different condition to decide if we need to do read lock. Can't this deadlock? I…
				brunoAuthorUnsubmitted Done Reply Inline Actions Yea, somehow it slipped, thanks! bruno: Yea, somehow it slipped, thanks!
				dvyukovUnsubmitted Not Done Reply Inline Actions With the current code it makes sense to do "IsAcquireOrder(fmo) && !IsAcquireOrder(mo)", because otherwise we already acquired. But even better what's discussed above: first evaluate CAS, then decide what order to use. dvyukov: With the current code it makes sense to do "IsAcquireOrder(fmo) && !IsAcquireOrder(mo)"…
	return c;			return c;
	}			}

				dvyukovUnsubmitted Not Done Reply Inline Actions There is some duplication here between handling of success case and failure case: adding to trace, acquire, unlock. And taking into account that fmo can be seq_cst, we will need even more duplication to fix this, e.g. we may need to do Release and WriteUnlock here. I would structure it along the following lines (it should both handle all cases and avoid duplication at the same time): SyncVar s = 0; bool write_lock = IsReleaseOrder(mo) \|\| IsReleaseOrder(fmo); if (mo != mo_relaxed \|\| fmo != mo_relaxed) s = ctx->metamap.GetOrCreateAndLock(thr, pc, (uptr)a, write_lock); T cc = c; T pr = func_cas(a, cc, v); bool success = pr == cc; if (!success) { c = pr; mo = fmo; } if (s) { thr->fast_state.IncrementEpoch(); // Can't increment epoch w/o writing to the trace as well. TraceAddEvent(thr, thr->fast_state, EventTypeMop, 0); if (IsAcqRelOrder(mo)) AcquireReleaseImpl(thr, pc, &s->clock); else if (IsReleaseOrder(mo)) ReleaseImpl(thr, pc, &s->clock); else if (IsAcquireOrder(mo)) AcquireImpl(thr, pc, &s->clock); if (write_lock) s->mtx.Unlock(); else s->mtx.ReadUnlock(); } return success; dvyukov:* There is some duplication here between handling of success case and failure case: adding to…
				brunoAuthorUnsubmitted Not Done Reply Inline Actions fmo cannot be `mo_release` or `mo_acq_rel`, and when it's `mo_seq_cst` my understanding (which could be wrong) is that it has load semantics. @rjmccall @jfb does my understanding makes sense? With that in mind, `write_lock` only needs to track `IsReleaseOrder(mo)`, which is what the current patch does. bruno: fmo cannot be `mo_release` or `mo_acq_rel`, and when it's `mo_seq_cst` my understanding (which…
				dvyukovUnsubmitted Not Done Reply Inline Actions I think you are right. This makes sense. dvyukov: I think you are right. This makes sense.
	#if !SANITIZER_GO			#if !SANITIZER_GO
	static void NoTsanAtomicFence(morder mo) {			static void NoTsanAtomicFence(morder mo) {
				dvyukovUnsubmitted Not Done Reply Inline Actions I don't think this is correct. Can't this lead to false positives? Consider that a thread does CAS-release to hand off the object to another thread, and that thread frees the object. I think this memory access can race with the free. It's generally not OK to touch memory after atomic operations. If if leads to a false positive, please add a test that catches it. dvyukov: I don't think this is correct. Can't this lead to false positives? Consider that a thread does…
				brunoAuthorUnsubmitted Not Done Reply Inline Actions Looks like I overthought the approach, thanks for pointing out. bruno: Looks like I overthought the approach, thanks for pointing out.
	__sync_synchronize();			__sync_synchronize();
	}			}

	static void AtomicFence(ThreadState *thr, uptr pc, morder mo) {			static void AtomicFence(ThreadState *thr, uptr pc, morder mo) {
	// FIXME(dvyukov): not implemented.			// FIXME(dvyukov): not implemented.
	__sync_synchronize();			__sync_synchronize();
	}			}
	#endif			#endif
	▲ Show 20 Lines • Show All 504 Lines • Show Last 20 Lines

compiler-rt/test/tsan/compare_exchange.cpp

This file was added.

				// RUN: %clangxx_tsan -O1 %s %link_libcxx_tsan -o %t && %deflake %env_tsan_opts=atexit_sleep_ms=50 %run %t 2>&1 \| FileCheck --check-prefix=CHECK-REPORT %s

				#include <atomic>
				#include <cassert>
				#include <stdio.h>
				#include <thread>

				#define NUM_ORDS 16
				#define NUM_THREADS NUM_ORDS * 2
				struct node {
				int val;
				};
				std::atomic<node *> _nodes[NUM_THREADS] = {};

				void f1(int i) {
				auto n = new node();
				n->val = 42;
				_nodes[i].store(n, std::memory_order_release);
				}

				template <int version>
				void f2(int i, std::memory_order mo, std::memory_order fmo) {
				node *expected = nullptr;
				while (expected == nullptr) {
				_nodes[i].compare_exchange_weak(expected, nullptr, mo, fmo);
				};

				++expected->val;
				assert(expected->val == 43);
				}

				struct MemOrdSuccFail {
				std::memory_order mo;
				std::memory_order fmo;
				};

				MemOrdSuccFail OrdList[NUM_ORDS] = {
				{std::memory_order_release, std::memory_order_relaxed},
				{std::memory_order_release, std::memory_order_acquire},
				{std::memory_order_release, std::memory_order_consume},
				{std::memory_order_release, std::memory_order_seq_cst},

				{std::memory_order_acq_rel, std::memory_order_relaxed},
				{std::memory_order_acq_rel, std::memory_order_acquire},
				{std::memory_order_acq_rel, std::memory_order_consume},
				{std::memory_order_acq_rel, std::memory_order_seq_cst},

				{std::memory_order_seq_cst, std::memory_order_relaxed},
				{std::memory_order_seq_cst, std::memory_order_acquire},
				{std::memory_order_seq_cst, std::memory_order_consume},
				{std::memory_order_seq_cst, std::memory_order_seq_cst},

				{std::memory_order_relaxed, std::memory_order_relaxed},
				{std::memory_order_relaxed, std::memory_order_acquire},
				{std::memory_order_relaxed, std::memory_order_consume},
				{std::memory_order_relaxed, std::memory_order_seq_cst},
				};

				int main() {
				std::thread threads[NUM_THREADS];
				int ords = 0;

				// Instantiate a new f2 for each MO so we can dedup reports and actually
				// make sure relaxed FMO triggers a warning for every different MO.
				for (unsigned t = 0; t < 8; t += 2) {
				threads[t] = std::thread(f1, t);
				threads[t + 1] = std::thread(f2<0>, t, OrdList[ords].mo, OrdList[ords].fmo);
				threads[t].join();
				threads[t + 1].join();
				ords++;
				}

				for (unsigned t = 8; t < 16; t += 2) {
				threads[t] = std::thread(f1, t);
				threads[t + 1] = std::thread(f2<1>, t, OrdList[ords].mo, OrdList[ords].fmo);
				threads[t].join();
				threads[t + 1].join();
				ords++;
				}

				for (unsigned t = 16; t < 24; t += 2) {
				threads[t] = std::thread(f1, t);
				threads[t + 1] = std::thread(f2<2>, t, OrdList[ords].mo, OrdList[ords].fmo);
				threads[t].join();
				threads[t + 1].join();
				ords++;
				}

				for (unsigned t = 24; t < 32; t += 2) {
				threads[t] = std::thread(f1, t);
				threads[t + 1] = std::thread(f2<3>, t, OrdList[ords].mo, OrdList[ords].fmo);
				threads[t].join();
				threads[t + 1].join();
				ords++;
				}

				fprintf(stderr, "DONE\n");
				return 0;
				}

				// CHECK-REPORT: WARNING: ThreadSanitizer: data race
				// CHECK-REPORT: WARNING: ThreadSanitizer: data race
				// CHECK-REPORT: WARNING: ThreadSanitizer: data race
				// CHECK-REPORT: WARNING: ThreadSanitizer: data race
				// CHECK-REPORT: DONE
				// CHECK-REPORT: ThreadSanitizer: reported 4 warnings
				No newline at end of file

llvm/test/Instrumentation/ThreadSanitizer/atomic.ll

Show First 20 Lines • Show All 343 Lines • ▼ Show 20 Lines	entry:
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic8_nand_seq_cst		; CHECK-LABEL: atomic8_nand_seq_cst
; CHECK: call i8 @__tsan_atomic8_fetch_nand(i8* %a, i8 0, i32 5), !dbg		; CHECK: call i8 @__tsan_atomic8_fetch_nand(i8* %a, i8 0, i32 5), !dbg

define void @atomic8_cas_monotonic(i8* %a) nounwind uwtable {		define void @atomic8_cas_monotonic(i8* %a) nounwind uwtable {
entry:		entry:
cmpxchg i8* %a, i8 0, i8 1 monotonic monotonic, !dbg !7		cmpxchg i8* %a, i8 0, i8 1 monotonic monotonic, !dbg !7
		cmpxchg i8* %a, i8 0, i8 1 monotonic acquire, !dbg !7
		cmpxchg i8* %a, i8 0, i8 1 monotonic seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic8_cas_monotonic		; CHECK-LABEL: atomic8_cas_monotonic
; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 0, i32 0), !dbg		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 0, i32 0), !dbg
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 0, i32 2), !dbg
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 0, i32 5), !dbg

define void @atomic8_cas_acquire(i8* %a) nounwind uwtable {		define void @atomic8_cas_acquire(i8* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i8* %a, i8 0, i8 1 acquire monotonic, !dbg !7
cmpxchg i8* %a, i8 0, i8 1 acquire acquire, !dbg !7		cmpxchg i8* %a, i8 0, i8 1 acquire acquire, !dbg !7
		cmpxchg i8* %a, i8 0, i8 1 acquire seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic8_cas_acquire		; CHECK-LABEL: atomic8_cas_acquire
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 2, i32 0), !dbg
; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 2, i32 2), !dbg		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 2, i32 2), !dbg
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 2, i32 5), !dbg

define void @atomic8_cas_release(i8* %a) nounwind uwtable {		define void @atomic8_cas_release(i8* %a) nounwind uwtable {
entry:		entry:
cmpxchg i8* %a, i8 0, i8 1 release monotonic, !dbg !7		cmpxchg i8* %a, i8 0, i8 1 release monotonic, !dbg !7
		cmpxchg i8* %a, i8 0, i8 1 release acquire, !dbg !7
		cmpxchg i8* %a, i8 0, i8 1 release seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic8_cas_release		; CHECK-LABEL: atomic8_cas_release
; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 3, i32 0), !dbg		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 3, i32 0), !dbg
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 3, i32 2), !dbg
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 3, i32 5), !dbg

define void @atomic8_cas_acq_rel(i8* %a) nounwind uwtable {		define void @atomic8_cas_acq_rel(i8* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i8* %a, i8 0, i8 1 acq_rel monotonic, !dbg !7
cmpxchg i8* %a, i8 0, i8 1 acq_rel acquire, !dbg !7		cmpxchg i8* %a, i8 0, i8 1 acq_rel acquire, !dbg !7
		cmpxchg i8* %a, i8 0, i8 1 acq_rel seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic8_cas_acq_rel		; CHECK-LABEL: atomic8_cas_acq_rel
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 4, i32 0), !dbg
; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 4, i32 2), !dbg		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 4, i32 2), !dbg
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 4, i32 5), !dbg

define void @atomic8_cas_seq_cst(i8* %a) nounwind uwtable {		define void @atomic8_cas_seq_cst(i8* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i8* %a, i8 0, i8 1 seq_cst monotonic, !dbg !7
		cmpxchg i8* %a, i8 0, i8 1 seq_cst acquire, !dbg !7
cmpxchg i8* %a, i8 0, i8 1 seq_cst seq_cst, !dbg !7		cmpxchg i8* %a, i8 0, i8 1 seq_cst seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic8_cas_seq_cst		; CHECK-LABEL: atomic8_cas_seq_cst
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 5, i32 0), !dbg
		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 5, i32 2), !dbg
; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 5, i32 5), !dbg		; CHECK: call i8 @__tsan_atomic8_compare_exchange_val(i8* %a, i8 0, i8 1, i32 5, i32 5), !dbg

define i16 @atomic16_load_unordered(i16* %a) nounwind uwtable {		define i16 @atomic16_load_unordered(i16* %a) nounwind uwtable {
entry:		entry:
%0 = load atomic i16, i16* %a unordered, align 2, !dbg !7		%0 = load atomic i16, i16* %a unordered, align 2, !dbg !7
ret i16 %0, !dbg !7		ret i16 %0, !dbg !7
}		}
; CHECK-LABEL: atomic16_load_unordered		; CHECK-LABEL: atomic16_load_unordered
▲ Show 20 Lines • Show All 333 Lines • ▼ Show 20 Lines	entry:
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic16_nand_seq_cst		; CHECK-LABEL: atomic16_nand_seq_cst
; CHECK: call i16 @__tsan_atomic16_fetch_nand(i16* %a, i16 0, i32 5), !dbg		; CHECK: call i16 @__tsan_atomic16_fetch_nand(i16* %a, i16 0, i32 5), !dbg

define void @atomic16_cas_monotonic(i16* %a) nounwind uwtable {		define void @atomic16_cas_monotonic(i16* %a) nounwind uwtable {
entry:		entry:
cmpxchg i16* %a, i16 0, i16 1 monotonic monotonic, !dbg !7		cmpxchg i16* %a, i16 0, i16 1 monotonic monotonic, !dbg !7
		cmpxchg i16* %a, i16 0, i16 1 monotonic acquire, !dbg !7
		cmpxchg i16* %a, i16 0, i16 1 monotonic seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic16_cas_monotonic		; CHECK-LABEL: atomic16_cas_monotonic
; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 0, i32 0), !dbg		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 0, i32 0), !dbg
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 0, i32 2), !dbg
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 0, i32 5), !dbg

define void @atomic16_cas_acquire(i16* %a) nounwind uwtable {		define void @atomic16_cas_acquire(i16* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i16* %a, i16 0, i16 1 acquire monotonic, !dbg !7
cmpxchg i16* %a, i16 0, i16 1 acquire acquire, !dbg !7		cmpxchg i16* %a, i16 0, i16 1 acquire acquire, !dbg !7
		cmpxchg i16* %a, i16 0, i16 1 acquire seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic16_cas_acquire		; CHECK-LABEL: atomic16_cas_acquire
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 2, i32 0), !dbg
; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 2, i32 2), !dbg		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 2, i32 2), !dbg
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 2, i32 5), !dbg

define void @atomic16_cas_release(i16* %a) nounwind uwtable {		define void @atomic16_cas_release(i16* %a) nounwind uwtable {
entry:		entry:
cmpxchg i16* %a, i16 0, i16 1 release monotonic, !dbg !7		cmpxchg i16* %a, i16 0, i16 1 release monotonic, !dbg !7
		cmpxchg i16* %a, i16 0, i16 1 release acquire, !dbg !7
		cmpxchg i16* %a, i16 0, i16 1 release seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic16_cas_release		; CHECK-LABEL: atomic16_cas_release
; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 3, i32 0), !dbg		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 3, i32 0), !dbg
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 3, i32 2), !dbg
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 3, i32 5), !dbg

define void @atomic16_cas_acq_rel(i16* %a) nounwind uwtable {		define void @atomic16_cas_acq_rel(i16* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i16* %a, i16 0, i16 1 acq_rel monotonic, !dbg !7
cmpxchg i16* %a, i16 0, i16 1 acq_rel acquire, !dbg !7		cmpxchg i16* %a, i16 0, i16 1 acq_rel acquire, !dbg !7
		cmpxchg i16* %a, i16 0, i16 1 acq_rel seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic16_cas_acq_rel		; CHECK-LABEL: atomic16_cas_acq_rel
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 4, i32 0), !dbg
; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 4, i32 2), !dbg		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 4, i32 2), !dbg
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 4, i32 5), !dbg

define void @atomic16_cas_seq_cst(i16* %a) nounwind uwtable {		define void @atomic16_cas_seq_cst(i16* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i16* %a, i16 0, i16 1 seq_cst monotonic, !dbg !7
		cmpxchg i16* %a, i16 0, i16 1 seq_cst acquire, !dbg !7
cmpxchg i16* %a, i16 0, i16 1 seq_cst seq_cst, !dbg !7		cmpxchg i16* %a, i16 0, i16 1 seq_cst seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic16_cas_seq_cst		; CHECK-LABEL: atomic16_cas_seq_cst
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 5, i32 0), !dbg
		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 5, i32 2), !dbg
; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 5, i32 5), !dbg		; CHECK: call i16 @__tsan_atomic16_compare_exchange_val(i16* %a, i16 0, i16 1, i32 5, i32 5), !dbg

define i32 @atomic32_load_unordered(i32* %a) nounwind uwtable {		define i32 @atomic32_load_unordered(i32* %a) nounwind uwtable {
entry:		entry:
%0 = load atomic i32, i32* %a unordered, align 4, !dbg !7		%0 = load atomic i32, i32* %a unordered, align 4, !dbg !7
ret i32 %0, !dbg !7		ret i32 %0, !dbg !7
}		}
; CHECK-LABEL: atomic32_load_unordered		; CHECK-LABEL: atomic32_load_unordered
▲ Show 20 Lines • Show All 333 Lines • ▼ Show 20 Lines	entry:
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic32_nand_seq_cst		; CHECK-LABEL: atomic32_nand_seq_cst
; CHECK: call i32 @__tsan_atomic32_fetch_nand(i32* %a, i32 0, i32 5), !dbg		; CHECK: call i32 @__tsan_atomic32_fetch_nand(i32* %a, i32 0, i32 5), !dbg

define void @atomic32_cas_monotonic(i32* %a) nounwind uwtable {		define void @atomic32_cas_monotonic(i32* %a) nounwind uwtable {
entry:		entry:
cmpxchg i32* %a, i32 0, i32 1 monotonic monotonic, !dbg !7		cmpxchg i32* %a, i32 0, i32 1 monotonic monotonic, !dbg !7
		cmpxchg i32* %a, i32 0, i32 1 monotonic acquire, !dbg !7
		cmpxchg i32* %a, i32 0, i32 1 monotonic seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic32_cas_monotonic		; CHECK-LABEL: atomic32_cas_monotonic
; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 0, i32 0), !dbg		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 0, i32 0), !dbg
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 0, i32 2), !dbg
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 0, i32 5), !dbg

define void @atomic32_cas_acquire(i32* %a) nounwind uwtable {		define void @atomic32_cas_acquire(i32* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i32* %a, i32 0, i32 1 acquire monotonic, !dbg !7
cmpxchg i32* %a, i32 0, i32 1 acquire acquire, !dbg !7		cmpxchg i32* %a, i32 0, i32 1 acquire acquire, !dbg !7
		cmpxchg i32* %a, i32 0, i32 1 acquire seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic32_cas_acquire		; CHECK-LABEL: atomic32_cas_acquire
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 2, i32 0), !dbg
; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 2, i32 2), !dbg		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 2, i32 2), !dbg
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 2, i32 5), !dbg

define void @atomic32_cas_release(i32* %a) nounwind uwtable {		define void @atomic32_cas_release(i32* %a) nounwind uwtable {
entry:		entry:
cmpxchg i32* %a, i32 0, i32 1 release monotonic, !dbg !7		cmpxchg i32* %a, i32 0, i32 1 release monotonic, !dbg !7
		cmpxchg i32* %a, i32 0, i32 1 release acquire, !dbg !7
		cmpxchg i32* %a, i32 0, i32 1 release seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic32_cas_release		; CHECK-LABEL: atomic32_cas_release
; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 3, i32 0), !dbg		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 3, i32 0), !dbg
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 3, i32 2), !dbg
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 3, i32 5), !dbg

define void @atomic32_cas_acq_rel(i32* %a) nounwind uwtable {		define void @atomic32_cas_acq_rel(i32* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i32* %a, i32 0, i32 1 acq_rel monotonic, !dbg !7
cmpxchg i32* %a, i32 0, i32 1 acq_rel acquire, !dbg !7		cmpxchg i32* %a, i32 0, i32 1 acq_rel acquire, !dbg !7
		cmpxchg i32* %a, i32 0, i32 1 acq_rel seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic32_cas_acq_rel		; CHECK-LABEL: atomic32_cas_acq_rel
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 4, i32 0), !dbg
; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 4, i32 2), !dbg		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 4, i32 2), !dbg
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 4, i32 5), !dbg

define void @atomic32_cas_seq_cst(i32* %a) nounwind uwtable {		define void @atomic32_cas_seq_cst(i32* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i32* %a, i32 0, i32 1 seq_cst monotonic, !dbg !7
		cmpxchg i32* %a, i32 0, i32 1 seq_cst acquire, !dbg !7
cmpxchg i32* %a, i32 0, i32 1 seq_cst seq_cst, !dbg !7		cmpxchg i32* %a, i32 0, i32 1 seq_cst seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic32_cas_seq_cst		; CHECK-LABEL: atomic32_cas_seq_cst
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 5, i32 0), !dbg
		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 5, i32 2), !dbg
; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 5, i32 5), !dbg		; CHECK: call i32 @__tsan_atomic32_compare_exchange_val(i32* %a, i32 0, i32 1, i32 5, i32 5), !dbg

define i64 @atomic64_load_unordered(i64* %a) nounwind uwtable {		define i64 @atomic64_load_unordered(i64* %a) nounwind uwtable {
entry:		entry:
%0 = load atomic i64, i64* %a unordered, align 8, !dbg !7		%0 = load atomic i64, i64* %a unordered, align 8, !dbg !7
ret i64 %0, !dbg !7		ret i64 %0, !dbg !7
}		}
; CHECK-LABEL: atomic64_load_unordered		; CHECK-LABEL: atomic64_load_unordered
▲ Show 20 Lines • Show All 353 Lines • ▼ Show 20 Lines	entry:
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic64_nand_seq_cst		; CHECK-LABEL: atomic64_nand_seq_cst
; CHECK: call i64 @__tsan_atomic64_fetch_nand(i64* %a, i64 0, i32 5), !dbg		; CHECK: call i64 @__tsan_atomic64_fetch_nand(i64* %a, i64 0, i32 5), !dbg

define void @atomic64_cas_monotonic(i64* %a) nounwind uwtable {		define void @atomic64_cas_monotonic(i64* %a) nounwind uwtable {
entry:		entry:
cmpxchg i64* %a, i64 0, i64 1 monotonic monotonic, !dbg !7		cmpxchg i64* %a, i64 0, i64 1 monotonic monotonic, !dbg !7
		cmpxchg i64* %a, i64 0, i64 1 monotonic acquire, !dbg !7
		cmpxchg i64* %a, i64 0, i64 1 monotonic seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic64_cas_monotonic		; CHECK-LABEL: atomic64_cas_monotonic
; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 0, i32 0), !dbg		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 0, i32 0), !dbg
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 0, i32 2), !dbg
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 0, i32 5), !dbg

define void @atomic64_cas_acquire(i64* %a) nounwind uwtable {		define void @atomic64_cas_acquire(i64* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i64* %a, i64 0, i64 1 acquire monotonic, !dbg !7
cmpxchg i64* %a, i64 0, i64 1 acquire acquire, !dbg !7		cmpxchg i64* %a, i64 0, i64 1 acquire acquire, !dbg !7
		cmpxchg i64* %a, i64 0, i64 1 acquire seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic64_cas_acquire		; CHECK-LABEL: atomic64_cas_acquire
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 2, i32 0), !dbg
; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 2, i32 2), !dbg		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 2, i32 2), !dbg
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 2, i32 5), !dbg

define void @atomic64_cas_release(i64* %a) nounwind uwtable {		define void @atomic64_cas_release(i64* %a) nounwind uwtable {
entry:		entry:
cmpxchg i64* %a, i64 0, i64 1 release monotonic, !dbg !7		cmpxchg i64* %a, i64 0, i64 1 release monotonic, !dbg !7
		cmpxchg i64* %a, i64 0, i64 1 release acquire, !dbg !7
		cmpxchg i64* %a, i64 0, i64 1 release seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic64_cas_release		; CHECK-LABEL: atomic64_cas_release
; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 3, i32 0), !dbg		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 3, i32 0), !dbg
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 3, i32 2), !dbg
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 3, i32 5), !dbg

define void @atomic64_cas_acq_rel(i64* %a) nounwind uwtable {		define void @atomic64_cas_acq_rel(i64* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i64* %a, i64 0, i64 1 acq_rel monotonic, !dbg !7
cmpxchg i64* %a, i64 0, i64 1 acq_rel acquire, !dbg !7		cmpxchg i64* %a, i64 0, i64 1 acq_rel acquire, !dbg !7
		cmpxchg i64* %a, i64 0, i64 1 acq_rel seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic64_cas_acq_rel		; CHECK-LABEL: atomic64_cas_acq_rel
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 4, i32 0), !dbg
; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 4, i32 2), !dbg		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 4, i32 2), !dbg
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 4, i32 5), !dbg

define void @atomic64_cas_seq_cst(i64* %a) nounwind uwtable {		define void @atomic64_cas_seq_cst(i64* %a) nounwind uwtable {
entry:		entry:
		cmpxchg i64* %a, i64 0, i64 1 seq_cst monotonic, !dbg !7
		cmpxchg i64* %a, i64 0, i64 1 seq_cst acquire, !dbg !7
cmpxchg i64* %a, i64 0, i64 1 seq_cst seq_cst, !dbg !7		cmpxchg i64* %a, i64 0, i64 1 seq_cst seq_cst, !dbg !7
ret void, !dbg !7		ret void, !dbg !7
}		}
; CHECK-LABEL: atomic64_cas_seq_cst		; CHECK-LABEL: atomic64_cas_seq_cst
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 5, i32 0), !dbg
		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 5, i32 2), !dbg
; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 5, i32 5), !dbg		; CHECK: call i64 @__tsan_atomic64_compare_exchange_val(i64* %a, i64 0, i64 1, i32 5, i32 5), !dbg

define void @atomic64_cas_seq_cst_ptr_ty(i8** %a, i8* %v1, i8* %v2) nounwind uwtable {		define void @atomic64_cas_seq_cst_ptr_ty(i8** %a, i8* %v1, i8* %v2) nounwind uwtable {
entry:		entry:
cmpxchg i8** %a, i8* %v1, i8* %v2 seq_cst seq_cst, !dbg !7		cmpxchg i8** %a, i8* %v1, i8* %v2 seq_cst seq_cst, !dbg !7
ret void		ret void
}		}
; CHECK-LABEL: atomic64_cas_seq_cst		; CHECK-LABEL: atomic64_cas_seq_cst
▲ Show 20 Lines • Show All 473 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[TSAN] Honor failure memory orders in AtomicCASClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 345067

compiler-rt/lib/tsan/rtl/tsan_interface_atomic.cpp

compiler-rt/test/tsan/compare_exchange.cpp

llvm/test/Instrumentation/ThreadSanitizer/atomic.ll

[TSAN] Honor failure memory orders in AtomicCAS
ClosedPublic