This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
libcxx/
-
include/
-
CMakeLists.txt
6/12
__threading_support
23/29
atomic
2/2
barrier
1/2
chrono
-
cstddef
2
latch
3
module.modulemap
7/8
semaphore
-
stdexcept
1/2
type_traits
-
src/
-
CMakeLists.txt
17/25
atomic.cpp
2/3
barrier.cpp
6/8
semaphore.cpp
-
test/
-
libcxx/
-
double_include.sh.cpp
-
std/
-
atomics/atomics.types.operations/atomics.types.operations.wait/
-
atomics.types.operations/
-
atomics.types.operations.wait/
-
atomic_wait.pass.cpp
-
thread/
-
thread.barrier/
1/1
arrive.pass.cpp
-
arrive_and_drop.pass.cpp
-
arrive_and_wait.pass.cpp
-
completion.pass.cpp
-
version.pass.cpp
-
thread.latch/
-
arrive_and_wait.pass.cpp
-
count_down.pass.cpp
-
try_wait.pass.cpp
-
version.pass.cpp
-
thread.semaphore/
-
acquire.pass.cpp
-
binary.pass.cpp
-
max.pass.cpp
-
release.pass.cpp
-
timed.pass.cpp
-
try_acquire.pass.cpp
-
version.pass.cpp

Differential D68480

Implementation of C++20's P1135R6 for libcxx
ClosedPublic

Authored by __simt__ on Oct 4 2019, 1:11 PM.

Download Raw Diff

Details

Reviewers

jfb
ldionne
EricWF
mclow.lists

Group Reviewers

Restricted Project

Commits

rG54fa9ecd3088: [libc++] Implementation of C++20's P1135R6 for libcxx

Summary

This is the first review of this code so there's a lot to look at, and I'm fully expecting a lot of changes will be requested.

This patch contains...

A) Changes to __threading_support that introduce:

Low-level semaphores on POSIX and Apple GCD, and futex on Linux.
Declarations for a sharded table of contention state in the dylib, used by atomic::wait.
Declarations for a thread_local variable in the dylib, used by barriers.

B) The <atomic> changes from P1135:

High QoI: multi-layered back-off, using either/both the state from 1b and futexes from 1a.
Low QoI: exponential time back-off, using chrono only.

C) Barrier:

High QoI: a tree barrier, using the acceleration state in 1c to amortize the extra round.
Low QoI: a central barrier, with a specialization for the empty completion function.

D) Semaphore:

All QoI: a general template semaphore for very large ptrdiff_t values.
High QoI: a specialization for “reasonable” ptrdiff_t values, using semaphores in 1a and acceleration atomics.
Low QoI: a specialization for unit count.

E) Latch:

All QoI (low): a central latch. (If there's a desire to see it, I could borrow the same QoI knobs from barrier, but sizeof() would grow a lot.)

F) The first basic tests for each facility from P1135.

G) Miscellaneous tweaks I needed to get this to build as libcu++ (the CUDA variant). We can drop these or you can take them as improvements. One of them is a legit macro bug.

Diff Detail

Repository: rCXX libc++

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

__simt__ added inline comments.Oct 10 2019, 2:50 PM

include/atomic
1557 ↗	(On Diff #223282)	This one is fun. The model for a condition variable is that there is a lock that guards all reads and writes of the variable that has a condition evaluated on it. In this case here, that variable is an atomic, and we both read and write it outside the lock - we evaluate the condition outside the lock with much greater concurrency. However we're still exposed to a race here, because threads may be transitioning into the sleep state as we come to notify. Taking the lock here resolves this race condition, even if we do nothing with the critical section.
include/barrier
80 ↗	(On Diff #223282)	It can be. How do we do that?
199 ↗	(On Diff #223282)	This one doesn't need to be. Removing.
216 ↗	(On Diff #223282)	That's not how it works in <atomic> and I patterned on that. Also it helps the CUDA port be less tedious. If you feel strongly about this one I can obviously take the extra steps, but do you?
include/chrono
1065 ↗	(On Diff #223282)	No.
1571 ↗	(On Diff #223282)	Yes. CUDA can't make OS calls but we do have another way of implementing this. So I made it external-izable.

Are the lock free algorithms used by this implementation published somewhere? That would give me a lot more confidence in their correctness.

In D68480#1705203, @EricWF wrote:

Are the lock free algorithms used by this implementation published somewhere? That would give me a lot more confidence in their correctness.

Now that's getting to the meat of the matter! The short answer is no, and the long answer is that it depends on what we're going to call an algorithm.

The <latch> code is more-or-less the spec's exposition converted into code, but I think it needs to become more complex later, for QoI reasons. In <barrier> we have well-known barrier algorithms like linear/central and tree/hierarchical barriers, but the pseudocode you'll find in literature needs a lot of adaption to fit into C++ in general (which doesn't do thread-local non-static members for classes), and into the C++20 std::barrier interface in particular (which doesn't assign a dense index space of IDs to thread that could be used to access O(P)-sized internal structures) -- and all that adaptation is 90% of the code in that file, not the algorithm itself. Finally, the gnarly code in <semaphore> is not the semaphore algorithm proper either (because that's been left to the OS except for the fallback cases) but the acceleration layer that one would need to put on top of it in order to make it fast (avoid calling the OS when it's not strictly necessary).

To be clear, I am reasonably concerned that there are bugs in this code, as well. The simpler / lower-QoI paths which CUDA will eventually ship are decently solid, but the complex / high-QoI paths that a performant CPU implementation needs (read: existentially needs, even in ~6 core laptop) are not as well tested relative to their complexity.

On our end, I think our intentions are to write more fuzzer-like tests that apply stress on them. I also have in the back of my mind the idea of proposing a C++Now talk in the vein of "How-to P1135 @ High QoI", to capture more eyeballs / document the post-adaptation versions of the algorithms better.

Where would you like to take this chat next?

griwes added a subscriber: griwes.Oct 16 2019, 8:14 PM

griwes added inline comments.

libcxx/include/semaphore
68	Use a reserved identifier for the template parameter.
410	Use a reserved identifier for the template parameter.
421	Use a reserved identifier for the template parameter.
libcxx/test/std/thread/thread.barrier/arrive.pass.cpp
20	This depends on CTAD, which makes it only work in C++17 and above. Instead it should probably say `std::barrier<> b;`. Same comment for all the other barrier and semaphore tests.

__simt__ marked 4 inline comments as done.Oct 17 2019, 7:29 PM

Refactored the semaphores into layers that each do one clear thing, and dispensed with a lot of #ifdefs. Moved as much semaphore code as possible out of the header and into the dylib.

Refactored the atomic code which previously necessitated type punning so it no longer does that, and narrowed the declarations of the contention state type. Made use of private contention state in the internal libcxx types that make use of wait/notify so they don't create contention with the user's uses of wait/notify.

Refactored some of the classes which previously used public inheritance of interface members, to use containment of the base type and forwarding member calls instead. This also let me simplify the bases to only define the core functionality and put the sugars on the one user interface type.

Removed asserts and alignas, and fixed several names which were not reserved before.

Fixed some issues Michał pointed out.

Fixed bugs where semaphores would randomly assert on APPLE.

Fixed bugs where semaphores would randomly fail timed waits on linux.

Simplified how the contention state is conditionally-used.

Fixed several breaks in the CUDA version from the last patch.

Fixed an incompatibility between the _Atomic(T) backend of <atomic> and the Futex function SFINAE in <__threading_support>.

Added some documentation for the semaphore and barrier algorithms as comments.

Fixed a few issues hit on Linux when overriding to disable optimized paths.

Oof, didn't attach the patch.

zoecarver added inline comments.Oct 20 2019, 9:43 AM

libcxx/include/atomic
1477	I think this can be a `static_cast`.
1533	Might be wrong but, can't this be a CAS (because you are both comparing and swapping `0`)?
1535	Why do you lock, then immediately unlock the mutex here?
1563	Is this another "magic" number? If so, can it be a macro too?
1696–1709	This should only be defined after C++17.
1697	You can use `memory_order::seq_cst` here.
1821	Is this available in C++03? If so, use `typedef`. Actually, do we support any compilers that don't support `using` type aliases?
libcxx/include/semaphore
106	Does `__cxx_atomic_notify_one` always call out to `__cxx_atomic_notify_all`? If so, can we get rid of `__cxx_atomic_notify_one`?

__simt__ marked 7 inline comments as done.Oct 20 2019, 10:05 AM

__simt__ added inline comments.

libcxx/include/atomic
1477	Actually it can and should go away now. I'll remove the cast.
1533	It's not conditionally exchanging, it's exchanging unconditionally and then conditionally taking a computation step.
1535	I answered that one here: https://reviews.llvm.org/D68480?id=223282#inline-618652
1563	Sure.
1696–1709	That would be unfortunate, I think, <atomic> in general tries to offer its functionality in back versions. Also, the CUDA port definitely doesn't want to be tied to '17 for quite some time. Can we leave it on in all dialects that <atomic> supports, like the rest?
1697	Pending other comment's resolution.
1821	Will do.

• dhollman added a subscriber: • dhollman.Oct 22 2019, 6:49 AM

In this update I repaired the usefulness of <atomic> in C++03 mode. Other headers won't be supported there, but I'm avoiding regression to <atomic>.

In this version I moved the tree barrier's core algorithm into the dylib so that any functional or performance issue found in it later could still be fixed without breaking ABI. By the same fact it narrows the visibility of the definition of the thread_local symbol it uses.

This revision enables cross-ABI compatibility for building the dylib and the user application with different combinations of these options:

_LIBCPP_HAS_NO_PLATFORM_WAIT
_LIBCPP_HAS_NO_PLATFORM_WAIT_TABLE
_LIBCPP_HAS_NO_PLATFORM_WAIT_STATE

When the ABIs are mixed, the total application still gets the best performance available for the facilities actually available in common. This is somewhat gnarly (apologies for the several void*...) but was requested offline, to allow for a platform to progress from not having Futex support (say, today) to having Futex (eventually, later).

What this ABI hardening does not include is support for changing the type of "__libcpp_platform_contention_t" from a uint32_t to a uint64_t, later. Once a platform chooses to expose Futex, it's an ABI break to change the size of that, so it's worth pondering whether the platform should be held back until it can support uint64_t Futex.

This revision also introduces an internal-only abstraction layer called "__atomic_positive_ptrdiff_t" which centralizes a lot of turning concerns for both latches and semaphores, when layered on top of atomics, for different configurations.

In this revision the APPLE configuration is a bit quirky: 1) the headers build as if Futex is enabled and is a uint64_t, but 2) the dylib builds as if Futex is disabled and leans on the cross-ABI compatibility to fall-back to mutexes and condvars. Also, this is not quirky but my new recommendation, on APPLE it no longer uses dispatch semaphores.

Closing out some obsolete comments after changes and/or testing.

Hi guys. Could I get some action items or other movement on this review? Would you like me to make it possible to merge it as disabled or as experimental, and send me DRs to fix afterwards?

_LIBCPP_HAS_NO_PLATFORM_WAIT

_LIBCPP_HAS_NO_PLATFORM_WAIT_TABLE

_LIBCPP_HAS_NO_PLATFORM_WAIT_STATE

It would be great to document these macros somewhere in the code or in a .rst document.

Also, this paper didn't add any feature-test macro?

libcxx/include/atomic
1821	Can we add a test for this?
1842	This too!
libcxx/include/chrono
1571	What's that?
libcxx/include/semaphore
78	Thanks for the comments in this file, I wish we did that more often.
libcxx/include/type_traits
4028	What are these drive-by fixes?

This revision now requires changes to proceed.Nov 18 2019, 1:15 PM

__simt__ marked 6 inline comments as done.Nov 18 2019, 8:20 PM

__simt__ added inline comments.

libcxx/include/atomic
1821	Yes, I can do that.
1842	Ack.
libcxx/include/chrono
1571	Timed wait functions need <chrono>. That meant that I had to port <chrono> to CUDA as part of this effort. In CUDA you can't ask the OS for the time, you need to ask the silicon for the time. I followed the precedent of external threading to implement this as external clocks. Makes sense?
libcxx/include/semaphore
78	NP, Eric had requested something here. Feel free to ask for comments elsewhere.
libcxx/include/type_traits
4028	It's important for the CUDA port that there not be naked "namespace std { ... }" that don't use the macros, because we need to inject our namespace name. I tried to keep the drive-by to a minimum.

Thank you for all the work you've put into this patch. Here are a few more comments. Still working through all this code :)

I'm becoming increasingly worried about the amount of inline configuration and platform-specific code. I want to make sure we have tests for all of these (and ideally CI that can run all the tests).

libcxx/src/atomic.cpp
29	For what platforms is `_LIBCPP_HAS_PLATFORM_WAIT_STATE` false, and have you tested on those platforms? I'm worried that there might be compiler errors.
33	What's the point of this macro, `ATOMIC_VAR_INIT` (I realize you didn't add it, but I'm still curious)?
47	This is different if `_LIBCPP_HAS_PLATFORM_WAIT_STATE` is false, right?
130	Will this ever be true?
134	same as below
140	Is there a test for this case? What will be the effect of `__s` not getting updated?
149	Instead of having void pointers that are casted, I don't see any reason these couldn't be defined as their actual types (`__cxx_atomic_impl` and `__libcpp_platform_contention_t`).
163	Is `__libcpp_platform_wait` defined on non-linux machines?
libcxx/src/barrier.cpp
45	Will this ever not only happen on the first iteration (if so, move it out of the loop maybe)?
libcxx/src/semaphore.cpp
28	Why is this only needed for apple?
124	Where does `50` come from? Maybe make this a macro.
153	Is this the same as `while (__old == 0)`?

No problem. I just want to know what I need to do in the next patch in order to move forward.

libcxx/src/atomic.cpp
29	It's false on CUDA. It would be false on platforms that can't rely on OS support for efficient waiting and have to fall back to polling with backoff.
33	Prior to P0883 merging into C++20, atomics are constructed in an uninitialized state. You're supposed to use this macro to give it a static initialization. This macro is deprecated after C++20.
47	If you have a platform wait state, then it's used by this facility. If you don't have a platform wait state, then a condvar is used instead.
130	Oh yes. Every time that there is a contending waiter concurrent with this notifier. This condition is true whenever the facility's use is not trivial.
134	Answered below.
140	Undefined behavior. Usually a segfault.
149	This is the uglier part of the patch. We have this situation where one platform (Apple) is about to go from not having platform wait states, to having them. This API is trying to be able to take either kind of efficient wait structure in the application OR in the dylib, and provide efficient waiting either way. I think it would not be unreasonable if you guys asked me to cut this part out in order to get a first commit of the facility, and then work in the background with Louis on recovering the capability in some way, with or without a public patch.
163	Yes.
libcxx/src/barrier.cpp
45	This needs to be inside the loop unfortunately. During the first round, we need to record our effective start (leaf) location in the tree. I could express it differently - record it only at the end of the first round - but it would remain in the loop nest. What we could potentially do is dumb down this favorite barrier index concept, or remove it entirely. It's worth a relatively small amount of performance by comparison to using the tree barrier in the first place. I would not be offended if you asked me to give you an ordered list of things I could delete and tell you what it costs you to delete them, approximately. Then we could stop where we are more comfortable.
libcxx/src/semaphore.cpp
28	Because of how GCD semaphores work, unfortunately. We could delete this by sending all Apple semaphores to the generic template based on atomics. Last I spoke with Louis, we thought that would be acceptable.
124	Yeah. Aren't there enough macros though? I think I might come back later with a different patch to propose a set of macros to configure back-offs.
153	It's not. This is masking the lower 32-bits of a 64-bit value, and then comparing that with 0.

zoecarver added inline comments.Jan 10 2020, 2:54 PM

libcxx/src/atomic.cpp
33	Heh. I didn't realize that was part of the standard (I was wondering why it wasn't mangled). Good to know.
47	I see. I got confused for a second while trying to follow the `#if`s.
140	Isn't that a problem if `_LIBCPP_HAS_NO_PLATFORM_WAIT_TABLE` isn't defined?
149	I'll defer to others on this but, (assuming it wouldn't be much more work for you) it might be better to remove that from this patch and add it as a follow-up patch. In general, the less code in this patch, the faster it can get committed.
libcxx/src/barrier.cpp
45	What we could potentially do is dumb down this favorite barrier index concept, or remove it entirely. It's worth a relatively small amount of performance by comparison to using the tree barrier in the first place. I don't know enough about this piece of code or potential methods of implementation to comment on what we should do. I'll defer to you/others on this.
libcxx/src/semaphore.cpp
124	Yes, there are an unfortunate number of macros in this bit of code. I don't feel strongly about adding a macro now or later, maybe add a comment, though (that the number isn't magic or referenced elsewhere).
153	I'm pretty sure they are the same. Look at this example. The first and last functions generate the same optimized assembly.

__simt__ updated this revision to Diff 240389.Jan 25 2020, 10:34 AM

A lot of changes in this patch, mostly simplifications based on the areas that got feedback about complexity.

For the most part, I have deleted code:

I deleted the condvar-based implementation of atomic_wait because no current platform would make use of it. That also allowed me to delete much of the complicated ABI tricks added in the last revision. That removed several configuration macros.
After discussing it with Louis, I switched the Apple config to use atomics-based semaphores. Not only did that delete a lot of code on its own, but it made some other optimizations in semaphores no longer needed, so I deleted them too. That removed several more configuration macros as well.
I deleted code dealing with the thread_local optimization in barrier. Using this_thread::get::id to inject a thread-stable search seed for the algorithm is about just as good, and everything for that already exists.
Latches, barriers and semaphores make more direct use of the wait/notify functionality now. Where they had "exceptions" with customized waiting, now there's a unified interface there, which deletes code.

In addition to this, the internal abstractions are much better now. I particularly like how natively-supported Futex types pass-through the intermediate layer now, it just works.

I hope you find this version is close enough that we might soon switch to delta patches on top, in trunk.

Cheers,

Olivier

jfb added a subscriber: MadCoder.Jan 27 2020, 10:42 AM

Not finished looking through this but, here are some comments.

Overall, I really like this change. It looks fantastic. It seems like a much cleaner/simpler implementation. Thanks for all the time you've spent on this.

libcxx/include/__threading_support
36	Are all the platforms we support guaranteed to have this header (they very well might, I just don't know)?
38	Is this macro still needed?
40	This could be an else block, no?
79	Are there any platforms where `SEM_VALUE_MAX` doesn't exist? Maybe some BSD platform? Could you check that it is defined (or maybe that we're on a certain platform)?
libcxx/include/atomic
2530	I might just be missing it but, is this in the standard? Otherwise, could you mangle it?
2531	nit: space between the equals.
libcxx/include/barrier
275	Will `__barrier_base` be defined if `_LIBCPP_HAS_NO_TREE_BARRIER` isn't?
libcxx/include/latch
54	Can we make it so that these headers just don't exist before C++11 via cmake? That might be a nicer way to fail.

Thanks Zoe!

Some responses ->

libcxx/include/__threading_support
36	This header is part of Pthreads and this #include is in the Pthreads section of the header, so I think the answer is yes. The Apple case is special in that they have deprecated this header.
38	Yes. We need a way to disable the platform native semaphores. Both Apple (in the current design) and CUDA need it.
40	It would have to be an #elif !defined(...) so that CUDA could trigger it too.
79	Like the header, it's mandated by Pthreads. It's conceivable that some BSD is non-conforming and we still want to work on it, but I don't have a BSD system handy to try.
libcxx/include/atomic
2530	It is, in C++20.
2531	OK.
libcxx/include/barrier
275	Yes. The macro ends up selecting between two different base classes -- one is the scalable tree barrier, the other is the simpler central barrier.
libcxx/include/latch
54	I don't have this skill. :^/ But I do think that we want to support them in C++11/14/17. This author's production team is going to present it to users in these dialects, for instance.

This is looking pretty good, only a few nitpicks. Once the nitpicks are addressed we can commit this and then go incrementally.

libcxx/include/__threading_support
41	Any reason to introduce this macro instead of just use `!defined(_LIBCPP_NO_NATIVE_SEMAPHORES)` when you need it?
555	`static constexpr`?
libcxx/include/atomic
550	Just wondering: will this in any way make it harder to support `<atomic>` in freestanding?
1465	I'm not seeing this used anywhere -- am I missing something?
libcxx/include/semaphore
62	Olivier and I spoke offline, and it's reasonable to request C++14 at least here. Please do this in all the headers you're adding. The idea is to avoid having headers that are supported in older dialects than they need to be (within reason), which is often a source of technical debt.
libcxx/src/atomic.cpp
48	In `apple_availability.h`, add: #if defined(__ENVIRONMENT_MAC_OS_X_VERSION_MIN_REQUIRED__) #if __ENVIRONMENT_MAC_OS_X_VERSION_MIN_REQUIRED__ >= 101500 #define _LIBCPP_USE_ULOCK #endif #elif defined(__ENVIRONMENT_IPHONE_OS_VERSION_MIN_REQUIRED__) #if __ENVIRONMENT_IPHONE_OS_VERSION_MIN_REQUIRED__ >= 130000 #define _LIBCPP_USE_ULOCK #endif #elif defined(__ENVIRONMENT_TV_OS_VERSION_MIN_REQUIRED__) #if __ENVIRONMENT_TV_OS_VERSION_MIN_REQUIRED__ >= 130000 #define _LIBCPP_USE_ULOCK #endif #elif defined(__ENVIRONMENT_WATCH_OS_VERSION_MIN_REQUIRED__) #if __ENVIRONMENT_WATCH_OS_VERSION_MIN_REQUIRED__ >= 60000 #define _LIBCPP_USE_ULOCK #endif #endif // __ENVIRONMENT_.*_VERSION_MIN_REQUIRED__ That should do it for all platforms aligned to Mac OS 10.15. Feel free to use whatever name for `_LIBCPP_USE_ULOCK` -- `_LIBCPP_USE_APPLE_ULOCK` probably makes the most sense since it's an Apple-specific API. Then, your `#elif` becomes `#elif defined(__APPLE__) && defined(_LIBCPP_USE_APPLE_ULOCK)`.

Replies. Also noting that I will include an update to the implementation status HTML page.

libcxx/include/__threading_support
41	I was going to say it’s more convenient for me, but I’m not even convinced. I can streamline this.
555	Yes.
libcxx/include/atomic
550	A bit, yeah. There are a few ways to proceed.
1465	I intend to use that to implement the normative encouragement that atomic_signed/unsigned[...]_t should be the efficient ones for waiting. I added those types below, but they don’t follow that encouragement.
libcxx/include/semaphore
62	Yep
libcxx/src/atomic.cpp
48	Thanks!

This revision addresses the outstanding comments, and incorporates Louis' recommended macros for ulock detection.

I also added missing max() members to barrier/latch that were added by NB comment resolution, and I implemented the normative encouragement for the lock-free types to match the ideal contention type, if there is one. I added tests for these things.

Finally, I further improved the ABI resilience of the barrier type so it's almost a full pimpl now.

Thanks a lot for all the work and patience, Olivier. I think we're good to go now. Do you have commit access?

Also, can you please confirm that you meant to introduce all the following new symbols in the dylib, and nothing else:

std::__1::__libcpp_atomic_wait(std::__1::__cxx_atomic_impl<long long, std::__1::__cxx_atomic_base_impl<long long> > const volatile*, long long)
std::__1::__libcpp_atomic_wait(void const volatile*, long long)
std::__1::__cxx_atomic_notify_all(std::__1::__cxx_atomic_impl<long long, std::__1::__cxx_atomic_base_impl<long long> > const volatile*)
std::__1::__cxx_atomic_notify_all(void const volatile*)
std::__1::__cxx_atomic_notify_one(std::__1::__cxx_atomic_impl<long long, std::__1::__cxx_atomic_base_impl<long long> > const volatile*)
std::__1::__cxx_atomic_notify_one(void const volatile*)
std::__1::__libcpp_atomic_monitor(std::__1::__cxx_atomic_impl<long long, std::__1::__cxx_atomic_base_impl<long long> > const volatile*)
std::__1::__libcpp_atomic_monitor(void const volatile*)
std::__1::__arrive_barrier_algorithm_base(std::__1::__barrier_algorithm_base*, unsigned char)
std::__1::__destroy_barrier_algorithm_base(std::__1::__barrier_algorithm_base*)
std::__1::__construct_barrier_algorithm_base(long&)

Those will have to be added to the ABI list file (I can do that).

This revision is now accepted and ready to land.Feb 18 2020, 7:11 AM

Closed by commit rG54fa9ecd3088: [libc++] Implementation of C++20's P1135R6 for libcxx (authored by __simt__, committed by ldionne). · Explain WhyFeb 24 2020, 8:03 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptFeb 24 2020, 8:03 AM

ldionne added inline comments.Feb 24 2020, 8:50 AM

libcxx/include/atomic
443	We forgot to update the synopsis for this header with the C++20 Synchronization Library. @__simt__ would you be willing to do that in a followup change?

Herald added a reviewer: mclow.lists. · View Herald TranscriptFeb 24 2020, 8:50 AM

ldionne mentioned this in rGb21405d1cd08: [libc++] Fix CI and Linux failures after landing D68480.Feb 24 2020, 9:08 AM

teemperor added a subscriber: teemperor.Feb 24 2020, 10:46 AM

teemperor added inline comments.

libcxx/include/module.modulemap
234	Maybe I'm missing something here, but due to this submodule we are always parsing the barrier header even when building the module with a language standard < C++14. This means that everyone using C++11 is no longer able to use the 'std' Clang module after this commit. Is this intentional?

ldionne added inline comments.Feb 24 2020, 10:50 AM

libcxx/include/module.modulemap
234	No, this is not intentional. Sorry, there were several failures that needed fixing after committing this (failures that were impossible to notice without throwing the change at all the build bots) -- we're getting there.

This is the second time something like this happened. When you make such large scale changes -- make sure to run the lldb testsuite, or be ready to revert. We already had this discussion in the past, but clearly it didn't prevent the problem from happening again.

teemperor added inline comments.Feb 24 2020, 11:22 AM

libcxx/include/module.modulemap
234	No worries, pushed a fix in b61e83eb0e31c1e6006569b43bb98a61ff44ca4c

In D68480#1889799, @davide wrote:

This is the second time something like this happened. When you make such large scale changes -- make sure to run the lldb testsuite, or be ready to revert. We already had this discussion in the past, but clearly it didn't prevent the problem from happening again.

Last time's discussion didn't get anywhere, as running the LLDB test suite on each commit we make to libc++ isn't a viable option.

Libc++'s test matrix is insanely large, and actually we can't even see all of it cause some uses are behind closed doors. Whenever we make a non-trivial change (and BTW this is a purely additive change), it breaks someone somewhere. And you can rest assured that we do run a lot of testing locally to make sure we don't break people before committing, but it can't catch everything. That's just the way it is, and we try to fix it as quickly as possible -- I've spent all day so far trying to fix the consequences of applying this patch. This is not a lack of diligence, it's just that the nature of libc++ makes it difficult to test comprehensively.

I don't know what this commit broke in LLDB, if anything, but instead it's more useful to comment here with a link to the failure so we can help fix it. Being combative is no help, as we're all in the same boat.

In D68480#1889935, @ldionne wrote:

In D68480#1889799, @davide wrote:

This is the second time something like this happened. When you make such large scale changes -- make sure to run the lldb testsuite, or be ready to revert. We already had this discussion in the past, but clearly it didn't prevent the problem from happening again.

Last time's discussion didn't get anywhere, as running the LLDB test suite on each commit we make to libc++ isn't a viable option.

Libc++'s test matrix is insanely large, and actually we can't even see all of it cause some uses are behind closed doors. Whenever we make a non-trivial change (and BTW this is a purely additive change), it breaks someone somewhere. And you can rest assured that we do run a lot of testing locally to make sure we don't break people before committing, but it can't catch everything. That's just the way it is, and we try to fix it as quickly as possible -- I've spent all day so far trying to fix the consequences of applying this patch. This is not a lack of diligence, it's just that the nature of libc++ makes it difficult to test comprehensively.

I don't know what this commit broke in LLDB, if anything, but instead it's more useful to comment here with a link to the failure so we can help fix it. Being combative is no help, as we're all in the same boat.

I would like to reiterate that the policy in LLVM is that commit that break projects can be reverted willy-nilly.
I would also like to stress that my use cases are not behind closed doors -- LLDB is part of the LLVM umbrella.
I thought we reached an agreement last time, but looks like there was a miscommunication or misunderstanding on your side, so let me reiterate that you have two options:

Run the lldb testsuite for changes that impact layout of structures -- or in any case you consider non-trivial, or if you don't think you can do this, at least ping somebody from lldb to take a look at the changes before they're committed. This of course requires some judgement on your side. When in doubt ask, as sending an e-mail is cheap.
You commit without pre-commit checking lldb because you consider the additive cost of running ninja check-lldb is prohibitive. This is your choice, but don't be surprised if people will revert your commit if it breaks things.

Hopefully this clarifies my position.

FWIW, this change broke building for windows. It seemed fairly straightforward to fix though, see D75102.

ldionne mentioned this in rGab41129b1ee1: [libc++] Proper fix for libc++'s modulemap after D68480.Feb 25 2020, 8:44 AM

Hi, a bisect shows that this patch seems to cause time.h to not be found for us:

[1958/45287] CXX kernel-x64-clang/obj/kernel/lib/libc/libc.snprintf.cc.o
FAILED: kernel-x64-clang/obj/kernel/lib/libc/libc.snprintf.cc.o
../../../recipe_cleanup/clangEPCywe/bin/clang++ -MD -MF kernel-x64-clang/obj/kernel/lib/libc/libc.snprintf.cc.o.d -o kernel-x64-clang/obj/kernel/lib/libc/libc.snprintf.cc.o -DTOOLCHAIN_VERSION=/b/s/w/ir/k/recipe_cleanup/clangEPCywe/bin -DZX_ASSERT_LEVEL=2 -DWITH_FRAME_POINTERS=1 -D_LIBCPP_ENABLE_THREAD_SAFETY_ANNOTATIONS=1 -D_LIBCPP_DISABLE_VISIBILITY_ANNOTATIONS -DKERNEL_BASE=0xffffffff80100000 -DSMP_MAX_CPUS=32 -D_KERNEL -DLK -DENABLE_PANIC_SHELL -DWITH_DEBUG_LINEBUFFER -DZIRCON_TOOLCHAIN -DLK_DEBUGLEVEL=2 -DWITH_KERNEL_PCIE -DKERNEL_RETPOLINE=1 -DWITH_UNIFIED_SCHEDULER=1 -DSCHEDULER_TRACING_LEVEL=0 -DARCH_X86 -DKERNEL_LOAD_OFFSET=0x00100000 -D_LIBCPP_DISABLE_EXTERN_TEMPLATE -I../../zircon/kernel/include -I../../zircon/kernel/lib/libc/include -I../../zircon/kernel/lib/ktl/include -I../../zircon/kernel/lib/io/include -I../../zircon/kernel/lib/heap/include -I../../zircon/system/ulib/lazy_init/include -I../../zircon/system/ulib/lockdep/include -I../../zircon/system/ulib/ffl/include -I../../zircon/kernel/vm/include -I../../zircon/kernel/lib/user_copy/include -I../../zircon/system/ulib/zircon-internal/include -I../../zircon/kernel/lib/ktrace/include -I../../zircon/system/ulib/fbl/include -I../../zircon/kernel/lib/fbl/include -I../../zircon/system/public -I../../zircon/kernel/arch/x86/include -I../../zircon/system/ulib/bitmap/include -I../../zircon/kernel/arch/x86/page_tables/include -I../../zircon/system/ulib/hwreg/include -I../../zircon/kernel/lib/heap/include -I../../zircon/kernel/lib/io/include -I../../zircon/kernel/lib/ktl/include -idirafter ../../zircon/kernel/lib/libc/limits-dummy -fno-common --target=x86_64-fuchsia -mcx16 -march=x86-64 -fcrash-diagnostics-dir=clang-crashreports -fcolor-diagnostics -ffile-prefix-map=/b/s/w/ir/k/fuchsia/out/default.zircon=. -ffile-prefix-map=/b/s/w/ir/k/fuchsia/out=.. -ffile-prefix-map=/b/s/w/ir/k/fuchsia=../.. -no-canonical-prefixes -O2 -g3 -Wall -Wextra -Wno-unused-parameter -Wno-address-of-packed-member -Wnewline-eof -Wno-unknown-warning-option -Wno-c99-designator -Wno-int-in-bool-context -Wno-range-loop-analysis -fno-omit-frame-pointer -ffunction-sections -fdata-sections -Wthread-safety -Wimplicit-fallthrough -fvisibility=hidden -ftrivial-auto-var-init=pattern -Werror -Wno-error=deprecated-declarations -fpie -mretpoline -mretpoline-external-thunk -ffreestanding -include ../../zircon/kernel/include/hidden.h -fno-unwind-tables -mno-red-zone -Wformat=2 -Wvla -mno-red-zone -msoft-float -mno-mmx -mno-sse -mno-sse2 -mno-3dnow -mno-avx -mno-avx2 -mcmodel=kernel -fdata-sections -Wno-gnu-string-literal-operator-template -std=c++17 -Wconversion -Wno-sign-conversion -Wextra-semi -Wno-deprecated-copy -Wno-non-c-typedef-for-linkage -ftemplate-backtrace-limit=0 -fno-exceptions -fno-rtti -fno-threadsafe-statics -fvisibility-inlines-hidden -faligned-new=8 -fno-exceptions -c ../../zircon/kernel/lib/libc/snprintf.cc
In file included from ../../zircon/kernel/lib/libc/snprintf.cc:10:
In file included from ../../zircon/kernel/lib/ktl/include/ktl/algorithm.h:10:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/algorithm:643:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/memory:666:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/atomic:550:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/__threading_support:14:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/chrono:827:
../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/ctime:49:10: fatal error: 'time.h' file not found
#include <time.h>

Do you know if this is a known or unintended side effect of this patch? I don't think we changed anything on our side that would've caused this.

Builder: https://ci.chromium.org/p/fuchsia/builders/ci/clang_toolchain.fuchsia-x64-debug-subbuild/b8887404211928925456?

In D68480#1893953, @leonardchan wrote:

Hi, a bisect shows that this patch seems to cause time.h to not be found for us:
[...]

Do you know if this is a known or unintended side effect of this patch? I don't think we changed anything on our side that would've caused this.

This must be cause we're now including <chrono> in <atomic>. This is not really something we can (or want to) workaround, since that's in the spec. Are you folks using some fancy C library that doesn't provide <time.h>? If so, I would argue that the fact it worked before this patch is just a coincidence.

In D68480#1893953, @leonardchan wrote:

Hi, a bisect shows that this patch seems to cause time.h to not be found for us:

[1958/45287] CXX kernel-x64-clang/obj/kernel/lib/libc/libc.snprintf.cc.o
FAILED: kernel-x64-clang/obj/kernel/lib/libc/libc.snprintf.cc.o
../../../recipe_cleanup/clangEPCywe/bin/clang++ -MD -MF kernel-x64-clang/obj/kernel/lib/libc/libc.snprintf.cc.o.d -o kernel-x64-clang/obj/kernel/lib/libc/libc.snprintf.cc.o -DTOOLCHAIN_VERSION=/b/s/w/ir/k/recipe_cleanup/clangEPCywe/bin -DZX_ASSERT_LEVEL=2 -DWITH_FRAME_POINTERS=1 -D_LIBCPP_ENABLE_THREAD_SAFETY_ANNOTATIONS=1 -D_LIBCPP_DISABLE_VISIBILITY_ANNOTATIONS -DKERNEL_BASE=0xffffffff80100000 -DSMP_MAX_CPUS=32 -D_KERNEL -DLK -DENABLE_PANIC_SHELL -DWITH_DEBUG_LINEBUFFER -DZIRCON_TOOLCHAIN -DLK_DEBUGLEVEL=2 -DWITH_KERNEL_PCIE -DKERNEL_RETPOLINE=1 -DWITH_UNIFIED_SCHEDULER=1 -DSCHEDULER_TRACING_LEVEL=0 -DARCH_X86 -DKERNEL_LOAD_OFFSET=0x00100000 -D_LIBCPP_DISABLE_EXTERN_TEMPLATE -I../../zircon/kernel/include -I../../zircon/kernel/lib/libc/include -I../../zircon/kernel/lib/ktl/include -I../../zircon/kernel/lib/io/include -I../../zircon/kernel/lib/heap/include -I../../zircon/system/ulib/lazy_init/include -I../../zircon/system/ulib/lockdep/include -I../../zircon/system/ulib/ffl/include -I../../zircon/kernel/vm/include -I../../zircon/kernel/lib/user_copy/include -I../../zircon/system/ulib/zircon-internal/include -I../../zircon/kernel/lib/ktrace/include -I../../zircon/system/ulib/fbl/include -I../../zircon/kernel/lib/fbl/include -I../../zircon/system/public -I../../zircon/kernel/arch/x86/include -I../../zircon/system/ulib/bitmap/include -I../../zircon/kernel/arch/x86/page_tables/include -I../../zircon/system/ulib/hwreg/include -I../../zircon/kernel/lib/heap/include -I../../zircon/kernel/lib/io/include -I../../zircon/kernel/lib/ktl/include -idirafter ../../zircon/kernel/lib/libc/limits-dummy -fno-common --target=x86_64-fuchsia -mcx16 -march=x86-64 -fcrash-diagnostics-dir=clang-crashreports -fcolor-diagnostics -ffile-prefix-map=/b/s/w/ir/k/fuchsia/out/default.zircon=. -ffile-prefix-map=/b/s/w/ir/k/fuchsia/out=.. -ffile-prefix-map=/b/s/w/ir/k/fuchsia=../.. -no-canonical-prefixes -O2 -g3 -Wall -Wextra -Wno-unused-parameter -Wno-address-of-packed-member -Wnewline-eof -Wno-unknown-warning-option -Wno-c99-designator -Wno-int-in-bool-context -Wno-range-loop-analysis -fno-omit-frame-pointer -ffunction-sections -fdata-sections -Wthread-safety -Wimplicit-fallthrough -fvisibility=hidden -ftrivial-auto-var-init=pattern -Werror -Wno-error=deprecated-declarations -fpie -mretpoline -mretpoline-external-thunk -ffreestanding -include ../../zircon/kernel/include/hidden.h -fno-unwind-tables -mno-red-zone -Wformat=2 -Wvla -mno-red-zone -msoft-float -mno-mmx -mno-sse -mno-sse2 -mno-3dnow -mno-avx -mno-avx2 -mcmodel=kernel -fdata-sections -Wno-gnu-string-literal-operator-template -std=c++17 -Wconversion -Wno-sign-conversion -Wextra-semi -Wno-deprecated-copy -Wno-non-c-typedef-for-linkage -ftemplate-backtrace-limit=0 -fno-exceptions -fno-rtti -fno-threadsafe-statics -fvisibility-inlines-hidden -faligned-new=8 -fno-exceptions -c ../../zircon/kernel/lib/libc/snprintf.cc
In file included from ../../zircon/kernel/lib/libc/snprintf.cc:10:
In file included from ../../zircon/kernel/lib/ktl/include/ktl/algorithm.h:10:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/algorithm:643:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/memory:666:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/atomic:550:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/__threading_support:14:
In file included from ../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/chrono:827:
../../../recipe_cleanup/clangEPCywe/bin/../include/c++/v1/ctime:49:10: fatal error: 'time.h' file not found
#include <time.h>

Do you know if this is a known or unintended side effect of this patch? I don't think we changed anything on our side that would've caused this.

Builder: https://ci.chromium.org/p/fuchsia/builders/ci/clang_toolchain.fuchsia-x64-debug-subbuild/b8887404211928925456?

Is zircon freestanding, and not providing chrono (which then includes time.h)? That would explain the issue, we'd need to fix freestanding atomic by not exposing time-related stuff if time.h ins't available.

In D68480#1893987, @jfb wrote:

In D68480#1893953, @leonardchan wrote:

Hi, a bisect shows that this patch seems to cause time.h to not be found for us:

[...]

Do you know if this is a known or unintended side effect of this patch? I don't think we changed anything on our side that would've caused this.

Builder: https://ci.chromium.org/p/fuchsia/builders/ci/clang_toolchain.fuchsia-x64-debug-subbuild/b8887404211928925456?

Is zircon freestanding, and not providing chrono (which then includes time.h)? That would explain the issue, we'd need to fix freestanding atomic by not exposing time-related stuff if time.h ins't available.

So, actually, I think we need to have a discussion about the support of Freestanding in libc++. We should either implement it properly and have testers for it, or stop pretending that we do. Right now all I see us doing is adding complexity to our code to fix stuff that breaks on user configurations we don't know about, understand or test. As someone who has to maintain the code with that complexity, this concerns me.

And I do care about Freestanding -- I think it's important and we should support it. But I also think that we should not do it halfway like right now. In the current state of things, I literally have no way to ensure that we won't break that configuration again tomorrow. @leonardchan @phosek Is there a way that you could add a libc++ builder that runs freestanding?

In D68480#1894037, @ldionne wrote:

In D68480#1893987, @jfb wrote:

In D68480#1893953, @leonardchan wrote:

Hi, a bisect shows that this patch seems to cause time.h to not be found for us:

[...]

Do you know if this is a known or unintended side effect of this patch? I don't think we changed anything on our side that would've caused this.

Builder: https://ci.chromium.org/p/fuchsia/builders/ci/clang_toolchain.fuchsia-x64-debug-subbuild/b8887404211928925456?

Is zircon freestanding, and not providing chrono (which then includes time.h)? That would explain the issue, we'd need to fix freestanding atomic by not exposing time-related stuff if time.h ins't available.

So, actually, I think we need to have a discussion about the support of Freestanding in libc++. We should either implement it properly and have testers for it, or stop pretending that we do. Right now all I see us doing is adding complexity to our code to fix stuff that breaks on user configurations we don't know about, understand or test. As someone who has to maintain the code with that complexity, this concerns me.

And I do care about Freestanding -- I think it's important and we should support it. But I also think that we should not do it halfway like right now. In the current state of things, I literally have no way to ensure that we won't break that configuration again tomorrow. @leonardchan @phosek Is there a way that you could add a libc++ builder that runs freestanding?

Agreed, it seems like we want to figure out what individual freestanding subsets can "carve out" of regular C++, and how to do so cleanly. "time" seems like an easy thing to carve out, but it shouldn't just be done here, it should be part of the configuration header, applied consistently through libc++, and extensively tested. Ideally it wouldn't just be a builder: you'd have a directory which tests freestanding configuration options (i.e. "can I include atomic without time support?").

@leonardchan @phosek Is there a way that you could add a libc++ builder that runs freestanding?

We have it as a TODO but can bump it in priority.

Agreed, it seems like we want to figure out what individual freestanding subsets can "carve out" of regular C++, and how to do so cleanly. "time" seems like an easy thing to carve out, but it shouldn't just be done here, it should be part of the configuration header, applied consistently through libc++, and extensively tested. Ideally it wouldn't just be a builder: you'd have a directory which tests freestanding configuration options (i.e. "can I include atomic without time support?").

How simple would it be to carve specifically time.h out? I'm unfamiliar with libcxx internals + configurations. I'm also not insisting that progress on this patch be halted, but I'd like to know if there's anything we could do at least now to provide a workaround for freestanding targets,

In D68480#1894192, @leonardchan wrote:

@leonardchan @phosek Is there a way that you could add a libc++ builder that runs freestanding?

We have it as a TODO but can bump it in priority.

Agreed, it seems like we want to figure out what individual freestanding subsets can "carve out" of regular C++, and how to do so cleanly. "time" seems like an easy thing to carve out, but it shouldn't just be done here, it should be part of the configuration header, applied consistently through libc++, and extensively tested. Ideally it wouldn't just be a builder: you'd have a directory which tests freestanding configuration options (i.e. "can I include atomic without time support?").

How simple would it be to carve specifically time.h out? I'm unfamiliar with libcxx internals + configurations. I'm also not insisting that progress on this patch be halted, but I'd like to know if there's anything we could do at least now to provide a workaround for freestanding targets,

As a non-libc++ maintainer (so take what I say with some doubt), I'd imagine things being detected in libcxx/include/__config, say through __has_include(time.h), and then you'd have a macro which says "no time". I'd then figure out the transitive inclusions of time.h in all libc++ headers, figure out want pulls them it, and use the macro to not expose the corresponding functions / types on those platforms.

You'd then need a decent amount of testing to make sure this works well.

Concretely here, there are a few atomic functions which require chrono, so you'd want to conditionally include chrono and conditionally define those functions. I think the patch which does this should also handle time in other libc++ headers, so we're not in a situation of "we kinda support having no time.h, but not consistently".

ldionne added inline comments.Mar 18 2020, 2:14 PM

libcxx/include/atomic
443	Ping @__simt__

Herald added a reviewer: Restricted Project. · View Herald TranscriptMar 18 2020, 2:14 PM

__simt__ marked an inline comment as done.Mar 18 2020, 3:09 PM

__simt__ added inline comments.

libcxx/include/atomic
443	Yep. Thanks for the ping.

• Quuxplusone mentioned this in D92240: [libc++] Consistently unparenthesize `numeric_limits<T>::max`. NFCI..Nov 27 2020, 12:22 PM

curdeius mentioned this in D93025: [libc++] Remove invalid use of `#if _LIBCPP_STD_VER >= 11`, as `_LIBCPP_STD_VER` can never be less than 11..Dec 10 2020, 7:07 AM

cjdb mentioned this in D103551: [libcxx][module-map] creates submodules for private headers.Jun 2 2021, 3:08 PM

ldionne mentioned this in rGc0efe8f26635: [libc++][NFC] Reformat comment about D68480 support.Nov 22 2021, 10:34 AM

This appears to have baked in some ABI details that don't permit efficient implementation on some of our supported platforms. In the future, please can you post an RFC from things that need to integrate with platform-specific code? We're now stuck with this interface until we are willing to do an ABI-breaking change. In particular:

OpenBSD provides a Linux-compatible futex that requires a 32-bit integer to be used as the key. This can't be used with the existing ABI, which has baked in 64-bit integers as the size for OpenBSD.
FreeBSD's _umtx_op uses long and so is able to support the ABI on 64-bit platforms (LP64) but not on 32-bit ones.
Windows' WaitOnAddress supports 1, 2, 4, and 8-byte objects, yet can be used only with 8-byte objects in this interface.
The semaphore interface does not require exposing the underlying semaphore type, so could have been implemented directly with _umtx_op on FreeBSD, rather than with the wrapper. A binary semaphore is trivial to implement with a futex with the fast path in the uncontended case a single inline atomic op. This is not possible without breaking the ABI in the current design.

The atomic wait / wake problems could have been addressed by making the size an explicit argument, rather than relying on overload resolution. The semaphore issue could have been addressed by asking for contributions to the __threading_support bits before a release was cut.

In D68480#3179355, @theraven wrote:

This appears to have baked in some ABI details that don't permit efficient implementation on some of our supported platforms.

This is unfortunate. Have you folks shipped this yet? Perhaps it's not too late to fix things now if you've only released it very recently.

In the future, please can you post an RFC from things that need to integrate with platform-specific code? We're now stuck with this interface until we are willing to do an ABI-breaking change.

With all due respect, I consider that the responsibility is yours. In May 2019, Olivier started a thread where he was gathering feedback on what was to become the synchronization library: https://lists.llvm.org/pipermail/libcxx-dev/2019-May/000396.html. The thread got very little traction (see the June 2019 archive for the only few replies). Then, this patch was created and it was under review for roughly 6 months. Anyone paying attention to libc++ development could have seen this go by.

The reality is that if you ship libc++ on your platform, you should have someone caring about libc++ development to ensure it plays well with your platform. There are so many people using libc++ in various ways (vendors and others) that it's difficult for the core contributors to make sure that everybody "gets the memo" whenever we do something. Don't get me wrong, I very often go out of my way to ensure that vendors get notified when we make potentially contentious changes (I often ping a bunch of people that I know vend the library). However, all of that is based on basically me knowing (or thinking I know) who vends the library for what platform. None of it is really formalized, and misses can and do happen.

Furthermore, I would like to point out that libc++ goes out of its way to ensure that these sorts of things don't happen. We have availability annotations, for example, that allow vendors to control when users are able to start relying on features. In this case, I used those annotations for solving exactly the problem you encountered: I ensured that users were not going to start relying on the synchronization library on Apple platforms until we could confirm it was ABI stable, and I only flipped the switch one year after it had been implemented (see D96790). I've set things up so that any vendor could take advantage of those (they were previously Apple-specific), and I've always been surprised that no other vendors cared enough about QoI on their platforms to start using that infrastructure. If you're not using the tools provided to control what you ship on your platform, you can't expect someone who might not even know your existence to do it for you.

I would also like to point out that more than one year after pre-commit CI has been set up, we still don't have a FreeBSD and OpenBSD bots in BuildKite. Technically, that makes those two platforms unsupported. They are important platforms and we definitely want to support them, but we need a bit more energy from the vendor side here. Not very much, just a bit.

So, concretely, here are some action items for OpenBSD and FreeBSD vendors to become a well-behaved citizen and make sure such issues don't happen again in the future:

Please set up BuildKite bots for OpenBSD and FreeBSD. I'll help you do it, it's easy. Then, we'll officially support those platforms and document it in our documentation.
If you vend libc++, please add yourself to this Herald group: https://reviews.llvm.org/project/view/109/. I created that group recently so I could formalize the notion of "begin a libc++ vendor" and make it easier to ping everybody.
Please have someone subscribe to libcxx-commits or use some Herald rules to be notified about reviews. This should ensure that, even if we don't ping the vendor group, there's some chance that you folks will see it. You don't have to engage in reviews or anything, but even just reviewing the subject lines once per week should be enough to ensure things like this don't happen again. That seems like a reasonable investment for an organization that ships libc++ on their platform.
Setup availability annotations for your platform. See libcxx/include/__availability for details.

Now, concretely, please let me know if you think it's feasible to fix this now that you've shipped it. If so, we can work together on that. Otherwise, I'm sorry this happened, and the steps above should ensure it doesn't again.

In D68480#3179526, @ldionne wrote:

In D68480#3179355, @theraven wrote:

This appears to have baked in some ABI details that don't permit efficient implementation on some of our supported platforms.

This is unfortunate. Have you folks shipped this yet? Perhaps it's not too late to fix things now if you've only released it very recently.

I am not sure who you mean when you say 'you' here. I am writing this as:

A member of the LLVM project and libc++ contributor
A libc++ consumer.

Since Howard's initial commit, libc++ has had strong ABI backwards compatibility guarantees for anything outside of the experimental namespace. Any time we define an interface between the library and the headers, that is an ABI that we need to support in the long term.

I came to this particular patch because one of my colleagues pointed me at atomic_wait as a possible replacement for platform-specific wrappers around futex-like abstractions in our code. I then tried to add FreeBSD and Windows support (the two platforms I care about for our code that do not have platform-specific code paths in libc++) and discovered that the ABI that we have committed to supporting cannot be implemented *at all* on 32-bit architectures with FreeBSD and cannot expose all of the functionality on Windows.

This is because this patch set decided to add a new pattern, rather than copying the same approach that every other atomic op uses: provide _1, _2, _4, _8, _16 and _n implementations. This also means that if Linux adds a futex64 system call (which I consider fairly probable, given that it is painful on Linux that you can't use a futex with pointers currently) then we can't take advantage of it without an ABI break.

In the future, please can you post an RFC from things that need to integrate with platform-specific code? We're now stuck with this interface until we are willing to do an ABI-breaking change.

With all due respect, I consider that the responsibility is yours. In May 2019, Olivier started a thread where he was gathering feedback on what was to become the synchronization library: https://lists.llvm.org/pipermail/libcxx-dev/2019-May/000396.html. The thread got very little traction (see the June 2019 archive for the only few replies). Then, this patch was created and it was under review for roughly 6 months. Anyone paying attention to libc++ development could have seen this go by.

The subject of that thread does not in any way mention that this requires platform-specific logic. Putting 'platform-specific futex-like' or similar in the subject would have made people pay attention. No one was tagged as a reviewer here to request perspectives from other platforms (dim or joerg, for example). It is the responsibility of the committer to ensure that a patch has adequate reviews. This did not happen here and we are now stuck with this ABI. I don't know how we can fix this, without adding a new version in the __2 namespace and eventually having a SONAME bump.

In D68480#3216178, @theraven wrote:

In D68480#3179526, @ldionne wrote:

In D68480#3179355, @theraven wrote:

This appears to have baked in some ABI details that don't permit efficient implementation on some of our supported platforms.

This is unfortunate. Have you folks shipped this yet? Perhaps it's not too late to fix things now if you've only released it very recently.

I am not sure who you mean when you say 'you' here. I am writing this as:

A member of the LLVM project and libc++ contributor

A libc++ consumer.

Right, "you" might not be the appropriate target here, I was thinking about whoever vends libc++ on FreeBSD (I wrongly assumed you were working for that vendor). My point is that nobody cares about ABI breaks for a platform unless the library is vended on that platform, in which case that vendor is the one who cares about ABI stability. In this case, that vendor is FreeBSD, and I consider that whoever vends libc++ on FreeBSD unfortunately missed the boat on this review. In FreeBSD's defence, it's true that we were not as well organized as we are now when we shipped this patch -- we now have a more detailed support policy and ties between various vendors and developers are somewhat clearer.

I will argue that breaking this ABI on Windows is not a big deal since nobody's vending libc++ on Windows. Or at least if they do, they are completely invisible to us ("us" being the people who develop libc++).

Since Howard's initial commit, libc++ has had strong ABI backwards compatibility guarantees for anything outside of the experimental namespace. Any time we define an interface between the library and the headers, that is an ABI that we need to support in the long term.

While I agree to some extent, I think it is important to highlight that ABI stability is not a property of the library itself, but a property of a specific vended instance of the library. For example, Chrome uses libc++, but they don't care about ABI stability because they link it statically. So we provide them with knobs to change libc++ behavior in various ways that are not ABI stable. On the other hand, some vendors like Apple do care about ABI stability for various reasons, and those vendors ensure that the ABI isn't broken on their platform, with the flavor of libc++ that they ship. And similarly, libc++ provides various knobs to help these vendors NOT break the ABI on their platforms.

In other words, over time, libc++ has grown from "ABI stability at all costs" to "providing tools to tweak it the way you want and be ABI stable if you want, but ABI unstable if you don't". That was the response we came up with because different consumers have different use cases for the library.

Where I'm going with this is that when Olivier created this review, I saw it had ABI implications and did what I needed to do for the vendor I represent. In the presence of a vast number of consumers of libc++, it's impossible for Olivier, or I, or any individual, to know about all the consumers that might not like the specific ABI that was selected. It might be obvious to you cause you've looked into it, but in the general case, the responsibility is on the vendors of the library to be at least a bit aware of what's going on and make sure they know what they are shipping on their platform. Cause that's the root cause of the issue -- this sub-optimal ABI was shipped on FreeBSD (hence locking them into it) without FreeBSD even knowing about it. That's really unfortunate, but like I said, libc++ provides ways to help vendors control what they ship on their platform, and they need only minimal involvement for this to happen. For instance, I would really welcome FreeBSD (and other vendors) taking advantage of the availability annotations I maintain. If FreeBSD had used those, it might have given some indication that FreeBSD might not want to ship those new symbols immediately.

I came to this particular patch because one of my colleagues pointed me at atomic_wait as a possible replacement for platform-specific wrappers around futex-like abstractions in our code. I then tried to add FreeBSD and Windows support (the two platforms I care about for our code that do not have platform-specific code paths in libc++) and discovered that the ABI that we have committed to supporting cannot be implemented *at all* on 32-bit architectures with FreeBSD and cannot expose all of the functionality on Windows.

Would you be willing to open a patch showing what changes are necessary for each of those? I think we can totally break the ABI on Windows since nobody's shipping it. And on 32-bit FreeBSD, if things don't work at all right now, then we can also "break the ABI" on that configuration since nobody can be depending on something that doesn't work. Perhaps things are not as bad as they seem?

Also, on a different note: I'm pretty serious about FreeBSD needing to implement pre-commit CI if it wants to be supported officially. It's a small investment, but it needs to happen for libc++ to keep FreeBSD as part of its list of supported platforms. @dim can you help with this?

In D68480#3218277, @ldionne wrote:

I will argue that breaking this ABI on Windows is not a big deal since nobody's vending libc++ on Windows. Or at least if they do, they are completely invisible to us ("us" being the people who develop libc++).

Would you be willing to open a patch showing what changes are necessary for each of those? I think we can totally break the ABI on Windows since nobody's shipping it.

I would like to add some nuance to these statements here.

I do ship shared linked libc++ on Windows (as part of the llvm-mingw toolchain distribution), but without any strong ABI guarantees. Ideally ABI wouldn't break between releases, but if it does (especially in a fringe area) it's probably tolerable.

MSYS2 also ships an environment where libc++ is the standard system C++ library, linked shared. That environment is considered somewhat experimental afaik (CC @mati865), so if it breaks some package (which can be fixed by rebuilding), I think they'd consider it tolerable. If none of their packages actually have ended up using and depending on the aspects that may change (I haven't followed closely exactly how widespread it is), the ABI break should be totally transparent though.

In D68480#3218277, @ldionne wrote:

Also, on a different note: I'm pretty serious about FreeBSD needing to implement pre-commit CI if it wants to be supported officially. It's a small investment, but it needs to happen for libc++ to keep FreeBSD as part of its list of supported platforms. @dim can you help with this?

I'm a FreeBSD committer but I don't have the access to setup e.g. bots on FreeBSD infrastructure. I hope @emaste can assist with this, as a FreeBSD Foundation member; I have discussed it with him before, and he seemed to be willing but probably runs out of time all the time... :)

In D68480#3218341, @mstorsjo wrote:

In D68480#3218277, @ldionne wrote:

I will argue that breaking this ABI on Windows is not a big deal since nobody's vending libc++ on Windows. Or at least if they do, they are completely invisible to us ("us" being the people who develop libc++).

Would you be willing to open a patch showing what changes are necessary for each of those? I think we can totally break the ABI on Windows since nobody's shipping it.

I would like to add some nuance to these statements here.

I do ship shared linked libc++ on Windows (as part of the llvm-mingw toolchain distribution), but without any strong ABI guarantees. Ideally ABI wouldn't break between releases, but if it does (especially in a fringe area) it's probably tolerable.

MSYS2 also ships an environment where libc++ is the standard system C++ library, linked shared. That environment is considered somewhat experimental afaik (CC @mati865), so if it breaks some package (which can be fixed by rebuilding), I think they'd consider it tolerable. If none of their packages actually have ended up using and depending on the aspects that may change (I haven't followed closely exactly how widespread it is), the ABI break should be totally transparent though.

Thanks for pinging me.
Actually I'd not consider it that much experimental now, it's available by default using pacman, it's mentioned on the website and people are interested in it.
Personally I won't hold back small breakage when it's justified (like here). Ping about such change welcome though so we can better prepare for it.

In D68480#3218341, @mstorsjo wrote:

In D68480#3218277, @ldionne wrote:

I will argue that breaking this ABI on Windows is not a big deal since nobody's vending libc++ on Windows. Or at least if they do, they are completely invisible to us ("us" being the people who develop libc++).

Would you be willing to open a patch showing what changes are necessary for each of those? I think we can totally break the ABI on Windows since nobody's shipping it.

I would like to add some nuance to these statements here.

I do ship shared linked libc++ on Windows (as part of the llvm-mingw toolchain distribution), but without any strong ABI guarantees. Ideally ABI wouldn't break between releases, but if it does (especially in a fringe area) it's probably tolerable.

MSYS2 also ships an environment where libc++ is the standard system C++ library, linked shared. That environment is considered somewhat experimental afaik (CC @mati865), so if it breaks some package (which can be fixed by rebuilding), I think they'd consider it tolerable. If none of their packages actually have ended up using and depending on the aspects that may change (I haven't followed closely exactly how widespread it is), the ABI break should be totally transparent though.

Thanks for clarifying, I wasn't aware of that. Like I said, it's incredibly difficult to keep track of everyone who ships libc++, and especially who provides what guarantees to their users.

I added @emaste and @mati865 to the libcxx-vendors "team" here so we can ping you in these situations.

mstorsjo mentioned this in D124519: [libcxx] Switch __cxx_contention_t to int32_t on 32 bit AIX.Apr 27 2022, 3:54 AM

mstorsjo mentioned this in rG39328a658181: [libcxx] Switch __cxx_contention_t to int32_t on 32 bit AIX.May 12 2022, 9:01 AM

MBkkt added a subscriber: MBkkt.Oct 14 2022, 12:02 PM

MBkkt added inline comments.

libcxx/src/atomic.cpp
39	Why do you use timed wait here? It's strange, and also different with macos(ulock) behavior

Herald added a project: Restricted Project. · View Herald TranscriptOct 14 2022, 12:02 PM

__simt__ added inline comments.Oct 14 2022, 12:44 PM

libcxx/src/atomic.cpp
39	Because it would be incorrect otherwise. There's an infinitesimal but not zero probability that the serial number being waited on can roll over, incremented by precisely the right value, and then you might think it didn't change when it did. There are a lot of weird discussions we could have from here. Does this negate Futex? I don't think it should. Is it even worth worrying about (like, the computer might be hit by a neutron flying in from space much more often than this)? I think it's cheap to mitigate. There's a judgement-call here about how long to wait. It's arbitrary. We wouldn't need to do this if Linux supported 64-bit Futex. Partly because we would rarely use serial numbers like this, and also partly because 64-bit numbers don't roll over inside of the useful life span of computer hardware.

MBkkt added inline comments.Oct 14 2022, 12:55 PM

libcxx/src/atomic.cpp
39	First of thanks for answer. But if I understand correctly it's only about not atomic<int> behavior (any atomic which use waiter pool) So for atomic<int> it's not needed (also why unsigned int not used in wait without waiter pool?) Another thought, for ulock behavior should be same?

Revision Contents

Path

Size

libcxx/

include/

3 lines

221 lines

386 lines

314 lines

8 lines

8 lines

122 lines

12 lines

282 lines

12 lines

8 lines

src/

3 lines

30 lines

26 lines

191 lines

test/

libcxx/

double_include.sh.cpp

3 lines

std/

atomics/

atomics.types.operations/

atomics.types.operations.wait/

atomic_wait.pass.cpp

56 lines

thread/

thread.barrier/

arrive.pass.cpp

32 lines

arrive_and_drop.pass.cpp

30 lines

arrive_and_wait.pass.cpp

31 lines

completion.pass.cpp

36 lines

version.pass.cpp

24 lines

thread.latch/

arrive_and_wait.pass.cpp

29 lines

count_down.pass.cpp

30 lines

try_wait.pass.cpp

27 lines

version.pass.cpp

24 lines

thread.semaphore/

30 lines

37 lines

27 lines

33 lines

43 lines

33 lines

24 lines

Diff 225615

libcxx/include/CMakeLists.txt

Show All 19 Lines	set(files
__threading_support		__threading_support
__tree		__tree
__tuple		__tuple
__undef_macros		__undef_macros
algorithm		algorithm
any		any
array		array
atomic		atomic
		barrier
bit		bit
bitset		bitset
cassert		cassert
ccomplex		ccomplex
cctype		cctype
cerrno		cerrno
cfenv		cfenv
cfloat		cfloat
▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	set(files
initializer_list		initializer_list
inttypes.h		inttypes.h
iomanip		iomanip
ios		ios
iosfwd		iosfwd
iostream		iostream
istream		istream
iterator		iterator
		latch
limits		limits
limits.h		limits.h
list		list
locale		locale
locale.h		locale.h
map		map
math.h		math.h
memory		memory
module.modulemap		module.modulemap
mutex		mutex
new		new
numeric		numeric
optional		optional
ostream		ostream
queue		queue
random		random
ratio		ratio
regex		regex
scoped_allocator		scoped_allocator
		semaphore
set		set
setjmp.h		setjmp.h
shared_mutex		shared_mutex
span		span
sstream		sstream
stack		stack
stdbool.h		stdbool.h
stddef.h		stddef.h
▲ Show 20 Lines • Show All 142 Lines • Show Last 20 Lines

libcxx/include/__threading_support

// -- C++ --		// -- C++ --
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef _LIBCPP_THREADING_SUPPORT		#ifndef _LIBCPP_THREADING_SUPPORT
#define _LIBCPP_THREADING_SUPPORT		#define _LIBCPP_THREADING_SUPPORT

#include <__config>		#include <__config>
#include <chrono>		#include <chrono>
#include <iosfwd>		#include <iosfwd>
#include <errno.h>		#include <errno.h>
		#include <climits>

		#if defined(__linux__)
		# include <unistd.h>
		# include <linux/futex.h>
		# include <sys/syscall.h>
		#endif

#ifndef _LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER		#ifndef _LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER
#pragma GCC system_header		#pragma GCC system_header
#endif		#endif

#if defined(_LIBCPP_HAS_THREAD_API_EXTERNAL)		#if defined(_LIBCPP_HAS_THREAD_API_EXTERNAL)
# include <__external_threading>		# include <__external_threading>
#elif !defined(_LIBCPP_HAS_NO_THREADS)		#elif !defined(_LIBCPP_HAS_NO_THREADS)

#if defined(_LIBCPP_HAS_THREAD_API_PTHREAD)		#if defined(_LIBCPP_HAS_THREAD_API_PTHREAD)
# include <pthread.h>		# include <pthread.h>
# include <sched.h>		# include <sched.h>
		# include <semaphore.h>
		zoecarverUnsubmitted Done Reply Inline Actions Are all the platforms we support guaranteed to have this header (they very well might, I just don't know)? zoecarver: Are all the platforms we support guaranteed to have this header (they very well might, I just…
		__simt__AuthorUnsubmitted Done Reply Inline Actions This header is part of Pthreads and this #include is in the Pthreads section of the header, so I think the answer is yes. The Apple case is special in that they have deprecated this header. __simt__: This header is part of Pthreads and this #include is in the Pthreads section of the header, so…
		# if defined(__APPLE__)
		# include <dispatch/dispatch.h>
		zoecarverUnsubmitted Done Reply Inline Actions Is this macro still needed? zoecarver: Is this macro still needed?
		__simt__AuthorUnsubmitted Done Reply Inline Actions Yes. We need a way to disable the platform native semaphores. Both Apple (in the current design) and CUDA need it. __simt__: Yes. We need a way to disable the platform native semaphores. Both Apple (in the current…
		# endif
#endif		#endif
		zoecarverUnsubmitted Not Done Reply Inline Actions This could be an else block, no? zoecarver: This could be an else block, no?
		__simt__AuthorUnsubmitted Not Done Reply Inline Actions It would have to be an #elif !defined(...) so that CUDA could trigger it too. __simt__: It would have to be an #elif !defined(...) so that CUDA could trigger it too.

		ldionneUnsubmitted Not Done Reply Inline Actions Any reason to introduce this macro instead of just use `!defined(_LIBCPP_NO_NATIVE_SEMAPHORES)` when you need it? ldionne: Any reason to introduce this macro instead of just use `!defined(_LIBCPP_NO_NATIVE_SEMAPHORES)`…
		__simt__AuthorUnsubmitted Done Reply Inline Actions I was going to say it’s more convenient for me, but I’m not even convinced. I can streamline this. __simt__: I was going to say it’s more convenient for me, but I’m not even convinced. I can streamline…
#if defined(_LIBCPP_HAS_THREAD_LIBRARY_EXTERNAL) \|\| \		#if defined(_LIBCPP_HAS_THREAD_LIBRARY_EXTERNAL) \|\| \
defined(_LIBCPP_BUILDING_THREAD_LIBRARY_EXTERNAL) \|\| \		defined(_LIBCPP_BUILDING_THREAD_LIBRARY_EXTERNAL) \|\| \
defined(_LIBCPP_HAS_THREAD_API_WIN32)		defined(_LIBCPP_HAS_THREAD_API_WIN32)
#define _LIBCPP_THREAD_ABI_VISIBILITY _LIBCPP_FUNC_VIS		#define _LIBCPP_THREAD_ABI_VISIBILITY _LIBCPP_FUNC_VIS
#else		#else
#define _LIBCPP_THREAD_ABI_VISIBILITY inline _LIBCPP_INLINE_VISIBILITY		#define _LIBCPP_THREAD_ABI_VISIBILITY inline _LIBCPP_INLINE_VISIBILITY
#endif		#endif

Show All 19 Lines
#define _LIBCPP_MUTEX_INITIALIZER PTHREAD_MUTEX_INITIALIZER		#define _LIBCPP_MUTEX_INITIALIZER PTHREAD_MUTEX_INITIALIZER

typedef pthread_mutex_t __libcpp_recursive_mutex_t;		typedef pthread_mutex_t __libcpp_recursive_mutex_t;

// Condition Variable		// Condition Variable
typedef pthread_cond_t __libcpp_condvar_t;		typedef pthread_cond_t __libcpp_condvar_t;
#define _LIBCPP_CONDVAR_INITIALIZER PTHREAD_COND_INITIALIZER		#define _LIBCPP_CONDVAR_INITIALIZER PTHREAD_COND_INITIALIZER

		// Semaphore
		#if defined(__APPLE__)
		typedef dispatch_semaphore_t __libcpp_semaphore_t;
		zoecarverUnsubmitted Not Done Reply Inline Actions Are there any platforms where `SEM_VALUE_MAX` doesn't exist? Maybe some BSD platform? Could you check that it is defined (or maybe that we're on a certain platform)? zoecarver: Are there any platforms where `SEM_VALUE_MAX` doesn't exist? Maybe some BSD platform? Could you…
		__simt__AuthorUnsubmitted Not Done Reply Inline Actions Like the header, it's mandated by Pthreads. It's conceivable that some BSD is non-conforming and we still want to work on it, but I don't have a BSD system handy to try. __simt__: Like the header, it's mandated by Pthreads. It's conceivable that some BSD is non-conforming…
		# define _LIBCPP_SEMAPHORE_MAX numeric_limits<long>::max()
		#else
		typedef sem_t __libcpp_semaphore_t;
		# define _LIBCPP_SEMAPHORE_MAX SEM_VALUE_MAX
		#endif

// Execute once		// Execute once
typedef pthread_once_t __libcpp_exec_once_flag;		typedef pthread_once_t __libcpp_exec_once_flag;
#define _LIBCPP_EXEC_ONCE_INITIALIZER PTHREAD_ONCE_INIT		#define _LIBCPP_EXEC_ONCE_INITIALIZER PTHREAD_ONCE_INIT

// Thread id		// Thread id
typedef pthread_t __libcpp_thread_id;		typedef pthread_t __libcpp_thread_id;

// Thread		// Thread
Show All 36 Lines

// Thread Local Storage		// Thread Local Storage
typedef long __libcpp_tls_key;		typedef long __libcpp_tls_key;

#define _LIBCPP_TLS_DESTRUCTOR_CC __stdcall		#define _LIBCPP_TLS_DESTRUCTOR_CC __stdcall
#endif // !defined(_LIBCPP_HAS_THREAD_API_PTHREAD) && !defined(_LIBCPP_HAS_THREAD_API_EXTERNAL)		#endif // !defined(_LIBCPP_HAS_THREAD_API_PTHREAD) && !defined(_LIBCPP_HAS_THREAD_API_EXTERNAL)

#if !defined(_LIBCPP_HAS_THREAD_API_EXTERNAL)		#if !defined(_LIBCPP_HAS_THREAD_API_EXTERNAL)

		_LIBCPP_THREAD_ABI_VISIBILITY
		__libcpp_timespec_t __libcpp_to_timespec(chrono::nanoseconds __ns, bool __absolute);

// Mutex		// Mutex
_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
int __libcpp_recursive_mutex_init(__libcpp_recursive_mutex_t *__m);		int __libcpp_recursive_mutex_init(__libcpp_recursive_mutex_t *__m);

_LIBCPP_THREAD_ABI_VISIBILITY _LIBCPP_NO_THREAD_SAFETY_ANALYSIS		_LIBCPP_THREAD_ABI_VISIBILITY _LIBCPP_NO_THREAD_SAFETY_ANALYSIS
int __libcpp_recursive_mutex_lock(__libcpp_recursive_mutex_t *__m);		int __libcpp_recursive_mutex_lock(__libcpp_recursive_mutex_t *__m);

_LIBCPP_THREAD_ABI_VISIBILITY _LIBCPP_NO_THREAD_SAFETY_ANALYSIS		_LIBCPP_THREAD_ABI_VISIBILITY _LIBCPP_NO_THREAD_SAFETY_ANALYSIS
Show All 29 Lines

_LIBCPP_THREAD_ABI_VISIBILITY _LIBCPP_NO_THREAD_SAFETY_ANALYSIS		_LIBCPP_THREAD_ABI_VISIBILITY _LIBCPP_NO_THREAD_SAFETY_ANALYSIS
int __libcpp_condvar_timedwait(__libcpp_condvar_t __cv, __libcpp_mutex_t __m,		int __libcpp_condvar_timedwait(__libcpp_condvar_t __cv, __libcpp_mutex_t __m,
__libcpp_timespec_t *__ts);		__libcpp_timespec_t *__ts);

_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
int __libcpp_condvar_destroy(__libcpp_condvar_t* __cv);		int __libcpp_condvar_destroy(__libcpp_condvar_t* __cv);

		// Semaphore
		_LIBCPP_THREAD_ABI_VISIBILITY
		bool __libcpp_semaphore_init(__libcpp_semaphore_t* __sem, int __init);

		_LIBCPP_THREAD_ABI_VISIBILITY
		bool __libcpp_semaphore_destroy(__libcpp_semaphore_t* __sem);

		_LIBCPP_THREAD_ABI_VISIBILITY
		bool __libcpp_semaphore_post(__libcpp_semaphore_t* __sem);

		_LIBCPP_THREAD_ABI_VISIBILITY
		bool __libcpp_semaphore_wait(__libcpp_semaphore_t* __sem);

		_LIBCPP_THREAD_ABI_VISIBILITY
		bool __libcpp_semaphore_wait_timed(__libcpp_semaphore_t* __sem, chrono::nanoseconds __ns);

// Execute once		// Execute once
_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
int __libcpp_execute_once(__libcpp_exec_once_flag *flag,		int __libcpp_execute_once(__libcpp_exec_once_flag *flag,
void (*init_routine)());		void (*init_routine)());

// Thread id		// Thread id
_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
bool __libcpp_thread_id_equal(__libcpp_thread_id t1, __libcpp_thread_id t2);		bool __libcpp_thread_id_equal(__libcpp_thread_id t1, __libcpp_thread_id t2);
Show All 20 Lines

_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
int __libcpp_thread_detach(__libcpp_thread_t *__t);		int __libcpp_thread_detach(__libcpp_thread_t *__t);

_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
void __libcpp_thread_yield();		void __libcpp_thread_yield();

_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
		void __libcpp_thread_yield_processor();

		_LIBCPP_THREAD_ABI_VISIBILITY
void __libcpp_thread_sleep_for(const chrono::nanoseconds& __ns);		void __libcpp_thread_sleep_for(const chrono::nanoseconds& __ns);

		template<class _Fn>
		_LIBCPP_THREAD_ABI_VISIBILITY
		bool __libcpp_thread_poll_with_backoff(_Fn && __f, chrono::nanoseconds __max = chrono::nanoseconds::zero());

// Thread local storage		// Thread local storage
_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
int __libcpp_tls_create(__libcpp_tls_key* __key,		int __libcpp_tls_create(__libcpp_tls_key* __key,
void(_LIBCPP_TLS_DESTRUCTOR_CC* __at_exit)(void*));		void(_LIBCPP_TLS_DESTRUCTOR_CC* __at_exit)(void*));

_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
void *__libcpp_tls_get(__libcpp_tls_key __key);		void *__libcpp_tls_get(__libcpp_tls_key __key);

_LIBCPP_THREAD_ABI_VISIBILITY		_LIBCPP_THREAD_ABI_VISIBILITY
int __libcpp_tls_set(__libcpp_tls_key __key, void *__p);		int __libcpp_tls_set(__libcpp_tls_key __key, void *__p);

#endif // !defined(_LIBCPP_HAS_THREAD_API_EXTERNAL)		#endif // !defined(_LIBCPP_HAS_THREAD_API_EXTERNAL)

#if (!defined(_LIBCPP_HAS_THREAD_LIBRARY_EXTERNAL) \|\| \		#if (!defined(_LIBCPP_HAS_THREAD_LIBRARY_EXTERNAL) \|\| \
defined(_LIBCPP_BUILDING_THREAD_LIBRARY_EXTERNAL)) && \		defined(_LIBCPP_BUILDING_THREAD_LIBRARY_EXTERNAL)) && \
defined(_LIBCPP_HAS_THREAD_API_PTHREAD)		defined(_LIBCPP_HAS_THREAD_API_PTHREAD)

		__libcpp_timespec_t __libcpp_to_timespec(chrono::nanoseconds __ns, bool __absolute) {

		using namespace chrono;
		if(__absolute)
		__ns += chrono::system_clock::now().time_since_epoch();
		seconds __s = duration_cast<seconds>(__ns);
		__libcpp_timespec_t __ts;
		typedef decltype(__ts.tv_sec) ts_sec;
		_LIBCPP_CONSTEXPR ts_sec __ts_sec_max = numeric_limits<ts_sec>::max();

		if (__s.count() < __ts_sec_max)
		{
		__ts.tv_sec = static_cast<ts_sec>(__s.count());
		__ts.tv_nsec = static_cast<decltype(__ts.tv_nsec)>((__ns - __s).count());
		}
		else
		{
		__ts.tv_sec = __ts_sec_max;
		__ts.tv_nsec = 999999999; // (10^9 - 1)
		}
		return __ts;
		}

int __libcpp_recursive_mutex_init(__libcpp_recursive_mutex_t *__m)		int __libcpp_recursive_mutex_init(__libcpp_recursive_mutex_t *__m)
{		{
pthread_mutexattr_t attr;		pthread_mutexattr_t attr;
int __ec = pthread_mutexattr_init(&attr);		int __ec = pthread_mutexattr_init(&attr);
if (__ec)		if (__ec)
return __ec;		return __ec;
__ec = pthread_mutexattr_settype(&attr, PTHREAD_MUTEX_RECURSIVE);		__ec = pthread_mutexattr_settype(&attr, PTHREAD_MUTEX_RECURSIVE);
if (__ec) {		if (__ec) {
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	int __libcpp_condvar_timedwait(__libcpp_condvar_t __cv, __libcpp_mutex_t __m,
return pthread_cond_timedwait(__cv, __m, __ts);		return pthread_cond_timedwait(__cv, __m, __ts);
}		}

int __libcpp_condvar_destroy(__libcpp_condvar_t *__cv)		int __libcpp_condvar_destroy(__libcpp_condvar_t *__cv)
{		{
return pthread_cond_destroy(__cv);		return pthread_cond_destroy(__cv);
}		}

		// Semaphore
		#if defined(__APPLE__)

		bool __libcpp_semaphore_init(__libcpp_semaphore_t* __sem, int __init)
		{
		return (*__sem = dispatch_semaphore_create(__init)) != NULL;
		}

		bool __libcpp_semaphore_destroy(__libcpp_semaphore_t* __sem)
		{
		dispatch_release(*__sem);
		return true;
		}

		bool __libcpp_semaphore_post(__libcpp_semaphore_t* __sem)
		{
		dispatch_semaphore_signal(*__sem);
		return true;
		}

		bool __libcpp_semaphore_wait(__libcpp_semaphore_t* __sem)
		{
		return dispatch_semaphore_wait(*__sem, DISPATCH_TIME_FOREVER) == 0;
		}

		bool __libcpp_semaphore_wait_timed(__libcpp_semaphore_t* __sem, chrono::nanoseconds __ns)
		{
		return dispatch_semaphore_wait(*__sem, dispatch_time(DISPATCH_TIME_NOW, __ns.count())) == 0;
		}

		#else

		bool __libcpp_semaphore_init(__libcpp_semaphore_t* __sem, int __init)
		{
		return sem_init(__sem, 0, __init) == 0;
		}

		bool __libcpp_semaphore_destroy(__libcpp_semaphore_t* __sem)
		{
		return sem_destroy(__sem) == 0;
		}

		bool __libcpp_semaphore_post(__libcpp_semaphore_t* __sem)
		{
		return sem_post(__sem) == 0;
		}

		bool __libcpp_semaphore_wait(__libcpp_semaphore_t* __sem)
		{
		return sem_wait(__sem) == 0;
		}

		bool __libcpp_semaphore_wait_timed(__libcpp_semaphore_t* __sem, chrono::nanoseconds __ns)
		{
		__libcpp_timespec_t __ts = __libcpp_to_timespec(__ns, true);
		return sem_timedwait(__sem, &__ts) == 0;
		}

		#endif //__APPLE__

// Execute once		// Execute once
int __libcpp_execute_once(__libcpp_exec_once_flag *flag,		int __libcpp_execute_once(__libcpp_exec_once_flag *flag,
void (*init_routine)()) {		void (*init_routine)()) {
return pthread_once(flag, init_routine);		return pthread_once(flag, init_routine);
}		}

// Thread id		// Thread id
// Returns non-zero if the thread ids are equal, otherwise 0		// Returns non-zero if the thread ids are equal, otherwise 0
Show All 39 Lines	int __libcpp_thread_detach(__libcpp_thread_t *__t)
return pthread_detach(*__t);		return pthread_detach(*__t);
}		}

void __libcpp_thread_yield()		void __libcpp_thread_yield()
{		{
sched_yield();		sched_yield();
}		}

void __libcpp_thread_sleep_for(const chrono::nanoseconds& __ns)		void __libcpp_thread_yield_processor()
{		{
using namespace chrono;		#if defined(__aarch64__)
seconds __s = duration_cast<seconds>(__ns);		asm volatile ("yield" :::);
__libcpp_timespec_t __ts;		#elif defined(__x86_64__)
typedef decltype(__ts.tv_sec) ts_sec;		asm volatile ("pause" :::);
_LIBCPP_CONSTEXPR ts_sec __ts_sec_max = numeric_limits<ts_sec>::max();		#elif defined (__powerpc__)
		asm volatile ("or 27,27,27":::);
		#else
		;
		#endif
		}

if (__s.count() < __ts_sec_max)		void __libcpp_thread_sleep_for(const chrono::nanoseconds& __ns)
{		{
__ts.tv_sec = static_cast<ts_sec>(__s.count());		__libcpp_timespec_t __ts = __libcpp_to_timespec(__ns, false);
__ts.tv_nsec = static_cast<decltype(__ts.tv_nsec)>((__ns - __s).count());		while (nanosleep(&__ts, &__ts) == -1 && errno == EINTR);
}		}
else
		#define _LIBCPP_POLLING_COUNT 16

		template<class _Fn>
		bool __libcpp_thread_poll_with_backoff(_Fn && __f, chrono::nanoseconds __max)
{		{
__ts.tv_sec = __ts_sec_max;		chrono::high_resolution_clock::time_point const __start = chrono::high_resolution_clock::now();
__ts.tv_nsec = 999999999; // (10^9 - 1)		for(int __count = 0;;) {
		if(__f())
		return true;
		if(__count < _LIBCPP_POLLING_COUNT) {
		if(__count > (_LIBCPP_POLLING_COUNT >> 1))
		__libcpp_thread_yield_processor();
		__count += 1;
		continue;
		}
		chrono::high_resolution_clock::duration const __elapsed = chrono::high_resolution_clock::now() - __start;
		if(__max != chrono::nanoseconds::zero() &&
		__max < __elapsed)
		return false;
		chrono::nanoseconds const __step = __elapsed / 4;
		if(__step >= chrono::milliseconds(1))
		__libcpp_thread_sleep_for(chrono::milliseconds(1));
		else if(__step >= chrono::microseconds(10))
		__libcpp_thread_sleep_for(__step);
		else
		__libcpp_thread_yield();
}		}

while (nanosleep(&__ts, &__ts) == -1 && errno == EINTR);
}		}

// Thread local storage		// Thread local storage
int __libcpp_tls_create(__libcpp_tls_key __key, void (__at_exit)(void *))		int __libcpp_tls_create(__libcpp_tls_key __key, void (__at_exit)(void *))
{		{
return pthread_key_create(__key, __at_exit);		return pthread_key_create(__key, __at_exit);
}		}

void *__libcpp_tls_get(__libcpp_tls_key __key)		void *__libcpp_tls_get(__libcpp_tls_key __key)
{		{
return pthread_getspecific(__key);		return pthread_getspecific(__key);
}		}

int __libcpp_tls_set(__libcpp_tls_key __key, void *__p)		int __libcpp_tls_set(__libcpp_tls_key __key, void *__p)
{		{
return pthread_setspecific(__key, __p);		return pthread_setspecific(__key, __p);
		ldionneUnsubmitted Not Done Reply Inline Actions `static constexpr`? ldionne: `static constexpr`?
		__simt__AuthorUnsubmitted Done Reply Inline Actions Yes. __simt__: Yes.
}		}

#endif // !_LIBCPP_HAS_THREAD_LIBRARY_EXTERNAL \|\| _LIBCPP_BUILDING_THREAD_LIBRARY_EXTERNAL		#endif // !_LIBCPP_HAS_THREAD_LIBRARY_EXTERNAL \|\| _LIBCPP_BUILDING_THREAD_LIBRARY_EXTERNAL

		#if defined(__linux__) && !defined(_LIBCPP_HAS_NO_PLATFORM_WAIT)

		#define _LIBCPP_HAS_PLATFORM_WAIT

		typedef int __libcpp_platform_wait_t;

		template<typename _Tp>
		struct __libcpp_platform_wait_uses_type
		{
		enum { __value = is_same<typename remove_cv<_Tp>::type, __libcpp_platform_wait_t>::value };
		};

		template <class _Tp, typename enable_if<__libcpp_platform_wait_uses_type<_Tp>::__value, int>::type = 1>
		void __libcpp_platform_wait(_Tp const* ptr, _Tp val, void const* timeout)
		{
		syscall(SYS_futex, ptr, FUTEX_WAIT_PRIVATE, val, timeout, 0, 0);
		}

		template <class _Tp, typename enable_if<__libcpp_platform_wait_uses_type<_Tp>::__value, int>::type = 1>
		void __libcpp_platform_wake(_Tp const* ptr, bool all)
		{
		syscall(SYS_futex, ptr, FUTEX_WAKE_PRIVATE, all ? INT_MAX : 1, 0, 0, 0);
		}

		#endif // __linux__

		#if !defined(_LIBCPP_HAS_NO_TREE_BARRIER) && !defined(_LIBCPP_HAS_NO_THREAD_FAVORITE_BARRIER_INDEX)

		_LIBCPP_EXPORTED_FROM_ABI
		extern thread_local ptrdiff_t __libcpp_thread_favorite_barrier_index;

		#endif

class _LIBCPP_TYPE_VIS thread;		class _LIBCPP_TYPE_VIS thread;
class _LIBCPP_TYPE_VIS __thread_id;		class _LIBCPP_TYPE_VIS __thread_id;

namespace this_thread		namespace this_thread
{		{

_LIBCPP_INLINE_VISIBILITY __thread_id get_id() _NOEXCEPT;		_LIBCPP_INLINE_VISIBILITY __thread_id get_id() _NOEXCEPT;

▲ Show 20 Lines • Show All 79 Lines • Show Last 20 Lines

libcxx/include/atomic

Show First 20 Lines • Show All 434 Lines • ▼ Show 20 Lines
template <class Integral>		template <class Integral>
Integral		Integral
atomic_fetch_or_explicit(atomic<Integral>* obj, Integral op,		atomic_fetch_or_explicit(atomic<Integral>* obj, Integral op,
memory_order m) noexcept;		memory_order m) noexcept;
template <class Integral>		template <class Integral>
Integral		Integral
atomic_fetch_xor(volatile atomic<Integral>* obj, Integral op) noexcept;		atomic_fetch_xor(volatile atomic<Integral>* obj, Integral op) noexcept;

template <class Integral>		template <class Integral>
		ldionneUnsubmitted Not Done Reply Inline Actions We forgot to update the synopsis for this header with the C++20 Synchronization Library. @__simt__ would you be willing to do that in a followup change? ldionne: We forgot to update the synopsis for this header with the C++20 Synchronization Library.
		ldionneUnsubmitted Not Done Reply Inline Actions Ping @__simt__ ldionne: Ping @__simt__
		__simt__AuthorUnsubmitted Done Reply Inline Actions Yep. Thanks for the ping. __simt__: Yep. Thanks for the ping.
Integral		Integral
atomic_fetch_xor(atomic<Integral>* obj, Integral op) noexcept;		atomic_fetch_xor(atomic<Integral>* obj, Integral op) noexcept;

template <class Integral>		template <class Integral>
Integral		Integral
atomic_fetch_xor_explicit(volatile atomic<Integral>* obj, Integral op,		atomic_fetch_xor_explicit(volatile atomic<Integral>* obj, Integral op,
memory_order m) noexcept;		memory_order m) noexcept;
template <class Integral>		template <class Integral>
▲ Show 20 Lines • Show All 90 Lines • ▼ Show 20 Lines
void atomic_thread_fence(memory_order m) noexcept;		void atomic_thread_fence(memory_order m) noexcept;
void atomic_signal_fence(memory_order m) noexcept;		void atomic_signal_fence(memory_order m) noexcept;

} // std		} // std

*/		*/

#include <__config>		#include <__config>
		#include <__threading_support>
		ldionneUnsubmitted Not Done Reply Inline Actions Just wondering: will this in any way make it harder to support `<atomic>` in freestanding? ldionne: Just wondering: will this in any way make it harder to support `<atomic>` in freestanding?
		__simt__AuthorUnsubmitted Done Reply Inline Actions A bit, yeah. There are a few ways to proceed. __simt__: A bit, yeah. There are a few ways to proceed.
#include <cstddef>		#include <cstddef>
#include <cstdint>		#include <cstdint>
		#include <cstring>
#include <type_traits>		#include <type_traits>
#include <version>		#include <version>

#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)		#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
#pragma GCC system_header		#pragma GCC system_header
#endif		#endif

#ifdef _LIBCPP_HAS_NO_THREADS		#ifdef _LIBCPP_HAS_NO_THREADS
▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines	typedef enum memory_order {
memory_order_acquire = __mo_acquire,		memory_order_acquire = __mo_acquire,
memory_order_release = __mo_release,		memory_order_release = __mo_release,
memory_order_acq_rel = __mo_acq_rel,		memory_order_acq_rel = __mo_acq_rel,
memory_order_seq_cst = __mo_seq_cst,		memory_order_seq_cst = __mo_seq_cst,
} memory_order;		} memory_order;

#endif // _LIBCPP_STD_VER > 17		#endif // _LIBCPP_STD_VER > 17

		template <typename _Tp> _LIBCPP_INLINE_VISIBILITY
		bool __cxx_nonatomic_compare_equal(_Tp const& __lhs, _Tp const& __rhs) {
		return memcmp(&__lhs, &__rhs, sizeof(_Tp)) == 0;
		}

static_assert((is_same<underlying_type<memory_order>::type, __memory_order_underlying_t>::value),		static_assert((is_same<underlying_type<memory_order>::type, __memory_order_underlying_t>::value),
"unexpected underlying type for std::memory_order");		"unexpected underlying type for std::memory_order");

#if defined(_LIBCPP_HAS_GCC_ATOMIC_IMP) \|\| \		#if defined(_LIBCPP_HAS_GCC_ATOMIC_IMP) \|\| \
defined(_LIBCPP_ATOMIC_ONLY_USE_BUILTINS)		defined(_LIBCPP_ATOMIC_ONLY_USE_BUILTINS)

// [atomics.types.generic]p1 guarantees _Tp is trivially copyable. Because		// [atomics.types.generic]p1 guarantees _Tp is trivially copyable. Because
// the default operator= in an object is not volatile, a byte-by-byte copy		// the default operator= in an object is not volatile, a byte-by-byte copy
// is required.		// is required.
template <typename _Tp, typename _Tv> _LIBCPP_INLINE_VISIBILITY		template <typename _Tp, typename _Tv> _LIBCPP_INLINE_VISIBILITY
typename enable_if<is_assignable<_Tp&, _Tv>::value>::type		typename enable_if<is_assignable<_Tp&, _Tv>::value>::type
__cxx_atomic_assign_volatile(_Tp& __a_value, _Tv const& __val) {		__cxx_atomic_assign_volatile(_Tp& __a_value, _Tv const& __val) {
__a_value = __val;		__a_value = __val;
▲ Show 20 Lines • Show All 429 Lines • ▼ Show 20 Lines	_Tp __cxx_atomic_fetch_xor(__cxx_atomic_base_impl<_Tp> volatile* __a, _Tp __pattern, memory_order __order) _NOEXCEPT {
return __c11_atomic_fetch_xor(&__a->__a_value, __pattern, static_cast<__memory_order_underlying_t>(__order));		return __c11_atomic_fetch_xor(&__a->__a_value, __pattern, static_cast<__memory_order_underlying_t>(__order));
}		}
template<class _Tp>		template<class _Tp>
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
_Tp __cxx_atomic_fetch_xor(__cxx_atomic_base_impl<_Tp> * __a, _Tp __pattern, memory_order __order) _NOEXCEPT {		_Tp __cxx_atomic_fetch_xor(__cxx_atomic_base_impl<_Tp> * __a, _Tp __pattern, memory_order __order) _NOEXCEPT {
return __c11_atomic_fetch_xor(&__a->__a_value, __pattern, static_cast<__memory_order_underlying_t>(__order));		return __c11_atomic_fetch_xor(&__a->__a_value, __pattern, static_cast<__memory_order_underlying_t>(__order));
}		}

#endif // _LIBCPP_HAS_GCC_ATOMIC_IMP, _LIBCPP_HAS_C_ATOMIC_IMP		#endif // _LIBCPP_HAS_GCC_ATOMIC_IMP, _LIBCPP_HAS_C_ATOMIC_IMP

template <class _Tp>		template <class _Tp>
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
_Tp kill_dependency(_Tp __y) _NOEXCEPT		_Tp kill_dependency(_Tp __y) _NOEXCEPT
{		{
return __y;		return __y;
}		}

▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	_Tp __cxx_atomic_exchange(__cxx_atomic_lock_impl<_Tp>* __a, _Tp __value, memory_order) {
return __old;		return __old;
}		}

template <typename _Tp>		template <typename _Tp>
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
bool __cxx_atomic_compare_exchange_strong(volatile __cxx_atomic_lock_impl<_Tp>* __a,		bool __cxx_atomic_compare_exchange_strong(volatile __cxx_atomic_lock_impl<_Tp>* __a,
_Tp* __expected, _Tp __value, memory_order, memory_order) {		_Tp* __expected, _Tp __value, memory_order, memory_order) {
__a->__lock();		__a->__lock();
_Tp temp;		_Tp __temp;
__cxx_atomic_assign_volatile(temp, __a->__a_value);		__cxx_atomic_assign_volatile(__temp, __a->__a_value);
bool __ret = temp == *__expected;		bool __ret = __temp == *__expected;
if(__ret)		if(__ret)
__cxx_atomic_assign_volatile(__a->__a_value, __value);		__cxx_atomic_assign_volatile(__a->__a_value, __value);
else		else
__cxx_atomic_assign_volatile(*__expected, __a->__a_value);		__cxx_atomic_assign_volatile(*__expected, __a->__a_value);
__a->__unlock();		__a->__unlock();
return __ret;		return __ret;
}		}
template <typename _Tp>		template <typename _Tp>
Show All 10 Lines	bool __cxx_atomic_compare_exchange_strong(__cxx_atomic_lock_impl<_Tp>* __a,
return __ret;		return __ret;
}		}

template <typename _Tp>		template <typename _Tp>
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
bool __cxx_atomic_compare_exchange_weak(volatile __cxx_atomic_lock_impl<_Tp>* __a,		bool __cxx_atomic_compare_exchange_weak(volatile __cxx_atomic_lock_impl<_Tp>* __a,
_Tp* __expected, _Tp __value, memory_order, memory_order) {		_Tp* __expected, _Tp __value, memory_order, memory_order) {
__a->__lock();		__a->__lock();
_Tp temp;		_Tp __temp;
__cxx_atomic_assign_volatile(temp, __a->__a_value);		__cxx_atomic_assign_volatile(__temp, __a->__a_value);
bool __ret = temp == *__expected;		bool __ret = __temp == *__expected;
if(__ret)		if(__ret)
__cxx_atomic_assign_volatile(__a->__a_value, __value);		__cxx_atomic_assign_volatile(__a->__a_value, __value);
else		else
__cxx_atomic_assign_volatile(*__expected, __a->__a_value);		__cxx_atomic_assign_volatile(*__expected, __a->__a_value);
__a->__unlock();		__a->__unlock();
return __ret;		return __ret;
}		}
template <typename _Tp>		template <typename _Tp>
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	_Tp* __cxx_atomic_fetch_add(volatile __cxx_atomic_lock_impl<_Tp> __a,
__cxx_atomic_assign_volatile(__old, __a->__a_value);		__cxx_atomic_assign_volatile(__old, __a->__a_value);
__cxx_atomic_assign_volatile(__a->__a_value, __old + __delta);		__cxx_atomic_assign_volatile(__a->__a_value, __old + __delta);
__a->__unlock();		__a->__unlock();
return __old;		return __old;
}		}
template <typename _Tp, typename _Td>		template <typename _Tp, typename _Td>
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
_Tp* __cxx_atomic_fetch_add(__cxx_atomic_lock_impl<_Tp> __a,		_Tp* __cxx_atomic_fetch_add(__cxx_atomic_lock_impl<_Tp> __a,
ptrdiff_t __delta, memory_order) {		ptrdiff_t __delta, memory_order) {
__a->__lock();		__a->__lock();
_Tp* __old = __a->__a_value;		_Tp* __old = __a->__a_value;
__a->__a_value += __delta;		__a->__a_value += __delta;
__a->__unlock();		__a->__unlock();
return __old;		return __old;
}		}

template <typename _Tp, typename _Td>		template <typename _Tp, typename _Td>
▲ Show 20 Lines • Show All 124 Lines • ▼ Show 20 Lines
struct __cxx_atomic_impl : public _Base {		struct __cxx_atomic_impl : public _Base {

#if _GNUC_VER >= 501		#if _GNUC_VER >= 501
static_assert(is_trivially_copyable<_Tp>::value,		static_assert(is_trivially_copyable<_Tp>::value,
"std::atomic<Tp> requires that 'Tp' be a trivially copyable type");		"std::atomic<Tp> requires that 'Tp' be a trivially copyable type");
#endif		#endif

_LIBCPP_INLINE_VISIBILITY __cxx_atomic_impl() _NOEXCEPT _LIBCPP_DEFAULT		_LIBCPP_INLINE_VISIBILITY __cxx_atomic_impl() _NOEXCEPT _LIBCPP_DEFAULT
_LIBCPP_INLINE_VISIBILITY _LIBCPP_CONSTEXPR explicit __cxx_atomic_impl(_Tp value) _NOEXCEPT		_LIBCPP_INLINE_VISIBILITY _LIBCPP_CONSTEXPR __cxx_atomic_impl(_Tp value) _NOEXCEPT
: _Base(value) {}		: _Base(value) {}
};		};

		#ifndef _LIBCPP_HAS_NO_THREAD_CONTENTION_STATE

		#ifdef _LIBCPP_HAS_PLATFORM_WAIT

		ldionneUnsubmitted Not Done Reply Inline Actions I'm not seeing this used anywhere -- am I missing something? ldionne: I'm not seeing this used anywhere -- am I missing something?
		__simt__AuthorUnsubmitted Done Reply Inline Actions I intend to use that to implement the normative encouragement that atomic_signed/unsigned[...]_t should be the efficient ones for waiting. I added those types below, but they don’t follow that encouragement. __simt__: I intend to use that to implement the normative encouragement that atomic_signed/unsigned[...
		struct __libcpp_contention_t
		{
		__cxx_atomic_impl<ptrdiff_t> __waiters = {0};
		__cxx_atomic_impl<__libcpp_platform_wait_t> __version = {0};
		};

		template <class _Tp, typename enable_if<__libcpp_platform_wait_uses_type<_Tp>::__value, int>::type = 1>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_all(__cxx_atomic_impl<_Tp> const volatile* __a,
		__libcpp_contention_t* __c) {
		__cxx_atomic_thread_fence(memory_order_seq_cst);
		if (0 != __cxx_atomic_exchange(&__c->__waiters, (ptrdiff_t)0, memory_order_relaxed))
		__libcpp_platform_wake((_Tp*)__a, true);
		zoecarverUnsubmitted Done Reply Inline Actions I think this can be a `static_cast`. zoecarver: I think this can be a `static_cast`.
		__simt__AuthorUnsubmitted Done Reply Inline Actions Actually it can and should go away now. I'll remove the cast. __simt__: Actually it can and should go away now. I'll remove the cast.
		}
		template <class _Tp, typename enable_if<!__libcpp_platform_wait_uses_type<_Tp>::__value, int>::type = 1>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_all(__cxx_atomic_impl<_Tp> const volatile*,
		__libcpp_contention_t* __c) {
		__cxx_atomic_fetch_add(&__c->__version, (__libcpp_platform_wait_t)1, memory_order_release);
		__cxx_atomic_thread_fence(memory_order_seq_cst);
		if (0 != __cxx_atomic_exchange(&__c->__waiters, (ptrdiff_t)0, memory_order_relaxed))
		__libcpp_platform_wake(&__c->__version.__a_value, true);
		}
		template <class _Tp, typename enable_if<__libcpp_platform_wait_uses_type<_Tp>::__value, int>::type = 1>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_one(__cxx_atomic_impl<_Tp> const volatile* __a,
		__libcpp_contention_t* __c) {
		__cxx_atomic_thread_fence(memory_order_seq_cst);
		if (0 != __cxx_atomic_load(&__c->__waiters, memory_order_relaxed))
		__libcpp_platform_wake((_Tp*)__a, false);
		}
		template <class _Tp, typename enable_if<!__libcpp_platform_wait_uses_type<_Tp>::__value, int>::type = 1>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_one(__cxx_atomic_impl<_Tp> const volatile* __a,
		__libcpp_contention_t* __c) {
		__cxx_atomic_notify_all(__a, __c);
		}
		template <class _Tp, typename enable_if<__libcpp_platform_wait_uses_type<_Tp>::__value, int>::type = 1>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_try_wait_slow(__cxx_atomic_impl<_Tp> const volatile* __a, _Tp __val, memory_order __order,
		__libcpp_contention_t* __c) {
		__cxx_atomic_store(&__c->__waiters, (ptrdiff_t)1, memory_order_relaxed);
		__cxx_atomic_thread_fence(memory_order_seq_cst);
		__libcpp_platform_wait((_Tp*)__a, __val, nullptr);
		}
		template <class _Tp, typename enable_if<!__libcpp_platform_wait_uses_type<_Tp>::__value, int>::type = 1>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_try_wait_slow(__cxx_atomic_impl<_Tp> const volatile* __a, _Tp const __val, memory_order __order,
		__libcpp_contention_t* __c) {
		__cxx_atomic_store(&__c->__waiters, (ptrdiff_t)1, memory_order_relaxed);
		__cxx_atomic_thread_fence(memory_order_seq_cst);
		auto const __version = __cxx_atomic_load(&__c->__version, memory_order_acquire);
		if (!__cxx_nonatomic_compare_equal(__cxx_atomic_load(__a, __order), __val))
		return;
		constexpr timespec __timeout = { 2, 0 }; // Hedge on rare 'int version' aliasing.
		__libcpp_platform_wait(&__c->__version.__a_value, __version, &__timeout);
		}

		#else // _LIBCPP_HAS_PLATFORM_WAIT

		struct __libcpp_contention_t
		{
		__cxx_atomic_impl<ptrdiff_t> __credit = {0};
		__libcpp_mutex_t __mutex = _LIBCPP_MUTEX_INITIALIZER;
		__libcpp_condvar_t __condvar = _LIBCPP_CONDVAR_INITIALIZER;
		};

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_all(__cxx_atomic_impl<_Tp> const volatile*,
		__libcpp_contention_t* __c) {
		__cxx_atomic_thread_fence(memory_order_seq_cst);
		if(0 == __cxx_atomic_load(&__c->__credit, memory_order_relaxed))
		return;
		if(0 != __cxx_atomic_exchange(&__c->__credit, (ptrdiff_t)0, memory_order_relaxed)) {
		zoecarverUnsubmitted Done Reply Inline Actions Might be wrong but, can't this be a CAS (because you are both comparing and swapping `0`)? zoecarver: Might be wrong but, can't this be a CAS (because you are both comparing and swapping `0`)?
		__simt__AuthorUnsubmitted Done Reply Inline Actions It's not conditionally exchanging, it's exchanging unconditionally and then conditionally taking a computation step. __simt__: It's not conditionally exchanging, it's exchanging unconditionally and then conditionally…
		__libcpp_mutex_lock(&__c->__mutex);
		__libcpp_mutex_unlock(&__c->__mutex);
		zoecarverUnsubmitted Done Reply Inline Actions Why do you lock, then immediately unlock the mutex here? zoecarver: Why do you lock, then immediately unlock the mutex here?
		__simt__AuthorUnsubmitted Done Reply Inline Actions I answered that one here: https://reviews.llvm.org/D68480?id=223282#inline-618652 __simt__: I answered that one here: https://reviews.llvm.org/D68480?id=223282#inline-618652
		__libcpp_condvar_broadcast(&__c->__condvar);
		}
		}
		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_one(__cxx_atomic_impl<_Tp> const volatile* __a,
		__libcpp_contention_t* __c) {
		__cxx_atomic_notify_all(__a, __c);
		}
		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_try_wait_slow(__cxx_atomic_impl<_Tp> const volatile* __a, _Tp const __val, memory_order __order,
		__libcpp_contention_t* __c) {
		__libcpp_mutex_lock(&__c->__mutex);
		__cxx_atomic_store(&__c->__credit, (ptrdiff_t)1, memory_order_relaxed);
		__cxx_atomic_thread_fence(memory_order_seq_cst);
		if (__cxx_nonatomic_compare_equal(__cxx_atomic_load(__a, __order), __val))
		__libcpp_condvar_wait(&__c->__condvar, &__c->__mutex);
		__libcpp_mutex_unlock(&__c->__mutex);
		}

		#endif // _LIBCPP_HAS_PLATFORM_WAIT

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_wait(__cxx_atomic_impl<_Tp> const volatile* __a, _Tp const __val, memory_order __order,
		__libcpp_contention_t* __c) {
		for(int __i = 0; __i < _LIBCPP_POLLING_COUNT; ++__i) {
		if(!__cxx_nonatomic_compare_equal(__cxx_atomic_load(__a, __order), __val))
		return;
		if(__i < 12)
		zoecarverUnsubmitted Done Reply Inline Actions Is this another "magic" number? If so, can it be a macro too? zoecarver: Is this another "magic" number? If so, can it be a macro too?
		__simt__AuthorUnsubmitted Done Reply Inline Actions Sure. __simt__: Sure.
		__libcpp_thread_yield_processor();
		else
		__libcpp_thread_yield();
		}
		while(__cxx_nonatomic_compare_equal(__cxx_atomic_load(__a, __order), __val))
		__cxx_atomic_try_wait_slow(__a, __val, __order, __c);
		}

		#ifndef _LIBCPP_HAS_NO_THREAD_CONTENTION_TABLE

		_LIBCPP_FUNC_VIS
		__libcpp_contention_t * __libcpp_contention_state(void const volatile * __p) _NOEXCEPT;

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_all(__cxx_atomic_impl<_Tp> const volatile* __a)
		{
		__cxx_atomic_notify_all(__a, __libcpp_contention_state(__a));
		}
		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_one(__cxx_atomic_impl<_Tp> const volatile* __a)
		{
		__cxx_atomic_notify_one(__a, __libcpp_contention_state(__a));
		}
		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_wait(__cxx_atomic_impl<_Tp> const volatile* __a, _Tp const __val, memory_order __order)
		{
		__cxx_atomic_wait(__a, __val, __order, __libcpp_contention_state(__a));
		}

		#endif // _LIBCPP_HAS_NO_THREAD_CONTENTION_TABLE

		#endif // _LIBCPP_HAS_NO_THREAD_CONTENTION_STATE

		#if defined(_LIBCPP_HAS_NO_THREAD_CONTENTION_STATE) \|\| defined(_LIBCPP_HAS_NO_THREAD_CONTENTION_TABLE)

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_all(__cxx_atomic_impl<_Tp> const volatile* __a)
		{
		}
		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_notify_one(__cxx_atomic_impl<_Tp> const volatile* __a)
		{
		}
		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY void __cxx_atomic_wait(__cxx_atomic_impl<_Tp> const volatile* __a, _Tp const __val, memory_order __order)
		{
		__libcpp_thread_poll_with_backoff([=] _LIBCPP_INLINE_VISIBILITY () -> bool {
		return !__cxx_nonatomic_compare_equal(__cxx_atomic_load(__a, __order), __val);
		});
		}

		#endif

// general atomic<T>		// general atomic<T>

template <class _Tp, bool = is_integral<_Tp>::value && !is_same<_Tp, bool>::value>		template <class _Tp, bool = is_integral<_Tp>::value && !is_same<_Tp, bool>::value>
struct __atomic_base // false		struct __atomic_base // false
{		{
mutable __cxx_atomic_impl<_Tp> __a_;		mutable __cxx_atomic_impl<_Tp> __a_;

#if defined(__cpp_lib_atomic_is_always_lock_free)		#if defined(__cpp_lib_atomic_is_always_lock_free)
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	#endif
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
bool compare_exchange_strong(_Tp& __e, _Tp __d,		bool compare_exchange_strong(_Tp& __e, _Tp __d,
memory_order __m = memory_order_seq_cst) volatile _NOEXCEPT		memory_order __m = memory_order_seq_cst) volatile _NOEXCEPT
{return __cxx_atomic_compare_exchange_strong(&__a_, &__e, __d, __m, __m);}		{return __cxx_atomic_compare_exchange_strong(&__a_, &__e, __d, __m, __m);}
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
bool compare_exchange_strong(_Tp& __e, _Tp __d,		bool compare_exchange_strong(_Tp& __e, _Tp __d,
memory_order __m = memory_order_seq_cst) _NOEXCEPT		memory_order __m = memory_order_seq_cst) _NOEXCEPT
{return __cxx_atomic_compare_exchange_strong(&__a_, &__e, __d, __m, __m);}		{return __cxx_atomic_compare_exchange_strong(&__a_, &__e, __d, __m, __m);}

		_LIBCPP_INLINE_VISIBILITY void wait(_Tp __v, memory_order __m = memory_order_seq_cst) const volatile _NOEXCEPT
		zoecarverUnsubmitted Done Reply Inline Actions You can use `memory_order::seq_cst` here. zoecarver: You can use `memory_order::seq_cst` here.
		__simt__AuthorUnsubmitted Done Reply Inline Actions Pending other comment's resolution. __simt__: Pending other comment's resolution.
		{__cxx_atomic_wait(&__a_, __v, __m);}
		_LIBCPP_INLINE_VISIBILITY void wait(_Tp __v, memory_order __m = memory_order_seq_cst) const _NOEXCEPT
		{__cxx_atomic_wait(&__a_, __v, __m);}
		_LIBCPP_INLINE_VISIBILITY void notify_one() volatile _NOEXCEPT
		{__cxx_atomic_notify_one(&__a_);}
		_LIBCPP_INLINE_VISIBILITY void notify_one() _NOEXCEPT
		{__cxx_atomic_notify_one(&__a_);}
		_LIBCPP_INLINE_VISIBILITY void notify_all() volatile _NOEXCEPT
		{__cxx_atomic_notify_all(&__a_);}
		_LIBCPP_INLINE_VISIBILITY void notify_all() _NOEXCEPT
		{__cxx_atomic_notify_all(&__a_);}

		zoecarverUnsubmitted Done Reply Inline Actions This should only be defined after C++17. zoecarver: This should only be defined after C++17.
		__simt__AuthorUnsubmitted Done Reply Inline Actions That would be unfortunate, I think, <atomic> in general tries to offer its functionality in back versions. Also, the CUDA port definitely doesn't want to be tied to '17 for quite some time. Can we leave it on in all dialects that <atomic> supports, like the rest? __simt__: That would be unfortunate, I think, <atomic> in general tries to offer its functionality in…
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
__atomic_base() _NOEXCEPT _LIBCPP_DEFAULT		__atomic_base() _NOEXCEPT _LIBCPP_DEFAULT

_LIBCPP_INLINE_VISIBILITY _LIBCPP_CONSTEXPR		_LIBCPP_INLINE_VISIBILITY _LIBCPP_CONSTEXPR
__atomic_base(_Tp __d) _NOEXCEPT : __a_(__d) {}		__atomic_base(_Tp __d) _NOEXCEPT : __a_(__d) {}

#ifndef _LIBCPP_CXX03_LANG		#ifndef _LIBCPP_CXX03_LANG
__atomic_base(const __atomic_base&) = delete;		__atomic_base(const __atomic_base&) = delete;
▲ Show 20 Lines • Show All 95 Lines • ▼ Show 20 Lines

// atomic<T>		// atomic<T>

template <class _Tp>		template <class _Tp>
struct atomic		struct atomic
: public __atomic_base<_Tp>		: public __atomic_base<_Tp>
{		{
typedef __atomic_base<_Tp> __base;		typedef __atomic_base<_Tp> __base;
		using value_type = _Tp;
		zoecarverUnsubmitted Done Reply Inline Actions Is this available in C++03? If so, use `typedef`. Actually, do we support any compilers that don't support `using` type aliases? zoecarver: Is this available in C++03? If so, use `typedef`. Actually, do we support any compilers that…
		__simt__AuthorUnsubmitted Done Reply Inline Actions Will do. __simt__: Will do.
		ldionneUnsubmitted Not Done Reply Inline Actions Can we add a test for this? ldionne: Can we add a test for this?
		__simt__AuthorUnsubmitted Done Reply Inline Actions Yes, I can do that. __simt__: Yes, I can do that.
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
atomic() _NOEXCEPT _LIBCPP_DEFAULT		atomic() _NOEXCEPT _LIBCPP_DEFAULT
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
_LIBCPP_CONSTEXPR atomic(_Tp __d) _NOEXCEPT : __base(__d) {}		_LIBCPP_CONSTEXPR atomic(_Tp __d) _NOEXCEPT : __base(__d) {}

_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
_Tp operator=(_Tp __d) volatile _NOEXCEPT		_Tp operator=(_Tp __d) volatile _NOEXCEPT
{__base::store(__d); return __d;}		{__base::store(__d); return __d;}
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
_Tp operator=(_Tp __d) _NOEXCEPT		_Tp operator=(_Tp __d) _NOEXCEPT
{__base::store(__d); return __d;}		{__base::store(__d); return __d;}
};		};

// atomic<T*>		// atomic<T*>

template <class _Tp>		template <class _Tp>
struct atomic<_Tp*>		struct atomic<_Tp*>
: public __atomic_base<_Tp*>		: public __atomic_base<_Tp*>
{		{
typedef __atomic_base<_Tp*> __base;		typedef __atomic_base<_Tp*> __base;
		using value_type = _Tp*;
		ldionneUnsubmitted Not Done Reply Inline Actions This too! ldionne: This too!
		__simt__AuthorUnsubmitted Done Reply Inline Actions Ack. __simt__: Ack.
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
atomic() _NOEXCEPT _LIBCPP_DEFAULT		atomic() _NOEXCEPT _LIBCPP_DEFAULT
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
_LIBCPP_CONSTEXPR atomic(_Tp* __d) _NOEXCEPT : __base(__d) {}		_LIBCPP_CONSTEXPR atomic(_Tp* __d) _NOEXCEPT : __base(__d) {}

_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
_Tp* operator=(_Tp* __d) volatile _NOEXCEPT		_Tp* operator=(_Tp* __d) volatile _NOEXCEPT
{__base::store(__d); return __d;}		{__base::store(__d); return __d;}
▲ Show 20 Lines • Show All 268 Lines • ▼ Show 20 Lines
atomic_compare_exchange_strong_explicit(atomic<_Tp>* __o, _Tp* __e,		atomic_compare_exchange_strong_explicit(atomic<_Tp>* __o, _Tp* __e,
_Tp __d,		_Tp __d,
memory_order __s, memory_order __f) _NOEXCEPT		memory_order __s, memory_order __f) _NOEXCEPT
_LIBCPP_CHECK_EXCHANGE_MEMORY_ORDER(__s, __f)		_LIBCPP_CHECK_EXCHANGE_MEMORY_ORDER(__s, __f)
{		{
return __o->compare_exchange_strong(*__e, __d, __s, __f);		return __o->compare_exchange_strong(*__e, __d, __s, __f);
}		}

		// atomic_wait

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY
		void atomic_wait(const volatile atomic<_Tp>* __o,
		typename atomic<_Tp>::value_type __v) _NOEXCEPT
		{
		return __o->wait(__v);
		}

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY
		void atomic_wait(const atomic<_Tp>* __o,
		typename atomic<_Tp>::value_type __v) _NOEXCEPT
		{
		return __o->wait(__v);
		}

		// atomic_wait_explicit

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY
		void atomic_wait_explicit(const volatile atomic<_Tp>* __o,
		typename atomic<_Tp>::value_type __v,
		memory_order __m) _NOEXCEPT
		_LIBCPP_CHECK_LOAD_MEMORY_ORDER(__m)
		{
		return __o->wait(__v, __m);
		}

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY
		void atomic_wait_explicit(const atomic<_Tp>* __o,
		typename atomic<_Tp>::value_type __v,
		memory_order __m) _NOEXCEPT
		_LIBCPP_CHECK_LOAD_MEMORY_ORDER(__m)
		{
		return __o->wait(__v, __m);
		}

		// atomic_notify_one

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY
		void atomic_notify_one(volatile atomic<_Tp>* __o) _NOEXCEPT
		{
		__o->notify_one();
		}
		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY
		void atomic_notify_one(atomic<_Tp>* __o) _NOEXCEPT
		{
		__o->notify_one();
		}

		// atomic_notify_one

		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY
		void atomic_notify_all(volatile atomic<_Tp>* __o) _NOEXCEPT
		{
		__o->notify_all();
		}
		template <class _Tp>
		_LIBCPP_INLINE_VISIBILITY
		void atomic_notify_all(atomic<_Tp>* __o) _NOEXCEPT
		{
		__o->notify_all();
		}

// atomic_fetch_add		// atomic_fetch_add

template <class _Tp>		template <class _Tp>
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
typename enable_if		typename enable_if
<		<
is_integral<_Tp>::value && !is_same<_Tp, bool>::value,		is_integral<_Tp>::value && !is_same<_Tp, bool>::value,
_Tp		_Tp
▲ Show 20 Lines • Show All 317 Lines • ▼ Show 20 Lines

// flag type and operations		// flag type and operations

typedef struct atomic_flag		typedef struct atomic_flag
{		{
__cxx_atomic_impl<_LIBCPP_ATOMIC_FLAG_TYPE> __a_;		__cxx_atomic_impl<_LIBCPP_ATOMIC_FLAG_TYPE> __a_;

_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
		bool test(memory_order __m = memory_order_seq_cst) const volatile _NOEXCEPT
		zoecarverUnsubmitted Done Reply Inline Actions I might just be missing it but, is this in the standard? Otherwise, could you mangle it? zoecarver: I might just be missing it but, is this in the standard? Otherwise, could you mangle it?
		__simt__AuthorUnsubmitted Done Reply Inline Actions It is, in C++20. __simt__: It is, in C++20.
		{return _LIBCPP_ATOMIC_FLAG_TYPE(true)==__cxx_atomic_load(&__a_, __m);}
		zoecarverUnsubmitted Done Reply Inline Actions nit: space between the equals. zoecarver: nit: space between the equals.
		__simt__AuthorUnsubmitted Done Reply Inline Actions OK. __simt__: OK.
		_LIBCPP_INLINE_VISIBILITY
		bool test(memory_order __m = memory_order_seq_cst) const _NOEXCEPT
		{return _LIBCPP_ATOMIC_FLAG_TYPE(true)==__cxx_atomic_load(&__a_, __m);}

		_LIBCPP_INLINE_VISIBILITY
bool test_and_set(memory_order __m = memory_order_seq_cst) volatile _NOEXCEPT		bool test_and_set(memory_order __m = memory_order_seq_cst) volatile _NOEXCEPT
{return __cxx_atomic_exchange(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(true), __m);}		{return __cxx_atomic_exchange(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(true), __m);}
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
bool test_and_set(memory_order __m = memory_order_seq_cst) _NOEXCEPT		bool test_and_set(memory_order __m = memory_order_seq_cst) _NOEXCEPT
{return __cxx_atomic_exchange(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(true), __m);}		{return __cxx_atomic_exchange(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(true), __m);}
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
void clear(memory_order __m = memory_order_seq_cst) volatile _NOEXCEPT		void clear(memory_order __m = memory_order_seq_cst) volatile _NOEXCEPT
{__cxx_atomic_store(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(false), __m);}		{__cxx_atomic_store(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(false), __m);}
_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
void clear(memory_order __m = memory_order_seq_cst) _NOEXCEPT		void clear(memory_order __m = memory_order_seq_cst) _NOEXCEPT
{__cxx_atomic_store(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(false), __m);}		{__cxx_atomic_store(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(false), __m);}

_LIBCPP_INLINE_VISIBILITY		_LIBCPP_INLINE_VISIBILITY
		void wait(bool __v, memory_order __m = memory_order_seq_cst) const volatile _NOEXCEPT
		{__cxx_atomic_wait(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(__v), __m);}
		_LIBCPP_INLINE_VISIBILITY
		void wait(bool __v, memory_order __m = memory_order_seq_cst) const _NOEXCEPT
		{__cxx_atomic_wait(&__a_, _LIBCPP_ATOMIC_FLAG_TYPE(__v), __m);}
		_LIBCPP_INLINE_VISIBILITY
		void notify_one() volatile _NOEXCEPT
		{__cxx_atomic_notify_one(&__a_);}
		_LIBCPP_INLINE_VISIBILITY
		void notify_one() _NOEXCEPT
		{__cxx_atomic_notify_one(&__a_);}
		_LIBCPP_INLINE_VISIBILITY
		void notify_all() volatile _NOEXCEPT
		{__cxx_atomic_notify_all(&__a_);}
		_LIBCPP_INLINE_VISIBILITY
		void notify_all() _NOEXCEPT
		{__cxx_atomic_notify_all(&__a_);}

		_LIBCPP_INLINE_VISIBILITY
atomic_flag() _NOEXCEPT _LIBCPP_DEFAULT		atomic_flag() _NOEXCEPT _LIBCPP_DEFAULT

_LIBCPP_INLINE_VISIBILITY _LIBCPP_CONSTEXPR		_LIBCPP_INLINE_VISIBILITY _LIBCPP_CONSTEXPR
atomic_flag(bool __b) _NOEXCEPT : __a_(__b) {} // EXTENSION		atomic_flag(bool __b) _NOEXCEPT : __a_(__b) {} // EXTENSION

#ifndef _LIBCPP_CXX03_LANG		#ifndef _LIBCPP_CXX03_LANG
atomic_flag(const atomic_flag&) = delete;		atomic_flag(const atomic_flag&) = delete;
atomic_flag& operator=(const atomic_flag&) = delete;		atomic_flag& operator=(const atomic_flag&) = delete;
atomic_flag& operator=(const atomic_flag&) volatile = delete;		atomic_flag& operator=(const atomic_flag&) volatile = delete;
#else		#else
private:		private:
atomic_flag(const atomic_flag&);		atomic_flag(const atomic_flag&);
atomic_flag& operator=(const atomic_flag&);		atomic_flag& operator=(const atomic_flag&);
atomic_flag& operator=(const atomic_flag&) volatile;		atomic_flag& operator=(const atomic_flag&) volatile;
#endif		#endif
} atomic_flag;		} atomic_flag;


		inline _LIBCPP_INLINE_VISIBILITY
		bool
		atomic_flag_test(const volatile atomic_flag* __o) _NOEXCEPT
		{
		return __o->test();
		}

		inline _LIBCPP_INLINE_VISIBILITY
		bool
		atomic_flag_test(const atomic_flag* __o) _NOEXCEPT
		{
		return __o->test();
		}

		inline _LIBCPP_INLINE_VISIBILITY
		bool
		atomic_flag_test_explicit(const volatile atomic_flag* __o, memory_order __m) _NOEXCEPT
		{
		return __o->test(__m);
		}

		inline _LIBCPP_INLINE_VISIBILITY
		bool
		atomic_flag_test_explicit(const atomic_flag* __o, memory_order __m) _NOEXCEPT
		{
		return __o->test(__m);
		}

inline _LIBCPP_INLINE_VISIBILITY		inline _LIBCPP_INLINE_VISIBILITY
bool		bool
atomic_flag_test_and_set(volatile atomic_flag* __o) _NOEXCEPT		atomic_flag_test_and_set(volatile atomic_flag* __o) _NOEXCEPT
{		{
return __o->test_and_set();		return __o->test_and_set();
}		}

inline _LIBCPP_INLINE_VISIBILITY		inline _LIBCPP_INLINE_VISIBILITY
Show All 40 Lines

inline _LIBCPP_INLINE_VISIBILITY		inline _LIBCPP_INLINE_VISIBILITY
void		void
atomic_flag_clear_explicit(atomic_flag* __o, memory_order __m) _NOEXCEPT		atomic_flag_clear_explicit(atomic_flag* __o, memory_order __m) _NOEXCEPT
{		{
__o->clear(__m);		__o->clear(__m);
}		}


		inline _LIBCPP_INLINE_VISIBILITY
		void
		atomic_flag_wait(const volatile atomic_flag* __o, bool __v) _NOEXCEPT
		{
		__o->wait(__v);
		}

		inline _LIBCPP_INLINE_VISIBILITY
		void
		atomic_flag_wait(const atomic_flag* __o, bool __v) _NOEXCEPT
		{
		__o->wait(__v);
		}

		inline _LIBCPP_INLINE_VISIBILITY
		void
		atomic_flag_wait_explicit(const volatile atomic_flag* __o,
		bool __v, memory_order __m) _NOEXCEPT
		{
		__o->wait(__v, __m);
		}

		inline _LIBCPP_INLINE_VISIBILITY
		void
		atomic_flag_wait_explicit(const atomic_flag* __o,
		bool __v, memory_order __m) _NOEXCEPT
		{
		__o->wait(__v, __m);
		}

		inline _LIBCPP_INLINE_VISIBILITY
		void
		atomic_flag_notify_one(volatile atomic_flag* __o) _NOEXCEPT
		{
		__o->notify_one();
		}

		inline _LIBCPP_INLINE_VISIBILITY
		void
		atomic_flag_notify_one(atomic_flag* __o) _NOEXCEPT
		{
		__o->notify_one();
		}

		inline _LIBCPP_INLINE_VISIBILITY
		void
		atomic_flag_notify_all(volatile atomic_flag* __o) _NOEXCEPT
		{
		__o->notify_all();
		}

		inline _LIBCPP_INLINE_VISIBILITY
		void
		atomic_flag_notify_all(atomic_flag* __o) _NOEXCEPT
		{
		__o->notify_all();
		}

// fences		// fences

inline _LIBCPP_INLINE_VISIBILITY		inline _LIBCPP_INLINE_VISIBILITY
void		void
atomic_thread_fence(memory_order __m) _NOEXCEPT		atomic_thread_fence(memory_order __m) _NOEXCEPT
{		{
__cxx_atomic_thread_fence(__m);		__cxx_atomic_thread_fence(__m);
}		}
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines

typedef atomic<intptr_t> atomic_intptr_t;		typedef atomic<intptr_t> atomic_intptr_t;
typedef atomic<uintptr_t> atomic_uintptr_t;		typedef atomic<uintptr_t> atomic_uintptr_t;
typedef atomic<size_t> atomic_size_t;		typedef atomic<size_t> atomic_size_t;
typedef atomic<ptrdiff_t> atomic_ptrdiff_t;		typedef atomic<ptrdiff_t> atomic_ptrdiff_t;
typedef atomic<intmax_t> atomic_intmax_t;		typedef atomic<intmax_t> atomic_intmax_t;
typedef atomic<uintmax_t> atomic_uintmax_t;		typedef atomic<uintmax_t> atomic_uintmax_t;

		static_assert(ATOMIC_INT_LOCK_FREE, "This library assumes atomic<int> is lock-free.");

		typedef atomic<int> atomic_signed_lock_free;
		typedef atomic<unsigned> atomic_unsigned_lock_free;

#define ATOMIC_FLAG_INIT {false}		#define ATOMIC_FLAG_INIT {false}
#define ATOMIC_VAR_INIT(__v) {__v}		#define ATOMIC_VAR_INIT(__v) {__v}

_LIBCPP_END_NAMESPACE_STD		_LIBCPP_END_NAMESPACE_STD

#endif // _LIBCPP_ATOMIC		#endif // _LIBCPP_ATOMIC

libcxx/include/barrier

This file was added.

				// -- C++ --
				//===--------------------------- barrier ----------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef _LIBCPP_BARRIER
				#define _LIBCPP_BARRIER

				/*
				barrier synopsis

				namespace std
				{

				template<class CompletionFunction = see below>
				class barrier
				{
				public:
				using arrival_token = see below;

				constexpr explicit barrier(ptrdiff_t phase_count,
				CompletionFunction f = CompletionFunction());
				~barrier();

				barrier(const barrier&) = delete;
				barrier& operator=(const barrier&) = delete;

				[[nodiscard]] arrival_token arrive(ptrdiff_t update = 1);
				void wait(arrival_token&& arrival) const;

				void arrive_and_wait();
				void arrive_and_drop();

				private:
				CompletionFunction completion; // exposition only
				};

				}

				*/

				#include <__config>
				#include <__threading_support>
				#include <atomic>
				# ifndef _LIBCPP_HAS_NO_TREE_BARRIER
				# include <memory>
				# endif

				#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
				#pragma GCC system_header
				#endif

				#ifdef _LIBCPP_HAS_NO_THREADS
				# error <barrier> is not supported on this single threaded system
				#endif

				#if _LIBCPP_STD_VER < 11
				# error <barrier> is requires C++11 or later
				#endif

				_LIBCPP_BEGIN_NAMESPACE_STD

				struct __empty_completion
				{
				inline _LIBCPP_INLINE_VISIBILITY
				void operator()() noexcept
				{
				}
				};

				#ifndef _LIBCPP_HAS_NO_TREE_BARRIER

				template<class _CompletionF = __empty_completion>
				class __barrier_base {

				ptrdiff_t __expected;
				__atomic_base<ptrdiff_t> __expected_adjustment;
				_CompletionF __completion;

				using __phase_t = uint8_t;
				__atomic_base<__phase_t> __phase;

				struct __state_t
				{
				struct {
				__atomic_base<__phase_t> __phase = ATOMIC_VAR_INIT(0);
				} __tickets[64];
				};
				::std::unique_ptr<__state_t[]> __state;

				_LIBCPP_INLINE_VISIBILITY
				bool __arrive(__phase_t const __old_phase)
				{
				__phase_t const __half_step = __old_phase + 1, __full_step = __old_phase + 2;
				#ifndef _LIBCPP_HAS_NO_THREAD_FAVORITE_BARRIER_INDEX
				ptrdiff_t __current = __libcpp_thread_favorite_barrier_index,
				#else
				ptrdiff_t __current = 0,
				#endif
				__current_expected = __expected,
				__last_node = (__current_expected >> 1);
				for(size_t __round = 0;; ++__round) {
				if(__current_expected == 1)
				return true;
				for(;;++__current) {
				#ifndef _LIBCPP_HAS_NO_THREAD_FAVORITE_BARRIER_INDEX
				if(0 == __round) {
				if(__current >= __current_expected)
				__current = 0;
				__libcpp_thread_favorite_barrier_index = __current;
				}
				#endif
				__phase_t expect = __old_phase;
				if(__current == __last_node && (__current_expected & 1))
				{
				if(__state[__current].__tickets[__round].__phase.compare_exchange_strong(expect, __full_step, memory_order_acq_rel))
				break; // I'm 1 in 1, go to next __round
				}
				else if(__state[__current].__tickets[__round].__phase.compare_exchange_strong(expect, __half_step, memory_order_acq_rel))
				{
				return false; // I'm 1 in 2, done with arrival
				}
				else if(expect == __half_step)
				{
				if(__state[__current].__tickets[__round].__phase.compare_exchange_strong(expect, __full_step, memory_order_acq_rel))
				break; // I'm 2 in 2, go to next __round
				}
				}
				__current_expected = (__current_expected >> 1) + (__current_expected & 1);
				__current &= ~( 1 << __round );
				__last_node &= ~( 1 << __round );
				}
				}

				public:
				using arrival_token = __phase_t;

				_LIBCPP_INLINE_VISIBILITY
				__barrier_base(ptrdiff_t __expected, _CompletionF __completion = _CompletionF())
				: __expected(__expected), __expected_adjustment(0), __completion(std::move(__completion)),
				__phase(0), __state(new __state_t[(__expected+1) >> 1])
				{
				}
				[[nodiscard]] _LIBCPP_INLINE_VISIBILITY
				arrival_token arrive(ptrdiff_t update)
				{
				auto __old_phase = __phase.load(memory_order_relaxed);
				for(; update; --update)
				if(__arrive(__old_phase)) {
				__completion();
				__expected += __expected_adjustment.load(memory_order_relaxed);
				__expected_adjustment.store(0, memory_order_relaxed);
				__phase.store(__old_phase + 2, memory_order_release);
				}
				return __old_phase;
				}
				_LIBCPP_INLINE_VISIBILITY
				void wait(arrival_token&& __old_phase) const
				{
				__libcpp_thread_poll_with_backoff([=]() -> bool {
				return __phase.load(memory_order_acquire) != __old_phase;
				});
				}
				_LIBCPP_INLINE_VISIBILITY
				void arrive_and_drop()
				{
				__expected_adjustment.fetch_sub(1, memory_order_relaxed);
				(void)arrive(1);
				}
				};

				#else

				template<class _CompletionF>
				class __barrier_base {

				__atomic_base<ptrdiff_t> __expected;
				__atomic_base<ptrdiff_t> __arrived;
				_CompletionF __completion;
				__atomic_base<bool> __phase;
				public:
				using arrival_token = bool;

				_LIBCPP_INLINE_VISIBILITY
				__barrier_base(ptrdiff_t __expected, _CompletionF __completion = _CompletionF())
				: __expected(__expected), __arrived(__expected), __completion(__completion), __phase(false)
				{
				}
				[[nodiscard]] _LIBCPP_INLINE_VISIBILITY
				arrival_token arrive(ptrdiff_t update)
				{
				auto const __old_phase = __phase.load(memory_order_relaxed);
				auto const __result = __arrived.fetch_sub(update, memory_order_acq_rel) - update;
				auto const new_expected = __expected.load(memory_order_relaxed);
				if(0 == __result) {
				__completion();
				__arrived.store(new_expected, memory_order_relaxed);
				__phase.store(!__old_phase, memory_order_release);
				atomic_notify_all(&__phase);
				}
				return __old_phase;
				}
				_LIBCPP_INLINE_VISIBILITY
				void wait(arrival_token&& __old_phase) const
				{
				__phase.wait(__old_phase, memory_order_acquire);
				}
				_LIBCPP_INLINE_VISIBILITY
				void arrive_and_drop()
				{
				__expected.fetch_sub(1, memory_order_relaxed);
				(void)arrive(1);
				}
				};

				class __barrier_base<__empty_completion> {

				static constexpr uint64_t __expected_unit = 1ull;
				static constexpr uint64_t __arrived_unit = 1ull << 32;
				static constexpr uint64_t __expected_mask = __arrived_unit - 1;
				static constexpr uint64_t __phase_bit = 1ull << 63;
				static constexpr uint64_t __arrived_mask = (__phase_bit - 1) & ~__expected_mask;

				__atomic_base<uint64_t> __phase_arrived_expected;

				static _LIBCPP_INLINE_VISIBILITY
				constexpr uint64_t __init(ptrdiff_t __count) _NOEXCEPT
				{
				return ((uint64_t(1u << 31) - __count) << 32)
				\| (uint64_t(1u << 31) - __count);
				}

				public:
				using arrival_token = uint64_t;

				_LIBCPP_INLINE_VISIBILITY
				explicit inline __barrier_base(ptrdiff_t __count, __empty_completion = __empty_completion())
				: __phase_arrived_expected(__init(__count))
				{
				}
				[[nodiscard]] inline _LIBCPP_INLINE_VISIBILITY
				arrival_token arrive(ptrdiff_t update)
				{
				auto const __inc = __arrived_unit * update;
				auto const __old = __phase_arrived_expected.fetch_add(__inc, memory_order_acq_rel);
				if((__old ^ (__old + __inc)) & __phase_bit) {
				__phase_arrived_expected.fetch_add((__old & __expected_mask) << 32, memory_order_relaxed);
				__phase_arrived_expected.notify_all();
				}
				return __old & __phase_bit;
				}
				inline _LIBCPP_INLINE_VISIBILITY
				void wait(arrival_token&& __phase) const
				{
				__libcpp_thread_poll_with_backoff([=]() -> bool
				{
				uint64_t const __current = __phase_arrived_expected.load(memory_order_acquire);
				return ((__current & __phase_bit) != __phase);
				});
				}
				inline _LIBCPP_INLINE_VISIBILITY
				void arrive_and_drop()
				{
				__phase_arrived_expected.fetch_add(__expected_unit, memory_order_relaxed);
				(void)arrive(1);
				}
				};

				#endif //_LIBCPP_HAS_NO_TREE_BARRIER

				template<class _CompletionF = __empty_completion>
				zoecarverUnsubmitted Done Reply Inline Actions Will `__barrier_base` be defined if `_LIBCPP_HAS_NO_TREE_BARRIER` isn't? zoecarver: Will `__barrier_base` be defined if `_LIBCPP_HAS_NO_TREE_BARRIER` isn't?
				__simt__AuthorUnsubmitted Done Reply Inline Actions Yes. The macro ends up selecting between two different base classes -- one is the scalable tree barrier, the other is the simpler central barrier. __simt__: Yes. The macro ends up selecting between two different base classes -- one is the scalable tree…
				class barrier {

				__barrier_base<_CompletionF> __b;
				public:
				using arrival_token = typename __barrier_base<_CompletionF>::arrival_token;

				_LIBCPP_INLINE_VISIBILITY
				barrier(ptrdiff_t __count, _CompletionF __completion = _CompletionF())
				: __b(__count, std::move(__completion)) {
				}

				barrier(barrier const&) = delete;
				barrier& operator=(barrier const&) = delete;

				[[nodiscard]] _LIBCPP_INLINE_VISIBILITY
				arrival_token arrive(ptrdiff_t update = 1)
				{
				return __b.arrive(update);
				}
				_LIBCPP_INLINE_VISIBILITY
				void wait(arrival_token&& __phase) const
				{
				__b.wait(std::move(__phase));
				}
				_LIBCPP_INLINE_VISIBILITY
				void arrive_and_wait()
				{
				wait(arrive());
				}
				_LIBCPP_INLINE_VISIBILITY
				void arrive_and_drop()
				{
				__b.arrive_and_drop();
				}
				};

				_LIBCPP_END_NAMESPACE_STD

				#endif //_LIBCPP_BARRIER

libcxx/include/chrono

Show First 20 Lines • Show All 1,562 Lines • ▼ Show 20 Lines
{		{
return __lhs.time_since_epoch() - __rhs.time_since_epoch();		return __lhs.time_since_epoch() - __rhs.time_since_epoch();
}		}

//////////////////////////////////////////////////////////		//////////////////////////////////////////////////////////
/////////////////////// clocks ///////////////////////////		/////////////////////// clocks ///////////////////////////
//////////////////////////////////////////////////////////		//////////////////////////////////////////////////////////

		#ifndef _LIBCPP_HAS_CLOCK_API_EXTERNAL
		ldionneUnsubmitted Not Done Reply Inline Actions What's that? ldionne: What's that?
		__simt__AuthorUnsubmitted Done Reply Inline Actions Timed wait functions need <chrono>. That meant that I had to port <chrono> to CUDA as part of this effort. In CUDA you can't ask the OS for the time, you need to ask the silicon for the time. I followed the precedent of external threading to implement this as external clocks. Makes sense? __simt__: Timed wait functions need <chrono>. That meant that I had to port <chrono> to CUDA as part of…

class _LIBCPP_TYPE_VIS system_clock		class _LIBCPP_TYPE_VIS system_clock
{		{
public:		public:
typedef microseconds duration;		typedef microseconds duration;
typedef duration::rep rep;		typedef duration::rep rep;
typedef duration::period period;		typedef duration::period period;
typedef chrono::time_point<system_clock> time_point;		typedef chrono::time_point<system_clock> time_point;
static _LIBCPP_CONSTEXPR_AFTER_CXX11 const bool is_steady = false;		static _LIBCPP_CONSTEXPR_AFTER_CXX11 const bool is_steady = false;
Show All 16 Lines	public:
static time_point now() _NOEXCEPT;		static time_point now() _NOEXCEPT;
};		};

typedef steady_clock high_resolution_clock;		typedef steady_clock high_resolution_clock;
#else		#else
typedef system_clock high_resolution_clock;		typedef system_clock high_resolution_clock;
#endif		#endif

		#endif // _LIBCPP_HAS_CLOCK_API_EXTERNAL

#if _LIBCPP_STD_VER > 17		#if _LIBCPP_STD_VER > 17

		#ifndef _LIBCPP_HAS_CLOCK_API_EXTERNAL

// [time.clock.file], type file_clock		// [time.clock.file], type file_clock
using file_clock = _VSTD_FS::_FilesystemClock;		using file_clock = _VSTD_FS::_FilesystemClock;

template<class _Duration>		template<class _Duration>
using file_time = time_point<file_clock, _Duration>;		using file_time = time_point<file_clock, _Duration>;


template <class _Duration>		template <class _Duration>
using sys_time = time_point<system_clock, _Duration>;		using sys_time = time_point<system_clock, _Duration>;
using sys_seconds = sys_time<seconds>;		using sys_seconds = sys_time<seconds>;
using sys_days = sys_time<days>;		using sys_days = sys_time<days>;

struct local_t {};		struct local_t {};
template<class Duration>		template<class Duration>
using local_time = time_point<local_t, Duration>;		using local_time = time_point<local_t, Duration>;
using local_seconds = local_time<seconds>;		using local_seconds = local_time<seconds>;
using local_days = local_time<days>;		using local_days = local_time<days>;

		#endif // _LIBCPP_HAS_CLOCK_API_EXTERNAL

struct last_spec { explicit last_spec() = default; };		struct last_spec { explicit last_spec() = default; };

class day {		class day {
private:		private:
unsigned char __d;		unsigned char __d;
public:		public:
day() = default;		day() = default;
▲ Show 20 Lines • Show All 1,331 Lines • Show Last 20 Lines

libcxx/include/cstddef

	Show First 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	using ::max_align_t;			using ::max_align_t;
	#else			#else
	typedef long double max_align_t;			typedef long double max_align_t;
	#endif			#endif

	_LIBCPP_END_NAMESPACE_STD			_LIBCPP_END_NAMESPACE_STD

	#if _LIBCPP_STD_VER > 14			#if _LIBCPP_STD_VER > 14
				#ifdef _LIBCPP_BEGIN_NAMESPACE_STD_NOVERSION
				_LIBCPP_BEGIN_NAMESPACE_STD_NOVERSION
				#else
	namespace std // purposefully not versioned			namespace std // purposefully not versioned
	{			{
				#endif //_LIBCPP_BEGIN_NAMESPACE_STD_NOVERSION
	enum class byte : unsigned char {};			enum class byte : unsigned char {};

	constexpr byte operator\| (byte __lhs, byte __rhs) noexcept			constexpr byte operator\| (byte __lhs, byte __rhs) noexcept
	{			{
	return static_cast<byte>(			return static_cast<byte>(
	static_cast<unsigned char>(			static_cast<unsigned char>(
	static_cast<unsigned int>(__lhs) \| static_cast<unsigned int>(__rhs)			static_cast<unsigned int>(__lhs) \| static_cast<unsigned int>(__rhs)
	));			));
	Show All 27 Lines
	constexpr byte operator~ (byte __b) noexcept			constexpr byte operator~ (byte __b) noexcept
	{			{
	return static_cast<byte>(			return static_cast<byte>(
	static_cast<unsigned char>(			static_cast<unsigned char>(
	~static_cast<unsigned int>(__b)			~static_cast<unsigned int>(__b)
	));			));
	}			}

				#ifdef _LIBCPP_END_NAMESPACE_STD_NOVERSION
				_LIBCPP_END_NAMESPACE_STD_NOVERSION
				#else
	}			}
				#endif //_LIBCPP_END_NAMESPACE_STD_NOVERSION

	#include <type_traits> // rest of byte			#include <type_traits> // rest of byte
	#endif			#endif

	#endif // _LIBCPP_CSTDDEF			#endif // _LIBCPP_CSTDDEF

libcxx/include/latch

This file was added.

				// -- C++ --
				//===--------------------------- latch -----------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef _LIBCPP_LATCH
				#define _LIBCPP_LATCH

				/*
				latch synopsis

				namespace std
				{

				class latch
				{
				public:
				constexpr explicit latch(ptrdiff_t expected);
				~latch();

				latch(const latch&) = delete;
				latch& operator=(const latch&) = delete;

				void count_down(ptrdiff_t update = 1);
				bool try_wait() const noexcept;
				void wait() const;
				void arrive_and_wait(ptrdiff_t update = 1);

				private:
				ptrdiff_t counter; // exposition only
				};

				}

				*/

				#include <__config>
				#include <__threading_support>
				#include <atomic>

				#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
				#pragma GCC system_header
				#endif

				#ifdef _LIBCPP_HAS_NO_THREADS
				# error <latch> is not supported on this single threaded system
				#endif

				#if _LIBCPP_STD_VER < 11
				# error <latch> is requires C++11 or later
				zoecarverUnsubmitted Not Done Reply Inline Actions Can we make it so that these headers just don't exist before C++11 via cmake? That might be a nicer way to fail. zoecarver: Can we make it so that these headers just don't exist before C++11 via cmake? That might be a…
				__simt__AuthorUnsubmitted Not Done Reply Inline Actions I don't have this skill. :^/ But I do think that we want to support them in C++11/14/17. This author's production team is going to present it to users in these dialects, for instance. __simt__: I don't have this skill. :^/ But I do think that we want to support them in C++11/14/17. This…
				#endif

				_LIBCPP_BEGIN_NAMESPACE_STD

				class __latch_base
				{
				__atomic_base<ptrdiff_t> __counter;
				#ifndef _LIBCPP_HAS_NO_THREAD_CONTENTION_STATE
				mutable __libcpp_contention_t __contention;
				#endif
				public:
				_LIBCPP_INLINE_VISIBILITY
				constexpr explicit __latch_base(ptrdiff_t __expected)
				: __counter(__expected) { }

				~__latch_base() = default;
				__latch_base(const __latch_base&) = delete;
				__latch_base& operator=(const __latch_base&) = delete;

				inline _LIBCPP_INLINE_VISIBILITY
				void count_down(ptrdiff_t __update = 1)
				{
				auto const __old = __counter.fetch_sub(__update, memory_order_release);
				if(__old == __update)
				#ifdef _LIBCPP_HAS_NO_THREAD_CONTENTION_STATE
				__cxx_atomic_notify_all(&__counter.__a_);
				#else
				__cxx_atomic_notify_all(&__counter.__a_, &__contention);
				#endif
				}
				inline _LIBCPP_INLINE_VISIBILITY
				bool try_wait() const noexcept
				{
				return __counter.load(memory_order_acquire) == 0;
				}
				inline _LIBCPP_INLINE_VISIBILITY
				void wait() const
				{
				while(1) {
				auto const __current = __counter.load(memory_order_acquire);
				if(__current == 0)
				return;
				#ifdef _LIBCPP_HAS_NO_THREAD_CONTENTION_STATE
				__cxx_atomic_wait(&__counter.__a_, __current, memory_order_relaxed);
				#else
				__cxx_atomic_wait(&__counter.__a_, __current, memory_order_relaxed, &__contention);
				#endif
				}
				}
				inline _LIBCPP_INLINE_VISIBILITY
				void arrive_and_wait(ptrdiff_t __update = 1)
				{
				count_down(__update);
				wait();
				}
				};

				class latch : public __latch_base {
				public:
				_LIBCPP_INLINE_VISIBILITY
				constexpr explicit latch(ptrdiff_t __expected)
				: __latch_base(__expected) { }

				};

				_LIBCPP_END_NAMESPACE_STD

				#endif //_LIBCPP_LATCH

libcxx/include/module.modulemap

Show First 20 Lines • Show All 225 Lines • ▼ Show 20 Lines	module array {
header "array"		header "array"
export initializer_list		export initializer_list
export *		export *
}		}
module atomic {		module atomic {
header "atomic"		header "atomic"
export *		export *
}		}
		module barrier {
		teemperorUnsubmitted Not Done Reply Inline Actions Maybe I'm missing something here, but due to this submodule we are always parsing the barrier header even when building the module with a language standard < C++14. This means that everyone using C++11 is no longer able to use the 'std' Clang module after this commit. Is this intentional? teemperor: Maybe I'm missing something here, but due to this submodule we are always parsing the barrier…
		ldionneUnsubmitted Not Done Reply Inline Actions No, this is not intentional. Sorry, there were several failures that needed fixing after committing this (failures that were impossible to notice without throwing the change at all the build bots) -- we're getting there. ldionne: No, this is not intentional. Sorry, there were several failures that needed fixing after…
		teemperorUnsubmitted Not Done Reply Inline Actions No worries, pushed a fix in b61e83eb0e31c1e6006569b43bb98a61ff44ca4c teemperor: No worries, pushed a fix in b61e83eb0e31c1e6006569b43bb98a61ff44ca4c
		header "barrier"
		export *
		}
module bit {		module bit {
header "bit"		header "bit"
export *		export *
}		}
module bitset {		module bitset {
header "bitset"		header "bitset"
export string		export string
export iosfwd		export iosfwd
▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	module istream {
header "istream"		header "istream"
// FIXME: should re-export ios, streambuf?		// FIXME: should re-export ios, streambuf?
export *		export *
}		}
module iterator {		module iterator {
header "iterator"		header "iterator"
export *		export *
}		}
		module latch {
		header "latch"
		export *
		}
module limits {		module limits {
header "limits"		header "limits"
export *		export *
}		}
module list {		module list {
header "list"		header "list"
export initializer_list		export initializer_list
export *		export *
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	module regex {
header "regex"		header "regex"
export initializer_list		export initializer_list
export *		export *
}		}
module scoped_allocator {		module scoped_allocator {
header "scoped_allocator"		header "scoped_allocator"
export *		export *
}		}
		module semaphore {
		header "semaphore"
		export *
		}
module set {		module set {
header "set"		header "set"
export initializer_list		export initializer_list
export *		export *
}		}
module sstream {		module sstream {
header "sstream"		header "sstream"
// FIXME: should re-export istream, ostream, ios, streambuf, string?		// FIXME: should re-export istream, ostream, ios, streambuf, string?
▲ Show 20 Lines • Show All 202 Lines • Show Last 20 Lines

libcxx/include/semaphore

This file was added.

				// -- C++ --
				//===--------------------------- semaphore --------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef _LIBCPP_SEMAPHORE
				#define _LIBCPP_SEMAPHORE

				/*
				semaphore synopsis

				namespace std {

				template<ptrdiff_t least_max_value = implementation-defined>
				class counting_semaphore
				{
				public:
				static constexpr ptrdiff_t max() noexcept;

				constexpr explicit counting_semaphore(ptrdiff_t desired);
				~counting_semaphore();

				counting_semaphore(const counting_semaphore&) = delete;
				counting_semaphore& operator=(const counting_semaphore&) = delete;

				void release(ptrdiff_t update = 1);
				void acquire();
				bool try_acquire() noexcept;
				template<class Rep, class Period>
				bool try_acquire_for(const chrono::duration<Rep, Period>& rel_time);
				template<class Clock, class Duration>
				bool try_acquire_until(const chrono::time_point<Clock, Duration>& abs_time);

				private:
				ptrdiff_t counter; // exposition only
				};

				using binary_semaphore = counting_semaphore<1>;

				}

				*/

				#include <__config>
				#include <__threading_support>
				#include <atomic>
				#include <cassert>

				#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
				#pragma GCC system_header
				#endif

				#ifdef _LIBCPP_HAS_NO_THREADS
				# error <semaphore> is not supported on this single threaded system
				#endif

				#if _LIBCPP_STD_VER < 11
				# error <semaphore> is requires C++11 or later
				ldionneUnsubmitted Not Done Reply Inline Actions Olivier and I spoke offline, and it's reasonable to request C++14 at least here. Please do this in all the headers you're adding. The idea is to avoid having headers that are supported in older dialects than they need to be (within reason), which is often a source of technical debt. ldionne: Olivier and I spoke offline, and it's reasonable to request C++14 at least here. Please do this…
				__simt__AuthorUnsubmitted Done Reply Inline Actions Yep __simt__: Yep
				#endif

				_LIBCPP_BEGIN_NAMESPACE_STD

				class __atomic_semaphore_base
				{
				griwesUnsubmitted Done Reply Inline Actions Use a reserved identifier for the template parameter. griwes: Use a reserved identifier for the template parameter.
				__atomic_base<ptrdiff_t> __count;
				#ifndef _LIBCPP_HAS_NO_THREAD_CONTENTION_STATE
				__libcpp_contention_t __contention;
				#endif
				public:
				_LIBCPP_INLINE_VISIBILITY
				__atomic_semaphore_base(ptrdiff_t __count) : __count(__count) { }
				~__atomic_semaphore_base() = default;
				__atomic_semaphore_base(__atomic_semaphore_base const&) = delete;
				__atomic_semaphore_base& operator=(__atomic_semaphore_base const&) = delete;
				ldionneUnsubmitted Done Reply Inline Actions Thanks for the comments in this file, I wish we did that more often. ldionne: Thanks for the comments in this file, I wish we did that more often.
				__simt__AuthorUnsubmitted Done Reply Inline Actions NP, Eric had requested something here. Feel free to ask for comments elsewhere. __simt__: NP, Eric had requested something here. Feel free to ask for comments elsewhere.

				_LIBCPP_INLINE_VISIBILITY
				void release(ptrdiff_t __update = 1)
				{
				if(0 < __count.fetch_add(__update, memory_order_release))
				;
				#ifdef _LIBCPP_HAS_NO_THREAD_CONTENTION_STATE
				else if(__update > 1)
				__cxx_atomic_notify_all(&__count.__a_);
				else
				__cxx_atomic_notify_one(&__count.__a_);
				#else
				else if(__update > 1)
				__cxx_atomic_notify_all(&__count.__a_, &__contention);
				else
				__cxx_atomic_notify_one(&__count.__a_, &__contention);
				#endif
				}
				_LIBCPP_INLINE_VISIBILITY
				void acquire()
				{
				ptrdiff_t __old = __count.load(memory_order_relaxed);
				while (1) {
				if(__old == 0) {
				#ifdef _LIBCPP_HAS_NO_THREAD_CONTENTION_STATE
				__cxx_atomic_wait(&__count.__a_, __old, memory_order_relaxed);
				#else
				__cxx_atomic_wait(&__count.__a_, __old, memory_order_relaxed, &__contention);
				zoecarverUnsubmitted Done Reply Inline Actions Does `__cxx_atomic_notify_one` always call out to `__cxx_atomic_notify_all`? If so, can we get rid of `__cxx_atomic_notify_one`? zoecarver: Does `__cxx_atomic_notify_one` always call out to `__cxx_atomic_notify_all`? If so, can we get…
				#endif
				__old = __count.load(memory_order_relaxed);
				continue;
				}
				if(__count.compare_exchange_weak(__old, __old - 1,
				memory_order_acquire, memory_order_relaxed))
				break;
				}
				}
				template <class Rep, class Period>
				_LIBCPP_INLINE_VISIBILITY
				bool try_acquire_for(chrono::duration<Rep, Period> const& __rel_time)
				{
				return __libcpp_thread_poll_with_backoff([=]() {
				ptrdiff_t __old = __count.load(memory_order_acquire);
				if (__old == 0)
				return false;
				return __count.compare_exchange_weak(__old, __old - 1,
				memory_order_acquire, memory_order_relaxed);
				}, __rel_time);
				}
				};

				#ifndef _LIBCPP_HAS_NO_SEMAPHORES

				class __sem_semaphore_basic_base {

				#ifdef __APPLE__
				atomic<ptrdiff_t> __balance = {0};
				#endif
				__libcpp_semaphore_t __semaphore;

				public:

				_LIBCPP_EXPORTED_FROM_ABI
				__sem_semaphore_basic_base(ptrdiff_t __count);
				_LIBCPP_EXPORTED_FROM_ABI
				~__sem_semaphore_basic_base();
				_LIBCPP_EXPORTED_FROM_ABI
				void release(ptrdiff_t __update);
				_LIBCPP_EXPORTED_FROM_ABI
				void acquire();
				_LIBCPP_EXPORTED_FROM_ABI
				bool try_acquire_for(chrono::nanoseconds __rel_time);
				};

				#ifndef _LIBCPP_HAS_NO_SEMAPHORE_BACK_BUFFER

				class __sem_semaphore_back_buffered_base {

				_LIBCPP_INLINE_VISIBILITY
				void __backfill();

				__sem_semaphore_basic_base __semaphore;
				atomic<ptrdiff_t> __backbuffer;

				public:
				_LIBCPP_EXPORTED_FROM_ABI
				__sem_semaphore_back_buffered_base(ptrdiff_t __count);
				_LIBCPP_EXPORTED_FROM_ABI
				~__sem_semaphore_back_buffered_base();
				_LIBCPP_EXPORTED_FROM_ABI
				void release(ptrdiff_t __update);
				_LIBCPP_EXPORTED_FROM_ABI
				void acquire();
				_LIBCPP_EXPORTED_FROM_ABI
				bool try_acquire_for(chrono::nanoseconds __rel_time);
				};

				#endif //_LIBCPP_HAS_NO_SEMAPHORE_BACK_BUFFER

				#ifndef _LIBCPP_HAS_NO_SEMAPHORE_FRONT_BUFFER

				class __sem_semaphore_front_buffered_base {

				_LIBCPP_INLINE_VISIBILITY
				bool __try_acquire_fast();
				_LIBCPP_INLINE_VISIBILITY
				void __try_done();

				#ifndef _LIBCPP_HAS_NO_SEMAPHORE_BACK_BUFFER
				__sem_semaphore_back_buffered_base __semaphore;
				#else
				__sem_semaphore_basic_base __semaphore;
				#endif
				atomic<ptrdiff_t> __frontbuffer;

				public:
				_LIBCPP_EXPORTED_FROM_ABI
				__sem_semaphore_front_buffered_base(ptrdiff_t __count);
				_LIBCPP_EXPORTED_FROM_ABI
				~__sem_semaphore_front_buffered_base();
				_LIBCPP_EXPORTED_FROM_ABI
				void release(ptrdiff_t __update);
				_LIBCPP_EXPORTED_FROM_ABI
				void acquire();
				_LIBCPP_EXPORTED_FROM_ABI
				bool try_acquire_for(chrono::nanoseconds __rel_time);
				};

				#endif //_LIBCPP_HAS_NO_SEMAPHORE_FRONT_BUFFER

				#endif //_LIBCPP_HAS_NO_SEMAPHORES

				#if defined(_LIBCPP_HAS_NO_SEMAPHORES)
				template<ptrdiff_t>
				using __semaphore_base = __atomic_semaphore_base;
				#else
				# if !defined(_LIBCPP_HAS_NO_SEMAPHORE_FRONT_BUFFER)
				using __sem_semaphore_base = __sem_semaphore_front_buffered_base;
				# elif !defined(_LIBCPP_HAS_NO_SEMAPHORE_BACK_BUFFER)
				using __sem_semaphore_base = __sem_semaphore_back_buffered_base;
				# else
				using __sem_semaphore_base = __sem_semaphore_basic_base;
				# endif
				template<ptrdiff_t __least_max_value>
				using __semaphore_base =
				typename conditional<(__least_max_value > 1 && __least_max_value <= _LIBCPP_SEMAPHORE_MAX),
				__sem_semaphore_base,
				__atomic_semaphore_base>::type;
				#endif

				template<ptrdiff_t __least_max_value = _LIBCPP_SEMAPHORE_MAX>
				class counting_semaphore
				{
				__semaphore_base<__least_max_value> __semaphore;
				public:
				static constexpr ptrdiff_t max() noexcept {
				return __least_max_value;
				}

				_LIBCPP_INLINE_VISIBILITY
				counting_semaphore(ptrdiff_t __count = 0) : __semaphore(__count) { }
				~counting_semaphore() = default;

				counting_semaphore(const counting_semaphore&) = delete;
				counting_semaphore& operator=(const counting_semaphore&) = delete;

				_LIBCPP_INLINE_VISIBILITY
				void release(ptrdiff_t __update = 1)
				{
				__semaphore.release(__update);
				}
				_LIBCPP_INLINE_VISIBILITY
				void acquire()
				{
				__semaphore.acquire();
				}
				template<class Rep, class Period>
				_LIBCPP_INLINE_VISIBILITY
				bool try_acquire_for(chrono::duration<Rep, Period> const& __rel_time)
				{
				return __semaphore.try_acquire_for(chrono::duration_cast<chrono::nanoseconds>(__rel_time));
				}
				_LIBCPP_INLINE_VISIBILITY
				bool try_acquire()
				{
				return try_acquire_for(chrono::nanoseconds::zero());
				}
				template <class Clock, class Duration>
				_LIBCPP_INLINE_VISIBILITY
				bool try_acquire_until(chrono::time_point<Clock, Duration> const& __abs_time)
				{
				auto const current = Clock::now();
				if(current >= __abs_time)
				return try_acquire();
				else
				return try_acquire_for(__abs_time - current);
				}
				};

				using binary_semaphore = counting_semaphore<1>;

				_LIBCPP_END_NAMESPACE_STD

				#endif //_LIBCPP_SEMAPHORE
				griwesUnsubmitted Done Reply Inline Actions Use a reserved identifier for the template parameter. griwes: Use a reserved identifier for the template parameter.
				griwesUnsubmitted Done Reply Inline Actions Use a reserved identifier for the template parameter. griwes: Use a reserved identifier for the template parameter.

libcxx/include/stdexcept

Show First 20 Lines • Show All 66 Lines • ▼ Show 20 Lines	public:
~__libcpp_refstring();		~__libcpp_refstring();

const char* c_str() const _NOEXCEPT {return __imp_;}		const char* c_str() const _NOEXCEPT {return __imp_;}
};		};
#endif // !_LIBCPP_ABI_VCRUNTIME		#endif // !_LIBCPP_ABI_VCRUNTIME

_LIBCPP_END_NAMESPACE_STD		_LIBCPP_END_NAMESPACE_STD

namespace std // purposefully not using versioning namespace		#ifdef _LIBCPP_BEGIN_NAMESPACE_STD_NOVERSION
		_LIBCPP_BEGIN_NAMESPACE_STD_NOVERSION
		#else
		namespace std // purposefully not versioned
{		{
		#endif //_LIBCPP_BEGIN_NAMESPACE_STD_NOVERSION

class _LIBCPP_EXCEPTION_ABI logic_error		class _LIBCPP_EXCEPTION_ABI logic_error
: public exception		: public exception
{		{
#ifndef _LIBCPP_ABI_VCRUNTIME		#ifndef _LIBCPP_ABI_VCRUNTIME
private:		private:
_VSTD::__libcpp_refstring __imp_;		_VSTD::__libcpp_refstring __imp_;
public:		public:
▲ Show 20 Lines • Show All 114 Lines • ▼ Show 20 Lines	public:
_LIBCPP_INLINE_VISIBILITY explicit underflow_error(const string& __s) : runtime_error(__s) {}		_LIBCPP_INLINE_VISIBILITY explicit underflow_error(const string& __s) : runtime_error(__s) {}
_LIBCPP_INLINE_VISIBILITY explicit underflow_error(const char* __s) : runtime_error(__s) {}		_LIBCPP_INLINE_VISIBILITY explicit underflow_error(const char* __s) : runtime_error(__s) {}

#ifndef _LIBCPP_ABI_VCRUNTIME		#ifndef _LIBCPP_ABI_VCRUNTIME
virtual ~underflow_error() _NOEXCEPT;		virtual ~underflow_error() _NOEXCEPT;
#endif		#endif
};		};

} // std		#ifdef _LIBCPP_END_NAMESPACE_STD_NOVERSION
		_LIBCPP_END_NAMESPACE_STD_NOVERSION
		#else
		}
		#endif //_LIBCPP_END_NAMESPACE_STD_NOVERSION

_LIBCPP_BEGIN_NAMESPACE_STD		_LIBCPP_BEGIN_NAMESPACE_STD

// in the dylib		// in the dylib
_LIBCPP_NORETURN _LIBCPP_FUNC_VIS void __throw_runtime_error(const char*);		_LIBCPP_NORETURN _LIBCPP_FUNC_VIS void __throw_runtime_error(const char*);

_LIBCPP_NORETURN inline _LIBCPP_INLINE_VISIBILITY		_LIBCPP_NORETURN inline _LIBCPP_INLINE_VISIBILITY
void __throw_logic_error(const char*__msg)		void __throw_logic_error(const char*__msg)
▲ Show 20 Lines • Show All 89 Lines • Show Last 20 Lines

libcxx/include/type_traits

	Show First 20 Lines • Show All 4,018 Lines • ▼ Show 20 Lines

	template <class _CharT>			template <class _CharT>
	using _IsCharLikeType = _And<is_standard_layout<_CharT>, is_trivial<_CharT> >;			using _IsCharLikeType = _And<is_standard_layout<_CharT>, is_trivial<_CharT> >;

	_LIBCPP_END_NAMESPACE_STD			_LIBCPP_END_NAMESPACE_STD

	#if _LIBCPP_STD_VER > 14			#if _LIBCPP_STD_VER > 14
	// std::byte			// std::byte
				#ifdef _LIBCPP_BEGIN_NAMESPACE_STD_NOVERSION
				_LIBCPP_BEGIN_NAMESPACE_STD_NOVERSION
				ldionneUnsubmitted Not Done Reply Inline Actions What are these drive-by fixes? ldionne: What are these drive-by fixes?
				__simt__AuthorUnsubmitted Done Reply Inline Actions It's important for the CUDA port that there not be naked "namespace std { ... }" that don't use the macros, because we need to inject our namespace name. I tried to keep the drive-by to a minimum. __simt__: It's important for the CUDA port that there not be naked "namespace std { ... }" that don't use…
				#else
	namespace std // purposefully not versioned			namespace std // purposefully not versioned
	{			{
				#endif //_LIBCPP_BEGIN_NAMESPACE_STD_NOVERSION
	template <class _Integer>			template <class _Integer>
	constexpr typename enable_if<is_integral_v<_Integer>, byte>::type &			constexpr typename enable_if<is_integral_v<_Integer>, byte>::type &
	operator<<=(byte& __lhs, _Integer __shift) noexcept			operator<<=(byte& __lhs, _Integer __shift) noexcept
	{ return __lhs = __lhs << __shift; }			{ return __lhs = __lhs << __shift; }

	template <class _Integer>			template <class _Integer>
	constexpr typename enable_if<is_integral_v<_Integer>, byte>::type			constexpr typename enable_if<is_integral_v<_Integer>, byte>::type
	operator<< (byte __lhs, _Integer __shift) noexcept			operator<< (byte __lhs, _Integer __shift) noexcept
	{ return static_cast<byte>(static_cast<unsigned char>(static_cast<unsigned int>(__lhs) << __shift)); }			{ return static_cast<byte>(static_cast<unsigned char>(static_cast<unsigned int>(__lhs) << __shift)); }

	template <class _Integer>			template <class _Integer>
	constexpr typename enable_if<is_integral_v<_Integer>, byte>::type &			constexpr typename enable_if<is_integral_v<_Integer>, byte>::type &
	operator>>=(byte& __lhs, _Integer __shift) noexcept			operator>>=(byte& __lhs, _Integer __shift) noexcept
	{ return __lhs = __lhs >> __shift; }			{ return __lhs = __lhs >> __shift; }

	template <class _Integer>			template <class _Integer>
	constexpr typename enable_if<is_integral_v<_Integer>, byte>::type			constexpr typename enable_if<is_integral_v<_Integer>, byte>::type
	operator>> (byte __lhs, _Integer __shift) noexcept			operator>> (byte __lhs, _Integer __shift) noexcept
	{ return static_cast<byte>(static_cast<unsigned char>(static_cast<unsigned int>(__lhs) >> __shift)); }			{ return static_cast<byte>(static_cast<unsigned char>(static_cast<unsigned int>(__lhs) >> __shift)); }

	template <class _Integer>			template <class _Integer>
	constexpr typename enable_if<is_integral_v<_Integer>, _Integer>::type			constexpr typename enable_if<is_integral_v<_Integer>, _Integer>::type
	to_integer(byte __b) noexcept { return static_cast<_Integer>(__b); }			to_integer(byte __b) noexcept { return static_cast<_Integer>(__b); }

				#ifdef _LIBCPP_END_NAMESPACE_STD_NOVERSION
				_LIBCPP_END_NAMESPACE_STD_NOVERSION
				#else
	}			}
				#endif //_LIBCPP_END_NAMESPACE_STD_NOVERSION
	#endif			#endif

	#endif // _LIBCPP_TYPE_TRAITS			#endif // _LIBCPP_TYPE_TRAITS

libcxx/src/CMakeLists.txt

	set(LIBCXX_LIB_CMAKEFILES_DIR "${CMAKE_CURRENT_BINARY_DIR}${CMAKE_FILES_DIRECTORY}" PARENT_SCOPE)			set(LIBCXX_LIB_CMAKEFILES_DIR "${CMAKE_CURRENT_BINARY_DIR}${CMAKE_FILES_DIRECTORY}" PARENT_SCOPE)

	# Get sources			# Get sources
	set(LIBCXX_SOURCES			set(LIBCXX_SOURCES
	algorithm.cpp			algorithm.cpp
	any.cpp			any.cpp
				atomic.cpp
				barrier.cpp
	bind.cpp			bind.cpp
	charconv.cpp			charconv.cpp
	chrono.cpp			chrono.cpp
	condition_variable.cpp			condition_variable.cpp
	condition_variable_destructor.cpp			condition_variable_destructor.cpp
	debug.cpp			debug.cpp
	exception.cpp			exception.cpp
	functional.cpp			functional.cpp
	future.cpp			future.cpp
	hash.cpp			hash.cpp
	include/apple_availability.h			include/apple_availability.h
	include/atomic_support.h			include/atomic_support.h
	include/config_elast.h			include/config_elast.h
	include/refstring.h			include/refstring.h
	ios.cpp			ios.cpp
	iostream.cpp			iostream.cpp
	locale.cpp			locale.cpp
	memory.cpp			memory.cpp
	mutex.cpp			mutex.cpp
	mutex_destructor.cpp			mutex_destructor.cpp
	new.cpp			new.cpp
	optional.cpp			optional.cpp
	random.cpp			random.cpp
	regex.cpp			regex.cpp
				semaphore.cpp
	shared_mutex.cpp			shared_mutex.cpp
	stdexcept.cpp			stdexcept.cpp
	string.cpp			string.cpp
	strstream.cpp			strstream.cpp
	support/runtime/exception_fallback.ipp			support/runtime/exception_fallback.ipp
	support/runtime/exception_glibcxx.ipp			support/runtime/exception_glibcxx.ipp
	support/runtime/exception_libcxxabi.ipp			support/runtime/exception_libcxxabi.ipp
	support/runtime/exception_libcxxrt.ipp			support/runtime/exception_libcxxrt.ipp
	▲ Show 20 Lines • Show All 361 Lines • Show Last 20 Lines

libcxx/src/atomic.cpp

This file was added.

				//===------------------------- atomic.cpp ---------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "__config"

				#ifndef _LIBCPP_HAS_NO_THREADS

				#include "atomic"

				_LIBCPP_BEGIN_NAMESPACE_STD

				#if !defined(_LIBCPP_HAS_NO_THREAD_CONTENTION_TABLE)

				__libcpp_contention_t __libcpp_contention_state_[ 256 /* < there's no magic in this number */ ];

				_LIBCPP_FUNC_VIS
				__libcpp_contention_t * __libcpp_contention_state(void const volatile * p) _NOEXCEPT {
				return __libcpp_contention_state_ + ((std::uintptr_t)p & 255);
				}

				#endif //_LIBCPP_HAS_NO_THREAD_CONTENTION_TABLE

				_LIBCPP_END_NAMESPACE_STD

				zoecarverUnsubmitted Done Reply Inline Actions For what platforms is `_LIBCPP_HAS_PLATFORM_WAIT_STATE` false, and have you tested on those platforms? I'm worried that there might be compiler errors. zoecarver: For what platforms is `_LIBCPP_HAS_PLATFORM_WAIT_STATE` false, and have you tested on those…
				__simt__AuthorUnsubmitted Done Reply Inline Actions It's false on CUDA. It would be false on platforms that can't rely on OS support for efficient waiting and have to fall back to polling with backoff. __simt__: It's false on CUDA. It would be false on platforms that can't rely on OS support for efficient…
				#endif //_LIBCPP_HAS_NO_THREADS
				zoecarverUnsubmitted Done Reply Inline Actions Is `__libcpp_platform_wait` defined on non-linux machines? zoecarver: Is `__libcpp_platform_wait` defined on non-linux machines?
				__simt__AuthorUnsubmitted Done Reply Inline Actions Yes. __simt__: Yes.
				zoecarverUnsubmitted Not Done Reply Inline Actions Instead of having void pointers that are casted, I don't see any reason these couldn't be defined as their actual types (`__cxx_atomic_impl` and `__libcpp_platform_contention_t`). zoecarver: Instead of having void pointers that are casted, I don't see any reason these couldn't be…
				__simt__AuthorUnsubmitted Done Reply Inline Actions This is the uglier part of the patch. We have this situation where one platform (Apple) is about to go from not having platform wait states, to having them. This API is trying to be able to take either kind of efficient wait structure in the application OR in the dylib, and provide efficient waiting either way. I think it would not be unreasonable if you guys asked me to cut this part out in order to get a first commit of the facility, and then work in the background with Louis on recovering the capability in some way, with or without a public patch. __simt__: This is the uglier part of the patch. We have this situation where one platform (Apple) is…
				zoecarverUnsubmitted Not Done Reply Inline Actions I'll defer to others on this but, (assuming it wouldn't be much more work for you) it might be better to remove that from this patch and add it as a follow-up patch. In general, the less code in this patch, the faster it can get committed. zoecarver: I'll defer to others on this but, (assuming it wouldn't be much more work for you) it might be…
				zoecarverUnsubmitted Done Reply Inline Actions same as below zoecarver: same as below
				__simt__AuthorUnsubmitted Done Reply Inline Actions Answered below. __simt__: Answered below.
				zoecarverUnsubmitted Done Reply Inline Actions Is there a test for this case? What will be the effect of `__s` not getting updated? zoecarver: Is there a test for this case? What will be the effect of `__s` not getting updated?
				__simt__AuthorUnsubmitted Done Reply Inline Actions Undefined behavior. Usually a segfault. __simt__: Undefined behavior. Usually a segfault.
				zoecarverUnsubmitted Not Done Reply Inline Actions Isn't that a problem if `_LIBCPP_HAS_NO_PLATFORM_WAIT_TABLE` isn't defined? zoecarver: Isn't that a problem if `_LIBCPP_HAS_NO_PLATFORM_WAIT_TABLE` isn't defined?
				zoecarverUnsubmitted Done Reply Inline Actions Will this ever be true? zoecarver: Will this ever be true?
				__simt__AuthorUnsubmitted Done Reply Inline Actions Oh yes. Every time that there is a contending waiter concurrent with this notifier. This condition is true whenever the facility's use is not trivial. __simt__: Oh yes. Every time that there is a contending waiter concurrent with this notifier. This…
				zoecarverUnsubmitted Done Reply Inline Actions What's the point of this macro, `ATOMIC_VAR_INIT` (I realize you didn't add it, but I'm still curious)? zoecarver: What's the point of this macro, `ATOMIC_VAR_INIT` (I realize you didn't add it, but I'm still…
				__simt__AuthorUnsubmitted Done Reply Inline Actions Prior to P0883 merging into C++20, atomics are constructed in an uninitialized state. You're supposed to use this macro to give it a static initialization. This macro is deprecated after C++20. __simt__: Prior to P0883 merging into C++20, atomics are constructed in an uninitialized state. You're…
				zoecarverUnsubmitted Not Done Reply Inline Actions Heh. I didn't realize that was part of the standard (I was wondering why it wasn't mangled). Good to know. zoecarver: Heh. I didn't realize that was part of the standard (I was wondering why it wasn't mangled).
				zoecarverUnsubmitted Done Reply Inline Actions This is different if `_LIBCPP_HAS_PLATFORM_WAIT_STATE` is false, right? zoecarver: This is different if `_LIBCPP_HAS_PLATFORM_WAIT_STATE` is false, right?
				__simt__AuthorUnsubmitted Done Reply Inline Actions If you have a platform wait state, then it's used by this facility. If you don't have a platform wait state, then a condvar is used instead. __simt__: If you have a platform wait state, then it's used by this facility. If you don't have a…
				zoecarverUnsubmitted Not Done Reply Inline Actions I see. I got confused for a second while trying to follow the `#if`s. zoecarver: I see. I got confused for a second while trying to follow the `#if`s.
				ldionneUnsubmitted Not Done Reply Inline Actions In `apple_availability.h`, add: #if defined(__ENVIRONMENT_MAC_OS_X_VERSION_MIN_REQUIRED__) #if __ENVIRONMENT_MAC_OS_X_VERSION_MIN_REQUIRED__ >= 101500 #define _LIBCPP_USE_ULOCK #endif #elif defined(__ENVIRONMENT_IPHONE_OS_VERSION_MIN_REQUIRED__) #if __ENVIRONMENT_IPHONE_OS_VERSION_MIN_REQUIRED__ >= 130000 #define _LIBCPP_USE_ULOCK #endif #elif defined(__ENVIRONMENT_TV_OS_VERSION_MIN_REQUIRED__) #if __ENVIRONMENT_TV_OS_VERSION_MIN_REQUIRED__ >= 130000 #define _LIBCPP_USE_ULOCK #endif #elif defined(__ENVIRONMENT_WATCH_OS_VERSION_MIN_REQUIRED__) #if __ENVIRONMENT_WATCH_OS_VERSION_MIN_REQUIRED__ >= 60000 #define _LIBCPP_USE_ULOCK #endif #endif // __ENVIRONMENT_._VERSION_MIN_REQUIRED__ That should do it for all platforms aligned to Mac OS 10.15. Feel free to use whatever name for `_LIBCPP_USE_ULOCK` -- `_LIBCPP_USE_APPLE_ULOCK` probably makes the most sense since it's an Apple-specific API. Then, your `#elif` becomes `#elif defined(__APPLE__) && defined(_LIBCPP_USE_APPLE_ULOCK)`. ldionne:* In `apple_availability.h`, add: ``` #if defined(__ENVIRONMENT_MAC_OS_X_VERSION_MIN_REQUIRED__)…
				__simt__AuthorUnsubmitted Done Reply Inline Actions Thanks! __simt__: Thanks!
				MBkktUnsubmitted Not Done Reply Inline Actions Why do you use timed wait here? It's strange, and also different with macos(ulock) behavior MBkkt: Why do you use timed wait here? It's strange, and also different with macos(ulock) behavior
				__simt__AuthorUnsubmitted Done Reply Inline Actions Because it would be incorrect otherwise. There's an infinitesimal but not zero probability that the serial number being waited on can roll over, incremented by precisely the right value, and then you might think it didn't change when it did. There are a lot of weird discussions we could have from here. Does this negate Futex? I don't think it should. Is it even worth worrying about (like, the computer might be hit by a neutron flying in from space much more often than this)? I think it's cheap to mitigate. There's a judgement-call here about how long to wait. It's arbitrary. We wouldn't need to do this if Linux supported 64-bit Futex. Partly because we would rarely use serial numbers like this, and also partly because 64-bit numbers don't roll over inside of the useful life span of computer hardware. __simt__: Because it would be incorrect otherwise. There's an infinitesimal but not zero probability…
				MBkktUnsubmitted Not Done Reply Inline Actions First of thanks for answer. But if I understand correctly it's only about not atomic<int> behavior (any atomic which use waiter pool) So for atomic<int> it's not needed (also why unsigned int not used in wait without waiter pool?) Another thought, for ulock behavior should be same? MBkkt: First of thanks for answer. But if I understand correctly it's only about not atomic<int>…

libcxx/src/barrier.cpp

This file was added.

				//===------------------------- barrier.cpp ---------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "__config"

				#ifndef _LIBCPP_HAS_NO_THREADS

				#include "barrier"

				_LIBCPP_BEGIN_NAMESPACE_STD

				#if !defined(_LIBCPP_HAS_NO_TREE_BARRIER) && !defined(_LIBCPP_HAS_NO_THREAD_FAVORITE_BARRIER_INDEX) && (_LIBCPP_STD_VER >= 11)

				_LIBCPP_EXPORTED_FROM_ABI
				thread_local ptrdiff_t __libcpp_thread_favorite_barrier_index = 0;

				#endif

				_LIBCPP_END_NAMESPACE_STD

				#endif //_LIBCPP_HAS_NO_THREADS
				zoecarverUnsubmitted Done Reply Inline Actions Will this ever not only happen on the first iteration (if so, move it out of the loop maybe)? zoecarver: Will this ever not //only// happen on the first iteration (if so, move it out of the loop…
				__simt__AuthorUnsubmitted Done Reply Inline Actions This needs to be inside the loop unfortunately. During the first round, we need to record our effective start (leaf) location in the tree. I could express it differently - record it only at the end of the first round - but it would remain in the loop nest. What we could potentially do is dumb down this favorite barrier index concept, or remove it entirely. It's worth a relatively small amount of performance by comparison to using the tree barrier in the first place. I would not be offended if you asked me to give you an ordered list of things I could delete and tell you what it costs you to delete them, approximately. Then we could stop where we are more comfortable. __simt__: This needs to be inside the loop unfortunately. During the first round, we need to record our…
				zoecarverUnsubmitted Not Done Reply Inline Actions What we could potentially do is dumb down this favorite barrier index concept, or remove it entirely. It's worth a relatively small amount of performance by comparison to using the tree barrier in the first place. I don't know enough about this piece of code or potential methods of implementation to comment on what we should do. I'll defer to you/others on this. zoecarver: > What we could potentially do is dumb down this favorite barrier index concept, or remove it…

libcxx/src/semaphore.cpp

This file was added.

				//===------------------------ semaphore.cpp -------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "__config"

				#ifndef _LIBCPP_HAS_NO_THREADS

				#include "semaphore"

				_LIBCPP_BEGIN_NAMESPACE_STD

				#if !defined(_LIBCPP_HAS_NO_SEMAPHORES)

				_LIBCPP_EXPORTED_FROM_ABI
				__sem_semaphore_basic_base::__sem_semaphore_basic_base(ptrdiff_t __count) :
				__semaphore()
				{
				__libcpp_semaphore_init(&__semaphore, __count);
				}
				_LIBCPP_EXPORTED_FROM_ABI
				__sem_semaphore_basic_base::~__sem_semaphore_basic_base() {
				#ifdef __APPLE__
				auto __b = __balance.load(memory_order_relaxed);
				zoecarverUnsubmitted Done Reply Inline Actions Why is this only needed for apple? zoecarver: Why is this only needed for apple?
				__simt__AuthorUnsubmitted Done Reply Inline Actions Because of how GCD semaphores work, unfortunately. We could delete this by sending all Apple semaphores to the generic template based on atomics. Last I spoke with Louis, we thought that would be acceptable. __simt__: Because of how GCD semaphores work, unfortunately. We could delete this by sending all Apple…
				for(; __b > 0; --__b) __libcpp_semaphore_wait(&__semaphore);
				for(; __b < 0; ++__b) __libcpp_semaphore_post(&__semaphore);
				#endif
				__libcpp_semaphore_destroy(&__semaphore);
				}
				_LIBCPP_EXPORTED_FROM_ABI
				void __sem_semaphore_basic_base::release(ptrdiff_t __update)
				{
				#ifdef __APPLE__
				__balance.fetch_add(__update, memory_order_relaxed);
				#endif
				for(; __update; --__update)
				__libcpp_semaphore_post(&__semaphore);
				}
				_LIBCPP_EXPORTED_FROM_ABI
				void __sem_semaphore_basic_base::acquire()
				{
				__libcpp_semaphore_wait(&__semaphore);
				#ifdef __APPLE__
				__balance.fetch_sub(1, memory_order_relaxed);
				#endif
				}
				_LIBCPP_EXPORTED_FROM_ABI
				bool __sem_semaphore_basic_base::try_acquire_for(chrono::nanoseconds __rel_time)
				{
				auto const __success = __libcpp_semaphore_wait_timed(&__semaphore, __rel_time);
				#ifdef __APPLE__
				__balance.fetch_sub(1, memory_order_relaxed);
				#endif
				return __success;
				}

				#ifndef _LIBCPP_HAS_NO_SEMAPHORE_BACK_BUFFER

				_LIBCPP_INLINE_VISIBILITY
				void __sem_semaphore_back_buffered_base::__backfill()
				{
				ptrdiff_t __expect = 2;
				while(__expect != 0)
				{
				ptrdiff_t const __sub = __expect > 1 ? 2 : 1;
				if(!__backbuffer.compare_exchange_weak(__expect, __expect - __sub, memory_order_acquire, memory_order_relaxed))
				continue;
				if(__sub > 1)
				__semaphore.release(1);
				__semaphore.release(1);
				break;
				}
				}
				_LIBCPP_EXPORTED_FROM_ABI
				__sem_semaphore_back_buffered_base::__sem_semaphore_back_buffered_base(ptrdiff_t __count) :
				__semaphore(__count), __backbuffer(0)
				{
				}
				_LIBCPP_EXPORTED_FROM_ABI
				__sem_semaphore_back_buffered_base::~__sem_semaphore_back_buffered_base()
				{
				}
				_LIBCPP_EXPORTED_FROM_ABI
				void __sem_semaphore_back_buffered_base::release(ptrdiff_t __update)
				{
				if(__update > 2)
				__backbuffer.fetch_add(__update - 2, memory_order_acq_rel);
				if(__update > 1)
				__semaphore.release(1);
				__semaphore.release(1);
				}
				_LIBCPP_EXPORTED_FROM_ABI
				void __sem_semaphore_back_buffered_base::acquire()
				{
				__semaphore.acquire();
				__backfill();
				}
				_LIBCPP_EXPORTED_FROM_ABI
				bool __sem_semaphore_back_buffered_base::try_acquire_for(chrono::nanoseconds __rel_time)
				{
				if(!__semaphore.try_acquire_for(__rel_time))
				return false;
				__backfill();
				return true;
				}

				#endif //_LIBCPP_HAS_NO_SEMAPHORE_BACK_BUFFER

				#ifndef _LIBCPP_HAS_NO_SEMAPHORE_FRONT_BUFFER

				_LIBCPP_INLINE_VISIBILITY
				bool __sem_semaphore_front_buffered_base::__try_acquire_fast()
				{
				ptrdiff_t __old;
				__libcpp_thread_poll_with_backoff([&]() {
				__old = __frontbuffer.load(memory_order_relaxed);
				return 0 != (__old >> 32);
				}, chrono::microseconds(5));
				// always steal if you can
				while(__old >> 32)
				zoecarverUnsubmitted Done Reply Inline Actions Where does `50` come from? Maybe make this a macro. zoecarver: Where does `50` come from? Maybe make this a macro.
				__simt__AuthorUnsubmitted Done Reply Inline Actions Yeah. Aren't there enough macros though? I think I might come back later with a different patch to propose a set of macros to configure back-offs. __simt__: Yeah. Aren't there enough macros though? I think I might come back later with a different patch…
				zoecarverUnsubmitted Not Done Reply Inline Actions Yes, there are an unfortunate number of macros in this bit of code. I don't feel strongly about adding a macro now or later, maybe add a comment, though (that the number isn't magic or referenced elsewhere). zoecarver: Yes, there are an unfortunate number of macros in this bit of code. I don't feel strongly…
				if(__frontbuffer.compare_exchange_weak(__old, __old - (1ll << 32), memory_order_acquire))
				return true;
				// record we're waiting
				__old = __frontbuffer.fetch_add(1ll, memory_order_release);
				// ALWAYS steal if you can!
				while(__old >> 32)
				if(__frontbuffer.compare_exchange_weak(__old, __old - (1ll << 32), memory_order_acquire))
				break;
				// not going to wait after all
				if(__old >> 32) {
				__try_done();
				return true;
				}
				// the wait has begun...
				return false;
				}
				_LIBCPP_INLINE_VISIBILITY
				void __sem_semaphore_front_buffered_base::__try_done()
				{
				// record we're NOT waiting
				__frontbuffer.fetch_sub(1ll, memory_order_release);
				}
				_LIBCPP_EXPORTED_FROM_ABI
				__sem_semaphore_front_buffered_base::__sem_semaphore_front_buffered_base(ptrdiff_t __count) :
				__semaphore(0), __frontbuffer(__count << 32)
				{
				}
				_LIBCPP_EXPORTED_FROM_ABI
				__sem_semaphore_front_buffered_base::~__sem_semaphore_front_buffered_base()
				zoecarverUnsubmitted Done Reply Inline Actions Is this the same as `while (__old == 0)`? zoecarver: Is this the same as `while (__old == 0)`?
				__simt__AuthorUnsubmitted Done Reply Inline Actions It's not. This is masking the lower 32-bits of a 64-bit value, and then comparing that with 0. __simt__: It's not. This is masking the lower 32-bits of a 64-bit value, and then comparing that with 0.
				zoecarverUnsubmitted Not Done Reply Inline Actions I'm pretty sure they are the same. Look at this example. The first and last functions generate the same optimized assembly. zoecarver: I'm pretty sure they are the same. [[ https://godbolt.org/z/AtWZCY \| Look at this example ]].
				{
				}
				_LIBCPP_EXPORTED_FROM_ABI
				void __sem_semaphore_front_buffered_base::release(ptrdiff_t __update)
				{
				// boldly assume the semaphore is taken but uncontended
				ptrdiff_t __old = 0;
				// try to fast-release as long as it's uncontended
				while(0 == (__old & ~0ul))
				if(__frontbuffer.compare_exchange_weak(__old, __old + (__update << 32), memory_order_acq_rel))
				return;
				__semaphore.release(__update);
				}
				_LIBCPP_EXPORTED_FROM_ABI
				void __sem_semaphore_front_buffered_base::acquire()
				{
				if(__try_acquire_fast())
				return;
				__semaphore.acquire();
				__try_done();
				}
				_LIBCPP_EXPORTED_FROM_ABI
				bool __sem_semaphore_front_buffered_base::try_acquire_for(chrono::nanoseconds __rel_time)
				{
				if(__try_acquire_fast())
				return true;
				auto const __success = __semaphore.try_acquire_for(__rel_time);
				__try_done();
				return __success;
				}

				#endif //_LIBCPP_HAS_NO_SEMAPHORE_FRONT_BUFFER

				#endif

				_LIBCPP_END_NAMESPACE_STD

				#endif //_LIBCPP_HAS_NO_THREADS

libcxx/test/libcxx/double_include.sh.cpp

	Show All 19 Lines
	#endif			#endif

	// Top level headers			// Top level headers
	#include <algorithm>			#include <algorithm>
	#include <any>			#include <any>
	#include <array>			#include <array>
	#ifndef _LIBCPP_HAS_NO_THREADS			#ifndef _LIBCPP_HAS_NO_THREADS
	#include <atomic>			#include <atomic>
				#include <latch>
				#include <barrier>
				#include <semaphore>
	#endif			#endif
	#include <bit>			#include <bit>
	#include <bitset>			#include <bitset>
	#include <cassert>			#include <cassert>
	#include <ccomplex>			#include <ccomplex>
	#include <cctype>			#include <cctype>
	#include <cerrno>			#include <cerrno>
	#include <cfenv>			#include <cfenv>
	▲ Show 20 Lines • Show All 139 Lines • Show Last 20 Lines

libcxx/test/std/atomics/atomics.types.operations/atomics.types.operations.wait/atomic_wait.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads
				// XFAIL: c++98, c++03

				// <atomic>

				#include <atomic>
				#include <type_traits>
				#include <cassert>
				#include <thread>

				#include "test_macros.h"
				#include "../atomics.types.operations.req/atomic_helpers.h"

				template <class T>
				struct TestFn {
				void operator()() const {
				typedef std::atomic<T> A;

				A t;
				std::atomic_init(&t, T(1));
				assert(std::atomic_load(&t) == T(1));
				std::atomic_wait(&t, T(0));
				std::thread t_([&](){
				std::atomic_store(&t, T(3));
				std::atomic_notify_one(&t);
				});
				std::atomic_wait(&t, T(1));
				t_.join();

				volatile A vt;
				std::atomic_init(&vt, T(2));
				assert(std::atomic_load(&vt) == T(2));
				std::atomic_wait(&vt, T(1));
				std::thread t2_([&](){
				std::atomic_store(&vt, T(4));
				std::atomic_notify_one(&vt);
				});
				std::atomic_wait(&vt, T(2));
				t2_.join();
				}
				};

				int main(int, char**)
				{
				TestEachAtomicType<TestFn>()();

				return 0;
				}

libcxx/test/std/thread/thread.barrier/arrive.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <barrier>

				#include <barrier>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::barrier<> b(2);
				griwesUnsubmitted Done Reply Inline Actions This depends on CTAD, which makes it only work in C++17 and above. Instead it should probably say `std::barrier<> b;`. Same comment for all the other barrier and semaphore tests. griwes: This depends on CTAD, which makes it only work in C++17 and above. Instead it should probably…

				auto tok = b.arrive();
				std::thread t([&](){
				(void)b.arrive();
				});
				b.wait(std::move(tok));
				t.join();

				auto tok2 = b.arrive(2);
				b.wait(std::move(tok2));
				return 0;
				}

libcxx/test/std/thread/thread.barrier/arrive_and_drop.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <barrier>

				#include <barrier>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::barrier<> b(2);

				std::thread t([&](){
				b.arrive_and_drop();
				});

				b.arrive_and_wait();
				b.arrive_and_wait();
				t.join();
				return 0;
				}

libcxx/test/std/thread/thread.barrier/arrive_and_wait.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <barrier>

				#include <barrier>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::barrier<> b(2);

				std::thread t([&](){
				for(int i = 0; i < 10; ++i)
				b.arrive_and_wait();
				});
				for(int i = 0; i < 10; ++i)
				b.arrive_and_wait();
				t.join();

				return 0;
				}

libcxx/test/std/thread/thread.barrier/completion.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <barrier>

				#include <barrier>
				#include <cassert>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				int x = 0;
				auto comp = [&]() { x += 1; };
				std::barrier<decltype(comp)> b(2, comp);

				std::thread t([&](){
				for(int i = 0; i < 10; ++i)
				b.arrive_and_wait();
				});

				for(int i = 0; i < 10; ++i)
				b.arrive_and_wait();

				assert(x == 10);
				t.join();
				return 0;
				}

libcxx/test/std/thread/thread.barrier/version.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <barrier>

				#include <barrier>

				#include "test_macros.h"

				#ifndef _LIBCPP_VERSION
				#error _LIBCPP_VERSION not defined
				#endif

				int main(int, char**)
				{
				return 0;
				}

libcxx/test/std/thread/thread.latch/arrive_and_wait.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <latch>

				#include <latch>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::latch l(2);

				std::thread t([&](){
				l.arrive_and_wait();
				});
				l.arrive_and_wait();
				t.join();

				return 0;
				}

libcxx/test/std/thread/thread.latch/count_down.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <latch>

				#include <latch>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::latch l(2);

				l.count_down();
				std::thread t([&](){
				l.count_down();
				});
				l.wait();
				t.join();

				return 0;
				}

libcxx/test/std/thread/thread.latch/try_wait.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <latch>

				#include <latch>
				#include <cassert>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::latch l(1);

				l.count_down();
				bool const b = l.try_wait();
				assert(b);

				return 0;
				}

libcxx/test/std/thread/thread.latch/version.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <latch>

				#include <latch>

				#include "test_macros.h"

				#ifndef _LIBCPP_VERSION
				#error _LIBCPP_VERSION not defined
				#endif

				int main(int, char**)
				{
				return 0;
				}

libcxx/test/std/thread/thread.semaphore/acquire.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <semaphore>

				#include <semaphore>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::counting_semaphore<> s(2);

				std::thread t([&](){
				s.acquire();
				});
				t.join();

				s.acquire();

				return 0;
				}

libcxx/test/std/thread/thread.semaphore/binary.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <semaphore>

				#include <semaphore>
				#include <chrono>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::binary_semaphore s(1);

				auto l = [&](){
				for(int i = 0; i < 1024; ++i) {
				s.acquire();
				std::this_thread::sleep_for(std::chrono::microseconds(1));
				s.release();
				}
				};

				std::thread t(l);
				l();

				t.join();

				return 0;
				}

libcxx/test/std/thread/thread.semaphore/max.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <semaphore>

				#include <semaphore>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				static_assert(std::counting_semaphore<>::max() > 0);
				static_assert(std::counting_semaphore<1>::max() >= 1);
				static_assert(std::counting_semaphore<std::numeric_limits<int>::max()>::max() >= 1);
				static_assert(std::counting_semaphore<std::numeric_limits<unsigned>::max()>::max() >= 1);
				static_assert(std::counting_semaphore<std::numeric_limits<ptrdiff_t>::max()>::max() >= 1);
				static_assert(std::counting_semaphore<1>::max() == std::binary_semaphore::max());
				return 0;
				}

libcxx/test/std/thread/thread.semaphore/release.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <semaphore>

				#include <semaphore>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::counting_semaphore<> s(0);

				s.release();
				s.acquire();

				std::thread t([&](){
				s.acquire();
				});
				s.release(2);
				t.join();
				s.acquire();

				return 0;
				}

libcxx/test/std/thread/thread.semaphore/timed.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <semaphore>

				#include <semaphore>
				#include <thread>
				#include <chrono>

				#include "test_macros.h"

				int main(int, char**)
				{
				auto const start = std::chrono::steady_clock::now();

				std::counting_semaphore<> s(0);

				assert(!s.try_acquire_until(start + std::chrono::milliseconds(250)));
				assert(!s.try_acquire_for(std::chrono::milliseconds(250)));

				std::thread t([&](){
				std::this_thread::sleep_for(std::chrono::milliseconds(250));
				s.release();
				std::this_thread::sleep_for(std::chrono::milliseconds(250));
				s.release();
				});

				assert(s.try_acquire_until(start + std::chrono::seconds(2)));
				assert(s.try_acquire_for(std::chrono::seconds(2)));
				t.join();

				auto const end = std::chrono::steady_clock::now();
				assert(end - start < std::chrono::seconds(10));

				return 0;
				}

libcxx/test/std/thread/thread.semaphore/try_acquire.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <semaphore>

				#include <semaphore>
				#include <thread>

				#include "test_macros.h"

				int main(int, char**)
				{
				std::counting_semaphore<> s(1);

				assert(s.try_acquire());
				s.release();
				assert(s.try_acquire());
				s.release(2);
				std::thread t([&](){
				assert(s.try_acquire());
				});
				t.join();
				assert(s.try_acquire());

				return 0;
				}

libcxx/test/std/thread/thread.semaphore/version.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// UNSUPPORTED: libcpp-has-no-threads

				// <semaphore>

				#include <semaphore>

				#include "test_macros.h"

				#ifndef _LIBCPP_VERSION
				#error _LIBCPP_VERSION not defined
				#endif

				int main(int, char**)
				{
				return 0;
				}

This is an archive of the discontinued LLVM Phabricator instance.

Implementation of C++20's P1135R6 for libcxxClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 225615

libcxx/include/CMakeLists.txt

libcxx/include/__threading_support

libcxx/include/atomic

libcxx/include/barrier

libcxx/include/chrono

libcxx/include/cstddef

libcxx/include/latch

libcxx/include/module.modulemap

libcxx/include/semaphore

libcxx/include/stdexcept

libcxx/include/type_traits

libcxx/src/CMakeLists.txt

libcxx/src/atomic.cpp

libcxx/src/barrier.cpp

libcxx/src/semaphore.cpp

libcxx/test/libcxx/double_include.sh.cpp

libcxx/test/std/atomics/atomics.types.operations/atomics.types.operations.wait/atomic_wait.pass.cpp

libcxx/test/std/thread/thread.barrier/arrive.pass.cpp

libcxx/test/std/thread/thread.barrier/arrive_and_drop.pass.cpp

libcxx/test/std/thread/thread.barrier/arrive_and_wait.pass.cpp

libcxx/test/std/thread/thread.barrier/completion.pass.cpp

libcxx/test/std/thread/thread.barrier/version.pass.cpp

libcxx/test/std/thread/thread.latch/arrive_and_wait.pass.cpp

libcxx/test/std/thread/thread.latch/count_down.pass.cpp

libcxx/test/std/thread/thread.latch/try_wait.pass.cpp

libcxx/test/std/thread/thread.latch/version.pass.cpp

libcxx/test/std/thread/thread.semaphore/acquire.pass.cpp

libcxx/test/std/thread/thread.semaphore/binary.pass.cpp

libcxx/test/std/thread/thread.semaphore/max.pass.cpp

libcxx/test/std/thread/thread.semaphore/release.pass.cpp

libcxx/test/std/thread/thread.semaphore/timed.pass.cpp

libcxx/test/std/thread/thread.semaphore/try_acquire.pass.cpp

libcxx/test/std/thread/thread.semaphore/version.pass.cpp

Implementation of C++20's P1135R6 for libcxx
ClosedPublic