This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/experimental/
-
experimental/
72/118
task

Differential D46140

[coroutines] Add std::experimental::task<T> type
Needs ReviewPublic

Authored by lewissbaker on Apr 26 2018, 11:30 AM.

Download Raw Diff

Details

Reviewers

EricWF
CaseyCarter
GorNishanov

Summary

Adds the coroutine std::experimental::task<T> type described in proposal P1056R0.
See https://wg21.link/P1056R0.

This implementation allows customization of the allocator used to allocate the
coroutine frame by passing std::allocator_arg as the first argument, followed by
the allocator to use.

This supports co_awaiting the same task multiple times. The second and
subsequent times it returns a reference to the already-computed value.

This diff also adds some implementations of other utilities that have potential for
standardization as helpers within the test/... area:

sync_wait(awaitable) - See P1171R0
manual_reset_event

Move the definition of the aligned_allocation_size helper function
from <experimental/memory_resource> to <experimental/memory>
so it can be more widely used without pulling in memory_resource.

Outstanding work:

Use C++14 keywords directly rather than macro versions eg. use noexcept instead of _NOEXCEPT).
Add support for overaligned coroutine frames. This may need wording in the Coroutines TS to support passing the extra std::align_val_t.
Eliminate use of if constexpr if we want it to compile under C++14.

Diff Detail

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

CaseyCarter added inline comments.Apr 26 2018, 8:17 PM

include/experimental/task
56	Why qualify `size_t` here when you're in a subnamespace of `std`? (It's only necessary to qualify functions to avoid unintended ADL, not types.) Why not `_NOEXCEPT`?
68	Ditto extraneous `_VSTD`. (I'll stop commenting these now.)
82	Is it an option to store `struct foo : _Alloc { _DeallocFunc* bar; };` here so space isn't wasted for empty allocators? This would let you coalesce the allocator and non-allocator cases into a single case with a defaulted (empty) allocator.
94	`#include<type_traits>`
95	You probably want `noexcept` in the message instead of `_NOEXCEPT`.
97	This comment seems unnecessary in addition to the `static_assert`.
98	`#include<utility>`? `_Alloc __allocatorOnStack(_VSTD::move(__heapAllocator));` to avoid ADL on `move`, and because the Allocator requirements don't require implicit move construction. (I know, mega-nitpick.)
108	`#include<new>` `::new (static_cast<void>(static_cast<char>(__mem) + __paddedSize3)) _Alloc(__allocator);` (to ensure True Placement New)
113	Allocator special member functions are forbidden to throw (http://eel.is/c++draft/allocator.requirements#tab:utilities.allocator.requirements)
117	`deallocate` is also forbidden to throw (same citation).
133	Ditto "`static` is implied but it's nice to be explicit" `_NOEXCEPT` wouldn't hurt.
137	There's some "bonus" indentation on this line.
168	Style: `reinterpret_cast<_Tp&>(__storage_).~_Tp();` saves a couple of valuable keystrokes. (And 172, 206, ... if you care)
185	`::new (static_cast<void*>(&__storage_)) // ...`
195	The parameters to `is_convertible` are the source expression's type-and-value-category, and the destination type - your arguments are reversed. Also, direct and copy-init can vary for class types in C++ so you should require `is_constructible` (I don't really care if it's "require instead" or "require as well"): is_constructible<_Tp, _Value> && is_convertible<_Value, _Tp> (The first argument of `is_convertible` and the non-first arguments to `is_constructible` go through `declval`, so `X&&` and `X` are equivalent.)
197	`is_nothrow_constructible<_Tp, _Value>` is equivalent and more concise.
198	Ditto `::new (static_cast<void*>(&__storage_)` (I'll stop commenting these now.)
225	Too pretty; needs to be more `__ugly` (all three enumerators)
238	Weird indent.
251	Should be `unsigned char` so it can "provide storage" (http://eel.is/c++draft/intro.object#3) `alignas(_Tp) alignas(exception_ptr)` would let you skip the `__storageAlignment` rigamarole This class would be greatly simplified if this was an anonymous union: `union { char __empty; _Tp __value; exception_ptr __exception; };` so the `reinterpret_cast`ing could go away.
262	Pure style comment: I'd use a default member initializer for `__has_exception` and not declare this constructor at all.
270	Ditto not a pointer to `exception_ptr`.
327	Ditto comments from the other specialization.
378	Why is this not simply: template <> class __task_promise<void> final : public __task_promise_base { using _Handle = coroutine_handle<__task_promise>; public: _Handle get_return_object() _NOEXCEPT { return _Handle::from_promise(*this); } void return_void() _NOEXCEPT {} void unhandled_exception() _NOEXCEPT { #ifndef _LIBCPP_NO_EXCEPTIONS __exception_ = current_exception(); #endif } void __lvalue_result() { __throw_if_exception(); } void __rvalue_result() { __throw_if_exception(); } private: void __throw_if_exception() { #ifndef _LIBCPP_NO_EXCEPTIONS if (__exception_) { rethrow_exception(__exception_); } #endif } #ifndef _LIBCPP_NO_EXCEPTIONS exception_ptr __exception_; #endif };
385	excess indent
414	`#include<utility>`?
430	Pure style comment: Let your destructor do the work: task __old = _VSTD::exchange(__coro_, _VSTD::exchange(__other.__coro_, {}));
530	Style: `auto&& x` is more concise. Doesn't `x` dangle in this case? Be really explicit in the proposal about why it's a good idea if so.
558	@GorNishanov: Why isn't `co_return foo;` equivalent to `co_return move(foo);` for an automatic variable `foo`?
611	I suggest adding a test case with a stateful allocator.

GorNishanov added inline comments.Apr 26 2018, 9:17 PM

include/experimental/task
558	Not implemented yet in Clang and MSVC. I filed a bug: https://bugs.llvm.org/show_bug.cgi?id=37265

lewissbaker added inline comments.May 1 2018, 4:35 AM

include/experimental/task
56	I meant to remove this _VSTD but missed it. I went a bit overboard with _VSTD usage as I wasn't used to writing code within the std:: namespace ;) I had missed that C++17 added the ability to add noexcept to the function type. Nice! I'll add _NOEXCEPT in.
78	Should we also be adding overloads that take `align_val_t` for cases where the promise and/or coroutine frame is overaligned? N4736 11.4.4(11) currently only mentions operator new overload with `size_t` argument. @GorNishanov Are there plans to extend the allocator support for coroutines to use `align_val_t` overloads of `operator new` for over-aligned allocations?
80	I will run it through clang-format to fix up the formatting issues.
82	I don't think we can do that as we need to be able to lookup the `_DeallocFunc` for a given coroutine frame regardless of the type of `_Alloc`, which is unknown in `operator delete()`. Placing the `_DeallocFunc` member after the `_Alloc` would prevent that. I could do something with `if constexpr (is_empty_v<_Alloc>)` to specialise `operator new()` for a stateless allocator. Or maybe we can do something with the new-fangled `[[no_unique_address]]` attribute in C++20?
118	Do we need to use `typename _Allocator::template rebind<char>::other` as the allocator type rather than just using `_Alloc` directly or is it preferable to require the allocator to have a `value_type` of `char` (or should that be `unsigned char`)?
234	This approach will result in a different ABI for the task promise when compiled without exceptions. @EricWF, @CaseyCarter: Does libc++ require standard library types to have the same ABI regardless of compiler flags?
378	This was done to avoid calling exception_ptr runtime functions in the common case where there is no exception created. I thought I recalled this being done by someone from MS (yourself or perhaps Kenny Kerr?) in a generator<T> implementation to avoid the overhead of calls to runtime library functions for the exception_ptr default constructor, operator bool() and destructor as these are generally not inlinable. However, I currently can't find the reference code for this and it's possible I may be misremembering here. A std::optional<exception_ptr> would achieve the same ends but I was conscious of incurring the extra dependency on the <optional> header. I can simplify the implementation to what you suggest and re-apply the optimisation later if profiling determines it to be a performance problem.
383	This macro would seem to only enable nodiscard when building for c++2a or later. Does libc++ not use `[[nodiscard]]` on any APIs when built against C++17?
395	The design I went for with `cppcoro::task` was to allow awaiting the value multiple times so that `task<T>` would act a bit like an asynchronous version of a `lazy<T>`. If we only allowed co_await'ing the task once then this should: allow the coroutine frame to be destroyed earlier (eg. we could pass ownership of the coroutine frame to awaiter object which could destroy coroutine frame at end of statement rather than when ~task is run). eliminate the need for the conditional on `await_ready()`. allow placing storage for the result in the awaiter object rather than the promise object - would this potentially provide more opportunity for copy elision?
430	I was considering implementing this by passing the parameter by value. task& operator=(task __other) _NOEXCEPT { _VSTD::swap(__coro_, __other.__coro_); }
435	I'm not sure either. The only place I've used `ready()` thus far has been in tests. I'll remove them for now - we can always add them in later if we find a need.
450	Ok, I'll update this to an ASSERT.
530	This particular piece of code was just to check the expression's type was `int&&`. The value would indeed dangle if this code was executed. I did it this way as a workaround for the fact that `co_await` is not currently allowed inside a `decltype()` expression. Also, `auto&&` has slightly different semantics to that of `decltype(auto)` when the initialising expression is a prvalue.
611	Will do.

I have posted an updated version of the code here: https://gist.github.com/lewissbaker/38ba1d8a13e4fb0906559b7aa1c413d3

I'm still trying to figure out how to get the updated revision into this review.
I'm not familiar with Phabricator and it seems to want to create a new revision (review?) when I click the 'Update diff' button.
Does the updated diff need to be relative to the previous diff?
Any tips as to what I'm doing wrong are appreciated :)

In D46140#1083915, @lewissbaker wrote:

I have posted an updated version of the code here: https://gist.github.com/lewissbaker/38ba1d8a13e4fb0906559b7aa1c413d3

I'm still trying to figure out how to get the updated revision into this review.
I'm not familiar with Phabricator and it seems to want to create a new revision (review?) when I click the 'Update diff' button.
Does the updated diff need to be relative to the previous diff?
Any tips as to what I'm doing wrong are appreciated :)

Phabricator wants you to replace the entire differential when you update. Each diff should be relative to the original code, not to the previous diff.

lewissbaker added a child revision: D46312: Updated <experimental/task> WIP.May 1 2018, 7:03 AM

lewissbaker commandeered this revision.May 1 2018, 7:27 AM

lewissbaker edited reviewers, added: GorNishanov; removed: lewissbaker.

Addressed some feedback from initial version.

factored out allocator padding calculations
use anonymous unions instead of char-buffers and reinterpret_cast
make co_await'ing an invalid task have undefined behaviour
run clang-format over the file to fix some formatting

lewissbaker removed a child revision: D46312: Updated <experimental/task> WIP.May 1 2018, 4:04 PM

GorNishanov added inline comments.May 1 2018, 9:24 PM

include/experimental/task
373	Why I am concerned about await_ready() => __coro.done() is that check is inherently racy. If the rule is, coroutine starts suspended and then you are allowed await on it once and that will start it. (Or you can use library helpers, spawn, when_all, etc. that will do similar thing. They will subscribe and start the coroutine). Once the coroutine is started, you cannot touch it. I struggle to see the use case. Once coroutine is started, it has already has continuation hooked up and once finished it will resume the waiter that will lead to the destruction of the coroutine. So, nobody can check for done-ness. If it has already been started, but has not finished, are we going to subscribe again? How would we distinguish the case that coroutine has not started, vs started but has not completed? I see this `await_ready()` returning anything but `false` to be fraught with danger.
395	I played around some time ago and I think I was getting the best results when the result was in the promise. We may get in the future a return_slot<T> type, that would allow to place result directly in the caller site (like RVO), if we get that, we would need the result in the promise, but, for now, it is fine to keep it there.

lewissbaker marked 26 inline comments as done.May 2 2018, 7:16 AM

lewissbaker added inline comments.

include/experimental/task
132	Still need to qualify `move()` call with `_VSTD::move()` as per @CaseyCarter's earlier comment.
158	Add _NOEXCEPT.
373	I do not see this as being racy. A `task` is not designed to be safe to access concurrently from multiple threads. I have tried to follow the general guideline of non-const methods cannot be called concurrently with any other access to the object, while const methods can generally be safely called concurrently with other const methods. The latest revision of `task<T>` does not have any const methods. When you first call a `task`-returning coroutine it creates the coroutine initially suspended. When the `task` is first `co_await`ed this will call `__coro.done()` inside `await_ready()` (this is safe since the coroutine is suspended). The awaiting coroutine is then suspended and the `task`'s coroutine is resumed. When the `task`'s coroutine completes (ie. suspends at `final_suspend` point) it then becomes `.done()` and the awaiting coroutine is resumed. If we treat the `task` as a normal non-thread-safe object then it is up to the caller to ensure the `task` is not accessed concurrently from another thread or coroutine. In this case, the awaiting coroutine should be the only coroutine that is accessing the `task` object for the duration of the `co_await` operation. As the awaiting coroutine will be suspended while the task's coroutine is executing, it will not have an opportunity to `co_await` the task again until the task completes, at which point the task's coroutine will be suspended at final_suspend-point and thus will be `.done()`. If a coroutine then subsequently `co_await`s the task again after the first `co_await` operation has completed then it will see that the coroutine is alread `.done()` and so will continue without suspending. How would we distinguish the case that coroutine has not started, vs started but has not completed? A `task` object should only ever be able to observe the underlying coroutine being either suspended at initial_suspend (before the task is first co_awaited) or at final_suspend (after the first co_await has completed). The coroutine that has access to the `task` will be suspended while the task's coroutine is executing and so cannot observe the started-but-not-completed state. Once coroutine is started, it has already has continuation hooked up and once finished it will resume the waiter that will lead to the destruction of the coroutine. So, nobody can check for done-ness. The task's coroutine is not destroyed until `~task` is executed, which may not be immediately after `co_await` expression resumes if the `task` object has been assigned to a local variable. In this case, the task's coroutine's `.done()` state would be observable. I struggle to see the use case. For one use-case, consider: task<record> get_record(int id); task<> example() { // Want to only get the record once regardless of how many times // it's used below, but don't get the record if we don't need it. task<record> t = get_record(123); if (someCond) { record& r = co_await t; //... } //... later if (someOtherCond) { record& r = co_await t; //... } }
373	As an aside, if we did want to allow multiple coroutines to concurrently `co_await` an asynchronous result then something like `cppcoro::shared_task<T>` would be more appropriate as it correctly handles synchronisation required to avoid the data races.

CaseyCarter added inline comments.May 2 2018, 7:17 AM

include/experimental/task
82	It occurred to me in the shower after making this comment that it would break because of the type erasure. I suspect the best choice would be to avoid storing an allocator when `allocator_traits<_Alloc>::is_always_empty::value && is_default_constructible_v<_Alloc>` is `true`, and just construct a new one in the deallocation function.
117	Is it intentional that this overload only takes lvalue parameters?
118	You need to go through `allocator_traits` for anything other than `allocate`, `deallocate`, and `value_type`. Your rebound allocator type would be `typename allocator_traits<_Alloc>::template rebind_alloc<char>`, for example. I think I prefer always rebinding to the proper allocator type: it seems silly to make users specify exact allocator types when we can rebind them so easily.
132	`_VSTD::move` to avoid ADL hijacking (`_Alloc` is a program-defined type).
147	The allocator requirements mandate that the constructor does not throw; they don't mandate that it is declared `noexcept`.
151	`::new` to avoid broken garbage `operator new` overloads in `_Alloc`.
220	Ditto `::new`.
378	That was Kenny's WinRT generator which I have since changed to use `exception_ptr` directly, and Microsoft's crappy implementation of `exception_ptr`. I have no idea about libc++'s `exception_ptr`. (I expect to fix ours when we can next break ABI: it's really terrible that everything isn't inlinable in the empty/fast case.)
393	Looking at these on my second readthrough I've realized you don't have a consistent SMF story for the promise hierarchy. Those classes are sometimes copy/movable depending on the parameter type and whether or not exceptions are enabled. Should they have explicitly deleted SMFs?

GorNishanov added inline comments.May 2 2018, 8:41 AM

include/experimental/task
373	With regard to the use case. Let's imagine that we will get the return_slot<T> type in C++20. Now, we don't have to store the result in the promise, return_value will directly constructed the result in the call site of the co_await. Or, even without the return_slot<T> type, we can construct the result directly in the awaiter. (That is not what we are doing and it is fine, but, if it were the case, you would not be able to extract the result the second time. It is similar to the std::future where you can only get the result once.

GorNishanov added inline comments.May 2 2018, 2:18 PM

include/experimental/task
403	Should this be task& operator=(task &&__other) _NOEXCEPT {

GorNishanov added inline comments.May 2 2018, 2:20 PM

include/experimental/task
387	Do we really have a use case for implicit public constructor from coroutine_handle? I think this should be a private constructor.

GorNishanov added inline comments.May 2 2018, 2:34 PM

include/experimental/task
412	since we have both operator co_await () & operator co_await() && is it the same as simply having a operator co_await () You allow mutable l-values, R-values, but not const values. Also, why `operator co_await` at all? as opposed to three await_xxx functions? I am thinking specifying the wording that allow implementators flexibility to pick either approach. Is there a reason to mandate one over the other?

lewissbaker marked 8 inline comments as done.May 2 2018, 7:47 PM

lewissbaker added inline comments.

include/experimental/task
82	I assume you mean `allocator_traits<_Alloc>::is_always_equal`?
117	Yes, the wording from coroutines TS states that operator new is passed lvalue-references to each of the coroutine parameters. 11.4.4(7) ... If the lookup ﬁnds an allocation function in the scope of P, overload resolution is performed on a function call created by assembling an argument list. The ﬁrst argument is the amount of space requested, and has type std::size_t. The lvalues p1 ... pn are the succeeding arguments. ...
373	It is similar to the std::future where you can only get the result once. Yes, I guess that single-await or multi-await is really the major choice here. If we limit it so that you can only co_await the task once then it does give more flexibility on implementation that could potentially lead to a more efficient implementation in some cases. This does prevent the use-case I mentioned above, however. I'm open to putting this restriction in for now - it matches the behaviour of std::future::get(). If we do this then we might want to consider having a single `operator co_await` that returns a prvalue - although this would potentially incur an extra copy/move in some cases compared with returning a reference to the value stored in the promise. Let's imagine that we will get the return_slot<T> type in C++20. Can P0878R0 (sub-object copy elision) help here if we put the result object in the awaiter and try to return the sub-object from await_resume()? If this can be made to work we may not need return_slot<T> to get RVO for tasks.
387	Yes, it probably should be a private constructor. It was added to allow the coroutine to implicitly construct the task from the coroutine_handle returned from get_return_object() - this simplifies the implementation but isn't required. I'll change get_return_object() to return a task<T> instead and add some appropriate friend-declarations.
393	The promise types are not really part of the public API so I wasn't so concerned with locking down the interface. I can add the explicit declaration of deleted SMFs if you like.
403	Yes, the parameter could be made `task&&`. I'd just need to move-construct to a temporary inside the body instead.
412	The reason `operator co_await()` are added is to allow overloading so that when a `task<T>&` is awaited it returns a T& `and when a` task<T>&& `is awaited it returns a` T&&`. We can't overload `await_resume() &` and `await_resume() &&` since the `co_await` mechanics states that it always calls `await_resume()` on the awaiter lvalue. If we change `task<T>` to only support awaiting once then we could get rid of `operator co_await()` and just add `await_xxx()` methods directly on `task<T>`. Having said that, it does feel a little dirty having the `await_xxx()` methods popup in intellisense completion when auto-completing methods on a `task<T>` instance - these methods are not intended to be called directly by users of the type.

Explicitly delete __task_promise_base special member functions.
Declare _task_promise_base::operator delete() as _NOEXCEPT.
Explicitly scope _VSTD::move() calls for user-defined types.
Explicitly call global-scope placement operator new for user-defined types.
Tweak task::operator=() to remove ambiguity with deleted operator=(const task&).

CaseyCarter added inline comments.May 3 2018, 7:01 AM

include/experimental/task
82	Yes, I did. I was thinking "use `is_always_equal` instead of `is_empty`" and apparently my brain decided to type them both.

GorNishanov added inline comments.May 3 2018, 10:32 AM

include/experimental/task
403	Ah Ok. No need to change it then. I am starting to work on wording and trying to figure out which things are the artifacts of the implementation and which should be carried over into the wording.
412	I will try to specify the wording so that both `await_xxx` and a single `operator co_await` are acceptable implementations. I do strongly feel that task<T> should have await once semantics. It has a potential to unlock RVO. It is (potentially) less prone to misuse (my subjective feeling). await_resume() for task<T> should return T. It allows RVO and eliminates dangling references. No need to change anything at the moment in the implementation. Let's see how specification works out and then alter implementation (if needed) to match. Sometimes specifying something precisely gives clarity to a problem :-).

Jeez, this is weird. Having other STL maintainers review additions to libc++ :-)

In D46140#1086987, @EricWF wrote:

Jeez, this is weird. Having other STL maintainers review additions to libc++ :-)

Speaking of which, I don't know *your* policy but we std`-qualify pretty calls even when they can't be ADL hijacked like _VSTD::terminate(); or _VSTD::move(reference_to_internal_non_template_type). There are a few such in here that I didn't comment.

My apologies to the reviewers with knowledge of libc++'s existing style, but I would like to modernize the style with new headers like this, which already use C++14 features, like auto return types. That is, I would rather use constexpr over _LIBCPP_CONSTEXPR, and noexcept over _NOEXCEPT, ect. At this point I think it's safe to no longer a need for C++11, and in this header C++14, features to be guarded using macros.

Does everyone agree it's reasonable to limit this header to C++14 and beyond? If so, let's #ifdef out the header contents in older dialects.
If we bump this up to C++17 we can use variant to handle the non-trivially destructible union.

For whitespace, feel free to just clang-format the header.

Also, tests! Let me know if you need any help building and setting up a testing environment. Take a look at the Building Libc++ and Testing Libc++ docs to get started.

include/experimental/task
58	Let's reuse `__aligned_allocation_size` from `memory_resource`. I would move it into `__memory`, drop the assert, and make it constexpr
326	We'll want a constant ABI here. The `exception_ptr` type should be complete even when exceptions are disabled, so lets just always enable this.
include/module.modulemap
507 ↗	(On Diff #144974)	By convention this should be it's own module.

EricWF added inline comments.May 4 2018, 4:30 AM

include/experimental/task
193	`= default`?
280	`= default` ?

Many improvements:

Remove move-assignment operator
task<T>::operator co_await() now returns by value when awaiting task<T> rvalue.
Explicitly scope calls to std-library functions that live in _VSTD.
Make __task_promise ABI stable under exception/no-exception compilation modes.
Moved __aligned_allocation_size to <experimental/__memory>
Make task<T> have a default template parameter of void, allowing task<> to mean task<void>
Allow co_return { ... } by adding overload of return_value() taking T&& and another taking initializer_list<U>.
Implement optimisation that avoids storing the allocator in the coroutine frame if allocator_traits<A>::is_always_equal is true and the allocator is default constructible.
Added unit-tests (currently failing some custom allocator tests under debug builds)

Herald added subscribers: cfe-commits, ldionne, mgorny. · View Herald TranscriptJul 20 2018, 12:26 PM

lewissbaker marked 24 inline comments as done.Jul 20 2018, 12:39 PM

lewissbaker updated this revision to Diff 156641.Jul 20 2018, 5:46 PM

lewissbaker edited the summary of this revision. (Show Details)

Make task<T> constructor from coroutine_handle private.
Run clang-format over <experimental/task>.

LGTM

With a few suggestions.

Remove swap. It is not part of the proposal at the moment.
If you feel like, add a lazy<T> alias to task<T>, that way we can taste both names and see how it feels.
Also, there is still an outstanding work to add handling of implicit 'this'. Probably would be good to add before committing,

My apologies for taking so long to review.

This revision is now accepted and ready to land.Aug 29 2018, 1:02 PM

Herald added a subscriber: jfb. · View Herald TranscriptAug 29 2018, 1:02 PM

lewissbaker added inline comments.Sep 20 2018, 10:32 AM

test/std/experimental/task/sync_wait.hpp
36–37 ↗	(On Diff #156641)	The call to `notify_all()` needs to be inside the lock. Otherwise, it's possible the waiting thread may see the write to `__isSet_` inside `wait()` below and return, destroying the `condition_variable` before `__cv_.notify_all()` call completes.

Don't use __allocator as variable name (it is #defined to NASTY_MACRO).
Remove workaround in test for https://bugs.llvm.org/show_bug.cgi?id=37265
Add support for custom allocators on member functions.

Herald added subscribers: libcxx-commits, jdoerfert. · View Herald TranscriptMar 25 2019, 2:27 PM

Fix race condition in __oneshot_event::set() method used by sync_wait().

lewissbaker retitled this revision from [coroutines] std::task type (WIP) to [coroutines] Add std::experimental::task<T> type.Mar 26 2019, 10:16 AM

lewissbaker edited the summary of this revision. (Show Details)

Committed in rL357010. Apologies, I forgot to include the differential revision in the commit message so this diff wasn't closed automatically as a result. I'll comment on rL357010 with the missing information.

Reopening as the commit for this diff was reverted due to it breaking the buildbot module builds.

This revision is now accepted and ready to land.Mar 26 2019, 4:02 PM

Added missing 'require coroutines' for experimental/task entry in module.modulemap
Added missing #include in experimental/task header that was failing module builds

@EricWF This implementation will currently not work with MSVC as MSVC does not yet support the symmetric-transfer capability added to the coroutines specification in P0913R0.

Is MSVC a target that libc++ needs to support coroutines with?
Or can we just say the task<T> class is unsupported for compilers that don't support P0913?

I have added the 'requires coroutines' line to the task modulemap entry which should hopefully fix the module build failures this diff was seeing.
Are there any other potential gotchas that I should make sure I test?

test/std/experimental/task/task.basic/task_of_value.pass.cpp
25 ↗	(On Diff #192384)	@EricWF This line may fail to compile on older versions of clang without the coroutines bugfix for https://bugs.llvm.org/show_bug.cgi?id=37265. Should I just change this back to `std::move(p)` here, since the intention is to test the library, not the compiler? Alternatively I can conditionally compile this line as either `co_return p;` or `co_return std::move(p);` depending on the compiler version.

CaseyCarter requested changes to this revision.Mar 27 2019, 11:01 AM

CaseyCarter added a subscriber: mclow.lists.

CaseyCarter added inline comments.

include/experimental/__memory
80 ↗	(On Diff #192208)	This is missing preconditions that `__a` is a power of 2, and that `__s <= -__a`.
include/experimental/task
29	"requires a compiler with support for coroutines" would be more informative.
142	The return value of `allocate` isn't necessarily convertible to `void`, it could be a fancy pointer. We should either `static_assert(is_same_v<typename allocator_traits<_CharAlloc>::void_pointer, void>, "Piss off with your fancy pointers");` or use `pointer_traits` here and in `__deallocFunc` to unfancy and re-fancy the pointer.
211	Pure style comment: I recommend using the non-elaborated `friend __task_promise_final_awaitable;` whenever possible. `friend struct foo` declares or redeclares `struct foo` in the enclosing namespace, whereas `friend foo` uses name lookup to find `foo` and makes it a friend. The latter form makes it far easier to analyze compiler errors when you screw something up in maintenance. (And on 480)
221	This mem-initializer for `__state_` is redundant with the default member initializer on 306.
228	I suggest moving the `case` label and `break` outside the `#ifndef` here so the compiler won't warn about this case being unhandled when `_LIBCPP_NO_EXCEPTIONS`.
245	Are you certain that `unhandled_exception` can't possibly be called after storing a value? If so, this would leak the value.
255	Style: you've been using the `_v` variable templates for traits elsewhere.
262	Unnecessary `std::` qualifier. (Occurs repeatedly.)
264	Style: the use of trailing-return-type SFINAE here is inconsistent with the use of template parameter SFINAE on 254.
309	These `__empty_` members seem extraneous.
310	Should we `static_assert` that `_Tp` is a destructible object type?
374	Do we care about `task<cv-void>`?
390	Should these `__foovalue_result` members be `foo`-qualified to make them harder to misuse?
408	Similar to above, should we `static_assert` that `_Tp` is `void`, an lvalue reference type, or a destructible non-array object type?
454	`this->` is extraneous in these classes that derive from the concrete `_AwaiterBase`. (And on 470)
459	Is missing a subject. How about "co_await on an invalid task<T> has undefined behavior"? (And on 475)
test/std/experimental/task/awaitable_traits.hpp
17 ↗	(On Diff #192384)	It's super sketchy for test code to inject things into the production namespace and to use reserved names. It's also inappropriate to use libc++ internals in a test in the `std` tree. (Occurs repeatedly.)
test/std/experimental/task/counted.hpp
81 ↗	(On Diff #192384)	IIUC that the tests all require C++17, you could declare these `inline` and avoid the external definitions / `DEFINE_COUNTED_VARIABLES` macro.
90 ↗	(On Diff #192384)	Should `nextId_` be initialized to `1`, as it is in `reset` on 40?
test/std/experimental/task/manual_reset_event.hpp
18 ↗	(On Diff #192384)	"currently". Also, "was not currently" is weird.
23 ↗	(On Diff #192384)	Here's a great example of why elaborated `friend` can be weird. This doesn't make the `_Awaiter` class defined on 25 a friend - which needn't be a friend, it's nested so it has full access to `manual_reset_event` - it declares a class named `_Awaiter` in the enclosing namespace that is a friend.
25 ↗	(On Diff #192384)	Ditto "using reserved names in test code". (Occurs repeatedly.)
40 ↗	(On Diff #192384)	`_LIBCPP_ASSERT` is inappropriate in `std` test code; it's undefined when the library under test is not libc++. Use `LIBCPP_ASSERT` (with no leading underscore) from `test/support/test_macros.h` to mean "this assertion must hold only if the library under test is libc++" and `assert` if the assertion must hold regardless of the tested library. (Occurs repeatedly.) (Ignore the misuses of `_LIBCPP_ASSERT` that @mclow.lists put in the `span` tests; I'll fix them when we implement `span` and enable the tests.) EDIT: Actually, I'll fix them now to save myself the trouble of figuring out why the tests are broken later.
47 ↗	(On Diff #192384)	Why are we concerned about data races on `__event_->__state_`, but not `__event_->__awaitingCoroutine_`?
49 ↗	(On Diff #192384)	s/this means that//; s/and so we/so we/; s/suspend - this/suspend. This/ (Ok, I'm nitpicking, but I actually did have issues parsing this comment.)
86 ↗	(On Diff #192384)	thread
88 ↗	(On Diff #192384)	"awaits the" what? The suspense is killing me!
92 ↗	(On Diff #192384)	`std::exchange`
102 ↗	(On Diff #192384)	What's the antecedent of "it"?
test/std/experimental/task/sync_wait.hpp
14 ↗	(On Diff #192384)	There's no such header in the working draft or the Coroutines TS.
70 ↗	(On Diff #192384)	Ditto "doesn't do what you think it does and is unnecessary."
115 ↗	(On Diff #192384)	Ditto "not in test code". You should include `"test_macros.h"` and use `TEST_NO_EXCEPTIONS`. (Occurs repeatedly.)
135 ↗	(On Diff #192384)	`#include<new>` for the definition of True Placement New. (This may be missing above as well, I don't recall or have time to scroll back.)
139 ↗	(On Diff #192384)	Here's another `_VSTD` that should be `std`. Please audit all test code.
36–37 ↗	(On Diff #156641)	The write to `__isSet_` must be under the lock, the call to `notify_all` need not be. (Not that I care either way, just clarifying.)
test/std/experimental/task/task.basic/task_custom_allocator.pass.cpp
18 ↗	(On Diff #192384)	This is not a Standard or Coroutines TS header.
26 ↗	(On Diff #192384)	`static` is extraneous in an anonymous namespace.
37 ↗	(On Diff #192384)	These are the defaults for `size_type`, `difference_type`, and `is_always_equal`; you should omit them to verify that the tested code properly uses `allocator_traits`.
54 ↗	(On Diff #192384)	This is almost certainly going to cause problems, since it leaves the moved-from allocator invalid. (And on 68.)
75 ↗	(On Diff #192384)	I suggest `assert(allocator_instance_count > 0)` before decrementing. We're testing code that explicitly destroys things, we should make sure it doesn't destroy more things than it constructs.
78 ↗	(On Diff #192384)	`allocate` needs to return `T*`. (Also, `std::size_t`.) (And on 88)
90 ↗	(On Diff #192384)	Here again, I suggest an assertion that we're not freeing more bytes than have been allocated.
221 ↗	(On Diff #192384)	In libc++ tests `main` is always declared `int main(int, char**)` (and explicitly returns `0`) to support the freestanding work. (Occurs repeatedly.)
test/std/experimental/task/task.lifetime/task_parameter_lifetime.pass.cpp
46 ↗	(On Diff #192384)	Again, leaving me in suspense.

This revision now requires changes to proceed.Mar 27 2019, 11:01 AM

Thanks very much for the detailed review @CaseyCarter! Very much appreciated :)

include/experimental/__memory
80 ↗	(On Diff #192208)	Is there a recommended way of documenting/implementing these preconditions on a constexpr function in libc++? The previous version that lived in `<experimental/memory_resource>` was not marked constexpr and so was able to use `_LIBCPP_ASSERT`.
include/experimental/task
29	This messaging was copied from <experimental/coroutine>. I'll update the message there as well.
142	I'm not quite sure how to go about using `pointer_traits` here when interacting with coroutines and allocators. Under C++20 I guess I would do something like: typename allocator_traits<_CharAlloc>::void_pointer __pointer = __charAllocator.allocate(...); ... return _VSTD::to_address(__pointer); I assume that `operator new()` still needs to return `void*` rather than the fancy pointer so we need the `to_address` call in there. Is there some fallback for `to_address()` for unfancying the pointer in earlier standard versions? Maybe we should just go with the `static_assert()` for now?
221	I'll remove the default member initializer on 306 then. We can't `= default` the constructor here due to the union member containing types with non-trivial constructors/destructors.
245	Yes, I think there is a case where this could happen. If we execute a `co_return someValue` which calls `promise.return_value()` and constructs `__value_`. Then while executing the `goto final_suspend;` if any of the destructors of in-scope variables throw an exception which then escapes the coroutine body we could end up calling `unhandled_exception()` with `__value_` already having been constructed. Similarly, the exception thrown from a destructor could be caught and then a new `co_return someOtherValue;` executed. So we should probably be guarding against `__value_` containing a value inside `return_value()` as well.
309	The `__empty_` members were added at your request ;) See https://reviews.llvm.org/D46140?id=144168#inline-403822 I can remove them if you think they're not necessary.
310	Sure, will do. I wonder if we should make this a requirement in the wording of P1056? What do you think @GorNishanov?
374	I don't feel strongly either way. It's not something I've ever had a use for, but maybe it should be done for completeness?
390	I'm not sure if it makes it much safer. We already have 'rvalue' in the method name which gives a hint to its move-ness. It's currently only called in one place: return this->__coro_.promise().__rvalue_result(); would become: return std::move(this->__coro_.promise()).__rvalue_result();
408	Sure, can do. Further, to support `task<T>::operator co_await() &&` T needs to both be destructible and move-constructible. For `task<T>::operator co_await() &` T only needs to be destructible.
454	I get compile-errors if I remove the `this->`. libcxx/include/experimental/task:502:16: error: 'std::experimental::coroutines_v1::task<counted>::__coro_' is not a member of class '_Awaiter' return __coro_.promise().__rvalue_result(); Compiler bug?
test/std/experimental/task/awaitable_traits.hpp
17 ↗	(On Diff #192384)	These were intended to be eventually added to the <experimental/coroutine> header and under the std::experimental namespace which is why they were using these macros. See P1288R0. Similarly for `sync_wait()` which is proposed in P1171R0. Is there some common namespace that test utilities should be declared within? If not I'll just move them to global scope and de-uglify them for now.
test/std/experimental/task/counted.hpp
90 ↗	(On Diff #192384)	The intent was that any test would always initialise these values by calling reset() before running. I hadn't intended the static-initialization to "do the right thing" here but I can if you think that's preferable.
test/std/experimental/task/manual_reset_event.hpp
47 ↗	(On Diff #192384)	Only the awaiting thread writes to `__awaitingCoroutine_` and publishes this value when it writes to `__state_` by setting the state to `__not_set_waiting_coroutine`. A thread that calls `.set()` will only read from `__awaitingCoroutine_` if, when it updates the `__state_` variable, sees that the previous state was `__not_set_waiting_coroutine`. So the write the `__awaitingCoroutine_` should strictly happens-before any read from `__awaitingCoroutine_`.
test/std/experimental/task/sync_wait.hpp
36–37 ↗	(On Diff #156641)	The problem here is that once we write `true` to `__isSet_` and release the lock that the thread calling `wait()` can potentially return and the on to destroy the `__oneshot_event` object, leaving the current thread with a dangling reference to a destroyed `std::condition_variable`. I ran into a similar problem in cppcoro. See https://github.com/lewissbaker/cppcoro/issues/98#issuecomment-450695759
test/std/experimental/task/task.basic/task_custom_allocator.pass.cpp
18 ↗	(On Diff #192384)	I can't `#include <memory_resource>` as libc++ doesn't currently provide that header. How should I go about writing a test that makes sure that std::experimental::task works with `std::experimental::pmr::polymorphic_allocator` (or `std::pmr::polymorphic_allocator` if available)?
54 ↗	(On Diff #192384)	Isn't a moved-from allocator left in a valid but unspecified state? ie. you can't assume it's ok to use. but should be ok to destruct or assign to.

This updated diff should address most of @CaseyCarter's review comments.

For detailed changelog you can find individual changes here https://github.com/lewissbaker/libcxx/commits/coro_task

There are a few outstanding issues that still need some clarification on how to proceed:

Should the task tests be taking a dependency on <experimental/memory_resource>?
How should I implement a precondition check for __aligned_allocation_size when it is declared constexpr?
I've not yet implemented support for task<cv-void>. Is this something we should support?
I've not yet handling allocators with fancy pointer types. If we are to support them is there a standard way to convert a fancy pointer to a void* to return from promise_type::operator new()? Does it even make sense to support them if the compiler is potentially going to be storing raw pointers to things inside the coroutine frame?

lewissbaker marked 3 inline comments as done.Apr 3 2019, 12:45 PM

lewissbaker added inline comments.

include/experimental/task
161	This should use explicit move construction rather than implicit move construction.

Gentle ping.

Is there anything else people would like to see changed?

Revision Contents

Path

Size

include/

experimental/

task

622 lines

Diff 144168

include/experimental/task

This file was added.

				// -- C++ --
				//===----------------------------- coroutine -----------------------------===//
				// NOTE: at the moment it is in its own header, we intend to eventually
				// move it into experimental/coroutine header.
				//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef _LIBCPP_EXPERIMENTAL_TASK
				#define _LIBCPP_EXPERIMENTAL_TASK

				#include <experimental/__config>
				#include <experimental/coroutine>

				#include <exception>

				#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
				#pragma GCC system_header
				#endif

				#ifdef _LIBCPP_HAS_NO_COROUTINES
				# if defined(_LIBCPP_WARNING)
				_LIBCPP_WARNING("<experimental/task> cannot be used with this compiler")
				# else
				CaseyCarterUnsubmitted Done Reply Inline Actions "requires a compiler with support for coroutines" would be more informative. CaseyCarter: "requires a compiler with support for coroutines" would be more informative.
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions This messaging was copied from <experimental/coroutine>. I'll update the message there as well. lewissbaker: This messaging was copied from <experimental/coroutine>. I'll update the message there as well.
				# warning <experimental/task> cannot be used with this compiler
				# endif
				#endif

				_LIBCPP_BEGIN_NAMESPACE_EXPERIMENTAL_COROUTINES

				////// task<T>

				struct __task_promise_final_awaitable {
				_LIBCPP_INLINE_VISIBILITY
				_LIBCPP_CONSTEXPR bool await_ready() const _NOEXCEPT { return false; }

				template <typename _TaskPromise>
				_LIBCPP_INLINE_VISIBILITY coroutine_handle<>
				await_suspend(coroutine_handle<_TaskPromise> __coro) const _NOEXCEPT {
				_LIBCPP_ASSERT(
				__coro.promise().__continuation_,
				"Coroutine completed without a valid continuation attached.");
				return __coro.promise().__continuation_;
				}

				_LIBCPP_INLINE_VISIBILITY
				void await_resume() const _NOEXCEPT {}
				};

				class _LIBCPP_TYPE_VIS __task_promise_base {
				using _DeallocFunc = void(void* __ptr, _VSTD::size_t __size);
				CaseyCarterUnsubmitted Done Reply Inline Actions Why qualify `size_t` here when you're in a subnamespace of `std`? (It's only necessary to qualify functions to avoid unintended ADL, not types.) Why not `_NOEXCEPT`? CaseyCarter: Why qualify `size_t` here when you're in a subnamespace of `std`? (It's only necessary to…
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I meant to remove this _VSTD but missed it. I went a bit overboard with _VSTD usage as I wasn't used to writing code within the std:: namespace ;) I had missed that C++17 added the ability to add noexcept to the function type. Nice! I'll add _NOEXCEPT in. lewissbaker: I meant to remove this _VSTD but missed it. I went a bit overboard with _VSTD usage as I wasn't…

				public:
				EricWFUnsubmitted Done Reply Inline Actions Let's reuse `__aligned_allocation_size` from `memory_resource`. I would move it into `__memory`, drop the assert, and make it constexpr EricWF: Let's reuse `__aligned_allocation_size` from `memory_resource`. I would move it into `__memory`…

				void* operator new (size_t __size)
				CaseyCarterUnsubmitted Done Reply Inline Actions It would be nice to declare these `new` overloads explicitly `static`. CaseyCarter: It would be nice to declare these `new` overloads explicitly `static`.
				{
				// Allocate space for an extra pointer immediately after __size that holds
				// the type-erased deallocation function.
				auto __paddedSize1 = (__size + alignof(_DeallocFunc) - 1) & ~(alignof(_DeallocFunc) - 1);
				auto __paddedSize2 = __paddedSize1 + sizeof(_DeallocFunc*);
				void* __mem = ::operator new (__paddedSize2);
				_DeallocFunc& __deallocFunc = reinterpret_cast<_DeallocFunc*>(static_cast<char>(__mem) + __paddedSize1);
				__deallocFunc = [](void* __pointer, _VSTD::size_t __size) _NOEXCEPT
				CaseyCarterUnsubmitted Done Reply Inline Actions Ditto extraneous `_VSTD`. (I'll stop commenting these now.) CaseyCarter: Ditto extraneous `_VSTD`. (I'll stop commenting these now.)
				{
				auto __paddedSize1 = (__size + alignof(_DeallocFunc)-1) & ~(alignof(_DeallocFunc)-1);
				CaseyCarterUnsubmitted Done Reply Inline Actions The rest of this function has spaces around binary operators. CaseyCarter: The rest of this function has spaces around binary operators.
				auto __paddedSize2 = __paddedSize1 + sizeof(_DeallocFunc*);
				CaseyCarterUnsubmitted Done Reply Inline Actions Maybe DRY extract padding computation? CaseyCarter: Maybe DRY extract padding computation?
				::operator delete(__pointer, __paddedSize2);
				};
				return __mem;
				}

				template<typename _Alloc, typename... _Args>
				void* operator new(size_t __size, allocator_arg_t, _Alloc& __allocator, _Args&...)
				CaseyCarterUnsubmitted Not Done Reply Inline Actions These is inconsistent with 60 and 66, which have a space between `new` and `(`. CaseyCarter: These is inconsistent with 60 and 66, which have a space between `new` and `(`.
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions Should we also be adding overloads that take `align_val_t` for cases where the promise and/or coroutine frame is overaligned? N4736 11.4.4(11) currently only mentions operator new overload with `size_t` argument. @GorNishanov Are there plans to extend the allocator support for coroutines to use `align_val_t` overloads of `operator new` for over-aligned allocations? lewissbaker: Should we also be adding overloads that take `align_val_t` for cases where the promise and/or…
				{
				auto __paddedSize1 = (__size + alignof(_DeallocFunc)-1) & ~(alignof(_DeallocFunc)-1);
				GorNishanovUnsubmitted Done Reply Inline Actions Looks like this block of code computing padding is duplicated later. Consider extracting it into its own function GorNishanov: Looks like this block of code computing padding is duplicated later. Consider extracting it…
				CaseyCarterUnsubmitted Not Done Reply Inline Actions I'm now completely confused about spaces (or the lack thereof) around binary operators. CaseyCarter: I'm now completely confused about spaces (or the lack thereof) around binary operators.
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I will run it through clang-format to fix up the formatting issues. lewissbaker: I will run it through clang-format to fix up the formatting issues.
				auto __paddedSize2 = __paddedSize1 + sizeof(_DeallocFunc*);
				auto __paddedSize3 = (__paddedSize2 + alignof(_Alloc)-1) & ~(alignof(_Alloc)-1);
				CaseyCarterUnsubmitted Done Reply Inline Actions Is it an option to store `struct foo : _Alloc { _DeallocFunc* bar; };` here so space isn't wasted for empty allocators? This would let you coalesce the allocator and non-allocator cases into a single case with a defaulted (empty) allocator. CaseyCarter: Is it an option to store `struct foo : _Alloc { _DeallocFunc* bar; };` here so space isn't…
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I don't think we can do that as we need to be able to lookup the `_DeallocFunc` for a given coroutine frame regardless of the type of `_Alloc`, which is unknown in `operator delete()`. Placing the `_DeallocFunc` member after the `_Alloc` would prevent that. I could do something with `if constexpr (is_empty_v<_Alloc>)` to specialise `operator new()` for a stateless allocator. Or maybe we can do something with the new-fangled `[[no_unique_address]]` attribute in C++20? lewissbaker: I don't think we can do that as we need to be able to lookup the `_DeallocFunc*` for a given…
				CaseyCarterUnsubmitted Done Reply Inline Actions It occurred to me in the shower after making this comment that it would break because of the type erasure. I suspect the best choice would be to avoid storing an allocator when `allocator_traits<_Alloc>::is_always_empty::value && is_default_constructible_v<_Alloc>` is `true`, and just construct a new one in the deallocation function. CaseyCarter: It occurred to me in the shower after making this comment that it would break because of the…
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I assume you mean `allocator_traits<_Alloc>::is_always_equal`? lewissbaker: I assume you mean `allocator_traits<_Alloc>::is_always_equal`?
				CaseyCarterUnsubmitted Done Reply Inline Actions Yes, I did. I was thinking "use `is_always_equal` instead of `is_empty`" and apparently my brain decided to type them both. CaseyCarter: Yes, I did. I was thinking "use `is_always_equal` instead of `is_empty`" and apparently my…
				auto __paddedSize4 = __paddedSize3 + sizeof(_Alloc);
				void* __mem = __allocator.allocate(__paddedSize4);
				_DeallocFunc& __deallocFunc = reinterpret_cast<_DeallocFunc*>(static_cast<char>(__mem) + __paddedSize1);
				__deallocFunc = [](void* __pointer, _VSTD::size_t __size) _NOEXCEPT
				{
				auto __paddedSize1 = (__size + alignof(_DeallocFunc)-1) & ~(alignof(_DeallocFunc)-1);
				auto __paddedSize2 = __paddedSize1 + sizeof(_DeallocFunc*);
				auto __paddedSize3 = (__paddedSize2 + alignof(_Alloc)-1) & ~(alignof(_Alloc)-1);
				auto __paddedSize4 = __paddedSize3 + sizeof(_Alloc);
				_Alloc& __heapAllocator = reinterpret_cast<_Alloc>(static_cast<char*>(__pointer) + __paddedSize3);
				static_assert(
				is_nothrow_move_constructible_v<_Alloc>,
				CaseyCarterUnsubmitted Done Reply Inline Actions `#include<type_traits>` CaseyCarter: `#include<type_traits>`
				"task<T> coroutine custom alloctor requires a _NOEXCEPT move constructor");
				CaseyCarterUnsubmitted Done Reply Inline Actions You probably want `noexcept` in the message instead of `_NOEXCEPT`. CaseyCarter: You probably want `noexcept` in the message instead of `_NOEXCEPT`.
				// We require the _Alloc move constructor to be noexcept here.
				// If it were to throw then we'd have problems
				CaseyCarterUnsubmitted Done Reply Inline Actions This comment seems unnecessary in addition to the `static_assert`. CaseyCarter: This comment seems unnecessary in addition to the `static_assert`.
				_Alloc __allocatorOnStack = move(__heapAllocator);
				CaseyCarterUnsubmitted Done Reply Inline Actions `#include<utility>`? `_Alloc __allocatorOnStack(_VSTD::move(__heapAllocator));` to avoid ADL on `move`, and because the Allocator requirements don't require implicit move construction. (I know, mega-nitpick.) CaseyCarter: `#include<utility>`? `_Alloc __allocatorOnStack(_VSTD::move(__heapAllocator));` to avoid ADL…
				__allocatorOnStack.deallocate(
				static_cast<char*>(__pointer), __paddedSize4);
				};

				// Copy the allocator into the heap frame.
				#ifndef _LIBCPP_NO_EXCEPTIONS
				try
				{
				#endif
				new (static_cast<char*>(__mem) + __paddedSize3) _Alloc(__allocator);
				CaseyCarterUnsubmitted Done Reply Inline Actions `#include<new>` `::new (static_cast<void>(static_cast<char>(__mem) + __paddedSize3)) _Alloc(__allocator);` (to ensure True Placement New) CaseyCarter: `#include<new>` `::new (static_cast<void>(static_cast<char>(__mem) + __paddedSize3)) _Alloc…
				#ifndef _LIBCPP_NO_EXCEPTIONS
				}
				catch (...)
				{
				// Allocator copy constructor has thrown
				CaseyCarterUnsubmitted Done Reply Inline Actions Allocator special member functions are forbidden to throw (http://eel.is/c++draft/allocator.requirements#tab:utilities.allocator.requirements) CaseyCarter: Allocator special member functions are forbidden to throw (http://eel.is/c++draft/allocator.
				// Free the memory and let exceptions propagate out.
				try
				{
				__allocator.deallocate(
				CaseyCarterUnsubmitted Done Reply Inline Actions `deallocate` is also forbidden to throw (same citation). CaseyCarter: `deallocate` is also forbidden to throw (same citation).
				CaseyCarterUnsubmitted Done Reply Inline Actions Is it intentional that this overload only takes lvalue parameters? CaseyCarter: Is it intentional that this overload only takes lvalue parameters?
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions Yes, the wording from coroutines TS states that operator new is passed lvalue-references to each of the coroutine parameters. 11.4.4(7) ... If the lookup ﬁnds an allocation function in the scope of P, overload resolution is performed on a function call created by assembling an argument list. The ﬁrst argument is the amount of space requested, and has type std::size_t. The lvalues p1 ... pn are the succeeding arguments. ... lewissbaker: Yes, the wording from coroutines TS states that operator new is passed lvalue-references to…
				static_cast<char*>(__mem),
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions Do we need to use `typename _Allocator::template rebind<char>::other` as the allocator type rather than just using `_Alloc` directly or is it preferable to require the allocator to have a `value_type` of `char` (or should that be `unsigned char`)? lewissbaker: Do we need to use `typename _Allocator::template rebind<char>::other` as the allocator type…
				CaseyCarterUnsubmitted Done Reply Inline Actions You need to go through `allocator_traits` for anything other than `allocate`, `deallocate`, and `value_type`. Your rebound allocator type would be `typename allocator_traits<_Alloc>::template rebind_alloc<char>`, for example. I think I prefer always rebinding to the proper allocator type: it seems silly to make users specify exact allocator types when we can rebind them so easily. CaseyCarter: You need to go through `allocator_traits` for anything other than `allocate`, `deallocate`, and…
				__paddedSize4);
				}
				catch (...)
				{
				terminate();
				}

				throw;
				}
				#endif

				return __mem;
				}

				CaseyCarterUnsubmitted Done Reply Inline Actions `_VSTD::move` to avoid ADL hijacking (`_Alloc` is a program-defined type). CaseyCarter: `_VSTD::move` to avoid ADL hijacking (`_Alloc` is a program-defined type).
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions Still need to qualify `move()` call with `_VSTD::move()` as per @CaseyCarter's earlier comment. lewissbaker: Still need to qualify `move()` call with `_VSTD::move()` as per @CaseyCarter's earlier comment.
				void operator delete(void* __pointer, size_t __size)
				CaseyCarterUnsubmitted Done Reply Inline Actions Ditto "`static` is implied but it's nice to be explicit" `_NOEXCEPT` wouldn't hurt. CaseyCarter: Ditto "`static` is implied but it's nice to be explicit" `_NOEXCEPT` wouldn't hurt.
				{
				auto __paddedSize1 = (__size + alignof(_DeallocFunc) - 1) & ~(alignof(_DeallocFunc) - 1);
				_DeallocFunc& __deallocFunc = reinterpret_cast<_DeallocFunc*>(static_cast<char>(__pointer) + __paddedSize1);
				__deallocFunc(__pointer, __size);
				CaseyCarterUnsubmitted Done Reply Inline Actions There's some "bonus" indentation on this line. CaseyCarter: There's some "bonus" indentation on this line.
				}

				_LIBCPP_INLINE_VISIBILITY
				suspend_always initial_suspend() const _NOEXCEPT { return {}; }

				CaseyCarterUnsubmitted Not Done Reply Inline Actions The return value of `allocate` isn't necessarily convertible to `void`, it could be a fancy pointer. We should either `static_assert(is_same_v<typename allocator_traits<_CharAlloc>::void_pointer, void>, "Piss off with your fancy pointers");` or use `pointer_traits` here and in `__deallocFunc` to unfancy and re-fancy the pointer. CaseyCarter: The return value of `allocate` isn't necessarily convertible to `void*`, it could be a fancy…
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions I'm not quite sure how to go about using `pointer_traits` here when interacting with coroutines and allocators. Under C++20 I guess I would do something like: typename allocator_traits<_CharAlloc>::void_pointer __pointer = __charAllocator.allocate(...); ... return _VSTD::to_address(__pointer); I assume that `operator new()` still needs to return `void` rather than the fancy pointer so we need the `to_address` call in there. Is there some fallback for `to_address()` for unfancying the pointer in earlier standard versions? Maybe we should just go with the `static_assert()` for now? lewissbaker:* I'm not quite sure how to go about using `pointer_traits` here when interacting with coroutines…
				_LIBCPP_INLINE_VISIBILITY
				__task_promise_final_awaitable final_suspend() _NOEXCEPT { return {}; }

				_LIBCPP_INLINE_VISIBILITY
				void __set_continuation(coroutine_handle<> __continuation) _NOEXCEPT {
				CaseyCarterUnsubmitted Done Reply Inline Actions The allocator requirements mandate that the constructor does not throw; they don't mandate that it is declared `noexcept`. CaseyCarter: The allocator requirements mandate that the constructor does not throw; they don't mandate that…
				__continuation_ = __continuation;
				GorNishanovUnsubmitted Done Reply Inline Actions assert that this->__continuation_ is zero before assignment GorNishanov: assert that this->__continuation_ is zero before assignment
				}

				private:
				CaseyCarterUnsubmitted Done Reply Inline Actions `::new` to avoid broken garbage `operator new` overloads in `_Alloc`. CaseyCarter: `::new` to avoid broken garbage `operator new` overloads in `_Alloc`.
				friend class __task_promise_final_awaitable;

				coroutine_handle<> __continuation_;
				};

				template <typename _Tp>
				class _LIBCPP_TEMPLATE_VIS __task_promise final : public __task_promise_base {
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions Add _NOEXCEPT. lewissbaker: Add _NOEXCEPT.
				using _Handle = coroutine_handle<__task_promise>;

				public:
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions This should use explicit move construction rather than implicit move construction. lewissbaker: This should use explicit move construction rather than implicit move construction.
				_LIBCPP_INLINE_VISIBILITY
				__task_promise() _NOEXCEPT : __state_(_State::_no_value) {}

				~__task_promise() {
				switch (__state_) {
				case _State::_value:
				reinterpret_cast<_Tp*>(&__storage_)->~_Tp();
				CaseyCarterUnsubmitted Done Reply Inline Actions Style: `reinterpret_cast<_Tp&>(__storage_).~_Tp();` saves a couple of valuable keystrokes. (And 172, 206, ... if you care) CaseyCarter: Style: `reinterpret_cast<_Tp&>(__storage_).~_Tp();` saves a couple of valuable keystrokes. (And…
				break;
				#ifndef _LIBCPP_NO_EXCEPTIONS
				case _State::_exception:
				reinterpret_cast<exception_ptr*>(&__storage_)->~exception_ptr();
				break;
				#endif
				case _State::_no_value:
				break;
				};
				}

				_LIBCPP_INLINE_VISIBILITY
				_Handle get_return_object() _NOEXCEPT { return _Handle::from_promise(*this); }

				void unhandled_exception() _NOEXCEPT {
				#ifndef _LIBCPP_NO_EXCEPTIONS
				new (&__storage_) exception_ptr(current_exception());
				CaseyCarterUnsubmitted Done Reply Inline Actions `::new (static_cast<void>(&__storage_)) // ...` CaseyCarter:* `::new (static_cast<void*>(&__storage_)) // ...`
				__state_ = _State::_exception;
				#else
				terminate();
				#endif
				}

				// Only enable return_value() overload if _Tp is implicitly constructible from _Value
				template <
				EricWFUnsubmitted Done Reply Inline Actions `= default`? EricWF: `= default`?
				typename _Value,
				enable_if_t<is_convertible<_Tp, _Value&&>::value, int> = 0>
				CaseyCarterUnsubmitted Done Reply Inline Actions The parameters to `is_convertible` are the source expression's type-and-value-category, and the destination type - your arguments are reversed. Also, direct and copy-init can vary for class types in C++ so you should require `is_constructible` (I don't really care if it's "require instead" or "require as well"): is_constructible<_Tp, _Value> && is_convertible<_Value, _Tp> (The first argument of `is_convertible` and the non-first arguments to `is_constructible` go through `declval`, so `X&&` and `X` are equivalent.) CaseyCarter: The parameters to `is_convertible` are the source expression's type-and-value-category, and the…
				void return_value(_Value&& __value)
				_NOEXCEPT_((is_nothrow_constructible<_Tp, _Value&&>::value)) {
				CaseyCarterUnsubmitted Done Reply Inline Actions `is_nothrow_constructible<_Tp, _Value>` is equivalent and more concise. CaseyCarter: `is_nothrow_constructible<_Tp, _Value>` is equivalent and more concise.
				new (&__storage_) _Tp(static_cast<_Value&&>(__value));
				CaseyCarterUnsubmitted Done Reply Inline Actions Ditto `::new (static_cast<void>(&__storage_)` (I'll stop commenting these now.) CaseyCarter:* Ditto `::new (static_cast<void*>(&__storage_)` (I'll stop commenting these now.)
				// Only set __state_ after successfully constructing the value.
				// If constructor throws then state will be updated by unhandled_exception().
				__state_ = _State::_value;
				}

				_Tp& __lvalue_result() {
				__throw_if_exception();
				return reinterpret_cast<_Tp>(&__storage_);
				}

				_Tp&& __rvalue_result() {
				__throw_if_exception();
				return static_cast<_Tp&&>(reinterpret_cast<_Tp>(&__storage_));
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Pure style comment: I recommend using the non-elaborated `friend __task_promise_final_awaitable;` whenever possible. `friend struct foo` declares or redeclares `struct foo` in the enclosing namespace, whereas `friend foo` uses name lookup to find `foo` and makes it a friend. The latter form makes it far easier to analyze compiler errors when you screw something up in maintenance. (And on 480) CaseyCarter: Pure style comment: I recommend using the non-elaborated `friend __task_promise_final_awaitable…
				}

				private:
				void __throw_if_exception() {
				#ifndef _LIBCPP_NO_EXCEPTIONS
				if (__state_ == _State::_exception) {
				rethrow_exception(
				reinterpret_cast<exception_ptr>(__storage_));
				}
				CaseyCarterUnsubmitted Done Reply Inline Actions Ditto `::new`. CaseyCarter: Ditto `::new`.
				#endif
				CaseyCarterUnsubmitted Done Reply Inline Actions This mem-initializer for `__state_` is redundant with the default member initializer on 306. CaseyCarter: This mem-initializer for `__state_` is redundant with the default member initializer on 306.
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I'll remove the default member initializer on 306 then. We can't `= default` the constructor here due to the union member containing types with non-trivial constructors/destructors. lewissbaker: I'll remove the default member initializer on 306 then. We can't `= default` the constructor…
				}

				enum class _State {
				_no_value,
				CaseyCarterUnsubmitted Done Reply Inline Actions Too pretty; needs to be more `__ugly` (all three enumerators) CaseyCarter: Too pretty; needs to be more `__ugly` (all three enumerators)
				_value
				#ifndef _LIBCPP_NO_EXCEPTIONS
				,
				CaseyCarterUnsubmitted Not Done Reply Inline Actions I suggest moving the `case` label and `break` outside the `#ifndef` here so the compiler won't warn about this case being unhandled when `_LIBCPP_NO_EXCEPTIONS`. CaseyCarter: I suggest moving the `case` label and `break` outside the `#ifndef` here so the compiler won't…
				_exception
				#endif
				};

				static constexpr size_t __storageSize =
				#ifdef _LIBCPP_NO_EXCEPTIONS
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions This approach will result in a different ABI for the task promise when compiled without exceptions. @EricWF, @CaseyCarter: Does libc++ require standard library types to have the same ABI regardless of compiler flags? lewissbaker: This approach will result in a different ABI for the task promise when compiled without…
				sizeof(_Tp);
				#else
				sizeof(_Tp) > sizeof(exception_ptr) ? sizeof(_Tp)
				: sizeof(exception_ptr);
				CaseyCarterUnsubmitted Done Reply Inline Actions Weird indent. CaseyCarter: Weird indent.
				#endif

				static constexpr size_t __storageAlignment =
				#ifdef _LIBCPP_NO_EXCEPTIONS
				alignof(_Tp);
				#else
				alignof(_Tp) > alignof(exception_ptr)
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Are you certain that `unhandled_exception` can't possibly be called after storing a value? If so, this would leak the value. CaseyCarter: Are you certain that `unhandled_exception` can't possibly be called after storing a value? If…
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions Yes, I think there is a case where this could happen. If we execute a `co_return someValue` which calls `promise.return_value()` and constructs `__value_`. Then while executing the `goto final_suspend;` if any of the destructors of in-scope variables throw an exception which then escapes the coroutine body we could end up calling `unhandled_exception()` with `__value_` already having been constructed. Similarly, the exception thrown from a destructor could be caught and then a new `co_return someOtherValue;` executed. So we should probably be guarding against `__value_` containing a value inside `return_value()` as well. lewissbaker: Yes, I think there is a case where this could happen. If we execute a `co_return someValue`…
				? alignof(_Tp)
				: alignof(exception_ptr);
				#endif

				_State __state_;
				alignas(__storageAlignment) char __storage_[__storageSize];
				CaseyCarterUnsubmitted Done Reply Inline Actions Should be `unsigned char` so it can "provide storage" (http://eel.is/c++draft/intro.object#3) `alignas(_Tp) alignas(exception_ptr)` would let you skip the `__storageAlignment` rigamarole This class would be greatly simplified if this was an anonymous union: `union { char __empty; _Tp __value; exception_ptr __exception; };` so the `reinterpret_cast`ing could go away. CaseyCarter: Should be `unsigned char` so it can "provide storage" (http://eel.is/c++draft/intro.object#3)…
				};

				template <typename _Tp>
				class __task_promise<_Tp&> final : public __task_promise_base {
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Style: you've been using the `_v` variable templates for traits elsewhere. CaseyCarter: Style: you've been using the `_v` variable templates for traits elsewhere.
				using _Ptr = _Tp*;
				using _Handle = coroutine_handle<__task_promise>;

				public:
				__task_promise() _NOEXCEPT
				#ifndef _LIBCPP_NO_EXCEPTIONS
				: __has_exception_(false)
				CaseyCarterUnsubmitted Done Reply Inline Actions Pure style comment: I'd use a default member initializer for `__has_exception` and not declare this constructor at all. CaseyCarter: Pure style comment: I'd use a default member initializer for `__has_exception` and not declare…
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Unnecessary `std::` qualifier. (Occurs repeatedly.) CaseyCarter: Unnecessary `std::` qualifier. (Occurs repeatedly.)
				#endif
				{
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Style: the use of trailing-return-type SFINAE here is inconsistent with the use of template parameter SFINAE on 254. CaseyCarter: Style: the use of trailing-return-type SFINAE here is inconsistent with the use of template…
				}

				~__task_promise() {
				#ifndef _LIBCPP_NO_EXCEPTIONS
				if (__has_exception_) {
				reinterpret_cast<exception_ptr*>(&__storage_)->~exception_ptr();
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Ditto not a pointer to `exception_ptr`. CaseyCarter: Ditto not a pointer to `exception_ptr`.
				}
				#endif
				}

				_LIBCPP_INLINE_VISIBILITY
				_Handle get_return_object() _NOEXCEPT { return _Handle::from_promise(*this); }

				void unhandled_exception() _NOEXCEPT {
				#ifndef _LIBCPP_NO_EXCEPTIONS
				new (&__storage_) exception_ptr(current_exception());
				EricWFUnsubmitted Not Done Reply Inline Actions `= default` ? EricWF: `= default` ?
				__has_exception_ = true;
				#else
				// FIXME: Should this be _LIBCPP_ASSERT(false)?
				terminate();
				GorNishanovUnsubmitted Done Reply Inline Actions Probably. Compiler should not call it if we have no exceptions GorNishanov: Probably. Compiler should not call it if we have no exceptions
				#endif
				}

				void return_value(_Tp& __value) _NOEXCEPT {
				new (&__storage_) _Ptr(_VSTD::addressof(__value));
				}

				_Tp& __lvalue_result() {
				__throw_if_exception();
				return *reinterpret_cast<_Ptr>(&__storage_);
				}

				_Tp& __rvalue_result() { return __lvalue_result(); }

				private:
				void __throw_if_exception() {
				#ifndef _LIBCPP_NO_EXCEPTIONS
				if (__has_exception_) {
				rethrow_exception(
				reinterpret_cast<exception_ptr>(&__storage_));
				}
				#endif
				}

				static constexpr auto __storageSize =
				CaseyCarterUnsubmitted Not Done Reply Inline Actions These `__empty_` members seem extraneous. CaseyCarter: These `__empty_` members seem extraneous.
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions The `__empty_` members were added at your request ;) See https://reviews.llvm.org/D46140?id=144168#inline-403822 I can remove them if you think they're not necessary. lewissbaker: The `__empty_` members were added at your request ;) See https://reviews.llvm.org/D46140?
				#ifdef _LIBCPP_NO_EXCEPTIONS
				CaseyCarterUnsubmitted Done Reply Inline Actions Should we `static_assert` that `_Tp` is a destructible object type? CaseyCarter: Should we `static_assert` that `_Tp` is a destructible object type?
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions Sure, will do. I wonder if we should make this a requirement in the wording of P1056? What do you think @GorNishanov? lewissbaker: Sure, will do. I wonder if we should make this a requirement in the wording of [P1056](https…
				sizeof(_Ptr);
				#else
				sizeof(_Ptr) > sizeof(exception_ptr)
				? sizeof(_Ptr)
				: sizeof(exception_ptr);
				#endif

				static constexpr auto __storageAlignment =
				#ifdef _LIBCPP_NO_EXCEPTIONS
				alignof(_Ptr);
				#else
				alignof(_Ptr) > alignof(exception_ptr)
				? alignof(_Ptr)
				: alignof(exception_ptr);
				#endif

				EricWFUnsubmitted Done Reply Inline Actions We'll want a constant ABI here. The `exception_ptr` type should be complete even when exceptions are disabled, so lets just always enable this. EricWF: We'll want a constant ABI here. The `exception_ptr` type should be complete even when…
				alignas(__storageAlignment) char __storage_[__storageSize];
				CaseyCarterUnsubmitted Done Reply Inline Actions Ditto comments from the other specialization. CaseyCarter: Ditto comments from the other specialization.
				bool __has_exception_;
				};

				template <>
				class __task_promise<void> final : public __task_promise_base {
				using _Handle = coroutine_handle<__task_promise>;

				public:
				__task_promise() _NOEXCEPT
				#ifndef _LIBCPP_NO_EXCEPTIONS
				: __has_exception_(false)
				#endif
				{
				}

				~__task_promise() {
				#ifndef _LIBCPP_NO_EXCEPTIONS
				if (__has_exception_) {
				reinterpret_cast<exception_ptr*>(&__storage_)->~exception_ptr();
				}
				#endif
				}

				_Handle get_return_object() _NOEXCEPT { return _Handle::from_promise(*this); }

				void return_void() _NOEXCEPT {}

				void unhandled_exception() _NOEXCEPT {
				#ifndef _LIBCPP_NO_EXCEPTIONS
				new (&__storage_) exception_ptr(current_exception());
				__has_exception_ = true;
				#endif
				}

				void __lvalue_result() { __throw_if_exception(); }

				void __rvalue_result() { __throw_if_exception(); }

				private:
				void __throw_if_exception() {
				#ifndef _LIBCPP_NO_EXCEPTIONS
				if (__has_exception_) {
				rethrow_exception(
				reinterpret_cast<exception_ptr>(&__storage_));
				}
				#endif
				GorNishanovUnsubmitted Not Done Reply Inline Actions Why I am concerned about await_ready() => __coro.done() is that check is inherently racy. If the rule is, coroutine starts suspended and then you are allowed await on it once and that will start it. (Or you can use library helpers, spawn, when_all, etc. that will do similar thing. They will subscribe and start the coroutine). Once the coroutine is started, you cannot touch it. I struggle to see the use case. Once coroutine is started, it has already has continuation hooked up and once finished it will resume the waiter that will lead to the destruction of the coroutine. So, nobody can check for done-ness. If it has already been started, but has not finished, are we going to subscribe again? How would we distinguish the case that coroutine has not started, vs started but has not completed? I see this `await_ready()` returning anything but `false` to be fraught with danger. GorNishanov: Why I am concerned about ``` await_ready() => __coro.done() ``` is that check is inherently…
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions I do not see this as being racy. A `task` is not designed to be safe to access concurrently from multiple threads. I have tried to follow the general guideline of non-const methods cannot be called concurrently with any other access to the object, while const methods can generally be safely called concurrently with other const methods. The latest revision of `task<T>` does not have any const methods. When you first call a `task`-returning coroutine it creates the coroutine initially suspended. When the `task` is first `co_await`ed this will call `__coro.done()` inside `await_ready()` (this is safe since the coroutine is suspended). The awaiting coroutine is then suspended and the `task`'s coroutine is resumed. When the `task`'s coroutine completes (ie. suspends at `final_suspend` point) it then becomes `.done()` and the awaiting coroutine is resumed. If we treat the `task` as a normal non-thread-safe object then it is up to the caller to ensure the `task` is not accessed concurrently from another thread or coroutine. In this case, the awaiting coroutine should be the only coroutine that is accessing the `task` object for the duration of the `co_await` operation. As the awaiting coroutine will be suspended while the task's coroutine is executing, it will not have an opportunity to `co_await` the task again until the task completes, at which point the task's coroutine will be suspended at final_suspend-point and thus will be `.done()`. If a coroutine then subsequently `co_await`s the task again after the first `co_await` operation has completed then it will see that the coroutine is alread `.done()` and so will continue without suspending. How would we distinguish the case that coroutine has not started, vs started but has not completed? A `task` object should only ever be able to observe the underlying coroutine being either suspended at initial_suspend (before the task is first co_awaited) or at final_suspend (after the first co_await has completed). The coroutine that has access to the `task` will be suspended while the task's coroutine is executing and so cannot observe the started-but-not-completed state. Once coroutine is started, it has already has continuation hooked up and once finished it will resume the waiter that will lead to the destruction of the coroutine. So, nobody can check for done-ness. The task's coroutine is not destroyed until `~task` is executed, which may not be immediately after `co_await` expression resumes if the `task` object has been assigned to a local variable. In this case, the task's coroutine's `.done()` state would be observable. I struggle to see the use case. For one use-case, consider: task<record> get_record(int id); task<> example() { // Want to only get the record once regardless of how many times // it's used below, but don't get the record if we don't need it. task<record> t = get_record(123); if (someCond) { record& r = co_await t; //... } //... later if (someOtherCond) { record& r = co_await t; //... } } lewissbaker: I do not see this as being racy. A `task` is not designed to be safe to access concurrently…
				GorNishanovUnsubmitted Not Done Reply Inline Actions With regard to the use case. Let's imagine that we will get the return_slot<T> type in C++20. Now, we don't have to store the result in the promise, return_value will directly constructed the result in the call site of the co_await. Or, even without the return_slot<T> type, we can construct the result directly in the awaiter. (That is not what we are doing and it is fine, but, if it were the case, you would not be able to extract the result the second time. It is similar to the std::future where you can only get the result once. GorNishanov: With regard to the use case. Let's imagine that we will get the return_slot<T> type in C++20.
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions It is similar to the std::future where you can only get the result once. Yes, I guess that single-await or multi-await is really the major choice here. If we limit it so that you can only co_await the task once then it does give more flexibility on implementation that could potentially lead to a more efficient implementation in some cases. This does prevent the use-case I mentioned above, however. I'm open to putting this restriction in for now - it matches the behaviour of std::future::get(). If we do this then we might want to consider having a single `operator co_await` that returns a prvalue - although this would potentially incur an extra copy/move in some cases compared with returning a reference to the value stored in the promise. Let's imagine that we will get the return_slot<T> type in C++20. Can P0878R0 (sub-object copy elision) help here if we put the result object in the awaiter and try to return the sub-object from await_resume()? If this can be made to work we may not need return_slot<T> to get RVO for tasks. lewissbaker: > It is similar to the std::future where you can only get the result once. Yes, I guess that…
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions As an aside, if we did want to allow multiple coroutines to concurrently `co_await` an asynchronous result then something like `cppcoro::shared_task<T>` would be more appropriate as it correctly handles synchronisation required to avoid the data races. lewissbaker: As an aside, if we did want to allow multiple coroutines to concurrently `co_await` an…
				}
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Do we care about `task<cv-void>`? CaseyCarter: Do we care about `task<cv-void>`?
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I don't feel strongly either way. It's not something I've ever had a use for, but maybe it should be done for completeness? lewissbaker: I don't feel strongly either way. It's not something I've ever had a use for, but maybe it…

				#ifndef _LIBCPP_NO_EXCEPTIONS
				alignas(exception_ptr) char __storage_[sizeof(exception_ptr)];
				bool __has_exception_;
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Why is this not simply: template <> class __task_promise<void> final : public __task_promise_base { using _Handle = coroutine_handle<__task_promise>; public: _Handle get_return_object() _NOEXCEPT { return _Handle::from_promise(this); } void return_void() _NOEXCEPT {} void unhandled_exception() _NOEXCEPT { #ifndef _LIBCPP_NO_EXCEPTIONS __exception_ = current_exception(); #endif } void __lvalue_result() { __throw_if_exception(); } void __rvalue_result() { __throw_if_exception(); } private: void __throw_if_exception() { #ifndef _LIBCPP_NO_EXCEPTIONS if (__exception_) { rethrow_exception(__exception_); } #endif } #ifndef _LIBCPP_NO_EXCEPTIONS exception_ptr __exception_; #endif }; CaseyCarter:* Why is this not simply: ``` template <> class __task_promise<void> final : public…
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions This was done to avoid calling exception_ptr runtime functions in the common case where there is no exception created. I thought I recalled this being done by someone from MS (yourself or perhaps Kenny Kerr?) in a generator<T> implementation to avoid the overhead of calls to runtime library functions for the exception_ptr default constructor, operator bool() and destructor as these are generally not inlinable. However, I currently can't find the reference code for this and it's possible I may be misremembering here. A std::optional<exception_ptr> would achieve the same ends but I was conscious of incurring the extra dependency on the <optional> header. I can simplify the implementation to what you suggest and re-apply the optimisation later if profiling determines it to be a performance problem. lewissbaker: This was done to avoid calling exception_ptr runtime functions in the common case where there…
				CaseyCarterUnsubmitted Not Done Reply Inline Actions That was Kenny's WinRT generator which I have since changed to use `exception_ptr` directly, and Microsoft's crappy implementation of `exception_ptr`. I have no idea about libc++'s `exception_ptr`. (I expect to fix ours when we can next break ABI: it's really terrible that everything isn't inlinable in the empty/fast case.) CaseyCarter: That was Kenny's WinRT generator which I have since changed to use `exception_ptr` directly…
				#endif
				};

				template <typename _Tp>
				class _LIBCPP_TEMPLATE_VIS task {
				GorNishanovUnsubmitted Not Done Reply Inline Actions class [[nodiscard]] task Probably need to use `_LIBCPP_NODISCARD_AFTER_CXX17` macro to put nodiscard in. Same comment for others specializations of task GorNishanov: ```class [[nodiscard]] task``` Probably need to use `_LIBCPP_NODISCARD_AFTER_CXX17` macro to…
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions This macro would seem to only enable nodiscard when building for c++2a or later. Does libc++ not use `[[nodiscard]]` on any APIs when built against C++17? lewissbaker: This macro would seem to only enable nodiscard when building for c++2a or later. Does libc++…
				public:
				using promise_type = __task_promise<_Tp>;
				CaseyCarterUnsubmitted Not Done Reply Inline Actions excess indent CaseyCarter: excess indent

				private:
				GorNishanovUnsubmitted Not Done Reply Inline Actions Do we really have a use case for implicit public constructor from coroutine_handle? I think this should be a private constructor. GorNishanov: Do we really have a use case for implicit public constructor from coroutine_handle? I think…
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions Yes, it probably should be a private constructor. It was added to allow the coroutine to implicitly construct the task from the coroutine_handle returned from get_return_object() - this simplifies the implementation but isn't required. I'll change get_return_object() to return a task<T> instead and add some appropriate friend-declarations. lewissbaker: Yes, it probably should be a private constructor. It was added to allow the coroutine to…
				using _Handle = coroutine_handle<__task_promise<_Tp> >;

				class _AwaiterBase {
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Should these `__foovalue_result` members be `foo`-qualified to make them harder to misuse? CaseyCarter: Should these `__foovalue_result` members be `foo`-qualified to make them harder to misuse?
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I'm not sure if it makes it much safer. We already have 'rvalue' in the method name which gives a hint to its move-ness. It's currently only called in one place: return this->__coro_.promise().__rvalue_result(); would become: return std::move(this->__coro_.promise()).__rvalue_result(); lewissbaker: I'm not sure if it makes it much safer. We already have 'rvalue' in the method name which gives…
				public:
				_AwaiterBase(_Handle __coro) _NOEXCEPT : __coro_(__coro) {}

				CaseyCarterUnsubmitted Not Done Reply Inline Actions Looking at these on my second readthrough I've realized you don't have a consistent SMF story for the promise hierarchy. Those classes are sometimes copy/movable depending on the parameter type and whether or not exceptions are enabled. Should they have explicitly deleted SMFs? CaseyCarter: Looking at these on my second readthrough I've realized you don't have a consistent SMF story…
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions The promise types are not really part of the public API so I wasn't so concerned with locking down the interface. I can add the explicit declaration of deleted SMFs if you like. lewissbaker: The promise types are not really part of the public API so I wasn't so concerned with locking…
				_LIBCPP_INLINE_VISIBILITY
				bool await_ready() const _NOEXCEPT { return !__coro_ \|\| __coro_.done(); }
				GorNishanovUnsubmitted Not Done Reply Inline Actions Can it be just: return false; Coroutine starts suspended, await starts it up. awaiting on the coroutine that does not exist looks like a programmer bug, awaiting on the coroutine the second time it likely to be a bug too GorNishanov: Can it be just: ``` return false; ``` Coroutine starts suspended, await starts it up.
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions The design I went for with `cppcoro::task` was to allow awaiting the value multiple times so that `task<T>` would act a bit like an asynchronous version of a `lazy<T>`. If we only allowed co_await'ing the task once then this should: allow the coroutine frame to be destroyed earlier (eg. we could pass ownership of the coroutine frame to awaiter object which could destroy coroutine frame at end of statement rather than when ~task is run). eliminate the need for the conditional on `await_ready()`. allow placing storage for the result in the awaiter object rather than the promise object - would this potentially provide more opportunity for copy elision? lewissbaker: The design I went for with `cppcoro::task` was to allow awaiting the value multiple times so…
				GorNishanovUnsubmitted Not Done Reply Inline Actions I played around some time ago and I think I was getting the best results when the result was in the promise. We may get in the future a return_slot<T> type, that would allow to place result directly in the caller site (like RVO), if we get that, we would need the result in the promise, but, for now, it is fine to keep it there. GorNishanov: I played around some time ago and I think I was getting the best results when the result was in…

				_LIBCPP_INLINE_VISIBILITY
				_Handle await_suspend(coroutine_handle<> __continuation) const _NOEXCEPT {
				__coro_.promise().__set_continuation(__continuation);
				return __coro_;
				}

				protected:
				GorNishanovUnsubmitted Not Done Reply Inline Actions Should this be task& operator=(task &&__other) _NOEXCEPT { GorNishanov: Should this be ``` task& operator=(task &&__other) _NOEXCEPT { ```
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions Yes, the parameter could be made `task&&`. I'd just need to move-construct to a temporary inside the body instead. lewissbaker: Yes, the parameter could be made `task&&`. I'd just need to move-construct to a temporary…
				GorNishanovUnsubmitted Not Done Reply Inline Actions Ah Ok. No need to change it then. I am starting to work on wording and trying to figure out which things are the artifacts of the implementation and which should be carried over into the wording. GorNishanov: Ah Ok. No need to change it then. I am starting to work on wording and trying to figure out…
				_Handle __coro_;
				};

				public:

				CaseyCarterUnsubmitted Done Reply Inline Actions Similar to above, should we `static_assert` that `_Tp` is `void`, an lvalue reference type, or a destructible non-array object type? CaseyCarter: Similar to above, should we `static_assert` that `_Tp` is `void`, an lvalue reference type, or…
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions Sure, can do. Further, to support `task<T>::operator co_await() &&` T needs to both be destructible and move-constructible. For `task<T>::operator co_await() &` T only needs to be destructible. lewissbaker: Sure, can do. Further, to support `task<T>::operator co_await() &&` T needs to both be…
				_LIBCPP_INLINE_VISIBILITY
				task(_Handle __coro) _NOEXCEPT : __coro_(__coro) {}

				_LIBCPP_INLINE_VISIBILITY
				GorNishanovUnsubmitted Not Done Reply Inline Actions since we have both operator co_await () & operator co_await() && is it the same as simply having a operator co_await () You allow mutable l-values, R-values, but not const values. Also, why `operator co_await` at all? as opposed to three await_xxx functions? I am thinking specifying the wording that allow implementators flexibility to pick either approach. Is there a reason to mandate one over the other? GorNishanov: since we have both ``` operator co_await () & operator co_await() && ``` is it the same as…
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions The reason `operator co_await()` are added is to allow overloading so that when a `task<T>&` is awaited it returns a T& `and when a` task<T>&& `is awaited it returns a` T&&`. We can't overload `await_resume() &` and `await_resume() &&` since the `co_await` mechanics states that it always calls `await_resume()` on the awaiter lvalue. If we change `task<T>` to only support awaiting once then we could get rid of `operator co_await()` and just add `await_xxx()` methods directly on `task<T>`. Having said that, it does feel a little dirty having the `await_xxx()` methods popup in intellisense completion when auto-completing methods on a `task<T>` instance - these methods are not intended to be called directly by users of the type. lewissbaker: The reason `operator co_await()` are added is to allow overloading so that when a `task<T>&` is…
				GorNishanovUnsubmitted Not Done Reply Inline Actions I will try to specify the wording so that both `await_xxx` and a single `operator co_await` are acceptable implementations. I do strongly feel that task<T> should have await once semantics. It has a potential to unlock RVO. It is (potentially) less prone to misuse (my subjective feeling). await_resume() for task<T> should return T. It allows RVO and eliminates dangling references. No need to change anything at the moment in the implementation. Let's see how specification works out and then alter implementation (if needed) to match. Sometimes specifying something precisely gives clarity to a problem :-). GorNishanov: I will try to specify the wording so that both `await_xxx` and a single `operator co_await` are…
				task(task&& __other) _NOEXCEPT
				: __coro_(_VSTD::exchange(__other.__coro_, {})) {}
				CaseyCarterUnsubmitted Done Reply Inline Actions `#include<utility>`? CaseyCarter: `#include<utility>`?

				task(const task&) = delete;
				task& operator=(const task&) = delete;

				_LIBCPP_INLINE_VISIBILITY
				~task() {
				if (__coro_)
				__coro_.destroy();
				}

				_LIBCPP_INLINE_VISIBILITY
				task& operator=(task&& __other) _NOEXCEPT {
				_Handle __oldHandle =
				_VSTD::exchange(__coro_, _VSTD::exchange(__other.__coro_, {}));
				if (__oldHandle)
				__oldHandle.destroy();
				CaseyCarterUnsubmitted Done Reply Inline Actions Pure style comment: Let your destructor do the work: task __old = _VSTD::exchange(__coro_, _VSTD::exchange(__other.__coro_, {})); CaseyCarter: Pure style comment: Let your destructor do the work: ``` task __old = _VSTD::exchange(__coro_…
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I was considering implementing this by passing the parameter by value. task& operator=(task __other) _NOEXCEPT { _VSTD::swap(__coro_, __other.__coro_); } lewissbaker: I was considering implementing this by passing the parameter by value. ``` task& operator=…
				return *this;
				}

				_LIBCPP_INLINE_VISIBILITY
				bool valid() const _NOEXCEPT { return __coro_; }
				GorNishanovUnsubmitted Done Reply Inline Actions Not sure if we need valid() an read() accessors GorNishanov: Not sure if we need valid() an read() accessors
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I'm not sure either. The only place I've used `ready()` thus far has been in tests. I'll remove them for now - we can always add them in later if we find a need. lewissbaker: I'm not sure either. The only place I've used `ready()` thus far has been in tests. I'll…

				_LIBCPP_INLINE_VISIBILITY
				bool ready() const _NOEXCEPT { return !__coro_ \|\| __coro_.done(); }

				_LIBCPP_INLINE_VISIBILITY
				auto operator co_await() & _NOEXCEPT {
				class _Awaiter : public _AwaiterBase {
				public:
				using _AwaiterBase::_AwaiterBase;
				_LIBCPP_INLINE_VISIBILITY
				decltype(auto) await_resume() {
				// FIXME: Should it be undefined-behaviour to co_await a task
				// that is not valid() or should we throw std::future_error(broken_promise)
				// or a std::logic_error{} of some sort?
				if (!this->__coro_)
				GorNishanovUnsubmitted Done Reply Inline Actions ASSERT (later we can teach sanitizer to deal with it). Whatever is the requirement is on await_resume, we can put it as a precondition in the wording or something. That would allow implementation to catch in debug mode or with sanitizers GorNishanov: ASSERT (later we can teach sanitizer to deal with it). Whatever is the requirement is on…
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions Ok, I'll update this to an ASSERT. lewissbaker: Ok, I'll update this to an ASSERT.
				#ifndef _LIBCPP_NO_EXCEPTIONS
				throw logic_error{"task<T> is not valid()"};
				#else
				terminate();
				CaseyCarterUnsubmitted Not Done Reply Inline Actions `this->` is extraneous in these classes that derive from the concrete `_AwaiterBase`. (And on 470) CaseyCarter: `this->` is extraneous in these classes that derive from the concrete `_AwaiterBase`. (And on…
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions I get compile-errors if I remove the `this->`. libcxx/include/experimental/task:502:16: error: 'std::experimental::coroutines_v1::task<counted>::__coro_' is not a member of class '_Awaiter' return __coro_.promise().__rvalue_result(); Compiler bug? lewissbaker: I get compile-errors if I remove the `this->`. ``` libcxx/include/experimental/task:502:16…
				#endif
				return this->__coro_.promise().__lvalue_result();
				}
				};
				return _Awaiter{__coro_};
				CaseyCarterUnsubmitted Done Reply Inline Actions Is missing a subject. How about "co_await on an invalid task<T> has undefined behavior"? (And on 475) CaseyCarter: Is missing a subject. How about "co_await on an invalid task<T> has undefined behavior"? (And…
				}

				_LIBCPP_INLINE_VISIBILITY
				auto operator co_await() && _NOEXCEPT {
				class _Awaiter : public _AwaiterBase {
				public:
				using _AwaiterBase::_AwaiterBase;
				_LIBCPP_INLINE_VISIBILITY
				decltype(auto) await_resume() {
				// FIXME: Should it be undefined-behaviour to co_await a task
				GorNishanovUnsubmitted Done Reply Inline Actions Same comment. Programming error => UB. Not an exception GorNishanov: Same comment. Programming error => UB. Not an exception
				// that is not valid() or should we throw std::future_error(broken_promise)
				// or a std::logic_error{} of some sort?
				if (!this->__coro_)
				#ifndef _LIBCPP_NO_EXCEPTIONS
				throw logic_error{"task<T> is not valid()"};
				#else
				terminate();
				#endif
				return this->__coro_.promise().__rvalue_result();
				}
				};
				return _Awaiter{__coro_};
				}

				void swap(task& __other) _NOEXCEPT { _VSTD::swap(__coro_, __other.__coro_); }

				private:
				_Handle __coro_;
				};

				_LIBCPP_END_NAMESPACE_EXPERIMENTAL_COROUTINES


				#endif

				////////////////////////
				// Tests for task<T>

				using namespace std::experimental;

				task<void> f1()
				{
				co_return;
				}

				task<void> f2()
				{
				co_await f1();
				}

				task<void> f3()
				{
				task<void> t = f1();
				co_await t;
				}

				task<int> g1()
				{
				co_return 123;
				}

				task<int> g2()
				{
				co_return co_await g1() + 100;
				}

				task<int> g3()
				{
				// co_await rvalue returns rvalue reference to result.
				{
				decltype(auto) x = co_await g1();
				CaseyCarterUnsubmitted Not Done Reply Inline Actions Style: `auto&& x` is more concise. Doesn't `x` dangle in this case? Be really explicit in the proposal about why it's a good idea if so. CaseyCarter: Style: `auto&& x` is more concise. Doesn't `x` dangle in this case? Be really explicit in…
				lewissbakerAuthorUnsubmitted Not Done Reply Inline Actions This particular piece of code was just to check the expression's type was `int&&`. The value would indeed dangle if this code was executed. I did it this way as a workaround for the fact that `co_await` is not currently allowed inside a `decltype()` expression. Also, `auto&&` has slightly different semantics to that of `decltype(auto)` when the initialising expression is a prvalue. lewissbaker: This particular piece of code was just to check the expression's type was `int&&`. The value…
				static_assert(std::is_same_v<decltype(x), int&&>);
				}

				// co_await lvalue returns lvalue reference to result
				task<int> t = g1();
				decltype(auto) y = co_await t;
				static_assert(std::is_same_v<decltype(y), int&>);

				co_return y * 2;
				}

				class MoveOnly
				{
				public:
				MoveOnly();
				MoveOnly(const MoveOnly&) = delete;
				MoveOnly(MoveOnly&&) noexcept;
				~MoveOnly();
				int get() const;
				private:
				void* _data;
				};

				task<MoveOnly> h1(bool x)
				{
				if (x) {
				MoveOnly value;
				co_return std::move(value);
				CaseyCarterUnsubmitted Not Done Reply Inline Actions @GorNishanov: Why isn't `co_return foo;` equivalent to `co_return move(foo);` for an automatic variable `foo`? CaseyCarter: @GorNishanov: Why isn't `co_return foo;` equivalent to `co_return move(foo);` for an automatic…
				GorNishanovUnsubmitted Not Done Reply Inline Actions Not implemented yet in Clang and MSVC. I filed a bug: https://bugs.llvm.org/show_bug.cgi?id=37265 GorNishanov: Not implemented yet in Clang and MSVC. I filed a bug: https://bugs.llvm.org/show_bug.cgi?
				}
				co_return MoveOnly{};
				}

				task<int> h2()
				{
				auto x = co_await h1(true);
				co_return x.get();
				}

				static int x;
				static int y;

				task<int&> h3(bool cond)
				{
				co_return cond ? x : y;
				}

				task<void> h3consumer()
				{
				int& result = co_await h3(true);
				result = 32;
				}

				// custom allocator tests

				template<typename Allocator>
				task<void> a1(std::allocator_arg_t, Allocator allocator, bool x)
				{
				co_return;
				}

				class my_allocator
				{
				public:
				my_allocator();
				my_allocator(const my_allocator&) noexcept;
				my_allocator(my_allocator&&) noexcept;
				char* allocate(size_t n);
				void deallocate(char* p, size_t n);
				};

				task<void> a2()
				{
				my_allocator alloc;
				co_await a1(std::allocator_arg, alloc, true);
				}

				task<void> a2a()
				{
				my_allocator alloc;
				return a1(std::allocator_arg, alloc, true);
				}
				CaseyCarterUnsubmitted Done Reply Inline Actions I suggest adding a test case with a stateful allocator. CaseyCarter: I suggest adding a test case with a stateful allocator.
				lewissbakerAuthorUnsubmitted Done Reply Inline Actions Will do. lewissbaker: Will do.


				task<void> a3()
				{
				co_await a1(std::allocator_arg, std::allocator<char>{}, false);
				}

				task<void> a3a()
				{
				return a1(std::allocator_arg, std::allocator<char>{}, false);
				}

This is an archive of the discontinued LLVM Phabricator instance.

[coroutines] Add std::experimental::task<T> typeNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 144168

include/experimental/task

[coroutines] Add std::experimental::task<T> type
Needs ReviewPublic