This is an archive of the discontinued LLVM Phabricator instance.

libcxx/include/__memory/uninitialized_algorithms.h
501	Is this moved too? For reviewing I prefer a stack of 2 (or more) commits where the moving of code is separated from the real changes.
libcxx/include/__utility/move.h
31 ↗	(On Diff #438215)	Is this needed? I assume the original code also resulted in this type when exceptions were disabled.
libcxx/test/libcxx/containers/sequences/vector/asan_throw.pass.cpp
102	Is this a behaviour change or a bug fix? I don't understand this change. Maybe add a description to the patch that explains this behaviour change. That way when we look at the history we know why this was done.

philnik marked an inline comment as done.Jun 20 2022, 9:38 AM

philnik added inline comments.

libcxx/include/__memory/uninitialized_algorithms.h
501	No, this isn't just moved. Here the names and behaviour changed. The algorithms now destroy the elements again if an exception has been thrown.
libcxx/include/__utility/move.h
31 ↗	(On Diff #438215)	https://godbolt.org/z/vsPqYa99h I also found it weird, but `noexcept` is still correctly checked with `-fno-exceptions`.
libcxx/test/libcxx/containers/sequences/vector/asan_throw.pass.cpp
102	https://eel.is/c++draft/vector#modifiers-2 is the relevant paragraph I think. I'm not entirely sure if this is just a behaviour change or if it's a bugfix. I think it's a bugfix.

Rebased

Harbormaster completed remote builds in B171526: Diff 439283.Jun 23 2022, 2:38 AM

ldionne requested changes to this revision.Jun 23 2022, 11:17 AM

ldionne added inline comments.

libcxx/include/__memory/uninitialized_algorithms.h
503	Can you please add a comment explaining what `__uninitialized_allocator_copy` & friends do similar to what we do in `__uninitialized_allocator_fill_n` (and others)? In particular, I think that explaining the exception safety guarantee offered by each algorithm added here is important.
506	Should we have a `static_assert(__is_cpp17_copy_insertable<...>)` here?
525	Let's introduce `RawType2` for symmetry?
568	If you used `std::copy` (or `std::copy_n`), I think you could simplify this and you wouldn't have to handle `reverse_iterator` specially below.
libcxx/include/__utility/move.h
30 ↗	(On Diff #439283)	We shouldn't do this. It makes us non-conforming under `-fno-exceptions`. We shouldn't try to do as-if `noexcept(<anything>) == true` in the library when `-fno-exceptions` is used. If we wanted to have that behavior, it should be achieved through the compiler. Also relevant as background: D62228
libcxx/include/vector
903	The code didn't destroy the new elements before in case of a failure, but now it does. I wonder whether the original code was written that way on purpose?
libcxx/test/libcxx/containers/sequences/vector/asan_throw.pass.cpp
102	My reading is that we are fixing a bug, since this line of the spec should apply: If an exception is thrown other than by the copy constructor, move constructor, assignment operator, or move assignment operator of `T` or by any `InputIterator` operation there are no effects. Indeed, the exception is thrown by `X(char)`, which is none-of-the-above, and so there should be no effects. Previously, the size of the vector would have been modified, and that's wrong. This mandates a test in `libcxx/test/std` -- it's a pretty serious bug since exception guarantees in `push_back` & friends are a big deal.
106	We should also have a test that ensures that we destroy the newly created elements if an exception is thrown. I think it was wrong to skip that in the code previously, since that means that we'd have been potentially leaking stuff if an exception was thrown.

This revision now requires changes to proceed.Jun 23 2022, 11:17 AM

Address comments

libcxx/include/__memory/uninitialized_algorithms.h
506	I'm not sure. I'll investigate it later, since it breaks a lot of tests. I think it's either not applicable or the trait is currently broken. It's not used anywhere right now.

Harbormaster completed remote builds in B172109: Diff 440094.Jun 26 2022, 3:27 PM

Try to fix CI

philnik mentioned this in D68365: [libc++] Implement P1004R2 (constexpr std::vector).Jun 27 2022, 3:18 AM

Harbormaster completed remote builds in B172149: Diff 440149.Jun 27 2022, 3:37 AM

ldionne requested changes to this revision.Jul 6 2022, 2:32 PM

ldionne added inline comments.

libcxx/include/__memory/uninitialized_algorithms.h
362–365	Let's use the same pattern for `__enable_if_t` here, i.e. use a non-type template parameter like you do below.
390–391	We should figure out what clang-format does wrong here, but in the meantime I would rather use the same formatting as line 364-366.
503	Please also add a note that array elements are NOT treated specially by this function. We may also want to differentiate between functions in this file that handle arrays vs those that don't, since it's really not obvious from their current names.
512	I think you either need to construct array elements recursively and destroy them recursively, or not. But construction and destruction has to be consistent w.r.t. how it handles array elements. Otherwise, you'll get a mismatching number of calls to `allocator_traits::construct` and `allocator_traits::destroy`. Concretely, I think for `std::vector` you don't want to treat array elements specially. So I would add a `std::__allocator_destroy(_Alloc&, _Iter, _Sent)` function and call that in the `catch (...)` instead. This should be tested by ensuring that we have a matching number of calls to construct and destroy when we use this algorithm with array elements.
537	Here, I suggest this instead: template <class _Alloc, class _Type, class _RawType = typename remove_const<_Type>::type, __enable_if_t< // using _RawType because of the allocator<T const> extension is_trivially_copy_constructible<_RawType>::value && is_trivially_copy_assignable<_RawType>::value && (__is_default_allocator<_Alloc>::value \|\| !__has_construct<_Alloc, _RawType, _Type const&>::value) > = nullptr> _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX17 _Type* __uninitialized_allocator_copy(_Alloc&, _Type const* __first1, _Type const* __last1, _Type* __first2) { // TODO: Remove the const_cast once we drop support for std::allocator<T const> return std::copy(__first1, __last1, const_cast<_RawType*>(__first2)); } In particular, note the tweaked `enable_if` conditions -- I think this is what we need to be safe here. If the type is not trivially copy constructible, we MUST call its copy constructor (instead of the assignment in `std::copy`), otherwise the optimization is not transparent. If the type is not trivially assignable, then we can also notice that we are doing an assignment instead of a copy-construction here, and so it's not transparent.
543
564	Same comment, we shouldn't treat array types specially here. Treating array types specially was only meaningful for the `std::make_shared<T[N]>(...)` functions because they were specified that way. Otherwise, I would have never bothered to handle array types specially :-).

This revision now requires changes to proceed.Jul 6 2022, 2:32 PM

Address comments

Remove unrelated changes

Harbormaster completed remote builds in B174127: Diff 442873.Jul 7 2022, 5:56 AM

Fix stuff

Harbormaster completed remote builds in B174163: Diff 442920.Jul 7 2022, 8:26 AM

ldionne added inline comments.Jul 7 2022, 8:37 AM

libcxx/include/__memory/uninitialized_algorithms.h
356	Can you please add a TODO to switch this to a normal left-to-right algorithm and to use `reverse_iterator` from callers?
502–503	Please comment on what this function does.
510	We should destroy in reverse order of construction, it's usually what's expected. Applies everywhere.
515–516	Instead of creating a separate `__transaction` class for C++03, I would do this: template <class _Alloc, class _Iter> struct _AllocatorDestroyRange { _LIBCPP_CONSTEXPR_AFTER_CXX11 void operator()() const { std::__allocator_destroy(__alloc_, __first, __last); } _Alloc& __alloc_; _Iter& __first; _Iter& __last; }; And then I'd use this from the `__uninitialized_FOO` functions as: __transaction<_AllocatorDestroyRange<_Alloc, _Iter1> > __guard(_AllocatorDestroyRange<_Alloc, _Iter1>(__allloc, __destruct_first, __first2)); Basically, I don't like that we are creating our own local emulation of `std::bind`/`std::bind_back` just for `__transaction`. Another option would be something like auto __guard = std::__make_transaction(std::bind_back(&__allocator_destroy<_Alloc, _Iter2, _Iter2>, __alloc, __destruct_first, __first2)); However, `bind_back` is not available in C++03 and I'm not 100% sure it would be a good idea to drag in that dependency.
526–530	Perhaps this should be called `__allocator_has_trivial_copy_construct` instead?

philnik added a child revision: D68365: [libc++] Implement P1004R2 (constexpr std::vector).Jul 7 2022, 8:37 AM

Address comments

Harbormaster completed remote builds in B174553: Diff 443493.Jul 10 2022, 5:25 AM

LGTM with comments applied.

libcxx/include/__memory/uninitialized_algorithms.h
33–34	I don't think we need to add this. We don't use `min()` or `max()` in this file.
501	Nitpick.
525–527
571–573
641	Same, can be removed.
libcxx/include/__utility/transaction.h
89–90	`_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR` ?
91

This revision is now accepted and ready to land.Jul 14 2022, 8:21 AM

Rebased
Address comments

Harbormaster completed remote builds in B175466: Diff 444745.Jul 14 2022, 11:56 AM

Try to fix CI

Harbormaster completed remote builds in B176457: Diff 446086.Jul 20 2022, 4:30 AM

Fix no-exceptions

Harbormaster completed remote builds in B176531: Diff 446189.Jul 20 2022, 10:30 AM

Fix diff

Harbormaster completed remote builds in B176559: Diff 446220.Jul 20 2022, 12:57 PM

Closed by commit rG23cf42e706fb: [libc++] Use uninitialized algorithms for vector (authored by philnik). · Explain WhyJul 20 2022, 1:02 PM

This revision was automatically updated to reflect the committed changes.

philnik added a commit: rG23cf42e706fb: [libc++] Use uninitialized algorithms for vector.

@ldionne looks like this broke the lldb test suite (https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/). Could you revert this/fix the test?

In D128146#3669567, @augusto2112 wrote:

@ldionne looks like this broke the lldb test suite (https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/). Could you revert this/fix the test?

Could you maybe give me some more context? Right now I have no idea what the breakage even is, let alone how to fix it.

From the expanded "All failed tests" section on https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/45513/testReport/

Assertion failed: (isa<InjectedClassNameType>(Decl->TypeForDecl)), function getInjectedClassNameType, file /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/clang/lib/AST/ASTContext.cpp, line 4588.
PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace.
Stack dump:
0.	HandleCommand(command = "expr s_vector.push({4})")
1.	<eof> parser at end of file
2.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/stack:236:10: instantiating function definition 'std::stack<C, std::vector<C>>::push'
3.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/vector:577:36: instantiating function definition 'std::vector<C>::push_back'
4.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/vector:718:17: instantiating function definition 'std::vector<C>::__push_back_slow_path<C>'
5.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/vector:700:10: instantiating function definition 'std::vector<C>::__swap_out_circular_buffer'
6.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/__memory/uninitialized_algorithms.h:614:1: instantiating function definition 'std::__uninitialized_allocator_move_if_noexcept<std::allocator<C>, std::reverse_iterator<C *>, std::reverse_iterator<C *>, C, void>'

In D128146#3669817, @avogelsgesang wrote:

From the expanded "All failed tests" section on https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/45513/testReport/

Assertion failed: (isa<InjectedClassNameType>(Decl->TypeForDecl)), function getInjectedClassNameType, file /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/clang/lib/AST/ASTContext.cpp, line 4588.
PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace.
Stack dump:
0.	HandleCommand(command = "expr s_vector.push({4})")
1.	<eof> parser at end of file
2.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/stack:236:10: instantiating function definition 'std::stack<C, std::vector<C>>::push'
3.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/vector:577:36: instantiating function definition 'std::vector<C>::push_back'
4.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/vector:718:17: instantiating function definition 'std::vector<C>::__push_back_slow_path<C>'
5.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/vector:700:10: instantiating function definition 'std::vector<C>::__swap_out_circular_buffer'
6.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/__memory/uninitialized_algorithms.h:614:1: instantiating function definition 'std::__uninitialized_allocator_move_if_noexcept<std::allocator<C>, std::reverse_iterator<C *>, std::reverse_iterator<C *>, C, void>'

It doesn't look like libc++ is at fault here. If the goal is to get the test green again ASAP, I would XFAIL the test with a TODO and try to get someone from Clang to take a look, since it seems to be crashing inside Clang.

Reverting the libc++ patch will only hide the issue (like a XFAIL), but it will also be pretty disruptive for us.

In D128146#3669817, @avogelsgesang wrote:

From the expanded "All failed tests" section on https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/45513/testReport/

Assertion failed: (isa<InjectedClassNameType>(Decl->TypeForDecl)), function getInjectedClassNameType, file /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/clang/lib/AST/ASTContext.cpp, line 4588.
PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace.
Stack dump:
0.	HandleCommand(command = "expr s_vector.push({4})")
1.	<eof> parser at end of file
2.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/stack:236:10: instantiating function definition 'std::stack<C, std::vector<C>>::push'
3.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/vector:577:36: instantiating function definition 'std::vector<C>::push_back'
4.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/vector:718:17: instantiating function definition 'std::vector<C>::__push_back_slow_path<C>'
5.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/vector:700:10: instantiating function definition 'std::vector<C>::__swap_out_circular_buffer'
6.	/Users/buildslave/jenkins/workspace/lldb-cmake/lldb-build/include/c++/v1/__memory/uninitialized_algorithms.h:614:1: instantiating function definition 'std::__uninitialized_allocator_move_if_noexcept<std::allocator<C>, std::reverse_iterator<C *>, std::reverse_iterator<C *>, C, void>'

Is there any way to convert TestStackFromStdModule.py to a normal reproducer? Or is this specific to LLDB in some way? Sorry, I don't understand what the test is trying to achieve and what it does to do that. I guess it somehow creates a std::stack<C, std::vector<C>> and then does some operations on them? If the problem isn't the code itself I guess I have no way to fix it in this patch.

I tried

#include <cassert>
#include <stack>

struct C { C(int i_) : i{i_} {} int i; };

int main() {
  std::stack<C, std::vector<C>> s_vector;
  s_vector.push({4});
  s_vector.pop();
  s_vector.size();
  s_vector.top();
  s_vector.emplace(5);
  assert(s_vector.top().i == 5);
}

but that is happy.

@ldionne just by looking at the patch, I can't tell if it's clang or libcxx who is doing the wrong thing (I do see that the stack trace includes this new __uninitialized_allocator_move_if_noexcept function, which is hidden from the ABI, maybe that's part of the problem?) . It shouldn't be LLDB's responsibility to track down who is responsible after and it's not healthy for the project to xfail tests that are broken by one of its dependencies and investigate later on, which would just lead to more and more xfails. The LLVM policy states that we should revert patches "If you break a buildbot in a way which can’t be quickly fixed, please revert.". So I think we should do that.

augusto2112 added a reverting change: rG1d057a6d4306: Revert "[libc++] Use uninitialized algorithms for vector".Jul 21 2022, 2:27 PM

In D128146#3669977, @augusto2112 wrote:

@ldionne just by looking at the patch, I can't tell if it's clang or libcxx who is doing the wrong thing (I do see that the stack trace includes this new __uninitialized_allocator_move_if_noexcept function, which is hidden from the ABI, maybe that's part of the problem?) . It shouldn't be LLDB's responsibility to track down who is responsible after and it's not healthy for the project to xfail tests that are broken by one of its dependencies and investigate later on, which would just lead to more and more xfails. The LLVM policy states that we should revert patches "If you break a buildbot in a way which can’t be quickly fixed, please revert.". So I think we should do that.

It also states that there should be a reproducer (ideally) and more generally a way for the author to debug the issue. It should should be simple to obtain a reproducer if it's a libc++ problem. Otherwise the patch is reverted and I still don't know what the bug is; assuming there even is one in this code. If you don't give me that I'll re-land the patch without any meaningful progress, which is useless.

In D128146#3669984, @philnik wrote:

In D128146#3669977, @augusto2112 wrote:

@ldionne just by looking at the patch, I can't tell if it's clang or libcxx who is doing the wrong thing (I do see that the stack trace includes this new __uninitialized_allocator_move_if_noexcept function, which is hidden from the ABI, maybe that's part of the problem?) . It shouldn't be LLDB's responsibility to track down who is responsible after and it's not healthy for the project to xfail tests that are broken by one of its dependencies and investigate later on, which would just lead to more and more xfails. The LLVM policy states that we should revert patches "If you break a buildbot in a way which can’t be quickly fixed, please revert.". So I think we should do that.

It also states that there should be a reproducer (ideally) and more generally a way for the author to debug the issue. It should should be simple to obtain a reproducer if it's a libc++ problem. Otherwise the patch is reverted and I still don't know what the bug is; assuming there even is one in this code. If you don't give me that I'll re-land the patch without any meaningful progress, which is useless.

The test reproduces the issue, no?

In D128146#3670021, @augusto2112 wrote:

In D128146#3669984, @philnik wrote:

In D128146#3669977, @augusto2112 wrote:

@ldionne just by looking at the patch, I can't tell if it's clang or libcxx who is doing the wrong thing (I do see that the stack trace includes this new __uninitialized_allocator_move_if_noexcept function, which is hidden from the ABI, maybe that's part of the problem?) . It shouldn't be LLDB's responsibility to track down who is responsible after and it's not healthy for the project to xfail tests that are broken by one of its dependencies and investigate later on, which would just lead to more and more xfails. The LLVM policy states that we should revert patches "If you break a buildbot in a way which can’t be quickly fixed, please revert.". So I think we should do that.

It also states that there should be a reproducer (ideally) and more generally a way for the author to debug the issue. It should should be simple to obtain a reproducer if it's a libc++ problem. Otherwise the patch is reverted and I still don't know what the bug is; assuming there even is one in this code. If you don't give me that I'll re-land the patch without any meaningful progress, which is useless.

The test reproduces the issue, no?

I have literally no idea what the test is even doing (as I said before). I also don't know how to run it. You have given me literally 0 information regarding the test other that "it fails". If it's a libc++ issue it should be possible to reproduce it without having to run LLDB tests.

In D128146#3670042, @philnik wrote:

In D128146#3670021, @augusto2112 wrote:

In D128146#3669984, @philnik wrote:

In D128146#3669977, @augusto2112 wrote:

@ldionne just by looking at the patch, I can't tell if it's clang or libcxx who is doing the wrong thing (I do see that the stack trace includes this new __uninitialized_allocator_move_if_noexcept function, which is hidden from the ABI, maybe that's part of the problem?) . It shouldn't be LLDB's responsibility to track down who is responsible after and it's not healthy for the project to xfail tests that are broken by one of its dependencies and investigate later on, which would just lead to more and more xfails. The LLVM policy states that we should revert patches "If you break a buildbot in a way which can’t be quickly fixed, please revert.". So I think we should do that.

It also states that there should be a reproducer (ideally) and more generally a way for the author to debug the issue. It should should be simple to obtain a reproducer if it's a libc++ problem. Otherwise the patch is reverted and I still don't know what the bug is; assuming there even is one in this code. If you don't give me that I'll re-land the patch without any meaningful progress, which is useless.

The test reproduces the issue, no?

I have literally no idea what the test is even doing (as I said before). I also don't know how to run it. You have given me literally 0 information regarding the test other that "it fails". If it's a libc++ issue it should be possible to reproduce it without having to run LLDB tests.

The test compiles this code:

#include <list>
#include <stack>
#include <vector>

struct C {
  // Constructor for testing emplace.
  C(int i) : i(i) {};
  int i;
};

int main(int argc, char **argv) {
  // std::deque is the default container.
  std::stack<C> s_deque({{1}, {2}, {3}});
  std::stack<C, std::vector<C>> s_vector({{1}, {2}, {3}});
  std::stack<C, std::list<C>> s_list({{1}, {2}, {3}});
  return 0; // Set break point at this line.
}

Runs lldb and attaches at the line with the " // Set break point at this line." comment. From the lldb console it (among other things):

Turns on importing the std module (settings set target.import-std-module true) - "Import the 'std' C++ module to improve expression parsing involving C++ standard library types."

Evaluates the following expressions: "expr s_vector.push({4})" which constructs a new vector to add to the stack.

You can find the cpp file in the same directory as the test, there's also a Makefile there that specifies how the test is compiled.

If you'd like to experiment manually, you can:

Delete the lldb-test-build.noindex folder in your build directory.

Run "bin/lldb-dotest ../llvm-project/lldb -p TestStackFromStdModule.py"

Navigate to "build/lldb-test-build.noindex/commands/expression/import-std-module/stack/TestStackFromStdModule.test_dsym"

There should be a a.out file there that you can attach with your built lldb and test with.

@augusto2112 Thank you for the information. I've run lldb locally with my patch and it seems to be fine. I ran everything in the test and AFAICT everything ran and produced the correct results. My lldb is compiled from 36c9e9968affac543952e81637a0584a4b708597 without assertions enabled. So the bug either doesn't show in the output (maybe the assertion is just wrong?), or the bug has been introduced within the last 3 weeks. Either way it's pretty clear to me that my code is fine. I'd like to re-land this patch with the LLDB test disabled, since D68365 depends on it and I'd like to get that into LLVM 15. Do you have any objections to this?

In D128146#3670190, @philnik wrote:

@augusto2112 Thank you for the information. I've run lldb locally with my patch and it seems to be fine. I ran everything in the test and AFAICT everything ran and produced the correct results. My lldb is compiled from 36c9e9968affac543952e81637a0584a4b708597 without assertions enabled. So the bug either doesn't show in the output (maybe the assertion is just wrong?), or the bug has been introduced within the last 3 weeks. Either way it's pretty clear to me that my code is fine. I'd like to re-land this patch with the LLDB test disabled, since D68365 depends on it and I'd like to get that into LLVM 15. Do you have any objections to this?

Ok, I tested your change and removing the assert and it seems to work fine. It's not great that the assertion is failing though, the ASTContext::getInjectedClassNameType code hasn't been touched since 2010 so I doubt that it's broken (the assert checks the of the decl is an InjectedClassNameType. Before your change it's an InjectedClassName after your change it somehow becomes a Record, given the function is called getInjectedClassNameType that assert seems correct to me). If this is urgent you can xfail the test, and if you could include a bug report number with your xfail that'd be great.

In Chromium we noticed that this adds 65 MB of debug info to one of our binaries (we noticed because that pushed it over the 4GB limit, so we'll need to do something about that anyway).

I noticed that the commit message has the "what" but not the "why" -- is using "uninitialized algorithms for vector" something mandated by the standard, some sort of optimization, or something else? Any chance vector could be made more lean instead? :-)

ldionne mentioned this in D129048: Rewording the "static_assert" to static assertion.Jul 22 2022, 6:45 AM

In D128146#3669977, @augusto2112 wrote:

It shouldn't be LLDB's responsibility to track down who is responsible after and it's not healthy for the project to xfail tests that are broken by one of its dependencies and investigate later on, which would just lead to more and more xfails.

That is only true if those XFAILs are not investigated and fixed in a timely fashion. And in that case, I would say there is a larger issue that nobody's responsible for fixing those. If we have bugs in Clang/LLDB, we should have folks ready to jump in and investigate them -- otherwise, that is what's unhealthy for the project.

In D128146#3670231, @augusto2112 wrote:

Ok, I tested your change and removing the assert and it seems to work fine. It's not great that the assertion is failing though, the ASTContext::getInjectedClassNameType code hasn't been touched since 2010 so I doubt that it's broken (the assert checks the of the decl is an InjectedClassNameType. Before your change it's an InjectedClassName after your change it somehow becomes a Record, given the function is called getInjectedClassNameType that assert seems correct to me). If this is urgent you can xfail the test, and if you could include a bug report number with your xfail that'd be great.

The point we are trying to make is that the libc++ code itself is correct. The tests are passing, it's valid C++ and it does what it should. The fact that it happens to start crashing Clang is an issue, but it doesn't mean that we should prevent ourselves from using this construct. For instance, this is something that users in the wild could write themselves and it would crash Clang just the same. So instead of bending backwards or reverting the patch in libc++, the correct thing to do is to acknowledge that there's a Clang (or LLDB) bug and try to fix it as soon as possible. And in the meantime, to ensure that the CI stays meaningful, mark the test as XFAILing due to that bug.

If this were a minor unimportant patch, I wouldn't be spending so much time arguing -- we all have more important stuff to do. However, this happens to be an incredibly important patch if we want to add support for constexpr std:::vector, which we are aiming to land in time for LLVM 15.

In D128146#3671442, @hans wrote:

In Chromium we noticed that this adds 65 MB of debug info to one of our binaries (we noticed because that pushed it over the 4GB limit, so we'll need to do something about that anyway).

Now, that is an actual issue that mandates our attention. Thanks for the heads up. I suspect it has to do with the fact that we instantiate more templates before we get to the memmove optimization in std::copy. @hans Do you have any way to tell what's in the 65 MB of debug information? Or a reproducer so we can try a few things out and see what impact it has on the size of the debug info? I assume this problem wouldn't be visible on a simple reproducer hand-written from scratch.

@philnik Let's re-land this with an XFAIL for the LLDB test and work with the Chromium folks to decrease the amount of debug information this creates. We can do this after the release and cherry-pick back the improvements.

In D128146#3671687, @ldionne wrote:

In D128146#3671442, @hans wrote:

In Chromium we noticed that this adds 65 MB of debug info to one of our binaries (we noticed because that pushed it over the 4GB limit, so we'll need to do something about that anyway).

Now, that is an actual issue that mandates our attention. Thanks for the heads up. I suspect it has to do with the fact that we instantiate more templates before we get to the memmove optimization in std::copy. @hans Do you have any way to tell what's in the 65 MB of debug information? Or a reproducer so we can try a few things out and see what impact it has on the size of the debug info? I assume this problem wouldn't be visible on a simple reproducer hand-written from scratch.

@philnik Let's re-land this with an XFAIL for the LLDB test and work with the Chromium folks to decrease the amount of debug information this creates. We can do this after the release and cherry-pick back the improvements.

It's not just Chromium, btw. We're seeing more fallout from to this commit. My teammates will post specific findings a bit later today.

Hi folks,

We have some code which compiles fine with the version previous to this patch and fails after.
The code compiles fine with godbolt: https://gcc.godbolt.org/z/ToPGG5cMb

But fails when built with clang containing this revision.
Repro compilation command:

clang -stdlib=libc++ -std=gnu++17 \
  -c /tmp/test.cc \
  -o /tmp/test.o

Compiler output:

/tmp/test.cc:5:17: error: assigning to 'float' from incompatible type 'Vx<float, 2>'
  data[Index] = std::forward<LastArg>(last_arg);
                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/tmp/test.cc:22:5: note: in instantiation of function template specialization 'SetData<0, 2, float, Vx<float, 2> &>' requested here
    SetData<0, Length, Element>(data_, std::forward<Args>(args)...);
    ^
[redacted]/include/c++/v1/__memory/allocator.h:165:28: note: in instantiation of function template specialization 'Vx<float, 2>::Vx<Vx<float, 2> &>' requested here
        ::new ((void*)__p) _Up(_VSTD::forward<_Args>(__args)...);
                           ^
[redacted]/include/c++/v1/__memory/allocator_traits.h:290:13: note: in instantiation of function template specialization 'std::allocator<Vx<float, 2>>::construct<Vx<float, 2>, Vx<float, 2> &>' requested here
        __a.construct(__p, _VSTD::forward<_Args>(__args)...);
            ^
[redacted]/include/c++/v1/__memory/uninitialized_algorithms.h:536:31: note: in instantiation of function template specialization 'std::allocator_traits<std::allocator<Vx<float, 2>>>::construct<Vx<float, 2>, Vx<float, 2> &, void>' requested here
    allocator_traits<_Alloc>::construct(__alloc, std::__to_address(__first2), *__first1);
                              ^
[redacted]/include/c++/v1/vector:1012:22: note: in instantiation of function template specialization 'std::__uninitialized_allocator_copy<std::allocator<Vx<float, 2>>, Vx<float, 2> *, Vx<float, 2> *, Vx<float, 2> *>' requested here
  __tx.__pos_ = std::__uninitialized_allocator_copy(__alloc(), __first, __last, __tx.__pos_);
                     ^
[redacted]/include/c++/v1/vector:1162:9: note: in instantiation of function template specialization 'std::vector<Vx<float, 2>>::__construct_at_end<Vx<float, 2> *>' requested here
        __construct_at_end(__x.__begin_, __x.__end_, __n);
        ^
/tmp/test.cc:35:16: note: in instantiation of member function 'std::vector<Vx<float, 2>>::vector' requested here
  do_something(vertices);
               ^
1 error generated.

In D128146#3671966, @alexfh wrote:

In D128146#3671687, @ldionne wrote:

In D128146#3671442, @hans wrote:

In Chromium we noticed that this adds 65 MB of debug info to one of our binaries (we noticed because that pushed it over the 4GB limit, so we'll need to do something about that anyway).

Now, that is an actual issue that mandates our attention. Thanks for the heads up. I suspect it has to do with the fact that we instantiate more templates before we get to the memmove optimization in std::copy. @hans Do you have any way to tell what's in the 65 MB of debug information? Or a reproducer so we can try a few things out and see what impact it has on the size of the debug info? I assume this problem wouldn't be visible on a simple reproducer hand-written from scratch.

@philnik Let's re-land this with an XFAIL for the LLDB test and work with the Chromium folks to decrease the amount of debug information this creates. We can do this after the release and cherry-pick back the improvements.

It's not just Chromium, btw. We're seeing more fallout from to this commit. My teammates will post specific findings a bit later today.

Hi,

We are seeing some binary size increase of 2-3% because of this commit. The binary size increases are due to debug information section increases and also text section increases.
Text section sizes increases between 1-2%.

In D128146#3672050, @bgraur wrote:

Hi folks,

We have some code which compiles fine with the version previous to this patch and fails after.
The code compiles fine with godbolt: https://gcc.godbolt.org/z/ToPGG5cMb

But fails when built with clang containing this revision.
Repro compilation command:

clang -stdlib=libc++ -std=gnu++17 \
  -c /tmp/test.cc \
  -o /tmp/test.o

Compiler output:

/tmp/test.cc:5:17: error: assigning to 'float' from incompatible type 'Vx<float, 2>'
  data[Index] = std::forward<LastArg>(last_arg);
                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/tmp/test.cc:22:5: note: in instantiation of function template specialization 'SetData<0, 2, float, Vx<float, 2> &>' requested here
    SetData<0, Length, Element>(data_, std::forward<Args>(args)...);
    ^
[redacted]/include/c++/v1/__memory/allocator.h:165:28: note: in instantiation of function template specialization 'Vx<float, 2>::Vx<Vx<float, 2> &>' requested here
        ::new ((void*)__p) _Up(_VSTD::forward<_Args>(__args)...);
                           ^
[redacted]/include/c++/v1/__memory/allocator_traits.h:290:13: note: in instantiation of function template specialization 'std::allocator<Vx<float, 2>>::construct<Vx<float, 2>, Vx<float, 2> &>' requested here
        __a.construct(__p, _VSTD::forward<_Args>(__args)...);
            ^
[redacted]/include/c++/v1/__memory/uninitialized_algorithms.h:536:31: note: in instantiation of function template specialization 'std::allocator_traits<std::allocator<Vx<float, 2>>>::construct<Vx<float, 2>, Vx<float, 2> &, void>' requested here
    allocator_traits<_Alloc>::construct(__alloc, std::__to_address(__first2), *__first1);
                              ^
[redacted]/include/c++/v1/vector:1012:22: note: in instantiation of function template specialization 'std::__uninitialized_allocator_copy<std::allocator<Vx<float, 2>>, Vx<float, 2> *, Vx<float, 2> *, Vx<float, 2> *>' requested here
  __tx.__pos_ = std::__uninitialized_allocator_copy(__alloc(), __first, __last, __tx.__pos_);
                     ^
[redacted]/include/c++/v1/vector:1162:9: note: in instantiation of function template specialization 'std::vector<Vx<float, 2>>::__construct_at_end<Vx<float, 2> *>' requested here
        __construct_at_end(__x.__begin_, __x.__end_, __n);
        ^
/tmp/test.cc:35:16: note: in instantiation of member function 'std::vector<Vx<float, 2>>::vector' requested here
  do_something(vertices);
               ^
1 error generated.

I think this is a bug in your code. https://godbolt.org/z/sv8YehKhY fails the same way, but without std::vector. BTW the simplest fix would be to add Vx(Vx& v) : Vx(std::as_const(v)) {}.

@augusto2112 Could you help me with the XFAIL? I guess we don't want to # XFAIL: *, but only something like # XFAIL: clang-assertions. Do you know what exactly I should check in the XFAIL for?

philnik reopened this revision.Jul 22 2022, 1:44 PM

This revision is now accepted and ready to land.Jul 22 2022, 1:44 PM

Try to minimize code size

Fix diff

@joanahalili @hans @alexfh Could you check whether the current patch fixes the binary size problems?

Harbormaster completed remote builds in B177107: Diff 446971.Jul 22 2022, 3:33 PM

In D128146#3672364, @philnik wrote:

@augusto2112 Could you help me with the XFAIL? I guess we don't want to # XFAIL: *, but only something like # XFAIL: clang-assertions. Do you know what exactly I should check in the XFAIL for?

Unfortunately I'm not aware of anyway of xfailing a test only when assertions are enabled.

In D128146#3672212, @philnik wrote:

In D128146#3672050, @bgraur wrote:

Hi folks,

We have some code which compiles fine with the version previous to this patch and fails after.
The code compiles fine with godbolt: https://gcc.godbolt.org/z/ToPGG5cMb

But fails when built with clang containing this revision.
Repro compilation command:

clang -stdlib=libc++ -std=gnu++17 \
  -c /tmp/test.cc \
  -o /tmp/test.o

Compiler output:

/tmp/test.cc:5:17: error: assigning to 'float' from incompatible type 'Vx<float, 2>'
  data[Index] = std::forward<LastArg>(last_arg);
                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/tmp/test.cc:22:5: note: in instantiation of function template specialization 'SetData<0, 2, float, Vx<float, 2> &>' requested here
    SetData<0, Length, Element>(data_, std::forward<Args>(args)...);
    ^
[redacted]/include/c++/v1/__memory/allocator.h:165:28: note: in instantiation of function template specialization 'Vx<float, 2>::Vx<Vx<float, 2> &>' requested here
        ::new ((void*)__p) _Up(_VSTD::forward<_Args>(__args)...);
                           ^
[redacted]/include/c++/v1/__memory/allocator_traits.h:290:13: note: in instantiation of function template specialization 'std::allocator<Vx<float, 2>>::construct<Vx<float, 2>, Vx<float, 2> &>' requested here
        __a.construct(__p, _VSTD::forward<_Args>(__args)...);
            ^
[redacted]/include/c++/v1/__memory/uninitialized_algorithms.h:536:31: note: in instantiation of function template specialization 'std::allocator_traits<std::allocator<Vx<float, 2>>>::construct<Vx<float, 2>, Vx<float, 2> &, void>' requested here
    allocator_traits<_Alloc>::construct(__alloc, std::__to_address(__first2), *__first1);
                              ^
[redacted]/include/c++/v1/vector:1012:22: note: in instantiation of function template specialization 'std::__uninitialized_allocator_copy<std::allocator<Vx<float, 2>>, Vx<float, 2> *, Vx<float, 2> *, Vx<float, 2> *>' requested here
  __tx.__pos_ = std::__uninitialized_allocator_copy(__alloc(), __first, __last, __tx.__pos_);
                     ^
[redacted]/include/c++/v1/vector:1162:9: note: in instantiation of function template specialization 'std::vector<Vx<float, 2>>::__construct_at_end<Vx<float, 2> *>' requested here
        __construct_at_end(__x.__begin_, __x.__end_, __n);
        ^
/tmp/test.cc:35:16: note: in instantiation of member function 'std::vector<Vx<float, 2>>::vector' requested here
  do_something(vertices);
               ^
1 error generated.

I think this is a bug in your code. https://godbolt.org/z/sv8YehKhY fails the same way, but without std::vector. BTW the simplest fix would be to add Vx(Vx& v) : Vx(std::as_const(v)) {}.

@philnik , can you please help me understand what is going on with the original example posted by @bgraur ?

The fact is that the example compiled before this patch and failed with this patch. Even if the example is buggy, I still need to understand what the semantics were when it worked. Was it using the wrong ctor? Why did it change with this patch?

I'm seeing a number of tests failing with this patch, so understanding the example might give me a clue where to look.

@philnik , can you please help me understand what is going on with the original example posted by @bgraur ?

The fact is that the example compiled before this patch and failed with this patch. Even if the example is buggy, I still need to understand what the semantics were when it worked. Was it using the wrong ctor? Why did it change with this patch?

I'm seeing a number of tests failing with this patch, so understanding the example might give me a clue where to look.

In the example the constructor

template <typename... Args>
explicit Vx(Args&&... args)

is shadowing the default generated copy constructor Vx(const Vx&) = default when the rhs is not const
See https://gcc.godbolt.org/z/3GYe9444P
If you have

Vx<int, 5> v1{};
Vx<int, 5> v2{v1};

The reason is that v1 is not const, on line 2, your constructor which takes Args&& is a better match then the cosnt Vx&, simply because the argument is not const.
And obviously your constructor cannot construct Vx<int, 5> with a Vx<int, 5>

However, if the caller uses a different syntax to trigger copy constructor

Vx<int, 5> v1{};
Vx<int, 5> v2 = v1;

https://gcc.godbolt.org/z/xffnrfWao
This works. Because unlike the previous way Vx<int, 5> v2{v1}; explicitly call the copy constructor, this Vx<int, 5> v2 = v1 implicitly calls the copy constructor.
You constructor is marked as explicit so in this case your constructor is not considered and the compiler will use the default generated copy constructor Vx(const Vx&) = default.

So it might be that @philnik 's patch somehow changes the way to invoke copy constructor. In theory he could stick to original way, but relying on that sounds very fragile.
Usually if you need to write constructor that takes Args&&..., it is best practice to SFINAE out when sizeof... == 1 and remove_cvref_t<Arg> is the class itself

Thanks a lot, @huixie90!

Then it might be, that the tests I'm seeing failing were calling one ctor before this patch and another ctor with this patch.

Maybe now this ctor resolution flakiness happens where memcpy versions were called before? Thinking about how to check this...

In D128146#3674260, @eaeltsin wrote:

Thanks a lot, @huixie90!

Then it might be, that the tests I'm seeing failing were calling one ctor before this patch and another ctor with this patch.

Maybe now this ctor resolution flakiness happens where memcpy versions were called before? Thinking about how to check this...

The problem is exactly as @huixie90 described. The reason this issue became apparent with this patch is that we call the constructor during constant evaluation. During runtime the calls still get forwarded to memmove. The interesting part for you is uninitialized_algorithms.h:566-578.

In D128146#3673017, @philnik wrote:

@joanahalili @hans @alexfh Could you check whether the current patch fixes the binary size problems?

It helps a little. With the original version of this (23cf42e706fb) we got:

ld.lld: error: output file too large: 4306465056 bytes

With the new version:

ld.lld: error: output file too large: 4305167360 bytes

Without any of those (at b4722cc4c96e) the binary is 4244939036 bytes

which is very close to the 4 GB limit, so we have to address that anyway, I just wanted to flag this so the growth didn't go unnoticed.

In D128146#3675833, @hans wrote:

which is very close to the 4 GB limit, so we have to address that anyway, I just wanted to flag this so the growth didn't go unnoticed.

@philnik Made some experiments and the current theory is that the bulk of the increase in debug information is caused by the fact that we now destroy the elements when an exception is thrown. That was previously a bug and it is being fixed by this patch.

Given that the 4gb limit will have to be addressed regardless, we would like to land this patch (which will unblock constexpr std::vector) and tackle improvements to debug information after the LLVM 15 branch has been created. Depending on the nature of the fix, we may be able to cherry-pick back to LLVM 15.

@hans @joanahalili @alexfh Does that sound reasonable? By the way, thanks for the heads up, this sort of input is super useful in finding things that we would not be able to easily see otherwise.

In D128146#3676615, @ldionne wrote:

@hans @joanahalili @alexfh Does that sound reasonable? By the way, thanks for the heads up, this sort of input is super useful in finding things that we would not be able to easily see otherwise.

Ping -- we'd like to merge this today in time for LLVM 15.

EDIT: Sorry, got into a race condition with @hans's comment.

In D128146#3676615, @ldionne wrote:

In D128146#3675833, @hans wrote:

which is very close to the 4 GB limit, so we have to address that anyway, I just wanted to flag this so the growth didn't go unnoticed.

@philnik Made some experiments and the current theory is that the bulk of the increase in debug information is caused by the fact that we now destroy the elements when an exception is thrown. That was previously a bug and it is being fixed by this patch.

We build with -fno-exceptions though, though I'm not sure if that affects this?

Given that the 4gb limit will have to be addressed regardless, we would like to land this patch (which will unblock constexpr std::vector) and tackle improvements to debug information after the LLVM 15 branch has been created. Depending on the nature of the fix, we may be able to cherry-pick back to LLVM 15.

@hans @joanahalili @alexfh Does that sound reasonable? By the way, thanks for the heads up, this sort of input is super useful in finding things that we would not be able to easily see otherwise.

Sounds reasonable to me, though I can't speak for @joanahalili and @alexfh.

ldionne added inline comments.Jul 26 2022, 6:29 AM

libcxx/include/__memory/uninitialized_algorithms.h
533–534	@philnik Since they say they use `-fno-exceptions`, I assume the issue is that `std::__transaction` doesn't get optimized away for when exceptions are disabled (which would make sense because they are probably compiling with lower optimization levels too). Instead, we could use #ifndef _LIBCPP_NO_EXCEPTIONS try { #endif while (__first1 != __last1) { allocator_traits<_Alloc>::construct(__alloc, std::__to_address(__first2), *__first1); ++__first1; ++__first2; } #ifndef _LIBCPP_NO_EXCEPTIONS } catch (...) { std::__allocator_destroy(__alloc_, std::reverse_iterator<_Iter>(__first2), std::reverse_iterator<_Iter>(__destruct_first)); throw; } #endif And ditch `__transaction` altogether. We can then look into improving codegen with `__transaction` separately. Under `-fno-exception`, the code above should be reallly close to what we had before.

Use try/catch instead of __transaction

Fix CI

This LGTM, this should address the debug information issues at least when compiling with -fno-exceptions.

@hans Are you able to confirm this by applying the patch locally?

Harbormaster completed remote builds in B177598: Diff 447680.Jul 26 2022, 8:36 AM

The CI is green -- ignore the failure on AIX, it's unrelated and it has been fixed on main.

This revision was landed with ongoing or failed builds.Jul 26 2022, 8:44 AM

Closed by commit rGf4fb72e6d4ce: [libc++] Use uninitialized algorithms for vector (authored by philnik). · Explain Why

This revision was automatically updated to reflect the committed changes.

philnik added a commit: rGf4fb72e6d4ce: [libc++] Use uninitialized algorithms for vector.

I applied the patch locally and checked again the binary sizes. There are only mild increases which is fine on our end.

In D128146#3680106, @joanahalili wrote:

I applied the patch locally and checked again the binary sizes. There are only mild increases which is fine on our end.

Awesome, thanks. We'll look into __transaction and how we can solve these codegen problems.

augusto2112 mentioned this in rG5ee910fef524: [lldb] Disable TestStackFromStdModule.py.Jul 26 2022, 1:04 PM

In D128146#3679829, @ldionne wrote:

This LGTM, this should address the debug information issues at least when compiling with -fno-exceptions.

@hans Are you able to confirm this by applying the patch locally?

Yes, this works for us. Thanks!

Hello, this changes behavior for the following program. Is that intentional? (It has an easy workaround -- just remove that defaulted ctor, and we only hit it in a single place as far as I know, so it's not a big problem for us -- but I thought I'd check if the change in behavior is intentional)

$ cat test.cc
#include <vector>

class Instruction {
 public:
  int i;
};
class StructuredControlState {
 public:
  StructuredControlState(Instruction* break_merge, Instruction* merge)
      : break_merge_(break_merge), current_merge_(merge) {}
  StructuredControlState(const StructuredControlState&) = default;
  bool InBreakable() const { return break_merge_; }
  bool InStructuredFlow() const { return CurrentMergeId() != 0; }
  uint32_t CurrentMergeId() const;
  uint32_t CurrentMergeHeader() const;
  uint32_t BreakMergeId() const;
  Instruction* BreakMergeInst() const { return break_merge_; }
 private:
  Instruction* break_merge_;
  Instruction* current_merge_;
};
class MergeReturnPass {
  void GenerateState() {
    Instruction inst;
    v.emplace_back(&inst, &inst);
  }
  std::vector<StructuredControlState> v;
};

Before this change:

$ third_party/llvm-build/Release+Asserts/bin/clang -I buildtools/third_party/libc++/trunk/include/ -c test.cc -I buildtools/third_party/libc++ -nostdinc++ -std=c++17 -Wall  -Wdeprecated-copy
# fine

After this change:

$ third_party/llvm-build/Release+Asserts/bin/clang -I buildtools/third_party/libc++/trunk/include/ -c test.cc -I buildtools/third_party/libc++ -nostdinc++ -std=c++17 -Wall  -Wdeprecated-copy
test.cc:11:3: warning: definition of implicit copy assignment operator for 'StructuredControlState' is deprecated because it has a user-declared copy constructor [-Wdeprecated-copy]
  StructuredControlState(const StructuredControlState&) = default;
  ^
buildtools/third_party/libc++/trunk/include/__algorithm/move.h:33:15: note: in implicit copy assignment operator for 'StructuredControlState' first required here
    *__result = std::move(*__first);
              ^
buildtools/third_party/libc++/trunk/include/__algorithm/move.h:52:17: note: in instantiation of function template specialization 'std::__move_impl<StructuredControlState *, StructuredControlState *, StructuredControlState *>' requested here
    return std::__move_impl<_InType*, _InType*, _OutType*>(__first, __last, __result);
                ^
buildtools/third_party/libc++/trunk/include/__algorithm/move.h:84:8: note: in instantiation of function template specialization 'std::__move_impl<StructuredControlState, StructuredControlState, void>' requested here
  std::__move_impl(__last_base, __first_base, __result_first);
       ^
buildtools/third_party/libc++/trunk/include/__algorithm/move.h:94:21: note: in instantiation of function template specialization 'std::__move_impl<StructuredControlState *, StructuredControlState *, 0>' requested here
  auto __ret = std::__move_impl(std::__unwrap_iter(__first), std::__unwrap_iter(__last), std::__unwrap_iter(__result));
                    ^
buildtools/third_party/libc++/trunk/include/__algorithm/move.h:110:15: note: in instantiation of function template specialization 'std::__move<std::reverse_iterator<StructuredControlState *>, std::reverse_iterator<StructuredControlState *>, std::reverse_iterator<StructuredControlState *>>' requested here
  return std::__move(__first, __last, __result).second;
              ^
buildtools/third_party/libc++/trunk/include/__memory/uninitialized_algorithms.h:635:17: note: in instantiation of function template specialization 'std::move<std::reverse_iterator<StructuredControlState *>, std::reverse_iterator<StructuredControlState *>>' requested here
    return std::move(__first1, __last1, __first2);
                ^
buildtools/third_party/libc++/trunk/include/vector:914:27: note: in instantiation of function template specialization 'std::__uninitialized_allocator_move_if_noexcept<std::allocator<StructuredControlState>, std::reverse_iterator<StructuredControlState *>, std::reverse_iterator<StructuredControlState *>, StructuredControlState, void>' requested here
    __v.__begin_   = std::__uninitialized_allocator_move_if_noexcept(
                          ^
buildtools/third_party/libc++/trunk/include/vector:1581:5: note: in instantiation of member function 'std::vector<StructuredControlState>::__swap_out_circular_buffer' requested here
    __swap_out_circular_buffer(__v);
    ^
buildtools/third_party/libc++/trunk/include/vector:1600:9: note: in instantiation of function template specialization 'std::vector<StructuredControlState>::__emplace_back_slow_path<Instruction *, Instruction *>' requested here
        __emplace_back_slow_path(_VSTD::forward<_Args>(__args)...);
        ^
test.cc:25:7: note: in instantiation of function template specialization 'std::vector<StructuredControlState>::emplace_back<Instruction *, Instruction *>' requested here
    v.emplace_back(&inst, &inst);
      ^

y-novikov added a subscriber: y-novikov.Jul 29 2022, 10:36 AM

@thakis I wouldn't say the change is intentional, but the change is definitely expected. Especially, since it looks like -Wdeprecated-copy is basically a rule-of-{0, 3, 5} warning, which you violated by having the copy constructor explicitly defaulted. Although I'm not sure why clang only warns when the constructor is used instead of when the class is declared. If you want you can use clang-tidy with cppcoreguidelines-special-member-functions to ensure that all your classes follow the rule-of-0-or-5.

In D128146#3687957, @philnik wrote:

@thakis I wouldn't say the change is intentional, but the change is definitely expected. Especially, since it looks like -Wdeprecated-copy is basically a rule-of-{0, 3, 5} warning, which you violated by having the copy constructor explicitly defaulted. Although I'm not sure why clang only warns when the constructor is used instead of when the class is declared.

Thanks for the reply!

As for the question, I'm guessing it's due to false positive rate of useful instances of the warning. Clang often warns on use instead of on declaration for that reason.

Here is a sample that was compiling Ok before the change and is now broken: https://godbolt.org/z/djPG94f69

#include <vector>

template <typename B>
struct REAL_TYPEDEF {
  typedef B base_type;
  B v;
  REAL_TYPEDEF() : v(0){}
  explicit REAL_TYPEDEF(B v) : v(v){}

  inline bool operator==(const REAL_TYPEDEF<B>& rhs) const {
    return v == rhs.v;
  }
};

template <typename T>
inline bool operator!=(const T& lhs, const T& rhs) {
  return !(lhs == rhs);
}

namespace zim {

typedef int offset_type;

#define TYPEDEF(NAME, TYPE)                              \
  struct NAME : public REAL_TYPEDEF<TYPE> {              \
    explicit NAME(TYPE v = 0) : REAL_TYPEDEF<TYPE>(v){}  \
  };                                                     \
  static_assert(sizeof(NAME) == sizeof(TYPE), "");

TYPEDEF(offset_t, offset_type)

int foo(int n) {
  std::vector<offset_t> b;

  b.reserve(n);

  return b.size();
}

};  // namespace zim

New output:

In file included from <source>:1:
In file included from /opt/compiler-explorer/clang-trunk-20220802/bin/../include/c++/v1/vector:296:
In file included from /opt/compiler-explorer/clang-trunk-20220802/bin/../include/c++/v1/__split_buffer:24:
In file included from /opt/compiler-explorer/clang-trunk-20220802/bin/../include/c++/v1/memory:858:
In file included from /opt/compiler-explorer/clang-trunk-20220802/bin/../include/c++/v1/__memory/ranges_uninitialized_algorithms.h:22:
/opt/compiler-explorer/clang-trunk-20220802/bin/../include/c++/v1/__memory/uninitialized_algorithms.h:628:21: error: use of overloaded operator '!=' is ambiguous (with operand types 'std::reverse_iterator<zim::offset_t *>' and 'std::reverse_iterator<zim::offset_t *>')
    while (__first1 != __last1) {
           ~~~~~~~~ ^  ~~~~~~~
/opt/compiler-explorer/clang-trunk-20220802/bin/../include/c++/v1/vector:914:27: note: in instantiation of function template specialization 'std::__uninitialized_allocator_move_if_noexcept<std::allocator<zim::offset_t>, std::reverse_iterator<zim::offset_t *>, std::reverse_iterator<zim::offset_t *>, zim::offset_t, void>' requested here
    __v.__begin_   = std::__uninitialized_allocator_move_if_noexcept(
                          ^
/opt/compiler-explorer/clang-trunk-20220802/bin/../include/c++/v1/vector:1501:9: note: in instantiation of member function 'std::vector<zim::offset_t>::__swap_out_circular_buffer' requested here
        __swap_out_circular_buffer(__v);
        ^
<source>:35:5: note: in instantiation of member function 'std::vector<zim::offset_t>::reserve' requested here
  b.reserve(n);
    ^
/opt/compiler-explorer/clang-trunk-20220802/bin/../include/c++/v1/__iterator/reverse_iterator.h:233:1: note: candidate function [with _Iter1 = zim::offset_t *, _Iter2 = zim::offset_t *]
operator!=(const reverse_iterator<_Iter1>& __x, const reverse_iterator<_Iter2>& __y)
^
<source>:16:13: note: candidate function [with T = std::reverse_iterator<zim::offset_t *>]
inline bool operator!=(const T& lhs, const T& rhs) {
            ^
1 error generated.

If I run the following with -O2 -fno-inline, after the patch is ~50% slower
It's even worse with Asan, Msan Tsan (no -fno-inline is needed), New code calls memcpy for each P, which is intercepted.

Somehow if compiled without fno-inline and no sanitizers, performance is unaffected

Is this expected?

#include <vector>

using NodeIndex = size_t;
class N {
 public:
  N(NodeIndex n) : v(n) {}

 private:
  struct P {
    size_t a;
    size_t b;
    size_t c;
  };
  std::vector<P> v;
};

int main() {
  for (int i = 0; i < 1000; ++i) {
    N b(i*10);
    std::vector<N> v(i * 2, b);
  }
}

@eaeltsin I'm not sure your code is supposed to work. It breaks every iterator wrapper AFAICT. The correct way to add an operator!= is to

change lines 15-18 to

template <typename T>
inline bool operator!=(const REAL_TYPEDEF<T>& lhs, const REAL_TYPEDEF<T>& rhs) {
  return !(lhs == rhs);
}

inline bool operator!=(const REAL_TYPEDEF<B>& rhs) const {
  return !(*this == rhs);
}

to REAL_TYPEDEF

use some library that adds it through CRTP, or
use C++20

@vitalybuka This isn't unexpected. -fno-inline disables inlining, which is essential for a lot of other optimizations. Using -fno-inline pretty much defeats the optimizer: https://godbolt.org/z/zrE5o1WK1.

Thanks @philnik!

The code is from libzim - https://github.com/openzim/libzim/blob/966f7b217e9bc36dc30be6d9e46d51a2bfb7091c/src/zim_types.h#L36 . It doesn't look nice to me, and there definitely are multiple ways to make it work.

My question is more about the change in the compiler behavior when selecting the correct overloaded operator. Was the code clearly non standard-compliant before?

In D128146#3699228, @eaeltsin wrote:

Thanks @philnik!

The code is from libzim - https://github.com/openzim/libzim/blob/966f7b217e9bc36dc30be6d9e46d51a2bfb7091c/src/zim_types.h#L36 . It doesn't look nice to me, and there definitely are multiple ways to make it work.

My question is more about the change in the compiler behavior when selecting the correct overloaded operator. Was the code clearly non standard-compliant before?

IIUC the code is not standards-compliant. For example it breaks https://godbolt.org/z/8jbqaY45b. I think http://eel.is/c++draft/constraints#namespace.std-7 is the interesting paragraph here, specifically (a) the overload's declaration depends on at least one user-defined type.

@vitalybuka This isn't unexpected. -fno-inline disables inlining, which is essential for a lot of other optimizations. Using -fno-inline pretty much defeats the optimizer: https://godbolt.org/z/zrE5o1WK1.

I am more concerned about sanitizers

https://godbolt.org/z/1x9qjGG19 Near LBB11_5 we have now __asan_memcpy per every "P", before it was for entire vector.
I assume some additional improvement in instrumentation are possible, maybe replacing fixed short asan_memcpy with check/load/store. Or even optimizing asan_memcpy itself.

But still maybe some ideas if it's solvable on libc++ level so we rely less on optimizations?

dblaikie added a subscriber: dblaikie.Aug 4 2022, 12:35 PM

dblaikie added inline comments.

libcxx/include/__memory/uninitialized_algorithms.h
634	Ah, this is another source of debug info growth - the `std::move(iter, iter, iter)` implementation instantiates `std::pair` (so this change added an extra 2,000 instantiations of `std::pair` to a clang dbg build - and `std::pair` isn't especially light weight with the compressed pair types, all the members, etc. Any chance the implementation of `std::move` could be changed to avoid using `std::pair` - while I realize that might involve some code duplication if the underlying helpers are used in a few places that want both parts of the paired result, it might still be worthwhile. I'll look into making a prototype. Oh, I didn't send this, and did the prototype... so results: It looks like the `__move_impl` doesn't actually use the pair result at all (the ranges-based move does use it, but the code isn't shared - maybe it was at some point) so I removed it, and that got the total `.dwp` change for this uninitialized patch + that move(iter, iter) patch be slightly negative: FILE SIZE VM SIZE -------------- -------------- +0.2% +571Ki [ = ] 0 .debug_str.dwo +0.2% +26.3Ki [ = ] 0 .debug_rnglists.dwo +0.0% +15.5Ki [ = ] 0 .debug_str_offsets.dwo +0.3% +1.67Ki [ = ] 0 .debug_loclists.dwo -0.2% -17.0Ki [ = ] 0 .debug_abbrev.dwo -0.2% -697Ki [ = ] 0 .debug_info.dwo -0.0% -99.5Ki [ = ] 0 TOTAL Net reduction of about 2k std::pair instantiations, rather than a net increase of about 2k with only the uninitialized patch. though `__compressed_pair`/`__compressed_pair_elem` instances didn't change by much - shrug. 25% (3k down from 4k) fewer `make_pair` instantiations. Few other minor things of note. the increases in `construct` (5933 -> 6229) and `reverse_iterator` (7827 -> 8786) are probably somewhat unavoidable/expected. (well, I understand the construct ones - oh, the reverse iterator ones might be for the destruction codepath that this patch mentions is a bugfix/intentionally added) I'll send out the __move pair removal cleanup shortly.

philnik added inline comments.Aug 4 2022, 12:50 PM

libcxx/include/__memory/uninitialized_algorithms.h
634	What do you mean with "the code isn't shared"? `__move` is called from `ranges::move` here. Maybe you looked at some in-between commit or missed the call?

dblaikie added inline comments.Aug 4 2022, 12:53 PM

libcxx/include/__memory/uninitialized_algorithms.h
634	Yeah, I didn't look closely enough. So this'd require some code duplication or conditionality - I'm open to ideas - I'll throw up a straw-man patch at least.

philnik added inline comments.Aug 4 2022, 1:42 PM

libcxx/include/__memory/uninitialized_algorithms.h
634	Could you check whether applying D131198 helps?

In D128146#3699909, @vitalybuka wrote:

@vitalybuka This isn't unexpected. -fno-inline disables inlining, which is essential for a lot of other optimizations. Using -fno-inline pretty much defeats the optimizer: https://godbolt.org/z/zrE5o1WK1.

I am more concerned about sanitizers

https://godbolt.org/z/1x9qjGG19 Near LBB11_5 we have now __asan_memcpy per every "P", before it was for entire vector.
I assume some additional improvement in instrumentation are possible, maybe replacing fixed short asan_memcpy with check/load/store. Or even optimizing asan_memcpy itself.

But still maybe some ideas if it's solvable on libc++ level so we rely less on optimizations?

I suspect this might go away if we manually lowered std::uninitialized_foo to memcpy like we do for std::copy and std::move.

In D128146#3703008, @ldionne wrote:

In D128146#3699909, @vitalybuka wrote:

@vitalybuka This isn't unexpected. -fno-inline disables inlining, which is essential for a lot of other optimizations. Using -fno-inline pretty much defeats the optimizer: https://godbolt.org/z/zrE5o1WK1.

I am more concerned about sanitizers

https://godbolt.org/z/1x9qjGG19 Near LBB11_5 we have now __asan_memcpy per every "P", before it was for entire vector.
I assume some additional improvement in instrumentation are possible, maybe replacing fixed short asan_memcpy with check/load/store. Or even optimizing asan_memcpy itself.

But still maybe some ideas if it's solvable on libc++ level so we rely less on optimizations?

I suspect this might go away if we manually lowered std::uninitialized_foo to memcpy like we do for std::copy and std::move.

(Just in case, I had a patch to do that a while ago: https://reviews.llvm.org/D118329)

dblaikie added inline comments.Aug 5 2022, 5:33 PM

libcxx/include/__memory/uninitialized_algorithms.h
634	After fixing up a few things (there were still a few mentions of `std::pair` in `move.h` and uses of `first` and `second instead of` __first_` and `__second_`, `make_pair` replaced with `{}`, but got the general idea) I ran that and got this comparison: FILE SIZE VM SIZE -------------- -------------- +0.6% +1.48Mi [ = ] 0 .debug_str.dwo +0.1% +513Ki [ = ] 0 .debug_info.dwo +0.2% +126Ki [ = ] 0 .debug_str_offsets.dwo +0.3% +34.5Ki [ = ] 0 .debug_rnglists.dwo +0.3% +1.67Ki [ = ] 0 .debug_loclists.dwo -0.2% -16.0Ki [ = ] 0 .debug_abbrev.dwo +0.3% +2.13Mi [ = ] 0 TOTAL (this is the diff/growth in .dwp between building clang with two different compilers, one that includes the uninitialized patch + the __libcxx_pair_ (along with a bunch of other changes - whatever's changed between our release and testing-the-next-release compiler - I can get specific versions if it's useful, but it looks like most of the growth is due to this uninitialized patch) and one that includes neither) Looking at the string count diffs - 2473 strings starting with "__libcpp_pair", 4133 -> 3309 instances of "make_pair" So, it seems it does certainly lower the cost, but not as low as not using a pair here at all/possibly duplicating the code so there's a separate copy for range-based-move.

In D128146#3703292, @var-const wrote:

In D128146#3703008, @ldionne wrote:

In D128146#3699909, @vitalybuka wrote:

@vitalybuka This isn't unexpected. -fno-inline disables inlining, which is essential for a lot of other optimizations. Using -fno-inline pretty much defeats the optimizer: https://godbolt.org/z/zrE5o1WK1.

I am more concerned about sanitizers

https://godbolt.org/z/1x9qjGG19 Near LBB11_5 we have now __asan_memcpy per every "P", before it was for entire vector.
I assume some additional improvement in instrumentation are possible, maybe replacing fixed short asan_memcpy with check/load/store. Or even optimizing asan_memcpy itself.

But still maybe some ideas if it's solvable on libc++ level so we rely less on optimizations?

I suspect this might go away if we manually lowered std::uninitialized_foo to memcpy like we do for std::copy and std::move.

(Just in case, I had a patch to do that a while ago: https://reviews.llvm.org/D118329)

I'm curious to check if this resolves some of the problems we are having now... @var-const, is your patch still applicable at head?

In D128146#3703883, @eaeltsin wrote:

In D128146#3703292, @var-const wrote:

In D128146#3703008, @ldionne wrote:

In D128146#3699909, @vitalybuka wrote:

@vitalybuka This isn't unexpected. -fno-inline disables inlining, which is essential for a lot of other optimizations. Using -fno-inline pretty much defeats the optimizer: https://godbolt.org/z/zrE5o1WK1.

I am more concerned about sanitizers

https://godbolt.org/z/1x9qjGG19 Near LBB11_5 we have now __asan_memcpy per every "P", before it was for entire vector.
I assume some additional improvement in instrumentation are possible, maybe replacing fixed short asan_memcpy with check/load/store. Or even optimizing asan_memcpy itself.

But still maybe some ideas if it's solvable on libc++ level so we rely less on optimizations?

I suspect this might go away if we manually lowered std::uninitialized_foo to memcpy like we do for std::copy and std::move.

(Just in case, I had a patch to do that a while ago: https://reviews.llvm.org/D118329)

I'm curious to check if this resolves some of the problems we are having now... @var-const, is your patch still applicable at head?

Unfortunately, the patch hasn't been rebased for a few months. That part of the code base hasn't changed much in the meantime, but it's likely the patch won't apply cleanly.

vitalybuka mentioned this in rGea42515dadfa: [asan] Faster version of QuickCheckForUnpoisonedRegion.Aug 8 2022, 10:07 PM

dblaikie added inline comments.Aug 9 2022, 9:29 AM

libcxx/include/__memory/uninitialized_algorithms.h
634	I also posted D131082 that helps a bit - gets us back under our 1% growth threshold we track, if that's of interest to upstream?

philnik mentioned this in D133661: [libc++] Improve binary size when using __transaction.Sep 11 2022, 2:36 AM

Revision Contents

Path

Size

libcxx/

include/

CMakeLists.txt

1 line

__algorithm/

equal_range.h

1 line

__hash_table

1 line

__memory/

swap_allocator.h

53 lines

uninitialized_algorithms.h

144 lines

__split_buffer

1 line

__tree

1 line

__utility/

5 lines

1 line

1 line

118 lines

1 line

1 line

19 lines

test/

libcxx/

containers/

sequences/

vector/

asan_throw.pass.cpp

4 lines

memory/

uninitialized_allocator_copy.pass.cpp

67 lines

private_headers.verify.cpp

1 line

std/

containers/

sequences/

vector/

vector.modifiers/

insert_iter_initializer_list.pass.cpp

49 lines

Diff 447726

libcxx/include/CMakeLists.txt

Show First 20 Lines • Show All 378 Lines • ▼ Show 20 Lines	set(files
__memory/compressed_pair.h		__memory/compressed_pair.h
__memory/concepts.h		__memory/concepts.h
__memory/construct_at.h		__memory/construct_at.h
__memory/pointer_traits.h		__memory/pointer_traits.h
__memory/ranges_construct_at.h		__memory/ranges_construct_at.h
__memory/ranges_uninitialized_algorithms.h		__memory/ranges_uninitialized_algorithms.h
__memory/raw_storage_iterator.h		__memory/raw_storage_iterator.h
__memory/shared_ptr.h		__memory/shared_ptr.h
		__memory/swap_allocator.h
__memory/temporary_buffer.h		__memory/temporary_buffer.h
__memory/uninitialized_algorithms.h		__memory/uninitialized_algorithms.h
__memory/unique_ptr.h		__memory/unique_ptr.h
__memory/uses_allocator.h		__memory/uses_allocator.h
__memory/voidify.h		__memory/voidify.h
__mutex_base		__mutex_base
__node_handle		__node_handle
__numeric/accumulate.h		__numeric/accumulate.h
▲ Show 20 Lines • Show All 443 Lines • Show Last 20 Lines

libcxx/include/__algorithm/equal_range.h

	Show All 15 Lines
	#include <__algorithm/lower_bound.h>			#include <__algorithm/lower_bound.h>
	#include <__algorithm/upper_bound.h>			#include <__algorithm/upper_bound.h>
	#include <__config>			#include <__config>
	#include <__functional/identity.h>			#include <__functional/identity.h>
	#include <__functional/invoke.h>			#include <__functional/invoke.h>
	#include <__iterator/advance.h>			#include <__iterator/advance.h>
	#include <__iterator/distance.h>			#include <__iterator/distance.h>
	#include <__iterator/iterator_traits.h>			#include <__iterator/iterator_traits.h>
				#include <__iterator/next.h>
	#include <__type_traits/is_callable.h>			#include <__type_traits/is_callable.h>
	#include <__type_traits/is_copy_constructible.h>			#include <__type_traits/is_copy_constructible.h>
	#include <__utility/move.h>			#include <__utility/move.h>
	#include <__utility/pair.h>			#include <__utility/pair.h>

	#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)			#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
	# pragma GCC system_header			# pragma GCC system_header
	#endif			#endif
	▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

libcxx/include/__hash_table

	Show All 12 Lines
	#include <__algorithm/max.h>			#include <__algorithm/max.h>
	#include <__algorithm/min.h>			#include <__algorithm/min.h>
	#include <__assert>			#include <__assert>
	#include <__bits> // __libcpp_clz			#include <__bits> // __libcpp_clz
	#include <__config>			#include <__config>
	#include <__debug>			#include <__debug>
	#include <__functional/hash.h>			#include <__functional/hash.h>
	#include <__iterator/iterator_traits.h>			#include <__iterator/iterator_traits.h>
				#include <__memory/swap_allocator.h>
	#include <__utility/swap.h>			#include <__utility/swap.h>
	#include <cmath>			#include <cmath>
	#include <initializer_list>			#include <initializer_list>
	#include <memory>			#include <memory>
	#include <type_traits>			#include <type_traits>

	#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)			#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
	# pragma GCC system_header			# pragma GCC system_header
	▲ Show 20 Lines • Show All 2,681 Lines • Show Last 20 Lines

libcxx/include/__memory/swap_allocator.h

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef _LIBCPP___MEMORY_SWAP_ALLOCATOR_H
				#define _LIBCPP___MEMORY_SWAP_ALLOCATOR_H

				#include <__config>
				#include <__memory/allocator_traits.h>
				#include <__type_traits/integral_constant.h>
				#include <__utility/swap.h>

				#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
				# pragma GCC system_header
				#endif

				_LIBCPP_BEGIN_NAMESPACE_STD
				philnikAuthorUnsubmitted Done Reply Inline Actions This is just copied and re-formatted. philnik: This is just copied and re-formatted.

				template <typename _Alloc>
				_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX11 void __swap_allocator(_Alloc& __a1, _Alloc& __a2, true_type)
				#if _LIBCPP_STD_VER > 11
				_NOEXCEPT
				#else
				_NOEXCEPT_(__is_nothrow_swappable<_Alloc>::value)
				#endif
				{
				using _VSTD::swap;
				swap(__a1, __a2);
				}

				template <typename _Alloc>
				inline _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX11 void
				__swap_allocator(_Alloc&, _Alloc&, false_type) _NOEXCEPT {}

				template <typename _Alloc>
				inline _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX11 void __swap_allocator(_Alloc& __a1, _Alloc& __a2)
				#if _LIBCPP_STD_VER > 11
				_NOEXCEPT
				#else
				_NOEXCEPT_(__is_nothrow_swappable<_Alloc>::value)
				#endif
				{
				_VSTD::__swap_allocator(
				__a1, __a2, integral_constant<bool, allocator_traits<_Alloc>::propagate_on_container_swap::value>());
				}

				_LIBCPP_END_NAMESPACE_STD

				#endif // _LIBCPP___MEMORY_SWAP_ALLOCATOR_H

libcxx/include/__memory/uninitialized_algorithms.h

// -*- C++ -*- // -*- C++ -*-

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

// //

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. // Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information. // See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#ifndef _LIBCPP___MEMORY_UNINITIALIZED_ALGORITHMS_H #ifndef _LIBCPP___MEMORY_UNINITIALIZED_ALGORITHMS_H

#define _LIBCPP___MEMORY_UNINITIALIZED_ALGORITHMS_H #define _LIBCPP___MEMORY_UNINITIALIZED_ALGORITHMS_H

#include <__algorithm/copy.h>

#include <__algorithm/move.h>

#include <__config> #include <__config>

#include <__iterator/iterator_traits.h> #include <__iterator/iterator_traits.h>

#include <__iterator/reverse_iterator.h>

#include <__memory/addressof.h> #include <__memory/addressof.h>

#include <__memory/allocator_traits.h> #include <__memory/allocator_traits.h>

#include <__memory/construct_at.h> #include <__memory/construct_at.h>

#include <__memory/pointer_traits.h>

#include <__memory/voidify.h> #include <__memory/voidify.h>

#include <__type_traits/is_constant_evaluated.h>

#include <__utility/move.h> #include <__utility/move.h>

#include <__utility/pair.h> #include <__utility/pair.h>

#include <__utility/transaction.h> #include <__utility/transaction.h>

#include <type_traits> #include <type_traits>

#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER) #if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)

# pragma GCC system_header # pragma GCC system_header

#endif #endif

_LIBCPP_BEGIN_NAMESPACE_STD _LIBCPP_BEGIN_NAMESPACE_STD

ldionneUnsubmitted

Done

I don't think we need to add this. We don't use min() or max() in this file.

ldionne: I don't think we need to add this. We don't use `min()` or `max()` in this file.

// This is a simplified version of C++20 `unreachable_sentinel` that doesn't use concepts and thus can be used in any // This is a simplified version of C++20 `unreachable_sentinel` that doesn't use concepts and thus can be used in any

// language mode. // language mode.

struct __unreachable_sentinel { struct __unreachable_sentinel {

template <class _Iter> template <class _Iter>

_LIBCPP_HIDE_FROM_ABI friend _LIBCPP_CONSTEXPR bool operator!=(const _Iter&, __unreachable_sentinel) _NOEXCEPT { _LIBCPP_HIDE_FROM_ABI friend _LIBCPP_CONSTEXPR bool operator!=(const _Iter&, __unreachable_sentinel) _NOEXCEPT {

return true; return true;

} }

}; };

▲ Show 20 Lines • Show All 304 Lines • ▼ Show 20 Lines

uninitialized_move_n(_InputIterator __ifirst, _Size __n, _ForwardIterator __ofirst) { uninitialized_move_n(_InputIterator __ifirst, _Size __n, _ForwardIterator __ofirst) {

using _ValueType = typename iterator_traits<_ForwardIterator>::value_type; using _ValueType = typename iterator_traits<_ForwardIterator>::value_type;

auto __iter_move = [](auto&& __iter) -> decltype(auto) { return _VSTD::move(*__iter); }; auto __iter_move = [](auto&& __iter) -> decltype(auto) { return _VSTD::move(*__iter); };

return _VSTD::__uninitialized_move_n<_ValueType>(_VSTD::move(__ifirst), __n, _VSTD::move(__ofirst), return _VSTD::__uninitialized_move_n<_ValueType>(_VSTD::move(__ifirst), __n, _VSTD::move(__ofirst),

__unreachable_sentinel(), __iter_move); __unreachable_sentinel(), __iter_move);

} }

// TODO: Rewrite this to iterate left to right and use reverse_iterators when calling

// Destroys every element in the range [first, last) FROM RIGHT TO LEFT using allocator // Destroys every element in the range [first, last) FROM RIGHT TO LEFT using allocator

ldionneUnsubmitted

Done

Can you please add a TODO to switch this to a normal left-to-right algorithm and to use reverse_iterator from callers?

ldionne: Can you please add a TODO to switch this to a normal left-to-right algorithm and to use…

// destruction. If elements are themselves C-style arrays, they are recursively destroyed // destruction. If elements are themselves C-style arrays, they are recursively destroyed

// in the same manner. // in the same manner.

// //

// This function assumes that destructors do not throw, and that the allocator is bound to // This function assumes that destructors do not throw, and that the allocator is bound to

// the correct type. // the correct type.

template<class _Alloc, class _BidirIter, class = __enable_if_t< template<class _Alloc, class _BidirIter, class = __enable_if_t<

__is_cpp17_bidirectional_iterator<_BidirIter>::value __is_cpp17_bidirectional_iterator<_BidirIter>::value

>> >>

_LIBCPP_HIDE_FROM_ABI _LIBCPP_HIDE_FROM_ABI

ldionneUnsubmitted

Done

Let's use the same pattern for __enable_if_t here, i.e. use a non-type template parameter like you do below.

ldionne: Let's use the same pattern for `__enable_if_t` here, i.e. use a non-type template parameter…

constexpr void __allocator_destroy_multidimensional(_Alloc& __alloc, _BidirIter __first, _BidirIter __last) noexcept { constexpr void __allocator_destroy_multidimensional(_Alloc& __alloc, _BidirIter __first, _BidirIter __last) noexcept {

using _ValueType = typename iterator_traits<_BidirIter>::value_type; using _ValueType = typename iterator_traits<_BidirIter>::value_type;

static_assert(is_same_v<typename allocator_traits<_Alloc>::value_type, _ValueType>, static_assert(is_same_v<typename allocator_traits<_Alloc>::value_type, _ValueType>,

"The allocator should already be rebound to the correct type"); "The allocator should already be rebound to the correct type");

if (__first == __last) if (__first == __last)

return; return;

if constexpr (is_array_v<_ValueType>) { if constexpr (is_array_v<_ValueType>) {

static_assert(!__libcpp_is_unbounded_array<_ValueType>::value, static_assert(!__libcpp_is_unbounded_array<_ValueType>::value,

"arrays of unbounded arrays don't exist, but if they did we would mess up here"); "arrays of unbounded arrays don't exist, but if they did we would mess up here");

using _Element = remove_extent_t<_ValueType>; using _Element = remove_extent_t<_ValueType>;

__allocator_traits_rebind_t<_Alloc, _Element> __elem_alloc(__alloc); __allocator_traits_rebind_t<_Alloc, _Element> __elem_alloc(__alloc);

do { do {

--__last; --__last;

decltype(auto) __array = *__last; decltype(auto) __array = *__last;

std::__allocator_destroy_multidimensional(__elem_alloc, __array, __array + extent_v<_ValueType>); std::__allocator_destroy_multidimensional(__elem_alloc, __array, __array + extent_v<_ValueType>);

} while (__last != __first); } while (__last != __first);

} else { } else {

do { do {

--__last; --__last;

allocator_traits<_Alloc>::destroy(__alloc, std::addressof(*__last)); allocator_traits<_Alloc>::destroy(__alloc, std::addressof(*__last));

} while (__last != __first); } while (__last != __first);

} }

ldionneUnsubmitted

Done

We should figure out what clang-format does wrong here, but in the meantime I would rather use the same formatting as line 364-366.

ldionne: We should figure out what clang-format does wrong here, but in the meantime I would rather use…

// Constructs the object at the given location using the allocator's construct method. // Constructs the object at the given location using the allocator's construct method.

// //

// If the object being constructed is an array, each element of the array is allocator-constructed, // If the object being constructed is an array, each element of the array is allocator-constructed,

// recursively. If an exception is thrown during the construction of an array, the initialized // recursively. If an exception is thrown during the construction of an array, the initialized

// elements are destroyed in reverse order of initialization using allocator destruction. // elements are destroyed in reverse order of initialization using allocator destruction.

// //

// This function assumes that the allocator is bound to the correct type. // This function assumes that the allocator is bound to the correct type.

▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines constexpr void __uninitialized_allocator_value_construct_n(_Alloc& __alloc, _BidirIter __it, _Size __n) {

for (; __n != 0; --__n, ++__it) { for (; __n != 0; --__n, ++__it) {

std::__allocator_construct_at(__value_alloc, std::addressof(*__it)); std::__allocator_construct_at(__value_alloc, std::addressof(*__it));

} }

__guard.__complete(); __guard.__complete();

} }

#endif // _LIBCPP_STD_VER > 14 #endif // _LIBCPP_STD_VER > 14

// Destroy all elements in [__first, __last) from left to right using allocator destruction.

MordanteUnsubmitted

Not Done

Is this moved too? For reviewing I prefer a stack of 2 (or more) commits where the moving of code is separated from the real changes.

Mordante: Is this moved too? For reviewing I prefer a stack of 2 (or more) commits where the moving of…

philnikAuthorUnsubmitted

Done

No, this isn't just moved. Here the names and behaviour changed. The algorithms now destroy the elements again if an exception has been thrown.

philnik: No, this isn't just moved. Here the names and behaviour changed. The algorithms now destroy the…

ldionneUnsubmitted

Done

#endif // _LIBCPP_STD_VER > 14

- // Destroy all elements in [__first, __last) from left to right.

+ // Destroy all elements in [__first, __last) from left to right using allocator destruction.

template <class _Alloc, class _Iter, class _Sent>

Nitpick.

ldionne: Nitpick.

template <class _Alloc, class _Iter, class _Sent>

_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX17 void

ldionneUnsubmitted

Done

Can you please add a comment explaining what __uninitialized_allocator_copy & friends do similar to what we do in __uninitialized_allocator_fill_n (and others)?

In particular, I think that explaining the exception safety guarantee offered by each algorithm added here is important.

ldionne: Can you please add a comment explaining what `__uninitialized_allocator_copy` & friends do…

ldionneUnsubmitted

Done

// that __first2 can hold at least distance(__first1, __last1) uninitialized elements. If an exception is thrown the

- // already copied elements are destroyed again.

+ // already copied elements are destroyed in reverse order of their construction.

template <class _Alloc, class _Iter1, class _Sent1, class _Iter2>

Please also add a note that array elements are NOT treated specially by this function. We may also want to differentiate between functions in this file that handle arrays vs those that don't, since it's really not obvious from their current names.

ldionne: Please also add a note that array elements are NOT treated specially by this function. We may…

ldionneUnsubmitted

Done

Please comment on what this function does.

ldionne: Please comment on what this function does.

__allocator_destroy(_Alloc& __alloc, _Iter __first, _Sent __last) {

for (; __first != __last; ++__first)

allocator_traits<_Alloc>::destroy(__alloc, std::__to_address(__first));

ldionneUnsubmitted

Done

Should we have a static_assert(__is_cpp17_copy_insertable<...>) here?

ldionne: Should we have a `static_assert(__is_cpp17_copy_insertable<...>)` here?

philnikAuthorUnsubmitted

Done

I'm not sure. I'll investigate it later, since it breaks a lot of tests. I think it's either not applicable or the trait is currently broken. It's not used anywhere right now.

philnik: I'm not sure. I'll investigate it later, since it breaks a lot of tests. I think it's either…

}

template <class _Alloc, class _Iter>

class _AllocatorDestroyRangeReverse {

ldionneUnsubmitted

Done

We should destroy in reverse order of construction, it's usually what's expected. Applies everywhere.

ldionne: We should destroy in reverse order of construction, it's usually what's expected. Applies…

public:

_LIBCPP_HIDE_FROM_ABI _AllocatorDestroyRangeReverse(_Alloc& __alloc, _Iter& __first, _Iter& __last)

ldionneUnsubmitted

Done

I think you either need to construct array elements recursively *and* destroy them recursively, or not. But construction and destruction has to be consistent w.r.t. how it handles array elements. Otherwise, you'll get a mismatching number of calls to allocator_traits::construct and allocator_traits::destroy.

Concretely, I think for std::vector you don't want to treat array elements specially. So I would add a std::__allocator_destroy(_Alloc&, _Iter, _Sent) function and call that in the catch (...) instead.

This should be tested by ensuring that we have a matching number of calls to construct and destroy when we use this algorithm with array elements.

ldionne: I think you either need to construct array elements recursively *and* destroy them recursively…

: __alloc_(__alloc), __first_(__first), __last_(__last) {}

_LIBCPP_CONSTEXPR_AFTER_CXX11 void operator()() const {

std::__allocator_destroy(__alloc_, std::reverse_iterator<_Iter>(__last_), std::reverse_iterator<_Iter>(__first_));

ldionneUnsubmitted

Done

Instead of creating a separate __transaction class for C++03, I would do this:

template <class _Alloc, class _Iter>
struct _AllocatorDestroyRange {
  _LIBCPP_CONSTEXPR_AFTER_CXX11 void operator()() const {
    std::__allocator_destroy(__alloc_, __first, __last);
  }
  _Alloc& __alloc_;
  _Iter& __first;
  _Iter& __last;
};

And then I'd use this from the __uninitialized_FOO functions as:

__transaction<_AllocatorDestroyRange<_Alloc, _Iter1> > __guard(_AllocatorDestroyRange<_Alloc, _Iter1>(__allloc, __destruct_first, __first2));

Basically, I don't like that we are creating our own local emulation of std::bind/std::bind_back just for __transaction.

Another option would be something like

auto __guard = std::__make_transaction(std::bind_back(&__allocator_destroy<_Alloc, _Iter2, _Iter2>, __alloc, __destruct_first, __first2));

However, bind_back is not available in C++03 and I'm not 100% sure it would be a good idea to drag in that dependency.

ldionne: Instead of creating a separate `__transaction` class for C++03, I would do this: ``` template…

}

private:

_Alloc& __alloc_;

_Iter& __first_;

_Iter& __last_;

};

// Copy-construct [__first1, __last1) in [__first2, __first2 + N), where N is distance(__first1, __last1).

ldionneUnsubmitted

Done

Let's introduce RawType2 for symmetry?

ldionne: Let's introduce `RawType2` for symmetry?

// The caller has to ensure that __first2 can hold at least N uninitialized elements. If an exception is thrown the

ldionneUnsubmitted

Done

_Iter& __last_;

};

- // Copy-construct [__first1, __last1) in [__first2, __first2 + distance(__first1, __last1)). The caller has to ensure

- // that __first2 can hold at least distance(__first1, __last1) uninitialized elements. If an exception is thrown the

+ // Copy-construct [__first1, __last1) in [__first2, __first2 + N), where N is distance(__first1, __last1).

+ //

+ // The caller has to ensure that __first2 can hold at least N uninitialized elements. If an exception is thrown the

// already copied elements are destroyed in reverse order of their construction.

template <class _Alloc, class _Iter1, class _Sent1, class _Iter2>

ldionne:

// already copied elements are destroyed in reverse order of their construction.

template <class _Alloc, class _Iter1, class _Sent1, class _Iter2>

_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX17 _Iter2

ldionneUnsubmitted

Done

Perhaps this should be called __allocator_has_trivial_copy_construct instead?

ldionne: Perhaps this should be called `__allocator_has_trivial_copy_construct` instead?

__uninitialized_allocator_copy(_Alloc& __alloc, _Iter1 __first1, _Sent1 __last1, _Iter2 __first2) {

#ifndef _LIBCPP_NO_EXCEPTIONS

auto __destruct_first = __first2;

try {

ldionneUnsubmitted

Not Done

@philnik Since they say they use -fno-exceptions, I assume the issue is that std::__transaction doesn't get optimized away for when exceptions are disabled (which would make sense because they are probably compiling with lower optimization levels too).

Instead, we could use

#ifndef _LIBCPP_NO_EXCEPTIONS
  try {
#endif
    while (__first1 != __last1) {
        allocator_traits<_Alloc>::construct(__alloc, std::__to_address(__first2), *__first1);
        ++__first1;
        ++__first2;
        
    }
#ifndef _LIBCPP_NO_EXCEPTIONS
  } catch (...) {
    std::__allocator_destroy(__alloc_, std::reverse_iterator<_Iter>(__first2), std::reverse_iterator<_Iter>(__destruct_first));
    throw;
  }
#endif

And ditch __transaction altogether. We can then look into improving codegen with __transaction separately. Under -fno-exception, the code above should be reallly close to what we had before.

ldionne: @philnik Since they say they use `-fno-exceptions`, I assume the issue is that `std…

#endif

while (__first1 != __last1) {

allocator_traits<_Alloc>::construct(__alloc, std::__to_address(__first2), *__first1);

ldionneUnsubmitted

Done

Here, I suggest this instead:

template <class _Alloc,
          class _Type,
          class _RawType = typename remove_const<_Type>::type,
          __enable_if_t<
            // using _RawType because of the allocator<T const> extension
            is_trivially_copy_constructible<_RawType>::value && 
            is_trivially_copy_assignable<_RawType>::value &&
            (__is_default_allocator<_Alloc>::value || !__has_construct<_Alloc, _RawType*, _Type const&>::value)
          >* = nullptr>
_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX17 _Type*
__uninitialized_allocator_copy(_Alloc&, _Type const* __first1, _Type const* __last1, _Type* __first2) {
  // TODO: Remove the const_cast once we drop support for std::allocator<T const>
  return std::copy(__first1, __last1, const_cast<_RawType*>(__first2));
}

In particular, note the tweaked enable_if conditions -- I think this is what we need to be safe here. If the type is not trivially copy constructible, we MUST call its copy constructor (instead of the assignment in std::copy), otherwise the optimization is not transparent.

If the type is not trivially assignable, then we can also notice that we are doing an assignment instead of a copy-construction here, and so it's not transparent.

ldionne: Here, I suggest this instead: ``` template <class _Alloc, class _Type…

++__first1;

++__first2;

}

#ifndef _LIBCPP_NO_EXCEPTIONS

} catch (...) {

_AllocatorDestroyRangeReverse<_Alloc, _Iter2>(__alloc, __destruct_first, __first2)();

ldionneUnsubmitted

Done

// move constructor is noexcept. Otherwise try to copy all elements. If an exception is thrown the already copied

- // elements are destroyed again.

+ // elements are destroyed in reverse order of their construction.

template <class _Alloc, class _Iter1, class _Sent1, class _Iter2>

ldionne:

throw;

}

#endif

return __first2;

}

template <class _Alloc, class _Type>

struct __allocator_has_trivial_copy_construct : _Not<__has_construct<_Alloc, _Type*, const _Type&> > {};

template <class _Type>

struct __allocator_has_trivial_copy_construct<allocator<_Type>, _Type> : true_type {};

template <class _Alloc,

class _Type,

class _RawType = typename remove_const<_Type>::type,

__enable_if_t<

// using _RawType because of the allocator<T const> extension

is_trivially_copy_constructible<_RawType>::value && is_trivially_copy_assignable<_RawType>::value &&

__allocator_has_trivial_copy_construct<_Alloc, _RawType>::value>* = nullptr>

_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX17 _Type*

__uninitialized_allocator_copy(_Alloc&, const _Type* __first1, const _Type* __last1, _Type* __first2) {

ldionneUnsubmitted

Done

Same comment, we shouldn't treat array types specially here.

Treating array types specially was only meaningful for the std::make_shared<T[N]>(...) functions because they were specified that way. Otherwise, I would have never bothered to handle array types specially :-).

ldionne: Same comment, we shouldn't treat array types specially here. Treating array types specially…

// TODO: Remove the const_cast once we drop support for std::allocator<T const>

if (__libcpp_is_constant_evaluated()) {

while (__first1 != __last1) {

std::__construct_at(std::__to_address(__first2), *__first1);

ldionneUnsubmitted

Done

If you used std::copy (or std::copy_n), I think you could simplify this and you wouldn't have to handle reverse_iterator specially below.

ldionne: If you used `std::copy` (or `std::copy_n`), I think you could simplify this and you wouldn't…

++__first1;

++__first2;

}

return __first2;

} else {

ldionneUnsubmitted

Done

}

- // Move-construct the elements [__first1, __last1) into [__first2, __first2 + distance(__first1, __last1)) if the

- // move constructor is noexcept. Otherwise try to copy all elements. If an exception is thrown the already copied

+ // Move-construct the elements [__first1, __last1) into [__first2, __first2 + N) if the

+ // move constructor is noexcept, where N is distance(__first1, __last1).

+ //

+ // Otherwise try to copy all elements. If an exception is thrown the already copied

// elements are destroyed in reverse order of their construction.

template <class _Alloc, class _Iter1, class _Sent1, class _Iter2>

ldionne:

return std::copy(__first1, __last1, const_cast<_RawType*>(__first2));

}

// Move-construct the elements [__first1, __last1) into [__first2, __first2 + N)

// if the move constructor is noexcept, where N is distance(__first1, __last1).

// Otherwise try to copy all elements. If an exception is thrown the already copied

// elements are destroyed in reverse order of their construction.

template <class _Alloc, class _Iter1, class _Sent1, class _Iter2>

_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX17 _Iter2 __uninitialized_allocator_move_if_noexcept(

_Alloc& __alloc, _Iter1 __first1, _Sent1 __last1, _Iter2 __first2) {

static_assert(__is_cpp17_move_insertable<_Alloc>::value,

"The specified type does not meet the requirements of Cpp17MoveInsertable");

#ifndef _LIBCPP_NO_EXCEPTIONS

auto __destruct_first = __first2;

try {

#endif

while (__first1 != __last1) {

#ifndef _LIBCPP_NO_EXCEPTIONS

allocator_traits<_Alloc>::construct(__alloc, std::__to_address(__first2), std::move_if_noexcept(*__first1));

#else

allocator_traits<_Alloc>::construct(__alloc, std::__to_address(__first2), std::move(*__first1));

#endif

++__first1;

++__first2;

}

#ifndef _LIBCPP_NO_EXCEPTIONS

} catch (...) {

_AllocatorDestroyRangeReverse<_Alloc, _Iter2>(__alloc, __destruct_first, __first2)();

throw;

}

#endif

return __first2;

}

template <class _Alloc, class _Type>

struct __allocator_has_trivial_move_construct : _Not<__has_construct<_Alloc, _Type*, _Type&&> > {};

template <class _Type>

struct __allocator_has_trivial_move_construct<allocator<_Type>, _Type> : true_type {};

#ifndef _LIBCPP_COMPILER_GCC

template <

class _Alloc,

class _Iter1,

class _Iter2,

class _Type = typename iterator_traits<_Iter1>::value_type,

class = __enable_if_t<is_trivially_move_constructible<_Type>::value && is_trivially_move_assignable<_Type>::value &&

__allocator_has_trivial_move_construct<_Alloc, _Type>::value> >

_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX17 _Iter1

__uninitialized_allocator_move_if_noexcept(_Alloc&, _Iter1 __first1, _Iter1 __last1, _Iter2 __first2) {

if (__libcpp_is_constant_evaluated()) {

while (__first1 != __last1) {

std::__construct_at(std::__to_address(__first2), std::move(*__first1));

++__first1;

++__first2;

}

return __first2;

} else {

return std::move(__first1, __last1, __first2);

dblaikieUnsubmitted

Not Done

Ah, this is another source of debug info growth - the std::move(iter, iter, iter) implementation instantiates std::pair (so this change added an extra 2,000 instantiations of std::pair to a clang dbg build - and std::pair isn't especially light weight with the compressed pair types, all the members, etc.

Any chance the implementation of std::move could be changed to avoid using std::pair - while I realize that might involve some code duplication if the underlying helpers are used in a few places that want both parts of the paired result, it might still be worthwhile.

I'll look into making a prototype.

Oh, I didn't send this, and did the prototype... so results: It looks like the __move_impl doesn't actually use the pair result at all (the ranges-based move does use it, but the code isn't shared - maybe it was at some point) so I removed it, and that got the total .dwp change for this uninitialized patch + that move(iter, iter) patch be slightly negative:

   FILE SIZE        VM SIZE
--------------  --------------
 +0.2%  +571Ki  [ = ]       0    .debug_str.dwo
 +0.2% +26.3Ki  [ = ]       0    .debug_rnglists.dwo
 +0.0% +15.5Ki  [ = ]       0    .debug_str_offsets.dwo
 +0.3% +1.67Ki  [ = ]       0    .debug_loclists.dwo
 -0.2% -17.0Ki  [ = ]       0    .debug_abbrev.dwo
 -0.2%  -697Ki  [ = ]       0    .debug_info.dwo
 -0.0% -99.5Ki  [ = ]       0    TOTAL

Net reduction of about 2k std::pair instantiations, rather than a net increase of about 2k with only the uninitialized patch.

though __compressed_pair/__compressed_pair_elem instances didn't change by much - *shrug*. 25% (3k down from 4k) fewer make_pair instantiations. Few other minor things of note.

the increases in construct (5933 -> 6229) and reverse_iterator (7827 -> 8786) are probably somewhat unavoidable/expected. (well, I understand the construct ones - oh, the reverse iterator ones might be for the destruction codepath that this patch mentions is a bugfix/intentionally added)

I'll send out the __move pair removal cleanup shortly.

dblaikie: Ah, this is another source of debug info growth - the `std::move(iter, iter, iter)`…

philnikAuthorUnsubmitted

Done

What do you mean with "the code isn't shared"? __move is called from ranges::move here. Maybe you looked at some in-between commit or missed the call?

philnik: What do you mean with "the code isn't shared"? `__move` is called from `ranges::move` [here]…

dblaikieUnsubmitted

Not Done

Yeah, I didn't look closely enough. So this'd require some code duplication or conditionality - I'm open to ideas - I'll throw up a straw-man patch at least.

dblaikie: Yeah, I didn't look closely enough. So this'd require some code duplication or conditionality…

philnikAuthorUnsubmitted

Done

Could you check whether applying D131198 helps?

philnik: Could you check whether applying D131198 helps?

dblaikieUnsubmitted

Not Done

After fixing up a few things (there were still a few mentions of std::pair in move.h and uses of first and second instead of __first_` and __second_, make_pair replaced with {}, but got the general idea) I ran that and got this comparison:

   FILE SIZE        VM SIZE    
--------------  -------------- 
 +0.6% +1.48Mi  [ = ]       0    .debug_str.dwo
 +0.1%  +513Ki  [ = ]       0    .debug_info.dwo
 +0.2%  +126Ki  [ = ]       0    .debug_str_offsets.dwo
 +0.3% +34.5Ki  [ = ]       0    .debug_rnglists.dwo
 +0.3% +1.67Ki  [ = ]       0    .debug_loclists.dwo
 -0.2% -16.0Ki  [ = ]       0    .debug_abbrev.dwo
 +0.3% +2.13Mi  [ = ]       0    TOTAL

(this is the diff/growth in .dwp between building clang with two different compilers, one that includes the uninitialized patch + the __libcxx_pair_ (along with a bunch of other changes - whatever's changed between our release and testing-the-next-release compiler - I can get specific versions if it's useful, but it looks like most of the growth is due to this uninitialized patch) and one that includes neither)

Looking at the string count diffs - 2473 strings starting with "__libcpp_pair", 4133 -> 3309 instances of "make_pair"

So, it seems it does certainly lower the cost, but not as low as not using a pair here at all/possibly duplicating the code so there's a separate copy for range-based-move.

dblaikie: After fixing up a few things (there were still a few mentions of `std::pair` in `move.h` and…

dblaikieUnsubmitted

Not Done

I also posted D131082 that helps a bit - gets us back under our 1% growth threshold we track, if that's of interest to upstream?

dblaikie: I also posted D131082 that helps a bit - gets us back under our 1% growth threshold we track…

}

#endif // _LIBCPP_COMPILER_GCC

_LIBCPP_END_NAMESPACE_STD _LIBCPP_END_NAMESPACE_STD

#endif // _LIBCPP___MEMORY_UNINITIALIZED_ALGORITHMS_H #endif // _LIBCPP___MEMORY_UNINITIALIZED_ALGORITHMS_H

ldionneUnsubmitted

Done

Same, can be removed.

ldionne: Same, can be removed.

libcxx/include/__split_buffer

	Show All 13 Lines
	#include <__algorithm/move.h>			#include <__algorithm/move.h>
	#include <__algorithm/move_backward.h>			#include <__algorithm/move_backward.h>
	#include <__config>			#include <__config>
	#include <__iterator/distance.h>			#include <__iterator/distance.h>
	#include <__iterator/iterator_traits.h>			#include <__iterator/iterator_traits.h>
	#include <__iterator/move_iterator.h>			#include <__iterator/move_iterator.h>
	#include <__memory/allocator.h>			#include <__memory/allocator.h>
	#include <__memory/compressed_pair.h>			#include <__memory/compressed_pair.h>
				#include <__memory/swap_allocator.h>
	#include <__utility/forward.h>			#include <__utility/forward.h>
	#include <memory>			#include <memory>
	#include <type_traits>			#include <type_traits>

	#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)			#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
	# pragma GCC system_header			# pragma GCC system_header
	#endif			#endif

	▲ Show 20 Lines • Show All 595 Lines • Show Last 20 Lines

libcxx/include/__tree

	Show All 11 Lines

	#include <__algorithm/min.h>			#include <__algorithm/min.h>
	#include <__assert>			#include <__assert>
	#include <__config>			#include <__config>
	#include <__debug>			#include <__debug>
	#include <__iterator/distance.h>			#include <__iterator/distance.h>
	#include <__iterator/iterator_traits.h>			#include <__iterator/iterator_traits.h>
	#include <__iterator/next.h>			#include <__iterator/next.h>
				#include <__memory/swap_allocator.h>
	#include <__utility/forward.h>			#include <__utility/forward.h>
	#include <__utility/swap.h>			#include <__utility/swap.h>
	#include <limits>			#include <limits>
	#include <memory>			#include <memory>
	#include <stdexcept>			#include <stdexcept>

	#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)			#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
	# pragma GCC system_header			# pragma GCC system_header
	▲ Show 20 Lines • Show All 2,717 Lines • Show Last 20 Lines

libcxx/include/__utility/transaction.h

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines _LIBCPP_CONSTEXPR_AFTER_CXX17 ~__transaction() {

__rollback_(); __rollback_();

} }

private: private:

_Rollback __rollback_; _Rollback __rollback_;

bool __completed_; bool __completed_;

}; };

template <class _Rollback>

_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR __transaction<_Rollback> __make_transaction(_Rollback __rollback) {

ldionneUnsubmitted

Done

_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR ?

ldionne: `_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR` ?

return __transaction<_Rollback>(std::move(__rollback));

ldionneUnsubmitted

Done

__transaction<_Rollback> __make_transaction(_Rollback __rollback) {

- return __transaction<_Rollback>(__rollback);

+ return __transaction<_Rollback>(std::move(__rollback));

}

_LIBCPP_END_NAMESPACE_STD

ldionne:

}

_LIBCPP_END_NAMESPACE_STD _LIBCPP_END_NAMESPACE_STD

#endif // _LIBCPP___UTILITY_TRANSACTION_H #endif // _LIBCPP___UTILITY_TRANSACTION_H

libcxx/include/forward_list

	Show First 20 Lines • Show All 182 Lines • ▼ Show 20 Lines
	#include <__algorithm/lexicographical_compare.h>			#include <__algorithm/lexicographical_compare.h>
	#include <__algorithm/min.h>			#include <__algorithm/min.h>
	#include <__assert> // all public C++ headers provide the assertion handler			#include <__assert> // all public C++ headers provide the assertion handler
	#include <__config>			#include <__config>
	#include <__iterator/distance.h>			#include <__iterator/distance.h>
	#include <__iterator/iterator_traits.h>			#include <__iterator/iterator_traits.h>
	#include <__iterator/move_iterator.h>			#include <__iterator/move_iterator.h>
	#include <__iterator/next.h>			#include <__iterator/next.h>
				#include <__memory/swap_allocator.h>
	#include <__utility/forward.h>			#include <__utility/forward.h>
	#include <limits>			#include <limits>
	#include <memory>			#include <memory>
	#include <type_traits>			#include <type_traits>
	#include <version>			#include <version>

	#ifndef _LIBCPP_REMOVE_TRANSITIVE_INCLUDES			#ifndef _LIBCPP_REMOVE_TRANSITIVE_INCLUDES
	# include <algorithm>			# include <algorithm>
	▲ Show 20 Lines • Show All 1,591 Lines • Show Last 20 Lines

libcxx/include/list

	Show First 20 Lines • Show All 188 Lines • ▼ Show 20 Lines
	#include <__debug>			#include <__debug>
	#include <__format/enable_insertable.h>			#include <__format/enable_insertable.h>
	#include <__iterator/distance.h>			#include <__iterator/distance.h>
	#include <__iterator/iterator_traits.h>			#include <__iterator/iterator_traits.h>
	#include <__iterator/move_iterator.h>			#include <__iterator/move_iterator.h>
	#include <__iterator/next.h>			#include <__iterator/next.h>
	#include <__iterator/prev.h>			#include <__iterator/prev.h>
	#include <__iterator/reverse_iterator.h>			#include <__iterator/reverse_iterator.h>
				#include <__memory/swap_allocator.h>
	#include <__utility/forward.h>			#include <__utility/forward.h>
	#include <__utility/move.h>			#include <__utility/move.h>
	#include <__utility/swap.h>			#include <__utility/swap.h>
	#include <limits>			#include <limits>
	#include <memory>			#include <memory>
	#include <type_traits>			#include <type_traits>
	#include <version>			#include <version>

	▲ Show 20 Lines • Show All 2,162 Lines • Show Last 20 Lines

libcxx/include/memory

Show First 20 Lines • Show All 879 Lines • ▼ Show 20 Lines
#include <compare>		#include <compare>

#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)		#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)
# pragma GCC system_header		# pragma GCC system_header
#endif		#endif

_LIBCPP_BEGIN_NAMESPACE_STD		_LIBCPP_BEGIN_NAMESPACE_STD

template <class _Alloc, class _Ptr>
_LIBCPP_INLINE_VISIBILITY
void __construct_forward_with_exception_guarantees(_Alloc& __a, _Ptr __begin1, _Ptr __end1, _Ptr& __begin2) {
static_assert(__is_cpp17_move_insertable<_Alloc>::value,
"The specified type does not meet the requirements of Cpp17MoveInsertable");
typedef allocator_traits<_Alloc> _Traits;
for (; __begin1 != __end1; ++__begin1, (void)++__begin2) {
_Traits::construct(__a, _VSTD::__to_address(__begin2),
#ifdef _LIBCPP_NO_EXCEPTIONS
_VSTD::move(*__begin1)
#else
_VSTD::move_if_noexcept(*__begin1)
#endif
);
}
}

template <class _Alloc, class _Tp, typename enable_if<
(__is_default_allocator<_Alloc>::value \|\| !__has_construct<_Alloc, _Tp*, _Tp>::value) &&
is_trivially_move_constructible<_Tp>::value
>::type>
_LIBCPP_INLINE_VISIBILITY
void __construct_forward_with_exception_guarantees(_Alloc&, _Tp* __begin1, _Tp* __end1, _Tp*& __begin2) {
ptrdiff_t _Np = __end1 - __begin1;
if (_Np > 0) {
_VSTD::memcpy(__begin2, __begin1, _Np * sizeof(_Tp));
__begin2 += _Np;
}
}

template <class _Alloc, class _Iter, class _Ptr>
_LIBCPP_INLINE_VISIBILITY
void __construct_range_forward(_Alloc& __a, _Iter __begin1, _Iter __end1, _Ptr& __begin2) {
typedef allocator_traits<_Alloc> _Traits;
for (; __begin1 != __end1; ++__begin1, (void) ++__begin2) {
_Traits::construct(__a, _VSTD::__to_address(__begin2), *__begin1);
}
}

template <class _Alloc, class _Source, class _Dest,
class _RawSource = typename remove_const<_Source>::type,
class _RawDest = typename remove_const<_Dest>::type,
class =
typename enable_if<
is_trivially_copy_constructible<_Dest>::value &&
is_same<_RawSource, _RawDest>::value &&
(__is_default_allocator<_Alloc>::value \|\| !__has_construct<_Alloc, _Dest*, _Source&>::value)
>::type>
_LIBCPP_INLINE_VISIBILITY
void __construct_range_forward(_Alloc&, _Source* __begin1, _Source* __end1, _Dest*& __begin2) {
ptrdiff_t _Np = __end1 - __begin1;
if (_Np > 0) {
_VSTD::memcpy(const_cast<_RawDest>(__begin2), __begin1, _Np sizeof(_Dest));
__begin2 += _Np;
}
}

template <class _Alloc, class _Ptr>
_LIBCPP_INLINE_VISIBILITY
void __construct_backward_with_exception_guarantees(_Alloc& __a, _Ptr __begin1, _Ptr __end1, _Ptr& __end2) {
static_assert(__is_cpp17_move_insertable<_Alloc>::value,
"The specified type does not meet the requirements of Cpp17MoveInsertable");
typedef allocator_traits<_Alloc> _Traits;
while (__end1 != __begin1) {
_Traits::construct(__a, _VSTD::__to_address(__end2 - 1),
#ifdef _LIBCPP_NO_EXCEPTIONS
_VSTD::move(*--__end1)
#else
_VSTD::move_if_noexcept(*--__end1)
#endif
);
--__end2;
}
}

template <class _Alloc, class _Tp, class = typename enable_if<
(__is_default_allocator<_Alloc>::value \|\| !__has_construct<_Alloc, _Tp*, _Tp>::value) &&
is_trivially_move_constructible<_Tp>::value
>::type>
_LIBCPP_INLINE_VISIBILITY
void __construct_backward_with_exception_guarantees(_Alloc&, _Tp* __begin1, _Tp* __end1, _Tp*& __end2) {
ptrdiff_t _Np = __end1 - __begin1;
__end2 -= _Np;
if (_Np > 0)
_VSTD::memcpy(static_cast<void>(__end2), static_cast<void const>(__begin1), _Np * sizeof(_Tp));
}

struct __destruct_n		struct __destruct_n
{		{
private:		private:
size_t __size_;		size_t __size_;

template <class _Tp>		template <class _Tp>
_LIBCPP_INLINE_VISIBILITY void __process(_Tp* __p, false_type) _NOEXCEPT		_LIBCPP_INLINE_VISIBILITY void __process(_Tp* __p, false_type) _NOEXCEPT
{for (size_t __i = 0; __i < __size_; ++__i, ++__p) __p->~_Tp();}		{for (size_t __i = 0; __i < __size_; ++__i, ++__p) __p->~_Tp();}
Show All 25 Lines	public:

template <class _Tp>		template <class _Tp>
_LIBCPP_INLINE_VISIBILITY void operator()(_Tp* __p) _NOEXCEPT		_LIBCPP_INLINE_VISIBILITY void operator()(_Tp* __p) _NOEXCEPT
{__process(__p, integral_constant<bool, is_trivially_destructible<_Tp>::value>());}		{__process(__p, integral_constant<bool, is_trivially_destructible<_Tp>::value>());}
};		};

_LIBCPP_FUNC_VIS void* align(size_t __align, size_t __sz, void*& __ptr, size_t& __space);		_LIBCPP_FUNC_VIS void* align(size_t __align, size_t __sz, void*& __ptr, size_t& __space);

// --- Helper for container swap --
template <typename _Alloc>
_LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX11
void __swap_allocator(_Alloc & __a1, _Alloc & __a2, true_type)
#if _LIBCPP_STD_VER > 11
_NOEXCEPT
#else
_NOEXCEPT_(__is_nothrow_swappable<_Alloc>::value)
#endif
{
using _VSTD::swap;
swap(__a1, __a2);
}

template <typename _Alloc>
inline _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX11
void __swap_allocator(_Alloc &, _Alloc &, false_type) _NOEXCEPT {}

template <typename _Alloc>
inline _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_AFTER_CXX11
void __swap_allocator(_Alloc & __a1, _Alloc & __a2)
#if _LIBCPP_STD_VER > 11
_NOEXCEPT
#else
_NOEXCEPT_(__is_nothrow_swappable<_Alloc>::value)
#endif
{
_VSTD::__swap_allocator(__a1, __a2,
integral_constant<bool, allocator_traits<_Alloc>::propagate_on_container_swap::value>());
}

template <typename _Alloc, typename _Traits=allocator_traits<_Alloc> >		template <typename _Alloc, typename _Traits=allocator_traits<_Alloc> >
struct __noexcept_move_assign_container : public integral_constant<bool,		struct __noexcept_move_assign_container : public integral_constant<bool,
_Traits::propagate_on_container_move_assignment::value		_Traits::propagate_on_container_move_assignment::value
#if _LIBCPP_STD_VER > 14		#if _LIBCPP_STD_VER > 14
\|\| _Traits::is_always_equal::value		\|\| _Traits::is_always_equal::value
#else		#else
&& is_nothrow_move_assignable<_Alloc>::value		&& is_nothrow_move_assignable<_Alloc>::value
#endif		#endif
▲ Show 20 Lines • Show All 85 Lines • Show Last 20 Lines

libcxx/include/module.modulemap.in

Show First 20 Lines • Show All 826 Lines • ▼ Show 20 Lines	module __memory {
module compressed_pair { private header "__memory/compressed_pair.h" }		module compressed_pair { private header "__memory/compressed_pair.h" }
module concepts { private header "__memory/concepts.h" }		module concepts { private header "__memory/concepts.h" }
module construct_at { private header "__memory/construct_at.h" }		module construct_at { private header "__memory/construct_at.h" }
module pointer_traits { private header "__memory/pointer_traits.h" }		module pointer_traits { private header "__memory/pointer_traits.h" }
module ranges_construct_at { private header "__memory/ranges_construct_at.h" }		module ranges_construct_at { private header "__memory/ranges_construct_at.h" }
module ranges_uninitialized_algorithms { private header "__memory/ranges_uninitialized_algorithms.h" }		module ranges_uninitialized_algorithms { private header "__memory/ranges_uninitialized_algorithms.h" }
module raw_storage_iterator { private header "__memory/raw_storage_iterator.h" }		module raw_storage_iterator { private header "__memory/raw_storage_iterator.h" }
module shared_ptr { private header "__memory/shared_ptr.h" }		module shared_ptr { private header "__memory/shared_ptr.h" }
		module swap_allocator { private header "__memory/swap_allocator.h" }
module temporary_buffer { private header "__memory/temporary_buffer.h" }		module temporary_buffer { private header "__memory/temporary_buffer.h" }
module uninitialized_algorithms { private header "__memory/uninitialized_algorithms.h" }		module uninitialized_algorithms { private header "__memory/uninitialized_algorithms.h" }
module unique_ptr { private header "__memory/unique_ptr.h" }		module unique_ptr { private header "__memory/unique_ptr.h" }
module uses_allocator { private header "__memory/uses_allocator.h" }		module uses_allocator { private header "__memory/uses_allocator.h" }
module voidify { private header "__memory/voidify.h" }		module voidify { private header "__memory/voidify.h" }
}		}
}		}
module mutex {		module mutex {
▲ Show 20 Lines • Show All 544 Lines • Show Last 20 Lines

libcxx/include/string

	Show First 20 Lines • Show All 526 Lines • ▼ Show 20 Lines
	#include <__functional/hash.h>			#include <__functional/hash.h>
	#include <__functional/unary_function.h>			#include <__functional/unary_function.h>
	#include <__ios/fpos.h>			#include <__ios/fpos.h>
	#include <__iterator/distance.h>			#include <__iterator/distance.h>
	#include <__iterator/iterator_traits.h>			#include <__iterator/iterator_traits.h>
	#include <__iterator/reverse_iterator.h>			#include <__iterator/reverse_iterator.h>
	#include <__iterator/wrap_iter.h>			#include <__iterator/wrap_iter.h>
	#include <__memory/allocate_at_least.h>			#include <__memory/allocate_at_least.h>
				#include <__memory/swap_allocator.h>
	#include <__string/char_traits.h>			#include <__string/char_traits.h>
	#include <__string/extern_template_lists.h>			#include <__string/extern_template_lists.h>
	#include <__utility/auto_cast.h>			#include <__utility/auto_cast.h>
	#include <__utility/move.h>			#include <__utility/move.h>
	#include <__utility/swap.h>			#include <__utility/swap.h>
	#include <__utility/unreachable.h>			#include <__utility/unreachable.h>
	#include <climits>			#include <climits>
	#include <cstdint>			#include <cstdint>
	▲ Show 20 Lines • Show All 4,173 Lines • Show Last 20 Lines

libcxx/include/vector

	Show First 20 Lines • Show All 285 Lines • ▼ Show 20 Lines
	#include <__format/enable_insertable.h>			#include <__format/enable_insertable.h>
	#include <__functional/hash.h>			#include <__functional/hash.h>
	#include <__functional/unary_function.h>			#include <__functional/unary_function.h>
	#include <__iterator/advance.h>			#include <__iterator/advance.h>
	#include <__iterator/iterator_traits.h>			#include <__iterator/iterator_traits.h>
	#include <__iterator/reverse_iterator.h>			#include <__iterator/reverse_iterator.h>
	#include <__iterator/wrap_iter.h>			#include <__iterator/wrap_iter.h>
	#include <__memory/allocate_at_least.h>			#include <__memory/allocate_at_least.h>
				#include <__memory/pointer_traits.h>
				#include <__memory/swap_allocator.h>
	#include <__split_buffer>			#include <__split_buffer>
	#include <__utility/forward.h>			#include <__utility/forward.h>
	#include <__utility/move.h>			#include <__utility/move.h>
	#include <__utility/swap.h>			#include <__utility/swap.h>
	#include <climits>			#include <climits>
	#include <cstdlib>			#include <cstdlib>
	#include <cstring>			#include <cstring>
	#include <iosfwd> // for forward declaration of vector			#include <iosfwd> // for forward declaration of vector
	▲ Show 20 Lines • Show All 588 Lines • ▼ Show 20 Lines
	vector(_InputIterator, _InputIterator, _Alloc)			vector(_InputIterator, _InputIterator, _Alloc)
	-> vector<__iter_value_type<_InputIterator>, _Alloc>;			-> vector<__iter_value_type<_InputIterator>, _Alloc>;
	#endif			#endif

	template <class _Tp, class _Allocator>			template <class _Tp, class _Allocator>
	void			void
	vector<_Tp, _Allocator>::__swap_out_circular_buffer(__split_buffer<value_type, allocator_type&>& __v)			vector<_Tp, _Allocator>::__swap_out_circular_buffer(__split_buffer<value_type, allocator_type&>& __v)
	{			{

	__annotate_delete();			__annotate_delete();
	_VSTD::__construct_backward_with_exception_guarantees(this->__alloc(), this->__begin_, this->__end_, __v.__begin_);			using _RevIter = std::reverse_iterator<pointer>;
				__v.__begin_ = std::__uninitialized_allocator_move_if_noexcept(
				__alloc(), _RevIter(__end_), _RevIter(__begin_), _RevIter(__v.__begin_))
				ldionneUnsubmitted Done Reply Inline Actions The code didn't destroy the new elements before in case of a failure, but now it does. I wonder whether the original code was written that way on purpose? ldionne: The code didn't destroy the new elements before in case of a failure, but now it does. I wonder…
				.base();
	_VSTD::swap(this->__begin_, __v.__begin_);			_VSTD::swap(this->__begin_, __v.__begin_);
	_VSTD::swap(this->__end_, __v.__end_);			_VSTD::swap(this->__end_, __v.__end_);
	_VSTD::swap(this->__end_cap(), __v.__end_cap());			_VSTD::swap(this->__end_cap(), __v.__end_cap());
	__v.__first_ = __v.__begin_;			__v.__first_ = __v.__begin_;
	__annotate_new(size());			__annotate_new(size());
	std::__debug_db_invalidate_all(this);			std::__debug_db_invalidate_all(this);
	}			}

	template <class _Tp, class _Allocator>			template <class _Tp, class _Allocator>
	typename vector<_Tp, _Allocator>::pointer			typename vector<_Tp, _Allocator>::pointer
	vector<_Tp, _Allocator>::__swap_out_circular_buffer(__split_buffer<value_type, allocator_type&>& __v, pointer __p)			vector<_Tp, _Allocator>::__swap_out_circular_buffer(__split_buffer<value_type, allocator_type&>& __v, pointer __p)
	{			{
	__annotate_delete();			__annotate_delete();
	pointer __r = __v.__begin_;			pointer __r = __v.__begin_;
	_VSTD::__construct_backward_with_exception_guarantees(this->__alloc(), this->__begin_, __p, __v.__begin_);			using _RevIter = std::reverse_iterator<pointer>;
	_VSTD::__construct_forward_with_exception_guarantees(this->__alloc(), __p, this->__end_, __v.__end_);			__v.__begin_ = std::__uninitialized_allocator_move_if_noexcept(
				__alloc(), _RevIter(__p), _RevIter(__begin_), _RevIter(__v.__begin_))
				.base();
				__v.__end_ = std::__uninitialized_allocator_move_if_noexcept(__alloc(), __p, __end_, __v.__end_);
	_VSTD::swap(this->__begin_, __v.__begin_);			_VSTD::swap(this->__begin_, __v.__begin_);
	_VSTD::swap(this->__end_, __v.__end_);			_VSTD::swap(this->__end_, __v.__end_);
	_VSTD::swap(this->__end_cap(), __v.__end_cap());			_VSTD::swap(this->__end_cap(), __v.__end_cap());
	__v.__first_ = __v.__begin_;			__v.__first_ = __v.__begin_;
	__annotate_new(size());			__annotate_new(size());
	std::__debug_db_invalidate_all(this);			std::__debug_db_invalidate_all(this);
	return __r;			return __r;
	}			}
	▲ Show 20 Lines • Show All 71 Lines • ▼ Show 20 Lines
	template <class _ForwardIterator>			template <class _ForwardIterator>
	typename enable_if			typename enable_if
	<			<
	__is_cpp17_forward_iterator<_ForwardIterator>::value,			__is_cpp17_forward_iterator<_ForwardIterator>::value,
	void			void
	>::type			>::type
	vector<_Tp, _Allocator>::__construct_at_end(_ForwardIterator __first, _ForwardIterator __last, size_type __n)			vector<_Tp, _Allocator>::__construct_at_end(_ForwardIterator __first, _ForwardIterator __last, size_type __n)
	{			{
	_ConstructTransaction __tx(*this, __n);			_ConstructTransaction __tx(*this, __n);
	_VSTD::__construct_range_forward(this->__alloc(), __first, __last, __tx.__pos_);			__tx.__pos_ = std::__uninitialized_allocator_copy(__alloc(), __first, __last, __tx.__pos_);
	}			}

	// Default constructs __n objects starting at __end_			// Default constructs __n objects starting at __end_
	// throws if construction throws			// throws if construction throws
	// Postcondition: size() == size() + __n			// Postcondition: size() == size() + __n
	// Exception safety: strong.			// Exception safety: strong.
	template <class _Tp, class _Allocator>			template <class _Tp, class _Allocator>
	void			void
	▲ Show 20 Lines • Show All 2,198 Lines • Show Last 20 Lines

libcxx/test/libcxx/containers/sequences/vector/asan_throw.pass.cpp

Show First 20 Lines • Show All 93 Lines • ▼ Show 20 Lines	void test_insert_range() {
v.push_back(X(2));		v.push_back(X(2));
assert(v.size() == 2);		assert(v.size() == 2);
assert(v.capacity() >= 4);		assert(v.capacity() >= 4);
try {		try {
char a[2] = {21, 42};		char a[2] = {21, 42};
v.insert(v.end(), a, a + 2);		v.insert(v.end(), a, a + 2);
assert(0);		assert(0);
} catch (int e) {		} catch (int e) {
assert(v.size() == 3);		assert(v.size() == 2);
		MordanteUnsubmitted Done Reply Inline Actions Is this a behaviour change or a bug fix? I don't understand this change. Maybe add a description to the patch that explains this behaviour change. That way when we look at the history we know why this was done. Mordante: Is this a behaviour change or a bug fix? I don't understand this change. Maybe add a…
		philnikAuthorUnsubmitted Done Reply Inline Actions https://eel.is/c++draft/vector#modifiers-2 is the relevant paragraph I think. I'm not entirely sure if this is just a behaviour change or if it's a bugfix. I think it's a bugfix. philnik: https://eel.is/c++draft/vector#modifiers-2 is the relevant paragraph I think. I'm not entirely…
		ldionneUnsubmitted Done Reply Inline Actions My reading is that we are fixing a bug, since this line of the spec should apply: If an exception is thrown other than by the copy constructor, move constructor, assignment operator, or move assignment operator of `T` or by any `InputIterator` operation there are no effects. Indeed, the exception is thrown by `X(char)`, which is none-of-the-above, and so there should be no effects. Previously, the size of the vector would have been modified, and that's wrong. This mandates a test in `libcxx/test/std` -- it's a pretty serious bug since exception guarantees in `push_back` & friends are a big deal. ldionne: My reading is that we are fixing a bug, since this line of the spec should apply: > If an…
}		}
assert(v.size() == 3);		assert(v.size() == 2);
assert(is_contiguous_container_asan_correct(v));		assert(is_contiguous_container_asan_correct(v));
}		}
		ldionneUnsubmitted Done Reply Inline Actions We should also have a test that ensures that we destroy the newly created elements if an exception is thrown. I think it was wrong to skip that in the code previously, since that means that we'd have been potentially leaking stuff if an exception was thrown. ldionne: We should also have a test that ensures that we destroy the newly created elements if an…

void test_insert() {		void test_insert() {
std::vector<X> v;		std::vector<X> v;
v.reserve(3);		v.reserve(3);
v.insert(v.end(), X(1));		v.insert(v.end(), X(1));
v.insert(v.begin(), X(2));		v.insert(v.begin(), X(2));
assert(v.size() == 2);		assert(v.size() == 2);
try {		try {
▲ Show 20 Lines • Show All 122 Lines • Show Last 20 Lines

libcxx/test/libcxx/memory/uninitialized_allocator_copy.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				// UNSUPPORTED: no-exceptions

				// ensure that __uninitialized_allocator_copy calls the proper construct and destruct functions

				#include <algorithm>
				#include <iterator>
				#include <memory>

				#include "test_allocator.h"

				template <class T>
				class construct_counting_allocator {
				public:
				using value_type = T;

				int* constructed_count_;
				int* max_constructed_count_;

				construct_counting_allocator(int* constructed_count, int* max_constructed_count)
				: constructed_count_(constructed_count), max_constructed_count_(max_constructed_count) {}

				template <class... Args>
				void construct(T* ptr, Args&&... args) {
				::new (static_cast<void*>(ptr)) T(args...);
				++*constructed_count_;
				max_constructed_count_ = std::max(max_constructed_count_, *constructed_count_);
				}

				void destroy(T* ptr) {
				--*constructed_count_;
				ptr->~T();
				}
				};

				int throw_if_zero = 15;

				struct ThrowSometimes {
				ThrowSometimes() = default;
				ThrowSometimes(const ThrowSometimes&) {
				if (--throw_if_zero == 0)
				throw 1;
				}
				};

				int main(int, char**) {
				int constructed_count = 0;
				int max_constructed_count = 0;
				construct_counting_allocator<ThrowSometimes> alloc(&constructed_count, &max_constructed_count);
				ThrowSometimes in[20];
				TEST_ALIGNAS_TYPE(ThrowSometimes) char out[sizeof(ThrowSometimes) * 20];
				try {
				std::__uninitialized_allocator_copy(
				alloc, std::begin(in), std::end(in), reinterpret_cast<ThrowSometimes*>(std::begin(out)));
				} catch (...) {
				}

				assert(constructed_count == 0);
				assert(max_constructed_count == 14);
				}

libcxx/test/libcxx/private_headers.verify.cpp

	Show First 20 Lines • Show All 409 Lines • ▼ Show 20 Lines
	#include <__memory/compressed_pair.h> // expected-error@: {{use of private header from outside its module: '__memory/compressed_pair.h'}}			#include <__memory/compressed_pair.h> // expected-error@: {{use of private header from outside its module: '__memory/compressed_pair.h'}}
	#include <__memory/concepts.h> // expected-error@: {{use of private header from outside its module: '__memory/concepts.h'}}			#include <__memory/concepts.h> // expected-error@: {{use of private header from outside its module: '__memory/concepts.h'}}
	#include <__memory/construct_at.h> // expected-error@: {{use of private header from outside its module: '__memory/construct_at.h'}}			#include <__memory/construct_at.h> // expected-error@: {{use of private header from outside its module: '__memory/construct_at.h'}}
	#include <__memory/pointer_traits.h> // expected-error@: {{use of private header from outside its module: '__memory/pointer_traits.h'}}			#include <__memory/pointer_traits.h> // expected-error@: {{use of private header from outside its module: '__memory/pointer_traits.h'}}
	#include <__memory/ranges_construct_at.h> // expected-error@: {{use of private header from outside its module: '__memory/ranges_construct_at.h'}}			#include <__memory/ranges_construct_at.h> // expected-error@: {{use of private header from outside its module: '__memory/ranges_construct_at.h'}}
	#include <__memory/ranges_uninitialized_algorithms.h> // expected-error@: {{use of private header from outside its module: '__memory/ranges_uninitialized_algorithms.h'}}			#include <__memory/ranges_uninitialized_algorithms.h> // expected-error@: {{use of private header from outside its module: '__memory/ranges_uninitialized_algorithms.h'}}
	#include <__memory/raw_storage_iterator.h> // expected-error@: {{use of private header from outside its module: '__memory/raw_storage_iterator.h'}}			#include <__memory/raw_storage_iterator.h> // expected-error@: {{use of private header from outside its module: '__memory/raw_storage_iterator.h'}}
	#include <__memory/shared_ptr.h> // expected-error@: {{use of private header from outside its module: '__memory/shared_ptr.h'}}			#include <__memory/shared_ptr.h> // expected-error@: {{use of private header from outside its module: '__memory/shared_ptr.h'}}
				#include <__memory/swap_allocator.h> // expected-error@: {{use of private header from outside its module: '__memory/swap_allocator.h'}}
	#include <__memory/temporary_buffer.h> // expected-error@: {{use of private header from outside its module: '__memory/temporary_buffer.h'}}			#include <__memory/temporary_buffer.h> // expected-error@: {{use of private header from outside its module: '__memory/temporary_buffer.h'}}
	#include <__memory/uninitialized_algorithms.h> // expected-error@: {{use of private header from outside its module: '__memory/uninitialized_algorithms.h'}}			#include <__memory/uninitialized_algorithms.h> // expected-error@: {{use of private header from outside its module: '__memory/uninitialized_algorithms.h'}}
	#include <__memory/unique_ptr.h> // expected-error@: {{use of private header from outside its module: '__memory/unique_ptr.h'}}			#include <__memory/unique_ptr.h> // expected-error@: {{use of private header from outside its module: '__memory/unique_ptr.h'}}
	#include <__memory/uses_allocator.h> // expected-error@: {{use of private header from outside its module: '__memory/uses_allocator.h'}}			#include <__memory/uses_allocator.h> // expected-error@: {{use of private header from outside its module: '__memory/uses_allocator.h'}}
	#include <__memory/voidify.h> // expected-error@: {{use of private header from outside its module: '__memory/voidify.h'}}			#include <__memory/voidify.h> // expected-error@: {{use of private header from outside its module: '__memory/voidify.h'}}
	#include <__mutex_base> // expected-error@: {{use of private header from outside its module: '__mutex_base'}}			#include <__mutex_base> // expected-error@: {{use of private header from outside its module: '__mutex_base'}}
	#include <__node_handle> // expected-error@: {{use of private header from outside its module: '__node_handle'}}			#include <__node_handle> // expected-error@: {{use of private header from outside its module: '__node_handle'}}
	#include <__numeric/accumulate.h> // expected-error@: {{use of private header from outside its module: '__numeric/accumulate.h'}}			#include <__numeric/accumulate.h> // expected-error@: {{use of private header from outside its module: '__numeric/accumulate.h'}}
	▲ Show 20 Lines • Show All 223 Lines • Show Last 20 Lines

libcxx/test/std/containers/sequences/vector/vector.modifiers/insert_iter_initializer_list.pass.cpp

	Show All 13 Lines

	#include <vector>			#include <vector>
	#include <cassert>			#include <cassert>

	#include "test_macros.h"			#include "test_macros.h"
	#include "min_allocator.h"			#include "min_allocator.h"
	#include "asan_testing.h"			#include "asan_testing.h"

	int main(int, char**)			#ifndef TEST_HAS_NO_EXCEPTIONS
	{			int throw_if_zero = 2;
				int constructed_count = 0;

				struct ThrowSometimes {
				ThrowSometimes() { ++constructed_count; }
				ThrowSometimes(const ThrowSometimes&) {
				if (--throw_if_zero == 0)
				throw 1;
				++constructed_count;
				}
				ThrowSometimes& operator=(const ThrowSometimes&) {
				if (--throw_if_zero == 0)
				throw 1;
				++constructed_count;
				return *this;
				}
				~ThrowSometimes() { --constructed_count; }
				};

				void test_throwing() {
				std::vector<ThrowSometimes> v;
				v.reserve(4);
				v.emplace_back();
				v.emplace_back();
				try {
				v.insert(v.end(), {ThrowSometimes{}, ThrowSometimes{}});
				assert(false);
				} catch (int) {
				assert(v.size() == 2);
				assert(constructed_count == 2);
				}
				}
				#endif // TEST_HAS_NO_EXCEPTIONS

				int main(int, char**) {
				#ifndef TEST_HAS_NO_EXCEPTIONS
				test_throwing();
				#endif
	{			{
	std::vector<int> d(10, 1);			std::vector<int> d(10, 1);
	std::vector<int>::iterator i = d.insert(d.cbegin() + 2, {3, 4, 5, 6});			std::vector<int>::iterator i = d.insert(d.cbegin() + 2, {3, 4, 5, 6});
	assert(d.size() == 14);			assert(d.size() == 14);
	assert(is_contiguous_container_asan_correct(d));			assert(is_contiguous_container_asan_correct(d));
	assert(i == d.begin() + 2);			assert(i == d.begin() + 2);
	assert(d[0] == 1);			assert(d[0] == 1);
	assert(d[1] == 1);			assert(d[1] == 1);
	assert(d[2] == 3);			assert(d[2] == 3);
	assert(d[3] == 4);			assert(d[3] == 4);
	assert(d[4] == 5);			assert(d[4] == 5);
	assert(d[5] == 6);			assert(d[5] == 6);
	assert(d[6] == 1);			assert(d[6] == 1);
	assert(d[7] == 1);			assert(d[7] == 1);
	assert(d[8] == 1);			assert(d[8] == 1);
	assert(d[9] == 1);			assert(d[9] == 1);
	assert(d[10] == 1);			assert(d[10] == 1);
	assert(d[11] == 1);			assert(d[11] == 1);
	assert(d[12] == 1);			assert(d[12] == 1);
	assert(d[13] == 1);			assert(d[13] == 1);
	}			}
	{			{
	std::vector<int, min_allocator<int>> d(10, 1);			std::vector<int, min_allocator<int>> d(10, 1);
	std::vector<int, min_allocator<int>>::iterator i = d.insert(d.cbegin() + 2, {3, 4, 5, 6});			std::vector<int, min_allocator<int>>::iterator i = d.insert(d.cbegin() + 2, {3, 4, 5, 6});
	assert(d.size() == 14);			assert(d.size() == 14);
	assert(is_contiguous_container_asan_correct(d));			assert(is_contiguous_container_asan_correct(d));
	assert(i == d.begin() + 2);			assert(i == d.begin() + 2);
	assert(d[0] == 1);			assert(d[0] == 1);
	assert(d[1] == 1);			assert(d[1] == 1);
	assert(d[2] == 3);			assert(d[2] == 3);
	assert(d[3] == 4);			assert(d[3] == 4);
	assert(d[4] == 5);			assert(d[4] == 5);
	assert(d[5] == 6);			assert(d[5] == 6);
	assert(d[6] == 1);			assert(d[6] == 1);
	assert(d[7] == 1);			assert(d[7] == 1);
	assert(d[8] == 1);			assert(d[8] == 1);
	assert(d[9] == 1);			assert(d[9] == 1);
	assert(d[10] == 1);			assert(d[10] == 1);
	assert(d[11] == 1);			assert(d[11] == 1);
	assert(d[12] == 1);			assert(d[12] == 1);
	assert(d[13] == 1);			assert(d[13] == 1);
	}			}

	return 0;			return 0;
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[libc++] Use uninitialized algorithms for vectorClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 447726

libcxx/include/CMakeLists.txt

libcxx/include/__algorithm/equal_range.h

libcxx/include/__hash_table

libcxx/include/__memory/swap_allocator.h

libcxx/include/__memory/uninitialized_algorithms.h

libcxx/include/__split_buffer

libcxx/include/__tree

libcxx/include/__utility/transaction.h

libcxx/include/forward_list

libcxx/include/list

libcxx/include/memory

libcxx/include/module.modulemap.in

libcxx/include/string

libcxx/include/vector

libcxx/test/libcxx/containers/sequences/vector/asan_throw.pass.cpp

libcxx/test/libcxx/memory/uninitialized_allocator_copy.pass.cpp

libcxx/test/libcxx/private_headers.verify.cpp

libcxx/test/std/containers/sequences/vector/vector.modifiers/insert_iter_initializer_list.pass.cpp

[libc++] Use uninitialized algorithms for vector
ClosedPublic