This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
libcxx/
-
docs/Status/
-
Status/
3/3
Cxx2bIssues.csv
-
SpaceshipProjects.csv
-
include/
-
CMakeLists.txt
-
__algorithm/
62/62
lexicographical_compare_three_way.h
1/1
algorithm
2/2
module.modulemap.in
-
test/
-
libcxx/
-
private_headers.verify.cpp
-
std/algorithms/alg.sorting/alg.three.way/
-
algorithms/
-
alg.sorting/
-
alg.three.way/
37/37
lexicographical_compare_three_way.pass.cpp
13/13
lexicographical_compare_three_way_comp.pass.cpp
5/5
lexicographical_compare_three_way_comp.verify.cpp

Differential D131395

[libc++] Implement `lexicographical_compare_three_way`
ClosedPublic

Authored by avogelsgesang on Aug 8 2022, 6:17 AM.

Download Raw Diff

Details

Reviewers

ldionne
Mordante
philnik
mumbleskates
huixie90
var-const
jdoerfert

Group Reviewers

Restricted Project

Commits

rG2a06757a200c: [libc++][spaceship] Implement `lexicographical_compare_three_way`

Summary

The implementation makes use of the freedom added by LWG 3410. We have
two variants of this algorithm:

a fast path for random access iterators: This fast path computes the maximum number of loop iterations up-front and does not compare the iterators against their limits on every loop iteration.
A basic implementation for all other iterators: This implementation compares the iterators against their limits in every loop iteration. However, it still takes advantage of the freedom added by LWG 3410 to avoid unnecessary additional iterator comparisons, as originally specified by P1614R2.

https://godbolt.org/z/7xbMEen5e shows the benefit of the fast path:
The hot loop generated of lexicographical_compare_three_way3 is
more tight than for lexicographical_compare_three_way1. The added
benchmark illustrates how this leads to a 30% performance improvement on
integer vectors.

Implements part of P1614R2 "The Mothership has Landed"

Fixes LWG 3410 and LWG 3350

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

fix the easy comments by @Mordante. Leaving the more involved ones for another day

libcxx/include/module.modulemap.in
291	I just realized that there is already a pattern how to deal with overly long names, see `uniform_random_bit_generator_adaptor`. Did the same here now...
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp
28	any particular reason? Would it be fine if I used a function-local `using std::array`?
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way_comp.verify.cpp
19	do we already have such a test utility somewhere? I couldn't find anything useful in `test_iterators.h`
libcxx/test/support/test_comparisons.h
253 ↗	(On Diff #454246)	because for `strong_ordering` you can simply use a plain `int`. Or should I still add it for completeness?

mumbleskates added inline comments.Aug 21 2022, 12:52 PM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
49	The point is that, while we are still relying on the exact same semantic under the hood, we are making an effort to spell out what we expect to happen, making it explicit that we thought of this case and fully intend this outcome. We can also ensure that future readers might also understand this happens when they might not have thought of it, whether in understanding what our code is doing or in making their own implementation some time in the future. I like @Mordante's most recent version with the `using` declarations.
libcxx/test/support/test_comparisons.h
253 ↗	(On Diff #454246)	honestly you could probably just write `using StrongComp = int;`. in the tests i've written so far i have used integral types for strong, floating points for partial, and only used structs for weak orderings. to that end, it would be useful if PartialComp had an avenue to actually return `partial_ordering::unordered`. you could keep its member typed as an `int` and use `INT_MIN` as a sentinel for the unordered value, which could even allow us to test heterogenous orderable/unorderable values `constexpr` in gcc (which currently(?) does not allow comparing infinities and NaNs against different values in constant evaluation). For additional completeness here we would add a `UserComp` struct whose `operator<=>` returns a `UserOrdering` typed value that implements the appropriate operators against literal zero; such types are useful for SFINAE testing and types that utilize `synth-three-way`.

LGTM with nits. I will leave the final approval to others who also commented

libcxx/benchmarks/lexicographical_compare_three_way.bench.cpp
35 ↗	(On Diff #454330)	nit: you can use random_access_iterator<int> from the test_iterators.h so that both tests are using some iterator wrappers (to be more fair). It also saves you to explain that `int` is a random access iterator
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp
122	in other tests, we usually do template <class It1, class Int2> void test(); template <class It2> void testAllIterator1(){ test<InputIteartor, It2>(); test<ForwardIteartor, It2>(); // ... test<ContiguousIteartor, It2>(); } void testAllIt1It2(){ testAllIterator1<InputIteartor>(); testAllIterator1<ForwardIteartor>(); // ... testAllIterator1<ContiguousIteartor>(); } This saves you manually writing down the all the cartesian products. Some people also argue that we only need to test the weakest iterator and the strongest iterator to save the combinatorial code bloat. It also has a point but I am fine with testing all of the combinations.

address a couple more comments

libcxx/include/__algorithm/lexicographical_compare_three_way.h
45–46	Can you test whether that's true here too, by using integral instead of is_integral_v? https://godbolt.org/z/rKdPq9joo Given that we implemented the `integral` as concept integral = is_integral_v<_Tp>; using a concept here doesn't improve the error message. It only adds a more complicated backtrace to the error message which does not really provide much value.
45–46	[...] maybe we want to add the exposition only concepts for Cpp17Iterators from the standard somewhere and static_assert that here? You mean static_assert(__cpp17_iterator<_InputIterator1>, "calling lexicographical_compare_three_way with a non-standard-compliant iterators is undefined behavior"); static_assert(__cpp17_iterator<_InputIterator2>, "calling lexicographical_compare_three_way with a non-standard-compliant iterators is undefined behavior"); ?
49	I personally don't have an opinion here. Please come to an agreement here (or agree to disagree, and nevertheless agree with merging this code). I am happy to update this patch in whichever way you agree on, but I want to avoid changing it forth and back (an earlier version of this review was actually written in terms of the difference type and changed based on feedback in this review)
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp
34	not sure how much that would actually give us. Instead of test_lexicographical_compare<Iter1, Iter2>(array{0, 1}, array{0, 2}, std::strong_ordering::less); I could write test_lexicographical_compare<Iter1, Iter2>(array{0, 1}, array{0, 2}); but I would still have to enumerate all the different input arrays, iterator types etc. Given this would only save few lines, I prefer to keep the test cases as they currently are.

Harbormaster completed remote builds in B182976: Diff 455013.Aug 23 2022, 4:40 PM

In D131395#3736227, @avogelsgesang wrote:

Added a benchmark

Based on the below results, we can see that the fast path for random access iterators gives us around 30% peformance compared to the basic implementation

Run on (40 X 3000 MHz CPU s)
CPU Caches:
  L1 Data 32 KiB (x20)
  L1 Instruction 32 KiB (x20)
  L2 Unified 1024 KiB (x20)
  L3 Unified 14080 KiB (x2)
Load Average: 8.64, 6.10, 3.51
***WARNING*** CPU scaling is enabled, the benchmark real time measurements may be noisy and will incur extra overhead.
-------------------------------------------------------------------------------------------------------------------
Benchmark                                                                         Time             CPU   Iterations
-------------------------------------------------------------------------------------------------------------------
BM_lexicographical_compare_three_way<int*>/1                                   1.38 ns         1.38 ns    507820129
BM_lexicographical_compare_three_way<int*>/2                                   2.06 ns         2.06 ns    339714887
BM_lexicographical_compare_three_way<int*>/4                                   3.44 ns         3.44 ns    203747061
BM_lexicographical_compare_three_way<int*>/8                                   6.20 ns         6.20 ns    111056521
BM_lexicographical_compare_three_way<int*>/16                                  12.0 ns         12.0 ns     58183199
BM_lexicographical_compare_three_way<int*>/32                                  22.9 ns         22.9 ns     30549045
BM_lexicographical_compare_three_way<int*>/64                                  44.8 ns         44.8 ns     15615457
BM_lexicographical_compare_three_way<int*>/128                                 99.0 ns         99.0 ns      7056236
BM_lexicographical_compare_three_way<int*>/256                                  187 ns          187 ns      3743238
BM_lexicographical_compare_three_way<int*>/512                                  363 ns          363 ns      1927025
BM_lexicographical_compare_three_way<int*>/1024                                 715 ns          715 ns       969796
BM_lexicographical_compare_three_way<int*>/2048                                1418 ns         1418 ns       493273
BM_lexicographical_compare_three_way<int*>/4096                                2838 ns         2838 ns       246123
BM_lexicographical_compare_three_way<int*>/8192                                5647 ns         5647 ns       123272
BM_lexicographical_compare_three_way<int*>/16384                              11318 ns        11318 ns        61772
BM_lexicographical_compare_three_way<int*>/32768                              22565 ns        22563 ns        31000
BM_lexicographical_compare_three_way<int*>/65536                              45231 ns        45229 ns        15513
BM_lexicographical_compare_three_way<int*>/131072                             91047 ns        91042 ns         7678
BM_lexicographical_compare_three_way<int*>/262144                            181619 ns       181601 ns         3867
BM_lexicographical_compare_three_way<int*>/524288                            361487 ns       361452 ns         1936
BM_lexicographical_compare_three_way<int*>/1048576                           736236 ns       736225 ns          953
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/1             1.03 ns         1.03 ns    675808965
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/2             2.41 ns         2.41 ns    290934998
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/4             4.48 ns         4.48 ns    156648871
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/8             8.60 ns         8.60 ns     81462544
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/16            16.9 ns         16.9 ns     41431608
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/32            33.3 ns         33.3 ns     20989390
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/64            66.3 ns         66.3 ns     10542387
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/128            142 ns          142 ns      4949055
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/256            274 ns          274 ns      2555910
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/512            537 ns          537 ns      1296173
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/1024          1066 ns         1066 ns       656525
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/2048          2121 ns         2121 ns       329789
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/4096          4237 ns         4237 ns       165380
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/8192          8452 ns         8452 ns        82559
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/16384        16897 ns        16897 ns        41435
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/32768        33800 ns        33798 ns        20716
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/65536        67609 ns        67608 ns        10340
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/131072      135777 ns       135770 ns         5156
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/262144      270691 ns       270686 ns         2588
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/524288      542588 ns       542550 ns         1290
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/1048576    1088975 ns      1088880 ns          642

Can you please redo the benchmark in a different way by doing some local (not to be committed) modifications? Rather than comparing the results for int* versus cpp17_input_iterator<int*>, remove the input iterator tests completely, run the int* benchmark against the current implementation, then remove the optimization and rerun the benchmark. That way, it would be a proper "apples-to-apples" comparison: int* vs. int* with or without the optimization. (I don't expect results to change much, but better be sure)

libcxx/include/__algorithm/lexicographical_compare_three_way.h
24	Is this header still used?
37	Optional: consider refactoring both branches into helper functions.
44	Nit: I think this would read a little better with a verb, e.g. `Using a non-integral difference_type is undefined behavior`. Also, this seems a little overkill -- I'm pretty sure there are many places in algorithms where we subtract random access iterators and expect to get an integral type, without checking. I don't object to the `static_assert`s, but the comment (starting from `We rely on the fact...`) seems a little unnecessary to me (the part about undefined behavior is already captured in the static assertion).
50	Nit: some empty lines will help separate this algorithm into logical blocks and make it more readable. I'd suggest adding blank lines before and after the `for` loop and before the `else` branch.
55	Nit: increment `__i` in the iteration expression? Otherwise, it seems more like a `while` loop.

var-const requested changes to this revision.Aug 25 2022, 1:48 PM

This revision now requires changes to proceed.Aug 25 2022, 1:48 PM

address more comments, but now gcc11 is crashing.
updating so that CI can hopefully give me some insight into whether it's fixed in gcc12.

avogelsgesang added inline comments.Aug 26 2022, 2:19 PM

libcxx/test/support/test_comparisons.h
253 ↗	(On Diff #454246)	complete the set with `std::strong_ordering` done `PartialComp` had an avenue to actually return `partial_ordering::unordered` done `UserComp` struct whose `operator<=>` returns a `UserOrdering` Added. But now gcc-11 crashes

Harbormaster completed remote builds in B183672: Diff 456018.Aug 26 2022, 2:26 PM

var-const added inline comments.Aug 26 2022, 6:05 PM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
46–48	As noted in another comment, can you please rerun the benchmark so that it compares `int` with optimizations vs. `int` without optimizations?

avogelsgesang added inline comments.Aug 28 2022, 2:00 PM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
46–48	yes, rerunning the benchmark is definitely still on my todo list. It takes longer than expected, because it seems that one of the refactorings during this review destroyed the optimization. I can still reproduce the numbers on the old commit, but on the current review the fast path is no longer faster than the default path. I will need some time to figure out what exactly lead to the regression here...

Mordante added inline comments.Aug 29 2022, 9:32 AM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
49	It helps since the rules of the type of the conditional expression are not simple. I had to verify with the standard it does the right thing. So instead of spending a few seconds to validate these 3 lines I had to spend several minutes. I don't see how `common_type` helps with the type of the conditional not being simple. That's exactly what `common_type` uses to get the type in this case AFAICT (https://eel.is/c++draft/type.traits#meta.trans.other-3). Plus is has a heap of other conditionals that are hard to get through. It's at least clear that it's intended to use `common_type` and not one of the other rules of the conditional expression. IMO auto should never be used with the conditional expression. Code should be optimized for understanding by humans, `auto` quite often saves the writer from typing a few characters. (The compiler doesn't care either way it does its thing.) I agree that code should be written to make it easier to read. IMO littering the code with types I don't really care about makes it harder to read. i.e. I don't really care that `a - b` returns a value of type `__iter_diff_t<_InputIterator1>`, but now I have to check that you actually named the correct type. Thinking about it, `integral auto __len1 = __last1 - __first1` would be great. Not sure how much compile time overhead that would incur though. WDYT? I care about these types when I try to understand the code and validate whether the author wrote the code correctly. It takes me as reader a lot longer to validate the code. Auto has its uses but it shouldn't be used everywhere just to make it easy for the writer. I still like the explicit type better either directly or by using a typedef. For the `operator-` I don't dislike this too strongly; but as said above for the conditional expression I do. The verbose code helps to communicate what the author of the code intended to happen. [Relying] on some (not always well understood) language rules means it's less clear for the reader to understand what the writer intended. Both may have a different understanding of these rules. But you are still relying on these rules through `common_type` AFAICT. Yes but as said above it makes it clear that `common_type` is intended to be used. (I agree `common_type` has it's complexity too.)
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp
28	To make it clear which array is used. Sometimes we use similar names as standard names. In general we don't use using globally, sometimes in a function. There usually the nested namespace being tested like `std::chrono` or the `literal`s namespaces.
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way_comp.verify.cpp
19	I think we don't. `almost_satisfies_types.h` seems to be the better place for such an iterator.
libcxx/test/support/test_comparisons.h
253 ↗	(On Diff #454246)	GCC-12 too? We don't officially support GCC-11 anymore.

@avogelsgesang Do you need anything from us to make progress on this patch? I'd like to make sure you're unblocked.

@avogelsgesang Do you need anything from us to make progress on this patch? I'd like to make sure you're unblocked.

Thanks for checking in! No, I currently don't need any support. I was on PTO for the last couple of weeks, I am currently ramping back up, and hope to find some time to work on this next week.

The main work item currently is: I am a bit confused by my own benchmark, and I have a suspicion that clang had a regression in the meantime so my benchmark numbers look different now. I want to figure out why my benchmark changed, but in the end I will likely just remove the optimization and the benchmark, and open a request on clang/LLVM to improve their optimizations

@var-const As requested, I updated the benchmark to benchmark int* on both the fast and the slow path.

Previously, you proposed

Can you please redo the benchmark in a different way by doing some local (not to be committed) modifications? Rather than comparing the results for int* versus cpp17_input_iterator<int*>, remove the input iterator tests completely, run the int* benchmark against the current implementation, then remove the optimization and rerun the benchmark. That way, it would be a proper "apples-to-apples" comparison: int* vs. int* with or without the optimization. (I don't expect results to change much, but better be sure)

Instead, I now factored out the slow and fast path into two separate functions (as proposed in another comment) and call both of them directly from the benchmark. That way, we can also rerun the benchmark in the future, and e.g. remove the optimization from libc++ again, as soon as clang's optimizations are smart enough to figure this out by themselves. Do you agree with the updated benchmark?

The results on my local machine (compiled using clang commit aae08b1d372a45b8bef95e86e5fe9110045eb78d):

Running ./libcxx/benchmarks/lexicographical_compare_three_way.libcxx.out
Run on (40 X 800.102 MHz CPU s)
CPU Caches:
  L1 Data 32 KiB (x20)
  L1 Instruction 32 KiB (x20)
  L2 Unified 1024 KiB (x20)
  L3 Unified 14080 KiB (x2)
Load Average: 0.00, 0.48, 0.67
---------------------------------------------------------------------------------------------------------------------
Benchmark                                                                           Time             CPU   Iterations
---------------------------------------------------------------------------------------------------------------------
BM_lexicographical_compare_three_way_slow_path/1                                 1.67 ns         1.67 ns    415947081
BM_lexicographical_compare_three_way_slow_path/4                                 4.68 ns         4.68 ns    149519461
BM_lexicographical_compare_three_way_slow_path/16                                16.7 ns         16.7 ns     41824440
BM_lexicographical_compare_three_way_slow_path/64                                64.9 ns         64.9 ns     10711142
BM_lexicographical_compare_three_way_slow_path/256                                266 ns          266 ns      2630928
BM_lexicographical_compare_three_way_slow_path/1024                              1036 ns         1036 ns       676167
BM_lexicographical_compare_three_way_slow_path/4096                              4122 ns         4121 ns       169817
BM_lexicographical_compare_three_way_slow_path/16384                            16453 ns        16453 ns        42559
BM_lexicographical_compare_three_way_slow_path/65536                            65827 ns        65826 ns        10619
BM_lexicographical_compare_three_way_slow_path/262144                          263147 ns       263140 ns         2658
BM_lexicographical_compare_three_way_slow_path/1048576                        1060909 ns      1060813 ns          659
BM_lexicographical_compare_three_way_fast_path/1                                 2.34 ns         2.34 ns    298968436
BM_lexicographical_compare_three_way_fast_path/4                                 4.76 ns         4.76 ns    148425440
BM_lexicographical_compare_three_way_fast_path/16                                10.1 ns         10.1 ns     68735401
BM_lexicographical_compare_three_way_fast_path/64                                28.8 ns         28.8 ns     24262243
BM_lexicographical_compare_three_way_fast_path/256                                123 ns          123 ns      5685144
BM_lexicographical_compare_three_way_fast_path/1024                               444 ns          444 ns      1575722
BM_lexicographical_compare_three_way_fast_path/4096                              1749 ns         1749 ns       400100
BM_lexicographical_compare_three_way_fast_path/16384                             6898 ns         6897 ns       101189
BM_lexicographical_compare_three_way_fast_path/65536                            28537 ns        28536 ns        24520
BM_lexicographical_compare_three_way_fast_path/262144                          125793 ns       125791 ns         5503
BM_lexicographical_compare_three_way_fast_path/1048576                         506020 ns       505976 ns         1379
BM_lexicographical_compare_three_way<int*>/1                                     1.67 ns         1.67 ns    418741799
BM_lexicographical_compare_three_way<int*>/4                                     3.68 ns         3.68 ns    190328759
BM_lexicographical_compare_three_way<int*>/16                                    8.70 ns         8.70 ns     80464881
BM_lexicographical_compare_three_way<int*>/64                                    28.9 ns         28.9 ns     24261824
BM_lexicographical_compare_three_way<int*>/256                                    125 ns          125 ns      5593129
BM_lexicographical_compare_three_way<int*>/1024                                   448 ns          448 ns      1562365
BM_lexicographical_compare_three_way<int*>/4096                                  1751 ns         1750 ns       399745
BM_lexicographical_compare_three_way<int*>/16384                                 6895 ns         6895 ns       101193
BM_lexicographical_compare_three_way<int*>/65536                                28741 ns        28740 ns        24319
BM_lexicographical_compare_three_way<int*>/262144                              125823 ns       125822 ns         5523
BM_lexicographical_compare_three_way<int*>/1048576                             508282 ns       508215 ns         1385
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/1             2.68 ns         2.68 ns    261609292
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/4             5.37 ns         5.37 ns    132805153
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/16            12.4 ns         12.4 ns     56251434
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/64            44.5 ns         44.5 ns     15732328
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/256            182 ns          182 ns      3848123
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/1024           696 ns          696 ns      1003714
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/4096          2753 ns         2753 ns       254392
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/16384        10966 ns        10966 ns        63844
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/65536        43846 ns        43844 ns        15953
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/262144      175599 ns       175590 ns         3989
BM_lexicographical_compare_three_way<random_access_iterator<int*>>/1048576     705291 ns       705245 ns          991
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/1               1.67 ns         1.67 ns    418471062
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/4               4.68 ns         4.68 ns    149512136
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/16              16.7 ns         16.7 ns     41727011
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/64              65.0 ns         65.0 ns     10776363
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/256              266 ns          266 ns      2635392
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/1024            1035 ns         1035 ns       674817
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/4096            4121 ns         4121 ns       169881
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/16384          16461 ns        16461 ns        42545
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/65536          65776 ns        65774 ns        10630
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/262144        263308 ns       263295 ns         2659
BM_lexicographical_compare_three_way<cpp17_input_iterator<int*>>/1048576      1060793 ns      1060722 ns          660

A couple of observations:

The results for {fast,slow}_path/1 vary from run to run. The slow_path is not faster than the fast_path consistently across runs. I think this is primarily variance
fast_path/1048576 is twice as fast as slow_path/1048576
The results for three_way<int*>/1048576 and three_way<int*>/1048576 are inline with the results for {fast,slow}_path/1048576. The dispatching based on whether we are passing in a random access iterator or not works as expected
three_way<random_access_iterator<int*>>/1048576 is less efficient than three_way<int*>/1048576. As shown below, the assembly code is exactly identical between the two, though. So this seems to be due to some micro-architectural shenanigans (maybe code alignment?)
Compared to the previously posted results, three_way<int*> is now faster. I think compared to last time, I was just more lucky this time, and this improvement is thanks to the same micro-architectural shenanigans as the previous point

I recorded the benchmark with perf record -e cycles:pp ./libcxx/benchmarks/lexicographical_compare_three_way.libcxx.out.
Below the hot loops of the individual benchmarks as reported by perf report:

Hot loop for BM_lexicographical_compare_three_way_fast_path and BM_lexicographical_compare_three_way<int*>. Both have the exact same assembly and the reported cycles:pp perf numbers are pretty identical.

15.10 │210:┌─→cmp    %rdi,%r10 
      │    │↓ je     25a
17.76 │    │  mov    0x4(%rsi,%rdi,4),%r8d
16.45 │    │  mov    0x4(%r15,%rdi,4),%r9d
14.85 │    │  inc    %rdi
 0.30 │    ├──cmp    %r9d,%r8d
15.98 │    └──je     210

Hot loop for BM_lexicographical_compare_three_way_slow_path and BM_lexicographical_compare_three_way<cpp17_input_iterator<int*, int*>>. Both have the exact same assembly and the reported perf numbers are pretty identical.

 0.03 │210:┌─→mov    (%r15,%r10,1),%edi
20.82 │    │  cmp    %edi,(%rsi,%r10,1) 
 2.75 │    │↓ jne    240 
 7.61 │    │  cmp    %r10,%rax
      │    │  sete   %r8b
 0.01 │    │  cmp    %r10,%r9
20.71 │    │  sete   %dil
 7.54 │    │↑ je     1d0
      │    │  lea    0x4(%r10),%r11
 0.01 │    │  cmp    %r10,%rax
21.13 │    │  mov    %r11,%r10
 7.63 │    └──jne    210

Hot loop for BM_lexicographical_compare_three_way<random_access_iterator<int*> >. Interestingly, the generated assembly is identitcal with

 2.10 │210:┌─→cmp    %rdi,%r10 
      │    │↓ je     25a
35.84 │    │  mov    0x4(%rsi,%rdi,4),%r8d
 3.58 │    │  mov    0x4(%r15,%rdi,4),%r9d
 4.02 │    │  inc    %rdi
 0.29 │    ├──cmp    %r9d,%r8d
33.22 │    └──je     210

As you can see, the assembly is identical with BM_lexicographical_compare_three_way_fast_path. However, the codes executes 40%, and the perf profile has different hot instructions.
My guess is that this is due to some micro-architectural shenanigans.

I am not sure how to further pinpoint this difference between int* and random_access_iterator<int*>.
Either way, I think we can say for sure: The assembly for the fast path is more efficient than the assembly for the slow path.
There seems to be some performance variability, though, due to some aspects which I don't understand fully.

Are those benchmark results as well as the updated benchmark code satisfactory to you, @var-const?

libcxx/include/__algorithm/lexicographical_compare_three_way.h
37	Done. In particular, because this allows me to address your other benchmarking request more easily: I can now directly call the slow and fast paths from the benchmark

Updated benchmark code to directly compare the slow and the fast path

Harbormaster completed remote builds in B198648: Diff 476712.Nov 19 2022, 5:58 PM

Address mordante's comments about test cases:

remove using std::array and qualify each usage explicitly
add test case for non-integer difference_type

libcxx/test/support/test_comparisons.h
253 ↗	(On Diff #454246)	Didn't try gcc-12, yet. But I am currently struggling to even get it working for clang. https://godbolt.org/z/TKj18bqr8 shows my current progress. The problem is that I can't get it to satisfy the `std::three_way_comparable`. `three_way_comparable` requires `same_as<common_comparison_category_t<user_ordering, partial_ordering>, partial_ordering>`, but looking at the implementation of `common_comparison_category_t`, it seems to be hardcoded to `{partial,weak,strong}_ordering` and I don't see a way how to extend it. As such, I would say: Any type of user-defined ordering is currently not implementable.

Harbormaster completed remote builds in B198655: Diff 476720.Nov 19 2022, 8:21 PM

I think I addressed all actionable review comments now.

There is one unresolved, controversial comments remaining, though: auto vs explicit types:

@Mordante prefers explicit types
@philnik prefers auto
@mumbleskates first preferred auto but now seems to prefer explicit types

Given that there are good arguments in both directions and I don't see one side as objectively better, I would propose to merge the commit as currently in review, in the interest of making progress.
Can you live with that, @mumbleskates, @philnik, @Mordante?

libcxx/include/__algorithm/lexicographical_compare_three_way.h

45–46

To me, the goal of the

static_assert(is_integral_v<typename iterator_traits<_InputIterator1>::difference_type>,
              "Using a non-integral difference_type is undefined behavior");
static_assert(is_integral_v<typename iterator_traits<_InputIterator2>::difference_type>,
              "Using a non-integral difference_type is undefined behavior");

was to remind the reader: the difference_type is an integer, keep that in mind while reading the following code. Replacing this by a __cpp17_input_iterator assert would no longer fulfil this purpose

fix CI

Harbormaster completed remote builds in B198661: Diff 476726.Nov 19 2022, 9:26 PM

fix CI

Harbormaster completed remote builds in B198673: Diff 476741.Nov 20 2022, 5:41 AM

fix CI

Harbormaster completed remote builds in B198674: Diff 476742.Nov 20 2022, 6:14 AM

fix CI... maybe...

avogelsgesang added a child revision: D132312: [libc++][spaceship] Implement `operator<=>` for `list`.Nov 20 2022, 6:10 PM

avogelsgesang removed a child revision: D132268: [libc++][spaceship] Implement `operator<=>` for `vector`.Nov 20 2022, 6:49 PM

Harbormaster completed remote builds in B198698: Diff 476779.Nov 21 2022, 12:35 PM

Another try at fixing the CI; this time the benchmarks

Harbormaster completed remote builds in B198847: Diff 476985.Nov 22 2022, 1:35 AM

use explicit types instead of auto
make benchmark runnable on non-libcxx

Harbormaster completed remote builds in B199461: Diff 477842.Nov 24 2022, 2:31 PM

sort includes correctly
work around NASTY_MACRO

Harbormaster completed remote builds in B199469: Diff 477852.Nov 24 2022, 6:15 PM

ldionne added inline comments.Nov 25 2022, 6:57 AM

libcxx/benchmarks/lexicographical_compare_three_way.bench.cpp
13–14 ↗	(On Diff #477852)	I wouldn't bother with that. We test internal stuff as well in other benchmarks. If we want to make them usable with other stdlibs, I think we'd need to spend some time making that work.

LGTM, I leave the final approval to @var-const

libcxx/include/__algorithm/lexicographical_compare_three_way.h
101	Is `forward` intended? `__comp` is passed by value.

Removing my "request for changes". Please update the benchmarks as suggested and ship it once @var-const gives you the LGTM.

Thanks!

libcxx/benchmarks/lexicographical_compare_three_way.bench.cpp
73–79 ↗	(On Diff #477852)	You seem to be benchmarking not only the call to `std::lexicographical_compare_three_way`, but all of: auto b1 = IteratorT{v1.data()}; auto e1 = IteratorT{v1.data() + v1.size()}; auto b2 = IteratorT{v2.data()}; auto e2 = IteratorT{v2.data() + v2.size()}; benchmark::DoNotOptimize(std::lexicographical_compare_three_way(b1, e1, b2, e2, std::compare_three_way())); I suspect this is where the difference for `random_access_iterator<int>` vs `int` comes from. I think it would make sense to instead create the various iterators outside of the timer. And then perhaps regenerate the benchmarks and hopefully the ones for `random_access_iterator<int>` and `int` would be within noise. Also, we did confirm that the assembly was the same for both: https://godbolt.org/z/nTfGcKrzM.

forward -> move
move iterator initialization out of benchmarking loop
remove #if _LIB_CPP_VERSION check from benchmark

libcxx/benchmarks/lexicographical_compare_three_way.bench.cpp

73–79 ↗

(On Diff #477852)

Good catch! I moved the initialization of the iterators out of the benchmark loop. In particular for the large vector sizes, I don't think that this influenced the benchmark results, but still good to make the benchmarks as focused as possible.

I suspect this is where the difference for random_access_iterator<int*> vs int* comes from.

The change did not help, unfortunately (see my local results below). This time, the performance of BM_lexicographical_compare_three_way_fast_path/1048576 also dropped to 700ms compared to last time. The assembly code for the different ways to trigger the fast path (direct call to the fast path; int*, random_access_iterator) is still identical.

-----------------------------------------------------------------------------------------------------------------------
Benchmark                                                                             Time             CPU   Iterations
-----------------------------------------------------------------------------------------------------------------------
BM_lexicographical_compare_three_way_slow_path/1                                   2.01 ns         2.01 ns    347897440
BM_lexicographical_compare_three_way_slow_path/4                                   5.00 ns         5.00 ns    137864992
BM_lexicographical_compare_three_way_slow_path/16                                  16.3 ns         16.3 ns     42347282
BM_lexicographical_compare_three_way_slow_path/64                                  64.8 ns         64.8 ns     10882499
BM_lexicographical_compare_three_way_slow_path/256                                  269 ns          269 ns      2602744
BM_lexicographical_compare_three_way_slow_path/1024                                1039 ns         1039 ns       672699
BM_lexicographical_compare_three_way_slow_path/4096                                4131 ns         4131 ns       169469
BM_lexicographical_compare_three_way_slow_path/16384                              16563 ns        16562 ns        42373
BM_lexicographical_compare_three_way_slow_path/65536                              65853 ns        65850 ns        10631
BM_lexicographical_compare_three_way_slow_path/262144                            263375 ns       263356 ns         2656
BM_lexicographical_compare_three_way_slow_path/1048576                          1060461 ns      1060399 ns          658
BM_lexicographical_compare_three_way_fast_path/1                                   1.35 ns         1.35 ns    523207011
BM_lexicographical_compare_three_way_fast_path/4                                   3.38 ns         3.38 ns    207207415
BM_lexicographical_compare_three_way_fast_path/16                                  11.4 ns         11.4 ns     60817686
BM_lexicographical_compare_three_way_fast_path/64                                  43.6 ns         43.6 ns     16014340
BM_lexicographical_compare_three_way_fast_path/256                                  181 ns          181 ns      3877235
BM_lexicographical_compare_three_way_fast_path/1024                                 700 ns          700 ns      1008340
BM_lexicographical_compare_three_way_fast_path/4096                                2758 ns         2758 ns       253557
BM_lexicographical_compare_three_way_fast_path/16384                              11054 ns        11054 ns        62963
BM_lexicographical_compare_three_way_fast_path/65536                              44180 ns        44176 ns        15921
BM_lexicographical_compare_three_way_fast_path/262144                            175889 ns       175876 ns         3986
BM_lexicographical_compare_three_way_fast_path/1048576                           707635 ns       707576 ns          990
BM_lexicographical_compare_three_way<IntPtr>/1                                     1.01 ns         1.01 ns    690305420
BM_lexicographical_compare_three_way<IntPtr>/4                                     3.70 ns         3.70 ns    189157419
BM_lexicographical_compare_three_way<IntPtr>/16                                    7.70 ns         7.70 ns     90905923
BM_lexicographical_compare_three_way<IntPtr>/64                                    29.2 ns         29.2 ns     23966259
BM_lexicographical_compare_three_way<IntPtr>/256                                    124 ns          124 ns      5635669
BM_lexicographical_compare_three_way<IntPtr>/1024                                   467 ns          467 ns      1497982
BM_lexicographical_compare_three_way<IntPtr>/4096                                  1848 ns         1848 ns       378813
BM_lexicographical_compare_three_way<IntPtr>/16384                                 7710 ns         7710 ns        89978
BM_lexicographical_compare_three_way<IntPtr>/65536                                29735 ns        29734 ns        23509
BM_lexicographical_compare_three_way<IntPtr>/262144                              122694 ns       122680 ns         5585
BM_lexicographical_compare_three_way<IntPtr>/1048576                             500608 ns       500572 ns         1000
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1             1.34 ns         1.34 ns    515760180
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/4             3.42 ns         3.42 ns    205376850
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/16            11.8 ns         11.8 ns     59125669
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/64            44.0 ns         44.0 ns     15869035
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/256            181 ns          180 ns      3880819
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1024           695 ns          695 ns      1003722
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/4096          2754 ns         2754 ns       254042
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/16384        11040 ns        11040 ns        63761
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/65536        43912 ns        43911 ns        15908
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/262144      177597 ns       177587 ns         3965
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1048576     706671 ns       706623 ns          993
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1               1.12 ns         1.12 ns    627628655
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/4               4.13 ns         4.13 ns    169518923
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/16              16.3 ns         16.3 ns     43030558
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/64              64.4 ns         64.4 ns     10875232
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/256              268 ns          268 ns      2611255
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1024            1040 ns         1040 ns       674434
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/4096            4133 ns         4133 ns       169644
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/16384          16523 ns        16522 ns        42524
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/65536          65819 ns        65814 ns        10645
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/262144        263771 ns       263765 ns         2656
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1048576      1059997 ns      1059932 ns          660

libcxx/include/__algorithm/lexicographical_compare_three_way.h

101

good catch! Changed to move

JohelEGP added inline comments.Nov 29 2022, 4:14 PM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
101	I think `ref(__comp)` is the preferred way.

Harbormaster completed remote builds in B200130: Diff 478741.Nov 29 2022, 7:03 PM

Thanks a lot for the updated benchmarks! I really appreciate the effort you put into this.

All in all, the results look pretty good, but I really hope we can get to the bottom of the difference between int* and random_access_iterator<int*>.

The results for {fast,slow}_path/1 vary from run to run. The slow_path is not faster than the fast_path consistently across runs. I think this is primarily variance

I suspect that this might be more than just variance. I haven't tried plotting this, but from looking at the numbers, it seems that the fast path starts _slower_ for the smallest input of 1, becomes roughly equal in time for a slightly larger input of 4, and becomes faster with larger inputs, with the difference becoming larger and larger as the input grows. So if we were to plot this (with the x axis being input size and y axis being time), it seems like we would get two curves where the fast path curve starts out slightly above the slow path curve, but then they intersect almost immediately and from that point the slow curve goes up at a sharper angle, so to say, than the fast curve. I could also imagine how the implementation for the fast path could end up doing slightly more work for tiny inputs. So in short, I think we might be seeing a real issue where the optimization is actually slightly less efficient for very small inputs.

I doubt, however, that this is an issue worth fixing. Adding a runtime check for small inputs would introduce branching that is likely to do way more harm than good, and optimizing for large inputs is a lot more important than for small ones. In short, this seems like a good (and probably unavoidable) tradeoff.

three_way<random_access_iterator<int*>>/1048576 is less efficient than three_way<int*>/1048576. As shown below, the assembly code is exactly identical between the two, though. So this seems to be due to some micro-architectural shenanigans (maybe code alignment?)
[...]
I am not sure how to further pinpoint this difference between int* and random_access_iterator<int*>.
Either way, I think we can say for sure: The assembly for the fast path is more efficient than the assembly for the slow path.
There seems to be some performance variability, though, due to some aspects which I don't understand fully.

Here, too, I suspect we're seeing a real thing and not just variance. I say that because the difference tends to stay pretty consistent, with random_access_iterator being ~40% (30-50%) slower. This seems to point to the issue being real.

I don't immediately see how the same assembly could produce consistently different timings, so I'd start with the hypothesis that the difference happens somewhere else. On the other hand, as the input grows, any factors such as inlining or not inlining the function or copying the iterators should be completely drowned out by the body of the algorithm. But perhaps looking at a larger piece of generated assembly would help pinpoint the issue. I'll try to find the time this week to see if I can reproduce the numbers on my side.

I would expect the results for int* and random_access_iterator<int*> to be within the margin of error in an optimized build. If they differ, it might imply we accidentally do something inefficient with iterators, and it would be great to get to the bottom of this because it's such a common use case. I certainly don't want to block this patch forever on this, though. Basically, I think we should dig more into the issue, but if we run out of ideas, I'd be okay to ship it as is and deal with it later.

libcxx/benchmarks/lexicographical_compare_three_way.bench.cpp
73 ↗	(On Diff #478741)	Nit: passing this is unnecessary because it's the default value, right?
73 ↗	(On Diff #478741)	Nit: move?
libcxx/include/__algorithm/lexicographical_compare_three_way.h
49	FWIW, I don't have a strong preference here, but for me, one of the most important and informative aspects of `auto` is "guaranteed no conversion". This is as relevant for the author as it is for the reader. While it's true that `auto` can be misused by a writer to avoid thinking about which types are returned (IMO that's a bigger time "saver" than the literal typing which can often be autocompleted), I think it has legitimate uses where it makes the intention clearer.
67	Would it make sense to pass `__comp` by reference? It could, in theory, be e.g. a lambda that captures a lot of state. In fact, we generally commit to avoid copying user-provided functors. However, see also the other comment about `__comp_ref_type` (which would make this a reference but also have an additional effect in the debug mode).
92	As a libc++-specific extension, we mark the return value of `lexicographical_compare` with `_LIBCPP_NODISCARD_EXT`. We should do the same here (and add a check to `test/libcxx/diagnostics/nodiscard_extensions.verify.cpp`).
101	@JohenEGP I presume you referring to the `__comp_ref_type`, right? It's a great suggestion. @avogelsgesang For context, there is an existing pattern in algorithms where the internal implementation of the function wraps the comparator in a typedef that is defined differently in debug and non-debug modes (see `include/__algorithm/comp_ref_type.h`): #ifdef _LIBCPP_ENABLE_DEBUG_MODE template <class _Comp> using __comp_ref_type = __debug_less<_Comp>; #else template <class _Comp> using __comp_ref_type = _Comp&; #endif So in non-debug modes, this resolves to simply a reference. In debug mode, however, it creates a temporary of a helper type `__debug_less` that additionally checks that the comparator in fact does induce a strict weak order. I think we want to continue using this pattern going forward (e.g. `lexicographical_compare` uses it). The only issue is that the existing `__debug_less` only supports comparators returning a boolean. You would probably need to create a separate `__three_way_comp_ref_type` typedef and a separate `__debug_three_way_comp` helper struct (names are subject to change). Please let me know if you'd like any additional context on this (this can be kinda confusing). Many existing algorithms can be used as examples of this pattern; a caveat is that more recent code uses a (very) slightly different approach: older code tends to call have `_Compare` as a template parameter of the internal function, declare the parameter as `_Compare __comp`, and have the caller specify the template argument explicitly, like `std::__foo<_comp_ref_type<_Comp>>(__first, __last, __comp)`. See e.g. `lexicographical_compare`; in newer code, the pattern was to declare the parameter as a forwarding reference `_Compare&& __comp` and have the caller do a `static_cast`, like `std::__foo(__first, __last, static_cast<__comp_ref_type(__comp)>` (see e.g. `inplace_merge`). If you prefer to, I'm fine with doing this in a follow up since this patch has been open for a while now.
110	Move the iterators? (in the other function as well) You might additionally `static_assert` that the given iterators are copyable, to prevent users from accidentally passing move-only iterators (that our implementation would happen to accept due to the optimization but which isn't guaranteed by the Standard). A few of the existing algorithms do that, see e.g. `upper_bound`.
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp
4	We have a few tests that check for a certain behavior across a wide range of algorithms; those have to be updated to include `lexicographical_compare_three_way` now: `test/libcxx/diagnostics/nodiscard_extensions.verify.cpp` (potentially); `test/libcxx/algorithms/robust_against_copying_comparators.pass.cpp`; `test/libcxx/algorithms/robust_against_cpp20_hostile_iterators.compile.pass.cpp`; `test/std/algorithms/robust_re_difference_type.compile.pass.cpp`; `test/std/algorithms/robust_against_adl.compile.pass.cpp`. Please let me know if you need any help with those.
30	Nit: `s/auto/decltype(auto)/`. That way we always check the _exact_ returned type, without potentially stripping away references. While it's very unlikely that this algorithm could plausibly return a reference, I think the main advantage of using `decltype(auto)` is just not having to think about it and have it guaranteed to always catch the exact type.
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way_comp.pass.cpp
32	Question: what is the purpose of comparing just the last digit instead of just the two given numbers? It makes the implementation slightly more complicated (and the variable names longer), but all the inputs are single-digit anyway.
35	Is this branch ever taken?
58	Nit: I'd put `expected` last so that all the algorithm inputs are next to each other.
68	Optional: create a local constant with a shorter name for the comparator to cut down on the boilerplate a little?
154	Nit: `s/both/the/`.
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way_comp.verify.cpp
41	Nit: please also check that passing `RandomAccessIteratorBadDifferenceType` iterators for the second range fails as well (i.e., calling `std::lexicographical_compare_three_way(c, d, a, b, std::compare_three_way())`).
libcxx/test/support/almost_satisfies_types.h
425 ↗	(On Diff #478741)	Ultranit: there's one extraneous blank line.
libcxx/test/support/test_comparisons.h
263 ↗	(On Diff #478741)	Is this branch ever taken?

This revision now requires changes to proceed.Nov 30 2022, 1:43 AM

philnik added inline comments.Nov 30 2022, 4:13 AM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
101	IMO we should make the debug stuff a separate effort. AFAIK we don't test it anywhere and because of that I'm pretty sure it regressed in some places. Instead of adding another untested branch, I'd suggest creating a patch that adds tests to all algorithms and, depending on scope, update the algorithms in follow-up patches or as part of the test-patch.

var-const added inline comments.Nov 30 2022, 11:34 AM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
101	I would agree to that if someone volunteers to do that follow-up in the near future -- in fact, it would be great. Would you or @avogelsgesang be willing to take this on?

address varconst's commments

Herald added a reviewer: jdoerfert. · View Herald TranscriptNov 30 2022, 7:05 PM

Herald added a subscriber: sstefan1. · View Herald Transcript

avogelsgesang added inline comments.Nov 30 2022, 7:05 PM

libcxx/benchmarks/lexicographical_compare_three_way.bench.cpp
73 ↗	(On Diff #478741)	Nit: passing this is unnecessary because it's the default value, right? correct. removed Nit: move? move what? the input iterators? I don't think that is possible without a "use-after-move". I need them for each iteration of the `for` loop.
libcxx/include/__algorithm/lexicographical_compare_three_way.h
67	passing as `_Cmp&` now
101	added the `__three_way_comp_ref_type` as requested
110	Is adding the additional calls to `std::move` standard compliant? https://eel.is/c++draft/alg.three.way defines `lexicographical_compare_three_way` as not moving the iterators
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp
4	I added `lexicographical_compare_three_way` to all of them, except for `robust_re_difference_type.compile.pass.cpp`. Afaict this test case relies on undefined behavior. The `difference_type` must be an integer type (https://eel.is/c++draft/iterator.iterators#2.2), but this test case violates that requirement. As requested by @philnik I added a `static_assert` to `lexicographical_compare_three_way` which guards against non-integer different_types. Hence, adding `lexicographical_compare_three_way` triggers this static_assert. I see two ways forward: Not adding `lexicographical_compare_three_way` to `robust_re_difference_type.compile.pass.cpp` or removing the `static_assert`. Which one do you prefer?
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way_comp.pass.cpp
32	the idea is to test a "non-default" comparator. If I just used normal integer comparisons here, the test cases wouldn't catch it if `lexicographical_compare_three_way` just ignored the comparator and used the standard comparator instead
35	see "Check for a `partial_ordering::unordered` result" inside `test_comparison_categories`
libcxx/test/support/test_comparisons.h
263 ↗	(On Diff #478741)	See "Check for a `partial_ordering::unordered` result" in `lexicographical_compare_three_way.pass.cpp`

Harbormaster completed remote builds in B200404: Diff 479137.Nov 30 2022, 7:12 PM

huixie90 added inline comments.Dec 3 2022, 9:55 AM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
46	I would Not use `common_type_t` as it might not exist a `common_type` as `common_type` does not work for types that need implicit conversions. The relevant spec here: https://eel.is/c++draft/iterator.concept.winc#6.sentence-1 In short, integral types are only "explicitly" convertible to each other, not implicitly. There are related discussion here: https://github.com/ericniebler/range-v3/issues/1745 I think we want to find which `difference_type` is wider (or we need another type to cover both) and then we need to `static_cast` to that type

avogelsgesang added inline comments.Dec 5 2022, 4:47 AM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
46	I cannot quite follow. Which part of `common_type_t` does not work for this use case? I think we want to find which difference_type is wider (or we need another type to cover both) My understanding is that `common_type_t` does exactly that. Note that https://eel.is/c++draft/iterator.iterators#2.2 guarantees that `difference_type` is a signed integral. As such, there can be no mismatches on signedness, and `common_type_t` should always give the wider `difference_type then we need to static_cast to that type Why do we need static_casts instead of relying on implicit conversions? https://eel.is/c++draft/iterator.concept.winc#6.sentence-1 states that casting to a wider integer type is an implicit conversion.

@avogelsgesang I tried out the benchmarks, and it looks like you were probably accidentally running the benchmarks on the debug version of the build (unfortunately, it's a very easy mistake to make since it's the default). Using the debug build, I get timings very similar to what you saw earlier:

-----------------------------------------------------------------------------------------------------------------------
Benchmark                                                                             Time             CPU   Iterations
-----------------------------------------------------------------------------------------------------------------------
BM_lexicographical_compare_three_way<IntPtr>/1                                     25.7 ns         25.7 ns     27179505
BM_lexicographical_compare_three_way<IntPtr>/4                                     64.1 ns         64.0 ns     10934595
BM_lexicographical_compare_three_way<IntPtr>/16                                     316 ns          249 ns      2924673
BM_lexicographical_compare_three_way<IntPtr>/64                                     880 ns          864 ns       820595
BM_lexicographical_compare_three_way<IntPtr>/256                                   3328 ns         3324 ns       211011
BM_lexicographical_compare_three_way<IntPtr>/1024                                 13151 ns        13145 ns        53256
BM_lexicographical_compare_three_way<IntPtr>/4096                                 52541 ns        52531 ns        13321
BM_lexicographical_compare_three_way<IntPtr>/16384                               210420 ns       210398 ns         3342
BM_lexicographical_compare_three_way<IntPtr>/65536                               847864 ns       847372 ns          825
BM_lexicographical_compare_three_way<IntPtr>/262144                             3394918 ns      3393657 ns          207
BM_lexicographical_compare_three_way<IntPtr>/1048576                           13569716 ns     13563941 ns           51
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1             37.7 ns         37.7 ns     18632581
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/4             85.6 ns         85.6 ns      8158698
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/16             286 ns          285 ns      2474206
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/64            1036 ns         1035 ns       676002
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/256           4018 ns         4017 ns       167247
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1024         16138 ns        15993 ns        43945
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/4096         63604 ns        63602 ns        10994
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/16384       255664 ns       255574 ns         2749
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/65536      1025497 ns      1025498 ns          683
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/262144     4113879 ns      4111688 ns          170
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1048576   16455950 ns     16454558 ns           43
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1               29.5 ns         29.5 ns     23748940
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/4               99.7 ns         99.7 ns      6798031
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/16               300 ns          300 ns      2339940
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/64              1116 ns         1115 ns       632037
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/256             4347 ns         4344 ns       161197
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1024           17271 ns        17270 ns        40467
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/4096           68959 ns        68958 ns        10135
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/16384         276437 ns       276348 ns         2533
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/65536        1111640 ns      1111475 ns          629
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/262144       4451482 ns      4450650 ns          157
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1048576     17807151 ns     17807154 ns           39

With a similar observation where random_access_iterator<int*> is consistently slower than int* (which makes sense in an unoptimized build).

Rerunning the build with -DCMAKE_BUILD_TYPE=Release, however, makes the difference go away -- now the timings are within the margin of error (not to mention many times faster):

BM_lexicographical_compare_three_way<IntPtr>/1                                    0.447 ns        0.444 ns   1000000000
BM_lexicographical_compare_three_way<IntPtr>/4                                     1.99 ns         1.96 ns    361161702
BM_lexicographical_compare_three_way<IntPtr>/16                                    5.86 ns         5.81 ns    121777252
BM_lexicographical_compare_three_way<IntPtr>/64                                    21.6 ns         21.4 ns     32736439
BM_lexicographical_compare_three_way<IntPtr>/256                                   97.6 ns         96.2 ns      7379296
BM_lexicographical_compare_three_way<IntPtr>/1024                                   346 ns          343 ns      2028815
BM_lexicographical_compare_three_way<IntPtr>/4096                                  1352 ns         1339 ns       515878
BM_lexicographical_compare_three_way<IntPtr>/16384                                 5356 ns         5312 ns       132878
BM_lexicographical_compare_three_way<IntPtr>/65536                                21379 ns        21179 ns        32955
BM_lexicographical_compare_three_way<IntPtr>/262144                               85883 ns        84892 ns         8327
BM_lexicographical_compare_three_way<IntPtr>/1048576                             359011 ns       355722 ns         1956
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1            0.448 ns        0.444 ns   1000000000
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/4             1.96 ns         1.94 ns    359261768
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/16            5.89 ns         5.83 ns    121641817
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/64            21.5 ns         21.3 ns     32731847
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/256           97.2 ns         96.2 ns      7337449
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1024           346 ns          344 ns      2040876
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/4096          1347 ns         1336 ns       516457
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/16384         5378 ns         5329 ns       131757
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/65536        21296 ns        21168 ns        32939
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/262144       85516 ns        84770 ns         8164
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1048576     362302 ns       358839 ns         1979
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1              0.576 ns        0.568 ns   1000000000
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/4               2.27 ns         2.26 ns    309139488
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/16              7.85 ns         7.80 ns     87860227
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/64              37.9 ns         37.7 ns     18627871
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/256              131 ns          130 ns      5375353
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1024             503 ns          501 ns      1386276
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/4096            1995 ns         1986 ns       352332
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/16384           7947 ns         7919 ns        88986
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/65536          31716 ns        31642 ns        22141
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/262144        127179 ns       126551 ns         5454
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1048576       517207 ns       514653 ns         1360

I think that explains it -- we were assuming we're seeing optimized results which wasn't actually the case. It also means the code is doing the right thing, so there's no actual issue, which is great!

Edit: never mind my original comment, I got incorrect results. Update below:

@avogelsgesang Thanks for pointing out you were using RelWithDebInfo. Running in that mode, I get similar results to Release, with no divergence between int* and random_access_iterator<int*>:

-----------------------------------------------------------------------------------------------------------------------
Benchmark                                                                             Time             CPU   Iterations
-----------------------------------------------------------------------------------------------------------------------
BM_lexicographical_compare_three_way<IntPtr>/1                                    0.590 ns        0.568 ns   1000000000
BM_lexicographical_compare_three_way<IntPtr>/4                                     2.24 ns         2.24 ns    312197559
BM_lexicographical_compare_three_way<IntPtr>/16                                    6.05 ns         6.05 ns    116194144
BM_lexicographical_compare_three_way<IntPtr>/64                                    21.4 ns         21.4 ns     32794105
BM_lexicographical_compare_three_way<IntPtr>/256                                   94.8 ns         94.8 ns      7386226
BM_lexicographical_compare_three_way<IntPtr>/1024                                   340 ns          340 ns      2051468
BM_lexicographical_compare_three_way<IntPtr>/4096                                  1319 ns         1319 ns       530854
BM_lexicographical_compare_three_way<IntPtr>/16384                                 5332 ns         5284 ns       132817
BM_lexicographical_compare_three_way<IntPtr>/65536                                20946 ns        20945 ns        33471
BM_lexicographical_compare_three_way<IntPtr>/262144                               83706 ns        83701 ns         8370
BM_lexicographical_compare_three_way<IntPtr>/1048576                             348663 ns       348648 ns         1981
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1            0.557 ns        0.557 ns   1000000000
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/4             2.20 ns         2.20 ns    313624287
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/16            6.04 ns         6.04 ns    117445723
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/64            27.8 ns         27.8 ns     25139524
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/256           95.5 ns         94.7 ns      7396059
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1024           339 ns          339 ns      2072201
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/4096          1319 ns         1319 ns       530504
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/16384         5245 ns         5245 ns       133212
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/65536        20917 ns        20917 ns        33457
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/262144       83482 ns        83481 ns         8363
BM_lexicographical_compare_three_way<random_access_iterator<IntPtr>>/1048576     346843 ns       346843 ns         2018
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1              0.627 ns        0.627 ns   1000000000
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/4               2.92 ns         2.92 ns    239622629
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/16              12.4 ns         12.4 ns     55910543
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/64              50.1 ns         50.1 ns     13777900
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/256              206 ns          206 ns      3390586
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1024             807 ns          807 ns       861390
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/4096            3286 ns         3286 ns       214230
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/16384          12841 ns        12841 ns        54509
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/65536          52191 ns        51713 ns        13388
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/262144        204980 ns       204978 ns         3411
BM_lexicographical_compare_three_way<cpp17_input_iterator<IntPtr>>/1048576       824384 ns       824375 ns          847

So my original guess about Debug was probably incorrect (though it did reproduce the relative difference in numbers pretty well). I'm not sure what happened, but since I'm seeing the expected results locally, I think everything's good with the patch, and it's good to proceed.

var-const added inline comments.Dec 8 2022, 10:46 PM

libcxx/benchmarks/lexicographical_compare_three_way.bench.cpp
73 ↗	(On Diff #478741)	Right, please disregard (I might have been looking at a previous iteration of this code where iterators were initialized inside the loop).
28 ↗	(On Diff #479137)	Hmm, I think the `slow/fast_path` functions still need the comparator passed explicitly.
libcxx/include/__algorithm/lexicographical_compare_three_way.h
110	I think we are allowed to do that (as long as the difference is not observable), and it is done in a few places. I can't think of a valid way a user could observe the difference -- since we are allowed to copy or move the iterators in the implementation, they can't rely on move constructor not being called or a copy constructor only being called once.
libcxx/include/__algorithm/three_way_comp_ref_type.h
9 ↗	(On Diff #479137)	You might need to run something like `ninja -C build libcxx-generate-files` to take the new header into account.
27 ↗	(On Diff #479137)	Functions in this file should be marked `_LIBCPP_HIDE_FROM_ABI`.
30 ↗	(On Diff #479137)	Hmm, looks like the return type can be just `auto`? (if I'm reading this correctly, it can never return a reference)
37 ↗	(On Diff #479137)	Note: `_LIBCPP_INLINE_VISIBILITY` was used in old code, in new code we use `_LIBCPP_HIDE_FROM_ABI` for this (they do the same thing, though).
39 ↗	(On Diff #479137)	Question: do we need this `requires` clause? We only use the class in our internal code, so it seems like we shouldn't need this. If it's to validate the comparator given by the user, then it shouldn't only be done in the debug mode.
46 ↗	(On Diff #479137)	Hmm, shouldn't this be `== __expected`? IIUC, `__expected` is already "reversed".
53 ↗	(On Diff #479137)	Since this class's definition is only available in C++20 and above, this can be just `constexpr` (throughout the file).
libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp
4	Hmm, it's a great question. I'm not sure why this wasn't just `static_assert`ed before. I'll check -- in the meantime, I think it's best to leave the `static_assert` in place and add a comment to the test file to explain that `lexicographical_compare_three_way` is omitted intentionally.
libcxx/test/std/algorithms/robust_against_adl.compile.pass.cpp
109 ↗	(On Diff #479137)	This looks like it was commented out accidentally?
112 ↗	(On Diff #479137)	Can this be uncommented?

Adress a couple of varconst's comments. Not completely done, yet. Want to get feedback from the CI

libcxx/include/__algorithm/three_way_comp_ref_type.h
9 ↗	(On Diff #479137)	That seems like a no-op to me. At least, `git` does not mark any files as changed after running this

Harbormaster completed remote builds in B202430: Diff 481905.Dec 11 2022, 7:21 AM

ldionne added inline comments.Dec 14 2022, 11:20 AM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
44	@avogelsgesang @philnik @var-const Is there a reason why we're testing for this here specifically? Isn't this a general property that we require from iterators? Also, why don't we check that the difference type is signed? That's what https://eel.is/c++draft/iterator.iterators#2.2 says. FWIW, I believe it would be preferable to add this assertion in a more principled way across the library, and also (especially) to do it in a separate patch. We probably need to add a temporary escape hatch for this one cause I would assume that it may break a lot of users.

philnik added inline comments.Dec 14 2022, 11:28 AM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
44	@avogelsgesang @philnik @var-const Is there a reason why we're testing for this here specifically? Isn't this a general property that we require from iterators? We're adding it here because it's a new algorithm, so we can't break people. Also, why don't we check that the difference type is signed? That's what https://eel.is/c++draft/iterator.iterators#2.2 says. I don't remember why we only check that it's integral; I assume it was just an oversight. FWIW, I believe it would be preferable to add this assertion in a more principled way across the library, and also (especially) to do it in a separate patch. We probably need to add a temporary escape hatch for this one cause I would assume that it may break a lot of users. I agree that it would make sense to add it to all the algorithms, but I don't think it makes a lot of sense to add an escape hatch to a new algorithm.

ldionne added inline comments.Dec 14 2022, 4:09 PM

libcxx/include/__algorithm/lexicographical_compare_three_way.h
44	Good point about the new algorithm. Let's add it here unconditionally. And then we can pursue lifting this into something like `static_assert(__iterator_requirements_whatever<_It>)` that we can perhaps sprinkle in a few places (with a escape hatch for existing algorithms). Basically I'd like to avoid this being just a one-off. I think this assert is a good idea and would like to expand its use. Let's address the `signed integral` issue before we ship, though.

static_assert for signed integral
add missing includes

Harbormaster completed remote builds in B203774: Diff 483771.Dec 17 2022, 12:20 PM

LGTM w/ green CI and current review comments addressed. Some of us will be out for the holidays and I don't want this patch to be blocked from making progress artificially since it seems like everything has been addressed or at least discussed.

Is this blocked on anything? It would be awesome to finish it up and merge it in time for LLVM 17. I think this had pretty much all the approvals it needed to go ahead and only a few review comments had to be addressed.

libcxx/docs/Status/Cxx20Issues.csv
265 ↗	(On Diff #483771)
libcxx/docs/Status/Cxx2bIssues.csv
66

Is this blocked on anything?

I am unfortunately still running into linking issues with libc++ (latest update here) which means I am unable to run any tests locally. So far, my strategy was to give at a bit of time since maybe someone else with more experience than me might be running into the same problem and fix it for me. I just rebased on latest main, but the problems still persist. I guess, I will have to dig deeper on what is broken there.

Rebase on main; update status pages: 16.0 -> 17.0

Add test for __debug_three_way_comp

Harbormaster completed remote builds in B213297: Diff 496785.Feb 12 2023, 12:46 PM

avogelsgesang added inline comments.Feb 12 2023, 1:02 PM

libcxx/include/__algorithm/three_way_comp_ref_type.h
39 ↗	(On Diff #479137)	The idea is: For comparators which can be called as `cmp(Type1, Type2)` but not as `cmp(Type2, Type1)`, we want to skip this check, and fallback to the other `__do_compare_assert` further down. I copied this from `__algorithm/comp_ref_type.h`, which had decltype((void)std::declval<_Compare&>()( std::declval<_LHS &>(), std::declval<_RHS &>())) to check whether the parameters can be swapped

fix clang-tidy by replacing declval with std::declval

Harbormaster completed remote builds in B213299: Diff 496788.Feb 12 2023, 2:32 PM

Landing despite red CI, because:

clang-format is red because it is complaining about incorrectly formatting in a pre-existing file
"Apple back-deployment macosx-10.15" is failing due to some unrelated infrastructure error
clang-cl failed due to std/thread/thread.mutex/thread.mutex.requirements/thread.timedmutex.requirements/thread.timedmutex.recursive/try_lock_for.pass.cpp which is unrelated to this commit

Build error from Apple back-deployment test:

+ tar -xz --strip-components=1 -C /Users/libcxx-buildkite-agent/libcxx.buildkite-agent/builds/y10-8-macminivault-com/llvm-project/libcxx-ci/build/apple-system-backdeployment-10.15/macos-roots
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100   934  100   934    0     0     31      0  0:00:30  0:00:30 --:--:--   246
tar: Error opening archive: Unrecognized archive format

This revision was not accepted when it landed; it landed in state Needs Review.Feb 12 2023, 2:51 PM

This revision was landed with ongoing or failed builds.

Closed by commit rG2a06757a200c: [libc++][spaceship] Implement `lexicographical_compare_three_way` (authored by avogelsgesang). · Explain Why

This revision was automatically updated to reflect the committed changes.

avogelsgesang added a commit: rG2a06757a200c: [libc++][spaceship] Implement `lexicographical_compare_three_way`.

h-vetinari added a subscriber: h-vetinari.Feb 12 2023, 5:23 PM

h-vetinari added inline comments.

libcxx/docs/Status/Cxx20Issues.csv
264–265 ↗	(On Diff #496806)	This line is now duplicated.
libcxx/test/std/algorithms/robust_re_difference_type.compile.pass.cpp
148–153 ↗	(On Diff #496806)	Is it intentional that these tests are commented out? If so, the comment doesn't really elucidate why, or what would be necessary to enable them.

avogelsgesang added inline comments.Feb 13 2023, 12:48 AM

libcxx/test/std/algorithms/robust_re_difference_type.compile.pass.cpp
148–153 ↗	(On Diff #496806)	Yes, commenting them out was intentional, see discussion in https://reviews.llvm.org/D131395#inline-1342123 I hoped that the comment `lexicographical_compare_three_way` static_asserts that the difference type is an integer, as required by https://eel.is/c++draft/iterator.iterators#2.2 would explain why `lexicographical_compare_three_way` would reject the difference_type used in this test here, but now I realize that the comment is missing the signed integer requirement... On 2nd thought: maybe it would have been sufficient to use `PickyIterator<void, unsigned long>(a);` instead of `PickyIterator<void, long>(a);` ...

avogelsgesang marked an inline comment as done.Feb 13 2023, 4:34 PM

avogelsgesang added inline comments.

libcxx/docs/Status/Cxx20Issues.csv
264–265 ↗	(On Diff #496806)	Thanks! Fixed :)

H-G-Hristov mentioned this in D150188: [libc++][spaceship] Fixed `__debug_three_way_comp`'s `operator()` for `vector<bool>'s `operator<=>`.May 9 2023, 3:43 AM

Zingam mentioned this in rG8fe609cb3a70: [libc++][spaceship] Fixed `__debug_three_way_comp`'s `operator()` for….Jun 9 2023, 8:48 PM

Revision Contents

Path

Size

libcxx/

docs/

Status/

Cxx2bIssues.csv

2 lines

SpaceshipProjects.csv

2 lines

include/

CMakeLists.txt

1 line

__algorithm/

lexicographical_compare_three_way.h

61 lines

algorithm

13 lines

module.modulemap.in

1 line

test/

libcxx/

private_headers.verify.cpp

1 line

std/

algorithms/

alg.sorting/

alg.three.way/

lexicographical_compare_three_way.pass.cpp

98 lines

lexicographical_compare_three_way_comp.pass.cpp

123 lines

lexicographical_compare_three_way_comp.verify.cpp

40 lines

Diff 452929

libcxx/docs/Status/Cxx2bIssues.csv

Show First 20 Lines • Show All 57 Lines • ▼ Show 20 Lines

"`3495 <https://wg21.link/LWG3495>`__","``constexpr launder`` makes pointers to inactive members of unions usable","February 2021","|Nothing To Do|",""

"`3500 <https://wg21.link/LWG3500>`__","``join_view::iterator::operator->()`` is bogus","February 2021","|Complete|","14.0","|ranges|"

"`3502 <https://wg21.link/LWG3502>`__","``elements_view`` should not be allowed to return dangling reference","February 2021","","","|ranges|"

"`3505 <https://wg21.link/LWG3505>`__","``split_view::outer-iterator::operator++`` misspecified","February 2021","","","|ranges|"

"","","","",""

`2774 <https://wg21.link/LWG2774>`__,"``std::function`` construction vs assignment","June 2021","",""

`2818 <https://wg21.link/LWG2818>`__,"``::std::`` everywhere rule needs tweaking","June 2021","|Nothing To Do|",""

`2997 <https://wg21.link/LWG2997>`__,"LWG 491 and the specification of ``{forward_,}list::unique``","June 2021","",""

`3410 <https://wg21.link/LWG3410>`__,"``lexicographical_compare_three_way`` is overspecified","June 2021","","","|spaceship|"

`3410 <https://wg21.link/LWG3410>`__,"``lexicographical_compare_three_way`` is overspecified","June 2021","|Complete|","16.0","|spaceship|"

philnikUnsubmitted

Done

You also implement LWG3350, right?

philnik: You also implement LWG3350, right?

avogelsgesangAuthorUnsubmitted

Done

indeed. Good catch!

avogelsgesang: indeed. Good catch!

ldionneUnsubmitted

Done

`2997 <https://wg21.link/LWG2997>`__,"LWG 491 and the specification of ``{forward_,}list::unique``","June 2021","",""

- `3410 <https://wg21.link/LWG3410>`__,"``lexicographical_compare_three_way`` is overspecified","June 2021","|Complete|","16.0","|spaceship|"

+ `3410 <https://wg21.link/LWG3410>`__,"``lexicographical_compare_three_way`` is overspecified","June 2021","|Complete|","17.0","|spaceship|"

`3430 <https://wg21.link/LWG3430>`__,"``std::fstream`` & co. should be constructible from string_view","June 2021","",""

ldionne:

`3430 <https://wg21.link/LWG3430>`__,"``std::fstream`` & co. should be constructible from string_view","June 2021","",""

`3462 <https://wg21.link/LWG3462>`__,"§[formatter.requirements]: Formatter requirements forbid use of ``fc.arg()``","June 2021","","","|format|"

`3481 <https://wg21.link/LWG3481>`__,"``viewable_range`` mishandles lvalue move-only views","June 2021","Superseded by `P2415R2 <https://wg21.link/P2415R2>`__","","|ranges|"

`3506 <https://wg21.link/LWG3506>`__,"Missing allocator-extended constructors for ``priority_queue``","June 2021","|Complete|","14.0"

`3517 <https://wg21.link/LWG3517>`__,"``join_view::iterator``'s ``iter_swap`` is underconstrained","June 2021","|Complete|","14.0","|ranges|"

`3518 <https://wg21.link/LWG3518>`__,"Exception requirements on char trait operations unclear","June 2021","|Nothing To Do|",""

`3519 <https://wg21.link/LWG3519>`__,"Incomplete synopses for ``<random>`` classes","June 2021","",""

`3520 <https://wg21.link/LWG3520>`__,"``iter_move`` and ``iter_swap`` are inconsistent for ``transform_view::iterator``","June 2021","|Complete|","14.0","|ranges|"

▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

libcxx/docs/Status/SpaceshipProjects.csv

	Section,Description,Dependencies,Assignee,Complete			Section,Description,Dependencies,Assignee,Complete
	\| `[cmp.concept] <https://wg21.link/cmp.concept>`_,"\| `three_way_comparable <https://reviews.llvm.org/D103478>`_			\| `[cmp.concept] <https://wg21.link/cmp.concept>`_,"\| `three_way_comparable <https://reviews.llvm.org/D103478>`_
	\| `three_way_comparable_with <https://reviews.llvm.org/D103478>`_",None,Ruslan Arutyunyan,\|Complete\|			\| `three_way_comparable_with <https://reviews.llvm.org/D103478>`_",None,Ruslan Arutyunyan,\|Complete\|
	\| `[cmp.result] <https://wg21.link/cmp.result>`_,\| `compare_three_way_result <https://reviews.llvm.org/D103581>`_,None,Arthur O'Dwyer,\|Complete\|			\| `[cmp.result] <https://wg21.link/cmp.result>`_,\| `compare_three_way_result <https://reviews.llvm.org/D103581>`_,None,Arthur O'Dwyer,\|Complete\|
	\| `[expos.only.func] <https://wg21.link/expos.only.func>`_,"\| `synth-three-way <https://reviews.llvm.org/D107721>`_			\| `[expos.only.func] <https://wg21.link/expos.only.func>`_,"\| `synth-three-way <https://reviews.llvm.org/D107721>`_
	\| `synth-three-way-result <https://reviews.llvm.org/D107721>`_",[cmp.concept],Kent Ross,\|Complete\|			\| `synth-three-way-result <https://reviews.llvm.org/D107721>`_",[cmp.concept],Kent Ross,\|Complete\|
	\| `[comparisons.three.way] <https://wg21.link/comparisons.three.way>`_,\| `compare_three_way <https://reviews.llvm.org/D80899>`_,[cmp.concept],Arthur O'Dwyer,\|Complete\|			\| `[comparisons.three.way] <https://wg21.link/comparisons.three.way>`_,\| `compare_three_way <https://reviews.llvm.org/D80899>`_,[cmp.concept],Arthur O'Dwyer,\|Complete\|
	\| `[cmp.alg] <https://wg21.link/cmp.alg>`_,"\| `strong_order <https://reviews.llvm.org/D110738>`_			\| `[cmp.alg] <https://wg21.link/cmp.alg>`_,"\| `strong_order <https://reviews.llvm.org/D110738>`_
	\| `weak_order <https://reviews.llvm.org/D110738>`_			\| `weak_order <https://reviews.llvm.org/D110738>`_
	\| `partial_order <https://reviews.llvm.org/D110738>`_			\| `partial_order <https://reviews.llvm.org/D110738>`_
	\| `strong_order_fallback <https://reviews.llvm.org/D111514>`_			\| `strong_order_fallback <https://reviews.llvm.org/D111514>`_
	\| `weak_order_fallback <https://reviews.llvm.org/D111514>`_			\| `weak_order_fallback <https://reviews.llvm.org/D111514>`_
	\| `partial_order_fallback <https://reviews.llvm.org/D111514>`_",None,Arthur O'Dwyer,\|Complete\| [#note-strongorder]_			\| `partial_order_fallback <https://reviews.llvm.org/D111514>`_",None,Arthur O'Dwyer,\|Complete\| [#note-strongorder]_
	\| `[alg.three.way] <https://wg21.link/alg.three.way>`_,\| `lexicographical_compare_three_way <https://reviews.llvm.org/D80902>`_,[comparisons.three.way],Christopher Di Bella,\|In Progress\|			\| `[alg.three.way] <https://wg21.link/alg.three.way>`_,\| `lexicographical_compare_three_way <https://reviews.llvm.org/D131395>`_,[comparisons.three.way],Adrian Vogelsgesang,\|Complete\|
	\| `[type.info] <https://wg21.link/type.info>`_,\| `typeinfo <https://reviews.llvm.org/D130853>`_,None,Adrian Vogelsgesang,\|Complete\|			\| `[type.info] <https://wg21.link/type.info>`_,\| `typeinfo <https://reviews.llvm.org/D130853>`_,None,Adrian Vogelsgesang,\|Complete\|
	\| `[coroutine.handle.compare] <https://wg21.link/coroutine.handle.compare>`_,\| `coroutine_handle <https://reviews.llvm.org/D109433>`_,[comparisons.three.way],Chuanqi Xu,\|Complete\|			\| `[coroutine.handle.compare] <https://wg21.link/coroutine.handle.compare>`_,\| `coroutine_handle <https://reviews.llvm.org/D109433>`_,[comparisons.three.way],Chuanqi Xu,\|Complete\|
	\| `[pairs.spec] <https://wg21.link/pairs.spec>`_,\| `pair <https://reviews.llvm.org/D107721>`_,[expos.only.func],Kent Ross,\|Complete\|			\| `[pairs.spec] <https://wg21.link/pairs.spec>`_,\| `pair <https://reviews.llvm.org/D107721>`_,[expos.only.func],Kent Ross,\|Complete\|
	\| `[syserr.errcat.nonvirtuals] <https://wg21.link/syserr.errcat.nonvirtuals>`_,\| `error_category <https://reviews.llvm.org/D131363>`_,[comparisons.three.way],Adrian Vogelsgesang,\|Complete\|			\| `[syserr.errcat.nonvirtuals] <https://wg21.link/syserr.errcat.nonvirtuals>`_,\| `error_category <https://reviews.llvm.org/D131363>`_,[comparisons.three.way],Adrian Vogelsgesang,\|Complete\|
	\| `[syserr.compare] <https://wg21.link/syserr.compare>`_,"\| `error_code <https://reviews.llvm.org/D131371>`_			\| `[syserr.compare] <https://wg21.link/syserr.compare>`_,"\| `error_code <https://reviews.llvm.org/D131371>`_
	\| `error_condition <https://reviews.llvm.org/D131371>`_",None,Adrian Vogelsgesang,\|Complete\|			\| `error_condition <https://reviews.llvm.org/D131371>`_",None,Adrian Vogelsgesang,\|Complete\|
	\| `[tuple.rel] <https://wg21.link/tuple.rel>`_,\| `tuple <https://reviews.llvm.org/D108250>`_,[expos.only.func],Kent Ross,\|Complete\|			\| `[tuple.rel] <https://wg21.link/tuple.rel>`_,\| `tuple <https://reviews.llvm.org/D108250>`_,[expos.only.func],Kent Ross,\|Complete\|
	"\| `[optional.relops] <https://wg21.link/optional.relops>`_			"\| `[optional.relops] <https://wg21.link/optional.relops>`_
	▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

libcxx/include/CMakeLists.txt

Show All 37 Lines	set(files
__algorithm/is_heap_until.h		__algorithm/is_heap_until.h
__algorithm/is_partitioned.h		__algorithm/is_partitioned.h
__algorithm/is_permutation.h		__algorithm/is_permutation.h
__algorithm/is_sorted.h		__algorithm/is_sorted.h
__algorithm/is_sorted_until.h		__algorithm/is_sorted_until.h
__algorithm/iter_swap.h		__algorithm/iter_swap.h
__algorithm/iterator_operations.h		__algorithm/iterator_operations.h
__algorithm/lexicographical_compare.h		__algorithm/lexicographical_compare.h
		__algorithm/lexicographical_compare_three_way.h
__algorithm/lower_bound.h		__algorithm/lower_bound.h
__algorithm/make_heap.h		__algorithm/make_heap.h
__algorithm/make_projected.h		__algorithm/make_projected.h
__algorithm/max.h		__algorithm/max.h
__algorithm/max_element.h		__algorithm/max_element.h
__algorithm/merge.h		__algorithm/merge.h
__algorithm/min.h		__algorithm/min.h
__algorithm/min_element.h		__algorithm/min_element.h
▲ Show 20 Lines • Show All 806 Lines • Show Last 20 Lines

libcxx/include/__algorithm/lexicographical_compare_three_way.h

This file was added.

//===----------------------------------------------------------------------===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

#ifndef _LIBCPP___ALGORITHM_LEXICOGRAPHICAL_COMPARE_THREE_WAY_H

#define _LIBCPP___ALGORITHM_LEXICOGRAPHICAL_COMPARE_THREE_WAY_H

#include <__compare/compare_three_way.h>

#include <__compare/ordering.h>

#include <__config>

#if !defined(_LIBCPP_HAS_NO_PRAGMA_SYSTEM_HEADER)

# pragma GCC system_header

#endif

_LIBCPP_BEGIN_NAMESPACE_STD

#if _LIBCPP_STD_VER > 17

template <class _InputIterator1, class _InputIterator2, class _Cmp>

var-constUnsubmitted

Done

Is this header still used?

var-const: Is this header still used?

_LIBCPP_HIDE_FROM_ABI constexpr auto lexicographical_compare_three_way(

huixie90Unsubmitted

Done

#if _LIBCPP_STD_VER > 17

- template <class InputIterator1, class InputIterator2, class Cmp>

+ template <class _InputIterator1, class _InputIterator2, class _Cmp>

_LIBCPP_HIDE_FROM_ABI constexpr auto lexicographical_compare_three_way(

huixie90:

huixie90Unsubmitted

Done

Thanks for updating these names but I think the last one Cmp has not been unglified

huixie90: Thanks for updating these names but I think the last one `Cmp` has not been unglified

_InputIterator1 __first1, _InputIterator1 __last1, _InputIterator2 __first2, _InputIterator2 __last2, _Cmp __comp)

-> decltype(__comp(*__first1, *__first2)) {

static_assert(__comparison_category<decltype(__comp(*__first1, *__first2))>,

"The comparator passed to lexicographical_compare_three_way must return a comparison category type");

while (true) {

bool __exhausted1 = __first1 == __last1;

bool __exhausted2 = __first2 == __last2;

if (__exhausted1 || __exhausted2) {

if (!__exhausted1)

philnikUnsubmitted

Done

"The comparator passed to lexicographical_compare_three_way must return a comparison category type");

- if constexpr (is_base_of_v<random_access_iterator_tag,

- typename iterator_traits<_InputIterator1>::iterator_category> &&

- is_base_of_v<random_access_iterator_tag,

- typename iterator_traits<_InputIterator2>::iterator_category>) {

+ if constexpr (__is_cpp17_random_access_iterator<_InputIterator1> && __is_cpp17_random_access_iterator<_InputIterator2>) {

// Fast path for random access iterators which computes the number of loop iterations up-front and

philnik:

return strong_ordering::greater;

if (!__exhausted2)

var-constUnsubmitted

Done

Optional: consider refactoring both branches into helper functions.

var-const: Optional: consider refactoring both branches into helper functions.

avogelsgesangAuthorUnsubmitted

Done

Done. In particular, because this allows me to address your other benchmarking request more easily: I can now directly call the slow and fast paths from the benchmark

avogelsgesang: Done. In particular, because this allows me to address your other benchmarking request more…

return strong_ordering::less;

return strong_ordering::equal;

}

huixie90Unsubmitted

Done

You can search for the usage of the header <__undef_macros> to see how to undef min/max macros

huixie90: You can search for the usage of the header `<__undef_macros>` to see how to undef min/max macros

auto __c = __comp(*__first1, *__first2);

if (__c != 0) {

return __c;

var-constUnsubmitted

Done

Nit: I think this would read a little better with a verb, e.g. Using a non-integral difference_type is undefined behavior.

Also, this seems a little overkill -- I'm pretty sure there are many places in algorithms where we subtract random access iterators and expect to get an integral type, without checking. I don't object to the static_asserts, but the comment (starting from We rely on the fact...) seems a little unnecessary to me (the part about undefined behavior is already captured in the static assertion).

var-const: Nit: I think this would read a little better with a verb, e.g. `Using a non-integral…

ldionneUnsubmitted

Done

@avogelsgesang @philnik @var-const

Is there a reason why we're testing for this *here* specifically? Isn't this a general property that we require from iterators?

Also, why don't we check that the difference type is signed? That's what https://eel.is/c++draft/iterator.iterators#2.2 says.

FWIW, I believe it would be preferable to add this assertion in a more principled way across the library, and also (especially) to do it in a separate patch. We probably need to add a temporary escape hatch for this one cause I would assume that it may break a lot of users.

ldionne: @avogelsgesang @philnik @var-const Is there a reason why we're testing for this *here*…

philnikUnsubmitted

Done

@avogelsgesang @philnik @var-const

Is there a reason why we're testing for this *here* specifically? Isn't this a general property that we require from iterators?

We're adding it here because it's a new algorithm, so we can't break people.

Also, why don't we check that the difference type is signed? That's what https://eel.is/c++draft/iterator.iterators#2.2 says.

I don't remember why we only check that it's integral; I assume it was just an oversight.

FWIW, I believe it would be preferable to add this assertion in a more principled way across the library, and also (especially) to do it in a separate patch. We probably need to add a temporary escape hatch for this one cause I would assume that it may break a lot of users.

I agree that it would make sense to add it to all the algorithms, but I don't think it makes a lot of sense to add an escape hatch to a new algorithm.

philnik: > @avogelsgesang @philnik @var-const > > Is there a reason why we're testing for this *here*…

ldionneUnsubmitted

Done

Good point about the new algorithm. Let's add it here unconditionally. And then we can pursue lifting this into something like static_assert(__iterator_requirements_whatever<_It>) that we can perhaps sprinkle in a few places (with a escape hatch for existing algorithms).

Basically I'd like to avoid this being just a one-off. I think this assert is a good idea and would like to expand its use.

Let's address the signed integral issue before we ship, though.

ldionne: Good point about the new algorithm. Let's add it here unconditionally. And then we can pursue…

}

++__first1;

MordanteUnsubmitted

Done

"A non-integral difference_type is undefined behavior");

- static_assert(is_integral_v<typename iterator_traits<_InputIterator2>::difference_type>,

- "A non-integral difference_type is undefined behavior");

+ static_assert(integral<__iter_diff_t<_InputIterator2>>, "A non-integral difference_type for _InputIterator2 is undefined behavior");

auto __len1 = __last1 - __first1;

At Discord we discovered concepts give nicer error messages. Can you test whether that's true here too, by using integral instead of is_integral_v?

Since you use two asserts, lets give additional information; alternatively merge them in one assert.

Mordante: At Discord we discovered concepts give nicer error messages. Can you test whether that's true…

philnikUnsubmitted

Done

Instead of having a lot of static_asserts here, maybe we want to add the exposition only concepts for Cpp17Iterators from the standard somewhere and static_assert that here?

philnik: Instead of having a lot of `static_assert`s here, maybe we want to add the exposition only…

avogelsgesangAuthorUnsubmitted

Done

[...] maybe we want to add the exposition only concepts for Cpp17Iterators from the standard somewhere and static_assert that here?

You mean

static_assert(__cpp17_iterator<_InputIterator1>, "calling lexicographical_compare_three_way with a non-standard-compliant iterators is undefined behavior");
static_assert(__cpp17_iterator<_InputIterator2>, "calling lexicographical_compare_three_way with a non-standard-compliant iterators is undefined behavior");

avogelsgesang: > [...] maybe we want to add the exposition only concepts for Cpp17Iterators from the standard…

avogelsgesangAuthorUnsubmitted

Done

To me, the goal of the

static_assert(is_integral_v<typename iterator_traits<_InputIterator1>::difference_type>,
              "Using a non-integral difference_type is undefined behavior");
static_assert(is_integral_v<typename iterator_traits<_InputIterator2>::difference_type>,
              "Using a non-integral difference_type is undefined behavior");

avogelsgesang: To me, the goal of the ``` static_assert(is_integral_v<typename…

avogelsgesangAuthorUnsubmitted

Done

Can you test whether that's true here too, by using integral instead of is_integral_v?

https://godbolt.org/z/rKdPq9joo

Given that we implemented the integral as

concept integral = is_integral_v<_Tp>;

using a concept here doesn't improve the error message. It only adds a more complicated backtrace to the error message which does not really provide much value.

avogelsgesang: > Can you test whether that's true here too, by using integral instead of is_integral_v? https…

huixie90Unsubmitted

Done

I would Not use common_type_t as it might not exist a common_type as common_type does not work for types that need implicit conversions.
The relevant spec here:
https://eel.is/c++draft/iterator.concept.winc#6.sentence-1
In short, integral types are only "explicitly" convertible to each other, not implicitly.
There are related discussion here: https://github.com/ericniebler/range-v3/issues/1745

I think we want to find which difference_type is wider (or we need another type to cover both) and then we need to static_cast to that type

huixie90: I would Not use `common_type_t` as it might not exist a `common_type` as `common_type` does not…

avogelsgesangAuthorUnsubmitted

Done

I cannot quite follow. Which part of common_type_t does not work for this use case?

I think we want to find which difference_type is wider (or we need another type to cover both)

My understanding is that common_type_t does exactly that. Note that https://eel.is/c++draft/iterator.iterators#2.2 guarantees that difference_type is a signed integral. As such, there can be no mismatches on signedness, and common_type_t should always give the wider `difference_type

then we need to static_cast to that type

Why do we need static_casts instead of relying on implicit conversions? https://eel.is/c++draft/iterator.concept.winc#6.sentence-1 states that casting to a wider integer type is an implicit conversion.

avogelsgesang: I cannot quite follow. Which part of `common_type_t` does not work for this use case? > I…

++__first2;

huixie90Unsubmitted

Done

agree with @philnik that providing benchmark to show the difference.

question: the standard specifies return (last1 - first1) <=> (last2 - first2); but here you are using size_t instead of iterator's difference_type. so effectively
return static_cast<size_t>(last1 - first1) <=> static_cast<size_t>(last2 - first2);. Does this have an observable difference?

huixie90: agree with @philnik that providing benchmark to show the difference. question: the standard…

JohelEGPUnsubmitted

Done

It should if the difference type is an integer-class type.

JohelEGP: It should if the difference type is an integer-class type.

philnikUnsubmitted

Done

I don't think it has any observable effects, but we shouldn't cast to different types, since that can prohibit some optimizations.

philnik: I don't think it has any observable effects, but we shouldn't cast to different types, since…

mumbleskatesUnsubmitted

Done

i'm not even 100% sure what is allowed to be assumed about difference_type; i would definitely avoid using size_t here and stick entirely to the iterator's difference type.

mumbleskates: i'm not even 100% sure what is allowed to be assumed about `difference_type`; i would…

avogelsgesangAuthorUnsubmitted

Done

Using difference_type now, and also only using this optimization if both iterator types have the same, integral difference_type

avogelsgesang: Using `difference_type` now, and also only using this optimization if both iterator types have…

philnikUnsubmitted

Done

difference_type is guaranteed to be a signed integral (http://eel.is/c++draft/iterator.traits#2). Otherwise your program if ill-formed. I'd just use ptrdiff_t as the type here. No need to not optimize just because the difference_types are different.

philnik: `difference_type` is guaranteed to be a signed integral (http://eel.is/c++draft/iterator.

JohelEGPUnsubmitted

Done

difference_type is guaranteed to be a signed integral (http://eel.is/c++draft/iterator.traits#2).

Not guaranteed, but it's a requirement:

(4.1) If an algorithm's template parameter is named InputIterator, InputIterator1, or InputIterator2, the template argument shall meet the Cpp17InputIterator requirements ([input.iterators]).

https://eel.is/c++draft/algorithms.requirements#4.1

1# A class or pointer type X meets the requirements of an input iterator for the value type T if X meets the Cpp17Iterator ([iterator.iterators]) and Cpp17EqualityComparable (Table 29) requirements and the expressions in Table 85 are valid and have the indicated semantics.

https://eel.is/c++draft/input.iterators#1

(2.2) iterator_traits<X>::difference_type is a signed integer type or void, and

https://eel.is/c++draft/iterator.iterators#2.2

Otherwise your program if ill-formed.

Not really. It's this:

1# In certain cases (replacement functions, handler functions, operations on types used to instantiate standard library template components), the C++ standard library depends on components supplied by a C++ program. If these components do not meet their requirements, this document places no requirements on the implementation.
2# In particular, the behavior is undefined in the following cases:
[...]
(2.3) For types used as template arguments when instantiating a template component, if the operations on the type do not implement the semantics of the applicable Requirements subclause ([allocator.requirements], [container.requirements], [iterator.requirements], [ algorithms.requirements], [numeric.requirements]). Operations on such types can report a failure by throwing an exception unless otherwise specified.
[...]

https://eel.is/c++draft/res.on.functions

An example for std::lexicographical_compare_three_way would be the iterators of an object of type std::ranges::iota_view<std::intmax_t, std::intmax_t> (https://eel.is/c++draft/ranges#range.iota.view-1.3).

JohelEGP: > difference_type is guaranteed to be a signed integral (http://eel.is/c++draft/iterator.

philnikUnsubmitted

Done

difference_type is guaranteed to be a signed integral (http://eel.is/c++draft/iterator.traits#2).

Not guaranteed, but it's a requirement:

What exactly is the difference between "guaranteed" and "requirement"?

Otherwise your program if ill-formed.

Not really. It's this:

1# In certain cases (replacement functions, handler functions, operations on types used to instantiate standard library template components), the C++ standard library depends on components supplied by a C++ program. If these components do not meet their requirements, this document places no requirements on the implementation.

In other words IFNDR.

An example for std::lexicographical_compare_three_way would be the iterators of an object of type std::ranges::iota_view<std::intmax_t, std::intmax_t> (https://eel.is/c++draft/ranges#range.iota.view-1.3).

I'm not sure what you are trying to say here.

philnik: > > difference_type is guaranteed to be a signed integral (http://eel.is/c++draft/iterator.

JohelEGPUnsubmitted

Done

In other words IFNDR.

Undefined, by p2.

What exactly is the difference between "guaranteed" and "requirement"?

The latter is more formal. The former does not imply that it's definitely going to be an integral type. The library gives you license to treat the difference type as an integral type by means of it being anything else being UB. However, I didn't want to confuse the PR submitter by being hand-wavy.

I'm not sure what you are trying to say here.

Just illustrating an unfortunate example for which you can't call std::lexicographical_compare_three_way without it being UB.

JohelEGP: > In other words IFNDR. Undefined, by p2. > What exactly is the difference between…

avogelsgesangAuthorUnsubmitted

Done

thanks for those additional details!

In the updated version, I now use ptrdiff_t as suggested by @philnik. Furthermore, I changed the is_integral check into a static_assert. Instead of exploiting undefined behavior and (potentially silently) generating, I think we should rather diagnose it

avogelsgesang: thanks for those additional details! In the updated version, I now use `ptrdiff_t` as…

mumbleskatesUnsubmitted

Done

(the below comments accidentally left un-sent in draft on 8/17)

sorry if my comments have been incompletely informed! i didn't remember that the function can operate on heterogeneous iterator types.

if we're confident that ptrdiff_t is always the widest signed int we can see in any situation I'm good with that; otherwise it might be needed to choose a type that's the widest of ptrdiff_t and the two difference types, as a safeguard. it might de-optimize when a different type is needed but it seems like a less than optimal situation in the first place, and the alternative may simply be wrong.

(end late comments)

great! glad we're using auto here now, i do believe that natural promotion should be the most reliable way to get the widest type.

mumbleskates: (the below comments accidentally left un-sent in draft on 8/17) sorry if my comments have been…

JohelEGPUnsubmitted

Done

Is that really the right thing to do, though? You used to be able to call an unconstrained algorithm with non-conforming iterators so long as all operations actually used by the algorithm were well-behaved. Has that changed? In this case, you'd also get the correct answer with iterators with a difference type that is an integer-class type so long as the actual difference doesn't exceed the max. value of ptrdiff_t.

JohelEGP: Is that really the right thing to do, though? You used to be able to call an unconstrained…

philnikUnsubmitted

Done

Calling algorithms with non-conforming iterators was always and will always be UB. Just fix your iterators, it's not that hard.

philnik: Calling algorithms with non-conforming iterators was always and will always be UB. Just fix…

mumbleskatesUnsubmitted

Done

i actually had a thought about testing here: would it be possible to create a regression test case that is able to demonstrate this capability?

For example, if it is possible to specify that the type of ptrdiff_t is int32_t, ranges::iota_view<int64_t>(0, 1) should lexically compare as less than ranges::iota_view<int64_t>(0, 0x100000000). If the a narrower type were used for the lengths it would determine that the latter view was the shorter one.

if i'm actually looking at it though i'm not sure there could be any good way to actually swap the definition of ptrdiff_t. however, we could at least safeguard against this on architectures where ptrdiff_t is narrower than intmax_t, by comparing ranges::iota_view(0, 1) against ranges::iota_view<intmax_t>(0, ((intmax_t)numeric_limits<ptrdiff_t>::max) + 1).

mumbleskates: i actually had a thought about testing here: would it be possible to create a regression test…

}

philnikUnsubmitted

Done

It would be great if you could provide a benchmark to show the performance difference.

philnik: It would be great if you could provide a benchmark to show the performance difference.

philnikUnsubmitted

Done

(Thanks to @Quuxplusone for pointing it out!) Using ptrdiff_t is actually a bug because it might be smaller than the largest signed integer. Probably the easiest thing to do is to use auto instead and replace the std::min() with a ternary. For __i you could maybe use decltype(__min_len). There might be a better way to do all this, I'm not sure.

philnik: (Thanks to @Quuxplusone for pointing it out!) Using `ptrdiff_t` is actually a bug because it…

avogelsgesangAuthorUnsubmitted

Done

Added a benchmark now. It shows a 30% improvement with the fast path

avogelsgesang: Added a benchmark now. It shows a 30% improvement with the fast path

var-constUnsubmitted

Done

As noted in another comment, can you please rerun the benchmark so that it compares int* with optimizations vs. int* without optimizations?

var-const: As noted in another comment, can you please rerun the benchmark so that it compares `int*` with…

avogelsgesangAuthorUnsubmitted

Done

yes, rerunning the benchmark is definitely still on my todo list. It takes longer than expected, because it seems that one of the refactorings during this review destroyed the optimization. I can still reproduce the numbers on the old commit, but on the current review the fast path is no longer faster than the default path. I will need some time to figure out what exactly lead to the regression here...

avogelsgesang: yes, rerunning the benchmark is definitely still on my todo list. It takes longer than expected…

}

MordanteUnsubmitted

Done

Why not using std::min here? I assume it is because the types of __len1 and __len2 can differ.

I'm not happy with all the autos in this code; it's hard to understand what the intended types are and whether these are the correct intended types. Please remove the autos so it's clear what the types are.

I really would like something along the lines of

__iter_diff_t<_InputIterator1> __len1 = __last1 - __first1;
__iter_diff_t<_InputIterator2> __len2 = __last2 - __first2;
auto __min = std::min<common_type_t<decltype(__len1), decltype(__len2)>(__len1, __len2));

This is more verbose, but at least I can understand how the types are defined and whether this is as intended.

Mordante: Why not using `std::min` here? I assume it is because the types of `__len1` and `__len2` can…

philnikUnsubmitted

Done

I very much disagree with you here. IMO the __iter_diff_t<_InputIterator1> and so on just make the code more verbose and give the option to get it wrong. __last1 - __first1 will always return a difference type. That's not exactly surprising. Also, you want to use decltype() for the min for some reason. I guess because otherwise it's really long. How does that help with understanding the code?

philnik: I very much disagree with you here. IMO the `__iter_diff_t<_InputIterator1>` and so on just…

MordanteUnsubmitted

Done

It helps since the rules of the type of the conditional expression are not simple. I had to verify with the standard it does the right thing. So instead of spending a few seconds to validate these 3 lines I had to spend several minutes.

Code should be optimized for understanding by humans, auto quite often saves the writer from typing a few characters. (The compiler doesn't care either way it does its thing.)

The verbose code helps to communicate what the author of the code intended to happen. Relaying on some (not always well understood) language rules means it's less clear for the reader to understand what the writer intended. Both may have a different understanding of these rules.

I also considered to use 3 types

using _Len1 = __iter_diff_t<_InputIterator1>;
using _Len2 = __iter_diff_t<_InputIterator2>;
using _Min = common_type_t<_Len1, _Len2>;

_Len1 __len1 = __last1 - __first1;
_Len2 __len2 = __last2 - __first2;

_Min __min = std::min<_Min>(__len1, __len2);

I don't mind that solution either.

Mordante: It helps since the rules of the type of the conditional expression are not simple. I had to…

philnikUnsubmitted

Done

It helps since the rules of the type of the conditional expression are not simple. I had to verify with the standard it does the right thing. So instead of spending a few seconds to validate these 3 lines I had to spend several minutes.

I don't see how common_type helps with the type of the conditional not being simple. That's exactly what common_type uses to get the type in this case AFAICT (https://eel.is/c++draft/type.traits#meta.trans.other-3). Plus is has a heap of other conditionals that are hard to get through.

Code should be optimized for understanding by humans, auto quite often saves the writer from typing a few characters. (The compiler doesn't care either way it does its thing.)

I agree that code should be written to make it easier to read. IMO littering the code with types I don't really care about makes it harder to read. i.e. I don't really care that a - b returns a value of type __iter_diff_t<_InputIterator1>, but now I have to check that you actually named the correct type. Thinking about it, integral auto __len1 = __last1 - __first1 would be great. Not sure how much compile time overhead that would incur though. WDYT?

The verbose code helps to communicate what the author of the code intended to happen. [Relying] on some (not always well understood) language rules means it's less clear for the reader to understand what the writer intended. Both may have a different understanding of these rules.

But you are still relying on these rules through common_type AFAICT.

philnik: > It helps since the rules of the type of the conditional expression are not simple. I had to…

mumbleskatesUnsubmitted

Done

The point is that, while we are still relying on the exact same semantic under the hood, we are making an effort to spell out what we expect to happen, making it explicit that we thought of this case and fully intend this outcome. We can also ensure that future readers might also understand this happens when they might not have thought of it, whether in understanding what our code is doing or in making their own implementation some time in the future.

I like @Mordante's most recent version with the using declarations.

mumbleskates: The point is that, while we are still relying on the exact same semantic under the hood, we are…

avogelsgesangAuthorUnsubmitted

Done

I personally don't have an opinion here. Please come to an agreement here (or agree to disagree, and nevertheless agree with merging this code).

I am happy to update this patch in whichever way you agree on, but I want to avoid changing it forth and back (an earlier version of this review was actually written in terms of the difference type and changed based on feedback in this review)

avogelsgesang: I personally don't have an opinion here. Please come to an agreement here (or agree to disagree…

MordanteUnsubmitted

Done

It helps since the rules of the type of the conditional expression are not simple. I had to verify with the standard it does the right thing. So instead of spending a few seconds to validate these 3 lines I had to spend several minutes.

I don't see how common_type helps with the type of the conditional not being simple. That's exactly what common_type uses to get the type in this case AFAICT (https://eel.is/c++draft/type.traits#meta.trans.other-3). Plus is has a heap of other conditionals that are hard to get through.

It's at least clear that it's intended to use common_type and not one of the other rules of the conditional expression. IMO auto should never be used with the conditional expression.

Code should be optimized for understanding by humans, auto quite often saves the writer from typing a few characters. (The compiler doesn't care either way it does its thing.)

I agree that code should be written to make it easier to read. IMO littering the code with types I don't really care about makes it harder to read. i.e. I don't really care that a - b returns a value of type __iter_diff_t<_InputIterator1>, but now I have to check that you actually named the correct type. Thinking about it, integral auto __len1 = __last1 - __first1 would be great. Not sure how much compile time overhead that would incur though. WDYT?

I care about these types when I try to understand the code and validate whether the author wrote the code correctly.
It takes me as reader a lot longer to validate the code. Auto has its uses but it shouldn't be used everywhere just to make it easy for the writer. I still like the explicit type better either directly or by using a typedef.
For the operator- I don't dislike this too strongly; but as said above for the conditional expression I do.

The verbose code helps to communicate what the author of the code intended to happen. [Relying] on some (not always well understood) language rules means it's less clear for the reader to understand what the writer intended. Both may have a different understanding of these rules.

But you are still relying on these rules through common_type AFAICT.

Yes but as said above it makes it clear that common_type is intended to be used. (I agree common_type has it's complexity too.)

Mordante: > > It helps since the rules of the type of the conditional expression are not simple. I had to…

var-constUnsubmitted

Done

FWIW, I don't have a strong preference here, but for me, one of the most important and informative aspects of auto is "guaranteed no conversion". This is as relevant for the author as it is for the reader. While it's true that auto can be misused by a writer to avoid thinking about which types are returned (IMO that's a bigger time "saver" than the literal typing which can often be autocompleted), I think it has legitimate uses where it makes the intention clearer.

var-const: FWIW, I don't have a strong preference here, but for me, one of the most important and…

var-constUnsubmitted

Done

Nit: some empty lines will help separate this algorithm into logical blocks and make it more readable. I'd suggest adding blank lines before and after the for loop and before the else branch.

var-const: Nit: some empty lines will help separate this algorithm into logical blocks and make it more…

template <class _InputIterator1, class _InputIterator2>

_LIBCPP_HIDE_FROM_ABI constexpr auto lexicographical_compare_three_way(

huixie90Unsubmitted

Done

}

- template <class InputIterator1, class InputIterator2>

+ template <class _InputIterator1, class _InputIterator2>

_LIBCPP_HIDE_FROM_ABI constexpr auto lexicographical_compare_three_way(

We need to uglify these type names

huixie90: We need to uglify these type names

_InputIterator1 __first1, _InputIterator1 __last1, _InputIterator2 __first2, _InputIterator2 __last2) {

return std::lexicographical_compare_three_way(__first1, __last1, __first2, __last2, compare_three_way());

}

huixie90Unsubmitted

Done

_InputIterator1 __first1, _InputIterator1 __last1, _InputIterator2 __first2, _InputIterator2 __last2) {

- return lexicographical_compare_three_way(__first1, __last1, __first2, __last2, compare_three_way());

+ return std::lexicographical_compare_three_way(__first1, __last1, __first2, __last2, compare_three_way());

}

#endif // _LIBCPP_STD_VER > 17

I believe the spec does not want us to trigger ADL

huixie90: I believe the spec does not want us to trigger ADL

var-constUnsubmitted

Done

Nit: increment __i in the iteration expression? Otherwise, it seems more like a while loop.

var-const: Nit: increment `__i` in the iteration expression? Otherwise, it seems more like a `while` loop.

#endif // _LIBCPP_STD_VER > 17

_LIBCPP_END_NAMESPACE_STD

#endif // _LIBCPP___ALGORITHM_LEXICOGRAPHICAL_COMPARE_THREE_WAY_H

MordanteUnsubmitted

Done

Is forward intended? __comp is passed by value.

Mordante: Is `forward` intended? `__comp` is passed by value.

avogelsgesangAuthorUnsubmitted

Done

good catch! Changed to move

avogelsgesang: good catch! Changed to `move`

JohelEGPUnsubmitted

Done

I think ref(__comp) is the preferred way.

JohelEGP: I think `ref(__comp)` is the preferred way.

var-constUnsubmitted

Done

@JohenEGP I presume you referring to the __comp_ref_type, right? It's a great suggestion.

@avogelsgesang For context, there is an existing pattern in algorithms where the internal implementation of the function wraps the comparator in a typedef that is defined differently in debug and non-debug modes (see include/__algorithm/comp_ref_type.h):

#ifdef _LIBCPP_ENABLE_DEBUG_MODE
template <class _Comp>
using __comp_ref_type = __debug_less<_Comp>;
#else
template <class _Comp>
using __comp_ref_type = _Comp&;
#endif

So in non-debug modes, this resolves to simply a reference. In debug mode, however, it creates a temporary of a helper type __debug_less that additionally checks that the comparator in fact does induce a strict weak order.

I think we want to continue using this pattern going forward (e.g. lexicographical_compare uses it). The only issue is that the existing __debug_less only supports comparators returning a boolean. You would probably need to create a separate __three_way_comp_ref_type typedef and a separate __debug_three_way_comp helper struct (names are subject to change).

Please let me know if you'd like any additional context on this (this can be kinda confusing). Many existing algorithms can be used as examples of this pattern; a caveat is that more recent code uses a (very) slightly different approach:

older code tends to call have _Compare as a template parameter of the internal function, declare the parameter as _Compare __comp, and have the caller specify the template argument explicitly, like std::__foo<_comp_ref_type<_Comp>>(__first, __last, __comp). See e.g. lexicographical_compare;
in newer code, the pattern was to declare the parameter as a forwarding reference _Compare&& __comp and have the caller do a static_cast, like std::__foo(__first, __last, static_cast<__comp_ref_type(__comp)> (see e.g. inplace_merge).

If you prefer to, I'm fine with doing this in a follow up since this patch has been open for a while now.

var-const: @JohenEGP I presume you referring to the `__comp_ref_type`, right? It's a great suggestion.

philnikUnsubmitted

Done

IMO we should make the debug stuff a separate effort. AFAIK we don't test it anywhere and because of that I'm pretty sure it regressed in some places. Instead of adding another untested branch, I'd suggest creating a patch that adds tests to all algorithms and, depending on scope, update the algorithms in follow-up patches or as part of the test-patch.

philnik: IMO we should make the debug stuff a separate effort. AFAIK we don't test it anywhere and…

var-constUnsubmitted

Done

I would agree to that if someone volunteers to do that follow-up in the near future -- in fact, it would be great. Would you or @avogelsgesang be willing to take this on?

var-const: I would agree to that if someone volunteers to do that follow-up in the near future -- in fact…

avogelsgesangAuthorUnsubmitted

Done

added the __three_way_comp_ref_type as requested

avogelsgesang: added the `__three_way_comp_ref_type` as requested

var-constUnsubmitted

Done

Move the iterators? (in the other function as well)

You might additionally static_assert that the given iterators are copyable, to prevent users from accidentally passing move-only iterators (that our implementation would happen to accept due to the optimization but which isn't guaranteed by the Standard). A few of the existing algorithms do that, see e.g. upper_bound.

var-const: Move the iterators? (in the other function as well) You might additionally `static_assert`…

avogelsgesangAuthorUnsubmitted

Done

Is adding the additional calls to std::move standard compliant? https://eel.is/c++draft/alg.three.way defines lexicographical_compare_three_way as not moving the iterators

avogelsgesang: Is adding the additional calls to `std::move` standard compliant? https://eel.is/c++draft/alg.

var-constUnsubmitted

Done

I think we are allowed to do that (as long as the difference is not observable), and it is done in a few places. I can't think of a valid way a user could observe the difference -- since we are allowed to copy or move the iterators in the implementation, they can't rely on move constructor not being called or a copy constructor only being called once.

var-const: I think we are allowed to do that (as long as the difference is not observable), and it is done…

var-constUnsubmitted

Done

Would it make sense to pass __comp by reference? It could, in theory, be e.g. a lambda that captures a lot of state. In fact, we generally commit to avoid copying user-provided functors. However, see also the other comment about __comp_ref_type (which would make this a reference but also have an additional effect in the debug mode).

var-const: Would it make sense to pass `__comp` by reference? It could, in theory, be e.g. a lambda that…

avogelsgesangAuthorUnsubmitted

Done

passing as _Cmp& now

avogelsgesang: passing as `_Cmp&` now

var-constUnsubmitted

Done

As a libc++-specific extension, we mark the return value of lexicographical_compare with _LIBCPP_NODISCARD_EXT. We should do the same here (and add a check to test/libcxx/diagnostics/nodiscard_extensions.verify.cpp).

var-const: As a libc++-specific extension, we mark the return value of `lexicographical_compare` with…

libcxx/include/algorithm

Show First 20 Lines • Show All 1,678 Lines • ▼ Show 20 Lines	template <class InputIterator1, class InputIterator2>
constexpr bool // constexpr in C++20		constexpr bool // constexpr in C++20
lexicographical_compare(InputIterator1 first1, InputIterator1 last1, InputIterator2 first2, InputIterator2 last2);		lexicographical_compare(InputIterator1 first1, InputIterator1 last1, InputIterator2 first2, InputIterator2 last2);

template <class InputIterator1, class InputIterator2, class Compare>		template <class InputIterator1, class InputIterator2, class Compare>
constexpr bool // constexpr in C++20		constexpr bool // constexpr in C++20
lexicographical_compare(InputIterator1 first1, InputIterator1 last1,		lexicographical_compare(InputIterator1 first1, InputIterator1 last1,
InputIterator2 first2, InputIterator2 last2, Compare comp);		InputIterator2 first2, InputIterator2 last2, Compare comp);

		template<class InputIterator1, class InputIterator2, class Cmp>
		constexpr auto
		lexicographical_compare_three_way(InputIterator1 first1, InputIterator1 last1,
		InputIterator2 first2, InputIterator2 last2,
		Cmp comp)
		-> decltype(comp(b1, b2)); // since C++20

		template<class InputIterator1, class InputIterator2>
		constexpr auto
		lexicographical_compare_three_way(InputIterator1 first1, InputIterator1 last1,
		InputIterator2 first2, InputIterator2 last2); // since C++20
		MordanteUnsubmitted Done Reply Inline Actions Can you align the version number comments? Mordante: Can you align the version number comments?

template <class BidirectionalIterator>		template <class BidirectionalIterator>
constexpr bool // constexpr in C++20		constexpr bool // constexpr in C++20
next_permutation(BidirectionalIterator first, BidirectionalIterator last);		next_permutation(BidirectionalIterator first, BidirectionalIterator last);

template <class BidirectionalIterator, class Compare>		template <class BidirectionalIterator, class Compare>
constexpr bool // constexpr in C++20		constexpr bool // constexpr in C++20
next_permutation(BidirectionalIterator first, BidirectionalIterator last, Compare comp);		next_permutation(BidirectionalIterator first, BidirectionalIterator last, Compare comp);

▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines
#include <__algorithm/is_heap.h>		#include <__algorithm/is_heap.h>
#include <__algorithm/is_heap_until.h>		#include <__algorithm/is_heap_until.h>
#include <__algorithm/is_partitioned.h>		#include <__algorithm/is_partitioned.h>
#include <__algorithm/is_permutation.h>		#include <__algorithm/is_permutation.h>
#include <__algorithm/is_sorted.h>		#include <__algorithm/is_sorted.h>
#include <__algorithm/is_sorted_until.h>		#include <__algorithm/is_sorted_until.h>
#include <__algorithm/iter_swap.h>		#include <__algorithm/iter_swap.h>
#include <__algorithm/lexicographical_compare.h>		#include <__algorithm/lexicographical_compare.h>
		#include <__algorithm/lexicographical_compare_three_way.h>
#include <__algorithm/lower_bound.h>		#include <__algorithm/lower_bound.h>
#include <__algorithm/make_heap.h>		#include <__algorithm/make_heap.h>
#include <__algorithm/max.h>		#include <__algorithm/max.h>
#include <__algorithm/max_element.h>		#include <__algorithm/max_element.h>
#include <__algorithm/merge.h>		#include <__algorithm/merge.h>
#include <__algorithm/min.h>		#include <__algorithm/min.h>
#include <__algorithm/min_element.h>		#include <__algorithm/min_element.h>
#include <__algorithm/min_max_result.h>		#include <__algorithm/min_max_result.h>
▲ Show 20 Lines • Show All 153 Lines • Show Last 20 Lines

libcxx/include/module.modulemap.in

Show First 20 Lines • Show All 282 Lines • ▼ Show 20 Lines	module __algorithm {
module is_sorted { private header "__algorithm/is_sorted.h" }		module is_sorted { private header "__algorithm/is_sorted.h" }
module is_sorted_until { private header "__algorithm/is_sorted_until.h" }		module is_sorted_until { private header "__algorithm/is_sorted_until.h" }
module iter_swap { private header "__algorithm/iter_swap.h" }		module iter_swap { private header "__algorithm/iter_swap.h" }
module iterator_operations {		module iterator_operations {
private header "__algorithm/iterator_operations.h"		private header "__algorithm/iterator_operations.h"
export *		export *
}		}
module lexicographical_compare { private header "__algorithm/lexicographical_compare.h" }		module lexicographical_compare { private header "__algorithm/lexicographical_compare.h" }
		module lexicographical_compare_three_way { private header "__algorithm/lexicographical_compare_three_way.h" }
		MordanteUnsubmitted Done Reply Inline Actions Can you indent the other `{`s to the same level. Mordante: Can you indent the other `{`s to the same level.
		avogelsgesangAuthorUnsubmitted Done Reply Inline Actions I just realized that there is already a pattern how to deal with overly long names, see `uniform_random_bit_generator_adaptor`. Did the same here now... avogelsgesang: I just realized that there is already a pattern how to deal with overly long names, see…
module lower_bound { private header "__algorithm/lower_bound.h" }		module lower_bound { private header "__algorithm/lower_bound.h" }
module make_heap { private header "__algorithm/make_heap.h" }		module make_heap { private header "__algorithm/make_heap.h" }
module make_projected { private header "__algorithm/make_projected.h" }		module make_projected { private header "__algorithm/make_projected.h" }
module max { private header "__algorithm/max.h" }		module max { private header "__algorithm/max.h" }
module max_element { private header "__algorithm/max_element.h" }		module max_element { private header "__algorithm/max_element.h" }
module merge { private header "__algorithm/merge.h" }		module merge { private header "__algorithm/merge.h" }
module min { private header "__algorithm/min.h" }		module min { private header "__algorithm/min.h" }
module min_element { private header "__algorithm/min_element.h" }		module min_element { private header "__algorithm/min_element.h" }
▲ Show 20 Lines • Show All 1,118 Lines • Show Last 20 Lines

libcxx/test/libcxx/private_headers.verify.cpp

	Show First 20 Lines • Show All 74 Lines • ▼ Show 20 Lines
	#include <__algorithm/is_heap_until.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_heap_until.h'}}			#include <__algorithm/is_heap_until.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_heap_until.h'}}
	#include <__algorithm/is_partitioned.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_partitioned.h'}}			#include <__algorithm/is_partitioned.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_partitioned.h'}}
	#include <__algorithm/is_permutation.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_permutation.h'}}			#include <__algorithm/is_permutation.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_permutation.h'}}
	#include <__algorithm/is_sorted.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_sorted.h'}}			#include <__algorithm/is_sorted.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_sorted.h'}}
	#include <__algorithm/is_sorted_until.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_sorted_until.h'}}			#include <__algorithm/is_sorted_until.h> // expected-error@: {{use of private header from outside its module: '__algorithm/is_sorted_until.h'}}
	#include <__algorithm/iter_swap.h> // expected-error@: {{use of private header from outside its module: '__algorithm/iter_swap.h'}}			#include <__algorithm/iter_swap.h> // expected-error@: {{use of private header from outside its module: '__algorithm/iter_swap.h'}}
	#include <__algorithm/iterator_operations.h> // expected-error@: {{use of private header from outside its module: '__algorithm/iterator_operations.h'}}			#include <__algorithm/iterator_operations.h> // expected-error@: {{use of private header from outside its module: '__algorithm/iterator_operations.h'}}
	#include <__algorithm/lexicographical_compare.h> // expected-error@: {{use of private header from outside its module: '__algorithm/lexicographical_compare.h'}}			#include <__algorithm/lexicographical_compare.h> // expected-error@: {{use of private header from outside its module: '__algorithm/lexicographical_compare.h'}}
				#include <__algorithm/lexicographical_compare_three_way.h> // expected-error@: {{use of private header from outside its module: '__algorithm/lexicographical_compare_three_way.h'}}
	#include <__algorithm/lower_bound.h> // expected-error@: {{use of private header from outside its module: '__algorithm/lower_bound.h'}}			#include <__algorithm/lower_bound.h> // expected-error@: {{use of private header from outside its module: '__algorithm/lower_bound.h'}}
	#include <__algorithm/make_heap.h> // expected-error@: {{use of private header from outside its module: '__algorithm/make_heap.h'}}			#include <__algorithm/make_heap.h> // expected-error@: {{use of private header from outside its module: '__algorithm/make_heap.h'}}
	#include <__algorithm/make_projected.h> // expected-error@: {{use of private header from outside its module: '__algorithm/make_projected.h'}}			#include <__algorithm/make_projected.h> // expected-error@: {{use of private header from outside its module: '__algorithm/make_projected.h'}}
	#include <__algorithm/max.h> // expected-error@: {{use of private header from outside its module: '__algorithm/max.h'}}			#include <__algorithm/max.h> // expected-error@: {{use of private header from outside its module: '__algorithm/max.h'}}
	#include <__algorithm/max_element.h> // expected-error@: {{use of private header from outside its module: '__algorithm/max_element.h'}}			#include <__algorithm/max_element.h> // expected-error@: {{use of private header from outside its module: '__algorithm/max_element.h'}}
	#include <__algorithm/merge.h> // expected-error@: {{use of private header from outside its module: '__algorithm/merge.h'}}			#include <__algorithm/merge.h> // expected-error@: {{use of private header from outside its module: '__algorithm/merge.h'}}
	#include <__algorithm/min.h> // expected-error@: {{use of private header from outside its module: '__algorithm/min.h'}}			#include <__algorithm/min.h> // expected-error@: {{use of private header from outside its module: '__algorithm/min.h'}}
	#include <__algorithm/min_element.h> // expected-error@: {{use of private header from outside its module: '__algorithm/min_element.h'}}			#include <__algorithm/min_element.h> // expected-error@: {{use of private header from outside its module: '__algorithm/min_element.h'}}
	▲ Show 20 Lines • Show All 579 Lines • Show Last 20 Lines

libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				var-constUnsubmitted Done Reply Inline Actions We have a few tests that check for a certain behavior across a wide range of algorithms; those have to be updated to include `lexicographical_compare_three_way` now: `test/libcxx/diagnostics/nodiscard_extensions.verify.cpp` (potentially); `test/libcxx/algorithms/robust_against_copying_comparators.pass.cpp`; `test/libcxx/algorithms/robust_against_cpp20_hostile_iterators.compile.pass.cpp`; `test/std/algorithms/robust_re_difference_type.compile.pass.cpp`; `test/std/algorithms/robust_against_adl.compile.pass.cpp`. Please let me know if you need any help with those. var-const: We have a few tests that check for a certain behavior across a wide range of algorithms; those…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions I added `lexicographical_compare_three_way` to all of them, except for `robust_re_difference_type.compile.pass.cpp`. Afaict this test case relies on undefined behavior. The `difference_type` must be an integer type (https://eel.is/c++draft/iterator.iterators#2.2), but this test case violates that requirement. As requested by @philnik I added a `static_assert` to `lexicographical_compare_three_way` which guards against non-integer different_types. Hence, adding `lexicographical_compare_three_way` triggers this static_assert. I see two ways forward: Not adding `lexicographical_compare_three_way` to `robust_re_difference_type.compile.pass.cpp` or removing the `static_assert`. Which one do you prefer? avogelsgesang: I added `lexicographical_compare_three_way` to all of them, except for…
				var-constUnsubmitted Done Reply Inline Actions Hmm, it's a great question. I'm not sure why this wasn't just `static_assert`ed before. I'll check -- in the meantime, I think it's best to leave the `static_assert` in place and add a comment to the test file to explain that `lexicographical_compare_three_way` is omitted intentionally. var-const: Hmm, it's a great question. I'm not sure why this wasn't just `static_assert`ed before. I'll…
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				// UNSUPPORTED: c++03, c++11, c++14, c++17

				// <algorithm>

				// template<class InputIterator1, class InputIterator2, class Cmp>
				// constexpr auto
				// lexicographical_compare_three_way(InputIterator1 first1, InputIterator1 last1,
				// InputIterator2 first2, InputIterator2 last2)

				#include <array>
				#include <algorithm>
				#include <cassert>
				#include <compare>
				#include <concepts>

				#include "test_macros.h"
				#include "test_iterators.h"

				using std::array;

				// A struct for which only weak comparisons are defined
				MordanteUnsubmitted Done Reply Inline Actions Please use the qualified name in the tests. Mordante: Please use the qualified name in the tests.
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions any particular reason? Would it be fine if I used a function-local `using std::array`? avogelsgesang: any particular reason? Would it be fine if I used a function-local `using std::array`?
				MordanteUnsubmitted Done Reply Inline Actions To make it clear which array is used. Sometimes we use similar names as standard names. In general we don't use using globally, sometimes in a function. There usually the nested namespace being tested like `std::chrono` or the `literal`s namespaces. Mordante: To make it clear which array is used. Sometimes we use similar names as standard names. In…
				struct WeakInt {
				int value;
				var-constUnsubmitted Done Reply Inline Actions Nit: `s/auto/decltype(auto)/`. That way we always check the _exact_ returned type, without potentially stripping away references. While it's very unlikely that this algorithm could plausibly return a reference, I think the main advantage of using `decltype(auto)` is just not having to think about it and have it guaranteed to always catch the exact type. var-const: Nit: `s/auto/decltype(auto)/`. That way we always check the _exact_ returned type, without…
				friend std::weak_ordering operator<=>(WeakInt, WeakInt) = default;
				};

				// A struct for which only partial comparisons are defined
				huixie90Unsubmitted Done Reply Inline Actions wrong comments? huixie90: wrong comments?
				MordanteUnsubmitted Done Reply Inline Actions I don't mind this way of testing, but since the function is expressed in terms of the version with a `compare_three_way` it would have been possible to test this version against that version. Something along the lines of: auto expected = std::lexicographical_compare_three_way(Iter1{a.begin()}, Iter1{a.end()}, Iter2{b.begin()}, Iter2{b.end()}, std::compare_three_way()); std::same_as<decltype(expected)> auto result = std::lexicographical_compare_three_way(Iter1{a.begin()}, Iter1{a.end()}, Iter2{b.begin()}, Iter2{b.end()}); assert(expected == result); That way we might even remove some tests here and relay on the other test to provide the coverage. Mordante: I don't mind this way of testing, but since the function is expressed in terms of the version…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions not sure how much that would actually give us. Instead of test_lexicographical_compare<Iter1, Iter2>(array{0, 1}, array{0, 2}, std::strong_ordering::less); I could write test_lexicographical_compare<Iter1, Iter2>(array{0, 1}, array{0, 2}); but I would still have to enumerate all the different input arrays, iterator types etc. Given this would only save few lines, I prefer to keep the test cases as they currently are. avogelsgesang: not sure how much that would actually give us. Instead of ```…
				struct PartialInt {
				int value;
				friend std::partial_ordering operator<=>(PartialInt, PartialInt) = default;
				};

				template <typename Iter1, typename Iter2, typename C1, typename C2, typename Order>
				[[nodiscard]] constexpr bool test_lexicographical_compare(const C1& a, const C2& b, Order expected) {
				std::same_as<Order> auto result =
				std::lexicographical_compare_three_way(Iter1{a.begin()}, Iter1{a.end()}, Iter2{b.begin()}, Iter2{b.end()});
				return expected == result;
				huixie90Unsubmitted Done Reply Inline Actions optional: In other C++20 algorithm tests, we usually do std::same_as<ExpectedType> result = std::foo_bar_algorithm(...); But feel free to ignore this. huixie90: optional: In other C++20 algorithm tests, we usually do ``` std::same_as<ExpectedType> result =…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions Nice technique! I didn't know this application of `same_as` yet avogelsgesang: Nice technique! I didn't know this application of `same_as` yet
				}
				huixie90Unsubmitted Done Reply Inline Actions extra nit: In other tests, we usually just `assert(expected == result)` here to save writing all the `assert`s on the caller site. huixie90: extra nit: In other tests, we usually just `assert(expected == result)` here to save writing…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions I modelled this helper function here after the `test` functions from `test_comparisons.h`. Do you want me to also update the helper functions from `test_comparisons.h` (in a separate commit), deviate from the pattern from `test_comparisons.h` here, or keep everything as currently proposed in this review? avogelsgesang:* I modelled this helper function here after the `test*` functions from `test_comparisons.h`. Do…

				template <typename Iter1, typename Iter2>
				constexpr void test_iterator_types() {
				// Both inputs empty
				huixie90Unsubmitted Done Reply Inline Actions please also add tests for cases where either range is empty or both of them are empty huixie90: please also add tests for cases where either range is empty or both of them are empty
				huixie90Unsubmitted Done Reply Inline Actions Could you please also add a test to test "Complexity: At most N applications of comp." huixie90: Could you please also add a test to test "Complexity: At most N applications of comp."
				assert((test_lexicographical_compare<Iter1, Iter2>(array<int, 0>{}, array<int, 0>{}, std::strong_ordering::equal)));
				// Left input empty
				assert((test_lexicographical_compare<Iter1, Iter2>(array<int, 0>{}, array{0, 1}, std::strong_ordering::less)));
				// Right input empty
				huixie90Unsubmitted Done Reply Inline Actions the comment is a bit ambiguous. which range's 2nd element is "greater"?. Since the result is "less". the comment "greater" is slightly counter-intuitive huixie90: the comment is a bit ambiguous. which range's 2nd element is "greater"?. Since the result is…
				assert((test_lexicographical_compare<Iter1, Iter2>(array{0, 1}, array<int, 0>{}, std::strong_ordering::greater)));

				// Identical arrays
				assert((test_lexicographical_compare<Iter1, Iter2>(array{0, 1}, array{0, 1}, std::strong_ordering::equal)));
				// "Less" on 2nd element
				assert((test_lexicographical_compare<Iter1, Iter2>(array{0, 1}, array{0, 2}, std::strong_ordering::less)));
				// "Greater" on 2nd element
				assert((test_lexicographical_compare<Iter1, Iter2>(array{0, 2}, array{0, 1}, std::strong_ordering::greater)));
				// "Greater" on 2nd element, but "less" on first entry
				assert((test_lexicographical_compare<Iter1, Iter2>(array{0, 2}, array{1, 1}, std::strong_ordering::less)));
				huixie90Unsubmitted Done Reply Inline Actions The spec has explicitly specifies the return type `-> decltype(__comp(__first1, __first2))` and this has a SFINAE effect. It would be good to test the SFINAE effect as well (if __comp is not callbale then the function should be SFINAEed out) huixie90: The spec has explicitly specifies the return type ` -> decltype(__comp(__first1, __first2))`…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions Sorry, but I don't quite understand this point. How should I check for SFINAE here? avogelsgesang: Sorry, but I don't quite understand this point. How should I check for SFINAE here?
				mumbleskatesUnsubmitted Done Reply Inline Actions in C++20 (which you have the luxury of relying upon since this function is new in C++20) you can just make a concept and then `static_check` it. mumbleskates: in C++20 (which you have the luxury of relying upon since this function is new in C++20) you…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions still can't follow. Did you mean `static_assert`? Or is `static_check` something different? How would I use a concept to `static_assert` that `std::lexicographical_compare_three_way` is excluded from a function overload set due to SFINAE? avogelsgesang: still can't follow. Did you mean `static_assert`? Or is `static_check` something different? How…
				philnikUnsubmitted Done Reply Inline Actions You can do something like template <class T> concept HasLexicographicalCompare = requires (T whatever) { std::lexicographical_compare_three_way(whatever); }; static_assert(hasLexicographicalCompare<CorrectType>) static_assert(!HasLexicographicalCompare<IncorrectType>); philnik: You can do something like ``` template <class T> concept HasLexicographicalCompare = requires…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions Thanks for that example! I was able to use this for `lexicographical_compare_three_way_comp.pass.cpp`. However, I am still struggling to understand what you want to me to change in this file, i.e. `lexicographical_compare_three_way.pass.cpp` where we don't pass a comparator. What I could come up with so far is: template <class T> concept has_lexicographical_compare = requires (T t) { std::lexicographical_compare_three_way(std::declval<T>(), std::declval<T>(), std::declval<T>(), std::declval<T>()); }; // Test that `std::lexicographical_compare_three_way` accepts valid types static_assert(has_lexicographical_compare<int>); static_assert(has_lexicographical_compare<WeakInt>); static_assert(has_lexicographical_compare<PartialInt>); // Test that `std::lexicographical_compare_three_way` rejects invalid types static_assert(!has_lexicographical_compare<UnexpectedlyComparableInt>); static_assert(!has_lexicographical_compare<UncomparableInt>); However that led to the error In file included from /home/tsi/avogelsgesang/Documents/llvm-project/libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp:18: In file included from /home/tsi/avogelsgesang/Documents/llvm-project/build/include/c++/v1/algorithm:1771: /home/tsi/avogelsgesang/Documents/llvm-project/build/include/c++/v1/__algorithm/lexicographical_compare_three_way.h:71:10: error: no matching function for call to 'lexicographical_compare_three_way' return std::lexicographical_compare_three_way(__first1, __last1, __first2, __last2, compare_three_way()); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ avogelsgesang: Thanks for that example! I was able to use this for `lexicographical_compare_three_way_comp.
				philnikUnsubmitted Done Reply Inline Actions Yes, that's a bug in your code :P. You have to add `-> decltype(std::lexicographical_compare_three_way(__first1, __last1, __first2, __last2, compare_three_way()))` to the overload. Also, the `T t` isn't just there for good looks ;-). In your case you can rewrite it to template <class T, class U> concept has_lexicographical_compare = requires (T first1, T last1, U first2, U last2) { std::lexicographical_compare_three_way(first1, last1, first2, last2); }; and then also check with different iterator types. philnik: Yes, that's a bug in your code :P. You have to add `-> decltype(std…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions You have to add `-> decltype(std::lexicographical_compare_three_way(__first1, __last1, __first2, __last2, compare_three_way()))` to the overload Afaict, this explicit return type is not present in the standard (https://eel.is/c++draft/alg.sorting#alg.three.way). Or am I looking in the wrong place? avogelsgesang: > You have to add `-> decltype(std::lexicographical_compare_three_way(__first1, __last1…
				huixie90Unsubmitted Done Reply Inline Actions @philnik The spec only explicitly specifies the return type for the comparator overload and not the other overload. So only the comparator overload has SFINAE and for the other overload implementations are free to hard error in the function body. So re the testing, we only need to test the SFINAE for comparator overload. for the none comparator overload, it is expected that the test won't work. huixie90: @philnik The spec only explicitly specifies the return type for the comparator overload and not…
				philnikUnsubmitted Done Reply Inline Actions You are right that it doesn't literally say that it should be SFINAEd away, but it has these two sneaky words `Equivalent to`, which has a lot of very subtle implications. See http://eel.is/c++draft/description#structure.specifications-4 for details. philnik: You are right that it doesn't literally say that it should be SFINAEd away, but it has these…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions it has these two sneaky words Equivalent to, which has a lot of very subtle implications I am not sure how to interpret that paragraph, afaict it does not directly talk about SFINAE. Did both Microsoft (https://github.com/microsoft/STL/blob/ebbc9908ad24dc5dbbdbce820432e79f166ec547/stl/inc/xutility#L4982) and libstdc++ (https://github.com/gcc-mirror/gcc/blob/16e2427f50c208dfe07d07f18009969502c25dc8/libstdc%2B%2B-v3/include/bits/stl_algobase.h#L1866) get this wrong? avogelsgesang: > it has these two sneaky words Equivalent to, which has a lot of very subtle implications I…
				philnikUnsubmitted Done Reply Inline Actions Ah no, Ok. I misremembered. It's only when it part of a `Constraints:` clause that you have to bring over the SFINAE fun. As I said, sneaky words :P. philnik: Ah no, Ok. I misremembered. It's only when it part of a `Constraints:` clause that you have to…
				mumbleskatesUnsubmitted Done Reply Inline Actions i'm sorry yes, `static_assert`. my bad :) mumbleskates: i'm sorry yes, `static_assert`. my bad :)
				// Identical elements, but longer
				assert((test_lexicographical_compare<Iter1, Iter2>(array{0, 1}, array{0, 1, 2}, std::strong_ordering::less)));
				// Identical elements, but shorter
				assert((test_lexicographical_compare<Iter1, Iter2>(array{0, 1, 2}, array{0, 1}, std::strong_ordering::greater)));
				}

				constexpr bool test() {
				// Check with various iterator types
				test_iterator_types<const int, const int>();
				huixie90Unsubmitted Done Reply Inline Actions Could you please at least add a test for the weak_ordering where one range is shorter than the other and all the elements are equal. IIUC, in this case, the result is going to convert from a strong_ordering to a weak_ordering. huixie90: Could you please at least add a test for the weak_ordering where one range is shorter than the…
				test_iterator_types<const int, forward_iterator<const int>>();
				test_iterator_types<cpp17_input_iterator<const int>, three_way_contiguous_iterator<const int>>();
				test_iterator_types<bidirectional_iterator<const int>, random_access_iterator<const int>>();
				test_iterator_types<contiguous_iterator<const int>, cpp20_random_access_iterator<const int>>();
				philnikUnsubmitted Done Reply Inline Actions We normally assert inside the test function. That lets us see the checks directly, avoids forgetting an `assert()` elsewhere and removes a lot of noise. philnik: We normally assert inside the test function. That lets us see the checks directly, avoids…

				// Check for other comparison categories
				assert((test_lexicographical_compare<const WeakInt, const WeakInt>(
				array<WeakInt, 2>{{{0}, {1}}}, array<WeakInt, 2>{{{1}, {1}}}, std::weak_ordering::less)));
				assert((test_lexicographical_compare<const PartialInt, const PartialInt>(
				array<PartialInt, 2>{{{0}, {1}}}, array<PartialInt, 2>{{{1}, {1}}}, std::partial_ordering::less)));

				// Check for other comparison categories with arrays of different sizes
				assert((test_lexicographical_compare<const WeakInt, const WeakInt>(
				array<WeakInt, 2>{{{0}}}, array<WeakInt, 2>{{{0}, {1}}}, std::weak_ordering::less)));
				assert((test_lexicographical_compare<const PartialInt, const PartialInt>(
				array<PartialInt, 2>{{{0}}}, array<PartialInt, 2>{{{0}, {1}}}, std::partial_ordering::less)));

				return true;
				}

				int main(int, char**) {
				test();
				static_assert(test());

				return 0;
				}
				philnikUnsubmitted Done Reply Inline Actions Please check the complete cartesian product of `cpp17_input_iterator`, `forward_iterator`, `bidirectional_iterator`, `random_access_iterator`, `contiguous_iterator`, `const int` and `int`. Also, `cpp20_random_access_iterator` is just a C++17 input iterator. philnik: Please check the complete cartesian product of `cpp17_input_iterator`, `forward_iterator`…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions done, except for `int`. `int` does not work, because the currently used `array`s are const. Is it fine not to test `int`? Which additional test coverage would `int` give us over `const int`? avogelsgesang:* done, except for `int`. `int` does not work, because the currently used `array`s are const.
				philnikUnsubmitted Done Reply Inline Actions You can just take the arrays by value instead of const reference. `int` checks that the algorithms doesn't fail for mutable types. It's not expected that it will happen, but it's also a very common scenario and quite easy to add tests for. philnik:* You can just take the arrays by value instead of const reference. `int*` checks that the…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions ok, also added tests for `int` avogelsgesang:* ok, also added tests for `int*`
				huixie90Unsubmitted Done Reply Inline Actions in other tests, we usually do template <class It1, class Int2> void test(); template <class It2> void testAllIterator1(){ test<InputIteartor, It2>(); test<ForwardIteartor, It2>(); // ... test<ContiguousIteartor, It2>(); } void testAllIt1It2(){ testAllIterator1<InputIteartor>(); testAllIterator1<ForwardIteartor>(); // ... testAllIterator1<ContiguousIteartor>(); } This saves you manually writing down the all the cartesian products. Some people also argue that we only need to test the weakest iterator and the strongest iterator to save the combinatorial code bloat. It also has a point but I am fine with testing all of the combinations. huixie90: in other tests, we usually do ``` template <class It1, class Int2> void test(); template…

libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way_comp.pass.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				// UNSUPPORTED: c++03, c++11, c++14, c++17

				// <algorithm>

				// template<class InputIterator1, class InputIterator2, class Cmp>
				// constexpr auto
				// lexicographical_compare_three_way(InputIterator1 first1, InputIterator1 last1,
				// InputIterator2 first2, InputIterator2 last2,
				// Cmp comp)
				// -> decltype(comp(b1, b2));

				#include <array>
				#include <algorithm>
				#include <cassert>
				#include <compare>
				#include <concepts>

				#include "test_macros.h"
				#include "test_iterators.h"

				using std::array;

				constexpr std::strong_ordering compare_last_digit(int a, int b) { return (a % 10) <=> (b % 10); }
				constexpr std::weak_ordering compare_last_digit_weak(int a, int b) { return (a % 10) <=> (b % 10); }
				constexpr std::partial_ordering compare_last_digit_partial(int a, int b) { return (a % 10) <=> (b % 10); }
				var-constUnsubmitted Done Reply Inline Actions Question: what is the purpose of comparing just the last digit instead of just the two given numbers? It makes the implementation slightly more complicated (and the variable names longer), but all the inputs are single-digit anyway. var-const: Question: what is the purpose of comparing just the last digit instead of just the two given…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions the idea is to test a "non-default" comparator. If I just used normal integer comparisons here, the test cases wouldn't catch it if `lexicographical_compare_three_way` just ignored the comparator and used the standard comparator instead avogelsgesang: the idea is to test a "non-default" comparator. If I just used normal integer comparisons here…

				template <typename Iter1, typename Iter2, typename C1, typename C2, typename Order, typename Comparator>
				[[nodiscard]] constexpr bool test_lexicographical_compare(const C1& a, const C2& b, Order expected, Comparator comp) {
				var-constUnsubmitted Done Reply Inline Actions Is this branch ever taken? var-const: Is this branch ever taken?
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions see "Check for a `partial_ordering::unordered` result" inside `test_comparison_categories` avogelsgesang: see "Check for a `partial_ordering::unordered` result" inside `test_comparison_categories`
				std::same_as<Order> auto result =
				std::lexicographical_compare_three_way(Iter1{a.begin()}, Iter1{a.end()}, Iter2{b.begin()}, Iter2{b.end()}, comp);
				return expected == result;
				}

				[[nodiscard]] constexpr bool test_lexicographical_compare(const auto& a, const auto& b, auto expected) {
				ldionneUnsubmitted Done Reply Inline Actions We should also be testing with iterator archetypes like you've done for the non-`comp` test. Is there a reason why it doesn't apply here? ldionne: We should also be testing with iterator archetypes like you've done for the non-`comp` test. Is…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions No real reason. I considered this duplicated test coverage given that `lexicographical_compare_three_way` with a comparator just delegates to `lexicographical_compare_three_way` without a comparator. Added the test coverage now, though avogelsgesang: No real reason. I considered this duplicated test coverage given that…
				mumbleskatesUnsubmitted Done Reply Inline Actions i believe we strive to test such that we cover the surface of the apis, not merely our own implementation. the test suite is meant to be usable against other standard libraries as well, which may work differently. (this also includes the future of our own implementation.) mumbleskates: i believe we strive to test such that we cover the surface of the apis, not merely our own…
				auto result = std::lexicographical_compare_three_way(a.begin(), a.end(), b.begin(), b.end(), compare_last_digit);
				ASSERT_SAME_TYPE(decltype(result), decltype(expected));
				ldionneUnsubmitted Done Reply Inline Actions We should also be testing with more than just `strong_ordering`. The same applies to the non-`comp` test IIUC. ldionne: We should also be testing with more than just `strong_ordering`. The same applies to the non…
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions We are now also testing `weak_ordering` and `partial_ordering` avogelsgesang: We are now also testing `weak_ordering` and `partial_ordering`
				return expected == result;
				}

				template <typename Iter1, typename Iter2>
				constexpr void test_iterator_types() {
				// Both inputs empty
				assert((test_lexicographical_compare<Iter1, Iter2>(
				array<int, 0>{}, array<int, 0>{}, std::strong_ordering::equal, compare_last_digit)));
				// Left input empty
				assert((test_lexicographical_compare<Iter1, Iter2>(
				array<int, 0>{}, array{0, 1}, std::strong_ordering::less, compare_last_digit)));
				// Right input empty
				assert((test_lexicographical_compare<Iter1, Iter2>(
				array{0, 1}, array<int, 0>{}, std::strong_ordering::greater, compare_last_digit)));

				var-constUnsubmitted Done Reply Inline Actions Nit: I'd put `expected` last so that all the algorithm inputs are next to each other. var-const: Nit: I'd put `expected` last so that all the algorithm inputs are next to each other.
				// Identical arrays
				assert((test_lexicographical_compare<Iter1, Iter2>(
				array{0, 1}, array{0, 1}, std::strong_ordering::equal, compare_last_digit)));
				// "Less" on 2nd element
				assert((test_lexicographical_compare<Iter1, Iter2>(
				array{0, 1}, array{0, 2}, std::strong_ordering::less, compare_last_digit)));
				// "Greater" on 2nd element
				assert((test_lexicographical_compare<Iter1, Iter2>(
				array{0, 2}, array{0, 1}, std::strong_ordering::greater, compare_last_digit)));
				// "Greater" on 2nd element, but "less" on first entry
				var-constUnsubmitted Done Reply Inline Actions Optional: create a local constant with a shorter name for the comparator to cut down on the boilerplate a little? var-const: Optional: create a local constant with a shorter name for the comparator to cut down on the…
				assert((test_lexicographical_compare<Iter1, Iter2>(
				array{0, 2}, array{1, 1}, std::strong_ordering::less, compare_last_digit)));
				// Identical elements, but longer
				assert((test_lexicographical_compare<Iter1, Iter2>(
				huixie90Unsubmitted Done Reply Inline Actions Please add a test where the input iterator only models c++17's `input_iterator` (I think you can use `cpp17_input_iterator`. Since the InputIterator is the minimum requirement huixie90: Please add a test where the input iterator only models c++17's `input_iterator` (I think you…
				array{0, 1}, array{0, 1, 2}, std::strong_ordering::less, compare_last_digit)));
				// Identical elements, but shorter
				assert((test_lexicographical_compare<Iter1, Iter2>(
				array{0, 1, 2}, array{0, 1}, std::strong_ordering::greater, compare_last_digit)));
				}

				constexpr bool test() {
				// Check with various iterator types
				test_iterator_types<const int, const int>();
				test_iterator_types<const int, forward_iterator<const int>>();
				test_iterator_types<cpp17_input_iterator<const int>, three_way_contiguous_iterator<const int>>();
				test_iterator_types<bidirectional_iterator<const int>, random_access_iterator<const int>>();
				test_iterator_types<contiguous_iterator<const int>, cpp20_random_access_iterator<const int>>();

				// Check for other comparison categories
				assert((test_lexicographical_compare<const int, const int>(
				array{0, 1}, array{10, 11}, std::weak_ordering::equivalent, compare_last_digit_weak)));
				assert((test_lexicographical_compare<const int, const int>(
				array{0, 1}, array{20, 11}, std::partial_ordering::equivalent, compare_last_digit_partial)));

				// Check for other comparison categories with arrays of different sizes
				assert((test_lexicographical_compare<const int, const int>(
				array{0}, array{0, 1}, std::weak_ordering::less, compare_last_digit_weak)));
				assert((test_lexicographical_compare<const int, const int>(
				array{0}, array{0, 1}, std::partial_ordering::less, compare_last_digit_partial)));

				// Test for "Complexity: At most N applications of comp."
				int compare_invocation_count = 0;
				auto compare_last_digit_counting = [&](int a, int b) -> std::strong_ordering {
				++compare_invocation_count;
				return (a % 10) <=> (b % 10);
				};
				// If one of both ranges is empty, the comparator must not be called at all
				compare_invocation_count = 0;
				assert((test_lexicographical_compare<const int, const int>(
				array{0, 1, 2, 3}, array<int, 0>{}, std::strong_ordering::greater, compare_last_digit_counting)));
				assert(compare_invocation_count == 0);
				// The comparator is invoked only `min(left.size(), right.size())` times
				assert((test_lexicographical_compare<const int, const int>(
				array{0, 1, 2}, array{0, 1, 2, 3}, std::strong_ordering::less, compare_last_digit_counting)));
				assert(compare_invocation_count == 3);

				return true;
				}

				int main(int, char**) {
				test();
				static_assert(test());

				return 0;
				}
				var-constUnsubmitted Done Reply Inline Actions Nit: `s/both/the/`. var-const: Nit: `s/both/the/`.

libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way_comp.verify.cpp

This file was added.

				//===----------------------------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				// UNSUPPORTED: c++03, c++11, c++14, c++17

				// <algorithm>

				// template<class InputIterator1, class InputIterator2, class Cmp>
				// constexpr auto
				// lexicographical_compare_three_way(InputIterator1 first1, InputIterator1 last1,
				// InputIterator2 first2, InputIterator2 last2,
				// Cmp comp)
				// -> decltype(comp(b1, b2));

				#include <array>
				MordanteUnsubmitted Done Reply Inline Actions Can you also test with iterators with invalid difference types? Mordante: Can you also test with iterators with invalid difference types?
				avogelsgesangAuthorUnsubmitted Done Reply Inline Actions do we already have such a test utility somewhere? I couldn't find anything useful in `test_iterators.h` avogelsgesang: do we already have such a test utility somewhere? I couldn't find anything useful in…
				MordanteUnsubmitted Done Reply Inline Actions I think we don't. `almost_satisfies_types.h` seems to be the better place for such an iterator. Mordante: I think we don't. `almost_satisfies_types.h` seems to be the better place for such an iterator.
				#include <algorithm>
				#include <cassert>
				#include <compare>

				#include "test_macros.h"

				constexpr bool incorrect_comparator(int a, int b) { return a < b; }

				int main(int, char**) {
				std::array a{90, 81};
				MordanteUnsubmitted Done Reply Inline Actions In general we don't use `main` in our verify test, since the code isn't executed. Mordante: In general we don't use `main` in our verify test, since the code isn't executed.
				std::array b{10, 11};
				// expected-error-re@: {{{{(static_assert\|static assertion)}} failed{{.*}}The comparator passed to lexicographical_compare_three_way must return a comparison category type}}}}
				// expected-error@: {{no viable conversion}}
				// expected-error@: {{no viable conversion}}
				// expected-error@: {{no viable conversion}}
				// expected-error-re@: {{conversion function{{.*}}invokes a deleted function}}
				auto result = std::lexicographical_compare_three_way(a.begin(), a.end(), b.begin(), b.end(), incorrect_comparator);
				assert(result == std::strong_ordering::equal);

				return 0;
				}
				var-constUnsubmitted Done Reply Inline Actions Nit: please also check that passing `RandomAccessIteratorBadDifferenceType` iterators for the second range fails as well (i.e., calling `std::lexicographical_compare_three_way(c, d, a, b, std::compare_three_way())`). var-const: Nit: please also check that passing `RandomAccessIteratorBadDifferenceType` iterators for the…

This is an archive of the discontinued LLVM Phabricator instance.

[libc++] Implement `lexicographical_compare_three_way`ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 452929

libcxx/docs/Status/Cxx2bIssues.csv

libcxx/docs/Status/SpaceshipProjects.csv

libcxx/include/CMakeLists.txt

libcxx/include/__algorithm/lexicographical_compare_three_way.h

libcxx/include/algorithm

libcxx/include/module.modulemap.in

libcxx/test/libcxx/private_headers.verify.cpp

libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way.pass.cpp

libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way_comp.pass.cpp

libcxx/test/std/algorithms/alg.sorting/alg.three.way/lexicographical_compare_three_way_comp.verify.cpp

[libc++] Implement `lexicographical_compare_three_way`
ClosedPublic