Download Raw Diff

Details

Reviewers

lebedev.ri
spatel

Commits

rOLDT369707: [MemFunctions] Add microbenchmarks for memory functions.
rL369707: [MemFunctions] Add microbenchmarks for memory functions.

Summary

Memory functions (memcmp, memcpy, ...) are typically recognized by the
compiler and expanded to specific asm patterns when the size is known at
compile time.

This will help catch any regressions in expansions.

Right now we're only testing memcmp (see context in D60318).

Diff Detail

Repository: rL LLVM

Event Timeline

courbet created this revision.Jul 2 2019, 8:51 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 2 2019, 8:51 AM

Herald added a subscriber: mgorny. · View Herald Transcript

Harbormaster completed remote builds in B34202: Diff 207570.Jul 2 2019, 8:53 AM

Thanks for working on this! I'm not familiar with how the benchmarking framework works, so someone else should definitely have a look.

Does the framework automatically account for and filter out noisy results? I'm guessing that tiny memcmp() will have a lot of run-to-run variation.

In D64082#1566955, @spatel wrote:

Thanks for working on this! I'm not familiar with how the benchmarking framework works, so someone else should definitely have a look.

That could be me, i suppose.
Does not look too wrong to me.

Does the framework automatically account for and filter out noisy results? I'm guessing that tiny memcmp() will have a lot of run-to-run variation.

The for (auto _ : state) loop will run for up to 0.5 sec, so the results should be good;
Plus at the test-suite/lnt level the test suite can be run several times so it's possible ensure that timing changes are meaningful (U Test, e.g.)

MicroBenchmarks/MemFunctions/main.cpp
34–38 ↗	(On Diff #207570)	So what about `q`? It's intentionally left all-zeros? That warrants a comment.
82–94 ↗	(On Diff #207570)	I'd do one or two macro levels here
111–118 ↗	(On Diff #207570)	BENCHMARK_MAIN();

Address review comments.

Thanks Roman.

MicroBenchmarks/MemFunctions/main.cpp
111–118 ↗	(On Diff #207570)	Thanks for the pointer.

In D64082#1566955, @spatel wrote:

Thanks for working on this! I'm not familiar with how the benchmarking framework works, so someone else should definitely have a look.

Does the framework automatically account for and filter out noisy results? I'm guessing that tiny memcmp() will have a lot of run-to-run variation.

The framework will grow number of iterations until measurements stabilize. This is usually sufficient. However it will not do statistical significance testing for you (which is what I've done in the attached PDF just to be sure).

Harbormaster completed remote builds in B34257: Diff 207732.Jul 3 2019, 2:36 AM

courbet added a reviewer: lebedev.ri.Jul 3 2019, 2:36 AM

lebedev.ri added inline comments.Jul 3 2019, 2:48 AM

MicroBenchmarks/MemFunctions/main.cpp
43 ↗	(On Diff #207732)	This may be paranoia, but i'm not sure this is sufficient to guarantee that compiler can't just look into `p`/`q`. I'd suggest adding this here: benchmark::DoNotOptimize(p); benchmark::DoNotOptimize(q); benchmark::ClobberMemory(p); benchmark::ClobberMemory(q); (i see that you do that for `std::vector<char>`'s already, but you have already acquired `_storage.data()`..)

courbet marked an inline comment as done.Jul 3 2019, 3:50 AM

courbet added inline comments.

MicroBenchmarks/MemFunctions/main.cpp
43 ↗	(On Diff #207732)	Sounds reasonable. I've even moved the ClobberMemory inside the call (and verified that benchmark numbers do not change).

Be evel less permissive as to what we allow the compiler to see.

Harbormaster completed remote builds in B34265: Diff 207750.Jul 3 2019, 3:50 AM

lebedev.ri added inline comments.Jul 3 2019, 5:08 AM

MicroBenchmarks/MemFunctions/main.cpp
29 ↗	(On Diff #207750)	Magical constant I'm guessing that by `4096` you limit the maximal size of `p` and `q` buffers, implying that they should fit into L1 cache? Do you want to use the actual L1 size instead? Otherwise, static constexpr size_t kMaxBufSizeBytes = 4096; constexpr size_t kNumElements = kMaxBufSizeBytes / kSize;

Name magical constant.

Harbormaster completed remote builds in B34271: Diff 207762.Jul 3 2019, 5:10 AM

Add comment for buffer size.

Harbormaster completed remote builds in B34272: Diff 207763.Jul 3 2019, 5:12 AM

courbet marked an inline comment as done.Jul 3 2019, 5:13 AM

courbet added inline comments.

MicroBenchmarks/MemFunctions/main.cpp
29 ↗	(On Diff #207750)	It's combination of things, among which caching. But you're right that this warrants a comment. Done.

Looks ok to me from benchmark perspective, but some more thoughts about the benchmark itself..

MicroBenchmarks/MemFunctions/main.cpp
50 ↗	(On Diff #207762)	I think i'm forgetting about some magic. All the predicates (`EqZero`, ...) take a single argument, how does this work if it passes two args?
58–66 ↗	(On Diff #207762)	To be noted, none of these is the actual `memcmp`, i think?
59 ↗	(On Diff #207762)	Does it matter that these take `int` while you always pass `char`?
43 ↗	(On Diff #207732)	Nice.

This revision is now accepted and ready to land.Jul 3 2019, 5:19 AM

Clarify top comment.

MicroBenchmarks/MemFunctions/main.cpp
50 ↗	(On Diff #207762)	I think you missed that the result of calling memcmp is passed to pred. `Pred` just defines which of `==`, `<` or `>` we're benching. I updated the bench comment to make that clearer.
58–66 ↗	(On Diff #207762)	See my comment above.
59 ↗	(On Diff #207762)	See my comment above.

Harbormaster completed remote builds in B34273: Diff 207765.Jul 3 2019, 5:29 AM

lebedev.ri marked an inline comment as done.Jul 3 2019, 5:33 AM

lebedev.ri added inline comments.

MicroBenchmarks/MemFunctions/main.cpp
50 ↗	(On Diff #207762)	Oh i see, that explains it, thanks!

lg too

Thanks!

Closed by commit rL369707: [MemFunctions] Add microbenchmarks for memory functions. (authored by courbet). · Explain WhyAug 22 2019, 2:24 PM

This revision was automatically updated to reflect the committed changes.

This is an archive of the discontinued LLVM Phabricator instance.

[MemFunctions] Add microbenchmarks for memory functions.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 216718

test-suite/trunk/MicroBenchmarks/CMakeLists.txt

test-suite/trunk/MicroBenchmarks/MemFunctions/CMakeLists.txt

test-suite/trunk/MicroBenchmarks/MemFunctions/main.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[MemFunctions] Add microbenchmarks for memory functions.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 216718

test-suite/trunk/MicroBenchmarks/CMakeLists.txt

test-suite/trunk/MicroBenchmarks/MemFunctions/CMakeLists.txt

test-suite/trunk/MicroBenchmarks/MemFunctions/main.cpp

[MemFunctions] Add microbenchmarks for memory functions.
ClosedPublic