Download Raw Diff

Details

Reviewers

fhahn
paquette
t.p.northover

Commits

rT3b44b6bdd3e8: [MicroBenchmarks] Add benchmarks to check runtime of truncate or zero-extend…

Summary

Add benchmarks to check runtime of truncate or zero-extend vector operations in AArch64.
This patch adds an initial set of benchmarks to check runtime of vectorized truncate or zero-extend operations in a loop for different vector types over different vector widths.
The goal of this initial benchmark is to check the impact of D133495, D135229 and D120571.

Diff Detail

Repository: rT test-suite

Event Timeline

nilanjana_basu created this revision.Oct 19 2022, 10:59 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 19 2022, 10:59 AM

Herald added a subscriber: kristof.beyls. · View Herald Transcript

nilanjana_basu requested review of this revision.Oct 19 2022, 10:59 AM

Herald added a subscriber: • pcwang-thead. · View Herald TranscriptOct 19 2022, 10:59 AM

Removed redundant code

Harbormaster completed remote builds in B193048: Diff 468978.Oct 19 2022, 11:02 AM

Thanks for the patch! I think it would be good to move the benchmark to a different file, as it is unrelated to measuring runtime check performance.

MicroBenchmarks/LoopVectorization/RuntimeChecks.cpp
7	It should be sufficient to use a much larger iteration count like 10000, the main benchmark loop will make sure the function is run long enough to collect stable data.
135	I don't think this is doing what you want at the moment. Instead of truncating to to `i8` it is extending from `i8`. `B` and `A` should probably be flipped?
144	It looks like this is missing the main benchmark loop that google benchmark requires: for (auto _ : state) { ... }

This revision now requires changes to proceed.Oct 20 2022, 12:00 PM

Made a separate file for testing vector operations for truncate or zero extend. Added tests for truncate of different types of data types, with different vectorization width settings.

Harbormaster completed remote builds in B193408: Diff 469452.Oct 20 2022, 7:25 PM

nilanjana_basu marked an inline comment as done.Oct 20 2022, 7:25 PM

Fixed a mistake where the same test was being ran twice

Harbormaster completed remote builds in B193695: Diff 469836.Oct 21 2022, 5:58 PM

Extended it to be generic enough for both truncate & zero-extend vector operations

Harbormaster completed remote builds in B193702: Diff 469844.Oct 21 2022, 6:36 PM

nilanjana_basu set the repository for this revision to rT test-suite.Oct 25 2022, 5:12 PM

Removed two test cases whose related patches are not yet available.

Harbormaster completed remote builds in B195341: Diff 472112.Oct 31 2022, 1:07 PM

nilanjana_basu edited the summary of this revision. (Show Details)Oct 31 2022, 1:15 PM

nilanjana_basu added reviewers: paquette, t.p.northover.

nilanjana_basu edited the summary of this revision. (Show Details)

All the comments have been addressed in the latest patch.

Removed the addition operation to keep only the truncate or zero-extend operation for a more focused performance comparison

Harbormaster completed remote builds in B195605: Diff 472476.Nov 1 2022, 6:42 PM

I think this looks reasonable now.

Minor fix to comments

Harbormaster completed remote builds in B195760: Diff 472696.Nov 2 2022, 10:54 AM

This revision was not accepted when it landed; it landed in state Needs Review.Nov 2 2022, 2:05 PM

Closed by commit rT3b44b6bdd3e8: [MicroBenchmarks] Add benchmarks to check runtime of truncate or zero-extend… (authored by nilanjana_basu). · Explain Why

This revision was automatically updated to reflect the committed changes.

nilanjana_basu added a commit: rT3b44b6bdd3e8: [MicroBenchmarks] Add benchmarks to check runtime of truncate or zero-extend….

nilanjana_basu mentioned this in rG955c0f13cd70: [AArch64] Extending lowering of 'zext <Y x i8> %x to <Y x i8X>' to use tbl….Dec 9 2022, 12:51 AM

nilanjana_basu mentioned this in rG02d09ffc1b09: [AArch64] Extending lowering of 'trunc <(8|16) x i64> %x to <(8|16) x i8>' to….Dec 15 2022, 7:21 AM

This is an archive of the discontinued LLVM Phabricator instance.

Microbenchmark to test runtime of truncate or zero-extend vector operations in AArch64
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 468974

MicroBenchmarks/LoopVectorization/RuntimeChecks.cpp

This is an archive of the discontinued LLVM Phabricator instance.

Microbenchmark to test runtime of truncate or zero-extend vector operations in AArch64ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 468974

MicroBenchmarks/LoopVectorization/RuntimeChecks.cpp

Microbenchmark to test runtime of truncate or zero-extend vector operations in AArch64
ClosedPublic