This is an archive of the discontinued LLVM Phabricator instance.

[NFC][libc] rearrange aarch64 memset code to better match new implementation
ClosedPublic

Authored by gchatelet on Aug 18 2022, 2:41 AM.

Diff Detail

Event Timeline

gchatelet created this revision.Aug 18 2022, 2:41 AM
Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptAug 18 2022, 2:41 AM
gchatelet requested review of this revision.Aug 18 2022, 2:41 AM
courbet accepted this revision.Aug 19 2022, 6:54 AM
courbet added inline comments.
libc/src/string/memory_utils/memset_implementations.h
95

This is magical. Maybe at least spell it 64 + 32 ?

103–105

ditto.

This revision is now accepted and ready to land.Aug 19 2022, 6:54 AM
gchatelet updated this revision to Diff 453987.Aug 19 2022, 7:10 AM
gchatelet marked an inline comment as done.
  • address comment
gchatelet marked an inline comment as done.Aug 19 2022, 7:10 AM
gchatelet added a subscriber: avieira.
gchatelet added inline comments.
libc/src/string/memory_utils/memset_implementations.h
103–105

@avieira how did you come up with 448 here?

avieira added inline comments.Aug 19 2022, 7:17 AM
libc/src/string/memory_utils/memset_implementations.h
103–105

I believe I found this number through benchmarking a non ZVA version with ZVA to see at what sizes (more or less) ZVA started to beat non ZVA. So 'Magical' is as good a way to describe it as I can come up with now too.

This revision was landed with ongoing or failed builds.Aug 19 2022, 7:48 AM
This revision was automatically updated to reflect the committed changes.