Aligned copy used to be 'destination aligned' for x86 but this decision was reverted in D93457 where we noticed that it was better for ARM to be 'source aligned'.
More benchmarking confirmed that it can be up to 30% faster to align copy to destination for x86. This Patch offers both implementations and switches x86 back to destination aligned.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Note that the code in this patch will be replaced in D100646 where we provide a cleaner and more general implementation.
libc/test/src/string/memory_utils/memcpy_utils_test.cpp | ||
---|---|---|
163 | Should we have a test for CopyDstAlignedBlocks? |
Should we have a test for CopyDstAlignedBlocks?