It is advised to read the post motivating the creation of __builtin_memcpy_inline first.
The patch focuses on static library but allows creation of several implementations depending on cpu features. The default implementation will be optimized for the host capabilities.
Currently the use of rep movsb is disabled but we plan to unable it via CMake options.
This implementation is mainly tested on clang but should compile with GCC as well. For now it doesn't build on MSVC.
llvm-nm (the nm that ships with MacOS) for mach-o files prints just the symbol and not its value or type character so grepping for U wouldn't work here (in this very niche case that I just happened to test). If we're using --undefined-only we could just grep . perhaps?