arch64: Add SIMD version of memcpy

Create a new memcpy implementation for targets with the NEON extension.

__memcpy_aarch64_simd has been tested on a range of modern
microarchitectures. It turned out to be faster than __memcpy_aarch64 on
all of them, with a performance improvement of 3-11% depending on the
platform.
4 files changed
tree: e059586d3cfeb64d0424c856a0b0568c82cd6cc3
  1. math/
  2. string/
  3. .gitignore
  4. config.mk.dist
  5. LICENSE
  6. Makefile
  7. README