string: Improve SVE memset

Improve SVE memset by using predicated load for sizes < 16. Unaligned memsets
are improved by ~20% on average for size 128-1024 by using aligned stores
for the last 64 bytes.
1 file changed