pl/math: Add vector/Neon atanf

Successfully ran tests and benchmarks. New routine is accurate to 3 ulps.
8 files changed