fix compilation with clang 3.9, fix performance with pset1, use vector operators instead of intrinsics in some cases
3 files changed