Merge "Initial SSE2 function fdst4_sse2()." into nextgenv2