libOpus: fix OOB read in ssse4 correlation kernel

Few SIMD functions read 16 bytes at a time and this
potentially leads to OOB read for some buffers
allocated on stack using ALLOC() calls. In order to
avoid these OOB reads, ALLOC() now allocates 16 additional bytes.

Bug: 191352053

Test: poc in bug description
Test: atest VtsHalMediaC2V1_0TargetAudioDecTest
Test: atest VtsHalMediaC2V1_0TargetAudioEncTest

Change-Id: I4da2840844d60f251dd7a222f51d508e4eb8749f
(cherry picked from commit 878bdeb38043407869c684fb73708b04e8fe0ce4)
2 files changed