Add support for shuffle vector intrinsic via Neon in ARM