use intel intrinsics in clib_memcpy64_x4