ip: improve csum fold on x86_64