Hi, Similar to aarch64, this patch replaces call to builtin by dereferencing __a in vld1_p64, vld1_s64 and vld1_u64. The patch changes code-gen for the intrinsic as follows: Before patch: vld1.64 {d16}, [r0:64] vmov r0, r1, d16 @ int bx lr After patch: ldrd r0, [r0] bx lr I assume the code-gen after patch is correct, since it loads two consecutive words from [r0] into r0 and r1 ? Bootstrapped+tested on arm-linux-gnueabihf. OK to commit ? Thanks, Prathamesh