On 15 April 2013 11:06, Måns Rullgård wrote: Hi Måns, >> Add a high performance memcpy routine optimized for Cortex-A15 with >> variants for use in the presence of NEON and VFP hardware, selected >> at runtime using indirect function support. > > How does this perform on Cortex-A9? The code is also faster on A9 although the gains are not quite as pronounced. A set of numbers is attached (they linewrap pretty horribly inline). -- Will Newton Toolchain Working Group, Linaro