public inbox for libc-alpha@sourceware.org
 help / color / mirror / Atom feed
* [PATCH 0/2] aarch64,falkor: memcpy/memmove performance improvements
@ 2018-05-03 17:52 Siddhesh Poyarekar
  2018-05-03 17:52 ` [PATCH 2/2] Ignore prefetcher tagging for smaller copies Siddhesh Poyarekar
                   ` (2 more replies)
  0 siblings, 3 replies; 6+ messages in thread
From: Siddhesh Poyarekar @ 2018-05-03 17:52 UTC (permalink / raw)
  To: libc-alpha

Hi,

Here are a couple of patches to improve performance of the falkor memcpy
and memmove implementations based on testing on the latest hardware.
The theme of the optimization is to avoid trying to train the hardware
prefetcher for smaller sizes and in the loop tail since that just
mis-trains the prefetcher.  Instead, use multiple registers to aid
reordering wherever possible.  Testing showed that regressions in these
sizes compared to generic memcpy are resolved with this patch.

Siddhesh

Siddhesh Poyarekar (2):
  aarch64,falkor: Ignore prefetcher hints for memmove tail
  Ignore prefetcher tagging for smaller copies

 sysdeps/aarch64/multiarch/memcpy_falkor.S  | 68 ++++++++++++++++++------------
 sysdeps/aarch64/multiarch/memmove_falkor.S | 48 ++++++++++++---------
 2 files changed, 70 insertions(+), 46 deletions(-)

-- 
2.14.3

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2018-05-10 10:29 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-05-03 17:52 [PATCH 0/2] aarch64,falkor: memcpy/memmove performance improvements Siddhesh Poyarekar
2018-05-03 17:52 ` [PATCH 2/2] Ignore prefetcher tagging for smaller copies Siddhesh Poyarekar
2018-05-10 10:29   ` Szabolcs Nagy
2018-05-03 17:52 ` [PATCH 1/2] aarch64,falkor: Ignore prefetcher hints for memmove tail Siddhesh Poyarekar
2018-05-10 10:29   ` Szabolcs Nagy
2018-05-10  2:59 ` [PING][PATCH 0/2] aarch64,falkor: memcpy/memmove performance improvements Siddhesh Poyarekar

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).