public inbox for libc-alpha@sourceware.org
 help / color / mirror / Atom feed
* [PATCH 0/1] LoongArch: Add optimized functions.
@ 2022-09-02  8:39 dengjianbo
  2022-09-02  8:39 ` [PATCH 1/1] " dengjianbo
  0 siblings, 1 reply; 6+ messages in thread
From: dengjianbo @ 2022-09-02  8:39 UTC (permalink / raw)
  To: adhemerval.zanella, libc-alpha, i.swmail
  Cc: joseph_myers, carlos, xuchenghua, caiyinyu, dengjianbo

Tested on LoongArch machine 3A5000: gcc 12.1.0, Linux kernel 5.19.0 rc5, binutils 2.38.50, All cases are passed besides ifunc related tests, no new failed entry was introduced.

configure:
../configure  --prefix=/usr CFLAGS="-O2"

make bench, comparing with the improved generic version, test results can be found in following link:
 https://github.com/jiadengx/glibc/tree/main/make_bench
strchr
 About 30% faster than improved generic version, detailed info can be found from strchr_xls.png and graph-strchr.
 https://github.com/jiadengx/glibc/blob/main/make_bench/graph-strchr
 https://github.com/jiadengx/glibc/blob/main/make_bench/strchr_xls.png
strchrnul
 About 30% faster than improved generic version, detailed info can be found from strchrnul_xls.png and graph-strchrnul.
 https://github.com/jiadengx/glibc/blob/main/make_bench/graph-strchrnul
 https://github.com/jiadengx/glibc/blob/main/make_bench/strchrnul_xls.png
strcmp
 About 10% - 60% faster than improved generic version, detailed info can be found from strcmp_xls.png and graph-strcmp.
 https://github.com/jiadengx/glibc/blob/main/make_bench/graph-strcmp
 https://github.com/jiadengx/glibc/blob/main/make_bench/strcmp_xls.png
strncmp
 About 0% - 80% faster than generic version, detailed info can be found from strncmp_xls.png and graph-strncmp.
 https://github.com/jiadengx/glibc/blob/main/make_bench/graph-strncmp
 https://github.com/jiadengx/glibc/blob/main/make_bench/strncmp_xls.png
memmove
 About 5% - 60% faster than improved generic version, detailed info can be found from memmove_xls.png and graph-memmove.
 https://github.com/jiadengx/glibc/blob/main/make_bench/graph-memmove
 https://github.com/jiadengx/glibc/blob/main/make_bench/memmove_xls.png
dengjianbo (1):
  LoongArch: Add optimized functions.

 sysdeps/loongarch/lp64/memmove.S   | 491 +++++++++++++++++++++++++++++
 sysdeps/loongarch/lp64/strchr.S    | 145 +++++++++
 sysdeps/loongarch/lp64/strchrnul.S | 160 ++++++++++
 sysdeps/loongarch/lp64/strcmp.S    | 210 ++++++++++++
 sysdeps/loongarch/lp64/strncmp.S   | 281 +++++++++++++++++
 5 files changed, 1287 insertions(+)
 create mode 100644 sysdeps/loongarch/lp64/memmove.S
 create mode 100644 sysdeps/loongarch/lp64/strchr.S
 create mode 100644 sysdeps/loongarch/lp64/strchrnul.S
 create mode 100644 sysdeps/loongarch/lp64/strcmp.S
 create mode 100644 sysdeps/loongarch/lp64/strncmp.S

-- 
2.20.1


^ permalink raw reply	[flat|nested] 6+ messages in thread
* [PATCH 0/1] LoongArch: Add optimized functions.
@ 2022-09-16  7:16 dengjianbo
  0 siblings, 0 replies; 6+ messages in thread
From: dengjianbo @ 2022-09-16  7:16 UTC (permalink / raw)
  To: adhemerval.zanella, libc-alpha, i.swmail
  Cc: joseph_myers, carlos, xuchenghua, caiyinyu, huangpei, dengjianbo

Tested on LoongArch machine 3A5000: gcc 12.1.0, Linux kernel 5.19.0 rc5, binutils 2.38.50, All cases are passed besides ifunc related tests, no new failed entry was introduced.

configure:
../configure  --prefix=/usr CFLAGS="-O2"

make bench:
comparing with improved generic string version resent in https://sourceware.org/pipermail/libc-alpha/2022-September/141833.html , test results can be found in following link:
https://github.com/jiadengx/glibc/tree/main/make_bench_v2

strchr
 About 30% faster than improved generic version, detailed info can be found from strchr_xls.png and graph-strchr.
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/graph-strchr
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/strchr_xls.png

strchrnul
 About 30% faster than improved generic version, detailed info can be found from strchrnul_xls.png and graph-strchrnul.
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/graph-strchrnul
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/strchrnul_xls.png

strcmp
 About 10% - 60% faster than improved generic version, detailed info can be found from strcmp_xls.png and graph-strcmp.
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/graph-strcmp
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/strcmp_xls.png

strncmp
 About 0% - 80% faster than generic version, detailed info can be found from strncmp_xls.png and graph-strncmp.
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/graph-strncmp
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/strncmp_xls.png

memmove
 About 5% - 60% faster than improved generic version, detailed info can be found from memmove_xls.png and graph-memmove
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/graph-memmove
https://github.com/jiadengx/glibc/blob/main/make_bench_v2/memmove_xls.png


dengjianbo (1):
  LoongArch: Add optimized functions

 sysdeps/loongarch/lp64/memmove.S   | 485 +++++++++++++++++++++++++++++
 sysdeps/loongarch/lp64/strchr.S    | 124 ++++++++
 sysdeps/loongarch/lp64/strchrnul.S | 133 ++++++++
 sysdeps/loongarch/lp64/strcmp.S    | 207 ++++++++++++
 sysdeps/loongarch/lp64/strncmp.S   | 278 +++++++++++++++++
 5 files changed, 1227 insertions(+)
 create mode 100644 sysdeps/loongarch/lp64/memmove.S
 create mode 100644 sysdeps/loongarch/lp64/strchr.S
 create mode 100644 sysdeps/loongarch/lp64/strchrnul.S
 create mode 100644 sysdeps/loongarch/lp64/strcmp.S
 create mode 100644 sysdeps/loongarch/lp64/strncmp.S

-- 
2.31.1


^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2022-09-16  7:34 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-09-02  8:39 [PATCH 0/1] LoongArch: Add optimized functions dengjianbo
2022-09-02  8:39 ` [PATCH 1/1] " dengjianbo
2022-09-02  9:35   ` Xi Ruoyao
2022-09-02  9:52     ` Xi Ruoyao
2022-09-16  7:34     ` dengjianbo
2022-09-16  7:16 [PATCH 0/1] " dengjianbo

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).