From: "Paul A. Clarke" <pc@us.ibm.com>
To: gcc-patches@gcc.gnu.org
Cc: segher@kernel.crashing.org
Subject: Re: [PATCH v3 0/6] rs6000: Support more SSE4 intrinsics
Date: Mon, 4 Oct 2021 13:26:30 -0500 [thread overview]
Message-ID: <20211004182630.GA2081132@li-24c3614c-2adc-11b2-a85c-85f334518bdb.ibm.com> (raw)
In-Reply-To: <20210916145939.GB4498@li-24c3614c-2adc-11b2-a85c-85f334518bdb.ibm.com>
Ping.
On Thu, Sep 16, 2021 at 09:59:39AM -0500, Paul A. Clarke via Gcc-patches wrote:
> Ping.
>
> On Mon, Aug 23, 2021 at 02:03:04PM -0500, Paul A. Clarke via Gcc-patches wrote:
> > v3: Add "nmmintrin.h". _mm_cmpgt_epi64 is part of SSE4.2
> > and users will expect to be able to include "nmmintrin.h",
> > even though "nmmintrin.h" just includes "smmintrin.h"
> > where all of the SSE4.2 implementations actually appear.
> >
> > Only patch 5/6 changed from v2.
> >
> > Tested ppc64le (POWER9) and ppc64/32 (POWER7).
> >
> > OK for trunk?
> >
> > Paul A. Clarke (6):
> > rs6000: Support SSE4.1 "round" intrinsics
> > rs6000: Support SSE4.1 "min" and "max" intrinsics
> > rs6000: Simplify some SSE4.1 "test" intrinsics
> > rs6000: Support SSE4.1 "cvt" intrinsics
> > rs6000: Support more SSE4 "cmp", "mul", "pack" intrinsics
> > rs6000: Guard some x86 intrinsics implementations
> >
> > gcc/config/rs6000/emmintrin.h | 12 +-
> > gcc/config/rs6000/nmmintrin.h | 40 ++
> > gcc/config/rs6000/pmmintrin.h | 4 +
> > gcc/config/rs6000/smmintrin.h | 427 ++++++++++++++++--
> > gcc/config/rs6000/tmmintrin.h | 12 +
> > gcc/testsuite/gcc.target/powerpc/pr78102.c | 23 +
> > .../gcc.target/powerpc/sse4_1-packusdw.c | 73 +++
> > .../gcc.target/powerpc/sse4_1-pcmpeqq.c | 46 ++
> > .../gcc.target/powerpc/sse4_1-pmaxsb.c | 46 ++
> > .../gcc.target/powerpc/sse4_1-pmaxsd.c | 46 ++
> > .../gcc.target/powerpc/sse4_1-pmaxud.c | 47 ++
> > .../gcc.target/powerpc/sse4_1-pmaxuw.c | 47 ++
> > .../gcc.target/powerpc/sse4_1-pminsb.c | 46 ++
> > .../gcc.target/powerpc/sse4_1-pminsd.c | 46 ++
> > .../gcc.target/powerpc/sse4_1-pminud.c | 47 ++
> > .../gcc.target/powerpc/sse4_1-pminuw.c | 47 ++
> > .../gcc.target/powerpc/sse4_1-pmovsxbd.c | 42 ++
> > .../gcc.target/powerpc/sse4_1-pmovsxbq.c | 42 ++
> > .../gcc.target/powerpc/sse4_1-pmovsxbw.c | 42 ++
> > .../gcc.target/powerpc/sse4_1-pmovsxdq.c | 42 ++
> > .../gcc.target/powerpc/sse4_1-pmovsxwd.c | 42 ++
> > .../gcc.target/powerpc/sse4_1-pmovsxwq.c | 42 ++
> > .../gcc.target/powerpc/sse4_1-pmovzxbd.c | 43 ++
> > .../gcc.target/powerpc/sse4_1-pmovzxbq.c | 43 ++
> > .../gcc.target/powerpc/sse4_1-pmovzxbw.c | 43 ++
> > .../gcc.target/powerpc/sse4_1-pmovzxdq.c | 43 ++
> > .../gcc.target/powerpc/sse4_1-pmovzxwd.c | 43 ++
> > .../gcc.target/powerpc/sse4_1-pmovzxwq.c | 43 ++
> > .../gcc.target/powerpc/sse4_1-pmuldq.c | 51 +++
> > .../gcc.target/powerpc/sse4_1-pmulld.c | 46 ++
> > .../gcc.target/powerpc/sse4_1-round3.h | 81 ++++
> > .../gcc.target/powerpc/sse4_1-roundpd.c | 143 ++++++
> > .../gcc.target/powerpc/sse4_1-roundps.c | 98 ++++
> > .../gcc.target/powerpc/sse4_1-roundsd.c | 256 +++++++++++
> > .../gcc.target/powerpc/sse4_1-roundss.c | 208 +++++++++
> > .../gcc.target/powerpc/sse4_2-check.h | 18 +
> > .../gcc.target/powerpc/sse4_2-pcmpgtq.c | 46 ++
> > 37 files changed, 2407 insertions(+), 59 deletions(-)
> > create mode 100644 gcc/config/rs6000/nmmintrin.h
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/pr78102.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-packusdw.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pcmpeqq.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmaxsb.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmaxsd.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmaxud.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmaxuw.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pminsb.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pminsd.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pminud.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pminuw.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovsxbd.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovsxbq.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovsxbw.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovsxdq.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovsxwd.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovsxwq.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovzxbd.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovzxbq.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovzxbw.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovzxdq.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovzxwd.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmovzxwq.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmuldq.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-pmulld.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-round3.h
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-roundpd.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-roundps.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-roundsd.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-roundss.c
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_2-check.h
> > create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_2-pcmpgtq.c
> >
> > --
> > 2.27.0
> >
next prev parent reply other threads:[~2021-10-04 18:26 UTC|newest]
Thread overview: 47+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-08-23 19:03 Paul A. Clarke
2021-08-23 19:03 ` [PATCH v3 1/6] rs6000: Support SSE4.1 "round" intrinsics Paul A. Clarke
2021-08-27 13:44 ` Bill Schmidt
2021-08-27 13:47 ` Bill Schmidt
2021-08-30 21:16 ` Paul A. Clarke
2021-08-30 21:24 ` Bill Schmidt
2021-10-07 23:08 ` Segher Boessenkool
2021-10-07 23:39 ` Segher Boessenkool
2021-10-08 1:04 ` Paul A. Clarke
2021-10-08 17:39 ` Segher Boessenkool
2021-10-08 19:27 ` Paul A. Clarke
2021-10-08 22:31 ` Segher Boessenkool
2021-10-11 13:46 ` Paul A. Clarke
2021-10-11 16:28 ` Segher Boessenkool
2021-10-11 17:31 ` Paul A. Clarke
2021-10-11 22:04 ` Segher Boessenkool
2021-10-12 19:35 ` Paul A. Clarke
2021-10-12 22:25 ` Segher Boessenkool
2021-10-19 0:36 ` Paul A. Clarke
2021-08-23 19:03 ` [PATCH v3 2/6] rs6000: Support SSE4.1 "min" and "max" intrinsics Paul A. Clarke
2021-08-27 13:47 ` Bill Schmidt
2021-10-11 19:28 ` Segher Boessenkool
2021-10-12 1:42 ` [COMMITTED v4 " Paul A. Clarke
2021-08-23 19:03 ` [PATCH v3 3/6] rs6000: Simplify some SSE4.1 "test" intrinsics Paul A. Clarke
2021-08-27 13:48 ` Bill Schmidt
2021-10-11 20:50 ` Segher Boessenkool
2021-10-12 1:47 ` [COMMITTED v4 " Paul A. Clarke
2021-08-23 19:03 ` [PATCH v3 4/6] rs6000: Support SSE4.1 "cvt" intrinsics Paul A. Clarke
2021-08-27 13:49 ` Bill Schmidt
2021-10-11 21:52 ` Segher Boessenkool
2021-10-12 1:51 ` [COMMITTED v4 " Paul A. Clarke
2021-08-23 19:03 ` [PATCH v3 5/6] rs6000: Support more SSE4 "cmp", "mul", "pack" intrinsics Paul A. Clarke
2021-08-27 15:21 ` Bill Schmidt
2021-08-27 18:52 ` Paul A. Clarke
2021-10-11 23:07 ` Segher Boessenkool
2021-10-12 1:55 ` [COMMITTED v4 " Paul A. Clarke
2021-08-23 19:03 ` [PATCH v3 6/6] rs6000: Guard some x86 intrinsics implementations Paul A. Clarke
2021-08-27 15:25 ` Bill Schmidt
2021-10-12 0:11 ` Segher Boessenkool
2021-10-13 17:04 ` Paul A. Clarke
2021-10-13 23:47 ` Segher Boessenkool
2021-10-19 0:26 ` Paul A. Clarke
2021-09-16 14:59 ` [PATCH v3 0/6] rs6000: Support more SSE4 intrinsics Paul A. Clarke
2021-10-04 18:26 ` Paul A. Clarke [this message]
2021-10-07 22:25 ` Segher Boessenkool
2021-10-08 0:29 ` Paul A. Clarke
2021-10-12 0:15 ` Segher Boessenkool
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20211004182630.GA2081132@li-24c3614c-2adc-11b2-a85c-85f334518bdb.ibm.com \
--to=pc@us.ibm.com \
--cc=gcc-patches@gcc.gnu.org \
--cc=segher@kernel.crashing.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).