public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
* [PATCH v4 0/3] rs6000: Support more SSE4 intrinsics
@ 2021-10-19  1:15 Paul A. Clarke
  2021-10-19  1:15 ` [PATCH v4 1/3] rs6000: Add nmmintrin.h to extra_headers Paul A. Clarke
                   ` (2 more replies)
  0 siblings, 3 replies; 12+ messages in thread
From: Paul A. Clarke @ 2021-10-19  1:15 UTC (permalink / raw)
  To: segher; +Cc: gcc-patches, wschmidt

v4:
- Of original 6 patches in this series, I committed patches 2-5.
- Found an issue from v3. New file "nmmintrin.h" also needs to be added
to gcc/config.gcc "extra_headers".  Unfortunately, I discovered this
after committing the patch which added "nmmintrin.h", so I've added a
new patch here.
- Added scheduling "barriers" to patch 2 after review from Segher.
- Noted additional PR fixed by patch 3.

v3: Add "nmmintrin.h". _mm_cmpgt_epi64 is part of SSE4.2
and users will expect to be able to include "nmmintrin.h",
even though "nmmintrin.h" just includes "smmintrin.h"
where all of the SSE4.2 implementations actually appear.
Only patch 5/6 changed from v2.

Tested ppc64le (POWER9) and ppc64/32 (POWER7).

OK for trunk?

Paul A. Clarke (3):
  rs6000: Add nmmintrin.h to extra_headers
  rs6000: Support SSE4.1 "round" intrinsics
  rs6000: Guard some x86 intrinsics implementations

 gcc/config.gcc                                |   1 +
 gcc/config/rs6000/emmintrin.h                 |  12 +-
 gcc/config/rs6000/pmmintrin.h                 |   4 +
 gcc/config/rs6000/smmintrin.h                 | 296 ++++++++++++++----
 gcc/config/rs6000/tmmintrin.h                 |  12 +
 .../gcc.target/powerpc/sse4_1-round3.h        |  81 +++++
 .../gcc.target/powerpc/sse4_1-roundpd.c       | 143 +++++++++
 .../gcc.target/powerpc/sse4_1-roundps.c       |  98 ++++++
 .../gcc.target/powerpc/sse4_1-roundsd.c       | 256 +++++++++++++++
 .../gcc.target/powerpc/sse4_1-roundss.c       | 208 ++++++++++++
 .../gcc.target/powerpc/sse4_2-pcmpgtq.c       |   4 +-
 11 files changed, 1039 insertions(+), 76 deletions(-)
 create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-round3.h
 create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-roundpd.c
 create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-roundps.c
 create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-roundsd.c
 create mode 100644 gcc/testsuite/gcc.target/powerpc/sse4_1-roundss.c

-- 
2.27.0


^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2022-01-03 16:48 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-10-19  1:15 [PATCH v4 0/3] rs6000: Support more SSE4 intrinsics Paul A. Clarke
2021-10-19  1:15 ` [PATCH v4 1/3] rs6000: Add nmmintrin.h to extra_headers Paul A. Clarke
2021-10-19 13:10   ` Bill Schmidt
2021-10-19 14:27     ` Segher Boessenkool
2021-10-19  1:15 ` [PATCH v4 2/3] rs6000: Support SSE4.1 "round" intrinsics Paul A. Clarke
2021-10-26 20:00   ` [PING PATCH " Paul A. Clarke
2021-11-08 17:40     ` [PING^2 " Paul A. Clarke
2021-11-19  2:24       ` [PING^3 " Paul A. Clarke
2022-01-03 16:48         ` [PING^4 " Paul A. Clarke
2021-10-19  1:15 ` [PATCH v4 3/3] rs6000: Guard some x86 intrinsics implementations Paul A. Clarke
2021-10-19 14:32   ` Segher Boessenkool
2021-10-19 15:23     ` Paul A. Clarke

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).