public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Kyrylo Tkachov <Kyrylo.Tkachov@arm.com>
To: Stam Markianos-Wright <Stam.Markianos-Wright@arm.com>,
	Andrea Corallo <Andrea.Corallo@arm.com>,
	"gcc-patches@gcc.gnu.org" <gcc-patches@gcc.gnu.org>
Cc: Richard Earnshaw <Richard.Earnshaw@arm.com>
Subject: RE: [PATCH 10/10] arm testsuite: Shifts and get_FPSCR ACLE optimisation fixes
Date: Wed, 3 May 2023 12:56:37 +0000	[thread overview]
Message-ID: <PAXPR08MB69266207615B6BE7DD3E8EC6936C9@PAXPR08MB6926.eurprd08.prod.outlook.com> (raw)
In-Reply-To: <25eeca56-2d5e-07f6-704f-7163faebb5b1@arm.com>



> -----Original Message-----
> From: Stam Markianos-Wright <Stam.Markianos-Wright@arm.com>
> Sent: Wednesday, May 3, 2023 1:35 PM
> To: Kyrylo Tkachov <Kyrylo.Tkachov@arm.com>; Andrea Corallo
> <Andrea.Corallo@arm.com>; gcc-patches@gcc.gnu.org
> Cc: Richard Earnshaw <Richard.Earnshaw@arm.com>
> Subject: [PATCH 10/10] arm testsuite: Shifts and get_FPSCR ACLE optimisation
> fixes
> 
> Hi Kyrill,
> 
> On 28/04/2023 17:58, Kyrylo Tkachov wrote:
> >
> >> -----Original Message-----
> >> From: Andrea Corallo <andrea.corallo@arm.com>
> >> Sent: Friday, April 28, 2023 12:30 PM
> >> To: gcc-patches@gcc.gnu.org
> >> Cc: Kyrylo Tkachov <Kyrylo.Tkachov@arm.com>; Richard Earnshaw
> >> <Richard.Earnshaw@arm.com>; Stam Markianos-Wright
> <Stam.Markianos-
> >> Wright@arm.com>
> >> Subject: [PATCH 10/10] arm testsuite: Shifts and get_FPSCR ACLE
> optimisation
> >> fixes
> >>
> >> From: Stam Markianos-Wright <stam.markianos-wright@arm.com>
> >>
> >> These newly updated tests were rewritten by Andrea. Some of them
> >> needed further manual fixing as follows:
> >>
> >> * The #shift immediate value not in the check-function-bodies as expected
> >> * Some shifts getting optimised to mov immediates, e.g.
> >>    `uqshll (1, 1);` -> movs    r0, #2; movs    r1, #0
> > Shouldn't this test be testing something that cannot be constant-folded
> away? i.e. have non-constant arguments?
> > I think we should have conformance tests first and foremost, and follow-up
> tests for such optimisations should be (welcome) added separately.
> 
> Ahh, good point! I think in that case I've removed these checks
> from here and put them into a new test (it's a bit trivial but I
> couldn't find anywhere else where we doing this check with MVE
> instructions)
> 
> 
> Also, since this patch is the last one in this series, would the
> series be Ok for backporting to GCC13?

Yes, ok for backporting too.
Thanks,
Kyrill

> 
> Thank you!
> Stam
> 
> >
> >> * The ACLE was specifying sub-optimal code: lsr+and instead of ubfx. In
> >>    this case the test rewritten from the ACLE had the lsr+and pattern,
> >>    but the compiler was able to optimise to ubfx. Hence I've changed the
> >>    test to now match on ubfx.
> > That looks ok.
> > Thanks,
> > Kyrill
> >
> >> gcc/testsuite/ChangeLog:
> >>
> >>        * gcc.target/arm/mve/intrinsics/srshr.c: Update shift value.
> >>        * gcc.target/arm/mve/intrinsics/srshrl.c: Update shift value.
> >>        * gcc.target/arm/mve/intrinsics/uqshl.c: Update shift value and mov
> >> imm.
> >>        * gcc.target/arm/mve/intrinsics/uqshll.c: Update shift value and mov
> >> imm.
> >>        * gcc.target/arm/mve/intrinsics/urshr.c: Update shift value.
> >>        * gcc.target/arm/mve/intrinsics/urshrl.c: Update shift value.
> >>        * gcc.target/arm/mve/intrinsics/vadciq_m_s32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vadciq_m_u32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vadciq_s32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vadciq_u32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vadcq_m_s32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vadcq_m_u32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vadcq_s32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vadcq_u32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vsbciq_m_s32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vsbciq_m_u32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vsbciq_s32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vsbciq_u32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vsbcq_m_s32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vsbcq_m_u32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vsbcq_s32.c: Update to ubfx.
> >>        * gcc.target/arm/mve/intrinsics/vsbcq_u32.c: Update to ubfx.
> >> ---
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/srshr.c       | 2 +-
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/srshrl.c      | 2 +-
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshl.c       | 4 ++--
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshll.c      | 5 +++--
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/urshr.c       | 4 ++--
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/urshrl.c      | 4 ++--
> >>   .../gcc.target/arm/mve/intrinsics/vadciq_m_s32.c          | 8 ++------
> >>   .../gcc.target/arm/mve/intrinsics/vadciq_m_u32.c          | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_s32.c  | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_u32.c  | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_s32.c | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_u32.c | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_s32.c   | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_u32.c   | 8 ++------
> >>   .../gcc.target/arm/mve/intrinsics/vsbciq_m_s32.c          | 8 ++------
> >>   .../gcc.target/arm/mve/intrinsics/vsbciq_m_u32.c          | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_s32.c  | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_u32.c  | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_s32.c | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_u32.c | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_s32.c   | 8 ++------
> >>   gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_u32.c   | 8 ++------
> >>   22 files changed, 43 insertions(+), 106 deletions(-)
> >>
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/srshr.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/srshr.c
> >> index 94e3f42fd33..734375d58c0 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/srshr.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/srshr.c
> >> @@ -12,7 +12,7 @@ extern "C" {
> >>   /*
> >>   **foo:
> >>   **   ...
> >> -**   srshr   (?:ip|fp|r[0-9]+), #shift(?:    @.*|)
> >> +**   srshr   (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/srshrl.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/srshrl.c
> >> index 65f28ccbfde..a91943c38a0 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/srshrl.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/srshrl.c
> >> @@ -12,7 +12,7 @@ extern "C" {
> >>   /*
> >>   **foo:
> >>   **   ...
> >> -**   srshrl  (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #shift(?: @.*|)
> >> +**   srshrl  (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #1(?:     @.*|)
> >>   **   ...
> >>   */
> >>   int64_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshl.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshl.c
> >> index b23c9d97ba6..58aa7a61e42 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshl.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshl.c
> >> @@ -12,7 +12,7 @@ extern "C" {
> >>   /*
> >>   **foo:
> >>   **   ...
> >> -**   uqshl   (?:ip|fp|r[0-9]+), #shift(?:    @.*|)
> >> +**   uqshl   (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32_t
> >> @@ -24,7 +24,7 @@ foo (uint32_t value)
> >>   /*
> >>   **foo1:
> >>   **   ...
> >> -**   uqshl   (?:ip|fp|r[0-9]+), #shift(?:    @.*|)
> >> +**   movs    r0, #2
> >>   **   ...
> >>   */
> >>   uint32_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshll.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshll.c
> >> index 6a3d08eea75..5584544aaf7 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshll.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/uqshll.c
> >> @@ -12,7 +12,7 @@ extern "C" {
> >>   /*
> >>   **foo:
> >>   **   ...
> >> -**   uqshll  (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #shift(?: @.*|)
> >> +**   uqshll  (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #1(?:     @.*|)
> >>   **   ...
> >>   */
> >>   uint64_t
> >> @@ -24,7 +24,8 @@ foo (uint64_t value)
> >>   /*
> >>   **foo1:
> >>   **   ...
> >> -**   uqshll  (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #shift(?: @.*|)
> >> +**   movs    r0, #2
> >> +**   movs    r1, #0
> >>   **   ...
> >>   */
> >>   uint64_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/urshr.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/urshr.c
> >> index 23afcb8da4c..ff97bf5c473 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/urshr.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/urshr.c
> >> @@ -12,7 +12,7 @@ extern "C" {
> >>   /*
> >>   **foo:
> >>   **   ...
> >> -**   urshr   (?:ip|fp|r[0-9]+), #shift(?:    @.*|)
> >> +**   urshr   (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32_t
> >> @@ -24,7 +24,7 @@ foo (uint32_t value)
> >>   /*
> >>   **foo1:
> >>   **   ...
> >> -**   urshr   (?:ip|fp|r[0-9]+), #shift(?:    @.*|)
> >> +**   urshr   (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/urshrl.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/urshrl.c
> >> index 8014371f47f..ff6a69d300f 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/urshrl.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/urshrl.c
> >> @@ -12,7 +12,7 @@ extern "C" {
> >>   /*
> >>   **foo:
> >>   **   ...
> >> -**   urshrl  (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #shift(?: @.*|)
> >> +**   urshrl  (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #1(?:     @.*|)
> >>   **   ...
> >>   */
> >>   uint64_t
> >> @@ -24,7 +24,7 @@ foo (uint64_t value)
> >>   /*
> >>   **foo1:
> >>   **   ...
> >> -**   urshrl  (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #shift(?: @.*|)
> >> +**   urshrl  (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #1(?:     @.*|)
> >>   **   ...
> >>   */
> >>   uint64_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_m_s32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_m_s32.c
> >> index b262bf94d39..a6a059a19e9 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_m_s32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_m_s32.c
> >> @@ -20,9 +20,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> @@ -43,9 +41,7 @@ foo (int32x4_t inactive, int32x4_t a, int32x4_t b,
> >> unsigned *carry_out, mve_pred
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_m_u32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_m_u32.c
> >> index d349caed36a..942111339f0 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_m_u32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_m_u32.c
> >> @@ -20,9 +20,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> @@ -43,9 +41,7 @@ foo (uint32x4_t inactive, uint32x4_t a, uint32x4_t b,
> >> unsigned *carry_out, mve_p
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_s32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_s32.c
> >> index 5166993a355..3b68bb6ac33 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_s32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_s32.c
> >> @@ -16,9 +16,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> @@ -35,9 +33,7 @@ foo (int32x4_t a, int32x4_t b, unsigned *carry_out)
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_u32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_u32.c
> >> index 080bd61d238..82228491043 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_u32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadciq_u32.c
> >> @@ -16,9 +16,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> @@ -35,9 +33,7 @@ foo (uint32x4_t a, uint32x4_t b, unsigned *carry_out)
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_s32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_s32.c
> >> index 45e6ff03623..0d4cb779254 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_s32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_s32.c
> >> @@ -26,9 +26,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> @@ -55,9 +53,7 @@ foo (int32x4_t inactive, int32x4_t a, int32x4_t b,
> >> unsigned *carry, mve_pred16_t
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_u32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_u32.c
> >> index 54f141b2093..a0ba6825d8c 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_u32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_m_u32.c
> >> @@ -26,9 +26,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> @@ -55,9 +53,7 @@ foo (uint32x4_t inactive, uint32x4_t a, uint32x4_t b,
> >> unsigned *carry, mve_pred1
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_s32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_s32.c
> >> index 06d5bae09da..47f5f22dde9 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_s32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_s32.c
> >> @@ -22,9 +22,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> @@ -47,9 +45,7 @@ foo (int32x4_t a, int32x4_t b, unsigned *carry)
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_u32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_u32.c
> >> index e2111cfd16a..55a961be217 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_u32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vadcq_u32.c
> >> @@ -22,9 +22,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> @@ -47,9 +45,7 @@ foo (uint32x4_t a, uint32x4_t b, unsigned *carry)
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_m_s32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_m_s32.c
> >> index 66a5c4c9da3..dcbaef1a571 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_m_s32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_m_s32.c
> >> @@ -20,9 +20,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> @@ -43,9 +41,7 @@ foo (int32x4_t inactive, int32x4_t a, int32x4_t b,
> >> unsigned *carry_out, mve_pred
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_m_u32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_m_u32.c
> >> index 9306f152cde..08f67f665c1 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_m_u32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_m_u32.c
> >> @@ -20,9 +20,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> @@ -43,9 +41,7 @@ foo (uint32x4_t inactive, uint32x4_t a, uint32x4_t b,
> >> unsigned *carry_out, mve_p
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_s32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_s32.c
> >> index 0b5040f0b2a..803246c3235 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_s32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_s32.c
> >> @@ -16,9 +16,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> @@ -35,9 +33,7 @@ foo (int32x4_t a, int32x4_t b, unsigned *carry_out)
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_u32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_u32.c
> >> index df211a64daa..22d2b4355bc 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_u32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbciq_u32.c
> >> @@ -16,9 +16,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> @@ -35,9 +33,7 @@ foo (uint32x4_t a, uint32x4_t b, unsigned *carry_out)
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_s32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_s32.c
> >> index 217cfa7ac21..7a332610c69 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_s32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_s32.c
> >> @@ -26,9 +26,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> @@ -55,9 +53,7 @@ foo (int32x4_t inactive, int32x4_t a, int32x4_t b,
> >> unsigned *carry, mve_pred16_t
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_u32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_u32.c
> >> index dad04d05d68..60902196502 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_u32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_m_u32.c
> >> @@ -26,9 +26,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> @@ -55,9 +53,7 @@ foo (uint32x4_t inactive, uint32x4_t a, uint32x4_t b,
> >> unsigned *carry, mve_pred1
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_s32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_s32.c
> >> index cd033640bcc..523fa32ee0d 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_s32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_s32.c
> >> @@ -22,9 +22,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> @@ -47,9 +45,7 @@ foo (int32x4_t a, int32x4_t b, unsigned *carry)
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   int32x4_t
> >> diff --git a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_u32.c
> >> b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_u32.c
> >> index 6ca0c753b5e..ff720fd2df5 100644
> >> --- a/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_u32.c
> >> +++ b/gcc/testsuite/gcc.target/arm/mve/intrinsics/vsbcq_u32.c
> >> @@ -22,9 +22,7 @@ extern "C" {
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> @@ -47,9 +45,7 @@ foo (uint32x4_t a, uint32x4_t b, unsigned *carry)
> >>   **   ...
> >>   **   vmrs    (?:ip|fp|r[0-9]+), FPSCR_nzcvqc(?:      @.*|)
> >>   **   ...
> >> -**   lsr     (?:ip|fp|r[0-9]+), #29(?:       @.*|)
> >> -**   ...
> >> -**   and     (?:ip|fp|r[0-9]+), #1(?:        @.*|)
> >> +**   ubfx    (?:ip|fp|r[0-9]+), (?:ip|fp|r[0-9]+), #29, #1(?:        @.*|)
> >>   **   ...
> >>   */
> >>   uint32x4_t
> >> --
> >> 2.25.1

  reply	other threads:[~2023-05-03 12:56 UTC|newest]

Thread overview: 31+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-04-28 11:29 [PATCH 01/10] arm: Mve testsuite improvements Andrea Corallo
2023-04-28 11:29 ` [PATCH 02/10] arm: Fix vstrwq* backend + testsuite Andrea Corallo
2023-04-28 16:27   ` Kyrylo Tkachov
2023-05-02  8:21   ` Christophe Lyon
2023-05-02  8:45     ` Andrea Corallo
2023-05-02 10:18     ` Andrea Corallo
2023-04-28 11:29 ` [PATCH 03/10] arm: Mve backend + testsuite fixes 2 Andrea Corallo
2023-04-28 16:40   ` Kyrylo Tkachov
2023-05-02 11:53     ` Andrea Corallo
2023-04-28 11:29 ` [PATCH 04/10] arm: Stop vadcq, vsbcq intrinsics from overwriting the FPSCR NZ flags Andrea Corallo
2023-04-28 16:45   ` Kyrylo Tkachov
2023-05-03 12:19     ` Stamatis Markianos-Wright
2023-05-03 12:55       ` Kyrylo Tkachov
2023-04-28 11:29 ` [PATCH 05/10] arm: Add vorrq_n overloading into vorrq _Generic Andrea Corallo
2023-04-28 16:47   ` Kyrylo Tkachov
2023-04-28 11:29 ` [PATCH 06/10] arm: Fix overloading of MVE scalar constant parameters on vbicq, vmvnq_m Andrea Corallo
2023-04-28 16:47   ` Kyrylo Tkachov
2023-04-28 11:29 ` [PATCH 07/10] arm: Fix MVE header pointer overloads this time (and a bit more tidying) Andrea Corallo
2023-04-28 16:51   ` Kyrylo Tkachov
2023-04-28 11:30 ` [PATCH 08/10] arm testsuite: Remove reduntant tests Andrea Corallo
2023-04-28 16:52   ` Kyrylo Tkachov
2023-04-28 11:30 ` [PATCH 09/10] arm testsuite: XFAIL or relax registers in some tests Andrea Corallo
2023-04-28 16:54   ` Kyrylo Tkachov
2023-05-02 12:17     ` Stamatis Markianos-Wright
2023-05-02  8:28   ` Christophe Lyon
2023-05-02  9:33     ` Stamatis Markianos-Wright
2023-04-28 11:30 ` [PATCH 10/10] arm testsuite: Shifts and get_FPSCR ACLE optimisation fixes Andrea Corallo
2023-04-28 16:58   ` Kyrylo Tkachov
2023-05-03 12:34     ` Stamatis Markianos-Wright
2023-05-03 12:56       ` Kyrylo Tkachov [this message]
2023-04-28 16:27 ` [PATCH 01/10] arm: Mve testsuite improvements Kyrylo Tkachov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=PAXPR08MB69266207615B6BE7DD3E8EC6936C9@PAXPR08MB6926.eurprd08.prod.outlook.com \
    --to=kyrylo.tkachov@arm.com \
    --cc=Andrea.Corallo@arm.com \
    --cc=Richard.Earnshaw@arm.com \
    --cc=Stam.Markianos-Wright@arm.com \
    --cc=gcc-patches@gcc.gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).