public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/70321] [8/9/10 Regression] STV generates less optimized code
       [not found] <bug-70321-4@http.gcc.gnu.org/bugzilla/>
@ 2020-03-12 11:58 ` jakub at gcc dot gnu.org
  2021-06-01  8:07 ` [Bug target/70321] [9/10/11/12 " rguenth at gcc dot gnu.org
                   ` (5 subsequent siblings)
  6 siblings, 0 replies; 7+ messages in thread
From: jakub at gcc dot gnu.org @ 2020-03-12 11:58 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70321

Jakub Jelinek <jakub at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|9.3                         |9.4

--- Comment #21 from Jakub Jelinek <jakub at gcc dot gnu.org> ---
GCC 9.3.0 has been released, adjusting target milestone.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug target/70321] [9/10/11/12 Regression] STV generates less optimized code
       [not found] <bug-70321-4@http.gcc.gnu.org/bugzilla/>
  2020-03-12 11:58 ` [Bug target/70321] [8/9/10 Regression] STV generates less optimized code jakub at gcc dot gnu.org
@ 2021-06-01  8:07 ` rguenth at gcc dot gnu.org
  2022-04-19 13:09 ` roger at nextmovesoftware dot com
                   ` (4 subsequent siblings)
  6 siblings, 0 replies; 7+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-06-01  8:07 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70321

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|9.4                         |9.5

--- Comment #22 from Richard Biener <rguenth at gcc dot gnu.org> ---
GCC 9.4 is being released, retargeting bugs to GCC 9.5.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug target/70321] [9/10/11/12 Regression] STV generates less optimized code
       [not found] <bug-70321-4@http.gcc.gnu.org/bugzilla/>
  2020-03-12 11:58 ` [Bug target/70321] [8/9/10 Regression] STV generates less optimized code jakub at gcc dot gnu.org
  2021-06-01  8:07 ` [Bug target/70321] [9/10/11/12 " rguenth at gcc dot gnu.org
@ 2022-04-19 13:09 ` roger at nextmovesoftware dot com
  2022-05-27  9:36 ` [Bug target/70321] [10/11/12/13 " rguenth at gcc dot gnu.org
                   ` (3 subsequent siblings)
  6 siblings, 0 replies; 7+ messages in thread
From: roger at nextmovesoftware dot com @ 2022-04-19 13:09 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70321

Roger Sayle <roger at nextmovesoftware dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |roger at nextmovesoftware dot com

--- Comment #23 from Roger Sayle <roger at nextmovesoftware dot com> ---
Patch proposed at
https://gcc.gnu.org/pipermail/gcc-patches/2022-April/593174.html
(waiting for GCC 13 stage 1).

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug target/70321] [10/11/12/13 Regression] STV generates less optimized code
       [not found] <bug-70321-4@http.gcc.gnu.org/bugzilla/>
                   ` (2 preceding siblings ...)
  2022-04-19 13:09 ` roger at nextmovesoftware dot com
@ 2022-05-27  9:36 ` rguenth at gcc dot gnu.org
  2022-05-30 20:21 ` cvs-commit at gcc dot gnu.org
                   ` (2 subsequent siblings)
  6 siblings, 0 replies; 7+ messages in thread
From: rguenth at gcc dot gnu.org @ 2022-05-27  9:36 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70321

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|9.5                         |10.4

--- Comment #24 from Richard Biener <rguenth at gcc dot gnu.org> ---
GCC 9 branch is being closed

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug target/70321] [10/11/12/13 Regression] STV generates less optimized code
       [not found] <bug-70321-4@http.gcc.gnu.org/bugzilla/>
                   ` (3 preceding siblings ...)
  2022-05-27  9:36 ` [Bug target/70321] [10/11/12/13 " rguenth at gcc dot gnu.org
@ 2022-05-30 20:21 ` cvs-commit at gcc dot gnu.org
  2022-06-04  9:12 ` roger at nextmovesoftware dot com
  2024-02-01 11:49 ` cvs-commit at gcc dot gnu.org
  6 siblings, 0 replies; 7+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2022-05-30 20:21 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70321

--- Comment #25 from CVS Commits <cvs-commit at gcc dot gnu.org> ---
The master branch has been updated by Roger Sayle <sayle@gcc.gnu.org>:

https://gcc.gnu.org/g:43201f2c2173894bf7c423cad6da1c21567e06c0

commit r13-855-g43201f2c2173894bf7c423cad6da1c21567e06c0
Author: Roger Sayle <roger@nextmovesoftware.com>
Date:   Mon May 30 21:20:09 2022 +0100

    PR target/70321: Split double word equality/inequality after STV on x86.

    This patch resolves the last piece of PR target/70321 a code quality
    (P2 regression) affecting mainline.  Currently, for HJ's testcase:

    void foo (long long ixi)
    {
      if (ixi != 14348907)
        __builtin_abort ();
    }

    GCC with -m32 -O2 generates four instructions for the comparison:

            movl    16(%esp), %eax
            movl    20(%esp), %edx
            xorl    $14348907, %eax
            orl     %eax, %edx

    but with this patch it now requires only three, making better use of
    x86's addressing modes:

            movl    16(%esp), %eax
            xorl    $14348907, %eax
            orl     20(%esp), %eax

    The solution is to expand "doubleword" equality/inequality expressions
    using flag setting COMPARE instructions for the early RTL passes, and
    then split them during split1, after STV and before reload.
    Hence on x86_64, we now see/allow things like:

    (insn 11 8 12 2 (set (reg:CCZ 17 flags)
            (compare:CCZ (reg/v:TI 84 [ x ])
                (reg:TI 96))) "cmpti.c":2:43 30 {*cmpti_doubleword}

    This allows the STV pass to decide whether it's preferrable to perform
    this comparison using vector operations, i.e. a pxor/ptest sequence,
    or as scalar integer operations, i.e. a xor/xor/or sequence.  Alas
    this required tweaking of the STV pass to recognize the "new" form of
    these comparisons and split out the pxor operation itself.  To confirm
    this still works as expected I've added a new STV test case:

    long long a[1024];
    long long b[1024];

    int foo()
    {
      for (int i=0; i<1024; i++)
      {
        long long t = (a[i]<<8) | (b[i]<<24);
        if (t == 0)
          return 1;
      }
      return 0;
    }

    where with -m32 -O2 -msse4.1 the above comparison with zero should look
    like:

            punpcklqdq      %xmm0, %xmm0
            ptest   %xmm0, %xmm0

    Although this patch includes one or two minor tweaks to provide all the
    necessary infrastructure to support conversion of TImode comparisons to
    V1TImode (and SImode comparisons to V4SImode), STV doesn't yet implement
    these transformations, but this is something that can be considered after
    stage 4.  Indeed the new convert_compare functionality is split out
    into a method to simplify its potential reuse by the timode_scalar_chain
    class.

    2022-05-30  Roger Sayle  <roger@nextmovesoftware.com>

    gcc/ChangeLog
            PR target/70321
            * config/i386/i386-expand.cc (ix86_expand_branch): Don't decompose
            DI mode equality/inequality using XOR here.  Instead generate a
            COMPARE for doubleword modes (DImode on !TARGET_64BIT or TImode).
            * config/i386/i386-features.cc (gen_gpr_to_xmm_move_src): Use
            gen_rtx_SUBREG when NUNITS is 1, i.e. for TImode to V1TImode.
            (general_scalar_chain::convert_compare): New function to convert
            scalar equality/inequality comparison into vector operations.
            (general_scalar_chain::convert_insn) [COMPARE]: Refactor. Call
            new convert_compare helper method.
            (convertible_comparion_p): Update to match doubleword COMPARE
            of two register, memory or integer constant operands.
            * config/i386/i386-features.h
(general_scalar_chain::convert_compare):
            Prototype/declare member function here.
            * config/i386/i386.md (cstore<mode>4): Change mode to SDWIM, but
            only allow new doubleword modes for EQ and NE operators.
            (*cmp<dwi>_doubleword): New define_insn_and_split, to split a
            doubleword comparison into a pair of XORs followed by an IOR to
            set the (zero) flags register, optimizing the XORs if possible.
            * config/i386/sse.md (V_AVX): Include V1TI and V2TI in mode
            iterator; V_AVX is (currently) only used by ptest.
            (sse4_1 mode attribute): Update to support V1TI and V2TI.

    gcc/testsuite/ChangeLog
            PR target/70321
            * gcc.target/i386/pr70321.c: New test case.
            * gcc.target/i386/sse4_1-stv-1.c: New test case.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug target/70321] [10/11/12/13 Regression] STV generates less optimized code
       [not found] <bug-70321-4@http.gcc.gnu.org/bugzilla/>
                   ` (4 preceding siblings ...)
  2022-05-30 20:21 ` cvs-commit at gcc dot gnu.org
@ 2022-06-04  9:12 ` roger at nextmovesoftware dot com
  2024-02-01 11:49 ` cvs-commit at gcc dot gnu.org
  6 siblings, 0 replies; 7+ messages in thread
From: roger at nextmovesoftware dot com @ 2022-06-04  9:12 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70321

Roger Sayle <roger at nextmovesoftware dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Resolution|---                         |FIXED
             Status|NEW                         |RESOLVED
   Target Milestone|10.4                        |13.0

--- Comment #26 from Roger Sayle <roger at nextmovesoftware dot com> ---
This should now be fixed on mainline.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug target/70321] [10/11/12/13 Regression] STV generates less optimized code
       [not found] <bug-70321-4@http.gcc.gnu.org/bugzilla/>
                   ` (5 preceding siblings ...)
  2022-06-04  9:12 ` roger at nextmovesoftware dot com
@ 2024-02-01 11:49 ` cvs-commit at gcc dot gnu.org
  6 siblings, 0 replies; 7+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2024-02-01 11:49 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70321

--- Comment #27 from GCC Commits <cvs-commit at gcc dot gnu.org> ---
The master branch has been updated by Rainer Orth <ro@gcc.gnu.org>:

https://gcc.gnu.org/g:ee7011eab67d8272f88e954bbab2e42bf6353441

commit r14-8690-gee7011eab67d8272f88e954bbab2e42bf6353441
Author: Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
Date:   Thu Feb 1 12:48:57 2024 +0100

    testsuite: i386: Fix gcc.target/i386/pr70321.c on 32-bit Solaris/x86

    gcc.target/i386/pr70321.c FAILs on 32-bit Solaris/x86 since its
    introduction in

    commit 43201f2c2173894bf7c423cad6da1c21567e06c0
    Author: Roger Sayle <roger@nextmovesoftware.com>
    Date:   Mon May 30 21:20:09 2022 +0100

        PR target/70321: Split double word equality/inequality after STV on
x86.

    FAIL: gcc.target/i386/pr70321.c scan-assembler-times mov 1

    The failure happens because 32-bit Solaris/x86 defaults to
    -fno-omit-frame-pointer.

    Fixed by specifying -fomit-frame-pointer explicitly.

    Tested on i386-pc-solaris2.11 and i686-pc-linux-gnu.

    2024-01-23  Rainer Orth  <ro@CeBiTec.Uni-Bielefeld.DE>

            gcc/testsuite:
            * gcc.target/i386/pr70321.c: Add -fomit-frame-pointer to
            dg-options.

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2024-02-01 11:49 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <bug-70321-4@http.gcc.gnu.org/bugzilla/>
2020-03-12 11:58 ` [Bug target/70321] [8/9/10 Regression] STV generates less optimized code jakub at gcc dot gnu.org
2021-06-01  8:07 ` [Bug target/70321] [9/10/11/12 " rguenth at gcc dot gnu.org
2022-04-19 13:09 ` roger at nextmovesoftware dot com
2022-05-27  9:36 ` [Bug target/70321] [10/11/12/13 " rguenth at gcc dot gnu.org
2022-05-30 20:21 ` cvs-commit at gcc dot gnu.org
2022-06-04  9:12 ` roger at nextmovesoftware dot com
2024-02-01 11:49 ` cvs-commit at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).