public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/104412] New: [Aarch64] Failure to optimize vector initialization from int64s
@ 2022-02-06 22:31 gabravier at gmail dot com
  2022-02-06 22:34 ` [Bug tree-optimization/104412] " pinskia at gcc dot gnu.org
                   ` (3 more replies)
  0 siblings, 4 replies; 5+ messages in thread
From: gabravier at gmail dot com @ 2022-02-06 22:31 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104412

            Bug ID: 104412
           Summary: [Aarch64] Failure to optimize vector initialization
                    from int64s
           Product: gcc
           Version: 12.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: target
          Assignee: unassigned at gcc dot gnu.org
          Reporter: gabravier at gmail dot com
  Target Milestone: ---

#include <stdint.h>

typedef int64_t v2i64 __attribute__((vector_size(16)));

v2i64 _mm_set_epi64x(int64_t i1, int64_t i2)
{
    union {
        int64_t data[2];
        v2i64 v;
    } d = {.data = {i2, i1}};
    return d.v;
}

With -O3, AArch64 GCC outputs this:

_mm_set_epi64x(long, long):
  sub sp, sp, #16
  stp x1, x0, [sp]
  ldr q0, [sp]
  add sp, sp, 16
  ret

LLVM instead outputs this:

_mm_set_epi64x(long, long):
  fmov d0, x1
  mov v0.d[1], x0
  ret

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug tree-optimization/104412] [Aarch64] Failure to optimize vector initialization from int64s
  2022-02-06 22:31 [Bug target/104412] New: [Aarch64] Failure to optimize vector initialization from int64s gabravier at gmail dot com
@ 2022-02-06 22:34 ` pinskia at gcc dot gnu.org
  2022-02-06 22:37 ` [Bug target/104412] union initialization of a vector is not optimized due to vector cost model pinskia at gcc dot gnu.org
                   ` (2 subsequent siblings)
  3 siblings, 0 replies; 5+ messages in thread
From: pinskia at gcc dot gnu.org @ 2022-02-06 22:34 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104412

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Severity|normal                      |enhancement
          Component|target                      |tree-optimization

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug target/104412] union initialization of a vector is not optimized due to vector cost model
  2022-02-06 22:31 [Bug target/104412] New: [Aarch64] Failure to optimize vector initialization from int64s gabravier at gmail dot com
  2022-02-06 22:34 ` [Bug tree-optimization/104412] " pinskia at gcc dot gnu.org
@ 2022-02-06 22:37 ` pinskia at gcc dot gnu.org
  2022-02-06 22:38 ` pinskia at gcc dot gnu.org
  2022-02-07  9:02 ` rguenth at gcc dot gnu.org
  3 siblings, 0 replies; 5+ messages in thread
From: pinskia at gcc dot gnu.org @ 2022-02-06 22:37 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104412

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
          Component|tree-optimization           |target
   Last reconfirmed|                            |2022-02-06
            Summary|[Aarch64] Failure to        |union initialization of a
                   |optimize vector             |vector is not optimized due
                   |initialization from int64s  |to vector cost model
     Ever confirmed|0                           |1
             Status|UNCONFIRMED                 |NEW

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug target/104412] union initialization of a vector is not optimized due to vector cost model
  2022-02-06 22:31 [Bug target/104412] New: [Aarch64] Failure to optimize vector initialization from int64s gabravier at gmail dot com
  2022-02-06 22:34 ` [Bug tree-optimization/104412] " pinskia at gcc dot gnu.org
  2022-02-06 22:37 ` [Bug target/104412] union initialization of a vector is not optimized due to vector cost model pinskia at gcc dot gnu.org
@ 2022-02-06 22:38 ` pinskia at gcc dot gnu.org
  2022-02-07  9:02 ` rguenth at gcc dot gnu.org
  3 siblings, 0 replies; 5+ messages in thread
From: pinskia at gcc dot gnu.org @ 2022-02-06 22:38 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104412

--- Comment #1 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Adding -fno-vect-cost-model allows it to be optimized. So a cost model issue
....

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug target/104412] union initialization of a vector is not optimized due to vector cost model
  2022-02-06 22:31 [Bug target/104412] New: [Aarch64] Failure to optimize vector initialization from int64s gabravier at gmail dot com
                   ` (2 preceding siblings ...)
  2022-02-06 22:38 ` pinskia at gcc dot gnu.org
@ 2022-02-07  9:02 ` rguenth at gcc dot gnu.org
  3 siblings, 0 replies; 5+ messages in thread
From: rguenth at gcc dot gnu.org @ 2022-02-07  9:02 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104412

--- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> ---
On x86_64 we get

        movq    %rsi, %xmm0
        movq    %rdi, %xmm1
        punpcklqdq      %xmm1, %xmm0
        ret

> ./cc1 -quiet t.c -I include -O2 -fopt-info-vec 
t.c:10:7: optimized: basic block part vectorized using 16 byte vectors

and at -O1

        movq    %rsi, -24(%rsp)
        movq    %rdi, -16(%rsp)
        movdqa  -24(%rsp), %xmm0
        ret

costing is a bit difficult since we get

t.c:10:7: note: Cost model analysis:
i2_4(D) 1 times scalar_store costs 12 in body
i1_6(D) 1 times scalar_store costs 12 in body
i2_4(D) 1 times vector_store costs 12 in body
<unknown> 1 times vec_construct costs 8 in prologue
t.c:10:7: note: Cost model analysis for part in loop 0:
  Vector cost: 20
  Scalar cost: 24

as we do not have an idea how costly the construction is (depends on
calling conventions) or how the return d.v allows us to elide the store
and the load.

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2022-02-07  9:02 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-02-06 22:31 [Bug target/104412] New: [Aarch64] Failure to optimize vector initialization from int64s gabravier at gmail dot com
2022-02-06 22:34 ` [Bug tree-optimization/104412] " pinskia at gcc dot gnu.org
2022-02-06 22:37 ` [Bug target/104412] union initialization of a vector is not optimized due to vector cost model pinskia at gcc dot gnu.org
2022-02-06 22:38 ` pinskia at gcc dot gnu.org
2022-02-07  9:02 ` rguenth at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).