public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug tree-optimization/31667]  New: Integer externsions aren't vectorized
@ 2007-04-23 14:27 hjl at lucon dot org
  2010-03-13  1:36 ` [Bug target/31667] Integer externsions vectorization could be improved pinskia at gcc dot gnu dot org
  0 siblings, 1 reply; 2+ messages in thread
From: hjl at lucon dot org @ 2007-04-23 14:27 UTC (permalink / raw)
  To: gcc-bugs

SSE4.1 has pmovzx and pmovsx. For code like:

[hjl@gnu-2 vect]$ cat pmovzxbw.c
typedef unsigned char vec_t;
typedef unsigned short vecx_t;

extern __attribute__((aligned(16))) vec_t x [64];
extern __attribute__((aligned(16))) vecx_t y [64];

void
foo ()
{
  int i;

  for (i = 0; i < 64; i++)
    y [i]  = x [i];
}

Icc generates

        pmovzxbw  x(%rip), %xmm0                                #13.14
        pmovzxbw  8+x(%rip), %xmm1                              #13.14
        pmovzxbw  16+x(%rip), %xmm2                             #13.14
        pmovzxbw  24+x(%rip), %xmm3                             #13.14
        pmovzxbw  32+x(%rip), %xmm4                             #13.14
        pmovzxbw  40+x(%rip), %xmm5                             #13.14
        pmovzxbw  48+x(%rip), %xmm6                             #13.14
        pmovzxbw  56+x(%rip), %xmm7                             #13.14
        movdqa    %xmm0, y(%rip)                                #13.5
        movdqa    %xmm1, 16+y(%rip)                             #13.5
        movdqa    %xmm2, 32+y(%rip)                             #13.5
        movdqa    %xmm3, 48+y(%rip)                             #13.5
        movdqa    %xmm4, 64+y(%rip)                             #13.5
        movdqa    %xmm5, 80+y(%rip)                             #13.5
        movdqa    %xmm6, 96+y(%rip)                             #13.5
        movdqa    %xmm7, 112+y(%rip)                            #13.5
        ret                                                     #14.1


-- 
           Summary: Integer externsions aren't vectorized
           Product: gcc
           Version: 4.3.0
            Status: UNCONFIRMED
          Severity: enhancement
          Priority: P3
         Component: tree-optimization
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: hjl at lucon dot org


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31667


^ permalink raw reply	[flat|nested] 2+ messages in thread

* [Bug target/31667] Integer externsions vectorization could be improved
  2007-04-23 14:27 [Bug tree-optimization/31667] New: Integer externsions aren't vectorized hjl at lucon dot org
@ 2010-03-13  1:36 ` pinskia at gcc dot gnu dot org
  0 siblings, 0 replies; 2+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2010-03-13  1:36 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #1 from pinskia at gcc dot gnu dot org  2010-03-13 01:35 -------
GCC 4.5 is able to produce pmovzxbw via sse4_1_zero_extendv8qiv8hi2 but it does
not accept a memory operand for operand 1.
        movdqa  x(%rip), %xmm0
        pmovzxbw        %xmm0, %xmm1
        psrldq  $8, %xmm0
        pmovzxbw        %xmm0, %xmm0
        movdqa  %xmm1, y(%rip)
        movdqa  %xmm0, y+16(%rip)
...
Is what GCC currently produces.


-- 

pinskia at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
          Component|tree-optimization           |target
     Ever Confirmed|0                           |1
 GCC target triplet|                            |i?86-*-* x86_64-*-*
           Keywords|                            |missed-optimization
   Last reconfirmed|0000-00-00 00:00:00         |2010-03-13 01:35:56
               date|                            |
            Summary|Integer externsions aren't  |Integer externsions
                   |vectorized                  |vectorization could be
                   |                            |improved


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31667


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2010-03-13  1:36 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-04-23 14:27 [Bug tree-optimization/31667] New: Integer externsions aren't vectorized hjl at lucon dot org
2010-03-13  1:36 ` [Bug target/31667] Integer externsions vectorization could be improved pinskia at gcc dot gnu dot org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).