public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug tree-optimization/31667] New: Integer externsions aren't vectorized
@ 2007-04-23 14:27 hjl at lucon dot org
2010-03-13 1:36 ` [Bug target/31667] Integer externsions vectorization could be improved pinskia at gcc dot gnu dot org
0 siblings, 1 reply; 2+ messages in thread
From: hjl at lucon dot org @ 2007-04-23 14:27 UTC (permalink / raw)
To: gcc-bugs
SSE4.1 has pmovzx and pmovsx. For code like:
[hjl@gnu-2 vect]$ cat pmovzxbw.c
typedef unsigned char vec_t;
typedef unsigned short vecx_t;
extern __attribute__((aligned(16))) vec_t x [64];
extern __attribute__((aligned(16))) vecx_t y [64];
void
foo ()
{
int i;
for (i = 0; i < 64; i++)
y [i] = x [i];
}
Icc generates
pmovzxbw x(%rip), %xmm0 #13.14
pmovzxbw 8+x(%rip), %xmm1 #13.14
pmovzxbw 16+x(%rip), %xmm2 #13.14
pmovzxbw 24+x(%rip), %xmm3 #13.14
pmovzxbw 32+x(%rip), %xmm4 #13.14
pmovzxbw 40+x(%rip), %xmm5 #13.14
pmovzxbw 48+x(%rip), %xmm6 #13.14
pmovzxbw 56+x(%rip), %xmm7 #13.14
movdqa %xmm0, y(%rip) #13.5
movdqa %xmm1, 16+y(%rip) #13.5
movdqa %xmm2, 32+y(%rip) #13.5
movdqa %xmm3, 48+y(%rip) #13.5
movdqa %xmm4, 64+y(%rip) #13.5
movdqa %xmm5, 80+y(%rip) #13.5
movdqa %xmm6, 96+y(%rip) #13.5
movdqa %xmm7, 112+y(%rip) #13.5
ret #14.1
--
Summary: Integer externsions aren't vectorized
Product: gcc
Version: 4.3.0
Status: UNCONFIRMED
Severity: enhancement
Priority: P3
Component: tree-optimization
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: hjl at lucon dot org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31667
^ permalink raw reply [flat|nested] 2+ messages in thread
* [Bug target/31667] Integer externsions vectorization could be improved
2007-04-23 14:27 [Bug tree-optimization/31667] New: Integer externsions aren't vectorized hjl at lucon dot org
@ 2010-03-13 1:36 ` pinskia at gcc dot gnu dot org
0 siblings, 0 replies; 2+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2010-03-13 1:36 UTC (permalink / raw)
To: gcc-bugs
------- Comment #1 from pinskia at gcc dot gnu dot org 2010-03-13 01:35 -------
GCC 4.5 is able to produce pmovzxbw via sse4_1_zero_extendv8qiv8hi2 but it does
not accept a memory operand for operand 1.
movdqa x(%rip), %xmm0
pmovzxbw %xmm0, %xmm1
psrldq $8, %xmm0
pmovzxbw %xmm0, %xmm0
movdqa %xmm1, y(%rip)
movdqa %xmm0, y+16(%rip)
...
Is what GCC currently produces.
--
pinskia at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|UNCONFIRMED |NEW
Component|tree-optimization |target
Ever Confirmed|0 |1
GCC target triplet| |i?86-*-* x86_64-*-*
Keywords| |missed-optimization
Last reconfirmed|0000-00-00 00:00:00 |2010-03-13 01:35:56
date| |
Summary|Integer externsions aren't |Integer externsions
|vectorized |vectorization could be
| |improved
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31667
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2010-03-13 1:36 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-04-23 14:27 [Bug tree-optimization/31667] New: Integer externsions aren't vectorized hjl at lucon dot org
2010-03-13 1:36 ` [Bug target/31667] Integer externsions vectorization could be improved pinskia at gcc dot gnu dot org
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).