public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/43118]  New: vld4 and vst4 intrinsics are not handled correctly
@ 2010-02-19 10:52 samuel dot rodal at nokia dot com
  2010-02-19 11:08 ` [Bug target/43118] " rguenth at gcc dot gnu dot org
                   ` (6 more replies)
  0 siblings, 7 replies; 10+ messages in thread
From: samuel dot rodal at nokia dot com @ 2010-02-19 10:52 UTC (permalink / raw)
  To: gcc-bugs

The vldX and vstX variation of NEON intrinsics, where X > 1, seem to cause the
compiler to generate an obscene amount of code.

Example:

void blend1(uint8_t *src, uint8_t *dst)
{
        uint8x8_t temp = vld1_u8(src);
        vst1_u8(dst, temp);
}

generates the sensible

        vld1.8  {d16}, [r0]
        vst1.8  {d16}, [r1]
        bx      lr

Whereas:

void blend4(uint8_t *src, uint8_t *dst)
{
        uint8x8x4_t temp = vld4_u8(src);
        vst4_u8(dst, temp);
}

generates

        stmfd   sp!, {r4, r5, r6}
        .save {r4, r5, r6}
.LCFI4:
        .pad #132
        sub     sp, sp, #132
.LCFI5:
        vld4.8  {d16-d19}, [r0]
        add     r6, sp, #64
        vstmia  r6, {d16-d19}
        mov     r5, r1
        ldmia   r6!, {r0, r1, r2, r3}
        add     ip, sp, #96
        mov     r4, ip
        stmia   r4!, {r0, r1, r2, r3}
        ldmia   r6, {r0, r1, r2, r3}
        stmia   r4, {r0, r1, r2, r3}
        ldmia   ip!, {r0, r1, r2, r3}
        add     ip, sp, #32
        mov     r6, ip
        stmia   r6!, {r0, r1, r2, r3}
        ldmia   r4, {r0, r1, r2, r3}
        stmia   r6, {r0, r1, r2, r3}
        ldmia   ip!, {r0, r1, r2, r3}
        mov     r4, sp
        stmia   r4!, {r0, r1, r2, r3}
        ldmia   r6, {r0, r1, r2, r3}
        stmia   r4, {r0, r1, r2, r3}
        vldmia  sp, {d16-d19}
        vst4.8  {d16-d19}, [r5]
        add     sp, sp, #132
        ldmfd   sp!, {r4, r5, r6}
        bx      lr

Compile flags used were "-mfloat-abi=softfp -mfpu=neon -O3".


-- 
           Summary: vld4 and vst4 intrinsics are not handled correctly
           Product: gcc
           Version: 4.4.1
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: target
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: samuel dot rodal at nokia dot com
 GCC build triplet: arm-none-linux-gnueabi-gcc
  GCC host triplet: arm-none-linux-gnueabi-gcc
GCC target triplet: arm-none-linux-gnueabi-gcc


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43118


^ permalink raw reply	[flat|nested] 10+ messages in thread
[parent not found: <bug-43118-4@http.gcc.gnu.org/bugzilla/>]

end of thread, other threads:[~2011-07-19  8:42 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-02-19 10:52 [Bug target/43118] New: vld4 and vst4 intrinsics are not handled correctly samuel dot rodal at nokia dot com
2010-02-19 11:08 ` [Bug target/43118] " rguenth at gcc dot gnu dot org
2010-02-19 13:46 ` ramana at gcc dot gnu dot org
2010-02-22 21:14 ` drow at false dot org
2010-02-23 10:42 ` rguenth at gcc dot gnu dot org
2010-03-20 14:53 ` ramana at gcc dot gnu dot org
2010-04-28 21:56 ` justin dot lebar+bug at gmail dot com
2010-09-15 20:54 ` generalruzzmo at gmail dot com
     [not found] <bug-43118-4@http.gcc.gnu.org/bugzilla/>
2011-07-08 11:58 ` ramana at gcc dot gnu.org
2011-07-19  8:42 ` rsandifo at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).