From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 17622 invoked by alias); 18 Jun 2014 14:25:16 -0000 Mailing-List: contact gcc-bugs-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-bugs-owner@gcc.gnu.org Received: (qmail 17553 invoked by uid 48); 18 Jun 2014 14:25:08 -0000 From: "cbaylis at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug target/61551] New: [NEON] alter costs to allow use of post-indexed addressing modes for VLD{2..4}/VST{2..4} Date: Wed, 18 Jun 2014 14:25:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: target X-Bugzilla-Version: 4.10.0 X-Bugzilla-Keywords: X-Bugzilla-Severity: enhancement X-Bugzilla-Who: cbaylis at gcc dot gnu.org X-Bugzilla-Status: UNCONFIRMED X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version bug_status bug_severity priority component assigned_to reporter cc cf_gcctarget attachments.created Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-06/txt/msg01572.txt.bz2 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=61551 Bug ID: 61551 Summary: [NEON] alter costs to allow use of post-indexed addressing modes for VLD{2..4}/VST{2..4} Product: gcc Version: 4.10.0 Status: UNCONFIRMED Severity: enhancement Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: cbaylis at gcc dot gnu.org CC: ramana.radhakrishnan at arm dot com Target: arm-unknown-linux-gnueabi Created attachment 32967 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=32967&action=edit test for NEON addressing modes The attached test case demonstrates that GCC does not exploit the post-indexed addressing mode for NEON structure loads and stores: VLDn, VSTn where n>=2. Generated code for VLD1/VST1 (using desired post-indexed addressing) test_ld1: @ args = 0, pretend = 0, frame = 0 @ frame_needed = 0, uses_anonymous_args = 0 @ link register save eliminated. vld1.8 {d16}, [r0], r1 vst1.8 {d16}, [r0], r1 vld1.8 {d16}, [r0], r1 vst1.8 {d16}, [r0], r1 vld1.8 {d16}, [r0], r1 vst1.8 {d16}, [r0] bx lr Generated code for VLD2: test_ld2: @ args = 0, pretend = 0, frame = 0 @ frame_needed = 0, uses_anonymous_args = 0 @ link register save eliminated. adds r3, r0, r1 vld2.8 {d16-d17}, [r0] adds r0, r3, r1 adds r2, r0, r1 vst2.8 {d16-d17}, [r3] adds r3, r2, r1 vld2.8 {d16-d17}, [r0] add r1, r1, r3 vst2.8 {d16-d17}, [r2] vld2.8 {d16-d17}, [r3] vst2.8 {d16-d17}, [r1] bx lr A proof of concept patch is posted at: https://gcc.gnu.org/ml/gcc-patches/2014-06/msg01361.html