From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 11687 invoked by alias); 18 May 2012 18:25:38 -0000 Received: (qmail 11675 invoked by uid 22791); 18 May 2012 18:25:37 -0000 X-SWARE-Spam-Status: No, hits=-4.2 required=5.0 tests=ALL_TRUSTED,AWL,BAYES_00,KHOP_THREADED,TW_SL,TW_ZJ X-Spam-Check-By: sourceware.org Received: from localhost (HELO gcc.gnu.org) (127.0.0.1) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Fri, 18 May 2012 18:25:24 +0000 From: "ubizjak at gmail dot com" To: gcc-bugs@gcc.gnu.org Subject: [Bug tree-optimization/53346] [4.6/4.7/4.8 Regression] Bad vectorization in the proc cptrf2 of rnflow.f90 Date: Fri, 18 May 2012 18:27:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: tree-optimization X-Bugzilla-Keywords: missed-optimization X-Bugzilla-Severity: normal X-Bugzilla-Who: ubizjak at gmail dot com X-Bugzilla-Status: NEW X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: rguenth at gcc dot gnu.org X-Bugzilla-Target-Milestone: 4.8.0 X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated Content-Type: text/plain; charset="UTF-8" MIME-Version: 1.0 Mailing-List: contact gcc-bugs-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-bugs-owner@gcc.gnu.org X-SW-Source: 2012-05/txt/msg01858.txt.bz2 http://gcc.gnu.org/bugzilla/show_bug.cgi?id=53346 --- Comment #16 from Uros Bizjak 2012-05-18 18:24:43 UTC --- Perf confirms this findings, the first loop: 0.02 : 401e10: movslq %edx,%rbx 5.04 : 401e13: movss -0x4(%rdi,%rbx,4),%xmm0 24.97 : 401e19: ucomiss (%r9),%xmm0 14.66 : 401e1d: cmova %ecx,%edx 15.37 : 401e20: sub $0x1,%ecx 0.00 : 401e23: sub $0x4,%r9 0.00 : 401e27: cmp %r10d,%ecx 0.00 : 401e2a: jne 401e10 the second: 0.00 : 401e60: movslq %ecx,%r10 1.69 : 401e63: movss -0x4(%rdi,%r10,4),%xmm0 7.78 : 401e6a: ucomiss (%r9),%xmm0 4.75 : 401e6e: cmova %r11d,%ecx 4.52 : 401e72: sub $0x1,%r11d 0.00 : 401e76: sub $0x4,%r9 0.05 : 401e7a: cmp %eax,%r11d 0.00 : 401e7d: jne 401e60 the third: 0.00 : 401ff8: movslq %edx,%r10 0.78 : 401ffb: movss -0x4(%rdi,%r10,4),%xmm0 3.14 : 402002: ucomiss (%r9),%xmm0 2.04 : 402006: cmova %ecx,%edx 1.89 : 402009: sub $0x4,%r9 0.00 : 40200d: sub $0x1,%ecx 0.00 : 402010: jne 401ff8