From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 6326 invoked by alias); 8 Feb 2014 16:15:49 -0000 Mailing-List: contact gcc-bugs-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-bugs-owner@gcc.gnu.org Received: (qmail 6295 invoked by uid 48); 8 Feb 2014 16:15:46 -0000 From: "tprince at computer dot org" To: gcc-bugs@gcc.gnu.org Subject: [Bug c/60117] New: simd reduction clause suppresses simd auto-vectorization when -fopenmp is set Date: Sat, 08 Feb 2014 16:15:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: c X-Bugzilla-Version: 4.9.0 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: tprince at computer dot org X-Bugzilla-Status: UNCONFIRMED X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version bug_status bug_severity priority component assigned_to reporter attachments.created Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-02/txt/msg00808.txt.bz2 http://gcc.gnu.org/bugzilla/show_bug.cgi?id=60117 Bug ID: 60117 Summary: simd reduction clause suppresses simd auto-vectorization when -fopenmp is set Product: gcc Version: 4.9.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: c Assignee: unassigned at gcc dot gnu.org Reporter: tprince at computer dot org Created attachment 32082 --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=32082&action=edit source code reproducer gcc version 4.9.0 20140203 gcc -O2 -ftree-vectorize -std=c99 -march=core-avx2 -fopt-info -S -fopenmp s314.c This uses vmaxss instruction in the main loop body, in spite of the fairly positive vectorization report. -O3 makes no significant difference, so -O2 is used in practice for stability elsewhere in gcc source code. If the omp simd is disabled by removing -fopenmp, excellent code is produced using vmaxps. Performance test shows 10x speedup with max-unroll-times=2.