From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 2187 invoked by alias); 8 May 2014 06:33:23 -0000 Mailing-List: contact gcc-bugs-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-bugs-owner@gcc.gnu.org Received: (qmail 2154 invoked by uid 48); 8 May 2014 06:33:19 -0000 From: "pinskia at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug tree-optimization/61108] gcc 4.8.2 generates incorrect integer arithmetic at O3 (ok at O2) Date: Thu, 08 May 2014 06:33:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: tree-optimization X-Bugzilla-Version: 4.8.2 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: pinskia at gcc dot gnu.org X-Bugzilla-Status: UNCONFIRMED X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-05/txt/msg00641.txt.bz2 http://gcc.gnu.org/bugzilla/show_bug.cgi?id=61108 --- Comment #1 from Andrew Pinski --- The tree level looks fine at least to me: vect_val_8.15_127 = MEM[(const uint32_t *)vectp_vals.14_85]; vect_patt_47.16_128 = WIDEN_MULT_EVEN_EXPR ; vect_patt_47.16_129 = WIDEN_MULT_ODD_EXPR ; vect__11.18_130 = vect_patt_47.16_128 >> 32; vect__11.18_131 = vect_patt_47.16_129 >> 32; vect_q_12.19_132 = VEC_PACK_TRUNC_EXPR ; vect__13.20_133 = vect_q_12.19_132 + vect_val_8.15_127; vect_t_14.21_134 = vect__13.20_133 >> 1; But the assembly code looks broken: leaq (%rdi,%r8,4), %r8 movdqa .LC0(%rip), %xmm1 cmpl $2, %r10d movdqa (%r8), %xmm2 movdqa %xmm2, %xmm0 movdqa %xmm2, %xmm3 pmuludq %xmm1, %xmm0 psrlq $32, %xmm0 psrlq $32, %xmm3 <---- where did this come from? pmuludq %xmm1, %xmm3 psrlq $32, %xmm3 shufps $136, %xmm3, %xmm0 paddd %xmm2, %xmm0 psrld $1, %xmm0