From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id EEE05385840E; Mon, 17 Jul 2023 08:30:17 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org EEE05385840E DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1689582617; bh=2/TNEmt4LXd7w7TCl7X8kCPE2rAqdc6cftpR4178mDQ=; h=From:To:Subject:Date:In-Reply-To:References:From; b=nq8edIdfTidSLZi/YiALmMGBLaCuQ338ZWxEthkuk21ohl6H+xosStW+7Heptpesm pEARoesZ+xZ2EoSNuH5V9m/c0m4ngQ5yxUq/T6I9fglVv2uiYCnR5dP1zdQ6tUPhox rmN+qOmtvZghxoOIIrKZGO0DZohyyLeWdnflhHv0= From: "rguenth at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug target/29256] [11/12/13/14 regression] loop performance regression Date: Mon, 17 Jul 2023 08:29:50 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: target X-Bugzilla-Version: 4.2.0 X-Bugzilla-Keywords: missed-optimization X-Bugzilla-Severity: normal X-Bugzilla-Who: rguenth at gcc dot gnu.org X-Bugzilla-Status: RESOLVED X-Bugzilla-Resolution: FIXED X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: 11.5 X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_status resolution Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 List-Id: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D29256 Richard Biener changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |FIXED --- Comment #76 from Richard Biener --- x86_64 has [local count: 536870800]: # ivtmp.13_3 =3D PHI vect__1.6_12 =3D MEM [(double *)&a + ivtmp.13_3 * 1]; MEM [(double *)&c + ivtmp.13_3 * 1] =3D vect__1.6_12; ivtmp.13_9 =3D ivtmp.13_3 + 16; if (ivtmp.13_9 !=3D 16000000) and .L2: movapd a(%rax), %xmm0 addq $16, %rax movaps %xmm0, c-16(%rax) cmpq $16000000, %rax jne .L2 which I think is optimal. With -fPIC we get .L2: movapd (%rax,%rdx), %xmm0 addq $16, %rax movaps %xmm0, -16(%rax,%rcx) cmpq $16000000, %rax jne .L2 let's close this.=