From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id C604C3844079; Thu, 9 Jul 2020 12:06:04 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org C604C3844079 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1594296364; bh=iZ9g1N6ponprjXIRY2OuYtEjMVpbB2yNwc6Me97/oEQ=; h=From:To:Subject:Date:In-Reply-To:References:From; b=ydoyeKFHfxKhtlNdHVB4lO6OEcxWbvQJNZnt4rnrvza2ato2X6KJTfScmsNemdXQN emsFM2/WNgpJJsYeZAGko2juctcQ0Rm5HEXtt4H57tMpx8R9cpw587VU6a44ilElU7 GMWns1OPjP55aaIiOwZP4rPvpSLSPAYihovOwREY= From: "rguenth at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug c++/96133] [10/11 Regression] x86-64 gcc 10.1 using -O3 leads to wrong calculation since r10-1882-g831e688af50c5f77 Date: Thu, 09 Jul 2020 12:06:04 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: c++ X-Bugzilla-Version: 10.1.0 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: rguenth at gcc dot gnu.org X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: gcc-bugs@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-bugs mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 09 Jul 2020 12:06:04 -0000 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D96133 --- Comment #2 from Richard Biener --- Confirmed. The i =3D=3D 1 lane is different. We're using standard interle= aving vectorization here, the innermost two loops are unrolled and rgb_cam is eli= ded. Note eventually we optimize the whole loop at compile-time to [local count: 89478486]: MEM [(double *)&xyz_cam] =3D { 2.97789709999999985257090884260833263397216796875e+0, 3.94211709999999992959374139900319278240203857421875e+0 }; MEM [(double *)&xyz_cam + 16B] =3D { 4.9063371000000000066165739553980529308319091796875e+0, 3.291832700000000055950977184693329036235809326171875e+0 }; MEM [(double *)&xyz_cam + 32B] =3D { 4.06932820000000017301999832852743566036224365234375e+0, 4.8468236999999998459998096222989261150360107421875e+0 }; MEM [(double *)&xyz_cam + 48B] =3D { 5.40156330000000028945805752300657331943511962890625e+0, 6.2267732999999996224005371914245188236236572265625e+0 }; xyz_cam[2][2] =3D 7.051983299999999843521436559967696666717529296875e+0;=