From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 36ED13858CDB; Wed, 3 Aug 2022 08:50:47 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 36ED13858CDB From: "linkw at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug target/106069] [12/13 Regression] wrong code with -O -fno-tree-forwprop -maltivec on ppc64le Date: Wed, 03 Aug 2022 08:50:46 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: target X-Bugzilla-Version: 12.0 X-Bugzilla-Keywords: wrong-code X-Bugzilla-Severity: normal X-Bugzilla-Who: linkw at gcc dot gnu.org X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: 12.2 X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: gcc-bugs@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-bugs mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 03 Aug 2022 08:50:47 -0000 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D106069 --- Comment #23 from Kewen Lin --- > Ideally we would avoid semantic difference of RTL depending on the target. > If that's not avoidable there should be target macros/hooks that specify > the desired semantics.=20=20 Not sure, IMHO it seems it doesn't depend on the target but on endianness (BYTES_BIG_ENDIAN)? Segher and Mike may have more insights on this. > I assume the semantic difference is in > vec_concat behavior but that's just documented as >=20 > @findex vec_concat > @item (vec_concat:@var{m} @var{x1} @var{x2}) > Describes a vector concat operation. The result is a concatenation of the > vectors or scalars @var{x1} and @var{x2}; its length is the sum of the > lengths of the two inputs. >=20 > which is a bit unspecific. To me it implies that > vec_select of a single lane N of the concat result can be distributed > to the operands of the vec_concat in the obvious way (if N >=3D > GET_MODE_NUNITS (x1) subtract GET_MODE_NUNITS and use x2) Yeah, the documentation isn't clear, neither for vec_select. I guess vec_se= lect also matters here, the indexes for vec_select would have the LE ordering li= ke subreg byte offset on LE?=