From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 472163855006; Fri, 11 Jun 2021 20:28:02 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 472163855006 From: "segher at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug target/100085] Bad code for union transfer from __float128 to vector types Date: Fri, 11 Jun 2021 20:28:01 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: target X-Bugzilla-Version: 10.2.1 X-Bugzilla-Keywords: missed-optimization X-Bugzilla-Severity: normal X-Bugzilla-Who: segher at gcc dot gnu.org X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: gcc-bugs@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-bugs mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 11 Jun 2021 20:28:02 -0000 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D100085 --- Comment #14 from Segher Boessenkool --- We *have* TImode already, but most 128-bit scalars currently use V1TImode. This often leads to reduced performance because that is not a scalar mode, does not get all optimisations we have generically for all other integer scalars. We have to do a lot of it manually, which is a lot of (combine) patterns, and we still miss almost all cases. I am not saying we should remove V1TImode. I am saying we want to use plain TImode for scalars, on newer cpus. On p8 we had V1TImode so that we could reduce the traffic between the vector register files and the GPR register file, because that was very costly on p8 (mtvsr* and mfvsr* were 5 cycles, and mtvsrdd and mfvsrld didn't even exist yet). Using V1TImode for scalars on p8 was a pretty big win. It should be a win again to use TImode on later cpus though. > And I have grave reservations about the vague plans of small/fringe minor= ity to=20 > subset the PowerISA for their convenience. I don't have reservations about that. Instead, I battle that with all I ca= n.=