From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 81545 invoked by alias); 20 Oct 2015 14:27:34 -0000 Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org Received: (qmail 81531 invoked by uid 89); 20 Oct 2015 14:27:33 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-2.3 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS autolearn=ham version=3.3.2 X-HELO: mail-lf0-f47.google.com Received: from mail-lf0-f47.google.com (HELO mail-lf0-f47.google.com) (209.85.215.47) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with (AES128-GCM-SHA256 encrypted) ESMTPS; Tue, 20 Oct 2015 14:27:18 +0000 Received: by lffy185 with SMTP id y185so6646821lff.2 for ; Tue, 20 Oct 2015 07:27:15 -0700 (PDT) MIME-Version: 1.0 X-Received: by 10.25.147.209 with SMTP id v200mr1337499lfd.22.1445351235227; Tue, 20 Oct 2015 07:27:15 -0700 (PDT) Received: by 10.25.160.73 with HTTP; Tue, 20 Oct 2015 07:27:15 -0700 (PDT) In-Reply-To: References: <56257F53.2000905@samsung.com> Date: Tue, 20 Oct 2015 14:46:00 -0000 Message-ID: Subject: Re: [PATCH][AArch64] Replace insn to zero up DF register From: Andrew Pinski To: Evandro Menezes Cc: GCC Patches , Marcus Shawcroft , Kyrill Tkachov Content-Type: text/plain; charset=UTF-8 X-IsSubscribed: yes X-SW-Source: 2015-10/txt/msg01889.txt.bz2 On Tue, Oct 20, 2015 at 7:59 AM, Andrew Pinski wrote: > On Tue, Oct 20, 2015 at 7:51 AM, Andrew Pinski wrote: >> On Tue, Oct 20, 2015 at 7:40 AM, Evandro Menezes wrote: >>> In the existing targets, it seems that it's always faster to zero up a DF >>> register with "movi %d0, #0" instead of "fmov %d0, xzr". >> >> I think for ThunderX 1, this change will not make a difference. So I >> am neutral on this change. > > Actually depending on fmov is decoded in our pipeline, this change > might actually be worse. Currently fmov with an immediate is 1 cycle > while movi is two cycles. Let me double check how internally on how > it is decoded and if it is 1 cycle or two. Ok, my objections are removed as I talked with the architectures here at Cavium and using movi is better in this case. Thanks, Andrew > > Thanks, > Andrew > >> >> Thanks, >> Andrew >> >>> >>> This patch modifies the respective pattern. >>> >>> Please, commit if it's alright. >>> >>> Thank you, >>> >>> -- >>> Evandro Menezes >>>