From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 62377 invoked by alias); 25 Nov 2016 11:30:39 -0000 Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org Received: (qmail 62223 invoked by uid 89); 25 Nov 2016 11:30:31 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-1.9 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS autolearn=ham version=3.3.2 spammy=DImode, edlinger, dimode, optimal X-HELO: mail-wm0-f67.google.com Received: from mail-wm0-f67.google.com (HELO mail-wm0-f67.google.com) (74.125.82.67) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Fri, 25 Nov 2016 11:30:20 +0000 Received: by mail-wm0-f67.google.com with SMTP id g23so7507273wme.1 for ; Fri, 25 Nov 2016 03:30:07 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=R/c6FVqmvnmgvsk0Eb/xTEgHPicNA3ZBEiZXTT5xFMA=; b=VYcx/B3Qr2iXQ9c9O9/wiWjb263Y9yS70WK5rfbOsQYkXjUrmurj7FxbKepHWX2yK4 HCWo02TKhRkkG42f0bVhc+Vbn5005SlgfSiEPpUoSatytpcdR7Wm4VIHJjQgTakbGHOG sTQzi+QQFr8LT/z/JVMYd8BYTUgubQMZ2Sonpf80Wccu2X1CT05CmzJygaEDZNRMkAiQ ZqSWR/XKAO0w9bus0C7HpR2SLU79DciZcrw741sr+05KH68z2oOZHkuQIgH3XkKPV8Jv 9XD2BbMU/3WydNDjbSY6XyQXjyhO+Xvri4362L2XjB876S84TvytLrgekfQQqVl7g10l +e5w== X-Gm-Message-State: AKaTC03g5T9rm5LniF6HE6yjvHB0erT5tHB7gboI37E9puy4sh4r2f8YsrtlCM7QbH1OUe2ZsQZFO5b7KepN8g== X-Received: by 10.28.90.65 with SMTP id o62mr6776119wmb.80.1480073405608; Fri, 25 Nov 2016 03:30:05 -0800 (PST) MIME-Version: 1.0 Received: by 10.194.36.232 with HTTP; Fri, 25 Nov 2016 03:30:04 -0800 (PST) In-Reply-To: References: From: Ramana Radhakrishnan Date: Fri, 25 Nov 2016 11:30:00 -0000 Message-ID: Subject: Re: [PATCH, ARM] Further improve stack usage on sha512 (PR 77308) To: Bernd Edlinger Cc: GCC Patches , Kyrill Tkachov , Richard Earnshaw , Wilco Dijkstra Content-Type: text/plain; charset=UTF-8 X-IsSubscribed: yes X-SW-Source: 2016-11/txt/msg02625.txt.bz2 On Sun, Nov 6, 2016 at 2:18 PM, Bernd Edlinger wrote: > Hi! > > This improves the stack usage on the sha512 test case for the case > without hardware fpu and without iwmmxt by splitting all di-mode > patterns right while expanding which is similar to what the shift-pattern > does. It does nothing in the case iwmmxt and fpu=neon or vfp as well as > thumb1. > I would go further and do this in the absence of Neon, the VFP unit being there doesn't help with DImode operations i.e. we do not have 64 bit integer arithmetic instructions without Neon. The main reason why we have the DImode patterns split so late is to give a chance for folks who want to do 64 bit arithmetic in Neon a chance to make this work as well as support some of the 64 bit Neon intrinsics which IIRC map down to these instructions. Doing this just for soft-float doesn't improve the default case only. I don't usually test iwmmxt and I'm not sure who has the ability to do so, thus keeping this restriction for iwMMX is fine. > It reduces the stack usage from 2300 to near optimal 272 bytes (!). > > Note this also splits many ldrd/strd instructions and therefore I will > post a followup-patch that mitigates this effect by enabling the ldrd/strd > peephole optimization after the necessary reg-testing. > > > Bootstrapped and reg-tested on arm-linux-gnueabihf. What do you mean by arm-linux-gnueabihf - when folks say that I interpret it as --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 or (--with-fpu=neon). If you've really bootstrapped and regtested it on armhf, doesn't this patch as it stand have no effect there i.e. no change ? arm-linux-gnueabihf usually means to me someone has configured with --with-float=hard, so there are no regressions in the hard float ABI case, Ramana > Is it OK for trunk? > > > Thanks > Bernd.