From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 13108 invoked by alias); 11 Dec 2012 09:49:19 -0000 Received: (qmail 13094 invoked by uid 22791); 11 Dec 2012 09:49:18 -0000 X-SWARE-Spam-Status: No, hits=-5.0 required=5.0 tests=AWL,BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FROM,KHOP_RCVD_TRUST,KHOP_THREADED,RCVD_IN_DNSWL_LOW,RCVD_IN_HOSTKARMA_YE X-Spam-Check-By: sourceware.org Received: from mail-wi0-f169.google.com (HELO mail-wi0-f169.google.com) (209.85.212.169) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Tue, 11 Dec 2012 09:49:13 +0000 Received: by mail-wi0-f169.google.com with SMTP id hq12so1727932wib.2 for ; Tue, 11 Dec 2012 01:49:09 -0800 (PST) MIME-Version: 1.0 Received: by 10.216.218.88 with SMTP id j66mr6206857wep.187.1355219348970; Tue, 11 Dec 2012 01:49:08 -0800 (PST) Received: by 10.194.179.130 with HTTP; Tue, 11 Dec 2012 01:49:08 -0800 (PST) In-Reply-To: <2679866A-C8BE-4F73-B417-C0A8140D6E99@comcast.net> References: <2679866A-C8BE-4F73-B417-C0A8140D6E99@comcast.net> Date: Tue, 11 Dec 2012 09:49:00 -0000 Message-ID: Subject: Re: [PATCH i386]: Enable push/pop in pro/epilogue for modern CPUs From: Richard Biener To: Mike Stump Cc: Xinliang David Li , Uros Bizjak , GCC Patches Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-IsSubscribed: yes Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org X-SW-Source: 2012-12/txt/msg00680.txt.bz2 On Mon, Dec 10, 2012 at 10:07 PM, Mike Stump wrote: > On Dec 10, 2012, at 12:42 PM, Xinliang David Li wrot= e: >> I have not measured the CFI size impact -- but conceivably it should >> be larger -- which is unfortunate. > > Code speed and size are preferable to optimizing dwarf size=85 :-) I'd = let dwarf 5 fix it! Well, different to debug info, CFI data has to be in memory to make unwinding work. These days most Linux distributions enable asyncronous unwind tables so any size savings due to shorter push/pop epilogue/prologue sequences has to be offsetted by the increase in CFI data. I'm not sure there is really a speed difference between both variants (well, maybe due to better icache footprint of the push/pop variant). That said - I'd prefer to have more data on this before making the switch f= or the generic model. What was your original motivation? Just "theory" or was it a real case? Thanks, Richard.