From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail2-relais-roc.national.inria.fr (mail2-relais-roc.national.inria.fr [192.134.164.83]) by sourceware.org (Postfix) with ESMTPS id 9F7B7385B835 for ; Fri, 10 Apr 2020 06:32:42 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 9F7B7385B835 Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=inria.fr Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=Paul.Zimmermann@inria.fr X-IronPort-AV: E=Sophos;i="5.72,364,1580770800"; d="scan'208";a="444657863" Received: from tomate.loria.fr (HELO tomate) ([152.81.10.51]) by mail2-relais-roc.national.inria.fr with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 10 Apr 2020 08:32:38 +0200 Date: Fri, 10 Apr 2020 08:32:38 +0200 Message-Id: From: paul zimmermann To: Adhemerval Zanella CC: libc-alpha@sourceware.org In-reply-to: <20200409195945.10393-2-adhemerval.zanella@linaro.org> (message from Adhemerval Zanella on Thu, 9 Apr 2020 16:59:42 -0300) Subject: Re: [PATCH 2/5] math: Optimized generic exp10f with wrappers References: <20200409195945.10393-1-adhemerval.zanella@linaro.org> <20200409195945.10393-2-adhemerval.zanella@linaro.org> X-Spam-Status: No, score=-1.8 required=5.0 tests=BAYES_00, KAM_DMARC_STATUS, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 10 Apr 2020 06:32:58 -0000 Dear Adhemerval, > Result for x86_64 (i7-4790K CPU @ 4.00GHz) are: > > Before new code: > "exp10f": { > "": { > "duration": 1.00923e+09, > "iterations": 5.4832e+07, > "max": 249.361, > "min": 12.5, > "mean": 18.4059 > } > > With new code: > "exp10f": { > "": { > "duration": 9.7639e+08, > "iterations": 8.1056e+07, > "max": 563.323, > "min": 11.54, > "mean": 12.0459 > } > > Result for aarch64 (A72 @ 2GHz) are: > > Before new code: > "exp10f": { > "": { > "duration": 1.00923e+09, > "iterations": 5.4832e+07, > "max": 249.361, > "min": 12.5, > "mean": 18.4059 > } > > With new code: > "exp10f": { > "": { > "duration": 9.7639e+08, > "iterations": 8.1056e+07, > "max": 563.323, > "min": 11.54, > "mean": 12.0459 > } the timings for x86_64 and aarch64 are identical. A copy/paste error? Paul