From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 16090 invoked by alias); 15 Apr 2013 10:49:45 -0000 Mailing-List: contact libc-ports-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-ports-owner@sourceware.org Received: (qmail 16080 invoked by uid 89); 15 Apr 2013 10:49:45 -0000 X-Spam-SWARE-Status: No, score=-3.3 required=5.0 tests=AWL,BAYES_00,KHOP_RCVD_UNTRUST,KHOP_THREADED,RCVD_IN_DNSWL_LOW,RCVD_IN_HOSTKARMA_YE,TW_CP autolearn=ham version=3.3.1 Received: from mail-ia0-f171.google.com (HELO mail-ia0-f171.google.com) (209.85.210.171) by sourceware.org (qpsmtpd/0.84/v0.84-167-ge50287c) with ESMTP; Mon, 15 Apr 2013 10:49:38 +0000 Received: by mail-ia0-f171.google.com with SMTP id f27so2313053iae.2 for ; Mon, 15 Apr 2013 03:49:37 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:cc:content-type:content-transfer-encoding :x-gm-message-state; bh=fO+lY+c+FIcME2J4V9jaca5VAT8uVrc6DIHV8Tthc1o=; b=Qpy0u2XLp/OSD66Wi5TvtaxNJiPGYuTTMcSwSB7FLmyvRENFvmrLOMF9AlbPyI42qZ +MYChDAAs/13rkORqZ2nYXOn/R2P4jjKifUIQvUmLDo8Ajr8l2mrUPpHKmXK4mzM8k+L MYUapbAWU82evICknTaTCxj3funB8OqLwAxuVWATsB2hTlkMIXJ1zUkYk+Y+Ckv6c2vQ EPIpw+JSOuUVbK0g0K4UXv1HsfwRPvXk4QFiwAtPbFKw9xCDIoi/o61C6Jlmm8ZM5tQc 6aHyDWgkpolDAg+rePGtvnmC2uBnnUvaKj3ndpeP6reJ82cdJn8/nN0SwgYRZu6YuTHt SsIw== MIME-Version: 1.0 X-Received: by 10.50.150.167 with SMTP id uj7mr5006081igb.1.1366022976858; Mon, 15 Apr 2013 03:49:36 -0700 (PDT) Received: by 10.64.100.174 with HTTP; Mon, 15 Apr 2013 03:49:36 -0700 (PDT) In-Reply-To: References: <516BCEE5.9070809@linaro.org> Date: Mon, 15 Apr 2013 10:49:00 -0000 Message-ID: Subject: Re: [PATCH] ARM: Add Cortex-A15 optimized NEON and VFP memcpy routines, with IFUNC. From: Will Newton To: =?ISO-8859-1?Q?M=E5ns_Rullg=E5rd?= Cc: libc-ports@sourceware.org, Patch Tracking Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Gm-Message-State: ALoCoQl2ELmg4ezRInxTBz95hNRN8ylIXImnbJwt51os7ug51aBobzoAX5pYQbAFmPnjxBWBVFHO X-SW-Source: 2013-04/txt/msg00063.txt.bz2 On 15 April 2013 11:45, M=E5ns Rullg=E5rd wrote: > Will Newton writes: > >> On 15 April 2013 11:06, M=E5ns Rullg=E5rd wrote: >> >> Hi M=E5ns, >> >>>> Add a high performance memcpy routine optimized for Cortex-A15 with >>>> variants for use in the presence of NEON and VFP hardware, selected >>>> at runtime using indirect function support. >>> >>> How does this perform on Cortex-A9? >> >> The code is also faster on A9 although the gains are not quite as >> pronounced. > > That should be expected. As long as it's not slower, it's all well. > > Out of interest, did you compare this memcpy to any other NEON versions > floating about, such as the one in bionic? It's been tested against all the versions of memcpy in the cortex-strings package on launchpad, which includes a version from bionic, although I don't believe it uses NEON. AFAIK the latest code in bionic for A15 is an ARM Ltd. copyright version similar to the one in newlib, which this code is faster than in almost all cases. -- Will Newton Toolchain Working Group, Linaro