From: "Carlos O'Donell" <firstname.lastname@example.org>
To: "Ryan S. Arnold" <email@example.com>
Cc: "Will Newton" <firstname.lastname@example.org>,
"Patch Tracking" <email@example.com>,
"Ondřej Bílka" <firstname.lastname@example.org>,
"Siddhesh Poyarekar" <email@example.com>
Subject: Re: [PATCH] sysdeps/arm/armv7/multiarch/memcpy_impl.S: Improve performance.
Date: Tue, 03 Sep 2013 19:54:00 -0000 [thread overview]
Message-ID: <52263E63.firstname.lastname@example.org> (raw)
On 09/03/2013 03:31 PM, Ryan S. Arnold wrote:
> On Tue, Sep 3, 2013 at 11:18 AM, Carlos O'Donell <email@example.com> wrote:
>> We have one, it's the glibc microbenchmark, and we want to expand it,
>> otherwise when ACME comes with their patch for ARM and breaks performance
>> for targets that Linaro cares about I have no way to reject the patch
>> objectively :-)
> Can you be objective in analyzing performance when two different
> people have differing opinions on what performance preconditions
> should be coded against?
> There are some cases that are obvious.. we know that from pipeline
> analysis that certain instruction sequences can hinder performance.
> That is objective and can be measured by a benchmark, but saying that
> a particular change penalizes X sized copies but helps Y sized copies
> when there are no published performance preconditions isn't. It's a
> difference in opinion of what's important.
I agree. The project needs to adopt some set of performance preconditions
and document them and defend them.
If we can't defend these positions then we will never be able to evaluate
any performance patches. We will see-saw between several implementations
over the years.
The current set of performance preconditions are baked into the experience
of the core developers reviewing patches. I want the experts out of the
> PowerPC has had the luxury of not having their performance
> pre-conditions contested. PowerPC string performance is optimized
> based upon customer data-set analysis. So PowerPC's preconditions are
> pretty concrete... Optimize for aligned data in excess of 128-bytes
> (I believe).
We should be documenting this somewhere, preferably in a Power-specific
test that looks at just this kind of issue.
Documenting this statically is the first, in my opinion, stepping stone
to having something like dynamic feedback.
>> You need to statistically analyze the numbers, assign weights to ranges,
>> and come up with some kind of number that evaluates the results based
>> on *some* formula. That is the only way we are going to keep moving
>> performance forward (against some kind of criteria).
> This sounds like establishing preconditions (what types of data will
> be optimized for).
I agree. We need it.
> Unless technology evolves that you can statistically analyze data in
> real time and adjust the implementation based on what you find (an
> implementation with a different set of preconditions) to account for
> this you're going to end up with a lot of in-fighting over
Why do you assume we'll have a lot of in-fighting over performance?
At present we've split the performance intensive (or so we believe)
routines on a per-machine basis. The arguments are then going to be
had only on a per-machine basis, and even then for each hardware
variant can have an IFUNC resolver select the right routine at
Then we come upon the tunables that should allow some dynamic adjustment
of an algorithm based on realtime data.
> I've run into situations where I recommended that a customer code
> their own string function implementation because they continually
> encountered unaligned-data when copying-by-value in C++ functions and
> PowerPC's string function implementations penalized unaligned copies
> in preference for aligned copies.
Provide both in glibc and expose a tunable?
next prev parent reply other threads:[~2013-09-03 19:54 UTC|newest]
Thread overview: 37+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-08-12 7:55 Will Newton
2013-08-27 7:46 ` Will Newton
2013-08-30 17:14 ` Carlos O'Donell
2013-08-30 18:48 ` Will Newton
2013-08-30 19:26 ` Carlos O'Donell
[not found] ` <CANu=DmhA9QvSe6RS72Db2P=yyjC72fsE8d4QZKHEcNiwqxNMvw@mail.gmail.com>
2013-09-02 14:18 ` benchmark improvements (Was: Re: [PATCH] sysdeps/arm/armv7/multiarch/memcpy_impl.S: Improve performance.) Siddhesh Poyarekar
2013-09-03 13:46 ` Will Newton
2013-09-03 17:48 ` Ondřej Bílka
2013-09-02 19:57 ` [PATCH] sysdeps/arm/armv7/multiarch/memcpy_impl.S: Improve performance Ondřej Bílka
2013-09-03 16:18 ` Carlos O'Donell
2013-09-03 17:37 ` Ondřej Bílka
2013-09-03 17:52 ` Carlos O'Donell
2013-09-03 18:57 ` Ondřej Bílka
2013-09-03 19:15 ` Carlos O'Donell
2013-09-04 7:27 ` Siddhesh Poyarekar
2013-09-04 11:03 ` Ondřej Bílka
2013-09-04 11:43 ` Siddhesh Poyarekar
2013-09-04 17:37 ` Ryan S. Arnold
2013-09-05 8:04 ` Ondřej Bílka
2013-09-04 15:30 ` Carlos O'Donell
2013-09-04 17:35 ` Ryan S. Arnold
2013-09-05 11:07 ` Ondřej Bílka
2013-09-05 11:54 ` Joseph S. Myers
2013-09-03 19:34 ` Ryan S. Arnold
2013-09-07 11:55 ` Ondřej Bílka
2013-09-03 19:31 ` Ryan S. Arnold
2013-09-03 19:54 ` Carlos O'Donell [this message]
2013-09-03 20:56 ` Ryan S. Arnold
2013-09-03 23:29 ` Ondřej Bílka
2013-09-03 23:31 ` Carlos O'Donell
2013-09-03 22:27 ` Ondřej Bílka
2013-09-02 14:18 ` Will Newton
2013-09-03 16:14 ` Carlos O'Donell
2013-08-29 23:58 ` Joseph S. Myers
2013-08-30 14:56 ` Will Newton
2013-08-30 15:18 ` Joseph S. Myers
2013-08-30 18:46 ` Will Newton
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).