public inbox for libc-stable@sourceware.org
 help / color / mirror / Atom feed
* Re: [PATCH] x86: Don't set Prefer_No_AVX512 for processors with AVX512 and AVX-VNNI
       [not found]                 ` <CAFUsyfLnBfseEOHEgZXTaywr3xnv-awUo+Bywdhr7+V4M7Vo9A@mail.gmail.com>
@ 2022-04-23  1:51                   ` Sunil Pandey
  0 siblings, 0 replies; only message in thread
From: Sunil Pandey @ 2022-04-23  1:51 UTC (permalink / raw)
  To: Noah Goldstein, libc-stable
  Cc: H.J. Lu, Florian Weimer, H.J. Lu via Libc-alpha, Hongyu Wang,
	Thiago Macieira, liuhongt, Arjan van de Ven

On Tue, Dec 7, 2021 at 11:33 AM Noah Goldstein via Libc-alpha
<libc-alpha@sourceware.org> wrote:
>
> On Tue, Dec 7, 2021 at 9:53 AM H.J. Lu via Libc-alpha
> <libc-alpha@sourceware.org> wrote:
> >
> > On Tue, Dec 7, 2021 at 7:48 AM Florian Weimer <fweimer@redhat.com> wrote:
> > >
> > > * H. J. Lu via Libc-alpha:
> > >
> > > > On Tue, Dec 7, 2021 at 6:05 AM Florian Weimer <fweimer@redhat.com> wrote:
> > > >>
> > > >> * H. J. Lu via Libc-alpha:
> > > >>
> > > >> > Hongtao, Hongyu,  can you find a Rocket Lake to test?
> > > >>
> > > >> I've found a lab machine with an i7-11700 CPU.  Is there something I
> > > >> could test for you?
> > > >
> > > > You can enable AVX512 in glibc with:
> > > >
> > > > $ export GLIBC_TUNABLES=glibc.cpu.hwcaps=-Prefer_No_AVX512
> > > >
> > > > While bootstrapping GCC with -j8, track CPU frequency with turbostat.  If
> > > > there is no CPU frequency drop and build time is less comparing against
> > > > without GLIBC_TUNABLES, we can enable AVX512.
> > > >
> > > >> (This could be non-production silicon, though.)
> > > >>
> > > >
> > > > The frequency behavior of non-production silicon can be different.
> > >
> > > With that caveat, it seems that frequencies drop further with
> > > GLIBC_TUNABLES set as above, and the build is also a little bit slower
> > > (5m31s vs 5m23s, the AVX-512 build was run first, and the systems was a
> > > little bit warmer for the second run).
> > >
> > > Would it make sense to run more extensive tests, or should we wait for
> > > someone with production silicon to show up?
> >
> > GCC is a heavy user of memcpy/memset, which is a good proxy of
> > ZMM load/store impact on CPU frequency.   We need to run the same
> > test on a production Rocket Lake.
>
> I would think a microbenchmark would be better for determining if
> rocketlake actually has throttling.
>
> Testing the full j8 GCC build will add a bunch of frequency "noise"
> due to thermal throttling.
>
> >
> > --
> > H.J.

I would like to backport this patch to release branches.
Any comments or objections?

--Sunil

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2022-04-23  1:52 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <20211206032303.7159-1-hjl.tools@gmail.com>
     [not found] ` <CAFUsyfLjNsvhg9yAvyRr69c8M5Ra7RDbPgwFJoEMvXa-MJUjSw@mail.gmail.com>
     [not found]   ` <CAMe9rOquRNm6_a1LZ8+u0czifb8Gm9iVGu8jRPxYt5+RJjbsxQ@mail.gmail.com>
     [not found]     ` <3639bca9-e90d-d3ff-c758-d2d5c4c0a3d2@linux.intel.com>
     [not found]       ` <CAMe9rOqnJ9xgKbUiP10hPS33hpQF5g-cqHm3a6cgtwyk8RYoFw@mail.gmail.com>
     [not found]         ` <87r1aoqzlz.fsf@oldenburg.str.redhat.com>
     [not found]           ` <CAMe9rOqwUwEZjUF=wonTAAxJyrj5eD2wApe0bUHC+sam005y6Q@mail.gmail.com>
     [not found]             ` <878rwwquvk.fsf@oldenburg.str.redhat.com>
     [not found]               ` <CAMe9rOoZVBuTaT9XmmBZs-KUFFrNOw3R0_nroUH7n9JJe_J5nQ@mail.gmail.com>
     [not found]                 ` <CAFUsyfLnBfseEOHEgZXTaywr3xnv-awUo+Bywdhr7+V4M7Vo9A@mail.gmail.com>
2022-04-23  1:51                   ` [PATCH] x86: Don't set Prefer_No_AVX512 for processors with AVX512 and AVX-VNNI Sunil Pandey

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).