[Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors

public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed

* [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors
@ 2023-05-08 16:42 fw at deneb dot enyo.de
  2023-05-08 17:03 ` [Bug dynamic-link/30428] " fweimer at redhat dot com
                   ` (7 more replies)
  0 siblings, 8 replies; 9+ messages in thread
From: fw at deneb dot enyo.de @ 2023-05-08 16:42 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30428

            Bug ID: 30428
           Summary: New AMD cache size computation logic does not work for
                    some CPUs, hypervisors
           Product: glibc
           Version: 2.38
            Status: NEW
          Severity: critical
          Priority: P1
         Component: dynamic-link
          Assignee: unassigned at sourceware dot org
          Reporter: fw at deneb dot enyo.de
  Target Milestone: ---
             Flags: security-

This commit:

commit 103a469dc7755fd9e8ccf362f3dd4c55dc761908
Author: Sajan Karumanchi <sajan.karumanchi@amd.com>
Date:   Wed Jan 18 18:29:04 2023 +0100

    x86: Cache computation for AMD architecture.

    All AMD architectures cache details will be computed based on
    __cpuid__ `0x8000_001D` and the reference to __cpuid__ `0x8000_0006` will
be
    zeroed out for future architectures.

    Reviewed-by: Premachandra Mallappa <premachandra.mallappa@amd.com>

changed cache size computation on the AMD architecture.

However, the new way of doing things is not supported by all AMD CPUs. This
CPU:

processor       : 0
vendor_id       : AuthenticAMD
cpu family      : 16
model           : 6
model name      : AMD Turion(tm) II Neo N40L Dual-Core Processor
stepping        : 3
microcode       : 0x10000c8
cpu MHz         : 800.000
cache size      : 1024 KB
physical id     : 0
siblings        : 2
core id         : 0
cpu cores       : 2
apicid          : 0
initial apicid  : 0
fpu             : yes
fpu_exception   : yes
cpuid level     : 5
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb
rdtscp lm 3dnowext 3dnow constant_tsc rep_good nopl nonstop_tsc cpuid
extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy
abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt nodeid_msr hw_pstate
vmmcall npt lbrv svm_lock nrip_save
bugs            : tlb_mmatch apic_c1e fxsave_leak sysret_ss_attrs null_seg
amd_e400 spectre_v1 spectre_v2
bogomips        : 2995.32
TLB size        : 1024 4K pages
clflush size    : 64
cache_alignment : 64
address sizes   : 48 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate

reports all zeros for its caches after this change (build from commit
cea74a4a24c36202309e8254f1f938e2166488f3, which includes commit mentioned
above):

$ ./ld.so --list-diagnostics | grep -E 'level|threshold'
x86.cpu_features.non_temporal_threshold=0x4040
x86.cpu_features.rep_movsb_threshold=0x800
x86.cpu_features.rep_movsb_stop_threshold=0x0
x86.cpu_features.rep_stosb_threshold=0x800
x86.cpu_features.level1_icache_size=0x0
x86.cpu_features.level1_icache_linesize=0x0
x86.cpu_features.level1_dcache_size=0x0
x86.cpu_features.level1_dcache_assoc=0x0
x86.cpu_features.level1_dcache_linesize=0x0
x86.cpu_features.level2_cache_size=0x0
x86.cpu_features.level2_cache_assoc=0x0
x86.cpu_features.level2_cache_linesize=0x0
x86.cpu_features.level3_cache_size=0x0
x86.cpu_features.level3_cache_assoc=0x0
x86.cpu_features.level3_cache_linesize=0x0
x86.cpu_features.level4_cache_size=0xffffffffffffffff

A build from the 2.36 branch (commit b7008a92f505632f32b313d1033d6d15c99a0b31)
yields this instead:

$ ./ld.so --list-diagnostics | grep -E 'level|threshold'
x86.cpu_features.non_temporal_threshold=0xc0000
x86.cpu_features.rep_movsb_threshold=0x800
x86.cpu_features.rep_movsb_stop_threshold=0x100000
x86.cpu_features.rep_stosb_threshold=0x800
x86.cpu_features.level1_icache_size=0x10000
x86.cpu_features.level1_icache_linesize=0x40
x86.cpu_features.level1_dcache_size=0x10000
x86.cpu_features.level1_dcache_assoc=0x2
x86.cpu_features.level1_dcache_linesize=0x40
x86.cpu_features.level2_cache_size=0x100000
x86.cpu_features.level2_cache_assoc=0x10
x86.cpu_features.level2_cache_linesize=0x40
x86.cpu_features.level3_cache_size=0x0
x86.cpu_features.level3_cache_assoc=0x0
x86.cpu_features.level3_cache_linesize=0x0
x86.cpu_features.level4_cache_size=0xffffffffffffffff

So it's a regression.

The CPU is probably old enough that we don't use temporal stores, so there is
probably not going to be a crash in glibc. But lack of accurate cache sizes
probably still causes performance regressions elsewhere (although no one is
going to use CPUs that old for their performance, admittedly).

Some hypervisors also fail to pass through these CPUID values even if they
identify the CPU as an AMD model:
https://bugzilla.redhat.com/show_bug.cgi?id=2196271

Addressing hypervisor compatibility might be the important part here.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug dynamic-link/30428] New AMD cache size computation logic does not work for some CPUs, hypervisors
  2023-05-08 16:42 [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors fw at deneb dot enyo.de
@ 2023-05-08 17:03 ` fweimer at redhat dot com
  2023-05-08 17:09 ` fweimer at redhat dot com
                   ` (6 subsequent siblings)
  7 siblings, 0 replies; 9+ messages in thread
From: fweimer at redhat dot com @ 2023-05-08 17:03 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30428

Florian Weimer <fweimer at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |fweimer at redhat dot com

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug dynamic-link/30428] New AMD cache size computation logic does not work for some CPUs, hypervisors
  2023-05-08 16:42 [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors fw at deneb dot enyo.de
  2023-05-08 17:03 ` [Bug dynamic-link/30428] " fweimer at redhat dot com
@ 2023-05-08 17:09 ` fweimer at redhat dot com
  2023-05-08 20:29 ` fweimer at redhat dot com
                   ` (5 subsequent siblings)
  7 siblings, 0 replies; 9+ messages in thread
From: fweimer at redhat dot com @ 2023-05-08 17:09 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30428

Florian Weimer <fweimer at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           See Also|                            |https://sourceware.org/bugz
                   |                            |illa/show_bug.cgi?id=29953

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug dynamic-link/30428] New AMD cache size computation logic does not work for some CPUs, hypervisors
  2023-05-08 16:42 [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors fw at deneb dot enyo.de
  2023-05-08 17:03 ` [Bug dynamic-link/30428] " fweimer at redhat dot com
  2023-05-08 17:09 ` fweimer at redhat dot com
@ 2023-05-08 20:29 ` fweimer at redhat dot com
  2023-05-09 14:55 ` jamborm at gcc dot gnu.org
                   ` (4 subsequent siblings)
  7 siblings, 0 replies; 9+ messages in thread
From: fweimer at redhat dot com @ 2023-05-08 20:29 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30428

--- Comment #1 from Florian Weimer <fweimer at redhat dot com> ---
Just to clarify: this regression affects sysconf (_SC_LEVEL2_CACHE_SIZE) and
similar configuration values, so it impacts more than just glibc-internal
tuning decisions.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug dynamic-link/30428] New AMD cache size computation logic does not work for some CPUs, hypervisors
  2023-05-08 16:42 [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors fw at deneb dot enyo.de
                   ` (2 preceding siblings ...)
  2023-05-08 20:29 ` fweimer at redhat dot com
@ 2023-05-09 14:55 ` jamborm at gcc dot gnu.org
  2023-06-11 17:52 ` sam at gentoo dot org
                   ` (3 subsequent siblings)
  7 siblings, 0 replies; 9+ messages in thread
From: jamborm at gcc dot gnu.org @ 2023-05-09 14:55 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30428

Martin Jambor <jamborm at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jamborm at gcc dot gnu.org

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug dynamic-link/30428] New AMD cache size computation logic does not work for some CPUs, hypervisors
  2023-05-08 16:42 [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors fw at deneb dot enyo.de
                   ` (3 preceding siblings ...)
  2023-05-09 14:55 ` jamborm at gcc dot gnu.org
@ 2023-06-11 17:52 ` sam at gentoo dot org
  2023-07-04 17:34 ` fweimer at redhat dot com
                   ` (2 subsequent siblings)
  7 siblings, 0 replies; 9+ messages in thread
From: sam at gentoo dot org @ 2023-06-11 17:52 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30428

Sam James <sam at gentoo dot org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |sam at gentoo dot org

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug dynamic-link/30428] New AMD cache size computation logic does not work for some CPUs, hypervisors
  2023-05-08 16:42 [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors fw at deneb dot enyo.de
                   ` (4 preceding siblings ...)
  2023-06-11 17:52 ` sam at gentoo dot org
@ 2023-07-04 17:34 ` fweimer at redhat dot com
  2023-09-14 20:33 ` jamborm at gcc dot gnu.org
  2023-09-14 20:48 ` fweimer at redhat dot com
  7 siblings, 0 replies; 9+ messages in thread
From: fweimer at redhat dot com @ 2023-07-04 17:34 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30428

--- Comment #2 from Florian Weimer <fweimer at redhat dot com> ---
Initial patch posted (still alters results compared to what we had before):

[PATCH] x86: Fix for cache computation on AMD legacy cpus.
<https://sourceware.org/pipermail/libc-alpha/2023-June/148763.html>

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug dynamic-link/30428] New AMD cache size computation logic does not work for some CPUs, hypervisors
  2023-05-08 16:42 [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors fw at deneb dot enyo.de
                   ` (5 preceding siblings ...)
  2023-07-04 17:34 ` fweimer at redhat dot com
@ 2023-09-14 20:33 ` jamborm at gcc dot gnu.org
  2023-09-14 20:48 ` fweimer at redhat dot com
  7 siblings, 0 replies; 9+ messages in thread
From: jamborm at gcc dot gnu.org @ 2023-09-14 20:33 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30428

--- Comment #3 from Martin Jambor <jamborm at gcc dot gnu.org> ---
Has this been addressed(*)?  If so, could the respective commit ID's be posted
here for reference, please?


(*) https://sourceware.org/pipermail/libc-alpha/2023-June/148812.html would
suggest that it was but perhaps not entirely?

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* [Bug dynamic-link/30428] New AMD cache size computation logic does not work for some CPUs, hypervisors
  2023-05-08 16:42 [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors fw at deneb dot enyo.de
                   ` (6 preceding siblings ...)
  2023-09-14 20:33 ` jamborm at gcc dot gnu.org
@ 2023-09-14 20:48 ` fweimer at redhat dot com
  7 siblings, 0 replies; 9+ messages in thread
From: fweimer at redhat dot com @ 2023-09-14 20:48 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30428

Florian Weimer <fweimer at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|---                         |2.39

--- Comment #4 from Florian Weimer <fweimer at redhat dot com> ---
Fixed for 2.39 via:

commit dcad5c8578130dec7f35fd5b0885304b59f9f543
Author: Sajan Karumanchi <sajan.karumanchi@amd.com>
Date:   Tue Aug 1 15:20:55 2023 +0000

    x86: Fix for cache computation on AMD legacy cpus.

    Some legacy AMD CPUs and hypervisors have the _cpuid_ '0x8000_001D'
    set to Zero, thus resulting in zeroed-out computed cache values.
    This patch reintroduces the old way of cache computation as a
    fail-safe option to handle these exceptions.
    Fixed 'level4_cache_size' value through handle_amd().

    Reviewed-by: Premachandra Mallappa <premachandra.mallappa@amd.com>
    Tested-by: Florian Weimer <fweimer@redhat.com>

It seems that we are still missing the backport to 2.37.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2023-09-14 20:48 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-05-08 16:42 [Bug dynamic-link/30428] New: New AMD cache size computation logic does not work for some CPUs, hypervisors fw at deneb dot enyo.de
2023-05-08 17:03 ` [Bug dynamic-link/30428] " fweimer at redhat dot com
2023-05-08 17:09 ` fweimer at redhat dot com
2023-05-08 20:29 ` fweimer at redhat dot com
2023-05-09 14:55 ` jamborm at gcc dot gnu.org
2023-06-11 17:52 ` sam at gentoo dot org
2023-07-04 17:34 ` fweimer at redhat dot com
2023-09-14 20:33 ` jamborm at gcc dot gnu.org
2023-09-14 20:48 ` fweimer at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).