public inbox for overseers@sourceware.org
 help / color / mirror / Atom feed
* RAID errors on sdi
@ 2018-01-15 20:50 Joseph Myers
  2018-01-15 21:03 ` Frank Ch. Eigler
  0 siblings, 1 reply; 4+ messages in thread
From: Joseph Myers @ 2018-01-15 20:50 UTC (permalink / raw)
  To: overseers

sourceware is being very slow at present, with dmesg showing RAID errors 
on sdi that may well be responsible, e.g.:

[710171.863023] megaraid_sas 0000:15:00.0: 492931 (569364361s/0x0002/FATAL) - Unrecoverable medium error during recovery on PD 10(e0x12/s8) at 7e01c65
[710177.173822] megaraid_sas 0000:15:00.0: 492934 (569364366s/0x0002/FATAL) - Unrecoverable medium error during recovery on PD 10(e0x12/s8) at 7e01c65
[710182.442559] megaraid_sas 0000:15:00.0: 492937 (569364371s/0x0002/FATAL) - Unrecoverable medium error during recovery on PD 10(e0x12/s8) at 7e01c65
[710187.778135] megaraid_sas 0000:15:00.0: 492940 (569364377s/0x0002/FATAL) - Unrecoverable medium error during recovery on PD 10(e0x12/s8) at 7e01c65
[710192.979496] megaraid_sas 0000:15:00.0: 492943 (569364382s/0x0002/FATAL) - Unrecoverable medium error during recovery on PD 10(e0x12/s8) at 7e01c65
[710198.106849] megaraid_sas 0000:15:00.0: 492946 (569364387s/0x0002/FATAL) - Unrecoverable medium error during recovery on PD 10(e0x12/s8) at 7e01c65
[710198.117613] sd 0:2:8:0: [sdi]  Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
[710198.117978] sd 0:2:8:0: [sdi] CDB: Read(10): 28 00 07 e0 1c 60 00 00 08 00
[710198.118206] end_request: I/O error, dev sdi, sector 132127840
[710198.427902] md/raid:md3: read error corrected (8 sectors at 132125792 on sdi1)

(lots more similar errors before that).

-- 
Joseph S. Myers
joseph@codesourcery.com

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: RAID errors on sdi
  2018-01-15 20:50 RAID errors on sdi Joseph Myers
@ 2018-01-15 21:03 ` Frank Ch. Eigler
  2018-01-18 20:36   ` Joseph Myers
  0 siblings, 1 reply; 4+ messages in thread
From: Frank Ch. Eigler @ 2018-01-15 21:03 UTC (permalink / raw)
  To: Joseph Myers; +Cc: overseers

Hi -

> sourceware is being very slow at present, with dmesg showing RAID errors 
> on sdi that may well be responsible, e.g.:

Thanks, I am engaging on-site sysadmins to swap that disk.

- FChE

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: RAID errors on sdi
  2018-01-15 21:03 ` Frank Ch. Eigler
@ 2018-01-18 20:36   ` Joseph Myers
  2018-01-18 20:45     ` Frank Ch. Eigler
  0 siblings, 1 reply; 4+ messages in thread
From: Joseph Myers @ 2018-01-18 20:36 UTC (permalink / raw)
  To: Frank Ch. Eigler; +Cc: overseers

In case you didn't already notice it, there's a message shown by dmesg:

[86524.889900] megaraid_sas 0000:15:00.0: 95 (569544782s/0x0008/FATAL) - Battery has failed and cannot support data retention. Please replace the battery

(I don't know if that would affect performance at all, if e.g. it means 
any hardware write caching is disabled.)

-- 
Joseph S. Myers
joseph@codesourcery.com

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: RAID errors on sdi
  2018-01-18 20:36   ` Joseph Myers
@ 2018-01-18 20:45     ` Frank Ch. Eigler
  0 siblings, 0 replies; 4+ messages in thread
From: Frank Ch. Eigler @ 2018-01-18 20:45 UTC (permalink / raw)
  To: Joseph Myers; +Cc: overseers

Hi -

> In case you didn't already notice it, there's a message shown by dmesg:
> 
> [86524.889900] megaraid_sas 0000:15:00.0: 95 (569544782s/0x0008/FATAL) - Battery has failed and cannot support data retention. Please replace the battery
> 
> (I don't know if that would affect performance at all, if e.g. it means 
> any hardware write caching is disabled.)

Yup, aware of it; nope, shouldn't really affect anything.
The NVMe SSD is the primary/gating target for I/O traffic.

- FChE

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2018-01-18 20:45 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-01-15 20:50 RAID errors on sdi Joseph Myers
2018-01-15 21:03 ` Frank Ch. Eigler
2018-01-18 20:36   ` Joseph Myers
2018-01-18 20:45     ` Frank Ch. Eigler

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).