Unsolved
46 Posts
0
1820
December 31st, 2020 02:00
X200 both boot drives wear_life threshold exceeded
Hi,
We are using a cluster of IQ6000X and X200 nodes.
Recently we had a power failure on site, our 30kVA UPS was holding but it went offline just before the power came back.
After powering on the cluster node no 8 (X200) reported 2 critical errors:
Drive at Internal J4/ad7 wear_life threshold exceeded: 100 (Threshold: 95). Please schedule drive replacement immediately.
Drive at Internal J3/ad4 wear_life threshold exceeded: 100 (Threshold: 95). Please schedule drive replacement immediately
I found documentation how to replace the boot drive but when I ran the commands to detect witch drives are failing I got that both drives are working:
gyar-8# atacontrol list
ATA channel 0:
Master: no device present
Slave: no device present
ATA channel 1:
Master: no device present
Slave: no device present
ATA channel 2:
Master: ad4 Serial ATA v1.0 II
Slave: no device present
ATA channel 3:
Master: no device present
Slave: ad7 Serial ATA v1.0 II
ATA channel 4:
Master: no device present
Slave: no device present
ATA channel 5:
Master: no device present
Slave: no device present
gyar-8# gmirror status
Name Status Components
mirror/root1 COMPLETE ad7p5
ad4p5
mirror/keystore COMPLETE ad7p12
ad4p11
mirror/mfg COMPLETE ad7p9
ad4p9
mirror/journal-backup COMPLETE ad7p8
ad4p8
mirror/var1 COMPLETE ad7p7
ad4p7
mirror/var0 COMPLETE ad7p6
ad4p6
mirror/root0 COMPLETE ad7p4
ad4p4
mirror/var-crash COMPLETE ad4p10
Could someone tell me if I really need to replace the drives? Do I need to replace both? What would be the procedure in this case when both drives are failing? (in the document there is no description for this case).
We do not have support anymore, so it's on me to solve this issue.
Many thx
events found

