1 Rookie

 • 

19 Posts

2686

September 14th, 2020 09:00

EqualLogic PS4100 - Faulted Beyond Recovery

Hello everyone,

I contact you and hope to get some help after a crash.

"uname -a" Command output

 

# uname -a
NetBSD  5.0_STABLE NetBSD 5.0_STABLE (EQL.PSS) #0: Fri Jun 19 20:27:07 EDT 2015  build@m64:/buildarea/V7.1.7__Fri_Jun_19_2015_20_20_05_EDT/bin/destdir.sbmips.64.release/EQL.PSS.64 sbmips

 

"raidtool" Command output

 

# raidtool
Driver Status: *Admin Intervention Requested*
RAID LUN 0 Faulted Beyond Recovery.
  10 Drives (0,2,4,6,8,10,11,?,f,9)
  RAID 50 (64KB sectPerSU)
  Capacity 4,608,948,699,136 bytes
Available Drives List: 1,3,7

 

"raidview" Command output

 

# raidview
Driver Status: *Admin Intervention Requested*
Reason: 0x2
Available List: 1,3,7
Failed List: NONE
Missing Drives: 196969554423508-1314432487-9

LUN R T  Drives                                                   Size    Status
--------------------------------------------------------------------------------
 0 50 H  [0,2,4,6,8][10,11,?,f,9]                                 4.2T Faulted Beyond Recovery

Checking for issues:
   1: Re-insert missing drives, and reboot.
      The LUNs must be recovered before orphaned cache can be written.
   2: Drive 3 has tripped SMART and should be replaced as soon as possible
   3: RAID LUN 0 is Faulted Beyond Recovery.  Reinsert missing drives.
      After reinserting the missing drives, a reboot may be required.

 

I tried to reboot the bay, get out and put the disk back in, change the disk, but nothing.

 

# diskview -j
Enc/Drive State      Write   Read    Power    Drive     Bad    ForceWrite  Reset   Read    Scan       Max      Max
                     Retrys  Retrys  Cycles  Timeouts  Blocks    Retrys    Fail   Timeout  Errors  Cominits  HrstMsecs
______________________________________________________________________________________________________________________
  0/ 0    Online         0       1       0       0        61       0           0      0        0        0        0
  0/ 1    Online         0       1       0       2        44       0           0      0        0        0        0
  0/ 2    Online         0       1       0       0        30       0           0      0        0        0        0
  0/ 3    Online         3       0       0       0        28       0           0      0        0        0        0
  0/ 4    Online         1       0       0       0        49       0           0      0        0        0        0
  0/ 5    Bad/Failed     0       0       0       0         0       0           0      0        0        0        0
  0/ 6    Online         0       2       0       0        39       0           0      0        0        0        0
  0/ 7    Online         0       1       0       0        54       0           0      0        0        0        0
  0/ 8    Online         0       0       0       0        14       0           0      0        0        0        0
  0/ 9    Online         0       1       0       0        24       0           0      0        0        0        0
  0/10    Online         0       1       0       0        32       0           0      0        0        0        0
  0/11    Online         0       0       0       0        31       0           0      0        0        0        0

 

In addition, Cli commands mainly return this message :

 

# Cli

The storage array is still initializing.  Limited commands will be available until the initialization is complete. Please try again later.

 

Regards,

4 Operator

 • 

1.5K Posts

September 14th, 2020 18:00

Hello, 

 Is this like a PS400?   12 bay array? 

 The bad blocks are not the problem.  The failed drives are.  RAID50 can't tolerate two failed drives in same RAIDset.  The last good drive is the critical one that has to be recovered in order to get the RAIDset back to degraded mode.   There is no other resolution to this condition. 

 If you have the data backed up, resetting and restoring with new drives is best choice.  Otherwise you will need to send all the failed/spare drives to a recovery house to see if they can get enough of the critical drive to bring back online    You might want to send them ALL the drives to be cloned, since you likely have other drives ready to fail. 

 Regards, 

 Don 

 

 

Moderator

 • 

9.3K Posts

September 14th, 2020 14:00

Hi,

 

First thing, do you have a backup of the data? What is the model of the device? It looks like multiple drives may have bad blocks and that is causing the whole RAID to fail.

1 Rookie

 • 

19 Posts

September 14th, 2020 23:00

Hello,

I actually have backups being restored on another storage array. The only sticker is :

 

Reg Model : E03J
Reg Type : E03J001
Equalogic PS4100

 

Is the problem the same as a PS400 with a RAID50 as said in the previous answer "RAID50 can't tolerate two failed drives in same RAIDset". ?

Regards,

1 Rookie

 • 

19 Posts

September 15th, 2020 04:00

Hello,

Thanks for your help, I got my answers. I'm going to juggle between my backups and my available storage bays.

Regards,

4 Operator

 • 

1.5K Posts

September 15th, 2020 04:00

Hello, 

 Re: RAID50   A RAID50 is two RAID5 striped together. So by the nature of the RAID level itself, it cannot tolerate two failures in the same stripe.  RAID6 is the level you want to run.to survive dual failures. 

Yes.  BTW, a 4100 array is not that old. 

 So the easiest option is reset this array, replace the failed drives and create it over again. 

 Regards, 

Don 

1 Rookie

 • 

19 Posts

September 15th, 2020 04:00

Hello,

Thank you for the answer.

Two questions:
- @DELL-Josh Cr : Is it possible to change the title of the discussion topic by replacing "Old" by "PS4100"?
@dwilliam62 : If I do a reset from the Cli, do I lose my data?

Regards,

 

4 Operator

 • 

1.5K Posts

September 15th, 2020 04:00

Hello, 

 Re: Reset. Yes. A reset removes all data and configuration information.  You have backups that you are restoring correct? 

 Otherwise you will most likely need to get the failed drives sent to a recovery service.  You will also need to purchase EQL compatible drives from a third party so they can transfer the recovered data to the new drives.  Then hopefully the array will come back up in degraded mode.  With an available spare it will start the rebuild process to get back to ready status. 

 Regards, 

Don 

 

4 Operator

 • 

1.5K Posts

September 15th, 2020 05:00

Hello, 

 You are very welcome!   Good luck.  If you have more questions about it, please let us know. 

Don 

 

No Events found!

Top