Start a Conversation

Unsolved

This post is more than 5 years old

2754

September 5th, 2013 03:00

DAE replacement

One drive slot has gone faulty in a DAE on CX4-480, EMC recommends DAE replacement. The drives have data and are linked to other RAID in other DAEs. What's the procedure to do this. Also how to take backup of data in this case?

9 Legend

 • 

20.4K Posts

September 5th, 2013 05:00

do you have free capacity in other raid groups that do not reside on this DAE ? Maybe you could use LUN Migration and move them around ?

1.4K Posts

September 5th, 2013 06:00

If you are going for LUN Migration Option I reckon you may need to migrate at least 3TB+ data that too provided, if you have a free space in the storage unit. Moreover, the migration time will vary and unpredictable and god forbid but you may stumble in to another issue that migration is stuck at 0%!

I would rather suggest the other way round!

- Schedule a Downtime.

- Power off the Storage Unit (Gracefully)

- Label each disk location before you remove it from the Slot!

- Remove and Replace the DAE

- Make sure you place the disks as it used to be.

- Try that magical Power On!

At times, its better to leave it as it is!

1.4K Posts

September 5th, 2013 06:00

Intention was not to scare you but to make you aware of everything before you come to conclusion.

Search on Powerlink: I reckon, you will find KB Articles for LUN Migration Stuck at 0%, RAID Group/LUN Expansion stuck at 0% etc.

Why to go for route which may cause a different issue when we can't afford?!

Anyways, there are multiple options [expensive & Time consuming]

- If you do not have MV License [Buy it]

- Perform a remote replication!

I know you won't do that but well Options are there!

9 Legend

 • 

20.4K Posts

September 5th, 2013 06:00

i guess you don't have many options then but to go with Ankit's suggestion. Make sure you have good backups of your data before you proceed.

17 Posts

September 5th, 2013 06:00

Yeah, have even experienced once, that's what scared me .

Data is critical and client is sensitive enough to spare few bucks so we may, also already have Replication Manager in environment, though not integrated with this box.

17 Posts

September 5th, 2013 06:00

Unfortunately no, don't have free space on same RAID type, RAID group, also Ankit's reply below kind of scared me (migration stuck at 0%)

1.4K Posts

September 5th, 2013 06:00

rahul_verma wrote:

Ankit,

Cant go for migration, Neither have free space nor the data is 3TB+

Further doubt :
Shutdown the whole unit or just this DAE.

Another issue with the DAE being the failed drive on it, whose hot-spare also gave me recommended replacement just now!

- Try that magical Power On! -- huh ?

- Gracefully Shutdown EMC CLARiiON [Entire Storage System]

- Ignore Recommended replacement for now.

- Try that magical Power On >>> I meant Cross you fingers and Power On the EMC CLARiiON.

Addition

- Replace the Recommended Replacement Disk, if everything is all right!

About above response:

I reckoned so, That was just a redundant option!

17 Posts

September 5th, 2013 06:00

Ankit,

Cant go for migration, Neither have free space nor the data is 3TB+

Further doubt :
Shutdown the whole unit or just this DAE.

Another issue with the DAE being the failed drive on it, whose hot-spare also gave me recommended replacement just now!

- Try that magical Power On! -- huh ?

1.4K Posts

September 5th, 2013 07:00

One more thing, if you are not going to implement this in next 24 hours and you are receiving multiple events with regards to the disk with Recommended replacement. For Eg. Soft Media Error.....[Bad Block] etc.

Little Change:

- Replace the disk proactively and then follow above.

17 Posts

September 5th, 2013 07:00

There is drive A which failed, tried replacing found that slot went bad and need to replace DAE, Drive B replaced A as hot-spare, now B is failing as well(recommended,  is on diff. DAE). Rebuilt is complete.

1.4K Posts

September 5th, 2013 07:00

huh?! Okay, let me know if I have understood correctly?

You have a DAE to replace which has 1 disk already failed and 1 Disk shows Recommended replacement which happens to be a spare? Right?

Did that failing spare invoke for any failed disk? If yes, you will need to wait till rebuilding/reconstruction is complete.

17 Posts

September 5th, 2013 07:00

Is it safe to replace the failing hot-spare, as in am not sure where will data go ? hot-spare to hot-spare jump ?

474 Posts

September 5th, 2013 08:00

Is the array on recent code levels?

Has it been upgraded recently?

Has there been a hardware change recently?

Are you seeing soft media errors on multiple disks? across all the disk in the same DAE? Lots of disks on the same bus but not other busses?

Has EMC Support ruled out a backend issue like a faulty LCC?

474 Posts

September 5th, 2013 08:00

How many attempts did you make to replace the failed drive originally? I’ve seen some drives show up DOA and then the DAE is suspected, but eventually we find that the replacement disk was also bad.

If the active hot spare is also going bad, you could just be reaching the point where all the drives are old and starting to log more errors.

17 Posts

September 5th, 2013 08:00

Hey Richard,

something strikes familiar here, there are too many soft media errors that am seeing since last week. tried replacing drive A twice already but no luck.

No Events found!

Top