Unsolved

1 Rookie

 • 

10 Posts

2461

July 20th, 2020 04:00

MD3200 Individual Physical Disk - Degraded Path and Degraded Physical Disk Channel

In Recovery Guru I have these summaries:

1.The specified physical disk channel is experiencing intermittent errors along the path to a single or several physical disks. The Recovery Guru Details area provides specific information you will need as you follow the recovery steps.

2.A physical disk channel status was set to Degraded because of excessive I/O errors or because a Technical Support Representative advised the storage array's administrator to manually set the physical disk channel status for diagnostic or other support reasons. The Recovery Guru Details area provides specific information you will need as you follow the recovery steps.

It appeared after replacing a failed Disk .The disk was properly detected and rebuilt automatically.

Both our atteched Dsik Pools are "optimal". klicking "Recheck" didn´t change anything.

what actions should I take to solve the warning?

In another post I read about manually setting the status to "optimal" through SMCLI command or to reboot the MDs.

would that be the way to go here as well?

July 20th, 2020 09:00

Hi mschreck,

Are any of the drive showing an error? Was a drive replaced? If so rebooting may help. Let us know if you have any additional questions.

1 Rookie

 • 

10 Posts

July 25th, 2020 11:00

hi

thanks for the reply. As mentioned in my post the errors showed up after replacing a failed disk.

now the pools are optimal again but the errors listed in the header are still showing in the summary guru.

July 27th, 2020 07:00

Have you rebooted?

1 Rookie

 • 

10 Posts

July 30th, 2020 11:00

actually not as I wanted to wait for a project to finish this weekend.
But today I got a new warning that confuses me even more. The newly replaced disk is giving me an impending failure warining. Usually I would manually fail the disk and replace it.
BUT in the guru it sais: "Service action (removal) allowed No X
Impending Physical Disk Failure (Medium Data Availability Risk)

I dont have a warning about the Hot spare but the removal still indicates i shouldn´t remove the disk.

What Caused the Problem?
A physical disk is reporting internal errors that could cause the
physical disk to fail. If this physical disk fails, the virtual disks in
the disk pool or disk group will become degraded. The Recovery Guru
Details area provides specific information you will need as you follow
the recovery steps.
If a Virtual Disk - Hot Spare in Use problem is also displayed in the
Recovery Guru Summary area, always fix the Impending Physical Disk
Failure problem first. Fixing the Virtual Disk - Hot Spare in Use"
failure before fixing the physical disk failure may result in data loss.
The affected virtual disks are RAID 1, 5, or 6. If the physical disk
fails, you may lose redundancy. If additional physical disks fail in
the same disk pool or disk group, you may lose data. You should
correct this problem as soon as possible.

replace it or NOT?
What´s the correct answer?

July 30th, 2020 11:00

It sounds like you have a bigger problem than just the one failed drive. You may have an array puncture with bad blocks on multiple drives that may be causing the array to fail. Do you have a backup of the data?

1 Rookie

 • 

10 Posts

July 30th, 2020 12:00

yes I´m backing up all the important data daily.

July 30th, 2020 12:00

In that case I still would reboot and then see what it shows.

1 Rookie

 • 

10 Posts

July 30th, 2020 13:00

okay thanks! I will do this on Saturday. I assume I just need to reboot the storage head or do i have to manually switch off the MDs as well?

Such a bad timing, our project finishes tomorrow. Hope will last that long...

thanks for your help

July 30th, 2020 13:00

If you can shut it all down it that would be best.

1 Rookie

 • 

10 Posts

August 17th, 2020 06:00

Just to close the subject:

luckily it was simply a faulty Drive. No punctured array or alike.

The fact that removal was NOT allowed was simply because the drive hadn´t failed YET - it only showed impending fail.

So after manually failing the drive I replaced it and all errors went away after the reconstruction of the array had completed.  

Top