Start a Conversation

Solved!

Go to Solution

5866

January 6th, 2020 23:00

Unable to upgrade firmware on VNXe3200 possibly due to faulted SP

Hi, I'm working on trying to upgrade the firmware for a VNXe3200 device. When running the health check prior to the upgrade, I receive this error:

"The health check has failed. A problem has been detected with a backend disk processor enclosure. Ensure that the backend cabling is correct. Error code: platform::check_for_enclosure_faults_2."

Under system health for the DPE, I see this error:

"The Disk Processor Enclosure (DPE) has faulted. This may have occurred because of a faulted SP."

I've verified that the management ports for both SP's are connected to the network on the same VLAN and I'm able to ping them both. Using the IPMI Tool, I logged into SP A and B and ran 

svc_networkcheck -r

and all tests show "The interconnect is OK".

Verified that SP A is the master/primary SP.

I've cleared boot counters with 

svc_rescue_state -c

and rebooted both SP's. When the SP's boot, there's an error in the Log in the web GUI that says:

System [storage] has experienced one or more problems that have had a major impact.

I re-imaged both SP's with the following:

svc_rescue_state -s
svc_shutdown -r
svc_reimage -r

Both the DPE error under System Health and the Health Check error are still showing.

Unfortunately, I'm not very familiar with these EMC storage devices. Thanks in advance!

January 22nd, 2020 14:00

After running

svc_diag --state=spinfo

I got the following error in the "BIOS and EPOS Errors" section:

  • ERROR: Type:2; Severity: 80; Class:0; Subclass:6; Operation:800b Type:2; EFI_ERROR_CODE Severity:80; EFI_ERROR_MAJOR Class:0; EFI_COMPUTING_UNIT Subclass:6; EFI_COMPUTING_UNIT_CHIPSET Operation:800b; DXE_SB_BAD_BATTERY (EFI_COMPUTING_UNIT_CHIPSET | AMI_CHIPSET_EC_BAD_BATTERY)

I reviewed the EMCSystemLogFile.log under /EMC/backend/log_shared and found the following errors for both SP A and B:

  • “SPA CPU low battery fault detected.” :: Category=System Component=espkg
  • “SPA BMC 0 BIST(Built In Self Test) Fault detected. Reason: Virtual UART0 Test Failure, Virtual UART1 Test Failure, UART4 Test Failure, Action: Replace SP.” :: Category=System Component=espkg

Determined the issue was both SP's had failed CMOS batteries (CR2032). After replacing the CMOS battery in both SP's, the health check passed and we were able to upgrade the firmware.

4 Operator

 • 

8.6K Posts

January 9th, 2020 05:00

You need to fix the errors first before you can upgrade

I would suggest to open a service request and have service troubleshoot whats wrong

Moderator

 • 

7.6K Posts

December 10th, 2020 12:00

Hello Colman,

The CMOS battery is located on each storage processor.  You will need to remove the SP from the rear of your enclosure and then remove the cover on the SP to replace the battery.

 

1 Rookie

 • 

5 Posts

December 10th, 2020 12:00

@jerdub1993where is that CMOS located!!?? We have the same issue but I cannot find anything about where that battery is located without taking the whole unit apart.

Thank you!

Colman

1 Rookie

 • 

5 Posts

December 10th, 2020 13:00

OK so I feel like a tool. I took the SP out again, and scanned it over, only to find the CMOS vertically inserted into a slot rather than laying flat on a circuit board as I was expecting... Oh boy, it has been that kind of day!

Thank you.

-Colman

Moderator

 • 

7.6K Posts

December 10th, 2020 13:00

Hello Colman,

Not a problem. Glad that you was able to find the battery.

No Events found!

Top