5 Posts
0
694
September 27th, 2022 21:00
R330 alternating fatal errors when installing OS
Recently went to install an OS (FreeBSD) on my R330, and would often get
"A fatal error was detected on a component at bus 7 device 0 function 0" or the same for bus 5.
Curiously, it would consistently alternate between the two for every boot. I haven't been able to figure out what device is actually on those buses, so any pointers there would be appreciated as well. I've tried a trio of HDDs in RAID 5 and a pair of SSDs in RAID 1.
No Events found!
MrTroyer
5 Posts
0
September 30th, 2022 17:00
It is definitely the raid card. Removing it lets booting proceed as normal.
DELL-Shine K
4 Operator
•
3K Posts
1
September 27th, 2022 21:00
If you have access to iDRAC, you can go to System Inventory page and try to locate the device with "bus 7 device 0 function 0". You can expand each device and check bus, device and function number for this.
DiegoLopez
4 Operator
•
2.7K Posts
0
September 28th, 2022 04:00
Hello @MrTroyer,
Keep also in mind, FreeBSD is not really supported for a PowerEdge Server, so you may have issues with OS and hardware incompatibilities. You can also check firmware levels to see if their up to date. Or just go with a Minimum to Post troubleshooting technique. And add hardware components until you get the error.
Regards.
MrTroyer
5 Posts
0
September 28th, 2022 08:00
Bus 5 is labeled as Embedded P2P Bridge 1-1 - PCI Device (SH7758 PCIe Switch [PS])
Bus 7 is labeled as Embedded P2P Bridge 4-2 - PCI Device (SH7758 PCIe-PCI Bridge [PPB])
I made sure to update the firmware before attempting an install. I've also tried installing RHEL, which had the same issues. It only seems to happen when writing to the disk, which is curious as the faulty item in question is a PCIe bridge.
DELL-Chris H
Moderator
•
9.5K Posts
0
September 28th, 2022 13:00
MrTroyer,
Other than the raid controller, do you have any other expansion cards installed, if so have you tested with them removed? Also, you stated you updated the firmware, would you clarify what all was updated?
Let us know.
MrTroyer
5 Posts
0
September 30th, 2022 15:00
I've removed all expansion cards save the raid controller, still getting the same issue. Going to try removing the raid card next.
Firmware was updated 9/29/22, via IDRAC using https://downloads.dell.com. Practically everything was updated, as the server had been decommissioned sometime during 2020.
DELL-Joey C
Moderator
•
3.9K Posts
0
October 3rd, 2022 01:00
Hi @MrTroyer,
If you do have a spare controller, try replacing it and check if the issue still occurs. Nevertheless, try to monitor the server without the card, to check if the issue re-occurs.
MrTroyer
5 Posts
0
October 12th, 2022 15:00
I was able to procure a replacement card, which worked immediately.