Unsolved
1 Rookie
•
11 Posts
2
183
April 18th, 2025 18:21
PowerEdge R620 iDrac Failure
Well, hello world of iDrac tortured souls. We use two Dell PowerEdge R620s, and I might have bricked one without knowing it. I disconnected one of the power cables for maintenance on the UPS. When I did that, the LCD showed the message about the power supply not having AC power. I wanted to clear the message, so I pressed the checkmark, and nothing happened. Maybe you must push all three buttons simultaneously to clear the current error message. I pressed all three buttons (Left, Check, Right) next to the LCD, and the server rebooted with the fans on 100%.
I looked everywhere online to see if pressing the three console buttons together does anything but couldn't find an answer. I can only assume it reset the iDrac or put it in some strange mode that it won't recover from.
Wow, those fans are screeching loudly!
I have googled and read a million posts with people having the same iDrac failure experience. I can't believe Dell hasn't done anything about this other than "buy a new motherboard." I'm hoping that due to my "pressing all three buttons," there's a recovery process.
Let me tell you where the server is at...
1) No lights on the iDrac maintenance Ethernet port
2) No display on the front LCD. It will show up blue for a bit, and then turn orange, but no text
3) Holding or pressing the (i) on the rear or front does nothing
4) BIOS Setup shows iDrac as not being able to read the configuration
5) Booting takes a very long time as it waits for "Initializing iDrac" for several minutes before rebooting with the fans on 100% and prompting to keep hitting F1
I've tried these things based on the millions of posts experiencing iDrac failures.
1) Cleared NVRAM with internal jumper. Waited until the boot displayed a message about it, reseated the jumper, and rebooted
2) Set the BIOS to defaults
3) Updated the BIOS to the latest using EFI boot from USB thumb drive
4) Put the firmimg. d7 on the SD card in the front panel formatted FAT32 & FAT. Restarted withholding the (i) for 15 and 30 seconds. I don't think the front SD card is active when the iDrac is offline.
5) Did the same but used the SD card on the NVram board in the unit
6) Booted off the Linux SLI 3.0 rescue live image. I started the services and tried racadm, located in /opt/dell/sbin, but it kept giving errors that prevented it from succeeding with any of the commands. (i.e., getsysinfo)
7) Disconnect the power, remove power supplies, and hold down the power button for "flea power" drain.
8) Left the machine overnight without power
9) Left the machine overnight with power
Are there any other suggestions, or do I have to buy an entirely new motherboard?
BIOS Version:
<Private data removed from public view. DELL-Admin>
iDRAC Version from BIOS Setup:
iDRAC Boot Error:
Blank LCD Display
FixMyServer
1 Rookie
•
11 Posts
1
April 18th, 2025 19:21
I should add that this is the error message which I receive in the setup bios
SWC0700: iDRAC is not ready. The configuration values cannot be accessed.
Using the SLI boot iso, I get the following error...
FixMyServer
1 Rookie
•
11 Posts
1
April 18th, 2025 19:43
The result code of racadm is 23
FixMyServer
1 Rookie
•
11 Posts
1
April 19th, 2025 16:54
Bueller? Bueller? Any suggested feedback?
I read there’s an iso media I can boot from that will update the server. I’m not sure where to find it for the r620. The few links that I see reference it go to a Dell.TO domain, which doesn’t work.
I bought a motherboard but in worried this will happen again. I’d prefer to save the motherboard as a backup - but it seems so fragile now. What precautions can we take moving forward?
FixMyServer
1 Rookie
•
11 Posts
1
April 19th, 2025 18:10
Is there a way to access IDRAC via the serial port? As documented here, I can’t seem to reproduce the result: https://buildingtents.com/2014/04/24/idrac6-recovery-through-tftp-and-serial/
I set redirection to com1 because the server has only 1 com port. I rebooted and I can see the console output on my serial terminal. However, I am not seeing any IDRAC configuration menus or anything else.
FixMyServer
1 Rookie
•
11 Posts
1
April 19th, 2025 18:22
Can someone from Dell help with the dot TO links in the forums? I find all forum posts have suggestions for my issue but they link to dot TO Dell webserver that doesn’t work.
first, the certificate is expired, and second there’s an error on the site.
we really need to get this server back online. Also, we need to understand what happened to prevent it from happening again. Did pressing the three front lcd buttons actually brick an entire server?
FixMyServer
1 Rookie
•
11 Posts
1
April 20th, 2025 16:17
Uhm, I wrote a lengthy response on my laptop and hit save. It’s about how I used the uart to debug busybox and see the EMMC has degraded. Hitting save now gives me this…
FixMyServer
1 Rookie
•
11 Posts
1
April 20th, 2025 20:38
For whatever reason I keep getting access denied on my laptop for the forum here. So I have to use my phone to post details.
It appears Dell choose emmc with a low write count. Essentially Dell has put a hardware expiry on all IDRAC servers. Because IDRAC writes logs and abuses the emmc, the expiry of a server can pretty much be predicted.
DELL-Young E
Moderator
•
5.1K Posts
1
April 21st, 2025 02:03
Hello overall when you can't access iDRAC, we eventually see mainboard replacement in the end. You can either opt for it or choosing to not use iDRAC since your system is out of warranty. You can also get a second opinion by raising an official ticket through https://www.dell.com/support/incidents-online/en-us/contactus/dynamic
Respectfully,
FixMyServer
1 Rookie
•
11 Posts
1
April 21st, 2025 06:10
Or, I fix it by building my own IDRAC and lifecycle image, configure uboot to boot from the sd card, and not have to pay Dell for a new server. Dell should be ashamed of yourselves for adding a life expectancy to your servers via emmc write limits. I’ll publish a guide so people can continue using their hardware without needing to purchase additional Dell replacements.
FixMyServer
1 Rookie
•
11 Posts
2
April 22nd, 2025 00:43
Hopefully, this will be useful to the tens of thousands of people buying new motherboards or throwing out their PowerEdge servers after experiencing iDRAC EMMC failures. This allows running iDRAC from the SD card rather than the internal EMMC. This is how Dell should have shipped the servers: by running the iDRAC off the SD card.
This document outlines how to boot the iDRAC system from an SD card on a server where the onboard eMMC is corrupt. The process involves configuring U-Boot environment variables, flashing the SD card with a firmware image using TFTP, and modifying the squashfs filesystem to reference the SD card (mmcblk1) instead of the corrupted eMMC (mmcblk0). This ensures the iDRAC system can run entirely from the SD card and avoids any failed interactions with the broken eMMC.
UBoot is the boot loader that loads the iDRAC Linux from partition 2 of the EMMC or SD.
*Note: always need to reboot and drain the flea power before booting off new sd card
(until you hear a click from the rear of the server)
*Note: SD card that is 32gb seems to work best AND make sure you check uboot log that the capacity is not a negative number. A bad sd card or incorrect size gives a negative reading and it'll never work. Can't have any bad sectors so use a new and high quality card. i'm using samsung evo's.
*Note: If you don't edit the squashfs to swap the mmcblk0 references from the scripts, it'll still boot off the SD but the scripts will mount the mmcblk0 partitions. That may not work for you if the EMMC is really screwed
*Note: Some conversations on the internet show to use the uboot "recover" command. The recover command always seems to want to write to the internal EMMC even if the SD card is set as target, as it doesn't work as expected - so use my instructions below.
# ------------------------------------------------------------------------------------------------------------
# Boot into U-Boot console fro the UART at 115,200 N-8-1
#
# Environment variables. Set these for your network
# Connect ethernet to NIC1 (don't need to use the iDRAC port)
# ------------------------------------------------------------------------------------------------------------
setenv ipaddr '10.0.0.21'
setenv netmask '255.255.255.0'
setenv gatewayip '10.0.0.1'
setenv serverip '10.0.0.10'
setenv bootargs 'root=/dev/mmcblk1p2 rootwait rw rootfstype=squashfs mem=239616k console=ttyS2,115200 <NULL> mac1=F8:BC:12:50:D3:F6 mac2=F8:BC:12:50:D3:F7 mode=vhard reset_cause=board nmi_buf=0x83000000 quiet'
setenv bootcmd 'fwu dev 1; fwu mmc sd; sleep 5; fwu boot'
setenv verify 'no'
saveenv
# ------------------------------------------------------------------------------------------------------------
# Flash the SD card with the firmimg.d7 from the TFTP server
# Get the firmimg.d7 file from Dell, I'm using 1.65.65.0
# ------------------------------------------------------------------------------------------------------------
tftpboot firmimg.d7
fwu mmc sd
fwu check 0x81000000
fwu update 0x81000000
# ------------------------------------------------------------------------------------------------------------
# Boot the SD card to make sure proper copy, you should see endless messages about being unable to write to mmcblk0
# This is becuase while we are booting from the SD card, the configuration uses mmcblk0 partitions.
# ------------------------------------------------------------------------------------------------------------
boot
# ------------------------------------------------------------------------------------------------------------
# After you have successfully booted iDRAC, you know it works.
# Put the SD Card on a Linux machine and extract, edit, rebuild...
#
# extract the second partition (live filesystem)
# ------------------------------------------------------------------------------------------------------------
mkdir ~/idrac
cd ~/idrac
unsquashfs /dev/sda/sda2
# ------------------------------------------------------------------------------------------------------------
# find references to the mmcblk0. So far they are only found in /etc/sys_apps_scripts of the extracted partition
# edit any files that have references to mmcblk0 and change to mmcblk0
# ------------------------------------------------------------------------------------------------------------
cd ~/idrac/squashfs-root/etc
grep -rv 'mmcblk0'
# ------------------------------------------------------------------------------------------------------------
# Write the squashfs file system back to the SD card. There are two partitions that contain system data.
# ------------------------------------------------------------------------------------------------------------
cd ~/idrac
mksquashfs squashfs-root/ /dev/sda2 -noappend
mksquashfs squashfs-root/ /dev/sda6 -noappend
9. Completely power off the server, disconnect power etc... put in the SD card and there you go.
It'll take a while to boot because the startup scripts will create the empty configs and such
on the SD card.
(edited)
luigi034
1 Rookie
•
1 Message
1
May 1st, 2025 17:06
Hey @FixMyServer,
Massive thanks for sharing your experience, your post is honestly a lifesaver. I've been digging around for months and this is the first proper bit of info I’ve found on getting around an iDRAC failure like this.
I was wondering if you could share exactly how you managed to interrupt U-Boot? Like what you pressed, when you pressed it, or anything specific you had to do? I'm working on a T420 but the setup looks really similar so I’d love to know how you did it.
Really appreciate any tips you can share, you’ve already helped loads just by posting what you did.
FixMyServer
1 Rookie
•
11 Posts
2
May 1st, 2025 20:05
Hi @luigi034, I plan to make a short video on the complete process. We have several more of these servers and expect to see this happen to them all soon. I wish Dell had put the SD card on the MB instead of soldered EMMC. That would have solved so many issues, including de-bricking and easier updating. Imagine simply inserting an SD card with the latest update to lessen downtime.
Anyway, let me get a short video that will explain the process. I can squeeze it in next week when I return home.
tonschk
2 Intern
•
187 Posts
1
June 21st, 2025 08:04
@FixMyServer Hi there, did you progress or went ahead with this task or project or the video you was mentioning?, thank you
Enrico
(edited)
tonschk
2 Intern
•
187 Posts
0
June 21st, 2025 19:04
Well I keep researching about this problem and various methods to try to sort out, I copy/paste here below a Reddit thread in which the latest post also about a bricked iDRAC for T620 and the guy looks like arguing was successful
https://www.reddit.com/r/homelab/comments/zhegir/dell_12g_bricked_idrac_emmc_replacement/
Just fixed a bricked iDRAC 7 on a T620 that had a botched 2.60.60 to 2.65.65 update. Replaced the eMMC, used the UART pins and a TTY console to load 1.6 firmware from TFTP then booted it from RAM which gave me the iDRAC 1.66 web console, uploaded the full 1.66 Dell Windows executable to the we console as an update AND putting the 1.66 firmimg.d7 file on a FAT16-formatted 512MB SD-Card which triggers a full re-install of 1.66 in the UART console. From there, I gradually updated to iDRAC 2.65.65 using the Dell Windows Executables running in Windows Server 2016 for each individual version from 1.66 all the way to 2.65.65. Was a PITA and time consuming but ultimately saved a server from going in the trash and wasn't all "that" hard in the end. It even picked up the Service Tag on its own, but lost the enterprise license completely.