Unsolved
1 Rookie
•
1 Message
0
6749
July 18th, 2022 09:00
MEM0001 Multi-bit memory error on DIMM_B2. Reseat memory.
My company use a Dell Poweredge 730 rack server. 24cores 64GB RAM 2GB SWP. Today I doing regular cleaning, during the cleaning time there was a run ongoing on my server which was taking 40GB+ RAM and 475MB+ SWP memory usage. I just took those removable coolers out and cleaned them outside one by one, then all on a sudden the PC got frozen and later it took auto reboot showing the error message on the display "MEM0001 Multi-bit memory error on DIMM_B2. Reseat memory." and the backlight was beeping in Yellow. When the auto reboot completed I submit a small run and was monitoring through htop command. It showed there was usage in RAM but SWP was showing simply 0/2GB while there was a program running. The program was done completely then I switched off the PC and ON again. Then the yellow beeping was off , no error message on the display also the Yellow beeping was not showing. Then I tried some simple runs, and kept monitoring with htop; it was showing SWP usage and RAM usage everything was fine like before. Then I again submitted the computationally larger job and it did crash again, again the Yellow beeping and same error message.
P.S. - I submitted and completed more larger systems and computationally heavy program in past 20-30days and it was absolutely fine.
How can I solve this problem?
MEM0001 Multi-bit memory error on DIMM_B2
DELL-Charles R
Moderator
•
4.4K Posts
0
July 18th, 2022 13:00
Hello roboloc,
A DIMM with Multi-bit errors you will want to replace.
You can swap B2 and A1 see if the error follows the DIMM or stays in the slot.
After you swap DIMM location you can run built in diagnostics to test:
Boot to F11 on Dell Splash screen, selecting Boot Manager -> System Utilities -> Launch Dell Diagnostics. Note any messages and continue testing.
I would also recommend to make sure you up to date on the iDRAC and BIOS firmware.
https://dell.to/3PjW4aH