1 Rookie
•
6 Posts
0
205
October 3rd, 2024 12:51
Serious memory error or safe to ignore?
I have a PowerEdge R640. System BIOS Version: 2.8.1. PERC H740P Mini, running perfectly fine since 2020-06-18. Running VMWare ESXi. Not even a hiccup. Today, the only thing randomly went down.
- Multi-bit memory errors detected on a memory device at locations DIMM_B2
- The system memory has faced an uncorrectable multi-bit memory errors in the non-execution path of a memory device at the location DIMM_B2
I powered down with the DRAC. Then powered up and all seems fine 12 hours later. My question is, what are the implications of this error? Should I replace the DIMM? Or is it fine again?
No Events found!
DELL-Chris H
Moderator
•
9.4K Posts
0
October 3rd, 2024 17:00
Myname234234,
It may be ok, but if it changes to being persistent or repetitive then it is indicating a problem. A couple of things you can do would be ensuring the server is up to date on BIOS, iDrac, perc, etc, and the other would be powering down the server and then swapping the dimm in B2 with another dimm in the server (noting where you moved it to). The reason being is that if the error returns, but is at the location you moved the dimm to, we know the issue is the dimm itself, if the error returns and is still on B2, then we know the issue is with the slot the dimm was residing in.
Let me know what you see and if this helps.
myname234234
1 Rookie
•
6 Posts
0
October 3rd, 2024 18:08
Thanks for your response.