Unsolved
1 Rookie
•
72 Posts
0
126
September 17th, 2024 19:38
Dell R630 2 CPU E5-2699v4 SR2JS 2.20Ghz problem Memory Slot A1
Hi guys
i am having a strange issue with the CPUs E5-2699v4 SR2JS on Dell R630 servers..
1- got a 2nd hand R630 off ebay, it came with a single core E5-2640 V3 cpu.. and no memory slot dims.. só i bought 128GB ram DDR4 2111mhz ECC for the server and it worked out.
2- updated idrac, bios, raid controller etc.. to latest available drivers and firmwares on Dell site.. running bios 2.9.xx now..
3- bought 2nd and pair of E5-2699v4 cpus SR2JS 2.20Ghz as i saw online this would be compatible with the server..
after i input the cpus and it boot i got the following error on the screen bios boot..
attached on the picture below
so i tought.. hmmm DIMM1 A1 memory just died.. i went ahead and bought another dimm samsung 16GB DDR4 2111mhz ecc exact same model from the others.. and replaced it in the server, and after BOOT the exact same error..
At this stage i tought well the server might be damaged at some point.. because it was used 2nd hand cheap.. i went on ebay and bouth a second unit chassei again no cpus, no memory, no raid controller.. when it arrived.. i did a test placed the old e5-2640v3 on slot 1 cpu and inserted the "suposed" died damage memory dim alone on slot1 and voilla it worked like magic.. so i tought humm the memory is ok not dead.. so i removed all the other dimms and tested on the new chassi. all 128GB on 8x16 ddr4 2111Mhz samsung ECC worked like a charm..
so i went ahead and did all the firmware updates necessary also because this servers had really old BIOS version.. BIOS Update, idrac, raid etc.. up to the latest version.
next step i replaced the E5-2699v4 cpus that were on the other used R630 server and placed it in the second unit bought.. and soon as it rebooted the exact same error appeard allways mentioning memory DIMM Slot A1 error ...
after the error i press F1 and it boots.. but i allways loose the 32GB slot dims from A1 and B1 inserted.. out of the 128GB only 96GB showing...
Now i am lost server works.. but this annoying error is showing up..
can this be something related to the CPUs support? CPus might be damaged?
i lost a couple os days of trial and error on this one.. the only thing left out to test is to remove cpu2 and just boot with cpu1 on it.. just for testing
could this be a power supply issue also as its running 2 redundant 495watts power supplies.
i am kind of lost now because it says its an error on the memory slot A1 but with other cpus its working.. i even for the sake of it wen and bought another 64GB DDR4 ECC 2433Mhz slots different brand.. and they work with the E5-2540v3 but with the E5-2699v4 it allways throws out the error.
Any ideas? suggestions to narrow down the error ?
update1
ok so i have tested this withMicron MTA36ASF2G72PZ-2G6E1RI 16GB 2rx4 pc4-2666v-rb2-11 with the cpus also.. and same error allways on memory slot A1.. replace the cpus with E5-2640v3 and memorys work ok boot ok with no issue or warning message on A1 slot..
running out of ideas.. anyone has a clue of what might be the correct memory? or if there is any CPU drivers update maybe missing for the cpu? running out of ideas now
DELL-Joey C
Moderator
•
3.9K Posts
0
September 18th, 2024 03:19
Hi,
Since you have already done the firmware updates, I won't troubleshoot on firmware capabilities.
PowerEdge R630 do support XEON E5-2699v4 DPN# 8923G.
You also have narrowed down to mainboard failure, since you have 2 mainboard to troubleshoot with. My suggestion is to check on the memory specifications. The processor supports DDR4 1600/1866/2133/2400 Ref: https://dell.to/3MMJxwW and the server's memory specification is https://dell.to/4gIc6tp DDR4 registered DIMMs (RDIMMs) and load-reduced DIMMs (LRDIMMs).
Here's some Dell DPN for memory:
1R8CR DIMM,16GB,2133,2RX4,4G,DDR4,R
29GM8 DIMM,64GB,2400,4RX4,8G,DDR4,LR
Praveen.Singh
3 Apprentice
•
482 Posts
0
September 18th, 2024 03:56
As far i understand the issue is not with your memory or system board, can you check if the system has a configuration lock, you can clear it through motherboard Jumpers, or check BIOS Settings as per the system model.
PortalNET2
1 Rookie
•
72 Posts
0
January 13th, 2025 23:37
Hi guys the issue was with cpu2 faulty.. i removed it and server boot fine with CPU1.. so i thought, removed CPU1 and placed CPU2 in cpu1 socket just to test the physical cpu.. and soon as i placed in and booted it showed same memory A1 error thing.. so i removed the CPU.. and swapped with the other CPu E5299v4 i had and placed it back againa in cpu1 slot.. powered on the server back up and no more memory errors.
So i can confirm it was a CPU issue.