Unsolved
1 Rookie
•
15 Posts
0
891
January 19th, 2024 02:08
Poweredge R430 will not power on. CPU1 Thermal Trip event
The server attempts to power on, the fans rev up for a second then it shuts off. IDRAC displays CPU1 CPU0001 Thermal trip event. check fans and replace heatsink. I have checked and all the fans are fine. I replaced the heatsink. I checked there was plenty of thermal compound between the CPU and the heatsink. I swapped the two processors. the issue only remained on CPU1. so luckily its not a CPU issue. no other logs from IDRAC. Is there a temperature setting that is set to low? or should I just replace the motherboard or is there something else before.
No Events found!
DELL-Joey C
Moderator
•
3.9K Posts
0
January 19th, 2024 07:41
Hi,
For this particular error, it might not be the CPU issue. It could be memory issue. How many memory are installed? Can remove all of them and remain 1 in slot 1. Then check for the error. Swap with other memory if the error persist.
grimm1369
1 Rookie
•
15 Posts
0
January 19th, 2024 13:16
I have 12 16GB of RAM for a total of 192GB. I removed all but one from slot A1 and still received the error. I removed that stick of RAM and still had the error CPU 1 has a thermal trip (over-temperature) event.
DELL-Charles R
Moderator
•
4.4K Posts
0
January 19th, 2024 13:57
Hello grimm1369,
It sounds like you have done some good troubleshooting.
These are a few more troubleshooting steps you can try and some you have already done:
*Ensure the airflow to the machine is not blocked. Placing it in an enclosed area or blocking the vent holes can cause it to overheat. If installed in a rack, make sure the rack cooling system is working ok.
*Verify the ambient temperature is within acceptable levels.
*Check the internal system fans for obstructions and verify all fans are spinning properly. Swap any failing fans with a known-good fan for testing.
*Verify any required shroud or any required blanks are installed (power supply, hard drives, DIMM, riser, fan etc.).
*If all of the fans are spinning properly, verify that the heatsink is installed correctly and thermal grease is applied.
*For multi-processor servers, you can attempt to test each processor in the first position.
Also try Minimum to Post configuration:
The minimum components to allow the Dell™ PowerEdge™ R430 to complete POST are:
*System Board
*CPU1 with heat sink (minimum for troubleshooting)( also test with second CPU in socket 1)
*Single DIMM configuration with DIMM in socket A1 (also test with an alternate DIMM in A1)
*Control panel with cable (to the power system on)
*Power supply unit (and PDB/PIB)
Remove anything not on that list: DVD, Hard drives, PERC controller, backplane, network card, NIC cable, any pcie cards, keyboard, mouse, USB devices, …. anything not on the list remove.
If it does not post then the issue is with one of those components listed.
If you get successful POST, put things back a little at a time until you find the faulting component.
grimm1369
1 Rookie
•
15 Posts
0
January 20th, 2024 21:16
*Ensure the airflow to the machine is not blocked. Placing it in an enclosed area or blocking the vent holes can cause it to overheat. If installed in a rack, make sure the rack cooling system is working ok. The sever is on a table so Air flow is good
*Verify the ambient temperature is within acceptable levels. IDRAC is displaying the ambient is in acceptable range.
*Check the internal system fans for obstructions and verify all fans are spinning properly. Swap any failing fans with a known-good fan for testing. Fans rev up when starting up.
*Verify any required shroud or any required blanks are installed (power supply, hard drives, DIMM, riser, fan etc.). Verified blanks are installed
*If all of the fans are spinning properly, verify that the heatsink is installed correctly and thermal grease is applied. I verified the heatsink and thermal grease is applied correctly.
*For multi-processor servers, you can attempt to test each processor in the first position. I swapped the processors and the issue remain.
Also try Minimum to Post configuration:
The minimum components to allow the Dell™ PowerEdge™ R430 to complete POST are:
*System Board
*CPU1 with heat sink (minimum for troubleshooting)( also test with second CPU in socket 1) tested
*Single DIMM configuration with DIMM in socket A1 (also test with an alternate DIMM in A1) tested
*Control panel with cable (to the power system on)
*Power supply unit (and PDB/PIB) tested
Remove anything not on that list: DVD, Hard drives, PERC controller, backplane, network card, NIC cable, any pcie cards, keyboard, mouse, USB devices, …. anything not on the list remove. external devices removed. Only the IDRAC has a network cable plugged in.
If it does not post then the issue is with one of those components listed. does not post
If you get successful POST, put things back a little at a time until you find the faulting component.
grimm1369
1 Rookie
•
15 Posts
0
January 21st, 2024 04:21
I have removed the processor from CPU2 and still get the same error. at this point I am thinking to replace the motherboard.
grimm1369
1 Rookie
•
15 Posts
0
January 21st, 2024 04:46
I reseated everything. I inspected the CPU socket. i removed the riser card. i dont know what else to do.
DELL-Young E
Moderator
•
5.1K Posts
0
January 21st, 2024 23:24
Hello, I'm afraid it's time you may need to replace the motherboard. Could you please get in touch with the tech support?
https://dell.to/48IwuWF
Respectfully,
grimm1369
1 Rookie
•
15 Posts
0
January 22nd, 2024 00:08
Ill have to buy one off a popular website. My server is end of life and there are no support options available.