Difference between revisions of "Troubleshooting Unhealthy Nodes"

From Internet Computer Wiki
Jump to: navigation, search
m
m
Line 4: Line 4:
 
** If any link lights are off, check for failed cables by swapping them out for known good cables as needed.
 
** If any link lights are off, check for failed cables by swapping them out for known good cables as needed.
 
*Hook up a crash cart and check for errors on the screen, troubleshoot as needed
 
*Hook up a crash cart and check for errors on the screen, troubleshoot as needed
* Contact Dell if hardware issues are found or suspected. Dell can also tell you exactly which firmware updates should be done.
+
* Contact Dell if hardware issues are found or suspected. Dell can also tell you exactly which [[Updating Firmware|firmware updates]] should be done.
 
** If Dell requires a TSR log, see [[IDRAC access and TSR logs]]
 
** If Dell requires a TSR log, see [[IDRAC access and TSR logs]]
 
** [[Updating_Firmware|Updating the firmware]] might also resolve the issue.
 
** [[Updating_Firmware|Updating the firmware]] might also resolve the issue.
  
 
[[Node Provider Troubleshooting|All Node Provider Troubleshooting links]]
 
[[Node Provider Troubleshooting|All Node Provider Troubleshooting links]]

Revision as of 14:31, 5 April 2023

Steps to take when a server is unhealthy, but the connectivity in the data center is functioning correctly:

  • Ensure that the server is powered on.
  • Ensure that all link lights for active network interfaces are on.
    • If any link lights are off, check for failed cables by swapping them out for known good cables as needed.
  • Hook up a crash cart and check for errors on the screen, troubleshoot as needed
  • Contact Dell if hardware issues are found or suspected. Dell can also tell you exactly which firmware updates should be done.

All Node Provider Troubleshooting links