Difference between revisions of "Unhealthy Nodes"

From Internet Computer Wiki
Jump to: navigation, search
m
Line 1: Line 1:
 
Steps to take when a server is unhealthy, but the connectivity in the data center is functioning correctly:
 
Steps to take when a server is unhealthy, but the connectivity in the data center is functioning correctly:
* Check for indicator lights on the server that are no longer green.
+
* Ensure that the server is powered on.
** If the server lights indicate an issue, then check for failed cables by swapping them out for known good cables as needed.
+
* Ensure that all link lights for active network interfaces are on.
*Hook up a crash cart and check for errors on the screen
+
** If any link lights are off, check for failed cables by swapping them out for known good cables as needed.
** Contact Dell if hardware issues are found.
+
*Hook up a crash cart and check for errors on the screen, troubleshoot as needed
 +
* Contact Dell if hardware issues are found or suspected. Dell can also tell you exactly which firmware updates should be done.
 
** If Dell requires a TSR log, see [[IDRAC access and TSR logs]]
 
** If Dell requires a TSR log, see [[IDRAC access and TSR logs]]
 
** [[Updating_Firmware|Updating the firmware]] might also resolve the issue.
 
** [[Updating_Firmware|Updating the firmware]] might also resolve the issue.
  
 
[[Node Provider Troubleshooting|All Node Provider Troubleshooting links]]
 
[[Node Provider Troubleshooting|All Node Provider Troubleshooting links]]

Revision as of 14:30, 5 April 2023

Steps to take when a server is unhealthy, but the connectivity in the data center is functioning correctly:

  • Ensure that the server is powered on.
  • Ensure that all link lights for active network interfaces are on.
    • If any link lights are off, check for failed cables by swapping them out for known good cables as needed.
  • Hook up a crash cart and check for errors on the screen, troubleshoot as needed
  • Contact Dell if hardware issues are found or suspected. Dell can also tell you exactly which firmware updates should be done.

All Node Provider Troubleshooting links