Troubleshooting Unhealthy Nodes

From Internet Computer Wiki
Revision as of 17:21, 5 April 2023 by Katie.peters (talk | contribs) (Adding redeployment steps)
Jump to: navigation, search

Steps to take when a server is unhealthy, but the connectivity in the data center is functioning correctly:

  • Ensure that the server is powered on.
  • Ensure that all link lights for active network interfaces are on.
    • If any link lights are off, check for failed cables by swapping them out for known good cables as needed.
  • Hook up a crash cart and check for errors on the screen, troubleshoot as needed
  • Contact Dell if hardware issues are found or suspected.
  • If no known error is found, please redeploy the node with a fresh IC-OS image.
    • The deployment process identifies/fixes some issues.
    • At the end, obtain the new principal ID for the node from the crash cart screen. Then search for the node's principal on the IC dashboard to verify that the node is healthy.

All Node Provider Troubleshooting links