Difference between revisions of "Troubleshooting Unhealthy Nodes"
From Internet Computer Wiki
Katie.peters (talk | contribs) |
Katie.peters (talk | contribs) m |
||
Line 1: | Line 1: | ||
Steps to take when a server is unhealthy, but the connectivity in the data center is functioning correctly: | Steps to take when a server is unhealthy, but the connectivity in the data center is functioning correctly: | ||
− | * | + | * Ensure that the server is powered on. |
− | ** If | + | * Ensure that all link lights for active network interfaces are on. |
− | *Hook up a crash cart and check for errors on the screen | + | ** If any link lights are off, check for failed cables by swapping them out for known good cables as needed. |
− | + | *Hook up a crash cart and check for errors on the screen, troubleshoot as needed | |
+ | * Contact Dell if hardware issues are found or suspected. Dell can also tell you exactly which firmware updates should be done. | ||
** If Dell requires a TSR log, see [[IDRAC access and TSR logs]] | ** If Dell requires a TSR log, see [[IDRAC access and TSR logs]] | ||
** [[Updating_Firmware|Updating the firmware]] might also resolve the issue. | ** [[Updating_Firmware|Updating the firmware]] might also resolve the issue. | ||
[[Node Provider Troubleshooting|All Node Provider Troubleshooting links]] | [[Node Provider Troubleshooting|All Node Provider Troubleshooting links]] |
Revision as of 14:30, 5 April 2023
Steps to take when a server is unhealthy, but the connectivity in the data center is functioning correctly:
- Ensure that the server is powered on.
- Ensure that all link lights for active network interfaces are on.
- If any link lights are off, check for failed cables by swapping them out for known good cables as needed.
- Hook up a crash cart and check for errors on the screen, troubleshoot as needed
- Contact Dell if hardware issues are found or suspected. Dell can also tell you exactly which firmware updates should be done.
- If Dell requires a TSR log, see IDRAC access and TSR logs
- Updating the firmware might also resolve the issue.