When a Network Switch Goes Dark for No Reason- Here's What's Actually Happening
Issue Reported by Customer Device: AS9716-32D Problem: Device became unreachable — SSH and console access failed. Customer Action Taken Power cycle performed Device recovered and became reachable again Customer Concern Why did the device suddenly become unreachable? Why was a critical temperature alert triggered at 66°C while the platform shows higher thresholds? The logs told the story During log analysis, we identified thermal warnings followed by a critical shutdown event . Temperature Spike Logs Feb 3 18:55:27 WARNING pmon#thermalctld: Temperature of CPU Package Temp changed too fast, from 32.0 to 66.0 Feb 3 18:55:27 WARNING pmon#thermalctld: Temperature of CPU Core 0 Temp changed too fast, from 32.0 to 66.0 Feb 3 18:55:46 CRIT - Monitor CPU Temp, temperature is 66.0. Temperature is over 66.0. Need shutdown DUT. Immediately after this event: Device stopped responding SSH and console access unavailable After the reboot: all clear After the power cycle, system tempera...