Problematic Machines

From DISI
Revision as of 21:16, 7 May 2018 by Benrwong (talk | contribs) (Created problem machines log. have entries for n-5-23 and dalet)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

This is a log of machines that are actively used on our cluster. If any machines encounter crashing/shutdown issues, we should log them and then search through the machine logs for further information regarding the trouble their experience. If there are repeat offenders (three crashes without any further information), we should consider dismantling the machine.

Problematic Events Log

n-5-23

2018-04-30 Found machine with power button in amber.  Booted upon seeing this.
2018-05-07 Similar issue to 4/30.  Found machine in amber power state.  

dalet

2018-05-05 machine was found to be off.  Logs indicate all processes received a TERM signal and the server entered a graceful shutdown.  I see no indication of any user initiating a shutdown.  /var/log/secure shows indication of receiving SIGNAL 15 (unexpected reboot).