HPCOutage of HPC services due to file system issues [SOLVED]
On Friday, February 9th @ 19:30, one of the network file server (NFS) stopped working. As a consequence, many login and compute nodes became unresponsive as a major file system ($WORK for NHR projects) could not be accessed. The NFS server (atuin) has been rebooted and operation seems to be stable again since Saturday, February […]On Friday, February 9th @ 19:30, one of the network file server (NFS) stopped working. As a consequence, many login and compute nodes became unresponsive as a major file system ($WORK for NHR projects) could not be accessed. The NFS server (atuin) has been rebooted and operation seems to be stable again since Saturday, February […]