Downtime of all our clusters on Thursday, February 13
There will be a scheduled downtime of all the HPC systems of NHR@FAU starting this
Thursday, 13.02. at 9:00 and expected to last until about 17:00.
As usual, Jobs that would collide with the downtime will be postponed until the downtime is over. Frontends and filesystems will be available most of the time, but there will be short interruptions on all clusters, and longer interruptions on Fritz and Alex.
Reason for the downtime is exchanging of defective hardware parts (part of which is urgent, hence the short notice), and software maintenance.
We will keep updating this post with status information.
- 12:50 – batch processing has been resumed on Woody; Slurm has been updated to 24.11
- 16:10 – batch processing has been resumed on Alex; Slurm has been updated to 24.11
- 16:15 – batch processing has been resumed on Helma
- 16:20 – batch processing has been resumed on Fritz; Slurm has been updated to 24.11
- 16:20 – batch processing has been resumed on Meggie
- 16:30 – batch processing has been resumed on TinyFAT and parts of TinyGPU
- 19:45 – GPU resources for Tier3-Jupyterhub are available again
- 14.02. 09:00 – batch processing has been resumed on the remaining parts of TinyGPU
- 14.02. 09:00 – we’re done. Please report remaining issues.