Scheduled downtime of all our HPC systems on Tuesday, August 26

checkmark-sunshine

There will be a scheduled downtime of all the HPC systems of NHR@FAU on

Tuesday, August 26, starting at 8:00, and lasting the whole day.

Reason for the downtime is maintenance on central fileservers, in particular /home/atuin (which hosts $WORK for NHR and BayernKI projects) and /home/hpc (which hosts the main home directories for ALL users).

Jobs that would collide with the downtime will automatically be postponed until after the downtime. While Frontends and some of the fileservers will be available most of the time, you should expect quite a few hangs and interruptions due to the unavailability of the main fileservers.

We will keep this post updated with progress reports.

Update 17:05: Fileserver-Maintenance has finished. We will reboot a few cluster frontends and csnhr now. Batch-Processing will be resumed soon.

Update 17:45: Batch-Processing on TinyGPU and TinyFat has been resumed.

Update 17:50: Batch-Processing on Alex and Woody has been resumed.

Update 18:00: Batch-Processing on Fritz and Meggie has been resumed. Apart from minor corrections, we’re done. Please report remaining problems.