Scheduled downtime of all our HPC systems from Sunday, June 08 until at least the evening of Tuesday, June 10
There will be a scheduled downtime of all the HPC systems of NHR@FAU
starting on Sunday, June 08 (Pentecost), at 9:00,
and lasting at least until the evening of Tuesday, June 10.
Main reason for the unusually long downtime is part 1 of the reconfiguration of /home/atuin
(which hosts $WORK
for NHR and BayernKI projects) in the hopes of fixing at least some of its longstanding problems. In addition, there will also be some general maintenance work on a bunch of our other systems on Tuesday.
For most users, jobs that would collide with the downtime will automatically be postponed until after the downtime. Frontends and fileservers (except /home/atuin
) will be available on Sunday and Monday, but there will be some interruptions on Tuesday. /home/atuin
will be unavailable Sunday through Tuesday.
Unfortunately, there is a handful of users (less than 10) on /home/atuin
who misuse the filesystem so badly that copying their data will simply not be feasible by Tuesday. These users will be notified seperately, and their jobs will have to be cancelled and resubmitted manually once the copy for their files finishes, because we feel it would be unfair to let >1000 users wait for <10 filesystem misusers.
We will keep this post updated.