Systems

Since 20:10 on January 30, the parallel Lustre file system ($FASTTMP) on the Fritz cluster is unavailable due to repeated server crashes. The issue is being investigated. Update 2025-01-31 17:00: The issue turned out to be a faulty component in the Infiniband network of Fritz, the Lustre servers ...

Category: Allgemein, HPC, Systems

Since Wednesday (2025-01-22) evening, some nodes in TinyGPU and TinyFAT seem to have problems accessing NFS directories. This will lead to issues like Jobs mysteriously hanging, or interactive jobs starting but never opening a shell prompt. Update 2025-01-23 09:30: This issue should now be resolv...

Category: Allgemein, HPC, Systems

There will be a scheduled downtime of all the HPC systems of NHR@FAU starting Tuesday, 29.10. at 7:30, and expected to last until about 18:00. As usual, Jobs that would collide with the downtime will be postponed until the downtime is over. Reason for the downtime is general maintenance on cent...

Category: HPC, Systems

There will be a scheduled downtime of all the HPC systems of NHR@FAU on Tuesday, May 21, starting at 00:00 and lasting all day. (*) As usual, Jobs that would collide with the downtime will be postponed until the downtime is over. Most frontends and fileservers will be available most of the time, but...

Category: All, Allgemein, HPC, Systems