Systems

Since about 6:00 p.m. today, the parallel filesystem ($FASTTMP) is partially down. Jobs on Fritz, Alex, and Helma may be affected. We are working on solving the issue and apologize for the incovenience. Update : Lustre is fully available again.  

Category: HPC, Systems

The Fritz and Helma clusters are down today (May 14) because of urgent maintenance of the water cooling system. We apologize for the inconvenience and will keep you updated about the status. Update About half of the Fritz cluster nodes are up and running again. Update The Fritz and Helma clu...

Category: HPC, Systems

There will be a scheduled downtime of all the HPC systems of NHR@FAU starting on Monday, 17.03. at 9:00 and expected to last until about 17:00. As usual, Jobs that would collide with the downtime will automatically be postponed until the downtime is over. Frontends and filesystems will be avai...

Category: Allgemein, HPC, Systems, Systems

Since 20:10 on January 30, the parallel Lustre file system ($FASTTMP) on the Fritz cluster is unavailable due to repeated server crashes. The issue is being investigated. Update 2025-01-31 17:00: The issue turned out to be a faulty component in the Infiniband network of Fritz, the Lustre servers ...

Category: Allgemein, HPC, Systems

Since Wednesday (2025-01-22) evening, some nodes in TinyGPU and TinyFAT seem to have problems accessing NFS directories. This will lead to issues like Jobs mysteriously hanging, or interactive jobs starting but never opening a shell prompt. Update 2025-01-23 09:30: This issue should now be resolv...

Category: Allgemein, HPC, Systems