Author: Georg Hager

Since 20:10 on January 30, the parallel Lustre file system ($FASTTMP) on the Fritz cluster is unavailable due to repeated server crashes. The issue is being investigated. Update 2025-01-31 17:00: The issue turned out to be a faulty component in the Infiniband network of Fritz, the Lustre servers ...

Category: Allgemein, HPC, Systems

Today at about 10:45 a.m., a large-area power outage in Erlangen has brought all our clusters down. All running jobs have been terminated. Some frontends have been switched off manually to lower the load on the uninterruptible power supply infrastructure. We are working to get the systems running...

Category: Allgemein, HPC, Systems