HPC and Housing in L5|08: Downtime

for operations on the power infrastructure

2024/09/30 07:00-18:00

For the final repair of the 2000A power rail, the whole HPC cluster will be down.

After the power short 2024-02-08 , the main power rail in L5|08 could only be made operational again by a makeshift repair, eliminating affected elements and thus, unduly truncating the rail mechanically.

In the meantime, all spare parts have been delivered, and the too-short power rail will be extended to its original length.

As it cannot be worked on under current/power, we have to shut down the whole HPC cluster (compute and login nodes) on the 30th of September 2024 for a day.

Institute servers in the adjacent housing are only affected if not connected to the uninterruptible power supply.

As soon as the workings are finished, we will inform you on the [HPC-Nutzer] mailing list and on this HPC News page.

You do not need to do anything with respect to your (running or pending) batch jobs. The scheduler knows about the downtime and will

  • start pending jobs only if these will be safely finished before the downtime and
  • hold all others until after the downtime and recommencing of the normal scheduling.