
Pawsey Supercomputing Research Centre Status
Real-time updates of Pawsey Supercomputing Research Centre issues and outages
Pawsey Supercomputing Research Centre status is Partially Degraded Service
Pawsey Supercomputing Research Centre Setonix
Login nodes
Setonix highmem partition
Banksia
Active Incidents
Banksia – one of two tape copies unavailable (at risk) The Banksia service is currently in a degraded, “at risk” state as it is operating with only one tape library instead of the standard two. As a result, the secondary copy of files will be unavailable for staging or archiving until Library 2 is restored to service. The primary copy is still available for all data so this should not impact the service. If you experience any issues accessing data please let us know at [email protected]
The issue has been traced to a faulty tape drive in DBA 5. To resolve this, the field engineer will remove the drives from the DBA, replace the faulty unit, and reseat all components. Since DBA 5 is currently blocked, DBA 6 will need to be removed first to allow access for the repair work. This work will take some time but the engineer indicates that in the worst case it will take until Tuesday but they are hoping for resolution today.
Recently Resolved Incidents
There appears to be an issue will the Slingshot interfaces in the login nodes in Setonix. We appear to be down to 1 login node in the normal pool of login nodes.
We have had a case open with HPE for weeks, but they appear to be no closer to providing any kind of solution.
Please, please, please, please don't run any computational intensive operations on the login nodes. We have lovely compute nodes for that.
Please be aware that you can log into setonix-workflow.pawsey.org.au and get access to additional "workflow" nodes.
Pawsey Supercomputing Research Centre Outage Survival Guide
Pawsey Supercomputing Research Centre Components
Pawsey Supercomputing Research Centre Setonix
Login nodes
There appears to be an issue will the Slingshot interfaces in the login nodes in Setonix. We appear to be down to 1 login node in the normal pool of login nodes.
We have had a case open with HPE for weeks, but they appear to be no closer to providing any kind of solution.
Please, please, please, please don't run any computational intensive operations on the login nodes. We have lovely compute nodes for that.
Please be aware that you can log into setonix-workflow.pawsey.org.au and get access to additional "workflow" nodes.
Data-mover nodes
Slurm scheduler
Setonix work partition
Setonix debug partition
Setonix long partition
Setonix copy partition
Setonix askaprt partition
Setonix highmem partition
Setonix gpu partition
Setonix gpu high mem partition
Setonix gpu debug partition
Pawsey Supercomputing Research Centre Lustre filesystems
/scratch filesystem
/software filesystem
/askapbuffer filesystem
/askapingest filesystem
Pawsey Supercomputing Research Centre Storage Systems
Acacia Ingest
Acacia MWA
Acacia Projects
Banksia
Banksia – one of two tape copies unavailable (at risk) The Banksia service is currently in a degraded, “at risk” state as it is operating with only one tape library instead of the standard two. As a result, the secondary copy of files will be unavailable for staging or archiving until Library 2 is restored to service. The primary copy is still available for all data so this should not impact the service. If you experience any issues accessing data please let us know at [email protected]
The issue has been traced to a faulty tape drive in DBA 5. To resolve this, the field engineer will remove the drives from the DBA, replace the faulty unit, and reseat all components. Since DBA 5 is currently blocked, DBA 6 will need to be removed first to allow access for the repair work. This work will take some time but the engineer indicates that in the worst case it will take until Tuesday but they are hoping for resolution today.