Farm HPC cluster status is Operational

Fri 14
Sat 15
Sun 16
Mon 17
Tue 18
Wed 19
Thu 20
now

Farm HPC cluster Login

Fri 14
Sat 15
Sun 16
Mon 17
Tue 18
Wed 19
Thu 20
now

Farm HPC cluster Software

Fri 14
Sat 15
Sun 16
Mon 17
Tue 18
Wed 19
Thu 20
now
Last updated 1 minute ago from official status page. Learn more
Stay ahead of Farm HPC cluster outages
Sign up to create a custom dashboard to monitor the services you rely on. 3,000+ services supported.

Active Incidents

No active incidents

Recently Resolved Incidents

Some group and home directories unable to mount on the login node
Started 19 Feb 2025 22:26:54 (2 days ago), resolved 21 Feb 2025 01:35:08 (14 hours ago)
Minor Incident
Resolved
Login

Some group and home directories are unable to mount on the login node. An emergency reboot of the login node is scheduled for 5pm tomorrow (Thursday). This will not impact any sbatch jobs, though it will cause all srun jobs to fail.

Various software, including RStudio, still broken following maintenance window
Started 7 Jan 2025 18:32:13 (1 month ago), resolved 19 Feb 2025 22:19:15 (2 days ago)
Major Incident
Resolved
Software

Sysadmins are actively working to restore software that is no longer working since the maintenance window. This includes, but is not limited to:

  • RStudio (which also breaks Rstudio Server in OnDemand)
  • Maker
  • Braker
  • BUSCO
  • Spades
  • multiqc
  • nvhpc

Fixed:

  • The 52 pre-built conda environments
  • Julia
  • apptainer
  • OpenMPI (version 4.1.5)

Farm HPC cluster Outage Survival Guide

A step-by-step guide to help you survive a Farm HPC cluster outage
NaN%

    Farm HPC cluster Components

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster Login

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now
    Some group and home directories unable to mount on the login node
    Started 19 Feb 2025 22:26:54 (2 days ago), resolved 21 Feb 2025 01:35:08 (14 hours ago)
    Minor Incident
    Resolved
    Login

    Some group and home directories are unable to mount on the login node. An emergency reboot of the login node is scheduled for 5pm tomorrow (Thursday). This will not impact any sbatch jobs, though it will cause all srun jobs to fail.

    Farm HPC cluster Storage

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster File transfer node

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster high2,med2,low2

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster high,med,low

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster bmh,bmm

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster bigmemh,bigmemm

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster bgpu

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster gpuh,gpum

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster Email

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster Virtualization

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now
    Proxmox Virtualization Nodes
    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now
    Ganetti cluster
    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster Slurm

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now

    Farm HPC cluster Software

    Fri 14
    Sat 15
    Sun 16
    Mon 17
    Tue 18
    Wed 19
    Thu 20
    now
    Various software, including RStudio, still broken following maintenance window
    Started 7 Jan 2025 18:32:13 (1 month ago), resolved 19 Feb 2025 22:19:15 (2 days ago)
    Major Incident
    Resolved
    Software

    Sysadmins are actively working to restore software that is no longer working since the maintenance window. This includes, but is not limited to:

    • RStudio (which also breaks Rstudio Server in OnDemand)
    • Maker
    • Braker
    • BUSCO
    • Spades
    • multiqc
    • nvhpc

    Fixed:

    • The 52 pre-built conda environments
    • Julia
    • apptainer
    • OpenMPI (version 4.1.5)