Farm HPC cluster status is Minor Service Outage

Wed 26
Thu 27
Fri 28
Sat 29
Sun 30
Mon 31
Tue 1
now

Farm HPC cluster Login

Wed 26
Thu 27
Fri 28
Sat 29
Sun 30
Mon 31
Tue 1
now

Farm HPC cluster Storage

Wed 26
Thu 27
Fri 28
Sat 29
Sun 30
Mon 31
Tue 1
now
Last updated 1 minute ago from official status page. Learn more
Stay ahead of Farm HPC cluster outages
Sign up to create a custom dashboard to monitor the services you rely on. 3,000+ services supported.

Active Incidents

Farm: nas-12-2 down due to multiple disk failures
Started 1 Apr 2025 01:09:07 (1 day ago), still ongoing
Major Incident
Identified
Login
Storage

nas-12-2 has suffered from multiple disk failures. Admins are investigating the best path forward.

The following group directories are currently unavailable:

awhitehegrp millermrgrp millsgrp runciegrp weimergrp yujingrp

The following home directories are unavailable:

aavalos7 awhitehe barao bcbaikie bcweimer berdeja crice crios cschles dglemay djprince dkblaufu drbandoy eabernat ecgranad edkoch emmaluu eoziolor fengq hahudson hemstrow hxhu jagill jajpark jamcgirr jassim jcariute jdowen jenwash jmiller1 jroach jrwashab jxnliu katng23 ljcohen madarm11 mam12n mary363 millermr mlyjones mmosmond motch mtreiber namcnabb nmariano nreid pjseba profeta prvasque psbapat rsbrenna sakre saumyaw scsastry seboles sejoslin smhigdon spatel23 tmbolt vfbetsis vpdunne wolfie12 xmixu yoxue ytakim ywdong

Recently Resolved Incidents

nas-5-2 crashed
Started 27 Mar 2025 16:08:02 (6 days ago), resolved 27 Mar 2025 18:59:36 (6 days ago)
Major Incident
Resolved
Storage

nas-5-2 has crashed. Any home directories, or group directories, shared from there are currently hung. Admins are investigating.

Farm HPC cluster Outage Survival Guide

A step-by-step guide to help you survive a Farm HPC cluster outage
NaN%

    Farm HPC cluster Components

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster Login

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now
    Farm: nas-12-2 down due to multiple disk failures
    Started 1 Apr 2025 01:09:07 (1 day ago), still ongoing
    Major Incident
    Identified
    Login
    Storage

    nas-12-2 has suffered from multiple disk failures. Admins are investigating the best path forward.

    The following group directories are currently unavailable:

    awhitehegrp millermrgrp millsgrp runciegrp weimergrp yujingrp

    The following home directories are unavailable:

    aavalos7 awhitehe barao bcbaikie bcweimer berdeja crice crios cschles dglemay djprince dkblaufu drbandoy eabernat ecgranad edkoch emmaluu eoziolor fengq hahudson hemstrow hxhu jagill jajpark jamcgirr jassim jcariute jdowen jenwash jmiller1 jroach jrwashab jxnliu katng23 ljcohen madarm11 mam12n mary363 millermr mlyjones mmosmond motch mtreiber namcnabb nmariano nreid pjseba profeta prvasque psbapat rsbrenna sakre saumyaw scsastry seboles sejoslin smhigdon spatel23 tmbolt vfbetsis vpdunne wolfie12 xmixu yoxue ytakim ywdong

    Farm HPC cluster Storage

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now
    Farm: nas-12-2 down due to multiple disk failures
    Started 1 Apr 2025 01:09:07 (1 day ago), still ongoing
    Major Incident
    Identified
    Login
    Storage

    nas-12-2 has suffered from multiple disk failures. Admins are investigating the best path forward.

    The following group directories are currently unavailable:

    awhitehegrp millermrgrp millsgrp runciegrp weimergrp yujingrp

    The following home directories are unavailable:

    aavalos7 awhitehe barao bcbaikie bcweimer berdeja crice crios cschles dglemay djprince dkblaufu drbandoy eabernat ecgranad edkoch emmaluu eoziolor fengq hahudson hemstrow hxhu jagill jajpark jamcgirr jassim jcariute jdowen jenwash jmiller1 jroach jrwashab jxnliu katng23 ljcohen madarm11 mam12n mary363 millermr mlyjones mmosmond motch mtreiber namcnabb nmariano nreid pjseba profeta prvasque psbapat rsbrenna sakre saumyaw scsastry seboles sejoslin smhigdon spatel23 tmbolt vfbetsis vpdunne wolfie12 xmixu yoxue ytakim ywdong

    nas-5-2 crashed
    Started 27 Mar 2025 16:08:02 (6 days ago), resolved 27 Mar 2025 18:59:36 (6 days ago)
    Major Incident
    Resolved
    Storage

    nas-5-2 has crashed. Any home directories, or group directories, shared from there are currently hung. Admins are investigating.

    Farm HPC cluster File transfer node

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster high2,med2,low2

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster high,med,low

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster bmh,bmm

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster bigmemh,bigmemm

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster bgpu

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster gpuh,gpum

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster Email

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster Virtualization

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now
    Proxmox Virtualization Nodes
    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now
    Ganetti cluster
    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster Slurm

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now

    Farm HPC cluster Software

    Wed 26
    Thu 27
    Fri 28
    Sat 29
    Sun 30
    Mon 31
    Tue 1
    now