Incident description

System Incident status Start Date End Date
Graham Closed
Created by Chou Khamkeuang on

Title


Planned Outage - Arrêt Planifié


Summary


Starting Tuesday, October 25th, 2022, at 9 a.m. EDT, the Graham cluster will be unavailable to all users as we perform cluster maintenance. All running jobs will be terminated and all queued jobs will be deleted. The work will be completed by Wednesday, October 26th, 2022 at 10 a.m. EDT.

During the outage a new home server will be installed; we will migrate all user data off the old server to the new. The cluster scheduler (Slurm) will be upgraded to a newer release. We will also update the compute node image and the CUDA driver version.

Please watch https://status.alliancecan.ca/ for updates on the availability of Graham and all other national systems.

This outage will impact the cluster, login nodes, visualization nodes (VDI) as well as data transfer nodes (DTN). There will be no impact to the Graham cloud.
 

Start Time : 9 a.m. EDT, Tuesday, October 25, 2022

Anticipated End Time : 10 a.m. EDT, Wednesday, October 26, 2022
 

Users will be notified by email when the cluster is up and running again.

For questions, please email support@tech.alliancecan.ca.


Updated by Kaizaad Bilimorya on