Updates


Event Date Summary

June 30 @ 5:36PM PST - We think we have sorted out the cooling issue. We are currently in the process of running some final cooling tests and expect to begin the process of bringing up Cedar nodes at 7PM PST.

We thank you for your continued patience!

June 30 @ 9:37AM: SFU Facilities Services is on-site and performing maintenance on the cooling system to try and determine why that unit are intermittently power cycling the fans and pumps. HPC services remain offline while we sort through the issue and will provide updates as they are available.

Cedar is now Available With Conditions

  • Globus services are now back online.
  • The SFU datacenter continues to experience cooling issues.
  • While temperatures have remained stable over the weekend, we are keeping all HPC services offline, with the exception of Cedar Cloud, dCache, and Globus services.
  • Access to login nodes is now available.
Important information
Please note that all of the file systems are now using the new storage infrastructure. If users encounter any issues, they should not hesitate to submit a support ticket. However, here is some important information...Project upgrade:
  • /project filesystem data was migrated to our new storage, if you find any missing data, or other concerns, such as quota, support is available to assist.
  • Benefit of this migration is the new Project Quota, rather than SGID of project; this means:
    • all of the files inside the parent project folder belong to the quota.
    • This eliminates issues often faced with tools, such as: git, mv, or tar
Possible reasons for out-of-quota issues:
  • The new project does not have file compression enabled, this is to be expected.
  • /project now uses Project Quota, so all files inside of the project apply against the PI's quota, whether they belong to the UID/GID of the PI or their users.
  • This will most likely affect projects that had files from old projects that they had moved over or left behind.

Thank you for your patience and understanding


Incident description

Service Incident status Start Date End Date
Cedar Closed
Created by Andre Poley on

Title


Planned Outage - Arrêt planifié


Summary




Cedar will be unavailable from 2025-06-16 to 2025-06-30. During this time, you will not be able to log in or run jobs on the cluster. Any jobs that are running at the time of the outage will be terminated and you will need to re-submit them once the cluster is back online.

======

Cedar ne sera pas disponible  2025-06-16 de 2025-06-30. Durant cette période, les utilisateurs ne pourront pas se connecter ni exécuter des tâches sur Cedar. Toute tâche en cours au moment de l’arrêt de service sera arrêtée et devra être soumise à nouveau une fois les travaux terminés.


Updated by James Peltier on