Investigating - We are investigating a networking issue in the us-east1-a region affecting connectivity to a subset of compute hosts. Affected virtual machines may appear in an unspecified state or be unreachable over the network. Managed Kubernetes clusters in the region may also report nodes as Not Ready.
Engineering is engaged and actively working to identify the cause and restore service. We will provide updates as we have more information.
May 20, 2026 - 01:37 UTC
GPU Virtual Machines
Operational
90 days ago
100.0
% uptime
Today
us-east1
Operational
90 days ago
100.0
% uptime
Today
us-northcentral1
Operational
90 days ago
100.0
% uptime
Today
us-southcentral1
Operational
90 days ago
100.0
% uptime
Today
eu-iceland1
Operational
90 days ago
100.0
% uptime
Today
Networking
Partial Outage
90 days ago
99.98
% uptime
Today
Wide Area Networking (WAN)
Operational
90 days ago
100.0
% uptime
Today
VPC Networking
Partial Outage
90 days ago
99.97
% uptime
Today
Infiniband Networks
Operational
90 days ago
100.0
% uptime
Today
Datacenter Networking
Operational
90 days ago
99.98
% uptime
Today
Storage
Operational
90 days ago
99.96
% uptime
Today
Shared Disks
Operational
90 days ago
99.95
% uptime
Today
Persistent Storage
Operational
90 days ago
99.98
% uptime
Today
Orchestration
Operational
90 days ago
99.95
% uptime
Today
Crusoe Managed Kubernetes (CMK)
Operational
90 days ago
99.95
% uptime
Today
API
Operational
90 days ago
99.97
% uptime
Today
API
Operational
90 days ago
99.97
% uptime
Today
UI
Operational
90 days ago
99.99
% uptime
Today
UI
Operational
90 days ago
99.99
% uptime
Today
Container Registry
Operational
90 days ago
99.88
% uptime
Today
Container Registry
Operational
90 days ago
99.88
% uptime
Today
Managed Inference
Operational
90 days ago
99.83
% uptime
Today
Managed Inference
Operational
90 days ago
99.83
% uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Related
No incidents or maintenance related to this downtime.
Crusoe Cloud will be undergoing planned network maintenance in all the regions. During this maintenance window, no running workloads will be impacted during the time of Maintenance.
If you notice any issues, please reach out to support@crusoe.ai Posted on
May 20, 2026 - 02:43 UTC
Crusoe Cloud will be performing planned network maintenance to upgrade Front End (FE) Leaf switches across our global infrastructure. This critical update addresses known software bugs and standardizes our network command-set, significantly improving long-term stability and incident response.
During the maintenance window, all hypervisors in the us-southcentral1-a region will experience intermittent unavailability. We strongly recommend pausing or gracefully shutting down critical workloads during the window to prevent unexpected disruptions or data loss.
All other services outside the us-southcentral1-a region will remain operational. If you have questions regarding your specific workloads or notice persistent issues following the maintenance window, please reach out to support@crusoe.ai. Posted on
May 06, 2026 - 17:55 UTC
Crusoe Cloud will be performing planned network maintenance to upgrade Front End (FE) Leaf switches across our global infrastructure. This critical update addresses known software bugs and standardizes our network command-set, significantly improving long-term stability and incident response.
During the maintenance window, all hypervisors in the us-east1-a region will experience intermittent unavailability. We strongly recommend pausing or gracefully shutting down critical workloads during the window to prevent unexpected disruptions or data loss.
All other services outside the us-east1-a region will remain operational. If you have questions regarding your specific workloads or notice persistent issues following a maintenance window, please reach out to support@crusoe.ai. Posted on
May 06, 2026 - 17:57 UTC
Crusoe Cloud will be performing planned network maintenance to upgrade Front End (FE) Leaf switches across our global infrastructure. This critical update addresses known software bugs and standardizes our network command-set, significantly improving long-term stability and incident response.
During the maintenance window, all hypervisors in the eu-iceland1-a region will experience intermittent unavailability. We strongly recommend pausing or gracefully shutting down critical workloads during the window to prevent unexpected disruptions or data loss.
All other services outside the eu-iceland1-a region will remain operational. If you have questions regarding your specific workloads or notice persistent issues following a maintenance window, please reach out to support@crusoe.ai. Posted on
May 06, 2026 - 17:59 UTC
Past Incidents
May 20, 2026
Unresolved incident: Connectivity issues impacting compute instances in us-east1-a.
Resolved -
VM creation and start operations in the us-southcentral1-a and eu-iceland1-a regions have been functioning normally for several hours. Our Engineering teams have addressed the underlying issue and a permanent fix is being prepared for deployment to prevent recurrence. Existing running workloads and resources remained fully operational throughout the incident.
We apologize for any inconvenience this may have caused.
May 19, 01:19 UTC
Monitoring -
The issue affecting the ability to create and start Virtual Machines in the us-southcentral1-a and eu-iceland1-a regions has been mitigated. VM creation and start operations are functioning normally. No customer impact has been observed as of 14:00 UTC. Our Engineering teams continue to monitor both regions closely and are working on a permanent fix to prevent recurrence.
We apologize for any inconvenience this may have caused.
May 18, 14:34 UTC
Update -
We have identified the underlying issue affecting the ability to create and start Virtual Machines in the us-southcentral1-a and eu-iceland1-a regions. Our Engineering teams are actively working on recovery. Some customers may still be experiencing impact while this work continues. Existing running workloads and resources remain fully operational and are not impacted. We will provide a further update as soon as more information becomes available.
May 18, 11:27 UTC
Identified -
We are investigating an issue affecting the ability to create and start Virtual Machines in the us-southcentral1-a and eu-iceland1-a regions. Existing running workloads and resources remain fully operational and are not impacted. Our Engineering teams are actively working on recovery. We will provide an update when more information is available.
May 18, 08:24 UTC
Completed -
The scheduled maintenance has been completed.
May 16, 01:02 UTC
In progress -
Scheduled maintenance is currently in progress. We will provide updates as necessary.
May 16, 00:00 UTC
Scheduled -
Migrate active node to new high-RAM/fast-disk VM
Migrating the active Vault node impacts all CMK and Slurm customers performing provisioning, scaling, or kubeconfig-fetch operations during the maintenance window.
Existing running clusters and existing kubeconfigs are NOT impacted — Vault is not in the runtime data path.
Customer Impact: * Nodepool VM creation * Cluster creation & deletion * Kubeconfig download
May 15, 23:31 UTC
Resolved -
This incident has been resolved.
May 14, 00:09 UTC
Investigating -
Customers using CCR (Crusoe Container Registry) will be unable to push or pull images. We are working on resolving this matter as soon as possible. If you experience any other issues, please reach out to support@crusoe.ai
May 13, 21:34 UTC
Resolved -
Service has been fully restored. We have identified the root cause and are taking steps to prevent recurrence.
May 10, 18:19 UTC
Investigating -
We are investigating an issue affecting access to Crusoe Managed Inference, including Intelligence Foundry and Bring Your Own Model (BYOM). Customers may be unable to reach inference endpoints or send requests to the service. Engineering is actively working on recovery. We will provide an update when more information is available.
May 10, 14:41 UTC
Resolved -
This incident has been resolved.
May 9, 03:09 UTC
Update -
The issue has been mitigated and project creation requests are completing successfully.
May 9, 03:09 UTC
Investigating -
We're investigating failures when creating new projects. Existing workloads and resources remain fully operational. We will provide an update when more information is available.
May 8, 20:04 UTC
Resolved -
This incident has been resolved.
May 6, 23:41 UTC
Investigating -
We are currently experiencing an issue with storage used disk capacity metrics in our Iceland (eu-iceland1) region.
All data access, read/write operations, and service functionality continue to operate normally with no performance impact. Our team is actively working to restore capacity metrics. We will provide updates as restoration progresses.
May 6, 22:19 UTC