Major power outage at Hong Kong (HKG12)
Incident Report for Misaka Network, Inc.
Resolved
This incident has been resolved.
Posted May 18, 2024 - 07:21 UTC
Monitoring
All servers are now online, the new rack is currently without CMI VRF. We're working on a fix and it should be fully operational soon.
Posted May 18, 2024 - 06:53 UTC
Update
All nodes except HKG12C0328 are up. We're still working to get it back online.
Posted May 18, 2024 - 06:16 UTC
Update
Our technician has arrived at site and discovered that the PDU has partially failed. We are now moving the affected servers to another rack to restore service. ETR is 30 minutes.
Posted May 18, 2024 - 05:19 UTC
Update
Some of our computing nodes are now back online. Based on the metrics we collected, it appears that the circuit breaker for one of the banks (the PDU has two banks) tripped when the primary PDU lost power. Although the secondary PDU took over the load, the circuit breaker still tripped. We are waiting for the datacenter to provide more information on this issue. We will provide further updates as soon as we have more information available.
Posted May 18, 2024 - 04:46 UTC
Update
We've noticed that some of our equipment is now back online, but the ToR switch remains offline as it has dual PSUs and is connected to different PDUs. We will provide more updates once we arrive at the datacenter or receive more information from the datacenter via the trouble ticket.
Posted May 18, 2024 - 04:30 UTC
Identified
We are aware that two of our cabinets in Hong Kong accidentally lost power during maintenance, the technician has been dispatched and we will provide updates as soon as he arrives at the datacenter.
Posted May 18, 2024 - 03:31 UTC
Investigating
We are currently investigating this issue.
Posted May 18, 2024 - 03:16 UTC
This incident affected: Datacenters (Hong Kong - (HKG12)).