Overheating knowledge centre forces shutdown of all network, compute, and storage sources
Uk South — 1 of Microsoft Azure’s two nearby cloud regions — crashed offline on Monday after an outage triggered by a cooling procedure failure in a knowledge centre.
The incident, in between fourteen:fifty four BST on fourteen Sep 2020 and 01:forty one BST on 15 Sep 2020, remaining engineers scrambling to location the automatic cooling procedure into manual mode and reset influenced pumps, after rising inside temperatures observed programs shut down all network, compute, and storage sources “to protect knowledge durability”.
“Customers applying numerous Availability Zones, or Zone Redundant companies might have professional small impact” notes Microsoft in its incident report.
The outage dragged on as after manually overriding automatic cooling programs and resetting them, engineers had to period in a return of electrical power and provide infrastructure progressively back on line. (A equivalent