We’ve all heard it: the cloud is highly available. And to be fair, it usually is. Until it isn’t.

If you’ve been around long enough, you’ve seen outages that go beyond a single server or application. Data centers go offline. Regions have issues. Services that everything depends on suddenly stop responding. Sometimes it’s a power problem. Sometimes it’s a bad patch. Sometimes it’s something no one saw coming.

The takeaway is pretty simple. No single location is immune to failure. If your architecture depends on one data center, one availability zone, or even one region always being available, you are taking on more risk than you might realize.

When you start thinking about disaster recovery in those terms, the conversation changes quickly.

Understanding what can actually fail