➀ A multi-day AWS outage, affecting millions of services, originated from a DNS configuration error in DynamoDB that cascaded to EC2 and Network Load Balancer systems.
➀ The root cause was a race condition in DNS updates: outdated automation processes overwrote critical DNS entries, leading to widespread service failures.
➂ Amazon implemented manual fixes and announced systemic safeguards, highlighting the risks of over-reliance on poorly decentralized automation in cloud infrastructure.