October 30, 2025

AWS Outage Disrupts Major Apps Worldwide

US-EAST-1 incident tied to DNS and load-balancer subsystem triggered cascading failures across the internet

Amazon Web Services experienced a global outage on October 20 that took down scores of apps and websites. Amazon said services later “returned to normal operations,” though some workloads faced backlogs as systems recovered.

Pittsburgh, PA – October 22, 2025 (Updated: 10/30) — A widespread AWS disruption that began shortly after midnight Pacific on Monday rippled across the web, knocking popular apps and business services offline before Amazon reported full restoration later in the day. Reporting indicates the incident centered on US-EAST-1 (N. Virginia) and was the most significant internet disruption since last year’s CrowdStrike event.

Initial status updates pointed to DNS resolution issues preventing applications from reaching the DynamoDB API in US-EAST-1. Later, AWS said the root cause was an internal subsystem that monitors the health of network load balancers within the EC2 internal network with DNS effects compounding the blast radius. By ~3:00 p.m. PT, Amazon said “all AWS services returned to normal operations,” while warning that some services would clear queued messages for several hours.

The outage impacted a broad range of consumer and enterprise brands. Reports and company statements cited disruptions at Snapchat, Reddit, Roblox, Venmo, Zoom, Coinbase, Robinhood, and even Amazon’s own retail, Prime Video, and Alexa services. Ookla said over 4 million users reported issues worldwide, and at least a thousand companies were affected.

All AWS services returned to normal operations… Some services… continue to have a backlog of messages that they will finish processing over the next few hours
said Amazon on Monday. 

Key facts

  • Timeline: Issues began shortly after midnight PT on Oct 20; AWS later reported full restoration the same day, with lingering backlogs.

  • Where: US-EAST-1 (N. Virginia)—a region with prior, high-profile incidents—was identified as the locus.

  • Root cause (AWS): Internal load-balancer health-monitor subsystem within the EC2 internal network; DNS problems impeded reaching DynamoDB endpoints.

  • Scale: Millions of outage reports; 1,000+ companies impacted, spanning communications, gaming, finance, and retail.

The event underscores the internet’s dependence on a few hyperscale providers. As one expert noted, the episode highlights how “relatively fragile infrastructures” can cascade through everyday digital services. 

Author:

Other articles

August 1, 2017
Hybrid vs Public Clouds; Which makes sense for you?

Private, public, and hybrid clouds offer different benefits for businesses. Each solution balances cost, performance, and security based on organizational needs.

More
Down arrow
September 30, 2025
Huawei Launches OceanDisk EX 560, SP 560, and LC 560 AI SSDs

Huawei unveiled EX/SP/LC 560 AI SSDs for training, inference, and capacity—EX up to 1.5M write IOPS; LC up to 245TB—and DiskBooster to pool HBM/DDR/SSD with up to 20× virtual memory.

More
Down arrow