October 30, 2025

AWS Outage Disrupts Major Apps Worldwide

US-EAST-1 incident tied to DNS and load-balancer subsystem triggered cascading failures across the internet

Pittsburgh, PA – October 22, 2025 (Updated: 10/30) — A widespread AWS disruption that began shortly after midnight Pacific on Monday rippled across the web, knocking popular apps and business services offline before Amazon reported full restoration later in the day. Reporting indicates the incident centered on US-EAST-1 (N. Virginia) and was the most significant internet disruption since last year’s CrowdStrike event.

Initial status updates pointed to DNS resolution issues preventing applications from reaching the DynamoDB API in US-EAST-1. Later, AWS said the root cause was an internal subsystem that monitors the health of network load balancers within the EC2 internal network with DNS effects compounding the blast radius. By ~3:00 p.m. PT, Amazon said “all AWS services returned to normal operations,” while warning that some services would clear queued messages for several hours.

The outage impacted a broad range of consumer and enterprise brands. Reports and company statements cited disruptions at Snapchat, Reddit, Roblox, Venmo, Zoom, Coinbase, Robinhood, and even Amazon’s own retail, Prime Video, and Alexa services. Ookla said over 4 million users reported issues worldwide, and at least a thousand companies were affected.

All AWS services returned to normal operations… Some services… continue to have a backlog of messages that they will finish processing over the next few hours
said Amazon on Monday. 

Key facts

  • Timeline: Issues began shortly after midnight PT on Oct 20; AWS later reported full restoration the same day, with lingering backlogs.

  • Where: US-EAST-1 (N. Virginia)—a region with prior, high-profile incidents—was identified as the locus.

  • Root cause (AWS): Internal load-balancer health-monitor subsystem within the EC2 internal network; DNS problems impeded reaching DynamoDB endpoints.

  • Scale: Millions of outage reports; 1,000+ companies impacted, spanning communications, gaming, finance, and retail.

The event underscores the internet’s dependence on a few hyperscale providers. As one expert noted, the episode highlights how “relatively fragile infrastructures” can cascade through everyday digital services. 

Author:

Keep Reading

Latest Updates

Dec 01, 2017

Hyper-convergence vs Convergence

Hyper-converged infrastructure (HCI) integrates IT components for better scalability, performance, and flexibility, but may not suit all business needs.

Dec 01, 2017
Apr 22, 2023

What is QLC SSD?

QLC SSDs use four-level cells to store 4 bits per cell, offering higher capacity at lower costs but with reduced speed and durability

Apr 22, 2023
Dec 11, 2012

iSCSI RAID Redundancy

Elevate storage performance with iSCSI RAID: SSD support, compression/dedupe, and dual redundancy for scalable, secure enterprise solutions.

Dec 11, 2012
Apr 15, 2024

Jetstor® Partners With Leil Storage

With Leil Storage Systems, JetStor enables enterprises to cut storage expenses by up to 67% and reduce power consumption by as much as 43%

Apr 15, 2024
Feb 16, 2026

Maximum Efficiency, Mutual Flourishing: Why JetStor Runs on Judo Principles

JetStor CEO Jim Gallagher explains how the Judo principles of Seiryoku-Zenyo (maximum efficiency) and Jita-Kyoei (mutual benefit) drive the company's approach to data storage, honest pricing, and customer partnerships.

Feb 16, 2026
Jul 23, 2022

Distributed Storage Distributes the Risks

Explore distributed storage systems that enhance data security, scalability, and reliability by spreading data across multiple nodes efficiently.

Jul 23, 2022
Contact and let us create a custom solution for you
An experienced JetStor systems engineer will assist you in translating your application requirements into specifications for system internal bandwidth, host(s) bandwidth, read and write performance, availability, redundancy and rack space.  From those specifications, a purpose-designed JetStor storage solution is crafted that addresses both your current needs as well as the future scalability required for the longest useful life and highest return on investment.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.