Postmortem Index

Explore incident reports from various companies

Title Company Date Categories
Amazon SimpleDB US East Region Disruption on June 13 Amazon
EVE Online: Trinity installer deletes boot.ini CCP Games 2007-12-05 – 2007-12-06
Homebrew GitHub token leak from Jenkins Homebrew 2018-07-31
Honeycomb query performance and alerting incident (August 2022) Honeycomb
Intermittent downtime from repeated crashes incident.io 2022-11-18
rust-lang rust-lang
Gitlab Gitlab 2014-07-08
GitHub.com outage of December 2012 GitHub 2012-12-22 – 2012-12-23
Zerodha Order Management System overload on August 29, 2019 Zerodha 2019-08-29
Trading and hanging orders on 12th April 2018 Zerodha 2018-04-12
Heroku Heroku
Elastic Cloud AWS us-east-1 outage of February 2019 Elastic 2019-02-04
India NEW grid blackouts Indian Electricity Grid (POSOCO / CERC) 2012-07-30 – 2012-07-31
Google Cloud Networking and Load Balancing outage of November 2021 Google 2021-11-16
Razorpay RDS Multi-AZ Failover and Data Loss in December 2019 Razorpay
ARPANET network-wide outage of October 1980 ARPANET 1980-10-27
Subversion SHA1 Collision Affects WebKit Repository WebKit code repository
Google logged-in services outage due to incorrect configuration Google 2014-01-24
Flowdock outage and cross-organization data leak Broadcom (CA Technologies) 2020-04-21 – 2020-04-22
CircleCI Workflow Delay Incidents March 26 - April 10, 2019 CircleCI 2019-03-26 – 2019-04-10
High Filtering On SMS Towards AT&T Network In United States Twilio 2021-10-27
Joyent Joyent
Etsy site outage caused by multicast rsync Etsy
Cloudflare API and dashboard availability incident on 2020-11-02 Cloudflare 2020-11-02
Fastly global outage of June 8, 2021 Fastly 2021-06-08
Amazon Kinesis Data Streams US-EAST-1 Degradation July 2024 Amazon 2024-07-30 – 2024-07-31
Delay in starting Docker Jobs. Machine & remote Docker environments blocked CircleCI 2021-05-21 – 2021-05-22
Turso free tier data leak and loss Turso 2023-12-01 – 2023-12-04
PagerDuty PagerDuty
OpenAI OpenAI
Stack Exchange Network outage due to HAProxy iptables misconfiguration on August 25, 2014 Stack Overflow 2014-08-25
Cloudflare systemwide outage, November 18, 2025 Cloudflare 2025-11-18
Basecamp Basecamp
Yeller network partition causes processing delays Yeller 2014-07-29 – 2014-07-30
Salesforce Service Disruption September 20, 2023 Salesforce 2023-09-20
Roblox 73-hour outage due to Consul and BoltDB issues (October 2021) Roblox 2021-10-28 – 2021-10-31
GitHub background job system degraded availability October 2020 GitHub 2020-10-09 – 2020-10-10
MongoHQ security breach impacting CircleCI customer data CircleCI 2013-10-27 – 2013-10-30
Swedish warship Vasa sinks on maiden voyage Sweden 1628-08-10
Apollo 11 Lunar Landing Computer Overload NASA 1969-07-20
Netflix's response to October 2012 AWS EBS degradation Netflix 2012-10-22
Stackdriver Intelligent Monitoring application outage on October 23, 2013 Stackdriver 2013-10-23 – 2013-10-26
Sun Microsystems Enterprise server cache memory flaw Sun
Dropbox Dropbox
Partial Cloudflare outage on October 25, 2022 Cloudflare 2022-10-25
incident.io service disruption during AWS us-east-1 outage on October 20, 2025 incident.io 2025-10-20
Google Cloud Network Outage in Eastern USA, June 2019 Google 2019-06-02
Northeast blackout of 2003 FirstEnergy / General Electric 2003-08-14 – 2003-08-16
Cloudflare outage on June 21, 2022 Cloudflare 2022-06-21
Incident.io intermittent database connection pool timeouts incident.io
Discord Connectivity Issues (March 2017) Discord 2017-03-20
Amazon DynamoDB US-EAST-1 outage of October 2025 Amazon 2025-10-20
Stack Exchange SQL Server bugcheck outage January 2017 Stack Exchange 2017-01-24
Google Cloud Networking issues in Europe and other regions on November 12, 2021 Google 2021-11-12
How I Broke `git push heroku main` Heroku
Fortnite service outages of February 3-4, 2018 Epic Games 2018-02-03 – 2018-02-05
GitHub network problems on November 30, 2012 GitHub 2012-11-30 – 2012-12-01
Reddit outage and degraded performance on August 11, 2016 Reddit 2016-08-11 – 2016-08-12
GoCardless API and Dashboard outage on 10 October 2017 GoCardless 2017-10-10
Malicious Packages Published to npm ESLint 2018-07-12
Allegro Allegro
Azure Storage service interruption Microsoft 2014-11-19
GitHub Actions and Codespaces outage of February 2026 GitHub 2026-02-02 – 2026-02-03
GitHub.com availability issues in September 2012 GitHub 2012-09-10
Okta third-party support engineer laptop compromise Okta 2022-01-16 – 2022-01-21
Heroku April 2022 security incident Heroku 2022-04-07 – 2022-04-14
LaunchDarkly service disruption due to AWS us-east-1 outage and internal cascading failures (October 2025) Launchdarkly 2025-10-20 – 2025-10-21
Xubuntu.org download page compromised via WordPress vulnerability Xubuntu 2025-10-15 – 2025-10-19
Therac-25 radiation overdose accidents Atomic Energy of Canada Limited (AECL) 1985-06-03 – 1987-01-17
Cloudflare 1.1.1.1 lookup failures on October 4, 2023 Cloudflare 2023-10-04
GitHub February 2020 mysql1 service disruptions GitHub 2020-02-19 – 2020-02-27
Datadog US region infrastructure connectivity issue DataDog 2020-09-24 – 2020-09-25
Skyliner Skyliner
1990 AT&T Long Distance Network Collapse AT&T 1990-01-15
Gliffy Gliffy
Sentry hosted Postgres XID wraparound outage Sentry 2015-07-20 – 2015-07-21
GitHub October 2018 Service Degradation due to MySQL Failover GitHub 2018-10-21 – 2018-10-22
AWS Lambda Service Event in Northern Virginia (US-EAST-1) Region on June 13th, 2023 Amazon 2023-06-13
Twilio billing system incident of July 2013 Twilio 2013-07-18 – 2013-07-20
15 seconds of API downtime during PostgreSQL migration GoCardless
OWASA fluoride overfeed and water main break in Orange County, February 2017 OWASA 2017-02-02
Cloud Filestore ListInstances API failed with error code 429 globally Google 2022-09-13
Slack Slack
PagerDuty notification dispatch system outage of April 2013 Pagerduty 2013-04-13
King's College London Strand Data Centre storage failure King's College London 2016-10-17
Windows Azure Service Disruption on Feb 29th, 2012 Azure 2012-02-29 – 2012-03-01
Northeast blackout U.S.-Canada Power System Outage Task Force 2003-08-14 – 2003-08-18
Firefox Add-ons Outage due to Certificate Expiration Mozilla 2019-05-04
Linux kernel leap second futex timer issue Linux 2012-07-01
Leap second affected Cloudflare DNS Cloudflare 2017-01-01
Mandrill Postgres XID Wraparound Outage February 2019 Mandrill 2019-02-04 – 2019-02-05
GitHub January 28th, 2016 datacenter power disruption GitHub 2016-01-28
Strava upload outage Strava 2014-07-29 – 2014-07-30
Multiple Slack service disruptions in October 2014 Slack
Stripe Stripe
CircleCI DB performance issue CircleCI 2015-07-07 – 2015-07-08
Google Cloud Networking, Storage, and BigQuery reduced capacity for lower priority traffic Google 2022-07-15
Platform.sh EU region outage of August 2016 Platform.sh 2016-08-18
GitHub DNS Outage on January 8, 2014 GitHub 2014-01-08
Firefox HTTP/3 network stack outage Firefox 2022-01-13
Reddit Reddit
GitHub.com database configuration change causes 36-minute outage GitHub 2024-08-14
Slack Outage on January 4th 2021 Slack 2021-01-04
Amazon EC2, EBS, and RDS EU West Region Service Event Amazon
Allegro Allegro
Mars Climate Orbiter unit conversion failure NASA 1998-12-11 – 1999-09-23
AppNexus AppNexus
Google Compute Engine Persistent Disk issue in europe-west1-b Google 2015-08-13 – 2015-08-16
AWS Direct Connect disruption in Tokyo (AP-NORTHEAST-1) on September 2, 2021 Amazon 2021-09-01 – 2021-09-02
AWS US-East Region Service Event of October 22, 2012 Amazon 2012-10-22 – 2012-10-23
Supermarket Intermittent Unresponsiveness Chef.io
Elastic Elastic
Honeycomb Ingest System Outage: Shepherd Cache Delays Honeycomb 2022-09-08
Metrist Metrist
Healthcare.gov Healthcare.gov
Heroku Heroku
NPM Fastly VCL misconfiguration outage on 2014-01-28 NPM 2014-01-28
GLONASS broadcast ephemerides corruption Roscosmos / GLONASS 2014-04-01 – 2014-04-02
GitHub November 2021 Availability Incident due to MySQL Schema Migration Github 2021-11-27
Amazon S3 Availability Event: July 20, 2008 Amazon 2008-07-20
Salesforce Salesforce
Cloudflare global outage due to router configuration error Cloudflare 2013-03-03
OWASA OWASA
Engineering Archives Heroku
CircleCI jobs not starting due to Kubernetes networking failure CircleCI 2023-03-14 – 2023-03-15
Parity Parity
Basecamp Basecamp
Heroku Heroku
CircleCI workflows latency and failures on April 4, 2025 CircleCI 2025-04-04
Amazon Kinesis US-EAST-1 outage November 2020 Amazon 2020-11-25 – 2020-11-26
Facebook Facebook
Travis CI production database truncation TravisCI 2018-03-13
WebKit code repository WebKit code repository
PythonAnywhere storage volume failure on 7 July 2020 PythonAnywhere 2020-07-07
Cloudflare parser bug causes memory leak Cloudflare 2016-09-22 – 2017-02-18
Chrome SyncDataType parsing crash Google 2012-12-10 – 2012-12-11
Instapaper AWS RDS MySQL 2TB File Size Limit Outage Instapaper 2017-02-09 – 2017-02-14
Joyent Joyent
Knight Capital SMARS deployment incident Knight Capital 2012-08-01
Stack Exchange network outage due to StackEgg on March 31, 2015 Stack Exchange 2015-03-31
CircleCI security incident and data exfiltration (December 2022) CircleCI 2022-12-16 – 2022-12-22
Google Cloud GCVE deletion incident impacting UniSuper Google
Google Google
AWS Sydney Region EC2 and EBS power disruption Amazon 2016-06-05
Summary of the Amazon DynamoDB Service Disruption and Related Impacts in the US-East Region Amazon
incident.io incident ID sequence jump incident.io
rust-lang rust-lang
Global Google Cloud API outage due to Service Control null pointer exception Google 2025-06-12 – 2025-06-13
Bungie Bungie
Etsy Etsy
GitHub DNS infrastructure failure and service degradation on October 11, 2024 GitHub 2024-10-11 – 2024-10-12
Cloudflare global outage on July 2, 2019 Cloudflare 2019-07-02
trivago trivago
Amazon EC2 and Amazon RDS Service Disruption in US East Region Amazon 2011-04-21 – 2011-04-24
GitHub Copilot degradation on July 13, 2024 GitHub 2024-07-13
Basecamp Basecamp
Unavailable Guilds & Connection Issues Discord 2017-10-13
Slack’s Incident on 2-22-22 Slack 2022-02-22
Heroku Heroku
Facebook global backbone network outage of October 2021 Facebook
Authentication Latency on DUO1 Deployment Duo 2018-08-29
Google Compute Engine, Cloud VPN, and Network Load Balancer connectivity issues Google 2017-01-30
GitHub GitHub
GitHub availability incidents May 9-11, 2023 GitHub 2023-05-09
PagerDuty PagerDuty
Google Cloud europe-west2 outage due to cooling system failure Google 2022-07-19 – 2022-07-21
AWS US-EAST-1 Internal Network Congestion on December 7, 2021 Amazon 2021-12-07 – 2021-12-08
High queue times on OS X builds (.com and .org) TravisCI 2015-08-04 – 2015-08-06
GitHub mysql1 cluster repeated service disruptions (March 2022) GitHub 2022-03-16 – 2022-03-23
Google Meet Livestream degraded quality Google 2021-10-25 – 2021-10-26
Foursquare MongoDB memory exhaustion outage Foursquare
Honeycomb total outage on July 25th, 2023 Honeycomb 2023-07-25
Leaderboarded production database accidental deletion Keepthescore 2020-10-17
Atlassian April 2022 customer site deletion outage Atlassian 2022-04-05 – 2022-04-18
Stack Exchange Network outage of July 20, 2016 Stack Exchange 2016-07-20
GitHub DDoS attack of March 2014 GitHub 2014-03-11
EVE Online long downtime on July 15th, 2015 CCP Games 2015-07-15
Mailgun Website Intermittent Timeouts Mailgun 2017-01-12
Sentry security incident Sentry 2016-06-12 – 2016-06-14
Valve Valve
Intel Pentium FDIV Bug Intel 1994-06-01 – 1994-12-31
CircleCI UI and build capabilities disruption on April 4, 2025 CircleCI 2025-04-04
ShapeShift Cyberattack Shapeshift 2016-03-14 – 2016-04-09
Honeycomb operational burden and scaling issues in September and October Honeycomb
Google Cloud internal blob storage disruption March 2019 Google 2019-03-13
Google Cloud Networking packet loss May 2022 Google 2022-05-20
ARPANET network-wide outage due to corrupted routing updates ARPANET 1980-10-27
Buildkite outage of August 22nd, 2016 Buildkite 2016-08-22
Google Cloud HTTP(S) Load Balancer 502 errors on April 5, 2017 Google 2017-04-05
TUI reservation system miscalculates G-TAWG takeoff weight TUI
Datadog Infrastructure Connectivity Issue March 2023 Datadog 2023-03-08 – 2023-03-10
BigQuery Storage WriteAPI elevated error rates in US Multi-Region Google 2022-10-13 – 2022-10-14
Gentoo GitHub Organization compromise of June 2018 Gentoo 2018-06-28 – 2018-07-03
Google Google
Amazon ELB Service Event in US-East Region on December 24, 2012 Amazon 2012-12-24 – 2012-12-25
Medium editor bug preventing Polish 'Ś' character input Medium
Spotify Popcount service outage of April 2013 Spotify 2013-04-27
Netflix Netflix
Kickstarter MySQL replication failure Kickstarter 2013-03-07
CircleCI jobs stuck in "not running" state on November 8, 2021 CircleCI 2021-11-08
AWS SA-EAST-1 Availability Zone Power and Network Incident, December 2013 Amazon 2013-12-18
Square service disruption of March 16, 2017 Square 2017-03-16
Cloudflare Control Plane and Analytics Outage due to Flexential Power Failure Cloudflare 2023-11-02 – 2023-11-04
Google Compute Engine global connectivity loss April 2016 Google 2016-04-12
Travis CI database truncation and cross-account session exposure Travis CI 2018-03-13
EVE Online Stackless Python tasklet memory reuse bug CCP Games
Steam client recursively deleted user files on Linux Valve
CircleCI Linux build queue backing up October 2015 CircleCI 2015-10-14 – 2015-10-15
BrowserStack security incident due to Shellshock vulnerability on prototype machine BrowserStack 2014-11-09 – 2014-11-10
Ariane 5 Flight 501 launch failure of June 1996 European Space Agency 1996-06-04
Allegro Allegro
Travis CI container-based Linux builds outage due to worker rollback failure TravisCI 2017-02-02 – 2017-02-05
Cloudflare outage on July 17, 2020 Cloudflare 2020-07-17
Mars Pathfinder system resets due to priority inversion NASA
GitHub Actions and Pages impacted by scoped token INT32 overflow GitHub 2021-05-16
A fire in a Telstra exchange is causing flight delays and network outages Telstra 2017-02-02
Amazon EC2 and EBS Issues in Tokyo (AP-NORTHEAST-1) on August 23, 2019 Amazon 2019-08-23
GitHub availability incidents in February and March 2026 GitHub
Bitly Bitly
Tarsnap outage 2016-07-24 Tarsnap 2016-07-24
Cloudflare service token incident on January 24, 2023 Cloudflare 2023-01-24
Stack Exchange Stack Exchange
Amazon EC2 DNS Resolution Issues in AP-NORTHEAST-2 Amazon 2018-11-21 – 2018-11-22
incident.io GKE Dataplane V2 `anetd` CPU saturation causes connection timeouts incident.io
VZaar VZaar
Kickstarter Kickstarter
Malicious packages reported in JCenter Bintray 2017-07-01 – 2018-12-12
Knight Capital SMARS algorithmic trading incident of August 2012 Knight Capital 2012-08-01
Rule attribute selector causing flag targeting web interface to crash Launchdarkly 2021-09-30
Travis CI GCE base image deletion Travis CI 2016-08-09
.COM/.NET SRS Production Environment Planned Outage Enom
Azure Storage service interruption November 2014 Microsoft 2014-11-19
Linux kernel leap second deadlock crash on New Year's 2008-2009 Linux 2009-01-01
AWS US East-1 power failure and service disruption in June 2012 Amazon 2012-06-30
Heroku Heroku
GitLab.com database outage of January 31, 2017 Gitlab 2017-01-31 – 2017-02-01
Google Code Jam 2014 Repeated Email Incident Google
Healthcare.gov launch failure Centers for Medicare & Medicaid Services (CMS) 2013-10-01 – 2013-12-31
BBC Online outage on Saturday 19th July 2014 BBC Online 2014-07-19 – 2014-07-21
incident.io database outage due to PGAudit incident.io 2025-04-09
CrowdStrike Falcon Content Update Incident of July 2024 CrowdStrike 2024-07-19
Amazon S3 US-EAST-1 outage of February 2017 Amazon 2017-02-28