Postmortem Index

Explore incident reports from various companies


A 43 second network partition during maintenance caused MySQL master failover, but the new master didn’t have several seconds of writes propogated to it because of cross-continent latency. 24+ hours of restoration work to maintain data integrity.

Source | Edit Metadata | JSON file