// a blog about systems that stagger but don't fall
lurch.dev
Notes on resilience engineering, fault tolerance, and the
endearing wobbliness of distributed systems
that stagger under load and somehow keep
walking forward.
v1.4 // unstable
build: passing*
uptime: 99.2%
+0px
post-mortem // 17
The 4-byte typo that ate Tuesday
2026-02-22 · incident write-up
We deployed a config change. The config change had a typo. The typo
looked like a flag. The flag was real. Reader, the flag was real.
full timeline →
+3px
post-mortem // 16
When DNS went on holiday
2026-01-30 · incident write-up
DNS didn't break. DNS just got slow. Slow enough that everything
that depended on DNS got slow. We learned a lot about our own
timeouts that day.
full timeline →
-4px
post-mortem // 15
A leap second walks into a bar
2025-12-31 · incident write-up
The bartender says, "we don't serve your kind here." Then he serves
it anyway, because the bar is a Linux box and Linux boxes have to
serve everyone, even the leap seconds.
full timeline →