Hacker News new | past | comments | ask | show | jobs | submit login
That one time Keygen went down for 5 hours (twice) (keygen.sh)
3 points by ezekg 10 months ago | hide | past | favorite | 1 comment



I see resolutions for protecting against the now-known error modes discussed, and better alerting to get the on-call engineer (aka always Zeke :D) looking into things quicker, but curious how they might approach preparing for "unknown-unknowns" that will come in the future.

Are there good ways for a small-team to proactively stress test a system without mucking up customers? Open question.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: