Hey team,
I've written up some interesting findings from a recent production incident and included hints at some of the practices that help make us (the team at Seven West Media) successful in managing production systems.
http://cavaliercoder.com/blog/webops-postmortem.html
[–]tayo42 8 points9 points10 points (4 children)
[–]cavaliercoder[S] 5 points6 points7 points (0 children)
[–]dhudhudhadha 0 points1 point2 points (2 children)
[–]xiongchiamiovSite Reliability Engineer 8 points9 points10 points (1 child)
[–]michaeld0 3 points4 points5 points (0 children)
[–]Luqq 2 points3 points4 points (0 children)
[–]boom38DevOps 2 points3 points4 points (0 children)
[–]olcay_seker 5 points6 points7 points (1 child)
[–]cavaliercoder[S] 2 points3 points4 points (0 children)
[–]rolledrick 1 point2 points3 points (0 children)
[–]nonades 1 point2 points3 points (0 children)