Action Item: Ban All Vacations
by J. Paul Reed (Netflix, USA) “I just got pager alert! Our service is reporting 100% CPU utilization; any ideas?” “Not sure, but let’s start […]
by J. Paul Reed (Netflix, USA) “I just got pager alert! Our service is reporting 100% CPU utilization; any ideas?” “Not sure, but let’s start […]
By Laura Maguire, PhD Critical digital infrastructure (CDI) is increasingly at the core of many high-risk domains that serve societal needs. Electronic health records, military […]
By Lorin Hochstein (USA) Unexpected events that happen in production are often learning opportunities. At Netflix, we encourage teams to write up operational surprises that […]
By: Beth Lay Asher Balkin is a Research Engineer at the Cognitive Systems Engineering Laboratory at Ohio State University, Columbus, Ohio. Asher has worked in […]
What knowledge, strategies and practices do we have at hand to respond in a flexible manner to COVID-19? Lessons learned from Resilience Engineering and Societal […]
Contribution from J. Paul Reed Presentation videos from this year’s REdeploy, a Resilience Engineering conference focused on the software development and operations industry, were recently posted. […]
It is everywhere, it is hot, and we have a lot to contribute In a 2018 special issue of the Harvard Business Review, provocatively entitled […]
By: Beth Lay Silence. We have Thai Wood on the phone. Asher looks at me, puzzled, then mouths “who is interviewing who?” Asher and I […]