Author Archives: John Laban

PagerDuty Wants You!

We’re hiring! Interested in working with a team reinventing the stagnant world of IT operations software? Want a job hacking on a product with a proven market and customers ranging in size from startups to Fortune 500s? Interested in working … Continue reading

Posted in Announcements | Leave a comment

Standing on the shoulders of giants and stumbling with them – the Amazon AWS outage’s “pain” statistics

Today, at around 1am Pacific Time, Amazon began having major problems with some of their cloud infrastructure: specifically with their EC2, EBS, and RDS offerings. We’d like to share some statistics on the alerts we sent out – via phone or SMS – during the outage. Continue reading

Posted in Announcements, Blog | 14 Comments

The ups and downs of Availability

This post is meant as a quick introduction to some concepts of system availability, so that subsequent posts in this series make sense. I’ll go over concepts like availability, SLA, mean time between failure, mean time to recovery, etc. Continue reading

Posted in Availability | Tagged , , , | 5 Comments

On-Call Best Practices: Part 1

This is Part 1 in a multi-part series dealing with tips for being on-call. Continue reading

Posted in Best Practices | Tagged , | 6 Comments