Author Archives: John Laban

Pressure Release Valves

This is the fourth in a series of posts on increasing overall availability of your service or system. Have you ever gotten paged, and known right away that this problem isn’t like the last 15 operations issues you’ve dealt with … Continue reading

Posted in Availability | Tagged , | Leave a comment

A Standard Operating Procedure for when s*IT hits the fan

This is the third in a series of posts on increasing overall availability of your service or system. In the first post of this series, we defined and introduced some concepts of system availability, including mean time between failure – MTBF … Continue reading

Posted in Availability | Tagged , | Leave a comment

More control over Optimistic Locking in Rails

Like pretty much everything else in Rails, optimistic locking is nice and easy to setup:  you simply add a “lock_version” column to your ActiveRecord model and you’re all set.  If a given Rails process is trying to update some record, … Continue reading

Posted in Code | Tagged , | Leave a comment

Availability lessons from shoe companies and ancient warlords

This is the second in a series of posts on increasing overall availability of your service or system. In the first post of this series, we defined and introduced some concepts of system availability, including mean time between failure – … Continue reading

Posted in Availability | Tagged , | 1 Comment

Getting the most out of PagerDuty: Incident De-Duping

Tired of getting a flood of PagerDuty incidents whenever a problem occurs with one of your systems?  Do many of the incidents seem identical?  Do you spend valuable time trying to fend off the seemingly never-ending PagerDuty phone calls and … Continue reading

Posted in Best Practices, Features | Leave a comment

Velocity Contest Winners

Velocity 2011 was a blast! Thanks to everyone who came by our booth to find more about PagerDuty, snag a t-shirt, and enter our contest. Continue reading

Posted in Announcements, Blog, Community | Leave a comment

New APIs Available Now

Have you ever said to yourself: “PagerDuty is great, but I wish I could better integrate it into the custom tools I already use.” Or maybe: “Why can’t I see more reports on the number of incidents each of my … Continue reading

Posted in Announcements, Features | Tagged | 7 Comments

See you at Velocity 2011

PagerDuty is excited to be attending the O’Reilly Velocity Conference 2011 next week in Santa Clara, CA. Velocity is a great venue that focuses on helping Web companies overcome the challenges of developing fast, scalable and reliable sites and services. … Continue reading

Posted in Announcements, Community | Tagged | 1 Comment

PagerDuty Wants You!

We’re hiring! Interested in working with a team reinventing the stagnant world of IT operations software? Want a job hacking on a product with a proven market and customers ranging in size from startups to Fortune 500s? Interested in working … Continue reading

Posted in Announcements | Leave a comment

Standing on the shoulders of giants and stumbling with them – the Amazon AWS outage’s “pain” statistics

Today, at around 1am Pacific Time, Amazon began having major problems with some of their cloud infrastructure: specifically with their EC2, EBS, and RDS offerings. We’d like to share some statistics on the alerts we sent out – via phone or SMS – during the outage. Continue reading

Posted in Announcements, Blog | 14 Comments