Category Archives: Best Practices

How Cascadeo Integrates PagerDuty Into Its NOC, Instant Messaging and Ops Support Platform

Over the past few years, PagerDuty has alerted thousands of users, letting them know when their systems are down. It’s what we do, and we’re proud to be seen as an integral part of their monitoring solution. Every once in … Continue reading

Posted in Best Practices, Blog, Customer | Tagged , , , | Leave a comment

Trading up Your Engine: How to Move Your IOPS-heavy MySQL/Rails Stack to Unicode Without Downtime

You’re a techie working for one of the multitude of startups that rushed to market, where the founders hastily glued a Rails app together with candy-bar wrappers and tinfoil.  Once it became obvious that enthusiasm was no substitute for raw coding power, developers … Continue reading

Posted in Best Practices, Blog, Code | Tagged , , | 2 Comments

Ensuring the Call Goes Out—Every Time

A few weeks ago I had the privilege of speaking at Surge 2012 in Baltimore, MD. The audience were of those whose focus was on better scaling their infrastructure to be prepared for the worse case scenario, as well as seeking … Continue reading

Posted in Best Practices, Events | Tagged , , , | Leave a comment

Growing a Rails Application: How We Made Deploy Fast Again

TL;DR; We brought our deploy time down from 10 minutes to 50 seconds. When I joined PagerDuty over a year ago, our application consisted of essentially a single Rails site. We’ve changed the architecture of our system since then to … Continue reading

Posted in Best Practices, Blog, Code | Tagged , , , , | 5 Comments

Approaching the Hiring of Engineers as a Machine Learning Problem

Hiring software engineers is hard.  We all know this.  If you get past the problem of sourcing and landing good candidates (which is hard in itself), the whole issue of “is this person I’m talking to ‘good enough’ to actually … Continue reading

Posted in Best Practices, Blog | Tagged , , , , , , | Leave a comment

4 Keys to a Website Monitoring Service

This is a guest post by Connie Quach, Sr. Product Manager, responsible for the web performance products at Neustar. In today’s competitive environment, website performance and availability are the keys to success.  When problems arise, you need to be notified … Continue reading

Posted in Best Practices, Guest Blog | Tagged , , | Leave a comment

How to provide 24×7 phone support using PagerDuty and RingCentral

Customers always expect great support from every business, and they ought to if they pay a premium for it. Providing awesome support is a lot of work and providing it 24×7 is even harder due to all the coordination that … Continue reading

Posted in Best Practices | Tagged , , | Leave a comment

How dotCloud, Instagram and One Crafty Systems Administrator are Using PagerDuty

Monitoring your infrastructure. It can be challenging, but that’s why you have all of the tools in place to make sure you don’t miss a beat when things go wrong. You’ve probably got Nagios monitoring your overall infrastructure, Pingdom or … Continue reading

Posted in Best Practices | Tagged , , , | 3 Comments

On-call best practices: Page your manager

Having one person on-call isn’t enough. What happens if your on-call engineer sleeps through their alert? What happens if their phone’s battery dies without them knowing, or if they get an alert at a really inconvenient time, like when stuck … Continue reading

Posted in Best Practices | Tagged , | 2 Comments

Not breaking your Google Analytics (like a pro)

As a general rule, whatever percentage you think your test coverage is, it isn’t. Whatever amount of the known surface area you’re covering, there’s going to be an exciting swath of things you didn’t realize that you need to test. … Continue reading

Posted in Best Practices | Tagged , , , | Leave a comment