Monthly Archives: October 2012

Trading up Your Engine: How to Move Your IOPS-heavy MySQL/Rails Stack to Unicode Without Downtime

You’re a techie working for one of the multitude of startups that rushed to market, where the founders hastily glued a Rails app together with candy-bar wrappers and tinfoil.  Once it became obvious that enthusiasm was no substitute for raw coding power, developers … Continue reading

Posted in Reliability | Tagged , , | 2 Comments

Ensuring the Call Goes Out—Every Time

A few weeks ago I had the privilege of speaking at Surge 2012 in Baltimore, MD. The audience were of those whose focus was on better scaling their infrastructure to be prepared for the worse case scenario, as well as seeking … Continue reading

Posted in Reliability | Tagged , , , | Leave a comment

Growing a Rails Application: How We Made Deploy Fast Again

TL;DR; We brought our deploy time down from 10 minutes to 50 seconds. When I joined PagerDuty over a year ago, our application consisted of essentially a single Rails site. We’ve changed the architecture of our system since then to … Continue reading

Posted in Operations Performance | Tagged , , , , | 9 Comments

Approaching the Hiring of Engineers as a Machine Learning Problem

Hiring software engineers is hard.  We all know this.  If you get past the problem of sourcing and landing good candidates (which is hard in itself), the whole issue of “is this person I’m talking to ‘good enough’ to actually … Continue reading

Posted in Operations Performance | Tagged , , , , , , | 2 Comments

4 Keys to a Website Monitoring Service

This is a guest post by Connie Quach, Sr. Product Manager, responsible for the web performance products at Neustar. In today’s competitive environment, website performance and availability are the keys to success.  When problems arise, you need to be notified … Continue reading

Posted in Reliability | Tagged , , | Leave a comment

Turn on Maintenance. Go Exploring. Break Stuff.

Sometimes you just have to tinker. Experimentation, trial and error are all part and parcel of the learning experience, and the gateway to bigger and better things. Reed Richards didn’t get into the Negative Zone overnight, and he may have … Continue reading

Posted in Reliability | Tagged , , , | 2 Comments