3 Major New Features – Part 2: The Nagios -> PagerDuty API

This is second article of a three part series about the latest improvements to PagerDuty. Be sure to check out Part 1 and Part 3.

NagiosWe’ve just released a Nagios API for PagerDuty.  If you’re using Nagios to monitor your hosts, you no longer have to use PagerDuty’s email integration mechanism to get SMSes and phone calls from your Nagios installation.  Instead, you can completely bypass the email step and have Nagios directly communicate problem, acknowledgement, and recovery messages to PagerDuty via a HTTPS API.

Add a Nagios service

The main benefit of the API over the email integration mechanism is that PagerDuty can now automatically close out incidents when Nagios reports that the problem has been fixed.  No more getting a call 30 minutes after fixing a problem because you forgot to mark the incident as resolved in PagerDuty!  Also, since the API allows us to distinguish between PROBLEM and RECOVERY messages, PagerDuty will no longer spuriously start the alerting process on a RECOVERY message.

Using the new Nagios API is very simple — you simply create a Nagios service within PagerDuty, copy a little Perl script to your Nagios server, and then add a “pseudo-contact” to your Nagios config corresponding to the new service.  For step-by-step details on how to do this, please take a look at our Nagios integration guide.

By switching your Nagios installation to use the API, you’ll be able to benefit from a number of new PagerDuty features we have planned.  One feature now in the works is the ability to have PagerDuty send out email and SMS alerts when an incident is resolved.  With this feature, you’ll be able to see at a glance whether an issue has resolved itself before crawling out of bed at 3am.

Another feature we’re now considering is the ability to assign Nagios alerts to different PagerDuty Escalation Policies based on Nagios variables such as the HOSTGROUP and SERVICEGROUP.  Let us know if this sounds useful to you — we’d love to know if this is something that your ops team would use.

Share on FacebookTweet about this on TwitterGoogle+
This entry was posted in Announcements, Features. Bookmark the permalink.

4 Responses to 3 Major New Features – Part 2: The Nagios -> PagerDuty API

  1. Nick says:

    +100 to using the HOSTGROUP and SERVICEGROUP to choose an escalation policy. 

  2. Inquiring Mind says:

    In the meantime, while HOSTGROUP/SERVICEGROUP support does not exist, how can we route specific Nagios alerts to specific rotations?  For example, if one has two rotations named ‘sysadmin’ and ‘dba’, and one wishes to send any database-related Nagios alerts to the ‘dba’ rotation, do you have to something equivalent to:

    1. Create a PagerDuty service for each rotation
    2. Create two different PagerDuty users in Nagios (one for each API key)3. Assign the Nagios user associated with the ‘dba’ PagerDuty service’s API key as the contact for the database-related services?

    Or am I missing something here?  Or alternatively, if I’m not being dense, is there some spiffier way of accomplishing this?

  3. pagerdutyusolame says:

    +1000 for doing -something-…

    AUG 3 2010… what happened, guys?

  4. Octavian says:

    Any update on the different escalation policies based on HOSTGROUP/SERVICEGROUP?

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>