<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>PagerDuty Blog &#187; Uncategorized</title>
	<atom:link href="http://blog.pagerduty.com/category/uncategorized/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.pagerduty.com</link>
	<description></description>
	<lastBuildDate>Mon, 30 Jan 2012 21:09:45 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Assorted New Features</title>
		<link>http://blog.pagerduty.com/2010/06/14/assorted-new-features/</link>
		<comments>http://blog.pagerduty.com/2010/06/14/assorted-new-features/#comments</comments>
		<pubDate>Mon, 14 Jun 2010 19:42:38 +0000</pubDate>
		<dc:creator>Andrew Miklas</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.pagerduty.com/?p=117</guid>
		<description><![CDATA[Set of new features to improve PagerDuty. <a href="http://blog.pagerduty.com/2010/06/14/assorted-new-features/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Browsing through our UserVoice feature requests is a pretty humbling experience for all of us working on PagerDuty.  It seems that as far as we&#8217;ve come with PagerDuty in our first year, we have at least another ten years of work ahead!</p>
<p>We&#8217;re just putting the finishing touches on the Nagios integration API now.  We&#8217;re going to have the first version of this out in the next two weeks or so.  But in the mean time, we thought we should show you all some of the other smaller features we&#8217;ve recently launched.</p>
<h1>Looping on Escalation Chains</h1>
<p>Up until now, incidents ran exactly once through their escalation policies.  Thus, unanswered incidents remained assigned to the last person on the escalation chain. Needless to say, this caused some problems if an alert made it through the escalation process without anyone taking action.</p>
<p>To ensure that open incidents are <em>always </em>dealt with, the final rule of an escalation policy can now direct PagerDuty to reassign the incident to the first person in the chain, and begin the escalation process anew.</p>
<p>We&#8217;re especially curious if anyone needs additional flexibility in the escalation policies.  Would the ability to loop back to a rule other than the first be useful to anyone?</p>
<p><img class="aligncenter size-full wp-image-127" title="PagerDuty Escalation Policies" src="http://pagerduty.zippykid.com/wp-content/uploads/pagerduty_escalation.png" alt="PagerDuty Escalation Policies" width="530" height="301" /></p>
<h1>Better Regex Support</h1>
<p>We&#8217;ve made it possible to specify both &#8220;AND&#8221; and &#8220;OR&#8221; trigger message regex filters.  We&#8217;ve also added the option to filter incoming messages based on the &#8220;from&#8221; address.  If you&#8217;ve ever accidentally hit &#8220;reply-all&#8221; on a trigger message you&#8217;ve been CC&#8217;ed, you&#8217;ll know exactly why we&#8217;ve added the from filter option.</p>
<p><img class="aligncenter size-full wp-image-129" title="PagerDuty Service Email Filters" src="http://pagerduty.zippykid.com/wp-content/uploads/pagerduty_regex.png" alt="PagerDuty Service Email Filters" width="530" height="222" /></p>
<h1>SSL &amp; TLS</h1>
<p>Our customers often find themselves needing to log into their PagerDuty accounts while on open WiFi points at airports, coffee shops, and the like.  Up until now, this was sort of a dicey proposition, since only the PagerDuty login and billing pages were SSL protected.  By popular request, we&#8217;ve added the option to enable SSL across your entire PagerDuty account.  To enable this option, get your account owner to visit the &#8220;Account Settings&#8221; page and flip on the SSL option.</p>
<p><img class="aligncenter size-full wp-image-124" title="PagerDuty Account  Settings - SSL" src="http://pagerduty.zippykid.com/wp-content/uploads/pagerduty_ssl.png" alt="PagerDuty Account Settings - SSL" width="530" height="242" /></p>
<p>By the way, we&#8217;ve also configured our mail servers to accept TLS protected SMTP sessions&#8230; perfect in case you suspect your network operator or upstream provider has some <a title="BOFH" href="http://www.theregister.co.uk/odds/bofh/">BOFH</a> tendencies.  Simply configure your outbound mail servers to use TLS opportunistically, and you should be all set.  If you&#8217;d like to check to see if your mail is being received encrypted at our end, click &#8220;View Email&#8221; on an incident trigger and then use the &#8220;View Raw Message&#8221; link.  If the message is encrypted, the last hop listed in the receive headers will mention a TLS-enabled connection.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pagerduty.com/2010/06/14/assorted-new-features/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>PagerDuty 2.0</title>
		<link>http://blog.pagerduty.com/2010/04/12/pagerduty-20/</link>
		<comments>http://blog.pagerduty.com/2010/04/12/pagerduty-20/#comments</comments>
		<pubDate>Mon, 12 Apr 2010 05:48:56 +0000</pubDate>
		<dc:creator>Alex Solomon</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.pagerduty.com/?p=89</guid>
		<description><![CDATA[We're happy to announce that we've released the new version of PagerDuty with multi-incident support. <a href="http://blog.pagerduty.com/2010/04/12/pagerduty-20/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>We&#8217;re happy to announce we&#8217;ve released the new version of PagerDuty, which has multi-incident support. To try it out, just log into your PagerDuty account.</p>
<p>This new feature corrects an over-simplification in PagerDuty&#8217;s design: up to now, PD required you to create a new alarm for each type of problem that your monitoring systems are capable of detecting. Unfortunately, this doesn&#8217;t work very well if you&#8217;re using a monitoring tool like Nagios, which can monitor thousands of hosts and services at once. The new release can now handle multiple open incidents from a single monitoring system; we call this &#8220;multi-incident support&#8221;.</p>
<p>Here&#8217;s a quick summary of the changes in the new release:</p>
<ul>
<li>Alarms have been renamed to Services.</li>
<li>Alarm Groups have been renamed to Escalation Policies.</li>
<li>Services can now track multiple open incidents at once.</li>
<li>Incident &#8220;suppression&#8221; has been renamed to &#8220;acknowledgement&#8221;.</li>
<li>The amount of time an incident stays Acknowledged is now configurable on a service-by-service basis</li>
</ul>
<p>The new version of PD is 100% backwards compatible with the previous version. Yes, we&#8217;ve renamed a bunch of stuff, but we&#8217;ve been very careful to retain the same behavior as the old version for your existing services. Read on for more details.</p>
<h2>The big change: Multi-Incident Support</h2>
<p>PagerDuty is now capable of tracking multiple open concurrent incidents.  Put another way, your monitoring system can tell PagerDuty about 100 simultaneous and independent problems without you needing to create 100 PagerDuty alarms (as was the case in the old version of PD).</p>
<p>PagerDuty now uses “incidents” rather than “alarms” as the main object.  Your support team will be acknowledging, escalating, and resolving incidents, instead of alarms.  Incidents in PagerDuty are similar to tickets in a bug tracking system: they are created when a problem is detected, and are resolved or closed when the problem is fixed.</p>
<p>Since PagerDuty can now handle hundreds of open incidents at once, we’ve tried to carefully design PagerDuty’s interface to make it easy to work with large collections of incidents.  The new Incidents and Dashboard tabs feature tables that let you see all of the open incidents assigned to you at a glance.  You can also easily triage your incidents straight from these pages using the controls located at the top of the table.</p>
<p style="text-align: center;"><a href="http://pagerduty.zippykid.com/wp-content/uploads/incidents_tab2.png" target="_blank"><img class="aligncenter size-full wp-image-81" title="Incidents Tab" src="http://pagerduty.zippykid.com/wp-content/uploads/incidents_tab.png" alt="Incidents tab" width="477" height="271" /></a></p>
<h2 style="text-align: left;">Turning on multi-incident support for your PagerDuty services</h2>
<p style="text-align: left;">By default, the PagerDuty services still work the same way they&#8217;ve always worked: they can only have one incident open at once. The reason for this is to maintain backwards compatibility.</p>
<p>You can enable multi-incident support for any existing service. Here&#8217;s how:</p>
<ol>
<li>Click on the &#8220;Services&#8221; tab, and click the &#8220;Edit&#8221; link (under Actions) for the service you wish to modify.</li>
<li>Under the &#8220;Email integration settings&#8221; section, you&#8217;ll see 3 options:
<ul>
<li>Open a new incident for each trigger email</li>
<li>Open a new incident for each new trigger email subject</li>
<li>Open a new incident only if an open incident does not already exist</li>
</ul>
<p style="text-align: left;"><a href="http://pagerduty.zippykid.com/wp-content/uploads/service_email_incident_creation2.png" target="_blank"><img class="aligncenter size-full wp-image-77" title="service_email_incident_creation2" src="http://pagerduty.zippykid.com/wp-content/uploads/service_email_incident_creation.png" alt="Email integration settings" width="477" height="258" /></a><br />
The first option, if selected, will cause the service to open a new incident for each trigger email sent to the service&#8217;s email address.</p>
<p>The second option, if selected, will cause the service to open a new incident based on the email subject: if an open incident with the same subject already exists, the email is appended to this incident; if not, a new incident is created.</p>
<p>The third option, which should be selected by default for an existing service, allows a service to maintain the behavior of the old version of PagerDuty. It basically turns multi-incident support off: if selected, the service can only have one open incident at any one time. When the service receives a trigger eamil, it opens a new incident if the service doesn&#8217;t already have an open incident; otherwise, it appends the email to the open incident.</li>
<li>To turn multi-incident support on, select either the first or second option.</li>
<li>Click &#8220;Save changes&#8221; at the bottom of the page, and you&#8217;re done.</li>
</ol>
<h2>Alarms are now Services</h2>
<p>We’ve renamed “alarms” to “services”.  Services are now used only to represent an integration point between PagerDuty and your monitoring services. Currently, the PagerDuty services integrate with your monitoring systems via email integration (just like in the old version of PD). In the coming weeks, we will also add support for an HTTP-based API for the PagerDuty services. This will allow your monitoring systems to trigger/acknowledge/resolve incidents in PagerDuty via a synchronous API call.</p>
<p>For similar reasons, we’ve renamed “alarm groups” to “escalation policies”.  We think the new name better captures the use of these objects.</p>
<h2>Incident &#8220;suppression&#8221; is now incident &#8220;acknowledgement&#8221;</h2>
<p>We’ve also renamed incident “suppression” to “acknowledge”.  As before, this feature is used to temporarily prevent an incident from generating alerts.  We thought the word “acknowledge” better captured the purpose of the feature: “stop bothering me about this problem for now… I’m working on it!”.</p>
<p>We&#8217;ve also made the acknowledgement timeout configurable on a service-by-service basis. This means that you can set the amount of time that an incident stays in the Acknowledged state, before it reverts to back to Triggered and alerts you again. The timeout is set to 30 minutes by default for each service, but you can change it or even turn it off easily:</p>
<ol>
<li>Click on the &#8220;Services&#8221; tab, and click the &#8220;Edit&#8221; link (under Actions) for the service you wish to modify.</li>
<li>Under the &#8220;Incident settings&#8221; section, you&#8217;ll see an entry for the &#8220;Incident ack timeout&#8221;.
<p style="text-align: center;"><a href="http://pagerduty.zippykid.com/wp-content/uploads/incident_ack_timeout.png"><img class="size-full wp-image-106 aligncenter" style="border: 1px solid #666666;" title="incident_ack_timeout" src="http://pagerduty.zippykid.com/wp-content/uploads/incident_ack_timeout.png" alt="Incident ack timeout" width="477" height="155" /></a></p>
</li>
<li>By default, the timeout is set to &#8220;30 minutes&#8221;. To modify the timeout, click and change the value of this drop-down.You can also disable the timeout altogether, by unchecking the checkbox labeled &#8220;Enable a timeout for incidents left in the Acknowledged state for too long&#8221;. We recommend leaving the timeout enabled, to ensure you don&#8217;t forget incidents in the Acknowledged state.</li>
<li>Click &#8220;Save changes&#8221; at the bottom of the page, and you&#8217;re done.</li>
</ol>
<h2>What’s next?</h2>
<p>Next up is support for a PagerDuty API. This will make it easier to integrate PagerDuty with popular monitoring tools like Nagios, Zenoss, monit, Munin and many others. The API will allow your monitoring system to trigger, acknowledge and resolve incidents directly in PagerDuty, via a synchronous call to the API.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pagerduty.com/2010/04/12/pagerduty-20/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Feature: Alarm Auto-resolution</title>
		<link>http://blog.pagerduty.com/2010/03/07/new-feature-alarm-auto-resolution/</link>
		<comments>http://blog.pagerduty.com/2010/03/07/new-feature-alarm-auto-resolution/#comments</comments>
		<pubDate>Sun, 07 Mar 2010 05:16:49 +0000</pubDate>
		<dc:creator>Alex Solomon</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.pagerduty.com/?p=55</guid>
		<description><![CDATA[Announcing Alarm Auto-resolution <a href="http://blog.pagerduty.com/2010/03/07/new-feature-alarm-auto-resolution/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>We&#8217;d like to announce a new PagerDuty feature: auto-resolution of alarms. Auto-resolution is a setting on the PagerDuty alarms; if enabled, an alarm will automatically resolve itself after a specified amount of time.</p>
<p>Alarm auto-resolution is an important safety mechanism in case you forget an alarm in the Triggered state. This all makes perfect sense if you understand how the PagerDuty alarms work.</p>
<p>Alarms in PagerDuty are stateful. Each alarm starts out in the Idle state. Upon receiving a trigger email, the alarm transitions to the Triggered state and begins to alert your team based on the rules specified by the alarm&#8217;s alarm group. However, if an already Triggered alarm receives additional trigger emails, it logs them but *does not re-start the alerting process*. This can be dangerous, as I&#8217;ll explain below.</p>
<p>In the normal case, an alarm is triggered and notifies the person on-call. That person receives the phone/SMS/email alert, fixes the problem and resolves the alarm. In some cases, the person on-call does not receive the alert (this can happen if your cell runs out of batteries, or has no reception, or you forget your phone in another room and go to sleep). In these cases, the alarm is automatically escalated to a secondary person, who then picks up the alert and resolves the alarm. It&#8217;s also possible (and this has happened a few times to some of our customers) that an alarm triggers and contacts all of the people in the escalation chain, but nobody picks it up.</p>
<p>When an alarm runs out of people to notify, it stays in the Triggered state until someone resolves it. This is a dangerous state for an alarm to be in, because, as I mentioned above, any trigger emails to the alarm will not restart the alerting process. The alarm must be explicitly resolved to re-enable alerting.</p>
<p><img class="aligncenter size-full wp-image-58" title="autores_600" src="http://pagerduty.zippykid.com/wp-content/uploads/autores_600.png" alt="autores_600" width="600" height="140" /><br />
This is where auto-resolution comes in. We strongly recommend you turn it on for all of your alarms. Here&#8217;s how to enable auto-resolution for an alarm:</p>
<ol>
<li>Click on the Alarms tab, and click one of your alarms.</li>
<li>Near the top of the page, you&#8217;ll see &#8220;Auto resolve&#8221;. Click &#8220;change&#8221;.</li>
<li>Set the amount of time after which the alarm is auto-resolved. This should be set according to the amount of time an alarm would take to run out of people to notify (as specified by the rules set in your alarm group).</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://blog.pagerduty.com/2010/03/07/new-feature-alarm-auto-resolution/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using apc
Page Caching using apc
Database Caching 1/16 queries in 0.009 seconds using memcached
Content Delivery Network via pagerduty.zkimg.com

Served from: blog.pagerduty.com @ 2012-02-05 10:59:35 -->
