| In ITOps

In IT, teams are responsible for maintaining a vast number of what are known as IT assets. IT assets include just about every tangible and…

| In ITOps, Monitoring

The efficacy of detecting and proactively preventing downtime often hinges on how far your visibility expands across your IT environment and how up to date…

| In ITOps, ITOps & Modern Ops

As a broad umbrella term, IT Operations, or “ITOps” as it’s commonly known, is a term generally covering an organization’s IT workforce outside of software…

| In SecOps, Security

In securing the modern threat landscape, many organizations turn to Security Information and Event Management (SIEM) as their best practices solution for aggregating and analyzing…

| In Engineering

Chaos testing was created just over ten years ago thanks to the same company that gave us Tiger King and The Queen’s Gambit—Netflix. In 2010,…

| In Best Practices & Insights, Culture

After the unfortunate Commonwealth Bank of Australia outage last week, the powerful Payment Systems Board—whose members include the chairs of the RBA and APRA –…

| In Reliability

On June 3rd and 4th, PagerDuty’s Notification Pipeline suffered two large SEV-1 outages. On the 3rd, the outage resulted in a period of poor performance…

| In Reliability

On April 14th, PagerDuty suffered an outage that affected customers on both the mobile and web applications. During the period of the outage, customers may…

| In Reliability

On March 25th, PagerDuty suffered intermittent service degradation over a three hour span, which affected our customers in a variety of ways. During the service…

| In Reliability

At PagerDuty, our customers rely on us to be highly-available and reliable when their infrastructure may not be. Unfortunately, sometimes bugs may surface in our…

| In Reliability

At PagerDuty we offer transparency of any outage that negatively impacts PagerDuty customers. We are proud of PagerDuty’s superior reliability, but occasionally we may have…

| In Operations Performance, Reliability

At PagerDuty we’ve invested in superior reliability of our service. We strive for 100% uptime to ensure that any events detected by your monitoring tools…

| In Reliability

On Dec 11th, PagerDuty suffered an outage which affected a subset of customers and blocked access to all pagerduty.com addresses. First off, we deeply apologize…

| In Reliability

At PagerDuty, we usually get a front seat to anything that’s wrong with the internet. Last weekend, a derecho storm took out 7% of AWS…

| In Reliability

On the evening of Friday, June 29th, Amazon Web Services (AWS) experienced a major outage at its North Virginia location due to a loss of…