Learn from Salesforce.com

When a Power Surge Cost Millions:
The Salesforce NA14 Incident

In 2016, a power surge at Salesforce triggered a chain of failures resulting in data loss for thousands of customers. Incident Drill allows your team to practice responding to similar high-pressure situations, minimizing real-world damage and downtime.

Salesforce.com | 2016 | Outage & Data Loss (Cloud CRM)

The High Stakes of Infrastructure Failure

Cloud infrastructure is complex, and even robust systems can fail. The Salesforce NA14 incident highlights the devastating impact of a single point of failure compounded by software bugs. It underscores the critical need for thorough testing, incident response planning, and data recovery strategies.

PREPARE YOUR TEAM

How Incident Drill Prepares Your Team

Incident Drill provides a safe and realistic environment to practice incident response. Simulate scenarios like the Salesforce NA14 outage, allowing your engineers to hone their skills in diagnostics, communication, and recovery without risking real customer data.

🚨

Realistic Simulations

Experience incidents with realistic failure modes and cascading effects.

🧑‍💻

Hands-on Exercises

Engage in practical exercises to diagnose, mitigate, and resolve incidents.

💬

Collaborative Environment

Work together with your team to improve communication and coordination.

📈

Performance Metrics

Track individual and team performance to identify areas for improvement.

📚

Post-Incident Analysis

Conduct thorough post-incident reviews to extract valuable lessons.

🛠️

Customizable Scenarios

Tailor incident simulations to your specific infrastructure and environment.

WHY TEAMS PRACTICE THIS

Prepare for the Unthinkable

  • Reduce Mean Time to Resolution (MTTR)
  • Improve Incident Response Coordination
  • Identify System Vulnerabilities
  • Enhance Team Communication
  • Minimize Data Loss Risk
  • Build Confidence in Critical Systems

Salesforce NA14 Outage Timeline

15:43 PST
Power surge impacts primary data center.
15:45 PST
Storage array controller fails. Critical
15:48 PST
Firmware bug causes database corruption.
16:00 PST
Failover to secondary site fails.
21:00 PST
Restoration from 5-hour-old backup initiated. Partial Recovery

How It Works

1

Step 1: Understand the Incident

Review the details of the Salesforce NA14 outage and its impact.

2

Step 2: Simulate the Scenario

Run an Incident Drill simulation mirroring the key events of the outage.

3

Step 3: Analyze the Response

Evaluate your team's performance and identify areas for improvement.

4

Step 4: Implement Preventative Measures

Apply lessons learned to strengthen your infrastructure and processes.

Ready to Prevent Your Own NA14?

Join the Incident Drill waitlist and be among the first to experience realistic incident simulations. Prepare your team for anything.

Get Early Access
Founding client discounts Shape the roadmap Direct founder support

Join the Incident Drill waitlist

Drop your email and we'll reach out with private beta invites and roadmap updates.