Previous incidents

August 2025
Aug 28, 2025
1 incident

Deepgram Aura-2 TTS Performance Degradation

Resolved Aug 28 at 12:04pm PDT

Deepgram is investigating an issue where a subset of requests may return elevated rates of 5XX errors or experience significantly higher time to first byte

Aug 27, 2025
1 incident

Aura-2 TTS Performance Degradation

Degraded

Resolved Aug 27 at 02:00pm PDT

This incident has been resolved.

1 previous update

Aug 25, 2025
1 incident

Intermittent issues with the dashboard loading

Resolved Aug 25 at 05:04pm PDT

The issue was pinpointed and reverted quickly

2 previous updates

Aug 05, 2025
1 incident

Elevated error rates on ElevenLabs Voice requests

Degraded

Resolved Aug 05 at 09:14pm PDT

Elevenlabs released a fix and is fully operational now. We are also seeing normal levels, but will continue to monitor.

For impacted users, we recommend implementing Vapi Fallback Plan to automatically failover in the future
https://docs.vapi.ai/voice-fallback-plan

1 previous update

Aug 04, 2025
1 incident

API and calls were momentarily disrupted

Resolved Aug 05 at 12:12pm PDT

IR August 4th: Call Degradation due to Pod Evictions

TL;DR

On August 4th, an incident occurred due to aggressive pod consolidation by Karpenter, which caused Redis pods to be evicted and restarted. This led to API pod failures, triggering a failover to an outdated networking component, resulting in dropped calls. The incident caused a total of 393 calls to be dropped.

Timeline (PST)

August 4th

  • 11:02-11:27 AM - Core team identifies Karpenter pods in CrashLoopBackOff (OOMKi...

1 previous update

July 2025
Jul 28, 2025
1 incident

Deepgram is experiencing connection issues

Degraded

Resolved Jul 28 at 10:19pm PDT

We are no longer seeing signs of connection issues. The issue should be resolved now, but we will continue to monitor. We apologize for any inconvenience caused.

1 previous update

Jul 25, 2025
1 incident

Incident Report: Increased call failures on July 25 (PST)

Resolved Jul 25 at 02:46pm PDT

Incident Report:
Increased call failures on July 25 (PST)

Summary (TL;DR)
On July 25 between 7:00–7:15am PST, a spike in call volume caused some calls to fail with a worker-not-available error. The fallback service for short calls (our serverless workers) could not start because its image architecture didn’t match the configured runtime (image built for ARM64, runtime set to x86). We stabilized the platform by scaling primary workers and then corrected the configuration. Service is o...

Jul 24, 2025
1 incident

Elevenlabs is experiencing increased Text-to-Speech latency

Degraded

Resolved Jul 24 at 11:30am PDT

Elevenlabs released a hotfix and is fully operational now.

1 previous update

Jul 17, 2025
1 incident

Web Calls from Assistant Page are unavailable

Degraded

Resolved Jul 17 at 10:50am PDT

Dashboard calls should be back to normal.

1 previous update

Jul 15, 2025
1 incident

Deepgram transcription is degraded

Degraded

Resolved Jul 17 at 10:34am PDT

This was resolved

1 previous update

Jul 09, 2025
1 incident

Up to date calls not returned in API or dashboard

Degraded

Resolved Jul 09 at 09:24pm PDT

We have identified and resolved the issue. Apologies for the disruption.

1 previous update

Jul 06, 2025
1 incident

Playht voices are degraded

Degraded

Resolved Jul 07 at 11:18pm PDT

Resolved now.

1 previous update

June 2025
Jun 25, 2025
2 incidents

SIP is degraded

Degraded

Resolved Jun 30 at 09:43pm PDT

TLDR: A temporary slowdown caused by saturation in our API gateway layer increased response times until they exceeded the edge-network timeout, causing a 524 HTTP response for some API requests.

Timeline in PST
01:00 AM First elevated 524 error responses detected
06:35 AM Rolled back recent backend release (no improvement)
07:19 AM Rolled back related network changes (no improvement).
08:22 AM Scaled up API gateway
09:36 AM Scaled up API gateway further
10:00 AM Reverted the previous night's...

4 previous updates

DB High Latency

Degraded

Resolved Jun 25 at 11:54am PDT

The issue has been resolved: https://neonstatus.com/aws-us-west-oregon/incidents/01JYM23FB7HR82VPZR9DBKVPP8#01JYM4F8Y8MZARWJWRFKV01AP5.

1 previous update

Jun 20, 2025
1 incident

Increased database latency causing requests to fail

Degraded

Resolved Jun 20 at 01:43pm PDT

Our database provider has reported this issue as resolved from their end

1 previous update

Jun 17, 2025
1 incident

Breaking changes to Success Evaluation API Response

Degraded

Resolved Jun 18 at 07:42pm PDT

TL;DR

In response to hallucinations reports in the Success Evaluation feature, we updated our integration with Gemini LLM to use Structured Output. This inadvertently changed the type of the call.analysis.successEvaluation field from string | null to string | number | boolean | null, introducing a breaking change for customers with strict type validation and those using Vapi Server SDKs.

Timeline (all in PT)

  • June 12, 11:32pm: Enterprise and Startup users report hallucinations in Su...

1 previous update

Jun 12, 2025
1 incident

Sign-ups/Sign-ins are not working

Downtime

Resolved Jun 13 at 01:12am PDT

It is resolved.

2 previous updates

Jun 09, 2025
1 incident

Elevenlabs voice provider not working with custom API Key

Degraded

Resolved Jun 12 at 03:20am PDT

Summary:
We experienced an issue related to API key validation within our WebSockets implementation when sending the API key more than once.

Details:
The issue arose during API key validation within our WebSockets implementation. Our system validates that the API key provided during the initial message is the same as proceeding messages.

A recent change introduced during a release caused a mismatch in how API keys were compared. Specifically, the system was comparing hashed API keys agains...

2 previous updates

Jun 01, 2025
1 incident

API was down

Downtime

Resolved Jun 01 at 11:00pm PDT

API was down due to user error in routine maintenance. Service has since been restored

1 previous update