Jun 2025 to Aug 2025

August 2025

Aug 28, 2025

1 incident

Deepgram Aura-2 TTS Performance Degradation

Resolved Aug 28, 2025 at 7:04pm UTC

Deepgram is investigating an issue where a subset of requests may return elevated rates of 5XX errors or experience significantly higher time to first byte

Aug 27, 2025

1 incident

Aura-2 TTS Performance Degradation

Resolved

Resolved Aug 27, 2025 at 9:00pm UTC

This incident has been resolved.

1 previous update

Aug 25, 2025

1 incident

Intermittent issues with the dashboard loading

Resolved

Resolved Aug 26, 2025 at 12:04am UTC

The issue was pinpointed and reverted quickly

2 previous updates

Aug 05, 2025

1 incident

Elevated error rates on ElevenLabs Voice requests

Resolved

Resolved Aug 6, 2025 at 4:14am UTC

Elevenlabs released a fix and is fully operational now. We are also seeing normal levels, but will continue to monitor.

For impacted users, we recommend implementing Vapi Fallback Plan to automatically failover in the future
https://docs.vapi.ai/voice-fallback-plan

1 previous update

Aug 04, 2025

1 incident

API and calls were momentarily disrupted

Resolved

Resolved Aug 5, 2025 at 7:12pm UTC

IR August 4th: Call Degradation due to Pod Evictions

TL;DR

On August 4th, an incident occurred due to aggressive pod consolidation by Karpenter, which caused Redis pods to be evicted and restarted. This led to API pod failures, triggering a failover to an outdated networking component, resulting in dropped calls. The incident caused a total of 393 calls to be dropped.

Timeline (PST)

August 4th

11:02-11:27 AM - Core team identifies Karpenter pods in CrashLoopBackOff (OOMKi...

1 previous update

July 2025

Jul 28, 2025

1 incident

Deepgram is experiencing connection issues

Resolved

Resolved Jul 29, 2025 at 5:19am UTC

We are no longer seeing signs of connection issues. The issue should be resolved now, but we will continue to monitor. We apologize for any inconvenience caused.

1 previous update

Jul 25, 2025

1 incident

Incident Report: Increased call failures on July 25 (PST)

Resolved

Resolved Jul 25, 2025 at 9:46pm UTC

Incident Report:
Increased call failures on July 25 (PST)

Summary (TL;DR)
On July 25 between 7:00–7:15am PST, a spike in call volume caused some calls to fail with a worker-not-available error. The fallback service for short calls (our serverless workers) could not start because its image architecture didn’t match the configured runtime (image built for ARM64, runtime set to x86). We stabilized the platform by scaling primary workers and then corrected the configuration. Service is o...

Jul 24, 2025

1 incident

Elevenlabs is experiencing increased Text-to-Speech latency

Resolved

Resolved Jul 24, 2025 at 6:30pm UTC

Elevenlabs released a hotfix and is fully operational now.

1 previous update

Jul 17, 2025

1 incident

Web Calls from Assistant Page are unavailable

Resolved

Resolved Jul 17, 2025 at 5:50pm UTC

Dashboard calls should be back to normal.

1 previous update

Jul 15, 2025

1 incident

Deepgram transcription is degraded

Resolved

Resolved Jul 17, 2025 at 5:34pm UTC

This was resolved

1 previous update

Jul 09, 2025

1 incident

Up to date calls not returned in API or dashboard

Resolved

Resolved Jul 10, 2025 at 4:24am UTC

We have identified and resolved the issue. Apologies for the disruption.

1 previous update

Jul 06, 2025

1 incident

Playht voices are degraded

Resolved

Resolved Jul 8, 2025 at 6:18am UTC

Resolved now.

1 previous update

June 2025

Jun 25, 2025

2 incidents

SIP is degraded

Resolved

Resolved Jul 1, 2025 at 4:43am UTC

TLDR: A temporary slowdown caused by saturation in our API gateway layer increased response times until they exceeded the edge-network timeout, causing a 524 HTTP response for some API requests.

Timeline in PST
01:00 AM First elevated 524 error responses detected
06:35 AM Rolled back recent backend release (no improvement)
07:19 AM Rolled back related network changes (no improvement).
08:22 AM Scaled up API gateway
09:36 AM Scaled up API gateway further
10:00 AM Reverted the previous night's...

4 previous updates

DB High Latency

Resolved

Resolved Jun 25, 2025 at 6:54pm UTC

The issue has been resolved: https://neonstatus.com/aws-us-west-oregon/incidents/01JYM23FB7HR82VPZR9DBKVPP8#01JYM4F8Y8MZARWJWRFKV01AP5.

1 previous update

Jun 20, 2025

1 incident

Increased database latency causing requests to fail

Resolved

Resolved Jun 20, 2025 at 8:43pm UTC

Our database provider has reported this issue as resolved from their end

1 previous update

Jun 17, 2025

1 incident

Breaking changes to Success Evaluation API Response

Resolved

Resolved Jun 19, 2025 at 2:42am UTC

TL;DR

In response to hallucinations reports in the Success Evaluation feature, we updated our integration with Gemini LLM to use Structured Output. This inadvertently changed the type of the call.analysis.successEvaluation field from string | null to string | number | boolean | null, introducing a breaking change for customers with strict type validation and those using Vapi Server SDKs.

Timeline (all in PT)

June 12, 11:32pm: Enterprise and Startup users report hallucinations in Su...

1 previous update

Jun 12, 2025

1 incident

Sign-ups/Sign-ins are not working

Resolved

Resolved Jun 13, 2025 at 8:12am UTC

It is resolved.

2 previous updates

Jun 09, 2025

1 incident

Elevenlabs voice provider not working with custom API Key

Resolved

Resolved Jun 12, 2025 at 10:20am UTC

Summary:
We experienced an issue related to API key validation within our WebSockets implementation when sending the API key more than once.

Details:
The issue arose during API key validation within our WebSockets implementation. Our system validates that the API key provided during the initial message is the same as proceeding messages.

A recent change introduced during a release caused a mismatch in how API keys were compared. Specifically, the system was comparing hashed API keys agains...

2 previous updates

Jun 01, 2025

1 incident

API was down

Resolved

Resolved Jun 2, 2025 at 6:00am UTC

API was down due to user error in routine maintenance. Service has since been restored

1 previous update

Previous incidents

IR August 4th: Call Degradation due to Pod Evictions

TL;DR

Timeline (PST)

August 4th

TL;DR

Timeline (all in PT)