I Benchmarked 5 Voice AI Stacks. Only 2 Stayed Under 300ms.

I kept reading that voice AI agents respond in under 300ms. AssemblyAI says it, Vapi says it, every Realtime API launch post says it. So I built five stacks, dropped a stopwatch into each pipeline, and ran the same one-minute conversation through all of them.

Three of the five never came close.

The other two were the ones I had quietly assumed were "marketing numbers." Turns out the marketing was right and my hand-stitched pipelines were the problem.

The three cliffs nobody puts on the slide

Before the numbers, the perception model. Voice latency does not degrade smoothly. It falls off cliffs. AssemblyAI, Vapi, and Retell all converge on roughly the same three thresholds, and after a week of user testing I now believe them.

Three of the five never came close.

The other two were the ones I had quietly assumed were "marketing numbers." Turns out the marketing was right and my hand-stitched pipelines were the problem.

The three cliffs nobody puts on the slide

I Benchmarked 5 Voice AI Stacks. Only 2 Stayed Under 300ms.

I Benchmarked 5 Voice AI Stacks. Only 2 Stayed Under 300ms.

Other newsrooms on this story

Related reading

The 400ms benchmark: Why infrastructure is the real hurdle for SA AI bots to…

We're Open Sourcing Our Voice AI Latency Benchmarking Tool

Your Voice Agent Is Slow. Here Are 5 Tricks to Hide It.

Announcing the fastest inference for realtime voice AI agents

Building Production Voice AI Agents: Latency, Architecture, and What Nobody…

OpenAI Shipped Your Voice Stack at $0.25/Min. Vapi Went Enterprise. The Infra…

Related reading

The 400ms benchmark: Why infrastructure is the real hurdle for SA AI bots to…

We're Open Sourcing Our Voice AI Latency Benchmarking Tool

Your Voice Agent Is Slow. Here Are 5 Tricks to Hide It.

Announcing the fastest inference for realtime voice AI agents

Building Production Voice AI Agents: Latency, Architecture, and What Nobody…

OpenAI Shipped Your Voice Stack at $0.25/Min. Vapi Went Enterprise. The Infra…

Other newsrooms on this story