Skip to Content Facebook Feature Image

Deepgram Launches Voice Agent API: World’s Only Enterprise-Ready, Real-Time, and Cost-Effective Conversational AI API

News

Deepgram Launches Voice Agent API: World’s Only Enterprise-Ready, Real-Time, and Cost-Effective Conversational AI API
News

News

Deepgram Launches Voice Agent API: World’s Only Enterprise-Ready, Real-Time, and Cost-Effective Conversational AI API

2025-06-16 20:31 Last Updated At:20:51

SAN FRANCISCO--(BUSINESS WIRE)--Jun 16, 2025--

Deepgram, the leading voice AI platform for enterprise use cases, today announced the general availability (GA) of its Voice Agent API, a single, unified voice-to-voice interface that gives developers full control to build context-aware voice agents that power natural, responsive conversations. Combining speech-to-text, text-to-speech, and large language model (LLM) orchestration with contextualized conversational logic into a unified architecture, the Voice Agent API gives developers the choice of using Deepgram’s fully integrated stack (leveraging industry-leading Nova-3 STT and Aura-2 TTS models) or bringing their own LLM and TTS models. It delivers the simplicity developers love and the controllability enterprises need to deploy real-time, intelligent voice agents at scale. Today, companies like Aircall, Jack in the Box, StreamIt, and OpenPhone are building voice agents with Deepgram to save costs, reduce wait times, and increase customer loyalty.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250616867923/en/

In today’s market, teams building voice agents are often forced to choose between two extremes: rigid, low-code platforms that lack customization, or DIY toolchains that require stitching together STT, TTS, and LLMs with significant engineering effort. Deepgram’s Voice Agent API eliminates this tradeoff by providing a unified API that simplifies development without sacrificing control. Developers can build faster with less complexity, while enterprises retain full control over orchestration, deployment, and model behavior, without compromising on performance or reliability.

“The future of customer engagement is voice-first,” said Scott Stephenson, CEO of Deepgram. “But most voice systems today are rigid, fragmented, or too slow. With our Voice Agent API, we’re giving developers a powerful yet simple interface to build conversational agents that feel natural, respond instantly, and scale across use cases without compromise.”

“We believe the future of customer communication is intelligent, seamless, and deeply human—and that’s the vision behind Aircall’s AI Voice Agent,” said Scott Chancellor, Chief Executive Officer of Aircall. “To bring it to life, we needed a partner who could match our ambition, and Deepgram delivered. Their advanced Voice Agent API enabled us to build fast without compromising accuracy or reliability. From managing mid-sentence interruptions to enabling natural, human-like conversations, their service performed with precision. Just as importantly, their collaborative approach helped us iterate quickly and push the boundaries of what voice intelligence can deliver in modern business communications.”

“We believe that integrating AI voice agents will be one of the most impactful initiatives for our business operations over the next five years, driving unparalleled efficiency and elevating the quality of our service,” said Doug Cook, CTO of Jack in the Box. “Deepgram is a leader in the industry and will be a strategic partner as we embark on this transformative journey.”

Developer Simplicity and Faster Time to Market

For teams taking the DIY route, the challenge isn’t just connecting models but also building and operating the entire runtime layer that makes real-time conversations work. Teams must manage live audio streaming, accurately detect when a user has finished speaking, coordinate model responses, handle mid-sentence interruptions, and maintain a natural conversational cadence. While some platforms offer partial orchestration features, most APIs do not provide a fully integrated runtime. As a result, developers are often left to manage streaming, session state, and coordination logic across fragmented services, which adds complexity and delays time to production.

Deepgram’s Voice Agent API removes this burden by providing a single, unified API that integrates speech-to-text, LLM reasoning, and text-to-speech with built-in support for real-time conversational dynamics. Capabilities such as barge-in handling and turn-taking prediction are model-driven and managed natively within the platform. This eliminates the need to stitch together multiple vendors or maintain custom orchestration, enabling faster prototyping, reduced complexity, and more time focused on building high-quality experiences.

In addition to the Voice Agent API, organizations seeking broader integrations can leverage Deepgram’s extensive partner ecosystem, including Kore.ai, OneReach.ai, Twilio and others, to access comprehensive conversational AI solutions and services powered by Deepgram APIs.

Maximum Control and Flexibility

While the Voice Agent API streamlines development, it also gives teams deep control over performance, behavior, and scalability in production. Built on Deepgram’s Enterprise Runtime and full model ownership across the entire voice AI stack, the platform enables model-level optimization at every layer of the interaction loop. This allows for precise tuning of latency, barge-in handling, turn-taking, and domain-specific behavior in ways not possible with disconnected components.

Key capabilities include:

“Deepgram gives us the flexibility to bring our own models, voices, and customize behavior while controlling how we build and orchestrate our voice agents,” said Harshal Jethwa, Engineering Manager at OpenPhone. “Their system seamlessly handles the complexity of real-time voice coordination, letting us focus on creating exactly the experience we want.”

This tightly coordinated design translates directly into measurable performance gains. In recent benchmark testing using the Voice Agent Quality Index (VAQI), Deepgram achieved the highest overall score among all evaluated providers (see Figure 1). VAQI is a composite benchmark that measures the core elements of voice agent quality: latency (how quickly the agent responds), interruption rate (how often it cuts users off), and response coverage (how often it misses valid input).

Deepgram outperformed OpenAI by 6.4% and ElevenLabs by 29.3%, reflecting the advantage of its integrated architecture and model-driven turn-taking. The result is smooth, responsive conversations without missed inputs, premature responses, or unnatural delays.

Cost-Effectiveness at Scale

In addition to control and performance, the Voice Agent API is built for cost efficiency across large-scale deployments. When teams run entirely on Deepgram’s vertically integrated stack, pricing is fully consolidated at a flat rate of $4.50 per hour (see Figure 2). This provides predictable, all-in-one billing that simplifies planning and scales with usage. Deepgram’s vertically integrated runtime also delivers unmatched compute efficiency, optimizing every stage of the speech pipeline to minimize infrastructure costs while maintaining real-time responsiveness.

For teams that bring their own LLM or TTS models, Deepgram offers built-in rate reductions, enabling even lower total cost of ownership for production-scale deployments.

“Deepgram’s Voice Agent API stands out for its technical prowess, affordability, and flexibility, making it the smart bet for customer service voice AI,” said Bill French, Senior Solutions Engineer at StreamIt.

Start Building with the Voice Agent API

Experience how fast and flexible voice agents can be with Deepgram’s unified voice-to-voice API. Explore the API in our interactive playground, review documentation, or integrate in minutes using our SDK. New users receive $200 in free credits, enough to process over 40 hours of real-time voice agent usage. Start building natural, responsive conversations with infrastructure built for real-time performance and enterprise-scale.

Additional Resources:

About Deepgram

Deepgram is the leading voice AI platform for enterprise use cases, offering speech-to-text (STT), text-to-speech (TTS), and full speech-to-speech (STS) capabilities–all powered by our enterprise-grade runtime. 200,000+ developers build with Deepgram’s voice-native foundational models – accessed through cloud APIs or as self-hosted / on-premises APIs – due to our unmatched accuracy, low latency, and pricing. Customers include technology ISVs building voice products or platforms, co-sell partners working with large enterprises, and enterprises solving internal use cases. Having processed over 50,000 years of audio and transcribed over 1 trillion words, there is no organization in the world that understands voice better than Deepgram. To learn more, visit www.deepgram.com, read our developer docs, or follow @DeepgramAI on X and LinkedIn.

Figure 1

Figure 1

DUBAI, United Arab Emirates (AP) — Iran responded to U.S. President Donald Trump’s address to Americans on the war with new missile attacks targeting Israel and the Gulf Arab states Thursday, underlining Tehran’s insistence that it rejected Washington’s outreach for a ceasefire while maintaining its grip on the Strait of Hormuz.

Britain planned to hold a call Thursday with nearly three dozen countries about how to reopen the strait, through which 20% of all oil and natural gas traded passes in peacetime. The 35 countries, including all G7 industrialized democracies except the U.S., as well as the United Arab Emirates and Bahrain, signed a declaration last month demanding Iran stop blocking the strait. The call will discuss “diplomatic and political measures” that could restore shipping once the fighting is over.

Washington has insisted that Iran allow ships to freely transit the strait, but Trump this week has said it is not up to the U.S. to force it, and in his address encouraged countries that receive oil through Hormuz to “build some delayed courage” and go “take it.”

In his address, Trump said the U.S. would hit Iran “extremely hard over the next two to three weeks,” while also insisting American “core strategic objectives are nearing completion.”

Iran's military said defiantly on Thursday that its armament facilities are hidden and will never be reached by Israeli or American attacks.

“The centers you think you have targeted are insignificant,” said Lt. Col. Ebrahim Zolfaghari, a spokesman for the Iranian military’s Khatam Al-Anbiya Central Headquarters.

Just before Trump began his nearly 20-minute address on Wednesday, explosions were heard in Dubai as air defenses worked to intercept an Iranian missile barrage. Less than a half hour after the president was done, Israel said its military was working to intercept incoming missiles.

Sirens sounded in Bahrain, home to the U.S. Navy's 5th Fleet, immediately after the speech.

Following a joint statement in March condemning Iranian attacks on unarmed commercial vessels that called upon Iran to “cease immediately its threats, laying of mines, drone and missile attacks and other attempts to block the strait,” the 35 signatories were to hold a virtual meeting Thursday hosted by British Foreign Secretary Yvette Cooper.

Though the oil and gas that typically transits the Strait of Hormuz primarily is sold to Asian nations, Japan and South Korea were the only two countries from the region that were joining.

“Trump’s message was that the United States can sustain its own economic and energy ecosystem, while countries dependent on regional exports will either have to buy from the United States or manage the Strait themselves,” the New York-based Soufan Center think tank wrote after the address.

“While Trump explicitly thanked U.S. allies in the Persian Gulf for their cooperation and allyship, an expedited U.S. withdrawal without securing the strait will leave many of these countries, whose economies are dependent on energy exports, in the lurch.”

No country appears willing to try and open the strait by force while the war is raging. British Prime Minister Keir Starmer said the group “will assess all viable diplomatic and political measures we can take to restore freedom of navigation, guarantee the safety of trapped ships and seafarers and to resume the movement of vital commodities.”

Bahrain, which now holds the presidency of the United Nations Security Council, has been working to get the world body to address the crisis as well.

Though Iran has allowed a trickle of ships through the strait, it remains largely closed. Iran has also been repeatedly attacking Gulf Arab energy infrastructure, sending oil prices skyrocketing and giving rise to broader economic problems worldwide.

Following Trump's speech, Brent crude, the international standard, rose again and was at $108 in early spot trading, up nearly 50% from Feb. 28 when Israel and the U.S. started the war with their attacks on Iran.

The rising energy prices and stock market jitters have been putting increasing domestic pressure on Trump, who used his address to offer a defense of the war while also suggesting it was close to winding down.

He acknowledged American service members who had been killed and said: “We are going to finish the job, and we’re going to finish it very fast. We’re getting very close.”

The U.S. has presented Iran with a 15-point plan for a ceasefire, but Trump didn’t say anything about the diplomatic efforts or bring up his April 6 deadline for Iran to reopen the Strait of Hormuz or face severe retaliation from the U.S.

More than 1,900 people have been killed in Iran during the war, while 19 have been reported dead in Israel. More than two dozen people have died in Gulf states and the occupied West Bank, while 13 U.S. service members have been killed.

More than 1,200 people have been killed in Lebanon and more than 1 million displaced, according to authorities. Ten Israeli soldiers have also died there.

Weissert reported from Washington and Rising reported from Bangkok.

The Indian flagged LPG carrier Jag Vasant transporting liquefied petroleum gas, is seen at the Mumbai Port in Mumbai, India, after it arrived clearing the Strait of Hormuz, Wednesday, April 1, 2026. (AP Photo/Rafiq Maqbool)

The Indian flagged LPG carrier Jag Vasant transporting liquefied petroleum gas, is seen at the Mumbai Port in Mumbai, India, after it arrived clearing the Strait of Hormuz, Wednesday, April 1, 2026. (AP Photo/Rafiq Maqbool)

President Donald Trump speaks about the Iran war from the Cross Hall of the White House on Wednesday, April 1, 2026, in Washington. (AP Photo/Alex Brandon, Pool)

President Donald Trump speaks about the Iran war from the Cross Hall of the White House on Wednesday, April 1, 2026, in Washington. (AP Photo/Alex Brandon, Pool)

President Donald Trump walks from the Blue Room to speak about the Iran war from the Cross Hall of the White House on Wednesday, April 1, 2026, in Washington. (AP Photo/Alex Brandon, Pool)

President Donald Trump walks from the Blue Room to speak about the Iran war from the Cross Hall of the White House on Wednesday, April 1, 2026, in Washington. (AP Photo/Alex Brandon, Pool)

President Donald Trump speaks about the Iran war from the Cross Hall of the White House on Wednesday, April 1, 2026, in Washington. (AP Photo/Alex Brandon, Pool)

President Donald Trump speaks about the Iran war from the Cross Hall of the White House on Wednesday, April 1, 2026, in Washington. (AP Photo/Alex Brandon, Pool)

President Donald Trump speaks about the Iran war from the Cross Hall of the White House on Wednesday, April 1, 2026, in Washington. (AP Photo/Alex Brandon, Pool)

President Donald Trump speaks about the Iran war from the Cross Hall of the White House on Wednesday, April 1, 2026, in Washington. (AP Photo/Alex Brandon, Pool)

Recommended Articles