Gen-AI-Today

GenAI TODAY NEWS

Free eNews Subscription

Twilio Announces Enhanced Partnership with OpenAI to Improve Speech-to-Speech

By Tracey E. Schelmetic

In the past, customers have found that speaking with AI-driven chatbots has felt very much less like “chat” and more like “bot.” Responses, even if they were correct, rarely sounded particularly natural.

Now. technology has grown to ideally fix this problem; namely, through speech-to-speech. This is an emerging solution that allows for voice conversations by AI virtual agents to feel much more like real human dialogue.

In this vein, many contact center companies are turning to OpenAI’s Realtime API to reduce latency and improve key components like conversation pacing, interruption handling, tone and balance between speaking and listening – all critical user experience elements that make conversation with a virtual agent more human-like.

And so, customer engagement solutions provider Twilio recently announced an integration with OpenAI to bring the latter company’s new Realtime API to the Twilio platform. The integration of streaming speech-to-speech capabilities, which are part of the Realtime API, will enable 300,000+ Twilio customers and more than 10 million developers to build conversational AI virtual agents leveraging OpenAI’s flagship multilingual and multimodal GPT-4o model. The new integration builds on existing OpenAI and Twilio product integrations announced last year to bring the power of large language models (LLMs) to the customer engagement platform.

In the announcement, the companies noted that the combined technology "is especially relevant for customer service and sales, delivering both operational efficiency and exceptional customer outcomes." Speech-to-speech is also set to support social impact at scale, empowering nonprofit and public sector organizations to deploy novel use cases like voice translation in real time between constituents and staff members who speak different languages.

“Integrating OpenAI’s Realtime API with Twilio’s platform enables businesses to offer more natural, real-time AI voice interactions at scale,” said Inbal Shani, Chief Product Officer, Twilio Communications. “Businesses can use this to create voice experiences that feel more human and can reduce operational costs and drive higher customer satisfaction.”




Edited by Alex Passett
Get stories like this delivered straight to your inbox. [Free eNews Subscription]

GenAIToday Contributor

SHARE THIS ARTICLE
Related Articles

VoIP Provider Zadarma Integrates Three AI Voice Agents into its PBX Platform

By: Erik Linask    6/11/2025

London-based VoIP provider Zadarma integrated three AI-powered voice assistants directly into its PBX platform, a first in Europe, according to the co…

Read More

The Future of CX: Mosaicx Unveils AI-Native Engage Platform

By: Erik Linask    6/6/2025

Mosaicx has launched Engage, its next-gen AI-native CX platform to drive improvements in customer engagement and experiences.

Read More

Jabra Reviving Human Focus Amid AI Revolution in Customer Experience

By: Erik Linask    5/27/2025

Jabra looks to redefine how customer service teams make good on the promise of quality CX by combining the "what" of customer conversations, with "how…

Read More

When AI Ambitions are Dictated by Cloud Matters

By: Special Guest    5/27/2025

How are increasing AI workloads changing what we know about and how we design cloud architectures?

Read More

Rising AI-Driven Infrastructure Costs Expose Critical Weaknesses: NVMe SSDs & CXL Modules Redefine Scalability

By: Special Guest    5/7/2025

AI workloads are too demanding for their existing IT architecture. GPUs remain under-utilized, not because of faulty hardware, but because data can't …

Read More

-->