46 Views

IBM and Deepgram team up for voice AI in enterprise automation

LinkedIn Facebook X
March 05, 2026

Get a Price Quote

IBM and Deepgram have joined forces to enhance voice capabilities within enterprise AI workflows. Through this new collaboration, Deepgram’s speech-to-text and text-to-speech technology will be integrated into IBM’s watsonx Orchestrate platform, marking Deepgram as IBM’s inaugural dedicated voice partner. This partnership underscores the increasing importance of voice as a fundamental interface in enterprise automation. It also demonstrates how major vendors are combining top-tier technologies to tackle real-world challenges like latency, accuracy, and multilingual support on a large scale.

IBM’s strategy involves incorporating Deepgram’s transcription, real-time captioning, and speech synthesis capabilities directly into watsonx Orchestrate, its generative AI solution for constructing and managing digital agents and workflows. The objective is to provide enterprise clients with a more intuitive means of interacting with AI systems through spoken language rather than text exclusively.

This integration is aimed at demanding scenarios where precision and performance are critical, such as automated customer service, call analytics, and voice-driven data entry. By utilizing Deepgram’s technology, IBM seeks to effectively handle challenging environments, diverse accents, and natural conversational speech more reliably than conventional speech recognition systems.

Another key focus is language coverage. The combined solution supports a wide array of languages and dialects, including numerous Arabic and Indian variations, and offers voices that mirror regional accents. Enterprises can also implement custom tuning and utilize real-time captioning, which is increasingly vital for accessibility and compliance.

Scott Stephenson, CEO and Co-Founder of Deepgram, emphasized the significance of voice as the primary interface between humans and technology in today’s fast-paced world. He stated, “Voice is rapidly becoming the default interface between humans and technology, and enterprise deployments require a real-time platform that is accurate, low latency, and reliable at scale.” By embedding Deepgram within watsonx Orchestrate Agent Builder, IBM clients can construct voice agents and voice-enabled workflows on a real-time foundation that has been honed over more than a decade.

From IBM’s perspective, this collaboration bolsters its open ecosystem strategy surrounding watsonx and provides customers with more options when adopting conversational AI. Nick Holda, Vice President of AI Technology Partnerships at IBM, noted, “Our watsonx Orchestrate integration powered by Deepgram APIs introduces new speech recognition and transcription capabilities to IBM clients, refining and modernizing their operations.” This partnership is designed to assist enterprise organizations in expediting their AI initiatives and reinforces IBM’s open ecosystem by offering choice and cutting-edge voice technology to partners and customers.

For Deepgram, being selected as IBM’s premier voice partner grants access to a broad enterprise customer base and solidifies its position as a real-time, enterprise-grade voice AI platform. As voice interfaces transition from experimental features to essential components, partnerships like this illustrate that enterprises now view speech as a primary input for AI systems, rather than just an optional feature.

Recent Stories


Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.