OpenAI’s gpt-realtime Enables Production-Ready Voice Agents with End-to-End Speech Processing

OpenAI launched gpt-realtime and the Realtime API, enabling production-ready AI voice agents with end-to-end speech processing, lower latency, and natural speech delivery. New features include SIP phone support, image input, MCP server integration, and improved safeguards. Early adopters like Zillow and T-Mobile are testing real-time customer service and search use cases. By Hien Luu

2025-09-11 08:30 GMT · 10 months ago www.infoq.com

OpenAI launched gpt-realtime and the Realtime API, enabling production-ready AI voice agents with end-to-end speech processing, lower latency, and natural speech delivery. New features include SIP phone support, image input, MCP server integration, and improved safeguards. Early adopters like Zillow and T-Mobile are testing real-time customer service and search use cases. By Hien Luu

Original: https://www.infoq.com/news/2025/09/openai-gpt-realtime/?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=AI%2C+ML+%26+Data+Engineering