OpenAI Releases an Advanced Speech-to-Speech Model and New Realtime API Capabilities including MCP Server Support, Image Input, and SIP Phone Calling Support

OpenAI has officially launched Realtime API and gpt-realtime, its most advanced speech-to-speech model, moving the Realtime API out of beta with a suite of enterprise-focused features. While the announcement marks real progress in voice AI technology, a closer examination reveals both meaningful improvements and persistent challenges that temper any revolutionary claims. Technical Architecture and Performance […] The post OpenAI Releases an Advanced Speech-to-Speech Model and New Realtime API Capabilities including MCP Server Support, Image Input, and SIP Phone Calling Support appeared first on MarkTechPost.

2025-08-29 12:30 GMT · 7 months ago www.marktechpost.com

OpenAI has officially launched Realtime API and gpt-realtime, its most advanced speech-to-speech model, moving the Realtime API out of beta with a suite of enterprise-focused features. While the announcement marks real progress in voice AI technology, a closer examination reveals both meaningful improvements and persistent challenges that temper any revolutionary claims. Technical Architecture and Performance […] The post OpenAI Releases an Advanced Speech-to-Speech Model and New Realtime API Capabilities including MCP Server Support, Image Input, and SIP Phone Calling Support appeared first on MarkTechPost.

Original: https://www.marktechpost.com/2025/08/29/openai-releases-an-advanced-speech-to-speech-model-and-new-realtime-api-capabilities-including-mcp-server-support-image-input-and-sip-phone-calling-support/