pull down to refresh

OpenAI has released the Realtime API, which is now open to all developers. This new feature enables developers to build voice-driven agents with support for MCP, image uploads and SIP-based calls. The API also introduces two new voices to facilitate more natural interactions.
Alongside this, OpenAI unveiled GPT-Realtime, its most advanced speech model to date. According to the company, it follows developer instructions more accurately and handles complex tasks with greater reliability.
Developers can try GPT-Realtime here (payment method required): https://platform.openai.com/audio/realtime
How much does this cost?
reply