Back to Blog

OpenAI Realtime API GA and new `gpt-realtime` model, 20% cheaper than 4o

OpenAI Realtime API GA and new `gpt-realtime` model, 20% cheaper than 4o

OpenAI launched the gpt-realtime model and Realtime API to GA, featuring advanced speech-to-speech capabilities, new voices (Cedar, Marin), image input, SIP telephony, and a ~20% price cut. Benchmarks show improvements over gpt-4o-realtime on BigBench and ComplexFuncBench. xAI introduced Grok Code Fast 1, a speed-optimized coding model integrated with popular IDEs, while OpenAI Codex received major upgrades for local and cloud development workflows. Google’s Gemini CLI improved multi-editor support, and new models like Microsoft MAI-1-preview and MAI-Voice-1 were announced. *"The new all-in-one WebRTC API removes the ephemeral token step and supports video on the same connection,"* highlighting enhanced developer tooling.

Read original post

Turn insight into implementation

Want help turning this idea into a production system?

xAGI Labs helps teams scope, build, and deploy AI products, agent workflows, voice systems, and enterprise rollouts.

If this topic is relevant to your roadmap, we can translate "OpenAI Realtime API GA and new `gpt-realtime` model, 20% cheaper than 4o" into a concrete build plan and launch path.