Back to Blog

OpenAI Realtime API GA and new `gpt-realtime` model, 20% cheaper than 4o

OpenAI launched the gpt-realtime model and Realtime API to GA, featuring advanced speech-to-speech capabilities, new voices (Cedar, Marin), image input, SIP telephony, and a ~20% price cut. Benchmarks show improvements over gpt-4o-realtime on BigBench and ComplexFuncBench. xAI introduced Grok Code Fast 1, a speed-optimized coding model integrated with popular IDEs, while OpenAI Codex received major upgrades for local and cloud development workflows. Google’s Gemini CLI improved multi-editor support, and new models like Microsoft MAI-1-preview and MAI-Voice-1 were announced. *"The new all-in-one WebRTC API removes the ephemeral token step and supports video on the same connection,"* highlighting enhanced developer tooling.

Read original post