OpenAI releases realtime voice models with reasoning and translation capabilities
The company announced new voice intelligence models available in its API, featuring speech-to-text transcription, translation, and reasoning functions for conversational applications.
1 source · single source
- OpenAI has published new realtime voice models for developers to use through its API.
- The models support transcription, translation, and reasoning capabilities in voice interactions.
- The announcement does not specify version numbers, availability timelines, or performance benchmarks.
- This is a first-party product announcement from OpenAI's official news channel.
OpenAI announced new realtime voice models available through its API. The models are designed to handle multiple voice intelligence tasks including speech transcription, language translation, and reasoning over spoken input. The announcement positions these capabilities as enabling more natural conversational experiences for developers building voice applications.
The specific technical specifications, performance metrics, and availability details were not disclosed in the summary. OpenAI's official news channel serves as the primary source for product launches, though the absence of detailed benchmarks, model names, or rollout timelines limits immediate falsifiability of performance claims.
- May 20, 2026 · TechCrunch
Stability AI releases Stable Audio 3.0 with models capable of generating six-minute compositions
Trust52 - May 20, 2026 · Allen Institute / Hugging Face
Allen Institute releases OlmoEarth v1.1, a satellite imagery model that cuts inference costs threefold
Trust74 - May 19, 2026 · Google AI — Blog
Google's AI Mode search feature surpasses one billion monthly active users one year after U.S. launch
Trust67