Home / Technology / OpenAI's New AI Talks, Translates, Transcribes in Real-Time
OpenAI's New AI Talks, Translates, Transcribes in Real-Time
8 May
Summary
- New AI models enable real-time conversation, translation, and transcription.
- GPT-Realtime-2 uses GPT-5 class reasoning for complex user requests.
- Features support over 70 input languages and 13 output languages for translation.

OpenAI has unveiled a suite of new voice intelligence features integrated into its Realtime API, designed to equip developers with tools for creating conversational applications. These innovations allow apps to listen, reason, translate, transcribe, and act upon user interactions as conversations unfold.
The GPT‑Realtime‑2 model represents a significant advancement, built with GPT‑5-class reasoning to handle more complex user requests and simulate realistic vocal interactions. This builds upon previous iterations with enhanced conversational capabilities.
Complementing this is GPT‑Realtime‑Translate, a real-time translation service capable of conversing fluidly. It supports an extensive range of over 70 input languages and 13 output languages, facilitating global communication.
Furthermore, the new GPT‑Realtime‑Whisper feature introduces live speech-to-text transcription. This allows for immediate conversion of spoken words into text as interactions occur, enhancing accessibility and data capture.
OpenAI states these models aim to advance voice interfaces beyond simple responses, enabling them to perform tasks dynamically. Potential applications span customer service, education, media, events, and creator platforms, empowering businesses and creators alike.
To mitigate potential misuse, OpenAI has implemented guardrails against spam, fraud, and online abuse. The system includes triggers to halt conversations that violate harmful content guidelines, ensuring responsible deployment of the technology.