Home / Technology / Voice AI Breakthrough: Goodbye Latency, Hello Emotion!
Voice AI Breakthrough: Goodbye Latency, Hello Emotion!
23 Jan
Summary
- Voice AI models now achieve sub-120ms latency, eliminating awkward pauses.
- Full-duplex models handle interruptions and backchanneling like humans.
- AI now has emotional intelligence, moving beyond flat text responses.

The past week has seen a dramatic shift in voice AI capabilities, with companies like Nvidia, Inworld, and FlashLabs releasing models that solve long-standing challenges.
Latency in voice AI has been virtually eliminated, with new models achieving speeds under 120ms, faster than human perception. This resolves the 'thinking pause' and enables viseme-level synchronization for digital avatars.
Traditional voice bots, previously half-duplex like walkie-talkies, can now handle interruptions. Nvidia's PersonaPlex, for instance, uses a dual-stream design allowing it to listen and speak concurrently and understand human backchanneling.




