NVIDIA PersonaPlex: One Speech Model That Listens, Talks, and Clones Any Voice

NVIDIA’s PersonaPlex is a 7B-parameter speech-to-speech model that listens and speaks simultaneously, supports real-time interruptions, and can adopt any voice identity from a short audio prompt. It outperforms both open-source and commercial systems on conversational dynamics benchmarks.
artificial-intelligence
Author

Kabui, Charles

Published

2026-02-19

Keywords

speech-to-speech, full-duplex, voice-cloning, nvidia, personaplex, conversational-ai