All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

NVIDIA PersonaPlex 7B Enables Real-Time Speech-to-Speech on Apple Silicon via Swift/MLX Library

By

ipotapov

2mo ago· 8 min readenNews

Summary

The article announces the integration of NVIDIA's PersonaPlex 7B model into a Swift/MLX speech library for Apple Silicon, enabling full-duplex speech-to-speech capabilities. The system allows real-time conversation with a laptop where audio input and output happen simultaneously through a single model, eliminating the traditional three-step pipeline of transcription, processing, and synthesis. The implementation achieves faster-than-real-time performance with 68ms per step and a real-time factor of 0.87, running natively on Apple Silicon hardware.

Key quotes

· 3 pulled
What if you could talk to your laptop and it talked back — not through a three-step pipeline of transcribe-think-synthesize, but as a single model that listens and speaks at the same time, faster than real-time, streaming audio chunks back as it generates them?
Our speech-swift Swift/MLX speech library now handles full-duplex speech-to-speech with streaming via NVIDIA's PersonaPlex 7B — faster than real-time (~68ms/step, RTF 0.87), alongside ASR, TTS, and multilingual synthesis.
Audio in, audio out, native Swift
Snippet from the RSS feed
NVIDIA PersonaPlex 7B on Apple Silicon: Full-Duplex Speech-to-Speech in Native Swift with MLX What if you could talk to your laptop and it talked back — not through a three-step pipeline of …

You might also wanna read