Moonshine Voice: Open-Source On-Device Speech Recognition Toolkit for Edge Applications
By
petewarden
Hand-rolled, kettle-boiled, baked to perfection. Worth every minute at the bakery.
Summary
Moonshine Voice is an open-source AI toolkit for developers building real-time voice applications that runs entirely on-device, offering fast, private speech recognition without requiring accounts, credit cards, or API keys. The framework is optimized for live streaming with low latency, and its speech-to-text models are based on proprietary research, claiming higher accuracy than Whisper Large V3 while being compact enough for edge devices (down to 26MB models).
Key quotes
· 4 pulledMoonshine Voice is an open source AI toolkit for developers building real-time voice applications.
Everything runs on-device, so it's fast, private, and you don't need an account, credit card, or API keys.
The framework and models are optimized for live streaming applications, offering low latency responses by doing a lot of the work while the user is still talking.
All speech to text models are based on our cutting edge research and trained from scratch, so we can offer higher accuracy than Whisper Large V3 at the top end, down to tiny 26MB models.
You might also wanna read
Voice Anywhere: AI Speech-to-Text App with Floating Microphone Interface
Voice Anywhere is an AI-powered speech-to-text application that features a floating microphone interface that stays above all windows, allow
Whispering: An Open-Source, Local-First Transcription App for Privacy-Conscious Users
Whispering is an open-source, local-first transcription app that prioritizes privacy by keeping audio data on-device. It supports both local
Vogent Voicelab: Platform for Optimized Open-Source Voice Model Inference
Vogent Voicelab is a platform that optimizes and post-trains top open-source voice models like Sesame's CSM-1B, Dia, and Chatterbox to gener
VibeSonic: On-Device AI Voice Dictation Software for Mac with Privacy Focus
VibeSonic is an AI voice dictation software for Mac that runs entirely on-device without cloud dependency or subscriptions. It uses on-devic
Stet: Open-source macOS voice dictation app with local processing and AI refinement
A minimalist, open-source voice input app for macOS that processes speech locally, uses AI to refine dictation, and preserves the user's nat
MiniCPM 4.0: Ultra-Efficient Open-Source AI Models for On-Device Deployment
MiniCPM 4.0 is a family of ultra-efficient, open-source AI models designed for on-device deployment, offering significant speed improvements
