OWhisper: A Customizable Speech-to-Text Tool for Local and Cloud Deployment
By
yujonglee
A bagel you'd recommend to a friend without hedging.
Summary
OWhisper is a tool similar to Ollama but designed for Speech-to-Text (STT) applications, both real-time and batch. It originated from user feedback during the development of Hyprnote, where users wanted the ability to integrate custom STT endpoints, akin to openai-compatible LLM endpoints. OWhisper serves two primary use cases: locally hosting lightweight models for prototyping or personal use, and deploying larger or cloud-hosted models on custom infrastructure. The tool offers CLI for the first use case and Proxy for the second.
Key quotes
· 3 pulledOWhisper can be thought of as something like Ollama but for Speech-to-Text (both real-time and batch).
This came from our experience while building Hyprnote, where users consistently requested bringing a custom STT endpoint, just like plugging in an openai-compatible LLM endpoint.
OWhisper is intended for 2 use cases: Quickly serving a lightweight model locally for prototyping or personal use. Deploying larger models or connecting to other cloud-hosted models on your own infrastructure.
You might also wanna read
Whispering: An Open-Source, Local-First Transcription App for Privacy-Conscious Users
Whispering is an open-source, local-first transcription app that prioritizes privacy by keeping audio data on-device. It supports both local
Stenox: macOS Voice Dictation Tool with Local and Cloud Transcription Options
Stenox is a macOS voice dictation tool that enables transcription across all apps and browsers. It offers multiple transcription options inc
OpenWispr: A Local Open-Source AI Speech-to-Text Model
OpenWispr is an open-source AI speech-to-text model that operates entirely locally, offering 3-5x faster transcription than typing. It is de
Silkwave: Unified AI Workspace for Mac with BYOK Model Support and On-Device Transcription
Silkwave is a unified AI workspace application for Mac that consolidates multiple AI models into a single chat interface using a Bring Your
Whisper Snapper: Mac Transcription Tool with Local AI Processing and Export Options
Whisper Snapper is a Mac application that transcribes audio and video content using AI models, offering both local processing on Mac or clou
Thoth: A native macOS transcription app that runs AI locally for privacy
Thoth is a native macOS transcription app built by a Laser Physicist that runs AI models (Whisper & LLMs) entirely on-device for privacy. It
