Erm: A Local CLI Tool for Removing Speech Disfluencies from Audio Recordings
By
"Doug Calobrisi"
Baker's choice. Dense with flavour, light on filler.
Summary
A developer built "erm," a free, open-source, local CLI tool that removes disfluencies (ums, uhs, ers) from English speech recordings. It uses faster-whisper for transcription, audio-level detectors, and ffmpeg to produce cleaned audio files and JSON cut lists. The tool runs entirely locally, respects privacy, and is designed for podcasters, voiceover artists, and anyone editing spoken audio.
Key quotes
· 3 pulledLinguists have a word for the ums, uhs, ers, and elongated versions (ummmm, uhhhhh) that pad spoken English: disfluencies.
I don't record a lot of voice audio, but a few friends do, and they tell me editing those out by hand is miserable. So I built erm to do it.
That's the whole interface for the common case. It writes a cleaned .wav and a JSON cut list next to the
You might also wanna read
Stenox: macOS Voice Dictation Tool with Local and Cloud Transcription Options
Stenox is a macOS voice dictation tool that enables transcription across all apps and browsers. It offers multiple transcription options inc
AI Voice Note Tool for Transcription and Content Creation
AI voice note tool that transcribes, cleans, and structures voice recordings into various formats like transcripts, summaries, emails, and v
Monologue: Voice Dictation Tool That Automatically Formats Spoken Text
Monologue is a voice dictation tool that automatically formats spoken words into clean, structured text by removing fillers, adding punctuat
Whisper Snapper: Mac Transcription Tool with Local AI Processing and Export Options
Whisper Snapper is a Mac application that transcribes audio and video content using AI models, offering both local processing on Mac or clou
TalkMirror: Privacy-First Voice Practice Tool for Language Learning and Public Speaking
TalkMirror is a privacy-first voice practice tool that functions as an acoustic mirror for language learning, public speaking, and acting pr
Dictato: Local Voice-to-Text Software for Mac with On-Device Processing
Dictato is a local voice-to-text application for Mac that converts speech to text entirely on-device without requiring cloud services, inter
