Abogen: Convert EPUBs, PDFs, and Text to Audiobooks with Synced Subtitles
By
mzehrer
Slow-proofed and worth the wait. Worth its weight in flour.
Summary
Abogen is a text-to-speech conversion tool that transforms ePub, PDF, or text files into high-quality audio with synchronized subtitles. It supports various use cases like audiobooks and social media voiceovers, leveraging the Kokoro-82M model for natural-sounding speech. The tool is efficient, as demonstrated by a 5-second demo generating a minute of audio with synced captions. Installation instructions are provided for Windows.
Key quotes
· 3 pulledAbogen is a powerful text-to-speech conversion tool that makes it easy to turn ePub, PDF, or text files into high-quality audio with matching subtitles in seconds.
This demo was generated in just 5 seconds, producing ∼1 minute of audio with perfectly synced subtitles.
Use it for audiobooks, voiceovers for Instagram, YouTube, TikTok, or any project that needs natural-sounding text-to-speech, using Kokoro-82M.
You might also wanna read
Running Gemma 4 on a 2016 Xeon Server with No GPU: A Technical Walkthrough
The article describes running Gemma 4 (a 25B-parameter Mixture-of-Experts model) on a severely outdated server with a 2016 Intel Xeon E5-262
NVIDIA Announces "Hack for Impact" London Event for Autonomous AI Agent Development
NVIDIA is hosting a "Hack for Impact" event in London, challenging participants to build autonomous agentic applications using open-source m
Four practical steps to control Azure Foundry token costs for agentic AI workloads
This article provides practical guidance on controlling token costs in Microsoft Azure Foundry, particularly for agentic AI workloads where
MerLean-Prover: A Recursive Agent Harness for Lean 4 Theorem Proving Outperforms Baselines
MerLean-Prover is an end-to-end Lean4 theorem prover that replaces 'sorry' declarations with kernel-checkable proofs using three agent types
Why small pull request policies can backfire on software quality
The article critiques a common software engineering policy that limits pull requests (PRs) to small sizes (e.g., 500 lines, few files). Whil
apenwarr.ca·7h agoHow Anthropic contains Claude's expanding access across its products
Anthropic describes how it has evolved its approach to granting Claude, its AI assistant, increasingly broad access to internal systems over
