Listen to e-books you own when no commercial audiobook exists.
Create personal audiobooks in your native language from any book format.
Generate audiobooks with a custom voice by cloning your own voice or someone else's.
Convert image-based books (scanned PDFs) into spoken audio using OCR.
Requires downloading large model weights (XTTSv2/Bark) on first run, which can take 10-15 minutes depending on internet speed.
ebook2audiobook is a tool that converts e-books into full audiobook files, complete with chapters and metadata. If you have an EPUB, PDF, MOBI, or dozens of other book formats sitting on your device, this tool reads the text and turns it into spoken audio you can listen to like any audiobook. The conversion uses text-to-speech (TTS) engines, software that reads text aloud in a synthesized voice. The project supports multiple TTS engines including XTTSv2, Bark, and others, so you can trade off speed versus voice quality. One standout feature is voice cloning: you can supply a short recording of any voice, and the tool will narrate your book in that voice. It supports over 1,158 languages, runs on ordinary laptops (even without a powerful graphics card), and outputs common audio formats like MP3, M4B, FLAC, and WAV. You would use it when you want to listen to a book you legally own but no commercial audiobook version exists, or when you want a personal audiobook in your native language. It also includes OCR support, meaning it can read books where pages are stored as images rather than text. You can run it through a simple web browser interface powered by Gradio, via Docker, or through Google Colab in the cloud, no coding required for basic use. Written in Python and packaged for Mac, Linux, and Windows.
Generated 2026-05-18 · Model: sonnet-4-6 · Verify against the repo before relying on details.