abogen


Nameabogen JSON
Version 1.2.0 PyPI version JSON
download
home_pageNone
SummaryGenerate audiobooks from EPUBs, PDFs and text with synchronized captions.
upload_time2025-10-20 00:56:37
maintainerNone
docs_urlNone
authorNone
requires_python<3.13,>=3.10
licenseNone
keywords accessibility audiobook book-converter chapter-management content-creation epub kokoro media-generation multilingual pdf subtitle subtitles text-to-speech tts voice-synthesis
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # abogen <img width="40px" title="abogen icon" src="https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/abogen/assets/icon.ico" align="right" style="padding-left: 10px; padding-top:5px;">

[![Build Status](https://github.com/denizsafak/abogen/actions/workflows/test_pip.yml/badge.svg)](https://github.com/denizsafak/abogen/actions)
[![GitHub Release](https://img.shields.io/github/v/release/denizsafak/abogen)](https://github.com/denizsafak/abogen/releases/latest)
[![Abogen PyPi Python Versions](https://img.shields.io/pypi/pyversions/abogen)](https://pypi.org/project/abogen/)
[![Operating Systems](https://img.shields.io/badge/os-windows%20%7C%20linux%20%7C%20macos%20-blue)](https://github.com/denizsafak/abogen/releases/latest)
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
[![License: MIT](https://img.shields.io/badge/License-MIT-maroon.svg)](https://opensource.org/licenses/MIT)

<a href="https://trendshift.io/repositories/14433" target="_blank"><img src="https://trendshift.io/api/badge/repositories/14433" alt="denizsafak%2Fabogen | Trendshift" style="width: 250px; height: 55px;" width="250" height="55"/></a>

Abogen is a powerful text-to-speech conversion tool that makes it easy to turn ePub, PDF, text or markdown files into high-quality audio with matching subtitles in seconds. Use it for audiobooks, voiceovers for Instagram, YouTube, TikTok, or any project that needs natural-sounding text-to-speech, using [Kokoro-82M](https://huggingface.co/hexgrad/Kokoro-82M).

<img title="Abogen Main" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/abogen.png' width="380"> <img title="Abogen Processing" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/abogen2.png' width="380">

## Demo

https://github.com/user-attachments/assets/094ba3df-7d66-494a-bc31-0e4b41d0b865

> This demo was generated in just 5 seconds, producing ∼1 minute of audio with perfectly synced subtitles. To create a similar video, see [the demo guide](https://github.com/denizsafak/abogen/tree/main/demo).

## `How to install?` <a href="https://pypi.org/project/abogen/" target="_blank"><img src="https://img.shields.io/pypi/pyversions/abogen" alt="Abogen Compatible PyPi Python Versions" align="right" style="margin-top:6px;"></a>

### Windows
Go to [espeak-ng latest release](https://github.com/espeak-ng/espeak-ng/releases/latest) download and run the *.msi file.

#### OPTION 1: Install using script
1. [Download](https://github.com/denizsafak/abogen/archive/refs/heads/main.zip) the repository
2. Extract the ZIP file
3. Run `WINDOWS_INSTALL.bat` by double-clicking it

This method handles everything automatically - installing all dependencies including CUDA in a self-contained environment without requiring a separate Python installation. (You still need to install [espeak-ng](https://github.com/espeak-ng/espeak-ng/releases/latest).)

> [!NOTE]
> You don't need to install Python separately. The script will install Python automatically.

#### OPTION 2: Install using pip
```bash
# Create a virtual environment (optional)
mkdir abogen && cd abogen
python -m venv venv
venv\Scripts\activate

# For NVIDIA GPUs:
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128

# For AMD GPUs:
# Not supported yet, because ROCm is not available on Windows. Use Linux if you have AMD GPU.

# Install abogen
pip install abogen
```

### Mac
```bash
# Install espeak-ng
brew install espeak-ng

# Create a virtual environment (recommended)
mkdir abogen && cd abogen
python3 -m venv venv
source venv/bin/activate

# Install abogen
pip3 install abogen

# For Silicon Mac (M1, M2 etc.)
# After installing abogen, we need to install Kokoro's development version which includes MPS support.
pip3 install git+https://github.com/hexgrad/kokoro.git
```
### Linux
```bash
# Install espeak-ng
sudo apt install espeak-ng # Ubuntu/Debian
sudo pacman -S espeak-ng # Arch Linux
sudo dnf install espeak-ng # Fedora

# Create a virtual environment (recommended)
mkdir abogen && cd abogen
python3 -m venv venv
source venv/bin/activate

# Install abogen
pip3 install abogen

# For NVIDIA GPUs:
# Already supported, no need to install CUDA separately.

# For AMD GPUs:
# After installing abogen, we need to uninstall the existing torch package
pip3 uninstall torch 
pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/rocm6.4
```

> See [How to fix "CUDA GPU is not available. Using CPU" warning?](#cuda-warning)

> See [How  to fix "WARNING: The script abogen-cli is installed in '/home/username/.local/bin' which is not on PATH" error in Linux?](#path-warning)

> See [How to fix "No matching distribution found" error?](#no-matching-distribution-found)

> See [How to fix "[WinError 1114] A dynamic link library (DLL) initialization routine failed" error?](#WinError-1114)

> See [How to use "uv" instead of "pip"?](#use-uv-instead-of-pip)

> Special thanks to [@hg000125](https://github.com/hg000125) for his contribution in [#23](https://github.com/denizsafak/abogen/issues/23). AMD GPU support is possible thanks to his work.

## `How to run?`
If you installed using pip, you can simply run the following command to start Abogen:

```bash
abogen
```
> [!TIP]
> If you installed Abogen using the Windows installer `(WINDOWS_INSTALL.bat)`, It should have created a shortcut in the same folder, or your desktop. You can run it from there. If you lost the shortcut, Abogen is located in `python_embedded/Scripts/abogen.exe`. You can run it from there directly.

## `How to use?`
1) Drag and drop any ePub, PDF, text or markdown file (or use the built-in text editor)
2) Configure the settings:
    - Set speech speed
    - Select a voice (or create a custom voice using voice mixer)
    - Select subtitle generation style (by sentence, word, etc.)
    - Select output format
    - Select where to save the output
3) Hit Start

## `In action`
<img title="Abogen in action" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/abogen.gif'> 

Here’s Abogen in action: in this demo, it processes ∼3,000 characters of text in just 11 seconds and turns it into 3 minutes and 28 seconds of audio, and I have a low-end **RTX 2060 Mobile laptop GPU**. Your results may vary depending on your hardware.

## `Configuration`

| Options | Description |
|---------|-------------|
| **Input Box** | Drag and drop `ePub`, `PDF`, `.TXT` or `.MD` files (or use built-in text editor) |
| **Queue options** | Add multiple files to a queue and process them in batch, with individual settings for each file. See [Queue mode](#queue-mode) for more details. |
| **Speed** | Adjust speech rate from `0.1x` to `2.0x` |
| **Select Voice** | First letter of the language code (e.g., `a` for American English, `b` for British English, etc.), second letter is for `m` for male and `f` for female. |
| **Voice mixer** | Create custom voices by mixing different voice models with a profile system. See [Voice Mixer](#voice-mixer) for more details. |
| **Voice preview** | Listen to the selected voice before processing. |
| **Generate subtitles** | `Disabled`, `Line`, `Sentence`, `Sentence + Comma`, `Sentence + Highlighting`, `1 word`, `2 words`, `3 words`, etc. (Represents the number of words in each subtitle entry) |
| **Output voice format** | `.WAV`, `.FLAC`, `.MP3`, `.OPUS (best compression)` and `M4B (with chapters)` |
| **Output subtitle format** | Configures the subtitle format as `SRT (standard)`, `ASS (wide)`, `ASS (narrow)`, `ASS (centered wide)`, or `ASS (centered narrow)`. |
| **Replace single newlines with spaces** | Replaces single newlines with spaces in the text. This is useful for texts that have imaginary line breaks. |
| **Save location** | `Save next to input file`, `Save to desktop`, or `Choose output folder` |

> Special thanks to [@brianxiadong](https://github.com/brianxiadong) for adding markdown support in PR [#75](https://github.com/denizsafak/abogen/pull/75)

> Special thanks to [@jborza](https://github.com/jborza) for chapter support in PR [#10](https://github.com/denizsafak/abogen/pull/10)

> Special thanks to [@mleg](https://github.com/mleg) for adding `Line` option in subtitle generation in PR [#94](https://github.com/denizsafak/abogen/pull/94)

| Book handler options | Description |
|---------|-------------|
| **Chapter Control** | Select specific `chapters` from ePUBs or markdown files or `chapters + pages` from PDFs. |
| **Save each chapter separately** | Save each chapter in e-books as a separate audio file. |
| **Create a merged version** | Create a single audio file that combines all chapters. (If `Save each chapter separately` is disabled, this option will be the default behavior.) |
| **Save in a project folder with metadata** | Save the converted items in a project folder with available metadata files. |

| Menu options | Description |
|---------|-------------|
| **Theme** | Change the application's theme using `System`, `Light`, or `Dark` options. |
| **Configure max words per subtitle** | Configures the maximum number of words per subtitle entry. |
| **Configure silence between chapters** | Configures the duration of silence between chapters (in seconds). |
| **Configure max lines in log window** | Configures the maximum number of lines to display in the log window. |
| **Separate chapters audio format** | Configures the audio format for separate chapters as `wav`, `flac`, `mp3`, or `opus`. |
| **Create desktop shortcut** | Creates a shortcut on your desktop for easy access. |
| **Open config directory** | Opens the directory where the configuration file is stored. |
| **Open cache directory** | Opens the cache directory where converted text files are stored. |
| **Clear cache files** | Deletes cache files created during the conversion or preview. |
| **Check for updates at startup** | Automatically checks for updates when the program starts. |
| **Disable Kokoro's internet access** | Prevents Kokoro from downloading models or voices from HuggingFace Hub, useful for offline use. |
| **Reset to default settings** | Resets all settings to their default values. |

> Special thanks to [@robmckinnon](https://github.com/robmckinnon) for adding Sentence + Highlighting feature in PR [#65](https://github.com/denizsafak/abogen/pull/65)

## `Voice Mixer`
<img title="Abogen Voice Mixer" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/voice_mixer.png'>

With voice mixer, you can create custom voices by mixing different voice models. You can adjust the weight of each voice and save your custom voice as a profile for future use. The voice mixer allows you to create unique and personalized voices.

> Special thanks to [@jborza](https://github.com/jborza) for making this possible through his contributions in [#5](https://github.com/denizsafak/abogen/pull/5)

## `Queue Mode`
<img title="Abogen queue mode" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/queue.png'>

Abogen supports **queue mode**, allowing you to add multiple files to a processing queue. This is useful if you want to convert several files in one batch.

- You can add text files (`.txt`) directly using the **Add files** button in the Queue Manager. To add PDF, EPUB, or markdown files, use the input box in the main window and click the **Add to Queue** button.
- Each file in the queue keeps the configuration settings that were active when it was added. Changing the main window configuration afterward does **not** affect files already in the queue.
- You can view each file's configuration by hovering over them.

Abogen will process each item in the queue automatically, saving outputs as configured.

> Special thanks to [@jborza](https://github.com/jborza) for adding queue mode in PR [#35](https://github.com/denizsafak/abogen/pull/35)

## `About Chapter Markers`
When you process ePUB, PDF or markdown files, Abogen converts them into text files stored in your cache directory. When you click "Edit," you're actually modifying these converted text files. In these text files, you'll notice tags that look like this:

```
<<CHAPTER_MARKER:Chapter Title>>
```
These are chapter markers. They are automatically added when you process ePUB, PDF or markdown files, based on the chapters you select. They serve an important purpose:
-  Allow you to split the text into separate audio files for each chapter
-  Save time by letting you reprocess only specific chapters if errors occur, rather than the entire file

You can manually add these markers to plain text files for the same benefits. Simply include them in your text like this:

```
<<CHAPTER_MARKER:Introduction>>
This is the beginning of my text...  

<<CHAPTER_MARKER:Main Content>> 
Here's another part...  
```
When you process the text file, Abogen will detect these markers automatically and ask if you want to save each chapter separately and create a merged version.

![Abogen Chapter Marker](https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/chapter_marker.png)

## `About Metadata Tags`
Similar to chapter markers, it is possible to add metadata tags for `M4B` files. This is useful for audiobook players that support metadata, allowing you to add information like title, author, year, etc. Abogen automatically adds these tags when you process ePUB, PDF or markdown files, but you can also add them manually to your text files. Add metadata tags **at the beginning of your text file** like this:
```
<<METADATA_TITLE:Title>>
<<METADATA_ARTIST:Author>>
<<METADATA_ALBUM:Album Title>>
<<METADATA_YEAR:Year>>
<<METADATA_ALBUM_ARTIST:Album Artist>>
<<METADATA_COMPOSER:Narrator>>
<<METADATA_GENRE:Audiobook>>
```

## `Supported Languages`
```
# 🇺🇸 'a' => American English, 🇬🇧 'b' => British English
# 🇪🇸 'e' => Spanish es
# 🇫🇷 'f' => French fr-fr
# 🇮🇳 'h' => Hindi hi
# 🇮🇹 'i' => Italian it
# 🇯🇵 'j' => Japanese: pip install misaki[ja]
# 🇧🇷 'p' => Brazilian Portuguese pt-br
# 🇨🇳 'z' => Mandarin Chinese: pip install misaki[zh]
```
For a complete list of supported languages and voices, refer to Kokoro's [VOICES.md](https://huggingface.co/hexgrad/Kokoro-82M/blob/main/VOICES.md). To listen to sample audio outputs, see [SAMPLES.md](https://huggingface.co/hexgrad/Kokoro-82M/blob/main/SAMPLES.md).

> See [How to fix Japanese audio not working?](#japanese-audio-not-working)

## `MPV Config`
I highly recommend using [MPV](https://mpv.io/installation/) to play your audio files, as it supports displaying subtitles even without a video track. Here's my `mpv.conf`:
```
# --- MPV Settings ---
save-position-on-quit
keep-open=yes
# --- Subtitle ---
sub-ass-override=no
sub-margin-y=50
sub-margin-x=50
# --- Audio Quality ---
audio-spdif=ac3,dts,eac3,truehd,dts-hd
audio-channels=auto
audio-samplerate=48000
volume-max=200
```

## `Docker Guide`
If you want to run Abogen in a Docker container:
1) [Download the repository](https://github.com/denizsafak/abogen/archive/refs/heads/main.zip) and extract, or clone it using git.
2) Go to `abogen` folder. You should see `Dockerfile` there.
3) Open your termminal in that directory and run the following commands:

```bash
# Build the Docker image:
docker build --progress plain -t abogen .

# Note that building the image may take a while.
# After building is complete, run the Docker container:

# Windows
docker run --name abogen -v %cd%:/shared -p 5800:5800 -p 5900:5900 --gpus all abogen

# Linux
docker run --name abogen -v $(pwd):/shared -p 5800:5800 -p 5900:5900 --gpus all abogen

# MacOS
docker run --name abogen -v $(pwd):/shared -p 5800:5800 -p 5900:5900 abogen

# We expose port 5800 for use by a web browser, 5900 if you want to connect with a VNC client.
```

Abogen launches automatically inside the container. 
- You can access it via a web browser at [http://localhost:5800](http://localhost:5800) or connect to it using a VNC client at `localhost:5900`.
- You can use `/shared` directory to share files between your host and the container.
- For later use, start it with `docker start abogen` and stop it with `docker stop abogen`.
- Pass in `-e WEB_AUDIO="1"` for `docker run` to enable audio.

Known issues:
- Audio preview is not working inside container (ALSA error) if using a VNC client.
- `Open cache directory` and `Open configuration directory` options in settings not working. (Tried pcmanfm, did not work with Abogen).

> Special thanks to [@geo38](https://www.reddit.com/user/geo38/) from Reddit, who provided the Dockerfile and instructions in [this comment](https://www.reddit.com/r/selfhosted/comments/1k8x1yo/comment/mpe0bz8/).

## `Similar Projects`
Abogen is a standalone project, but it is inspired by and shares some similarities with other projects. Here are a few:
- [audiblez](https://github.com/santinic/audiblez): Generate audiobooks from e-books. **(Has CLI and GUI support)**
- [autiobooks](https://github.com/plusuncold/autiobooks): Automatically convert epubs to audiobooks
- [pdf-narrator](https://github.com/mateogon/pdf-narrator): Convert your PDFs and EPUBs into audiobooks effortlessly.
- [epub_to_audiobook](https://github.com/p0n1/epub_to_audiobook): EPUB to audiobook converter, optimized for Audiobookshelf
- [ebook2audiobook](https://github.com/DrewThomasson/ebook2audiobook): Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning

## `Roadmap`
- [ ] Add OCR scan feature for PDF files using docling/teserract.
- [x] Add chapter metadata for .m4a files. (Issue [#9](https://github.com/denizsafak/abogen/issues/9), PR [#10](https://github.com/denizsafak/abogen/pull/10))
- [ ] Add support for different languages in GUI.
- [x] Add voice formula feature that enables mixing different voice models. (Issue [#1](https://github.com/denizsafak/abogen/issues/1), PR [#5](https://github.com/denizsafak/abogen/pull/5))
- [ ] Add support for kokoro-onnx (If it's necessary).
- [x] Add dark mode.

## `Troubleshooting`
If you encounter any issues while running Abogen, try launching it from the command line with:
```
abogen-cli
```

If you installed using the Windows installer `(WINDOWS_INSTALL.bat)`, go to `python_embedded/Scripts` and run:
```
abogen-cli.exe
```

This will start Abogen in command-line mode and display detailed error messages. Please open a new issue on the [Issues](https://github.com/denizsafak/abogen/issues) page with the error message and a description of your problem.

## `Common Issues & Solutions`

<details><summary><b>
<a name="about-abogen">About the name "abogen"</a>
</b></summary>

> The name **"abogen"** comes from a shortened form of **"audiobook generator"**, which is the purpose of this project.  
>
> After releasing the project, I learned from [community feedback](https://news.ycombinator.com/item?id=44853064#44857237) that the prefix *"abo"* can unfortunately be understood as an ethnic slur in certain regions (particularly Australia and New Zealand). This was something I was not aware of when naming the project, as English is not my first language.  
>
> I want to make it clear that the name was chosen only for its technical meaning, with **no offensive intent**. I’m grateful to those who kindly pointed this out, as it helps ensure the project remains respectful and welcoming to everyone.  

</details>

<details><summary><b>
<a name="cuda-warning">How to fix "CUDA GPU is not available. Using CPU" warning?</a>
</b></summary>

> This message means PyTorch couldn't use your GPU. On Windows, Abogen supports NVIDIA GPUs with CUDA. AMD GPUs are supported only on Linux. Abogen will still run on the CPU, but it will be slower.
>
> If you have a compatible NVIDIA GPU on Windows and still see this warning:
> Open your terminal in the Abogen folder (the folder that contains `python_embedded`) and type:
> ```bash
> python_embedded\python.exe -m pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
> ```
> If you have an AMD GPU, use Linux and follow the Linux/ROCm [instructions](#how-to-install-). If you want to keep running on CPU, no action is required, but performance will just be reduced. See [#32](https://github.com/denizsafak/abogen/issues/32) for more details.

</details>

<details><summary><b>
<a name="path-warning">How to fix "WARNING: The script abogen-cli is installed in '/home/username/.local/bin' which is not on PATH" error in Linux?</a>
</b></summary>

> Run the following command to add Abogen to your PATH:
> ```bash
> echo "export PATH=\"/home/$USER/.local/bin:\$PATH\"" >> ~/.bashrc && source ~/.bashrc
> ```

</details>

<details><summary><b>
<a name="no-matching-distribution-found">How to fix "No matching distribution found" error?<a>
</b></summary>

> Try installing Abogen on supported Python (3.10 to 3.12) versions. You can use [pyenv](https://github.com/pyenv/pyenv) to manage multiple Python versions easily in Linux. Watch this [video](https://www.youtube.com/watch?v=MVyb-nI4KyI) by NetworkChuck for a quick guide.

</details>

<details><summary><b>
<a name="WinError-1114">How to fix "[WinError 1114] A dynamic link library (DLL) initialization routine failed" error?</a>
</b></summary>

> I faced this error when trying to run Abogen in a virtual Windows machine without GPU support. Here's how I fixed it:
> If you installed Abogen using the Windows installer `(WINDOWS_INSTALL.bat)`, go to Abogen's folder (that contains `python_embedded`), open your terminal there and run:
> ```bash
> python_embedded\python.exe -m pip install torch==2.8.0 torchaudio==2.8.0 torchvision==0.23.0
> ```
> If you installed Abogen using pip, open your terminal in the virtual environment and run:
> ```bash
> pip install torch==2.8.0 torchaudio==2.8.0 torchvision==0.23.0
> ```

</details>

<details><summary><b>
<a name="japanese-audio-not-working">How to fix Japanese audio not working?</a>
</b></summary>

> Japanese audio may require additional configuration. 
> I'm not sure about the exact solution, but it seems to be related to installing additional dependencies for Japanese support in Kokoro. Please check [#56](https://github.com/denizsafak/abogen/issues/56) for more information. 

</details>

<details><summary><b>
<a name="use-uv-instead-of-pip">How to use "uv" instead of "pip"?</a>
</b></summary>

> Abogen needs "pip", because Kokoro uses pip to download voice models from HuggingFace Hub. If you want to use "uv" instead of "pip", you can use the following command to run Abogen:
>
> ```bash
> uvx --with pip abogen
> ```

</details>

<details><summary><b>
<a name="use-uv-instead-of-pip">How to uninstall Abogen?</a>
</b></summary>

> - From the settings menu, go to `Open configuration directory` and delete the directory.
> - From the settings menu, go to `Open cache directory` and delete the directory.
> - If you installed Abogen using pip, type:
>```bash
>pip uninstall abogen # uninstalls abogen
>pip cache purge # removes pip cache
>```
> - If you installed Abogen using the Windows installer (WINDOWS_INSTALL.bat), just remove the folder that contains Abogen. It installs everything inside `python_embedded` folder, no other directories are created.
> - If you installed espeak-ng, you need to remove it separately.

</details>

## `Contributing`
I welcome contributions! If you have ideas for new features, improvements, or bug fixes, please fork the repository and submit a pull request.
### For developers and contributors
If you'd like to modify the code and contribute to development, you can [download the repository](https://github.com/denizsafak/abogen/archive/refs/heads/main.zip), extract it and run the following commands to build **or** install the package:
```bash
# Go to the directory where you extracted the repository and run:
pip install -e .      # Installs the package in editable mode
pip install build     # Install the build package
python -m build       # Builds the package in dist folder (optional)
abogen                # Opens the GUI
```
Feel free to explore the code and make any changes you like.

## `Credits`
- Abogen uses [Kokoro](https://github.com/hexgrad/kokoro) for its high-quality, natural-sounding text-to-speech synthesis. Huge thanks to the Kokoro team for making this possible.
- Thanks to [@wojiushixiaobai](https://github.com/wojiushixiaobai) for [Embedded Python](https://github.com/wojiushixiaobai/Python-Embed-Win64) packages. These modified packages include pip pre-installed, enabling Abogen to function as a standalone application without requiring users to separately install Python in Windows.
- Thanks to creators of [EbookLib](https://github.com/aerkalov/ebooklib), a Python library for reading and writing ePub files, which is used for extracting text from ePub files.
- Special thanks to the [PyQt](https://www.riverbankcomputing.com/software/pyqt/) team for providing the cross-platform GUI toolkit that powers Abogen's interface.
- Icons: [US](https://icons8.com/icon/aRiu1GGi6Aoe/usa), [Great Britain](https://icons8.com/icon/t3NE3BsOAQwq/great-britain), [Spain](https://icons8.com/icon/ly7tzANRt33n/spain), [France](https://icons8.com/icon/3muzEmi4dpD5/france), [India](https://icons8.com/icon/esGVrxg9VCJ1/india), [Italy](https://icons8.com/icon/PW8KZnP7qXzO/italy), [Japan](https://icons8.com/icon/McQbrq9qaQye/japan), [Brazil](https://icons8.com/icon/zHmH8HpOmM90/brazil), [China](https://icons8.com/icon/Ej50Oe3crXwF/china), [Female](https://icons8.com/icon/uI49hxbpxTkp/female), [Male](https://icons8.com/icon/12351/male), [Adjust](https://icons8.com/icon/21698/adjust) and [Voice Id](https://icons8.com/icon/GskSeVoroQ7u/voice-id) icons by [Icons8](https://icons8.com/).

## `License`
This project is available under the MIT License - see the [LICENSE](https://github.com/denizsafak/abogen/blob/main/LICENSE) file for details.
[Kokoro](https://github.com/hexgrad/kokoro) is licensed under [Apache-2.0](https://github.com/hexgrad/kokoro/blob/main/LICENSE) which allows commercial use, modification, distribution, and private use.

## `Star History`
[![Star History Chart](https://api.star-history.com/svg?repos=denizsafak/abogen&type=Date)](https://www.star-history.com/#denizsafak/abogen&Date)

> [!IMPORTANT]
> Subtitle generation currently works only for English. This is because Kokoro provides timestamp tokens only for English text. If you want subtitles in other languages, please request this feature in the [Kokoro project](https://github.com/hexgrad/kokoro). For more technical details, see [this line](https://github.com/hexgrad/kokoro/blob/6d87f4ae7abc2d14dbc4b3ef2e5f19852e861ac2/kokoro/pipeline.py#L383) in the Kokoro's code.

> Tags: audiobook, kokoro, text-to-speech, TTS, audiobook generator, audiobooks, text to speech, audiobook maker, audiobook creator, audiobook generator, voice-synthesis, text to audio, text to audio converter, text to speech converter, text to speech generator, text to speech software, text to speech app, epub to audio, pdf to audio, markdown to audio, content-creation, media-generation

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "abogen",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<3.13,>=3.10",
    "maintainer_email": null,
    "keywords": "accessibility, audiobook, book-converter, chapter-management, content-creation, epub, kokoro, media-generation, multilingual, pdf, subtitle, subtitles, text-to-speech, tts, voice-synthesis",
    "author": null,
    "author_email": "Deniz \u015eafak <denizsafak98@gmail.com>",
    "download_url": "https://files.pythonhosted.org/packages/9f/57/29cf1346aaa9e5d8f4bac9ff0a03247b1aa1e79facf46e8fe7a0068ac9bb/abogen-1.2.0.tar.gz",
    "platform": null,
    "description": "# abogen <img width=\"40px\" title=\"abogen icon\" src=\"https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/abogen/assets/icon.ico\" align=\"right\" style=\"padding-left: 10px; padding-top:5px;\">\n\n[![Build Status](https://github.com/denizsafak/abogen/actions/workflows/test_pip.yml/badge.svg)](https://github.com/denizsafak/abogen/actions)\n[![GitHub Release](https://img.shields.io/github/v/release/denizsafak/abogen)](https://github.com/denizsafak/abogen/releases/latest)\n[![Abogen PyPi Python Versions](https://img.shields.io/pypi/pyversions/abogen)](https://pypi.org/project/abogen/)\n[![Operating Systems](https://img.shields.io/badge/os-windows%20%7C%20linux%20%7C%20macos%20-blue)](https://github.com/denizsafak/abogen/releases/latest)\n[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)\n[![License: MIT](https://img.shields.io/badge/License-MIT-maroon.svg)](https://opensource.org/licenses/MIT)\n\n<a href=\"https://trendshift.io/repositories/14433\" target=\"_blank\"><img src=\"https://trendshift.io/api/badge/repositories/14433\" alt=\"denizsafak%2Fabogen | Trendshift\" style=\"width: 250px; height: 55px;\" width=\"250\" height=\"55\"/></a>\n\nAbogen is a powerful text-to-speech conversion tool that makes it easy to turn ePub, PDF, text or markdown files into high-quality audio with matching subtitles in seconds. Use it for audiobooks, voiceovers for Instagram, YouTube, TikTok, or any project that needs natural-sounding text-to-speech, using [Kokoro-82M](https://huggingface.co/hexgrad/Kokoro-82M).\n\n<img title=\"Abogen Main\" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/abogen.png' width=\"380\"> <img title=\"Abogen Processing\" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/abogen2.png' width=\"380\">\n\n## Demo\n\nhttps://github.com/user-attachments/assets/094ba3df-7d66-494a-bc31-0e4b41d0b865\n\n> This demo was generated in just 5\u00a0seconds, producing \u223c1\u00a0minute of audio with perfectly synced subtitles. To create a similar video, see [the demo guide](https://github.com/denizsafak/abogen/tree/main/demo).\n\n## `How to install?` <a href=\"https://pypi.org/project/abogen/\" target=\"_blank\"><img src=\"https://img.shields.io/pypi/pyversions/abogen\" alt=\"Abogen Compatible PyPi Python Versions\" align=\"right\" style=\"margin-top:6px;\"></a>\n\n### Windows\nGo to [espeak-ng latest release](https://github.com/espeak-ng/espeak-ng/releases/latest) download and run the *.msi file.\n\n#### OPTION 1: Install using script\n1. [Download](https://github.com/denizsafak/abogen/archive/refs/heads/main.zip) the repository\n2. Extract the ZIP file\n3. Run `WINDOWS_INSTALL.bat` by double-clicking it\n\nThis method handles everything automatically - installing all dependencies including CUDA in a self-contained environment without requiring a separate Python installation. (You still need to install [espeak-ng](https://github.com/espeak-ng/espeak-ng/releases/latest).)\n\n> [!NOTE]\n> You don't need to install Python separately. The script will install Python automatically.\n\n#### OPTION 2: Install using pip\n```bash\n# Create a virtual environment (optional)\nmkdir abogen && cd abogen\npython -m venv venv\nvenv\\Scripts\\activate\n\n# For NVIDIA GPUs:\npip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128\n\n# For AMD GPUs:\n# Not supported yet, because ROCm is not available on Windows. Use Linux if you have AMD GPU.\n\n# Install abogen\npip install abogen\n```\n\n### Mac\n```bash\n# Install espeak-ng\nbrew install espeak-ng\n\n# Create a virtual environment (recommended)\nmkdir abogen && cd abogen\npython3 -m venv venv\nsource venv/bin/activate\n\n# Install abogen\npip3 install abogen\n\n# For Silicon Mac (M1, M2 etc.)\n# After installing abogen, we need to install Kokoro's development version which includes MPS support.\npip3 install git+https://github.com/hexgrad/kokoro.git\n```\n### Linux\n```bash\n# Install espeak-ng\nsudo apt install espeak-ng # Ubuntu/Debian\nsudo pacman -S espeak-ng # Arch Linux\nsudo dnf install espeak-ng # Fedora\n\n# Create a virtual environment (recommended)\nmkdir abogen && cd abogen\npython3 -m venv venv\nsource venv/bin/activate\n\n# Install abogen\npip3 install abogen\n\n# For NVIDIA GPUs:\n# Already supported, no need to install CUDA separately.\n\n# For AMD GPUs:\n# After installing abogen, we need to uninstall the existing torch package\npip3 uninstall torch \npip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/rocm6.4\n```\n\n> See [How to fix \"CUDA GPU is not available. Using CPU\" warning?](#cuda-warning)\n\n> See [How  to fix \"WARNING: The script abogen-cli is installed in '/home/username/.local/bin' which is not on PATH\" error in Linux?](#path-warning)\n\n> See [How to fix \"No matching distribution found\" error?](#no-matching-distribution-found)\n\n> See [How to fix \"[WinError 1114] A dynamic link library (DLL) initialization routine failed\" error?](#WinError-1114)\n\n> See [How to use \"uv\" instead of \"pip\"?](#use-uv-instead-of-pip)\n\n> Special thanks to [@hg000125](https://github.com/hg000125) for his contribution in [#23](https://github.com/denizsafak/abogen/issues/23). AMD GPU support is possible thanks to his work.\n\n## `How to run?`\nIf you installed using pip, you can simply run the following command to start Abogen:\n\n```bash\nabogen\n```\n> [!TIP]\n> If you installed Abogen using the Windows installer `(WINDOWS_INSTALL.bat)`, It should have created a shortcut in the same folder, or your desktop. You can run it from there. If you lost the shortcut, Abogen is located in `python_embedded/Scripts/abogen.exe`. You can run it from there directly.\n\n## `How to use?`\n1) Drag and drop any ePub, PDF, text or markdown file (or use the built-in text editor)\n2) Configure the settings:\n    - Set speech speed\n    - Select a voice (or create a custom voice using voice mixer)\n    - Select subtitle generation style (by sentence, word, etc.)\n    - Select output format\n    - Select where to save the output\n3) Hit Start\n\n## `In action`\n<img title=\"Abogen in action\" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/abogen.gif'> \n\nHere\u2019s Abogen in action: in this demo, it processes \u223c3,000 characters of text in just 11 seconds and turns it into 3 minutes and 28 seconds of audio, and I have a low-end **RTX\u00a02060\u00a0Mobile laptop GPU**. Your results may vary depending on your hardware.\n\n## `Configuration`\n\n| Options | Description |\n|---------|-------------|\n| **Input Box** | Drag and drop `ePub`, `PDF`, `.TXT` or `.MD` files (or use built-in text editor) |\n| **Queue options** | Add multiple files to a queue and process them in batch, with individual settings for each file. See [Queue mode](#queue-mode) for more details. |\n| **Speed** | Adjust speech rate from `0.1x` to `2.0x` |\n| **Select Voice** | First letter of the language code (e.g., `a` for American English, `b` for British English, etc.), second letter is for `m` for male and `f` for female. |\n| **Voice mixer** | Create custom voices by mixing different voice models with a profile system. See [Voice Mixer](#voice-mixer) for more details. |\n| **Voice preview** | Listen to the selected voice before processing. |\n| **Generate subtitles** | `Disabled`, `Line`, `Sentence`, `Sentence + Comma`, `Sentence + Highlighting`, `1 word`, `2 words`, `3 words`, etc. (Represents the number of words in each subtitle entry) |\n| **Output voice format** | `.WAV`, `.FLAC`, `.MP3`, `.OPUS (best compression)` and `M4B (with chapters)` |\n| **Output subtitle format** | Configures the subtitle format as `SRT (standard)`, `ASS (wide)`, `ASS (narrow)`, `ASS (centered wide)`, or `ASS (centered narrow)`. |\n| **Replace single newlines with spaces** | Replaces single newlines with spaces in the text. This is useful for texts that have imaginary line breaks. |\n| **Save location** | `Save next to input file`, `Save to desktop`, or `Choose output folder` |\n\n> Special thanks to [@brianxiadong](https://github.com/brianxiadong) for adding markdown support in PR [#75](https://github.com/denizsafak/abogen/pull/75)\n\n> Special thanks to [@jborza](https://github.com/jborza) for chapter support in PR [#10](https://github.com/denizsafak/abogen/pull/10)\n\n> Special thanks to [@mleg](https://github.com/mleg) for adding `Line` option in subtitle generation in PR [#94](https://github.com/denizsafak/abogen/pull/94)\n\n| Book handler options | Description |\n|---------|-------------|\n| **Chapter Control** | Select specific `chapters` from ePUBs or markdown files or `chapters + pages` from PDFs. |\n| **Save each chapter separately** | Save each chapter in e-books as a separate audio file. |\n| **Create a merged version** | Create a single audio file that combines all chapters. (If `Save each chapter separately` is disabled, this option will be the default behavior.) |\n| **Save in a project folder with metadata** | Save the converted items in a project folder with available metadata files. |\n\n| Menu options | Description |\n|---------|-------------|\n| **Theme** | Change the application's theme using `System`, `Light`, or `Dark` options. |\n| **Configure max words per subtitle** | Configures the maximum number of words per subtitle entry. |\n| **Configure silence between chapters** | Configures the duration of silence between chapters (in seconds). |\n| **Configure max lines in log window** | Configures the maximum number of lines to display in the log window. |\n| **Separate chapters audio format** | Configures the audio format for separate chapters as `wav`, `flac`, `mp3`, or `opus`. |\n| **Create desktop shortcut** | Creates a shortcut on your desktop for easy access. |\n| **Open config directory** | Opens the directory where the configuration file is stored. |\n| **Open cache directory** | Opens the cache directory where converted text files are stored. |\n| **Clear cache files** | Deletes cache files created during the conversion or preview. |\n| **Check for updates at startup** | Automatically checks for updates when the program starts. |\n| **Disable Kokoro's internet access** | Prevents Kokoro from downloading models or voices from HuggingFace Hub, useful for offline use. |\n| **Reset to default settings** | Resets all settings to their default values. |\n\n> Special thanks to [@robmckinnon](https://github.com/robmckinnon) for adding Sentence + Highlighting feature in PR [#65](https://github.com/denizsafak/abogen/pull/65)\n\n## `Voice Mixer`\n<img title=\"Abogen Voice Mixer\" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/voice_mixer.png'>\n\nWith voice mixer, you can create custom voices by mixing different voice models. You can adjust the weight of each voice and save your custom voice as a profile for future use. The voice mixer allows you to create unique and personalized voices.\n\n> Special thanks to [@jborza](https://github.com/jborza) for making this possible through his contributions in [#5](https://github.com/denizsafak/abogen/pull/5)\n\n## `Queue Mode`\n<img title=\"Abogen queue mode\" src='https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/queue.png'>\n\nAbogen supports **queue mode**, allowing you to add multiple files to a processing queue. This is useful if you want to convert several files in one batch.\n\n- You can add text files (`.txt`) directly using the **Add files** button in the Queue Manager. To add PDF, EPUB, or markdown files, use the input box in the main window and click the **Add to Queue** button.\n- Each file in the queue keeps the configuration settings that were active when it was added. Changing the main window configuration afterward does **not** affect files already in the queue.\n- You can view each file's configuration by hovering over them.\n\nAbogen will process each item in the queue automatically, saving outputs as configured.\n\n> Special thanks to [@jborza](https://github.com/jborza) for adding queue mode in PR [#35](https://github.com/denizsafak/abogen/pull/35)\n\n## `About Chapter Markers`\nWhen you process ePUB, PDF or markdown files, Abogen converts them into text files stored in your cache directory. When you click \"Edit,\" you're actually modifying these converted text files. In these text files, you'll notice tags that look like this:\n\n```\n<<CHAPTER_MARKER:Chapter Title>>\n```\nThese are chapter markers. They are automatically added when you process ePUB, PDF or markdown files, based on the chapters you select. They serve an important purpose:\n-  Allow you to split the text into separate audio files for each chapter\n-  Save time by letting you reprocess only specific chapters if errors occur, rather than the entire file\n\nYou can manually add these markers to plain text files for the same benefits. Simply include them in your text like this:\n\n```\n<<CHAPTER_MARKER:Introduction>>\nThis is the beginning of my text...  \n\n<<CHAPTER_MARKER:Main Content>> \nHere's another part...  \n```\nWhen you process the text file, Abogen will detect these markers automatically and ask if you want to save each chapter separately and create a merged version.\n\n![Abogen Chapter Marker](https://raw.githubusercontent.com/denizsafak/abogen/refs/heads/main/demo/chapter_marker.png)\n\n## `About Metadata Tags`\nSimilar to chapter markers, it is possible to add metadata tags for `M4B` files. This is useful for audiobook players that support metadata, allowing you to add information like title, author, year, etc. Abogen automatically adds these tags when you process ePUB, PDF or markdown files, but you can also add them manually to your text files. Add metadata tags **at the beginning of your text file** like this:\n```\n<<METADATA_TITLE:Title>>\n<<METADATA_ARTIST:Author>>\n<<METADATA_ALBUM:Album Title>>\n<<METADATA_YEAR:Year>>\n<<METADATA_ALBUM_ARTIST:Album Artist>>\n<<METADATA_COMPOSER:Narrator>>\n<<METADATA_GENRE:Audiobook>>\n```\n\n## `Supported Languages`\n```\n# \ud83c\uddfa\ud83c\uddf8 'a' => American English, \ud83c\uddec\ud83c\udde7 'b' => British English\n# \ud83c\uddea\ud83c\uddf8 'e' => Spanish es\n# \ud83c\uddeb\ud83c\uddf7 'f' => French fr-fr\n# \ud83c\uddee\ud83c\uddf3 'h' => Hindi hi\n# \ud83c\uddee\ud83c\uddf9 'i' => Italian it\n# \ud83c\uddef\ud83c\uddf5 'j' => Japanese: pip install misaki[ja]\n# \ud83c\udde7\ud83c\uddf7 'p' => Brazilian Portuguese pt-br\n# \ud83c\udde8\ud83c\uddf3 'z' => Mandarin Chinese: pip install misaki[zh]\n```\nFor a complete list of supported languages and voices, refer to Kokoro's [VOICES.md](https://huggingface.co/hexgrad/Kokoro-82M/blob/main/VOICES.md). To listen to sample audio outputs, see [SAMPLES.md](https://huggingface.co/hexgrad/Kokoro-82M/blob/main/SAMPLES.md).\n\n> See [How to fix Japanese audio not working?](#japanese-audio-not-working)\n\n## `MPV Config`\nI highly recommend using [MPV](https://mpv.io/installation/) to play your audio files, as it supports displaying subtitles even without a video track. Here's my `mpv.conf`:\n```\n# --- MPV Settings ---\nsave-position-on-quit\nkeep-open=yes\n# --- Subtitle ---\nsub-ass-override=no\nsub-margin-y=50\nsub-margin-x=50\n# --- Audio Quality ---\naudio-spdif=ac3,dts,eac3,truehd,dts-hd\naudio-channels=auto\naudio-samplerate=48000\nvolume-max=200\n```\n\n## `Docker Guide`\nIf you want to run Abogen in a Docker container:\n1) [Download the repository](https://github.com/denizsafak/abogen/archive/refs/heads/main.zip) and extract, or clone it using git.\n2) Go to `abogen` folder. You should see `Dockerfile` there.\n3) Open your termminal in that directory and run the following commands:\n\n```bash\n# Build the Docker image:\ndocker build --progress plain -t abogen .\n\n# Note that building the image may take a while.\n# After building is complete, run the Docker container:\n\n# Windows\ndocker run --name abogen -v %cd%:/shared -p 5800:5800 -p 5900:5900 --gpus all abogen\n\n# Linux\ndocker run --name abogen -v $(pwd):/shared -p 5800:5800 -p 5900:5900 --gpus all abogen\n\n# MacOS\ndocker run --name abogen -v $(pwd):/shared -p 5800:5800 -p 5900:5900 abogen\n\n# We expose port 5800 for use by a web browser, 5900 if you want to connect with a VNC client.\n```\n\nAbogen launches automatically inside the container. \n- You can access it via a web browser at [http://localhost:5800](http://localhost:5800) or connect to it using a VNC client at `localhost:5900`.\n- You can use `/shared` directory to share files between your host and the container.\n- For later use, start it with `docker start abogen` and stop it with `docker stop abogen`.\n- Pass in `-e WEB_AUDIO=\"1\"` for `docker run` to enable audio.\n\nKnown issues:\n- Audio preview is not working inside container (ALSA error) if using a VNC client.\n- `Open cache directory` and `Open configuration directory` options in settings not working. (Tried pcmanfm, did not work with Abogen).\n\n> Special thanks to [@geo38](https://www.reddit.com/user/geo38/) from Reddit, who provided the Dockerfile and instructions in [this comment](https://www.reddit.com/r/selfhosted/comments/1k8x1yo/comment/mpe0bz8/).\n\n## `Similar Projects`\nAbogen is a standalone project, but it is inspired by and shares some similarities with other projects. Here are a few:\n- [audiblez](https://github.com/santinic/audiblez): Generate audiobooks from e-books. **(Has CLI and GUI support)**\n- [autiobooks](https://github.com/plusuncold/autiobooks): Automatically convert epubs to audiobooks\n- [pdf-narrator](https://github.com/mateogon/pdf-narrator): Convert your PDFs and EPUBs into audiobooks effortlessly.\n- [epub_to_audiobook](https://github.com/p0n1/epub_to_audiobook): EPUB to audiobook converter, optimized for Audiobookshelf\n- [ebook2audiobook](https://github.com/DrewThomasson/ebook2audiobook): Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning\n\n## `Roadmap`\n- [ ] Add OCR scan feature for PDF files using docling/teserract.\n- [x] Add chapter metadata for .m4a files. (Issue [#9](https://github.com/denizsafak/abogen/issues/9), PR [#10](https://github.com/denizsafak/abogen/pull/10))\n- [ ] Add support for different languages in GUI.\n- [x] Add voice formula feature that enables mixing different voice models. (Issue [#1](https://github.com/denizsafak/abogen/issues/1), PR [#5](https://github.com/denizsafak/abogen/pull/5))\n- [ ] Add support for kokoro-onnx (If it's necessary).\n- [x] Add dark mode.\n\n## `Troubleshooting`\nIf you encounter any issues while running Abogen, try launching it from the command line with:\n```\nabogen-cli\n```\n\nIf you installed using the Windows installer `(WINDOWS_INSTALL.bat)`, go to `python_embedded/Scripts` and run:\n```\nabogen-cli.exe\n```\n\nThis will start Abogen in command-line mode and display detailed error messages. Please open a new issue on the [Issues](https://github.com/denizsafak/abogen/issues) page with the error message and a description of your problem.\n\n## `Common Issues & Solutions`\n\n<details><summary><b>\n<a name=\"about-abogen\">About the name \"abogen\"</a>\n</b></summary>\n\n> The name **\"abogen\"** comes from a shortened form of **\"audiobook generator\"**, which is the purpose of this project.  \n>\n> After releasing the project, I learned from [community feedback](https://news.ycombinator.com/item?id=44853064#44857237) that the prefix *\"abo\"* can unfortunately be understood as an ethnic slur in certain regions (particularly Australia and New Zealand). This was something I was not aware of when naming the project, as English is not my first language.  \n>\n> I want to make it clear that the name was chosen only for its technical meaning, with **no offensive intent**. I\u2019m grateful to those who kindly pointed this out, as it helps ensure the project remains respectful and welcoming to everyone.  \n\n</details>\n\n<details><summary><b>\n<a name=\"cuda-warning\">How to fix \"CUDA GPU is not available. Using CPU\" warning?</a>\n</b></summary>\n\n> This message means PyTorch couldn't use your GPU. On Windows, Abogen supports NVIDIA GPUs with CUDA. AMD GPUs are supported only on Linux. Abogen will still run on the CPU, but it will be slower.\n>\n> If you have a compatible NVIDIA GPU on Windows and still see this warning:\n> Open your terminal in the Abogen folder (the folder that contains `python_embedded`) and type:\n> ```bash\n> python_embedded\\python.exe -m pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128\n> ```\n> If you have an AMD GPU, use Linux and follow the Linux/ROCm [instructions](#how-to-install-). If you want to keep running on CPU, no action is required, but performance will just be reduced. See [#32](https://github.com/denizsafak/abogen/issues/32) for more details.\n\n</details>\n\n<details><summary><b>\n<a name=\"path-warning\">How to fix \"WARNING: The script abogen-cli is installed in '/home/username/.local/bin' which is not on PATH\" error in Linux?</a>\n</b></summary>\n\n> Run the following command to add Abogen to your PATH:\n> ```bash\n> echo \"export PATH=\\\"/home/$USER/.local/bin:\\$PATH\\\"\" >> ~/.bashrc && source ~/.bashrc\n> ```\n\n</details>\n\n<details><summary><b>\n<a name=\"no-matching-distribution-found\">How to fix \"No matching distribution found\" error?<a>\n</b></summary>\n\n> Try installing Abogen on supported Python (3.10 to 3.12) versions. You can use [pyenv](https://github.com/pyenv/pyenv) to manage multiple Python versions easily in Linux. Watch this [video](https://www.youtube.com/watch?v=MVyb-nI4KyI) by NetworkChuck for a quick guide.\n\n</details>\n\n<details><summary><b>\n<a name=\"WinError-1114\">How to fix \"[WinError 1114] A dynamic link library (DLL) initialization routine failed\" error?</a>\n</b></summary>\n\n> I faced this error when trying to run Abogen in a virtual Windows machine without GPU support. Here's how I fixed it:\n> If you installed Abogen using the Windows installer `(WINDOWS_INSTALL.bat)`, go to Abogen's folder (that contains `python_embedded`), open your terminal there and run:\n> ```bash\n> python_embedded\\python.exe -m pip install torch==2.8.0 torchaudio==2.8.0 torchvision==0.23.0\n> ```\n> If you installed Abogen using pip, open your terminal in the virtual environment and run:\n> ```bash\n> pip install torch==2.8.0 torchaudio==2.8.0 torchvision==0.23.0\n> ```\n\n</details>\n\n<details><summary><b>\n<a name=\"japanese-audio-not-working\">How to fix Japanese audio not working?</a>\n</b></summary>\n\n> Japanese audio may require additional configuration. \n> I'm not sure about the exact solution, but it seems to be related to installing additional dependencies for Japanese support in Kokoro. Please check [#56](https://github.com/denizsafak/abogen/issues/56) for more information. \n\n</details>\n\n<details><summary><b>\n<a name=\"use-uv-instead-of-pip\">How to use \"uv\" instead of \"pip\"?</a>\n</b></summary>\n\n> Abogen needs \"pip\", because Kokoro uses pip to download voice models from HuggingFace Hub. If you want to use \"uv\" instead of \"pip\", you can use the following command to run Abogen:\n>\n> ```bash\n> uvx --with pip abogen\n> ```\n\n</details>\n\n<details><summary><b>\n<a name=\"use-uv-instead-of-pip\">How to uninstall Abogen?</a>\n</b></summary>\n\n> - From the settings menu, go to `Open configuration directory` and delete the directory.\n> - From the settings menu, go to `Open cache directory` and delete the directory.\n> - If you installed Abogen using pip, type:\n>```bash\n>pip uninstall abogen # uninstalls abogen\n>pip cache purge # removes pip cache\n>```\n> - If you installed Abogen using the Windows installer (WINDOWS_INSTALL.bat), just remove the folder that contains Abogen. It installs everything inside `python_embedded` folder, no other directories are created.\n> - If you installed espeak-ng, you need to remove it separately.\n\n</details>\n\n## `Contributing`\nI welcome contributions! If you have ideas for new features, improvements, or bug fixes, please fork the repository and submit a pull request.\n### For developers and contributors\nIf you'd like to modify the code and contribute to development, you can [download the repository](https://github.com/denizsafak/abogen/archive/refs/heads/main.zip), extract it and run the following commands to build **or** install the package:\n```bash\n# Go to the directory where you extracted the repository and run:\npip install -e .      # Installs the package in editable mode\npip install build     # Install the build package\npython -m build       # Builds the package in dist folder (optional)\nabogen                # Opens the GUI\n```\nFeel free to explore the code and make any changes you like.\n\n## `Credits`\n- Abogen uses [Kokoro](https://github.com/hexgrad/kokoro) for its high-quality, natural-sounding text-to-speech synthesis. Huge thanks to the Kokoro team for making this possible.\n- Thanks to [@wojiushixiaobai](https://github.com/wojiushixiaobai) for [Embedded Python](https://github.com/wojiushixiaobai/Python-Embed-Win64) packages. These modified packages include pip pre-installed, enabling Abogen to function as a standalone application without requiring users to separately install Python in Windows.\n- Thanks to creators of [EbookLib](https://github.com/aerkalov/ebooklib), a Python library for reading and writing ePub files, which is used for extracting text from ePub files.\n- Special thanks to the [PyQt](https://www.riverbankcomputing.com/software/pyqt/) team for providing the cross-platform GUI toolkit that powers Abogen's interface.\n- Icons: [US](https://icons8.com/icon/aRiu1GGi6Aoe/usa), [Great Britain](https://icons8.com/icon/t3NE3BsOAQwq/great-britain), [Spain](https://icons8.com/icon/ly7tzANRt33n/spain), [France](https://icons8.com/icon/3muzEmi4dpD5/france), [India](https://icons8.com/icon/esGVrxg9VCJ1/india), [Italy](https://icons8.com/icon/PW8KZnP7qXzO/italy), [Japan](https://icons8.com/icon/McQbrq9qaQye/japan), [Brazil](https://icons8.com/icon/zHmH8HpOmM90/brazil), [China](https://icons8.com/icon/Ej50Oe3crXwF/china), [Female](https://icons8.com/icon/uI49hxbpxTkp/female), [Male](https://icons8.com/icon/12351/male), [Adjust](https://icons8.com/icon/21698/adjust) and [Voice Id](https://icons8.com/icon/GskSeVoroQ7u/voice-id) icons by [Icons8](https://icons8.com/).\n\n## `License`\nThis project is available under the MIT License - see the [LICENSE](https://github.com/denizsafak/abogen/blob/main/LICENSE) file for details.\n[Kokoro](https://github.com/hexgrad/kokoro) is licensed under [Apache-2.0](https://github.com/hexgrad/kokoro/blob/main/LICENSE) which allows commercial use, modification, distribution, and private use.\n\n## `Star History`\n[![Star History Chart](https://api.star-history.com/svg?repos=denizsafak/abogen&type=Date)](https://www.star-history.com/#denizsafak/abogen&Date)\n\n> [!IMPORTANT]\n> Subtitle generation currently works only for English. This is because Kokoro provides timestamp tokens only for English text. If you want subtitles in other languages, please request this feature in the [Kokoro project](https://github.com/hexgrad/kokoro). For more technical details, see [this line](https://github.com/hexgrad/kokoro/blob/6d87f4ae7abc2d14dbc4b3ef2e5f19852e861ac2/kokoro/pipeline.py#L383) in the Kokoro's code.\n\n> Tags: audiobook, kokoro, text-to-speech, TTS, audiobook generator, audiobooks, text to speech, audiobook maker, audiobook creator, audiobook generator, voice-synthesis, text to audio, text to audio converter, text to speech converter, text to speech generator, text to speech software, text to speech app, epub to audio, pdf to audio, markdown to audio, content-creation, media-generation\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Generate audiobooks from EPUBs, PDFs and text with synchronized captions.",
    "version": "1.2.0",
    "project_urls": {
        "Documentation": "https://github.com/denizsafak/abogen",
        "Homepage": "https://github.com/denizsafak/abogen",
        "Issues": "https://github.com/denizsafak/abogen/issues",
        "Repository": "https://github.com/denizsafak/abogen"
    },
    "split_keywords": [
        "accessibility",
        " audiobook",
        " book-converter",
        " chapter-management",
        " content-creation",
        " epub",
        " kokoro",
        " media-generation",
        " multilingual",
        " pdf",
        " subtitle",
        " subtitles",
        " text-to-speech",
        " tts",
        " voice-synthesis"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "6c8f15e3bb0b40430eb416b8f6b06ba50023e0727e4aad4377a19fe077e9115f",
                "md5": "0a925bf1cebe5acd9e714759cbae096d",
                "sha256": "a97df9a07b671a3a8b48e012c1240e3bfa778803ccc07ee3a2f484c98757e02d"
            },
            "downloads": -1,
            "filename": "abogen-1.2.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "0a925bf1cebe5acd9e714759cbae096d",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<3.13,>=3.10",
            "size": 262042,
            "upload_time": "2025-10-20T00:56:36",
            "upload_time_iso_8601": "2025-10-20T00:56:36.162718Z",
            "url": "https://files.pythonhosted.org/packages/6c/8f/15e3bb0b40430eb416b8f6b06ba50023e0727e4aad4377a19fe077e9115f/abogen-1.2.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "9f5729cf1346aaa9e5d8f4bac9ff0a03247b1aa1e79facf46e8fe7a0068ac9bb",
                "md5": "ce6c6cf31f71ec65ab134f0cddacae70",
                "sha256": "4f3de29190e3c657c369fe94ff5ba0af5750db3bb72988c0e2e10265503dc78d"
            },
            "downloads": -1,
            "filename": "abogen-1.2.0.tar.gz",
            "has_sig": false,
            "md5_digest": "ce6c6cf31f71ec65ab134f0cddacae70",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<3.13,>=3.10",
            "size": 259143,
            "upload_time": "2025-10-20T00:56:37",
            "upload_time_iso_8601": "2025-10-20T00:56:37.891265Z",
            "url": "https://files.pythonhosted.org/packages/9f/57/29cf1346aaa9e5d8f4bac9ff0a03247b1aa1e79facf46e8fe7a0068ac9bb/abogen-1.2.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-10-20 00:56:37",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "denizsafak",
    "github_project": "abogen",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "abogen"
}
        
Elapsed time: 2.03431s