ttspod


Namettspod JSON
Version 0.1.437 PyPI version JSON
download
home_pageNone
SummaryConvert any content to a podcast
upload_time2024-12-09 22:58:14
maintainerNone
docs_urlNone
authorNone
requires_python==3.11.*
licenseMIT License
keywords podcast text-to-speech
VCS
bugtrack_url
requirements Requests Unidecode anyascii ffmpeg html2text instapaper ipython lxml lxml_html_clean nltk numpy pandoc paramiko pocket pocket_api pod2gen posix_ipc pydub pymupdf pypandoc pypandoc-binary python-dotenv python_magic trafilatura validators webdataset
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # TTSPod

This is a command-line app that takes many types of content and turns them into audible speech as a podcast feed.

## Inputs 

* Your Wallabag feed
* Your Pocket feed
* Your Instapaper feed 
* An arbitrary URL
* An email (pipe the email into the script, or provide as command-line argument)
* A locally-stored HTML file
* A locally-stored text file
* Office documents/PDFs 

## Outputs

* A podcast RSS feed
* MP3 files corresponding to each input item

You'll need a web server to host the generated content in order to subscribe with your podcasting app.

## Text-to-Speech Engines

* [F5](https://github.com/SWivid/F5-TTS) (free, requires substantial compute resources and GPU)
* [Coqui](https://github.com/coqui-ai/TTS) (free, requires substantial compute resources and GPU): supports XTTS (fast) and Tortoise (slow, high quality) inference models
* [Whisper](https://github.com/collabora/WhisperSpeech) (free, requires substantial compute resources and GPU)
* OpenAI (paid, requires an [API key](https://platform.openai.com/api-keys))
* Eleven (limited free version or paid version, [requires an API key](https://elevenlabs.io/docs/api-reference/getting-started))

Depending on your hardware configuration, you may need to pull a more recent pytorch build to get maximum performance for your specific GPU. See [the PyTorch website](https://pytorch.org/get-started/locally/) for instructions on installing torch and torchaudio with pip for your platform. Coqui with XTTS runs reasonably fast on Linux, Mac, and Windows, although the Mac GPU (mps) support is limited compared to NVidia CUDA. Coqui with Tortoise is quite slow, especially on Mac.

## Get Started
This should work "out of the box" on Linux or MacOS.
```
mkdir ttspod
cd ttspod
curl -s https://raw.githubusercontent.com/ajkessel/ttspod/refs/heads/main/quickstart.sh -o quickstart.sh
bash quickstart.sh
```
Windows install from PowerShell, not extensively tested:
```
Invoke-WebRequest 'https://raw.githubusercontent.com/ajkessel/ttspod/refs/heads/main/quickstart.ps1' -OutFile 'quickstart.ps1'
& quickstart.ps1
```

You'll need to generate a config file (`ttspod -g`) and edit it before you can do anything. Minimal required settings include configuring your TTS speech engine preference and podcast URL.

You'll also need somewhere to host your RSS feed and MP3 audio files if you want to subscribe and listen with a podcast client. The application is set up to sync the podcast feed to a web server over ssh.

## Usage
```
usage: ttspod [-h] [-c [CONFIG]] [-g [GENERATE]] [-w [WALLABAG]] [-i [INSTA]] [-p [POCKET]] [-l [LOG]] [-q [QUIET]] [-d] [-r] [-f] [-t TITLE] [-e ENGINE] [-m MODEL] [-s] [-n] [--nogpu] [-u] [-v] [url ...]

Convert any content to a podcast feed.

positional arguments:
  url                   specify any number of URLs or local documents (plain text, HTML, PDF, Word documents, etc) to add to your podcast feed

options:
  -h, --help            show this help message and exit
  -c [CONFIG], --config [CONFIG]
                        specify path for config file (default ~/.config/ttspod.ini if it exists, otherwise .env in the current directory)
  -g [GENERATE], --generate [GENERATE]
                        generate a new config file(default ~/.config/ttspod.ini if ~/.config exists, otherwise .env in the current directory)
  -w [WALLABAG], --wallabag [WALLABAG]
                        add unprocessed items with specified tag (default audio) from your wallabag feed to your podcast feed
  -i [INSTA], --insta [INSTA]
                        add unprocessed items with specified tag (default audio) from your instapaper feed to your podcast feed, or use tag ALL for default inbox
  -p [POCKET], --pocket [POCKET]
                        add unprocessed items with specified tag (default audio) from your pocket feed to your podcast feed
  -l [LOG], --log [LOG]
                        log all output to specified filename (default ttspod.log)
  -q [QUIET], --quiet [QUIET]
                        no visible output (all output will go to log if specified)
  -d, --debug           include debug output
  -r, --restart         wipe state file clean and start new podcast feed
  -f, --force           force addition of podcast even if cache indicates it has already been added
  -t TITLE, --title TITLE
                        specify title for content provided via pipe
  -e ENGINE, --engine ENGINE
                        specify TTS engine for this session (whisper, coqui, openai, eleven)
  -m MODEL, --model MODEL
                        specify model to use with engine (for use with Coqui, OpenAI, or Eleven)
  -s, --sync            sync podcast episodes and state file
  -n, --dry-run         do not actually create or sync audio files
  --nogpu               disable GPU support (try this if you're having trouble on Mac)
  -u, --upgrade         upgrade to latest version
  -v, --version         print version number
```
### Examples
Add a URL to your podcast feed
```
# ttspod https://slashdot.org/story/24/09/24/2049204/human-reviewers-cant-keep-up-with-police-bodycam-videos-ai-now-gets-the-job
```
Update your podcast feed with all of your Wallabag items tagged "audio" that have not yet been processed
```
ttspod -w
```
Create a one-sentence podcast from the command-line
```
echo "This text will be turned into a podcast that I will be able to listen to later." | ttspod -t 'The Title of the Podcast'
```
Turn a Word document into a podcast 
```
ttspod my_document.docx
```

## Platforms
* Linux
* MacOS
* Windows

## procmail
The easiest way to feed emails to TTSPod is with a procmail recipe in `.procmailrc`. For example, this recipe will send emails from me@example.com or you@domain.com to myttsaddress@mydomain.com to this script, assuming you have a symlink to the script in `~/.local/bin`.
```
:0 Hc
* ^From:(.*\<(?)(me@example.com|you@domain.com)
* ^(To|X-Original-To):(.*\<(?)(myttsaddress@mydomain.com)
| ${HOME}/.local/bin/ttspod &> ${HOME}/log/tts.log 
```

## TODO
* Sanity checking on config settings
* Smooth migration of config settings with updates
* Command-line options for all configuration settings
* Interactive configuration
* Pocket interactive authentication workflow
* Instapaper interactive authentication workflow
* Process links received by email
* Process directly-emailed mp3s and links to mp3s
* Allow configuration of TTS models/voices/speeds/etc
* More customizations for podcast feed
* Add audio files directly from CLI via filesystem path or URL
* Use rsync where available, only remote_sync as fallback
* Language support - right now everything assumes English
* Graphical interface
* Detect paragraph breaks for slightly longer pauses
* Unit tests!

## License
[MIT](LICENSE)

Contributions welcome.

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "ttspod",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "==3.11.*",
    "maintainer_email": "Adam Kessel <adam@rosi-kessel.org>",
    "keywords": "podcast, text-to-speech",
    "author": null,
    "author_email": "Adam Kessel <adam@rosi-kessel.org>",
    "download_url": "https://files.pythonhosted.org/packages/0d/41/06888c1182d8d84d189c770afb8f2d41d7d2e4dc36cf3181388664d488a4/ttspod-0.1.437.tar.gz",
    "platform": null,
    "description": "# TTSPod\n\nThis is a command-line app that takes many types of content and turns them into audible speech as a podcast feed.\n\n## Inputs \n\n* Your Wallabag feed\n* Your Pocket feed\n* Your Instapaper feed \n* An arbitrary URL\n* An email (pipe the email into the script, or provide as command-line argument)\n* A locally-stored HTML file\n* A locally-stored text file\n* Office documents/PDFs \n\n## Outputs\n\n* A podcast RSS feed\n* MP3 files corresponding to each input item\n\nYou'll need a web server to host the generated content in order to subscribe with your podcasting app.\n\n## Text-to-Speech Engines\n\n* [F5](https://github.com/SWivid/F5-TTS) (free, requires substantial compute resources and GPU)\n* [Coqui](https://github.com/coqui-ai/TTS) (free, requires substantial compute resources and GPU): supports XTTS (fast) and Tortoise (slow, high quality) inference models\n* [Whisper](https://github.com/collabora/WhisperSpeech) (free, requires substantial compute resources and GPU)\n* OpenAI (paid, requires an [API key](https://platform.openai.com/api-keys))\n* Eleven (limited free version or paid version, [requires an API key](https://elevenlabs.io/docs/api-reference/getting-started))\n\nDepending on your hardware configuration, you may need to pull a more recent pytorch build to get maximum performance for your specific GPU. See [the PyTorch website](https://pytorch.org/get-started/locally/) for instructions on installing torch and torchaudio with pip for your platform. Coqui with XTTS runs reasonably fast on Linux, Mac, and Windows, although the Mac GPU (mps) support is limited compared to NVidia CUDA. Coqui with Tortoise is quite slow, especially on Mac.\n\n## Get Started\nThis should work \"out of the box\" on Linux or MacOS.\n```\nmkdir ttspod\ncd ttspod\ncurl -s https://raw.githubusercontent.com/ajkessel/ttspod/refs/heads/main/quickstart.sh -o quickstart.sh\nbash quickstart.sh\n```\nWindows install from PowerShell, not extensively tested:\n```\nInvoke-WebRequest 'https://raw.githubusercontent.com/ajkessel/ttspod/refs/heads/main/quickstart.ps1' -OutFile 'quickstart.ps1'\n& quickstart.ps1\n```\n\nYou'll need to generate a config file (`ttspod -g`) and edit it before you can do anything. Minimal required settings include configuring your TTS speech engine preference and podcast URL.\n\nYou'll also need somewhere to host your RSS feed and MP3 audio files if you want to subscribe and listen with a podcast client. The application is set up to sync the podcast feed to a web server over ssh.\n\n## Usage\n```\nusage: ttspod [-h] [-c [CONFIG]] [-g [GENERATE]] [-w [WALLABAG]] [-i [INSTA]] [-p [POCKET]] [-l [LOG]] [-q [QUIET]] [-d] [-r] [-f] [-t TITLE] [-e ENGINE] [-m MODEL] [-s] [-n] [--nogpu] [-u] [-v] [url ...]\n\nConvert any content to a podcast feed.\n\npositional arguments:\n  url                   specify any number of URLs or local documents (plain text, HTML, PDF, Word documents, etc) to add to your podcast feed\n\noptions:\n  -h, --help            show this help message and exit\n  -c [CONFIG], --config [CONFIG]\n                        specify path for config file (default ~/.config/ttspod.ini if it exists, otherwise .env in the current directory)\n  -g [GENERATE], --generate [GENERATE]\n                        generate a new config file(default ~/.config/ttspod.ini if ~/.config exists, otherwise .env in the current directory)\n  -w [WALLABAG], --wallabag [WALLABAG]\n                        add unprocessed items with specified tag (default audio) from your wallabag feed to your podcast feed\n  -i [INSTA], --insta [INSTA]\n                        add unprocessed items with specified tag (default audio) from your instapaper feed to your podcast feed, or use tag ALL for default inbox\n  -p [POCKET], --pocket [POCKET]\n                        add unprocessed items with specified tag (default audio) from your pocket feed to your podcast feed\n  -l [LOG], --log [LOG]\n                        log all output to specified filename (default ttspod.log)\n  -q [QUIET], --quiet [QUIET]\n                        no visible output (all output will go to log if specified)\n  -d, --debug           include debug output\n  -r, --restart         wipe state file clean and start new podcast feed\n  -f, --force           force addition of podcast even if cache indicates it has already been added\n  -t TITLE, --title TITLE\n                        specify title for content provided via pipe\n  -e ENGINE, --engine ENGINE\n                        specify TTS engine for this session (whisper, coqui, openai, eleven)\n  -m MODEL, --model MODEL\n                        specify model to use with engine (for use with Coqui, OpenAI, or Eleven)\n  -s, --sync            sync podcast episodes and state file\n  -n, --dry-run         do not actually create or sync audio files\n  --nogpu               disable GPU support (try this if you're having trouble on Mac)\n  -u, --upgrade         upgrade to latest version\n  -v, --version         print version number\n```\n### Examples\nAdd a URL to your podcast feed\n```\n# ttspod https://slashdot.org/story/24/09/24/2049204/human-reviewers-cant-keep-up-with-police-bodycam-videos-ai-now-gets-the-job\n```\nUpdate your podcast feed with all of your Wallabag items tagged \"audio\" that have not yet been processed\n```\nttspod -w\n```\nCreate a one-sentence podcast from the command-line\n```\necho \"This text will be turned into a podcast that I will be able to listen to later.\" | ttspod -t 'The Title of the Podcast'\n```\nTurn a Word document into a podcast \n```\nttspod my_document.docx\n```\n\n## Platforms\n* Linux\n* MacOS\n* Windows\n\n## procmail\nThe easiest way to feed emails to TTSPod is with a procmail recipe in `.procmailrc`. For example, this recipe will send emails from me@example.com or you@domain.com to myttsaddress@mydomain.com to this script, assuming you have a symlink to the script in `~/.local/bin`.\n```\n:0 Hc\n* ^From:(.*\\<(?)(me@example.com|you@domain.com)\n* ^(To|X-Original-To):(.*\\<(?)(myttsaddress@mydomain.com)\n| ${HOME}/.local/bin/ttspod &> ${HOME}/log/tts.log \n```\n\n## TODO\n* Sanity checking on config settings\n* Smooth migration of config settings with updates\n* Command-line options for all configuration settings\n* Interactive configuration\n* Pocket interactive authentication workflow\n* Instapaper interactive authentication workflow\n* Process links received by email\n* Process directly-emailed mp3s and links to mp3s\n* Allow configuration of TTS models/voices/speeds/etc\n* More customizations for podcast feed\n* Add audio files directly from CLI via filesystem path or URL\n* Use rsync where available, only remote_sync as fallback\n* Language support - right now everything assumes English\n* Graphical interface\n* Detect paragraph breaks for slightly longer pauses\n* Unit tests!\n\n## License\n[MIT](LICENSE)\n\nContributions welcome.\n",
    "bugtrack_url": null,
    "license": "MIT License",
    "summary": "Convert any content to a podcast",
    "version": "0.1.437",
    "project_urls": {
        "Changelog": "https://github.com/ajkessel/ttspod/commits/main/",
        "Documentation": "https://github.com/ajkessel/ttspod/blob/main/README.md",
        "Homepage": "https://github.com/ajkessel/ttspod",
        "Issues": "https://github.com/ajkessel/ttspod/issues",
        "Repository": "https://github.com/ajkessel/ttspod"
    },
    "split_keywords": [
        "podcast",
        " text-to-speech"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "0d4106888c1182d8d84d189c770afb8f2d41d7d2e4dc36cf3181388664d488a4",
                "md5": "3fc19468cad808d4fc46c4de384ca28f",
                "sha256": "f91bfcdc96f8f931a3b8e583795d31bcc79820e35bc168bade14f9e2af340572"
            },
            "downloads": -1,
            "filename": "ttspod-0.1.437.tar.gz",
            "has_sig": false,
            "md5_digest": "3fc19468cad808d4fc46c4de384ca28f",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "==3.11.*",
            "size": 5467977,
            "upload_time": "2024-12-09T22:58:14",
            "upload_time_iso_8601": "2024-12-09T22:58:14.708410Z",
            "url": "https://files.pythonhosted.org/packages/0d/41/06888c1182d8d84d189c770afb8f2d41d7d2e4dc36cf3181388664d488a4/ttspod-0.1.437.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-12-09 22:58:14",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "ajkessel",
    "github_project": "ttspod",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [
        {
            "name": "Requests",
            "specs": [
                [
                    "==",
                    "2.32.3"
                ]
            ]
        },
        {
            "name": "Unidecode",
            "specs": [
                [
                    "==",
                    "1.3.8"
                ]
            ]
        },
        {
            "name": "anyascii",
            "specs": [
                [
                    "==",
                    "0.3.2"
                ]
            ]
        },
        {
            "name": "ffmpeg",
            "specs": []
        },
        {
            "name": "html2text",
            "specs": [
                [
                    "==",
                    "2024.2.26"
                ]
            ]
        },
        {
            "name": "instapaper",
            "specs": []
        },
        {
            "name": "ipython",
            "specs": []
        },
        {
            "name": "lxml",
            "specs": [
                [
                    "==",
                    "5.3.0"
                ]
            ]
        },
        {
            "name": "lxml_html_clean",
            "specs": []
        },
        {
            "name": "nltk",
            "specs": [
                [
                    "==",
                    "3.9.1"
                ]
            ]
        },
        {
            "name": "numpy",
            "specs": [
                [
                    "<",
                    "2"
                ]
            ]
        },
        {
            "name": "pandoc",
            "specs": [
                [
                    "==",
                    "2.4"
                ]
            ]
        },
        {
            "name": "paramiko",
            "specs": [
                [
                    "==",
                    "3.4.1"
                ]
            ]
        },
        {
            "name": "pocket",
            "specs": [
                [
                    "==",
                    "0.3.6"
                ]
            ]
        },
        {
            "name": "pocket_api",
            "specs": [
                [
                    "==",
                    "0.1.5"
                ]
            ]
        },
        {
            "name": "pod2gen",
            "specs": [
                [
                    "==",
                    "1.0.3"
                ]
            ]
        },
        {
            "name": "posix_ipc",
            "specs": []
        },
        {
            "name": "pydub",
            "specs": [
                [
                    "==",
                    "0.25.1"
                ]
            ]
        },
        {
            "name": "pymupdf",
            "specs": []
        },
        {
            "name": "pypandoc",
            "specs": []
        },
        {
            "name": "pypandoc-binary",
            "specs": []
        },
        {
            "name": "python-dotenv",
            "specs": [
                [
                    "==",
                    "1.0.1"
                ]
            ]
        },
        {
            "name": "python_magic",
            "specs": [
                [
                    "==",
                    "0.4.27"
                ]
            ]
        },
        {
            "name": "trafilatura",
            "specs": [
                [
                    "==",
                    "1.12.2"
                ]
            ]
        },
        {
            "name": "validators",
            "specs": [
                [
                    "==",
                    "0.34.0"
                ]
            ]
        },
        {
            "name": "webdataset",
            "specs": []
        }
    ],
    "lcname": "ttspod"
}
        
Elapsed time: 0.87528s