ted2zim


Nameted2zim JSON
Version 3.0.0 PyPI version JSON
download
home_pageNone
SummaryMake ZIM file from TED Talks
upload_time2024-04-19 07:34:18
maintainerNone
docs_urlNone
authorNone
requires_python<3.13,>=3.12
licenseGPL-3.0-or-later
keywords offline openzim ted zim
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # ted2zim

##### Get the best :bulb: TED videos offline :arrow_down:
An offliner to create ZIM :package: files from TED talks

[![CodeFactor](https://www.codefactor.io/repository/github/openzim/ted/badge)](https://www.codefactor.io/repository/github/openzim/ted)
[![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0)
[![codecov](https://codecov.io/gh/openzim/ted/branch/main/graph/badge.svg)](https://codecov.io/gh/openzim/ted)
[![PyPI version shields.io](https://img.shields.io/pypi/v/ted2zim.svg)](https://pypi.org/project/ted2zim/)
[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/ted2zim.svg)](https://pypi.org/project/ted2zim)

TED (Technology, Entertainment, Design) is a global set of conferences under the slogan "ideas worth spreading". They address a wide range of topics within the research and practice of science and culture, often through storytelling. The speakers are given a maximum of 18 minutes to present their ideas in the most innovative and engaging ways they can. One can eaisly find all the TED videos [here](https://ted.com/talks).

This project is aimed at creating a sustainable solution to make TED accessible offline by creating ZIM files providing these videos in a similar manner like online.

`ted2zim` adheres to openZIM's [Contribution Guidelines](https://github.com/openzim/overview/wiki/Contributing).

`ted2zim` has implemented openZIM's [Python bootstrap, conventions and policies](https://github.com/openzim/_python-bootstrap/docs/Policy.md) **v1.0.1**.

## Getting started :rocket:

#### Install the dependencies
Make sure that you have `python3`, `unzip`, `ffmpeg`, `wget` and `curl` installed on your system before running the scraper (otherwise you'll get a warning to install them).

#### Setup the package
One can easily install the PyPI version but let's setup the source version.

First, clone this repository.

If you do not already have it on your system, install hatch to build the software and manage virtual environments (you might be interested by our detailed [Developer Setup](https://github.com/openzim/_python-bootstrap/wiki/Developer-Setup) as well).

```bash
pip3 install hatch
```

Start a hatch shell: this will install software including dependencies in an isolated virtual environment.

```bash
hatch shell
```

That's it. You can now run `ted2zim` from your terminal

```bash
ted2zim --topics [TOPICS] --name [NAME]
```

For the full list of arguments, see [this](ted2zim/entrypoint.py) file or run the following
```bash
ted2zim --help
```

Example usage
```bash
ted2zim --topics="augmented reality" --debug --name="augumented_reality" --format=mp4 --title="Augmented Reality" --description="TED videos in AR category" --creator="TED" --publisher="openzim" --output="output" --keep --low-quality
```

This project can also be run with docker. Use the provided [Dockerfile](Dockerfile) or [pre-build images](https://github.com/orgs/openzim/packages/container/package/ted) to run it with Docker. See steps [here](https://docs.docker.com/get-started/part2/).

## Features :robot:
You can create ZIMs for multiple topics (should be same as given [here](https://ted.com/talks)), choose between different video formats (webm/mp4), different compression rates, and even use an S3 based cache.

#### Want more flexibility? There's a multitool
`ted2zim-multi` is an extra command available that allows you to do much more with the scraper. It falls back to `ted2zim` if normal commands are passed. It supports creation of multiple ZIMs with single command for both playlists and topics and even getting metadata from a specified JSON file. It supports the following extra arguments -

- `--indiv-zims` - Allows you to create one zim/topic or one zim/playlist
- `--{name|description|zim-file|title}-format` - Allows you to add custom format for the equivalent `ted2zim` arguments. You can add `{identity}` as a placeholder in these values to get the playlist ID / topic name in it's place (spaces replaced by `-`). You can now also add `{slug}` to get the topic/playlist slug.
- `--metadata-from` - Path to a JSON file containing the metadata.

Should be of the following format:

```bash
{
    "<playlist-id/topic-name-with-underscores>": {
        "name": "sample_name_{identity}",
        "description": "Sample description",
        "title": "Custom title",
        "zim-file": "sample.zim",
        "tags": "tag",
        "creator": "Yourself",
        "build-dir": "/custom_build_dir"
    }
}
```

See `ted2zim-multi --help` for details.

## License :book:

[GPLv3](https://www.gnu.org/licenses/gpl-3.0) or later, see
[LICENSE](LICENSE) for more details.

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "ted2zim",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<3.13,>=3.12",
    "maintainer_email": null,
    "keywords": "offline, openzim, ted, zim",
    "author": null,
    "author_email": "openZIM <dev@openzim.org>",
    "download_url": "https://files.pythonhosted.org/packages/b7/24/d4e22b891c66d682e6ea4be4dea8f8c975db7a8fd176895dfe3f20136d07/ted2zim-3.0.0.tar.gz",
    "platform": null,
    "description": "# ted2zim\n\n##### Get the best :bulb: TED videos offline :arrow_down:\nAn offliner to create ZIM :package: files from TED talks\n\n[![CodeFactor](https://www.codefactor.io/repository/github/openzim/ted/badge)](https://www.codefactor.io/repository/github/openzim/ted)\n[![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0)\n[![codecov](https://codecov.io/gh/openzim/ted/branch/main/graph/badge.svg)](https://codecov.io/gh/openzim/ted)\n[![PyPI version shields.io](https://img.shields.io/pypi/v/ted2zim.svg)](https://pypi.org/project/ted2zim/)\n[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/ted2zim.svg)](https://pypi.org/project/ted2zim)\n\nTED (Technology, Entertainment, Design) is a global set of conferences under the slogan \"ideas worth spreading\". They address a wide range of topics within the research and practice of science and culture, often through storytelling. The speakers are given a maximum of 18 minutes to present their ideas in the most innovative and engaging ways they can. One can eaisly find all the TED videos [here](https://ted.com/talks).\n\nThis project is aimed at creating a sustainable solution to make TED accessible offline by creating ZIM files providing these videos in a similar manner like online.\n\n`ted2zim` adheres to openZIM's [Contribution Guidelines](https://github.com/openzim/overview/wiki/Contributing).\n\n`ted2zim` has implemented openZIM's [Python bootstrap, conventions and policies](https://github.com/openzim/_python-bootstrap/docs/Policy.md) **v1.0.1**.\n\n## Getting started :rocket:\n\n#### Install the dependencies\nMake sure that you have `python3`, `unzip`, `ffmpeg`, `wget` and `curl` installed on your system before running the scraper (otherwise you'll get a warning to install them).\n\n#### Setup the package\nOne can easily install the PyPI version but let's setup the source version.\n\nFirst, clone this repository.\n\nIf you do not already have it on your system, install hatch to build the software and manage virtual environments (you might be interested by our detailed [Developer Setup](https://github.com/openzim/_python-bootstrap/wiki/Developer-Setup) as well).\n\n```bash\npip3 install hatch\n```\n\nStart a hatch shell: this will install software including dependencies in an isolated virtual environment.\n\n```bash\nhatch shell\n```\n\nThat's it. You can now run `ted2zim` from your terminal\n\n```bash\nted2zim --topics [TOPICS] --name [NAME]\n```\n\nFor the full list of arguments, see [this](ted2zim/entrypoint.py) file or run the following\n```bash\nted2zim --help\n```\n\nExample usage\n```bash\nted2zim --topics=\"augmented reality\" --debug --name=\"augumented_reality\" --format=mp4 --title=\"Augmented Reality\" --description=\"TED videos in AR category\" --creator=\"TED\" --publisher=\"openzim\" --output=\"output\" --keep --low-quality\n```\n\nThis project can also be run with docker. Use the provided [Dockerfile](Dockerfile) or [pre-build images](https://github.com/orgs/openzim/packages/container/package/ted) to run it with Docker. See steps [here](https://docs.docker.com/get-started/part2/).\n\n## Features :robot:\nYou can create ZIMs for multiple topics (should be same as given [here](https://ted.com/talks)), choose between different video formats (webm/mp4), different compression rates, and even use an S3 based cache.\n\n#### Want more flexibility? There's a multitool\n`ted2zim-multi` is an extra command available that allows you to do much more with the scraper. It falls back to `ted2zim` if normal commands are passed. It supports creation of multiple ZIMs with single command for both playlists and topics and even getting metadata from a specified JSON file. It supports the following extra arguments -\n\n- `--indiv-zims` - Allows you to create one zim/topic or one zim/playlist\n- `--{name|description|zim-file|title}-format` - Allows you to add custom format for the equivalent `ted2zim` arguments. You can add `{identity}` as a placeholder in these values to get the playlist ID / topic name in it's place (spaces replaced by `-`). You can now also add `{slug}` to get the topic/playlist slug.\n- `--metadata-from` - Path to a JSON file containing the metadata.\n\nShould be of the following format:\n\n```bash\n{\n    \"<playlist-id/topic-name-with-underscores>\": {\n        \"name\": \"sample_name_{identity}\",\n        \"description\": \"Sample description\",\n        \"title\": \"Custom title\",\n        \"zim-file\": \"sample.zim\",\n        \"tags\": \"tag\",\n        \"creator\": \"Yourself\",\n        \"build-dir\": \"/custom_build_dir\"\n    }\n}\n```\n\nSee `ted2zim-multi --help` for details.\n\n## License :book:\n\n[GPLv3](https://www.gnu.org/licenses/gpl-3.0) or later, see\n[LICENSE](LICENSE) for more details.\n",
    "bugtrack_url": null,
    "license": "GPL-3.0-or-later",
    "summary": "Make ZIM file from TED Talks",
    "version": "3.0.0",
    "project_urls": {
        "Donate": "https://www.kiwix.org/en/support-us/",
        "Homepage": "https://github.com/openzim/ted"
    },
    "split_keywords": [
        "offline",
        " openzim",
        " ted",
        " zim"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "4bb5bd92e197817e95a16897cc956eb6cbac06b2b9076e259f70a58de53c6e38",
                "md5": "c5e33d2f70f6b7eddd4eddf711166343",
                "sha256": "c13298b21efd27ed7562ae270b74014fc5df51d4d6c94df8ab6e07a70690ce6c"
            },
            "downloads": -1,
            "filename": "ted2zim-3.0.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "c5e33d2f70f6b7eddd4eddf711166343",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<3.13,>=3.12",
            "size": 3121411,
            "upload_time": "2024-04-19T07:34:16",
            "upload_time_iso_8601": "2024-04-19T07:34:16.264206Z",
            "url": "https://files.pythonhosted.org/packages/4b/b5/bd92e197817e95a16897cc956eb6cbac06b2b9076e259f70a58de53c6e38/ted2zim-3.0.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b724d4e22b891c66d682e6ea4be4dea8f8c975db7a8fd176895dfe3f20136d07",
                "md5": "7e25e14df88f278cd2043a2a4367fe48",
                "sha256": "cca98faea290755efe1e614039b366aa8d2b8a4d477ccedea0c9bb96fdf416dd"
            },
            "downloads": -1,
            "filename": "ted2zim-3.0.0.tar.gz",
            "has_sig": false,
            "md5_digest": "7e25e14df88f278cd2043a2a4367fe48",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<3.13,>=3.12",
            "size": 447542,
            "upload_time": "2024-04-19T07:34:18",
            "upload_time_iso_8601": "2024-04-19T07:34:18.601372Z",
            "url": "https://files.pythonhosted.org/packages/b7/24/d4e22b891c66d682e6ea4be4dea8f8c975db7a8fd176895dfe3f20136d07/ted2zim-3.0.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-04-19 07:34:18",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "openzim",
    "github_project": "ted",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "ted2zim"
}
        
Elapsed time: 0.25282s