llama-index-readers-youtube-transcript


Namellama-index-readers-youtube-transcript JSON
Version 0.2.0 PyPI version JSON
download
home_pageNone
Summaryllama-index readers youtube transcript integration
upload_time2024-08-22 03:16:45
maintainerravi03071991
docs_urlNone
authorYour Name
requires_python<4.0,>=3.8.1
licenseMIT
keywords video
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Youtube Transcript Loader

```bash
pip install llama-hub-youtube-transcript

pip install llama-index-readers-youtube-transcript
```

This loader fetches the text transcript of Youtube videos using the `youtube_transcript_api` Python package.

## Usage

To use this loader, you will need to first `pip install youtube_transcript_api`.

Then, simply pass an array of YouTube links into `load_data`:

```python
from llama_index.readers.youtube_transcript import YoutubeTranscriptReader

loader = YoutubeTranscriptReader()
documents = loader.load_data(
    ytlinks=["https://www.youtube.com/watch?v=i3OYlaoj-BM"]
)
```

Supported URL formats: + youtube.com/watch?v={video_id} (with or without 'www.') + youtube.com/embed?v={video_id} (with or without 'www.') + youtu.be/{video_id} (never includes www subdomain)

To programmatically check if a URL is supported:

```python
from llama_index.readers.youtube_transcript.utils import is_youtube_video

is_youtube_video("https://youtube.com/watch?v=j83jrh2")  # => True
is_youtube_video("https://vimeo.com/272134160")  # => False
```

This loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "llama-index-readers-youtube-transcript",
    "maintainer": "ravi03071991",
    "docs_url": null,
    "requires_python": "<4.0,>=3.8.1",
    "maintainer_email": null,
    "keywords": "video",
    "author": "Your Name",
    "author_email": "you@example.com",
    "download_url": "https://files.pythonhosted.org/packages/57/f5/7eb526d9fbe0d3f5b326e15aa285d73e67441e47762c4129854fe509f2ab/llama_index_readers_youtube_transcript-0.2.0.tar.gz",
    "platform": null,
    "description": "# Youtube Transcript Loader\n\n```bash\npip install llama-hub-youtube-transcript\n\npip install llama-index-readers-youtube-transcript\n```\n\nThis loader fetches the text transcript of Youtube videos using the `youtube_transcript_api` Python package.\n\n## Usage\n\nTo use this loader, you will need to first `pip install youtube_transcript_api`.\n\nThen, simply pass an array of YouTube links into `load_data`:\n\n```python\nfrom llama_index.readers.youtube_transcript import YoutubeTranscriptReader\n\nloader = YoutubeTranscriptReader()\ndocuments = loader.load_data(\n    ytlinks=[\"https://www.youtube.com/watch?v=i3OYlaoj-BM\"]\n)\n```\n\nSupported URL formats: + youtube.com/watch?v={video_id} (with or without 'www.') + youtube.com/embed?v={video_id} (with or without 'www.') + youtu.be/{video_id} (never includes www subdomain)\n\nTo programmatically check if a URL is supported:\n\n```python\nfrom llama_index.readers.youtube_transcript.utils import is_youtube_video\n\nis_youtube_video(\"https://youtube.com/watch?v=j83jrh2\")  # => True\nis_youtube_video(\"https://vimeo.com/272134160\")  # => False\n```\n\nThis loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "llama-index readers youtube transcript integration",
    "version": "0.2.0",
    "project_urls": null,
    "split_keywords": [
        "video"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b0320d92d664da683f41f2a856e7030bd4dba292e311f31bb4d198ece61057a5",
                "md5": "a620aa3693b9e0fa72234ea7d511f813",
                "sha256": "969a050c68703bd21e16baac70624ca89398ba5913134b9b90122c6eae695615"
            },
            "downloads": -1,
            "filename": "llama_index_readers_youtube_transcript-0.2.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "a620aa3693b9e0fa72234ea7d511f813",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.8.1",
            "size": 3629,
            "upload_time": "2024-08-22T03:16:44",
            "upload_time_iso_8601": "2024-08-22T03:16:44.461439Z",
            "url": "https://files.pythonhosted.org/packages/b0/32/0d92d664da683f41f2a856e7030bd4dba292e311f31bb4d198ece61057a5/llama_index_readers_youtube_transcript-0.2.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "57f57eb526d9fbe0d3f5b326e15aa285d73e67441e47762c4129854fe509f2ab",
                "md5": "3e288c13d7904b19b99c1838b3405746",
                "sha256": "d18d38f24ce585da31c41c894d71c7137f2de51eb925467d714f9b22b83c6d85"
            },
            "downloads": -1,
            "filename": "llama_index_readers_youtube_transcript-0.2.0.tar.gz",
            "has_sig": false,
            "md5_digest": "3e288c13d7904b19b99c1838b3405746",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.8.1",
            "size": 2957,
            "upload_time": "2024-08-22T03:16:45",
            "upload_time_iso_8601": "2024-08-22T03:16:45.750689Z",
            "url": "https://files.pythonhosted.org/packages/57/f5/7eb526d9fbe0d3f5b326e15aa285d73e67441e47762c4129854fe509f2ab/llama_index_readers_youtube_transcript-0.2.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-08-22 03:16:45",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "llama-index-readers-youtube-transcript"
}
        
Elapsed time: 3.45179s