llama-index-readers-remote


Namellama-index-readers-remote JSON
Version 0.2.0 PyPI version JSON
download
home_pageNone
Summaryllama-index readers remote integration
upload_time2024-08-22 11:17:56
maintainerthejessezhang
docs_urlNone
authorYour Name
requires_python<4.0,>=3.8.1
licenseMIT
keywords gutenberg hosted url
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Remote Page/File Loader

```bash
pip install llama-index-readers-remote
```

This loader makes it easy to extract the text from any remote page or file using just its url. If there's a file at the url, this loader will download it temporarily and parse it using `SimpleDirectoryReader`. It is an all-in-one tool for (almost) any url.

As a result, any page or type of file is supported. For instance, if a `.txt` url such as a [Project Gutenberg book](https://www.gutenberg.org/cache/epub/69994/pg69994.txt) is passed in, the text will be parsed as is. On the other hand, if a hosted .mp3 url is passed in, it will be downloaded and parsed using `AudioTranscriber`.

## Usage

To use this loader, you need to pass in a `Path` to a local file. Optionally, you may specify a `file_extractor` for the `SimpleDirectoryReader` to use, other than the default one.

```python
from llama_index.readers.remote import RemoteReader

loader = RemoteReader()
documents = loader.load_data(
    url="https://en.wikipedia.org/wiki/File:Example.jpg"
)
```

This loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "llama-index-readers-remote",
    "maintainer": "thejessezhang",
    "docs_url": null,
    "requires_python": "<4.0,>=3.8.1",
    "maintainer_email": null,
    "keywords": "gutenberg, hosted, url",
    "author": "Your Name",
    "author_email": "you@example.com",
    "download_url": "https://files.pythonhosted.org/packages/e0/57/cb1645fbd09784c2c55946954f8794e8f364a7624d34a151b8b2db3d5c9d/llama_index_readers_remote-0.2.0.tar.gz",
    "platform": null,
    "description": "# Remote Page/File Loader\n\n```bash\npip install llama-index-readers-remote\n```\n\nThis loader makes it easy to extract the text from any remote page or file using just its url. If there's a file at the url, this loader will download it temporarily and parse it using `SimpleDirectoryReader`. It is an all-in-one tool for (almost) any url.\n\nAs a result, any page or type of file is supported. For instance, if a `.txt` url such as a [Project Gutenberg book](https://www.gutenberg.org/cache/epub/69994/pg69994.txt) is passed in, the text will be parsed as is. On the other hand, if a hosted .mp3 url is passed in, it will be downloaded and parsed using `AudioTranscriber`.\n\n## Usage\n\nTo use this loader, you need to pass in a `Path` to a local file. Optionally, you may specify a `file_extractor` for the `SimpleDirectoryReader` to use, other than the default one.\n\n```python\nfrom llama_index.readers.remote import RemoteReader\n\nloader = RemoteReader()\ndocuments = loader.load_data(\n    url=\"https://en.wikipedia.org/wiki/File:Example.jpg\"\n)\n```\n\nThis loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "llama-index readers remote integration",
    "version": "0.2.0",
    "project_urls": null,
    "split_keywords": [
        "gutenberg",
        " hosted",
        " url"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b53de0fa235d6e89fecaa56396323d9f325b578ff7fd20ced83f564558bbb2db",
                "md5": "16eb18f70dd7de535ae027f7ab08d5ca",
                "sha256": "423d417b2557a03aa579d9dc4ec2385b1c654949841697ac8c2f5c44fa65f8b2"
            },
            "downloads": -1,
            "filename": "llama_index_readers_remote-0.2.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "16eb18f70dd7de535ae027f7ab08d5ca",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.8.1",
            "size": 3540,
            "upload_time": "2024-08-22T11:17:55",
            "upload_time_iso_8601": "2024-08-22T11:17:55.858029Z",
            "url": "https://files.pythonhosted.org/packages/b5/3d/e0fa235d6e89fecaa56396323d9f325b578ff7fd20ced83f564558bbb2db/llama_index_readers_remote-0.2.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e057cb1645fbd09784c2c55946954f8794e8f364a7624d34a151b8b2db3d5c9d",
                "md5": "7f1e574e5f949b299d525a492fc2a067",
                "sha256": "c57f8e3431b9cb8e6975f03454d5a318ef799ac23de20a18fa179456c00e1f32"
            },
            "downloads": -1,
            "filename": "llama_index_readers_remote-0.2.0.tar.gz",
            "has_sig": false,
            "md5_digest": "7f1e574e5f949b299d525a492fc2a067",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.8.1",
            "size": 3324,
            "upload_time": "2024-08-22T11:17:56",
            "upload_time_iso_8601": "2024-08-22T11:17:56.680104Z",
            "url": "https://files.pythonhosted.org/packages/e0/57/cb1645fbd09784c2c55946954f8794e8f364a7624d34a151b8b2db3d5c9d/llama_index_readers_remote-0.2.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-08-22 11:17:56",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "llama-index-readers-remote"
}
        
Elapsed time: 0.32481s