Name | llama-index-readers-remote JSON |
Version |
0.1.4
JSON |
| download |
home_page | |
Summary | llama-index readers remote integration |
upload_time | 2024-02-21 21:28:37 |
maintainer | thejessezhang |
docs_url | None |
author | Your Name |
requires_python | >=3.8.1,<4.0 |
license | MIT |
keywords |
gutenberg
hosted
url
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# Remote Page/File Loader
This loader makes it easy to extract the text from any remote page or file using just its url. If there's a file at the url, this loader will download it temporarily and parse it using `SimpleDirectoryReader`. It is an all-in-one tool for (almost) any url.
As a result, any page or type of file is supported. For instance, if a `.txt` url such as a [Project Gutenberg book](https://www.gutenberg.org/cache/epub/69994/pg69994.txt) is passed in, the text will be parsed as is. On the other hand, if a hosted .mp3 url is passed in, it will be downloaded and parsed using `AudioTranscriber`.
## Usage
To use this loader, you need to pass in a `Path` to a local file. Optionally, you may specify a `file_extractor` for the `SimpleDirectoryReader` to use, other than the default one.
```python
from llama_index import download_loader
RemoteReader = download_loader("RemoteReader")
loader = RemoteReader()
documents = loader.load_data(
url="https://en.wikipedia.org/wiki/File:Example.jpg"
)
```
This loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/tree/main/llama_index) and/or subsequently used as a Tool in a [LangChain](https://github.com/hwchase17/langchain) Agent. See [here](https://github.com/emptycrown/llama-hub/tree/main) for examples.
Raw data
{
"_id": null,
"home_page": "",
"name": "llama-index-readers-remote",
"maintainer": "thejessezhang",
"docs_url": null,
"requires_python": ">=3.8.1,<4.0",
"maintainer_email": "",
"keywords": "gutenberg,hosted,url",
"author": "Your Name",
"author_email": "you@example.com",
"download_url": "https://files.pythonhosted.org/packages/b4/72/b3392528a93b88a4e65da3a0def522f32235116495c631bf941317a9f733/llama_index_readers_remote-0.1.4.tar.gz",
"platform": null,
"description": "# Remote Page/File Loader\n\nThis loader makes it easy to extract the text from any remote page or file using just its url. If there's a file at the url, this loader will download it temporarily and parse it using `SimpleDirectoryReader`. It is an all-in-one tool for (almost) any url.\n\nAs a result, any page or type of file is supported. For instance, if a `.txt` url such as a [Project Gutenberg book](https://www.gutenberg.org/cache/epub/69994/pg69994.txt) is passed in, the text will be parsed as is. On the other hand, if a hosted .mp3 url is passed in, it will be downloaded and parsed using `AudioTranscriber`.\n\n## Usage\n\nTo use this loader, you need to pass in a `Path` to a local file. Optionally, you may specify a `file_extractor` for the `SimpleDirectoryReader` to use, other than the default one.\n\n```python\nfrom llama_index import download_loader\n\nRemoteReader = download_loader(\"RemoteReader\")\n\nloader = RemoteReader()\ndocuments = loader.load_data(\n url=\"https://en.wikipedia.org/wiki/File:Example.jpg\"\n)\n```\n\nThis loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/tree/main/llama_index) and/or subsequently used as a Tool in a [LangChain](https://github.com/hwchase17/langchain) Agent. See [here](https://github.com/emptycrown/llama-hub/tree/main) for examples.\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "llama-index readers remote integration",
"version": "0.1.4",
"project_urls": null,
"split_keywords": [
"gutenberg",
"hosted",
"url"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "bd9d7d4007c02180fbe8c6d048d78b7dedfa963de449283fc2425042fb649c68",
"md5": "312501991143cf68ccac7fe5d4cfef3f",
"sha256": "f9a594170d1b186056f1c88757ec91c67704ba4a3e532f8502a311e871c9db20"
},
"downloads": -1,
"filename": "llama_index_readers_remote-0.1.4-py3-none-any.whl",
"has_sig": false,
"md5_digest": "312501991143cf68ccac7fe5d4cfef3f",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.8.1,<4.0",
"size": 3471,
"upload_time": "2024-02-21T21:28:36",
"upload_time_iso_8601": "2024-02-21T21:28:36.704286Z",
"url": "https://files.pythonhosted.org/packages/bd/9d/7d4007c02180fbe8c6d048d78b7dedfa963de449283fc2425042fb649c68/llama_index_readers_remote-0.1.4-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "b472b3392528a93b88a4e65da3a0def522f32235116495c631bf941317a9f733",
"md5": "4c1a33fdae9e71e0eac6cccc3674a1be",
"sha256": "b46d90d8b948457f06850d43bfa2b24b1fc84a33513afeeaaa3d308d2ce5c88f"
},
"downloads": -1,
"filename": "llama_index_readers_remote-0.1.4.tar.gz",
"has_sig": false,
"md5_digest": "4c1a33fdae9e71e0eac6cccc3674a1be",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.8.1,<4.0",
"size": 3247,
"upload_time": "2024-02-21T21:28:37",
"upload_time_iso_8601": "2024-02-21T21:28:37.693601Z",
"url": "https://files.pythonhosted.org/packages/b4/72/b3392528a93b88a4e65da3a0def522f32235116495c631bf941317a9f733/llama_index_readers_remote-0.1.4.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-02-21 21:28:37",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "llama-index-readers-remote"
}