Name | llama-index-readers-remote JSON |
Version |
0.4.0
JSON |
| download |
home_page | None |
Summary | llama-index readers remote integration |
upload_time | 2025-07-31 00:30:41 |
maintainer | thejessezhang |
docs_url | None |
author | None |
requires_python | <4.0,>=3.9 |
license | None |
keywords |
gutenberg
hosted
url
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# Remote Page/File Loader
```bash
pip install llama-index-readers-remote
```
This loader makes it easy to extract the text from any remote page or file using just its url. If there's a file at the url, this loader will download it temporarily and parse it using `SimpleDirectoryReader`. It is an all-in-one tool for (almost) any url.
As a result, any page or type of file is supported. For instance, if a `.txt` url such as a [Project Gutenberg book](https://www.gutenberg.org/cache/epub/69994/pg69994.txt) is passed in, the text will be parsed as is. On the other hand, if a hosted .mp3 url is passed in, it will be downloaded and parsed using `AudioTranscriber`.
## Usage
To use this loader, you need to pass in a `Path` to a local file. Optionally, you may specify a `file_extractor` for the `SimpleDirectoryReader` to use, other than the default one.
```python
from llama_index.readers.remote import RemoteReader
loader = RemoteReader()
documents = loader.load_data(
url="https://en.wikipedia.org/wiki/File:Example.jpg"
)
```
This loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).
Raw data
{
"_id": null,
"home_page": null,
"name": "llama-index-readers-remote",
"maintainer": "thejessezhang",
"docs_url": null,
"requires_python": "<4.0,>=3.9",
"maintainer_email": null,
"keywords": "gutenberg, hosted, url",
"author": null,
"author_email": "Your Name <you@example.com>",
"download_url": "https://files.pythonhosted.org/packages/67/34/e2fcef1fe57cf31c902bf04e4f27ec413857f302aa64cd31e426ebcfb098/llama_index_readers_remote-0.4.0.tar.gz",
"platform": null,
"description": "# Remote Page/File Loader\n\n```bash\npip install llama-index-readers-remote\n```\n\nThis loader makes it easy to extract the text from any remote page or file using just its url. If there's a file at the url, this loader will download it temporarily and parse it using `SimpleDirectoryReader`. It is an all-in-one tool for (almost) any url.\n\nAs a result, any page or type of file is supported. For instance, if a `.txt` url such as a [Project Gutenberg book](https://www.gutenberg.org/cache/epub/69994/pg69994.txt) is passed in, the text will be parsed as is. On the other hand, if a hosted .mp3 url is passed in, it will be downloaded and parsed using `AudioTranscriber`.\n\n## Usage\n\nTo use this loader, you need to pass in a `Path` to a local file. Optionally, you may specify a `file_extractor` for the `SimpleDirectoryReader` to use, other than the default one.\n\n```python\nfrom llama_index.readers.remote import RemoteReader\n\nloader = RemoteReader()\ndocuments = loader.load_data(\n url=\"https://en.wikipedia.org/wiki/File:Example.jpg\"\n)\n```\n\nThis loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).\n",
"bugtrack_url": null,
"license": null,
"summary": "llama-index readers remote integration",
"version": "0.4.0",
"project_urls": null,
"split_keywords": [
"gutenberg",
" hosted",
" url"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "daa36edd9977c5a44dde47c81db91293d7a1ce65823ece8b58f9467d1fc0ce75",
"md5": "06c6ba42ff7e94a749d2a6dd8f56ef21",
"sha256": "25092fb4fc3c9acabf109b22f602f340e721302d6774a1d51cc34306a864950d"
},
"downloads": -1,
"filename": "llama_index_readers_remote-0.4.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "06c6ba42ff7e94a749d2a6dd8f56ef21",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<4.0,>=3.9",
"size": 4336,
"upload_time": "2025-07-31T00:30:40",
"upload_time_iso_8601": "2025-07-31T00:30:40.823071Z",
"url": "https://files.pythonhosted.org/packages/da/a3/6edd9977c5a44dde47c81db91293d7a1ce65823ece8b58f9467d1fc0ce75/llama_index_readers_remote-0.4.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "6734e2fcef1fe57cf31c902bf04e4f27ec413857f302aa64cd31e426ebcfb098",
"md5": "790a5a16a2eb1a5d500277912103ae32",
"sha256": "6be7c92065bbd901ac2cc6e11ca24c53438ba351635e855dcb35078bc92df6b9"
},
"downloads": -1,
"filename": "llama_index_readers_remote-0.4.0.tar.gz",
"has_sig": false,
"md5_digest": "790a5a16a2eb1a5d500277912103ae32",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "<4.0,>=3.9",
"size": 4714,
"upload_time": "2025-07-31T00:30:41",
"upload_time_iso_8601": "2025-07-31T00:30:41.713959Z",
"url": "https://files.pythonhosted.org/packages/67/34/e2fcef1fe57cf31c902bf04e4f27ec413857f302aa64cd31e426ebcfb098/llama_index_readers_remote-0.4.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-07-31 00:30:41",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "llama-index-readers-remote"
}