llama-index-readers-papers


Namellama-index-readers-papers JSON
Version 0.1.6 PyPI version JSON
download
home_pageNone
Summaryllama-index readers papers integration
upload_time2024-05-19 03:57:43
maintainerthejessezhang
docs_urlNone
authorYour Name
requires_python<4.0,>=3.8.1
licenseMIT
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Papers Loaders

```bash
pip install llama-index-readers-papers
```

## Arxiv Papers Loader

This loader fetches the text from the most relevant scientific papers on Arxiv specified by a search query (e.g. "Artificial Intelligence"). For each paper, the abstract is extracted and put in a separate document. The search query may be any string, Arxiv paper id, or a general Arxiv query string (see the full list of capabilities [here](https://info.arxiv.org/help/api/user-manual.html#query_details)).

## Usage

To use this loader, you need to pass in the search query. You may also optionally specify a local directory to temporarily store the paper PDFs (they are deleted automatically) and the maximum number of papers you want to parse for your search query (default is 10).

```python
from llama_index.readers.papers import ArxivReader

loader = ArxivReader()
documents = loader.load_data(search_query="au:Karpathy")
```

Alternatively, if you would like to load papers and abstracts separately:

```python
from llama_index.readers.papers import ArxivReader

loader = ArxivReader()
documents, abstracts = loader.load_papers_and_abstracts(
    search_query="au:Karpathy"
)
```

This loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).

## Pubmed Papers Loader

This loader fetches the text from the most relevant scientific papers on Pubmed specified by a search query (e.g. "Alzheimers"). For each paper, the abstract is included in the `Document`. The search query may be any string.

## Usage

To use this loader, you need to pass in the search query. You may also optionally specify the maximum number of papers you want to parse for your search query (default is 10).

```python
from llama_index.readers.papers import PubmedReader

loader = PubmedReader()
documents = loader.load_data(search_query="amyloidosis")
```

This loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "llama-index-readers-papers",
    "maintainer": "thejessezhang",
    "docs_url": null,
    "requires_python": "<4.0,>=3.8.1",
    "maintainer_email": null,
    "keywords": null,
    "author": "Your Name",
    "author_email": "you@example.com",
    "download_url": "https://files.pythonhosted.org/packages/8c/d2/2d3911526d44d99b4445504fe94db4d931712635d273a6aeb9d27cbdc69d/llama_index_readers_papers-0.1.6.tar.gz",
    "platform": null,
    "description": "# Papers Loaders\n\n```bash\npip install llama-index-readers-papers\n```\n\n## Arxiv Papers Loader\n\nThis loader fetches the text from the most relevant scientific papers on Arxiv specified by a search query (e.g. \"Artificial Intelligence\"). For each paper, the abstract is extracted and put in a separate document. The search query may be any string, Arxiv paper id, or a general Arxiv query string (see the full list of capabilities [here](https://info.arxiv.org/help/api/user-manual.html#query_details)).\n\n## Usage\n\nTo use this loader, you need to pass in the search query. You may also optionally specify a local directory to temporarily store the paper PDFs (they are deleted automatically) and the maximum number of papers you want to parse for your search query (default is 10).\n\n```python\nfrom llama_index.readers.papers import ArxivReader\n\nloader = ArxivReader()\ndocuments = loader.load_data(search_query=\"au:Karpathy\")\n```\n\nAlternatively, if you would like to load papers and abstracts separately:\n\n```python\nfrom llama_index.readers.papers import ArxivReader\n\nloader = ArxivReader()\ndocuments, abstracts = loader.load_papers_and_abstracts(\n    search_query=\"au:Karpathy\"\n)\n```\n\nThis loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).\n\n## Pubmed Papers Loader\n\nThis loader fetches the text from the most relevant scientific papers on Pubmed specified by a search query (e.g. \"Alzheimers\"). For each paper, the abstract is included in the `Document`. The search query may be any string.\n\n## Usage\n\nTo use this loader, you need to pass in the search query. You may also optionally specify the maximum number of papers you want to parse for your search query (default is 10).\n\n```python\nfrom llama_index.readers.papers import PubmedReader\n\nloader = PubmedReader()\ndocuments = loader.load_data(search_query=\"amyloidosis\")\n```\n\nThis loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "llama-index readers papers integration",
    "version": "0.1.6",
    "project_urls": null,
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "1b0ee4a71381d5f0481ac1a64d2736040185efc81ca1fe10a2c9fc56df8a069b",
                "md5": "7bc823ae758b940cf5ecf6cad315deb9",
                "sha256": "7c35eb3f181908a6e6108268721bb3944be10419ba1d04ead3e7c9e8a3bd0589"
            },
            "downloads": -1,
            "filename": "llama_index_readers_papers-0.1.6-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "7bc823ae758b940cf5ecf6cad315deb9",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.8.1",
            "size": 5885,
            "upload_time": "2024-05-19T03:57:42",
            "upload_time_iso_8601": "2024-05-19T03:57:42.261890Z",
            "url": "https://files.pythonhosted.org/packages/1b/0e/e4a71381d5f0481ac1a64d2736040185efc81ca1fe10a2c9fc56df8a069b/llama_index_readers_papers-0.1.6-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "8cd22d3911526d44d99b4445504fe94db4d931712635d273a6aeb9d27cbdc69d",
                "md5": "4063718f9779130851c45b8318e4bd3c",
                "sha256": "11273a76c11df19088d6cd5d44f03da665f01a40160f8e459c7771ecdcbafd4f"
            },
            "downloads": -1,
            "filename": "llama_index_readers_papers-0.1.6.tar.gz",
            "has_sig": false,
            "md5_digest": "4063718f9779130851c45b8318e4bd3c",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.8.1",
            "size": 4533,
            "upload_time": "2024-05-19T03:57:43",
            "upload_time_iso_8601": "2024-05-19T03:57:43.732319Z",
            "url": "https://files.pythonhosted.org/packages/8c/d2/2d3911526d44d99b4445504fe94db4d931712635d273a6aeb9d27cbdc69d/llama_index_readers_papers-0.1.6.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-05-19 03:57:43",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "llama-index-readers-papers"
}
        
Elapsed time: 0.26627s