llama-index-readers-pdf-table


Namellama-index-readers-pdf-table JSON
Version 0.1.3 PyPI version JSON
download
home_page
Summaryllama-index readers pdf_table integration
upload_time2024-02-21 20:39:14
maintaineryy0867
docs_urlNone
authorYour Name
requires_python>=3.8.1,<4.0
licenseMIT
keywords pdf table pdf table
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # PDF Table Loader

This loader reads the tables included in the PDF.

Users can input the PDF `file` and the `pages` from which they want to extract tables, and they can read the tables included on those pages.

## Usage

Here's an example usage of the PDFTableReader.
`pages` parameter is the same as camelot's `pages`. Therefore, you can use patterns such as `all`, `1,2,3`, `10-20`, and so on.

```python
from llama_hub.pdf_table import PDFTableReader
from pathlib import Path

reader = PDFTableReader()
pdf_path = Path("/path/to/pdf")
documents = reader.load_data(file=pdf_path, pages="80-90")
```

## Example

This loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/tree/main/llama_index) and/or subsequently used as a Tool in a [LangChain](https://github.com/hwchase17/langchain) Agent.

            

Raw data

            {
    "_id": null,
    "home_page": "",
    "name": "llama-index-readers-pdf-table",
    "maintainer": "yy0867",
    "docs_url": null,
    "requires_python": ">=3.8.1,<4.0",
    "maintainer_email": "",
    "keywords": "pdf table,pdf,table",
    "author": "Your Name",
    "author_email": "you@example.com",
    "download_url": "https://files.pythonhosted.org/packages/17/e1/ed6c1e9742146f7e79e5072ebd1e600d3825969e1d9f9351093e0bc90ea8/llama_index_readers_pdf_table-0.1.3.tar.gz",
    "platform": null,
    "description": "# PDF Table Loader\n\nThis loader reads the tables included in the PDF.\n\nUsers can input the PDF `file` and the `pages` from which they want to extract tables, and they can read the tables included on those pages.\n\n## Usage\n\nHere's an example usage of the PDFTableReader.\n`pages` parameter is the same as camelot's `pages`. Therefore, you can use patterns such as `all`, `1,2,3`, `10-20`, and so on.\n\n```python\nfrom llama_hub.pdf_table import PDFTableReader\nfrom pathlib import Path\n\nreader = PDFTableReader()\npdf_path = Path(\"/path/to/pdf\")\ndocuments = reader.load_data(file=pdf_path, pages=\"80-90\")\n```\n\n## Example\n\nThis loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/tree/main/llama_index) and/or subsequently used as a Tool in a [LangChain](https://github.com/hwchase17/langchain) Agent.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "llama-index readers pdf_table integration",
    "version": "0.1.3",
    "project_urls": null,
    "split_keywords": [
        "pdf table",
        "pdf",
        "table"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "a3dc8335701d2e99d3877a55713c86a489922c63b457a5b5b40fd327fb2367d1",
                "md5": "7f0ba9960cf15f11cd5c47541a11b7ed",
                "sha256": "0c7a3cad35062158bc498d48635c37bdb77ba81198b951ab03034ce96c8095a3"
            },
            "downloads": -1,
            "filename": "llama_index_readers_pdf_table-0.1.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "7f0ba9960cf15f11cd5c47541a11b7ed",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8.1,<4.0",
            "size": 2801,
            "upload_time": "2024-02-21T20:39:13",
            "upload_time_iso_8601": "2024-02-21T20:39:13.655473Z",
            "url": "https://files.pythonhosted.org/packages/a3/dc/8335701d2e99d3877a55713c86a489922c63b457a5b5b40fd327fb2367d1/llama_index_readers_pdf_table-0.1.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "17e1ed6c1e9742146f7e79e5072ebd1e600d3825969e1d9f9351093e0bc90ea8",
                "md5": "ff71db5596d96305c44a5ddd6a9623f9",
                "sha256": "902c98e74e12fef068f60fbb022995606052eba220d02662b7383376ff4e28de"
            },
            "downloads": -1,
            "filename": "llama_index_readers_pdf_table-0.1.3.tar.gz",
            "has_sig": false,
            "md5_digest": "ff71db5596d96305c44a5ddd6a9623f9",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8.1,<4.0",
            "size": 2586,
            "upload_time": "2024-02-21T20:39:14",
            "upload_time_iso_8601": "2024-02-21T20:39:14.693266Z",
            "url": "https://files.pythonhosted.org/packages/17/e1/ed6c1e9742146f7e79e5072ebd1e600d3825969e1d9f9351093e0bc90ea8/llama_index_readers_pdf_table-0.1.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-02-21 20:39:14",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "llama-index-readers-pdf-table"
}
        
Elapsed time: 0.18783s