# PDF Table Loader
```bash
pip install llama-index-readers-pdf-table
```
This loader reads the tables included in the PDF.
Users can input the PDF `file` and the `pages` from which they want to extract tables, and they can read the tables included on those pages.
## Usage
Here's an example usage of the PDFTableReader.
`pages` parameter is the same as camelot's `pages`. Therefore, you can use patterns such as `all`, `1,2,3`, `10-20`, and so on.
```python
from llama_index.readers.pdf_table import PDFTableReader
from pathlib import Path
reader = PDFTableReader()
pdf_path = Path("/path/to/pdf")
documents = reader.load_data(file=pdf_path, pages="80-90")
```
## Example
This loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).
Raw data
{
"_id": null,
"home_page": null,
"name": "llama-index-readers-pdf-table",
"maintainer": "yy0867",
"docs_url": null,
"requires_python": "<4.0,>=3.9",
"maintainer_email": null,
"keywords": "pdf table, pdf, table",
"author": "Your Name",
"author_email": "you@example.com",
"download_url": "https://files.pythonhosted.org/packages/30/c4/230b960f66c6d2aca6ec9fa4d475e3d3174c98f624c6a4a12f92e65121ed/llama_index_readers_pdf_table-0.2.0.tar.gz",
"platform": null,
"description": "# PDF Table Loader\n\n```bash\npip install llama-index-readers-pdf-table\n```\n\nThis loader reads the tables included in the PDF.\n\nUsers can input the PDF `file` and the `pages` from which they want to extract tables, and they can read the tables included on those pages.\n\n## Usage\n\nHere's an example usage of the PDFTableReader.\n`pages` parameter is the same as camelot's `pages`. Therefore, you can use patterns such as `all`, `1,2,3`, `10-20`, and so on.\n\n```python\nfrom llama_index.readers.pdf_table import PDFTableReader\nfrom pathlib import Path\n\nreader = PDFTableReader()\npdf_path = Path(\"/path/to/pdf\")\ndocuments = reader.load_data(file=pdf_path, pages=\"80-90\")\n```\n\n## Example\n\nThis loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "llama-index readers pdf_table integration",
"version": "0.2.0",
"project_urls": null,
"split_keywords": [
"pdf table",
" pdf",
" table"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "99e08afc3648b1a053e5c75b571abe870bc89a2b251b7851b2255bbd7f3c976e",
"md5": "73fbcfa91eedd9abfcf57fb428f99d30",
"sha256": "6029ddf828f5ac4c487f7fe2d3ac409548e79c5048519ea278e65f32188845a7"
},
"downloads": -1,
"filename": "llama_index_readers_pdf_table-0.2.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "73fbcfa91eedd9abfcf57fb428f99d30",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<4.0,>=3.9",
"size": 2759,
"upload_time": "2024-08-22T10:40:47",
"upload_time_iso_8601": "2024-08-22T10:40:47.051394Z",
"url": "https://files.pythonhosted.org/packages/99/e0/8afc3648b1a053e5c75b571abe870bc89a2b251b7851b2255bbd7f3c976e/llama_index_readers_pdf_table-0.2.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "30c4230b960f66c6d2aca6ec9fa4d475e3d3174c98f624c6a4a12f92e65121ed",
"md5": "bbc4119739797932cf9c807b7e54a07a",
"sha256": "63fa7eb1355c5d8a2b2b30f110cf967c57567ea0a711acf35b2e848e595088c6"
},
"downloads": -1,
"filename": "llama_index_readers_pdf_table-0.2.0.tar.gz",
"has_sig": false,
"md5_digest": "bbc4119739797932cf9c807b7e54a07a",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "<4.0,>=3.9",
"size": 2562,
"upload_time": "2024-08-22T10:40:48",
"upload_time_iso_8601": "2024-08-22T10:40:48.302559Z",
"url": "https://files.pythonhosted.org/packages/30/c4/230b960f66c6d2aca6ec9fa4d475e3d3174c98f624c6a4a12f92e65121ed/llama_index_readers_pdf_table-0.2.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-08-22 10:40:48",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "llama-index-readers-pdf-table"
}