docling-parse


Namedocling-parse JSON
Version 1.2.0 PyPI version JSON
download
home_pagehttps://github.com/DS4SD/docling-parse
SummarySimple package to extract text with coordinates from programmatic PDFs
upload_time2024-09-09 12:33:30
maintainerPeter Staar
docs_urlNone
authorPeter Staar
requires_python<4.0,>=3.9
licenseMIT
keywords docling pdf parser
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Docling Parse

[![PyPI version](https://img.shields.io/pypi/v/docling-parse)](https://pypi.org/project/docling-parse/)
[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/docling-parse)](https://pypi.org/project/docling-parse/)
[![Poetry](https://img.shields.io/endpoint?url=https://python-poetry.org/badge/v0.json)](https://python-poetry.org/)
[![Pybind11](https://img.shields.io/badge/build-pybind11-blue)](https://github.com/pybind/pybind11/)
[![Platforms](https://img.shields.io/badge/platform-macos%20|%20linux-blue)](https://github.com/DS4SD/docling-parse/)
[![License MIT](https://img.shields.io/github/license/DS4SD/docling-parse)](https://opensource.org/licenses/MIT)

Simple package to extract text with coordinates from programmatic PDFs.
This package is part of the [Docling](https://github.com/DS4SD/docling) conversion.


## Quick start

Install the package from Pypi

```sh
pip install docling-parse
```

Convert a PDF

```python
from docling_parse.docling_parse import pdf_parser

# Do this only once to load fonts (avoid initialising it many times)
parser = pdf_parser()

# parser.set_loglevel(1) # 1=error, 2=warning, 3=success, 4=info

doc_file = "my-doc.pdf" # filename
doc_key = f"key={pdf_doc}" # unique document key (eg hash, UUID, etc)

# Load the document from file using filename doc_file. This only loads
# the QPDF document, but no extracted data
success = parser.load_document(doc_key, doc_file)

# Open the file in binary mode and read its contents
# with open(pdf_doc, "rb") as file:
#      file_content = file.read()

# Create a BytesIO object and write the file contents to it
# bytes_io = io.BytesIO(file_content)
# success = parser.load_document_from_bytesio(doc_key, bytes_io)

# Parse the entire document in one go, easier, but could require
# a lot (more) memory as parsing page-by-page
# json_doc = parser.parse_pdf_from_key(doc_key)	

# Get number of pages
num_pages = parser.number_of_pages(doc_key)

# Parse page by page to minimize memory footprint
for page in range(0, num_pages):

    # Internal memory for page is auto-deleted after this call.
    # No need to unload a specifc page 
    json_doc = parser.parse_pdf_from_key_on_page(doc_key, page)

    if "pages" not in json_doc:  # page could not get parsed
       continue

    # parsed page is the first one!				  
    json_page = json_doc["pages"][0] 
    
    page_dimensions = [json_page["dimensions"]["width"], json_page["dimensions"]["height"]]

    # find text cells
    cells=[]
    for cell_id,cell in enumerate(json_page["cells"]):
    	cells.append([page,
	              cell_id,
		      cell["content"]["rnormalized"], # text
	              cell["box"]["device"][0], # x0 (lower left x)
		      cell["box"]["device"][1], # y0 (lower left y)
		      cell["box"]["device"][2], # x1 (upper right x)
		      cell["box"]["device"][3], # y1 (upper right y)	
		      ])

    # find bitmap images
    images=[]
    for image_id,image in enumerate(json_page["images"]):
    	images.append([page,
	               image_id,
	               image["box"][0], # x0 (lower left x)
		       image["box"][1], # y0 (lower left y)
		       image["box"][2], # x1 (upper right x)
		       image["box"][3], # y1 (upper right y)
		       ])

    # find paths
    paths=[]
    for path_id,path in enumerate(json_page["paths"]):
    	paths.append([page,
	              path_id,
	              path["x-values"], # array of x values
	              path["y-values"], # array of y values
		      ])

# Unload the (QPDF) document and buffers
parser.unload_document(doc_key)

# Unloads everything at once
# parser.unload_documents()
```

Use the CLI

```sh
$ docling-parse -h
usage: docling-parse [-h] -p PDF

Process a PDF file.

options:
  -h, --help         show this help message and exit
  -p PDF, --pdf PDF  Path to the PDF file
```

## Development

### CXX

To build the parse, simply run the following command in the root folder,

```sh
rm -rf build; cmake -B ./build; cd build; make
```

You can run the parser from your build folder with

```sh
./parse.exe <input-file> <optional-logging:true>
```

If you dont have an input file, then a template input file will be printed on the terminal.


### Python

To build the package, simply run (make sure [poetry](https://python-poetry.org/) is [installed](https://python-poetry.org/docs/#installing-with-the-official-installer)),

```
poetry build
```

To test the package, run,

```
poetry run pytest ./tests/test_parse.py
```


## Contributing

Please read [Contributing to Docling Parse](https://github.com/DS4SD/docling-parse/blob/main/CONTRIBUTING.md) for details.


## References

If you use Docling in your projects, please consider citing the following:

```bib
@software{Docling,
author = {Deep Search Team},
month = {7},
title = {{Docling}},
url = {https://github.com/DS4SD/docling},
version = {main},
year = {2024}
}
```

## License

The Docling Parse codebase is under MIT license.
For individual model usage, please refer to the model licenses found in the original packages.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/DS4SD/docling-parse",
    "name": "docling-parse",
    "maintainer": "Peter Staar",
    "docs_url": null,
    "requires_python": "<4.0,>=3.9",
    "maintainer_email": "taa@zurich.ibm.com",
    "keywords": "docling, pdf, parser",
    "author": "Peter Staar",
    "author_email": "taa@zurich.ibm.com",
    "download_url": null,
    "platform": null,
    "description": "# Docling Parse\n\n[![PyPI version](https://img.shields.io/pypi/v/docling-parse)](https://pypi.org/project/docling-parse/)\n[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/docling-parse)](https://pypi.org/project/docling-parse/)\n[![Poetry](https://img.shields.io/endpoint?url=https://python-poetry.org/badge/v0.json)](https://python-poetry.org/)\n[![Pybind11](https://img.shields.io/badge/build-pybind11-blue)](https://github.com/pybind/pybind11/)\n[![Platforms](https://img.shields.io/badge/platform-macos%20|%20linux-blue)](https://github.com/DS4SD/docling-parse/)\n[![License MIT](https://img.shields.io/github/license/DS4SD/docling-parse)](https://opensource.org/licenses/MIT)\n\nSimple package to extract text with coordinates from programmatic PDFs.\nThis package is part of the [Docling](https://github.com/DS4SD/docling) conversion.\n\n\n## Quick start\n\nInstall the package from Pypi\n\n```sh\npip install docling-parse\n```\n\nConvert a PDF\n\n```python\nfrom docling_parse.docling_parse import pdf_parser\n\n# Do this only once to load fonts (avoid initialising it many times)\nparser = pdf_parser()\n\n# parser.set_loglevel(1) # 1=error, 2=warning, 3=success, 4=info\n\ndoc_file = \"my-doc.pdf\" # filename\ndoc_key = f\"key={pdf_doc}\" # unique document key (eg hash, UUID, etc)\n\n# Load the document from file using filename doc_file. This only loads\n# the QPDF document, but no extracted data\nsuccess = parser.load_document(doc_key, doc_file)\n\n# Open the file in binary mode and read its contents\n# with open(pdf_doc, \"rb\") as file:\n#      file_content = file.read()\n\n# Create a BytesIO object and write the file contents to it\n# bytes_io = io.BytesIO(file_content)\n# success = parser.load_document_from_bytesio(doc_key, bytes_io)\n\n# Parse the entire document in one go, easier, but could require\n# a lot (more) memory as parsing page-by-page\n# json_doc = parser.parse_pdf_from_key(doc_key)\t\n\n# Get number of pages\nnum_pages = parser.number_of_pages(doc_key)\n\n# Parse page by page to minimize memory footprint\nfor page in range(0, num_pages):\n\n    # Internal memory for page is auto-deleted after this call.\n    # No need to unload a specifc page \n    json_doc = parser.parse_pdf_from_key_on_page(doc_key, page)\n\n    if \"pages\" not in json_doc:  # page could not get parsed\n       continue\n\n    # parsed page is the first one!\t\t\t\t  \n    json_page = json_doc[\"pages\"][0] \n    \n    page_dimensions = [json_page[\"dimensions\"][\"width\"], json_page[\"dimensions\"][\"height\"]]\n\n    # find text cells\n    cells=[]\n    for cell_id,cell in enumerate(json_page[\"cells\"]):\n    \tcells.append([page,\n\t              cell_id,\n\t\t      cell[\"content\"][\"rnormalized\"], # text\n\t              cell[\"box\"][\"device\"][0], # x0 (lower left x)\n\t\t      cell[\"box\"][\"device\"][1], # y0 (lower left y)\n\t\t      cell[\"box\"][\"device\"][2], # x1 (upper right x)\n\t\t      cell[\"box\"][\"device\"][3], # y1 (upper right y)\t\n\t\t      ])\n\n    # find bitmap images\n    images=[]\n    for image_id,image in enumerate(json_page[\"images\"]):\n    \timages.append([page,\n\t               image_id,\n\t               image[\"box\"][0], # x0 (lower left x)\n\t\t       image[\"box\"][1], # y0 (lower left y)\n\t\t       image[\"box\"][2], # x1 (upper right x)\n\t\t       image[\"box\"][3], # y1 (upper right y)\n\t\t       ])\n\n    # find paths\n    paths=[]\n    for path_id,path in enumerate(json_page[\"paths\"]):\n    \tpaths.append([page,\n\t              path_id,\n\t              path[\"x-values\"], # array of x values\n\t              path[\"y-values\"], # array of y values\n\t\t      ])\n\n# Unload the (QPDF) document and buffers\nparser.unload_document(doc_key)\n\n# Unloads everything at once\n# parser.unload_documents()\n```\n\nUse the CLI\n\n```sh\n$ docling-parse -h\nusage: docling-parse [-h] -p PDF\n\nProcess a PDF file.\n\noptions:\n  -h, --help         show this help message and exit\n  -p PDF, --pdf PDF  Path to the PDF file\n```\n\n## Development\n\n### CXX\n\nTo build the parse, simply run the following command in the root folder,\n\n```sh\nrm -rf build; cmake -B ./build; cd build; make\n```\n\nYou can run the parser from your build folder with\n\n```sh\n./parse.exe <input-file> <optional-logging:true>\n```\n\nIf you dont have an input file, then a template input file will be printed on the terminal.\n\n\n### Python\n\nTo build the package, simply run (make sure [poetry](https://python-poetry.org/) is [installed](https://python-poetry.org/docs/#installing-with-the-official-installer)),\n\n```\npoetry build\n```\n\nTo test the package, run,\n\n```\npoetry run pytest ./tests/test_parse.py\n```\n\n\n## Contributing\n\nPlease read [Contributing to Docling Parse](https://github.com/DS4SD/docling-parse/blob/main/CONTRIBUTING.md) for details.\n\n\n## References\n\nIf you use Docling in your projects, please consider citing the following:\n\n```bib\n@software{Docling,\nauthor = {Deep Search Team},\nmonth = {7},\ntitle = {{Docling}},\nurl = {https://github.com/DS4SD/docling},\nversion = {main},\nyear = {2024}\n}\n```\n\n## License\n\nThe Docling Parse codebase is under MIT license.\nFor individual model usage, please refer to the model licenses found in the original packages.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Simple package to extract text with coordinates from programmatic PDFs",
    "version": "1.2.0",
    "project_urls": {
        "Homepage": "https://github.com/DS4SD/docling-parse",
        "Repository": "https://github.com/DS4SD/docling-parse"
    },
    "split_keywords": [
        "docling",
        " pdf",
        " parser"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "2cb6a82bb3bb206cbb184ef82e7eff9723fb525a1333ac981667637850ff5f8f",
                "md5": "160acf32ea216ad0b4f9cf6d4f89be73",
                "sha256": "a85e5cc3e075d8628ced33595f2f4768e0dee40d1ed39cdec99b2dcff7eee596"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp310-cp310-macosx_13_0_arm64.whl",
            "has_sig": false,
            "md5_digest": "160acf32ea216ad0b4f9cf6d4f89be73",
            "packagetype": "bdist_wheel",
            "python_version": "cp310",
            "requires_python": "<4.0,>=3.9",
            "size": 7658054,
            "upload_time": "2024-09-09T12:33:30",
            "upload_time_iso_8601": "2024-09-09T12:33:30.233897Z",
            "url": "https://files.pythonhosted.org/packages/2c/b6/a82bb3bb206cbb184ef82e7eff9723fb525a1333ac981667637850ff5f8f/docling_parse-1.2.0-cp310-cp310-macosx_13_0_arm64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "ba84f36bbaa30ec1640d1a37cc7a3459e4fcfb0036e7af5a2491271851b0948b",
                "md5": "ac16efc53eca223808a6aad87b60319c",
                "sha256": "e4fc3875207b837d9849a32f2c15a1fd0244cfdcc268e39858faf83b4bb1ff47"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp310-cp310-macosx_13_0_x86_64.whl",
            "has_sig": false,
            "md5_digest": "ac16efc53eca223808a6aad87b60319c",
            "packagetype": "bdist_wheel",
            "python_version": "cp310",
            "requires_python": "<4.0,>=3.9",
            "size": 7739490,
            "upload_time": "2024-09-09T12:34:41",
            "upload_time_iso_8601": "2024-09-09T12:34:41.192172Z",
            "url": "https://files.pythonhosted.org/packages/ba/84/f36bbaa30ec1640d1a37cc7a3459e4fcfb0036e7af5a2491271851b0948b/docling_parse-1.2.0-cp310-cp310-macosx_13_0_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "9bcbc9ae3ad927c1705af38bdc5c5e2f1f1f02f20cdd7809b4148806df0a62cf",
                "md5": "ef77db00fbb21804c067f5eebc0935b1",
                "sha256": "0724b13fc6ff02bddacbc7b3b11a7f648bad39a5af9df039efd8724ff6a2cdac"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp310-cp310-macosx_14_0_arm64.whl",
            "has_sig": false,
            "md5_digest": "ef77db00fbb21804c067f5eebc0935b1",
            "packagetype": "bdist_wheel",
            "python_version": "cp310",
            "requires_python": "<4.0,>=3.9",
            "size": 7646993,
            "upload_time": "2024-09-09T12:26:57",
            "upload_time_iso_8601": "2024-09-09T12:26:57.596339Z",
            "url": "https://files.pythonhosted.org/packages/9b/cb/c9ae3ad927c1705af38bdc5c5e2f1f1f02f20cdd7809b4148806df0a62cf/docling_parse-1.2.0-cp310-cp310-macosx_14_0_arm64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "d7ab1d904c5864364f36a993c757fc03c554fecf28a25f09df016efccb026043",
                "md5": "7437585adfe2e6f4fae4139bc2a0826b",
                "sha256": "c2ebaf97d9f6d8c50f4846ff8927d41c55087c8229d0e1b6944efad810fbe8e4"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp310-cp310-macosx_14_0_x86_64.whl",
            "has_sig": false,
            "md5_digest": "7437585adfe2e6f4fae4139bc2a0826b",
            "packagetype": "bdist_wheel",
            "python_version": "cp310",
            "requires_python": "<4.0,>=3.9",
            "size": 7725270,
            "upload_time": "2024-09-09T12:26:51",
            "upload_time_iso_8601": "2024-09-09T12:26:51.752477Z",
            "url": "https://files.pythonhosted.org/packages/d7/ab/1d904c5864364f36a993c757fc03c554fecf28a25f09df016efccb026043/docling_parse-1.2.0-cp310-cp310-macosx_14_0_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "aa33d436f89d201e5001db46f852039815c91f10b5216ac215b8222d45abe438",
                "md5": "c459b6c17aa8831714b68afd7b595a72",
                "sha256": "63a0f5a1e06873470e252c3c09268cdf294fc3fff081f8c0aa2df0480b772faa"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp310-cp310-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "has_sig": false,
            "md5_digest": "c459b6c17aa8831714b68afd7b595a72",
            "packagetype": "bdist_wheel",
            "python_version": "cp310",
            "requires_python": "<4.0,>=3.9",
            "size": 8779344,
            "upload_time": "2024-09-09T13:14:24",
            "upload_time_iso_8601": "2024-09-09T13:14:24.479845Z",
            "url": "https://files.pythonhosted.org/packages/aa/33/d436f89d201e5001db46f852039815c91f10b5216ac215b8222d45abe438/docling_parse-1.2.0-cp310-cp310-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "703c28ef12df34a10adfaa8b8b7cab958ac840b14a3798d33e480f650daa33a0",
                "md5": "aba6d2729abe8757f235b6d84650c429",
                "sha256": "accefe923784147c11f13b1195fe8f6024bc3a895b63c2c0f7c0e2a6536252f5"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "has_sig": false,
            "md5_digest": "aba6d2729abe8757f235b6d84650c429",
            "packagetype": "bdist_wheel",
            "python_version": "cp310",
            "requires_python": "<4.0,>=3.9",
            "size": 8833346,
            "upload_time": "2024-09-09T13:14:26",
            "upload_time_iso_8601": "2024-09-09T13:14:26.750262Z",
            "url": "https://files.pythonhosted.org/packages/70/3c/28ef12df34a10adfaa8b8b7cab958ac840b14a3798d33e480f650daa33a0/docling_parse-1.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b8706c7a05f8c7c0f6d306d6eaa62276120121fc7b35efae38eadf66fdd1755e",
                "md5": "fbd84d060a13908e2730c52add67b2f7",
                "sha256": "a2deac4e69d3de46647970df973189ee19dbe5939ec379c91bc67fa09641ac49"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp311-cp311-macosx_13_0_arm64.whl",
            "has_sig": false,
            "md5_digest": "fbd84d060a13908e2730c52add67b2f7",
            "packagetype": "bdist_wheel",
            "python_version": "cp311",
            "requires_python": "<4.0,>=3.9",
            "size": 7658016,
            "upload_time": "2024-09-09T12:33:36",
            "upload_time_iso_8601": "2024-09-09T12:33:36.601494Z",
            "url": "https://files.pythonhosted.org/packages/b8/70/6c7a05f8c7c0f6d306d6eaa62276120121fc7b35efae38eadf66fdd1755e/docling_parse-1.2.0-cp311-cp311-macosx_13_0_arm64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "9b297fe639eda5f5bafdc04f9c2bb02a75ff1196759c7793db6ed2e989684397",
                "md5": "f5b982c3afa9e8477e173fcc0f1e99ac",
                "sha256": "c17516e76d57fe9ab36d139f7fa022517087b84249cc4d74fbf6d7e380725f3b"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp311-cp311-macosx_13_0_x86_64.whl",
            "has_sig": false,
            "md5_digest": "f5b982c3afa9e8477e173fcc0f1e99ac",
            "packagetype": "bdist_wheel",
            "python_version": "cp311",
            "requires_python": "<4.0,>=3.9",
            "size": 7739426,
            "upload_time": "2024-09-09T12:36:41",
            "upload_time_iso_8601": "2024-09-09T12:36:41.573523Z",
            "url": "https://files.pythonhosted.org/packages/9b/29/7fe639eda5f5bafdc04f9c2bb02a75ff1196759c7793db6ed2e989684397/docling_parse-1.2.0-cp311-cp311-macosx_13_0_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "7e7007195f02e6b5770717e409e8584a5729aad7398ee5b58658e5263531dda4",
                "md5": "5a9e379d2c1ce3570a2d0d19a9dc3696",
                "sha256": "739adf1f3354864efc2b80bb684202c8bca687221e482be3798ce64e8c3d5c8f"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp311-cp311-macosx_14_0_arm64.whl",
            "has_sig": false,
            "md5_digest": "5a9e379d2c1ce3570a2d0d19a9dc3696",
            "packagetype": "bdist_wheel",
            "python_version": "cp311",
            "requires_python": "<4.0,>=3.9",
            "size": 7646974,
            "upload_time": "2024-09-09T12:36:53",
            "upload_time_iso_8601": "2024-09-09T12:36:53.565042Z",
            "url": "https://files.pythonhosted.org/packages/7e/70/07195f02e6b5770717e409e8584a5729aad7398ee5b58658e5263531dda4/docling_parse-1.2.0-cp311-cp311-macosx_14_0_arm64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "fcaf2082cd0f7f9178475f2321f41c03d1f95c4e0ec628d9369938efe6396351",
                "md5": "d14bddf7c64df31fd666346ffd8d078d",
                "sha256": "709a3ade0a08661d4e2ceba0f5ad188f07d614114708c2458a7c00d15cb21164"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp311-cp311-macosx_14_0_x86_64.whl",
            "has_sig": false,
            "md5_digest": "d14bddf7c64df31fd666346ffd8d078d",
            "packagetype": "bdist_wheel",
            "python_version": "cp311",
            "requires_python": "<4.0,>=3.9",
            "size": 7725306,
            "upload_time": "2024-09-09T12:36:50",
            "upload_time_iso_8601": "2024-09-09T12:36:50.767332Z",
            "url": "https://files.pythonhosted.org/packages/fc/af/2082cd0f7f9178475f2321f41c03d1f95c4e0ec628d9369938efe6396351/docling_parse-1.2.0-cp311-cp311-macosx_14_0_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "37a38c80d56e5eed402d432ee7eead89b7522524b6c1a20179c810e4be4ce776",
                "md5": "e218f2040ba356812993e920d92f0144",
                "sha256": "6d944c6789179df77f1fd35d26aadf451cae8e8ae1801ab6a17e8acf7e1c320c"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "has_sig": false,
            "md5_digest": "e218f2040ba356812993e920d92f0144",
            "packagetype": "bdist_wheel",
            "python_version": "cp311",
            "requires_python": "<4.0,>=3.9",
            "size": 8779474,
            "upload_time": "2024-09-09T13:14:57",
            "upload_time_iso_8601": "2024-09-09T13:14:57.179163Z",
            "url": "https://files.pythonhosted.org/packages/37/a3/8c80d56e5eed402d432ee7eead89b7522524b6c1a20179c810e4be4ce776/docling_parse-1.2.0-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "f20079b1b13e5633bf4de0acdfadbf352fbe01335f4fb87686f35f3f10a522a9",
                "md5": "5ddbcd718c3d467b3f332cc661bba9dd",
                "sha256": "a24c7bb0f7628a7174bc8ee94ab6004443b2908e91a6a9a8616ad3d57bce9256"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "has_sig": false,
            "md5_digest": "5ddbcd718c3d467b3f332cc661bba9dd",
            "packagetype": "bdist_wheel",
            "python_version": "cp311",
            "requires_python": "<4.0,>=3.9",
            "size": 8833297,
            "upload_time": "2024-09-09T13:14:59",
            "upload_time_iso_8601": "2024-09-09T13:14:59.226122Z",
            "url": "https://files.pythonhosted.org/packages/f2/00/79b1b13e5633bf4de0acdfadbf352fbe01335f4fb87686f35f3f10a522a9/docling_parse-1.2.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "2ddf4b1972719c88c3020d9c4cc8fde231ba3005a1db7f5afcc58d181440a3c1",
                "md5": "2df05c65a08512c4c9e24cb22fd7381b",
                "sha256": "b4218d6f3234a22e2de929e77d84fa940bdaef0e2644bd8e291f6f95930c1c27"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp312-cp312-macosx_13_0_arm64.whl",
            "has_sig": false,
            "md5_digest": "2df05c65a08512c4c9e24cb22fd7381b",
            "packagetype": "bdist_wheel",
            "python_version": "cp312",
            "requires_python": "<4.0,>=3.9",
            "size": 7658770,
            "upload_time": "2024-09-09T12:37:34",
            "upload_time_iso_8601": "2024-09-09T12:37:34.812114Z",
            "url": "https://files.pythonhosted.org/packages/2d/df/4b1972719c88c3020d9c4cc8fde231ba3005a1db7f5afcc58d181440a3c1/docling_parse-1.2.0-cp312-cp312-macosx_13_0_arm64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "38872d9cf24242e0daa916631959477b4d32df24dbacfeb332c7dbbd4c2d5dd6",
                "md5": "c3ea51b3ee3432cfab442e9b67d8ba06",
                "sha256": "39444e9e758edf72d9a4f7e035872fd75159566a1f62ddafdba46c7c0ac92cc4"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp312-cp312-macosx_13_0_x86_64.whl",
            "has_sig": false,
            "md5_digest": "c3ea51b3ee3432cfab442e9b67d8ba06",
            "packagetype": "bdist_wheel",
            "python_version": "cp312",
            "requires_python": "<4.0,>=3.9",
            "size": 7740559,
            "upload_time": "2024-09-09T12:41:51",
            "upload_time_iso_8601": "2024-09-09T12:41:51.426091Z",
            "url": "https://files.pythonhosted.org/packages/38/87/2d9cf24242e0daa916631959477b4d32df24dbacfeb332c7dbbd4c2d5dd6/docling_parse-1.2.0-cp312-cp312-macosx_13_0_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "0617856ca6ec7f0d5bea8cdcc871fcafbaaccbfcc247379e453cc93be4dad7a1",
                "md5": "ac63c57a72679de7bcb102bdc4415948",
                "sha256": "344f1f6296d5c6f5a2052d6490a7abc2b381b8861ee1ed23898db3a5d2801af3"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp312-cp312-macosx_14_0_arm64.whl",
            "has_sig": false,
            "md5_digest": "ac63c57a72679de7bcb102bdc4415948",
            "packagetype": "bdist_wheel",
            "python_version": "cp312",
            "requires_python": "<4.0,>=3.9",
            "size": 7647576,
            "upload_time": "2024-09-09T12:40:07",
            "upload_time_iso_8601": "2024-09-09T12:40:07.860448Z",
            "url": "https://files.pythonhosted.org/packages/06/17/856ca6ec7f0d5bea8cdcc871fcafbaaccbfcc247379e453cc93be4dad7a1/docling_parse-1.2.0-cp312-cp312-macosx_14_0_arm64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "0db849cf9e5bfdd10f2ae6af01aebcef0997f8eb741883f0c6f6f421065885a1",
                "md5": "12ff639b5e50e4995328604e9cd028b2",
                "sha256": "0f71f0d8c8bdec16b847d031eb02aa9bb08b3a1f5a3bcc286b3f2ebc882eb038"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp312-cp312-macosx_14_0_x86_64.whl",
            "has_sig": false,
            "md5_digest": "12ff639b5e50e4995328604e9cd028b2",
            "packagetype": "bdist_wheel",
            "python_version": "cp312",
            "requires_python": "<4.0,>=3.9",
            "size": 7726566,
            "upload_time": "2024-09-09T12:40:31",
            "upload_time_iso_8601": "2024-09-09T12:40:31.795213Z",
            "url": "https://files.pythonhosted.org/packages/0d/b8/49cf9e5bfdd10f2ae6af01aebcef0997f8eb741883f0c6f6f421065885a1/docling_parse-1.2.0-cp312-cp312-macosx_14_0_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "7689af37f65e886379e8cc4c4c4b4ea2d1ab15bc8f792873a7bcfae412c7e0b1",
                "md5": "97928ceda8e75790d8650ad68a7b5a8b",
                "sha256": "144900e82af7c3089c1c0f9b3f3f42b89a313cfba75faa2e08c8b705fe48a742"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "has_sig": false,
            "md5_digest": "97928ceda8e75790d8650ad68a7b5a8b",
            "packagetype": "bdist_wheel",
            "python_version": "cp312",
            "requires_python": "<4.0,>=3.9",
            "size": 8779360,
            "upload_time": "2024-09-09T13:15:52",
            "upload_time_iso_8601": "2024-09-09T13:15:52.128472Z",
            "url": "https://files.pythonhosted.org/packages/76/89/af37f65e886379e8cc4c4c4b4ea2d1ab15bc8f792873a7bcfae412c7e0b1/docling_parse-1.2.0-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "f2272c6796bc6c83b2ccaa1bc9c78c3c7ba13760e455e1d11695abe24064226d",
                "md5": "0f381edc537d8f3260fec1be90b7825b",
                "sha256": "78fd07f9a09e71c96f445ae8b951b5e6f69b6b79dd2011e88531dbf66d7f7944"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "has_sig": false,
            "md5_digest": "0f381edc537d8f3260fec1be90b7825b",
            "packagetype": "bdist_wheel",
            "python_version": "cp312",
            "requires_python": "<4.0,>=3.9",
            "size": 8833415,
            "upload_time": "2024-09-09T13:15:54",
            "upload_time_iso_8601": "2024-09-09T13:15:54.298337Z",
            "url": "https://files.pythonhosted.org/packages/f2/27/2c6796bc6c83b2ccaa1bc9c78c3c7ba13760e455e1d11695abe24064226d/docling_parse-1.2.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "5b5a6d7e64e5e4e46509c825ceb11311c3df9056ee52681b060e8a8c02bd23c6",
                "md5": "faf091d4ca0d88352453fbad04f5f230",
                "sha256": "2118ea097dfee3d082a640d14d69fc2ad4716ae297b97ecef9a2f3a34baacad9"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp39-cp39-macosx_13_0_arm64.whl",
            "has_sig": false,
            "md5_digest": "faf091d4ca0d88352453fbad04f5f230",
            "packagetype": "bdist_wheel",
            "python_version": "cp39",
            "requires_python": "<4.0,>=3.9",
            "size": 7658161,
            "upload_time": "2024-09-09T12:44:48",
            "upload_time_iso_8601": "2024-09-09T12:44:48.912461Z",
            "url": "https://files.pythonhosted.org/packages/5b/5a/6d7e64e5e4e46509c825ceb11311c3df9056ee52681b060e8a8c02bd23c6/docling_parse-1.2.0-cp39-cp39-macosx_13_0_arm64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "9ead14c786e1d5be2fc070abbc101854ec53f2efac72bf6a03c48c602fcf0574",
                "md5": "de8d72804fdeefd6fff2d1a1765c9abd",
                "sha256": "38434455ddbca4e525154a3545d1ed71a80db48c0424cd6c3236e4590971bfca"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp39-cp39-macosx_13_0_x86_64.whl",
            "has_sig": false,
            "md5_digest": "de8d72804fdeefd6fff2d1a1765c9abd",
            "packagetype": "bdist_wheel",
            "python_version": "cp39",
            "requires_python": "<4.0,>=3.9",
            "size": 7739573,
            "upload_time": "2024-09-09T12:45:56",
            "upload_time_iso_8601": "2024-09-09T12:45:56.204244Z",
            "url": "https://files.pythonhosted.org/packages/9e/ad/14c786e1d5be2fc070abbc101854ec53f2efac72bf6a03c48c602fcf0574/docling_parse-1.2.0-cp39-cp39-macosx_13_0_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e43566d1e4a5cf22b9db8a9aecd772b9f76b833edc5e281e1f8b2579d2c73a9f",
                "md5": "f56a7b2980b2f21a1360988d90684018",
                "sha256": "34514dec15c6f2e462d48c5691f136ee1913a0927a15d2e13bacc38dc261c0d5"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp39-cp39-macosx_14_0_arm64.whl",
            "has_sig": false,
            "md5_digest": "f56a7b2980b2f21a1360988d90684018",
            "packagetype": "bdist_wheel",
            "python_version": "cp39",
            "requires_python": "<4.0,>=3.9",
            "size": 7647031,
            "upload_time": "2024-09-09T12:43:36",
            "upload_time_iso_8601": "2024-09-09T12:43:36.773367Z",
            "url": "https://files.pythonhosted.org/packages/e4/35/66d1e4a5cf22b9db8a9aecd772b9f76b833edc5e281e1f8b2579d2c73a9f/docling_parse-1.2.0-cp39-cp39-macosx_14_0_arm64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "8b3eb22f323fb6fb81c9ec1d1f8003be90791a17ee3cdf501ba654770a30160b",
                "md5": "137884aa6132243bac7f950744464b9d",
                "sha256": "ea08b8c996234e18aaf65bff1e3ee954cff2960ef0d11414e5e14d674d343407"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp39-cp39-macosx_14_0_x86_64.whl",
            "has_sig": false,
            "md5_digest": "137884aa6132243bac7f950744464b9d",
            "packagetype": "bdist_wheel",
            "python_version": "cp39",
            "requires_python": "<4.0,>=3.9",
            "size": 7725395,
            "upload_time": "2024-09-09T12:43:43",
            "upload_time_iso_8601": "2024-09-09T12:43:43.899538Z",
            "url": "https://files.pythonhosted.org/packages/8b/3e/b22f323fb6fb81c9ec1d1f8003be90791a17ee3cdf501ba654770a30160b/docling_parse-1.2.0-cp39-cp39-macosx_14_0_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "9391e09aa28878e09b983788e71c7f8d25dc80ab5c6e2bd9443b57f43145fd5b",
                "md5": "6fa6ab2ce3041cac81a9a740d4f36d55",
                "sha256": "205d15200ababa614e14f604e64cfeed39c188da982f34705735232f8960c23d"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp39-cp39-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "has_sig": false,
            "md5_digest": "6fa6ab2ce3041cac81a9a740d4f36d55",
            "packagetype": "bdist_wheel",
            "python_version": "cp39",
            "requires_python": "<4.0,>=3.9",
            "size": 8779603,
            "upload_time": "2024-09-09T13:14:32",
            "upload_time_iso_8601": "2024-09-09T13:14:32.848920Z",
            "url": "https://files.pythonhosted.org/packages/93/91/e09aa28878e09b983788e71c7f8d25dc80ab5c6e2bd9443b57f43145fd5b/docling_parse-1.2.0-cp39-cp39-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "8acc9217b33f89dc785f81b02903f1a0fbd053496d6155bc034be4f4b77b4309",
                "md5": "33b7840cec2a4c0a66caa19652382e5c",
                "sha256": "8fa3bfdb205662ab28d2b8c029b331be9bc8186153edc8ebc494042c1689b86f"
            },
            "downloads": -1,
            "filename": "docling_parse-1.2.0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "has_sig": false,
            "md5_digest": "33b7840cec2a4c0a66caa19652382e5c",
            "packagetype": "bdist_wheel",
            "python_version": "cp39",
            "requires_python": "<4.0,>=3.9",
            "size": 8833913,
            "upload_time": "2024-09-09T13:14:35",
            "upload_time_iso_8601": "2024-09-09T13:14:35.389077Z",
            "url": "https://files.pythonhosted.org/packages/8a/cc/9217b33f89dc785f81b02903f1a0fbd053496d6155bc034be4f4b77b4309/docling_parse-1.2.0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-09-09 12:33:30",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "DS4SD",
    "github_project": "docling-parse",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "docling-parse"
}
        
Elapsed time: 3.09678s