invoice2data

Name	invoice2data JSON
Version	0.4.5 JSON
	download
home_page	https://github.com/invoice-x/invoice2data
Summary	Python parser to extract data from pdf invoice
upload_time	2023-11-26 16:02:15
maintainer
docs_url	None
author	Manuel Riel
requires_python
license	MIT License
keywords	pdf invoicing
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # Data extractor for PDF invoices - invoice2data

[![invoice2data build status on GitHub Actions](https://github.com/invoice-x/invoice2data/workflows/Test/badge.svg)](https://github.com/invoice-x/invoice2data/actions)
[![Version](https://img.shields.io/pypi/v/invoice2data.svg)](https://pypi.python.org/pypi/invoice2data)
[![Support Python versions](https://img.shields.io/pypi/pyversions/invoice2data.svg)](https://pypi.python.org/pypi/invoice2data)

A command line tool and Python library to support your accounting
process.

1. extracts text from PDF files using different techniques, like
   `pdftotext`, `text`, `ocrmypdf`, `pdfminer`, `pdfplumber` or OCR -- `tesseract`, or
   `gvision` (Google Cloud Vision).
2. searches for regex in the result using a YAML or JSON-based template system
3. saves results as CSV, JSON or XML or renames PDF files to match the content.

With the flexible template system you can:

- precisely match content PDF files
- plugins available to match line items and tables
- define static fields that are the same for every invoice
- define custom fields needed in your organisation or process
- have multiple regex per field (if layout or wording changes)
- define currency
- extract invoice-items using the `lines`-plugin developed by [Holger
  Brunn](https://github.com/hbrunn)

Go from PDF files to this:

    {'date': (2014, 5, 7), 'invoice_number': '30064443', 'amount': 34.73, 'desc': 'Invoice 30064443 from QualityHosting', 'lines': [{'price': 42.0, 'desc': u'Small Business StandardExchange 2010\nGrundgeb\xfchr pro Einheit\nDienst: OUDJQ_office\n01.05.14-31.05.14\n', 'pos': u'7', 'qty': 1.0}]}
    {'date': (2014, 6, 4), 'invoice_number': 'EUVINS1-OF5-DE-120725895', 'amount': 35.24, 'desc': 'Invoice EUVINS1-OF5-DE-120725895 from Amazon EU'}
    {'date': (2014, 8, 3), 'invoice_number': '42183017', 'amount': 4.11, 'desc': 'Invoice 42183017 from Amazon Web Services'}
    {'date': (2015, 1, 28), 'invoice_number': '12429647', 'amount': 101.0, 'desc': 'Invoice 12429647 from Envato'}

```mermaid
flowchart LR

    InvoiceFile[fa:fa-file-invoice Invoicefile\n\npdf\nimage\ntext] --> Input-module(Input Module\n\npdftotext\ntext\npdfminer\npdfplumber\ntesseract\ngvision)

    Input-module --> |Extracted Text| C{keyword\nmatching}

    Invoice-Templates[(fa:fa-file-lines Invoice Templates)] --> C{keyword\nmatching}

    C --> |Extracted Text + fa:fa-file-circle-check Template| E(Template Processing\n apply options from template\nremove accents, replaces etc...)

    E --> |Optimized String|Plugins&Parsers(Call plugins + parsers)

    subgraph Plugins&Parsers

      direction BT

        tables[fa:fa-table tables] ~~~ lines[fa:fa-grip-lines lines]

        lines ~~~ regex[fa:fa-code regex]

        regex ~~~ static[fa:fa-check static]

 

    end

    Plugins&Parsers --> |output| result[result\nfa:fa-file-csv,\njson,\nXML]

 

 click Invoice-Templates https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md

 click result https://github.com/invoice-x/invoice2data#usage

 click Input-module https://github.com/invoice-x/invoice2data#installation-of-input-modules

 click E https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#options

 click tables https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#tables

 click lines https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#lines

 click regex https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#regex

 click static https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#parser-static

```

## Installation

1.  Install pdftotext

If possible get the latest
[xpdf/poppler-utils](https://poppler.freedesktop.org/) version. It's
included with macOS Homebrew, Debian and Ubuntu. Without it, `pdftotext`
won't parse tables in PDF correctly.

2.  Install `invoice2data` using pip

    pip install invoice2data

### Installation of input modules

An [tesseract](https://github.com/tesseract-ocr/tessdoc/blob/main/FAQ.md#how-do-i-get-tesseract) wrapper is included in auto language mode. It will test your input files against the languages installed on your system. To use it tesseract and imagemagick needs to be installed.
tesseract supports multiple OCR engine modes. By default the available engine installed on the system will be used.

Languages:
tesseract-ocr recognize more than [100 languages](https://github.com/tesseract-ocr/tessdata)
For Linux users, you can often find packages that provide language packs:

```
# Display a list of all Tesseract language packs
apt-cache search tesseract-ocr

# Debian/Ubuntu users
apt-get install tesseract-ocr-chi-sim  # Example: Install Chinese Simplified language pack

# Arch Linux users
pacman -S tesseract-data-eng tesseract-data-deu # Example: Install the English and German language packs

```

## Usage

Basic usage. Process PDF files and write result to CSV.

- `invoice2data invoice.pdf`
- `invoice2data invoice.txt`
- `invoice2data *.pdf`

Choose any of the following input readers:

- pdftotext `invoice2data --input-reader pdftotext invoice.pdf`
- pdftotext `invoice2data --input-reader text invoice.txt`
- tesseract `invoice2data --input-reader tesseract invoice.pdf`
- pdfminer.six `invoice2data --input-reader pdfminer invoice.pdf`
- pdfplumber `invoice2data --input-reader pdfplumber invoice.pdf`
- ocrmypdf `invoice2data --input-reader ocrmypdf invoice.pdf`
- gvision `invoice2data --input-reader gvision invoice.pdf` (needs `GOOGLE_APPLICATION_CREDENTIALS` env var)

Choose any of the following output formats:

- csv `invoice2data --output-format csv invoice.pdf`
- json `invoice2data --output-format json invoice.pdf`
- xml `invoice2data --output-format xml invoice.pdf`

Save output file with custom name or a specific folder

`invoice2data --output-format csv --output-name myinvoices/invoices.csv invoice.pdf`

**Note:** You must specify the `output-format` in order to create
`output-name`

Specify folder with yml templates. (e.g. your suppliers)

`invoice2data --template-folder ACME-templates invoice.pdf`

Only use your own templates and exclude built-ins

`invoice2data --exclude-built-in-templates --template-folder ACME-templates invoice.pdf`

Processes a folder of invoices and copies renamed invoices to new
folder.

`invoice2data --copy new_folder folder_with_invoices/*.pdf`

Processes a single file and dumps whole file for debugging (useful when
adding new templates in templates.py)

`invoice2data --debug my_invoice.pdf`

Recognize test invoices: `invoice2data invoice2data/test/pdfs/* --debug`

### Use as Python Library

You can easily add `invoice2data` to your own Python scripts as library.

    from invoice2data import extract_data
    result = extract_data('path/to/my/file.pdf')

Using in-house templates

    from invoice2data import extract_data
    from invoice2data.extract.loader import read_templates

    templates = read_templates('/path/to/your/templates/')
    result = extract_data(filename, templates=templates)


## Template system

See `invoice2data/extract/templates` for existing templates. Just extend
the list to add your own. If deployed by a bigger organisation, there
should be an interface to edit templates for new suppliers. 80-20 rule.
For a short tutorial on how to add new templates, see [TUTORIAL.md](TUTORIAL.md).

Templates are based on Yaml or JSON. They define one or more keywords to find
the right template, one or more exclude_keywords to further narrow it down
and regexp for fields to be extracted. They could also be a static value,
like the full company name.

Template files are tried in alphabetical order.

We may extend them to feature options to be used during invoice
processing.

Example:

    issuer: Amazon Web Services, Inc.
    keywords:
    - Amazon Web Services
    exclude_keywords:
    - San Jose
    fields:
      amount: TOTAL AMOUNT DUE ON.*\$(\d+\.\d+)
      amount_untaxed: TOTAL AMOUNT DUE ON.*\$(\d+\.\d+)
      date: Invoice Date:\s+([a-zA-Z]+ \d+ , \d+)
      invoice_number: Invoice Number:\s+(\d+)
      partner_name: (Amazon Web Services, Inc\.)
    options:
      remove_whitespace: false
      currency: HKD
      date_formats:
        - '%d/%m/%Y'
    lines:
        start: Detail
        end: \* May include estimated US sales tax
        first_line: ^    (?P<description>\w+.*)\$(?P<price_unit>\d+\.\d+)
        line: (.*)\$(\d+\.\d+)
        skip_line: Note
        last_line: VAT \*\*

The lines package has multiple settings:
- start > The pattern where the lines begin. This is typically the header row of the table. This row is not included in the line matching.
- end > The pattern denoting where the lines end. Typically some text at the very end or immediately below the table. Also not included in the line matching.
- first_line > Optional. This is the primary line item for each entry.
- line > If first_line is not provided, this will be used as the primary line pattern. If first_line is provided, this is the pattern for any sub-lines such as line item details.
- skip_line > Optional. If first_line is passed, this pattern indicates which sub-lines will be skipped and their data not recorded. This is useful if tables span multiple pages and you need to skip over page numbers or headers that appear mid-table.
- last_line > Optional. If first_line is passed, this pattern denotes the final line of the sub-lines and is included in the output data.

:warning: Invoice2data uses a yaml templating system. The yaml templates are loaded with [pyyaml](https://github.com/yaml/pyyaml) which is a pure python implementation. (thus rather slow)
As an alternative json templates can be used. Which are natively better supported by python.

The performance with yaml templates can be greatly increased **10x** by using [libyaml](https://github.com/yaml/libyaml)
It can be installed on most distributions by:
`sudo apt-get libyaml-dev`


## Development

If you are interested in improving this project, have a look at our
[developer guide](DEVELOP.md) to get you started quickly.

## Roadmap and open tasks

- integrate with online OCR?
- try to 'guess' parameters for new invoice formats.
- can apply machine learning to guess new parameters?
- advanced table parsing with [camelot](https://github.com/camelot-dev/camelot)

## Maintainers

- [Manuel Riel](https://github.com/m3nu)
- [Alexis de Lattre](https://github.com/alexis-via)
- [bosd](https://github.com/bosd)

## Contributors

-   [Harshit Joshi](https://github.com/duskybomb): As Google Summer of
    Code student.
-   [Holger Brunn](https://github.com/hbrunn): Add support for parsing
    invoice items.

## Used By
- Odoo, OCA module [account_invoice_import_invoice2data](https://github.com/OCA/edi)

## Related Projects

- [OCR-Invoice](https://github.com/robela/OCR-Invoice) (FOSS \| C\#)
- [DeepLogic AI](https://deeplogicai.tech/case_list/automatic-key-information-extraction-business-documents/) (Commercial \| SaaS)
- [Docparser](https://docparser.com/) (Commercial \| Web Service)
- [A-PDF](http://www.a-pdf.com/data-extractor/index.htm) (Commercial)
- [PDFdeconstruct](http://www.glyphandcog.com/PDFdeconstruct.html?g6)
  (Commercial)
- [CVision](http://www.cvisiontech.com/library/document-automation/forms-processing/extract-data-from-invoice.html)
  (Commercial)

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/invoice-x/invoice2data",
    "name": "invoice2data",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "pdf,invoicing",
    "author": "Manuel Riel",
    "author_email": "",
    "download_url": "https://files.pythonhosted.org/packages/11/a3/e6d5080f27cb909560657720f0f919d40e605c9a6fff693580e78c5fbda3/invoice2data-0.4.5.tar.gz",
    "platform": null,
    "description": "# Data extractor for PDF invoices - invoice2data\n\n[![invoice2data build status on GitHub Actions](https://github.com/invoice-x/invoice2data/workflows/Test/badge.svg)](https://github.com/invoice-x/invoice2data/actions)\n[![Version](https://img.shields.io/pypi/v/invoice2data.svg)](https://pypi.python.org/pypi/invoice2data)\n[![Support Python versions](https://img.shields.io/pypi/pyversions/invoice2data.svg)](https://pypi.python.org/pypi/invoice2data)\n\nA command line tool and Python library to support your accounting\nprocess.\n\n1. extracts text from PDF files using different techniques, like\n   `pdftotext`, `text`, `ocrmypdf`, `pdfminer`, `pdfplumber` or OCR -- `tesseract`, or\n   `gvision` (Google Cloud Vision).\n2. searches for regex in the result using a YAML or JSON-based template system\n3. saves results as CSV, JSON or XML or renames PDF files to match the content.\n\nWith the flexible template system you can:\n\n- precisely match content PDF files\n- plugins available to match line items and tables\n- define static fields that are the same for every invoice\n- define custom fields needed in your organisation or process\n- have multiple regex per field (if layout or wording changes)\n- define currency\n- extract invoice-items using the `lines`-plugin developed by [Holger\n  Brunn](https://github.com/hbrunn)\n\nGo from PDF files to this:\n\n    {'date': (2014, 5, 7), 'invoice_number': '30064443', 'amount': 34.73, 'desc': 'Invoice 30064443 from QualityHosting', 'lines': [{'price': 42.0, 'desc': u'Small Business StandardExchange 2010\\nGrundgeb\\xfchr pro Einheit\\nDienst: OUDJQ_office\\n01.05.14-31.05.14\\n', 'pos': u'7', 'qty': 1.0}]}\n    {'date': (2014, 6, 4), 'invoice_number': 'EUVINS1-OF5-DE-120725895', 'amount': 35.24, 'desc': 'Invoice EUVINS1-OF5-DE-120725895 from Amazon EU'}\n    {'date': (2014, 8, 3), 'invoice_number': '42183017', 'amount': 4.11, 'desc': 'Invoice 42183017 from Amazon Web Services'}\n    {'date': (2015, 1, 28), 'invoice_number': '12429647', 'amount': 101.0, 'desc': 'Invoice 12429647 from Envato'}\n\n```mermaid\nflowchart LR\n\n    InvoiceFile[fa:fa-file-invoice Invoicefile\\n\\npdf\\nimage\\ntext] --> Input-module(Input Module\\n\\npdftotext\\ntext\\npdfminer\\npdfplumber\\ntesseract\\ngvision)\n\n    Input-module --> |Extracted Text| C{keyword\\nmatching}\n\n    Invoice-Templates[(fa:fa-file-lines Invoice Templates)] --> C{keyword\\nmatching}\n\n    C --> |Extracted Text + fa:fa-file-circle-check Template| E(Template Processing\\n apply options from template\\nremove accents, replaces etc...)\n\n    E --> |Optimized String|Plugins&Parsers(Call plugins + parsers)\n\n    subgraph Plugins&Parsers\n\n      direction BT\n\n        tables[fa:fa-table tables] ~~~ lines[fa:fa-grip-lines lines]\n\n        lines ~~~ regex[fa:fa-code regex]\n\n        regex ~~~ static[fa:fa-check static]\n\n \n\n    end\n\n    Plugins&Parsers --> |output| result[result\\nfa:fa-file-csv,\\njson,\\nXML]\n\n \n\n click Invoice-Templates https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md\n\n click result https://github.com/invoice-x/invoice2data#usage\n\n click Input-module https://github.com/invoice-x/invoice2data#installation-of-input-modules\n\n click E https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#options\n\n click tables https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#tables\n\n click lines https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#lines\n\n click regex https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#regex\n\n click static https://github.com/invoice-x/invoice2data/blob/master/TUTORIAL.md#parser-static\n\n```\n\n## Installation\n\n1.  Install pdftotext\n\nIf possible get the latest\n[xpdf/poppler-utils](https://poppler.freedesktop.org/) version. It's\nincluded with macOS Homebrew, Debian and Ubuntu. Without it, `pdftotext`\nwon't parse tables in PDF correctly.\n\n2.  Install `invoice2data` using pip\n\n    pip install invoice2data\n\n### Installation of input modules\n\nAn [tesseract](https://github.com/tesseract-ocr/tessdoc/blob/main/FAQ.md#how-do-i-get-tesseract) wrapper is included in auto language mode. It will test your input files against the languages installed on your system. To use it tesseract and imagemagick needs to be installed.\ntesseract supports multiple OCR engine modes. By default the available engine installed on the system will be used.\n\nLanguages:\ntesseract-ocr recognize more than [100 languages](https://github.com/tesseract-ocr/tessdata)\nFor Linux users, you can often find packages that provide language packs:\n\n```\n# Display a list of all Tesseract language packs\napt-cache search tesseract-ocr\n\n# Debian/Ubuntu users\napt-get install tesseract-ocr-chi-sim  # Example: Install Chinese Simplified language pack\n\n# Arch Linux users\npacman -S tesseract-data-eng tesseract-data-deu # Example: Install the English and German language packs\n\n```\n\n## Usage\n\nBasic usage. Process PDF files and write result to CSV.\n\n- `invoice2data invoice.pdf`\n- `invoice2data invoice.txt`\n- `invoice2data *.pdf`\n\nChoose any of the following input readers:\n\n- pdftotext `invoice2data --input-reader pdftotext invoice.pdf`\n- pdftotext `invoice2data --input-reader text invoice.txt`\n- tesseract `invoice2data --input-reader tesseract invoice.pdf`\n- pdfminer.six `invoice2data --input-reader pdfminer invoice.pdf`\n- pdfplumber `invoice2data --input-reader pdfplumber invoice.pdf`\n- ocrmypdf `invoice2data --input-reader ocrmypdf invoice.pdf`\n- gvision `invoice2data --input-reader gvision invoice.pdf` (needs `GOOGLE_APPLICATION_CREDENTIALS` env var)\n\nChoose any of the following output formats:\n\n- csv `invoice2data --output-format csv invoice.pdf`\n- json `invoice2data --output-format json invoice.pdf`\n- xml `invoice2data --output-format xml invoice.pdf`\n\nSave output file with custom name or a specific folder\n\n`invoice2data --output-format csv --output-name myinvoices/invoices.csv invoice.pdf`\n\n**Note:** You must specify the `output-format` in order to create\n`output-name`\n\nSpecify folder with yml templates. (e.g. your suppliers)\n\n`invoice2data --template-folder ACME-templates invoice.pdf`\n\nOnly use your own templates and exclude built-ins\n\n`invoice2data --exclude-built-in-templates --template-folder ACME-templates invoice.pdf`\n\nProcesses a folder of invoices and copies renamed invoices to new\nfolder.\n\n`invoice2data --copy new_folder folder_with_invoices/*.pdf`\n\nProcesses a single file and dumps whole file for debugging (useful when\nadding new templates in templates.py)\n\n`invoice2data --debug my_invoice.pdf`\n\nRecognize test invoices: `invoice2data invoice2data/test/pdfs/* --debug`\n\n### Use as Python Library\n\nYou can easily add `invoice2data` to your own Python scripts as library.\n\n    from invoice2data import extract_data\n    result = extract_data('path/to/my/file.pdf')\n\nUsing in-house templates\n\n    from invoice2data import extract_data\n    from invoice2data.extract.loader import read_templates\n\n    templates = read_templates('/path/to/your/templates/')\n    result = extract_data(filename, templates=templates)\n\n\n## Template system\n\nSee `invoice2data/extract/templates` for existing templates. Just extend\nthe list to add your own. If deployed by a bigger organisation, there\nshould be an interface to edit templates for new suppliers. 80-20 rule.\nFor a short tutorial on how to add new templates, see [TUTORIAL.md](TUTORIAL.md).\n\nTemplates are based on Yaml or JSON. They define one or more keywords to find\nthe right template, one or more exclude_keywords to further narrow it down\nand regexp for fields to be extracted. They could also be a static value,\nlike the full company name.\n\nTemplate files are tried in alphabetical order.\n\nWe may extend them to feature options to be used during invoice\nprocessing.\n\nExample:\n\n    issuer: Amazon Web Services, Inc.\n    keywords:\n    - Amazon Web Services\n    exclude_keywords:\n    - San Jose\n    fields:\n      amount: TOTAL AMOUNT DUE ON.*\\$(\\d+\\.\\d+)\n      amount_untaxed: TOTAL AMOUNT DUE ON.*\\$(\\d+\\.\\d+)\n      date: Invoice Date:\\s+([a-zA-Z]+ \\d+ , \\d+)\n      invoice_number: Invoice Number:\\s+(\\d+)\n      partner_name: (Amazon Web Services, Inc\\.)\n    options:\n      remove_whitespace: false\n      currency: HKD\n      date_formats:\n        - '%d/%m/%Y'\n    lines:\n        start: Detail\n        end: \\* May include estimated US sales tax\n        first_line: ^    (?P<description>\\w+.*)\\$(?P<price_unit>\\d+\\.\\d+)\n        line: (.*)\\$(\\d+\\.\\d+)\n        skip_line: Note\n        last_line: VAT \\*\\*\n\nThe lines package has multiple settings:\n- start > The pattern where the lines begin. This is typically the header row of the table. This row is not included in the line matching.\n- end > The pattern denoting where the lines end. Typically some text at the very end or immediately below the table. Also not included in the line matching.\n- first_line > Optional. This is the primary line item for each entry.\n- line > If first_line is not provided, this will be used as the primary line pattern. If first_line is provided, this is the pattern for any sub-lines such as line item details.\n- skip_line > Optional. If first_line is passed, this pattern indicates which sub-lines will be skipped and their data not recorded. This is useful if tables span multiple pages and you need to skip over page numbers or headers that appear mid-table.\n- last_line > Optional. If first_line is passed, this pattern denotes the final line of the sub-lines and is included in the output data.\n\n:warning: Invoice2data uses a yaml templating system. The yaml templates are loaded with [pyyaml](https://github.com/yaml/pyyaml) which is a pure python implementation. (thus rather slow)\nAs an alternative json templates can be used. Which are natively better supported by python.\n\nThe performance with yaml templates can be greatly increased **10x** by using [libyaml](https://github.com/yaml/libyaml)\nIt can be installed on most distributions by:\n`sudo apt-get libyaml-dev`\n\n\n## Development\n\nIf you are interested in improving this project, have a look at our\n[developer guide](DEVELOP.md) to get you started quickly.\n\n## Roadmap and open tasks\n\n- integrate with online OCR?\n- try to 'guess' parameters for new invoice formats.\n- can apply machine learning to guess new parameters?\n- advanced table parsing with [camelot](https://github.com/camelot-dev/camelot)\n\n## Maintainers\n\n- [Manuel Riel](https://github.com/m3nu)\n- [Alexis de Lattre](https://github.com/alexis-via)\n- [bosd](https://github.com/bosd)\n\n## Contributors\n\n-   [Harshit Joshi](https://github.com/duskybomb): As Google Summer of\n    Code student.\n-   [Holger Brunn](https://github.com/hbrunn): Add support for parsing\n    invoice items.\n\n## Used By\n- Odoo, OCA module [account_invoice_import_invoice2data](https://github.com/OCA/edi)\n\n## Related Projects\n\n- [OCR-Invoice](https://github.com/robela/OCR-Invoice) (FOSS \\| C\\#)\n- [DeepLogic AI](https://deeplogicai.tech/case_list/automatic-key-information-extraction-business-documents/) (Commercial \\| SaaS)\n- [Docparser](https://docparser.com/) (Commercial \\| Web Service)\n- [A-PDF](http://www.a-pdf.com/data-extractor/index.htm) (Commercial)\n- [PDFdeconstruct](http://www.glyphandcog.com/PDFdeconstruct.html?g6)\n  (Commercial)\n- [CVision](http://www.cvisiontech.com/library/document-automation/forms-processing/extract-data-from-invoice.html)\n  (Commercial)\n",
    "bugtrack_url": null,
    "license": "MIT License",
    "summary": "Python parser to extract data from pdf invoice",
    "version": "0.4.5",
    "project_urls": {
        "Homepage": "https://github.com/invoice-x/invoice2data"
    },
    "split_keywords": [
        "pdf",
        "invoicing"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e0718a294fbab7e86bf292aa35d55b120a210d7ce98154e230d208efb900491c",
                "md5": "080e7abbad4257c8a8786c3263c9d03d",
                "sha256": "bd309db210262e3667813c34cf22f29737b64bd8f5f0e1f1a77106b46309c552"
            },
            "downloads": -1,
            "filename": "invoice2data-0.4.5-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "080e7abbad4257c8a8786c3263c9d03d",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 149018,
            "upload_time": "2023-11-26T16:02:12",
            "upload_time_iso_8601": "2023-11-26T16:02:12.028832Z",
            "url": "https://files.pythonhosted.org/packages/e0/71/8a294fbab7e86bf292aa35d55b120a210d7ce98154e230d208efb900491c/invoice2data-0.4.5-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "11a3e6d5080f27cb909560657720f0f919d40e605c9a6fff693580e78c5fbda3",
                "md5": "21350a12925fe4c556d911cc9ad8632b",
                "sha256": "fca780ebdae9bdca7a166275d7b9135717da2f82d7b56b47c89784e968fea64a"
            },
            "downloads": -1,
            "filename": "invoice2data-0.4.5.tar.gz",
            "has_sig": false,
            "md5_digest": "21350a12925fe4c556d911cc9ad8632b",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 76322,
            "upload_time": "2023-11-26T16:02:15",
            "upload_time_iso_8601": "2023-11-26T16:02:15.682930Z",
            "url": "https://files.pythonhosted.org/packages/11/a3/e6d5080f27cb909560657720f0f919d40e605c9a6fff693580e78c5fbda3/invoice2data-0.4.5.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-11-26 16:02:15",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "invoice-x",
    "github_project": "invoice2data",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "invoice2data"
}

Manuel Riel