py-excel-form-extractor


Namepy-excel-form-extractor JSON
Version 0.3.33 PyPI version JSON
download
home_pagehttps://github.com/adhadse/excelFormExtractor
SummaryExtract excel form content into structured data
upload_time2024-12-19 09:43:38
maintainerNone
docs_urlNone
authorAnurag Dhadse
requires_python>=3.8
licenseMIT
keywords go golang python excel xlsx form extractor
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Excel extractor
Extract excel form content into structured data.

## Usage

1. SECCF extraction: supplier export control classification Form/declaration

```python
from py_excel_form_extractor import extractor, go

company_names = extractor.CompanyNameList()  # the company name which can be mentioned in the file
for company_name in ["Amazon", "Amazon Ltd"]:
    company_names.append(company_name)

extr = extractor.make_seccf_extractor("Example.xlsx", company_names)
extraction = extr.extract()

# convert to JSON string
extr_json = extr.to_json()
```

## BUILD

1. Building the go binary
```bash
go build -o gobinary ./cmd/excelExtractor
```

2. Running the program without building the binary
```bash
go build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go
```
3. Run the binary:
```bash
./bin/excel-extrator
```

## Local Python bindings generation and installation

```bash
pip3 install pybindgen wheel
gopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*
RELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force

# install wheel file
wheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file
```

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/adhadse/excelFormExtractor",
    "name": "py-excel-form-extractor",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "go, golang, python, excel, xlsx, form, extractor",
    "author": "Anurag Dhadse",
    "author_email": "Anurag Dhadse <hello@adhadse.com>",
    "download_url": null,
    "platform": null,
    "description": "# Excel extractor\nExtract excel form content into structured data.\n\n## Usage\n\n1. SECCF extraction: supplier export control classification Form/declaration\n\n```python\nfrom py_excel_form_extractor import extractor, go\n\ncompany_names = extractor.CompanyNameList()  # the company name which can be mentioned in the file\nfor company_name in [\"Amazon\", \"Amazon Ltd\"]:\n    company_names.append(company_name)\n\nextr = extractor.make_seccf_extractor(\"Example.xlsx\", company_names)\nextraction = extr.extract()\n\n# convert to JSON string\nextr_json = extr.to_json()\n```\n\n## BUILD\n\n1. Building the go binary\n```bash\ngo build -o gobinary ./cmd/excelExtractor\n```\n\n2. Running the program without building the binary\n```bash\ngo build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go\n```\n3. Run the binary:\n```bash\n./bin/excel-extrator\n```\n\n## Local Python bindings generation and installation\n\n```bash\npip3 install pybindgen wheel\ngopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*\nRELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force\n\n# install wheel file\nwheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file\n```\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Extract excel form content into structured data",
    "version": "0.3.33",
    "project_urls": {
        "Homepage": "https://github.com/adhadse/excelFormExtractor"
    },
    "split_keywords": [
        "go",
        " golang",
        " python",
        " excel",
        " xlsx",
        " form",
        " extractor"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "3037ca230d97b000a6b4432e6fafc89bcbbeec0f17f1803e238e9b1409bb797b",
                "md5": "13b3a9d0f4e8ad4c1faf3d91464aedd0",
                "sha256": "f21a4347629f6c80a2cacfd450af4160dd24c2b4b77754228ae94e76064fdbb7"
            },
            "downloads": -1,
            "filename": "py_excel_form_extractor-0.3.33-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "has_sig": false,
            "md5_digest": "13b3a9d0f4e8ad4c1faf3d91464aedd0",
            "packagetype": "bdist_wheel",
            "python_version": "cp311",
            "requires_python": ">=3.8",
            "size": 5893886,
            "upload_time": "2024-12-19T09:43:38",
            "upload_time_iso_8601": "2024-12-19T09:43:38.971854Z",
            "url": "https://files.pythonhosted.org/packages/30/37/ca230d97b000a6b4432e6fafc89bcbbeec0f17f1803e238e9b1409bb797b/py_excel_form_extractor-0.3.33-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "0e2b8da096dba2e3cd6f347825b513eb0259ae1f7dec06bd3602a337c8b6d669",
                "md5": "1e0c6b38483a62e0b25ab5121c790094",
                "sha256": "4ec54fc9feaa28447d82d8ab0b8a6e4b7bcb6c3a6e7716ad99340b6e1412179c"
            },
            "downloads": -1,
            "filename": "py_excel_form_extractor-0.3.33-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "has_sig": false,
            "md5_digest": "1e0c6b38483a62e0b25ab5121c790094",
            "packagetype": "bdist_wheel",
            "python_version": "cp311",
            "requires_python": ">=3.8",
            "size": 6271210,
            "upload_time": "2024-12-19T09:43:43",
            "upload_time_iso_8601": "2024-12-19T09:43:43.037494Z",
            "url": "https://files.pythonhosted.org/packages/0e/2b/8da096dba2e3cd6f347825b513eb0259ae1f7dec06bd3602a337c8b6d669/py_excel_form_extractor-0.3.33-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e5dc71391794448c3feec44a646dd8e550e5e2068c9d09f6f20e0b19c3e44162",
                "md5": "a2bdafee6e710a5451c3fab192d38ece",
                "sha256": "5daab385bb91735370791b8e444ae3072198dc6396a9874081aec02962b284d0"
            },
            "downloads": -1,
            "filename": "py_excel_form_extractor-0.3.33-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "has_sig": false,
            "md5_digest": "a2bdafee6e710a5451c3fab192d38ece",
            "packagetype": "bdist_wheel",
            "python_version": "cp312",
            "requires_python": ">=3.8",
            "size": 5891438,
            "upload_time": "2024-12-19T09:43:45",
            "upload_time_iso_8601": "2024-12-19T09:43:45.472114Z",
            "url": "https://files.pythonhosted.org/packages/e5/dc/71391794448c3feec44a646dd8e550e5e2068c9d09f6f20e0b19c3e44162/py_excel_form_extractor-0.3.33-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b35aba268cd729b976e30a7f9dd5825c1bef5d5fc3a05a5f84808f1501a100ef",
                "md5": "672f65cf109db12b2232a6f70a9ec1a8",
                "sha256": "ee59d715d75fe9f79642d5b42b98e83875c69149ff2eaeb83e7965cf65996f53"
            },
            "downloads": -1,
            "filename": "py_excel_form_extractor-0.3.33-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "has_sig": false,
            "md5_digest": "672f65cf109db12b2232a6f70a9ec1a8",
            "packagetype": "bdist_wheel",
            "python_version": "cp312",
            "requires_python": ">=3.8",
            "size": 6270378,
            "upload_time": "2024-12-19T09:43:48",
            "upload_time_iso_8601": "2024-12-19T09:43:48.761716Z",
            "url": "https://files.pythonhosted.org/packages/b3/5a/ba268cd729b976e30a7f9dd5825c1bef5d5fc3a05a5f84808f1501a100ef/py_excel_form_extractor-0.3.33-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "feeaee4466640fffd1303bba72bd4ff37b028be832972fcf2529b947588b9ba3",
                "md5": "16840d8029953e607a64f37d8ed93471",
                "sha256": "cd469a512b787e44bfcbc9f1ea0c7de313018db667b35f3da2c87ed9750d288c"
            },
            "downloads": -1,
            "filename": "py_excel_form_extractor-0.3.33-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "has_sig": false,
            "md5_digest": "16840d8029953e607a64f37d8ed93471",
            "packagetype": "bdist_wheel",
            "python_version": "cp313",
            "requires_python": ">=3.8",
            "size": 5891960,
            "upload_time": "2024-12-19T09:43:51",
            "upload_time_iso_8601": "2024-12-19T09:43:51.662184Z",
            "url": "https://files.pythonhosted.org/packages/fe/ea/ee4466640fffd1303bba72bd4ff37b028be832972fcf2529b947588b9ba3/py_excel_form_extractor-0.3.33-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "077ab9417c2405c7a1d80475bccf6d4c7b3571951d1d8424408a63db8fea12cb",
                "md5": "2ee227f306da0872298ee9e94da665c9",
                "sha256": "55b2b4f065796dc35858a10802f4fe54b236380eba34843a5388cf716c22d947"
            },
            "downloads": -1,
            "filename": "py_excel_form_extractor-0.3.33-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "has_sig": false,
            "md5_digest": "2ee227f306da0872298ee9e94da665c9",
            "packagetype": "bdist_wheel",
            "python_version": "cp313",
            "requires_python": ">=3.8",
            "size": 6270361,
            "upload_time": "2024-12-19T09:43:54",
            "upload_time_iso_8601": "2024-12-19T09:43:54.394834Z",
            "url": "https://files.pythonhosted.org/packages/07/7a/b9417c2405c7a1d80475bccf6d4c7b3571951d1d8424408a63db8fea12cb/py_excel_form_extractor-0.3.33-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-12-19 09:43:38",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "adhadse",
    "github_project": "excelFormExtractor",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "py-excel-form-extractor"
}
        
Elapsed time: 0.40900s