# Excel extractor
Extract excel form content into structured data.
## Usage
1. SECCF extraction: supplier export control classification Form/declaration
```python
from py_excel_form_extractor import extractor, go
company_names = extractor.CompanyNameList() # the company name which can be mentioned in the file
for company_name in ["Amazon", "Amazon Ltd"]:
company_names.append(company_name)
extr = extractor.make_seccf_extractor("Example.xlsx", company_names)
extraction = extr.extract()
# convert to JSON string
extr_json = extr.to_json()
```
## BUILD
1. Building the go binary
```bash
go build -o gobinary ./cmd/excelExtractor
```
2. Running the program without building the binary
```bash
go build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go
```
3. Run the binary:
```bash
./bin/excel-extrator
```
## Local Python bindings generation and installation
```bash
pip3 install pybindgen wheel
gopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*
RELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force
# install wheel file
wheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file
```
Raw data
{
"_id": null,
"home_page": "https://github.com/adhadse/excelFormExtractor",
"name": "py-excel-form-extractor",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.8",
"maintainer_email": null,
"keywords": "go, golang, python, excel, xlsx, form, extractor",
"author": "Anurag Dhadse",
"author_email": "Anurag Dhadse <hello@adhadse.com>",
"download_url": null,
"platform": null,
"description": "# Excel extractor\nExtract excel form content into structured data.\n\n## Usage\n\n1. SECCF extraction: supplier export control classification Form/declaration\n\n```python\nfrom py_excel_form_extractor import extractor, go\n\ncompany_names = extractor.CompanyNameList() # the company name which can be mentioned in the file\nfor company_name in [\"Amazon\", \"Amazon Ltd\"]:\n company_names.append(company_name)\n\nextr = extractor.make_seccf_extractor(\"Example.xlsx\", company_names)\nextraction = extr.extract()\n\n# convert to JSON string\nextr_json = extr.to_json()\n```\n\n## BUILD\n\n1. Building the go binary\n```bash\ngo build -o gobinary ./cmd/excelExtractor\n```\n\n2. Running the program without building the binary\n```bash\ngo build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go\n```\n3. Run the binary:\n```bash\n./bin/excel-extrator\n```\n\n## Local Python bindings generation and installation\n\n```bash\npip3 install pybindgen wheel\ngopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*\nRELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force\n\n# install wheel file\nwheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file\n```\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Extract excel form content into structured data",
"version": "0.3.33",
"project_urls": {
"Homepage": "https://github.com/adhadse/excelFormExtractor"
},
"split_keywords": [
"go",
" golang",
" python",
" excel",
" xlsx",
" form",
" extractor"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "3037ca230d97b000a6b4432e6fafc89bcbbeec0f17f1803e238e9b1409bb797b",
"md5": "13b3a9d0f4e8ad4c1faf3d91464aedd0",
"sha256": "f21a4347629f6c80a2cacfd450af4160dd24c2b4b77754228ae94e76064fdbb7"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.33-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"has_sig": false,
"md5_digest": "13b3a9d0f4e8ad4c1faf3d91464aedd0",
"packagetype": "bdist_wheel",
"python_version": "cp311",
"requires_python": ">=3.8",
"size": 5893886,
"upload_time": "2024-12-19T09:43:38",
"upload_time_iso_8601": "2024-12-19T09:43:38.971854Z",
"url": "https://files.pythonhosted.org/packages/30/37/ca230d97b000a6b4432e6fafc89bcbbeec0f17f1803e238e9b1409bb797b/py_excel_form_extractor-0.3.33-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "0e2b8da096dba2e3cd6f347825b513eb0259ae1f7dec06bd3602a337c8b6d669",
"md5": "1e0c6b38483a62e0b25ab5121c790094",
"sha256": "4ec54fc9feaa28447d82d8ab0b8a6e4b7bcb6c3a6e7716ad99340b6e1412179c"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.33-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "1e0c6b38483a62e0b25ab5121c790094",
"packagetype": "bdist_wheel",
"python_version": "cp311",
"requires_python": ">=3.8",
"size": 6271210,
"upload_time": "2024-12-19T09:43:43",
"upload_time_iso_8601": "2024-12-19T09:43:43.037494Z",
"url": "https://files.pythonhosted.org/packages/0e/2b/8da096dba2e3cd6f347825b513eb0259ae1f7dec06bd3602a337c8b6d669/py_excel_form_extractor-0.3.33-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "e5dc71391794448c3feec44a646dd8e550e5e2068c9d09f6f20e0b19c3e44162",
"md5": "a2bdafee6e710a5451c3fab192d38ece",
"sha256": "5daab385bb91735370791b8e444ae3072198dc6396a9874081aec02962b284d0"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.33-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"has_sig": false,
"md5_digest": "a2bdafee6e710a5451c3fab192d38ece",
"packagetype": "bdist_wheel",
"python_version": "cp312",
"requires_python": ">=3.8",
"size": 5891438,
"upload_time": "2024-12-19T09:43:45",
"upload_time_iso_8601": "2024-12-19T09:43:45.472114Z",
"url": "https://files.pythonhosted.org/packages/e5/dc/71391794448c3feec44a646dd8e550e5e2068c9d09f6f20e0b19c3e44162/py_excel_form_extractor-0.3.33-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "b35aba268cd729b976e30a7f9dd5825c1bef5d5fc3a05a5f84808f1501a100ef",
"md5": "672f65cf109db12b2232a6f70a9ec1a8",
"sha256": "ee59d715d75fe9f79642d5b42b98e83875c69149ff2eaeb83e7965cf65996f53"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.33-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "672f65cf109db12b2232a6f70a9ec1a8",
"packagetype": "bdist_wheel",
"python_version": "cp312",
"requires_python": ">=3.8",
"size": 6270378,
"upload_time": "2024-12-19T09:43:48",
"upload_time_iso_8601": "2024-12-19T09:43:48.761716Z",
"url": "https://files.pythonhosted.org/packages/b3/5a/ba268cd729b976e30a7f9dd5825c1bef5d5fc3a05a5f84808f1501a100ef/py_excel_form_extractor-0.3.33-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "feeaee4466640fffd1303bba72bd4ff37b028be832972fcf2529b947588b9ba3",
"md5": "16840d8029953e607a64f37d8ed93471",
"sha256": "cd469a512b787e44bfcbc9f1ea0c7de313018db667b35f3da2c87ed9750d288c"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.33-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"has_sig": false,
"md5_digest": "16840d8029953e607a64f37d8ed93471",
"packagetype": "bdist_wheel",
"python_version": "cp313",
"requires_python": ">=3.8",
"size": 5891960,
"upload_time": "2024-12-19T09:43:51",
"upload_time_iso_8601": "2024-12-19T09:43:51.662184Z",
"url": "https://files.pythonhosted.org/packages/fe/ea/ee4466640fffd1303bba72bd4ff37b028be832972fcf2529b947588b9ba3/py_excel_form_extractor-0.3.33-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "077ab9417c2405c7a1d80475bccf6d4c7b3571951d1d8424408a63db8fea12cb",
"md5": "2ee227f306da0872298ee9e94da665c9",
"sha256": "55b2b4f065796dc35858a10802f4fe54b236380eba34843a5388cf716c22d947"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.33-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "2ee227f306da0872298ee9e94da665c9",
"packagetype": "bdist_wheel",
"python_version": "cp313",
"requires_python": ">=3.8",
"size": 6270361,
"upload_time": "2024-12-19T09:43:54",
"upload_time_iso_8601": "2024-12-19T09:43:54.394834Z",
"url": "https://files.pythonhosted.org/packages/07/7a/b9417c2405c7a1d80475bccf6d4c7b3571951d1d8424408a63db8fea12cb/py_excel_form_extractor-0.3.33-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-12-19 09:43:38",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "adhadse",
"github_project": "excelFormExtractor",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "py-excel-form-extractor"
}