# Excel extractor
Extract excel form content into structured data.
## Usage
1. SECCF extraction: supplier export control classification Form/declaration
```python
from py_excel_form_extractor import extractor, go
company_names = extractor.CompanyNameList() # the company name which can be mentioned in the file
for company_name in ["Amazon", "Amazon Ltd"]:
company_names.append(company_name)
extr = extractor.make_seccf_extractor("Example.xlsx", company_names)
extraction = extr.extract()
# convert to JSON string
extr_json = extr.to_json()
```
## BUILD
1. Building the go binary
```bash
go build -o gobinary ./cmd/excelExtractor
```
2. Running the program without building the binary
```bash
go build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go
```
3. Run the binary:
```bash
./bin/excel-extrator
```
## Local Python bindings generation and installation
```bash
pip3 install pybindgen wheel
gopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*
RELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force
# install wheel file
wheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file
```
Raw data
{
"_id": null,
"home_page": "https://github.com/adhadse/excelFormExtractor",
"name": "py-excel-form-extractor",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.8",
"maintainer_email": null,
"keywords": "go, golang, python, excel, xlsx, form, extractor",
"author": "Anurag Dhadse",
"author_email": "Anurag Dhadse <hello@adhadse.com>",
"download_url": null,
"platform": null,
"description": "# Excel extractor\nExtract excel form content into structured data.\n\n## Usage\n\n1. SECCF extraction: supplier export control classification Form/declaration\n\n```python\nfrom py_excel_form_extractor import extractor, go\n\ncompany_names = extractor.CompanyNameList() # the company name which can be mentioned in the file\nfor company_name in [\"Amazon\", \"Amazon Ltd\"]:\n company_names.append(company_name)\n\nextr = extractor.make_seccf_extractor(\"Example.xlsx\", company_names)\nextraction = extr.extract()\n\n# convert to JSON string\nextr_json = extr.to_json()\n```\n\n## BUILD\n\n1. Building the go binary\n```bash\ngo build -o gobinary ./cmd/excelExtractor\n```\n\n2. Running the program without building the binary\n```bash\ngo build -o ./bin/excel-extrator ./cmd/excelFormExtractor/main.go\n```\n3. Run the binary:\n```bash\n./bin/excel-extrator\n```\n\n## Local Python bindings generation and installation\n\n```bash\npip3 install pybindgen wheel\ngopy build --output=py_excel_form_extractor -vm=python3 ./pkg/*\nRELEASE_VERSION=YOUR_UPDATED_PACKAGE_VERSION python3 setup.py bdist_wheel --force\n\n# install wheel file\nwheel_file=$(ls dist/*.whl | head -n1); pip3 install $wheel_file\n```\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Extract excel form content into structured data",
"version": "0.3.41",
"project_urls": {
"Homepage": "https://github.com/adhadse/excelFormExtractor"
},
"split_keywords": [
"go",
" golang",
" python",
" excel",
" xlsx",
" form",
" extractor"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "83da71cba22e2d85a9d29cb2b67084ef0e638fb97012dd5fe45188512815a9d7",
"md5": "92ea85735f0cd751b9f2ee2f19898d14",
"sha256": "ebc5d645729abfb451d54f27d80ffd33e26547320c6db8b24eb3ef240b88e3f2"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.41-cp311-cp311-macosx_11_0_arm64.whl",
"has_sig": false,
"md5_digest": "92ea85735f0cd751b9f2ee2f19898d14",
"packagetype": "bdist_wheel",
"python_version": "cp311",
"requires_python": ">=3.8",
"size": 3552539,
"upload_time": "2025-10-11T07:57:38",
"upload_time_iso_8601": "2025-10-11T07:57:38.149754Z",
"url": "https://files.pythonhosted.org/packages/83/da/71cba22e2d85a9d29cb2b67084ef0e638fb97012dd5fe45188512815a9d7/py_excel_form_extractor-0.3.41-cp311-cp311-macosx_11_0_arm64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "885e4794500b578d8f7245ff3c090ebf6afc587e62540fb3a7f57833f8746197",
"md5": "7ed373e1639ec1a9106c38c5cbe8d248",
"sha256": "81023c07d1869f50bef2cb0d673d25067c08fae1a81d74325aaa949863d512e3"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"has_sig": false,
"md5_digest": "7ed373e1639ec1a9106c38c5cbe8d248",
"packagetype": "bdist_wheel",
"python_version": "cp311",
"requires_python": ">=3.8",
"size": 5893722,
"upload_time": "2025-10-11T07:57:40",
"upload_time_iso_8601": "2025-10-11T07:57:40.295716Z",
"url": "https://files.pythonhosted.org/packages/88/5e/4794500b578d8f7245ff3c090ebf6afc587e62540fb3a7f57833f8746197/py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "f9a0b70f266ae10ca58f4a5d5d815951e3850858174c9b51c65b4ba0ae0605ba",
"md5": "d2213efe470d13aee6863b77df1c0421",
"sha256": "ccaba8ae23bb95f5b7b1b88c2e0b6bb72d331905fcade7fe0f09ebc98d8cd89f"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "d2213efe470d13aee6863b77df1c0421",
"packagetype": "bdist_wheel",
"python_version": "cp311",
"requires_python": ">=3.8",
"size": 6271621,
"upload_time": "2025-10-11T07:57:42",
"upload_time_iso_8601": "2025-10-11T07:57:42.133305Z",
"url": "https://files.pythonhosted.org/packages/f9/a0/b70f266ae10ca58f4a5d5d815951e3850858174c9b51c65b4ba0ae0605ba/py_excel_form_extractor-0.3.41-cp311-cp311-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "ec03a964643654460335d20a223d6dfd1bdeea6267e5029d50b0954bcbc32610",
"md5": "306b90871660d8b0668c0e9cb36f2f74",
"sha256": "fb1810ae2898e7618e021ac065b7bb5ba5208787eaf31b8219896a7c7934703b"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.41-cp312-cp312-macosx_11_0_arm64.whl",
"has_sig": false,
"md5_digest": "306b90871660d8b0668c0e9cb36f2f74",
"packagetype": "bdist_wheel",
"python_version": "cp312",
"requires_python": ">=3.8",
"size": 3551134,
"upload_time": "2025-10-11T07:57:43",
"upload_time_iso_8601": "2025-10-11T07:57:43.881825Z",
"url": "https://files.pythonhosted.org/packages/ec/03/a964643654460335d20a223d6dfd1bdeea6267e5029d50b0954bcbc32610/py_excel_form_extractor-0.3.41-cp312-cp312-macosx_11_0_arm64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "f9711928e26df1c2a4242720afed72b22378bb081bc1a00a3cb63c751832559c",
"md5": "66ddb701ce3a0a5345dbeb8b47f54ad7",
"sha256": "b630b3479c574fea4aa8a8d92f296dfca3598477e557d4679849bcea1fa4ae10"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"has_sig": false,
"md5_digest": "66ddb701ce3a0a5345dbeb8b47f54ad7",
"packagetype": "bdist_wheel",
"python_version": "cp312",
"requires_python": ">=3.8",
"size": 5891819,
"upload_time": "2025-10-11T07:57:45",
"upload_time_iso_8601": "2025-10-11T07:57:45.309890Z",
"url": "https://files.pythonhosted.org/packages/f9/71/1928e26df1c2a4242720afed72b22378bb081bc1a00a3cb63c751832559c/py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "e186d25f68cf241b274cff1a772af60d750e032f7d4c323c99c979ddbc52e097",
"md5": "2ebbad40f6316e8173608488bcef7393",
"sha256": "6268bc8fe4b6605ff3170107bf1d46c4bce78c0a32ea98f96bd93c1b0d84b6bf"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "2ebbad40f6316e8173608488bcef7393",
"packagetype": "bdist_wheel",
"python_version": "cp312",
"requires_python": ">=3.8",
"size": 6270384,
"upload_time": "2025-10-11T07:57:46",
"upload_time_iso_8601": "2025-10-11T07:57:46.688918Z",
"url": "https://files.pythonhosted.org/packages/e1/86/d25f68cf241b274cff1a772af60d750e032f7d4c323c99c979ddbc52e097/py_excel_form_extractor-0.3.41-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "60ed216ed3eca0e2e1e59e0980453bf8fc5f139424a9e72c5e3f4a50b933f02c",
"md5": "0280ee67fbd7fe4d17432a7f047b4180",
"sha256": "fc2df3e24c3070abe711c8286a0f19f0a821ecc5d786555b36da742da5884f0b"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"has_sig": false,
"md5_digest": "0280ee67fbd7fe4d17432a7f047b4180",
"packagetype": "bdist_wheel",
"python_version": "cp313",
"requires_python": ">=3.8",
"size": 5891498,
"upload_time": "2025-10-11T07:57:48",
"upload_time_iso_8601": "2025-10-11T07:57:48.037048Z",
"url": "https://files.pythonhosted.org/packages/60/ed/216ed3eca0e2e1e59e0980453bf8fc5f139424a9e72c5e3f4a50b933f02c/py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_17_aarch64.manylinux2014_aarch64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "6810397ab6a1fe9fb33188754d9447baa66e96bc3d6d7369554a203f7f53bf3d",
"md5": "e1cac1866c5862f56e755d88d2dbf550",
"sha256": "2aebe71655031604a910d1f0148573cdf3f2e628a421291ed9854e9334a9d3d1"
},
"downloads": -1,
"filename": "py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "e1cac1866c5862f56e755d88d2dbf550",
"packagetype": "bdist_wheel",
"python_version": "cp313",
"requires_python": ">=3.8",
"size": 6270510,
"upload_time": "2025-10-11T07:57:50",
"upload_time_iso_8601": "2025-10-11T07:57:50.261164Z",
"url": "https://files.pythonhosted.org/packages/68/10/397ab6a1fe9fb33188754d9447baa66e96bc3d6d7369554a203f7f53bf3d/py_excel_form_extractor-0.3.41-cp313-cp313-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-10-11 07:57:38",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "adhadse",
"github_project": "excelFormExtractor",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "py-excel-form-extractor"
}