Name | parquet-to-excel JSON |
Version |
0.3.0
JSON |
| download |
home_page | None |
Summary | a crate to convert parquet file(s) to an/a excel/csv file with constant memory in rust |
upload_time | 2025-02-16 12:54:26 |
maintainer | None |
docs_url | None |
author | None |
requires_python | >=3.8 |
license | MIT |
keywords |
csv
xlsx
excel
parquet
convert
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# parquet_to_excel
A tool to convert parquet file to an/a excel/csv file in rust with constant memory, both a single parquet file and a folder of parquet files are supported.
You can also use python or rust to call it. The python package name is parquet_to_excel too. you can install it by `pip install parquet_to_excel`. If you could not install this package correctly, you can try to install rust and maturin (`pip install maturin`) first. Then you can try again.
# Functions
1. parquet_file_to_csv: convert a single parquet file to a csv file
2. parquet_files_to_csv: convert a folder of parquet files to a csv file
1. parquet_file_to_xlsx: convert a single parquet file to an excel file
2. parquet_files_to_xlsx: convert a folder of parquet files to an excel file
# Python Examples
1. parquet to csv
```python
from parquet_to_excel import parquet_file_to_csv, parquet_files_to_csv
parquet_file_to_csv(r"D:\Projects\TornadoSrv\data\result\qid=160\part0.parquet", r"D:\Felix\Desktop\out1.csv", header_labels={"ddbm": "地点编码"})
parquet_files_to_csv(r"D:\Projects\TornadoSrv\data\result\qid=160", r"D:\Felix\Desktop\out2.csv", header_labels={"ddbm": "地点编码"})
```
2. parquet to xlsx
```python
from parquet_to_excel import parquet_file_to_xlsx, parquet_files_to_xlsx
# write all data into one sheet
parquet_file_to_xlsx(r"D:\Projects\TornadoSrv\data\result\qid=160\part0.parquet", r"D:\Felix\Desktop\out1.xlsx", sheet_name="data", header_labels={"ddbm": "地点编码"})
# write all data into different sheets by the value of column "qyztmc"
parquet_files_to_xlsx(r"D:\Projects\TornadoSrv\data\result\qid=160", r"D:\Felix\Desktop\out2.xlsx", sheet_column = "qyztmc", header_labels={"ddbm": "地点编码"})
```
Raw data
{
"_id": null,
"home_page": null,
"name": "parquet-to-excel",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.8",
"maintainer_email": null,
"keywords": "csv, xlsx, excel, parquet, convert",
"author": null,
"author_email": null,
"download_url": "https://files.pythonhosted.org/packages/e9/39/59041143648f6303f92893b35a2429f392d7de2eec53dad44e55acb6803f/parquet_to_excel-0.3.0.tar.gz",
"platform": null,
"description": "# parquet_to_excel\r\nA tool to convert parquet file to an/a excel/csv file in rust with constant memory, both a single parquet file and a folder of parquet files are supported.\r\nYou can also use python or rust to call it. The python package name is parquet_to_excel too. you can install it by `pip install parquet_to_excel`. If you could not install this package correctly, you can try to install rust and maturin (`pip install maturin`) first. Then you can try again.\r\n\r\n# Functions\r\n1. parquet_file_to_csv: convert a single parquet file to a csv file\r\n2. parquet_files_to_csv: convert a folder of parquet files to a csv file\r\n1. parquet_file_to_xlsx: convert a single parquet file to an excel file\r\n2. parquet_files_to_xlsx: convert a folder of parquet files to an excel file\r\n\r\n# Python Examples\r\n1. parquet to csv\r\n```python\r\nfrom parquet_to_excel import parquet_file_to_csv, parquet_files_to_csv\r\n\r\nparquet_file_to_csv(r\"D:\\Projects\\TornadoSrv\\data\\result\\qid=160\\part0.parquet\", r\"D:\\Felix\\Desktop\\out1.csv\", header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"})\r\n\r\nparquet_files_to_csv(r\"D:\\Projects\\TornadoSrv\\data\\result\\qid=160\", r\"D:\\Felix\\Desktop\\out2.csv\", header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"})\r\n```\r\n\r\n2. parquet to xlsx\r\n```python\r\nfrom parquet_to_excel import parquet_file_to_xlsx, parquet_files_to_xlsx\r\n\r\n# write all data into one sheet\r\nparquet_file_to_xlsx(r\"D:\\Projects\\TornadoSrv\\data\\result\\qid=160\\part0.parquet\", r\"D:\\Felix\\Desktop\\out1.xlsx\", sheet_name=\"data\", header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"})\r\n\r\n# write all data into different sheets by the value of column \"qyztmc\"\r\nparquet_files_to_xlsx(r\"D:\\Projects\\TornadoSrv\\data\\result\\qid=160\", r\"D:\\Felix\\Desktop\\out2.xlsx\", sheet_column = \"qyztmc\", header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"})\r\n```\r\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "a crate to convert parquet file(s) to an/a excel/csv file with constant memory in rust",
"version": "0.3.0",
"project_urls": {
"Source Code": "https://gitee.com/sone92/paruqet-to-xlsx.git"
},
"split_keywords": [
"csv",
" xlsx",
" excel",
" parquet",
" convert"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "eaab0a4fe4e7e462fb7d1b33ed6b16be0b17f739bcc67a48896c2ac8ceadd66d",
"md5": "f773e0fbad8713943ffd322a6b5639a3",
"sha256": "786fe690f250f87c0bf10bd01c10890eeda42dc807bf6919cc2d5ca6bd75d9fd"
},
"downloads": -1,
"filename": "parquet_to_excel-0.3.0-cp312-cp312-win_amd64.whl",
"has_sig": false,
"md5_digest": "f773e0fbad8713943ffd322a6b5639a3",
"packagetype": "bdist_wheel",
"python_version": "cp312",
"requires_python": ">=3.8",
"size": 6403247,
"upload_time": "2025-02-16T12:54:20",
"upload_time_iso_8601": "2025-02-16T12:54:20.360805Z",
"url": "https://files.pythonhosted.org/packages/ea/ab/0a4fe4e7e462fb7d1b33ed6b16be0b17f739bcc67a48896c2ac8ceadd66d/parquet_to_excel-0.3.0-cp312-cp312-win_amd64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "e93959041143648f6303f92893b35a2429f392d7de2eec53dad44e55acb6803f",
"md5": "d2df3fd6dc58fce13e1de1802400b8c4",
"sha256": "c128f32b393ac04979bb0df89fd6770bdf3097122a8cc7d1150a45847e005f64"
},
"downloads": -1,
"filename": "parquet_to_excel-0.3.0.tar.gz",
"has_sig": false,
"md5_digest": "d2df3fd6dc58fce13e1de1802400b8c4",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.8",
"size": 14454,
"upload_time": "2025-02-16T12:54:26",
"upload_time_iso_8601": "2025-02-16T12:54:26.305602Z",
"url": "https://files.pythonhosted.org/packages/e9/39/59041143648f6303f92893b35a2429f392d7de2eec53dad44e55acb6803f/parquet_to_excel-0.3.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-02-16 12:54:26",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "parquet-to-excel"
}