Name | parquet-to-excel JSON |
Version |
0.5.0
JSON |
| download |
home_page | None |
Summary | a crate to convert parquet file(s) to an/a excel/csv file with constant memory in rust |
upload_time | 2025-07-21 06:40:36 |
maintainer | None |
docs_url | None |
author | None |
requires_python | >=3.8 |
license | MIT |
keywords |
csv
xlsx
excel
parquet
convert
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# parquet_to_excel
A tool to convert parquet file to an/a excel/csv file in rust with constant memory, both a single parquet file and a folder of parquet files are supported.
You can also use python or rust to call it. The python package name is parquet_to_excel too. you can install it by `pip install parquet_to_excel`. If you could not install this package correctly, you can try to install rust and maturin (`pip install maturin`) first. Then you can try again.
# Functions
1. parquet_file_to_csv: convert a single parquet file to a csv file
2. parquet_files_to_csv: convert a folder of parquet files to a csv file
1. parquet_file_to_xlsx: convert a single parquet file to an excel file
2. parquet_files_to_xlsx: convert a folder of parquet files to an excel file
# Python Examples
1. parquet to csv
```python
from parquet_to_excel import parquet_file_to_csv, parquet_files_to_csv
parquet_file_to_csv(
r"D:\Projects\RustTool\data\.duck\yo_dxzh\source=zzz.xlsx\data.parquet",
r"D:\Felix\Desktop\out1.csv",
header_labels={"ddbm": "地点编码"},
select_columns=set(["sheet", "yjkm", "yjkmsm"]))
parquet_files_to_csv(
r"D:\Projects\RustTool\data\.duck\yo_dxzh",
r"D:\Felix\Desktop\out2.csv",
header_labels={"ddbm": "地点编码"},
select_columns=set(["sheet", "yjkm", "yjkmsm"]))
```
2. parquet to xlsx
```python
from parquet_to_excel import parquet_file_to_xlsx, parquet_files_to_xlsx
# write all data into one sheet
parquet_file_to_xlsx(
r"D:\Projects\RustTool\data\.duck\yo_dxzh\source=合并报表公司主体及内部客商编码(管理责任人:刘露).xlsx\data.parquet",
r"D:\Felix\Desktop\out1.xlsx",
sheet_name="data",
header_labels={"ddbm": "地点编码"},
select_columns=set(["sheet", "yjkm", "yjkmsm"]))
# write all data into different sheets by the value of column "sheet"
parquet_files_to_xlsx(
r"D:\Projects\RustTool\data\.duck\yo_dxzh",
r"D:\Felix\Desktop\out2.xlsx",
sheet_column = "sheet",
header_labels={"ddbm": "地点编码"},
select_columns=set(["sheet", "yjkm", "yjkmsm"]))
```
Raw data
{
"_id": null,
"home_page": null,
"name": "parquet-to-excel",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.8",
"maintainer_email": null,
"keywords": "csv, xlsx, excel, parquet, convert",
"author": null,
"author_email": null,
"download_url": "https://files.pythonhosted.org/packages/1b/fd/2f3e779c5292fa298c0e6ef60b5827f7d00bfb3400e5f1fb93a186f05425/parquet_to_excel-0.5.0.tar.gz",
"platform": null,
"description": "# parquet_to_excel\r\nA tool to convert parquet file to an/a excel/csv file in rust with constant memory, both a single parquet file and a folder of parquet files are supported.\r\nYou can also use python or rust to call it. The python package name is parquet_to_excel too. you can install it by `pip install parquet_to_excel`. If you could not install this package correctly, you can try to install rust and maturin (`pip install maturin`) first. Then you can try again.\r\n\r\n# Functions\r\n1. parquet_file_to_csv: convert a single parquet file to a csv file\r\n2. parquet_files_to_csv: convert a folder of parquet files to a csv file\r\n1. parquet_file_to_xlsx: convert a single parquet file to an excel file\r\n2. parquet_files_to_xlsx: convert a folder of parquet files to an excel file\r\n\r\n# Python Examples\r\n1. parquet to csv\r\n```python\r\nfrom parquet_to_excel import parquet_file_to_csv, parquet_files_to_csv\r\n\r\nparquet_file_to_csv(\r\n r\"D:\\Projects\\RustTool\\data\\.duck\\yo_dxzh\\source=zzz.xlsx\\data.parquet\", \r\n r\"D:\\Felix\\Desktop\\out1.csv\", \r\n header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"},\r\n select_columns=set([\"sheet\", \"yjkm\", \"yjkmsm\"]))\r\n\r\nparquet_files_to_csv(\r\n r\"D:\\Projects\\RustTool\\data\\.duck\\yo_dxzh\", \r\n r\"D:\\Felix\\Desktop\\out2.csv\", \r\n header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"},\r\n select_columns=set([\"sheet\", \"yjkm\", \"yjkmsm\"]))\r\n```\r\n\r\n2. parquet to xlsx\r\n```python\r\nfrom parquet_to_excel import parquet_file_to_xlsx, parquet_files_to_xlsx\r\n\r\n# write all data into one sheet\r\nparquet_file_to_xlsx(\r\n r\"D:\\Projects\\RustTool\\data\\.duck\\yo_dxzh\\source=\u5408\u5e76\u62a5\u8868\u516c\u53f8\u4e3b\u4f53\u53ca\u5185\u90e8\u5ba2\u5546\u7f16\u7801\uff08\u7ba1\u7406\u8d23\u4efb\u4eba\uff1a\u5218\u9732\uff09.xlsx\\data.parquet\", \r\n r\"D:\\Felix\\Desktop\\out1.xlsx\", \r\n sheet_name=\"data\", \r\n header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"},\r\n select_columns=set([\"sheet\", \"yjkm\", \"yjkmsm\"]))\r\n\r\n# write all data into different sheets by the value of column \"sheet\"\r\nparquet_files_to_xlsx(\r\n r\"D:\\Projects\\RustTool\\data\\.duck\\yo_dxzh\", \r\n r\"D:\\Felix\\Desktop\\out2.xlsx\", \r\n sheet_column = \"sheet\", \r\n header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"},\r\n select_columns=set([\"sheet\", \"yjkm\", \"yjkmsm\"]))\r\n```\r\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "a crate to convert parquet file(s) to an/a excel/csv file with constant memory in rust",
"version": "0.5.0",
"project_urls": {
"Source Code": "https://gitee.com/sone92/paruqet-to-xlsx.git"
},
"split_keywords": [
"csv",
" xlsx",
" excel",
" parquet",
" convert"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "1b2c3705b6ba236ca040694e933c5217b888fddfc7deb989b82dbfd30eb8d550",
"md5": "4379a9c6881d0d02d423b62af6462547",
"sha256": "83d1bd7d8c9f116ae5a6e661731b9cc6b527a9346ed914daa36d318c5bef32ac"
},
"downloads": -1,
"filename": "parquet_to_excel-0.5.0-cp312-cp312-win_amd64.whl",
"has_sig": false,
"md5_digest": "4379a9c6881d0d02d423b62af6462547",
"packagetype": "bdist_wheel",
"python_version": "cp312",
"requires_python": ">=3.8",
"size": 11492174,
"upload_time": "2025-07-21T06:40:33",
"upload_time_iso_8601": "2025-07-21T06:40:33.924117Z",
"url": "https://files.pythonhosted.org/packages/1b/2c/3705b6ba236ca040694e933c5217b888fddfc7deb989b82dbfd30eb8d550/parquet_to_excel-0.5.0-cp312-cp312-win_amd64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "1bfd2f3e779c5292fa298c0e6ef60b5827f7d00bfb3400e5f1fb93a186f05425",
"md5": "3dabe6da6f80791a17ebfcd816db6779",
"sha256": "ffdfb63dd217d29c374ab2448862750ea77db9adbf5d0a7fc0290b3c8eb4e803"
},
"downloads": -1,
"filename": "parquet_to_excel-0.5.0.tar.gz",
"has_sig": false,
"md5_digest": "3dabe6da6f80791a17ebfcd816db6779",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.8",
"size": 24794,
"upload_time": "2025-07-21T06:40:36",
"upload_time_iso_8601": "2025-07-21T06:40:36.419483Z",
"url": "https://files.pythonhosted.org/packages/1b/fd/2f3e779c5292fa298c0e6ef60b5827f7d00bfb3400e5f1fb93a186f05425/parquet_to_excel-0.5.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-07-21 06:40:36",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "parquet-to-excel"
}