parquet-to-excel


Nameparquet-to-excel JSON
Version 0.5.0 PyPI version JSON
download
home_pageNone
Summarya crate to convert parquet file(s) to an/a excel/csv file with constant memory in rust
upload_time2025-07-21 06:40:36
maintainerNone
docs_urlNone
authorNone
requires_python>=3.8
licenseMIT
keywords csv xlsx excel parquet convert
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # parquet_to_excel
A tool to convert parquet file to an/a excel/csv file in rust with constant memory, both a single parquet file and a folder of parquet files are supported.
You can also use python or rust to call it. The python package name is parquet_to_excel too. you can install it by `pip install parquet_to_excel`. If you could not install this package correctly, you can try to install rust and maturin (`pip install maturin`) first. Then you can try again.

# Functions
1. parquet_file_to_csv: convert a single parquet file to a csv file
2. parquet_files_to_csv: convert a folder of parquet files to a csv file
1. parquet_file_to_xlsx: convert a single parquet file to an excel file
2. parquet_files_to_xlsx: convert a folder of parquet files to an excel file

# Python Examples
1. parquet to csv
```python
from parquet_to_excel import parquet_file_to_csv, parquet_files_to_csv

parquet_file_to_csv(
    r"D:\Projects\RustTool\data\.duck\yo_dxzh\source=zzz.xlsx\data.parquet", 
    r"D:\Felix\Desktop\out1.csv", 
    header_labels={"ddbm": "地点编码"},
    select_columns=set(["sheet", "yjkm", "yjkmsm"]))

parquet_files_to_csv(
    r"D:\Projects\RustTool\data\.duck\yo_dxzh", 
    r"D:\Felix\Desktop\out2.csv", 
    header_labels={"ddbm": "地点编码"},
    select_columns=set(["sheet", "yjkm", "yjkmsm"]))
```

2. parquet to xlsx
```python
from parquet_to_excel import parquet_file_to_xlsx, parquet_files_to_xlsx

# write all data into one sheet
parquet_file_to_xlsx(
    r"D:\Projects\RustTool\data\.duck\yo_dxzh\source=合并报表公司主体及内部客商编码(管理责任人:刘露).xlsx\data.parquet", 
    r"D:\Felix\Desktop\out1.xlsx", 
    sheet_name="data", 
    header_labels={"ddbm": "地点编码"},
    select_columns=set(["sheet", "yjkm", "yjkmsm"]))

# write all data into different sheets by the value of column "sheet"
parquet_files_to_xlsx(
    r"D:\Projects\RustTool\data\.duck\yo_dxzh", 
    r"D:\Felix\Desktop\out2.xlsx", 
    sheet_column = "sheet", 
    header_labels={"ddbm": "地点编码"},
    select_columns=set(["sheet", "yjkm", "yjkmsm"]))
```


            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "parquet-to-excel",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "csv, xlsx, excel, parquet, convert",
    "author": null,
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/1b/fd/2f3e779c5292fa298c0e6ef60b5827f7d00bfb3400e5f1fb93a186f05425/parquet_to_excel-0.5.0.tar.gz",
    "platform": null,
    "description": "# parquet_to_excel\r\nA tool to convert parquet file to an/a excel/csv file in rust with constant memory, both a single parquet file and a folder of parquet files are supported.\r\nYou can also use python or rust to call it. The python package name is parquet_to_excel too. you can install it by `pip install parquet_to_excel`. If you could not install this package correctly, you can try to install rust and maturin (`pip install maturin`) first. Then you can try again.\r\n\r\n# Functions\r\n1. parquet_file_to_csv: convert a single parquet file to a csv file\r\n2. parquet_files_to_csv: convert a folder of parquet files to a csv file\r\n1. parquet_file_to_xlsx: convert a single parquet file to an excel file\r\n2. parquet_files_to_xlsx: convert a folder of parquet files to an excel file\r\n\r\n# Python Examples\r\n1. parquet to csv\r\n```python\r\nfrom parquet_to_excel import parquet_file_to_csv, parquet_files_to_csv\r\n\r\nparquet_file_to_csv(\r\n    r\"D:\\Projects\\RustTool\\data\\.duck\\yo_dxzh\\source=zzz.xlsx\\data.parquet\", \r\n    r\"D:\\Felix\\Desktop\\out1.csv\", \r\n    header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"},\r\n    select_columns=set([\"sheet\", \"yjkm\", \"yjkmsm\"]))\r\n\r\nparquet_files_to_csv(\r\n    r\"D:\\Projects\\RustTool\\data\\.duck\\yo_dxzh\", \r\n    r\"D:\\Felix\\Desktop\\out2.csv\", \r\n    header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"},\r\n    select_columns=set([\"sheet\", \"yjkm\", \"yjkmsm\"]))\r\n```\r\n\r\n2. parquet to xlsx\r\n```python\r\nfrom parquet_to_excel import parquet_file_to_xlsx, parquet_files_to_xlsx\r\n\r\n# write all data into one sheet\r\nparquet_file_to_xlsx(\r\n    r\"D:\\Projects\\RustTool\\data\\.duck\\yo_dxzh\\source=\u5408\u5e76\u62a5\u8868\u516c\u53f8\u4e3b\u4f53\u53ca\u5185\u90e8\u5ba2\u5546\u7f16\u7801\uff08\u7ba1\u7406\u8d23\u4efb\u4eba\uff1a\u5218\u9732\uff09.xlsx\\data.parquet\", \r\n    r\"D:\\Felix\\Desktop\\out1.xlsx\", \r\n    sheet_name=\"data\", \r\n    header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"},\r\n    select_columns=set([\"sheet\", \"yjkm\", \"yjkmsm\"]))\r\n\r\n# write all data into different sheets by the value of column \"sheet\"\r\nparquet_files_to_xlsx(\r\n    r\"D:\\Projects\\RustTool\\data\\.duck\\yo_dxzh\", \r\n    r\"D:\\Felix\\Desktop\\out2.xlsx\", \r\n    sheet_column = \"sheet\", \r\n    header_labels={\"ddbm\": \"\u5730\u70b9\u7f16\u7801\"},\r\n    select_columns=set([\"sheet\", \"yjkm\", \"yjkmsm\"]))\r\n```\r\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "a crate to convert parquet file(s) to an/a excel/csv file with constant memory in rust",
    "version": "0.5.0",
    "project_urls": {
        "Source Code": "https://gitee.com/sone92/paruqet-to-xlsx.git"
    },
    "split_keywords": [
        "csv",
        " xlsx",
        " excel",
        " parquet",
        " convert"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "1b2c3705b6ba236ca040694e933c5217b888fddfc7deb989b82dbfd30eb8d550",
                "md5": "4379a9c6881d0d02d423b62af6462547",
                "sha256": "83d1bd7d8c9f116ae5a6e661731b9cc6b527a9346ed914daa36d318c5bef32ac"
            },
            "downloads": -1,
            "filename": "parquet_to_excel-0.5.0-cp312-cp312-win_amd64.whl",
            "has_sig": false,
            "md5_digest": "4379a9c6881d0d02d423b62af6462547",
            "packagetype": "bdist_wheel",
            "python_version": "cp312",
            "requires_python": ">=3.8",
            "size": 11492174,
            "upload_time": "2025-07-21T06:40:33",
            "upload_time_iso_8601": "2025-07-21T06:40:33.924117Z",
            "url": "https://files.pythonhosted.org/packages/1b/2c/3705b6ba236ca040694e933c5217b888fddfc7deb989b82dbfd30eb8d550/parquet_to_excel-0.5.0-cp312-cp312-win_amd64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "1bfd2f3e779c5292fa298c0e6ef60b5827f7d00bfb3400e5f1fb93a186f05425",
                "md5": "3dabe6da6f80791a17ebfcd816db6779",
                "sha256": "ffdfb63dd217d29c374ab2448862750ea77db9adbf5d0a7fc0290b3c8eb4e803"
            },
            "downloads": -1,
            "filename": "parquet_to_excel-0.5.0.tar.gz",
            "has_sig": false,
            "md5_digest": "3dabe6da6f80791a17ebfcd816db6779",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 24794,
            "upload_time": "2025-07-21T06:40:36",
            "upload_time_iso_8601": "2025-07-21T06:40:36.419483Z",
            "url": "https://files.pythonhosted.org/packages/1b/fd/2f3e779c5292fa298c0e6ef60b5827f7d00bfb3400e5f1fb93a186f05425/parquet_to_excel-0.5.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-07-21 06:40:36",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "parquet-to-excel"
}
        
Elapsed time: 0.61565s