| Name | Version | Summary | date |
| PyquetMS |
0.1.0 |
Memory-efficient mzML to Parquet converter for mass spectrometry files |
2025-08-31 23:51:16 |
| parquetconv |
0.2.1 |
A command-line tool for converting between Parquet and CSV file formats |
2025-08-25 20:58:42 |
| tidy-viewer-py |
0.3.0 |
A cross-platform data pretty printer that uses column styling to maximize viewer enjoyment. Supports CSV, Parquet, Pandas, and Polars DataFrames with automatic data type detection and display. |
2025-08-20 23:39:52 |
| aydie-dataset-cleaner |
1.0.0 |
A Python library to validate, profile, and clean datasets (CSV, Excel, Parquet) for machine learning workflows. |
2025-08-20 19:09:12 |
| mockingbird-cli |
0.5.0 |
A powerful CLI tool for generating realistic mock data with relationships and referential integrity |
2025-08-05 11:48:23 |
| fosho |
0.1.0 |
Data Signing & Quality - Offline data integrity with CRC32 file hashing and MD5 schema hashing |
2025-07-24 20:29:29 |
| data7 |
0.12.1 |
Data7 streams CSV/Parquet datasets over HTTP from SQL queries. |
2025-07-24 08:44:46 |
| hypersets |
0.0.2 |
Fast, efficient alternative to Hugging Face load_dataset using DuckDB for querying, sampling and transforming remote datasets |
2025-07-20 22:53:45 |
| pyforge-cli |
1.0.9 |
A powerful CLI tool for data format conversion and synthetic data generation |
2025-07-11 04:38:06 |
| atoti-client-gcp |
0.9.4 |
Code to interact with Google Cloud Platform |
2025-02-28 22:10:13 |
| atoti-client-azure |
0.9.4 |
Code to interact with Microsoft Azure |
2025-02-28 22:10:00 |
| atoti-client-aws |
0.9.4 |
Code to interact with AWS |
2025-02-28 22:09:59 |
| arff-format-converter |
1.1.1 |
Converts ARFF files to CSV, JSON, XML, XLSX, ORC, and parquet. |
2024-12-11 08:52:52 |
| dnd-firefly |
0.4.1 |
Programmatically drag-and-drop in IRSA Viewer (Firefly) tool via Upload feature |
2024-11-16 02:03:12 |
| skeem |
0.1.1 |
Infer SQL DDL statements from tabular data |
2024-10-22 07:37:52 |
| xdlake |
0.0.10 |
A loose implimentation of the deltalake spec focused on extensibility and distributed data. |
2024-10-12 19:53:28 |
| execsql |
1.130.1 |
Runs a SQL script against a PostgreSQL, SQLite, MariaDB/MySQL, DuckDB, Firebird, MS-Access, MS-SQL-Server, or Oracle database, or an ODBC DSN. Provides metacommands to import and export data, copy data between databases, conditionally execute SQL and metacommands, and dynamically alter SQL and metacommands with substitution variables. Data can be exported in 18 different formats, including CSV, TSV, ODS, HTML, JSON, LaTeX, and Markdown tables, and using custom templates. |
2024-09-28 23:12:07 |
| io-bench |
0.1.0 |
IO Bench is a library designed to benchmark the performance of standard flat file formats and partitioning schemes. |
2024-08-21 01:43:11 |
| parquet2lance |
0.4.3 |
The Python wrapper for the Rust parquet2lance |
2024-07-05 10:14:59 |
| atoti-gcp |
0.8.13 |
Plugin to load CSV and Parquet files from Google Cloud Storage into Atoti tables |
2024-06-04 23:20:50 |