PyDigger - unearthing stuff about Python


NameVersionSummarydate
PyquetMS 0.1.0 Memory-efficient mzML to Parquet converter for mass spectrometry files 2025-08-31 23:51:16
parquetconv 0.2.1 A command-line tool for converting between Parquet and CSV file formats 2025-08-25 20:58:42
tidy-viewer-py 0.3.0 A cross-platform data pretty printer that uses column styling to maximize viewer enjoyment. Supports CSV, Parquet, Pandas, and Polars DataFrames with automatic data type detection and display. 2025-08-20 23:39:52
aydie-dataset-cleaner 1.0.0 A Python library to validate, profile, and clean datasets (CSV, Excel, Parquet) for machine learning workflows. 2025-08-20 19:09:12
mockingbird-cli 0.5.0 A powerful CLI tool for generating realistic mock data with relationships and referential integrity 2025-08-05 11:48:23
fosho 0.1.0 Data Signing & Quality - Offline data integrity with CRC32 file hashing and MD5 schema hashing 2025-07-24 20:29:29
data7 0.12.1 Data7 streams CSV/Parquet datasets over HTTP from SQL queries. 2025-07-24 08:44:46
hypersets 0.0.2 Fast, efficient alternative to Hugging Face load_dataset using DuckDB for querying, sampling and transforming remote datasets 2025-07-20 22:53:45
pyforge-cli 1.0.9 A powerful CLI tool for data format conversion and synthetic data generation 2025-07-11 04:38:06
atoti-client-gcp 0.9.4 Code to interact with Google Cloud Platform 2025-02-28 22:10:13
atoti-client-azure 0.9.4 Code to interact with Microsoft Azure 2025-02-28 22:10:00
atoti-client-aws 0.9.4 Code to interact with AWS 2025-02-28 22:09:59
arff-format-converter 1.1.1 Converts ARFF files to CSV, JSON, XML, XLSX, ORC, and parquet. 2024-12-11 08:52:52
dnd-firefly 0.4.1 Programmatically drag-and-drop in IRSA Viewer (Firefly) tool via Upload feature 2024-11-16 02:03:12
skeem 0.1.1 Infer SQL DDL statements from tabular data 2024-10-22 07:37:52
xdlake 0.0.10 A loose implimentation of the deltalake spec focused on extensibility and distributed data. 2024-10-12 19:53:28
execsql 1.130.1 Runs a SQL script against a PostgreSQL, SQLite, MariaDB/MySQL, DuckDB, Firebird, MS-Access, MS-SQL-Server, or Oracle database, or an ODBC DSN. Provides metacommands to import and export data, copy data between databases, conditionally execute SQL and metacommands, and dynamically alter SQL and metacommands with substitution variables. Data can be exported in 18 different formats, including CSV, TSV, ODS, HTML, JSON, LaTeX, and Markdown tables, and using custom templates. 2024-09-28 23:12:07
io-bench 0.1.0 IO Bench is a library designed to benchmark the performance of standard flat file formats and partitioning schemes. 2024-08-21 01:43:11
parquet2lance 0.4.3 The Python wrapper for the Rust parquet2lance 2024-07-05 10:14:59
atoti-gcp 0.8.13 Plugin to load CSV and Parquet files from Google Cloud Storage into Atoti tables 2024-06-04 23:20:50
hourdayweektotal
7719078353335249
Elapsed time: 3.91946s