PyDigger - unearthing stuff about Python


NameVersionSummarydate
sparkdq 0.11.0 A declarative PySpark framework for row- and aggregate-level data quality validation. 2025-08-09 16:03:40
canonmap 0.4.53 A data matching and canonicalization library with multipl database connector support 2025-08-09 14:25:13
laktory 0.8.6 An ETL and DataOps framework for building a lakehouse 2025-08-09 13:44:18
databathing 0.2.3 Convert SQL queries to PySpark DataFrame operations 2025-08-08 20:35:53
dg-sqlmesh 1.3.2 Seamless integration between Dagster and SQLMesh for modern data engineering workflows 2025-08-08 07:53:58
google-ads-reports 1.2.2 ETL module for Google Ads API v20 with database-optimized DataFrame processing 2025-08-07 13:52:13
google-sheets-helper 1.1.1 Helper module to parse data from GSheets into database-optimized DataFrames 2025-08-07 13:01:59
zut 3.1.0 Reusable Python utilities. 2025-08-06 19:19:34
FabricSync 2.2.10 Fabric BigQuery Data Sync Utility 2025-08-05 15:25:33
estat-api-dlt-helper 0.1.4 e-Stat APIを使ってデータを取得し、dltを使ってデータをロードするためのヘルパーライブラリ 2025-08-04 01:50:15
sling 1.4.16 Slings data from a source to a target 2025-08-03 01:02:24
cloakdata 1.0.0 A lightweight library for anonymizing tabular datasets using Polars 2025-08-02 00:04:27
docling-core 2.44.1 A python library to define and validate data types in Docling. 2025-07-30 11:05:55
ll2cz 0.6.2 Transform LiteLLM database data into CloudZero AnyCost CBF format 2025-07-29 02:28:36
dataprobe 1.0.0 Advanced data pipeline debugging and profiling tools for Python 2025-07-28 18:26:25
pysetl 1.2.1 A PySpark ETL Framework 2025-07-27 18:35:31
rushdb 1.10.0 RushDB Python SDK 2025-07-27 12:13:16
milvus-ingest 0.1.2 High-performance data ingestion tool for Milvus vector database with vectorized operations 2025-07-25 11:39:04
parsons 5.2.0 None 2025-07-24 18:31:37
m9lib 1.0.1 m9 utility library 2025-07-20 17:49:29
hourdayweektotal
64175110510308990
Elapsed time: 3.10456s