Name | Version | Summary | date |
dagster-kafka |
1.3.1 |
Enterprise-grade Kafka integration for Dagster with Confluent Connect, comprehensive serialization support, DLQ handling, and production monitoring |
2025-08-21 21:48:43 |
databus |
0.1.0 |
Python SDK and command-line toolkit for GTFS data processing, validation, and analysis. Provides programmatic access to Databús APIs, GTFS manipulation utilities, data conversion tools, and automated testing frameworks for transit data workflows and research applications. |
2025-08-20 18:53:55 |
pydpm-xl |
0.1.3 |
Python library for DPM-XL data processing and analysis |
2025-08-20 16:25:44 |
cratedb-toolkit |
0.0.41 |
CrateDB Toolkit |
2025-08-19 21:11:01 |
sheetwise |
2.2.0 |
A Python package for encoding spreadsheets for Large Language Models, implementing the SpreadsheetLLM research framework |
2025-08-19 15:46:22 |
sortdx |
0.1.1 |
Universal sorting tool for files, data structures, and large datasets |
2025-08-18 16:31:35 |
lineagentic-flow |
1.0.2 |
Lineagentic-flow is agentic ai approach for building data lineage across diverse data processing scripts including python, sql, java, airflow, spark, etc. |
2025-08-18 16:30:41 |
sortx-universal |
0.1.0 |
Universal sorting tool for files, data structures, and large datasets |
2025-08-18 14:25:44 |
minispark |
0.1.10 |
一个轻量级的Python库,用于从多种数据源读取数据并在本地进行高效处理,类似于Apache Spark的功能 |
2025-08-18 12:14:19 |
fleetfluid |
0.1.3 |
AI Agent Functions for ETL Processing |
2025-08-17 22:47:23 |
flagged-csv |
0.1.3 |
Convert XLSX files to CSV with visual formatting preserved as inline flags |
2025-08-17 00:03:25 |
minispqrk |
0.1.9 |
一个轻量级的Python库,用于从多种数据源读取数据并在本地进行高效处理,类似于Apache Spark的功能 |
2025-08-16 12:41:38 |
pyjsonkit |
0.1.0 |
A comprehensive Python toolkit for JSON processing with advanced AI-focused features for modern data workflows |
2025-08-15 17:12:21 |
px-processor |
0.2.3 |
Process and validate JSON and CSV data with ease. |
2025-08-06 09:31:28 |
slipstream-async |
1.0.4 |
Streamline your stream processing. |
2025-08-01 15:35:36 |
openforis-whisp |
2.0.0a5 |
Whisp (What is in that plot) is an open-source solution which helps to produce relevant forest monitoring information and support compliance with deforestation-related regulations. |
2025-07-31 11:19:56 |
datamaster-mcp |
1.0.3 |
DataMaster MCP - AI-powered data analysis tool with MCP protocol support |
2025-07-27 10:19:41 |
mlfcrafter |
0.1.1 |
ML Pipeline Automation Framework - Chain together data processing, model training, and deployment with minimal code |
2025-07-26 10:48:34 |
laygo |
0.1.2 |
A lightweight Python library for building resilient, in-memory data pipelines with elegant, chainable syntax |
2025-07-16 19:41:27 |
splurge-data-profiler |
0.1.2 |
A data profiling tool for delimited and database sources. |
2025-07-11 10:27:59 |