| Name | Version | Summary | date |
| antchain |
0.0.7 |
一个函数式编程风格的数据处理管道库,支持链式调用和多种数据处理操作 |
2025-10-25 20:57:21 |
| table-toolkit |
2025.10.22.post1 |
A Python library for consistent preprocessing of tabular data with automatic type inference, caching, and stratified splitting |
2025-10-23 03:19:53 |
| buelon |
1.0.73 |
A scripting language to simply manage a very large amount of i/o heavy workloads. Such as API calls for your ETL, ELT or any program needing Python and/or SQL |
2025-10-14 17:36:06 |
| batch-data-test-tool |
1.1.1 |
一个用于批量处理数据并发送HTTP请求的Python工具包 |
2025-10-11 07:54:22 |
| 1cijferho |
0.1.0 |
Professional tools for processing Dutch higher educational data (1CijferHO / ROD) |
2025-10-10 08:31:41 |
| dataknobs-fsm |
0.1.1 |
Finite State Machine framework with data modes, resource management, and streaming support |
2025-10-08 22:26:12 |
| zephflow |
0.3.1 |
Python SDK for ZephFlow data processing pipelines |
2025-10-07 23:22:12 |
| scriptcraft-python |
1.6.3 |
Data processing and quality control tools for research workflows |
2025-09-08 17:36:02 |
| oxidize |
0.7.0 |
High-performance data processing tools for Python, built with Rust |
2025-09-07 01:03:44 |
| streamz-zmq |
0.1.5 |
ZeroMQ integration for streamz - high-performance streaming data processing |
2025-09-02 15:44:24 |
| raydp-nightly |
2025.7.14.dev0 |
RayDP: Distributed Data Processing on Ray |
2025-07-14 01:23:17 |
| raydp |
1.6.2 |
RayDP: Distributed Data Processing on Ray |
2025-03-14 08:43:42 |
| aws-s3-controller |
0.7.1 |
A collection of natural language-like utility functions to intuitively and easily control AWS's cloud object storage resource, S3. |
2025-02-13 01:59:49 |
| bulkflow |
0.1.4 |
A high-performance CSV to PostgreSQL data loader with chunked processing and error handling |
2024-11-12 05:54:22 |
| emrrunner |
1.0.9 |
A powerful CLI tool and API for managing Spark jobs on Amazon EMR clusters |
2024-11-03 16:44:04 |
| batch-dev |
0.0.3 |
Generic python module for handling dictionary-based batch data |
2024-03-07 12:00:08 |
| ror |
0.1.1 |
Simple pipelining framework in Python |
2024-01-13 20:25:05 |
| meltano-target-cratedb |
0.0.1 |
A Singer target for CrateDB, built with the Meltano SDK, and based on the Meltano PostgreSQL target. |
2023-12-08 20:50:19 |
| pystream-pipeline |
0.2.0 |
Python package to create and manage fast parallelized data processing pipeline for real-time application |
2023-11-04 14:50:23 |
| ini2csv |
1.0.0 |
A simple utility that converts and combines a folder of .ini files with identical keys into one csv file. |
2023-06-21 17:47:12 |