| Name | Version | Summary | date |
| iterabledata |
1.0.6 |
Iterable data processing Python library |
2025-11-01 08:35:51 |
| vivaa |
1.0.0 |
VIVA: Versatile Intelligent Visual Annotation Tool |
2025-10-31 05:36:35 |
| datagen-cli |
0.1.3 |
A colorful and interactive CLI tool to generate customizable synthetic datasets. |
2025-10-30 21:11:08 |
| wikisets |
0.1.0 |
Flexible Wikipedia dataset builder with sampling and pretraining support |
2025-10-27 22:41:33 |
| datafast |
0.0.28 |
A Python package for synthetic text dataset generation |
2025-10-26 17:32:32 |
| japanese-personal-name-dataset |
0.1.1 |
A comprehensive dataset of Japanese personal names (first names and last names) with hiragana readings, romaji, and kanji variations |
2025-10-25 10:27:08 |
| maze-dataset |
1.4.1 |
generating and working with datasets of mazes |
2025-10-17 12:31:39 |
| ds-format |
4.3.0 |
ds-format is an open source program, a Python package and a storage format which provides an interface for reading and writing NetCDF files, as well as its own data file format. |
2025-10-15 09:52:46 |
| detection-dataset-annotator |
0.1.5 |
A slight annotator of detection datasets |
2025-09-13 12:12:58 |
| webshart |
0.4.3 |
Fast and memory-efficient webdataset shard reader |
2025-09-06 15:31:56 |
| dfcon |
0.2.10 |
To make access to the database easier. |
2025-08-29 13:31:45 |
| smolvladataset |
0.1.1 |
Loader for SmolVLA robotics datasets with deterministic train/val/test splits; LeRobot‑compatible, cached locally, downloads precompiled bundles from the Hugging Face Hub or rebuilds from a CSV. |
2025-08-27 01:41:50 |
| aydie-dataset-cleaner |
1.0.0 |
A Python library to validate, profile, and clean datasets (CSV, Excel, Parquet) for machine learning workflows. |
2025-08-20 19:09:12 |
| tempdataset |
0.2.0 |
A lightweight Python library for generating realistic temporary datasets |
2025-08-12 11:37:49 |
| lero-core |
0.3.0 |
LERO - LeRobot dataset Operations toolkit for editing and managing LeRobot datasets |
2025-08-10 01:53:51 |
| lero |
0.3.0 |
LERO - LeRobot dataset Operations toolkit for editing and managing LeRobot datasets |
2025-08-09 23:42:27 |
| sapien |
3.0.0 |
['SAPIEN: A SimulAted Parted based Interactive ENvironment'] |
2025-07-25 22:51:37 |
| gitcode |
1.1.4 |
GitCode模型文件上传下载CLI工具 |
2025-07-23 06:08:00 |
| easy-dataset-share |
0.4.3 |
CLI tool to responsibly share datasets by gzipping, canarying, and tracking provenance. |
2025-07-22 03:58:21 |
| financial-dataset-loader |
0.2.3 |
A Python module for loading financial datasets from various sources |
2025-02-19 05:39:45 |