PyDigger - unearthing stuff about Python


NameVersionSummarydate
iterabledata 1.0.6 Iterable data processing Python library 2025-11-01 08:35:51
vivaa 1.0.0 VIVA: Versatile Intelligent Visual Annotation Tool 2025-10-31 05:36:35
datagen-cli 0.1.3 A colorful and interactive CLI tool to generate customizable synthetic datasets. 2025-10-30 21:11:08
wikisets 0.1.0 Flexible Wikipedia dataset builder with sampling and pretraining support 2025-10-27 22:41:33
datafast 0.0.28 A Python package for synthetic text dataset generation 2025-10-26 17:32:32
japanese-personal-name-dataset 0.1.1 A comprehensive dataset of Japanese personal names (first names and last names) with hiragana readings, romaji, and kanji variations 2025-10-25 10:27:08
maze-dataset 1.4.1 generating and working with datasets of mazes 2025-10-17 12:31:39
ds-format 4.3.0 ds-format is an open source program, a Python package and a storage format which provides an interface for reading and writing NetCDF files, as well as its own data file format. 2025-10-15 09:52:46
detection-dataset-annotator 0.1.5 A slight annotator of detection datasets 2025-09-13 12:12:58
webshart 0.4.3 Fast and memory-efficient webdataset shard reader 2025-09-06 15:31:56
dfcon 0.2.10 To make access to the database easier. 2025-08-29 13:31:45
smolvladataset 0.1.1 Loader for SmolVLA robotics datasets with deterministic train/val/test splits; LeRobot‑compatible, cached locally, downloads precompiled bundles from the Hugging Face Hub or rebuilds from a CSV. 2025-08-27 01:41:50
aydie-dataset-cleaner 1.0.0 A Python library to validate, profile, and clean datasets (CSV, Excel, Parquet) for machine learning workflows. 2025-08-20 19:09:12
tempdataset 0.2.0 A lightweight Python library for generating realistic temporary datasets 2025-08-12 11:37:49
lero-core 0.3.0 LERO - LeRobot dataset Operations toolkit for editing and managing LeRobot datasets 2025-08-10 01:53:51
lero 0.3.0 LERO - LeRobot dataset Operations toolkit for editing and managing LeRobot datasets 2025-08-09 23:42:27
sapien 3.0.0 ['SAPIEN: A SimulAted Parted based Interactive ENvironment'] 2025-07-25 22:51:37
gitcode 1.1.4 GitCode模型文件上传下载CLI工具 2025-07-23 06:08:00
easy-dataset-share 0.4.3 CLI tool to responsibly share datasets by gzipping, canarying, and tracking provenance. 2025-07-22 03:58:21
financial-dataset-loader 0.2.3 A Python module for loading financial datasets from various sources 2025-02-19 05:39:45
hourdayweektotal
826538011333472
Elapsed time: 5.45718s