Name | Version | Summary | date |
caption-flow |
0.2.1 |
Self-contained distributed community captioning system |
2025-08-19 01:29:33 |
data4ai |
0.2.2 |
Production-ready AI-powered dataset generation for instruction tuning and model fine-tuning |
2025-08-18 14:34:27 |
clip-retrieval |
2.45.0 |
Easily computing clip embeddings and building a clip retrieval system with them |
2025-08-15 22:45:08 |
rdata |
1.0.0 |
Read R datasets from Python. |
2025-08-15 17:17:10 |
ytfetcher |
0.4.1 |
YTFetcher lets you fetch YouTube transcripts in bulk with metadata like titles, publish dates, and thumbnails. Great for ML, NLP, and dataset generation. |
2025-08-13 16:24:39 |
dataset-with-logits |
1.0.3 |
PyTorch datasets with pre-computed model logits for efficient research |
2025-08-12 15:23:56 |
vggsounder |
0.1.3 |
A Python package for accessing VGGSounder dataset labels and metadata |
2025-08-11 22:08:12 |
img2dataset |
1.47.0 |
Easily turn a set of image urls to an image dataset |
2025-08-09 22:07:23 |
atptools |
0.1.31 |
Dataset manipulation tool. |
2025-08-08 12:49:59 |
agilab |
0.5.1 |
AGILAB a datascience IDE for engineering to explore AI |
2025-08-07 16:16:42 |
agi-core |
0.5.1 |
AGI core aggregation of agi-env, agi-cluster, agi-node |
2025-08-07 16:16:14 |
nnja-ai |
1.0.0 |
Find and load data from the Brightband AI-ready mirror of the NOAA NASA Joint Archive (NNJA) of Observations for Earth System Reanalysis |
2025-08-05 22:40:58 |
re-cdp-patches |
0.9.1 |
Patching CDP (Chrome DevTools Protocol) leaks on OS level. Easy to use with Playwright/Patchright. |
2025-08-04 10:27:35 |
dataset-iterator |
0.5.3 |
Keras-style data iterator for images contained in dataset files such as hdf5 or PIL readable files. Images can be contained in several files. |
2025-08-01 16:18:50 |
iso3166-2 |
1.7.2 |
A lightweight Python package, and accompanying RESTful API, used to access all of the world's most up-to-date and accurate ISO 3166-2 subdivision data, including: name, local/other name, code, parent code, type, latitude/longitude, flag and history. |
2025-07-30 20:33:02 |
xarray-dataclass |
3.0.0 |
xarray data creation by data classes |
2025-07-30 20:26:31 |
rdetoolkit |
1.3.3 |
A module that supports the workflow of the RDE dataset construction program |
2025-07-29 01:37:03 |
dataset-cat |
0.0.6 |
A tool for fetching and organizing anime datasets for training. |
2025-07-24 09:34:14 |
data7 |
0.12.1 |
Data7 streams CSV/Parquet datasets over HTTP from SQL queries. |
2025-07-24 08:44:46 |
audb |
1.11.4 |
Load and publish databases in audformat |
2025-07-22 07:32:05 |