| Name | Version | Summary | date |
| barecat |
0.2.7 |
Scalable archive format for storing millions of small files with random access and SQLite indexing. |
2025-09-17 16:34:13 |
| caption-flow |
0.4.1 |
Self-contained distributed community captioning system |
2025-09-11 11:44:36 |
| iso3166-2 |
1.8.0 |
A lightweight Python package, and accompanying RESTful API, used to access all of the world's most up-to-date and accurate ISO 3166-2 subdivision data, including: name, local/other name, code, parent code, type, latitude/longitude, flag and history. |
2025-09-07 16:12:20 |
| revoxx |
1.0.2 |
Speech recording application for creating high-quality speech datasets |
2025-09-03 16:28:57 |
| kn-dataset-tools |
0.66.dev0 |
A Simple Viewer for EXIF and AI Metadata. |
2025-09-03 00:37:15 |
| zos-ftp-mcp |
0.1.0 |
MCP server for z/OS mainframe FTP operations |
2025-09-02 22:13:29 |
| fastdatasets-llm |
0.1.3 |
Generate high-quality LLM training datasets from documents with distillation and augmentation. |
2025-08-31 05:51:14 |
| yolococo |
0.2.0 |
YOLO <-> COCO conversion tools with COCO dataset merging |
2025-08-27 23:08:21 |
| annotex |
2.0.7 |
Annotation Tool for Computer Vision Datasets |
2025-08-27 16:27:44 |
| kitti-odom-2012-dataloader |
0.1.0 |
Efficient and user-friendly point cloud data loader for the Kitti Odometry 2012 dataset, supporting multiple coordinate systems and numpy compatibility. |
2025-08-24 06:31:47 |
| tgmix |
0.3.0 |
A tool to process Telegram chat exports into an AI-friendly format, inspired by Repomix. |
2025-08-24 06:19:17 |
| kaist-dataloader |
0.1.1 |
Efficient and user-friendly point cloud data loader for the KAIST dataset, supporting multiple coordinate systems and numpy compatibility. |
2025-08-23 03:49:49 |
| data4ai |
0.3.0 |
Production-ready AI-powered dataset generation for instruction tuning and model fine-tuning |
2025-08-22 00:52:23 |
| endofactory |
0.1.4 |
Revolutionary EndoVQA dataset construction tool for rapid dataset mixing and configuration |
2025-08-21 08:08:17 |
| clip-retrieval |
2.45.0 |
Easily computing clip embeddings and building a clip retrieval system with them |
2025-08-15 22:45:08 |
| rdata |
1.0.0 |
Read R datasets from Python. |
2025-08-15 17:17:10 |
| dataset-with-logits |
1.0.3 |
PyTorch datasets with pre-computed model logits for efficient research |
2025-08-12 15:23:56 |
| vggsounder |
0.1.3 |
A Python package for accessing VGGSounder dataset labels and metadata |
2025-08-11 22:08:12 |
| img2dataset |
1.47.0 |
Easily turn a set of image urls to an image dataset |
2025-08-09 22:07:23 |
| atptools |
0.1.31 |
Dataset manipulation tool. |
2025-08-08 12:49:59 |