PyDigger - unearthing stuff about Python


NameVersionSummarydate
chunklet 1.1.0.post1 A smart multilingual text chunker for LLMs, RAG, and beyond. 2025-08-13 17:34:55
semchunk 3.2.3 A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks. 2025-08-13 03:31:59
rag-document-viewer 1.1.1 RAG Document Viewer 2025-08-11 16:21:26
chunkwrap 2.4.1 Command-line tool to select code/docs for LLMs with secret masking and a hard cap on final output size. 2025-08-11 11:37:09
smartchunkllm 0.1.7 Advanced Legal Document Semantic Chunking System 2025-08-10 21:52:24
chunkipy 1.0.0.post1 Chunkipy is an easy-to-use library for chunking text based on the size estimator function you provide. 2025-08-08 12:37:03
llama-index-packs-node-parser-semantic-chunking 0.4.0 llama-index packs node_parser integration 2025-07-30 21:33:16
docling-analysis-framework 1.1.0 AI-ready analysis framework for PDF and Office documents using Docling for content extraction 2025-07-29 14:34:10
llm-text-splitter 0.2.0 A lightweight, rule-based text splitter for LLM context window management, handles multiple file formats and enriches chunks with metadata. 2025-07-24 12:21:01
llm-agent-toolkit 0.0.32.8 LLM Agent Toolkit provides minimal, modular interfaces for core components in LLM-based applications. 2025-04-21 07:43:48
ai-chunking 0.1.4 A powerful Python library for semantic document chunking and enrichment using AI 2025-03-16 20:44:19
betterhtmlchunking 0.9.1 A Python library for intelligent HTML segmentation and ROI extraction. It builds a DOM tree from raw HTML and extracts content-rich regions for efficient web scraping and analysis. 2025-02-14 08:21:28
alphacodings 0.2.0 base26 ([A-Z]) and base52 ([A-Za-z]) encodings 2024-12-09 03:04:43
quackling 0.4.1 Quackling enables document-native generative AI applications 2024-09-11 13:26:57
llama-index-readers-preprocess 0.2.0 llama-index readers preprocess integration 2024-08-22 06:50:57
pypreprocess 1.4.3 Preprocess SDK 2024-08-11 08:00:57
semantic-chunker 0.1.0 Semantic Chunker 2024-07-17 22:05:34
fastcdc 1.7.0 FastCDC (content defined chunking) in pure Python. 2024-06-27 15:55:34
hourdayweektotal
86230110517310860
Elapsed time: 2.10903s