PyDigger - unearthing stuff about Python


NameVersionSummarydate
unstructured-ingest 0.5.8 A library that prepares raw documents for downstream ML tasks. 2025-02-21 18:12:37
marker-pdf 1.5.5 Convert PDF to markdown with high speed and accuracy. 2025-02-19 23:06:12
BabelDOC 0.1.10 Yet Another Document Translator 2025-02-19 12:10:32
imio.email.parser 0.3.0 This parser extracts forwarded attached email, embedded images and attachments. It also generates a PDF from the email. 2025-02-18 12:55:00
pyvisionai 0.3.0 A Python library for extracting and describing content from documents using Vision LLMs 2025-02-18 10:47:11
diffpy.pdfgui 3.1.0 GUI for PDF simulation and structure refinement. 2025-02-17 23:42:16
plakativ 0.5.3 Convert a PDF into a large poster that can be printed on multiple smaller pages. 2025-02-16 17:47:23
img2pdf 0.6.0 Convert images to PDF via direct JPEG inclusion. 2025-02-15 14:09:53
pdftitle 0.18 pdftitle is a small utility to extract the title from a PDF file 2025-02-15 11:36:51
smart-llm-loader 0.1.0 A powerful PDF processing toolkit that seamlessly integrates with LLMs for intelligent document chunking and RAG applications. Features smart context-aware segmentation, multi-LLM support, and optimized content extraction for enhanced RAG performance. 2025-02-14 12:42:55
endesive 2.18.1 Library for digital signing and verification of digital signatures in mail, PDF and XML documents. 2025-02-10 07:00:58
pdf-ansh 0.1.0 A Python package for various PDF operations. 2025-02-09 17:24:08
pdfslash 0.5.0 Crop pdf margins from interactive interpreter. 2025-02-09 17:06:17
tosixinch 0.10.0 Browser to e-reader in a few minutes 2025-02-09 16:49:54
diffpy.pdffit2 1.5.1 PDFfit2 - real space structure refinement program. 2025-02-07 20:44:29
unstructured 0.16.20 A library that prepares raw documents for downstream ML tasks. 2025-02-06 06:13:48
unstructured-inference 0.8.7 A library for performing inference using trained models. 2025-02-03 19:41:02
yadt 0.1.2 Yet Another Document Translator 2025-02-03 16:20:00
markdrop 0.3.1.3 A comprehensive PDF processing toolkit that converts PDFs to markdown with advanced AI-powered features for image and table analysis. Supports local files and URLs, preserves document structure, extracts high-quality images, detects tables using advanced ML models, and generates detailed content descriptions using multiple LLM providers including OpenAI and Google's Gemini. 2025-01-29 22:37:58
pdftext 0.5.1 Extract structured text from pdfs quickly 2025-01-28 17:10:43
hourdayweektotal
5811106355288835
Elapsed time: 1.08327s