Name | Version | Summary | date |
unstructured-ingest |
0.5.8 |
A library that prepares raw documents for downstream ML tasks. |
2025-02-21 18:12:37 |
marker-pdf |
1.5.5 |
Convert PDF to markdown with high speed and accuracy. |
2025-02-19 23:06:12 |
BabelDOC |
0.1.10 |
Yet Another Document Translator |
2025-02-19 12:10:32 |
imio.email.parser |
0.3.0 |
This parser extracts forwarded attached email, embedded images and attachments. It also generates a PDF from the email. |
2025-02-18 12:55:00 |
pyvisionai |
0.3.0 |
A Python library for extracting and describing content from documents using Vision LLMs |
2025-02-18 10:47:11 |
diffpy.pdfgui |
3.1.0 |
GUI for PDF simulation and structure refinement. |
2025-02-17 23:42:16 |
plakativ |
0.5.3 |
Convert a PDF into a large poster that can be printed on multiple smaller pages. |
2025-02-16 17:47:23 |
img2pdf |
0.6.0 |
Convert images to PDF via direct JPEG inclusion. |
2025-02-15 14:09:53 |
pdftitle |
0.18 |
pdftitle is a small utility to extract the title from a PDF file |
2025-02-15 11:36:51 |
smart-llm-loader |
0.1.0 |
A powerful PDF processing toolkit that seamlessly integrates with LLMs for intelligent document chunking and RAG applications. Features smart context-aware segmentation, multi-LLM support, and optimized content extraction for enhanced RAG performance. |
2025-02-14 12:42:55 |
endesive |
2.18.1 |
Library for digital signing and verification of digital signatures in mail, PDF and XML documents. |
2025-02-10 07:00:58 |
pdf-ansh |
0.1.0 |
A Python package for various PDF operations. |
2025-02-09 17:24:08 |
pdfslash |
0.5.0 |
Crop pdf margins from interactive interpreter. |
2025-02-09 17:06:17 |
tosixinch |
0.10.0 |
Browser to e-reader in a few minutes |
2025-02-09 16:49:54 |
diffpy.pdffit2 |
1.5.1 |
PDFfit2 - real space structure refinement program. |
2025-02-07 20:44:29 |
unstructured |
0.16.20 |
A library that prepares raw documents for downstream ML tasks. |
2025-02-06 06:13:48 |
unstructured-inference |
0.8.7 |
A library for performing inference using trained models. |
2025-02-03 19:41:02 |
yadt |
0.1.2 |
Yet Another Document Translator |
2025-02-03 16:20:00 |
markdrop |
0.3.1.3 |
A comprehensive PDF processing toolkit that converts PDFs to markdown with advanced AI-powered features for image and table analysis. Supports local files and URLs, preserves document structure, extracts high-quality images, detects tables using advanced ML models, and generates detailed content descriptions using multiple LLM providers including OpenAI and Google's Gemini. |
2025-01-29 22:37:58 |
pdftext |
0.5.1 |
Extract structured text from pdfs quickly |
2025-01-28 17:10:43 |