Name | Version | Summary | date |
BabelDOC |
0.2.29 |
Yet Another Document Translator |
2025-04-01 14:06:39 |
marker-pdf |
1.6.0 |
Convert documents to markdown with high speed and accuracy. |
2025-02-28 23:57:04 |
atai-pdf-tool |
0.1.0 |
A tool for parsing and extracting text from PDF files with OCR capabilities |
2025-02-27 11:15:46 |
xhtml2pdf |
0.2.17 |
PDF generator using HTML and CSS |
2025-02-23 23:17:02 |
pyvisionai |
0.3.1 |
A Python library for extracting and describing content from documents using Vision LLMs |
2025-02-22 22:21:47 |
unstructured-ingest |
0.5.8 |
A library that prepares raw documents for downstream ML tasks. |
2025-02-21 18:12:37 |
imio.email.parser |
0.3.0 |
This parser extracts forwarded attached email, embedded images and attachments. It also generates a PDF from the email. |
2025-02-18 12:55:00 |
diffpy.pdfgui |
3.1.0 |
GUI for PDF simulation and structure refinement. |
2025-02-17 23:42:16 |
plakativ |
0.5.3 |
Convert a PDF into a large poster that can be printed on multiple smaller pages. |
2025-02-16 17:47:23 |
img2pdf |
0.6.0 |
Convert images to PDF via direct JPEG inclusion. |
2025-02-15 14:09:53 |
pdftitle |
0.18 |
pdftitle is a small utility to extract the title from a PDF file |
2025-02-15 11:36:51 |
smart-llm-loader |
0.1.0 |
A powerful PDF processing toolkit that seamlessly integrates with LLMs for intelligent document chunking and RAG applications. Features smart context-aware segmentation, multi-LLM support, and optimized content extraction for enhanced RAG performance. |
2025-02-14 12:42:55 |
endesive |
2.18.1 |
Library for digital signing and verification of digital signatures in mail, PDF and XML documents. |
2025-02-10 07:00:58 |
pdf-ansh |
0.1.0 |
A Python package for various PDF operations. |
2025-02-09 17:24:08 |
pdfslash |
0.5.0 |
Crop pdf margins from interactive interpreter. |
2025-02-09 17:06:17 |
tosixinch |
0.10.0 |
Browser to e-reader in a few minutes |
2025-02-09 16:49:54 |
diffpy.pdffit2 |
1.5.1 |
PDFfit2 - real space structure refinement program. |
2025-02-07 20:44:29 |
unstructured |
0.16.20 |
A library that prepares raw documents for downstream ML tasks. |
2025-02-06 06:13:48 |
unstructured-inference |
0.8.7 |
A library for performing inference using trained models. |
2025-02-03 19:41:02 |
yadt |
0.1.2 |
Yet Another Document Translator |
2025-02-03 16:20:00 |