Name | Version | Summary | date |
docling |
2.14.0 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. |
2024-12-18 07:05:40 |
docling-ibm-models |
3.1.0 |
This package contains the AI models used by the Docling PDF conversion package |
2024-12-13 13:25:34 |
chunknorris |
1.0.3 |
A package for chunking documents from various formats |
2024-12-12 14:03:57 |
pyzerox-impacte |
0.0.8 |
ocr documents using vision models from all popular providers like OpenAI, Azure OpenAI, Anthropic, AWS Bedrock etc |
2024-12-10 19:39:55 |
limberer |
0.9.1 |
A flexible document generator based on weasyprint, mustache templates, and pandoc. |
2024-12-07 20:26:20 |
llama-index-packs-multi-document-agents |
0.4.0 |
llama-index packs multi_document_agents integration |
2024-11-18 02:01:23 |
llama-index-packs-multidoc-autoretrieval |
0.3.0 |
llama-index packs multidoc_autoretrieval integration |
2024-11-18 01:30:35 |
aspose-words |
24.11.0 |
Aspose.Words for Python is a Document Processing library that allows developers to work with documents in many popular formats without needing Office Automation. |
2024-11-13 10:02:16 |
groupdocs-signature-net |
24.11.0 |
File converter for the most commonly used formats, including DOCX, PDF, CAD, and many more. |
2024-11-12 18:42:47 |
smartloop |
1.1.8 |
Smartloop Command Line interface to process documents using LLM |
2024-11-11 21:29:15 |
groupdocs-conversion-net |
24.11 |
File converter for the most commonly used formats, including DOCX, PDF, CAD, and many more. |
2024-11-11 14:59:24 |
papis |
0.14 |
Powerful and highly extensible command-line based document and bibliography manager |
2024-11-08 19:07:52 |
yattag |
1.16.1 |
Generate HTML or XML in a pythonic way. Pure python alternative to web template engines.Can fill HTML forms with default values and error messages. |
2024-11-02 22:38:30 |
leadtools |
23.0.0.3 |
Powered by patented artificial intelligence and machine learning algorithms, LEADTOOLS is a collection of comprehensive toolkits to integrate recognition, document, medical, imaging, and multimedia technologies into desktop, server, tablet, web and mobile solutions. |
2024-10-11 20:55:19 |
groupdocs-watermark-net |
24.9.1 |
GroupDocs.Watermark is a powerful document watermarking API that allows to add image and text watermarks. Additionally, it can search and remove the watermarks which were added to the documents by other third-party software. |
2024-09-14 10:50:36 |
brutils |
2.2.0 |
Utils library for specific Brazilian businesses |
2024-09-12 19:35:13 |
office-integrator-sdk |
1.0.0b2 |
Zoho Office Integrator Python SDK |
2024-08-26 06:22:52 |
aetherdb |
0.0.1b1 |
AetherDB is a lightweight document-like database for Python. |
2024-07-05 07:20:57 |
palimpzest |
0.2.0 |
Palimpzest is a system which enables anyone to process AI-powered analytical queries simply by defining them in a declarative language |
2024-05-30 03:03:32 |
fedfold |
0.0.2 |
None |
2024-05-11 08:39:39 |