Name | Version | Summary | date |
unstructured-expanded |
0.16.11.post2 |
Expansion to the unstructured package, adding support for image extraction. |
2024-12-21 22:43:01 |
docling |
2.14.0 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. |
2024-12-18 07:05:40 |
aspose-cells-gridjs-net-python |
24.12.0 |
a lightweight, scalable, and customizable toolkit that provides cross-platform web applications, enables convenient development for editing or viewing Excel/Spreadsheet files, offers simple deployment, and provides easy-to-use APIs. |
2024-12-16 09:33:48 |
groupdocs-comparison-net |
24.12 |
GroupDocs.Comparison for Python via .NET is a powerful API to compare over 50 types of documents and images, including all Microsoft Office and OpenDocument file formats, PDF documents, raster images (TIFF, JPEG, GIF, PNG, BMP). Retrieve the list of changes in the desired format with a line-by-line comparison of content, paragraphs, characters, styles, shapes, and position. |
2024-12-13 13:20:22 |
aspose-cells |
24.12.1 |
A powerful library for manipulating and converting Excel (XLS, XLSX, XLSB), ODS, CSV and HTML files. |
2024-12-12 04:33:14 |
aspose-words-cloud |
24.12.0 |
Python Cloud SDK wraps Aspose.Words Cloud API so you could seamlessly integrate Microsoft Word file generation, manipulation, conversion & inspection features into your own python applications. |
2024-12-10 09:10:08 |
wdoc |
2.4.16 |
A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (url, pdf, epub, youtube (incl playlist), audio, anki, md, docx, pptx, oe any combination!) |
2024-12-05 19:02:38 |
pylibreoffice |
0.1.3 |
A Python library for handling Microsoft Office documents, built with LibreOfficeKit. |
2024-12-02 03:08:47 |
aspose-html-net |
24.11.0 |
Aspose.HTML for Python via .NET is a powerful API for Python that provides a headless browser functionality, allowing you to work with HTML documents in a variety of ways. With this API, you can easily create new HTML documents or open existing ones from different sources. Once you have the document, you can perform various manipulation operations, such as removing and replacing HTML nodes. |
2024-11-30 08:11:44 |
asposepdfcloud |
24.11.0 |
Aspose.PDF Cloud |
2024-11-22 08:12:03 |
aspose-cells-python |
24.11.0 |
A powerful library for manipulating and converting Excel (XLS, XLSX, XLSB), CSV, ODS, PDF, JSON, JPG, PNG, BMP, EMF, SVG and HTML files. |
2024-11-13 12:13:34 |
groupdocs-signature-net |
24.11.0 |
File converter for the most commonly used formats, including DOCX, PDF, CAD, and many more. |
2024-11-12 18:42:47 |
groupdocs-conversion-net |
24.11 |
File converter for the most commonly used formats, including DOCX, PDF, CAD, and many more. |
2024-11-11 14:59:24 |
doc-master |
0.0.2 |
Paper - Pytorch |
2024-11-07 20:52:35 |
leadtools |
23.0.0.3 |
Powered by patented artificial intelligence and machine learning algorithms, LEADTOOLS is a collection of comprehensive toolkits to integrate recognition, document, medical, imaging, and multimedia technologies into desktop, server, tablet, web and mobile solutions. |
2024-10-11 20:55:19 |
aspose-total-net |
24.9.0 |
Aspose.Total for Python via .NET is a Document Processing python class library that allows developers to work with Microsoft Word®, Microsoft PowerPoint®, Microsoft Outlook®, OpenOffice®, & 3D file formats without needing Office Automation. |
2024-09-30 13:50:27 |
groupdocs-watermark-net |
24.9.1 |
GroupDocs.Watermark is a powerful document watermarking API that allows to add image and text watermarks. Additionally, it can search and remove the watermarks which were added to the documents by other third-party software. |
2024-09-14 10:50:36 |
groupdocs-metadata-net |
24.9 |
GroupDocs.Metadata for Python via .NET is a robust document API that supports over 170 file types and enables developers to easily render files to various formats, such as PDF, HTML, JPG, or PNG. With this API, you can seamlessly render a wide range of file types, including popular OpenDocument and Microsoft Office formats like DOCS, XLSX, and PPTX, as well as specialized CAD and graphic editor files like DWG, DXF, PSD, AI, and CDR. |
2024-09-06 12:38:20 |
docxedit |
1.0.4 |
Edit Word documents but keep original formatting |
2024-09-04 22:57:28 |
llama-index-readers-docugami |
0.2.0 |
llama-index readers docugami integration |
2024-08-22 03:12:43 |