PyDigger - unearthing stuff about Python


NameVersionSummarydate
unstructured-expanded 0.16.11.post2 Expansion to the unstructured package, adding support for image extraction. 2024-12-21 22:43:01
docling 2.14.0 SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. 2024-12-18 07:05:40
aspose-cells-gridjs-net-python 24.12.0 a lightweight, scalable, and customizable toolkit that provides cross-platform web applications, enables convenient development for editing or viewing Excel/Spreadsheet files, offers simple deployment, and provides easy-to-use APIs. 2024-12-16 09:33:48
groupdocs-comparison-net 24.12 GroupDocs.Comparison for Python via .NET is a powerful API to compare over 50 types of documents and images, including all Microsoft Office and OpenDocument file formats, PDF documents, raster images (TIFF, JPEG, GIF, PNG, BMP). Retrieve the list of changes in the desired format with a line-by-line comparison of content, paragraphs, characters, styles, shapes, and position. 2024-12-13 13:20:22
aspose-cells 24.12.1 A powerful library for manipulating and converting Excel (XLS, XLSX, XLSB), ODS, CSV and HTML files. 2024-12-12 04:33:14
aspose-words-cloud 24.12.0 Python Cloud SDK wraps Aspose.Words Cloud API so you could seamlessly integrate Microsoft Word file generation, manipulation, conversion & inspection features into your own python applications. 2024-12-10 09:10:08
wdoc 2.4.16 A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (url, pdf, epub, youtube (incl playlist), audio, anki, md, docx, pptx, oe any combination!) 2024-12-05 19:02:38
pylibreoffice 0.1.3 A Python library for handling Microsoft Office documents, built with LibreOfficeKit. 2024-12-02 03:08:47
aspose-html-net 24.11.0 Aspose.HTML for Python via .NET is a powerful API for Python that provides a headless browser functionality, allowing you to work with HTML documents in a variety of ways. With this API, you can easily create new HTML documents or open existing ones from different sources. Once you have the document, you can perform various manipulation operations, such as removing and replacing HTML nodes. 2024-11-30 08:11:44
asposepdfcloud 24.11.0 Aspose.PDF Cloud 2024-11-22 08:12:03
aspose-cells-python 24.11.0 A powerful library for manipulating and converting Excel (XLS, XLSX, XLSB), CSV, ODS, PDF, JSON, JPG, PNG, BMP, EMF, SVG and HTML files. 2024-11-13 12:13:34
groupdocs-signature-net 24.11.0 File converter for the most commonly used formats, including DOCX, PDF, CAD, and many more. 2024-11-12 18:42:47
groupdocs-conversion-net 24.11 File converter for the most commonly used formats, including DOCX, PDF, CAD, and many more. 2024-11-11 14:59:24
doc-master 0.0.2 Paper - Pytorch 2024-11-07 20:52:35
leadtools 23.0.0.3 Powered by patented artificial intelligence and machine learning algorithms, LEADTOOLS is a collection of comprehensive toolkits to integrate recognition, document, medical, imaging, and multimedia technologies into desktop, server, tablet, web and mobile solutions. 2024-10-11 20:55:19
aspose-total-net 24.9.0 Aspose.Total for Python via .NET is a Document Processing python class library that allows developers to work with Microsoft Word®, Microsoft PowerPoint®, Microsoft Outlook®, OpenOffice®, & 3D file formats without needing Office Automation. 2024-09-30 13:50:27
groupdocs-watermark-net 24.9.1 GroupDocs.Watermark is a powerful document watermarking API that allows to add image and text watermarks. Additionally, it can search and remove the watermarks which were added to the documents by other third-party software. 2024-09-14 10:50:36
groupdocs-metadata-net 24.9 GroupDocs.Metadata for Python via .NET is a robust document API that supports over 170 file types and enables developers to easily render files to various formats, such as PDF, HTML, JPG, or PNG. With this API, you can seamlessly render a wide range of file types, including popular OpenDocument and Microsoft Office formats like DOCS, XLSX, and PPTX, as well as specialized CAD and graphic editor files like DWG, DXF, PSD, AI, and CDR. 2024-09-06 12:38:20
docxedit 1.0.4 Edit Word documents but keep original formatting 2024-09-04 22:57:28
llama-index-readers-docugami 0.2.0 llama-index readers docugami integration 2024-08-22 03:12:43
hourdayweektotal
3311009535274549
Elapsed time: 1.43745s