Name | Version | Summary | date |
docling |
2.45.0 |
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. |
2025-08-18 10:27:01 |
aspose-cells |
25.8.0 |
Aspose.Cells for Python via Java is a high-performance library that unleashes the full potential of Excel in your Python projects. It can be used to efficiently manipulate and convert Excel and spreadsheet formats including XLS, XLSX, XLSB, ODS, CSV, and HTML - all from your Python code. Amazingly, it also offers free support. |
2025-08-14 02:30:43 |
contextgem |
0.15.0 |
Effortless LLM extraction from documents |
2025-08-13 22:25:52 |
aspose-words-cloud |
25.8.0 |
Python Cloud SDK wraps Aspose.Words Cloud API so you could seamlessly integrate Microsoft Word file generation, manipulation, conversion & inspection features into your own python applications. |
2025-08-13 12:45:20 |
aerospot-autoreport |
1.1.3 |
AeroSpot自动化报告生成工具 |
2025-08-12 05:49:57 |
md-server |
0.1.2 |
HTTP API server for converting documents, web pages, and media to markdown |
2025-08-10 17:41:22 |
docuver |
0.1.0 |
A meta tool for version control of Office documents (docx, xlsx, pptx, odt, ods, odp) |
2025-08-10 17:05:51 |
docx-image-extractor-mcp |
1.2.1 |
A powerful DOC/DOCX image extractor with MCP protocol support for Claude Desktop integration |
2025-08-06 13:40:07 |
noteparser |
1.0.0 |
A comprehensive document parser for converting academic materials to Markdown and LaTeX |
2025-08-06 08:20:40 |
aspose-total-net |
25.7.0 |
Aspose.Total for Python via .NET is a Document Processing python class library that allows developers to work with Microsoft Word®, Microsoft PowerPoint®, Microsoft Outlook®, OpenOffice®, & 3D file formats without needing Office Automation. |
2025-08-05 23:32:27 |
leverparser |
0.1.0 |
A fast, standalone Python library for parsing resumes with high accuracy and zero external dependencies |
2025-08-03 03:29:14 |
aspose-cells-python |
25.7.4 |
Aspose.Cells for Python via .NET is a high-performance library that unleashes the full potential of Excel in your Python projects. It can be used to efficiently manipulate and convert Excel and spreadsheet formats including XLS, XLSX, XLSB, ODS, CSV, and HTML - all from your Python code. Amazingly, it also offers free support. |
2025-07-28 13:54:39 |
wdoc |
3.3.1 |
A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (url, pdf, epub, youtube (incl playlist), audio, anki, md, docx, pptx, oe any combination!) |
2025-07-26 08:24:30 |
docx-mcp |
0.1.1 |
A MCP (Model Context Protocol) service for Word document processing, providing document structure extraction, content modification and file management capabilities. |
2025-07-24 12:13:03 |
aspose-html-net |
25.7.0 |
Aspose.HTML for Python via .NET is a powerful API for Python that provides a headless browser functionality, allowing you to work with HTML documents in a variety of ways. With this API, you can easily create new HTML documents or open existing ones from different sources. Once you have the document, you can perform various manipulation operations, such as removing and replacing HTML nodes. |
2025-07-23 15:49:34 |
asposepdfcloud |
25.7.0 |
Aspose.PDF Cloud |
2025-07-22 16:44:38 |
code-formatter-cli |
1.1.0 |
CLI tool to execute and format code output in txt, docx, or LaTeX formats |
2025-07-20 19:47:03 |
html-for-docx |
1.0.9 |
Convert HTML to Docx easily and fastly |
2025-07-18 15:34:23 |
unidoc-agent |
0.2.5 |
Universal Document Agent for extracting and analyzing various documents with Ollama support. |
2025-07-14 03:39:33 |
filemac |
1.1.9 |
Open source Python CLI toolkit for conversion, manipulation, analysis of files (All major file operations) |
2025-07-13 11:56:59 |