PyDigger - unearthing stuff about Python


NameVersionSummarydate
ocrmypdf 16.11.0 OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched 2025-09-12 08:36:53
surya-ocr 0.16.7 OCR, layout, reading order, and table recognition in 90+ languages 2025-09-08 16:41:25
ocr-tamil 0.4.1 Python Tamil OCR package 2025-09-06 21:55:45
clown_sort 1.13.4 Sort screenshots based on rules or through individual review. 2025-09-06 21:43:41
rapidocr 3.4.0 Awesome OCR Library 2025-09-06 10:05:32
aspose-ocr-python-net 25.8.0 Aspose.OCR for Python is a powerful yet easy-to-use and cost-effective API for extracting text from scanned images, photos, screenshots, PDF documents, and other files. 2025-09-02 19:53:32
dddocr-py 0.1.0 Python client for the 3DOCR.com OCR API 2025-08-30 19:16:43
ddddocr-unofficial 1.6.0 Unofficial PyPI release of ddddocr v1.6.0. This package provides an up-to-date, installable version of the original work by sml2h3. 2025-08-29 23:02:38
kraken 6.0.0 OCR/HTR engine for all the languages 2025-08-29 21:11:39
johnsnowlabs-for-databricks 6.1.0 The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster 2025-08-28 00:17:07
johnsnowlabs 6.1.0 The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster 2025-08-28 00:17:06
actscene-ocr 0.1.5 Actscene OCR: 日本語書類向けの包括的OCRパイプライン (PaddleOCRベース) 2025-08-19 18:51:14
upspawn-ocr-cli 0.1.0b3 Modern, polished CLI to extract text from PDFs using the Mistral OCR API. 2025-08-15 23:24:29
hashub-docapp 1.0.0 Professional Python SDK for the HashubDocApp API - Advanced OCR, document conversion, and text extraction service 2025-08-15 12:09:58
doctr-synth-generator 0.0.1 A synthetic data generator for training OCR models 2025-08-15 08:48:29
onnxtr 0.8.0 Onnx Text Recognition (OnnxTR): docTR Onnx-Wrapper for high-performance OCR on documents. 2025-08-13 08:18:25
rapidocr-web 1.0.0 The Web version of RapidOCR 2025-07-30 14:50:38
ocr-document-converter 3.1.0 Enterprise-grade OCR and document conversion tool with dual OCR engines 2025-07-22 15:19:03
phocr 1.0.2 High-Performance OCR Toolkit 2025-07-17 07:41:32
paddleocr-convert 0.1.0 Tool for converting the PaddleOCR model to onnx format. 2025-07-15 14:25:04
hourdayweektotal
6923025937325331
Elapsed time: 3.49133s