PyDigger - unearthing stuff about Python


NameVersionSummarydate
llm-data-converter 2.2.0 Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract 2025-07-25 13:32:07
markitdown-pdf-separators 0.4.2 MarkItDown with PDF page separators - convert PDFs to Markdown with page boundary markers 2025-07-23 20:31:15
hourdayweektotal
93158610317303327
Elapsed time: 2.84435s