PyDigger - unearthing stuff about Python


NameVersionSummarydate
llama-index-tools-scrapegraphai 0.2.1 llama-index tools integrating ScrapegraphAI 2025-09-08 20:47:59
juriscraper 2.6.89 An API to scrape American court websites for metadata. 2025-09-02 19:52:24
socio4health 0.1.6 Socio4health is a Python package for gathering and consolidating socio-demographic data. 2025-08-25 03:12:22
indo-scraper 1.0.0 Library Python untuk scraping website Indonesia dengan mudah 2025-08-03 14:06:33
pydantic-scrape 0.2.2 Advanced web automation framework with AI-powered agents, Chawan terminal browser integration, and geographic search targeting 2025-07-31 20:38:32
cloudflare-peek 0.1.0 A Python utility for scraping Cloudflare-protected websites using screenshot + OCR fallback 2025-07-27 16:41:12
journ4list 0.11.0 A powerful async news content extraction library with modern API for web scraping and article analysis 2025-07-24 21:57:44
hget-audio 2025.7.24a0 Comprehensive audio scraping tool for websites. 2025-07-24 12:47:05
hs-scraper-toolkit 1.1.0 A comprehensive toolkit for scraping high school data with school-specific modules 2025-07-22 15:45:25
Scrapinger 1.0.48 Scraping Support 2025-02-15 01:39:31
exfil-kit 0.1.0 A stealthy data extraction toolkit 2025-02-12 05:27:17
scrapeready 0.1.0 A Python client for the Scrapeready.com v1 API 2025-01-30 17:50:19
scrapfly-sdk 0.8.21 Scrapfly SDK for Scrapfly 2025-01-29 14:33:40
artix-news 0.3.0 Artix News scraper 2024-12-21 23:55:09
async-scrape 0.1.20 A package designed to scrape webpages using aiohttp and asyncio. Has some error handling to overcome common issues such as sites blocking you after n requests over a short period. 2024-12-08 21:29:56
undetected-geckodriver 1.0.7 A Firefox Selenium WebDriver that patches the browser to avoid detection. Bypasses services such as Cloudflare, Distil Networks, and more. Ideal for web scraping, automated testing, and bot development without getting detected. 2024-11-20 19:39:27
hext 1.0.12 A module and command-line utility to extract structured data from HTML 2024-11-03 09:50:56
woob 3.7 Woob, Web Outside Of Browsers 2024-10-29 17:08:17
web2vec 0.1.3 Website to vector representation library 2024-10-22 05:48:03
subweb 2.0.1 A package for scanning subdomains and collecting website information. 2024-10-16 17:26:46
hourdayweektotal
80203810003321115
Elapsed time: 3.08712s