PyDigger - unearthing stuff about Python


NameVersionSummarydate
journ4list 0.11.0 A powerful async news content extraction library with modern API for web scraping and article analysis 2025-07-24 21:57:44
hget-audio 2025.7.24a0 Comprehensive audio scraping tool for websites. 2025-07-24 12:47:05
pydantic-scrape 0.1.2 A modular web scraping framework using pydantic-ai and pydantic-graph with intelligent caching 2025-07-23 11:55:16
hs-scraper-toolkit 1.1.0 A comprehensive toolkit for scraping high school data with school-specific modules 2025-07-22 15:45:25
juriscraper 2.6.80 An API to scrape American court websites for metadata. 2025-07-16 18:33:45
socio4health 0.1.3 Socio4health is a Python package for gathering and consolidating socio-demographic data. 2025-07-14 17:46:04
Scrapinger 1.0.48 Scraping Support 2025-02-15 01:39:31
exfil-kit 0.1.0 A stealthy data extraction toolkit 2025-02-12 05:27:17
llama-index-tools-scrapegraphai 0.1.1 llama-index tools integrating ScrapegraphAI 2025-02-05 20:59:30
scrapeready 0.1.0 A Python client for the Scrapeready.com v1 API 2025-01-30 17:50:19
scrapfly-sdk 0.8.21 Scrapfly SDK for Scrapfly 2025-01-29 14:33:40
raggy 0.2.6 scraping stuff 2024-12-22 00:28:35
artix-news 0.3.0 Artix News scraper 2024-12-21 23:55:09
async-scrape 0.1.20 A package designed to scrape webpages using aiohttp and asyncio. Has some error handling to overcome common issues such as sites blocking you after n requests over a short period. 2024-12-08 21:29:56
undetected-geckodriver 1.0.7 A Firefox Selenium WebDriver that patches the browser to avoid detection. Bypasses services such as Cloudflare, Distil Networks, and more. Ideal for web scraping, automated testing, and bot development without getting detected. 2024-11-20 19:39:27
hext 1.0.12 A module and command-line utility to extract structured data from HTML 2024-11-03 09:50:56
woob 3.7 Woob, Web Outside Of Browsers 2024-10-29 17:08:17
web2vec 0.1.3 Website to vector representation library 2024-10-22 05:48:03
subweb 2.0.1 A package for scanning subdomains and collecting website information. 2024-10-16 17:26:46
torsel 0.4.31 A Python module for managing Tor instances with Selenium 2024-09-08 19:20:46
hourdayweektotal
56212710147302372
Elapsed time: 2.19536s