Name | Version | Summary | date |
llama-index-tools-scrapegraphai |
0.2.1 |
llama-index tools integrating ScrapegraphAI |
2025-09-08 20:47:59 |
juriscraper |
2.6.89 |
An API to scrape American court websites for metadata. |
2025-09-02 19:52:24 |
socio4health |
0.1.6 |
Socio4health is a Python package for gathering and consolidating socio-demographic data. |
2025-08-25 03:12:22 |
indo-scraper |
1.0.0 |
Library Python untuk scraping website Indonesia dengan mudah |
2025-08-03 14:06:33 |
pydantic-scrape |
0.2.2 |
Advanced web automation framework with AI-powered agents, Chawan terminal browser integration, and geographic search targeting |
2025-07-31 20:38:32 |
cloudflare-peek |
0.1.0 |
A Python utility for scraping Cloudflare-protected websites using screenshot + OCR fallback |
2025-07-27 16:41:12 |
journ4list |
0.11.0 |
A powerful async news content extraction library with modern API for web scraping and article analysis |
2025-07-24 21:57:44 |
hget-audio |
2025.7.24a0 |
Comprehensive audio scraping tool for websites. |
2025-07-24 12:47:05 |
hs-scraper-toolkit |
1.1.0 |
A comprehensive toolkit for scraping high school data with school-specific modules |
2025-07-22 15:45:25 |
Scrapinger |
1.0.48 |
Scraping Support |
2025-02-15 01:39:31 |
exfil-kit |
0.1.0 |
A stealthy data extraction toolkit |
2025-02-12 05:27:17 |
scrapeready |
0.1.0 |
A Python client for the Scrapeready.com v1 API |
2025-01-30 17:50:19 |
scrapfly-sdk |
0.8.21 |
Scrapfly SDK for Scrapfly |
2025-01-29 14:33:40 |
artix-news |
0.3.0 |
Artix News scraper |
2024-12-21 23:55:09 |
async-scrape |
0.1.20 |
A package designed to scrape webpages using aiohttp and asyncio. Has some error handling to overcome common issues such as sites blocking you after n requests over a short period. |
2024-12-08 21:29:56 |
undetected-geckodriver |
1.0.7 |
A Firefox Selenium WebDriver that patches the browser to avoid detection. Bypasses services such as Cloudflare, Distil Networks, and more. Ideal for web scraping, automated testing, and bot development without getting detected. |
2024-11-20 19:39:27 |
hext |
1.0.12 |
A module and command-line utility to extract structured data from HTML |
2024-11-03 09:50:56 |
woob |
3.7 |
Woob, Web Outside Of Browsers |
2024-10-29 17:08:17 |
web2vec |
0.1.3 |
Website to vector representation library |
2024-10-22 05:48:03 |
subweb |
2.0.1 |
A package for scanning subdomains and collecting website information. |
2024-10-16 17:26:46 |