ML-based HTML scraper that learns extraction rules from examples
Simple Python framework for building multithreaded web crawlers
Intelligent proxy pool for collecting and managing public proxies
Instagram profile crawler that extracts posts, tags, and stats
Automated mobile app crawler and testing tool built on Appium
Ferret is a web scraping system
Fast and flexible C# framework for building customizable web crawlers
Gospider - Fast web spider written in Go
Polite concurrent web crawler library for Go with flexible hooks
Educational Python web scraping case collection for many sites
AST-based JavaScript reverse engineering and variable tracing toolkit
Async Python framework for fast and flexible web scraping spiders
Guide and resources for accessing and using the U3C3 BitTorrent site
Proxy crawler that aggregates, tests, and serves usable proxy nodes
Python tool for scraping search engine results from many providers
Collection of Python ecommerce and website crawler examples projects
The next web scraper, see through the <html> noise
WebExtractServer use with WebExtractLte for use with web browsers
SEO Macroscope is a website scanning tool, to check your website
Creating Scrapy scrapers via the Django admin interface
Python crawler that downloads image galleries and analyzes titles
Twitter Intelligence OSINT project performs tracking and analysis
Ever wanted to download only a part of a Git repository.
Python library to crawl and retrieve data from WeChat accounts
A powerful Spider(Web Crawler) system in Python