Fast and flexible C# framework for building customizable web crawlers
Gospider - Fast web spider written in Go
Polite concurrent web crawler library for Go with flexible hooks
Educational Python web scraping case collection for many sites
AST-based JavaScript reverse engineering and variable tracing toolkit
Async Python framework for fast and flexible web scraping spiders
Guide and resources for accessing and using the U3C3 BitTorrent site
Proxy crawler that aggregates, tests, and serves usable proxy nodes
Python tool for scraping search engine results from many providers
Collection of Python ecommerce and website crawler examples projects
The next web scraper, see through the <html> noise
WebExtractServer use with WebExtractLte for use with web browsers
SEO Macroscope is a website scanning tool, to check your website
Creating Scrapy scrapers via the Django admin interface
Python crawler that downloads image galleries and analyzes titles
Twitter Intelligence OSINT project performs tracking and analysis
Ever wanted to download only a part of a Git repository.
Python library to crawl and retrieve data from WeChat accounts
A powerful Spider(Web Crawler) system in Python
Open source web crawler for Java
Distributed proxy IP pool for web crawlers using Scrapy and Redis
Asyncio-based Python framework for building fast web crawling spiders
Convert websites into structured APIs automatically with Python tool
Lightweight Java web crawler framework with jQuery-style extraction
Perl Web Scraping Project