High-level web crawling and scraping framework for Elixir apps
Easy Spider is a distributed Perl Web Crawler Project from 2006
Goutte, a simple PHP Web Scraper
Headless Chrome crawler for collecting URLs for vulnerability scans
Free Extracts Emails, Phones and custom text from Web using JAVA Regex
Python library providing APIs for automated website login workflows
Web crawler for archiving and backing up sites into WARC archives
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
ML-based HTML scraper that learns extraction rules from examples
Simple Python framework for building multithreaded web crawlers
Intelligent proxy pool for collecting and managing public proxies
Instagram profile crawler that extracts posts, tags, and stats
Automated mobile app crawler and testing tool built on Appium
Ferret is a web scraping system
Fast and flexible C# framework for building customizable web crawlers
Gospider - Fast web spider written in Go
Polite concurrent web crawler library for Go with flexible hooks
Educational Python web scraping case collection for many sites
AST-based JavaScript reverse engineering and variable tracing toolkit
Async Python framework for fast and flexible web scraping spiders
Guide and resources for accessing and using the U3C3 BitTorrent site
Proxy crawler that aggregates, tests, and serves usable proxy nodes
Python tool for scraping search engine results from many providers
Shows the complex connection between musicians and their pupils
An Android rich text class library that supports graphic & text mixing