Open source file indexing & storage analytics powered by Elasticsearch
Open source enterprise search server for websites, files, and data
Lightweight .NET framework for fast web crawling and data scraping
Library for extracting streaming site data without official APIs
A fast, high-level web crawling and web scraping framework
The unix-way web crawler
Python HTTP client with TLS and HTTP/2 fingerprint emulation support
Cross platform GUI tool for downloading videos from Bilibili sites
Python tool for crawling and extracting structured data from news site
Remote isolated browser API for security
A tool to scrape images from SimpCity
Movie metadata scraper and organizer for media libraries and NFO
Open source Douyin crawler for collecting and downloading public data
Turn entire websites into LLM-ready markdown or structured data
Python crawler for collecting and downloading Sina Weibo user data
All-in-one Python web reconnaissance tool for fast target analysis
Realtime crawler for COVID-19 outbreak statistics from DXY data
Desktop tool for collecting and exporting Xiaohongshu post data
Open source web scraping system for automated data collection tasks
AI-first Ruby framework for building fast, flexible web scraping spide
AI-ready web crawler that extracts and structures website content
Fast CLI tool for cloning entire websites for local browsing offline
High-performance Rust web crawler and scraper for large-scale data
Collection of Python web scraping scripts for data extraction tasks
Lighter, faster browser kernel of blink to integrate HTML UI in apps