Open source file indexing & storage analytics powered by Elasticsearch
Lightweight .NET framework for fast web crawling and data scraping
Open source enterprise search server for websites, files, and data
A fast, high-level web crawling and web scraping framework
Cross platform GUI tool for downloading videos from Bilibili sites
The unix-way web crawler
Python HTTP client with TLS and HTTP/2 fingerprint emulation support
Remote isolated browser API for security
Python tool for crawling and extracting structured data from news site
Movie metadata scraper and organizer for media libraries and NFO
Open source Douyin crawler for collecting and downloading public data
Turn entire websites into LLM-ready markdown or structured data
Python crawler for collecting and downloading Sina Weibo user data
Realtime crawler for COVID-19 outbreak statistics from DXY data
Open source web scraping system for automated data collection tasks
AI-first Ruby framework for building fast, flexible web scraping spide
AI-ready web crawler that extracts and structures website content
High-performance Rust web crawler and scraper for large-scale data
Fast CLI tool for cloning entire websites for local browsing offline
Collection of Python web scraping scripts for data extraction tasks
Lighter, faster browser kernel of blink to integrate HTML UI in apps
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Lightweight Ruby DSL for scraping structured data from web pages
Declarative web scraping
Small event-delegation library for decoupling event binding and handli