Open Source OCR Engine
A pure Javascript Multilingual OCR
OCR software, free and offline
Accurate × Fast × Comprehensive
Enhances Tesseract OCR output using LLMs (local or API)
Contexts Optical Compression
Visual Causal Flow
OCRmyPDF adds an OCR text layer to scanned PDF files
Fast and efficient unstructured data extraction
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR offline image text recognition command line windows program
Screenshots, word marking, OCR, AI, translation software
A community-supported supercharged version of paperless
System tool for beginners wanting agentic engineering capabilities
Ready-to-use OCR with 80+ supported languages
A high-quality tool for convert PDF to Markdown and JSON
Library for OCR-related tasks powered by Deep Learning
A cross-platform software for text translation and recognition
Free OCR Software: No internet required, easy to use.
JavaScript OCR and text extraction for images and PDFs
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Deep Learning API and Server in C++14 support for Caffe, PyTorch
OCR expert VLM powered by Hunyuan's native multimodal architecture
Use LLMs and LLM Vision (OCR) to handle paperless-ngx