OCRmyPDF adds an OCR text layer to scanned PDF files
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Ready-to-use OCR with 80+ supported languages
Open Source Document Management System for Digital Archives
e-Dokyumento is web-based Document Management System (DMS)
A supercharged version of paperless, scan, index and archive docs
Typeface from Ming Dynasty woodblock printed books
Easy-OCR solution and Tesseract trainer for GNU/Linux
The tool supports template-based parsing, allowing structured output i