JavaScript OCR and text extraction for images and PDFs
Video-based AI memory library. Store millions of text chunks in MP4
Reading book source
The book documenting the curl project, the curl tool, libcurl
Revolutionizing Database Interactions with Private LLM Technology
PDF Indexing Script: Searches PDF for words, records page numbers
Document Index for Vectorless, Reasoning-based RAG
MD/.JSON Document OCR and structured data extraction API
Open source semantic search and text analytics for large document sets
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
Hypertext-infused philosophy personal database software
Search text or a regular expression in multiple documents
Modify a manual filterwheel and add stepper motor and Arduino
AI-powered semantic indexing: automating the creation of book indexes
a light OPDS/HTML server indexing EPUB and PDF files
This is a simple GUI for the command line tool grep and pdfgrep
Node.js module for rendering pdf pages to images, svgs and HTML files
Elasticsearch File System Crawler (FS Crawler)
A supercharged version of paperless, scan, index and archive docs
An open source search engine with RESTFul API and crawlers
C# class library for processing OpenStreetMap data
The study environment of ancient languages (Coptic, Greek, Latin)
IFile, PHP based framework for indexing and search in the documents
Personalized Search Engine for Your Files