Showing 142 open source projects for "file text search"

View related business solutions
  • Respond 100x faster, more accurately, and improve your documentation Icon
    Respond 100x faster, more accurately, and improve your documentation

    Designed for forward-thinking security, sales, and compliance teams

    Slash response times for questionnaires, audits, and RFPs by up to 90%. OptiValue.ai automates the heavy lifting, freeing your team to focus on strategic priorities with intuitive tools for seamless review and validation.
    Learn More
  • The complete IT asset and license management platform Icon
    The complete IT asset and license management platform

    Gain full visibility and control over your IT assets, licenses, usage and spend in one place with Setyl.

    The platform seamlessly integrates with 100+ IT systems, including MDM, RMM, IDP, SSO, HR, finance, helpdesk tools, and more.
    Learn More
  • 1
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    Open Semantic Search is an open source research and analytics platform designed for searching, analyzing, and exploring large collections of documents using semantic search technologies. It provides an integrated search server combined with a document processing pipeline that supports crawling, text extraction, and automated analysis of content from many different sources.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Text Embeddings Inference

    Text Embeddings Inference

    High-performance inference server for text embeddings models API layer

    Text Embeddings Inference is a high-performance server designed to serve text embedding models efficiently in production environments. It focuses on delivering fast and scalable embedding generation by leveraging optimized inference techniques and modern hardware acceleration. It is built to support transformer-based embedding models, making it suitable for tasks such as semantic search, clustering, and retrieval-augmented systems.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    fess

    fess

    Open source enterprise search server for websites, files, and data

    Fess is an open source enterprise search server designed to provide powerful full-text search capabilities across multiple data sources. It enables organizations to quickly deploy a scalable search environment without requiring deep knowledge of underlying search technologies. Fess is built on top of OpenSearch and offers an integrated solution for crawling, indexing, and searching documents from websites, file systems, and various data stores. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    Databend

    Databend

    Cloud-native open source data warehouse for analytics and AI queries

    ...This architecture enables cost-efficient storage and elastic scaling for workloads that involve large datasets and complex queries. Databend provides a unified engine capable of handling analytics, vector search, and full-text search within a single platform. Databend supports SQL-based workflows and enables real-time data ingestion, transformation, and analysis through streaming and task orchestration features. With its cloud-native design and distributed architecture, Databend can run both as a self-hosted system or within managed environments to power data analytics, AI workloads, and large-scale data.
    Downloads: 13 This Week
    Last Update:
    See Project
  • Feroot AI automates website security with 24/7 monitoring Icon
    Feroot AI automates website security with 24/7 monitoring

    Trusted by enterprises, healthcare providers, retailers, SaaS platforms, payment service providers, and public sector organizations.

    Feroot unifies JavaScript behavior analysis, web compliance scanning, third-party script monitoring, consent enforcement, and data privacy posture management to stop Magecart, formjacking, and unauthorized tracking.
    Learn More
  • 5
    GooFuzz

    GooFuzz

    OSINT fuzzing tool using Google dorks to find exposed resources

    ...This method enables the discovery of potentially sensitive files, directories, subdomains, and parameters that are already exposed on the web. By combining wordlists, search operators, and file extension filters, the tool helps security professionals locate misconfigured or unintentionally exposed resources. GooFuzz is commonly used in penetration testing, reconnaissance, and bug bounty research where passive information gathering is important.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Desktop Commander MCP

    Desktop Commander MCP

    AI-powered MCP server for desktop file and terminal automation

    Desktop Commander MCP is an advanced Model Context Protocol server designed to extend AI assistants with direct control over a user’s local machine, including the file system and terminal. It integrates with clients like Claude Desktop to enable AI-driven workflows such as editing files, executing commands, and automating development tasks from a single conversational interface. Desktop Commander MCP builds on top of an MCP filesystem server and enhances it with powerful search, replace, and code editing capabilities tailored for real-world development environments. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    yek

    yek

    Serialize repositories into LLM-ready context w/ smart prioritization

    ...Yek supports multiple directories, individual files, and glob patterns, making it flexible for different workflows. It can stream output when piped or save results to a temporary file, depending on usage. Configuration is handled through a yek.yaml file, allowing users to define ignore rules and priority settings. By consolidating code and documents into a single, ordered format, Yek simplifies preparing repositories for AI-driven analysis, debugging, or automation tasks.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 8
    Generative AI for Beginners (Version 3)

    Generative AI for Beginners (Version 3)

    21 Lessons, Get Started Building with Generative AI

    Generative AI for Beginners is a 21-lesson course by Microsoft Cloud Advocates that teaches the fundamentals of building generative AI applications in a practical, project-oriented way. Lessons are split into “Learn” modules for core concepts and “Build” modules with hands-on code in Python and TypeScript, so you can jump in at any point that matches your goals. The course covers everything from model selection, prompt engineering, and chat/text/image app patterns to secure development...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    todo.txt-cli

    todo.txt-cli

    A simple and extensible shell script for managing your todo.txt file

    ...Todo.txt apps are minimal, todo.txt-focused editors which help you manage your tasks with as few keystrokes and taps as possible. Countless productivity apps and sites store your tasks in their own proprietary database and file format. But you can work with your todo.txt file in every text editor ever made, regardless of operating system or vendor. With a simple but powerful shell script called todo.sh, you can interact with todo.txt at the command line for quick and easy, Unix-y access.
    Downloads: 2 This Week
    Last Update:
    See Project
  • World class QA, 100% done-for-you Icon
    World class QA, 100% done-for-you

    For engineering teams in search of a solution to design, manage and maintain E2E tests for their apps

    MuukTest is a test automation service that combines our own proprietary, AI-powered software with expert QA services to help you achieve world class test automation at a fraction of the in-house costs.
    Learn More
  • 10
    Rig

    Rig

    Rust framework for building modular and scalable LLM-powered apps

    Rig is an open source Rust framework designed to help developers build modular and scalable applications powered by large language models. It provides a unified set of abstractions that allow applications to interact with many AI model providers and vector databases through a single interface. Its architecture emphasizes modularity, enabling developers to integrate only the components and integrations they need for a specific application. Rig includes built-in support for agent workflows,...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Everywhere

    Everywhere

    Context-aware desktop AI assistant that understands screen content

    Everywhere is a context-aware desktop AI assistant designed to interact directly with the content displayed on a user’s screen. It distinguishes itself from traditional AI tools by eliminating the need for manual input methods such as copying text or taking screenshots, instead allowing users to invoke assistance instantly through a shortcut. It can analyze on-screen information in real time and provide contextual responses, making it useful for tasks like troubleshooting errors, summarizing...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    IPRanges

    IPRanges

    Daily updated lists of cloud, bot, and service IP ranges

    ...It also tracks IP ranges used by search engine bots and automated agents including Googlebot, Bingbot, and OpenAI’s GPTBot. Lists are published in both IPv4 and IPv6 formats and are regularly updated through automated processes to keep the data current. In addition to provider specific lists, the project also offers merged and combined datasets that aggregate ranges from multiple sources into a single file.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    goenv

    goenv

    Like pyenv and rbenv, but for Go

    ...This project was cloned from pyenv and modified for Go. goenv does let you change the global Go version on a per-user basis. Provide support for per-project Go versions. Allow you to override the Go version with an environment variable. Search commands from multiple versions of Go at a time. At a high level, goenv intercepts Go commands using shim executables injected into your PATH, determines which Go version has been specified by your application, and passes your commands along to the correct Go installation. When you run all the variety of Go commands using go, your operating system searches through a list of directories to find an executable file with that name. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    ...It can also analyze document text to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common text files. For supported audio and video files, AI File Sorter can read embedded metadata (such as ID3, Vorbis, and MP4 tags) to suggest normalized names like year_artist_album_title.ext. AI analysis runs read-only, and all suggestions must be reviewed before being applied.
    Downloads: 247 This Week
    Last Update:
    See Project
  • 15
    RuoYi AI

    RuoYi AI

    Enterprise AI platform for building, deploying, and managing apps

    RuoYi AI is a full-stack enterprise-oriented AI development platform designed to help developers rapidly build, deploy, and manage intelligent applications using modern large language models and AI ecosystems. It provides a unified framework for integrating multiple AI models from different providers, allowing teams to switch or combine models through a consistent interface without vendor lock-in. RuoYi AI includes built-in support for retrieval-augmented generation, enabling organizations...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    lynis

    lynis

    Security auditing tool for Linux, macOS, and UNIX-based system

    Lynis is a battle-tested security tool for systems running Linux, macOS, or Unix-based operating system. It performs an extensive health scan of your systems to support system hardening and compliance testing. The project is open source software with the GPL license and available since 2007. Since Lynis is flexible, it is used for several different purposes. Typical use cases for Lynis include security auditing, compliance testing (e.g. PCI, HIPAA, SOx), penetration testing, vulnerability...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    PasteMD

    PasteMD

    Paste Markdown and AI responses into Word Excel instantly fast

    PasteMD is a lightweight desktop utility designed to streamline the process of transferring formatted content from the clipboard into office applications such as Word, WPS, and Excel. It primarily targets users who frequently copy content from AI chat tools or web pages and encounter formatting issues, especially with Markdown, tables, and LaTeX formulas. PasteMD operates from the system tray and monitors clipboard content, automatically converting Markdown or HTML into properly formatted...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18
    DocFetcher

    DocFetcher

    Desktop search application

    DocFetcher is an Open Source desktop search application: It allows you to search the contents of files on your computer. — You can think of it as Google for your local files. The application runs on Windows, Linux and Mac OS X.
    Leader badge
    Downloads: 2,903 This Week
    Last Update:
    See Project
  • 19
    GitGot

    GitGot

    Semi-automated tool for discovering exposed secrets in GitHub data

    ...During a search session, users review results and provide feedback that allows GitGot to filter out irrelevant or repetitive findings. This feedback is used to build blacklists that eliminate results based on repository names, file names, user names, or fuzzy matches of file content. The approach helps reduce noise while guiding the search process toward more relevant results.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Kubetail

    Kubetail

    Bash script to tail Kubernetes logs from multiple pods

    ...If you're on MacOSX I recommend using iTerm2 which allows for continuous highlighting of search terms, good scrolling capabilities, and multitab arrangements. Another useful feature of iTerm2 is the "timeline" (cmd + shift + e) which lets you display a timeline in your own local timezone next to the logs (that are typically in UTC).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    yellowcard

    yellowcard

    Address List program for tracking personal info in a plain text file

    Yellowcard is an address list maintenance program for keeping personal information in a plain text file. The reviews for the octfont project serve as a question and answer area for yellowcard and other josephms sourceforge projects. The reviews for the octfont project are checked regularly by the author as a method of support for yellowcard.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    RefDB is a reference database and bibliography tool for SGML, XML, and LaTeX documents, sort of a Reference Manager or BibTeX for markup languages. It is portable and known to run on Linux, Free/NetBSD, OSX, Solaris, and Windows/Cygwin.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 24
    Script Echo Color

    Script Echo Color

    Terminal text colorizing and simplifies script coding.

    ScriptEchoColor simplifies Linux terminal text colorizing, formatting and several steps of script coding.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Nostalgic Photo DataBase (platform)

    Nostalgic Photo DataBase (platform)

    Active repository of jpeg-photos with tags suitable for personal needs

    ...Powered by an embedded SQL database, NPDB delivers lightning-fast search results, ensuring that users can access the images they need almost instantaneously. Whether searching by tags or entering free-text queries, NPDB executes SQL queries efficiently, providing accurate results in no time. Developed primarily in C++, NPDB ensures seamless operation on Linux, Windows (via WSL), and Chrome OS.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB