1938 projects for "file text search" with 1 filter applied:

  • GR4VY: Payment Orchestration Platform Icon
    GR4VY: Payment Orchestration Platform

    Payment orchestration platform that connects PSPs, methods, and tools in one layer, streamlining payments and increasing success rates.

    Gr4vy’s payment orchestration platform empowers enterprise merchants and platforms to optimize their stack and create bespoke checkout experiences, giving you full control over your payment strategy.
    Learn More
  • Share your screen instantly while on a phone call with CrankWheel for an engaging presentation. Icon
    Share your screen instantly while on a phone call with CrankWheel for an engaging presentation.

    For salespeople and customer service agents who want to compliment their phone calls with visual elements.

    Our 10x simpler screen sharing tool is designed for you if you spend your days on the phone with clients, and need to add a visual presentation to close sales. No more scheduling a follow-up meeting, or teaching them to use a complex tool. Send them a text message or email, and they see your screen in seconds.
    Learn More
  • 1
    Kilo

    Kilo

    A text editor in less than 1000 LOC with syntax highlight and search

    Kilo is a minimalistic terminal text editor written in C, famous for fitting its full implementation into fewer than 1,000 lines of code in a single source file. It was created by Salvatore Sanfilippo (antirez, also known for Redis) as an exercise in writing a small, self-contained editor that others can study and extend. Despite its tiny size, Kilo supports core editor features like opening and saving files, incremental search, and basic syntax highlighting. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Open Semantic Search

    Open Semantic Search

    Open source semantic search and text analytics for large document sets

    Open Semantic Search is an open source research and analytics platform designed for searching, analyzing, and exploring large collections of documents using semantic search technologies. It provides an integrated search server combined with a document processing pipeline that supports crawling, text extraction, and automated analysis of content from many different sources.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Text Embeddings Inference

    Text Embeddings Inference

    High-performance inference server for text embeddings models API layer

    Text Embeddings Inference is a high-performance server designed to serve text embedding models efficiently in production environments. It focuses on delivering fast and scalable embedding generation by leveraging optimized inference techniques and modern hardware acceleration. It is built to support transformer-based embedding models, making it suitable for tasks such as semantic search, clustering, and retrieval-augmented systems.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    text-extract-api

    text-extract-api

    Document (PDF, Word, PPTX ...) extraction and parse API

    ...Instead of requiring developers to integrate multiple document parsing libraries individually, the system centralizes text extraction capabilities into a unified API that standardizes the output. The platform supports automated processing pipelines that detect file types and apply the appropriate extraction method to obtain the most accurate text representation possible. It can be integrated into document analysis systems, knowledge retrieval tools, and AI pipelines that rely on clean textual data. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • A privacy-first API that predicts global consumer preferences Icon
    A privacy-first API that predicts global consumer preferences

    Qloo AI adds value to a wide range of Fortune 500 companies in the media, technology, CPG, hospitality, and automotive sectors.

    Through our API, we provide contextualized personalization and insights based on a deep understanding of consumer behavior and more than 575 million people, places, and things.
    Learn More
  • 5
    fess

    fess

    Open source enterprise search server for websites, files, and data

    Fess is an open source enterprise search server designed to provide powerful full-text search capabilities across multiple data sources. It enables organizations to quickly deploy a scalable search environment without requiring deep knowledge of underlying search technologies. Fess is built on top of OpenSearch and offers an integrated solution for crawling, indexing, and searching documents from websites, file systems, and various data stores. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file, find filenames fast using index files, rename many files using expressions, copy part of a file, change times of a file, set file time from filename, print colored text to terminal, convert csv to tab separated, download files from web, send http or udp requests, print tcp or udp traffic, create hexdump of files, join many text files into one, list nested .zip .tar .tar.gz .tar.bz2 archive contents. ...
    Leader badge
    Downloads: 495 This Week
    Last Update:
    See Project
  • 7
    clip-retrieval

    clip-retrieval

    Easily compute clip embeddings and build a clip retrieval system

    clip-retrieval is an open-source toolkit designed to build large-scale semantic search systems for images and text by leveraging CLIP embeddings to enable multimodal retrieval. It allows developers to compute embeddings for both images and text efficiently and then index them for fast similarity search across massive datasets. The system is optimized for performance and scalability, capable of processing tens or even hundreds of millions of embeddings using GPU acceleration. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    OpenGrok

    OpenGrok

    Fast and usable source code search and cross reference engine

    OpenGrok is a fast and usable source code search and cross reference engine. It helps you search, cross-reference and navigate your source tree. It understands various program file formats and history from many Source Code Management systems. In other words it lets you grok (profoundly understand) source code and is developed in the open, hence the name OpenGrok. It is written in Java.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    yt-fts

    yt-fts

    Search all of YouTube from the command line

    yt-fts, short for YouTube Full Text Search, is an open-source command-line tool that enables users to search the spoken content of YouTube videos by indexing their subtitles. The program automatically downloads subtitles from a specified YouTube channel using the yt-dlp utility and stores them in a local SQLite database. Once indexed, users can perform full-text searches across all transcripts to quickly locate keywords or phrases mentioned within the videos. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • Zendesk: The Complete Customer Service Solution Icon
    Zendesk: The Complete Customer Service Solution

    Discover AI-powered, award-winning customer service software trusted by 200k customers

    Equip your agents with powerful AI tools and workflows that boost efficiency and elevate customer experiences across every channel.
    Learn More
  • 10
    QMD

    QMD

    mini cli search engine for your docs, knowledge bases, etc.

    QMD is a powerful and lightweight command-line tool that acts as an on-device search engine for your personal knowledge base, allowing you to index and search files like Markdown notes, meeting transcripts, technical documentation, and other text collections without depending on cloud services. Designed to keep all search activity local, it combines classic full-text search techniques with modern semantic features such as vector similarity and hybrid ranking so that queries return not just literal matches but conceptually relevant results. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Sync Server

    Sync Server

    Secure, open-source platform for file storage, sharing, collaboration

    ...Sync-in supports real-time collaboration through integrations with office editors and activity tracking, and it enhances productivity with deep content search across a variety of file types and comprehensive document management capabilities. The platform is built with TypeScript and Node.js and is suitable for self-hosting on your own infrastructure using Docker or standard Node deployments.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    tinysearch

    tinysearch

    Tiny, full-text search engine for static websites built with Rust

    tinysearch is a lightweight full-text search engine designed specifically for static websites, offering fast and efficient search capabilities without requiring a backend server. It is written in Rust and compiled to WebAssembly, allowing it to run entirely in the browser while maintaining a very small footprint. The engine uses compact data structures such as XOR filters to efficiently index and query text, significantly reducing memory usage compared to traditional search libraries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    OpenArchiver

    OpenArchiver

    An open-source platform for legally compliant email archiving

    OpenArchiver is a comprehensive, self-hosted email archiving and compliance platform built to help organizations ingest, index, store, and search email communication data across diverse sources like Gmail, Microsoft 365, IMAP, PST, and more. It’s designed for scenarios where reliable, tamper-proof archiving and full-text search across both emails and attachments are essential for legal discovery, compliance, or long-term records retention. The platform combines a modern web UI with powerful backend services, including fast indexing, deduplication, encryption at rest, and asynchronous ingestion workflows, making it suitable for both small teams and enterprise deployments. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    LLPlayer

    LLPlayer

    The media player for language learning, with dual subtitles

    ...Additional tools such as instant word lookup, contextual translation, and subtitle search allow learners to interact with the text while watching videos.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 15
    Humanizer Skill

    Humanizer Skill

    Claude Code skill that removes signs of AI-generated writing from text

    Humanizer Skill is a utility library focused on transforming technical or machine-oriented text into expressions that are more natural, readable, and “human-friendly.” It provides a suite of algorithms that convert timestamps, identifiers, file sizes, code tokens, and structured data into phrases that resemble typical human phrasing rather than compact machine output. For example, date and time values can be expressed as relative terms (“two hours ago”), and file sizes can be shown in intuitive units instead of raw bytes. ...
    Downloads: 134 This Week
    Last Update:
    See Project
  • 16
    Alexandrie

    Alexandrie

    Web application for Markdown note taking

    ...It offers a structured note-taking experience with support for workspaces and categories, making it easy to organize large repositories of information intuitively. The application runs as a responsive web interface that works online or offline, with search and export features that help users retrieve and reuse their content effectively. Built with a clear focus on productivity, Alexandrie includes support for custom storage backends like S3-compatible services or local file systems via Docker Compose setups. Its extended Markdown syntax lets users embed rich content, multimedia references, and structured blocks that go beyond plain text, while the sleek UI encourages distraction-free writing and review.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Laravel Scout

    Laravel Scout

    Provides a driver based solution to searching your Eloquent models

    Laravel Scout provides a simple, driver-based solution for adding full-text search to your Eloquent models. Once Scout is installed and configured, it will automatically sync your model changes to your search indexes.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    SemTools

    SemTools

    Semantic search and document parsing tools for the command line

    SemTools is an open-source command-line toolkit designed for document parsing, semantic indexing, and semantic search workflows. The project focuses on enabling developers and AI agents to process large document collections and extract meaningful semantic representations that can be searched efficiently. Built with Rust for performance and reliability, the toolchain provides fast processing of text and structured documents while maintaining low system overhead.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 19
    FlagEmbedding

    FlagEmbedding

    Retrieval and Retrieval-augmented LLMs

    FlagEmbedding is an open-source toolkit for building and deploying high-performance text embedding models used in information retrieval and retrieval-augmented generation systems. The project is part of the BAAI FlagOpen ecosystem and focuses on creating embedding models that transform text into dense vector representations suitable for semantic search and large language model pipelines. FlagEmbedding includes a family of models known as BGE (BAAI General Embedding), which are designed to achieve strong performance across multilingual and cross-lingual retrieval benchmarks. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Pixeltable

    Pixeltable

    Data Infrastructure providing an approach to multimodal AI workloads

    ...Developers define data transformations and AI operations using computed columns on tables, allowing pipelines to evolve incrementally as new data or models are added. The framework supports multimodal content including images, video, text, and audio, enabling applications such as retrieval-augmented generation systems, semantic search, and multimedia analytics.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    react-highlight-words

    react-highlight-words

    React component to highlight words within a larger body of text

    react-highlight-words renders text with specified substrings highlighted, useful for search results, find-in-page UIs, and autocomplete menus. You pass a string (or node tree) and an array of search terms, and the component splits content into matched and unmatched chunks with minimal overhead. It supports case-sensitive or insensitive matching and optional auto-escaping so user-entered queries won’t be treated as regex.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    kg-gen

    kg-gen

    Knowledge Graph Generation from Any Text

    kg-gen is an open-source framework developed by the STAIR Lab that automatically generates knowledge graphs from unstructured text using large language models. The system is designed to transform plain text sources such as documents, articles, or conversation transcripts into structured graphs composed of entities and relationships. Instead of relying on traditional rule-based extraction techniques, KG-Gen uses language models to identify entities and their relationships, producing higher-quality graph structures from raw text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    MemU

    MemU

    MemU is an open-source memory framework for AI companions

    MemU is an agentic memory layer for LLM applications, specifically designed for AI companions. Transform your memory into an intelligent file system that automatically organizes, connects, and evolves with your memories. Simple, fast, and reliable memory infrastructure for AI applications. Powerful tools and dedicated support to scale your AI applications with confidence. Full proprietary features, commercial usage rights, and white-labeling options for your enterprise needs. SSO/RBAC...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 24
    Engram

    Engram

    A New Axis of Sparsity for Large Language Models

    Engram is a high-performance embedding and similarity search library focused on making retrieval-augmented workflows efficient, scalable, and easy to adopt by developers building search, recommendation, or semantic matching systems. It provides utilities to generate embeddings from text or other structured data, index them using efficient approximate nearest neighbor algorithms, and perform real-time similarity queries even on large corpora.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    OOO Extras

    OOO Extras

    Extra tools for OpenOffice under weak copyleft or other licenses

    A space to store classic OOo dependencies that cannot be easily redistributed in Apache OpenOffice's Source tree. Initially this was meant for copyleft tarballs only but it is also pretty handy to mirror other file dependencies.
    Downloads: 87 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB