Showing 1822 open source projects for "sandbox:/mnt/data/project_plan.pod"

View related business solutions
  • Securden Privileged Account Manager Icon
    Securden Privileged Account Manager

    Unified Privileged Access Management

    Discover and manage administrator, service, and web app passwords, keys, and identities. Automate management with approval workflows. Centrally control, audit, monitor, and record all access to critical IT assets.
    Learn More
  • Network Management Software and Tools for Businesses and Organizations | Auvik Networks Icon
    Network Management Software and Tools for Businesses and Organizations | Auvik Networks

    Mapping, inventory, config backup, and more.

    Reduce IT headaches and save time with a proven solution for automated network discovery, documentation, and performance monitoring. Choose Auvik because you'll see value in minutes, and stay with us to improve your IT for years to come.
    Learn More
  • 1
    VibeKit

    VibeKit

    Run Claude Code, Gemini, Codex in a clean, isolated sandbox

    ...It provides a set of abstractions and utilities that let developers connect generative models to UI frameworks, sensors, event streams, and external services without having to build plumbing from scratch. Instead of treating AI models as black boxes behind simple prompts, Vibekit encourages developers to define declarative behaviors, reactive rules, and data flows that make the outputs of models part of living application logic. This can include things like dynamic content generation, live adaptation based on user interaction, and connectors to external APIs for enriched grounding. The toolkit also supports testing and local iteration, with utilities that simulate event streams and mock model responses to make development predictable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 102 This Week
    Last Update:
    See Project
  • 3
    Arize Phoenix

    Arize Phoenix

    Uncover insights, surface problems, monitor, and fine tune your LLM

    Phoenix provides ML insights at lightning speed with zero-config observability for model drift, performance, and data quality. Phoenix is an Open Source ML Observability library designed for the Notebook. The toolset is designed to ingest model inference data for LLMs, CV, NLP and tabular datasets. It allows Data Scientists to quickly visualize their model data, monitor performance, track down issues & insights, and easily export to improve. Deep Learning Models (CV, LLM, and Generative) are an amazing technology that will power many of future ML use cases. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    n8n

    n8n

    Free and source-available fair-code licensed workflow automation tool

    n8n is an extendable workflow automation tool. With a fair-code distribution model, n8n will always have visible source code, be available to self-host, and allow you to add your own custom functions, logic and apps. n8n's node-based approach makes it highly versatile, enabling you to connect anything to everything. n8n has 200+ different nodes to automate workflows.
    Downloads: 835 This Week
    Last Update:
    See Project
  • Get full visibility and control over your tasks and projects with Wrike. Icon
    Get full visibility and control over your tasks and projects with Wrike.

    A cloud-based collaboration, work management, and project management software

    Wrike offers world-class features that empower cross-functional, distributed, or growing teams take their projects from the initial request stage all the way to tracking work progress and reporting results.
    Learn More
  • 5
    MiroFish

    MiroFish

    A Simple and Universal Swarm Intelligence Engine

    MiroFish is a next-generation artificial intelligence prediction engine that leverages multi-agent technology and swarm-intelligence simulation to model, simulate, and forecast complex real-world scenarios. The system extracts “seed” information from sources such as breaking news, policy documents, and market signals to construct a high-fidelity digital parallel world populated by thousands of virtual agents with independent memory and behavior rules. Users can inject variables or conditions...
    Downloads: 943 This Week
    Last Update:
    See Project
  • 6
    Quadratic

    Quadratic

    Data science spreadsheet with Python & SQL

    Quadratic enables your team to work together on data analysis to deliver better results, faster. You already know how to use a spreadsheet, but you’ve never had this much power before. Quadratic is a Web-based spreadsheet application that runs in the browser and as a native app (via Electron). Our goal is to build a spreadsheet that enables you to pull your data from its source (SaaS, Database, CSV, API, etc) and then work with that data using the most popular data science tools today (Python, Pandas, SQL, JS, Excel Formulas, etc). ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    SDGym

    SDGym

    Benchmarking synthetic data generation methods

    The Synthetic Data Gym (SDGym) is a benchmarking framework for modeling and generating synthetic data. Measure performance and memory usage across different synthetic data modeling techniques – classical statistics, deep learning and more! The SDGym library integrates with the Synthetic Data Vault ecosystem. You can use any of its synthesizers, datasets or metrics for benchmarking.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    FinMind

    FinMind

    Open Data, more than 50 financial data

    In the era of big data, data is the foundation of everything. We collect more than 50 kinds of Taiwan stock related information and provide download, online analysis, and backtesting. Regardless of the program, you can download data through the api provided by FinMind, or you can download data directly from the website. After data is available, statistical analysis, regression analysis, time series analysis, machine learning, and deep learning can be performed. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    CTGAN

    CTGAN

    Conditional GAN for generating synthetic tabular data

    CTGAN is a collection of Deep Learning based synthetic data generators for single table data, which are able to learn from real data and generate synthetic data with high fidelity. If you're just getting started with synthetic data, we recommend installing the SDV library which provides user-friendly APIs for accessing CTGAN. The SDV library provides wrappers for preprocessing your data as well as additional usability features like constraints. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Data management solutions for confident marketing Icon
    Data management solutions for confident marketing

    For companies wanting a complete Data Management solution that is native to Salesforce

    Verify, deduplicate, manipulate, and assign records automatically to keep your CRM data accurate, complete, and ready for business.
    Learn More
  • 10
    pgvector

    pgvector

    Open-source vector similarity search for Postgres

    ...It has better query performance than IVFFlat (in terms of speed-recall tradeoff), but has slower build times and uses more memory. Also, an index can be created without any data in the table since there isn’t a training step like IVFFlat.
    Downloads: 75 This Week
    Last Update:
    See Project
  • 11
    Groq Desktop

    Groq Desktop

    Local Groq Desktop chat app with MCP support

    Groq Desktop is a cross-platform (Windows / macOS / Linux) local desktop application that provides a graphical chat interface for interacting with Groq-hosted, function-call-capable models. It bundles a built-in MCP (Model Context Protocol) server enabling smart function calling, letting users chat with an AI, send images, or interact with richer multimodal inputs — all from a friendly desktop UI. The app is built with modern web technologies and packaged for native distribution, making it...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 12
    graphify

    graphify

    AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)

    ...Overall, graphify serves as a bridge between raw data and visual insight.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    Dexter

    Dexter

    An autonomous agent for deep financial research

    Dexter is an autonomous agent tailored for deep financial research: you pose complex financial questions (for example, about a company’s revenue growth or financial ratios) and Dexter breaks them down into structured research tasks, fetches relevant real-time data (e.g. income statements, cash flows), performs analysis, and returns data-backed answers. It uses a multi-agent architecture with components such as a planning agent (to decompose queries), an action agent (to run tasks & fetch data), and self-validation mechanisms: after getting results, Dexter checks its own outputs and refines them until it is confident about its answer. ...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 14
    Claude Code

    Claude Code

    Claude Code is an agentic coding tool that lives in your terminal

    ...Claude Code integrates seamlessly into your terminal, IDE, or GitHub by tagging @claude to interact with your code context. The tool is designed to simplify development by automating repetitive work and providing instant clarifications on code behavior. User feedback and usage data are collected responsibly, with strict privacy safeguards and limited retention, ensuring no feedback is used to train generative models. Claude Code is open and actively maintained with community-driven bug reporting and feature requests. Its natural language interface makes advanced coding workflows accessible without leaving your coding environment.
    Downloads: 216 This Week
    Last Update:
    See Project
  • 15
    ExtractThinker

    ExtractThinker

    ExtractThinker is a Document Intelligence library for LLMs

    ExtractThinker is a tool designed to facilitate the extraction and analysis of information from various data sources, aiding in data processing and knowledge discovery.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Diffgram

    Diffgram

    Training data (data labeling, annotation, workflow) for all data types

    From ingesting data to exploring it, annotating it, and managing workflows. Diffgram is a single application that will improve your data labeling and bring all aspects of training data under a single roof. Diffgram is world’s first truly open source training data platform that focuses on giving its users an unlimited experience. This is aimed to reduce your data labeling bills and increase your Training Data Quality.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Databend

    Databend

    Cloud-native open source data warehouse for analytics and AI queries

    Databend is an open source cloud-native data warehouse designed for large-scale analytics and modern data workloads. Built in Rust, the system focuses on high performance, scalability, and efficient data processing for analytical queries. It is designed with a separation of compute and storage, allowing compute nodes to scale independently while storing data in object storage systems.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    ROOT

    ROOT

    Analyzing, storing and visualizing big data, scientifically

    ROOT is a unified software package for the storage, processing, and analysis of scientific data: from its acquisition to the final visualization in the form of highly customizable, publication-ready plots. It is reliable, performant and well supported, easy to use and obtain, and strives to maximize the quantity and impact of scientific results obtained per unit cost, both of human effort and computing resources. ROOT provides a very efficient storage system for data models, that demonstrated to scale at the Large Hadron Collider experiments: Exabytes of scientific data are written in columnar ROOT format. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    Memvid

    Memvid

    Video-based AI memory library. Store millions of text chunks in MP4

    Memvid encodes text chunks as QR codes within MP4 frames to build a portable “video memory” for AI systems. This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 20
    Rust Port

    Rust Port

    The Rust workspace under rust/ is the current systems-language port

    ...The project emphasizes agent-based execution, where tasks are broken down into steps and handled iteratively, simulating how modern AI coding tools operate in production environments. It is often used as a sandbox for exploring how large-scale coding agents behave, including their decision-making processes, tool usage, and workflow orchestration. The system likely includes abstractions for handling file systems, executing commands, and maintaining context across sessions, allowing for more persistent and intelligent coding interactions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    GPT-SoVITS

    GPT-SoVITS

    1 min voice data can also be used to train a good TTS model

    GPT‑SoVITS is a state-of-the-art voice conversion and TTS system that enables zero‑shot and few‑shot synthesis based on a short vocal sample (e.g., 5 seconds). It supports cross‑lingual speech synthesis across English, Chinese, Japanese, Korean, Cantonese, and more. It's powered by VITS architecture enhanced for few‑sample adaptation and real‑time usability.
    Downloads: 60 This Week
    Last Update:
    See Project
  • 22
    Vespa

    Vespa

    The open big data serving engine

    Make AI-driven decisions using your data, in real-time. At any scale, with unbeatable performance. Vespa is a full-featured text search engine and supports both regular text search and fast approximate vector search (ANN). This makes it easy to create high-performing search applications at any scale, whether you want to use traditional techniques or a modern vector-based approach.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 23
    Data Annotator for Machine Learning

    Data Annotator for Machine Learning

    Data annotator for machine learning

    Data annotator for machine learning allows you to centrally create, manage and administer annotation projects for machine learning. Data Annotator for Machine Learning (DAML) is an application that helps machine learning teams facilitate the creation and management of annotations. Active learning with uncertain sampling to query unlabeled data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DataChain

    DataChain

    AI-data warehouse to enrich, transform and analyze unstructured data

    ...The resulting datasets can be saved, versioned, and sent directly to PyTorch and TensorFlow for training. Datachain can persist features of Python objects returned by AI models, and enables vectorized analytical operations over them. The typical use cases are data curation, LLM analytics and validation, image segmentation, pose detection, and GenAI alignment. Datachain is especially helpful if batch operations can be optimized – for instance, when synchronous API calls can be parallelized or where an LLM API offers batch processing.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    NannyML

    NannyML

    Detecting silent model failure. NannyML estimates performance

    NannyML is an open-source python library that allows you to estimate post-deployment model performance (without access to targets), detect data drift, and intelligently link data drift alerts back to changes in model performance. Built for data scientists, NannyML has an easy-to-use interface, and interactive visualizations, is completely model-agnostic, and currently supports all tabular classification use cases. NannyML closes the loop with performance monitoring and post deployment data science, empowering data scientist to quickly understand and automatically detect silent model failure. ...
    Downloads: 3 This Week
    Last Update:
    See Project