Run Claude Code, Gemini, Codex in a clean, isolated sandbox
OCRmyPDF adds an OCR text layer to scanned PDF files
Uncover insights, surface problems, monitor, and fine tune your LLM
Free and source-available fair-code licensed workflow automation tool
A Simple and Universal Swarm Intelligence Engine
Data science spreadsheet with Python & SQL
Benchmarking synthetic data generation methods
Open Data, more than 50 financial data
Conditional GAN for generating synthetic tabular data
Open-source vector similarity search for Postgres
Local Groq Desktop chat app with MCP support
AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)
An autonomous agent for deep financial research
Claude Code is an agentic coding tool that lives in your terminal
ExtractThinker is a Document Intelligence library for LLMs
Training data (data labeling, annotation, workflow) for all data types
Cloud-native open source data warehouse for analytics and AI queries
Analyzing, storing and visualizing big data, scientifically
Video-based AI memory library. Store millions of text chunks in MP4
The Rust workspace under rust/ is the current systems-language port
1 min voice data can also be used to train a good TTS model
The open big data serving engine
Data annotator for machine learning
AI-data warehouse to enrich, transform and analyze unstructured data
Detecting silent model failure. NannyML estimates performance