Agentic, Reasoning, and Coding (ARC) foundation models
Claude Code skill for generating production-quality SVG+PNG technical
Advanced language and coding AI model
Kimi Code CLI is your next CLI agent
LLM-based agent for general purpose software engineering tasks
Open-Sora: Democratizing Efficient Video Production for All
A nearly-live implementation of OpenAI's Whisper
Code for running inference with the SAM 3D Body Model 3DB
Contexts Optical Compression
Implementation of TurboQuant (ICLR 2026)
AI video generator optimized for low VRAM and older GPUs use
Multilingual speech recognition and audio understanding model
A robust, efficient, low-latency speech-to-text library
Audiocraft is a library for audio processing and generation
Python observability platform for tracing apps, metrics, and logs
Python library and CLI tool to interface with Google Translate
Long-term memory OS for AI with structured recall and context awarenes
Open-source AI agent framework
Framework for building real-time voice and multimodal AI agents
Fast backend for long-term AI user memory via structured profiles
The Python code to reproduce illustrations from Machine Learning Book
Tokenizer-Free TTS for Multilingual Speech Generation
AI-powered Jupyter spreadsheet that converts workflows into Python
Anomaly detection related books, papers, videos, and toolboxes
A TTS that fits in your CPU (and pocket)