A high-performance image compression microservice based on MCP
Deep learning optimization library: makes distributed training easy
Neural Network Compression Framework for enhanced OpenVINO
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Technical principles related to large models
The highest-scoring AI memory system ever benchmarked
48khz stereo neural audio codec for general audio
Unified KV Cache Compression Methods for Auto-Regressive Models
Implementation of TurboQuant (ICLR 2026)
Lets make video diffusion practical
Redundancy-aware KV Cache Compression for Reasoning Models
Claude Code plugin that automatically captures everything Claude does
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
14-stage Fusion Pipeline for LLM token compression
AIMET is a library that provides advanced quantization and compression
Build your chatbot within minutes on your favorite device
Koog is the official Kotlin framework for building AI agents
Awesome multilingual OCR toolkits based on PaddlePaddle
SOTA discrete acoustic codec models with 40/75 tokens per second
Data and tools for generating and inspecting OLMo pre-training data
Running large language models on a single GPU
A tension reasoning engine over 131 S-class problems
AI gateway with token compression for Claude Code, Codex, and more
Data Lake for Deep Learning. Build, manage, and query datasets
iOS/Android image picker with support for camera, video, etc.