Motion-controllable Video Generation via Latent Trajectory Guidance
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Hunyuan Translation Model Version 1.5
Persistent context and multi-instance coordination
Multimodal embedding and reranking models built on Qwen3-VL
SimpleMem: Efficient Lifelong Memory for LLM Agents
A New Axis of Sparsity for Large Language Models
"Big Model" trains a visual multimodal VLM with 26M parameters
Collection of Gemma 3 variants that are trained for performance
Language Model Reinforcement Learning Environments frameworks
Collection of reference environments, offline reinforcement learning
A simple, secure MCP-to-OpenAPI proxy server
Implementation of "MobileCLIP" CVPR 2024
Code release for Cut and Learn for Unsupervised Object Detection
High-resolution models for human tasks
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Documentation for Google's Gen AI site - including Gemini API & Gemma
Open-source MCP server that gives your coding agent
Free Tailwind CSS UI component library for modern web interfaces
14-stage Fusion Pipeline for LLM token compression
Open multimodal web agent built by Ai2
An MCP server for interacting with Google Colab
AI agent microservice
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
A Personalized LLM-powered Agent Frameworks