PyTorch code and models for the DINOv2 self-supervised learning
Pokee Deep Research Model Open Source Repo
Tooling for the Common Objects In 3D dataset
Renderer for the harmony response format to be used with gpt-oss
Stable Diffusion with Core ML on Apple Silicon
Chat & pretrained large audio language model proposed by Alibaba Cloud
Collection of Gemma 3 variants that are trained for performance
LTX-Video Support for ComfyUI
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
The official PyTorch implementation of Google's Gemma models
Open Source Speech Language Model
Tool for exploring and debugging transformer model behaviors
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
OCR expert VLM powered by Hunyuan's native multimodal architecture
StudioOllamaUI is a local, portable interface for Ollama
Release for Improved Denoising Diffusion Probabilistic Models
AI Suite for upscaling, interpolating & restoring images/videos
AI-powered tool to quickly remove watermarks from images flawlessly
Open Multilingual Multimodal Chat LMs
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Official repo for consistency models
800,000 step-level correctness labels on LLM solutions to MATH problem
Chinese LLaMA & Alpaca large language model + local CPU/GPU training