Open Multilingual Multimodal Chat LMs
Encoder of greater-than-word length text trained on a variety of data
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Official repo for consistency models
800,000 step-level correctness labels on LLM solutions to MATH problem
Repo for external large-scale work
Learning to Act by Watching Unlabeled Online Videos
PyTorch implementation of MAE
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Generate embeddings from large-scale graph-structured data
A library for Multilingual Unsupervised or Supervised word Embeddings
Dual LSTM Encoder for Dialog Response Generation
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
Vision-language-action model for robot control via images and text
Tiny pre-trained IBM model for multivariate time series forecasting
Metric monocular depth estimation (vision model)
CLIP model fine-tuned for zero-shot fashion product classification
VaultGemma: 1B DP-trained Gemma variant for private NLP tasks
Qwen3-Next: 80B instruct LLM with ultra-long context up to 1M tokens
Large-scale xAI model for local inference with SGLang, Grok-2.5
Powerful 14B LLM with strong instruction and long-text handling
Robust BERT-based model for English with improved MLM training
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input
High-performance MoE model with MLA, MTP, and multilingual reasoning