800,000 step-level correctness labels on LLM solutions to MATH problem
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Repo for external large-scale work
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Locally run an Instruction-Tuned Chat-Style LLM
Learning to Act by Watching Unlabeled Online Videos
PyTorch implementation of MAE
An implementation of model parallel GPT-2 and GPT-3-style models
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Large-scale autoregressive pixel model for image generation by OpenAI
Generate embeddings from large-scale graph-structured data
A library for Multilingual Unsupervised or Supervised word Embeddings
Dual LSTM Encoder for Dialog Response Generation
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
Vision-language-action model for robot control via images and text
Tiny pre-trained IBM model for multivariate time series forecasting
Metric monocular depth estimation (vision model)
CLIP model fine-tuned for zero-shot fashion product classification
VaultGemma: 1B DP-trained Gemma variant for private NLP tasks
Qwen3-Next: 80B instruct LLM with ultra-long context up to 1M tokens
Large-scale xAI model for local inference with SGLang, Grok-2.5
Powerful 14B LLM with strong instruction and long-text handling
Robust BERT-based model for English with improved MLM training
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input