Strong, Economical, and Efficient Mixture-of-Experts Language Model
Production-tested AI infrastructure tools
Open-Source Financial Large Language Models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Flux 2 image generation model pure C inference
Qwen2.5-VL is the multimodal large language model series
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
PyTorch implementation of JiT
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Scaling Reinforcement Learning with LLMs
RGBD video generation model conditioned on camera input
An experimental version of DeepSeek model
Reference PyTorch implementation and models for DINOv3
Towards Real-World Vision-Language Understanding
Foundation Models for Time Series
FAIR Sequence Modeling Toolkit 2
An AI-powered security review GitHub Action using Claude
The ChatGPT Retrieval Plugin lets you easily find personal documents
Qwen3-Coder is the code version of Qwen3
My personal Claude Code configuration
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Models for object and human mesh reconstruction
Contexts Optical Compression
State of the art LLM and coding model
LTX-Video Support for ComfyUI