From nobody to big model (LLM) hero
MoBA: Mixture of Block Attention for Long-Context LLMs
Mastering Applied AI, One Concept at a Time
the terminal client for Ollama
NeurIPS2025 Spotlight] Quantized Attention
An open-source, modern-design AI training tracking and visualization
Open-source evaluation toolkit of large multi-modality models (LMMs)
General technology for enabling AI capabilities w/ LLMs and MLLMs
Maimaibot, a (more focused) multi-platform intelligent agent
Open-source model for program synthesis
Llama Chinese community, real-time aggregation
Production-grade platform for building agentic IM bots
Large Language Model Principles and Practice Tutorial from Scratch
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Ready-to-run cloud templates for RAG
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
Building an Intelligent Agent from Scratch
Cube Studio open source cloud native one-stop machine learning
Run LLM prompts from your shell
Analyzing Hacker News discussions from a decade ago in hindsight
Making RAG Simpler with Small and Open-Sourced Language Models
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Fast-stable-diffusion + DreamBooth
Ultimate meta-skill for generating best-in-class Claude Code skills
End-to-end pipeline converting generative videos