The SOTA Open-Source Browser Agent
Collection of awesome LLM apps with AI Agents and RAG using OpenAI
Structured outputs for llms
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Large Multimodal Models for Video Understanding and Editing
Concatenate a directory full of files into a single prompt
OCR expert VLM powered by Hunyuan's native multimodal architecture
Dealing with all unstructured data, such as reverse image search
Making ALL Software Agent-Native
Helps scientists define testable, modular, self-documenting dataflow
Scalable machine learning for time series forecasting
ktrain is a Python library that makes deep learning AI more accessible
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Definitions for AI/ML tasks like dataset creation
Long-form streaming TTS system for multi-speaker dialogue generation
Collection of Gemma 3 variants that are trained for performance
LTX-Video Support for ComfyUI
LLM training in simple, raw C/CUDA
A high-performance ML model serving framework, offers dynamic batching
Generate high-definition story short videos with one click using AI
Python package for AutoML on Tabular Data with Feature Engineering
Solve end to end problems using Llama model family
Framework to easily create LLM powered bots over any dataset
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Your Personal Research Multi-Tool