LLM framework for document understanding and semantic retrieval
One API call, pull Claude agent, completely sandboxed
A python tool that uses GPT-4, FFmpeg, and OpenCV
Models for object and human mesh reconstruction
Fast Multimodal LLM on Mobile Devices
Benchmarking Multimodal Agents for Open-Ended Tasks
From Paper to Presentation in One Click
Code-first tutorials covering every layer of GenAI agents
Intelligent automation and multi-agent orchestration for Claude Code
Full System Prompts, Internal Tools & AI Models
Multilingual speech recognition and audio understanding model
This repository is for helping those interested in machine learning
Mac app for Ollama
Deploy your agentic worfklows to production
Make videos programmatically with React
Curated list of datasets and tools for post-training
The NVIDIA AgentIQ toolkit is an open-source library
A CLI that writes your git commit messages for you with AI
Committed to building an open, public welfare
Deep learning concepts in an approachable style
A collection of open-source skills for AI coding agents
An agentless approach to automatically solve software development
Contexts Optical Compression
Audiocraft is a library for audio processing and generation
AI Agent Builder and Runtime by Docker Engineering