A TTS model capable of generating ultra-realistic dialogue
Open source codebase for Scale Agentex
Course to get into Large Language Models (LLMs)
code for Mesh R-CNN, ICCV 2019
PyTorch code and models for VJEPA2 self-supervised learning from video
This is a simple demonstration of more advanced, agentic patterns
The ChatGPT Retrieval Plugin lets you easily find personal documents
Implementation of the Surya Foundation Model for Heliophysics
Resources for deep learning with satellite & aerial imagery
This repo contains the code for 1D tokenizer and generator
A Universal Customization Method for Single and Multi Conditioning
A Unified Framework for Image Customization
Flexible Photo Recrafting While Preserving Your Identity
Learn AI and LLMs from scratch using free resources
Lightweight Python library for adding real-time multi-object tracking
OpenAI swift async text to image for SwiftUI app using OpenAI
A python library for self-supervised learning on images
Hub of ready-to-use datasets for ML models
Deep learning library
Enabling web apps to get accessed by AI agents
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Implementation of 'lightweight' GAN, proposed in ICLR 2021
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An advanced paper search agent powered by large language models
Swirl queries any number of data sources with APIs