Create UIs for your machine learning model in Python in 3 minutes
Focus on prompting and generating
A single Gradio + React WebUI with extensions for ACE-Step
Comprehensive Gradio WebUI for audio processing
A simple, high-quality voice conversion tool focused on ease of use
Synchronized Translation for Videos
A fast TTS architecture with conditional flow matching
The python library for real-time communication
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Diffusion Transformer with Fine-Grained Chinese Understanding
One-click deployment (including offline integration package)
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Speech-AI-Forge is a project developed around TTS generation model
Unified Multimodal Understanding and Generation Models
Real-time voice interactive digital human
An open-source RAG-based tool for chatting with your documents
Stable Diffusion web UI
SoTA open-source TTS
Time-lapse Video Generation Models as Metamorphic Simulators
Oobabooga - The definitive Web UI for local AI, with powerful features
A Web UI for easy subtitle using whisper model
From Images to High-Fidelity 3D Assets
Text and image to video generation: CogVideoX and CogVideo
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
SOTA Open Source TTS