Create UIs for your machine learning model in Python in 3 minutes
Focus on prompting and generating
A single Gradio + React WebUI with extensions for ACE-Step
Comprehensive Gradio WebUI for audio processing
A simple, high-quality voice conversion tool focused on ease of use
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Synchronized Translation for Videos
EPUB to audiobook converter, optimized for Audiobookshelf
A fast TTS architecture with conditional flow matching
The python library for real-time communication
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Diffusion Transformer with Fine-Grained Chinese Understanding
One-click deployment (including offline integration package)
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Speech-AI-Forge is a project developed around TTS generation model
Unified Multimodal Understanding and Generation Models
An open-source RAG-based tool for chatting with your documents
Stable Diffusion web UI
Real-time voice interactive digital human
SoTA open-source TTS
Time-lapse Video Generation Models as Metamorphic Simulators
Oobabooga - The definitive Web UI for local AI, with powerful features
A Web UI for easy subtitle using whisper model
From Images to High-Fidelity 3D Assets
Text and image to video generation: CogVideoX and CogVideo