Speech to Text to Speech, sends text as OSC messages
Comprehensive Gradio WebUI for audio processing
Qwen3-TTS is an open-source series of TTS models
A lightweight text-to-speech model with zero-shot voice cloning
State-of-the-art TTS model under 25MB
Spark-TTS Inference Code
Controllable & emotion-expressive zero-shot TTS
Use Microsoft Edge's online text-to-speech service from Python
A single Gradio + React WebUI with extensions for ACE-Step
1 min voice data can also be used to train a good TTS model
Real-time voice interactive digital human
Towards Human-Sounding Speech
A sound cloning tool with a web interface, using your voice
Free, high-quality text-to-speech API endpoint to replace OpenAI
Foundational model for human-like, expressive TTS
Industrial-level controllable zero-shot text-to-speech system
Bailing is a voice dialogue robot similar to GPT-4o
Generate audiobooks from e-books, voice cloning & 1107+ languages
Virtual AI anchor that combines state-of-the-art technology
The open-source voice synthesis studio powered by Qwen3-TTS
SoTA open-source TTS
Conversational voice AI agents
A high-quality rapid TTS voice cloning model
Official PyTorch Implementation
Open-source framework for intelligent speech interaction