Reading book source
Industrial-level controllable zero-shot text-to-speech system
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A high-quality rapid TTS voice cloning model
Controllable & emotion-expressive zero-shot TTS
Controllable and fast Text-to-Speech for over 7000 languages
Instant voice cloning by MIT and MyShell. Audio foundation model
A lightweight text-to-speech model with zero-shot voice cloning
Spark-TTS Inference Code
Comprehensive Gradio WebUI for audio processing
High-Quality Voice Cloning TTS for 600+ Languages
Multi-lingual large voice generation model, providing inference
State-of-the-art TTS model under 25MB
Towards Human-Sounding Speech
Python library and CLI tool to interface with Google Translate
Use Microsoft Edge's online text-to-speech service from Python
SOTA Open Source TTS
TTS with kokoro and onnx runtime
EPUB to audiobook converter, optimized for Audiobookshelf
Offline Text To Speech synthesis for python
Build Vision Agents quickly with any model or video provider
Automatically translates the text of a video based on a subtitle file
A nearly-live implementation of OpenAI's Whisper
End-to-end speech processing toolkit
SoTA open-source TTS