The open-source voice synthesis studio powered by Qwen3-TTS
End-to-end speech processing toolkit
Speech Note Linux app. Note taking, reading and translating
A generative speech model for daily dialogue
Offline inference engine for art, real-time voice conversations
Scalable generative AI framework built for researchers and developers
Toolkit for conversational AI
A high-quality rapid TTS voice cloning model
Foundational model for human-like, expressive TTS
Python library and CLI tool to interface with Google Translate
An Open Source text-to-speech system built by inverting Whisper
Towards Human-Sounding Speech
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
A list of accessible speech corpora for ASR, TTS
WaveRNN Vocoder + TTS
Implementation of a Transformer based neural network
Free and open source text-to-speech software
ColdFusion SDK for the VoiceShot API.
PHP SDK for processing phone calls and SMS through the VoiceShot API.
.NET SDK for processing phone calls and SMS through the VoiceShot API.
ASP SDK for processing phone calls and SMS through the VoiceShot API.
Toolkit for efficient experimentation with Speech Recognition
TensorFlow Implementation of DC-TTS: yet another text-to-speech model