A generative speech model for daily dialogue
A fast TTS architecture with conditional flow matching
Toolkit for conversational AI
Scalable generative AI framework built for researchers and developers
Offline inference engine for art, real-time voice conversations
Generate audiobooks from e-books
Foundational model for human-like, expressive TTS
Official MiniMax Model Context Protocol (MCP) server
An Open Source text-to-speech system built by inverting Whisper
Framework for building neural networks
Towards Human-Level Text-to-Speech through Style Diffusion
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
VITS2 backbone with multilingual-bert
Text to Speech Utility
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Multi-Voice and Prompt-Controlled TTS Engine
Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]
WaveRNN Vocoder + TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Implementation of a Transformer based neural network
Toolkit for efficient experimentation with Speech Recognition
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Just Another Speech Recognition and Text to Speech software.