Toolkit for conversational AI
Scalable generative AI framework built for researchers and developers
Video translation and dubbing tool powered by LLMs
Offline inference engine for art, real-time voice conversations
Generate audiobooks from e-books
Foundational model for human-like, expressive TTS
Open source text-to-speech tool, supports extra-long text
An Open Source text-to-speech system built by inverting Whisper
A fast TTS architecture with conditional flow matching
Official MiniMax Model Context Protocol (MCP) server
Framework for building neural networks
Towards Human-Level Text-to-Speech through Style Diffusion
Free & Easy AI Voice Accounting Software For Blind & Speechless People
Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People
VITS2 backbone with multilingual-bert
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Multi-Voice and Prompt-Controlled TTS Engine
Microsoft speech synthesis tool, built with Electron
Chinese text-to-speech engine
A list of accessible speech corpora for ASR, TTS
WaveRNN Vocoder + TTS
Tool that can record speech synthesis
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Implementation of a Transformer based neural network