Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio server, programming language, and IDE for sound synthesis
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
The open-source voice synthesis studio powered by Qwen3-TTS
Software synthesizer based on the SoundFont 2 specifications
Sonic Pi is your free code-based music creation and performance tool
A multi-system chiptune tracker compatible with DefleMask modules
Collaborative programmable music
Functional programming language for signal processing
Tokenizer-Free TTS for Multilingual Speech Generation
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Controllable & emotion-expressive zero-shot TTS
Transforming Multimodal Content into Captivating Multilingual Audio
Framework for building real-time voice and multimodal AI agents
Free open source speech synthesizer for Russian and other languages
Capable of understanding text, audio, vision, video
Offline Text To Speech synthesis for python
Translate the video from one language to another and embed dubbing
Open Source Speech Language Model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A fast TTS architecture with conditional flow matching
A Systematic Framework for Interactive World Modeling
Industrial-level controllable zero-shot text-to-speech system