Repo of Qwen2-Audio chat & pretrained large audio language model
Offline Text To Speech synthesis for python
Use Microsoft Edge's online text-to-speech service from Python
Open-source abilities for OpenHome agents
Speakr is a personal, self-hosted web application
A text-to-speech, speech-to-text and speech-to-speech library
Robust Speech Recognition via Large-Scale Weak Supervision
AI assistant based on large models that can actively think and plan
Official MiniMax Model Context Protocol (MCP) server
Run a full local LLM stack with one command using Docker
A specialized Claude Code workspace for creating long-form
Flowly is 100x faster than OpenClaw
Offline inference engine for art, real-time voice conversations
Context-aware desktop AI assistant that understands screen content
Management of Yandex Station and other smart home devices
MARS5 speech model (TTS) from CAMB.AI
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
AI Slack bot for reading, summarizing, and chatting with content
Framework for building realtime multimodal voice AI agents apps
Curated collection of Amazing Python scripts
A simple native web interface that uses ChatTTS to synthesize text
TTS with kokoro and onnx runtime
Framework for building AI-powered interactive digital humans and agent
Generate audiobooks from e-books