Translate the video from one language to another and embed dubbing
Video translation and dubbing tool powered by LLMs
Hardware-accelerated video transcoding using Android MediaCodec APIs
Capable of understanding text, audio, vision, video
Synchronized Translation for Videos
Automagically synchronize subtitles with video
The python library for real-time communication
Framework for building real-time voice and multimodal AI agents
Subtitle Creation Assistant
Qwen3-omni is a natively end-to-end, omni-modal LLM
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Official Python inference and LoRA trainer package
A suite of advanced multi-modal LLMs
Voice Recognition to Text Tool
Official MiniMax Model Context Protocol (MCP) server
AI-powered tool for generating, optimizing, and translating subtitles
Use Microsoft Edge's online text-to-speech service from Python
Textream is a free macOS teleprompter app for streamers, interviewers
Open source text-to-speech tool, supports extra-long text
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Workflow and speech recognition app
One-stop AI digital human system with video voice synthesis tools
Build Vision Agents quickly with any model or video provider
Instantly generate AI-powered subtitles on your device
The media player for language learning, with dual subtitles