Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-to-text, text-to-speech, and speaker recognition
Speech recognition module for Python
Multilingual speech recognition and audio understanding model
Audio foundation model excelling in audio understanding
Open-source industrial-grade ASR models
A PyTorch-based Speech Toolkit
kaldi-asr/kaldi is the official location of the Kaldi project
Speech recognition for your site
A free, open source, and extensible speech-to-text application
Captcha solver extension for humans
Fast and accurate automatic speech recognition (ASR) for edge devices
Multilingual Automatic Speech Recognition with word-level timestamps
Cross-platform AI language practice app
Port of OpenAI's Whisper model in C/C++
StreamSpeech is a seamless model for offline speech recognition
Automatic Speech Recognition with Word-level Timestamps
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Toolkit for conversational AI
OpenVINO™ Toolkit repository
Faster Whisper transcription with CTranslate2
A cross-platform software for text translation and recognition
Fast multimodal LLM for real-time voice interaction and AI apps
Run local LLMs like llama, deepseek, kokoro etc. inside your browser