transcribe free download

Showing 36 open source projects for "transcribe"

View related business solutions

Mac Clear Filters & Widen Search

Get full visibility and control over your tasks and projects with Wrike.
A cloud-based collaboration, work management, and project management software

Wrike offers world-class features that empower cross-functional, distributed, or growing teams take their projects from the initial request stage all the way to tracking work progress and reporting results.

Learn More
A privacy-first API that predicts global consumer preferences
Qloo AI adds value to a wide range of Fortune 500 companies in the media, technology, CPG, hospitality, and automotive sectors.

Through our API, we provide contextualized personalization and insights based on a deep understanding of consumer behavior and more than 575 million people, places, and things.

Learn More
1

Concordia

Crowdsourcing platform for full text transcription and tagging

Concordia is a platform for crowdsourcing transcription and tagging of text in digitized images. It was developed by the Library of Congress so that volunteers of all backgrounds could transcribe and tag digitized images of manuscripts and typed materials from the Library’s collections that could not otherwise be done by optical character recognition.

Downloads: 6 This Week

Last Update: 2026-03-23
See Project
2

Handy STT

A free, open source, and extensible speech-to-text application

...Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active text field. Its backend leverages OpenAI’s Whisper models for GPU-accelerated speech recognition and Parakeet V3 for efficient CPU-only transcription with automatic language detection. To further refine accuracy and responsiveness, Handy integrates Silero’s Voice Activity Detection (VAD) for silence filtering, ensuring only speech segments are processed.

Downloads: 89 This Week

Last Update: 2026-04-02
See Project
3

Vibe

Transcribe on your own

Vibe is an open-source project by thewh1teagle designed to deliver a collaborative and interactive social application experience, though its specifics depend on its evolving community scope; its development often focuses on connecting users through dynamic features that can include chat, shared spaces, and immersive interactions. The repository typically includes backend logic, frontend integration, and real-time communication stacks to support live user engagement, performance...

Downloads: 25 This Week

Last Update: 2026-03-13
See Project
4

pyVideoTrans

Translate the video from one language to another and embed dubbing

pyVideoTrans is an ambitious open-source multimedia processing project that assembles speech recognition, subtitle generation, AI translation, voice synthesis, and video assembly into a unified pipeline for converting videos from one language to another with embedded dubbing and captions. At its core it runs speech-to-text models to transcribe audio tracks, translates the resulting text into a target language using local or cloud-based translation engines, synthesizes new speech to match the translated subtitles, and then merges that speech back into the video, creating a fully localized media file. The tool supports both command-line and GUI modes, making it accessible to developers and creatives needing batch or automated processing.

Downloads: 21 This Week

Last Update: 3 days ago
See Project
Striven | All In One Business Management Software
Striven is an all-in-one business management software suite with everything your organization needs for success.

Striven is the all-in-one business management software that lowers your costs, improves your operations, and makes work easier. Make your company’s data coherent, connected, and relevant.

Learn More
5

AutoSubs

Instantly generate AI-powered subtitles on your device

AutoSubs is an open-source, AI-powered subtitle generation tool that enables users to automatically transcribe audio and video content into accurate, editable subtitles directly on their device. It supports both standalone usage and integration with professional video editing software such as DaVinci Resolve, allowing creators to generate and edit subtitles within their existing workflows. The tool leverages speech-to-text models, including OpenAI Whisper, to produce high-quality transcriptions and can differentiate between speakers using diarization techniques. ...

Downloads: 14 This Week

Last Update: 2026-03-18
See Project
6

VideoCaptioner

AI-powered tool for generating, optimizing, and translating subtitles

...It integrates speech recognition, language processing, and translation technologies to automatically generate and refine subtitles from video or audio sources. VideoCaptioner uses speech-to-text engines such as Whisper variants to transcribe spoken content and convert it into subtitle text with accurate timestamps. After transcription, large language models are used to intelligently restructure subtitles into natural sentences, correct wording, and improve readability for viewers. It can also translate subtitles into other languages while preserving the original timing, making it suitable for multilingual video publishing and accessibility. ...

Downloads: 14 This Week

Last Update: 2026-03-28
See Project
7

ChatGPT Telegram Bot

A Telegram bot that integrates with OpenAI's official ChatGPT APIs

A Telegram bot that integrates with OpenAI's official ChatGPT, DALL·E and Whisper APIs to provide answers. Ready to use with minimal configuration required.

Downloads: 1 This Week

Last Update: 2024-12-28
See Project
8

stt

Voice Recognition to Text Tool

stt is a standalone speech recognition tool that locally converts spoken content in audio or video files into textual formats without requiring internet access, giving users control over their data and reducing reliance on external APIs. It leverages open-source speech models such as Faster-Whisper to recognize and transcribe human speech into plain text, structured JSON objects, or subtitle files with time codes, making it suitable for both personal and professional transcription tasks. The project is designed to be easy to deploy: you can run a local Python server that exposes an HTTP API for uploading audio/video files and retrieving transcriptions in different formats. ...

Downloads: 3 This Week

Last Update: 2026-02-17
See Project
9

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. ...

Downloads: 11 This Week

Last Update: 2026-04-05
See Project
CloudZero: The Cloud Cost Optimization Platform
CloudZero automates the collection, allocation, and analysis of your infrastructure and AI spend to uncover waste and improve unit economics.

CloudZero is the leader in proactive cloud cost efficiency. We enable engineers to build cost-efficient software without slowing down innovation. CloudZero's next-generation cloud cost optimization platform automates the collection, allocation, and analysis of cloud costs to uncover savings opportunities and improve unit economics. We are the only platform that enables companies to understand 100% of their operational cloud spend and take an engineering-led approach to optimizing that spend. CloudZero is used by industry leaders worldwide, such as Coinbase, Klaviyo, Miro, Nubank, and Rapid7.

Learn More
10

Amical

Open Source AI Dictation App

Amical is an open source, AI-powered desktop dictation and note-taking application that enables users to dictate hands-free, transcribe meetings, and capture notes effortlessly with unmatched speed, accuracy, and privacy. It leverages both local and cloud-based AI models, letting users seamlessly switch between providers for the ideal balance of speed, precision, and control, and understands the context of each app in use to automatically format text in a tone and style appropriate to the platform. ...

Downloads: 16 This Week

Last Update: 3 days ago
See Project
11

Whishper

Transcribe any audio to text, translate and edit subtitles 100% locall

Open-source, local-first audio transcription and subtitling suite with a simple web UI. Thanks to open-source technologies, Whishper can run 100% offline. Your data never leaves your computer. Whishper allows you to translate your transcriptions to and from more than 60 languages thanks to Argos Translate and LibreTranslate. Download the transcriptions in many formats (json, txt, vtt, srt). Easily edit your subtitles right in the Web-UI.

Downloads: 15 This Week

Last Update: 2024-09-10
See Project
12

BasedHardware

Open source AI wearable platform for recording and summarizing speech

...It combines hardware, firmware, mobile applications, and backend services to create a complete ecosystem for voice-driven interaction. Users can connect the wearable device to a mobile phone and automatically record and transcribe meetings, conversations, and voice memos. Omi includes firmware for wearable hardware, a Flutter-based mobile companion application, backend services built with Python and FastAPI, and various SDKs for developers. These components work together to process audio, perform speech recognition, and integrate AI features such as summaries and automated actions. ...

Downloads: 8 This Week

Last Update: 6 hours ago
See Project
13

ShortGPT

AI framework for automated short video creation and editing tools

ShortGPT is an experimental AI-powered framework designed to automate the creation of short-form and long-form video content. It provides a structured system that handles multiple stages of the content creation workflow, including script generation, asset sourcing, voiceover synthesis, and video editing. ShortGPT uses large language models to generate scripts and prompts that guide the automated editing and production process. ShortGPT includes specialized content engines that manage...

Downloads: 5 This Week

Last Update: 2026-03-13
See Project
14

Groq TypeScript / Node.s

The official Node.js / Typescript library for the Groq API

...The library also supports passing different input types (file streams, blobs, fetch responses) for media-related endpoints, making it flexible for diverse environments (backend, browser, serverless). With this SDK, developers can call Groq’s models, transcribe audio, perform file uploads — all with minimal boilerplate — which streamlines creation of AI-enabled applications in the JavaScript/TypeScript ecosystem.

Downloads: 6 This Week

Last Update: 2026-03-25
See Project
15

Buzz

Transcribe and translate audio offline on your personal computer

Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for...

1 Review

Downloads: 4,913 This Week

Last Update: 2026-03-14
See Project
16

Bootleg Text Slicer

Text transcription & slicing tool with visual timeline and WAV output.

- Transcribe an audio file into individual words. - Display and interact with each word’s start and end positions on a timeline or within the "Review Dashboard." - Adjust timing offsets for the beginning and end of each word either globally or individually. - Play full audio or specific words directly from within the app. - Export words as separate `.wav` audio files

Downloads: 1 This Week

Last Update: 2026-01-31
See Project
17

DeepSeek AIO

Access and use all DeepSeek AI models in one program.

DeepSeek AIO is a simple program that allows you to interact with all DeepSeek large language models in one place. It supports text-based chats, data analysis, code generation, language translation, and more. The program is designed to make it easy for users to use DeepSeek's AI tools for different purposes without switching between multiple platforms.

Downloads: 23 This Week

Last Update: 2025-11-26
See Project
18

The Hear

The Hear program is made for journalists.

To transcribe audio, the app uses the built-in speech recognition features of macOS. Turn your audio and video files into text You can change the font size by pressing the command and +/- keys. The font size is saved during further use. A folder "Hear" with text files is created on the desktop. The program is universal - arm64/x86_64 You can ask questions here https://sourceforge.net/p/the-hear/discussion

Downloads: 0 This Week

Last Update: 2026-02-24
See Project
19

Cheetah

AI macOS app for real-time coding interview coaching assistance

...It integrates audio transcription and AI-generated responses to help users navigate technical interview questions as they happen. Cheetah uses a local speech-to-text engine based on Whisper to capture and transcribe conversations in real time, enabling it to understand interviewer prompts. It then leverages language models to generate suggested answers, refinements, or explanations tailored to the ongoing discussion. Cheetah also connects with live coding environments through a browser extension, allowing it to analyze code and logs directly from supported platforms. ...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
20

OpenAI Web Application

A web application that allows users to interact with OpenAI's models

...Utilize Whisper Model to transcribe audio into text.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
21

WhisperBatchRun

This batch file will run OpenAi's whisper to transcribe (or translate)

Currently, Version #1 of the batch file does the following: (1) checks of whisper is installed, and if so, starts to run; (2) Asks if you want to process sub-folders, and if an answer is not provided in 10 seconds, defaults to "N"; (3) Applies the following command to each mp3 or wav file in the folder/sub-folders: "whisper "FILENAME" --model large-v2 --output_format vtt" (4) Creates a log file in the active directory, but only if there were any errors; (5) ends. To use, simply place...

Downloads: 2 This Week

Last Update: 2023-03-21
See Project
22

VoiceOver

VoiceOver is a web application that allows you to transcribe audio

VoiceOver is a web application that allows you to transcribe English audio and listen to it in another voice. Choose a source, an audio file (.wav) in English only. Transcribe audio, several algorithms will take care of it. Listen to the generated transcription, a man or a woman, it's up to you!

1 Review

Downloads: 1 This Week

Last Update: 2023-03-24
See Project
23

Piano transcription

Task of transcribing piano recordings into MIDI files

...The authors used this system to build a large-scale classical piano MIDI dataset (see next project), but as a standalone tool it enables researchers, musicians, or hobbyists to transcribe their own piano recordings automatically.

Downloads: 1 This Week

Last Update: 2025-12-02
See Project
24

Live Transcribe Speech Engine

Live Transcribe is an Android application

Live Transcribe Speech Engine provides on-device speech recognition components that power real-time transcription for accessibility and everyday voice-first experiences. Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. ...

Downloads: 0 This Week

Last Update: 2025-10-10
See Project
25

From PEG to a practical parser

Transcribe Parsing Expression Grammar into a parser written in Java.

Tool to transcribe Parsing Expression Grammar into a parser written in Java. Parsing Expression Grammar (PEG) is a way to specify recursive-descent parsers with limited backtracking. The use of backtracking lifts the LL(1) restriction usually imposed by top-down parsers. In addition, PEG can define parsers with integrated lexing. Unlike some existing parser generators for PEG, the tool does not produce a complex and storage-hungry "packrat parser", but a collection of transparent recursive procedures. ...

Downloads: 1 This Week

Last Update: 2022-08-06
See Project