Showing 198 open source projects for "voice"

View related business solutions
  • Secure Cloud Storage for Files, Photos and Documents | pCloud Icon
    Secure Cloud Storage for Files, Photos and Documents | pCloud

    Store, access, and manage your files on your own terms, from anywhere.

    Store, sync, and share your files securely with pCloud. Get up to 10 GB of free secure cloud storage and access your files from any device, anywhere.
    Learn More
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of Google's scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • 1
    Voice

    Voice

    Minimalistic audiobook player

    This is my digital playground where I am learning. I'm integrating and validating new technologies and ideas here, playing around with new UI / UX components, and developing with the best coding standard I have come up with. At the same time, I want to provide an audiobook player which is really easy in use and a joy to work with.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Moonshine Voice

    Moonshine Voice

    Fast and accurate automatic speech recognition (ASR) for edge devices

    moonshine is an open-source automatic speech recognition toolkit optimized for fast and accurate transcription on edge devices and local environments. The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. Moonshine supports multiple platforms including mobile, desktop, and embedded systems, and provides example projects to accelerate integration into real-world products. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 3
    Kaset

    Kaset

    The missing YouTube Music macOS app

    Kaset is a social audio platform framework that allows users to host, share, and interact with audio content in community-oriented spaces, combining elements of podcasting, voice rooms, and feedback-driven discovery. It provides an interface where creators can upload episodes, host live or scheduled voice sessions, and cultivate listener communities through comments, reactions, and follow systems. The platform emphasizes audio discovery with playlists, curated channels, and trending audio feeds, helping users find relevant voice content without sifting through noise. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Stigg | SaaS Monetization and Entitlements API Icon
    Stigg | SaaS Monetization and Entitlements API

    For developers in need of a tool to launch pricing plans faster and build better buying experiences

    A monetization platform is a standalone middleware that sits between your application and your business applications, as part of the modern enterprise billing stack. Stigg unifies all the APIs and abstractions billing and platform engineers had to build and maintain in-house otherwise. Acting as your centralized source of truth, with a highly scalable and flexible entitlements management, rolling out any pricing and packaging change is now a self-service, risk-free, exercise.
    Learn More
  • 5
    FFsubsync

    FFsubsync

    Automagically synchronize subtitles with video

    ...In this case, you can use the correctly synchronized srt file directly as a reference for synchronization, instead of using the video as the reference. ffsubsync uses the file extension to decide whether to perform voice activity detection on the audio or to directly extract speech from an srt file. ffsubsync usually finishes in 20 to 30 seconds, depending on the length of the video.
    Downloads: 51 This Week
    Last Update:
    See Project
  • 6
    Textream

    Textream

    Textream is a free macOS teleprompter app for streamers, interviewers

    ...It runs natively on macOS and leverages on-device speech recognition to highlight each word in real time as you speak, keeping your focus where it belongs — on delivery rather than memorization. The interface supports multiple modes of use, such as classic constant-scroll auto-scrolling, voice-activated scrolling that pauses when you’re silent, and direct word tracking that syncs the displayed script to your spoken pace. Textream also includes flexible display modes, from a sleek notch-anchored overlay on laptops to an invisible-to-the-audience floating window or fullscreen experience.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 7
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 8
    OpenVINO AI Plugins for Audacity

    OpenVINO AI Plugins for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity. These AI features run 100% locally on your PC, no internet connection is necessary. OpenVINO™ is used to run AI models on supported accelerators found on the user's system such as CPU, GPU, and NPU.
    Downloads: 167 This Week
    Last Update:
    See Project
  • 9
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Workable Hiring Software - Hire The Best People, Fast Icon
    Workable Hiring Software - Hire The Best People, Fast

    Find the best candidates with the best recruitment software

    Workable is the preferred software for today's recruiting industry and HR teams, trusted by over 6,000 companies to streamline their hiring processes. Finding the right person for the job has never been easier—users now possess the ability to manage multiple hiring pipelines at once, from posting a job to sourcing candidates. Workable is also seamlessly integrated between desktop and mobile, allowing admins full control and flexibility all in the ATS without needing additional software.
    Learn More
  • 10
    Translate-Subtitle-File

    Translate-Subtitle-File

    Subtitle Creation Assistant

    Subtitle group machine translation assistant - [Function 1: Translate subtitle file] .srt .ass .vtt [Function 2: Voice to text] (Drag in video or audio to recognize subtitles) (The latest version v4.1.0 Update time 2021 2 May 23) 12 translation service providers can be configured, such as Google, Baidu, Tencent, Caiyun, IBM, Azure, Amazon, etc. (6 voice service providers can be configured: Alibaba Cloud, Xunfei, Tencent Cloud, IBM, Azure, Amazon ) Advantages: 1.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Record You

    Record You

    Privacy focused recorder app built with MD3

    Privacy-focused voice and screen recorder app built with MD3.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    RHVoice

    RHVoice

    Free open source speech synthesizer for Russian and other languages

    RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers and non-programmers, are blind or partially sighted.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 13
    Echo Voice Recorder

    Echo Voice Recorder

    Simple Voice Recorder Application for Debian Based Linux Systems

    This is a good audio recording app that's extremely lightweight and requires minimal dependencies. It records your microphone and, if desired, your system sounds. https://github.com/shampuan/Echo-Voice-Recorder 5 audio formats: wav, mp3, ogg, flac, aac. 7 language options: Azerbaijani, German, English, Spanish, French, Turkish (default), Russian.
    Downloads: 41 This Week
    Last Update:
    See Project
  • 14
    Lenstube

    Lenstube

    Decentralized video-sharing social media platform

    Decentralized video-sharing social media platform, built using Lens protocol. This turborepo uses Yarn as a package manager. For a place to have open discussions on features, voice your ideas, or get help with general questions please visit our community at Discord.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    MediaDevices

    MediaDevices

    Go implementation of the MediaDevices API

    ...It provides a cross-platform, unified API for capturing and manipulating media streams and is often used in combination with Pion WebRTC for peer-to-peer communications. Its support for device enumeration, media constraints, and frame processing makes it a powerful building block for custom voice and video solutions in Go.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    Voxal voice changer

    Voxal voice changer

    Transform your voice in real-time voxal voice changer

    Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    Voice-Changer-MeowVoice

    Voice-Changer-MeowVoice

    Meow Voice Changer is a lightweight, real-time voice modulation tool

    Meow Voice Changer is a lightweight, real-time voice modulation tool for Windows, purrrfect for streaming and voice chats. Inspired by Battlestar Galactica Cylon and Viper comms but capable of many more effects, it offers pitch, tempo, wobble, ring modulation, flanging, echo, EQ, and radio effects like distortion and static. Process microphone input with a customizable effect chain.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ...Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an open-source repository, AudioNotes provides developers or power users the opportunity to customize how audio is captured, stored, annotated, and replayed — e.g. adding playback speed control, export to standard formats, or synchronization between notes and audio timeline. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    annyang!

    annyang!

    Speech recognition for your site

    annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays nicely with all browsers, progressively enhancing browsers that support SpeechRecognition, while leaving users with older browsers unaffected. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    AzioVoice Recorder

    AzioVoice Recorder

    AzioVoice Recorder is an app designed to record audio

    Description Starting from version 1.3.1.0, the project has been renamed to AzioVoice Recorder and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PP795T0KSFP The app supports recording in WAV format with adjustable settings like sample rate, bit depth, and channels. It includes several audio filters for basic enhancement and features a simple file explorer for playback and management. Themes can be switched between dark and light, with settings...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    byzorgan

    byzorgan

    Specialized sound synthesizer with Byzantine Church music scales

    This software integrates a small, specialized synthesizer and vocal processor. It can be used to learn Byzantine Church singing. You can play from the keyboard, mouse or touch screen. MIDI input is also available. Voice functions include: pitch highlighting, synthesizer control by voice, pitch correction and voice-to-ison conversion. On the screen there are labels with symbols of Byzantine notes. There is a metronome. The program is oriented on the Chrysanthos tuning of the diatonic scale: 9/8 - 12/11 - 88/81. The octave is divided into 17 (or 19) non-equal parts, it allows you to get all the Byzantine modes.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 23
    WO Mic

    WO Mic

    Transform your smartphone into a PC microphone

    WO Mic is a free utility that turns your smartphone into a functional microphone for your Windows PC. It eliminates the need to buy a separate microphone, offering a convenient and cost-effective solution for voice chat, recording, or wireless voice control. The app supports multiple connection types including Wi-Fi, Bluetooth, and USB, giving users flexible options to suit their setup. Setup involves installing the mobile app and the PC client with drivers, which is straightforward and fast. While USB connections provide the best audio quality and lowest latency, Wi-Fi and Bluetooth offer greater wireless freedom. ...
    Downloads: 652 This Week
    Last Update:
    See Project
  • 24
    Mechvibes

    Mechvibes

    Play mechanical keyboard sounds, typewriter sound as you type

    It's a side project that I made for myself since when I use my own mechanical keyboard at late night or in the office and my parents and my coworkers hate that a lot because the sound of the keyboard is very loud (especially in quiet places). So if you are facing this issue just like me then this product is for you! - Plays typewriter sound - You can add more keyboard sound set by recording any sound you like and add them to this app with some easy steps. - Use your laptop keyboard...
    Leader badge
    Downloads: 1,957 This Week
    Last Update:
    See Project
  • 25
    Audio Satanifier 666

    Audio Satanifier 666

    Easily apply cool gnarly voice filters to your audio files

    ...Its more focused on the "satan voice" filter, while the larger default version has 16 different filters you can mix and match. (Source code is included within the html file itself.)
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB