Showing 118 open source projects for "voice"

View related business solutions
  • Comet Backup - Fast, Secure Backup Software for MSPs Icon
    Comet Backup - Fast, Secure Backup Software for MSPs

    Fast, Secure Backup Software for Businesses and IT Providers

    Comet is a flexible backup platform, giving you total control over your backup environment and storage destinations.
    Learn More
  • The AI-powered unified PSA-RMM platform for modern MSPs. Icon
    The AI-powered unified PSA-RMM platform for modern MSPs.

    Trusted PSA-RMM partner of MSPs worldwide

    SuperOps.ai is the only PSA-RMM platform powered by intelligent automation and thoughtfully crafted for the new-age MSP. The platform also helps MSPs manage their projects, clients, and IT documents from a single place.
    Learn More
  • 1
    Moonshine Voice

    Moonshine Voice

    Fast and accurate automatic speech recognition (ASR) for edge devices

    moonshine is an open-source automatic speech recognition toolkit optimized for fast and accurate transcription on edge devices and local environments. The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. Moonshine supports multiple platforms including mobile, desktop, and embedded systems, and provides example projects to accelerate integration into real-world products. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    Kaset

    Kaset

    The missing YouTube Music macOS app

    Kaset is a social audio platform framework that allows users to host, share, and interact with audio content in community-oriented spaces, combining elements of podcasting, voice rooms, and feedback-driven discovery. It provides an interface where creators can upload episodes, host live or scheduled voice sessions, and cultivate listener communities through comments, reactions, and follow systems. The platform emphasizes audio discovery with playlists, curated channels, and trending audio feeds, helping users find relevant voice content without sifting through noise. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Textream

    Textream

    Textream is a free macOS teleprompter app for streamers, interviewers

    ...It runs natively on macOS and leverages on-device speech recognition to highlight each word in real time as you speak, keeping your focus where it belongs — on delivery rather than memorization. The interface supports multiple modes of use, such as classic constant-scroll auto-scrolling, voice-activated scrolling that pauses when you’re silent, and direct word tracking that syncs the displayed script to your spoken pace. Textream also includes flexible display modes, from a sleek notch-anchored overlay on laptops to an invisible-to-the-audience floating window or fullscreen experience.
    Downloads: 17 This Week
    Last Update:
    See Project
  • Outbound sales software Icon
    Outbound sales software

    Unified cloud-based platform for dialing, emailing, appointment scheduling, lead management and much more.

    Adversus is an outbound dialing solution that helps you streamline your call strategies, automate manual processes, and provide valuable insights to improve your outbound workflows and efficiency.
    Learn More
  • 5
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 6
    OpenVINO AI Plugins for Audacity

    OpenVINO AI Plugins for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity. These AI features run 100% locally on your PC, no internet connection is necessary. OpenVINO™ is used to run AI models on supported accelerators found on the user's system such as CPU, GPU, and NPU.
    Downloads: 167 This Week
    Last Update:
    See Project
  • 7
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Translate-Subtitle-File

    Translate-Subtitle-File

    Subtitle Creation Assistant

    Subtitle group machine translation assistant - [Function 1: Translate subtitle file] .srt .ass .vtt [Function 2: Voice to text] (Drag in video or audio to recognize subtitles) (The latest version v4.1.0 Update time 2021 2 May 23) 12 translation service providers can be configured, such as Google, Baidu, Tencent, Caiyun, IBM, Azure, Amazon, etc. (6 voice service providers can be configured: Alibaba Cloud, Xunfei, Tencent Cloud, IBM, Azure, Amazon ) Advantages: 1.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    RHVoice

    RHVoice

    Free open source speech synthesizer for Russian and other languages

    RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers and non-programmers, are blind or partially sighted.
    Downloads: 43 This Week
    Last Update:
    See Project
  • Securden Privileged Account Manager Icon
    Securden Privileged Account Manager

    Unified Privileged Access Management

    Discover and manage administrator, service, and web app passwords, keys, and identities. Automate management with approval workflows. Centrally control, audit, monitor, and record all access to critical IT assets.
    Learn More
  • 10
    Echo Voice Recorder

    Echo Voice Recorder

    Simple Voice Recorder Application for Debian Based Linux Systems

    This is a good audio recording app that's extremely lightweight and requires minimal dependencies. It records your microphone and, if desired, your system sounds. https://github.com/shampuan/Echo-Voice-Recorder 5 audio formats: wav, mp3, ogg, flac, aac. 7 language options: Azerbaijani, German, English, Spanish, French, Turkish (default), Russian.
    Downloads: 41 This Week
    Last Update:
    See Project
  • 11
    Lenstube

    Lenstube

    Decentralized video-sharing social media platform

    Decentralized video-sharing social media platform, built using Lens protocol. This turborepo uses Yarn as a package manager. For a place to have open discussions on features, voice your ideas, or get help with general questions please visit our community at Discord.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    MediaDevices

    MediaDevices

    Go implementation of the MediaDevices API

    ...It provides a cross-platform, unified API for capturing and manipulating media streams and is often used in combination with Pion WebRTC for peer-to-peer communications. Its support for device enumeration, media constraints, and frame processing makes it a powerful building block for custom voice and video solutions in Go.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Voxal voice changer

    Voxal voice changer

    Transform your voice in real-time voxal voice changer

    Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 14
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an open-source repository, AudioNotes provides developers or power users the opportunity to customize how audio is captured, stored, annotated, and replayed — e.g. adding playback speed control, export to standard formats, or synchronization between notes and audio timeline. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    annyang!

    annyang!

    Speech recognition for your site

    annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays nicely with all browsers, progressively enhancing browsers that support SpeechRecognition, while leaving users with older browsers unaffected. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    byzorgan

    byzorgan

    Specialized sound synthesizer with Byzantine Church music scales

    This software integrates a small, specialized synthesizer and vocal processor. It can be used to learn Byzantine Church singing. You can play from the keyboard, mouse or touch screen. MIDI input is also available. Voice functions include: pitch highlighting, synthesizer control by voice, pitch correction and voice-to-ison conversion. On the screen there are labels with symbols of Byzantine notes. There is a metronome. The program is oriented on the Chrysanthos tuning of the diatonic scale: 9/8 - 12/11 - 88/81. The octave is divided into 17 (or 19) non-equal parts, it allows you to get all the Byzantine modes.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 17
    Mechvibes

    Mechvibes

    Play mechanical keyboard sounds, typewriter sound as you type

    It's a side project that I made for myself since when I use my own mechanical keyboard at late night or in the office and my parents and my coworkers hate that a lot because the sound of the keyboard is very loud (especially in quiet places). So if you are facing this issue just like me then this product is for you! - Plays typewriter sound - You can add more keyboard sound set by recording any sound you like and add them to this app with some easy steps. - Use your laptop keyboard...
    Leader badge
    Downloads: 1,957 This Week
    Last Update:
    See Project
  • 18
    Audio Satanifier 666

    Audio Satanifier 666

    Easily apply cool gnarly voice filters to your audio files

    ...Its more focused on the "satan voice" filter, while the larger default version has 16 different filters you can mix and match. (Source code is included within the html file itself.)
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 199 This Week
    Last Update:
    See Project
  • 20
    Internet DJ Console

    Internet DJ Console

    A feature packed DJ console and internet radio client for Linux users

    Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain. This list of features is by no means complete. More info at the project homepage.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 21
    JavaComm

    JavaComm

    Desktop communication tool: chat, telephony, multimedia

    JavaComm is a client–server communication platform offering chat, telephony, screen sharing, conferencing, a media player, and configurable media streams. Users can define and manage their own streams for flexible playback. All server-based communication runs over HTTPS (SSL/TLS), while file transfers are handled directly between users without passing through the server. The JavaComm Server runs on any Jakarta EE Web Profile 10 platform with the required extensions, using Tomcat 10.1.52 as...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 22

    speech intonator

    The purpose of the project is to develop audio processing algorithms

    ...Detailed documentation in 'read' signature files. Anyone who is interested in this project, please contact. My mail: alekbo@yahoo.com. The side branch of the 2020 project contains data on voice intonation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    InstrumentalMusic

    InstrumentalMusic

    Application which detects musical notes from the microphone.

    ...It allows listening to the microphone and play the detected notes to output (in midi). Multilanguage support. Zoom Dark mode option JDK-17 compatibility With v1.2 it includes a pitch shifter (making voice lower or sharper through a slider) There is a demo video which shows how it works (the demo video can be visited from Help menu of the application) You can also see the pitch-shifter demo version here: https://frojasg1.com:8443/resource_counter/resourceCounter?operation=countAndForward&url=https%3A%2F%2Ffrojasg1.com%2Fdemos%2Faplicaciones%2FInstrumentalMusic%2Fpitch.shifter.mp4%3Forigin%3Dsourceforge&origin=web You can find more about it at this web site: https://www.frojasg1.com:8443/downloads_web/web/html/instrumentalMusic.html?...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    RemoteTTS

    RemoteTTS

    Tool to remotely activate Text-To-Speech (TTS) on a server

    The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Lyra

    Lyra

    A Very Low-Bitrate Codec for Speech Compression

    ...The codebase includes encoder and decoder components, along with tools for data preparation and evaluation. By pushing bitrates down to just a few kilobits per second while retaining quality, lyra expands access to voice calls where bandwidth is scarce.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB