Showing 44 open source projects for "speech text"

View related business solutions
  • Get full visibility and control over your tasks and projects with Wrike. Icon
    Get full visibility and control over your tasks and projects with Wrike.

    A cloud-based collaboration, work management, and project management software

    Wrike offers world-class features that empower cross-functional, distributed, or growing teams take their projects from the initial request stage all the way to tracking work progress and reporting results.
    Learn More
  • Professional Email Hosting for Small Business | Greatmail Icon
    Professional Email Hosting for Small Business | Greatmail

    Ready to switch to a more reliable and secure email hosting solution?

    Dependable cloud based email hosting with spam filtering, antivirus protection, generous storage and webmail. Compatible with Outlook and all other POP3/IMAP clients. High volume SMTP service for responsible senders. Outbound relay service for transactional messages, email marketing campaigns, newsletters and other applications. Dedicated email servers, clustering and multiple IP load balancing for high volume senders. Fixed monthly cost with unlimited sending and reputation monitoring. Greatmail is an email service provider (ESP) specializing in business class email hosting, SMTP hosting and email servers. For ISPs, application programmers and cloud developers, we also provide custom solutions including dedicated IP servers and process specific, load balanced configurations with multiple servers.
    Learn More
  • 1
    DragonianVoice

    DragonianVoice

    C++ inference library for multiple SVC/TTS

    DragonianVoice is a C++ inference library that unifies multiple speech synthesis, voice conversion, and singing voice synthesis models under a single, high-performance ONNX-based framework. It focuses on being a reusable native library rather than a full UI product, with bindings for C, C++, and C# so it can be embedded into other applications or engines. The project supports a wide range of model families: TTS models such as Tacotron2, VITS, EmotionalVITS, BERTVits2, GPT-SoVITS, SVC systems...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Seamless Communication

    Seamless Communication

    Foundational Models for State-of-the-Art Speech and Text Translation

    Seamless Communication is a research project focused on building more integrated, low-latency multimodal communication between humans and AI agents. The motivation is to move beyond “text in, text out” and enable direct, live, multi-turn exchange involving language, gesture, gaze, vision, and modality switching without user friction. The system architecture includes a real-time multimodal signal pipeline for audio, video, and sensor data, a dialog manager that can decide when to act (speak,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    fairseq2

    fairseq2

    FAIR Sequence Modeling Toolkit 2

    fairseq2 is a modern, modular sequence modeling framework developed by Meta AI Research as a complete redesign of the original fairseq library. Built from the ground up for scalability, composability, and research flexibility, fairseq2 supports a broad range of language, speech, and multimodal content generation tasks, including instruction fine-tuning, reinforcement learning from human feedback (RLHF), and large-scale multilingual modeling. Unlike the original fairseq—which evolved into a...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    openctp

    openctp

    Provides CTP stock options and Zhongtai Securities XTP

    openctp is a technical service platform built around the CTP trading ecosystem that provides CTP compatible interfaces for a wide range of brokerage backends and markets. Its core idea is to wrap heterogeneous stock and derivatives trading gateways such as Zhongtai XTP, Huaxin Qidian TORA, and others with CTPAPI compatible interfaces, so existing CTP programs can connect simply by swapping dynamic libraries rather than rewriting code. The project offers a comprehensive simulation environment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Assembled is the only unified platform for staffing and managing your human and AI support team. Icon
    Assembled is the only unified platform for staffing and managing your human and AI support team.

    AI for world-class support operations

    Assembled is the only platform that unifies AI agents and intelligent workforce management to power fast and flexible support operations. Built for scale, we help teams automate over 50% of customer interactions, forecast with 90%+ accuracy, and optimize staffing across in-house and BPO teams. Orchestrate every chat, email, or call, balancing workloads between human and AI agents in real time — without sacrificing quality or control. Trusted by Stripe, Canva, and Robinhood, Assembled transforms support from a cost center into a strategic advantage. Our Workforce and Vendor Management tools connect forecasting, scheduling, and performance for smarter staffing decisions. AI Agents automate conversations across channels with your workflows and brand voice. AI Copilot empowers agents with real-time guidance, suggested replies, and one-click actions for faster, higher-quality resolutions.
    Learn More
  • 5
    Cactus

    Cactus

    Low-latency AI inference engine optimized for mobile devices

    ...Cactus emphasizes efficient memory usage through techniques such as zero-copy computation graphs and quantized model formats, allowing large models to run within the constraints of mobile hardware. It supports a wide range of AI tasks including text generation, speech-to-text, vision processing, and retrieval-augmented workflows through a unified API interface. A notable feature of Cactus is its hybrid execution model, which can dynamically route tasks between on-device processing and cloud services when additional compute is required.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 187 This Week
    Last Update:
    See Project
  • 7
    A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Cheetah

    Cheetah

    AI macOS app for real-time coding interview coaching assistance

    Cheetah is an AI-powered macOS application designed to assist users during software engineering interview practice through real-time coaching capabilities. It integrates audio transcription and AI-generated responses to help users navigate technical interview questions as they happen. Cheetah uses a local speech-to-text engine based on Whisper to capture and transcribe conversations in real time, enabling it to understand interviewer prompts. It then leverages language models to generate suggested answers, refinements, or explanations tailored to the ongoing discussion. Cheetah also connects with live coding environments through a browser extension, allowing it to analyze code and logs directly from supported platforms. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 1 This Week
    Last Update:
    See Project
  • Ganttic is an online resource planning software, that you can use for managing project portfolios while planning resources with maximum efficiency. Icon
    Ganttic is an online resource planning software, that you can use for managing project portfolios while planning resources with maximum efficiency.

    Create clear and comprehensive visual plans that give you an instant overview of all your resources and projects.

    Ganttic is a resource management software that excels in high-level resource planning and managing multiple project portfolios at once. In Ganttic, anything and anyone you need to schedule can be a resource – people, rooms, machinery – you name it! The software scales with your business, and you can introduce department after department to Ganttic to utilize all of your resources in the most effective way possible. Using Ganttic, you will have a good grasp on both the allocation and utilization of your resources.
    Free Demo
  • 10
    PNotes
    PNotes is light-weight, flexible, skinnable manager of virtual notes on your desktop. It supports multiple languages, individual note's settings, transparency and scheduling. Absolutely portable as well - no traces in registry. PNotes.NET edition requires .NET framework 4 Client Profile
    Leader badge
    Downloads: 216 This Week
    Last Update:
    See Project
  • 11
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12

    AhoTTS Multilingual, a Multilingual TTS

    Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

    Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Open JTalk is a Japanese text-to-speech synthesis system. This software is released under the Modified BSD license.
    Leader badge
    Downloads: 1,858 This Week
    Last Update:
    See Project
  • 14
    Winboard 4.5 Accessible Chess

    Winboard 4.5 Accessible Chess

    Chess for the Blind for the JAWS or NVDA Screen Readers

    Winboard 4.5 32-bit is a free Windows accessible Chess program that works automatically with the JAWS or the free NVDA screen reader. It is for the blind, low sighted or those who can not use a mouse. It provides vocal announcements of position changes and other selectable board conditions. Blind players also use a separate "tactile chess board". Winboard 4.5 has full keyboard access to move pieces and run menu items. Partial sighted players use high contrast mode and adjust board, piece,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    Xiphos (see github)
    A cross-platform Bible application for the Linux & Windows communities. Xiphos is no longer developed at SF; the SVN tree has been locked, the mailing lists are disabled, and we have moved to GitHub. Please see https://github.com/crosswire/xiphos. Releases up to 4.0.4 are present here. From 4.0.5 onward, see GitHub.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 16
    vinuxproject

    vinuxproject

    Vinux is an Ubuntu derived distribution for blind & visually impaired.

    Vinux supports software text to speech and Braille support from boot-up to shutdown. Users can use installation medium to install independently with no sighted assistance required. Vinux supports command line environment speech, Desktop environment speech and magnification features. Vinux comes with an accessible suite of software and has an excellent mailing list support group.
    Leader badge
    Downloads: 20 This Week
    Last Update:
    See Project
  • 17
    Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Dhvani is Text-to-Speech System for Indic Languages. Current C- GNU/Linux implementation supports Hindi, Kannada, Marathi, Malayalam, Gujarati, Bengali, Telugu, Panjabi, Tamil and Oriya.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Speect
    Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices. It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    terminalmath

    terminalmath

    A talking math practice game.

    terminalmath is a textmode math practice game for school-level children. Its main distinguishing feature is that it talks via the eSpeak text-to-speech engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The AK toolkit is another kit for building and use Hidden Markov Models (HMMs). Originally developed for handwritten text recognition (HTR) using Bernoulli HMMs, it also implements diagonal Gaussians and can be used for any other purpose.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Quh is another audio player that cultivates many insane/bloated API's into a very simple and file operations inspired framework. Quh aims to play everything that makes noise (including reading different text formats using speech synthesis).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This project aims to build a suite of Natural Language Processing tools. Modules will include corpus indexing and access tools, a part-of-speech tagger, tokenisers, text classification software, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A phrase to phoneme code converter for the SpeakJet chip by Magnevation. Speakalator runs on Unix type operating systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    app_swift provides an interface into the Cepstral TTS libraries to stream text-to-speech audio from the Asterisk PBX without creating files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB