Open Source Speech Recognition Software - Page 3

Sort By:

Speech Recognition Software

View 115 business solutions

Speech Recognition Clear Filters

Easily build robust connections between Salesforce and any platform
We help companies using Salesforce connect their data with a no-code Salesforce-native solution.

Like having Postman inside Salesforce! Declarative Webhooks allows users to quickly and easily configure bi-directional integrations between Salesforce and external systems using a point-and-click interface. No coding is required, making it a fast and efficient and as a native solution, Declarative Webhooks seamlessly integrates with Salesforce platform features such as Flow, Process Builder, and Apex. You can also leverage the AI Integration Agent feature to automatically build your integration templates by providing it with links to API documentation.

Learn More
Hightouch is a data and AI platform for marketing and personalization.
Marketing needs data and AI. Give them Hightouch.

Find insights, run real-time campaigns, and build AI agents with all your data.

Learn More
1

ASR for Medical Reporting

Automatic speech recognition system for medical reporting in spanish.

This is a functional prototype of automatic speech recognition system for medical reporting in Spanish using CMU Sphinx4 ASR toolkit. This ASR use pre-trained acoustic model and context dependent language model in nuclear medicine diagnostics.

Downloads: 0 This Week

Last Update: 2020-07-15
See Project
2

ASRT Speech Recognition

A Deep-Learning-Based Chinese Speech Recognition System

ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.

Downloads: 0 This Week

Last Update: 2025-07-03
See Project
3

Arabic Phonetic Platform using VoiceXML

This project'll be the core engine of many voice based platforms,which can be implemented into your projects,websites...etc to provide an Arabic speech service, where your servers can interact with the clients through Arabic Speech Recognition.

Downloads: 0 This Week

Last Update: 2013-04-01
See Project
4

Arabisc

Arabisc is speaker independent large vocabulary continuous speech recognizer for Arabic language released under GNU license.It is also a collection of open source tools that allows researchers and developers to build speech recognition systems for Arab

1 Review

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
Gearset | The complete Salesforce DevOps solution
Salesforce DevOps done right.

Gearset is the only platform you need for unparalleled deployment success, continuous delivery, automated testing and backups.

Learn More
5

Awesome Recurrent Neural Networks

A curated list of resources dedicated to RNN

A curated list of resources dedicated to recurrent neural networks (closely related to deep learning). Provides a wide range of works and resources such as a Recurrent Neural Network Tutorial, a Sequence-to-Sequence Model Tutorial, Tutorials by nlintz, Notebook examples by aymericdamien, Scikit Flow (skflow) - Simplified Scikit-learn like Interface for TensorFlow, Keras (Tensorflow / Theano)-based modular deep learning library similar to Torch, char-rnn-tensorflow by sherjilozair, char-rnn in tensorflow, and much more. Codes, theory, applications, and datasets about natural language processing, robotics, computer vision, and much more.

Downloads: 0 This Week

Last Update: 2021-09-22
See Project
6

AzioSpeech Recognition and Translation

AzioSpeech Recognition and Translation

Starting from version 1.2.1.0, the project has been renamed to AzioSpeech Recognition and Translation and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PFV5DG73198 A desktop application built with Avalonia UI that provides real-time speech recognition and translation using Azure Speech Services. Convert spoken words into text and translate them into multiple languages with professional-grade accuracy. Important Setup Requirements Before using this application, you MUST have: 1. Azure Account Setup Active Azure Subscription - Create a free account at portal.azure.com Azure Speech Service Resource - You must create your own Speech Service within your Azure subscription Valid API Key & Region - Obtain these credentials from your Azure Speech Service resource 2. Windows Privacy Settings CRITICAL: Microphone Access Required You must grant microphone access through Windows settings

Downloads: 0 This Week

Last Update: 2026-02-13
See Project
7

CALL-SLT

A project which uses existing speech recognition and speech translation resources to build conversation partners for beginning language students, based on the idea of a "translation game".

Downloads: 0 This Week

Last Update: 2019-06-17
See Project
8

CJ7

CJ7 is an open-source speech recognition engine.

Downloads: 0 This Week

Last Update: 2016-10-23
See Project
9

CSLU_KALDI

speach recognision using kaldi

adjusting KALDI speech recognition to new corpus.

Downloads: 0 This Week

Last Update: 2015-05-03
See Project
Digital business card + lead capture + contact enrichment
Your complete in-person marketing platform

Share digital business cards, capture leads, and enrich validated contact info - at events, in the field, and beyond. Powered by AI and our proprietary data engine, Popl drives growth for companies around the world, turning every handshake into an opportunity.

Learn More
10

Centauri Voice Interface

Provides a voice interface for applications via a plug in system. Allows the inclusion of voice recognition in an application with a minimum of effort.

Downloads: 0 This Week

Last Update: 2016-03-11
See Project
11

Cheery

A smartphone-PC interface for control your computer remotely.

Cheery is a smartphone-PC interface for control your computer remotely. Uses speech recognition for get the commands and it sends to a Java server that does the actions. Coming soon Cheery will also be a Swiss Army Knife for Android.

Downloads: 0 This Week

Last Update: 2016-10-17
See Project
12

ComTalk

ComTalk uses MSAgent Technology, created by Microsoft, to make an easy to use interface with Speech Recognition and Text-To-Speech technologies. The MSAgent system is the same used to produce the assistants in Microsoft Word and other Microsoft Programs.

Downloads: 0 This Week

Last Update: 2014-07-11
See Project
13

Commander

Commander.exe is speech recognition engine for Polaris.

Commander.exe is speech recognition engine for Polaris. What is Polaris ? Polaris is plugin for Eclipse IDE With Polaris you have the possibility of incorporating speech into programing. Through use of this plugin in Eclipse IDE you can see that not only is it possible to provide an environment for a programing with voice, but that programing with voice it is part of the natural evolution of programming tools. Current version supports simple but powerful commands such as openig search forms, changing workspace, copy and paste code. Efforts on daily basis are made to increase the range of functionality that can be controlled with voice. Commander is here to recognize speech and send it to Polaris who activates actions in Eclipse IDE

Downloads: 0 This Week

Last Update: 2019-05-12
See Project
14

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures! Optimization courses which form the foundation for ML, DL, RL. Computer Vision courses which are DL & ML heavy. Speech recognition courses which are DL heavy. Structured Courses on Geometric, Graph Neural Networks. Section on Autonomous Vehicles. Section on Computer Graphics with ML/DL focus.

Downloads: 0 This Week

Last Update: 2022-07-29
See Project
15

Deep Learning with PyTorch

Latest techniques in deep learning and representation learning

This course concerns the latest techniques in deep learning and representation learning, focusing on supervised and unsupervised deep learning, embedding methods, metric learning, convolutional and recurrent nets, with applications to computer vision, natural language understanding, and speech recognition. The prerequisites include DS-GA 1001 Intro to Data Science or a graduate-level machine learning course. To be able to follow the exercises, you are going to need a laptop with Miniconda (a minimal version of Anaconda) and several Python packages installed. The following instruction would work as is for Mac or Ubuntu Linux users, Windows users would need to install and work in the Git BASH terminal. JupyterLab has a built-in selectable dark theme, so you only need to install something if you want to use the classic notebook interface.

Downloads: 0 This Week

Last Update: 2021-10-12
See Project
16

DesktopAgent

DesktopAgent is a freeware application for Microsoft Agent. It uses the interactive capabilities of Microsoft Agent to display a character (agent) which can interact with you using animations, speech and voice recognition.

Downloads: 0 This Week

Last Update: 2013-02-22
See Project
17

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.

Downloads: 0 This Week

Last Update: 2019-08-21
See Project
18

Domotic Speech-recognition interface

Speech-recognition interface for a domotic system.

This product recognizes oral commands and translates them to domotic orders for a domotic system. This product does not implement a domotic system. This product is an interface to be plugged to a domotic system. The speech recognition is done by an arduino UNO board and an EasyVR shield. Available oral commands are generated from a house description file in XML format. The oral commands have to be trained for a specific users. For this purpose 2 interfaces are provided: a command line interface and a web application. These interfaces allow to visualize oral commands, train and delete trainings.

Downloads: 0 This Week

Last Update: 2015-12-29
See Project
19

EasyGradeXL

Uses speech recognition to enter grades in an Excel workbook

This application simplifies the tedious task of entering grades in a Excel workbook by using the Google text-to-speech API. This API currently supports 137 languages and a number of dialects. The application keeps a log of the grades, in the order that they are entered and provides a readback function to easily check if the grades were entered correctly. This application was developed using Microsoft Excel Version 2108. It currently only runs under Microsoft Windows.

Downloads: 0 This Week

Last Update: 2021-12-21
See Project
20

Ebba

EBBA is a project aiming to develop an advanced chatbot by combining AIML, 3d facial expressions, speech synthesizer, speech recognition and an iq-test solving functionality.

2 Reviews

Downloads: 0 This Week

Last Update: 2016-06-01
See Project
21

FireRedASR

Open-source industrial-grade ASR models

FireRedASR is an industrial-grade family of open-source automatic speech recognition models designed to provide high-precision speech-to-text performance across languages including Mandarin, English, and various Chinese dialects, achieving new state-of-the-art benchmarks on public test sets. The project includes multiple model variants to meet different application needs, such as high-accuracy end-to-end interaction using an encoder-adapter-LLM framework and efficient real-time recognition using attention-based encoder-decoder architectures, giving developers flexibility in balancing performance and resource constraints. FireRedASR not only excels in traditional speech recognition tasks but also demonstrates strong capability in challenging scenarios like singing lyrics recognition, where accurate transcription is often difficult for conventional models.

Downloads: 0 This Week

Last Update: 2026-02-25
See Project
22

G.A.S.I.

Webcam Gesture and Voice Recognition OS proof of concept

Inspired by interfaces from sci-fi movies like Iron Man, Gesture Analytical Sonic Interface (GASI) is a proof of concept of a Webcam gesture (Kinect like) and Voice recognition based computer interface, constraining itself to only components included in average laptops (A simple webcam and microphone, no Kinect)

Downloads: 0 This Week

Last Update: 2016-11-18
See Project
23

GoMad

GoMad is a speech recognition system that allows you to control windows-based applications using your voice as input, instead of your mouse and keyboard.

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
24

Grapheme to Phoneme Forge

Use our tools to hand edit phonetic word dictionaries for speech recognition engines. The new G2P4J format supporting SAMPA and Kirshenbaum IPA is portable to Sphinx, Julius and others. Demo medical, legal and technical dictionaries are featured.

Downloads: 0 This Week

Last Update: 2013-04-03
See Project
25

HMM Speech Recognition in Java

HMM Speech Recognition in Java

HMM Speech Recognition in Java

Downloads: 0 This Week

Last Update: 2013-09-21
See Project