Search Results for "tamil speech recognition"

Sort By:

Showing 37 open source projects for "tamil speech recognition"

View related business solutions

C++ Clear Filters & Widen Search

All-in-One Inspection Software
flowdit is a connected worker platform tailored for industry needs in commissioning, quality, maintenance, and EHS management.

Optimize Frontline Operations: Elevate Equipment Uptime, Operational Excellence, and Safety with Connected Teams and Data, Including Issue Capture and Corrective Action.

Learn More
Secure Cloud Storage for Files, Photos and Documents | pCloud
Store, access, and manage your files on your own terms, from anywhere.

Store, sync, and share your files securely with pCloud. Get up to 10 GB of free secure cloud storage and access your files from any device, anywhere.

Learn More
1

Vosk Speech Recognition Toolkit

Offline speech recognition API for Android, iOS, Raspberry Pi

Speech recognition bindings are implemented for various programming languages like Python, Java, Node.JS, C#, C++, Rust, Go and others. Vosk supplies speech recognition for chatbots, smart home appliances, and virtual assistants. It can also create subtitles for movies, and transcription for lectures and interviews. Vosk scales from small devices like Raspberry Pi or Android smartphones to big clusters.

Downloads: 103 This Week

Last Update: 2024-04-22
See Project
2

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 200 This Week

Last Update: 3 hours ago
See Project
3

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project

Kaldi is an open source toolkit for speech recognition research. It provides a powerful framework for building state-of-the-art automatic speech recognition (ASR) systems, with support for deep neural networks, Gaussian mixture models, hidden Markov models, and other advanced techniques. The toolkit is widely used in both academia and industry due to its flexibility, extensibility, and strong community support.

Downloads: 1 This Week

Last Update: 7 days ago
See Project
4

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. ...

Downloads: 365 This Week

Last Update: 2026-03-19
See Project
Searching for a better way to ship ecommerce? We can help
ShipHero gives you the tools that give you ecommerce fulfillment super powers.

ShipHero is built for multi-channel commerce. With a few clicks, you can connect your stores. ShipHero will download new products, as well as sync existing ones. When changes are made to your inventory all connected stores will be updated.

Learn More
5

OpenVINO

OpenVINO™ Toolkit repository

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference. Boost deep learning performance in computer vision, automatic speech recognition, natural language processing and other common tasks. Use models trained with popular frameworks like TensorFlow, PyTorch and more. Reduce resource demands and efficiently deploy on a range of Intel® platforms from edge to cloud. This open-source version includes several components: namely Model Optimizer, OpenVINO™ Runtime, Post-Training Optimization Tool, as well as CPU, GPU, MYRIAD, multi device and heterogeneous plugins to accelerate deep learning inferencing on Intel® CPUs and Intel® Processor Graphics. ...

Downloads: 40 This Week

Last Update: 2026-03-25
See Project
6

XiaoZhi AI Chatbot

Build your own AI friend

xiaozhi-esp32 is an open-source project that guides users in building their own AI-powered conversational companion using the ESP32 microcontroller. The project provides detailed instructions on assembling the hardware, setting up the software, and integrating AI models to enable natural language interactions. This DIY approach offers an accessible entry point into AI and hardware development.

Downloads: 339 This Week

Last Update: 2026-03-09
See Project
7

RunAnywhere

Production ready toolkit to run AI locally

...It also includes integrated pipelines that combine speech-to-text, large language models, and text-to-speech into a complete conversational system.

Downloads: 0 This Week

Last Update: 2026-03-20
See Project
8

Operit AI

Powerful Android AI agent with tools, automation, and Linux shell

Operit is a full-featured AI assistant and agent platform designed specifically for Android devices, aiming to go far beyond traditional chat-based interfaces. It integrates deep system-level capabilities with a wide range of tools, allowing the AI to perform real tasks such as file management, automation, and system control directly on the device. A standout aspect of the project is its built-in Ubuntu 24 environment, which enables users to run Linux commands, scripts, and development tools...

Downloads: 8 This Week

Last Update: 2026-03-18
See Project
9

FAY

Framework for building AI-powered interactive digital humans and agent

...Fay supports various types of digital humans, including 2.5D and 3D avatars, and can be integrated with applications running on mobile devices, PCs, web platforms, and embedded systems. Its architecture allows developers to combine different AI components such as speech recognition, text-to-speech, and large language models to create conversational digital agents. Fay provides multiple interfaces for text, voice, and digital human control, enabling developers to build interactive assistants, virtual presenters, or automated service agents. It also supports custom knowledge bases and configurable behaviors so developers can tailor the personality and responses of the digital human.

Downloads: 3 This Week

Last Update: 2026-04-10
See Project
AI-powered SAST and AppSec platform that helps companies find and fix vulnerabilities.
Trusted by 750+ companies and performing 200k+ code scans monthly.

ZeroPath (YC S24) is an AI-native application security platform that delivers comprehensive code protection beyond traditional SAST. Founded by security engineers from Tesla and Google, ZeroPath combines large language models with advanced program analysis to find and automatically fix vulnerabilities.

Learn More
10

TensorRT

C++ library for high performance inference on NVIDIA GPUs

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference. It includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for deep learning inference applications. TensorRT-based applications perform up to 40X faster than CPU-only platforms during inference. With TensorRT, you can optimize neural network models trained in all major frameworks, calibrate for lower precision with high accuracy, and deploy to hyperscale data centers,...

Downloads: 22 This Week

Last Update: 2026-03-25
See Project
11

KoboldCpp

Run GGUF models easily with a UI or API. One File. Zero Install.

KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.

Downloads: 476 This Week

Last Update: 6 hours ago
See Project
12

Flashlight library

A C++ standalone library for machine learning

Flashlight is a fast, flexible machine learning library written entirely in C++ by Facebook AI Research and the creators of Torch, TensorFlow, Eigen, and Deep Speech. Native support in C++ and simple extensibility make Flashlight a powerful research framework that's hackable to its core and enables fast iteration on new experimental setups and algorithms with little unopinionated and without sacrificing performance. In a single repository, Flashlight provides apps for research across...

Downloads: 2 This Week

Last Update: 2022-05-27
See Project
13

wav2letter++

Facebook AI research's automatic speech recognition toolkit

First, install Flashlight (using the 0.3 branch is required) with the ASR application. This repository includes recipes to reproduce the following research papers as well as pre-trained models. All results reproduction must use Flashlight <= 0.3.2 for exact reproducibility. At least one of LZMA, BZip2, or Z is required for LM compression with KenLM. It is highly recommended to build KenLM with position-independent code (-fPIC) enabled, to enable python compatibility. After installing, run...

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
14

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki/ Domain advanced versions (Polish Language) 1. ...

2 Reviews

Downloads: 6 This Week

Last Update: 2020-03-15
See Project
15

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 4 This Week

Last Update: 2018-05-11
See Project
16

Caffe Framework

Caffe, a fast open framework for deep learning

Caffe is a deep learning framework made with expression, speed, and modularity in mind. It is developed by Berkeley AI Research (BAIR) and by community contributors. Yangqing Jia created the project during his PhD at UC Berkeley. Caffe is released under the BSD 2-Clause license. Expressive architecture encourages application and innovation. Models and optimization are defined by configuration without hard-coding. Switch between CPU and GPU by setting a single flag to train on a GPU machine...

Downloads: 0 This Week

Last Update: 2021-12-09
See Project
17

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

...These toolkits are meant for facilitating research and development of automatic distant speech recognition.

Downloads: 0 This Week

Last Update: 2019-08-21
See Project
18

Kaldi

Speech recognition research toolkit

13 Reviews

Downloads: 23 This Week

Last Update: 2016-02-19
See Project
19

Specimen Photography for Canon Powershot

SpecimenPhoto controls a Canon Powershot camera for specimen archival photography. Each photograph is assigned a case number, labeled and stored. Identification is manual or "hands free" using separately available barcode and speech recognition modules.

Downloads: 0 This Week

Last Update: 2015-04-08
See Project
20

jaivox

Speech recognition application builder and library

Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
21

Voce

A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.

3 Reviews

Downloads: 0 This Week

Last Update: 2013-10-03
See Project
22

RNNLIB

RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition. full installation and usage instructions given at http://sourceforge.net/p/rnnl/wiki/Home/

2 Reviews

Downloads: 0 This Week

Last Update: 2016-11-28
See Project
23

openSMILE

SMILE = Speech & Music Interpretation by Large Space Extraction openSMILE is a fast, real-time (audio) feature extraction utility for automatic speech, music and paralinguistic recognition research developed originally at TUM in the scope of the EU-project SEMAINE, now maintained and supported by audEERING.

Downloads: 3 This Week

Last Update: 2014-11-27
See Project
24

World Voice Recognition

World Voice Recognition est un programme OpenSource de reconnaissance vocal dont le but est de faire la liaison entre plusieurs modules crée par n'importe quelle developpeur ( Module du microphone, module de reconnaissance vocal, module pour faire parler l'ordinateur, ou des plugins : par exemple la météo ). La SDK est compatible avec n'importe quelle language de programmation (ASM,C++,Ada,Java...) sur toutes les platformes (Windows, Mac et Linux). Tous les developpeurs peuvent programme un...

Downloads: 0 This Week

Last Update: 2012-06-14
See Project
25

CJ7

CJ7 is an open-source speech recognition engine.

Downloads: 0 This Week

Last Update: 2016-10-23
See Project