Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Text to Speech Software
Search Results

Search Results for "/storage/emulated/0/android/data/net.sourceforge.uiq3.fx603p/files" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Windows 83
Linux 76
Mac 70
More...
BSD 41
ChromeOS 33
Mobile Operating Systems 12
Desktop Operating Systems 2

Category

Artificial Intelligence 93
Multimedia 12
Scientific/Engineering 4
Software Development 4
Business 3
Communications 2
Internet 2
Mobile 2
Education 1
Text Editors 1

License

OSI-Approved Open Source 80
Public Domain 3

Translations

English 8
Arabic 1
Chinese (Simplified) 1

Programming Language

Python 50
C++ 5
Java 5
C# 4
More...
JavaScript 3
TypeScript 3
Visual Basic .NET 3
BASIC 2
C 2
Kotlin 2
Unix Shell 2
ASP.NET 1
AWK 1
Cold Fusion 1
Go 1
PHP 1

Status

Production/Stable 10
Beta 7
Planning 2
Pre-Alpha 2
More...
Alpha 2
Mature 1

Showing 93 open source projects for "/storage/emulated/0/android/data/net.sourceforge.uiq3.fx603p/files"

View related business solutions

Text to Speech Clear Filters & Widen Search

Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.

Try for free
Our xDM platform turns business users into data champions.
Discover the Intelligent Data Hub unique platform for Master Data Management

It empowers organizations of any size to build trusted data applications quickly, with fast time to value using a single software platform for governance, master data, reference data, data quality, enrichment, and workflows.

Learn More
1

Chatterbox

SoTA open-source TTS

Chatterbox is Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs and is consistently preferred in side-by-side evaluations. Whether you're working on memes, videos, games, or AI agents, Chatterbox brings your content to life. It's also the first open source TTS model to support emotion exaggeration control, a powerful feature that makes your voices stand out. Try it now on our...

Downloads: 15 This Week

Last Update: 2025-06-25
See Project
2

Open Vision Agents by Stream

Build Vision Agents quickly with any model or video provider

...Developers work with an agent abstraction that connects video edge providers, LLMs, and processors into pipelines, making it easier to orchestrate tasks like object detection, pose estimation, and conversational guidance. The project includes SDKs for React, Android, iOS, Flutter, React Native, and Unity, enabling integration into a wide variety of client environments such as mobile apps, web apps, and games.

Downloads: 0 This Week

Last Update: 5 days ago
See Project
3

ESPnet

End-to-end speech processing toolkit

ESPnet is a comprehensive end-to-end speech processing toolkit covering a wide spectrum of tasks, including automatic speech recognition (ASR), text-to-speech (TTS), speech translation (ST), speech enhancement, speaker diarization, and spoken language understanding. It uses PyTorch as its deep learning engine and adopts a Kaldi-style data processing pipeline for features, data formats, and experimental recipes. This combination allows researchers to leverage modern neural architectures while still benefiting from the robust data preparation practices developed in the speech community. ESPnet provides many ready-to-run recipes for popular academic benchmarks, making it straightforward to reproduce published results or serve as baselines for new research. ...

Downloads: 0 This Week

Last Update: 2026-04-07
See Project
4

ChatTTS

A generative speech model for daily dialogue

ChatTTS is an open-source conversational text-to-speech model optimized for dialogue, developed by 2Noise. Trained on 100,000+ hours of English and Chinese conversation data, it excels at generating expressive prosody—pauses, interjections, laughter—for more natural-sounding speech synthesis in assistant and chatbot applications.

Downloads: 4 This Week

Last Update: 2026-04-10
See Project
Online Project Management Platform - Zoho
A plan put together with small businesses and startups in mind.

Zoho Projects is a cloud-based project management solution that helps teams plan, track, collaborate, and achieve project goals.

Learn More
5

NVIDIA NeMo

Toolkit for conversational AI

...NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. NeMo uses PyTorch Lightning for easy and performant multi-GPU/multi-node mixed-precision training. Supported models: Jasper, QuartzNet, CitriNet, Conformer-CTC, Conformer-Transducer, Squeezeformer-CTC, Squeezeformer-Transducer, ContextNet, LSTM-Transducer (RNNT), LSTM-CTC. ...

Downloads: 3 This Week

Last Update: 2026-03-23
See Project
6

KrillinAI

Video translation and dubbing tool powered by LLMs

KrillinAI is an end-to-end content localization, translation, and dubbing tool aimed at helping creators transform videos into multiple languages with minimal manual effort. It integrates several stages of the pipeline: video acquisition (either from local files or remote via download tools), speech recognition (ASR), subtitle segmentation and alignment, machine translation (with context-aware translation to preserve semantics), and voice cloning + text-to-speech (TTS) to produce dubbed audio tracks. KrillinAI supports both landscape and portrait videos, which makes it suitable for a wide range of platforms — from YouTube to TikTok or other vertical-video sites — and ensures correct formatting and layout for the final video. ...

Downloads: 7 This Week

Last Update: 2025-11-28
See Project
7

NVIDIA NeMo Framework

Scalable generative AI framework built for researchers and developers

...NeMo 2.0 introduces a Python-based configuration system, replacing YAML with more flexible, programmable configs that can be versioned and composed for different experiments. The framework builds on PyTorch Lightning–style modular abstractions, so training scripts are composed from reusable components for data loading, models, optimizers, and schedulers, which simplifies experimentation and adaptation. NeMo is designed to scale: with tools like NeMo-Run, users can orchestrate large-scale experiments across thousands of GPUs.

Downloads: 2 This Week

Last Update: 2026-03-23
See Project
8

AI Runner

Offline inference engine for art, real-time voice conversations

...It is implemented as a desktop-oriented Python application and emphasizes privacy and self-hosting, allowing users to work with text-to-speech, speech-to-text, text-to-image and multimodal models without sending data to external services. At the core of its LLM stack is a mode-based architecture with specialized “modes” such as Author, Code, Research, QA and General, and a workflow manager that automatically routes user requests to the right agent based on the task. The project has a strong focus on developer ergonomics, with thorough development guidelines, environment configuration using .env variables, and a clear structure for tests, tools and agents.

Downloads: 10 This Week

Last Update: 2025-12-11
See Project
9

Audiblez

Generate audiobooks from e-books

...It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained on under 100 hours of audio, and supports multiple languages, including English (US/UK), Spanish, French, Hindi, Italian, Japanese, Brazilian Portuguese, and Mandarin Chinese. Audiblez can run entirely from the command line via a PyPI package or through a simple cross-platform GUI built on wxPython, giving both advanced users and non-technical users an accessible workflow.

Downloads: 3 This Week

Last Update: 2025-11-30
See Project
FusionAuth: Authentication and User Management Software
Offer your users flexible authentication options, including passwords, passwordless, single sign-on (SSO), and multi-factor authentication (MFA).

FusionAuth adds login, registration, SSO, MFA, and a bazillion other features to your app in days - not months.

Learn More
10

MetaVoice-1B

Foundational model for human-like, expressive TTS

...Specifically, the base model (MetaVoice-1B) uses around 1.2 billion parameters and has been trained on a massive dataset — reportedly around 100,000 hours of speech data. The goal is to provide human-like, expressive, and flexible TTS: able to generate natural-sounding speech that can handle diverse inputs and likely generalize over voice styles, intonation, prosody, and perhaps multiple languages or accents. With that scale and dataset volume, MetaVoice aims to push the boundary of what open-source TTS models can achieve: high fidelity, natural prosody, and robustness even for edge cases. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
11

EasyVoice

Open source text-to-speech tool, supports extra-long text

...The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure parameters such as rate, pitch, and volume per role. It offers streaming playback so audio starts almost immediately, even for very long inputs, and automatically generates subtitle files suitable for video production or translation workflows. Under the hood, easyVoice uses a modern stack with Vue 3 and Element Plus on the front end, Node.js and Express on the back end, and TTS engines such as Microsoft Azure TTS and OpenAI-compatible APIs, orchestrated through ffmpeg.

Downloads: 2 This Week

Last Update: 2026-01-26
See Project
12

Matcha-TTS

A fast TTS architecture with conditional flow matching

...The model is fully probabilistic, so it can generate diverse realizations of the same text while still sounding stable and intelligible. The repository provides an end-to-end TTS pipeline: a PyTorch/Lightning training stack, configuration files, pre-trained checkpoints, a command-line interface, and a Gradio app for interactive testing. Users can train on standard datasets like LJSpeech or plug in their own corpora, with helper tools for computing dataset statistics, extracting phoneme durations, and running multi-GPU training.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
13

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper

...The project aims to be for speech what Stable Diffusion is for images: powerful, hackable, and safe for commercial use, with code under Apache-2.0/MIT and models trained only on properly licensed data. Its architecture follows a token-based, multi-stage pipeline inspired by AudioLM and SPEAR-TTS: Whisper is used to produce semantic tokens, EnCodec compresses the waveform into acoustic tokens, and Vocos reconstructs high-fidelity audio from those tokens. The repository includes notebooks and scripts for inference, long-form synthesis, and finetuning, as well as pre-trained models and converted datasets hosted on Hugging Face. ...

Downloads: 2 This Week

Last Update: 2025-11-28
See Project
14

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server

...The server is written in Python and distributed under the MIT license, with a pyproject.toml and uv-based workflow that makes installation and execution reproducible. Configuration is handled through JSON files that tell MCP clients how to launch the server (typically via uvx minimax-mcp) and which environment variables to use for the API key, host, and output directory. The README carefully explains region-specific API hosts for global and mainland users to avoid invalid-key errors, and documents both local stdio transport and SSE-based network transport modes.

Downloads: 1 This Week

Last Update: 2026-01-07
See Project
15

Lingvo

Framework for building neural networks

...Lingvo includes reference models and configurations for domains like machine translation, automatic speech recognition, language modeling, image understanding, and 3D object detection. Centralized hyperparameter configuration files allow researchers to share exact experiment setups so others can retrain and compare results reliably.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
16

StyleTTS 2

Towards Human-Level Text-to-Speech through Style Diffusion

...StyleTTS2 supports both single-speaker and multi-speaker configurations, with the ability to sample or transfer styles from reference audio, making it powerful for expressive TTS and character voices. The repository includes training scripts, configuration files, and pre-trained auxiliary modules such as a text aligner, pitch extractor, and PL-BERT-based linguistic encoder.

Downloads: 3 This Week

Last Update: 2025-11-28
See Project
17

QChartist

Free and Open Source Technical Analysis Charting Software

QChartist is a free and open source technical analysis charting software. Its purpose is to provide a complete set of tools to perform technical analysis on charts and data. It helps to make forecasts mainly for markets but can also be used for weather or any quantifiable data. The program is flexible and its functionalities can be easily extended. You can draw geometrical shapes on your charts or plot programmable indicators from your data. It is also possible to filter or merge data. I got a little inspired from MT4 allowing a fairly easy portability of programmed indicators from MT4 to QChartist. ...

1 Review

Downloads: 11 This Week

Last Update: 2026-04-12
See Project
18

AudioBC

Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS

AudioBC is a powerful desktop application designed to turn your digital library into a personal audiobook collection. Unlike most Text-to-Speech (TTS) tools that require expensive cloud API subscriptions or an active internet connection, AudioBC runs entirely on your local machine. Powered by the state-of-the-art Kokoro-82M neural engine, AudioBC produces natural, human-like speech that rivals premium cloud services. It is built with a focus on privacy and simplicity, offering a...

Downloads: 4 This Week

Last Update: 2026-03-22
See Project
19

Bert-VITS2

VITS2 backbone with multilingual-bert

...The core idea is to use BERT-style contextual embeddings for text encoding while relying on a refined VITS2 architecture for acoustic generation and vocoding. The repository includes everything needed to train, fine-tune, and run the model, from configuration files to preprocessing scripts, spectrogram utilities, and training entrypoints for multi-GPU and multi-node setups. It provides emotional modeling through “emo embeddings,” allowing voices to be conditioned on different affective states during synthesis. Releases include optimizations for Japanese and English alignment, expanded training data, spec caching and pre-generation tools, as well as ONNX export for more lightweight inference deployments.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
20

Voice Accounting For Blind & Mute People

Free & Easy AI Voice Accounting Software For Blind & Speechless People

Just download the above zip file, extract it and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, please view and download my full collection of softwares for people with disabilities, here : https://sourceforge.net/projects/softwares-for-disabled-people/ This full collection also includes the Voice Accounting Software as well.

Downloads: 0 This Week

Last Update: 2024-04-30
See Project
21

Softwares For Blind, Deaf, Handicap

Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People

Just download the above zip file, extract it first and then open the index.html file on internet browsers like Firefox ( preferable ) or Google Chrome. Also, keep NumLock ON while using the Numeric Keypad of any Keyboard. Can also attach an external USB keyboard, with seperate Numeric Keypad, if required. I have added some general guidelines for students, using these softwares, on the Wiki Page of this website. Please refer them for more instructions.

Downloads: 0 This Week

Last Update: 2026-01-18
See Project
22

EasyTTS

Text to Speech Utility

EasyTTS is a text to speech app for 64 bit Windows that offers online and offline text-to-speech, with settings for how fast the voice is. It supports languages other than English but only if you are connected to the Internet. These are Spanish, Portuguese, Russian, French, and Mandarin (?) Chinese.

1 Review

Downloads: 2 This Week

Last Update: 2024-05-01
See Project
23

RS Media Robot Development Kit

A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.

Downloads: 6 This Week

Last Update: 2026-01-14
See Project
24

SpeakFlow-TTS

Multilingual Text-to-Speech (TTS)

Excited to present SpeakFlow - an intuitive desktop application for Text-to-Speech (TTS) conversion! It allows you to easily transform entered text into high-quality audio files, using natural voices in many languages. Key features of SpeakFlow: Multilingual support: Choose from a wide range of languages and voices (Ukrainian, English, German, Russian, Polish, French, Italian, Spanish, Portuguese, and more). Simple and intuitive interface: Designed for quick and convenient audio generation. Audio Playback: Instantly listen to and download the generated text in MP3 format.

Downloads: 0 This Week

Last Update: 2025-07-13
See Project
25

EmotiVoice

Multi-Voice and Prompt-Controlled TTS Engine

...EmotiVoice provides multiple ways to interact with it, including a web interface, a Docker image, an HTTP API (including an OpenAI-compatible TTS API), and Python scripts for batch synthesis. It also supports voice cloning with your own data, backed by recipes for popular datasets like DataBaker and LJSpeech, so you can train or adapt voices to custom personas.

Downloads: 5 This Week

Last Update: 2025-11-30
See Project

Previous
1
You're on page 2
3
4
Next

Related Searches

ai

dubbing

ai offline

ai chatbot offline

nvidia

offline ai

jarvis voice hindi

forex

sapi 5 voices

speech

Related Categories

Artificial Intelligence

Multimedia

Scientific/Engineering

Software Development

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise