Search Results for "delphi audio components" - Page 2

Sort By:

Showing 517 open source projects for "delphi audio components"

View related business solutions

Securden Password Vault
For IT Teams, CIO, CSO, Security Analysts

Store, manage, and share passwords, files, SSH keys, and DevOps secrets among IT teams. Enforce password security best practices. Ensure compliance with industry standards using comprehensive audit trails.

Learn More
Intelligent Retail Management
Retail space, product categories, planograms, automatic ordering, and shelf labels management

Quant offers a wide range of solutions for retail. Within one integrated software system, it allows you to efficiently combine the management of retail space, shelf labels and marketing materials with task management, reporting and automatic replenishment.

Learn More
1

YuE

Open source AI model for generating full songs from lyrics prompts

...It includes inference scripts, prompt examples, evaluation tools, and training components that enable researchers and developers to experiment with AI-based music.

Downloads: 3 This Week

Last Update: 2 days ago
See Project
2

BasedHardware

Open source AI wearable platform for recording and summarizing speech

...Omi includes firmware for wearable hardware, a Flutter-based mobile companion application, backend services built with Python and FastAPI, and various SDKs for developers. These components work together to process audio, perform speech recognition, and integrate AI features such as summaries and automated actions. Developers can extend the platform by building plugins, integrations, and custom applications using provided SDKs and APIs. The repository also supports experimental hardware implementations.

Downloads: 8 This Week

Last Update: 12 hours ago
See Project
3

VibeVoice ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

VibeVoice ComfyUI is a comprehensive wrapper that integrates Microsoft’s VibeVoice text-to-speech models directly into ComfyUI workflows. It exposes VibeVoice as a set of custom nodes so you can build single-speaker and multi-speaker voice generation pipelines visually, combining TTS with other audio or generative components. The integration supports high-quality single-speaker synthesis as well as scripted multi-speaker conversations, with optional voice cloning from audio samples for each speaker. It includes advanced control over generation parameters like attention backend, diffusion steps, sampling temperature, guidance scale, and quantization settings, allowing users to tune the trade-offs between quality, VRAM usage, and speed. ...

Downloads: 4 This Week

Last Update: 2025-11-28
See Project
4

Membrane Core

The core of Membrane Framework, multimedia processing framework

membrane_core is the foundation of the Membrane multimedia framework for Elixir, providing the abstractions and runtime needed to build real-time audio and video pipelines. It models media processing as a graph of lightweight, supervised OTP processes—elements connected by links—so work is isolated, fault-tolerant, and easy to scale or reconfigure at runtime. The core defines a clear lifecycle and callback API for elements, plus concepts like buffers, events, and capabilities/format negotiation to keep components interoperable and type-safe. ...

Downloads: 8 This Week

Last Update: 2025-12-22
See Project
Online Project Management Platform - Zoho
A plan put together with small businesses and startups in mind.

Zoho Projects is a cloud-based project management solution that helps teams plan, track, collaborate, and achieve project goals.

Learn More
5

ThumbmarkJS

World's best free browser fingerprinting library

ThumbmarkJS is an MIT-licensed browser fingerprinting library that produces stable fingerprints with 90% uniqueness. It works with normal and private browsing. ThumbmarkJS is a free, open‑source browser fingerprinting JavaScript library, designed as an alternative to FingerprintJS. It generates distinct, persistent device fingerprints using web APIs like canvas, audio, fonts, WebGL, and more, enabling identification of browsers across sessions, even in incognito or cleared-cache...

Downloads: 2 This Week

Last Update: 2026-04-08
See Project
6

OmAgent

Build multimodal language agents for fast prototype and production

OmAgent is an open-source Python framework designed to simplify the development of multimodal language agents that can reason, plan, and interact with different types of data sources. The framework provides abstractions and infrastructure for building AI agents that operate on text, images, video, and audio while maintaining a relatively simple interface for developers. Instead of forcing developers to implement complex orchestration logic manually, the system manages task scheduling, worker...

Downloads: 3 This Week

Last Update: 2026-03-05
See Project
7

Descent 3

Descent 3 by Outrage Entertainment

...It provides the full C and C++ engine source code, including the historically significant “1.5” patch that was previously created by developers and later stabilized by fans. The codebase covers the game’s rendering, physics, audio, networking, tools, and editor components, allowing enthusiasts to build, run, and modify the classic 6-degrees-of-freedom space shooter on modern systems. To actually play the game, users must supply their own original game assets, following instructions in the repository’s usage documentation. The project uses CMake and related modern tooling for cross-platform builds, with support for Linux and Windows among other environments. ...

Downloads: 8 This Week

Last Update: 2025-12-01
See Project
8

Clock Signal

A latency-hating emulator of: the Acorn Electron, BBC Micro

Clock Signal also known as Clock Signal, is a highly sophisticated multi-system emulator designed with a strong emphasis on minimizing latency and maximizing authenticity in signal reproduction rather than relying on post-processing shortcuts. Its defining philosophy is to make emulation “invisible” to the user, meaning software can be launched directly without requiring manual configuration of machines, disks, or hardware profiles. The emulator supports a wide range of classic systems,...

Downloads: 2 This Week

Last Update: 2026-04-07
See Project
9

MLT Multimedia Framework

MLT Multimedia Framework

...The functionality of the system is provided via an assortment of ready-to-use tools, XML authoring components, and an extensible plug-in-based API.

Downloads: 7 This Week

Last Update: 2025-12-31
See Project
Respond 100x faster, more accurately, and improve your documentation
Designed for forward-thinking security, sales, and compliance teams

Slash response times for questionnaires, audits, and RFPs by up to 90%. OptiValue.ai automates the heavy lifting, freeing your team to focus on strategic priorities with intuitive tools for seamless review and validation.

Learn More
10

LTX-Video

Official repository for LTX-Video

LTX-Video is a sophisticated multimedia processing framework from Lightricks designed to handle high-quality video editing, compositing, and transformation tasks with performance and scalability. It provides runtime components that efficiently decode, encode, and manipulate video streams, frame buffers, and audio tracks while exposing a rich API for building customized editing features like transitions, effects, color grading, and keyframe automation. The toolkit is built with both real-time and offline workflows in mind, enabling applications from consumer editing to professional content creation and batch processing. ...

Downloads: 10 This Week

Last Update: 2026-01-11
See Project
11

MediaPipe Solutions

Cross-platform, customizable ML solutions

MediaPipe is an open-source framework developed by Google for building cross-platform machine learning pipelines that process audio, video, and other streaming data in real time. The system provides developers with tools and reusable components that allow them to combine multiple machine learning models with preprocessing and postprocessing logic into efficient perception pipelines. These pipelines can run on a wide variety of platforms including mobile devices, desktop systems, web browsers, and embedded edge devices. ...

Downloads: 1 This Week

Last Update: 2026-03-15
See Project
12

Wire iOS

Wire for iOS (iPhone and iPad)

...It is the client-side layer that processes all the data that is displayed in the mobile app. It handles network communication and authentication with the backend, push notifications, local caching of data, client-side business logic, signaling with the audio-video libraries, encryption and decryption (using encryption libraries from a lower level) and other bits and pieces. The user interface layer of the mobile app is built on top of the sync engine, which provides the data to display to the UI. The sync engine itself is built on top of a few third-party frameworks, and uses Wire components that are shared between platforms for cryptography (Proteus/Cryptobox) and audio-video signaling (AVS).

Downloads: 0 This Week

Last Update: 3 days ago
See Project
13

Delphi IDE explorer expert

Expert for Embarcadero Delphi that gives access to internal components

This is an expert for the Embarcadero Delphi XE IDEs that gives access to the internal components. It is based on David Hoyle's IDE expert for Delphi 3/4/5 but has been extended to access more properties and to filter on component types.

Downloads: 0 This Week

Last Update: 6 days ago
See Project
14

Instill Core

Instill Core is a full-stack AI infrastructure tool for data

...It provides an end-to-end solution that enables developers to build, deploy, and manage AI-powered applications without needing to manually stitch together multiple tools across the data and model lifecycle. The platform focuses heavily on handling unstructured data such as documents, images, audio, and video, transforming them into AI-ready formats through integrated ETL pipelines and processing workflows. Instill Core includes modular components such as pipelines, artifacts, and model services, which work together to enable flexible and scalable AI system design. It also supports retrieval-augmented generation workflows and model deployment without requiring complex GPU infrastructure management.

Downloads: 6 This Week

Last Update: 2026-03-19
See Project
15

Seamless Communication

Foundational Models for State-of-the-Art Speech and Text Translation

...The motivation is to move beyond “text in, text out” and enable direct, live, multi-turn exchange involving language, gesture, gaze, vision, and modality switching without user friction. The system architecture includes a real-time multimodal signal pipeline for audio, video, and sensor data, a dialog manager that can decide when to act (speak, gesture, point) or query, and a cross-modal reasoning layer that fuses perception with semantic context. The research prototype includes components for visual grounding (understanding when a user references something in view), gesture recognition and synthesis, and turn-taking mechanisms that mirror human conversational timing. ...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
16

Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models

...Developers can customize voice output parameters like speed, pitch, and volume, and combine the TTS stack with other AI components.

Downloads: 7 This Week

Last Update: 2026-03-17
See Project
17

UDP SuperComponents

Today in Delphi new but brief in old Delphi and Lazarus is a next step

Component for direct access to Data via UDP / TCP developed by me for general use, in the demo there is a remote access project for everyone. Today in Delphi new but brief in old Delphi and Lazarus as it is being ported to the same ones.

1 Review

Downloads: 0 This Week

Last Update: 2025-07-03
See Project
18

Rig

Rust framework for building modular and scalable LLM-powered apps

...It provides a unified set of abstractions that allow applications to interact with many AI model providers and vector databases through a single interface. Its architecture emphasizes modularity, enabling developers to integrate only the components and integrations they need for a specific application. Rig includes built-in support for agent workflows, allowing systems to perform multi-turn reasoning, tool calling, and retrieval-based tasks within structured pipelines. It also supports capabilities such as text generation, embeddings, transcription, image generation, and audio generation depending on the provider used. ...

Downloads: 6 This Week

Last Update: 4 days ago
See Project
19

Delphi Wrapper for Android Debug Bridge

Delphi Wrapper to copy files to/from phone

Implement wrapper for Android Debug Bridge and test its features. Test created to exercise features of the Android Debug Bridge Class. It was adapted from an Ada "library" used to copy music and audio book files to the phone (and delete them later). Android Debug Bridge must be installed on your computer for this to work. Note: you will have to change adbExeLocation in AndroidDebugBridge.pas so that it contains the correct path to adb.exe. This has been tested using ADB version 1.0.41, Delphi 11.3 Community Edition and a Moto G Stylus 5G (2022) phone using Android 13.

Downloads: 2 This Week

Last Update: 2024-09-09
See Project
20

WhatsApp MCP Server

WhatsApp MCP server enabling AI access to chats and messaging

...It acts as a bridge between WhatsApp and large language models, allowing controlled access to messages, chats, and contacts. whatsapp-mcp is composed of two main components: a Go-based bridge that connects to the WhatsApp Web API and stores data locally, and a Python-based MCP server that exposes tools for AI interaction. All message data is stored in a local SQLite database and is only accessed when explicitly requested through defined tools, giving users control over how their data is used. It supports both sending and receiving messages, including various media types such as images, audio, videos, and documents. ...

Downloads: 2 This Week

Last Update: 2026-03-17
See Project
21

DDabLib

Library of Delphi components and units by DelphiDabbler.

A library of useful and re-usable Delphi components, units and IDE extensions published on https://delphidabbler.com/codelib. NOTE: The project has now moved to https://github.com/ddablib, but releases are posted here, in the File section. Many of the components and classes are stable and have been in development for a number of years. A complete list of library contents and links to documentation is available at https://github.com/delphidabbler/ddab-lib-docs Releases of each sub-project within the library are made separately. ...

Downloads: 14 This Week

Last Update: 2025-11-29
See Project
22

Jina-Serve

Build multimodal AI applications with cloud-native stack

Jina Serve is an open-source framework designed for building, deploying, and scaling AI services and machine learning pipelines in production environments. The framework allows developers to create microservices that expose machine learning models through APIs that communicate using protocols such as HTTP, gRPC, and WebSockets. It is built with a cloud-native architecture that supports deployment on local machines, containerized environments, or large orchestration platforms such as...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
23

CosyVoice

Multi-lingual large voice generation model, providing inference

CosyVoice is a multilingual large voice generation model that offers a full-stack solution for training, inference, and deployment of high-quality TTS systems. The model supports multiple languages, including Chinese, English, Japanese, Korean, and a range of Chinese dialects such as Cantonese, Sichuanese, Shanghainese, Tianjinese, and Wuhanese. It is designed for zero-shot voice cloning and cross-lingual or mix-lingual scenarios, so a single reference voice can be used to synthesize speech...

Downloads: 1 This Week

Last Update: 2025-11-30
See Project
24

Omnilingual ASR

Omnilingual ASR Open-Source Multilingual SpeechRecognition

Omnilingual-ASR is a research codebase exploring automatic speech recognition that generalizes across a very large number of languages using shared modeling and training recipes. It focuses on leveraging self-supervised audio pretraining and scalable fine-tuning so low-resource languages can benefit from high-resource data. The project provides data preparation pipelines, training scripts, decoding utilities, and evaluation tools so researchers can reproduce results and extend to new...

Downloads: 0 This Week

Last Update: 2025-12-12
See Project
25

Multimodal

TorchMultimodal is a PyTorch library

...The library provides modular building blocks such as encoders, fusion modules, loss functions, and transformations that support combining modalities (vision, text, audio, etc.) in unified architectures. It includes a collection of ready model classes—like ALBEF, CLIP, BLIP-2, COCA, FLAVA, MDETR, and Omnivore—that serve as reference implementations you can adopt or adapt. The design emphasizes composability: you can mix and match encoder, fusion, and decoder components rather than starting from monolithic models. ...

Downloads: 0 This Week

Last Update: 2026-01-12
See Project