Search Results for "input-output model" - Page 5

Sort By:

Showing 665 open source projects for "input-output model"

View related business solutions

Outplacement, Executive Coaching and Career Development | Careerminds
Careerminds outplacement includes personalized coaching and a high-tech approach to help transition employees back to work faster.

By helping to avoid the potential risks of RIFs or layoffs through our global outplacement services, companies can move forward with their goals while preserving their internal culture, employer brand, and bottom lines.

Learn More
Pylon is an All-in-one B2B Support Platform for modern B2B businesses.
Pylon is a modern support system that integrates with all B2B channels like Slack and Team.

We bring together everything a post-sales teams team needs including a ticketing system, B2B omnichannel integrations (Slack Connect, Microsoft Teams), modern chat widget, knowledge base, AI support bot, account management, customer marketing, and more.

Learn More
1

Paint.NET

Downloads for Paint.NET, such as installer EXEs and portable ZIPs

...Starting the app is nearly instantaneous, and every feature has been thoroughly optimized to take advantage of the latest multicore CPUs, GPUs, and NVMe SSDs. The use of DXGI Flip Model ensures low input latency and reduced power consumption. Whether you have a power-conscious laptop or a monstrous desktop with a gigantic GPU, you can expect it to start up immediately, respond quickly to every mouse click, and take full advantage of all of your hardware.

Downloads: 139 This Week

Last Update: 2026-03-08
See Project
2

kubectl-neat

Clean up Kubernetes yaml and json output to make it readable

Remove clutter from Kubernetes manifests to make them more readable.

Downloads: 7 This Week

Last Update: 2024-07-12
See Project
3

node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama

...The library enables applications built with Node.js to interact directly with local LLM models without requiring a remote API or external service. By using native bindings and optimized model execution, the framework allows developers to integrate advanced language model capabilities into desktop applications, server software, and command-line tools. The system automatically detects the available hardware on a machine and selects the most appropriate compute backend, including CPU or GPU acceleration. Developers can use the library to perform tasks such as text generation, conversational chat, embedding generation, and structured output generation. ...

Downloads: 11 This Week

Last Update: 2026-03-17
See Project
4

SimpleLLM

950 line, minimal, extensible LLM inference engine built from scratch

...Designed to run efficiently on high-end GPUs like NVIDIA H100 with support for models such as OpenAI/gpt-oss-120b, Simple-LLM implements continuous batching and event-driven inference loops to maximize hardware utilization and throughput. Its straightforward code structure allows anyone experimenting with custom kernels, new batching strategies, or inference optimizations to trace execution from input to output with minimal cognitive overhead.

Downloads: 0 This Week

Last Update: 2026-01-28
See Project
Instant Remote Support Software. Unattended Remote Access Software.
Zoho Assist, your all-in-one remote access solution, helps you to access and manage remote devices.

Zoho Assist is cloud-based remote support and remote access software that helps you support customers from a distance through web-based, on-demand remote support sessions. Set up unattended remote access and manage remote PCs, laptops, mobile devices, and servers effortlessly. A few seconds is all you need to establish secure connections to offer your customers remote support solutions.

Learn More
5

Mesh R-CNN

code for Mesh R-CNN, ICCV 2019

...The system combines 2D detection from Mask R-CNN with 3D reasoning modules that output full mesh reconstructions aligned with the input image. It has been evaluated on datasets such as Pix3D, where it demonstrates state-of-the-art performance in reconstructing real-world object geometry.

Downloads: 0 This Week

Last Update: 5 days ago
See Project
6

Starter Applets

Google AI Studio Starter Apps

starter-applets is a collection of minimal, sandboxed example “applets” that demonstrate how to compose Gemini-powered microapps (chat widgets, image generation, workflows) that can be embedded in other applications or used standalone. The applets are structured with a focus on simplicity: each presents a prompt input, minimal UI logic, and inline display of the resulting output or widget (e.g. generated text, images). They are built to illustrate best practices (e.g. safety guards, prompt templates, streaming UI updates) rather than production feature sets. The repo supplies a CLI or script to scaffold new applet templates, letting developers spin up small Gemini-powered components quickly. ...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
7

whylogs

The open standard for data logging

whylogs is an open-source library for logging any kind of data. With whylogs, users are able to generate summaries of their datasets (called whylogs profiles) which they can use to track changes in their dataset Create data constraints to know whether their data looks the way it should. Quickly visualize key summary statistics about their datasets. whylogs profiles are the core of the whylogs library. They capture key statistical properties of data, such as the distribution (far beyond...

Downloads: 5 This Week

Last Update: 2024-12-03
See Project
8

Gemini-API

Reverse-engineered Python API for Google Gemini web app

Gemini-API is a community-created asynchronous Python wrapper for the web interface of Google’s Gemini models (formerly Bard). It is the result of reverse-engineering the Gemini web app and exposing its functionality through a programmatic API. This enables developers to incorporate Gemini into Python applications, scripts, bots, or tools without relying solely on official SDKs. The wrapper supports streaming responses, model selection, and handling of the web-based authentication/session...

Downloads: 10 This Week

Last Update: 2026-04-06
See Project
9

KVision

Object oriented web framework for Kotlin/JS

...Choose yourself the preferred way. Almost 100 ready-to-use GUI components are used as building blocks for the application UI. Sophisticated layout containers. Forms with type safe data model and built-in validation. Different text input components including rich text, typeahead, and input mask support. Buttons, checkboxes, radios and switches. Date and time picker. Spinner and range numeric input components. Advanced select box with ajax support. File upload with preview and multi-selection. Advanced charts. Reactive tables. ...

Downloads: 3 This Week

Last Update: 2026-03-28
See Project
End-To-End Document Management Software
UnForm is ideal for businesses focusing on distribution, manufacturing ERP solutions, and general accounting.

UnForm® is a platform-independent software product that creates, delivers, stores and retrieves graphically enhanced documents from ERP application printing. A complete, end-to-end document management solution, UnForm interfaces at the point of printing to produce documents in various formats for printing and electronic delivery.

Learn More
10

Chandra

OCR model for complex documents with layout-aware structured outputs

Chandra is an advanced OCR model designed to extract and structure information from complex documents such as tables, forms, handwritten notes, and mathematical content. It focuses on preserving full document layout, meaning that extracted text is accompanied by positional metadata like bounding boxes for each element. Chandra supports multiple output formats including Markdown, HTML, and JSON, making it suitable for downstream processing and integration into data pipelines. ...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
11

HunyuanVideo-Foley

Multimodal Diffusion with Representation Alignment

HunyuanVideo-Foley is a multimodal diffusion model from Tencent Hunyuan for high-fidelity Foley (sound effects) audio generation synchronized to video scenes. It is designed to generate audio that matches both visual content and textual semantic cues, for use in video production, film, advertising, games, etc. The model architecture aligns audio, video, and text representations to produce realistic synchronized soundtracks.

Downloads: 1 This Week

Last Update: 2025-09-28
See Project
12

Whisper-WebUI

A Web UI for easy subtitle using whisper model

Whisper WebUI is an open-source browser-based interface that simplifies the use of Whisper speech recognition models by providing an intuitive graphical environment for transcription, translation, and subtitle generation. Built with Gradio, it allows users to upload audio or video files, process them locally, and generate accurate text outputs without relying on command-line tools. The platform integrates optimized implementations such as faster-whisper, significantly improving transcription...

Downloads: 18 This Week

Last Update: 2026-03-18
See Project
13

Easy Diffusion

An easy 1-click way to create beautiful artwork on your PC using AI

...It provides a browser-based user interface that runs locally, allowing users to type text prompts and immediately generate images directly within their web browser, democratizing access to powerful text-to-image models for artists and hobbyists alike. The project abstracts away environment setup, dependencies, and model installation — tasks that can be daunting to beginners — and instead lets users focus on creative experimentation with prompt phrasing, model parameters, and image output settings. Because it’s designed to be easy to install and use, EasyDiffusion’s interface includes options for queuing multiple jobs, applying modifiers like upscaling or face correction, and adjusting generation parameters like guidance scale and resolution.

Downloads: 42 This Week

Last Update: 2026-03-31
See Project
14

RealtimeTTS

Converts text to speech in realtime

RealtimeTTS is a low-latency text-to-speech library built for real-time applications such as voice chat with LLMs, assistants, and interactive tools. It is designed around a streaming model: you can feed it text incrementally (for example, as an LLM responds) and get audio output almost immediately, which keeps end-to-end latency very low. The library is engine-agnostic and plugs into a wide range of cloud and local TTS systems, including OpenAI, ElevenLabs, Azure, Coqui, Piper, StyleTTS2, Edge TTS, Google TTS, system TTS and others, so you can swap providers without rewriting your pipeline. ...

Downloads: 5 This Week

Last Update: 2026-03-28
See Project
15

Hermes Agent

The agent that grows with you

Hermes Agent is a fully open-source autonomous AI agent designed to run persistently on your own machine or server, becoming more capable the longer it operates by learning from experience and building reusable procedural skills. Rather than functioning as a stateless chatbot, it maintains long-term memory across sessions and can generate searchable “Skill Documents” that capture how it solved complex tasks so it doesn’t start from scratch each time. The agent interfaces with messaging...

Downloads: 151 This Week

Last Update: 4 days ago
See Project
16

WorkAny

Desktop Agent for Any Task

WorkAny is an open-source desktop AI agent application that executes generic tasks through natural language commands, effectively bringing intelligent automation into everyday workflows without needing to write code manually. It acts as a unified environment where users can ask the AI to generate documents, presentations, websites, spreadsheets, organize files, or write code — all with real-time streaming outputs directly in the app, so you see results as the AI produces them. Powered by a...

Downloads: 8 This Week

Last Update: 2026-03-07
See Project
17

Qwen3 Embedding

Designed for text embedding and ranking tasks

Qwen3-Embedding is a model series from the Qwen family designed specifically for text embedding and ranking tasks. It builds upon the Qwen3 base/dense models and offers several sizes (0.6B, 4B, 8B parameters), for both embedding and reranking, with high multilingual capability, long‐context understanding, and reasoning. It achieves state-of-the-art performance on benchmarks like MTEB (Multilingual Text Embedding Benchmark) and supports instruction-aware embedding (i.e. embedding task...

Downloads: 0 This Week

Last Update: 2025-09-30
See Project
18

Prompt Declaration Language

Prompt Declaration Language is a declarative prompt programming lang

LLMs will continue to change the way we build software systems. They are not only useful as coding assistants, providing snipets of code, explanations, and code transformations, but they can also help replace components that could only previously be achieved with rule-based systems. Whether LLMs are used as coding assistants or software components, reliability remains an important concern. LLMs have a textual interface and the structure of useful prompts is not captured formally. Programming...

Downloads: 2 This Week

Last Update: 2026-02-26
See Project
19

MetaVoice-1B

Foundational model for human-like, expressive TTS

MetaVoice — in the form of its source repository “metavoice-src” — is a large-scale text-to-speech (TTS) model. Specifically, the base model (MetaVoice-1B) uses around 1.2 billion parameters and has been trained on a massive dataset — reportedly around 100,000 hours of speech data. The goal is to provide human-like, expressive, and flexible TTS: able to generate natural-sounding speech that can handle diverse inputs and likely generalize over voice styles, intonation, prosody, and perhaps...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
20

AWS MCP Servers

Helping you get the most out of AWS, wherever you use MCP

AWS MCP Servers are a collection of remotely hosted, fully-managed Model Context Protocol (MCP) servers by AWS, providing AI applications with real-time access to AWS documentation, API references, best practices, and infrastructure-management capabilities via natural-language workflows. An MCP Server is a lightweight program that exposes specific capabilities through the standardized Model Context Protocol. Host applications (such as chatbots, IDEs, and other AI tools) have MCP clients that...

Downloads: 5 This Week

Last Update: 3 days ago
See Project
21

Ring

Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI

Ring is a reasoning Mixture-of-Experts (MoE) large language model (LLM) developed by inclusionAI. It is built from or derived from Ling. Its design emphasizes reasoning, efficiency, and modular expert activation. In its “flash” variant (Ring-flash-2.0), it optimizes inference by activating only a subset of experts. It applies reinforcement learning/reasoning optimization techniques. Its architectures and training approaches are tuned to enable efficient and capable reasoning performance....

Downloads: 0 This Week

Last Update: 2025-09-30
See Project
22

Bolt NLP

Bolt is a deep learning library with high performance

Bolt is a high-performance deep learning inference framework developed by Huawei Noah's Ark Lab. It is designed to optimize and accelerate the deployment of deep learning models across various hardware platforms. Bolt is a light-weight library for deep learning. Bolt, as a universal deployment tool for all kinds of neural networks, aims to automate the deployment pipeline and achieve extreme acceleration. Bolt has been widely deployed and used in many departments of HUAWEI company, such as...

Downloads: 7 This Week

Last Update: 2025-01-30
See Project
23

Asciidoctor PDF

Asciidoctor PDF: A native PDF converter for AsciiDoc

A fast text processor & publishing toolchain for converting AsciiDoc to HTML5, DocBook & more. Asciidoctor is a fast, open source, Ruby-based text processor for parsing AsciiDoc® into a document model and converting it to output formats such as HTML 5, DocBook 5, manual pages, PDF, EPUB 3, and other formats. Asciidoctor also has an ecosystem of extensions, converters, build plugins, and tools to help you author and publish content written in AsciiDoc.

Downloads: 6 This Week

Last Update: 2025-11-15
See Project
24

Step1X-3D

High-Fidelity and Controllable Generation of Textured 3D Assets

Step1X-3D is an open-source framework for generating high-fidelity textured 3D assets from scratch — both their geometry and surface textures — using modern generative AI techniques. It combines a hybrid architecture: a geometry generation stage using a VAE-DiT model to output a watertight 3D representation (e.g. TSDF surface), and a texture synthesis stage that conditions on geometry and optionally reference input (or prompts) to produce view-consistent textures using a diffusion-based texture module. The result is fully 3D assets — meshes + textures — which can be rendered from any viewpoint, textured consistently, and used in 3D applications. ...

Downloads: 1 This Week

Last Update: 2025-12-01
See Project
25

HunyuanWorld-Voyager

RGBD video generation model conditioned on camera input

HunyuanWorld-Voyager is a next-generation video diffusion framework developed by Tencent-Hunyuan for generating world-consistent 3D scene videos from a single input image. By leveraging user-defined camera paths, it enables immersive scene exploration and supports controllable video synthesis with high realism. The system jointly produces aligned RGB and depth video sequences, making it directly applicable to 3D reconstruction tasks. At its core, Voyager integrates a world-consistent video diffusion model with an efficient long-range world exploration engine powered by auto-regressive inference. ...

Downloads: 8 This Week

Last Update: 2025-12-17
See Project