Page 2 | gradio free download

Showing 47 open source projects for "gradio"

View related business solutions

A privacy-first API that predicts global consumer preferences
Qloo AI adds value to a wide range of Fortune 500 companies in the media, technology, CPG, hospitality, and automotive sectors.

Through our API, we provide contextualized personalization and insights based on a deep understanding of consumer behavior and more than 575 million people, places, and things.

Learn More
Ditto Edge Server is a lightweight standalone server for resource-constrained edge environments, based on the core Ditto Edge SDK.
With Ditto Edge Server, you can join devices as small as a Raspberry Pi to a local mesh network and synchronize data across edge environments.

Ditto's Edge SDK is the only thing your edge devices need to ensure your application is operational in any environment, regardless of network conditions.

Learn More
1

VoxCPM

TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

VoxCPM is a tokenizer-free text-to-speech system that models speech in a continuous space, aiming for extremely realistic, context-aware synthesis and true-to-life zero-shot voice cloning. Instead of converting speech into discrete tokens, it uses an end-to-end diffusion-autoregressive architecture built on the MiniCPM-4 backbone, combining hierarchical language modeling, finite scalar quantization (FSQ), and local Diffusion Transformers. This design helps decouple semantic and acoustic...

Downloads: 44 This Week

Last Update: 2026-04-08
See Project
2

Fish Speech

SOTA Open Source TTS

Fish Speech is a state-of-the-art open-source text-to-speech project that has evolved into the OpenAudio series of advanced TTS models. The repository hosts the code and tooling for training, fine-tuning, and serving high-quality TTS, while the current flagship models (OpenAudio-S1 and S1-mini) are distributed via Fish Audio’s playground and Hugging Face. The models are evaluated with Seed TTS metrics and achieve exceptionally low word and character error rates, indicating strong...

Downloads: 19 This Week

Last Update: 2025-11-28
See Project
3

HunyuanVideo-Foley

Multimodal Diffusion with Representation Alignment

HunyuanVideo-Foley is a multimodal diffusion model from Tencent Hunyuan for high-fidelity Foley (sound effects) audio generation synchronized to video scenes. It is designed to generate audio that matches both visual content and textual semantic cues, for use in video production, film, advertising, games, etc. The model architecture aligns audio, video, and text representations to produce realistic synchronized soundtracks. Produces high-quality 48 kHz audio output suitable for professional...

Downloads: 1 This Week

Last Update: 2025-09-28
See Project
4

HunyuanWorld-Voyager

RGBD video generation model conditioned on camera input

HunyuanWorld-Voyager is a next-generation video diffusion framework developed by Tencent-Hunyuan for generating world-consistent 3D scene videos from a single input image. By leveraging user-defined camera paths, it enables immersive scene exploration and supports controllable video synthesis with high realism. The system jointly produces aligned RGB and depth video sequences, making it directly applicable to 3D reconstruction tasks. At its core, Voyager integrates a world-consistent video...

Downloads: 8 This Week

Last Update: 2025-12-17
See Project
Contractor Foreman is the most affordable all-in-one construction management software for contractors and is trusted by contractors in more than 75 countries.
For Residential, Commercial and Public Works Contractors

Starting at $49/m for the WHOLE company, Contractor Foreman is the most affordable all-in-one construction management system for contractors. Our customers in 75+ countries and industry awards back it up. And it's all backed by a 100 day guarantee.

Learn More
5

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training

MedicalGPT training medical GPT model with ChatGPT training pipeline, implementation of Pretraining, Supervised Finetuning, Reward Modeling and Reinforcement Learning. MedicalGPT trains large medical models, including secondary pre-training, supervised fine-tuning, reward modeling, and reinforcement learning training.

Downloads: 11 This Week

Last Update: 4 days ago
See Project
6

Sygil WebUI

Stable Diffusion web UI

Sygil WebUI is a browser-based interface for running Stable Diffusion image generation locally or on a server, wrapping common text-to-image and image-to-image workflows into a practical UI. It provides multiple UI modes (including a legacy Gradio interface) and focuses on making iterative prompting, parameter tuning, and post-processing accessible without writing code. The UI exposes core generation controls like resolution, CFG guidance, sampling steps, samplers, seeds, and batch generation so users can reproduce results and refine outputs systematically. It also supports jumping between workflows, such as sending an output directly into Image2Image for variations or into an “Image Lab” style area for enhancement and upscaling. ...

Downloads: 1 This Week

Last Update: 2026-02-03
See Project
7

Qwen-Agent

Agent framework and applications built upon Qwen>=3.0

Qwen-Agent is a framework for building applications / agents using Qwen models (version 3.0+). It provides components for instruction following, tool usage (function calling), planning, memory, RAG (retrieval augmented generation), code interpreter, etc. It ships with example applications (Browser Assistant, Code Interpreter, Custom Assistant), supports GUI front-ends, backends, server setups. Agent workflow can maintain context / memory to perform multi-turn or more complex logic over time....

Downloads: 4 This Week

Last Update: 2025-09-23
See Project
8

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM

ChatGLM2-6B is the second-gen Chinese-English conversational LLM from ZhipuAI/Tsinghua. It upgrades the base model with GLM’s hybrid pretraining objective, 1.4 TB bilingual data, and preference alignment—delivering big gains on MMLU, CEval, GSM8K, and BBH. The context window extends up to 32K (FlashAttention), and Multi-Query Attention improves speed and memory use. The repo includes Python APIs, CLI & web demos, OpenAI-style/FASTAPI servers, and quantized checkpoints for lightweight local...

Downloads: 0 This Week

Last Update: 7 days ago
See Project
9

Fragments by E2B

Open source template for AI-powered code generation apps w/ sandboxes

...Fragments is built with modern web technologies such as Next.js, TailwindCSS, and shadcn/ui, providing a structured foundation for developers creating AI-driven development tools. It supports multiple programming frameworks including Python interpreters, Next.js, Vue.js, Streamlit, and Gradio, allowing generated projects to span from simple scripts to full web applications. It integrates with a wide range of large language model providers and supports streaming responses.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
ThreatLocker Cybersecurity Software
Giving you complete control to help you manage your applications and better protect your endpoints.

The Zero Trust security solution that offers a unified approach to protecting users, devices, and networks against the exploitation of zero day vulnerabilities.

Learn More
10

Fast3R

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

...It outputs high-quality 3D scene representations from unordered or sequential views, scaling to large datasets and varied camera intrinsics. The repository includes pretrained models, Gradio-based demos, and modular APIs for direct integration into research or production workflows.

Downloads: 0 This Week

Last Update: 7 days ago
See Project
11

h2oGPT

Private chat with local GPT with document, images, video, etc.

h2oGPT is an open-source platform that allows users to interact with local GPT models in a completely private environment. It supports a variety of document types, including PDFs, Word files, images, video frames, and even audio, enabling users to query and analyze their documents or engage in a private chat with AI. The platform is designed to be secure and offline, ensuring that all data remains private and under the user's control. h2oGPT supports several AI models, including oLLaMa and...

Downloads: 0 This Week

Last Update: 2025-02-22
See Project
12

VGGSfM

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

VGGSfM is an advanced structure-from-motion (SfM) framework jointly developed by Meta AI Research (GenAI) and the University of Oxford’s Visual Geometry Group (VGG). It reconstructs 3D geometry, dense depth, and camera poses directly from unordered or sequential images and videos. The system combines learned feature matching and geometric optimization to generate high-quality camera calibrations, sparse/dense point clouds, and depth maps in standard COLMAP format. Version 2.0 adds support...

Downloads: 0 This Week

Last Update: 7 days ago
See Project
13

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

CogVLM2 is the second generation of the CogVLM vision-language model series, developed by ZhipuAI and released in 2024. Built on Meta-Llama-3-8B-Instruct, CogVLM2 significantly improves over its predecessor by providing stronger performance across multimodal benchmarks such as TextVQA, DocVQA, and ChartQA, while introducing extended context length support of up to 8K tokens and high-resolution image input up to 1344×1344. The series includes models for both image understanding and video...

Downloads: 0 This Week

Last Update: 7 days ago
See Project
14

CogVLM

A state-of-the-art open visual language model

CogVLM is an open-source visual–language model suite—and its GUI-oriented sibling CogAgent—aimed at image understanding, grounding, and multi-turn dialogue, with optional agent actions on real UI screenshots. The flagship CogVLM-17B combines ~10B visual parameters with ~7B language parameters and supports 490×490 inputs; CogAgent-18B extends this to 1120×1120 and adds plan/next-action outputs plus grounded operation coordinates for GUI tasks. The repo provides multiple ways to run models...

Downloads: 0 This Week

Last Update: 7 days ago
See Project
15

LLaVA

Visual Instruction Tuning: Large Language-and-Vision Assistant

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities.

Downloads: 5 This Week

Last Update: 2024-02-04
See Project
16

text-generation-webui-colab

A colab gradio web UI for running Large Language Models

text-generation-webui-colab is a repository that provides Google Colab notebooks designed to simplify the process of running large language models through the popular text-generation-webui interface. The project automates the setup and deployment of AI models in cloud-based notebook environments, allowing users to experiment with text generation systems without configuring complex local environments. By leveraging Google Colab, the repository enables users to run open-source models such as...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
17

llama2-webui

Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere

Running Llama 2 with gradio web UI on GPU or CPU from anywhere (Linux/Windows/Mac).

Downloads: 0 This Week

Last Update: 2023-10-04
See Project
18

Audio Webui

A webui for different audio related Neural Networks

...For more advanced users, it exposes a rich set of command-line flags to control behavior such as skipping installation, disabling venv, changing model cache directories, sharing Gradio links, setting passwords, and specifying themes or ports.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
19

Stable Diffusion v 2.1 web UI

Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img

Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, in paint and upscale4x. Gradio app for Stable Diffusion 2 by Stability AI. It uses Hugging Face Diffusers implementation. Currently supported pipelines are text-to-image, image-to-image, inpainting, upscaling and depth-to-image.

Downloads: 1 This Week

Last Update: 2023-03-22
See Project
20

hebrew-gpt_neo

Hebrew text generation models based on EleutherAI's gpt-neo

Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 which was made available to me via the TPU Research Cloud Program. The Open Super-large Crawled ALMAnaCH coRpus is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the goclassy architecture.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
21

twlinux

...Quick link Icon in LinuxGold for fast help https://github.com/tjw0lf/twlinux LinuxGold Respin based on Linux Lite and XUbuntu 16.04 LIVEPASSWD:twlinux for live root access Software installed google-chrome steam spotify clementine systemback gradio xfdashboard for quick search functionally one click drop down terminal software updater install/remove software packages easy conky for appearance great look bleachbit spotify conky (spotter) cover one click easy use listen (credit deserved to erikdubois) https://github.com/erikdubois/Aureola And more awesome features check it out and try for your self. enjoy Hope you like it. review if you wish and theme to your hearts desire. ...

Downloads: 0 This Week

Last Update: 2017-05-12
See Project
22

GRadio

GRadio Greek Radio Stations

Downloads: 0 This Week

Last Update: 2013-04-03
See Project