LLM-based Reinforcement Learning audio edit model
Transforming Multimodal Content into Captivating Multilingual Audio
Multi-modal large language model designed for audio understanding
Generate blog articles from video or audio
AI tool converting video/audio into structured documents instantly
Convert files and web content into clean, usable Markdown easily
Your Personal Streaming Service
Download videos from websites like YouTube and many others
FineTune, a macOS menu bar app to control volume for each app
Cross platform GUI tool for downloading videos from Bilibili sites
AudioMuse-AI is an Open Source Dockerized environment
A tool to download whole playlists, channels or single videos
A python tool that uses GPT-4, FFmpeg, and OpenCV
A netease cloud music based UI
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
The open LMS by Instructure, Inc.
Fast multimodal LLM for real-time voice interaction and AI apps
Download videos from almost any website
A Web UI for easy subtitle using whisper model
Code and models for ICML 2024 paper, NExT-GPT
Synchronized Translation for Videos
Taming Stable Diffusion for Lip Sync
Workflow and speech recognition app
One-click deployment (including offline integration package)
Instantly generate AI-powered subtitles on your device