MARS5 speech model (TTS) from CAMB.AI
Diversity-driven optimization and large-model reasoning ability
This repository provides an advanced RAG
LLM powered fuzzing via OSS-Fuzz
Agent framework and applications built upon Qwen>=3.0
Repo of Qwen2-Audio chat & pretrained large audio language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
No-code multi-agent framework to build LLM Agents, workflows
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
The data structure for multimodal data
Hub of ready-to-use datasets for ML models
Build cross-modal and multimodal applications on the cloud
Build AI-powered semantic search applications
A library for deep learning end-to-end dialog systems and chatbots
A trainable PyTorch reproduction of AlphaFold 3
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Multi-modal large language model designed for audio understanding
Large Multimodal Models for Video Understanding and Editing
Scalable machine learning for time series forecasting
On-device Speech-to-Intent engine powered by deep learning
Making Enterprise Data Intelligent and Responsive for AI
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Powering Amazon custom machine learning chips
Pushing the Limits of Mathematical Reasoning in Open Language Models