Open-source, high-performance AI model with advanced reasoning
Universal LLM Deployment Engine with ML Compilation
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A modular graph-based Retrieval-Augmented Generation (RAG) system
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
All-in-one WebUI for AI generative image and video creation
Operating LLMs in production
A lightweight vLLM implementation built from scratch
Multilingual sentence & image embeddings with BERT
AI-Powered Data Processing: Use LOTUS to process all of your datasets
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
AirLLM 70B inference with single 4GB GPU
The official Meta Llama 3 GitHub site
Language-model investigation agent with a terminal UI
The official repo of Qwen chat & pretrained large language model
A guidance language for controlling large language models
Inference code for CodeLlama models
PandasAI is a Python library that integrates generative AI
State-of-the-art Parameter-Efficient Fine-Tuning
Modular AI runtime for robots
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Uncertainty Quantification for Language Models, is a Python package
Official Repo for ICML 2024 paper
Make your agents learn from experience
Build a large language model from 0 only with Python foundation