20+ high-performance LLMs with recipes to pretrain, finetune at scale
MNN is a blazing fast, lightweight deep learning framework
A Unified Library for Parameter-Efficient Learning
User-friendly AI Interface
Run Local LLMs on Any Device. Open-source
LLM.swift is a simple and readable library
Library for serving Transformers models on Amazon SageMaker
A RWKV management and startup tool, full automation, only 8MB
A set of Docker images for training and serving models in TensorFlow
A high-performance ML model serving framework, offers dynamic batching
AI interface for tinkerers (Ollama, Haystack RAG, Python)
Openai style api for open large language models
Lightweight Python library for adding real-time multi-object tracking
Tensor search for humans
An easy-to-use LLMs quantization package with user-friendly apis
A graphical manager for ollama that can manage your LLMs
High quality, fast, modular reference implementation of SSD in PyTorch
A real time inference engine for temporal logical specifications
Framework that is dedicated to making neural data processing
A computer vision framework to create and deploy apps in minutes
Database system for building simpler and faster AI-powered application
Prem provides a unified environment to develop AI applications
CPU/GPU inference server for Hugging Face transformer models
Fast and user-friendly runtime for transformer inference