How to optimize some algorithm in cuda
Cybersecurity AI (CAI), the framework for AI Security
A list of free LLM inference resources accessible via API
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Framework and no-code GUI for fine-tuning LLMs
Inference Llama 2 in one file of pure C
A Gym environment for web task automation
Easy token price estimates for 400+ LLMs. TokenOps
Unified framework for building enterprise RAG pipelines
A high-performance ML model serving framework, offers dynamic batching
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
An LLM-powered knowledge curation system that researches topics
Large-scale Self-supervised Pre-training Across Tasks, Languages, etc.
AI Agent Evaluator & Red Team Platform
MemoryOS is designed to provide a memory operating system
Real-time multi-AI collaboration: Claude, Codex & Gemini
Document (PDF, Word, PPTX ...) extraction and parse API
A dataset consists of 15,140 ChatGPT prompts from Reddit
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Qwen2.5-VL is the multimodal large language model series
Streamlines and simplifies prompt design for both developers
LangChain powered shell command generator and runner CLI
local-first semantic code search engine
Take control of your AI agents
Data Infrastructure providing an approach to multimodal AI workloads