Designed for training LLM/VLM agents via RL
Agent framework that enables tool-use agent tasks
A large-scale model of medical consultation in Chinese
LongBench v2 and LongBench (ACL 25'&24')
On the Structural Pruning of Large Language Models
SQL-Driven RAG Engine
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Hypernetworks that adapt LLMs for specific benchmark tasks
Towards Efficient Self-Evolving Agent System
Chat with any codebase in under two minutes | Fully local
E2M converts various file types (doc, docx, epub, html, htm, url
Your Personal Research Multi-Tool
Unified KV Cache Compression Methods for Auto-Regressive Models
Learning to Reason with Search for LLMs via Reinforcement Learning
Advanced LLM-powered brute-force tool combining AI intelligence
Traditional Mandarin LLMs for Taiwan
Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Recipes to train reward model for RLHF
A tension reasoning engine over 131 S-class problems
Constrained Value Alignment via Safe Reinforcement Learning
Unleashing 10,000+ Word Generation from Long Context LLMs
Autoregressive Model Beats Diffusion
An agentless approach to automatically solve software development