RL research on Android devices
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
Volcano Engine Reinforcement Learning for LLMs
Benchmarking Multimodal Agents for Open-Ended Tasks
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
The most simple, flexible, and comprehensive OpenAI Gym trading
Jupyter Notebook tutorials for REINVENT 3.2
Reinforced Recommendation toolkit built around pytorch 1.7
Implement AlphaZero/AlphaGo Zero methods on Chinese chess