sandbox:/mnt/data/project_plan.pod free download

verl

Volcano Engine Reinforcement Learning for LLMs

Data pipelines treat human feedback, simulated environments, and synthetic preferences as interchangeable sources, which helps with rapid experimentation. VERL is meant for both research and production hardening: logging, checkpointing, and evaluation suites are built in so you can track learning dynamics and regressions over time.

Downloads: 3 This Week

Last Update: 2026-03-16

See Project

ReinventCommunity

Jupyter Notebook tutorials for REINVENT 3.2

This repository is a collection of useful jupyter notebooks, code snippets and example JSON files illustrating the use of Reinvent 3.2.

Downloads: 0 This Week

Last Update: 2023-12-23

See Project

RecNN

Reinforced Recommendation toolkit built around pytorch 1.7

This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.

Downloads: 0 This Week

Last Update: 2024-06-04

See Project

CCZero (中国象棋Zero)

Implement AlphaZero/AlphaGo Zero methods on Chinese chess

ChineseChess-AlphaZero is a project that implements the AlphaZero algorithm for the game of Chinese Chess (Xiangqi). It adapts DeepMind’s AlphaZero method—combining neural networks and Monte Carlo Tree Search (MCTS)—to learn and play Chinese Chess without prior human data. The system includes self-play, training, and evaluation pipelines tailored to Xiangqi's unique game mechanics.

Downloads: 0 This Week

Last Update: 2025-03-13

See Project

PIQLE

PIQLE is a Platform Implementing Q-LEarning (and other Reinforcement Learning) algorithms in JAVA. Version 2 is a major refactoring. The core data structures and algorithms are in piqle-coreVersion2. Examples are in piqle-examplesVersion2. A complete doc

Downloads: 0 This Week

Last Update: 2013-04-22

See Project

Search Results for "sandbox:/mnt/data/project_plan.pod"

5 projects for "sandbox:/mnt/data/project_plan.pod" with 2 filters applied: