Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Reinforcement Learning Frameworks
Search Results

Search Results for "python::module" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Windows 94
Linux 92
Mac 92
More...
BSD 34
ChromeOS 34
Mobile Operating Systems 3

Category

Artificial Intelligence 94
Software Development 12
Games 4
Scientific/Engineering 4
Education 3
Business 2
Database 1
Formats and Protocols 1
System 1

License

OSI-Approved Open Source 89
Creative Commons Attribution License 1

Programming Language

Python 87
C++ 7
C# 1
Java 1

Status

Alpha 2
Pre-Alpha 1
Beta 1

Showing 94 open source projects for "python::module"

View related business solutions

Reinforcement Learning Frameworks Windows Clear Filters & Widen Search

Outbound sales software
Unified cloud-based platform for dialing, emailing, appointment scheduling, lead management and much more.

Adversus is an outbound dialing solution that helps you streamline your call strategies, automate manual processes, and provide valuable insights to improve your outbound workflows and efficiency.

Learn More
The AI-powered unified PSA-RMM platform for modern MSPs.
Trusted PSA-RMM partner of MSPs worldwide

SuperOps.ai is the only PSA-RMM platform powered by intelligent automation and thoughtfully crafted for the new-age MSP. The platform also helps MSPs manage their projects, clients, and IT documents from a single place.

Learn More
1

VectorizedMultiAgentSimulator (VMAS)

VMAS is a vectorized differentiable simulator

VectorizedMultiAgentSimulator is a high-performance, vectorized simulator for multi-agent systems, focusing on large-scale agent interactions in shared environments. It is designed for research in multi-agent reinforcement learning, robotics, and autonomous systems where thousands of agents need to be simulated efficiently.

Downloads: 3 This Week

Last Update: 2025-11-10
See Project
2

OSWorld

Benchmarking Multimodal Agents for Open-Ended Tasks

OSWorld is an open-source synthetic world environment designed for embodied AI research and multi-agent learning. It provides a richly simulated 3D world where multiple agents can interact, perform tasks, and learn complex behaviors. OSWorld emphasizes multi-modal interaction, enabling agents to process visual, auditory, and symbolic data for grounded learning in a simulated world.

Downloads: 2 This Week

Last Update: 2025-03-13
See Project
3

PaLM + RLHF - Pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback)

PaLM-rlhf-pytorch is a PyTorch implementation of Pathways Language Model (PaLM) with Reinforcement Learning from Human Feedback (RLHF). It is designed for fine-tuning large-scale language models with human preference alignment, similar to OpenAI’s approach for training models like ChatGPT.

Downloads: 2 This Week

Last Update: 2025-09-19
See Project
4

Stable Baselines3

PyTorch version of Stable Baselines

Stable Baselines3 (SB3) is a set of reliable implementations of reinforcement learning algorithms in PyTorch. It is the next major version of Stable Baselines. You can read a detailed presentation of Stable Baselines3 in the v1.0 blog post or our JMLR paper. These algorithms will make it easier for the research community and industry to replicate, refine, and identify new ideas, and will create good baselines to build projects on top of. We expect these tools will be used as a base around...

Downloads: 4 This Week

Last Update: 2026-04-01
See Project
Network Management Software and Tools for Businesses and Organizations | Auvik Networks
Mapping, inventory, config backup, and more.

Reduce IT headaches and save time with a proven solution for automated network discovery, documentation, and performance monitoring. Choose Auvik because you'll see value in minutes, and stay with us to improve your IT for years to come.

Learn More
5

RL Baselines3 Zoo

Training framework for Stable Baselines3 reinforcement learning agents

rl-baselines3-zoo is a collection of pre-trained models, benchmarks, and hyperparameter tuning tools built on top of Stable Baselines3, a reinforcement learning library. It provides an easy way to test, evaluate, and train RL agents across a wide variety of environments.

Downloads: 1 This Week

Last Update: 2026-04-01
See Project
6

RWARE

MuA multi-agent reinforcement learning environment

robotic-warehouse is a simulation environment and framework for robotic warehouse automation, enabling research and development of AI and robotic agents to manage warehouse logistics, such as item picking and transport.

Downloads: 1 This Week

Last Update: 2025-03-13
See Project
7

Multi-Agent Orchestrator

Flexible and powerful framework for managing multiple AI agents

Multi-Agent Orchestrator is an AI coordination framework that enables multiple intelligent agents to work together to complete complex, multi-step workflows.

Downloads: 2 This Week

Last Update: 2025-06-24
See Project
8

SLM Lab

Modular Deep Reinforcement Learning framework in PyTorch

SLM Lab is a modular and extensible deep reinforcement learning framework designed for research and practical applications. It provides implementations of various state-of-the-art RL algorithms and emphasizes reproducibility, scalability, and detailed experiment tracking. SLM Lab is structured around a flexible experiment management system, allowing users to define, run, and analyze RL experiments efficiently.

Downloads: 1 This Week

Last Update: 2026-03-04
See Project
9

Gymnasium

An API standard for single-agent reinforcement learning environments

Gymnasium is a fork of OpenAI Gym, maintained by the Farama Foundation, that provides a standardized API for reinforcement learning environments. It improves upon Gym with better support, maintenance, and additional features while maintaining backward compatibility.

Downloads: 1 This Week

Last Update: 2025-12-18
See Project
The full-stack observability platform that protects your dataLayer, tags and conversion data
Stop losing revenue to bad data today. and protect your marketing data with Code-Cube.io.

Code-Cube.io detects issues instantly, alerts you in real time and helps you resolve them fast. No manual QA. No unreliable data. Just data you can trust and act on.

Learn More
10

H2O LLM Studio

Framework and no-code GUI for fine-tuning LLMs

Welcome to H2O LLM Studio, a framework and no-code GUI designed for fine-tuning state-of-the-art large language models (LLMs). You can also use H2O LLM Studio with the command line interface (CLI) and specify the configuration file that contains all the experiment parameters. To finetune using H2O LLM Studio with CLI, activate the pipenv environment by running make shell. With H2O LLM Studio, training your large language model is easy and intuitive. First, upload your dataset and then start...

Downloads: 5 This Week

Last Update: 2026-04-07
See Project
11

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training

MedicalGPT training medical GPT model with ChatGPT training pipeline, implementation of Pretraining, Supervised Finetuning, Reward Modeling and Reinforcement Learning. MedicalGPT trains large medical models, including secondary pre-training, supervised fine-tuning, reward modeling, and reinforcement learning training.

Downloads: 7 This Week

Last Update: 7 days ago
See Project
12

Atropos

Language Model Reinforcement Learning Environments frameworks

Atropos is a comprehensive open-source framework for reinforcement learning (RL) environments tailored specifically to work with large language models (LLMs). Designed as a scalable ecosystem of environment microservices, Atropos allows researchers and developers to collect, evaluate, and manage trajectories (sequences of actions and outcomes) generated by LLMs across a variety of tasks—from static dataset benchmarks to dynamic interactive games and real-world scenario environments. It...

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
13

verl

Volcano Engine Reinforcement Learning for LLMs

VERL is a reinforcement-learning–oriented toolkit designed to train and align modern AI systems, from language models to decision-making agents. It brings together supervised fine-tuning, preference modeling, and online RL into one coherent training stack so teams can move from raw data to aligned policies with minimal glue code. The library focuses on scalability and efficiency, offering distributed training loops, mixed precision, and replay/buffering utilities that keep accelerators busy....

Downloads: 3 This Week

Last Update: 2026-03-16
See Project
14

PettingZoo

An API standard for multi-agent reinforcement learning environments

PettingZoo is a standardized API and library for multi-agent reinforcement learning (MARL) environments. It provides a broad set of environments and tools to facilitate the development and evaluation of multi-agent algorithms.

Downloads: 0 This Week

Last Update: 2025-04-18
See Project
15

Mctx

Monte Carlo tree search in JAX

mctx is a Monte Carlo Tree Search (MCTS) library developed by Google DeepMind for reinforcement learning research. It enables efficient and flexible implementation of MCTS algorithms, including those used in AlphaZero and MuZero.

Downloads: 0 This Week

Last Update: 2025-09-02
See Project
16

Tensorforce

A TensorFlow library for applied reinforcement learning

Tensorforce is an open-source deep reinforcement learning framework built on TensorFlow, emphasizing modularized design and straightforward usability for applied research and practice.

Downloads: 0 This Week

Last Update: 12 hours ago
See Project
17

Cosmos-RL

Cosmos-RL is a flexible and scalable Reinforcement Learning framework

Cosmos-RL is a scalable reinforcement learning framework designed specifically for physical AI systems such as robotics, autonomous agents, and multimodal models. It provides a distributed training architecture that separates policy learning and environment rollout processes, enabling efficient and asynchronous reinforcement learning at scale. The framework supports multiple parallelism strategies, including tensor, pipeline, and data parallelism, allowing it to leverage large GPU clusters...

Downloads: 1 This Week

Last Update: 6 days ago
See Project
18

EvoTorch

Advanced evolutionary computation library built on top of PyTorch

EvoTorch is an evolutionary optimization framework built on top of PyTorch, developed by NNAISENSE. It is designed for large-scale optimization problems, particularly those that require evolutionary algorithms rather than gradient-based methods.

Downloads: 0 This Week

Last Update: 2025-05-14
See Project
19

Habitat-Lab

A modular high-level library to train embodied AI agents

Habitat-Lab is a modular high-level library for end-to-end development in embodied AI. It is designed to train agents to perform a wide variety of embodied AI tasks in indoor environments, as well as develop agents that can interact with humans in performing these tasks. Allowing users to train agents in a wide variety of single and multi-agent tasks (e.g. navigation, rearrangement, instruction following, question answering, human following), as well as define novel tasks. Configuring and...

Downloads: 1 This Week

Last Update: 2025-01-27
See Project
20

OpenSpiel

Environments and algorithms for research in general reinforcement

...OpenSpiel also includes tools to analyze learning dynamics and other common evaluation metrics. Games are represented as procedural extensive-form games, with some natural extensions. The core API and games are implemented in C++ and exposed to Python. Algorithms and tools are written both in C++ and Python. To try OpenSpiel in Google Colaboratory, please refer to open_spiel/colabs subdirectory.

Downloads: 0 This Week

Last Update: 2026-03-16
See Project
21

AI4U

Multi-engine plugin to specify agents with reinforcement learning

...Train using multiple concurrent Unity/Godot environment instances. Unity/Godot environment partial control from Python. Wrap Unity/Godot learning environments as a gym.

Downloads: 0 This Week

Last Update: 2025-10-21
See Project
22

TaskWeaver

A code-first agent framework for seamlessly planning analytics tasks

TaskWeaver is a multi-agent AI framework designed for orchestrating autonomous agents that collaborate to complete complex tasks.

Downloads: 0 This Week

Last Update: 2025-01-29
See Project
23

ViZDoom

Doom-based AI research platform for reinforcement learning

ViZDoom allows developing AI bots that play Doom using only the visual information (the screen buffer). It is primarily intended for research in machine visual learning, and deep reinforcement learning, in particular. ViZDoom is based on ZDOOM, the most popular modern source-port of DOOM. This means compatibility with a huge range of tools and resources that can be used to create custom scenarios, availability of detailed documentation of the engine and tools and support of Doom community....

Downloads: 1 This Week

Last Update: 2026-02-11
See Project
24

TensorHouse

A collection of reference Jupyter notebooks and demo AI/ML application

TensorHouse is a scalable reinforcement learning (RL) platform that focuses on high-throughput experience generation and distributed training. It is designed to efficiently train agents across multiple environments and compute resources. TensorHouse enables flexible experiment management, making it suitable for large-scale RL experiments in both research and applied settings.

Downloads: 6 This Week

Last Update: 2025-03-13
See Project
25

Astrape

Optical-packet node transceiver frequency allocation

In an optical network scenario which consists of multiple nodes (whiteboxes) at its edges and ROADMs in-between, the coherent transceiver average laser configuration time is improved. The process is evaluated according to a testbed setup. This is facilitated in the appropriate lab equipment (or via simulation when required). For that purpose, a software agent (Netconf server) residing at the whiteboxes, is developed receiving input from the Software-Defined Networking (SDN) packet...

Downloads: 1 This Week

Last Update: 2025-03-14
See Project

Previous
1
You're on page 2
3
4
Next

Related Searches

doom

llm

lab

coding

Related Categories

Artificial Intelligence

Software Development

Games

Scientific/Engineering

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise