Awesome Open Source

Programming Languages

Search results for ppo

344 search results found

Baselines ⭐ 14,949

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Reinforcement Learning With Tensorflow ⭐ 8,174

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Easy Rl ⭐ 7,643

强化学习中文教程（蘑菇书），在线阅读地址：https://datawhalechina.github

Tianshou ⭐ 7,125

An elegant PyTorch deep reinforcement learning library.

Deep Reinforcement Learning ⭐ 4,635

Repo for the Deep Reinforcement Learning Nanodegree program

Reinforcement Learning ⭐ 4,097

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

Cleanrl ⭐ 3,947

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Deep_reinforcement_learning_course ⭐ 3,581

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

Pytorch A2c Ppo Acktr Gail ⭐ 3,450

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Elegantrl ⭐ 3,229

Massively Parallel Deep Reinforcement Learning. 🔥

Football ⭐ 3,177

Check out the new game server:

Deeprl ⭐ 2,834

Modularized Implementation of Deep RL Algorithms in PyTorch

Deep Reinforcement Learning With Pytorch ⭐ 2,741

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Rl Stock ⭐ 2,419

📈 如何用深度强化学习自动炒股

Minimalrl ⭐ 2,417

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Finrl Trading ⭐ 1,858

For trading. Please star.

Rl Baselines3 Zoo ⭐ 1,640

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Ppo Pytorch ⭐ 1,270

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Slm Lab ⭐ 1,052

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Rl Baselines Zoo ⭐ 1,025

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

On Policy ⭐ 990

This is the official implementation of Multi-Agent PPO (MAPPO).

Batch Ppo ⭐ 919

Efficient Batched Reinforcement Learning in TensorFlow

Pytorch A3c ⭐ 768

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Deeprl Tutorials ⭐ 726

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Super Mario Bros Ppo Pytorch ⭐ 692

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

Pytorch Rl ⭐ 638

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Autonomous Learning Library ⭐ 616

A PyTorch library for building deep reinforcement learning agents.

Rl_games ⭐ 603

RL implementations

Hands On Reinforcement Learning With Python ⭐ 596

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Rl Starter Files ⭐ 571

RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code

Modular_rl ⭐ 523

Implementation of TRPO and related algorithms

Purejaxrl ⭐ 460

Really Fast End-to-End Jax RL Implementations

Ppo For Beginners ⭐ 427

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-w

Huskarl ⭐ 417

Deep Reinforcement Learning Framework + Algorithms

Reinforcement Learning Algorithms ⭐ 407

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

Long-Term Evolution Project of Reinforcement Learning

Pytorch Drl ⭐ 387

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Reinforcement Implementation ⭐ 380

Implementation of benchmark RL algorithms

Deep_rl ⭐ 372

PyTorch implementations of deep reinforcement learning algorithms

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Imitation Learning ⭐ 344

Imitation learning algorithms

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).

Machine Learning Is All You Need ⭐ 337

🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!

Pytorch Cpp Rl ⭐ 308

PyTorch C++ Reinforcement Learning

Allenact ⭐ 288

An open source framework for research in Embodied-AI from AI2.

Xingtian ⭐ 282

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

RAD: Reinforcement Learning with Augmented Data

Lets Do Irl ⭐ 269

Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)

Reinforcement_learning ⭐ 250

Reinforcement learning tutorials

Pg_travel ⭐ 243

Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)

Rlgraph ⭐ 241

RLgraph: Modular computation graphs for deep reinforcement learning

Deep Reinforcement Learning Algorithms ⭐ 235

32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

Llm Rlhf Tuning ⭐ 225

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Stable Baselines ⭐ 221

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Landing A Spacex Falcon Heavy Rocket ⭐ 195

This is the code for "Landing a SpaceX Falcon Heavy Rocket" By Siraj Raval on Youtube

Tf_deep_rl_trader ⭐ 186

Trading Environment(OpenAI Gym) + PPO(TensorForce)

Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]

Phasic Policy Gradient ⭐ 175

Code for the paper "Phasic Policy Gradient"

Deep_rl_with_pytorch ⭐ 174

A pytorch tutorial for DRL(Deep Reinforcement Learning)

Episodic Curiosity ⭐ 142

Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability

Chatglm Maths ⭐ 142

chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu

Deeprl Tensorflow2 ⭐ 139

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Torch Ac ⭐ 134

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

Rl Experiments ⭐ 134

Keeping track of RL experiments

Train Procgen ⭐ 134

Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"

Pytorch Dppo ⭐ 129

Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286

Torchrl ⭐ 127

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

Doom Net Pytorch ⭐ 121

Reinforcement learning models in ViZDoom environment

🚀 A fast safe reinforcement learning library in PyTorch

Deeprl_algorithms ⭐ 112

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Rl Collision Avoidance ⭐ 112

Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"

Awesome Rl ⭐ 110

Awesome RL: Papers, Books, Codes, Benchmarks

Samsung Drl Code ⭐ 105

A repository for implementations of deep reinforcement learning lectured at Samsung

Openai_five_vs_dota2_explained ⭐ 101

This is the code for "OpenAI Five vs DOTA 2 Explained" By Siraj Raval on Youtube

Episodic Transformer Memory Ppo ⭐ 99

Clean baseline implementation of PPO using an episodic TransformerXL memory

Ros2learn ⭐ 98

ROS 2 enabled Machine Learning algorithms

Deep Reinforcement Learning With Python ⭐ 94

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Rl Examples ⭐ 88

Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow

Gym Continuousdoubleauction ⭐ 87

A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.

Safety Starter Agents ⭐ 86

Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.

Carla Ppo ⭐ 86

This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.

Simple A2c Ppo ⭐ 84

Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.

Sc2aibot ⭐ 84

Implementing reinforcement-learning algorithms for pysc2 -environment

Human_aware_rl ⭐ 84

Code for "On the Utility of Learning about Humans for Human-AI Coordination"

Recurrent Ppo Truncated Bptt ⭐ 82

Baseline implementation of recurrent PPO using truncated BPTT

Run Skeleton Run ⭐ 81

Reason8.ai PyTorch solution for NIPS RL 2017 challenge

Db Football ⭐ 80

A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.

Mujoco Benchmark ⭐ 78

Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library

Gail_gym ⭐ 75

Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.

Drl_local_planner_ros_stable_baselines ⭐ 73

Proximal Policy Optimization implementation with TensorFlow

Gail_ppo_tf ⭐ 72

Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action

The Differentiable Cross-Entropy Method

Rl Workshop ⭐ 69

Reinforcement Learning Workshop for Data Science BKK

Tensorswarm ⭐ 69

TensorSwarm: A framework for reinforcement learning of robot swarms.

Q1physrl ⭐ 68

Quake 1 movement physics reinforcement learning

Chatglm Rlhf ⭐ 68

对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF

Related Searches

Python Ppo (446)

Reinforcement Learning Ppo (221)

1-100 of 344 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.