Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python ppo
ppo
x
python
x
215 search results found
Baselines
⭐
14,949
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Reinforcement Learning With Tensorflow
⭐
8,174
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Tianshou
⭐
7,125
An elegant PyTorch deep reinforcement learning library.
Cleanrl
⭐
3,947
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Football
⭐
3,177
Check out the new game server:
Deeprl
⭐
2,834
Modularized Implementation of Deep RL Algorithms in PyTorch
Deep Reinforcement Learning With Pytorch
⭐
2,741
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Minimalrl
⭐
2,417
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Rl Baselines3 Zoo
⭐
1,640
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Ppo Pytorch
⭐
1,270
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Slm Lab
⭐
1,052
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
On Policy
⭐
990
This is the official implementation of Multi-Agent PPO (MAPPO).
Batch Ppo
⭐
919
Efficient Batched Reinforcement Learning in TensorFlow
Pytorch A3c
⭐
768
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Deeprl Tutorials
⭐
726
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Super Mario Bros Ppo Pytorch
⭐
692
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Pytorch Rl
⭐
638
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Autonomous Learning Library
⭐
616
A PyTorch library for building deep reinforcement learning agents.
Rlcode
⭐
560
Modular_rl
⭐
523
Implementation of TRPO and related algorithms
Purejaxrl
⭐
460
Really Fast End-to-End Jax RL Implementations
Ppo For Beginners
⭐
427
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-w
Huskarl
⭐
417
Deep Reinforcement Learning Framework + Algorithms
Reinforcement Learning Algorithms
⭐
407
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Rllte
⭐
403
Long-Term Evolution Project of Reinforcement Learning
Reinforcement Implementation
⭐
380
Implementation of benchmark RL algorithms
Deep_rl
⭐
372
PyTorch implementations of deep reinforcement learning algorithms
Lagom
⭐
365
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Imitation Learning
⭐
344
Imitation learning algorithms
Xuance
⭐
339
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
Halos
⭐
339
A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).
Machine Learning Is All You Need
⭐
337
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Allenact
⭐
288
An open source framework for research in Embodied-AI from AI2.
Xingtian
⭐
282
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
Lets Do Irl
⭐
269
Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)
Reinforcement_learning
⭐
250
Reinforcement learning tutorials
Pg_travel
⭐
243
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
Rlgraph
⭐
241
RLgraph: Modular computation graphs for deep reinforcement learning
Stable Baselines
⭐
221
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Machin
⭐
206
Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
Tf_deep_rl_trader
⭐
186
Trading Environment(OpenAI Gym) + PPO(TensorForce)
Tf2 Rl
⭐
178
Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]
Phasic Policy Gradient
⭐
175
Code for the paper "Phasic Policy Gradient"
Deeprl Tensorflow2
⭐
139
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Train Procgen
⭐
134
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
Torch Ac
⭐
134
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Pytorch Dppo
⭐
129
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
Torchrl
⭐
127
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Doom Net Pytorch
⭐
121
Reinforcement learning models in ViZDoom environment
Deeprl_algorithms
⭐
112
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Samsung Drl Code
⭐
105
A repository for implementations of deep reinforcement learning lectured at Samsung
Episodic Transformer Memory Ppo
⭐
99
Clean baseline implementation of PPO using an episodic TransformerXL memory
Ros2learn
⭐
98
ROS 2 enabled Machine Learning algorithms
Rl Examples
⭐
88
Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow
Safety Starter Agents
⭐
86
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
Sc2aibot
⭐
84
Implementing reinforcement-learning algorithms for pysc2 -environment
Human_aware_rl
⭐
84
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
Run Skeleton Run
⭐
81
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
Drl_local_planner_ros_stable_baselines
⭐
73
Gail_ppo_tf
⭐
72
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
Ppo
⭐
72
Proximal Policy Optimization implementation with TensorFlow
Explorer
⭐
68
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Deep_rl_zoo
⭐
65
A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
Code For Paper
⭐
59
Mario_rl
⭐
57
Imitation_learning
⭐
56
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Tensorflow_rl
⭐
54
Learning2run
⭐
53
Our NIPS 2017: Learning to Run source code
Model Free Algorithms
⭐
52
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
Learninghumanoidwalking
⭐
50
Training a humanoid robot for locomotion using Reinforcement Learning
Reinforcementlearning
⭐
48
Reinforcing Your Learning of Reinforcement Learning
Wu Uct
⭐
48
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
Rl Experiments
⭐
47
High-quality implementations of deep reinforcement learning algorithms for experiments
Stanford Osrl
⭐
46
NIPS2017 challenge
Pytorch Learn Reinforcement Learning
⭐
42
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Minimal Isaac Gym
⭐
42
A Minimal Example of Isaac Gym with DQN and PPO.
Relational_deep_reinforcement_learning
⭐
41
Ppo
⭐
40
PyTorch implementation of Proximal Policy Optimization
Ppo Pytorch
⭐
35
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Realworldrl_suite
⭐
35
Real-World RL Benchmark Suite
Redco
⭐
35
MLSys Workshop NeurIPS 2023 - Redco: A Lightweight Tool to Automate Distributed Training and Inference
Pop3d
⭐
35
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
Pytorch Ppo
⭐
33
Proximal Policy Optimization in PyTorch
Ppo Clip And Ppo Penalty On Atari Domain
⭐
33
Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty
Ppo_jax
⭐
33
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights on all environments.
Llm4rl
⭐
32
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Ppo Pytorch
⭐
32
Proximal policy optimization in PyTorch. Easy to read and understand.
Level Replay
⭐
32
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.
Deeprl Baselines
⭐
31
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Reinforcement Learning
Sebulba
⭐
31
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
Dehrl
⭐
31
Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.
Ppo Stein Control Variate
⭐
29
Proximal Policy Optimization with Stein Control Variates:
Pyrl
⭐
29
PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)
Drlkit
⭐
29
A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Rainy
⭐
28
☔ Deep RL agents with PyTorch☔
Reinforcement_learning_with_pytorch
⭐
27
Implement some algorithms of RL
Apex
⭐
27
A continuous deep reinforcement learning framework for robotics
Trading_gym
⭐
25
a unified environment for supervised learning and reinforcement learning in the context of quantitative trading
General
⭐
25
国内第一个基于TensorFlow2.0、支持非gym环境训练、支持可视化配置的强化学习应用编程框架
Ppo Rnd
⭐
24
Random network distillation on Montezuma's Revenge and Super Mario Bros.
Related Searches
Python Machine Learning (20,195)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
Python Video Game (10,254)
Python Algorithms (10,033)
Python Testing (9,339)
Python Artificial Intelligence (8,580)
Python Pytorch (7,877)
1-100 of 215 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.