Pg_travel

Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
Alternatives To Pg_travel
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Tianshou7,125103 months ago33August 22, 202397mitPython
An elegant PyTorch deep reinforcement learning library.
Deep Reinforcement Learning With Pytorch2,741
a year ago26mitPython
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Rl Baselines Zoo1,025
2 years ago5mitPython
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Pytorch Rl638
3 years ago6mitPython
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Hands On Reinforcement Learning With Python596
4 years ago2Jupyter Notebook
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Modular_rl523
6 years ago10mitPython
Implementation of TRPO and related algorithms
Reinforcement Learning Algorithms407
3 years ago4Python
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Reinforcement Implementation380
2 years ago1Python
Implementation of benchmark RL algorithms
Deep_rl372
3 years ago1mitPython
PyTorch implementations of deep reinforcement learning algorithms
Machine Learning Is All You Need337
8 months agoPython
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Alternatives To Pg_travel
Select To Compare


Alternative Project Comparisons
Popular Ppo Projects
Popular Trpo Projects
Popular Machine Learning Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Gradient
Ppo
Trpo