Pytorch Cpo

PyTorch implementation of Constrained Policy Optimization
Alternatives To Pytorch Cpo
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Tianshou7,125103 months ago33August 22, 202397mitPython
An elegant PyTorch deep reinforcement learning library.
Deep Reinforcement Learning With Pytorch2,741
a year ago26mitPython
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Minimalrl2,417
a year ago21mitPython
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Ppo Pytorch1,270
4 months ago10mitPython
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Slm Lab1,052
2 years ago7mitPython
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Pytorch Rl638
3 years ago6mitPython
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Rl_algorithms470
a year ago15mitPython
Structural implementation of RL key algorithms
Lagom365
4 years ago5mitJupyter Notebook
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Pytorch Rl356
5 years ago2August 20, 20183Python
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Handyrl278
3 months ago40mitPython
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Alternatives To Pytorch Cpo
Select To Compare


Alternative Project Comparisons
Popular Pytorch Projects
Popular Policy Gradient Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Pytorch
Reinforcement Learning
Deep Reinforcement Learning
Policy Gradient
Trpo