Awesome Open Source
Awesome Open Source

pytorch-madrl

This project includes PyTorch implementations of various Deep Reinforcement Learning algorithms for both single agent and multi-agent.

  • [ ] A2C
  • [ ] ACKTR
  • [ ] DQN
  • [ ] DDPG
  • [ ] PPO

It is written in a modular way to allow for sharing code between different algorithms. In specific, each algorithm is represented as a learning agent with a unified interface including the following components:

  • [ ] interact: interact with the environment to collect experience. Taking one step forward and n steps forward are both supported (see _take_one_step_ and _take_n_steps, respectively)
  • [ ] train: train on a sample batch
  • [ ] exploration_action: choose an action based on state with random noise added for exploration in training
  • [ ] action: choose an action based on state for execution
  • [ ] value: evaluate value for a state-action pair
  • [ ] evaluation: evaluation the learned agent

Requirements

  • gym
  • python 3.6
  • pytorch

Usage

To train a model:

$ python run_a2c.py

Results

It's extremely difficult to reproduce results for Reinforcement Learning algorithms. Due to different settings, e.g., random seed and hyper parameters etc, you might get different results compared with the followings.

A2C

CartPole-v0

ACKTR

CartPole-v0

DDPG

Pendulum-v0

DQN

CartPole-v0

PPO

CartPole-v0

TODO

  • [ ] TRPO
  • [ ] LOLA
  • [ ] Parameter noise

Acknowledgments

This project gets inspirations from the following projects:

License

MIT

Alternatives To Pytorch Drl
Select To Compare


Alternative Project Comparisons
Related Awesome Lists
Top Programming Languages
Top Projects

Get A Weekly Email With Trending Projects For These Topics
No Spam. Unsubscribe easily at any time.
Pytorch (20,745
Reinforcement Learning (4,362
Rl (2,338
Dqn (1,410
Deep Reinforcement Learning (1,007
Ppo (606
Ddpg (540
Actor Critic (201
Multi Agent (116
Deep Q Network (110
A2c (101
Proximal Policy Optimization (72
Drl (46
Deep Deterministic Policy Gradient (29
Advantage Actor Critic (23
Acktr (10