Awesome Open Source

Programming Languages

Search results for policy gradient

policy-gradient x

173 search results found

Reinforcement Learning With Tensorflow ⭐ 8,174

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Easy Rl ⭐ 7,643

强化学习中文教程（蘑菇书），在线阅读地址：https://datawhalechina.github

Tianshou ⭐ 7,125

An elegant PyTorch deep reinforcement learning library.

Reinforcement Learning ⭐ 4,115

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

Reinforcement Learning ⭐ 3,119

Minimal and Clean Reinforcement Learning Examples

Deep Reinforcement Learning With Pytorch ⭐ 2,741

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Minimalrl ⭐ 2,417

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Seqgan ⭐ 1,801

Implementation of Sequence Generative Adversarial Nets with Policy Gradient

Deep Reinforcement Learning

Ppo Pytorch ⭐ 1,270

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Slm Lab ⭐ 1,052

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Scalable, event-driven, deep-learning-friendly backtesting library

Pytorch Rl ⭐ 638

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Stock_market_reinforcement_learning ⭐ 630

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

Rlseq2seq ⭐ 610

Deep Reinforcement Learning For Sequence to Sequence Models

Hands On Reinforcement Learning With Python ⭐ 596

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Ddpg Keras Torcs ⭐ 595

Using Keras and Deep Deterministic Policy Gradient to play TORCS

Ml Compiler Opt ⭐ 553

Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.

Awesome Monte Carlo Tree Search Papers ⭐ 537

A curated list of Monte Carlo tree search papers with implementations.

Deep Rl Keras ⭐ 521

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

DEEp Reinforcement learning framework

Tensorflow Reinforce ⭐ 477

Implementations of Reinforcement Learning Models in Tensorflow

Rl_algorithms ⭐ 470

Structural implementation of RL key algorithms

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Reinforcement_learning_tutorial_with_demo ⭐ 357

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Pytorch Rl ⭐ 356

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

Openai_lab ⭐ 314

An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.

Text_summurization_abstractive_methods ⭐ 310

Multiple implementations for abstractive text summurization , using google colab

Handyrl ⭐ 278

HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.

Multihopkg ⭐ 252

Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout

Reinforcement_learning ⭐ 250

Reinforcement learning tutorials

Reinforcement Learning Kr ⭐ 238

[파이썬과 케라스로 배우는 강화학습] 예제

Pytorch Maddpg ⭐ 231

A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)

Reinforcementzerotoall ⭐ 221

Pytorch Ddpg ⭐ 207

Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch

Seqgan Pytorch ⭐ 178

A implementation of SeqGAN in PyTorch, following the implementation in tensorflow.

Phasic Policy Gradient ⭐ 175

Code for the paper "Phasic Policy Gradient"

A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow

Question Generation ⭐ 148

Neural text-to-text question generation

Deep Algotrading ⭐ 145

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Show Adapt And Tell ⭐ 142

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

Openai Cartpole ⭐ 129

random search, hill climbing, policy gradient

Deeprl_algorithms ⭐ 112

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Paddle Rlbooks ⭐ 108

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

Torchrl ⭐ 107

Highly Modular and Scalable Reinforcement Learning

Mlds2018spring ⭐ 106

Machine Learning and having it Deep and Structured (MLDS) in 2018 spring

Episodic Transformer Memory Ppo ⭐ 99

Clean baseline implementation of PPO using an episodic TransformerXL memory

Reinforcement Learning ⭐ 95

🤖 Implements of Reinforcement Learning algorithms.

Deep Reinforcement Learning With Python ⭐ 94

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Reinforcement_learning ⭐ 89

강화학습에 대한 기본적인 알고리즘 구현

Openai Gym Policy Gradient ⭐ 88

Reinforcement Learning using Policy Gradient to solve OpenAI Gym games

Combining deep learning and reinforcement learning.

This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.

Recurrent Ppo Truncated Bptt ⭐ 82

Baseline implementation of recurrent PPO using truncated BPTT

[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks :octocat:

Rl Intro ⭐ 73

Rl Course Experiments ⭐ 73

Explorer ⭐ 68

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Pytorch Rl ⭐ 64

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

Reinforcement_learning ⭐ 62

Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3

Reinforcement Learning Algorithms And Dynamic Programming ⭐ 60

Reinforcement learning Algorithms such as SARSA, Q learning, Actor-Critic Policy Gradient and Value Function Approximation were applied to stabilize an inverted pendulum system and achieve optimal control. So essentially, the concept of Reinforcement Learning Controllers has been established. The Reinforcement Learning Controllers have been compared on the basis of performance and efficiency and they are separately compared with the classical Linear Quadratic Regulator Controller. Each of the RL

Imitation_learning ⭐ 56

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Drl_in_cv ⭐ 54

A course on Deep Reinforcement Learning in Computer Vision. Visit Website:

Spinning Up A Pong Ai With Deep Rl ⭐ 53

Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.

Fruit Api ⭐ 50

A Universal Deep Reinforcement Learning Framework

Tensorflow Rl Pong ⭐ 49

Pong AI trained using policy gradient-based reinforcement learning

Cst_captioning ⭐ 48

PyTorch Implementation of Consensus-based Sequence Training for Video Captioning

Reinforcementlearning ⭐ 48

Reinforcing Your Learning of Reinforcement Learning

Photo Editing Tensorflow ⭐ 47

Photo Optimizing Adversarial Net with Policy Gradient Method

Torch Policy Gradient ⭐ 44

Deterministic Policy Gradient using torch7

Pytorch Learn Reinforcement Learning ⭐ 42

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

Reinforcement Learning ⭐ 39

Personal experiments on Reinforcement Learning

Policy Gradient Methods ⭐ 35

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Chainer Seqgan ⭐ 34

implementation of SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

Reinforcement Learning ⭐ 33

Reinforcement Learning Examples Of Policy Gradients, PPO+GAE, and DDQN Using OpenAI Gym and PyTorch

Reinforcementlearning ⭐ 32

强化学习算法库，包含了目前主流的强化学习算法(Value based and Policy basd)的代码，代码都经过调试并可以运行

On The Fly Fgsbir ⭐ 30

[CVPR 2020, Oral] "Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2020. .

Seqgan Pytorch ⭐ 30

Implementation of Sequence Generative Adversarial Nets with Policy Gradient in PyTorch

Deep_rl_acrobot ⭐ 30

TensorFlow A2C to solve Acrobot, with synchronized parallel environments

Sinkhorn Policy Gradient.pytorch ⭐ 29

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

Connect4 ⭐ 29

Solving board games like Connect4 using Deep Reinforcement Learning

Optimization_of_image_description_metrics_using_policy_gradient_methods ⭐ 29

Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods

Rl Implementation Impala ⭐ 28

A Test-Implementation of the IMPALA algorithm (by deepmind 2018)

Policy Gradient Pong ⭐ 27

tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/

Tutorial4rl ⭐ 25

Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.

Practical_rl ⭐ 24

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

Deep Reinforcement Learning ⭐ 23

A collection of several Deep Reinforcement Learning techniques (Deep Q Learning, Policy Gradients, ...), gets updated over time.

Tic Tac Toe ⭐ 23

Train agents to play Tic-Tac-Toe using Policy Gradient

A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm

Ddpg Pytorch ⭐ 21

Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite

Parl Sample ⭐ 20

Deep reinforcement learning using baidu PARL(maze,flappy bird and so on)

Policy Gradient Importance Sampling ⭐ 20

Policy gradient reinforcement learning algorithm with importance sampling

Interpolated Policy Gradient With Ppo For Robotics Control ⭐ 20

Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gradient and Hindsight Experience Replay (HER)

Stock_market_reinforcement_learning ⭐ 19

Applied Deep Learning (2019 Spring) @ NTU

Rl Short Course ⭐ 18

Reinforcement Learning Short Course

1-100 of 173 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.