Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for policy gradient
policy-gradient
x
173 search results found
Reinforcement Learning With Tensorflow
⭐
8,174
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Easy Rl
⭐
7,643
强化学习中文教程(蘑菇书),在线阅读地址:https://datawhalechina.github
Tianshou
⭐
7,125
An elegant PyTorch deep reinforcement learning library.
Reinforcement Learning
⭐
4,115
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Reinforcement Learning
⭐
3,119
Minimal and Clean Reinforcement Learning Examples
Deep Reinforcement Learning With Pytorch
⭐
2,741
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Minimalrl
⭐
2,417
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Seqgan
⭐
1,801
Implementation of Sequence Generative Adversarial Nets with Policy Gradient
Drl
⭐
1,542
Deep Reinforcement Learning
Ppo Pytorch
⭐
1,270
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Slm Lab
⭐
1,052
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Btgym
⭐
825
Scalable, event-driven, deep-learning-friendly backtesting library
Pytorch Rl
⭐
638
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stock_market_reinforcement_learning
⭐
630
This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.
Rlseq2seq
⭐
610
Deep Reinforcement Learning For Sequence to Sequence Models
Hands On Reinforcement Learning With Python
⭐
596
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Ddpg Keras Torcs
⭐
595
Using Keras and Deep Deterministic Policy Gradient to play TORCS
Ml Compiler Opt
⭐
553
Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.
Awesome Monte Carlo Tree Search Papers
⭐
537
A curated list of Monte Carlo tree search papers with implementations.
Deep Rl Keras
⭐
521
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Deer
⭐
481
DEEp Reinforcement learning framework
Tensorflow Reinforce
⭐
477
Implementations of Reinforcement Learning Models in Tensorflow
Rl_algorithms
⭐
470
Structural implementation of RL key algorithms
Seqgan
⭐
441
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Lagom
⭐
365
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Reinforcement_learning_tutorial_with_demo
⭐
357
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Pytorch Rl
⭐
356
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Trpo
⭐
315
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
Openai_lab
⭐
314
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Text_summurization_abstractive_methods
⭐
310
Multiple implementations for abstractive text summurization , using google colab
Handyrl
⭐
278
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Multihopkg
⭐
252
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Reinforcement_learning
⭐
250
Reinforcement learning tutorials
Reinforcement Learning Kr
⭐
238
[파이썬과 케라스로 배우는 강화학습] 예제
Pytorch Maddpg
⭐
231
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Reinforcementzerotoall
⭐
221
Pytorch Ddpg
⭐
207
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
Seqgan Pytorch
⭐
178
A implementation of SeqGAN in PyTorch, following the implementation in tensorflow.
Phasic Policy Gradient
⭐
175
Code for the paper "Phasic Policy Gradient"
A2c
⭐
159
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Question Generation
⭐
148
Neural text-to-text question generation
Deep Algotrading
⭐
145
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Show Adapt And Tell
⭐
142
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Openai Cartpole
⭐
129
random search, hill climbing, policy gradient
Deeprl_algorithms
⭐
112
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Paddle Rlbooks
⭐
108
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Torchrl
⭐
107
Highly Modular and Scalable Reinforcement Learning
Mlds2018spring
⭐
106
Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
Episodic Transformer Memory Ppo
⭐
99
Clean baseline implementation of PPO using an episodic TransformerXL memory
Reinforcement Learning
⭐
95
🤖 Implements of Reinforcement Learning algorithms.
Deep Reinforcement Learning With Python
⭐
94
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Reinforcement_learning
⭐
89
강화학습에 대한 기본적인 알고리즘 구현
Openai Gym Policy Gradient
⭐
88
Reinforcement Learning using Policy Gradient to solve OpenAI Gym games
Yarll
⭐
84
Combining deep learning and reinforcement learning.
Sqddpg
⭐
84
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
Recurrent Ppo Truncated Bptt
⭐
82
Baseline implementation of recurrent PPO using truncated BPTT
Codegan
⭐
74
[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks :octocat:
Rl Intro
⭐
73
Rl Course Experiments
⭐
73
Explorer
⭐
68
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Pytorch Rl
⭐
64
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Reinforcement_learning
⭐
62
Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3
Reinforcement Learning Algorithms And Dynamic Programming
⭐
60
Reinforcement learning Algorithms such as SARSA, Q learning, Actor-Critic Policy Gradient and Value Function Approximation were applied to stabilize an inverted pendulum system and achieve optimal control. So essentially, the concept of Reinforcement Learning Controllers has been established. The Reinforcement Learning Controllers have been compared on the basis of performance and efficiency and they are separately compared with the classical Linear Quadratic Regulator Controller. Each of the RL
Imitation_learning
⭐
56
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Drl_in_cv
⭐
54
A course on Deep Reinforcement Learning in Computer Vision. Visit Website:
Spinning Up A Pong Ai With Deep Rl
⭐
53
Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.
Fruit Api
⭐
50
A Universal Deep Reinforcement Learning Framework
Tensorflow Rl Pong
⭐
49
Pong AI trained using policy gradient-based reinforcement learning
Cst_captioning
⭐
48
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
Reinforcementlearning
⭐
48
Reinforcing Your Learning of Reinforcement Learning
Photo Editing Tensorflow
⭐
47
Photo Optimizing Adversarial Net with Policy Gradient Method
Torch Policy Gradient
⭐
44
Deterministic Policy Gradient using torch7
Pytorch Learn Reinforcement Learning
⭐
42
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Reinforcement Learning
⭐
39
Personal experiments on Reinforcement Learning
Policy Gradient Methods
⭐
35
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Chainer Seqgan
⭐
34
implementation of SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Reinforcement Learning
⭐
33
Reinforcement Learning Examples Of Policy Gradients, PPO+GAE, and DDQN Using OpenAI Gym and PyTorch
Reinforcementlearning
⭐
32
强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy basd)的代码,代码都经过调试并可以运行
On The Fly Fgsbir
⭐
30
[CVPR 2020, Oral] "Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2020. .
Seqgan Pytorch
⭐
30
Implementation of Sequence Generative Adversarial Nets with Policy Gradient in PyTorch
Deep_rl_acrobot
⭐
30
TensorFlow A2C to solve Acrobot, with synchronized parallel environments
Sinkhorn Policy Gradient.pytorch
⭐
29
Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"
Connect4
⭐
29
Solving board games like Connect4 using Deep Reinforcement Learning
Optimization_of_image_description_metrics_using_policy_gradient_methods
⭐
29
Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods
Lirpg
⭐
29
Rl Implementation Impala
⭐
28
A Test-Implementation of the IMPALA algorithm (by deepmind 2018)
Policy Gradient Pong
⭐
27
tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/
Tutorial4rl
⭐
25
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
Practical_rl
⭐
24
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Python
⭐
23
Deep Reinforcement Learning
⭐
23
A collection of several Deep Reinforcement Learning techniques (Deep Q Learning, Policy Gradients, ...), gets updated over time.
Tic Tac Toe
⭐
23
Train agents to play Tic-Tac-Toe using Policy Gradient
Rl
⭐
22
A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
Ddpg Pytorch
⭐
21
Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite
Parl Sample
⭐
20
Deep reinforcement learning using baidu PARL(maze,flappy bird and so on)
Policy Gradient Importance Sampling
⭐
20
Policy gradient reinforcement learning algorithm with importance sampling
Interpolated Policy Gradient With Ppo For Robotics Control
⭐
20
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gradient and Hindsight Experience Replay (HER)
Stock_market_reinforcement_learning
⭐
19
Adl2019
⭐
19
Applied Deep Learning (2019 Spring) @ NTU
Rl Short Course
⭐
18
Reinforcement Learning Short Course
1-100 of 173 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.