Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for proximal policy optimization
proximal-policy-optimization
x
54 search results found
Reinforcement Learning With Tensorflow
⭐
8,174
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Cleanrl
⭐
3,947
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Pytorch A2c Ppo Acktr Gail
⭐
3,484
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Ppo Pytorch
⭐
1,270
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Super Mario Bros Ppo Pytorch
⭐
692
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Pytorch Rl
⭐
638
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Autonomous Learning Library
⭐
616
A PyTorch library for building deep reinforcement learning agents.
Reinforcement Learning Algorithms
⭐
407
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Pytorch Drl
⭐
387
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Lagom
⭐
365
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Pytorch Cpp Rl
⭐
308
PyTorch C++ Reinforcement Learning
Tf_deep_rl_trader
⭐
186
Trading Environment(OpenAI Gym) + PPO(TensorForce)
Torch Ac
⭐
134
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Episodic Transformer Memory Ppo
⭐
99
Clean baseline implementation of PPO using an episodic TransformerXL memory
Curiosity Driven Exploration Pytorch
⭐
98
Curiosity-driven Exploration by Self-supervised Prediction
Sc2aibot
⭐
84
Implementing reinforcement-learning algorithms for pysc2 -environment
Recurrent Ppo Truncated Bptt
⭐
82
Baseline implementation of recurrent PPO using truncated BPTT
Gail_gym
⭐
75
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
Autonomous Driving In Carla Using Deep Reinforcement Learning
⭐
61
Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]
Imitation_learning
⭐
56
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Relational_deep_reinforcement_learning
⭐
41
Ppo
⭐
40
PyTorch implementation of Proximal Policy Optimization
Carla Driving Rl Agent
⭐
39
Code for the paper "Reinforced Curriculum Learning for Autonomous Driving in CARLA" (ICIP 2021)
Rl_matrix
⭐
36
Reinforcement Learning Agents in .NET
Ppo Pytorch
⭐
35
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Pop3d
⭐
35
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
Reinforcement Learning
⭐
33
Reinforcement Learning Examples Of Policy Gradients, PPO+GAE, and DDQN Using OpenAI Gym and PyTorch
Ppo_jax
⭐
33
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights on all environments.
Reinforcementlearning
⭐
32
强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy basd)的代码,代码都经过调试并可以运行
Distributed Ppo
⭐
23
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
Walk_the_blocks
⭐
22
Implementation of Scheduled Policy Optimization for task-oriented language grouding
Tradernet Crv2
⭐
18
TraderNet-CRv2 - Combining Deep Reinforcement Learning with Technical Analysis and Trend Monitoring on Cryptocurrency Markets
Google Football Pytorch
⭐
15
It's the pytorch implementation of google research football.
Reinforcement_learning_ppo_rnd
⭐
14
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
Btc_rl_trading_bot
⭐
14
A trading bitcoin agent was created with deep reinforcement learning implementations.
Reinforcement_learning_phasic_policy_gradient
⭐
10
Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow
Hospitalbot Path Planning
⭐
10
This repository contains an application using ROS2 Humble, Gazebo, OpenAI Gym and Stable Baselines3 to train reinforcement learning agents for a path planning problem.
Pysc2_rl
⭐
9
Spacefortress
⭐
9
OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206
Rlbox
⭐
8
RLbox: Solving OpenAI Gym with TensorFlow
Ppo Tensorflow 2.0
⭐
8
Proximal Policy Optimization with Tensorflow 2.0
Protorl
⭐
8
A Torch Based RL Framework for Rapid Prototyping of Research Papers
Deep Rl
⭐
8
You can see a reference for Books, Articles, Courses and Educational Materials in this field. Implementation of Reinforcement Learning Algorithms and Environments. Python, OpenAI Gym, Tensorflow.
Icm Ppo Implementation
⭐
8
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
Transportation Routes Optimization By Rl
⭐
7
Application of reinforcement learning to Optimize transportation routes using reinforcement learning
Sb3 Jax Haiku
⭐
7
stable-baselines with JAX & Haiku
Generative_adversarial_imitation_learning
⭐
6
Machinelearning
⭐
6
Various machine learning implementations and tools
Spinning_up_kr
⭐
6
Neurips2018 Aiforprosthetics
⭐
6
Reinforcement learning with musculoskeletal models
Soccer Ppo
⭐
5
Udacity Deep Reinforcement Learning Nanodegree Program
Mappo Competitive Reinforcement
⭐
5
A Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem.
Rl_integrated Updraft Exploitation
⭐
5
This repository includes a reinforcement learning framework for end-to-end type integrated thermal updraft localization and exploitation.
Car Racing Ppo
⭐
5
Implementation of a Deep Reinforcement Learning algorithm, Proximal Policy Optimization (SOTA), on a continuous action space openai gym (Box2D/Car Racing v0)
1-54 of 54 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.