Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for ppo
ppo
x
344 search results found
Baselines
⭐
14,949
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Reinforcement Learning With Tensorflow
⭐
8,174
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Easy Rl
⭐
7,643
强化学习中文教程(蘑菇书),在线阅读地址:https://datawhalechina.github
Tianshou
⭐
7,125
An elegant PyTorch deep reinforcement learning library.
Deep Reinforcement Learning
⭐
4,635
Repo for the Deep Reinforcement Learning Nanodegree program
Reinforcement Learning
⭐
4,097
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Cleanrl
⭐
3,947
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Deep_reinforcement_learning_course
⭐
3,581
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Pytorch A2c Ppo Acktr Gail
⭐
3,450
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Elegantrl
⭐
3,229
Massively Parallel Deep Reinforcement Learning. 🔥
Football
⭐
3,177
Check out the new game server:
Deeprl
⭐
2,834
Modularized Implementation of Deep RL Algorithms in PyTorch
Deep Reinforcement Learning With Pytorch
⭐
2,741
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Rl Stock
⭐
2,419
📈 如何用深度强化学习自动炒股
Minimalrl
⭐
2,417
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Finrl Trading
⭐
1,858
For trading. Please star.
Rl Baselines3 Zoo
⭐
1,640
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Ppo Pytorch
⭐
1,270
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Slm Lab
⭐
1,052
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Rl Baselines Zoo
⭐
1,025
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
On Policy
⭐
990
This is the official implementation of Multi-Agent PPO (MAPPO).
Batch Ppo
⭐
919
Efficient Batched Reinforcement Learning in TensorFlow
Pytorch A3c
⭐
768
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Deeprl Tutorials
⭐
726
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Super Mario Bros Ppo Pytorch
⭐
692
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Pytorch Rl
⭐
638
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Autonomous Learning Library
⭐
616
A PyTorch library for building deep reinforcement learning agents.
Rl_games
⭐
603
RL implementations
Hands On Reinforcement Learning With Python
⭐
596
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Rl Starter Files
⭐
571
RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code
Rlcode
⭐
560
Modular_rl
⭐
523
Implementation of TRPO and related algorithms
Purejaxrl
⭐
460
Really Fast End-to-End Jax RL Implementations
Ppo For Beginners
⭐
427
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-w
Huskarl
⭐
417
Deep Reinforcement Learning Framework + Algorithms
Reinforcement Learning Algorithms
⭐
407
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Rllte
⭐
403
Long-Term Evolution Project of Reinforcement Learning
Pytorch Drl
⭐
387
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Reinforcement Implementation
⭐
380
Implementation of benchmark RL algorithms
Deep_rl
⭐
372
PyTorch implementations of deep reinforcement learning algorithms
Lagom
⭐
365
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Imitation Learning
⭐
344
Imitation learning algorithms
Xuance
⭐
339
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
Halos
⭐
339
A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).
Machine Learning Is All You Need
⭐
337
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Pytorch Cpp Rl
⭐
308
PyTorch C++ Reinforcement Learning
Allenact
⭐
288
An open source framework for research in Embodied-AI from AI2.
Xingtian
⭐
282
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
Rad
⭐
273
RAD: Reinforcement Learning with Augmented Data
Lets Do Irl
⭐
269
Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)
Reinforcement_learning
⭐
250
Reinforcement learning tutorials
Pg_travel
⭐
243
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
Rlgraph
⭐
241
RLgraph: Modular computation graphs for deep reinforcement learning
Deep Reinforcement Learning Algorithms
⭐
235
32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
Llm Rlhf Tuning
⭐
225
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
Stable Baselines
⭐
221
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Machin
⭐
206
Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
Landing A Spacex Falcon Heavy Rocket
⭐
195
This is the code for "Landing a SpaceX Falcon Heavy Rocket" By Siraj Raval on Youtube
Tf_deep_rl_trader
⭐
186
Trading Environment(OpenAI Gym) + PPO(TensorForce)
Tf2 Rl
⭐
178
Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]
Phasic Policy Gradient
⭐
175
Code for the paper "Phasic Policy Gradient"
Deep_rl_with_pytorch
⭐
174
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Episodic Curiosity
⭐
142
Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
Chatglm Maths
⭐
142
chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu
Deeprl Tensorflow2
⭐
139
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Torch Ac
⭐
134
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Rl Experiments
⭐
134
Keeping track of RL experiments
Train Procgen
⭐
134
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
Pytorch Dppo
⭐
129
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
Torchrl
⭐
127
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Doom Net Pytorch
⭐
121
Reinforcement learning models in ViZDoom environment
Fsrl
⭐
121
🚀 A fast safe reinforcement learning library in PyTorch
Deeprl_algorithms
⭐
112
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Rl Collision Avoidance
⭐
112
Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"
Awesome Rl
⭐
110
Awesome RL: Papers, Books, Codes, Benchmarks
Samsung Drl Code
⭐
105
A repository for implementations of deep reinforcement learning lectured at Samsung
Openai_five_vs_dota2_explained
⭐
101
This is the code for "OpenAI Five vs DOTA 2 Explained" By Siraj Raval on Youtube
Episodic Transformer Memory Ppo
⭐
99
Clean baseline implementation of PPO using an episodic TransformerXL memory
Ros2learn
⭐
98
ROS 2 enabled Machine Learning algorithms
Deep Reinforcement Learning With Python
⭐
94
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Rl Examples
⭐
88
Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow
Gym Continuousdoubleauction
⭐
87
A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.
Safety Starter Agents
⭐
86
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
Carla Ppo
⭐
86
This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
Simple A2c Ppo
⭐
84
Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
Sc2aibot
⭐
84
Implementing reinforcement-learning algorithms for pysc2 -environment
Human_aware_rl
⭐
84
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
Recurrent Ppo Truncated Bptt
⭐
82
Baseline implementation of recurrent PPO using truncated BPTT
Run Skeleton Run
⭐
81
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
Db Football
⭐
80
A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.
Mujoco Benchmark
⭐
78
Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library
Gail_gym
⭐
75
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
Drl_local_planner_ros_stable_baselines
⭐
73
Ppo
⭐
72
Proximal Policy Optimization implementation with TensorFlow
Gail_ppo_tf
⭐
72
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
Dcem
⭐
70
The Differentiable Cross-Entropy Method
Rl Workshop
⭐
69
Reinforcement Learning Workshop for Data Science BKK
Tensorswarm
⭐
69
TensorSwarm: A framework for reinforcement learning of robot swarms.
Q1physrl
⭐
68
Quake 1 movement physics reinforcement learning
Chatglm Rlhf
⭐
68
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
Related Searches
Python Ppo (446)
Reinforcement Learning Ppo (221)
1-100 of 344 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.