Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python ppo
ppo
x
python
x
260 search results found
Baselines
⭐
14,949
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Reinforcement Learning With Tensorflow
⭐
8,174
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Tianshou
⭐
7,125
An elegant PyTorch deep reinforcement learning library.
Cleanrl
⭐
3,947
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Football
⭐
3,177
Check out the new game server:
Deeprl
⭐
2,834
Modularized Implementation of Deep RL Algorithms in PyTorch
Deep Reinforcement Learning With Pytorch
⭐
2,741
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Minimalrl
⭐
2,417
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Rl Baselines3 Zoo
⭐
1,640
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Ppo Pytorch
⭐
1,270
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Slm Lab
⭐
1,052
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Rl Baselines Zoo
⭐
1,025
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
On Policy
⭐
990
This is the official implementation of Multi-Agent PPO (MAPPO).
Batch Ppo
⭐
919
Efficient Batched Reinforcement Learning in TensorFlow
Pytorch A3c
⭐
768
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Deeprl Tutorials
⭐
726
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Super Mario Bros Ppo Pytorch
⭐
692
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Pytorch Rl
⭐
638
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Autonomous Learning Library
⭐
616
A PyTorch library for building deep reinforcement learning agents.
Rl Starter Files
⭐
571
RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code
Rlcode
⭐
560
Modular_rl
⭐
523
Implementation of TRPO and related algorithms
Purejaxrl
⭐
460
Really Fast End-to-End Jax RL Implementations
Ppo For Beginners
⭐
427
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-w
Huskarl
⭐
417
Deep Reinforcement Learning Framework + Algorithms
Reinforcement Learning Algorithms
⭐
407
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Rllte
⭐
403
Long-Term Evolution Project of Reinforcement Learning
Reinforcement Implementation
⭐
380
Implementation of benchmark RL algorithms
Deep_rl
⭐
372
PyTorch implementations of deep reinforcement learning algorithms
Lagom
⭐
365
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Imitation Learning
⭐
344
Imitation learning algorithms
Xuance
⭐
339
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
Halos
⭐
339
A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).
Machine Learning Is All You Need
⭐
337
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Allenact
⭐
288
An open source framework for research in Embodied-AI from AI2.
Xingtian
⭐
282
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
Lets Do Irl
⭐
269
Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)
Reinforcement_learning
⭐
250
Reinforcement learning tutorials
Pg_travel
⭐
243
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
Rlgraph
⭐
241
RLgraph: Modular computation graphs for deep reinforcement learning
Llm Rlhf Tuning
⭐
225
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
Stable Baselines
⭐
221
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Machin
⭐
206
Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
Landing A Spacex Falcon Heavy Rocket
⭐
195
This is the code for "Landing a SpaceX Falcon Heavy Rocket" By Siraj Raval on Youtube
Tf_deep_rl_trader
⭐
186
Trading Environment(OpenAI Gym) + PPO(TensorForce)
Tf2 Rl
⭐
178
Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]
Phasic Policy Gradient
⭐
175
Code for the paper "Phasic Policy Gradient"
Chatglm Maths
⭐
142
chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu
Deeprl Tensorflow2
⭐
139
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Train Procgen
⭐
134
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
Torch Ac
⭐
134
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Pytorch Dppo
⭐
129
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
Torchrl
⭐
127
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Doom Net Pytorch
⭐
121
Reinforcement learning models in ViZDoom environment
Deeprl_algorithms
⭐
112
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Rl Collision Avoidance
⭐
112
Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"
Samsung Drl Code
⭐
105
A repository for implementations of deep reinforcement learning lectured at Samsung
Openai_five_vs_dota2_explained
⭐
101
This is the code for "OpenAI Five vs DOTA 2 Explained" By Siraj Raval on Youtube
Episodic Transformer Memory Ppo
⭐
99
Clean baseline implementation of PPO using an episodic TransformerXL memory
Ros2learn
⭐
98
ROS 2 enabled Machine Learning algorithms
Rl Examples
⭐
88
Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow
Safety Starter Agents
⭐
86
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
Carla Ppo
⭐
86
This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
Fsrl
⭐
85
🚀 A fast safe reinforcement learning library in PyTorch
Human_aware_rl
⭐
84
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
Sc2aibot
⭐
84
Implementing reinforcement-learning algorithms for pysc2 -environment
Run Skeleton Run
⭐
81
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
Db Football
⭐
80
A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.
Drl_local_planner_ros_stable_baselines
⭐
73
Gail_ppo_tf
⭐
72
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
Ppo
⭐
72
Proximal Policy Optimization implementation with TensorFlow
Tensorswarm
⭐
69
TensorSwarm: A framework for reinforcement learning of robot swarms.
Explorer
⭐
68
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Chatglm Rlhf
⭐
68
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
Open Chatgpt
⭐
66
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
Deep_rl_zoo
⭐
65
A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
Autonomous Driving In Carla Using Deep Reinforcement Learning
⭐
61
Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]
Code For Paper
⭐
59
Auto Drac
⭐
59
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
Mario_rl
⭐
57
Imitation_learning
⭐
56
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Tensorflow_rl
⭐
54
Rl Bot Football
⭐
54
An RL agent for the Google Football environment
Learning2run
⭐
53
Our NIPS 2017: Learning to Run source code
Model Free Algorithms
⭐
52
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
Occupancyanticipation
⭐
51
This repository contains code for our publication "Occupancy Anticipation for Efficient Exploration and Navigation" in ECCV 2020.
Learninghumanoidwalking
⭐
50
Training a humanoid robot for locomotion using Reinforcement Learning
Reinforcementlearning
⭐
48
Reinforcing Your Learning of Reinforcement Learning
Wu Uct
⭐
48
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
Rl Experiments
⭐
47
High-quality implementations of deep reinforcement learning algorithms for experiments
Stanford Osrl
⭐
46
NIPS2017 challenge
Pytorch Learn Reinforcement Learning
⭐
42
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Minimal Isaac Gym
⭐
42
A Minimal Example of Isaac Gym with DQN and PPO.
Relational_deep_reinforcement_learning
⭐
41
Ppo
⭐
40
PyTorch implementation of Proximal Policy Optimization
Ppocma
⭐
38
Llama Trl
⭐
38
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
Deepcomp
⭐
36
Dynamic multi-cell selection for cooperative multipoint (CoMP) using (multi-agent) deep reinforcement learning
Tstarbot1
⭐
36
Ppo Pytorch
⭐
35
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Related Searches
Python Machine Learning (20,195)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
Python Video Game (10,254)
Python Algorithms (10,033)
Python Testing (9,339)
Python Artificial Intelligence (8,580)
Python Pytorch (7,877)
1-100 of 260 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.