Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for ppo
ppo
x
344 search results found
Explorer
⭐
68
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Open Chatgpt
⭐
66
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
Deep_rl_zoo
⭐
65
A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
Stateadvdrl
⭐
63
[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"
Reinforcement_learning
⭐
62
Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3
Autonomous Driving In Carla Using Deep Reinforcement Learning
⭐
61
Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]
Code For Paper
⭐
59
Auto Drac
⭐
59
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
Super Mario Bros Rl
⭐
57
This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super Mario Bros
Deep Reinforcement Learning Applied To Doom
⭐
57
DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM
Mario_rl
⭐
57
Imitation_learning
⭐
56
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Tensorflow_rl
⭐
54
Rl Bot Football
⭐
54
An RL agent for the Google Football environment
Pytorch Rl
⭐
53
Learning2run
⭐
53
Our NIPS 2017: Learning to Run source code
Model Free Algorithms
⭐
52
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
Pensieve Ppo
⭐
51
The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, and SAC
Occupancyanticipation
⭐
51
This repository contains code for our publication "Occupancy Anticipation for Efficient Exploration and Navigation" in ECCV 2020.
Learninghumanoidwalking
⭐
50
Training a humanoid robot for locomotion using Reinforcement Learning
Wu Uct
⭐
48
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
Reinforcementlearning
⭐
48
Reinforcing Your Learning of Reinforcement Learning
Rl Experiments
⭐
47
High-quality implementations of deep reinforcement learning algorithms for experiments
Stanford Osrl
⭐
46
NIPS2017 challenge
Reinforcementlearningzoo.jl
⭐
45
Unitytensorflowkeras
⭐
43
Unity In Editor Deep Learning Tools. Using KerasSharp, TensorflowSharp, Unity MLAgent. In-Editor training and no python needed.
Minimal Isaac Gym
⭐
42
A Minimal Example of Isaac Gym with DQN and PPO.
Pytorch Learn Reinforcement Learning
⭐
42
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Relational_deep_reinforcement_learning
⭐
41
Ppo
⭐
40
PyTorch implementation of Proximal Policy Optimization
Drl_shape_optimization
⭐
40
Deep reinforcement learning to perform shape optimization
Ppocma
⭐
38
Ppo Pytorch
⭐
38
Implementation of PPO in Pytorch
Llama Trl
⭐
38
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
Ppo Lstm Parallel
⭐
37
ppo-lstm-parallel
Rl_matrix
⭐
36
Reinforcement Learning Agents in .NET
Deepcomp
⭐
36
Dynamic multi-cell selection for cooperative multipoint (CoMP) using (multi-agent) deep reinforcement learning
Tstarbot1
⭐
36
Redco
⭐
35
MLSys Workshop NeurIPS 2023 - Redco: A Lightweight Tool to Automate Distributed Training and Inference
Pop3d
⭐
35
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
Ppo Pytorch
⭐
35
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Realworldrl_suite
⭐
35
Real-World RL Benchmark Suite
Ucmec_commag
⭐
34
Simulation code and mathematic details of our paper in IEEE Communications Magazine: ''When the User-Centric Network Meets Mobile Edge Computing: Challenges and Optimization''
Ppo_jax
⭐
33
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights on all environments.
Planetplanet
⭐
33
A general photodynamical code for exoplanet light curves
Ppo Clip And Ppo Penalty On Atari Domain
⭐
33
Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty
Pytorch Ppo
⭐
33
Proximal Policy Optimization in PyTorch
Ppo Pytorch
⭐
32
Proximal policy optimization in PyTorch. Easy to read and understand.
Level Replay
⭐
32
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.
Llm4rl
⭐
32
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Sebulba
⭐
31
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
Dehrl
⭐
31
Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.
Deeprl Baselines
⭐
31
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Reinforcement Learning
Ppo Stein Control Variate
⭐
29
Proximal Policy Optimization with Stein Control Variates:
Pyrl
⭐
29
PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)
Drlkit
⭐
29
A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Rainy
⭐
28
☔ Deep RL agents with PyTorch☔
Reinforcement_learning_with_pytorch
⭐
27
Implement some algorithms of RL
Apex
⭐
27
A continuous deep reinforcement learning framework for robotics
Rl_pytorch
⭐
27
Deep Reinforcement Learning Algorithms Implementation in PyTorch
Meta Reinforcement Learning
⭐
26
Code snippets of Meta Reinforcement Learning algorithms
Rl Policies Attacks Defenses
⭐
26
Adversarial attacks on Deep Reinforcement Learning (RL)
Retro_contest_agent
⭐
26
Trading_gym
⭐
25
a unified environment for supervised learning and reinforcement learning in the context of quantitative trading
Paramnoise
⭐
25
A comparison of parameter space noise methods for exploration in deep reinforcement learning
General
⭐
25
国内第一个基于TensorFlow2.0、支持非gym环境训练、支持可视化配置的强化学习应用编程框架
Ppo Rnd
⭐
24
Random network distillation on Montezuma's Revenge and Super Mario Bros.
Core Rl
⭐
24
Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcement Learning" for additional details.
Hybrid Cp Rl Solver
⭐
24
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Distributed Ppo
⭐
23
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
Sonic_contest
⭐
23
Source code for OpenAI Retro Contest for Sonic the Hedgehog
Stadium
⭐
23
A graphical inteface for reinforcement learning and gym-based environments. Integrates tensorboard and various configuration utilities for ease of usage.
Football Paris
⭐
23
The exact codes used by the team "liveinparis" at the kaggle football competition ranked 8th/1141
Angela
⭐
22
A modular deep reinforcement learning framework that supports a variety of algorithms, environments and models.
Curiosity Bottleneck
⭐
22
Repository for our ICML 2019 paper: Curiosity-Bottleneck
Ai Traineree
⭐
21
PyTorch agents and tools for (Deep) Reinforcement Learning
Pytorch_ppo_rl
⭐
21
Chatglm Lora Rlhf Pytorch
⭐
21
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
Interpolated Policy Gradient With Ppo For Robotics Control
⭐
20
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gradient and Hindsight Experience Replay (HER)
Hrl4in
⭐
20
Code for CoRL 2019 paper: HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation with Mobile Manipulators
Rl Pytorch
⭐
20
A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.
Agnes
⭐
20
Flexible Reinforcement Learning Framework with PyTorch
Neural Dynamic Policies
⭐
19
Gym Microrts Paper Sb3
⭐
19
RL agent to play μRTS with Stable-Baselines3 and PyTorch
Pbrl
⭐
19
A Population Based Reinforcement Learning Library based on PyTorch
Capg
⭐
19
Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
Ppo Pytorch
⭐
19
Netrand
⭐
19
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020
Arp
⭐
19
Autoregressive policies for continuous control reinforcement learning
Deeprl Ppo Tutorial
⭐
18
This repository contains tutorial material on Doing DeepRL with PPO in GDG DevFest 2017 Seoul.
Ppo Pytorch
⭐
18
Pytorch Implementation of Proximal Policy Optimization Algorithm
Deep Reinforcement Learning Algorithm Collection
⭐
17
Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.
Vicuna Lora Rlhf Pytorch
⭐
17
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
Rl_gym
⭐
17
Solving several OpenAI Gym and custom gazebo environments using reinforcement learning techniques.
Marathonenvsbaselines
⭐
17
Experimental - using OpenAI baselines with MarathonEnvs (ML-Agents)
Pop Spiking Deep Rl
⭐
17
DRL with population coded spiking neural network for optimal and energy-efficient continuous control.
Stove
⭐
16
Structured Object-Aware Physics Prediction for Video Modeling and Planning
Supervised_policy_update
⭐
16
Code to reproduce Supervised Policy Update (ICLR 2019)
Implementation Matters
⭐
15
Tf_practice
⭐
15
TensorFlow 1.x Practice
Related Searches
Python Ppo (446)
Reinforcement Learning Ppo (221)
101-200 of 344 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.