Awesome Open Source

Programming Languages

Search results for ppo

344 search results found

Explorer ⭐ 68

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Open Chatgpt ⭐ 66

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

Deep_rl_zoo ⭐ 65

A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

Stateadvdrl ⭐ 63

[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"

Reinforcement_learning ⭐ 62

Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3

Autonomous Driving In Carla Using Deep Reinforcement Learning ⭐ 61

Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]

Code For Paper ⭐ 59

Auto Drac ⭐ 59

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Super Mario Bros Rl ⭐ 57

This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super Mario Bros

Deep Reinforcement Learning Applied To Doom ⭐ 57

DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM

Mario_rl ⭐ 57

Imitation_learning ⭐ 56

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Tensorflow_rl ⭐ 54

Rl Bot Football ⭐ 54

An RL agent for the Google Football environment

Pytorch Rl ⭐ 53

Learning2run ⭐ 53

Our NIPS 2017: Learning to Run source code

Model Free Algorithms ⭐ 52

TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x

Pensieve Ppo ⭐ 51

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, and SAC

Occupancyanticipation ⭐ 51

This repository contains code for our publication "Occupancy Anticipation for Efficient Exploration and Navigation" in ECCV 2020.

Learninghumanoidwalking ⭐ 50

Training a humanoid robot for locomotion using Reinforcement Learning

A novel parallel UCT algorithm with linear speedup and negligible performance loss.

Reinforcementlearning ⭐ 48

Reinforcing Your Learning of Reinforcement Learning

Rl Experiments ⭐ 47

High-quality implementations of deep reinforcement learning algorithms for experiments

Stanford Osrl ⭐ 46

NIPS2017 challenge

Reinforcementlearningzoo.jl ⭐ 45

Unitytensorflowkeras ⭐ 43

Unity In Editor Deep Learning Tools. Using KerasSharp, TensorflowSharp, Unity MLAgent. In-Editor training and no python needed.

Minimal Isaac Gym ⭐ 42

A Minimal Example of Isaac Gym with DQN and PPO.

Pytorch Learn Reinforcement Learning ⭐ 42

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

Relational_deep_reinforcement_learning ⭐ 41

PyTorch implementation of Proximal Policy Optimization

Drl_shape_optimization ⭐ 40

Deep reinforcement learning to perform shape optimization

Ppo Pytorch ⭐ 38

Implementation of PPO in Pytorch

Llama Trl ⭐ 38

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

Ppo Lstm Parallel ⭐ 37

ppo-lstm-parallel

Rl_matrix ⭐ 36

Reinforcement Learning Agents in .NET

Deepcomp ⭐ 36

Dynamic multi-cell selection for cooperative multipoint (CoMP) using (multi-agent) deep reinforcement learning

Tstarbot1 ⭐ 36

MLSys Workshop NeurIPS 2023 - Redco: A Lightweight Tool to Automate Distributed Training and Inference

Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization

Ppo Pytorch ⭐ 35

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Realworldrl_suite ⭐ 35

Real-World RL Benchmark Suite

Ucmec_commag ⭐ 34

Simulation code and mathematic details of our paper in IEEE Communications Magazine: ''When the User-Centric Network Meets Mobile Edge Computing: Challenges and Optimization''

Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights on all environments.

Planetplanet ⭐ 33

A general photodynamical code for exoplanet light curves

Ppo Clip And Ppo Penalty On Atari Domain ⭐ 33

Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty

Pytorch Ppo ⭐ 33

Proximal Policy Optimization in PyTorch

Ppo Pytorch ⭐ 32

Proximal policy optimization in PyTorch. Easy to read and understand.

Level Replay ⭐ 32

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.

A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.

Deeprl Baselines ⭐ 31

Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Reinforcement Learning

Ppo Stein Control Variate ⭐ 29

Proximal Policy Optimization with Stein Control Variates:

PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)

A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms

☔ Deep RL agents with PyTorch☔

Reinforcement_learning_with_pytorch ⭐ 27

Implement some algorithms of RL

A continuous deep reinforcement learning framework for robotics

Rl_pytorch ⭐ 27

Deep Reinforcement Learning Algorithms Implementation in PyTorch

Meta Reinforcement Learning ⭐ 26

Code snippets of Meta Reinforcement Learning algorithms

Rl Policies Attacks Defenses ⭐ 26

Adversarial attacks on Deep Reinforcement Learning (RL)

Retro_contest_agent ⭐ 26

Trading_gym ⭐ 25

a unified environment for supervised learning and reinforcement learning in the context of quantitative trading

Paramnoise ⭐ 25

A comparison of parameter space noise methods for exploration in deep reinforcement learning

国内第一个基于TensorFlow2.0、支持非gym环境训练、支持可视化配置的强化学习应用编程框架

Random network distillation on Montezuma's Revenge and Super Mario Bros.

Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcement Learning" for additional details.

Hybrid Cp Rl Solver ⭐ 24

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Distributed Ppo ⭐ 23

This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).

Sonic_contest ⭐ 23

Source code for OpenAI Retro Contest for Sonic the Hedgehog

A graphical inteface for reinforcement learning and gym-based environments. Integrates tensorboard and various configuration utilities for ease of usage.

Football Paris ⭐ 23

The exact codes used by the team "liveinparis" at the kaggle football competition ranked 8th/1141

A modular deep reinforcement learning framework that supports a variety of algorithms, environments and models.

Curiosity Bottleneck ⭐ 22

Repository for our ICML 2019 paper: Curiosity-Bottleneck

Ai Traineree ⭐ 21

PyTorch agents and tools for (Deep) Reinforcement Learning

Pytorch_ppo_rl ⭐ 21

Chatglm Lora Rlhf Pytorch ⭐ 21

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

Interpolated Policy Gradient With Ppo For Robotics Control ⭐ 20

Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gradient and Hindsight Experience Replay (HER)

Code for CoRL 2019 paper: HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation with Mobile Manipulators

Rl Pytorch ⭐ 20

A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.

Flexible Reinforcement Learning Framework with PyTorch

Neural Dynamic Policies ⭐ 19

Gym Microrts Paper Sb3 ⭐ 19

RL agent to play μRTS with Stable-Baselines3 and PyTorch

A Population Based Reinforcement Learning Library based on PyTorch

Implementation of clipped action policy gradient (CAPG) with PPO and TRPO

Ppo Pytorch ⭐ 19

Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020

Autoregressive policies for continuous control reinforcement learning

Deeprl Ppo Tutorial ⭐ 18

This repository contains tutorial material on Doing DeepRL with PPO in GDG DevFest 2017 Seoul.

Ppo Pytorch ⭐ 18

Pytorch Implementation of Proximal Policy Optimization Algorithm

Deep Reinforcement Learning Algorithm Collection ⭐ 17

Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.

Vicuna Lora Rlhf Pytorch ⭐ 17

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

Solving several OpenAI Gym and custom gazebo environments using reinforcement learning techniques.

Marathonenvsbaselines ⭐ 17

Experimental - using OpenAI baselines with MarathonEnvs (ML-Agents)

Pop Spiking Deep Rl ⭐ 17

DRL with population coded spiking neural network for optimal and energy-efficient continuous control.

Structured Object-Aware Physics Prediction for Video Modeling and Planning

Supervised_policy_update ⭐ 16

Code to reproduce Supervised Policy Update (ICLR 2019)

Implementation Matters ⭐ 15

Tf_practice ⭐ 15

TensorFlow 1.x Practice

Related Searches

Python Ppo (446)

Reinforcement Learning Ppo (221)

101-200 of 344 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.