Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python offline reinforcement learning
offline-reinforcement-learning
x
python
x
25 search results found
Corl
⭐
769
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Offlinerl Kit
⭐
165
An elegant PyTorch offline reinforcement learning library for researchers.
Offlinerl
⭐
124
A collection of offline reinforcement learning algorithms. This is a mirror repo from https://agit.ai/Polixir/OfflineRL
Cql
⭐
72
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
Neorl
⭐
70
This is a mirror of https://agit.ai/Polixir/NeoRL.git
Latentplan
⭐
60
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
Og Marl
⭐
59
Datasets with baselines for offline multi-agent reinforcement learning 🤖
Min Decision Transformer
⭐
47
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
Focal Iclr
⭐
46
Code for FOCAL Paper Published at ICLR 2021
Por
⭐
41
Author's implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
Edac
⭐
38
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
Sac N Jax
⭐
36
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
Loopquest
⭐
26
A Production Tool for Embodied AI
Rewardshifting
⭐
20
Code for paper Exploiting Reward Shifting in Value-Based Deep RL
Maple
⭐
20
The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
Fisor
⭐
19
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
Dwbc
⭐
19
Author's implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
Rosmo
⭐
19
Code for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
Stochastic Muzero
⭐
14
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
Omiga
⭐
13
The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization" (NeurIPS 2023)
Lb Sac
⭐
11
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop
Iql Pytorch
⭐
9
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
Rorl
⭐
8
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
Dppo
⭐
7
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
Awgcsl
⭐
5
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
1-25 of 25 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.