Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for pytorch reinforcement learning
pytorch
x
reinforcement-learning
x
486 search results found
Annotated_deep_learning_paper_implementations
โญย
41,877
๐งโ๐ซ 60 Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
Ray
โญย
29,596
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
D2l En
โญย
20,613
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Fingpt
โญย
10,376
Data-Centric FinGPT. Open-source for open finance! Revolutionize ๐ฅ We release the trained model on HuggingFace.
Wandb
โญย
8,204
๐ฅ A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
Pytorch Tutorial
โญย
7,372
Build your neural network easy and fast, ่ซ็ฆPythonไธญๆๆๅญฆ
Stable Baselines3
โญย
7,292
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Practical_rl
โญย
5,572
A course in reinforcement learning in the wild
Deep Reinforcement Learning
โญย
4,635
Repo for the Deep Reinforcement Learning Nanodegree program
Trlx
โญย
4,155
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Cleanrl
โญย
3,947
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Alpha Zero General
โญย
3,497
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Pytorch A2c Ppo Acktr Gail
โญย
3,450
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Polyaxon
โญย
3,438
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
Elegantrl
โญย
3,229
Massively Parallel Deep Reinforcement Learning. ๐ฅ
Catalyst
โญย
3,151
Accelerated deep learning R&D
Alphazero_gomoku
โญย
2,427
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Minimalrl
โญย
2,417
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Muzero General
โญย
2,203
MuZero
Iccv2019 Learningtopaint
โญย
2,125
ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning
Ml Course
โญย
1,936
Open Machine Learning course
Rlpyt
โญย
1,709
Reinforcement Learning in PyTorch
Rl
โญย
1,658
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Rl Baselines3 Zoo
โญย
1,640
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Rainbow Is All You Need
โญย
1,637
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
Tnt
โญย
1,598
A lightweight library for PyTorch training tools and utilities
Bindsnet
โญย
1,422
Simulation of spiking neural networks (SNNs) using PyTorch.
Andrew Ng Notes
โญย
1,367
This is Andrew NG Coursera Handwritten Notes.
Ppo Pytorch
โญย
1,270
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Machine Learning Curriculum
โญย
1,065
๐ป Learn to make machines learn so that you don't have to struggle to program them; The ultimate list
Slm Lab
โญย
1,052
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Evotorch
โญย
941
Advanced evolutionary computation library built directly on top of PyTorch, created at NNAISENSE.
Trademaster
โญย
912
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning ๐ฅ โก ๐
Omnisafe
โญย
831
OmniSafe is an infrastructural framework for accelerating SafeRL research.
Rl Book
โญย
794
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
Pytorch A3c
โญย
768
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Lightzero
โญย
767
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Mushroom Rl
โญย
765
Python library for Reinforcement Learning.
Super Mario Bros A3c Pytorch
โญย
735
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Deeprl Tutorials
โญย
726
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Autokernel
โญย
724
AutoKernel ๆฏไธไธช็ฎๅๆ็จ๏ผไฝ้จๆง็่ชๅจ็ฎๅญไผๅๅทฅๅ ท๏ผๆ้ซๆทฑๅบฆๅญฆไน ็ฎๆณ้จ็ฝฒๆ็ใ
Pytorch Rl
โญย
703
Deep Reinforcement Learning with pytorch & visdom
Super Mario Bros Ppo Pytorch
โญย
692
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Pytorch Rl
โญย
638
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Fast_abs_rl
โญย
622
Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"
Autonomous Learning Library
โญย
616
A PyTorch library for building deep reinforcement learning agents.
Rl_games
โญย
603
RL implementations
Overcooked_ai
โญย
593
A benchmark environment for fully cooperative human-AI performance.
Rl_a3c_pytorch
โญย
535
A3C LSTM Atari with Pytorch plus A3G design
Torch Light
โญย
532
Deep-learning by using Pytorch. Basic nns like Logistic, CNN, RNN, LSTM and some examples are implemented by complex model.
Textrl
โญย
513
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Pytorch Blender
โญย
510
๐ฆ Seamless, distributed, real-time integration of Blender into PyTorch data pipelines
Complete Life Cycle Of A Data Science Project
โญย
499
Complete-Life-Cycle-of-a-Data-Science-Project
Di Drive
โญย
498
Decision Intelligence Platform for Autonomous Driving simulation.
Openrl
โญย
496
Unified Reinforcement Learning Framework
Deep Reinforcement Learning Algorithms With Pytorch
โญย
484
Clean, Robust, and Unified PyTorch implementation of popular DRL Algorithms (Q-learning, Duel DDQN, PER, C51, PPO, DDPG, TD3, SAC, ASL)
Awesome Deep Rl
โญย
477
A curated list of awesome Deep Reinforcement Learning resources.
Rl_algorithms
โญย
470
Structural implementation of RL key algorithms
Agilerl
โญย
457
Streamlining reinforcement learning with RLOps
Robotics Rl Srl
โญย
455
S-RL Toolbox: Reinforcement Learning (RL) and State Representation Learning (SRL) for Robotics
Warp Drive
โญย
427
Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)
Ppo For Beginners
โญย
427
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-w
Gdrl
โญย
422
Grokking Deep Reinforcement Learning
Pytorch Soft Actor Critic
โญย
414
PyTorch implementation of soft actor critic
Flappy Bird Deep Q Learning Pytorch
โญย
407
Deep Q-learning for playing flappy bird game
Tetris Deep Q Learning Pytorch
โญย
404
Deep Q-learning for playing tetris game
Papers In 100 Lines Of Code
โญย
395
Implementation of papers in 100 lines of code.
Pytorch Drl
โญย
387
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Stable Baselines3 Contrib
โญย
384
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Genrl
โญย
375
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
Drq
โญย
369
DrQ: Data regularized Q
Lagom
โญย
365
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Tmrl
โญย
361
Reinforcement Learning for real-time applications - host of the TrackMania Roborace League
Pytorch Rl
โญย
356
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
World Models
โญย
347
Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch
Pytorch Vsumm Reinforce
โญย
342
Unsupervised video summarization with deep reinforcement learning. AAAI'18.
Xuance
โญย
339
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
Jorldy
โญย
333
Repository for Open Source Reinforcement Learning Framework JORLDY
Rls
โญย
330
Reinforcement Learning Algorithms Based on PyTorch
Rltrader
โญย
324
ํ์ด์ฌ๊ณผ ์ผ๋ผ์ค๋ฅผ ์ด์ฉํ ๋ฅ๋ฌ๋/๊ฐํํ์ต ์ฃผ์ํฌ์ - ํํธ ํฌ์, ์๊ณ ๋ฆฌ์ฆ ํธ๋ ์ด๋ฉ์ ์ํ ์ต์ฒจ๋จ ํด๋ฒ ์ ๋ฌธ (๊ฐ์ ํ)
Betty
โญย
316
Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization
Recnn
โญย
313
Reinforced Recommendation toolkit build around pytorch 1.7
Pytorch Cpp Rl
โญย
308
PyTorch C++ Reinforcement Learning
Pytorch Ddpg Naf
โญย
304
Implementation of algorithms for continuous control (DDPG and NAF).
Rl Exploration Baselines
โญย
279
RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven exploration (RIDE).
Handyrl
โญย
278
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Vel
โญย
273
Velocity in deep-learning research
Self Driving Truck
โญย
271
Self-Driving Truck in Euro Truck Simulator 2, trained via Reinforcement Learning
Skrl
โญย
263
Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Isaac Orbit and Omniverse Isaac Gym
Pytorch Reinforce
โญย
261
PyTorch Implementation of REINFORCE for both discrete & continuous control
Seq2seq Summarizer
โญย
259
Pointer-generator reinforced seq2seq summarization in PyTorch
Neural Combinatorial Rl Pytorch
โญย
257
PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940
Gam
โญย
257
A PyTorch implementation of "Graph Classification Using Structural Attention" (KDD 2018).
Dreamerv3 Torch
โญย
254
Implementation of Dreamer v3 in pytorch.
Multihopkg
โญย
252
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Pytorch Trpo
โญย
252
PyTorch implementation of Trust Region Policy Optimization
Rlcycle
โญย
242
A library for ready-made reinforcement learning agents and reusable components for neat prototyping
Rlgraph
โญย
241
RLgraph: Modular computation graphs for deep reinforcement learning
Pytorch_sac
โญย
232
PyTorch implementation of Soft Actor-Critic (SAC)
Thought Cloning
โญย
217
[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
Related Searches
Python Pytorch (16,596)
Deep Learning Pytorch (6,390)
Jupyter Notebook Pytorch (4,892)
Machine Learning Pytorch (2,934)
Python Reinforcement Learning (2,612)
Dataset Pytorch (1,848)
Pytorch Convolutional Neural Networks (1,777)
Pytorch Neural Network (1,391)
Pytorch Computer Vision (1,361)
Tensorflow Pytorch (1,312)
1-100 of 486 search results
Next >
Privacy
ย |ย
About
ย |ย
Terms
ย |ย
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source.ย All rights reserved.