Awesome Open Source

Programming Languages

Search results for pytorch reinforcement learning

reinforcement-learning x

486 search results found

Annotated_deep_learning_paper_implementations ⭐ 41,877

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

D2l En ⭐ 20,613

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Fingpt ⭐ 10,376

Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We release the trained model on HuggingFace.

Wandb ⭐ 8,204

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Pytorch Tutorial ⭐ 7,372

Build your neural network easy and fast, 莫烦Python中文教学

Stable Baselines3 ⭐ 7,292

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Practical_rl ⭐ 5,572

A course in reinforcement learning in the wild

Deep Reinforcement Learning ⭐ 4,635

Repo for the Deep Reinforcement Learning Nanodegree program

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Cleanrl ⭐ 3,947

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Alpha Zero General ⭐ 3,497

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Pytorch A2c Ppo Acktr Gail ⭐ 3,450

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Polyaxon ⭐ 3,438

MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle

Elegantrl ⭐ 3,229

Massively Parallel Deep Reinforcement Learning. 🔥

Catalyst ⭐ 3,151

Accelerated deep learning R&D

Alphazero_gomoku ⭐ 2,427

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Minimalrl ⭐ 2,417

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Muzero General ⭐ 2,203

Iccv2019 Learningtopaint ⭐ 2,125

ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning

Ml Course ⭐ 1,936

Open Machine Learning course

Rlpyt ⭐ 1,709

Reinforcement Learning in PyTorch

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Rl Baselines3 Zoo ⭐ 1,640

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Rainbow Is All You Need ⭐ 1,637

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

A lightweight library for PyTorch training tools and utilities

Bindsnet ⭐ 1,422

Simulation of spiking neural networks (SNNs) using PyTorch.

Andrew Ng Notes ⭐ 1,367

This is Andrew NG Coursera Handwritten Notes.

Ppo Pytorch ⭐ 1,270

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Machine Learning Curriculum ⭐ 1,065

💻 Learn to make machines learn so that you don't have to struggle to program them; The ultimate list

Slm Lab ⭐ 1,052

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Evotorch ⭐ 941

Advanced evolutionary computation library built directly on top of PyTorch, created at NNAISENSE.

Trademaster ⭐ 912

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈

Omnisafe ⭐ 831

OmniSafe is an infrastructural framework for accelerating SafeRL research.

Rl Book ⭐ 794

Source codes for the book "Reinforcement Learning: Theory and Python Implementation"

Pytorch A3c ⭐ 768

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Lightzero ⭐ 767

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

Mushroom Rl ⭐ 765

Python library for Reinforcement Learning.

Super Mario Bros A3c Pytorch ⭐ 735

Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Deeprl Tutorials ⭐ 726

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Autokernel ⭐ 724

AutoKernel 是一个简单易用，低门槛的自动算子优化工具，提高深度学习算法部署效率。

Pytorch Rl ⭐ 703

Deep Reinforcement Learning with pytorch & visdom

Super Mario Bros Ppo Pytorch ⭐ 692

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

Pytorch Rl ⭐ 638

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Fast_abs_rl ⭐ 622

Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"

Autonomous Learning Library ⭐ 616

A PyTorch library for building deep reinforcement learning agents.

Rl_games ⭐ 603

RL implementations

Overcooked_ai ⭐ 593

A benchmark environment for fully cooperative human-AI performance.

Rl_a3c_pytorch ⭐ 535

A3C LSTM Atari with Pytorch plus A3G design

Torch Light ⭐ 532

Deep-learning by using Pytorch. Basic nns like Logistic, CNN, RNN, LSTM and some examples are implemented by complex model.

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Pytorch Blender ⭐ 510

💦 Seamless, distributed, real-time integration of Blender into PyTorch data pipelines

Complete Life Cycle Of A Data Science Project ⭐ 499

Complete-Life-Cycle-of-a-Data-Science-Project

Di Drive ⭐ 498

Decision Intelligence Platform for Autonomous Driving simulation.

Unified Reinforcement Learning Framework

Deep Reinforcement Learning Algorithms With Pytorch ⭐ 484

Clean, Robust, and Unified PyTorch implementation of popular DRL Algorithms (Q-learning, Duel DDQN, PER, C51, PPO, DDPG, TD3, SAC, ASL)

Awesome Deep Rl ⭐ 477

A curated list of awesome Deep Reinforcement Learning resources.

Rl_algorithms ⭐ 470

Structural implementation of RL key algorithms

Agilerl ⭐ 457

Streamlining reinforcement learning with RLOps

Robotics Rl Srl ⭐ 455

S-RL Toolbox: Reinforcement Learning (RL) and State Representation Learning (SRL) for Robotics

Warp Drive ⭐ 427

Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)

Ppo For Beginners ⭐ 427

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-w

Grokking Deep Reinforcement Learning

Pytorch Soft Actor Critic ⭐ 414

PyTorch implementation of soft actor critic

Flappy Bird Deep Q Learning Pytorch ⭐ 407

Deep Q-learning for playing flappy bird game

Tetris Deep Q Learning Pytorch ⭐ 404

Deep Q-learning for playing tetris game

Papers In 100 Lines Of Code ⭐ 395

Implementation of papers in 100 lines of code.

Pytorch Drl ⭐ 387

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

Stable Baselines3 Contrib ⭐ 384

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL

DrQ: Data regularized Q

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Reinforcement Learning for real-time applications - host of the TrackMania Roborace League

Pytorch Rl ⭐ 356

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

World Models ⭐ 347

Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch

Pytorch Vsumm Reinforce ⭐ 342

Unsupervised video summarization with deep reinforcement learning. AAAI'18.

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Repository for Open Source Reinforcement Learning Framework JORLDY

Reinforcement Learning Algorithms Based on PyTorch

Rltrader ⭐ 324

파이썬과 케라스를 이용한 딥러닝/강화학습 주식투자 - 퀀트 투자, 알고리즘 트레이딩을 위한 최첨단 해법 입문 (개정판)

Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization

Reinforced Recommendation toolkit build around pytorch 1.7

Pytorch Cpp Rl ⭐ 308

PyTorch C++ Reinforcement Learning

Pytorch Ddpg Naf ⭐ 304

Implementation of algorithms for continuous control (DDPG and NAF).

Rl Exploration Baselines ⭐ 279

RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven exploration (RIDE).

Handyrl ⭐ 278

HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.

Velocity in deep-learning research

Self Driving Truck ⭐ 271

Self-Driving Truck in Euro Truck Simulator 2, trained via Reinforcement Learning

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Isaac Orbit and Omniverse Isaac Gym

Pytorch Reinforce ⭐ 261

PyTorch Implementation of REINFORCE for both discrete & continuous control

Seq2seq Summarizer ⭐ 259

Pointer-generator reinforced seq2seq summarization in PyTorch

Neural Combinatorial Rl Pytorch ⭐ 257

PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940

A PyTorch implementation of "Graph Classification Using Structural Attention" (KDD 2018).

Dreamerv3 Torch ⭐ 254

Implementation of Dreamer v3 in pytorch.

Multihopkg ⭐ 252

Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout

Pytorch Trpo ⭐ 252

PyTorch implementation of Trust Region Policy Optimization

Rlcycle ⭐ 242

A library for ready-made reinforcement learning agents and reusable components for neat prototyping

Rlgraph ⭐ 241

RLgraph: Modular computation graphs for deep reinforcement learning

Pytorch_sac ⭐ 232

PyTorch implementation of Soft Actor-Critic (SAC)

Thought Cloning ⭐ 217

[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking

Related Searches

Python Pytorch (16,596)

Deep Learning Pytorch (6,390)

Jupyter Notebook Pytorch (4,892)

Machine Learning Pytorch (2,934)

Python Reinforcement Learning (2,612)

Dataset Pytorch (1,848)

Pytorch Convolutional Neural Networks (1,777)

Pytorch Neural Network (1,391)

Pytorch Computer Vision (1,361)

Tensorflow Pytorch (1,312)

1-100 of 486 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.