Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python reinforcement learning
python
x
reinforcement-learning
x
1,699 search results found
Ray
⭐
29,596
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
D2l En
⭐
21,912
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Numpy Ml
⭐
14,162
Machine learning, in numpy
Tensor2tensor
⭐
13,701
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Reinforcement Learning An Introduction
⭐
12,490
Python Implementation of Reinforcement Learning: An Introduction
Tensorflow Tutorials
⭐
8,644
TensorFlow Tutorials with YouTube Videos
Reinforcement Learning With Tensorflow
⭐
8,174
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Machine_learning_examples
⭐
7,861
A collection of machine learning examples and tutorials.
Trax
⭐
7,818
Trax — Deep Learning with Clear Code and Speed
Palm Rlhf Pytorch
⭐
7,496
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Pytorch Tutorial
⭐
7,372
Build your neural network easy and fast, 莫烦Python中文教学
Tensorlayer
⭐
7,348
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
Stable Baselines3
⭐
7,292
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Keras Rl
⭐
5,348
Deep Reinforcement Learning for Keras.
Gymnasium
⭐
4,828
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Open_spiel
⭐
4,446
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Trlx
⭐
4,155
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Cleanrl
⭐
3,947
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Tensorwatch
⭐
3,333
Debugging, monitoring and visualization for Python Machine Learning and Data Science
Douzero
⭐
3,241
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
Football
⭐
3,177
Check out the new game server:
Catalyst
⭐
3,151
Accelerated deep learning R&D
Deep Learning Roadmap
⭐
3,139
📡 Organized Resources for Deep Learning Researchers and Developers
Reinforcement Learning
⭐
3,119
Minimal and Clean Reinforcement Learning Examples
Trfl
⭐
3,076
TensorFlow Reinforcement Learning
Stable Baselines
⭐
3,064
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Easy Tensorflow
⭐
2,875
Simple and comprehensive tutorials in TensorFlow
Ai Optimizer
⭐
2,733
The next generation deep reinforcement learning tookit
Data Science Best Resources
⭐
2,718
Carefully curated resource links for data science in one place
Agents
⭐
2,658
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Ml Course
⭐
2,447
Open Machine Learning course
Alphazero_gomoku
⭐
2,427
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Minimalrl
⭐
2,417
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Rlcard
⭐
2,260
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Muzero General
⭐
2,203
MuZero
Iccv2019 Learningtopaint
⭐
2,125
ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning
Dqn Tensorflow
⭐
2,114
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
Ppoxfamily
⭐
1,979
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
Rl4lms
⭐
1,911
A modular RL library to fine-tune language models to human preferences
Habitat Lab
⭐
1,910
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
Reco Papers
⭐
1,753
Classic papers and resources on recommendation
Gym Anytrading
⭐
1,751
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
Rlpyt
⭐
1,709
Reinforcement Learning in PyTorch
Physo
⭐
1,669
Physical Symbolic Optimization
Rl
⭐
1,658
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Rl Baselines3 Zoo
⭐
1,640
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Tnt
⭐
1,598
A lightweight library for PyTorch training tools and utilities
Snake
⭐
1,596
Artificial intelligence for the Snake game.
Advanced Deep Learning With Keras
⭐
1,534
Advanced Deep Learning with Keras, published by Packt
Magent
⭐
1,502
A Platform for Many-agent Reinforcement Learning
Andrew Ng Notes
⭐
1,367
This is Andrew NG Coursera Handwritten Notes.
Noreward Rl
⭐
1,315
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
Rainbow
⭐
1,272
Rainbow: Combining Improvements in Deep Reinforcement Learning
Ppo Pytorch
⭐
1,270
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Tradinggym
⭐
1,190
Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.
Awesome Deep Learning Papers For Search Recommendation Advertising
⭐
1,167
Awesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR and CVR prediction), Post Ranking, Multi-task Learning, Graph Neural Networks, Transfer Learning, Reinforcement Learning, Self-supervised Learning and so on.
Evolutionary Algorithm
⭐
1,152
Evolutionary Algorithm using Python, 莫烦Python 中文AI教学
Softlearning
⭐
1,108
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Machine Learning Curriculum
⭐
1,065
💻 Learn to make machines learn so that you don't have to struggle to program them; The ultimate list
Slm Lab
⭐
1,052
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Safe Rlhf
⭐
1,040
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Evotorch
⭐
941
Advanced evolutionary computation library built directly on top of PyTorch, created at NNAISENSE.
Batch Ppo
⭐
919
Efficient Batched Reinforcement Learning in TensorFlow
Deep Q Learning
⭐
894
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
Flow
⭐
890
Computational framework for reinforcement learning in traffic control
Smac
⭐
881
SMAC: The StarCraft Multi-Agent Challenge
Reinforcement_learning_course_materials
⭐
857
Lecture notes, tutorial tasks including solutions as well as online videos for the reinforcement learning course hosted by Paderborn University
Smarts
⭐
853
Scalable Multi-Agent RL Training School for Autonomous Driving
Deepdrive
⭐
845
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
Btgym
⭐
825
Scalable, event-driven, deep-learning-friendly backtesting library
Mbrl Lib
⭐
821
Library for Model Based RL
Recsyspapers
⭐
801
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
Rl Book
⭐
794
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
Corl
⭐
769
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Pytorch A3c
⭐
768
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Lightzero
⭐
767
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Maro
⭐
765
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
Rex Gym
⭐
759
OpenAI Gym environments for an open-source quadruped robot (SpotMicro)
Inverse Reinforcement Learning
⭐
753
Implementations of selected inverse reinforcement learning algorithms.
Dreamerv3
⭐
749
Mastering Diverse Domains through World Models
Super Mario Bros A3c Pytorch
⭐
735
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Pygame Learning Environment
⭐
727
PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.
Deeprl Tutorials
⭐
726
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Pysc2 Examples
⭐
721
StarCraft II - pysc2 Deep Reinforcement Learning Examples
Pytorch Rl
⭐
703
Deep Reinforcement Learning with pytorch & visdom
Dreamerv2
⭐
700
Mastering Atari with Discrete World Models
Osim Rl
⭐
695
Reinforcement learning environments with musculoskeletal models
Sample Factory
⭐
692
High throughput synchronous and asynchronous reinforcement learning
Super Mario Bros Ppo Pytorch
⭐
692
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Youtube Code Repository
⭐
687
Repository for most of the code from my YouTube channel
Iris
⭐
681
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
Contextualbandits
⭐
647
Python implementations of contextual bandits algorithms
Ai_all_resources
⭐
647
A curated list of Best Artificial Intelligence Resources
Pytorch Rl
⭐
638
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Reversi Alpha Zero
⭐
634
Reversi reinforcement learning by AlphaGo Zero methods.
Autonomous Learning Library
⭐
616
A PyTorch library for building deep reinforcement learning agents.
Ai Toolbox
⭐
611
A C++ framework for MDPs and POMDPs with Python bindings
Rlseq2seq
⭐
610
Deep Reinforcement Learning For Sequence to Sequence Models
Mava
⭐
593
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Courses
⭐
590
Answers for Quizzes & Assignments that I have taken
Related Searches
Python Django (28,897)
Python Flask (17,643)
Python Pytorch (16,596)
Python Dataset (14,792)
Python Tensorflow (14,225)
Python Docker (14,113)
Python Machine Learning (14,099)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
1-100 of 1,699 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2025 Awesome Open Source. All rights reserved.