Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Annotated_deep_learning_paper_implementations | 36,223 | 1 | 6 days ago | 78 | September 24, 2022 | 27 | mit | Jupyter Notebook | ||
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠 | ||||||||||
Ray | 27,844 | 80 | 298 | 5 hours ago | 87 | July 24, 2023 | 3,423 | apache-2.0 | Python | |
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. | ||||||||||
D2l En | 18,967 | a month ago | 2 | November 13, 2022 | 95 | other | Python | |||
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge. | ||||||||||
Fingpt | 8,566 | a day ago | 1 | May 28, 2023 | 33 | mit | Jupyter Notebook | |||
Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We release the trained model on HuggingFace. | ||||||||||
Pytorch Tutorial | 7,372 | 6 months ago | 28 | mit | Jupyter Notebook | |||||
Build your neural network easy and fast, 莫烦Python中文教学 | ||||||||||
Wandb | 7,263 | 39 | 460 | 5 hours ago | 259 | August 01, 2023 | 995 | mit | Python | |
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API. | ||||||||||
Stable Baselines3 | 6,514 | 60 | 11 hours ago | 74 | June 14, 2022 | 83 | mit | Python | ||
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms. | ||||||||||
Practical_rl | 5,450 | 2 days ago | 40 | unlicense | Jupyter Notebook | |||||
A course in reinforcement learning in the wild | ||||||||||
Deep Reinforcement Learning | 4,521 | 4 months ago | 2 | mit | Jupyter Notebook | |||||
Repo for the Deep Reinforcement Learning Nanodegree program | ||||||||||
Trlx | 3,907 | 8 hours ago | 72 | mit | Python | |||||
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF) |
This project includes PyTorch implementations of various Deep Reinforcement Learning algorithms for both single agent and multi-agent.
It is written in a modular way to allow for sharing code between different algorithms. In specific, each algorithm is represented as a learning agent with a unified interface including the following components:
_take_one_step_
and _take_n_steps
, respectively)To train a model:
$ python run_a2c.py
It's extremely difficult to reproduce results for Reinforcement Learning algorithms. Due to different settings, e.g., random seed and hyper parameters etc, you might get different results compared with the followings.
This project gets inspirations from the following projects:
MIT