Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for reinforcement learning multi armed bandit
multi-armed-bandit
x
reinforcement-learning
x
7 search results found
Agents
⭐
2,658
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Dissecting Reinforcement Learning
⭐
585
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Rlberry
⭐
147
An easy-to-use reinforcement learning library for research and education.
Mabalgs
⭐
100
👤 Multi-Armed Bandit Algorithms Library (MAB) 👮
Contextual
⭐
47
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
Personalized News Recommendation
⭐
34
Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset
Bayesianbandits
⭐
19
A Pythonic microframework for multi-armed bandit problems
Machine Learning Summer Schools
⭐
14
Curated materials for different machine learning related summer schools
Python Ranker
⭐
14
Ranking, Scoring, Decisions, and Optimization with XGBoost
Knnbandit
⭐
14
Software for the experiments reported in the RecSys 2019 paper "A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendation"
Multi Armed Bandit Example
⭐
12
Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB.
Swift Ranker
⭐
10
Easily Score & Rank Codable Objects with ML
Reinforcement Learning
⭐
10
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
Spgd Search Party Gradient Descent Algorithm
⭐
7
SPGD: Search Party Gradient Descent algorithm, a Simple Gradient-Based Parallel Algorithm for Bound-Constrained Optimization. Link: https://www.mdpi.com/2227-7390/10/5/800
Mabby
⭐
5
A multi-armed bandit (MAB) simulation library in Python
Multiplayer Bandits
⭐
5
Multi-Player Bandits Revisited [L. Besson & É. Kaufmann]
Contextual Gaussian Process Bandit Optimization
⭐
5
Simple implementation of CGP-UCB algorithm.
Related Searches
Python Reinforcement Learning (2,612)
Jupyter Notebook Reinforcement Learning (761)
Pytorch Reinforcement Learning (681)
Reinforcement Learning Rl (649)
Artificial Intelligence Reinforcement Learning (623)
Reinforcement Learning Gym (496)
Tensorflow Reinforcement Learning (448)
Reinforcement Learning Dqn (439)
Reinforcement Learning Openai Gym (363)
Reinforcement Learning Openai (333)
1-7 of 7 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.