Awesome Open Source

Programming Languages

Search results for reinforcement learning multi armed bandit

multi-armed-bandit x

reinforcement-learning x

7 search results found

Agents ⭐ 2,658

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Dissecting Reinforcement Learning ⭐ 585

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Rlberry ⭐ 147

An easy-to-use reinforcement learning library for research and education.

Mabalgs ⭐ 100

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

Contextual ⭐ 47

Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

Personalized News Recommendation ⭐ 34

Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset

Bayesianbandits ⭐ 19

A Pythonic microframework for multi-armed bandit problems

Machine Learning Summer Schools ⭐ 14

Curated materials for different machine learning related summer schools

Python Ranker ⭐ 14

Ranking, Scoring, Decisions, and Optimization with XGBoost

Knnbandit ⭐ 14

Software for the experiments reported in the RecSys 2019 paper "A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendation"

Multi Armed Bandit Example ⭐ 12

Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB.

Swift Ranker ⭐ 10

Easily Score & Rank Codable Objects with ML

Reinforcement Learning ⭐ 10

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

Spgd Search Party Gradient Descent Algorithm ⭐ 7

SPGD: Search Party Gradient Descent algorithm, a Simple Gradient-Based Parallel Algorithm for Bound-Constrained Optimization. Link: https://www.mdpi.com/2227-7390/10/5/800

A multi-armed bandit (MAB) simulation library in Python

Multiplayer Bandits ⭐ 5

Multi-Player Bandits Revisited [L. Besson & É. Kaufmann]

Contextual Gaussian Process Bandit Optimization ⭐ 5

Simple implementation of CGP-UCB algorithm.

Related Searches

Python Reinforcement Learning (2,612)

Jupyter Notebook Reinforcement Learning (761)

Pytorch Reinforcement Learning (681)

Reinforcement Learning Rl (649)

Artificial Intelligence Reinforcement Learning (623)

Reinforcement Learning Gym (496)

Tensorflow Reinforcement Learning (448)

Reinforcement Learning Dqn (439)

Reinforcement Learning Openai Gym (363)

Reinforcement Learning Openai (333)

1-7 of 7 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.