Search results for reinforcement learning bandit algorithms