Solutions and figures for problems from Reinforcement Learning: An Introduction Sutton&Barto
-
Updated
Jul 16, 2019 - Python
Solutions and figures for problems from Reinforcement Learning: An Introduction Sutton&Barto
Solutions to the Stanford CS:234 Reinforcement Learning 2022 course assignments.
A small collection of Bandit Algorithms (ETC, E-Greedy, Elimination, UCB, Exp3, LinearUCB, and Thompson Sampling)
Add a description, image, and links to the bandit-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the bandit-algorithm topic, visit your repo's landing page and select "manage topics."