Reinforcement Learning: The K-armed Bandit Problem
Domino Data Lab
DECEMBER 21, 2021
In a previous blog post we talked about the foundations of reinforcement learning. In it, we present the k-armed bandit problem - a very simple setting that enables us to introduce the interaction between some of the key components of reinforcement learning. This entry is a continuation of the series.
Let's personalize your content