Semi-uniform strategies for solving K-armed bandits
Domino Data Lab
JANUARY 31, 2022
In a previous blog post we introduced the K-armed bandit problem - a simple example of allocation of a limited set of resources over time and under uncertainty. We saw how a stochastic bandit behaves and demonstrated that pulling arms at random yields rewards close to the expectation of the reward distribution.
Let's personalize your content