Multi armed bandits resources

I am trying to get a better grasp of multi armed bandit algorithms. I have got a decent handle on basic reinforcement learning, and I started reading Bandit Algorithms by Lattimore and but its heavy for me right now. Like way too heavy

Anyone know of some simpler or more intuitive resources to start with? Maybe blog posts, YouTube videos, or lecture notes that explain things like epsilon-greedy, UCB, Thompson Sampling in a more easy way? I saw some nptel courses on youtube but its way too stretched.

Would really appreciate any recs. Thanks!

submitted by /u/Upbeat-Stand1560 to r/learnmachinelearning
[link] [comments]


Commentaires

Laisser un commentaire

Votre adresse e-mail ne sera pas publiée. Les champs obligatoires sont indiqués avec *