RT1 Report: The Multi-Armed Bandit Problem and Thompson Sampling

March 27, 2023 James Neill Leave a comment

The first of two reports I have written this year for STOR601 is on the multi-armed bandit problem, supervised by James Grant.

This report focuses on using Thompson sampling to minimise regret for the multi-armed bandit problem, including approximations to Thompson sampling when the method cannot be used directly. These methods are compared empirically using simulated data.

View the report here:

RT1 Report

Blog

RT1 Report: The Multi-Armed Bandit Problem and Thompson Sampling

Leave a Reply Cancel reply