You are an octopus at a casino. You want to make as much money as possible by pulling levers on 8 different slot machines. Each time you pull a lever, it costs you $5.
Click on the buttons below to pull a lever.
How much fictional money can you make in...
Reflection
Which button was the most profitable?
How much money did you have to lose before you figured this out (and started to come out ahead)?
What was your strategy for balancing Exploitation (i.e., earning the most money possible from buttons that were returning high values) and Exploration (i.e., trying new buttons)? When did you move on, and when did you stay?
Are you certain that you couldn't have made more money if you had explored more? Why? (The best algorithms can net around $60 in 100 clicks.)
If you were to write instructions to systematically choose the best solution in another scenario, what process would you follow?