Skip to main content

All Questions

2votes
1answer
942views

Testing Multi-Arm Bandits on Historical Data

Suppose I want to test a multi-arm bandit algorithm in the contextual setting on a set of historical data. For simplicity, let's assume there are only two arms A and B and suppose the rewards are ...
Pavan Sangha's user avatar

close