Type S Errors in Multi-Armed Bandits

4 Pages Posted: 7 Oct 2019

See all articles by Markus Loecher

Markus Loecher

Berlin School of Economics and Law

Date Written: October 1, 2019

Abstract

A standard method to evaluate new features and changes to e.g. websites is A/B testing. A common pitfall in performing A/B testing is the habit of looking at a test while it’s running, then stopping early. Due to the implicit multiple testing, the p-values are no longer trustworthy and usually overly optimistic. We investigate the claim that Bayesian methods, unlike frequentist tests, are immune to this “peeking” problem. We demonstrate that two frequently used measures, namely posterior probability and value remaining, are severely affected by repeated testing. We further show a strong dependence on the prior probability of the parameters of interest.

Keywords: multi-armed bandits, sequential testing, A/B testing

JEL Classification: C1, C11, C12

Suggested Citation

Loecher, Markus, Type S Errors in Multi-Armed Bandits (October 1, 2019). Available at SSRN: https://ssrn.com/abstract=3464959 or http://dx.doi.org/10.2139/ssrn.3464959

Markus Loecher (Contact Author)

Berlin School of Economics and Law ( email )

Badensche Strasse 50-51
Berlin, D-10825
Germany

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
17
Abstract Views
236
PlumX Metrics