and Athey, S. Confidence intervals for policy eval- uation in adaptive experiments. arXiv preprint arXiv:1911.02768, 2019. • Hahn, J., Hirano, K., and Karlan, D. Adaptive exper- imental design using the propensity score. Journal of Business and Economic Statistics, 29(1):96–108, 2011. • Zhao, S., Zhou, E., Sabharwal, A., and Ermon, S. Adaptive concentration inequalities for sequential decision problems. In NeurIPS, pp. 1343–1351. Cur- ran Associates, Inc., 2016. • Chernozhukov, V., Chetverikov, D., Demirer, M., Du- flo, E., Hansen, C., Newey, W., and Robins, J. Dou- ble/debiased machine learning for treatment and structural parameters. Econometrics Journal, 21: C1–C68, 2018. • Kaufmann, E., Cappé, O., and Garivier, A. On the com- plexity of best-arm identification in multi-armed bandit models. JMLR, 2016. Reference of Adaptive Experimental Design 58