Kohavi, Ron, et al. "Trustworthy online controlled experiments: Five puzzling outcomes explained." Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. 2012. [2] 永田靖. "サンプルサイズの決め方". 朝倉書店, 2003年. [3] なぜAAテストにおけるp値は一様分布になるのか?. Zenn [4] Microsoft. "p-Values for Your p-Values: Validating Metric Trustworthiness by Simulated A/A Tests". 2020. 20 Appendix