MAAIA:一种面向A/B实验效果精准评估的多维异动归因方法
MAAIA: A Multidimensional Anomaly Attribution Method for Accurate Evaluation of A/B Testing Effects
摘要: 在A/B实验中,有时会出现指标异常波动的问题,这里的异常波动是指无法从报表中直接分析出实验期指标显著上升或下降的原因,或者出现了无法从单一维度定位根原因的波动。这类异常波动会极大增加分析成本,数据分析师们通常需要手动拆解和组合大量维度的数据,过程繁琐且准确度低。本文提出了MAAIA方法(多维异动归因智能方法,Multi-dimensional Anomaly Attribution Intelligent Approach),并将该方法应用到A/B实验中,该方法能够快速高效得识别出造成指标波动的根原因,并且能更深入地下钻到交叉维度对指标异常波动的影响。目前已通过已有的实验验证了这个方法的有效性以及效率,并成功将其应用到百度某业务的报表分析中,结果表明,MAAIA方法可以快速高效地定位根原因,节省了大量分析成本,增加了分析结果的准确率。
Abstract: In A/B experiments, there are sometimes issues with abnormal fluctuations in indicators. Such abnormal fluctuations refer to situations where the reasons for significant increases or decreases in indicators during the experimental period cannot be directly analyzed from the reports, or where the root cause cannot be identified from a single dimension. Such abnormal fluctuations can greatly increase the cost of analysis, as data analysts often need to manually disassemble and combine data from numerous dimensions, a process that is cumbersome and has low accuracy. This paper proposes the MAAIA method (Multi-dimensional Anomaly Attribution Intelligent Approach), and applies this method to A/B experiments. This method can quickly and efficiently identify the root cause of indicator fluctuations, and can delve deeper into the impact of cross-dimensional factors on abnormal indicator fluctuations. The effectiveness and efficiency of this method have been verified through existing experiments, and it has been successfully applied to report analysis for a certain business at Baidu. The results show that the MAAIA method can quickly and efficiently locate the root cause, save a significant amount of analysis cost, and increase the accuracy of analysis results.
参考文献
|
[1]
|
Lin, Q., Lou, J., Zhang, H. and Zhang, D. (2016) iDice: Problem Identification for Emerging Issues. Proceedings of the 38th International Conference on Software Engineering, Austin, 14-22 May 2016, 214-224. [Google Scholar] [CrossRef]
|
|
[2]
|
Kim, H. (2016) Statistical Notes for Clinical Researchers: Sample Size Calculation 1. Comparison of Two Independent Sample Means. Restorative Dentistry & Endodontics, 41, 74-78. [Google Scholar] [CrossRef]
|
|
[3]
|
Burk, S. (2006) A Better Statistical Method for A/B Testing in Marketing Campaigns. Marketing Bulletin, 17, Technical Note 3.
|
|
[4]
|
Alvi, M.H. (2016) A Manual for Selecting Sampling Techniques in Research. Munich Personal RePEc Archive.
|
|
[5]
|
Zhou, J., Lu, J. and Shallah, A. (2023) All about Sample-Size Calculations for A/B Testing: Novel Extensions & Practical Guide. Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, Birmingham, 21-25 October 2023, 3574-3583. [Google Scholar] [CrossRef]
|
|
[6]
|
Arndt, C. (2001) Information Measures: Information and Its Description in Science and Engineering. Springer.
|
|
[7]
|
van Erven, T. and Harremoes, P. (2014) Rényi Divergence and Kullback-Leibler Divergence. IEEE Transactions on Information Theory, 60, 3797-3820. [Google Scholar] [CrossRef]
|
|
[8]
|
ICBU象树中心, 阿里云开发者社区. 异动归因分析技术解决方案-异动归因指标拆解[EB/OL].
https://developer.aliyun.com/article/880405, 2022-03-30.
|
|
[9]
|
好好的分析师, 知乎. 波动解读——指标拆解的加减乘除双因素[EB/OL].
https://zhuanlan.zhihu.com/p/412117828, 2023-06-02.
|
|
[10]
|
Abadie, A. (2005) Semiparametric Difference-in-Differences Estimators. The Review of Economic Studies, 72, 1-19. [Google Scholar] [CrossRef]
|
|
[11]
|
Sant’Anna, P.H.C. and Zhao, J. (2020) Doubly Robust Difference-In-Differences Estimators. Journal of Econometrics, 219, 101-122. [Google Scholar] [CrossRef]
|