因果发现算法研究综述

doi:10.12677/csa.2026.164112

期刊菜单

因果发现算法研究综述
A Survey on Causal Discovery Algorithms

DOI: 10.12677/csa.2026.164112, PDF, 科研立项经费支持
作者: 祁慧英：伊犁师范大学电子工程学院，新疆伊宁；苏恒, 苗志轩^*：伊犁师范大学网络安全与信息技术学院，新疆伊宁；王浩莲：伊犁河谷智能计算研究与应用重点实验室，新疆伊宁
关键词: 因果发现；结构因果模型；有向无环图；基于约束的方法；基于评分的方法；基于混合的方法；Causal Discovery； Structural Causal Models； Directed Acyclic Graphs； Constraint-Based Methods； Score-Based Methods； Hybrid Methods

摘要: 因果发现旨在从观测数据或实验数据中自动化地辨识变量之间的因果关联结构，是因果推理研究领域的核心议题之一。本文系统性地梳理了因果发现的理论根基、算法分类体系及其演进历程。首先，阐明因果发现问题的基本形式化框架，涵盖结构因果模型、有向无环图、因果马尔可夫条件与忠实性假设等核心理论要素；其次，依据方法论范式将现有算法划分为基于约束的方法、基于评分的方法、基于函数因果模型的方法、混合策略方法以及基于深度学习的新兴方法五大类别，逐一剖析其理论动机、代表性算法、技术优势与固有局限；最后，对该领域面临的开放性挑战与未来发展趋向进行展望。

Abstract: Causal discovery aims to automatically identify the underlying causal relationships among variables from observational or experimental data, and has become a central topic in the field of causal inference. This paper provides a systematic review of the theoretical foundations, algorithmic taxonomy, and evolutionary development of causal discovery methods. First, the fundamental formal framework of the causal discovery problem is introduced, including core theoretical components such as structural causal models, directed acyclic graphs, the causal Markov condition, and the faithfulness assumption. Second, according to methodological paradigms, existing algorithms are categorized into five major classes: constraint-based methods, score-based methods, functional causal model-based methods, hybrid approaches, and emerging deep learning-based methods. For each category, the theoretical motivation, representative algorithms, technical advantages, and inherent limitations are systematically analyzed. Finally, the paper discusses the open challenges faced by the field and outlines potential directions for future research.

文章引用：祁慧英, 苏恒, 王浩莲, 苗志轩. 因果发现算法研究综述[J]. 计算机科学与应用, 2026, 16(4): 90-100. https://doi.org/10.12677/csa.2026.164112

参考文献

[1]	McDonald, R.P. (2002) Judea Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press. [Google Scholar] [CrossRef]
[2]	Jonas, P., Janzing, D. and Scholkopf, B. (2017) Elements of Causal Inference: Founda-tions and Learning Algorithms. MIT Press.
[3]	Sewall, W. (1921) Correlation and Causation. Journal of Agricultural Research, 20, Article 557.
[4]	Spirtes, P., Glymour, C. and Scheines, R. (2001) Causation, Prediction, and Search. 2nd Edition, The MIT Press. [Google Scholar] [CrossRef]
[5]	Judea, P. (2014) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Elsevier.
[6]	Daphne, K. and Friedman, N. (2009) Probabilistic Graphical Models: Principles and Techniques. MIT Press.
[7]	Judea, P. (2009) Causality. Cambridge University Press.
[8]	Ioannis, T. and Aliferis, C.F. (2003) Towards Princi-pled Feature Selection: Relevancy, Filters and Wrappers. International Workshop on Artificial Intelligence and Statistics, Florida, 3-6 January 2003.
[9]	Spirtes, P. (2010) Introduction to Causal Inference. Journal of Machine Learning Research, 11, 1643-1662.
[10]	Verma, T. and Pearl, J. (2022) Equivalence and Synthesis of Causal Models. In: Probabilistic and Causal Inference, ACM, 221-236. [Google Scholar] [CrossRef]
[11]	Spirtes, P. and Glymour, C. (1991) An Algorithm for Fast Re-covery of Sparse Causal Graphs. Social Science Computer Review, 9, 62-72. [Google Scholar] [CrossRef]
[12]	Spirtes, P.L., Meek, C. and Richardson, T.S. (2013) Causal Inference in the Presence of Latent Variables and Selection Bias.
[13]	Colombo, D., Maathuis, M.H., Kalisch, M. and Richardson, T.S. (2012) Learn-ing High-Dimensional Directed Acyclic Graphs with Latent and Selection Variables. The Annals of Statistics, 40, 294-321. [Google Scholar] [CrossRef]
[14]	Judea, P. and Verma, T.S. (1995) A Theory of Inferred Causation. In: Studies in Logic and the Foundations of Mathematics. Vol. 134, Elsevier, 789-811.
[15]	Tsamardinos, I., Aliferis, C.F. and Statnikov, A. (2003) Algo-rithms for Large Scale Markov Blanket Discovery. American Association for Artificial Intelligence.
[16]	Yaramakala, S. and Mar-garitis, D. (2005) Speculative Markov Blanket Discovery for Optimal Feature Selection. 15th IEEE International Conference on Data Mining (ICDM’05), Houston, 27-30 November 2005, 1-4.
[17]	Yang, X., Wang, Y., Ou, Y. and Tong, Y. (2019) Three-Fast-Inter Incremental Association Markov Blanket Learning Algorithm. Pattern Recognition Letters, 122, 73-78. [Google Scholar] [CrossRef]
[18]	Margaritis, D. (2009) Toward Provably Correct Feature Selection in Arbitrary Domains. Advances in Neural Information Processing Systems, 22, 1-9.
[19]	Tsamardinos, I., Aliferis, C.F. and Statnikov, A. (2003) Time and Sample Efficient Discovery of Markov Blankets and Direct Causal Relations. Proceedings of the Ninth ACM SIGKDD Inter-national Conference on Knowledge Discovery and Data Mining, Washington, 24-27 August 2003, 673-678. [Google Scholar] [CrossRef]
[20]	Aliferis, C.F., Tsamardinos, I. and Statnikov, A. (2003) HITON: A Novel Markov Blanket Algorithm for Optimal Variable Selection. AMIA Annual Symposium Proceedings, Washington, 8-12 November 2003, 21-25.
[21]	Peña, J.M., Björkegren, J. and Tegnér, J. (2005) Scalable, Efficient and Correct Learning of Markov Boundaries under the Faithfulness Assumption. In: Lecture Notes in Computer Science, Springer, 136-147. [Google Scholar] [CrossRef]
[22]	Peña, J.M., Nilsson, R., Björkegren, J. and Tegnér, J. (2007) Towards Scalable and Data Efficient Learning of Markov Boundaries. International Journal of Approximate Reasoning, 45, 211-232. [Google Scholar] [CrossRef]
[23]	Gao, T. and Ji, Q. (2016) Efficient Markov Blanket Discovery and Its Application. IEEE Transactions on Cybernetics, 47, 1169-1179. [Google Scholar] [CrossRef]
[24]	Strobl, E.V. (2017) Causal Discovery under Non-Stationary Feedback. University of Pittsburgh.
[25]	Heckerman, D., Geiger, D. and Chickering, D.M. (1995) Learning Bayesian Networks: The Combination of Knowledge and Statistical Data. Machine Learning, 20, 197-243. [Google Scholar] [CrossRef]
[26]	Heckerman, D. (1998) A Tutorial on Learning with Bayesian Networks. In: Learning in Graphical Models, Springer, 301-354. [Google Scholar] [CrossRef]
[27]	Chickering, D.M. (2002) Optimal Structure Identification with Greedy Search. Journal of Machine Learning Research, 3, 507-554.
[28]	Aragam, B. (2024) Greedy Equivalence Search for Nonparametric Graphical Models. arXiv:2406.17228.
[29]	Ramsey, J., Glymour, M., Sanchez-Romero, R. and Glymour, C. (2017) A Million Variables and More: The Fast Greedy Equivalence Search Algorithm for Learning High-Dimensional Graphical Causal Models, with an Application to Functional Magnetic Resonance Images. International Journal of Data Science and Analytics, 3, 121-129. [Google Scholar] [CrossRef]
[30]	Chickering, D.M. and Meek, C. (2015) Selective Greedy Equivalence Search: Finding Optimal Bayesian Networks Using a Polynomial Number of Score Evaluations.
[31]	Gao, T., Fadnis, K. and Campbell, M. (2017) Local-to-Global Bayesian Network Structure Learning. International Conference on Machine Learning, Sydney, 6-11 August 2017.
[32]	Ramsey, J.D., Hanson, S.J., Hanson, C., Halchenko, Y.O., Poldrack, R.A. and Glymour, C. (2010) Six Problems for Causal Inference from fMRI. NeuroImage, 49, 1545-1558. [Google Scholar] [CrossRef]
[33]	Hauser, A. and Bühlmann, P. (2012) Characterization and Greedy Learning of Interventional Markov Equivalence Classes of Directed Acyclic Graphs. The Journal of Machine Learning Research, 13, 2409-2464.
[34]	Lachapelle, S., Brouillard, P., Deleu, T., et al. (2019) Gra-dient-Based Neural DAG Learning.
[35]	Zheng, X., Aragam, B., Ravikumar, P., et al. (2018) DAGs with No Tears: Continuous Op-timization for Structure Learning. Advances in Neural Information Processing Systems, 31, 1-22.
[36]	Tsamardinos, I., Brown, L.E. and Aliferis, C.F. (2006) The Max-Min Hill-Climbing Bayesian Network Structure Learning Algorithm. Machine Learning, 65, 31-78. [Google Scholar] [CrossRef]
[37]	Scanagatta, M., Salmerón, A. and Stella, F. (2019) A Survey on Bayesian Net-work Structure Learning from Data. Progress in Artificial Intelligence, 8, 425-439. [Google Scholar] [CrossRef]
[38]	Scutari, M. (2010) Learning Bayesian Networks with Thebnlearnrpackage. Journal of Statistical Software, 35, 1-22. [Google Scholar] [CrossRef]
[39]	Gasse, M., Aussem, A. and Elghazel, H. (2014) A Hybrid Algorithm for Bayesian Network Structure Learning with Application to Multi-Label Learning. Expert Systems with Applications, 41, 6755-6772. [Google Scholar] [CrossRef]
[40]	Ling, Z.L., Peng, H.H., Zhang, Y.W., et al. (2024) Hybrid Local Causal Discov-ery. arXiv:2412.19507.
[41]	Nandy, P., Hauser, A. and Maathuis, M.H. (2018) High-Dimensional Consistency in Score-Based and Hybrid Structure Learning. The Annals of Statistics, 46, 3151-3183. [Google Scholar] [CrossRef]
[42]	Wang, N., Liu, H., Zhang, L., Cai, Y. and Shi, Q. (2024) An Efficient Skeleton Learning Approach-Based Hybrid Algorithm for Identifying Bayesian Network Structure. Engineering Applications of Artificial Intelligence, 133, Article 108105. [Google Scholar] [CrossRef]
[43]	Liang, J., Wang, J., Yu, G., Domeniconi, C., Zhang, X. and Guo, M. (2023) Gradient-Based Local Causal Structure Learning. IEEE Transactions on Cybernetics, 54, 486-495. [Google Scholar] [CrossRef]
[44]	Huang, X., Guo, X., Li, Y. and Yu, K. (2023) A Novel Data Enhancement Ap-proach to DAG Learning with Small Data Samples. Applied Intelligence, 53, 27589-27607. [Google Scholar] [CrossRef]

为你推荐

友情链接