机器学习在材料性质预测中的运用
Application of Machine Learning in Material Property Prediction
DOI: 10.12677/CMP.2020.92002, PDF,  被引量    国家自然科学基金支持
作者: 提 磊, 宋紫菀, 缪子隆:南京邮电大学电子与光学工程学院,江苏 南京;吴思璇, 石宇衡:南京邮电大学理学院,江苏 南京;李 斌*:南京邮电大学理学院,江苏 南京;江苏省新能源技术工程实验室,江苏 南京;施智祥:东南大学物理学院,江苏 南京
关键词: 晶格结构机器学习高通量计算Crystal Structure Machine Learning High-Throughput Calculation
摘要: 本文采用机器学习算法对晶体的生成焓的预测进行了研究。利用由多个神经层组成的深度神经网络(DNN)模型学习数据间的关系,对深度学习模型在材料结生成焓的预测做了深入的研究和讨论。通过对开放量子材料数据库(OQMD)中275,778种化合物的生成焓参数进行学习,建立了深度学习多层全连接网络,并用来预测未知晶体材料的生成焓。优化后预测模型的精度达到了0.075 eV/atom,达到了量子力学软件的计算精度。
Abstract: The prediction of enthalpy of formation of the crystal is studied by the machine learning algo-rithm. Based on the relationship between the learning data of the deep neural network (DNN) model composed of multiple neural layers, the prediction of enthalpy of formation of the deep learning model in the material junction is studied and discussed in depth. By learning the en-thalpy of formation parameters of 275,778 compounds in the open quantum materials database (OQMD), a deep learning multi-layer fully connected network was established to predict the en-thalpy of formation of unknown crystal materials. The accuracy of the optimized prediction model reached 0.075 eV/atom, which reached the computational accuracy of the quantum me-chanics software.
文章引用:提磊, 吴思璇, 李斌, 石宇衡, 宋紫菀, 缪子隆, 施智祥. 机器学习在材料性质预测中的运用[J]. 凝聚态物理学进展, 2020, 9(2): 11-19. https://doi.org/10.12677/CMP.2020.92002

参考文献

[1] Curtarolo, S., Kolmogorov, A.N. and Cocks, F.H. (2005) High-Throughput ab Initio Analysis of the Bi-In, Bi-Mg, Bi-Sb, In-Mg, In-Sb, and Mg-Sb systems. Calphad, 29, 155-161. [Google Scholar] [CrossRef
[2] Oganov, A.R. and Glass, C.W. (2006) Crystal Structure Prediction Using ab Initio Evolutionary Techniques: Principles and Applications. The Journal of Chemical Physics, 124, Article ID: 244704. [Google Scholar] [CrossRef] [PubMed]
[3] Hautier, G., Jain, A., Mueller, T., et al. (2013) Designing Multielectron Lithium-Ion Phosphate Cathodes by Mixing Transition Metals. Chemistry of Materials, 25, 2064-2074. [Google Scholar] [CrossRef
[4] Rupp, M., Tkatchenko, A., Müller, K.R., et al. (2012) Reply to Comment on Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning. Physical Review Letters, 109, Article ID: 059802. [Google Scholar] [CrossRef
[5] Snyder, J.C., Rupp, M., Hansen, K., et al. (2012) Finding Density Functionals with Machine Learning. Physical Review Letters, 108, Article ID: 253002. [Google Scholar] [CrossRef
[6] Wolverton, C., Yan, X.Y., Vijayaraghavan, R., et al. (2002) Incorporating First-Principles Energetics in Computational Thermodynamics Approaches. Acta Materialia, 50, 2187-2197. [Google Scholar] [CrossRef
[7] Ceder, G. (1998) Predicting Properties from Scratch. Science, 280, 1099-1100. [Google Scholar] [CrossRef
[8] Moreels, P. and Perona, P. (2007) Evaluation of Features Detectors and Descriptors Based on 3D Objects. International Journal of Computer Vision, 73, 263-284. [Google Scholar] [CrossRef
[9] Krizhevsky, A., Sutskever, I. and Hinton, G.E. (2012) Imagenet Classification with Deep Convolutional Neural Networks. In: Advances in Neural Information Processing Systems, Springer, New York, 1097-1105.
[10] Deng, L., Li, J., Huang, J.T., et al. (2013) Recent Advances in Deep Learning for Speech Research at Microsoft. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, 26-31 May 2013, 8604-8608. [Google Scholar] [CrossRef
[11] Braun, M.L., Buhmann, J.M. and MÞller, K.R. (2008) On Relevant Dimensions in Kernel Feature Spaces. Journal of Machine Learning Research, 9, 1875-1908.
[12] Geman, S., Bienenstock, E. and Doursat, R. (1992) Neural Networks and the Bias/Variance Di-lemma. Neural Computation, 4, 1-58. [Google Scholar] [CrossRef
[13] Vapnik, V. (2013) The Nature of Statistical Learning Theory. Springer Science & Business Media, New York.
[14] Cuingnet, R., Rosso, C., Chupin, M., et al. (2011) Spatial Regularization of SVM for the Detection of Diffusion Alterations Associated with Stroke Outcome. Medical Image Analysis, 15, 729-737. [Google Scholar] [CrossRef] [PubMed]
[15] Gaonkar, B. and Davatzikos, C. (2013) Analytic Estimation of Statistical Significance Maps for Support Vector Machine Based Multi-Variate Image Analysis and Classification. NeuroImage, 78, 270-283. [Google Scholar] [CrossRef] [PubMed]
[16] Boser, B.E., Guyon, I.M. and Vapnik, V.N. (1992) Proceedings of the Fifth Annual Workshop on Computational Learning Theory.
[17] Rosasco, L., Vito, E.D., Ca-ponnetto, A., et al. (2004) Are Loss Functions All the Same? Neural Computation, 16, 1063-1076. [Google Scholar] [CrossRef] [PubMed]
[18] Hastie, T. and Tibshirani, R. (1998) Classification by Pairwise Coupling. In: Advances in Neural Information Processing Systems, Springer, New York, 507-513. [Google Scholar] [CrossRef
[19] Hawkins, D.M. (2004) The Problem of Overfitting. Journal of Chemical Information and Computer Sciences, 44, 1-12. [Google Scholar] [CrossRef] [PubMed]
[20] Nair, V. and Hinton, G.E. (2010) Rectified Linear Units Improve Re-stricted Boltzmann Machines. Proceedings of the 27th International Conference on Machine Learning (ICML-10), Haifa, Israel, 21-24 June 2010, 807-814.