CSPTH:基于天河二号的晶体结构预测软件框架
CSPTH: A Crystal Structure Prediction Framework on Tianhe-2 Supercomputer
DOI: 10.12677/CSA.2022.124088, PDF,  被引量    科研立项经费支持
作者: 刘瑾瑜:中山大学,计算机学院,广东 广州;陈 品, 卢宇彤*:中山大学,计算机学院,广东 广州;中山大学国家超级计算中心,广东 广州
关键词: 晶体结构预测遗传算法高性能计算Crystal Structure Prediction Genetic Algorithm High-Performance Computing
摘要: 晶体结构是深入理解材料的物理及化学性质的重要信息,发展可以从理论上预测晶体结构的方法具有重要意义。通过高性能计算集群甚至利用超级计算机来加速晶体结构预测已逐步成为趋势。本文中,我们基于天河二号超级计算机开发了一套开源的晶体结构预测软件框架,命名为CSPTH。在算法层面,我们基于当前效率最高的遗传算法进行晶体结构预测,并采用了多种技术提升结构预测效率,包括并行化生成种群结构;引入空间群限定,减少自由度搜索,提升结构多样性;引入晶体指纹进行相似性算法,排除相似结构干扰,避免“基因漂变”的问题。特别地,我们针对晶体结构预测算法的应用特点以及天河二号的系统环境,从任务以及数据管理两个方面做了优化。在任务管理上,我们设计了多层任务调度管理模块,根据计算任务的规模大小的分发细粒度作业(节点内)以及粗粒度作业(跨节点),提升计算资源的高效使用;在数据管理上,我们将每个计算任务的数据都临时储存于计算节点的RAMDISK,提取有效信息后再存储于MongoDB数据库,避免大量小文件存储于公共存储。CSPTH已在15种已知一元、二元以及三元体系上进行了结构预测,实验结果表明CSPTH能根据给定的组分及外部压力条件下全部预测出相应的稳定结构。
Abstract: Crystal structure is the critical information for understanding the physical and chemical properties of materials. Therefore, theoretical prediction of crystal structures only with chemical composition and external conditions is significant. It has become a trend to design new materials through high-performance computing clusters or even using supercomputers. In this paper, we developed an open source framework for crystal structure prediction (CSP) based on Tianhe-2 supercomputer, named CSPTH. When designing the algorithm in CSP, we chose the most efficient genetic algorithm in our framework and adopted numerous technologies to improve prediction accuracy. Specifically, we used a multi-process parallel method to generate the trying structures. The space group restriction is introduced to reduce the searching space and improve the structural diversity in population. We utilized a crystal fingerprint to eliminate the similar structures, which can avoid the problem of “gene drift”. In particular, considering the characteristics of crystal structure algorithm and the system environment of Tianhe-2, we optimize the crystal structure prediction algorithm from two aspects: task management and data management. In terms of task management, we designed a multi-layer task scheduling management module to distribute fine-grained tasks (within a node) and coarse-grained tasks (multi-nodes) according to the system size of tasks to improve the efficiency of employing resources. In our data management module, the data of each computing task is temporarily cached in the RAMDISK of the computing node, and the useful information is extracted and later stored in the MongoDB database, which can avoid a large number of small files stored in the public storage. CSPTH has been used to predict the structures of 15 known element, binary and ternary systems. Experimental results show that CSPTH can predict all the correspond-ing stable structures with the only known of chemical composition and external pressure.
文章引用:刘瑾瑜, 陈品, 卢宇彤. CSPTH:基于天河二号的晶体结构预测软件框架[J]. 计算机科学与应用, 2022, 12(4): 866-878. https://doi.org/10.12677/CSA.2022.124088

参考文献

[1] Liu, Y., Wang, R., Wang, Z., Li, D. and Cui, T. (2022) Formation of Twelve-Fold Iodine Coordination at High Pressure. Nature Communications, 13, Article No. 412. [Google Scholar] [CrossRef] [PubMed]
[2] Luo, D., Qiao, X. and Dronskowski, R. (2021) Predicting Nitrogen-Based Families of Compounds: Transition-Metal Guanidinates TCN3 (T=V, Nb, Ta) and Ortho-Nitrido Carbonates T’2CN4 (T’=Ti, Zr, Hf). Angewandte Chemie International Edition, 60, 486-492. [Google Scholar] [CrossRef] [PubMed]
[3] Luo, W., Nakamura, Y., Park, J. and Yoon, M. (2021) Co-balt-Based Magnetic Weyl Semimetals with High-Thermodynamic Stabilities. npj Computational Materials, 7, Article No. 2. [Google Scholar] [CrossRef
[4] Liu, X., Niu, H. and Oganov, A.R. (2021) COPEX: Co-Evolutionary Crystal Structure Prediction Algorithm for Complex Systems. npj Computational Materials, 7, Article No. 199. [Google Scholar] [CrossRef
[5] Kvashnin, A.G., Tantardini, C., Zakaryan, H.A., Kvashnina, Y.A. and Oganov, A.R. (2020) Computational Search for New W-Mo-B Compounds. Chemistry of Materi-als, 32, 7028-7035. [Google Scholar] [CrossRef
[6] Pannetier, J., Bassas-Alsina, J., Rodri-guez-Carvajal, J. and Caignaert, V. (1990) Prediction of Crystal Structures from Crystal Chemistry Rules by Simulated Annealing. Nature, 346, 343-345. [Google Scholar] [CrossRef
[7] Schön, J.C. and Jansen, M. (1996) First Step towards Planning of Syntheses in Solid-State Chemistry: Determination of Promising Structure Candidates by Global Optimization. Angewandte Chemie International Edition in English, 35, 1286-1304. [Google Scholar] [CrossRef
[8] Wales, D.J. and Doye, J.P. (1997) Global Optimization by Ba-sin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms. The Journal of Physical Chemistry A, 101, 5111-5116. [Google Scholar] [CrossRef
[9] Goedecke, S. (2004) Minima Hopping: An Efficient Search Method for the Global Minimum of the Potential Energy Surface of Complex Molecular Systems. Journal of Chemical Physics, 120, 9911-9917. [Google Scholar] [CrossRef] [PubMed]
[10] Pickard, C.J. and Needs, R.J. (2006) High-Pressure Phases of Silane. Physical Review Letters, 97, Article ID: 045504. [Google Scholar] [CrossRef
[11] Pickard, C.J. and Needs, R.J. (2011) Ab Initio Random Structure Searching. Journal of Physics: Condensed Matter, 23, Article ID: 053201. [Google Scholar] [CrossRef] [PubMed]
[12] Woodley, S., Battle, P., Gale, J. and Catlow, C.A. (1999) The Prediction of Inorganic Crystal Structures Using a Genetic Algorithm and Energy Minimisation. Physical Chemistry Chemical Physics, 1, 2535-2542. [Google Scholar] [CrossRef
[13] Abraham, N.L. and Probert, M.I. (2006) A Periodic Genetic Algorithm with Real-Space Representation for Crystal Structure and Polymorph Prediction. Physical Review B, 73, Article ID: 224104. [Google Scholar] [CrossRef
[14] Oganov, A.R. and Glass, C.W. (2006) Crystal Structure Pre-diction Using Ab Initio Evolutionary Techniques: Principles and Applications. Journal of Chemical Physics, 124, 201-419. [Google Scholar] [CrossRef] [PubMed]
[15] Trimarchi, G. and Zunger, A. (2007) Global Space-Group Opti-mization Problem: Finding the Stablest Crystal Structure without Constraints. Physical Review B, 75, Article ID: 104113. [Google Scholar] [CrossRef
[16] Martoňák, R., Laio, A. and Parrinello, M. (2003) Predicting Crystal Structures: The Parrinello-Rahman Method Revisited. Physical Review Letters, 90, Article ID: 075503. [Google Scholar] [CrossRef
[17] Call, S.T., Zubarev, D.Y. and Boldyrev, A.I. (2007) Global Minimum Structure Searches via Particle Swarm Optimization. Journal of Computational Chemistry, 28, 1177-1186. [Google Scholar] [CrossRef] [PubMed]
[18] Laio, A. and Parrinello, M. (2002) Escaping Free-Energy Minima. Pro-ceedings of the National Academy of Sciences of the United States of America, 99, 12562-12566. [Google Scholar] [CrossRef] [PubMed]
[19] Glass, C.W., Oganov, A.R. and Hansen, N. (2006) USPEX—Evolutionary Crystal Structure Prediction. Computer Physics Communications, 175, 713-720. [Google Scholar] [CrossRef
[20] Oganov, A., Lyakhov, A. and Valle, M. (2011) How Evolutionary Crystal Structure Prediction Works—And Why. Accounts of Chemical Research, 44, 227-237. [Google Scholar] [CrossRef] [PubMed]
[21] Wang, Y., Lv, J., Zhu, L. and Ma, Y. (2010) Crystal Structure Prediction via Particle Swarm Optimization. Physics Review B, 82, Article ID: 094116. [Google Scholar] [CrossRef
[22] Wang, Y., Lv, J., Zhu, L. and Ma, Y. (2012) CALYPSO: A Method for Crystal Structure Prediction. Computer Physics Communications, 183, 2063-2070. [Google Scholar] [CrossRef
[23] Zhang, Y., Wang, H., Wang, Y., Zhang, L. and Ma, Y. (2016) Computer-Assisted Inverse Design of Inorganic Electrides. Physical Review X, 7, Article ID: 019903. [Google Scholar] [CrossRef
[24] Van Eijck, B., Mooij, W.T. and Kroon, J. (1995) Attempted Prediction of the Crystal Structures of Six Monosaccharides. Acta Crystallographica Section B: Structural Science, B51, 99-103. [Google Scholar] [CrossRef
[25] Lonie, D.C. and Zurek, E. (2011) XTALOPT: An Open-Source Evolutionary Algorithm for Crystal Structure Prediction. Computer Physics Communications, 182, 372-387. [Google Scholar] [CrossRef
[26] Price, S.L., Leslie, M., Welch, G.W., Habgood, M., Price, L.S., Karamertzanis, P.G. and Day, G.M. (2010) Modelling Organic Crystal Structures Using Distributed Multipole and Polarizability-Based Model Intermolecular Potentials. Physical Chemistry Chemical Physics, 12, 8478-8490. [Google Scholar] [CrossRef] [PubMed]
[27] Tipton, W.W. and Hennig, R.G. (2013) A Grand Canonical Genetic Algo-rithm for the Prediction of Multi-Component Phase Diagrams and Testing of Empirical Potentials. Journal of Physics: Condensed Matter, 25, Article ID: 495401. [Google Scholar] [CrossRef] [PubMed]
[28] Neumann, M., van de Streek, J., Fabbiani, F., Hidber, P. and Grassmann, O. (2015) Combined Crystal Structure Prediction and High-Pressure Crystallization in Rational Phar-maceutical Polymorph Screening. Nature Communications, 6, Article No. 7793. [Google Scholar] [CrossRef] [PubMed]
[29] Neumann, M. and van de Streek, J. (2018) How Many Ritonavir Cases Are There Still out There? Faraday Discussions, 211, 441-458. [Google Scholar] [CrossRef
[30] Mor-tazavi, M., Hoja, J., Aerts, L., Quéré, L., van de Streek, J., Neumann, M.A. and Tkatchenko, A. (2019) Computational Polymorph Screening Reveals Late-Appearing and Poorly-Soluble form of Rotigotine. Communications Chemistry, 2, Article No. 70. [Google Scholar] [CrossRef
[31] Zhang, Y.Y., Gao, W., Chen, S., Xiang, H. and Gong, X.G. (2015) Inverse Design of Materials by Multi-Objective Differential Evolution. Computational Materials Science, 98, 51-55. [Google Scholar] [CrossRef
[32] Curtis, F., Li, X., Rose, T., Vazquez-Mayagoitia, A., Bhattacharya, S., Ghiringhelli, L.M. and Marom, N. (2018) GAtor: A First-Principles Genetic Algorithm for Molecular Crystal Structure Prediction. Journal of Chemical Theory and Computation, 14, 2246-2264. [Google Scholar] [CrossRef] [PubMed]
[33] Hajinazar, S., Thorn, A., Sandoval, E.D., Kharabadze, S. and Kol-mogorov, A.N. (2021) MAISE: Construction of Neural Network Interatomic Models and Evolutionary Structure Opti-mization. Computer Physics Communications, 259, Article ID: 107679. [Google Scholar] [CrossRef
[34] Yamashita, T., Kanehira, S., Sato, N., Kino, H., Terayama, K., Sawahata, H., et al. (2021) CrySPY: A Crystal Structure Prediction Tool Accelerated by Machine Learning. Science and Technology of Advanced Materials: Methods, 1, 87-97. [Google Scholar] [CrossRef
[35] Mosley, J.W., Operskalski, E.A., Tobler, L.H., Andrews, W.W., Phelps, B., Dockter, J., et al. (1987) Genetic Algorithms and Their Applications. Proceedings of the 2nd Interna-tional Conference on Genetic Algorithms, L. Erlbaum Associates Inc., 1-8.
[36] Seko, A., Togo, A. and Tanaka, I. (2018) Descriptors for Machine Learning of Materials Data. In: Tanaka, I., Ed., Nanoinformatics, Springer, Singapore, 3-23. [Google Scholar] [CrossRef
[37] Thompson, A.P., Aktulga, H.M., Berger, R., Bolintineanu, D.S., Brown, W.M., Crozier, P.S., et al. (2022) LAMMPS—A Flexible Simulation Tool for Particle-Based Materials Modeling at the Atomic, Meso, and Continuum Scales. Computer Physics Communications, 271, Article ID: 108171. [Google Scholar] [CrossRef
[38] Kresse, G. and Furthmüller, J. (1996) Efficient Iterative Schemes for Ab Initio Total-Energy Calculations Using a Plane-Wave Basis Set. Physical Review B, 54, 11169-11186. [Google Scholar] [CrossRef
[39] Frisch, M.J., Trucks, G.W., Schlegel, H.B., Scuseria, G.E., Robb, M.A., Cheeseman, J.R., et al. (2016) Gaussian-16 Revision B.01. Gaussian Inc., Wallingford, CT.
[40] Yoo, A.B., Jette, M.A. and Grondona, M. (2003) SLURM: Simple Linux Utility for Resource Management. Workshop on Job Scheduling Strategies for Parallel Processing, Seattle, 24 June 2004, 44-60. [Google Scholar] [CrossRef
[41] Bode, B., Halstead, D.M., Kendall, R., Lei, Z. and Jackson, D. (2000) The Portable Batch Scheduler and the Maui Scheduler on Linux Clusters. 4th Annual Linux Showcase & Conference (ALS 2000), Atlanta, 10-14 October 2000, 27-34.
[42] Gentzsch, W. (2001) Sun Grid Engine: Towards Creating a Compute Power Grid. Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid, Brisbane, 15-18 May 2001, 35-36. [Google Scholar] [CrossRef
[43] Thain, D., Tannenbaum, T. and Livny, M. (2005) Distributed Computing in Practice: The Condor Experience. Concurrency and Computation: Practice and Experience, 17, 323-356. [Google Scholar] [CrossRef
[44] Banker, K., Garrett, D., Bakkum, P. and Verch, S. (2016) Mongodb in Action: Covers Mongodb Version 3.0. Simon & Schuster, New York.
[45] Barrett, C.S. (1956) X-Ray Study of the Alkali Metals at Low Temperatures. Acta Crystallographica, 9, 671-677. [Google Scholar] [CrossRef
[46] Trucano, P. and Chen, R. (1975) Structure of Graphite by Neutron Diffraction. Nature, 258, 136-137. [Google Scholar] [CrossRef
[47] Raynor, G. and Hume-Rothery, W. (1939) A Technique for the X-Ray Powder Photography of Reactive Metals and Alloys with Special Reference to the Lattice Spacing of mg at High Tem-peratures. Journal of the Institute of Metals, 65, 477-485.
[48] Wu, H., Hartman, M., Udovic, T., Rush, J., Zhou, W., Bowman, R. and Vajo, J. (2007) Structure of the Novel Ternary Hydrides Li4Tt2D (Tt =Si and Ge). Acta Crystallo-graphica. Section B, Structural Science, B63, 63-68. [Google Scholar] [CrossRef
[49] Elliot, A. (2010) Structure of Pyrrhotite 5C (Fe9S10) Acta Crys-tallographica. Section B, Structural Science, B66, 271-279. [Google Scholar] [CrossRef
[50] H. d’Amour, W.D. and Schulz, H. (1979) Structure Determination of α-Quartz up to 68 x 108 Pa. Acta Crystallographica. Section B, Structural Science, B35, 550-555. [Google Scholar] [CrossRef
[51] Kuwayama, Y., Hirose, K., Sata, N. and Ohishi, Y. (2005) The Pyrite-Type High-Pressure Form of Silica. Science, 309, 923-925. [Google Scholar] [CrossRef] [PubMed]
[52] Irving, P.E. and Beevers, C.J. (1971) Some Metallographie and Lat-tice Parameter Observations on Titanium Hydride. Metallurgical Transactions, 2, 613-615. [Google Scholar] [CrossRef
[53] Pauling, L. and Hendricks, S.B. (1925) The Crystal Structures of Hema-tite and Corundum. Journal of the American Chemical Society, 47, 781-790. [Google Scholar] [CrossRef
[54] Duan, W., Wentzcovitch, R.M. and Thomson, K.T. (1998) First-Principles Study of High Pressure Alumina Polymorphs. Physical Review B, 57, 10363-10369. [Google Scholar] [CrossRef
[55] Oganov, A.R. and Ono, S. (2004) Theoretical and Experimental Evidence for a Post-Perovskite Phase of MgSiO3 in Earth’s D’’ Layer. Nature, 430, 445-448. [Google Scholar] [CrossRef] [PubMed]
[56] Fang, C.M. and de With, G. (2002) Crystal Structure and Chemical Bonding of the High-Pressure Phase of mgal2o4 from First-Principles Calculations. Philosophical Magazine A, 82, 2885-2894. [Google Scholar] [CrossRef