极限学习机和一类深度学习方法的对比研究
A Comparative Study of Extreme Learning Machine and a Class of Deep Learning Methods
摘要: 本文针对神经网络在微分方程求解中的先验估计问题,将极限学习机(Extreme Learning Machine, ELM)嵌入粒子群优化算法和一类使用梯度更新的深度学习方法进行比较,将他们应用于一类微分方程求解并进行对比。与使用梯度信息优化损失函数的深度学习方法不同,ELM使用Moore-Penrose广义逆替代了梯度信息更新权重以达到损失函数最小化。本文通过实验证明了ELM对于方程的区间没有要求,训练时间短,但劣势是若解是非连续的,对间断点类型有要求等,而此类深度学习算法加深了网络层数,优势是学习到更抽象的特征,且解可以是非连续的,劣势对方程区间有较严格要求,否则梯度消失,并通过实验结果进行了验证,且训练时间长此类深度学习算法有时会伴随着一些剪枝算法压缩网络。本文的结论可以用于先验的算法选择,提高求解效率。
Abstract: In this paper, aiming at the priori estimation problem of neural network in differential equation solving, the Extreme Learning Machine (ELM) was embedded in particle swarm optimization algo-rithm and a class of deep learning algorithms using gradient update are compared, and they are applied to a class of differential equation solving and compared. Unlike the deep learning algo-rithms using gradient information to optimize the loss function, ELM used Moore-Penrose gener-alized inverse instead of gradient information to update the weight to minimize the loss function. This paper proved through experiments that ELM has no requirements for the interval of the equation, and the training time is short, but the disadvantages are that if the solution is discon-tinuous, it has requirements for the type of breakpoints, etc. This kind of deep learning algorithm deepens the number of network layers, and the advantages are learning more abstract features, and the solution can be discontinuous. While the disadvantages are strict requirements for the in-terval of the equation, otherwise the gradient is disappear, which is verified by the experimental results, and the training time is long. Such deep learning algorithms sometimes are accompanied by some pruning algorithms to compress the network. The conclusion of this paper can be used for prior algorithm selection to improve the efficiency of solution.
文章引用:徐翊铭, 范馨月. 极限学习机和一类深度学习方法的对比研究[J]. 应用数学进展, 2022, 11(12): 8740-8749. https://doi.org/10.12677/AAM.2022.1112921

参考文献

[1] Milligen, B.V., Tribaldos, V. and Jiménez, J.A. (1995) Neural Network Differential Equation and Plasma Equilibrium Solver. Physical Review Letters, 75, 3594-3597. [Google Scholar] [CrossRef
[2] Kim, S., Ji, W., Deng, S., Ma, Y. and Rackauckas, C. (2021) Stiff Neural Ordinary Differential Equations. Chaos: An Interdisciplinary Journal of Nonlinear Science, 31, Article No. 093122. [Google Scholar] [CrossRef] [PubMed]
[3] Khoo, Y., Lu, J. and Ying, L. (2021) Solving Parametric PDE Problems with Artificial Neural Networks. European Journal of Applied Mathematics, 32, 421-435. [Google Scholar] [CrossRef
[4] Chaharborj, S.S. and Yaghoub, M. (2017) Study of Fractional Order Integro-Differential Equations by Using Chebyshev Neural Network. Journal of Mathematics and Statistics, 13, 1-13. [Google Scholar] [CrossRef
[5] Liu, Y., Kutz, J.N. and Brunton, S.L. (2022) Hierarchical Deep Learning of Multiscale Differential Equation Time- Steppers. Philosophical Transactions of the Royal Society A, 380, Article ID: 20210200. [Google Scholar] [CrossRef] [PubMed]
[6] Mo, Y., Ling, L. and Zeng, D. (2022) Data-Driven Vector Soliton Solutions of Coupled Nonlinear Schrödinger Equation Using a Deep Learning Algorithm. Physics Letters A, 421, Article ID: 127739. [Google Scholar] [CrossRef
[7] Yang, Y., Hou, M., Sun, H., et al. (2020) Neural Network Algorithm Based on Legendre Improved Extreme Learning Machine for Solving Elliptic Partial Differential Equations. Soft Computing, 24, 1083-1096. [Google Scholar] [CrossRef
[8] Lu, Y., Chen, G., Yin, Q., Sun, H. and Hou, M. (2020) Solving the Ruin Probabilities of Some Risk Models with Legendre Neural Network Algorithm. Digital Signal Processing, 99, Article ID: 102634. [Google Scholar] [CrossRef
[9] Huang, G.B., Zhu, Q.Y. and Siew, C.K. (2006) Extreme Learning Machine: Theory and Applications. Neurocomputing, 70, 489-501. [Google Scholar] [CrossRef
[10] Yang, Y., Hou, M., Luo, J. and Tian, Z. (2020) Numerical Solution of Several Kinds of Differential Equations Using Block Neural Network Method with Improved Extreme Learning Machine Algorithm. Journal of Intelligent & Fuzzy Systems, 38, 3445-3461. [Google Scholar] [CrossRef
[11] Chen, Y., Yi, C., Xie, X., Hou, M. and Cheng, Y. (2020) Solution of Ruin Probability for Continuous Time Model Based on Block Trigonometric Exponential Neural Network. Symmetry, 12, Article No. 876. [Google Scholar] [CrossRef
[12] Raissi, M., Perdikaris, P. and Karniadakis, G.E. (2017) Physics In-formed Deep Learning (Part I): Data-Driven Solutions of Nonlinear Partial Diferential Equations. ArXiv: 1711.10561.
[13] Raissi, M., Perdikaris, P. and Karniadakis, G.E. (2017) Physics Informed Deep Learning (Part II): Data-Driven Discovery of Nonlinear Partial Differential Equations. ArXiv: 1711.10566.
[14] Lu, L., Meng, X., Mao, Z. and Karniadakis, G.E. (2021) DeepXDE: A Deep Learning Library for Solving Differential Equations. SIAM Review, 63, 208-228. [Google Scholar] [CrossRef
[15] Wang, L. and Yan, Z. (2021) Data-Driven Rogue Waves and Parameter Discovery in the Defocusing Nonlinear Schrödinger Equation with a Potential Using the PINN Deep Learning. Physics Letters A, 404, Article ID: 127408. [Google Scholar] [CrossRef
[16] Chen, J., Du, R. and Wu, K. (2020) A Comparison Study of Deep Galerkin Method and Deep Ritz Method for Elliptic Problems with Different Boundary Conditions. Communica-tions in Mathematical Research, 36, 354-376. [Google Scholar] [CrossRef
[17] Baydin, A.G., Pearlmutter, B.A., Radul, A.A. and Siskind, J.M. (2015) Automatic Differentiation in Machine Learning: A Survey.
[18] Zhou, T., Liu, X., Hou, M. and Liu, C. (2019) Numerical Solution for Ruin Probability of Continuous Time Model Based on Neural Network Algorithm. Neurocomputing, 331, 67-76. [Google Scholar] [CrossRef
[19] Sengupta, S., Basak, S., Peters, R.A. (2018) Particle Swarm Optimization: A Survey of Historical and Recent Developments with Hybridization Per-spectives. Machine Learning and Knowledge Extraction, 1, 157-191. [Google Scholar] [CrossRef
[20] Kingma, D.P. and Ba, J. (2014) Adam: A Method for Stochastic Op-timization. Computer Science.
[21] You, H., Guo, J. and Jiang, J. (2019) Interval Estimation of the Ruin Probability in the Classical Compound Poisson Risk Model. Computational Statistics & Data Analysis, 144, Article ID: 106890. [Google Scholar] [CrossRef
[22] Wang, G. (2001) A Decomposition of the Ruin Probability for the Risk Process Perturbed by Diffusion. Insurance: Mathematics and Economics, 28, 49-59. [Google Scholar] [CrossRef
[23] Mao, Z., Jagtap, A.D. and Karniadakis, G.E. (2020) Phys-ics-Informed Neural Networks for High-Speed Flows. Computer Methods in Applied Mechanics and Engineering, 360, Article ID: 112789. [Google Scholar] [CrossRef