层次聚类和长短期记忆网络(LSTM)混合机器学习模型的数据资产估值模型
A Data Asset Valuation Model of a Hybrid Machine Learning Model of Hierarchical Clustering and Long- and Short-Term Memory Networks
DOI: 10.12677/csa.2025.156156, PDF,   
作者: 彭守斌:上海大学智慧城市研究院,上海;上海荟宸信息科技有限公司研究中心,上海;杨学军:上海荟宸信息科技有限公司研究中心,上海
关键词: 层次聚类长短期记忆网络(LSTM)AgglomerativeClustering (层次聚类算法)门控机制(LSTM)梯度消失/爆炸(RNN缺陷)泛化能力经典估值模型(成本法/市场法/收益法)单一估值模型(K-Means/MLP/RNN)Hierarchical Clustering Long and Short Term Memory Network AgglomerativeClustering (Hierarchical Clustering Algorithm) Gated Mechanism (LSTM) Gradient Disappearance/Explosion (RNN Defect) Generalization Ability Classical Valuation Model (Cost Method/Market Method/Benefit Method) Single Valuation Model (K-Means/MLP/RNN)
摘要: 研究数据资产估值在宏观层面能为数字经济发展、资源配置优化、行业规范和国家竞争力提升等方面提供支撑,对推动社会经济数字化转型意义重大。在数据资产估值过程中,会遇到诸多复杂问题,如数据异质性问题、时间序列特征挖掘问题、数据维度高和复杂性问题、缺乏通用估值标准问题和突发外部事件影响问题。层次聚类和长短期记忆网络(LSTM)混合机器学习模型将层次聚类划分异质数据成簇,LSTM挖掘各簇时间序列特征,应对高维复杂数据,结合制定估值标准,快速适应突发变化;可有效应对以上诸多复杂问题。
Abstract: Research on data asset valuation can support the development of the digital economy, optimize resource allocation, standardize industries, and enhance national competitiveness at a macro level, playing a significant role in promoting the digital transformation of society and the economy. In the process of data asset valuation, numerous complex issues arise, such as data heterogeneity, time series feature mining, high-dimensional and complex data, lack of universal valuation standards, and the impact of sudden external events. A hybrid machine learning model combining hierarchical clustering and long short-term memory (LSTM) clusters heterogeneous data into groups through hierarchical clustering, and LSTM mines temporal features within each cluster to handle high-dimensional and complex data. By formulating valuation standards, it quickly adapts to sudden changes, effectively addressing these various complex issues.
文章引用:彭守斌, 杨学军. 层次聚类和长短期记忆网络(LSTM)混合机器学习模型的数据资产估值模型[J]. 计算机科学与应用, 2025, 15(6): 45-55. https://doi.org/10.12677/csa.2025.156156

参考文献

[1] International Valuation Standards Council (IVSC) (2020) Valuation of Data and Intangible Assets.
https://www.markables.net/international-valuation-standard-for-intangible-assets-ivs-210
[2] European Commission (2020) Guidance on the Regulation on a Framework for the Free Flow of Non-Personal Data in the European Union.
https://ec.europa.eu/commission/presscorner/detail/en/ip_19_2749
[3] (2019) ISO/IEC 20546:2019. Information Technology-Big Data-Overview and Vocabulary.
https://www.en-standard.eu/bs-iso-iec-20546-2019-information-technology-big-data-overview-and-vocabulary
[4] OECD (2019) Guidelines for Measuring the Value of Data as an Asset.
https://www.oecd.org/en/publications/measuring-the-digital-transformation_9789264311992-en.html
[5] Goodfellow, I., Bengio, Y. and Courville, A. (2016) Deep Learning. MIT Press.
[6] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[7] Murtagh, F. and Legendre, P. (2014) Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion? Journal of Classification, 31, 274-295. [Google Scholar] [CrossRef
[8] Taleb, N.N. (2007) The Black Swan: The Impact of the Highly Improbable. Random House.
[9] Gers, F.A., Schmidhuber, J. and Cummins, F. (2000) Learning to Forget: Continual Prediction with LSTM. Neural Computation, 12, 2451-2471. [Google Scholar] [CrossRef] [PubMed]
[10] Zhang, T., Ramakrishnan, R. and Livny, M. (1996) BIRCH: An Efficient Data Clustering Method for Very Large Databases. ACM SIGMOD Record, 25, 103-114. [Google Scholar] [CrossRef
[11] Zhang, G.P. (2003) Time Series Forecasting Using a Hybrid ARIMA and Neural Network Model. Neurocomputing, 50, 159-175. [Google Scholar] [CrossRef
[12] Krizhevsky, A., Sutskever, I. and Hinton, G.E. (2012) ImageNet Classification with Deep Convolutional Neural Networks.
https://papers.nips.cc/paper_files/paper/2012/hash/c399862d3b9d6b76c8436e924a68c45b-Abstract.html
[13] Hastie, T., Tibshirani, R. and Friedman, J. (2009) The Elements of Statistical Learning. 2nd Edition, Springer.
[14] Chen, T. and Guestrin, C. (2016) XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, 13-17 August 2016, 785-794. [Google Scholar] [CrossRef
[15] Raschka, S. and Mirjalili, V. (2019) Python Machine Learning. 3rd Edition, Packt Publishing.
[16] Chollet, F. (2021) Deep Learning with Python. 2nd Edition, Manning Publications.
[17] Han, J., Kamber, M. and Pei, J. (2011) Data Mining: Concepts and Techniques. 3rd Edition, Morgan Kaufmann.
[18] Shiller, R.J. (2015) Irrational Exuberance. 3rd Edition, Princeton University Press.