基于深度学习的地址分割算法在电力业务辅助录入系统上的应用研究
Research on the Application of Address Seg-mentation Algorithm Based on Deep Learning in Power Service Auxiliary Input System
DOI: 10.12677/CSA.2023.139164, PDF,    科研立项经费支持
作者: 郭凤婵, 吴毅良*, 罗序良, 陆庭辉, 刘翠媚:广东电网有限责任公司江门供电局,广东 江门
关键词: 深度学习地址分割辅助录入LSTM网络Deep Learning Address Segmentation Auxiliary Input LSTM Network
摘要: 现代电力行业中的信息系统通常需要对地址信息进行特定的格式处理以提高数据的统计与分析能力。本文提出一种基于深度学习的地址分割算法,对地址信息按行政区级进行分割。采用ELU激活层对LSTM网络进行优化以提高网络的整体性能,另外采用GAN网络对数据集进行了增强,进一步降低了训练过程中过拟合情况的发生。实验结果表明,算法对于地址数据的分割平均准确率达到99%,平均运算时间为0.1秒,满足辅助录入系统需求。算法利用分割地址信息关联对应的供电局(所),有效提升了电力业务的办理效率,具有较好的应用前景。
Abstract: Information systems in the modern power industry often require specific formatting of address information to improve data analysis and statistics capabilities. This paper proposes a deep learning-based address segmentation algorithm that separates address information into administrative divisions. The LSTM network is optimized using the ELU activation layer to enhance overall network performance. Additionally, GAN networks are used to augment the dataset, further reducing over-fitting during the training process. Experimental results show that the algorithm achieves an average accuracy of 99% in address segmentation with an average computation time of 0.1 seconds, meeting the requirements of auxiliary input systems. The algorithm utilizes the segmentation of address information to associate it with the corresponding power supply bureaus, thereby further enhancing the efficiency of electricity service processing. It has great potential for application in the power industry.
文章引用:郭凤婵, 吴毅良, 罗序良, 陆庭辉, 刘翠媚. 基于深度学习的地址分割算法在电力业务辅助录入系统上的应用研究[J]. 计算机科学与应用, 2023, 13(9): 1655-1664. https://doi.org/10.12677/CSA.2023.139164

参考文献

[1] Sarawagi, S. and Cohen, W. (2004) Semi-Markov Conditional Random Fields for Information Extraction. Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, Banff, 7-11 July 2004, 344-351.
[2] Navarro, G. (2000) A Guided Tour to Approximate String Matching. ACM Computing Surveys, 33, 31-88. [Google Scholar] [CrossRef
[3] Kipf, T.N. and Welling, M. (2016) Semi-Supervised Classification with Graph Convolutional Networks. arXiv preprint arXiv:1609.02907. [Google Scholar] [CrossRef
[4] Lafferty, J., McCallum, A. and Pereira, F. (2001) Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. Proceedings of the 18th International Conference on Machine Learning (ICML), Berkshires, Massachusetts, 28 June-1 July 2001, 282-289.
[5] 何晓明, 洪亲, 蔡坚勇, 等. 基于n-gram中英文字符串分割算法实现[J]. 电脑知识与技术, 2012, 8(23): 5530-5533. [Google Scholar] [CrossRef
[6] Yin, W., Kann, K., Yu, M., et al. (2017) Comparative Study of CNN and RNN for Natural Language Processing. arXiv preprint arXiv: 1702.01923. [Google Scholar] [CrossRef
[7] Tarwani, K.M. and Edem, S. (2017) Survey on Recurrent Neural Network in Natural Language Processing. International Journal of Engineering Trends and Technology (IJETT), 48, 301-304. [Google Scholar] [CrossRef
[8] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[9] Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H. and Bengio, Y. (2014) Learning Phrase Representations Using RNN Encoder-Decoder for Statistical Machine Translation. arXiv preprint arXiv: 1406.1078. [Google Scholar] [CrossRef
[10] Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S. and Bengio, Y. (2014) Generative Adversarial Nets. Advances in neural information pro-cessing systems (NIPS), Montreal, 8-13 December 2014, 2672-2680.
[11] Vaswani, A., Shazeer, N., Parmar, N., Usz-koreit, J., Jones, L., Gomez, A.N. and Polosukhin, I. (2017) Attention is All You Need. Advances in Neural Information Processing Systems (Neur IPS), Long Beach, 4-9 December 2017, 5998-6008.
[12] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778.
[13] Pan, S.J. and Yang, Q. (2009) A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering, 22, 1345-1359. [Google Scholar] [CrossRef
[14] Chowdhury, G.G. (2003) Natural Language Processing. Annual Re-view of Information Science and Technology (ARIST), 37, 51-89. [Google Scholar] [CrossRef
[15] Simard, P.Y., Steinkraus, D. and Platt, J.C. (2003) Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis. 7th International Conference on Document Analysis and Recognition (ICDAR 2003), Edinburgh, 6 August 2003, 958-962. [Google Scholar] [CrossRef