深度学习在大数据分析中的应用研究
Research of Deep Learning Applications in Big Data Analytics
DOI: 10.12677/SEA.2022.113057, PDF,  被引量    科研立项经费支持
作者: 贾美娟, 李 欣, 孔 靓, 刘 春, 邵国强:大庆师范学院计算机科学与信息技术学院,黑龙江 大庆
关键词: 大数据深度学习数据表示语义索引区分任务Big Data Deep Learning Data Representation Semantic Index Discriminative Task
摘要: 深度学习算法通过分层学习过程提取高级、复杂的抽象作为数据表示。本文通过对现有深度学习在大数据分析中的所取得的成果,探讨了如何利用深度学习解决大数据分析中的一些典型问题,如从海量数据中提取复杂模式、语义索引、数据标记、快速信息检索及简化区分任务等问题,重点讨论的是在音视频方面的应用情况。最后,就目前研究存在的问题对未来相关工作提出见解。
Abstract: Deep Learning Algorithm extracts high-level and complex abstractions as data representation through hierarchical learning process. Based on the achievements of existing Deep Learning in big data analysis, this paper discusses how to use Deep Learning to solve some typical problems in big data analysis, such as extracting complex patterns from massive data, semantic indexing, data labeling, fast information retrieval and simplifying differentiated tasks. The focus is on the application in audio and video. Finally, some opinions on the future related work are put forward based on the problems existing in the current research.
文章引用:贾美娟, 李欣, 孔靓, 刘春, 邵国强. 深度学习在大数据分析中的应用研究[J]. 软件工程与应用, 2022, 11(3): 549-557. https://doi.org/10.12677/SEA.2022.113057

参考文献

[1] 袁波. 大数据领域的反垄断问题研究[D]: [博士学位论文]. 上海: 上海交通大学, 2019.
[2] 孟小峰, 慈祥. 大数据管理: 概念、技术与挑战[J]. 计算机研究与发展, 2013, 50(1): 146-169.
[3] Bengio, Y. (2020) Deep Learning of Representations: Looking Forward. In: Proceedings of the 1st International Conference on Statistical Language and Speech Processing, Springer, Tarragona, 1-37. [Google Scholar] [CrossRef
[4] 张菊, 郭永峰. 深度学习研究综述[J]. 教学研究, 2021, 44(3): 6-13.
[5] Bengio, Y. and LeCun, Y. (2007) Scaling Learning Algorithms towards AI. In: Bottou, L., Chapelle, O., DeCoste, D. and Weston, J., Eds., Large Scale Kernel Machines, MIT Press, Cambridge, Vol. 34, 321-360.
[6] Dumbill, E. (2012) What Is Big Data? An Introduction to the Big Data Landscape. O’Reilly Strata Making Data Work Conference, Santa Clara, 28 February-1 March 2012, 315-450.
[7] Garshol, L.M. (2013) Introduction to Big Data/Machine Learning. Online Slide Show. http://www.slideshare.net/larsgaintroduction-to-big-datamachine-learning
[8] Grobelnik, M. (2013) Big Data Tutorial. European Data Forum. https://european-big-data-value-forum.eu/
[9] Salton, G. and Buckley, C. (1988) Term-Weighting Approaches in Automatic Text Retrieval. Information Processing & Management, 24, 513-523. [Google Scholar] [CrossRef
[10] Robertson, S.E. and Walker, S. (1994) Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted Retrieval. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Springer-Verlag, New York, 232-241. [Google Scholar] [CrossRef
[11] Hinton, G. and Salakhutdinov, R. (2011) Discovering Binary Codes for Documents by Learning Deep Generative Models. Topics in Cognitive Science, 3, 74-91. [Google Scholar] [CrossRef] [PubMed]
[12] 刘芳名, 张鸿. 基于多级语义的判别式跨模态哈希检索算法[J]. 计算机应用, 2021, 41(8): 48-56.
[13] Ranzato, M. and Szummer, M. (2008) Semi-Supervised Learning of Compact Document Representations with Deep Networks. In: Proceedings of the 25th International Conference on Machine Learning, ACM, New York, 792-799. [Google Scholar] [CrossRef
[14] Mikolov, T., Chen, K. and Dean, J. (2013) Efficient Estimation of Word Representations in Vector Space. CoRR: Computing Research Repository, 1-12.
[15] Dean, J., Corrado, G., Monga, R., et al. (2012) Large Scale Distributed Deep Networks. In: Bartlett, P., Pereira, F.C.N., Burges, C.J.C., Bottou, L. and Weinberger, K.Q., Eds., Advances in Neural Information Processing Systems, Vol. 25, Curran Associates, Inc., Red Hook, 1232-1240.
[16] Mikolov, T., Le, Q.V. and Sutskever, I. (2013) Exploiting Similarities among Languages for Machine Translation. CoRR: Comput Res Repository, 1-10.
[17] Li, G., Zhu, H., Cheng, G., Thambiratnam, K., et al. (2012) Context-Dependent Deep Neural Networks for Audio Indexing of Real-Life Data. IEEE Spoken Language Technology Workshop (SLT), Miami, 2-5 December 2012, 143-148. [Google Scholar] [CrossRef
[18] 马腾腾, 赵宇翔, 朱庆华. 国外移动视觉搜索产品的比较分析研究[J]. 图书馆杂志, 2016, 35(9): 12-18.
[19] 魏正曦, 邱玲, 赵攀. 基于灰度分类的图像搜索引擎[J]. 四川理工学院学报(自然科学版), 2021, 27(1): 68-77.
[20] Krizhevsky, A., Sutskever, I. and Hinton, G. (2012) Imagenet Classification with Deep Convolutional Neural Networks. In: Advances in Neural Information Processing Systems, Curran Associates, Inc., Red Hook, Vol. 25, 1106-1114.
[21] Hinton, G.E., Osindero, S. and The, Y.-W. (2006) A Fast Learning Algorithm for Deep Belief Nets. Neural Computation, 18, 1527-1554. [Google Scholar] [CrossRef] [PubMed]
[22] Hinton, G.E. and Salakhutdinov, R.R. (2006) Reducing the Dimensionality of Data with Neural Networks. Science, 313, 504-507. [Google Scholar] [CrossRef] [PubMed]
[23] Lee, H., Battle, A., Raina, R. and Ng, A. (2006) Efficient Sparse Coding Algorithms. In: Advances in Neural Information Processing Systems, MIT Press, Cambridge, 801-808.
[24] Le, Q., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G., Dean, J. and Ng, A. (2012) Building High-Level Features Using Large Scale Unsupervised Learning. Proceeding of the 29th International Conference on Machine Learning, Edinburgh, 26 June-1 July 2012, 8595-8598. [Google Scholar] [CrossRef
[25] 朱均安. 基于深度学习的视觉目标跟踪算法研究[D]: [博士学位论文]. 北京: 中国科学院大学(中国科学院长春光学精密机械与物理研究所), 2020.
[26] Socher, R., Lin, C.C., Ng, A. and Manning, C. (2011) Parsing Natural Scenes and Natural Language with Recursive Neural Networks. In: Proceedings of the 28th International Conference on Machine Learning, Omnipress, Madison, 129-136.
[27] Kumar, R., Talton, J.O., Ahmad, S. and Klemmer, S.R. (2012) Data-Driven Web Design. Proceedings of the 29th International Conference on Machine Learning, Edinburgh, 26 June-1 July 2012, 119-130.
[28] Le, Q.V., Zou, W.Y., Yeung, S.Y. and Ng, A.Y. (2011) Learning Hierarchical Invariant Spatio-Temporal Features for Action Recognition with Independent Subspace Analysis. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Colorado, 20-25 June 2011, 3361-3368. [Google Scholar] [CrossRef