基于深度学习的通用本地图像检索系统设计

doi:10.12677/CSA.2024.141014

期刊菜单

基于深度学习的通用本地图像检索系统设计
Design of a General Local Image Retrieval System Based on Deep Learning

DOI: 10.12677/CSA.2024.141014, PDF,
作者: 张浩东, 田春岐：同济大学电子与信息工程学院，上海
关键词: 图像检索；深度学习；磁盘向量检索；检索方法；Image Retrieval； Deep Learning； Disk Vector Retrieval； Retrieval Measure

摘要: 随着大量数字图像数据的产生，高效准确的图像检索技术变得尤为重要。本文提出了一种结合深度学习和磁盘向量检索技术的通用本地图像检索系统，采用了深度神经网络模型作为特征提取的主要工具，通过深层网络结构捕获图像的高层语义信息，实现对图像内容的精细描述，旨在提升检索的准确性和效率，图像数据库的容量。由具体的实例数据验证说明了系统可用性，证明了其在实际应用中的广泛适用性，文中研究可对图像检索系统的进一步发展起到积极的参考作用。

Abstract: With the generation of a massive amount of digital image data, efficient and accurate image retriev-al technology has become particularly important. This paper proposes a universal local image re-trieval system that combines deep learning and disk vector retrieval technology, utilizing deep neural network models as the main tool for feature extraction. By capturing the high-level semantic information of images through deep network structures, the system achieves a fine- grained de-scription of image content, aiming to enhance the accuracy and efficiency of retrieval, as well as the capacity of the image database. The usability of the system is demonstrated through specific instance data, proving its wide applicability in practical applications. The research presented in this paper can play a positive role in the further development of image retrieval systems.

文章引用：张浩东, 田春岐. 基于深度学习的通用本地图像检索系统设计[J]. 计算机科学与应用, 2024, 14(1): 123-133. https://doi.org/10.12677/CSA.2024.141014

参考文献

[1]	Zheng, L., Yang, Y. and Tian, Q. (2017) SIFT Meets CNN: A Decade Survey of Instance Retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 1224-1244. [Google Scholar] [CrossRef]
[2]	Csurka, G., et al. (2004) Visual Categorization with Bags of Keypoints. Workshop on Statistical Learning in Computer Vision, ECCV, 1. 1-22.
[3]	Hervé, J., et al. (2010) Aggre-gating Local Descriptors into a Compact Image Representation. 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 13-18 June 2010. [Google Scholar] [CrossRef]
[4]	Perronnin, F., Sánchez, J. and Mensink, T. (2010) Improving the Fisher Kernel for Large-Scale Image Classification. In: Daniilidis, K., Maragos, P. and Paragios, N., eds., Computer Vision—ECCV 2010, Springer, Berlin, Heidelberg. [Google Scholar] [CrossRef]
[5]	Urdinez, F. and Cruz, A. (2021) R for Political Data Science: A Practical Guide. Chapman and Hall/CRC, Boca Raton, FL, 375-393.
[6]	Vaswani, A., et al. (2017) Attention Is All You Need. https://arxiv.org/abs/1706.03762
[7]	Dosovitskiy, A., et al. (2020) An Image Is Worth 16x16 Words: Trans-formers for Image Recognition at Scale. https://arxiv.org/abs/2010.11929
[8]	Bentley, J.L. (1975) Multidimensional Binary Search Trees Used for Asso-ciative Searching. Communications of the ACM, 18, 509-517. [Google Scholar] [CrossRef]
[9]	Omohundro, S.M. (1989) Five Balltree Construction Algorithms. In-ternational Computer Science Institute, Berkeley, 1-22.
[10]	Gionis, A., Indyk, P. and Motwani, R. (1999) Similarity Search in High Dimensions via Hashing. Very Large Data Bases Conference (VLDB), 99, 518-529.
[11]	Malkov, Y.A. and Yashunin, D.A. (2018) Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 824-836. [Google Scholar] [CrossRef]
[12]	Subramanya, S.J., et al. (2019) DiskANN: Fast Accurate Bil-lion-Point Nearest Neighbor Search on a Single Node. 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada, 31 August 2020.
[13]	He, K.M., et al. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27-30 June 2016. [Google Scholar] [CrossRef]
[14]	El-Nouby, A., et al. (2021) Training Vision Transformers for Image Retrieval. https://arxiv.org/abs/2102.05644

为你推荐

友情链接