一种结合LATCH与多索引哈希的闭环检测方法
Loop Closure Detection Method Combining LATCH and Multi Index Hashing
摘要: RatSLAM是一个受海马体启发的同时定位与地图构建(SLAM)解决方案。针对RatSLAM由于一般的视觉处理方法而在复杂的室内环境中表现不佳的情况。本文首先提出了一种改进的RatSLAM,这是一种基于特征的单目SLAM系统,可以实时运行。其次,本文引入了多索引哈希(MIH)算法和三块区域代码(LATCH)的学习排列,用于评估当前场景与先前场景的相似性。最后,通过贝叶斯滤波和滑动窗口筛选出最相似的视觉模板,完成回环检测(LCD)。实验结果表明,改进后的系统在室内环境中的位置识别精度更高,体验地图的构建性能也更好,最大召回率达到60%。
Abstract: RatSLAM is a simultaneous localization and mapping (SLAM) solution inspired by the hippocampus. Regarding the poor performance of RatSLAM in complex indoor environments due to general visual processing methods. This article first proposes an improved RatSLAM, which is a feature-based monocular SLAM system that can run in real-time. Secondly, this article introduces the Multi Index Hash (MIH) algorithm and the Learning Arrangement of Three Block Region Code (LATCH) to evaluate the similarity between the current scene and the previous scene. Finally, the most similar visual template is selected through Bayesian filtering and sliding window filtering to complete loop closure detection (LCD). The experimental results show that the improved system has higher position recognition accuracy in indoor environments, better performance in constructing experiential maps, and a maximum recall rate of 60%.
文章引用:李瑞华. 一种结合LATCH与多索引哈希的闭环检测方法[J]. 建模与仿真, 2025, 14(2): 417-424. https://doi.org/10.12677/mos.2025.142163

参考文献

[1] 章陈常, 朱华, 由韶泽. 基于视觉的同时定位与地图构建的研究进展[J]. 计算机应用研究, 2018, 35(3): 641-647.
[2] 黄凤堂. 基于视觉惯性SLAM的自动驾驶车辆导航算法研究[D]: [硕士学位论文]. 桂林: 桂林电子科技大学, 2023.
[3] Milford, M.J., Wyeth, G.F. and Prasser, D. (2004) RatSLAM: A Hippocampal Model for Simultaneous Localization and Mapping. IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA’04. 2004, New Orleans, 26 April-1 May 2004, 403-408. [Google Scholar] [CrossRef
[4] Milford, M.J. and Wyeth, G.F. (2008) Mapping a Suburb with a Single Camera Using a Biologically Inspired SLAM System. IEEE Transactions on Robotics, 24, 1038-1053. [Google Scholar] [CrossRef
[5] 李昂, 阮晓钢, 黄静, 等. 融合卷积神经网络与VLAD的闭环检测方法[J]. 计算机应用与软件, 2021, 38(1): 135-142.
[6] Rublee, E., Rabaud, V., Konolige, K. and Bradski, G. (2011) ORB: An Efficient Alternative to SIFT or Surf. 2011 International Conference on Computer Vision, Barcelona, 6-13 November 2011, 2564-2571. [Google Scholar] [CrossRef
[7] Levi, G. and Hassner, T. (2016) LATCH: Learned Arrangements of Three Patch Codes. 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, 7-10 March 2016, 1-9. [Google Scholar] [CrossRef
[8] Lowe, D.G. (2004) Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 60, 91-110. [Google Scholar] [CrossRef
[9] Greene, D., Parnas, M. and Yao, F. (1994) Multi-Index Hashing for Information Retrieval. Proceedings 35th Annual Symposium on Foundations of Computer Science, Santa Fe, 20-22 November 1994, 722-731. [Google Scholar] [CrossRef
[10] O’Keefe, J. and Conway, D.H. (1978) Hippocampal Place Units in the Freely Moving Rat: Why They Fire Where They Fire. Experimental Brain Research, 31, 573-590. [Google Scholar] [CrossRef] [PubMed]
[11] Ranck, J.B. (1985) Head Direction Cells in the Deep Cell Layer of Dorsolateral Pre-Subiculum in Freely Moving Rats. Electrical Activity of the Archicortex.
[12] Ball, D., Heath, S., Wiles, J., Wyeth, G., Corke, P. and Milford, M. (2013) Openratslam: An Open Source Brain-Based SLAM System. Autonomous Robots, 34, 149-176. [Google Scholar] [CrossRef
[13] Zhou, S., Yan, R., Li, J., Chen, Y. and Tang, H. (2017) A Brain-Inspired SLAM System Based on ORB Features. International Journal of Automation and Computing, 14, 564-575. [Google Scholar] [CrossRef
[14] Galvez-Lopez, D. and Tardos, J.D. (2011) Real-Time Loop Detection with Bags of Binary Words. 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, San Francisco, 25-30 September 2011, 51-58. [Google Scholar] [CrossRef
[15] Garcia-Fidalgo, E. and Ortiz, A. (2018) iBoW-LCD: An Appearance-Based Loop Closure Detection Approach Using Incremental Bags of Binary Words. IEEE Robotics and Automation Letters, 3, 3051-3057.
[16] Angeli, A., Doncieux, S., Meyer, J. and Filliat, D. (2009) Visual topological SLAM and global localization. 2009 IEEE International Conference on Robotics and Automation, Kobe, 12-17 May 2009, 4300-4305. [Google Scholar] [CrossRef
[17] Shahbazi, H. and Zhang, H. (2011) Application of Locality Sensitive Hashing to Realtime Loop Closure Detection. 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, San Francisco, 25-30 September 2011, 1228-1233. [Google Scholar] [CrossRef
[18] Gionis, A., Indyk, P., Motwani, R., et al. (1999) Similarity Search in High Dimensions via Hashing. Proceedings of the 25th International Conference on Very Large Data Bases, Edinburgh, 7-10 September 1999, 518-529.
[19] Han, L. and Fang, L. (2017) MILD: Multi-Index Hashing for Appearance Based Loop Closure Detection. 2017 IEEE International Conference on Multimedia and Expo (ICME), Hong Kong, 10-14 July 2017, 139-144. [Google Scholar] [CrossRef
[20] 魏俐华, 陈刚. 考虑信任度的Pythagorean 模糊两阶段交互多属性群决策方法[J]. 计算机应用研究, 2021, 38(10): 2954-2960.