基于主导方向校正的无人机航拍图像匹配算法研究
Research on UAV Aerial Image Matching Algorithm Based on Dominant Direction Correction
DOI: 10.12677/airr.2026.152038, PDF,   
作者: 张新生:中国人民解放军93160部队,北京;李伟幸:航天恒星科技有限公司,北京;空间信息体系与融合应用全国重点实验室,北京
关键词: 图像匹配深度学习旋转不变性主导方向校正匹配增强Image Matching Deep Learning Rotation Invariance Dominant Direction Correction Matching Enhancement
摘要: 图像匹配是导航定位、地图构建、三维场景重建等计算机视觉下游任务的核心支撑,但大角度旋转等几何变换常导致其匹配性能显著退化。当前主流深度学习匹配模型,由于特征表示环节缺乏显式旋转不变性设计,在未知的大角度旋转场景中的匹配精度大幅下降。针对这一局限,本文提出一种基于主导方向校正的图像匹配算法IMA-DDC (Image Matching Algorithm Based on Dominant Direction Correction),该算法具有较强的旋转不变性,且模型推理实时性较高。在特征点匹配阶段,根据初始的特征匹配对,构建能够表示旋转特性的主导方向;同时在匹配推理环节嵌入旋转一致性约束,优化SuperGlue的匹配过程。实验结果显示,在构建的无人机航拍数据集上,本文提出的IMA-DDC方法在四种地形场景中均取得了最高的定位精度,且与对比方法存在显著性能优势。如在城镇地形中,相较SuperGlue和EfficientLoFtr,IMA-DDC分别提升5.98%和2.72%。该方法有效强化了现有匹配模型的旋转不变性,拓展了其在复杂几何变换场景中的应用可靠性。
Abstract: Image matching is the core support for downstream computer vision tasks such as navigation and positioning, map construction, and 3D scene reconstruction. However, geometric transformations like large-angle rotation often lead to significant degradation of its matching performance. Current mainstream deep learning matching models, due to the lack of explicit rotation invariance design in the feature representation stage, suffer a sharp drop in matching accuracy in unknown large-angle rotation scenarios. To address this limitation, this paper proposes an image matching algorithm based on dominant direction correction, namely IMA-DDC (Image Matching Algorithm Based on Dominant Direction Correction). This algorithm possesses strong rotation invariance and high real-time performance in model inference. In the feature point matching stage, a dominant direction that can characterize rotational properties is constructed based on initial feature matching pairs; meanwhile, rotation consistency constraints are embedded into the matching inference process to optimize the matching procedure of SuperGlue. Experimental results show that on the constructed UAV aerial image dataset, the proposed IMA-DDC method achieves the highest positioning accuracy in all four terrain scenarios and exhibits significant performance advantages over the comparison methods. For example, in urban terrain, compared with SuperGlue and EfficientLoFtr, IMA-DDC improves the accuracy by 5.98% and 2.72% respectively. This method effectively enhances the rotation invariance of existing matching models and expands their application reliability in complex geometric transformation scenarios.
文章引用:张新生, 李伟幸. 基于主导方向校正的无人机航拍图像匹配算法研究[J]. 人工智能与机器人研究, 2026, 15(2): 395-402. https://doi.org/10.12677/airr.2026.152038

参考文献

[1] 高双猛, 王诗薇, 胡俊伟. 异源图像匹配算法研究综述[J]. 计算机工程与应用, 1-23.
https://kns.cnki.net/kcms2/article/abstract?v=8witAuWEzbOkU3Ldrw99tF6qzM3pmtsIIiTKjLbd06D77t3KFCgOkzyHAii6Fm15vckofYe72g6UadIfoipedA1dEvhU1gdDFj7QIfMeLpoT2CyYnODNbh1ZXtdHR9UUS-_c3uTUschXOSobTiraWRyWaOWNmAqtSic9IA3PFYY=&uniplatform=NZKPT&language=CHS, 2025-11-21.
[2] 黄开基, 杨华. 基于深度学习特征的二维图像匹配算法综述[J]. 计算机工程, 2024, 50(10): 16-34.
[3] 谷美颖, 李航, 张家伟, 等. 基于视觉的无人机定位与导航方法研究综述[J]. 电子学报, 2025, 53(3): 651-685.
[4] DeTone, D., Malisiewicz, T. and Rabinovich, A. (2018) SuperPoint: Self-Supervised Interest Point Detection and Description. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, 18-22 June 2018, 337-349. [Google Scholar] [CrossRef
[5] Dai, M., Hu, J., Zhuang, J. and Zheng, E. (2022) A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization. IEEE Transactions on Circuits and Systems for Video Technology, 32, 4376-4389. [Google Scholar] [CrossRef
[6] Lee, J., Kim, B., Kim, S. and Cho, M. (2023) Learning Rotation-Equivariant Features for Visual Correspondence. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 21887-21897. [Google Scholar] [CrossRef
[7] 冯浩臻. 亚像素精度快速视觉导航方法的研究与实现[D]: [硕士学位论文]. 大连: 大连理工大学, 2022.
[8] Sun, J.M., Shen, Z.H., Wang, Y., Bao, H. and Zhou, X. (2021) LoFTR: Detector-Free Local Feature Matching with Transformers. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 8918-8927. [Google Scholar] [CrossRef
[9] 刘庚辰, 姜梁, 吴国强, 等. 基于改进SuperPoint的空天异源图像匹配算法[J]. 电子学报, 2025, 53(4): 1201-1211.
[10] Wang, Y., He, X., Peng, S., Tan, D. and Zhou, X. (2024) Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 16-22 June 2024, 21666-21675. [Google Scholar] [CrossRef
[11] Edstedt, J., Sun, Q., Bökman, G., Wadenbäck, M. and Felsberg, M. (2024) RoMa: Robust Dense Feature Matching. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 16-22 June 2024, 19790-19800. [Google Scholar] [CrossRef
[12] 刘奇霏, 吴国强, 黄坤, 等. 空天影像匹配技术研究[J]. 遥测遥控, 2026, 47(1): 1-11.
[13] 陈娜, 白佳佳, 周祺胤, 等. 基于注意力与分层特征的图像特征点匹配算法[J]. 系统仿真学报, 2025, 37(11): 2839-2852.
[14] Liu, Y., Xia, C., Zhu, X. and Xu, S. (2022) Two-Stage Copy-Move Forgery Detection with Self Deep Matching and Proposal Superglue. IEEE Transactions on Image Processing, 31, 541-555. [Google Scholar] [CrossRef] [PubMed]
[15] 张永显, 薛源, 徐梦珍, 等. 融合多尺度学习型特征与注意力机制的多源遥感图像匹配[J]. 测绘科学技术学报, 2025, 41(2): 163-172.
[16] 杨云皓. 基于深度学习的大视角变换及大位移图像匹配算法研究[D]: [硕士学位论文]. 成都: 电子科技大学, 2025.
[17] 周嘉星, 陈志高, 高登巍, 等. 基于图像匹配的无人机侦察目标定位方法[J]. 弹箭与制导学报, 2025, 45(3): 287-294.
[18] 熊子恒, 张轩雄. 基于深度学习的红外与可见光图像匹配[J]. 电子技, 2025, 38(9): 1-8.