基于单目视觉的高精度三维场景重建技术研究
High Precision 3D Scene Reconstruction Based on Monocular Vision
摘要: 近年来,随着计算机硬件的不断快速更新,计算机的处理能力也不断变强。同时场景三维模型的获取技术越来越成熟,我们获得场景的三维模型数据的方式更多也更加方便了。而目前在基于单目和双目的三维重建技术中,单目技术较双目操作简单、而且取材方面更有利于推向市场。本文主要讲述基于单相机的三维重建,然后通过基于累积图的快速NCC匹配的种子扩张算法来进行高精度的三维场景重建。本章对经典的NCC相似度量函数进行优化,以此减少计算时间。而种子像素扩张算法即先选择初始的种子像素,利用视差图进行窗口比较从而获得高置信度的种子像素,因此大大降低了视差图的误匹配点。试验表明,该方法能够得到高质量的三维场景模型。
Abstract: In recent years, along with the rapid updating of computer hardware, the processing capability of computer is also increasing. At the same time, 3d scene reconstruction technique has become more and more mature and we can get 3d model data for scenarios more easily than ever before. Now, in the 3d reconstruction technology based on monocular and binocular, monocular technology is simpler to operate than binocular technology and more convenient to acquire materials and more favorable to the market. This paper focuses on monocular based 3D reconstruction, the algorithm is used to reconstruct the 3d scene with a fast NCC algorithm based on the cumulative diagram. This paper improves the classic NCC similarity measures to reduce the computation time. Seed pixel expansion algorithm is presented to choose the initial seed pixels, use parallax to make window comparisons to obtain high confidence seed pixels, therefore, the mismatches of the parallax figure are greatly reduced. Experiments show that the method can reconstruct precise and clear 3d scenarios.
文章引用:金家梁, 朱孟飞, 姚拓中, 宋加涛. 基于单目视觉的高精度三维场景重建技术研究[J]. 人工智能与机器人研究, 2018, 7(3): 112-121. https://doi.org/10.12677/AIRR.2018.73013

参考文献

[1] Cyberware (2008) Cyberware Web Page.
http://www.cyberware.com
[2] Konica (2008) Konica Minolta 3-D Digitizer Web Page.
http://www.minolta3d.com
[3] Scharstein, D. and Szeliski, R. (2003) High-Accuracy Stereo Depth Maps Using Structured Light. 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Madison, WI, 18-20 June 2003, 195-202. [Google Scholar] [CrossRef
[4] Neugebauer, P. and Klein, K. (1999) Texturing 3D Models of Real World Objects from Multiple Unregistered Photographic Views. Eurographics, 18, 245-256.
[5] Scharstein, D. and Szeliski, R. (2002) A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms. International Journal of Computer Vision, 47, 7-42. [Google Scholar] [CrossRef
[6] Ogale, A. and Aloimonos, Y. (2007) A Roadmap to the Integration of Early Visual Modules. International Journal of Computer Vision, 72, 9-25. [Google Scholar] [CrossRef
[7] Pollefeys, M., Van Gool, L. and Vergauwen, M. (2004) Visual Modeling with a Hand-Held Camera. International Journal of Computer Vision, 59, 207-232. [Google Scholar] [CrossRef
[8] Snavely, N., Seitz, S.M. and Szeliski, R. (2008) Modeling the World from Internet Photo Collections. International Journal of Computer Vision, 80, 189-210. [Google Scholar] [CrossRef
[9] Geiger, A., Ziegler, J. and Stiller, C. (2011) StereoScan: Dense 3d Reconstruc-tion in Real-Time. Intelligent Vehicles Symposium, Baden-Baden, 5-9 June 2011, 963-968. [Google Scholar] [CrossRef
[10] Shen, S.H. (2013) Accurate Multiple View 3D Reconstruction Using Patch-Based Stereo for Large-Scale Scenes. IEEE Transactions on Image Processing, 22, 1901-1914. [Google Scholar] [CrossRef
[11] Newcombe, R.A., Izadi, S. and Hilliges, O. (2013) Kinect Fusion: Real-Time Dense Surface Mapping and Tracking. Mixed and Augmented Reality, 233, 430-436.
[12] Zhang, Z. (2000) A Flexible New Technique for Camera Calibration. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 1330-1334. [Google Scholar] [CrossRef
[13] Lowe, D. (2004) Distinctive Image Features from Scale-Invariant Key points. Interna-tional Journal of Computer Vision, 60, 91-110. [Google Scholar] [CrossRef
[14] Fischler, M. and Bolles, R. (1981) Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography. Communications, 24, 381-395. [Google Scholar] [CrossRef