针对高亮物体的高效神经隐式表面重建
Efficient Neural Implicit Surface Reconstruction for Glossy Objects
摘要: 神经隐式表面(Neural Implicit Surface)一直是近年来计算机视觉的热门研究方向。许多工作通过扩展神经辐射场的体渲染管线实现了从二维图像和相机位姿作为输入,在无需三维监督信息下重建高质量三维物体形状。但是,由于利用二维图像进行训练监督,这些方法难以对高亮物体的形状进行合理的推理,其原因是因为物体材质和环境光照所产生的模糊性。本文提出一种针对高亮物体的高效表面重建方法,通过权重插值辐射场(Radiance Field)与反射场(Reflection Field)的方式使得可以更好地表达高亮物体的外观。同时,本文引入了一种渲染损失的方法来缓解高光反射带来的多视角不一致问题,并且引入了两种针对物体法向量的正则化来缓解混合神经场梯度噪声的问题。本工作通过渐进式的训练范式分别对三种数据集进行了实验,实验表明,本方法在多视角合成和高亮物体表面重建任务上都超越了基准模型,并且在训练速度上比基准模型快一个量级。
Abstract: Neural implicit surface has been a popular research direction in computer vision in recent years. Many approaches have extended the volume rendering pipeline of neural radiance field to reconstruct high quality 3D object shapes from 2D images and camera poses as input without any 3D supervision. However, due to the use of two-dimensional images for training supervision, it is difficult for these methods to rationally reason about the shape of glossy objects, because of the ambiguity caused by the material and environment lighting. In this paper, we propose an efficient surface reconstruction method specifically designed for glossy objects, which better represents the appearance of such objects through the interpolation of a weighted radiance field and reflection field. Additionally, we introduce a relax rendering loss to alleviate the issue of multi-view inconsistency caused by specular reflections, and two types of regularization for object normal to reduce the gradient noise of the hybrid neural field. Experiments on three datasets using a progressive training paradigm demonstrate that the proposed method outperforms baseline models in novel view synthesis and surface reconstruction tasks, while achieving training speeds approximately one order of magnitude faster than baseline models.
文章引用:何思源, 刘兴林. 针对高亮物体的高效神经隐式表面重建[J]. 计算机科学与应用, 2024, 14(5): 265-276. https://doi.org/10.12677/csa.2024.145135

参考文献

[1] Wang, P., Liu, L., Liu, Y., et al. (2021) Neus: Learning Neural Implicit Surfaces by Volume Rendering for Multi-View Reconstruction. Advances in Neural Information Processing Systems, 34, 27171-27183.
[2] Yariv, L., Gu, J., Kasten, Y., et al. (2021) Volume Rendering of Neural Implicit Surfaces. Advances in Neural Information Processing Systems, 34, 4805-4815.
[3] Oechsle, M., Peng, S. and Geiger, A. (2021) Unisurf: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction. Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, 10-17 October 2021, 5589-5599. [Google Scholar] [CrossRef
[4] Yariv, L., Kasten, Y., Moran, D., et al. (2020) Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance. Advances in Neural Information Processing Systems, 33, 2492-2502.
[5] Verbin, D., Hedman, P., Mildenhall, B., et al. (2022) Ref-Nerf: Structured View-Dependent Appearance for Neural Radiance Fields. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 5481-5490. [Google Scholar] [CrossRef
[6] Müller, T., Evans, A., Schied, C., et al. (2022) Instant Neural Graphics Primitives with a Multiresolution Hash Encoding. ACM Transactions on Graphics (ToG), 41, 1-15. [Google Scholar] [CrossRef
[7] Park, J.J., Florence, P., Straub, J., et al. (2019) Deepsdf: Learning Continuous Signed Distance Functions for Shape Representation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, 15-20 June 2019, 165-174. [Google Scholar] [CrossRef
[8] Mescheder, L., Oechsle, M., Niemeyer, M., et al. (2019) Occupancy Networks: Learning 3d Reconstruction in Function Space. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, 15-20 June 2019, 4460-4470. [Google Scholar] [CrossRef
[9] Mildenhall, B., Srinivasan, P.P., Tancik, M., et al. (2021) Nerf: Representing Scenes as Neural Radiance Fields for View Synthesis. Communications of the ACM, 65, 99-106. [Google Scholar] [CrossRef
[10] Ge, W., Hu, T., Zhao, H., et al. (2023) Ref-Neus: Ambiguity-Reduced Neural Implicit Surface Learning for Multi-View Reconstruction with Reflection. Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, 01-06 October 2023, 4251-4260. [Google Scholar] [CrossRef
[11] Liang, R., Chen, H., Li, C., et al. (2023) Envidr: Implicit Differentiable Renderer with Neural Environment Lighting. Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, 01-06 October 2023, 79-89. [Google Scholar] [CrossRef
[12] Munkberg, J., Hasselgren, J., Shen, T., et al. (2022) Extracting Triangular 3d Models, Materials, and Lighting from Images. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, 18-24 June 2022, 8280-8290. [Google Scholar] [CrossRef
[13] Hasselgren, J., Hofmann, N. and Munkberg, J. (2022) Shape, Light, and Material Decomposition from Images Using Monte Carlo Rendering and Denoising. Advances in Neural Information Processing Systems, 35, 22856-22869.
[14] Liu, Y., Wang, P., Lin, C., et al. (2023) Nero: Neural Geometry and Brdf Reconstruction of Reflective Objects from Multiview Images. ACM Transactions on Graphics (TOG), 42, 1-22. [Google Scholar] [CrossRef
[15] Li, Z., Müller, T., Evans, A., et al. (2023) Neuralangelo: High-fidelity neural surface reconstruction. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, 17-24 June 2023, 8456-8465. [Google Scholar] [CrossRef