基于隐式神经表示与物理约束的室内声场重建方法
A Method for Indoor Sound Field Reconstruction Based on Implicit Neural Representations and Physical Constraints
DOI: 10.12677/csa.2026.162041, PDF,    科研立项经费支持
作者: 胡泽宁, 王金戈:北京印刷学院信息工程学院,北京;田益民*:北京印刷学院基础部,北京
关键词: 室内声场稀疏采样物理约束隐式神经表示声场重建Indoor Sound Field Sparse Sampling Physical Constraints Implicit Neural Representation Sound Field Reconstruction
摘要: 在实际室内声学应用中,受测点数量、布置条件及测量成本等因素限制,往往只能获得稀疏采样条件下的声场观测数据,给声场空间分布的准确重建带来挑战。针对这一问题,本文将室内稳态声场重建视为由空间坐标到声压幅值的函数拟合问题,提出一种融合隐式神经表示与物理约束的室内声场重建方法。该方法以多层感知机为基础,引入位置编码与正弦激活函数以增强网络对复杂空间振荡特征的表达能力,并进一步结合Helmholtz方程构建物理一致性约束,引导网络在稀疏采样条件下学习符合声波传播规律的声场空间分布。在不同采样率与频率条件下对所提方法进行了系统验证。实验结果表明,与传统MLP及仅引入位置编码的基线模型相比,该方法在稀疏采样条件下能够显著提升声场重建精度与稳定性,有效抑制未观测区域中的非物理振荡现象。相关研究为有限观测条件下的室内声场重建提供了一种可行的建模思路。
Abstract: In practical indoor acoustics applications, the number of measurement points is often limited by measurement conditions, sensor deployment, and acquisition costs, resulting in sparsely sampled sound field observations and posing challenges to accurate reconstruction of spatial sound field distributions. To address this problem, indoor steady-state sound field reconstruction is formulated as a function approximation task that maps spatial coordinates to sound pressure amplitudes. Based on this formulation, a sound field reconstruction method integrating implicit neural representations and physical constraints is proposed. The proposed method is built upon a multilayer perceptron architecture and incorporates positional encoding and sinusoidal activation functions to enhance the network’s capability in representing complex spatial oscillatory patterns. Furthermore, a physics-consistent constraint derived from the Helmholtz equation is introduced to guide the network toward learning sound field distributions that conform to acoustic wave propagation characteristics under sparse sampling conditions. The proposed method was systematically validated under different sampling rates and frequency conditions. The results demonstrate that, compared with conventional multilayer perceptron models and baseline models employing only positional encoding, the proposed method achieves higher reconstruction accuracy and improved stability under sparse sampling, while effectively suppressing non-physical oscillations in unobserved regions. The proposed approach provides a feasible modeling framework for indoor sound field reconstruction under limited observation conditions.
文章引用:胡泽宁, 田益民, 王金戈. 基于隐式神经表示与物理约束的室内声场重建方法[J]. 计算机科学与应用, 2026, 16(2): 90-101. https://doi.org/10.12677/csa.2026.162041

参考文献

[1] Hu, D., Li, H., Hu, Y. and Fang, Y. (2018) Sound Field Reconstruction with Sparse Sampling and the Equivalent Source Method. Mechanical Systems and Signal Processing, 108, 317-325. [Google Scholar] [CrossRef
[2] Talagala, D.S., Zhang, W. and Abhayapala, T.D. (2014) Efficient Multi-Channel Adaptive Room Compensation for Spatial Soundfield Reproduction Using a Modal Decomposition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22, 1522-1532. [Google Scholar] [CrossRef
[3] Wang, Y. and Chen, K. (2017) Sound Field Reconstruction within an Entire Cavity by Plane Wave Expansions Using a Spherical Microphone Array. The Journal of the Acoustical Society of America, 142, 1858-1870. [Google Scholar] [CrossRef] [PubMed]
[4] Hu, D., Liu, X., Xiao, Y. and Fang, Y. (2019) Fast Sparse Reconstruction of Sound Field via Bayesian Compressive Sensing. Journal of Vibration and Acoustics, 141, Article ID: 041017. [Google Scholar] [CrossRef
[5] Caviedes-Nozal, D., Riis, N.A.B., Heuchel, F.M., Brunskog, J., Gerstoft, P. and Fernandez-Grande, E. (2021) Gaussian Processes for Sound Field Reconstruction. The Journal of the Acoustical Society of America, 149, 1107-1119. [Google Scholar] [CrossRef] [PubMed]
[6] Horiuchi, R., Koyama, S., Ribeiro, J.G.C., Ueno, N. and Saruwatari, H. (2021) Kernel Learning for Sound Field Estimation with L1 and L2 Regularizations. 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, 17-20 October 2021, 261-265. [Google Scholar] [CrossRef
[7] Ribeiro, J.G.C., Koyama, S. and Saruwatari, H. (2023) Kernel Interpolation of Acoustic Transfer Functions with Adaptive Kernel for Directed and Residual Reverberations. ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, 4-10 June 2023, 1-5. [Google Scholar] [CrossRef
[8] Lluís, F., Martínez-Nuevo, P., Bo Møller, M. and Ewan Shepstone, S. (2020) Sound Field Reconstruction in Rooms: Inpainting Meets Super-resolution. The Journal of the Acoustical Society of America, 148, 649-659. [Google Scholar] [CrossRef] [PubMed]
[9] Kristoffersen, M.S., Møller, M.B., Martínez-Nuevo, P., et al. (2021) Deep Sound Field Reconstruction in Real Rooms: Introducing the Isobel Sound Field Dataset.
[10] Pezzoli, M., Perini, D., Bernardini, A., Borra, F., Antonacci, F. and Sarti, A. (2022) Deep Prior Approach for Room Impulse Response Reconstruction. Sensors, 22, Article No. 2710. [Google Scholar] [CrossRef] [PubMed]
[11] 刁威宇. 基于组合神经网络的混响水池声场重建技术研究[D]: [硕士学位论文]. 哈尔滨: 哈尔滨工程大学, 2023.
[12] Fernandez-Grande, E., Karakonstantis, X., Caviedes-Nozal, D. and Gerstoft, P. (2023) Generative Models for Sound Field Reconstruction. The Journal of the Acoustical Society of America, 153, 1179-1190. [Google Scholar] [CrossRef] [PubMed]
[13] Karakonstantis, X. and Fernandez-Grande, E. (2023) Generative Adversarial Networks with Physical Sound Field Priors. The Journal of the Acoustical Society of America, 154, 1226-1238. [Google Scholar] [CrossRef] [PubMed]
[14] Karakonstantis, X., Caviedes-Nozal, D., Richard, A. and Fernandez-Grande, E. (2024) Room Impulse Response Reconstruction with Physics-Informed Deep Learning. The Journal of the Acoustical Society of America, 155, 1048-1059. [Google Scholar] [CrossRef] [PubMed]
[15] Li, Y., Wang, L. and Reiss, J. (2025) Binaural Room Impulse Responses Interpolation Using Physics-Informed Neural Networks in Three Dimensions. 51st German Annual Conference on Acoustics (DAGA), Copenhagen, 17-20 March 2025. [Google Scholar] [CrossRef
[16] Damiano, S. and Van Waterschoot, T. (2025) Sound Field Reconstruction Using Physics-Informed Boundary Integral Networks. 2025 33rd European Signal Processing Conference (EUSIPCO), Palermo, 8-12 September 2025, 76-80. [Google Scholar] [CrossRef
[17] Chen, F., Xiao, Y., Yu, L. and Jiang, L. (2026) Frequency-Domain Physics-Informed Neural Network for Accurate Reconstruction of 3D Acoustic Fields under Sparse and Multi-Frequency Measurements. Neural Networks, 197, Article ID: 108476. [Google Scholar] [CrossRef