基于自注意力机制的动态全息声场生成方法
Dynamic Holographic Acoustic Field Generation Method Based on Self-Attention Mechanism
DOI: 10.12677/sea.2024.133030, PDF,    科研立项经费支持
作者: 杨 柳, 游福成:北京印刷学院信息学院,北京
关键词: 全息声场超声相控阵深度学习自注意力机制Acoustic Holography Phased Array Transducer Deep Learning Self-Attention Mechanism
摘要: 声场控制对于扬声器设计、超声成像和声学粒子操纵等多种应用至关重要。对微米和纳米尺度物体进行精确操纵的需求导致了非接触式操纵方法的发展。然而,关于给定全息声场的反向操纵的研究很少。在本文中,我们提出了一种在相控阵技术(PAT)背景下基于自注意力机制Transformer模型(VS3D- Transformer)的方法,以实现快速准确地全息声场生成。我们的方法解决了传统CNN仅考虑局部感受野且训练精度低的缺点。此外,我们降低了传统物理方法的迭代复杂性。为了模拟声场的产生,我们采用基于活塞模型的模拟方法来产生全息声场。在仿真研究中,与传统的IB迭代算法和深度学习Acousnet算法相比,我们的模型表现出更快的训练速度和更高的精度。我们提出的模型在各种条件下(即声场相位优化准确率、损失率和训练速度)的结果表明我们的模型可以作为一种高效的替代方案。
Abstract: Acoustic field control is critical in applications as diverse as loudspeaker design, ultrasonic imaging, and acoustic particle manipulation. The need for precise manipulation of objects at the micron and nanoscale has led to the development of contactless manipulation methods. However, there are few studies on the reverse manipulation of a given holographic acoustic field. In this paper, we propose a method based on the attention mechanism transformer model (VS3D-Transformer) within the context of phased array technology (PAT) to achieve fast and accurate holographic acoustic field generation. Our method solves the shortcomings of traditional CNNs which only consider the local receptive field and possesses low training accuracy. Moreover, we reduce the iterative complexity of traditional physical methods. To simulate acoustic field generation, we use the simulation method based on the piston model to generate the holographic acoustic field. In the simulation study, our model demonstrates faster training speed and higher accuracy compared to both the traditional IB iterative algorithm and the deep learning Acousnet algorithm. The results of our proposed model under various conditions (i.e., overall field generation, loss rate, and training speed) indicate that our model could serve as a highly effective alternative.
文章引用:杨柳, 游福成. 基于自注意力机制的动态全息声场生成方法[J]. 软件工程与应用, 2024, 13(3): 302-311. https://doi.org/10.12677/sea.2024.133030

参考文献

[1] Memoli, G., Caleap, M., Asakawa, M., et al. (2017) Metamaterial Bricks and Quantization of Meta-Surfaces. Nature Communications, 8, Article No. 14608. [Google Scholar] [CrossRef] [PubMed]
[2] Li, B., Lu, M., Liu, C., et al. (2022) Acoustic Hologram Reconstruction with Unsupervised Neural Network. Frontiers in Materials, 9, Article ID: 916527. [Google Scholar] [CrossRef
[3] Friend, J. and Yeo, L. (2011) Microscale Acoustofluidics: Microfluidics Driven via Acoustics and Ultrasonics. Reviews of Modern Physics, 83, 647-704. [Google Scholar] [CrossRef
[4] Wiklund, M., et al. (2006) Ultrasonic Standing Wave Manipulation Technology Integrated into a Dielectrophoretic Chip. Lab on a Chip, 6, 1537-1544. [Google Scholar] [CrossRef
[5] Shi, J., et al. (2009) Continuous Particle Separation in a Microfluidic Channel via Standing Surface Acoustic Waves (SSAW). Lab on a Chip, 9, 3354-3359. [Google Scholar] [CrossRef] [PubMed]
[6] Frommelt, T., et al. (2008) Microfluidic Mixing via Acoustically Driven Chaotic Advection. Physical Review Letters, 100, Article ID: 034502. [Google Scholar] [CrossRef
[7] Gao, Y., Yang, B.Q., Shi, S.G., et al. (2023) Extension of Sound Field Reconstruction Based on Element Radiation Superposition Method in a Sparsity Framework. Chinese Physics B, 32, Article ID: 044302. [Google Scholar] [CrossRef
[8] Dong, W., Chen, M. and Xiong, L. (2022) Research on Sound Transmission Performance of an Infinite Solid Plate Excited by a Vibrating Piston. The 32nd International Ocean and Polar Engineering Conference, Shanghai, June 2022. ISOPE-I-22-455.
[9] Lahoud, J., Cao, J., Khan, F.S., et al. (2022) 3D Vision with Transformers: A Survey.
[10] He, C., Li, R., Li, S., et al. (2022) Voxel Set Transformer: A Set-to-Set Approach to 3d Object Detection from Point Clouds. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, 18-24 June 2022, 8417-8427. [Google Scholar] [CrossRef
[11] Zhao, S., You, F. and Liu, Z.Y. (2020) Leveraging Pre-Trained Language Model for Summary Generation on Short Text. IEEE Access, 8, 228798-228803. [Google Scholar] [CrossRef
[12] Tang, Z., Cho, J., Nie, Y., et al. (2022) TVLT: Textless Vision-Language Transformer. Advances in Neural Information Processing Systems, Vol. 35, 9617-9632.
[13] Cranston, D. (2015) A Review of High Intensity Focused Ultrasound in Relation to the Treatment of Renal Tumours and Other Malignancies. Ultrasonics Sonochemistry, 27, 654-658. [Google Scholar] [CrossRef] [PubMed]
[14] Geng, J. (2013) Three-Dimensional Display Technologies. Advances in Optics and Photonics, 5, 456-535. [Google Scholar] [CrossRef
[15] Zhao, T. and Chi, Y. (2020) Modified Gerchberg-Saxton (GS) Algorithm and Its Application. Entropy, 22, Article No. 1354. [Google Scholar] [CrossRef] [PubMed]
[16] Fushimi, T., Yamamoto, K. and Ochiai, Y. (2021) Acoustic Hologram Optimisation Using Automatic Differentiation. Scientific Reports, 11, Article No. 12678. [Google Scholar] [CrossRef] [PubMed]
[17] Plasencia, D.M., Hirayama, R., Montano-Murillo, R., et al. (2020) GS-PAT: High-Speed Multi-Point Sound-Fields for Phased Arrays of Transducers. ACM Transactions on Graphics (TOG), 39, Article No. 138. [Google Scholar] [CrossRef
[18] Long, B., Seah, S.A., Carter, T., et al. (2014) Rendering Volumetric Haptic Shapes in Mid-Air Using Ultrasound. ACM Transactions on Graphics (TOG), 33, Article No. 181. [Google Scholar] [CrossRef
[19] Marzo, Y.A. and Drinkwater, B.W. (2019) Holographic Acoustic Tweezers. Proceedings of the National Academy of Sciences, 116, 84-89. [Google Scholar] [CrossRef] [PubMed]
[20] Zhong, C., Jia, Y., Jeong, D.C., et al. (2021) A Deep Learning Based Approach to Dynamic 3d Holographic Acoustic Field Generation from Phased Transducer Array. IEEE Robotics and Automation Letters, 7, 666-673. [Google Scholar] [CrossRef