基于多头注意力和对抗网络的晶圆图缺陷生成方法
Wafer Map Defect Generation Model Based on Multi-Head Attention and Generative Adversarial Networks
摘要: 为了解决深度学习分类模型对少数样本的晶圆图缺陷模式的分类准确率低的问题。该研究提出了一种融合多头注意力机制和对抗网络模型的生成算法Multi-SAGAN。多头注意力机制拥有多个特征子空间,可以生成更丰富的图像细节和全局特征,利用Multi-SAGAN生成的晶圆图来扩充数量较少的晶圆图缺陷模式,能够提高分类器的分类准确率。为了比较DCGAN、SAGAN、数据增强模型和Multi-SAGAN的生成性能,分别把生成的图像和原始数据集组合成新的数据集,放入同一个分类模型中比较分类准确率。最终实验结果表明由Multi-SAGAN生成的图像组成的数据集准确率比原始数据集准确率高18.9%,比数据增强和DCGAN的扩充数据集准确率分别高7.4%和6.4%。比SAGAN的扩充数据集准确率高2.2%。
Abstract: To address the issue of low classification accuracy of deep learning models on wafer map defect patterns with scarce quantities, this study proposes a novel generative algorithm named Multi-SAGAN, integrating multi-head-attention and SAGAN. Generating multiple feature subspaces through multi-attention enables the creation of richer image details and global features. Ultimately, utilizing this generative algorithm to augment the limited quantity of wafer map defect patterns aims to enhance the classifier’s classification accuracy. The images generated by DCGAN, SAGAN, data augmentation, and Multi-SAGAN are respectively mixed with the original dataset to create new datasets, which are then input into the same classification model to compare classification accuracy. The final experimental results show that the dataset generated by Multi-SAGAN achieves an accuracy 18.9% higher than the original dataset, surpassing the accuracy of the datasets generated by data augmentation and DCGAN by 7.4% and 6.4% respectively. Additionally, it outperforms the accuracy of the dataset generated by SAGAN by 2.2%.
文章引用:王先旺. 基于多头注意力和对抗网络的晶圆图缺陷生成方法[J]. 建模与仿真, 2025, 14(2): 304-317. https://doi.org/10.12677/mos.2025.142153

参考文献

[1] Hsu, C., Chen, W. and Chien, J. (2020) Similarity Matching of Wafer Bin Maps for Manufacturing Intelligence to Empower Industry 3.5 for Semiconductor Manufacturing. Computers & Industrial Engineering, 142, Article 106358. [Google Scholar] [CrossRef
[2] Wang, C., Kuo, W. and Bensmail, H. (2006) Detection and Classification of Defect Patterns on Semiconductor Wafers. IIE Transactions, 38, 1059-1068. [Google Scholar] [CrossRef
[3] 卢笑蕾, 余建波. 基于混合模型与流形调节的晶圆表面缺陷识别[J]. 计算机集成制造系统, 2018, 24(2): 302-308.
[4] Krizhevsky, A., Sutskever, I. and Hinton, G.E. (2012) ImageNet Classification with Deep Convolutional Neural Networks. Communications of the ACM, 60, 84-90.
[5] Yu, N., Xu, Q. and Wang, H. (2019) Wafer Defect Pattern Recognition and Analysis Based on Convolutional Neural Network. IEEE Transactions on Semiconductor Manufacturing, 32, 566-573. [Google Scholar] [CrossRef
[6] [CrossRef
[7] Batool, U., Shapiai, M.I., Fauzi, H. and Fong, J.X. (2020) Convolutional Neural Network for Imbalanced Data Classification of Silicon Wafer Defects. 2020 16th IEEE International Colloquium on Signal Processing & Its Applications (CSPA), Langkawi, 28-29 February 2020, 230-235. [Google Scholar] [CrossRef
[8] 张玉彦, 张永奇, 孙春亚, 王昊琪, 文笑雨, 乔东平, 闫新宇, 李浩. 不平衡样本下基于生成式对抗网络的风机叶片开裂状态识别[J]. 计算机集成制造系统, 2023, 29(2): 532-543.
[9] Ian, J.G. and Jean, P.A. (2014) Generative Adversarial Networks. arXiv: 1406.2661. [Google Scholar] [CrossRef
[10] Ji, Y. and Lee, J. (2020) Using GAN to Improve CNN Performance of Wafer Map Defect Type Classification: Yield Enhancement. 2020 31st Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC), Saratoga Springs, 24-26 August 2020, 1-6. [Google Scholar] [CrossRef
[11] 丁满, 袁云磊, 张新新, 孙鸣宇. 基于深度学习的产品色彩情感化设计[J]. 计算机集成制造系统, 2023, 29(5): 1647-1656.
[12] Zhang, H., Goodfellow, I., Metaxas, D. and Odena, A. (2018) Self-Attention Generative Adversarial Networks. arXiv: 1805.08318. [Google Scholar] [CrossRef
[13] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., et al. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 6000-6010.
[14] Cordonnier, J.-B., Loukas, A. and Jaggi, M. (2019) On the Relationship between Self-Attention and Convolutional Layers. arXiv: 1911.03584. [Google Scholar] [CrossRef
[15] Wu, M.-J., Jang, J.R. and Chen, J.-L. (2015) Wafer Map Failure Pattern Recognition and Similarity Ranking for Large-Scale Data Sets. IEEE Transactions on Semiconductor Manufacturing, 28, 1-12. [Google Scholar] [CrossRef
[16] Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V. and Courville, A. (2017) Improved Training of Wasserstein GANs. arXiv: 1704.00028. [Google Scholar] [CrossRef
[17] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef