改进AlexNet的细粒度汉字手势识别研究
Fine-Grained Chinese Character Gesture Recognition Using Improved AlexNet
DOI: 10.12677/csa.2026.161014, PDF,    科研立项经费支持
作者: 王聪聪, 左 洋*, 葛宝泉:新疆理工职业大学人工智能学院,新疆 喀什;陈兰兰:新疆理工职业大学通识学院,新疆 喀什;张亚军:新疆大学软件学院,新疆 乌鲁木齐
关键词: RFID细粒度手势识别AlexNetSE注意力模块轻量化模型人机交互RFID Fine-Grained Gesture Recognition AlexNet SE Attention Module Lightweight Model Human-Computer Interaction
摘要: 射频识别在人机交互中的应用不断拓展,但现有方法在细粒度手势区分、跨场景适应性和模型轻量化方面仍存在不足。为此,提出一种改进AlexNet融合注意力机制的细粒度汉字手势识别方法。该方法通过标签矩阵与双天线融合增强信号采集完整性,结合非视距信号基线修正与马尔可夫转移场映射强化时序特征表达,并在轻量化卷积神经网络中嵌入注意力模块,在降低计算复杂度的同时保持较高识别准确率。实验结果显示,系统总体识别准确率为98.88%,跨用户与跨场景平均准确率分别为97.49%和97.74%。研究表明,该方法在细粒度汉字手势识别中具有较强鲁棒性与泛化能力,对智慧课堂人机交互及教育信息化发展具有应用价值。
Abstract: Radio Frequency Identification (RFID) has been increasingly applied in human-computer interaction (HCI), yet existing approaches still face challenges in fine-grained gesture differentiation, cross-scenario adaptability, and model lightweighting. To address these issues, this study proposes a fine-grained Chinese character gesture recognition method based on an improved AlexNet integrated with a Squeeze-and-Excitation (SE) attention mechanism. The method enhances signal acquisition integrity by combining label matrices with dual-antenna fusion, applies non-line-of-sight baseline correction and Markov Transition Field mapping to strengthen temporal feature representation, and embeds an attention module into a lightweight convolutional neural network to reduce computational complexity while maintaining high recognition accuracy. Experimental results demonstrate an overall recognition accuracy of 98.88%, with average accuracies of 97.49% and 97.74% under cross-user and cross-scenario conditions, respectively. These findings confirm that the proposed approach exhibits strong robustness and generalization in fine-grained Chinese character gesture recognition, and provides a reliable solution for smart classroom interaction and the advancement of educational informatization.
文章引用:王聪聪, 左洋, 陈兰兰, 葛宝泉, 张亚军. 改进AlexNet的细粒度汉字手势识别研究[J]. 计算机科学与应用, 2026, 16(1): 169-181. https://doi.org/10.12677/csa.2026.161014

参考文献

[1] 程时伟, 沈泽鹏, 张章伟. 基于眼动跟踪的移动智慧课堂可视交互系统[J]. 浙江工业大学学报, 2023, 51(4): 362-371.
[2] 陈威宇. 基于RFID的人体动作识别研究[D]: [硕士学位论文]. 南京: 南京信息工程大学, 2024.
[3] Lin, M., Wu, J., Zou, J., Meng, J., Wang, W. and Wu, J. (2024) Retired Battery Screening Based on Markov Transition Field and Swin Transformer. IEEE Transactions on Transportation Electrification, 10, 4217-4227. [Google Scholar] [CrossRef
[4] Fang, T., Zhang, J., Qi, D. and Gao, M. (2024) BLSENet: A Novel Lightweight Bilinear Convolutional Neural Network Based on Attention Mechanism and Feature Fusion Strategy for Apple Leaf Disease Classification. Journal of Food Quality, 2024, 1-11. [Google Scholar] [CrossRef
[5] Yuan, Y. and Lian, Y.H. (2025) Design Analysis of Human-Computer Interaction and Information Communication in Artificial Intelligence Environments. International Journal of E-Collaboration, 21, 1-13. [Google Scholar] [CrossRef
[6] Qiu, Q., Wang, T., Chen, F. and Wang, C. (2024) LD-Recognition: Classroom Action Recognition Based on Passive RFID. IEEE Transactions on Computational Social Systems, 11, 1182-1191. [Google Scholar] [CrossRef
[7] Pu, Y., Chen, J., Sheng, X., Wang, Z., Li, J. and Tang, H. (2025) Data-Driven Fault Diagnosis with Markov Transition Field and Deep Learning for Water Management of Automotive PEMFC Power System. International Journal of Hydrogen Energy, 148, Article 150022. [Google Scholar] [CrossRef
[8] Huang, M., Wan, N. and Zhu, H. (2025) Reconstruction of Structural Acceleration Response Based on CNN-BiGRU with Squeeze-and-Excitation under Environmental Temperature Effects. Journal of Civil Structural Health Monitoring, 15, 985-1003. [Google Scholar] [CrossRef
[9] Qiu, J., Lu, X., Wang, X., Chen, C., Chen, Y. and Yang, Y. (2024) Research on Image Recognition of Tomato Leaf Diseases Based on Improved Alexnet Model. Heliyon, 10, e33555. [Google Scholar] [CrossRef] [PubMed]
[10] Aliferis, C. and Simon, G. (2024) Overfitting, Underfitting and General Model Overconfidence and Under-Performance Pitfalls and Best Practices in Machine Learning and AI. In: Health Informatics, Springer, 477-524. [Google Scholar] [CrossRef