基于多模型融合的高精度实时摔倒检测系统
A High-Precision Real-Time Fall Detection System Based on Multi-Model Integration
摘要: 随着对老年人及其他有障碍群体的重视,视频识别技术在预防摔倒中发挥了越来越重要的作用。本项目旨在提高行人摔倒预防的精度和实时性,通过优化卷积神经网络结构、改进训练数据的方式、构建更合理的损失函数,最终形成一种能够精确识别和报警的智能监控体系。本文介绍了一种基于Faster R-CNN和Mediapipe框架的摔倒检测方法,结合改进的时空图卷积神经网络(DCST-GCN)模型,对多种行为进行了识别和分类。通过NTU-RGB+D60数据集和公共数据库Kinetics-400的实验验证,该方法在摔倒检测中表现出较高的准确性和鲁棒性。本项目为摔倒预防与控制方法的发展奠定了坚实基础,并提供了新的预防摔倒事故的途径和方法。
Abstract: As attention to the elderly and other disabled groups increases, video recognition technology plays an increasingly important role in fall prevention. This project aims to enhance the accuracy and real-time performance of pedestrian fall prevention by optimizing the convolutional neural network structure, improving the training data methods, and constructing more reasonable loss functions, ultimately forming an intelligent monitoring system capable of precise identification and alerting. This paper introduces a fall detection method based on Faster R-CNN and the Mediapipe framework, combined with an improved spatiotemporal graph convolutional network (DCST-GCN) model for recognizing and classifying various behaviors. Experiments conducted on the NTU-RGB+D60 dataset and the public Kinetics-400 database demonstrate that the proposed method shows high accuracy and robustness in fall detection. This project lays a solid foundation for the development of fall prevention and control methods and provides new approaches for preventing fall accidents.
文章引用:昝超, 蔡帅, 万心怡, 谭康龙. 基于多模型融合的高精度实时摔倒检测系统[J]. 计算机科学与应用, 2024, 14(9): 121-129. https://doi.org/10.12677/csa.2024.149193

参考文献

[1] Debard, G., Baldewijns, G., Goedeme, T., Tuytelaars, T. and Vanrumste, B. (2015) Camera-Based Fall Detection Using a Particle Filter. 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Milan, 25-29 August 2015, 6947-6950. [Google Scholar] [CrossRef] [PubMed]
[2] Lie, W., Le, A.T. and Lin, G. (2018) Human Fall-Down Event Detection Based on 2D Skeletons and Deep Learning Approach. 2018 International Workshop on Advanced Image Technology (IWAIT), Chiang Mai, 7-9 January 2018, 1-4. [Google Scholar] [CrossRef
[3] Rougier, C., Meunier, J., St-Arnaud, A. and Rousseau, J. (2011) Robust Video Surveillance for Fall Detection Based on Human Shape Deformation. IEEE Transactions on Circuits and Systems for Video Technology, 21, 611-622. [Google Scholar] [CrossRef
[4] Charfi, I., Miteran, J., Dubois, J., Atri, M. and Tourki, R. (2012) Definition and Performance Evaluation of a Robust SVM Based Fall Detection Solution. 2012 Eighth International Conference on Signal Image Technology and Internet Based Systems, Sorrento, 25-29 November 2012, 218-224. [Google Scholar] [CrossRef
[5] Musci, M., De Martini, D., Blago, N., Facchinetti, T. and Piastra, M. (2021) Online Fall Detection Using Recurrent Neural Networks on Smart Wearable Devices. IEEE Transactions on Emerging Topics in Computing, 9, 1276-1289. [Google Scholar] [CrossRef
[6] Li, S., Xiong, H. and Diao, X. (2019) Pre-Impact Fall Detection Using 3D Convolutional Neural Network. 2019 IEEE 16th International Conference on Rehabilitation Robotics (ICORR), Toronto, 24-28 June 2019, 1173-1178. [Google Scholar] [CrossRef] [PubMed]
[7] 黄万里. 基于改进YOLOv7的跌倒检测算法研究[D]: [硕士学位论文]. 武汉: 江汉大学, 2023.
[8] 张振亚, 何明艳, 王萍. 改进后的YOLOv5用于跌倒行为检测[J]. 安庆师范大学学报(自然科学版), 2023, 29(1): 72-78.
[9] Wang, X. and Jia, K. (2020) Human Fall Detection Algorithm Based on YOLOv3. 2020 IEEE 5th International Conference on Image, Vision and Computing (ICIVC), Beijing, 10-12 July 2020, 50-54. [Google Scholar] [CrossRef
[10] Yin, Y., Lei, L., Liang, M., Li, X., He, Y. and Qin, L. (2021) Research on Fall Detection Algorithm for the Elderly Living Alone Based on YOLO. 2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT), Chongqing, 22-24 November 2021, 403-408. [Google Scholar] [CrossRef
[11] Yan, S., Xiong, Y. and Lin, D. (2018) Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 32, 7444-7452. [Google Scholar] [CrossRef
[12] Girshick, R. (2015) Fast R-CNN. 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, 7-13 December 2015, 1440-1448. [Google Scholar] [CrossRef
[13] Google Team. Media Pipe Solutions Guide.
https://chuoling.github.io/mediapipe/
[14] Google Team Face Landmark Detection Guide.
https://mediapipe.readthedocs.io/en/latest/index.html
[15] Chen, B., Li, H., Luo, W. and Huang, J. (2020) Image Processing Operations Identification via Convolutional Neural Network. Science China Information Sciences, 63, Article No. 139109. [Google Scholar] [CrossRef
[16] Shahroudy, A., Liu, J., Ng, T. and Wang, G. (2016) NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 1010-1019. [Google Scholar] [CrossRef
[17] Injila, G.R.B. (2023) In Band Full Duplex (IBFD) Technology for Next Generation Wireless Networks: A Survey in Cellular Networks. China Communications, 20, 20-39. [Google Scholar] [CrossRef