基于可拆分结构的中层属性行为识别
Activity Recognition of Middle-Level Attribution Based on Detachable Structure
DOI: 10.12677/CSA.2018.84050, PDF,    国家自然科学基金支持
作者: 时少艳*, 孙永宣, 吴克伟, 谢 昭:合肥工业大学计算机与信息学院,安徽 合肥
关键词: 时序结构中层属性模板匹配行为识别Temporal Structure Middle-Level Attribution Template Matching Activity Recognition
摘要: 随着行为识别研究的深入,复杂行为识别受到了越来越多研究者的关注。然而复杂行为中存在的大量冗余信息以及噪声严重降低了行为识别的准确性。针对这一问题,本文提出了基于可拆分结构的复杂行为中层属性时序一致性结构,并采用动态时间规整算法进行行为时序模式匹配。通过对视频行为时序进行拆分,学习行为的中层属性表达,构建行为中层属性的一致时序关系,有效去除了视频中的噪声和冗余信息,提高行为识别的准确率,并且能够从行为时序发展角度解释行为,增强了行为分析的可解释性。在Olympics数据集上的实验结果表明,该算法能够有效提高行为识别的准确率。
Abstract: With the deeper research of activity recognition, more researchers pay attention to complex activity recognition. However, a large number of redundant information and noise reduce the accuracy of activity recognition. In order to solve this problem, the paper proposes temporally consistent middle attributes structure and matching patterns of activity by dynamic time warping algorithm. Through splitting videos and learning middle attribute of segments, we build the consistent temporal structure of each activity category effectively solving noise and redundancy. And it explains the process of activity from the perspective of timing and enhances the interpretability of activity analysis. The experiment results on Olympics dataset show that the algorithm can improve the ac-curacy of activity recognition effectively.
文章引用:时少艳, 孙永宣, 吴克伟, 谢昭. 基于可拆分结构的中层属性行为识别[J]. 计算机科学与应用, 2018, 8(4): 455-463. https://doi.org/10.12677/CSA.2018.84050

参考文献

[1] 雷庆, 陈锻生, 李绍滋. 复杂场景下的人体行为识别研究新进展[J]. 计算机科学, 2014, 41(12): 1-7.
[2] 胡琼, 秦磊, 黄庆明. 基于视觉的人体动作识别综述[J]. 计算机学报, 2013, 36(12): 2512-2524.
[3] Aggarwal, J.K. and Ryoo, M.S. (2011) Human Activity Analysis: A Review. ACM Computing Surveys, 43, 1-43. [Google Scholar] [CrossRef
[4] Tang, K., Li, F.F. and Koller, D. (2012) Learning Latent Temporal Structure for Complex Event Detection. Computer Vision and Pattern Recognition, IEEE, 1250-1257.
[5] Bhattacharya, S., Kalayeh, M.M., Suk-thankar, R., et al. (2015) Recognition of Complex Events: Exploiting Temporal Dynamics between Underlying Concepts. Computer Vision and Pattern Recognition, 2243-2250.
[6] Ma, S., Sigal, L. and Sclaroff, S. (2016) Learning Activity Progression in LSTMs for Activity Detection and Early Detection. Computer Vision and Pattern Recognition, IEEE, 1942-1950.
[7] Liu, C., Wu, X. and Jia, Y. (2016) A Hierarchical Video Description for Complex Activity Understanding. International Journal of Computer Vision, 118, 240-255. [Google Scholar] [CrossRef
[8] Grenander, U. (1993) General Pattern Theory: A Mathematical Study of Regular Structures. Clarendon Press, Oxford.
[9] Souza, F.D.M.D., Sarkar, S., Srivastava, A., et al. (2016) Spatially Coherent Interpretations of Videos Using Pattern Theory. International Journal of Computer Vision, 121, 5-25. [Google Scholar] [CrossRef
[10] Li, K., Hu, J. and Fu, Y. (2012) Modeling Complex Temporal Composition of Actionlets for Activity Prediction. European Conference on Computer Vision, 286-299.
[11] Wang, H. and Schmid, C. (2014) Action Recognition with Improved Trajectories. IEEE International Conference on Computer Vision, IEEE, 3551-3558.
[12] Kuehne, H., Gall, J. and Serre, T. (2016) An End-to-End Generative Framework for Video Segmentation and Recognition. Applications of Computer Vision, IEEE, 1-8.
[13] Sun, C. and Nevatia, R. (2013) ACTIVE: Activity Concept Transitions in Video Event Classification. IEEE International Conference on Computer Vision, IEEE, 913-920.
[14] Lan, T., Zhu, Y., Zamir, A.R., et al. (2015) Action Recognition by Hierarchical Mid-Level Action Elements. IEEE International Conference on Computer Vision, IEEE, 4552-4560.
[15] Jones, S. and Shao, L. (2014) A Multigraph Representation for Improved Unsupervised/Semi-Supervised Learning of Human Actions. Computer Vision and Pattern Recognition, IEEE, 820-826.