关节点时空信息融合降维的人体动作识别方法

李博文; 潘晴; 田妮莉; 吴琼琼

doi:10.19304/J.ISSN1000-7180.2021.0568

关节点时空信息融合降维的人体动作识别方法

Human action recognition method based on joint point space time information fusion and dimension reduction

摘要

摘要: 基于二维卷积神经网络(2DCNN)和三维卷积神经网络(3DCNN)的人体动作识别方法都存在运算量较大的问题，提出了关节点时空信息融合降维的人体动作识别方法(Joint-trajectory).首先，采用高分辨率网络(HigherHRnet)提取视频每帧图像中人体各个关节点的空间坐标信息，构建单帧图像中人体关节点空间信息行向量；其次，在时间维度上对整段视频的所有关节点空间信息行向量进行纵向拼接，获得该视频的关节点时空信息融合矩阵；最后，使用残差网络对关节点时空信息融合矩阵进行学习和分类.在KTH数据集上的实验结果表明，该方法在有效的降低人体动作识别复杂度的同时，能够获得更高的识别率，且具有较强的鲁棒性.

Abstract: In view of the large amount of computation in human action recognition methods based on two-dimensional convolutional neural networks (2DCNN) and three-dimensional convolutional neural networks (3DCNN), a human action recognition method based on joint point space-time information fusion (Joint-trajectory) is proposed. Firstly, high resolution network (HigherHRnet) is used to extract the spatial coordinate information of each human joint point in each video frame, and the row vector of spatial information of human node in a single frame of image was constructed.. Secondly, the row vector of all joint point spatial information of the whole video are longitudinally spliced in the time dimension, so as to obtain the spatial and temporal information fusion matrix of the video. Finally, the residual network is used to learn and classify the temporal and spatial information fusion matrix of joint points. The experimental results on KTH data set show that the proposed method can effectively reduce the complexity of human action recognition, at the same time, it can obtain higher recognition rate and has strong robustness.

HTML全文

参考文献(19)

施引文献

资源附件(0)