Work place: Software College, Shenyang Normal University, Shenyang 110034, China
E-mail:
Website:
Research Interests: Information-Theoretic Security, Information Theory, Information Security
Biography
Dan Zheng graduated with a Bachelor of Engineering from Shenyang Normal University in 2017. In her college, after completing the learning task, she interests in exploring her professional knowledge. During graduate, under the guidance of his master instructor, she researches information security theory and technology.
By Dan zheng Hang Li Shoulin Yin
DOI: https://doi.org/10.5815/ijmsc.2020.06.03, Pub. Date: 8 Dec. 2020
Human action recognition is an important research direction in computer vision areas. Its main content is to simulate human brain to analyze and recognize human action in video. It usually includes individual actions, interactions between people and the external environment. Space-time dual-channel neural network can represent the features of video from both spatial and temporal perspectives. Compared with other neural network models, it has more advantages in human action recognition. In this paper, a action recognition method based on improved space-time two-channel convolutional neural network is proposed. First, the video is divided into several equal length non-overlapping segments, and a frame image representing the static feature of the video and a stacked optical flow image representing the motion feature are sampled at random part from each segment. Then these two kinds of images are input into the spatial domain and the temporal domain convolutional neural network respectively for feature extraction, and then the segmented features of each video are fused in the two channels respectively to obtain the category prediction features of the spatial domain and the temporal domain. Finally, the video action recognition results are obtained by integrating the predictive features of the two channels. Through experiments, various data enhancement methods and transfer learning schemes are discussed to solve the over-fitting problem caused by insufficient training samples, and the effects of different segmental number, pre-training network, segmental feature fusion scheme and dual-channel integration strategy on action recognition performance are analyzed. The experiment results show that the proposed model can better learn the human action features in a complex video and better recognize the action.
[...] Read more.Subscribe to receive issue release notifications and newsletters from MECS Press journals