Idioma:

MEST: An Action Recognition Network with Motion Encoder and Spatio-Temporal Module

Zhang, Yi

Sensors (Basel, Switzerland), 2022-09, Vol.22 (17), p.6595 [Periódico revisado por pares]

Basel: MDPI AG

Texto completo disponível

Citações Citado por

Enviar para

Título:
MEST: An Action Recognition Network with Motion Encoder and Spatio-Temporal Module
Autor: Zhang, Yi
Assuntos: action recognition ; Activity recognition ; Coders ; Computing costs ; Content analysis ; Deep learning ; key frame ; Methods ; Modules ; Neural networks ; spatio-temporal information ; temporal modeling ; Video
É parte de: Sensors (Basel, Switzerland), 2022-09, Vol.22 (17), p.6595
Notas: ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Descrição: As a sub-field of video content analysis, action recognition has received extensive attention in recent years, which aims to recognize human actions in videos. Compared with a single image, video has a temporal dimension. Therefore, it is of great significance to extract the spatio-temporal information from videos for action recognition. In this paper, an efficient network to extract spatio-temporal information with relatively low computational load (dubbed MEST) is proposed. Firstly, a motion encoder to capture short-term motion cues between consecutive frames is developed, followed by a channel-wise spatio-temporal module to model long-term feature information. Moreover, the weight standardization method is applied to the convolution layers followed by batch normalization layers to expedite the training process and facilitate convergence. Experiments are conducted on five public datasets of action recognition, Something-Something-V1 and -V2, Jester, UCF101 and HMDB51, where MEST exhibits competitive performance compared to other popular methods. The results demonstrate the effectiveness of our network in terms of accuracy, computational cost and network scales.
Editor: Basel: MDPI AG
Idioma: Inglês

Voltar para lista de resultados

Realização: Logos de Redes Sociais:

MEST: An Action Recognition Network with Motion Encoder and Spatio-Temporal Module

Zhang, Yi

Basel: MDPI AG

Buscando em bases de dados remotas. Favor aguardar.