skip to main content
Primo Search
Search in: Busca Geral

MEST: An Action Recognition Network with Motion Encoder and Spatio-Temporal Module

Zhang, Yi

Sensors (Basel, Switzerland), 2022-09, Vol.22 (17), p.6595 [Periódico revisado por pares]

Basel: MDPI AG

Texto completo disponível

Citações Citado por
  • Título:
    MEST: An Action Recognition Network with Motion Encoder and Spatio-Temporal Module
  • Autor: Zhang, Yi
  • Assuntos: action recognition ; Activity recognition ; Coders ; Computing costs ; Content analysis ; Deep learning ; key frame ; Methods ; Modules ; Neural networks ; spatio-temporal information ; temporal modeling ; Video
  • É parte de: Sensors (Basel, Switzerland), 2022-09, Vol.22 (17), p.6595
  • Notas: ObjectType-Article-1
    SourceType-Scholarly Journals-1
    ObjectType-Feature-2
    content type line 23
  • Descrição: As a sub-field of video content analysis, action recognition has received extensive attention in recent years, which aims to recognize human actions in videos. Compared with a single image, video has a temporal dimension. Therefore, it is of great significance to extract the spatio-temporal information from videos for action recognition. In this paper, an efficient network to extract spatio-temporal information with relatively low computational load (dubbed MEST) is proposed. Firstly, a motion encoder to capture short-term motion cues between consecutive frames is developed, followed by a channel-wise spatio-temporal module to model long-term feature information. Moreover, the weight standardization method is applied to the convolution layers followed by batch normalization layers to expedite the training process and facilitate convergence. Experiments are conducted on five public datasets of action recognition, Something-Something-V1 and -V2, Jester, UCF101 and HMDB51, where MEST exhibits competitive performance compared to other popular methods. The results demonstrate the effectiveness of our network in terms of accuracy, computational cost and network scales.
  • Editor: Basel: MDPI AG
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.