Dynamic gesture recognition method based on Improved R(2+1)D

Yupeng Huo, Jie Shen, Sheng Zhang, Li Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Currently, methods based on 3D convolutional neural networks have made significant progress in the field of dynamic gesture recognition. Dynamic gestures are highly redundant in both the temporal and spatial dimensions, and the complex environment during the recognition process can easily affect the final recognition results. Therefore, it is crucial to make the model focus on the important moments and regions of gesture movements and extract relevant salient spatiotemporal features to further improve model performance. To address this issue, this paper proposes a lightweight Temporal-Spatial-Channel attention (TSCA) module based on the R(2+1)D network. The module consists of two sub-modules: a Temporal-Channel attention (TCA) module and a Temporal-Spatial attention (TSA) module, with the goal of enabling the model to focus on important information along the spatial, channel, and temporal dimensions during gesture movements. Finally, the TSCA attention module is integrated into the R(2+1)D network, resulting in only a 2.8M increase in parameters, and achieves good performance on the IPN-Hand and NvGesture datasets.

Original languageEnglish
Title of host publication2023 4th International Conference on Computer Vision, Image and Deep Learning, CVIDL 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages323-328
Number of pages6
ISBN (Electronic)9798350326444
DOIs
StatePublished - 2023
Event4th International Conference on Computer Vision, Image and Deep Learning, CVIDL 2023 - Zhuhai, China
Duration: 12 May 202314 May 2023

Publication series

Name2023 4th International Conference on Computer Vision, Image and Deep Learning, CVIDL 2023

Conference

Conference4th International Conference on Computer Vision, Image and Deep Learning, CVIDL 2023
Country/TerritoryChina
CityZhuhai
Period12/05/2314/05/23

Keywords

  • Gesture Recognition
  • R(2+1)D Convolution
  • Self-Attention

Fingerprint

Dive into the research topics of 'Dynamic gesture recognition method based on Improved R(2+1)D'. Together they form a unique fingerprint.

Cite this