TY - JOUR
T1 - A multi-head pseudo nodes based spatial–temporal graph convolutional network for emotion perception from GAIT
AU - Chai, Shurong
AU - Liu, Jiaqing
AU - Jain, Rahul Kumar
AU - Tateyama, Tomoko
AU - Iwamoto, Yutaro
AU - Lin, Lanfen
AU - Chen, Yen Wei
N1 - Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2022/10/28
Y1 - 2022/10/28
N2 - In the recent years, the emotion recognition task has attracted a lot of attention in the field of human–computer interaction. Most existing research typically uses aural-visual analysis, which has been an effective approach to capturing emotional features. However, aural-visual signals are difficult to notice, in comparison to other human representations such as gait in remote situations. Recently, human gait can be effectively recognized in more complex backgrounds because of the advancement of Graph Convolutional Networks (GCNs). According to the anatomy of the human body, central torso joints play a key role in GCNs-based human gait recognition systems, instead of the body's marginal limb joints. As a result, there is a major issue of receptive field imbalance. In this study, we propose a method for perceiving emotions based on the human gait skeleton. We present a multi-head pseudo nodes strategy to alleviate the receptive field imbalance problem and capture the non-local dependencies among different joints. The strategy employs a series of extra nodes that link to all physical human body joints and obtain global information from different feature spaces. The results of the experiments on a public emotion-gait dataset demonstrate that our proposed method outperforms existing skeleton-based methods. Further, to verify the effectiveness of our method, we use publicly available human action recognition datasets. Our results show that our method significantly improves performance in comparison to other baseline methods.
AB - In the recent years, the emotion recognition task has attracted a lot of attention in the field of human–computer interaction. Most existing research typically uses aural-visual analysis, which has been an effective approach to capturing emotional features. However, aural-visual signals are difficult to notice, in comparison to other human representations such as gait in remote situations. Recently, human gait can be effectively recognized in more complex backgrounds because of the advancement of Graph Convolutional Networks (GCNs). According to the anatomy of the human body, central torso joints play a key role in GCNs-based human gait recognition systems, instead of the body's marginal limb joints. As a result, there is a major issue of receptive field imbalance. In this study, we propose a method for perceiving emotions based on the human gait skeleton. We present a multi-head pseudo nodes strategy to alleviate the receptive field imbalance problem and capture the non-local dependencies among different joints. The strategy employs a series of extra nodes that link to all physical human body joints and obtain global information from different feature spaces. The results of the experiments on a public emotion-gait dataset demonstrate that our proposed method outperforms existing skeleton-based methods. Further, to verify the effectiveness of our method, we use publicly available human action recognition datasets. Our results show that our method significantly improves performance in comparison to other baseline methods.
UR - http://www.scopus.com/inward/record.url?scp=85138189799&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85138189799&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2022.09.061
DO - 10.1016/j.neucom.2022.09.061
M3 - Article
AN - SCOPUS:85138189799
SN - 0925-2312
VL - 511
SP - 437
EP - 447
JO - Neurocomputing
JF - Neurocomputing
ER -