Inicio  /  Applied Sciences  /  Vol: 13 Par: 5 (2023)  /  Artículo
ARTÍCULO
TITULO

Human Pose Estimation Based on a Spatial Temporal Graph Convolutional Network

Meng Wu and Pudong Shi    

Resumen

To address the problem of poor detection and under-utilization of the spatial relationship between nodes in human pose estimation, a method based on an improved spatial temporal graph convolutional network (ST-GCN) model is proposed. Firstly, upsampling and segmented random sampling strategies are used to effectively solve the problems of class imbalance and the large sequence length of the dataset. Secondly, an improved detection transformer (DETR) structure is added to effectively suppress the generation of non-maximal suppression (NMS) and anchor points, a multi-head attention (M-ATT) module is introduced into each ST-GCN cell to capture richer feature information, and a residual module is introduced into the 9th ST-GCN cell to avoid possible network degradation in deep networks. In addition, strategies such as warmup, regularization, loss functions, and optimizers are configured to improve the model?s performance. The experimental results show that the average percentage of correct keypoints (PCK) of this method are 93.2% and 92.7% for the FSD and MPII datasets, respectively, which is 1.9% and 1.7% higher than the average PCK of the original ST-GCN method. Moreover, the confusion matrix corresponding to this method also indicated that the model has high recognition accuracy. In addition, comparison experiments with ST-GCN and other methods show that the computation of the model corresponding to this method is about 1.7 GFLOPs and the corresponding MACs are about 6.4 GMACs, which is a good performance.

 Artículos similares

       
 
Xinjing Zhang and Qixun Zhou    
Human pose estimation, as the basis of advanced computer vision, has a wide application perspective. In existing studies, the high-capacity model based on the heatmap method can achieve accurate recognition results, but it encounters many difficulties wh... ver más
Revista: Applied Sciences

 
Yaxin Mao, Lamei Yan, Hongyu Guo, Yujie Hong, Xiaocheng Huang and Youwei Yuan    
Inertial measurement unit (IMU) technology has gained popularity in human activity recognition (HAR) due to its ability to identify human activity by measuring acceleration, angular velocity, and magnetic flux in key body areas like the wrist and knee. I... ver más
Revista: Applied Sciences

 
Yisha Wang, Yanjun Zhao, Xu Han, Jiashuo Wang, Chuandong Wu, Yuan Zhuang, Jiemin Liu and Wenhui Li    
Organophosphate esters (OPEs) are increasingly used as flame retardants and plasticizers in various products. Most of them are physically mixed rather than chemical bonded to the polymeric products, leading to OPEs being readily released into the surroun... ver más
Revista: Water

 
Rytis Maskeliunas, Audrius Kulikajevas, Robertas Dama?evicius, Julius Gri?kevicius and Au?ra Adomaviciene    
The research introduces a unique deep-learning-based technique for remote rehabilitative analysis of image-captured human movements and postures. We present a ploninomial Pareto-optimized deep-learning architecture for processing inverse kinematics for s... ver más
Revista: Applied Sciences

 
Tianlei Wang, Fei Ding and Zhenxing Sun    
Human intelligence has the advantage for making high-level decisions in the remote control of underwater vehicles, while autonomous control is superior for accurate and fast close-range pose adjustment. Combining the advantages of both remote and autonom... ver más