Inicio  /  Applied Sciences  /  Vol: 13 Par: 11 (2023)  /  Artículo
ARTÍCULO
TITULO

Action Recognition Network Based on Local Spatiotemporal Features and Global Temporal Excitation

Shukai Li    
Xiaofang Wang    
Dongri Shan and Peng Zhang    

Resumen

Temporal modeling is a key problem in action recognition, and it remains difficult to accurately model temporal information of videos. In this paper, we present a local spatiotemporal extraction module (LSTE) and a channel time excitation module (CTE), which are specially designed to accurately model temporal information in video sequences. The LSTE module first obtains difference features by computing the pixel-wise differences between adjacent frames within each video segment and then obtains local motion features by stressing the effect of the feature channels sensitive to difference information. The local motion features are merged with the spatial features to represent local spatiotemporal features of each segment. The CTE module adaptively excites time-sensitive channels by modeling the interdependencies of channels in terms of time to enhance the global temporal information. Further, the above two modules are embedded into the existing 2DCNN baseline methods to build an action recognition network based on local spatiotemporal features and global temporal excitation (LSCT). We conduct experiments on the temporal-dependent Something-Something V1 and V2 datasets. We compare the recognition results with those obtained by the current methods, which proves the effectiveness of our methods.

 Artículos similares

       
 
Hui-Jun Kim, Jung-Soon Kim and Sung-Hee Kim    
The existing question-and-answer screening test has a limitation in that test accuracy varies due to a high learning effect and based on the inspector?s competency, which can have consequences for rapid-onset cognitive-related diseases. To solve this pro... ver más
Revista: Applied Sciences

 
Abdorreza Alavigharahbagh, Vahid Hajihashemi, José J. M. Machado and João Manuel R. S. Tavares    
In this article, a hierarchical method for action recognition based on temporal and spatial features is proposed. In current HAR methods, camera movement, sensor movement, sudden scene changes, and scene movement can increase motion feature errors and de... ver más
Revista: Information

 
Miao Feng and Jean Meunier    
Recognizing human actions can help in numerous ways, such as health monitoring, intelligent surveillance, virtual reality and human?computer interaction. A quick and accurate detection algorithm is required for daily real-time detection. This paper first... ver más
Revista: Algorithms

 
Hayat Ullah and Arslan Munir    
The recognition of human activities using vision-based techniques has become a crucial research field in video analytics. Over the last decade, there have been numerous advancements in deep learning algorithms aimed at accurately detecting complex human ... ver más
Revista: Algorithms

 
Ji Zhang, Xiangze Jia, Zhen Wang, Yonglong Luo, Fulong Chen, Gaoming Yang and Lihui Zhao    
Skeleton-based action recognition depends on skeleton sequences to detect categories of human actions. In skeleton-based action recognition, the recognition of action scenes with more than one subject is named as interaction recognition. Different from t... ver más
Revista: Algorithms