Redirigiendo al acceso original de articulo en 17 segundos...
Inicio  /  Algorithms  /  Vol: 16 Par: 8 (2023)  /  Artículo
ARTÍCULO
TITULO

Human Action Representation Learning Using an Attention-Driven Residual 3DCNN Network

Hayat Ullah and Arslan Munir    

Resumen

The recognition of human activities using vision-based techniques has become a crucial research field in video analytics. Over the last decade, there have been numerous advancements in deep learning algorithms aimed at accurately detecting complex human actions in video streams. While these algorithms have demonstrated impressive performance in activity recognition, they often exhibit a bias towards either model performance or computational efficiency. This biased trade-off between robustness and efficiency poses challenges when addressing complex human activity recognition problems. To address this issue, this paper presents a computationally efficient yet robust approach, exploiting saliency-aware spatial and temporal features for human action recognition in videos. To achieve effective representation of human actions, we propose an efficient approach called the dual-attentional Residual 3D Convolutional Neural Network (DA-R3DCNN). Our proposed method utilizes a unified channel-spatial attention mechanism, allowing it to efficiently extract significant human-centric features from video frames. By combining dual channel-spatial attention layers with residual 3D convolution layers, the network becomes more discerning in capturing spatial receptive fields containing objects within the feature maps. To assess the effectiveness and robustness of our proposed method, we have conducted extensive experiments on four well-established benchmark datasets for human action recognition. The quantitative results obtained validate the efficiency of our method, showcasing significant improvements in accuracy of up to 11% as compared to state-of-the-art human action recognition methods. Additionally, our evaluation of inference time reveals that the proposed method achieves up to a 74× improvement in frames per second (FPS) compared to existing approaches, thus showing the suitability and effectiveness of the proposed DA-R3DCNN for real-time human activity recognition.

 Artículos similares

       
 
Michiel van der Vlag, Lionel Kusch, Alain Destexhe, Viktor Jirsa, Sandra Diaz-Pier and Jennifer S. Goldman    
Global neural dynamics emerge from multi-scale brain structures, with nodes dynamically communicating to form transient ensembles that may represent neural information. Neural activity can be measured empirically at scales spanning proteins and subcellul... ver más
Revista: Applied Sciences

 
Miao Feng and Jean Meunier    
Recognizing human actions can help in numerous ways, such as health monitoring, intelligent surveillance, virtual reality and human?computer interaction. A quick and accurate detection algorithm is required for daily real-time detection. This paper first... ver más
Revista: Algorithms

 
Abdorreza Alavigharahbagh, Vahid Hajihashemi, José J. M. Machado and João Manuel R. S. Tavares    
In this article, a hierarchical method for action recognition based on temporal and spatial features is proposed. In current HAR methods, camera movement, sensor movement, sudden scene changes, and scene movement can increase motion feature errors and de... ver más
Revista: Information

 
Shiqi Yue and Yuanwu Shi    
With the rapid development of computer and artificial intelligence technology, robots have been widely used in assembly, sorting, and other work scenarios, gradually changing the human-oriented mechanical assembly line working mode. Traditional robot con... ver más
Revista: Applied Sciences

 
Chen Huang, Yimin Chen, Weiqin Tong, Tao Feng and Mingxing Deng    
As an effective solution for data visualization and analysis, the large-scale high-resolution display wall system has been widely used in various scientific research fields. On the basis of investigating existing system cases and research results, this p... ver más
Revista: Applied Sciences