Inicio  /  Applied Sciences  /  Vol: 12 Par: 2 (2022)  /  Artículo
ARTÍCULO
TITULO

Human Activity Classification Using the 3DCNN Architecture

Roberta Vrskova    
Robert Hudec    
Patrik Kamencay and Peter Sykora    

Resumen

Interest in utilizing neural networks in a variety of scientific and academic studies and in industrial applications is increasing. In addition to the growing interest in neural networks, there is also a rising interest in video classification. Object detection from an image is used as a tool for various applications and is the basis for video classification. Identifying objects in videos is more difficult than for single images, as the information in videos has a time continuity constraint. Common neural networks such as ConvLSTM (Convolutional Long Short-Term Memory) and 3DCNN (3D Convolutional Neural Network), as well as many others, have been used to detect objects from video. Here, we propose a 3DCNN for the detection of human activity from video data. The experimental results show that the optimized proposed 3DCNN provides better results than neural network architectures for motion, static and hybrid features. The proposed 3DCNN obtains the highest recognition precision of the methods considered, 87.4%. In contrast, the neural network architectures for motion, static and hybrid features achieve precisions of 65.4%, 63.1% and 71.2%, respectively. We also compare results with previous research. Previous 3DCNN architecture on database UCF Youtube Action worked worse than the architecture we proposed in this article, where the achieved result was 29%. The experimental results on the UCF YouTube Action dataset demonstrate the effectiveness of the proposed 3DCNN for recognition of human activity. For a more complex comparison of the proposed neural network, the modified UCF101 dataset, full UCF50 dataset and full UCF101 dataset were compared. An overall precision of 82.7% using modified UCF101 dataset was obtained. On the other hand, the precision using full UCF50 dataset and full UCF101 dataset was 80.6% and 78.5%, respectively.

 Artículos similares

       
 
María Zamarreño Suárez, Juan Marín Martínez, Francisco Pérez Moreno, Raquel Delgado-Aguilera Jurado, Patricia María López de Frutos and Rosa María Arnaldo Valdés    
The use of electroencephalography (EEG) techniques has many advantages in the study of human performance in air traffic control (ATC). At present, these are non-intrusive techniques that allow large volumes of data to be recorded on a continuous basis us... ver más
Revista: Aerospace

 
Varsha S. Lalapura, Veerender Reddy Bhimavarapu, J. Amudha and Hariram Selvamurugan Satheesh    
The Recurrent Neural Networks (RNNs) are an essential class of supervised learning algorithms. Complex tasks like speech recognition, machine translation, sentiment classification, weather prediction, etc., are now performed by well-trained RNNs. Local o... ver más
Revista: Algorithms

 
Vijeta Sharma, Manjari Gupta, Ajai Kumar and Deepti Mishra    
The video camera is essential for reliable activity monitoring, and a robust analysis helps in efficient interpretation. The systematic assessment of classroom activity through videos can help understand engagement levels from the perspective of both stu... ver más
Revista: Information

 
Shuyu Liang, Tianqi Yan, Hongshan Gao, Chuantao Jing, Fei He and Meiqin Han    
Revista: Water

 
Néstor J. Jarque-Bou, Margarita Vergara and Joaquín L. Sancho-Bru    
Revista: Applied Sciences