Redirigiendo al acceso original de articulo en 17 segundos...
Inicio  /  Applied Sciences  /  Vol: 10 Par: 4 (2020)  /  Artículo
ARTÍCULO
TITULO

Modelling a Spatial-Motion Deep Learning Framework to Classify Dynamic Patterns of Videos

Sandeli Priyanwada Kasthuri Arachchi    
Timothy K. Shih and Noorkholis Luthfil Hakim    

Resumen

Video classification is an essential process for analyzing the pervasive semantic information of video content in computer vision. Traditional hand-crafted features are insufficient when classifying complex video information due to the similarity of visual contents with different illumination conditions. Prior studies of video classifications focused on the relationship between the standalone streams themselves. In this paper, by leveraging the effects of deep learning methodologies, we propose a two-stream neural network concept, named state-exchanging long short-term memory (SE-LSTM). With the model of spatial motion state-exchanging, the SE-LSTM can classify dynamic patterns of videos using appearance and motion features. The SE-LSTM extends the general purpose of LSTM by exchanging the information with previous cell states of both appearance and motion stream. We propose a novel two-stream model Dual-CNNSELSTM utilizing the SE-LSTM concept combined with a Convolutional Neural Network, and use various video datasets to validate the proposed architecture. The experimental results demonstrate that the performance of the proposed two-stream Dual-CNNSELSTM architecture significantly outperforms other datasets, achieving accuracies of 81.62%, 79.87%, and 69.86% with hand gestures, fireworks displays, and HMDB51 datasets, respectively. Furthermore, the overall results signify that the proposed model is most suited to static background dynamic patterns classifications.

 Artículos similares

       
 
Laura Guimarães, António Paulo Carvalho, Pedro Ribeiro, Cláudia Teixeira, Nuno Silva, André Pereira, João Amorim and Luís Oliva-Teles    
Triops longicaudatus is a crustacean typically inhabiting temporary freshwater bodies in regions with a Mediterranean climate. These crustaceans are easily maintained in the laboratory and show a set of biological features that make them good candidates ... ver más
Revista: Water

 
Kasun Moolikagedara, Minh Nguyen, Weiqi Yan and Xuejun Li    
In the digital age, where the Internet of Things (IoT) permeates every facet of our lives, the safeguarding of data privacy, especially video data, emerges as a paramount concern. The ubiquity of IoT devices, capable of capturing and disseminating vast q... ver más
Revista: Information

 
Vivian W. H. Wong and Kincho H. Law    
Crowd congestion is one of the main causes of modern public safety issues such as stampedes. Conventional crowd congestion monitoring using closed-circuit television (CCTV) video surveillance relies on manual observation, which is tedious and often error... ver más
Revista: Algorithms

 
Shukai Li, Xiaofang Wang, Dongri Shan and Peng Zhang    
Temporal modeling is a key problem in action recognition, and it remains difficult to accurately model temporal information of videos. In this paper, we present a local spatiotemporal extraction module (LSTE) and a channel time excitation module (CTE), w... ver más
Revista: Applied Sciences

 
Kolja Hedrich, Lennart Hinz and Eduard Reithmeier    
The automation of inspections in aircraft engines is an ever-increasing growing field of research. In particular, the inspection and quantification of coating damages in confined spaces, usually performed manually with handheld endoscopes, comprise tasks... ver más
Revista: Aerospace