Inicio  /  Information  /  Vol: 15 Par: 4 (2024)  /  Artículo
ARTÍCULO
TITULO

S-YOLOv5: A Lightweight Model for Detecting Objects Thrown from Tall Buildings in Communities

Yuntao Shi    
Qi Luo    
Meng Zhou    
Wei Guo    
Jie Li    
Shuqin Li and Yu Ding    

Resumen

Objects thrown from tall buildings in communities are characterized by their small size, inconspicuous features, and high speed. Existing algorithms for detecting such objects face challenges, including excessive parameters, overly complex models that are difficult to implement, and insufficient detection accuracy. This study proposes a lightweight detection model for objects thrown from tall buildings in communities, named S-YOLOv5, to address these issues. The model is based on the YOLOv5 algorithm, and a lightweight convolutional neural network, Enhanced ShuffleNet (ESNet), is chosen as the backbone network to extract image features. On this basis, the initial stage of the backbone network is enhanced and the simplified attention module (SimAM) attention mechanism is added to utilize the rich position information and contour information in the shallow feature map to improve the detection of small targets. For feature fusion, the sparsely connected Path Aggregation Network (SCPANet) module is designed to use sparsely connected convolution (SCConv) instead of the regular convolution of the Path Aggregation Network (PANet) to fuse features efficiently. In addition, the model uses the normalized Wasserstein distance (NWD) loss function to reduce the sensitivity of positional bias. The accuracy of the model is further improved. Test results from the self-built objects thrown from tall buildings dataset show that S-YOLOv5 can detect objects thrown from tall buildings quickly and accurately, with an accuracy of 90.2% and a detection rate of 34.1 Fps/s. Compared with the original YOLOv5 model, the parameters are reduced by 87.3%, and the accuracy and rate are improved by 0.8% and 63%, respectively.

 Artículos similares

       
 
Hao Liu, Bo Yang and Zhiwen Yu    
Multimodal sarcasm detection is a developing research field in social Internet of Things, which is the foundation of artificial intelligence and human psychology research. Sarcastic comments issued on social media often imply people?s real attitudes towa... ver más
Revista: Applied Sciences

 
Xuesong Jin, Huiyuan Sun and Yuhang Zhang    
This research is based on an intra-frame rate control algorithm based on the Versatile Video Coding (VVC) standard, considering that there is the phenomenon of over-allocating the bitrate of the end coding tree units (CTUs) in the bit allocation process,... ver más
Revista: Applied Sciences

 
Mohammad Alhumaid and Ayman G. Fayoumi    
Paranasal sinus pathologies, particularly those affecting the maxillary sinuses, pose significant challenges in diagnosis and treatment due to the complex anatomical structures and diverse disease manifestations. The aim of this study is to investigate t... ver más
Revista: Applied Sciences

 
Evangelos Filippou, Spyridon Kilimtzidis, Athanasios Kotzakolios and Vassilis Kostopoulos    
The pursuit of more efficient transport has led engineers to develop a wide variety of aircraft configurations with the aim of reducing fuel consumption and emissions. However, these innovative designs introduce significant aeroelastic couplings that can... ver más
Revista: Aerospace

 
Luis A. Fletscher, Alejandra Zuleta, Alexander Galvis, David Quintero, Juan Felipe Botero and Natalia Gaviria    
While 5G has become a reality in several places around the world, some countries are still in the process of assigning frequency bands and deploying networks. In this context, there is a significant opportunity to explore new market models for the manage... ver más
Revista: Information