Inicio  /  Applied Sciences  /  Vol: 12 Par: 18 (2022)  /  Artículo
ARTÍCULO
TITULO

ReSTiNet: On Improving the Performance of Tiny-YOLO-Based CNN Architecture for Applications in Human Detection

Shahriar Shakir Sumit    
Dayang Rohaya Awang Rambli    
Seyedali Mirjalili    
Muhammad Mudassir Ejaz and M. Saef Ullah Miah    

Resumen

Human detection is a special application of object recognition and is considered one of the greatest challenges in computer vision. It is the starting point of a number of applications, including public safety and security surveillance around the world. Human detection technologies have advanced significantly in recent years due to the rapid development of deep learning techniques. Despite recent advances, we still need to adopt the best network-design practices that enable compact sizes, deep designs, and fast training times while maintaining high accuracies. In this article, we propose ReSTiNet, a novel compressed convolutional neural network that addresses the issues of size, detection speed, and accuracy. Following SqueezeNet, ReSTiNet adopts the fire modules by examining the number of fire modules and their placement within the model to reduce the number of parameters and thus the model size. The residual connections within the fire modules in ReSTiNet are interpolated and finely constructed to improve feature propagation and ensure the largest possible information flow in the model, with the goal of further improving the proposed ReSTiNet in terms of detection speed and accuracy. The proposed algorithm downsizes the previously popular Tiny-YOLO model and improves the following features: (1) faster detection speed; (2) compact model size; (3) solving the overfitting problems; and (4) superior performance than other lightweight models such as MobileNet and SqueezeNet in terms of mAP. The proposed model was trained and tested using MS COCO and Pascal VOC datasets. The resulting ReSTiNet model is 10.7 MB in size (almost five times smaller than Tiny-YOLO), but it achieves an mAP of 63.74% on PASCAL VOC and 27.3% on MS COCO datasets using Tesla k80 GPU.

 Artículos similares

       
 
Fu-Shiung Hsieh    
One of the most significant financial benefits of a shared mobility mode such as ridesharing is cost savings. For this reason, a lot of studies focus on the maximization of cost savings in shared mobility systems. Cost savings provide an incentive for ri... ver más
Revista: Algorithms

 
Michael Mayerhofer, Stefan Brenner, Michael Doppler, Luis Catarino, Stefanie Girst, Vesna Nedeljkovic-Groha and Günther Dollinger    
The enormous potential of additive manufacturing (AM), particularly laser powder bed fusion (L-PBF), to produce radiofrequency cavities (cavities) has already been demonstrated. However, the required geometrical accuracy for GHz TM010" role="presentation... ver más
Revista: Instruments

 
Uxia Garcia-Luis, Alejandro M. Gomez-San-Juan, Fermin Navarro-Medina, Carlos Ulloa-Sande, Alfonso Yñigo-Rivera and Alba Eva Peláez-Santos    
The integration of uncertainty analysis methodologies allows for improving design efficiency, particularly in the context of instruments that demand precise pointing accuracy, such as space telescopes. Focusing on the VINIS Earth observation telescope de... ver más
Revista: Aerospace

 
Razvan Daniel Zota, Ionu? Alexandru Cîmpeanu, Denis Alexandru Dragomir and Mihai Adrian Lungu    
Sustainable development, smart waste management, and circular economy principles are paramount to the significant worldwide trend of smart city-related research and projects. The basic hypothesis of our research is that artificial intelligence (AI)-based... ver más
Revista: Applied Sciences

 
Jingyuan Liang, Shuiqing Lin and Xizheng Ke    
Four-level pulse amplitude modulation (PAM4) can transmit more information in the same symbol interval, effectively improving the information transmission rate and frequency band utilization of visible light communication (VLC). This paper proposes a met... ver más
Revista: Applied Sciences