Inicio  /  Information  /  Vol: 14 Par: 11 (2023)  /  Artículo
ARTÍCULO
TITULO

POSS-CNN: An Automatically Generated Convolutional Neural Network with Precision and Operation Separable Structure Aiming at Target Recognition and Detection

Jia Hou    
Jingyu Zhang    
Qi Chen    
Siwei Xiang    
Yishuo Meng    
Jianfei Wang    
Cimang Lu and Chen Yang    

Resumen

Artificial intelligence is changing and influencing our world. As one of the main algorithms in the field of artificial intelligence, convolutional neural networks (CNNs) have developed rapidly in recent years. Especially after the emergence of NASNet, CNNs have gradually pushed the idea of AutoML to the public?s attention, and large numbers of new structures designed by automatic searches are appearing. These networks are usually based on reinforcement learning and evolutionary learning algorithms. However, sometimes, the blocks of these networks are complex, and there is no small model for simpler tasks. Therefore, this paper proposes POSS-CNN aiming at target recognition and detection, which employs a multi-branch CNN structure with PSNC and a method of automatic parallel selection for super parameters based on a multi-branch CNN structure. Moreover, POSS-CNN can be broken up. By choosing a single branch or the combination of two branches as the ?benchmark?, as well as the overall POSS-CNN, we can achieve seven models with different precision and operations. The test accuracy of POSS-CNN for a recognition task tested on a CIFAR10 dataset can reach 86.4%, which is equivalent to AlexNet and VggNet, but the operation and parameters of the whole model in this paper are 45.9% and 45.8% of AlexNet, and 29.5% and 29.4% of VggNet. The mAP of POSS-CNN for a detection task tested on the LSVH dataset is 45.8, inferior to the 62.3 of YOLOv3. However, compared with YOLOv3, the operation and parameters of the model in this paper are reduced by 57.4% and 15.6%, respectively. After being accelerated by WRA, POSS-CNN for a detection task tested on an LSVH dataset can achieve 27 fps, and the energy efficiency is 0.42 J/f, which is 5 times and 96.6 times better than GPU 2080Ti in performance and energy efficiency, respectively.

 Artículos similares

       
 
Yu Tang, Zhiqin He, Qinmu Wu, Xiao Wang and Yuhang Wang    
The scoliosis report is a diagnosis made by the clinician looking at X-ray images of the spine. However, with numerous images, writing the report can be time-consuming and error-prone. Therefore, this paper proposes an automatic generation model of the e... ver más
Revista: Applied Sciences

 
Oscar Ondeng, Heywood Ouma and Peter Akuon    
Visual understanding is a research area that bridges the gap between computer vision and natural language processing. Image captioning is a visual understanding task in which natural language descriptions of images are automatically generated using visio... ver más
Revista: Applied Sciences

 
Min-Kyu Kim, Jong-Hwa Kim and Hyun Yang    
In this study, basic research was conducted regarding the era of autonomous vessels and artificial intelligence (deep learning, big data, etc.). When a vessel is navigating autonomously, it must determine the optimal route by itself and accurately follow... ver más

 
Mingyuan Huang, Dawei Cheng, Jia Zhou and Zhong Lu    
Traditional reliability analysis methods such as Reliability Block Diagram, Fault Tree Analysis, and Markov Analysis are all subjective methods whose results significantly depend on the analysts? skills and experiences. A model-based reliability method i... ver más
Revista: Aerospace

 
Fahim Sufi    
Utilizing social media data is imperative in comprehending critical insights on the Russia?Ukraine cyber conflict due to their unparalleled capacity to provide real-time information dissemination, thereby enabling the timely tracking and analysis of cybe... ver más
Revista: Information