REVISTA
Future Internet

TODAS

Inicio / Future Internet / Vol: 13 Par: 12 (2021) / Art�culo

ART�CULO

TITULO

An Efficient Deep Convolutional Neural Network Approach for Object Detection and Recognition Using a Multi-Scale Anchor Box in Real-Time

Vijayakumar Varadarajan

Dweepna Garg and Ketan Kotecha

Resumen

Deep learning is a relatively new branch of machine learning in which computers are taught to recognize patterns in massive volumes of data. It primarily describes learning at various levels of representation, which aids in understanding data that includes text, voice, and visuals. Convolutional neural networks have been used to solve challenges in computer vision, including object identification, image classification, semantic segmentation and a lot more. Object detection in videos involves confirming the presence of the object in the image or video and then locating it accurately for recognition. In the video, modelling techniques suffer from high computation and memory costs, which may decrease performance measures such as accuracy and efficiency to identify the object accurately in real-time. The current object detection technique based on a deep convolution neural network requires executing multilevel convolution and pooling operations on the entire image to extract deep semantic properties from it. For large objects, detection models can provide superior results; however, those models fail to detect the varying size of the objects that have low resolution and are greatly influenced by noise because the features after the repeated convolution operations of existing models do not fully represent the essential characteristics of the objects in real-time. With the help of a multi-scale anchor box, the proposed approach reported in this paper enhances the detection accuracy by extracting features at multiple convolution levels of the object. The major contribution of this paper is to design a model to understand better the parameters and the hyper-parameters which affect the detection and the recognition of objects of varying sizes and shapes, and to achieve real-time object detection and recognition speeds by improving accuracy. The proposed model has achieved 84.49 mAP on the test set of the Pascal VOC-2007 dataset at 11 FPS, which is comparatively better than other real-time object detection models.

Palabras claves

deep learning - convolution neural network - object detection and recognition - PASCAL VOC dataset - FDDB dataset

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 13 Parte: 12 (2021)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

Buildings
Clean Technologies
ISPRS International Journal of Geo-Information

DOI

https://doi.org/10.3390/fi13120307

Art�culos similares

Enhancing Maritime Navigational Safety: Ship Trajectory Prediction Using ACoAtt?LSTM and AIS Data

Acceso

Mingze Li, Bing Li, Zhigang Qi, Jiashuai Li and Jiawei Wu

Predicting ship trajectories plays a vital role in ensuring navigational safety, preventing collision incidents, and enhancing vessel management efficiency. The integration of advanced machine learning technology for precise trajectory prediction is emer... ver m�s

Revista: ISPRS International Journal of Geo-Information

Component Identification and Depth Estimation for Structural Images Based on Multi-Scale Task Interaction Network

Acceso

Jianlong Ye, Hongchuan Yu, Gaoyang Liu, Jiong Zhou and Jiangpeng Shu

Component identification and depth estimation are important for detecting the integrity of post-disaster structures. However, traditional manual methods might be time-consuming, labor-intensive, and influenced by subjective judgments of inspectors. Deep-... ver m�s

Revista: Buildings

Performance Evaluation of Graph Neural Network-Based RouteNet Model with Attention Mechanism

Acceso

Binita Kusum Dhamala, Babu R. Dawadi, Pietro Manzoni and Baikuntha Kumar Acharya

Graph representation is recognized as an efficient method for modeling networks, precisely illustrating intricate, dynamic interactions within various entities of networks by representing entities as nodes and their relationships as edges. Leveraging the... ver m�s

Revista: Future Internet

NeXtFusion: Attention-Based Camera-Radar Fusion Network for Improved Three-Dimensional Object Detection and Tracking

Acceso

Priyank Kalgaonkar and Mohamed El-Sharkawy

Accurate perception is crucial for autonomous vehicles (AVs) to navigate safely, especially in adverse weather and lighting conditions where single-sensor networks (e.g., cameras or radar) struggle with reduced maneuverability and unrecognizable targets.... ver m�s

Revista: Future Internet

Examining Recognition of Occupants? Cooking Activity Based on Sound Data Using Deep Learning Models

Acceso

Yuhwan Kim, Chang-Ho Choi, Chang-Young Park and Seonghyun Park

In today?s society, where people spend over 90% of their time indoors, indoor air quality (IAQ) is crucial for sustaining human life. However, as various indoor activities such as cooking generate diverse types of pollutants in indoor spaces, IAQ has eme... ver m�s

Revista: Buildings

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles