Inicio  /  Information  /  Vol: 14 Par: 2 (2023)  /  Artículo
ARTÍCULO
TITULO

Improved Feature Extraction and Similarity Algorithm for Video Object Detection

Haotian You    
Yufang Lu and Haihua Tang    

Resumen

Video object detection is an important research direction of computer vision. The task of video object detection is to detect and classify moving objects in a sequence of images. Based on the static image object detector, most of the existing video object detection methods use the unique temporal correlation of video to solve the problem of missed detection and false detection caused by moving object occlusion and blur. Another video object detection model guided by an optical flow network is widely used. Feature aggregation of adjacent frames is performed by estimating the optical flow field. However, there are many redundant computations for feature aggregation of adjacent frames. To begin with, this paper improved Faster RCNN by Feature Pyramid and Dynamic Region Aware Convolution. Then the S-SELSA module is proposed from the perspective of semantic and feature similarity. Feature similarity is obtained by a modified SSIM algorithm. The module can aggregate the features of frames globally to avoid redundancy. Finally, the experimental results on the ImageNet VID and DET datasets show that the mAP of the method proposed in this paper is 83.55%, which is higher than the existing methods.

 Artículos similares

       
 
Dacheng Yu, Mingjun Zhang, Feng Yao and Jitao Li    
Variational Mode Decomposition (VMD) has typically been used in weak fault feature extraction in recent years. The problem analyzed in this study is weak fault feature extraction and the enhancement of AUV thrusters based on Artificial Rabbits Optimizati... ver más

 
Qiyan Li, Zhi Weng, Zhiqiang Zheng and Lixin Wang    
The decrease in lake area has garnered significant attention within the global ecological community, prompting extensive research in remote sensing and computer vision to accurately segment lake areas from satellite images. However, existing image segmen... ver más
Revista: Applied Sciences

 
Yimin Ma, Yi Xu, Yunqing Liu, Fei Yan, Qiong Zhang, Qi Li and Quanyang Liu    
In recent years, deep convolutional neural networks with multi-scale features have been widely used in image super-resolution reconstruction (ISR), and the quality of the generated images has been significantly improved compared with traditional methods.... ver más
Revista: Applied Sciences

 
Wenqi Lyu, Wei Ke, Hao Sheng, Xiao Ma and Huayun Zhang    
In response to the challenge of handling large-scale 3D point cloud data, downsampling is a common approach, yet it often leads to the problem of feature loss. We present a dynamic downsampling algorithm for 3D point cloud maps based on an improved voxel... ver más
Revista: Applied Sciences

 
Zheng Zhao, Jialing Yuan and Luhao Chen    
Air Traffic Flow Management (ATFM) delay can quantitatively reflect the congestion caused by the imbalance between capacity and demand in an airspace network. Furthermore, it is an important parameter for the ex-post analysis of airspace congestion and t... ver más
Revista: Aerospace