Inicio  /  Applied Sciences  /  Vol: 13 Par: 17 (2023)  /  Artículo
ARTÍCULO
TITULO

Attention Mechanism Used in Monocular Depth Estimation: An Overview

Yundong Li    
Xiaokun Wei and Hanlu Fan    

Resumen

Monocular depth estimation (MDE), as one of the fundamental tasks of computer vision, plays important roles in downstream applications such as virtual reality, 3D reconstruction, and robotic navigation. Convolutional neural networks (CNN)-based methods gained remarkable progress compared with traditional methods using visual cues. However, recent researches reveal that the performance of MDE using CNN could be degraded due to the local receptive field of CNN. To bridge the gap, various attention mechanisms were proposed to model the long-range dependency. Although reviews of MDE algorithms based on CNN were reported, a comprehensive outline of how attention boosts MDE performance is not explored yet. In this paper, we firstly categorize recent attention-related works into CNN-based, Transformer-based, and hybrid (CNN?Transformer-based) approaches in the light of how the attention mechanism impacts the extraction of global features. Secondly, we discuss the details and contributions of attention-based MDE methods published from 2020 to 2022. Then, we compare the performance of the typical attention-based methods. Finally, the challenges and trends of the attention mechanism used in MDE are discussed.

 Artículos similares

       
 
Haiyang Yao, Tian Gao, Yong Wang, Haiyan Wang and Xiao Chen    
To overcome the challenges of inadequate representation and ineffective information exchange stemming from feature homogenization in underwater acoustic target recognition, we introduce a hybrid network named Mobile_ViT, which synergizes MobileNet and Tr... ver más

 
Changhong Liu, Jiawen Wen, Jinshan Huang, Weiren Lin, Bochun Wu, Ning Xie and Tao Zou    
Underwater object detection is crucial in marine exploration, presenting a challenging problem in computer vision due to factors like light attenuation, scattering, and background interference. Existing underwater object detection models face challenges ... ver más

 
Zheng Zhao, Jialing Yuan and Luhao Chen    
Air Traffic Flow Management (ATFM) delay can quantitatively reflect the congestion caused by the imbalance between capacity and demand in an airspace network. Furthermore, it is an important parameter for the ex-post analysis of airspace congestion and t... ver más
Revista: Aerospace

 
Ping Huang and Yafeng Wu    
Airborne speech enhancement is always a major challenge for the security of airborne systems. Recently, multi-objective learning technology has become one of the mainstream methods of monaural speech enhancement. In this paper, we propose a novel multi-o... ver más
Revista: Aerospace

 
Yong Liu, Jialin Zhou, Dong Zhang, Shaoyu Wei, Mingshun Yang and Xinqin Gao    
To solve the problem of low diagnostic accuracy caused by the scarcity of fault samples and class imbalance in the fault diagnosis task of box-type substations, a fault diagnosis method based on self-attention improvement of conditional tabular generativ... ver más
Revista: Applied Sciences