Inicio  /  Applied Sciences  /  Vol: 13 Par: 17 (2023)  /  Artículo
ARTÍCULO
TITULO

Attention Mechanism Used in Monocular Depth Estimation: An Overview

Yundong Li    
Xiaokun Wei and Hanlu Fan    

Resumen

Monocular depth estimation (MDE), as one of the fundamental tasks of computer vision, plays important roles in downstream applications such as virtual reality, 3D reconstruction, and robotic navigation. Convolutional neural networks (CNN)-based methods gained remarkable progress compared with traditional methods using visual cues. However, recent researches reveal that the performance of MDE using CNN could be degraded due to the local receptive field of CNN. To bridge the gap, various attention mechanisms were proposed to model the long-range dependency. Although reviews of MDE algorithms based on CNN were reported, a comprehensive outline of how attention boosts MDE performance is not explored yet. In this paper, we firstly categorize recent attention-related works into CNN-based, Transformer-based, and hybrid (CNN?Transformer-based) approaches in the light of how the attention mechanism impacts the extraction of global features. Secondly, we discuss the details and contributions of attention-based MDE methods published from 2020 to 2022. Then, we compare the performance of the typical attention-based methods. Finally, the challenges and trends of the attention mechanism used in MDE are discussed.

 Artículos similares

       
 
Jing Luo, Yuhang Zhang, Jiayuan Zhuang and Yumin Su    
The development of intelligent task allocation and path planning algorithms for unmanned surface vehicles (USVs) is gaining significant interest, particularly in supporting complex ocean operations. This paper proposes an intelligent hybrid algorithm tha... ver más

 
Chenhong Yan, Shefeng Yan, Tianyi Yao, Yang Yu, Guang Pan, Lu Liu, Mou Wang and Jisheng Bai    
Ship-radiated noise classification is critical in ocean acoustics. Recently, the feature extraction method combined with time?frequency spectrograms and convolutional neural networks (CNNs) has effectively described the differences between various underw... ver más

 
Ruoyang Li, Shuping Xiong, Yinchao Che, Lei Shi, Xinming Ma and Lei Xi    
Semantic segmentation algorithms leveraging deep convolutional neural networks often encounter challenges due to their extensive parameters, high computational complexity, and slow execution. To address these issues, we introduce a semantic segmentation ... ver más
Revista: Algorithms

 
Yuhuan Wu and Yonghong Wu    
Salient object detection (SOD) aims to identify the most visually striking objects in a scene, simulating the function of the biological visual attention system. The attention mechanism in deep learning is commonly used as an enhancement strategy which e... ver más
Revista: Algorithms

 
Jiao Su, Yi An, Jialin Wu and Kai Zhang    
Pedestrian detection has always been a difficult and hot spot in computer vision research. At the same time, pedestrian detection technology plays an important role in many applications, such as intelligent transportation and security monitoring. In comp... ver más
Revista: Algorithms