Inicio  /  Information  /  Vol: 10 Par: 4 (2019)  /  Artículo
ARTÍCULO
TITULO

Deep Image Similarity Measurement Based on the Improved Triplet Network with Spatial Pyramid Pooling

Xinpan Yuan    
Qunfeng Liu    
Jun Long    
Lei Hu and Yulou Wang    

Resumen

Image similarity measurement is a fundamental problem in the field of computer vision. It is widely used in image classification, object detection, image retrieval, and other fields, mostly through Siamese or triplet networks. These networks consist of two or three identical branches of convolutional neural network (CNN) and share their weights to obtain the high-level image feature representations so that similar images are mapped close to each other in the feature space, and dissimilar image pairs are mapped far from each other. Especially, the triplet network is known as the state-of-the-art method on image similarity measurement. However, the basic CNN can only handle fixed-size images. If we obtain a fixed size image via cutting or scaling, the information of the image will be lost and the recognition accuracy will be reduced. To solve the problem, this paper has proposed the triplet spatial pyramid pooling network (TSPP-Net) through combing the triplet convolution neural network with the spatial pyramid pooling. Additionally, we propose an improved triplet loss function, so that the network model can realize twice distance learning by only inputting three samples at one time. Through the theoretical analysis and experiments, it is proved that the TSPP-Net model and the improved triple loss function can improve the generalization ability and the accuracy of image similarity measurement algorithm.

 Artículos similares

       
 
Jingxiong Lei, Xuzhi Liu, Haolang Yang, Zeyu Zeng and Jun Feng    
High-resolution remote sensing images (HRRSI) have important theoretical and practical value in urban planning. However, current segmentation methods often struggle with issues like blurred edges and loss of detailed information due to the intricate back... ver más
Revista: Applied Sciences

 
Ryota Higashimoto, Soh Yoshida and Mitsuji Muneyasu    
This paper addresses the performance degradation of deep neural networks caused by learning with noisy labels. Recent research on this topic has exploited the memorization effect: networks fit data with clean labels during the early stages of learning an... ver más
Revista: Applied Sciences

 
Jin-Woo Kong, Byoung-Doo Oh, Chulho Kim and Yu-Seop Kim    
Intracerebral hemorrhage (ICH) is a severe cerebrovascular disorder that poses a life-threatening risk, necessitating swift diagnosis and treatment. While CT scans are the most effective diagnostic tool for detecting cerebral hemorrhage, their interpreta... ver más
Revista: Applied Sciences

 
Tianhao Gao, Meng Zhang, Yifan Zhu, Youjian Zhang, Xiangsheng Pang, Jing Ying and Wenming Liu    
Classifying sports videos is complex due to their dynamic nature. Traditional methods, like optical flow and the Histogram of Oriented Gradient (HOG), are limited by their need for expertise and lack of universality. Deep learning, particularly Convoluti... ver más
Revista: Applied Sciences

 
Yanjun Li, Takaaki Yoshimura, Yuto Horima and Hiroyuki Sugimori    
The detection of coronary artery stenosis is one of the most important indicators for the diagnosis of coronary artery disease. However, stenosis in branch vessels is often difficult to detect using computer-aided systems and even radiologists because of... ver más
Revista: Algorithms