Inicio  /  Applied Sciences  /  Vol: 13 Par: 15 (2023)  /  Artículo
ARTÍCULO
TITULO

Efficient Information-Theoretic Large-Scale Semi-Supervised Metric Learning via Proxies

Peng Chen and Huibing Wang    

Resumen

Semi-supervised metric learning intends to learn a distance function from the limited labeled data as well as a large amount of unlabeled data to better gauge the similarities of any two instances than using a general distance function. However, most existing semi-supervised metric learning methods rely on the manifold assumptions to mine the rich discriminant information of the unlabeled data, which breaks the intrinsic connection between the manifold regularizer-building process and the subsequent metric learning. Moreover, these methods usually encounter high computational or memory overhead. To solve these issues, we develop a novel method entitled Information-Theoretic Large-Scale Semi-Supervised Metric Learning via Proxies (ISMLP). ISMLP aims to simultaneously learn multiple proxy vectors as well as a Mahalanobis matrix and forms the semi-supervised metric learning as the probability distribution optimization parameterized by the Mahalanobis distance between the instance and each proxy vector. ISMLP maximizes the entropy of the labeled data and minimizes that of the unlabeled data to follow the entropy regularization, in this way, the labeled part and unlabeled part can be integrated in a meaningful way. Furthermore, the time complexity of the proposed method has a linear dependency concerning the number of instances, thereby, can be extended to the large-scale dataset without incurring too much time. Experiments on multiple datasets demonstrate the superiority of the proposed method over the compared methods used in the experiments.

 Artículos similares

       
 
Pin-Hung Juan and Ja-Ling Wu    
In this study, we present a federated learning approach that combines a multi-branch network and the Oort client selection algorithm to improve the performance of federated learning systems. This method successfully addresses the significant issue of non... ver más
Revista: Algorithms

 
Kai Sun, Ziyin Wu, Mingwei Wang, Jihong Shang, Zhihao Liu, Dineng Zhao and Xiaowen Luo    
Polymetallic nodules are spherical or ellipsoidal mineral aggregates formed naturally in deep-sea environments. They contain a variety of metallic elements and are important solid mineral resources on the seabed. How best to quickly and accurately identi... ver más

 
Shangcong Zhang, Yongfang Li, Xuefei Chen, Ruyi Zhou, Ziran Wu and Taha Zarhmouti    
Fire pumps are the key components of water supply in a firefighting system. At present, there is a lack of fire water pump testing methods that intelligently detect faulty states. Existing testing approaches require manual operation, which leads to low e... ver más
Revista: Water

 
Diego Sánchez-Moreno, Vivian F. López Batista, María Dolores Muñoz Vicente, Ángel Luis Sánchez Lázaro and María N. Moreno-García    
Information from social networks is currently being widely used in many application domains, although in the music recommendation area, its use is less common because of the limited availability of social data. However, most streaming platforms allow for... ver más
Revista: Information

 
Yongen Lin, Dagang Wang, Tao Jiang and Aiqing Kang    
Reliable streamflow forecasting is a determining factor for water resource planning and flood control. To better understand the strengths and weaknesses of newly proposed methods in streamflow forecasting and facilitate comparisons of different research ... ver más
Revista: Water