Inicio  /  Informatics  /  Vol: 6 Par: 2 (2019)  /  Artículo
ARTÍCULO
TITULO

The Effect of Evidence Transfer on Latent Feature Relevance for Clustering

Athanasios Davvetas    
Iraklis A. Klampanos    
Spiros Skiadopoulos and Vangelis Karkaletsis    

Resumen

Evidence transfer for clustering is a deep learning method that manipulates the latent representations of an autoencoder according to external categorical evidence with the effect of improving a clustering outcome. Evidence transfer?s application on clustering is designed to be robust when introduced with a low quality of evidence, while increasing the effectiveness of the clustering accuracy during relevant corresponding evidence. We interpret the effects of evidence transfer on the latent representation of an autoencoder by comparing our method to the information bottleneck method. Information bottleneck is an optimisation problem of finding the best tradeoff between maximising the mutual information of data representations and a task outcome while at the same time being effective in compressing the original data source. We posit that the evidence transfer method has essentially the same objective regarding the latent representations produced by an autoencoder. We verify our hypothesis using information theoretic metrics from feature selection in order to perform an empirical analysis over the information that is carried through the bottleneck of the latent space. We use the relevance metric to compare the overall mutual information between the latent representations and the ground truth labels before and after their incremental manipulation, as well as, to study the effects of evidence transfer regarding the significance of each latent feature.

 Artículos similares

       
 
Pau Bas-Calopa, Jordi-Roger Riba and Manuel Moreno-Eguilaz    
The combination of the low-pressure environment found in aircraft systems and the gradual electrification of aircraft increases the risk of electrical discharges occurrence. This is an undesirable situation that compromises aircraft safety and complicate... ver más
Revista: Aerospace

 
Andrea E. Copping, Daniel J. Hasselman, Charles W. Bangley, Joel Culina and Max Carcas    
Commercial development of tidal stream energy is hampered by technical and financial challenges, and impeded by uncertainty about potential environmental effects that drive environmental risk assessments and permitting (consenting) processes. The effect ... ver más

 
Wei-Rung Chou, Po-Yu Wu and Tsung-Hsien Li    
Data acquired from stranded sea turtles can provide awareness of human activities that adversely affect sea turtle populations. We assessed strandings of five sea turtle species between 2017 and 2021. This study utilizes principal component analysis (PCA... ver más

 
Ola N. Halawi, Faisal N. Abu-Khzam and Sergio Thoumi    
Enormous amounts of data collected from social networks or other online platforms are being published for the sake of statistics, marketing, and research, among other objectives. The consequent privacy and data security concerns have motivated the work o... ver más
Revista: Algorithms

 
Daniela Galatro, Rosario Trigo-Ferre, Allana Nakashook-Zettler, Vincenzo Costanzo-Alvarez, Melanie Jeffrey, Maria Jacome, Jason Bazylak and Cristina H. Amon    
Acute myeloid leukemia (AML) is a type of blood cancer that affects both adults and children. Benzene exposure has been reported to increase the risk of developing AML in children. The assessment of the potential relationship between environmental benzen... ver más
Revista: Algorithms