Inicio  /  Algorithms  /  Vol: 14 Par: 2 (2021)  /  Artículo
ARTÍCULO
TITULO

Detection of Representative Variables in Complex Systems with Interpretable Rules Using Core-Clusters

Camille Champion    
Anne-Claire Brunet    
Rémy Burcelin    
Jean-Michel Loubes and Laurent Risser    

Resumen

In this paper, we present a new framework dedicated to the robust detection of representative variables in high dimensional spaces with a potentially limited number of observations. Representative variables are selected by using an original regularization strategy: they are the center of specific variable clusters, denoted CORE-clusters, which respect fully interpretable constraints. Each CORE-cluster indeed contains more than a predefined amount of variables and each pair of its variables has a coherent behavior in the observed data. The key advantage of our regularization strategy is therefore that it only requires to tune two intuitive parameters: the minimal dimension of the CORE-clusters and the minimum level of similarity which gathers their variables. Interpreting the role played by a selected representative variable is additionally obvious as it has a similar observed behaviour as a controlled number of other variables. After introducing and justifying this variable selection formalism, we propose two algorithmic strategies to detect the CORE-clusters, one of them scaling particularly well to high-dimensional data. Results obtained on synthetic as well as real data are finally presented.

 Artículos similares

       
 
Zeyulin Zhang, Yanshuang Ba, Dazheng Chen, Pengru Yan, Qingwen Song, Yuming Zhang, Weidong Zhu, Chunfu Zhang and Yue Hao    
All-inorganic perovskites, with their low-cost, simple processes and superior heat stability, have become potential candidate materials for photodetectors (PDs). However, they have no representative responsivity in the deep-ultraviolet (UV) wavelength re... ver más
Revista: Applied Sciences

 
Rongrong Wu, Mingdong Dong and Lei Liu    
The unique nano?bio interfacial phenomena play a crucial role in the biosafety and bioapplications of nanomaterials. As a representative two-dimensional (2D) nanomaterial, molybdenum disulfide (MoS2) has shown great potential in biological applications d... ver más
Revista: Coatings

 
Jian Huang and Yijun Gu    
Community detection is an important task in the analysis of complex networks, which is significant for mining and analyzing the organization and function of networks. As an unsupervised learning algorithm based on the particle competition mechanism, stoc... ver más
Revista: Applied Sciences

 
Bakht Zaman, Dusica Marijan and Tetyana Kholodna    
The availability of automatic identification system (AIS) data for tracking vessels has paved the way for improvements in maritime safety and efficiency. However, one of the main challenges in using AIS data is often the low quality of the data. Practica... ver más

 
Andreas Döring, Markus Vogelbacher, Oliver Schneider, Jacob Müller, Stefan Hinz and Jörg Matthes    
Prestressed concrete bridges built between 1960 and 1990 no longer meet today?s requirements due to loads and increasing mileage of higher loads that have increased since the bridges were designed. Prestressed concrete bridges are representative of Germa... ver más
Revista: Infrastructures