Inicio  /  Applied Sciences  /  Vol: 12 Par: 24 (2022)  /  Artículo
ARTÍCULO
TITULO

Dichotomization of Multilevel Variables to Detect Hidden Associations

Asdrúbal López-Chau    
Lisbeth Rodriguez-Mazahua    
Farid García-Lamont    
Maricela Quintana-López and Carlos A. Rojas-Hernández    

Resumen

A test of independence is commonly used to determine differences (or associations) between samples in a nominal level measurement. Fisher?s exact test and Chi-square test are two of the most widely applied tests of independence used in the data analyses in different areas such as information technologies, biostatistics, psychology and health sciences. In some cases, contingency tables with null entries (also called random zeros) arise, particularly if the number of samples is small, and the variables analyzed are multilevel. This situation becomes a problem because if one or more entries in a contingency table are zero or have small values, then the tests of independence produce unreliable results. In this paper, we propose a method to address that issue. The method merges one or more levels of the variables analyzed to create contingency tables with only one degree of freedom, avoiding applying a test of independence on contingency tables with random zeros. The source code (Python) of the method is publicly available for use. The results obtained using our method give a complete panorama of the associations between the variables of a data set. To show the effectiveness of our approach to find dependencies between variables, we use four data sets publicly available on the Internet.

 Artículos similares

       
 
Camil Bancioiu and Remus Brad    
This article proposes the usage of the d-separation criterion in Markov Boundary Discovery algorithms, instead of or alongside the statistical tests of conditional independence these algorithms usually rely on. This is a methodological improvement applic... ver más
Revista: Algorithms

 
Tatjana Bolic, Lorenzo Castelli, Andrea De Lorenzo and Fulvio Vascotto    
Availability of different types of data and advances in data-driven techniques open the path to more detailed analyses of various phenomena. Here, we examine the insights that can be gained through the analysis of historical flight trajectories, using da... ver más
Revista: Aerospace

 
Laith R. Sultan, Theodore W. Cary, Maryam Al-Hasani, Mrigendra B. Karmacharya, Santosh S. Venkatesh, Charles-Antoine Assenmacher, Enrico Radaelli and Chandra M. Sehgal    
Machine learning for medical imaging not only requires sufficient amounts of data for training and testing but also that the data be independent. It is common to see highly interdependent data whenever there are inherent correlations between observations... ver más
Revista: AI

 
Ram M. Narayanan, Michael J. Harner, John R. Jendzurski and Nicholas G. Paulter    
Through-wall and through-barrier motion-sensing systems are becoming increasingly important tools to locate humans concealed behind barriers and under rubble. The sensing performance of these systems is best determined with appropriately designed calibra... ver más
Revista: Instruments

 
Daehee Park and Cheoljun Lee    
Because smartphones support various functions, they are carried by users everywhere. Whenever a user believes that a moment is interesting, important, or meaningful to them, they can record a video to preserve such memories. The main problem with video r... ver más
Revista: Applied Sciences