Inicio  /  Applied Sciences  /  Vol: 10 Par: 9 (2020)  /  Artículo
ARTÍCULO
TITULO

Predictor Selection for Bacterial Vaginosis Diagnosis Using Decision Tree and Relief Algorithms

Jesús F. Pérez-Gómez    
Juana Canul-Reich    
José Hernández-Torruco and Betania Hernández-Ocaña    

Resumen

Requiring only a few relevant characteristics from patients when diagnosing bacterial vaginosis is highly useful for physicians as it makes it less time consuming to collect these data. This would result in having a dataset of patients that can be more accurately diagnosed using only a subset of informative or relevant features in contrast to using the entire set of features. As such, this is a feature selection (FS) problem. In this work, decision tree and Relief algorithms were used as feature selectors. Experiments were conducted on a real dataset for bacterial vaginosis with 396 instances and 252 features/attributes. The dataset was obtained from universities located in Baltimore and Atlanta. The FS algorithms utilized feature rankings, from which the top fifteen features formed a new dataset that was used as input for both support vector machine (SVM) and logistic regression (LR) algorithms for classification. For performance evaluation, averages of 30 runs of 10-fold cross-validation were reported, along with balanced accuracy, sensitivity, and specificity as performance measures. A performance comparison of the results was made between using the total number of features against using the top fifteen. These results found similar attributes from our rankings compared to those reported in the literature. This study is part of ongoing research that is investigating a range of feature selection and classification methods.

 Artículos similares

       
 
Hoang-Long Nguyen, Thanh-Hai Le, Cao-Thang Pham, Tien-Thinh Le, Lanh Si Ho, Vuong Minh Le, Binh Thai Pham and Hai-Bang Ly    
The main objective of this study is to develop and compare hybrid Artificial Intelligence (AI) approaches, namely Adaptive Network-based Fuzzy Inference System (ANFIS) optimized by Genetic Algorithm (GAANFIS) and Particle Swarm Optimization (PSOANFIS) an... ver más
Revista: Applied Sciences

 
Arsalan Shahid,Muhammad Fahad,Ravi Reddy,Alexey Lastovetsky     Pág. 50 - 65
Performance events or performance monitoring counters (PMCs) are now the dominant predictor variables for modeling energy consumption. Modern hardware processors provide a large set of PMCs. Determination of the best subset of PMCs for energy predictive ... ver más