ARTÍCULO
TITULO

Combination of Reduction Detection Using TOPSIS for Gene Expression Data Analysis

Jogeswar Tripathy    
Rasmita Dash    
Binod Kumar Pattanayak    
Sambit Kumar Mishra    
Tapas Kumar Mishra and Deepak Puthal    

Resumen

In high-dimensional data analysis, Feature Selection (FS) is one of the most fundamental issues in machine learning and requires the attention of researchers. These datasets are characterized by huge space due to a high number of features, out of which only a few are significant for analysis. Thus, significant feature extraction is crucial. There are various techniques available for feature selection; among them, the filter techniques are significant in this community, as they can be used with any type of learning algorithm and drastically lower the running time of optimization algorithms and improve the performance of the model. Furthermore, the application of a filter approach depends on the characteristics of the dataset as well as on the machine learning model. Thus, to avoid these issues in this research, a combination of feature reduction (CFR) is considered designing a pipeline of filter approaches for high-dimensional microarray data classification. Considering four filter approaches, sixteen combinations of pipelines are generated. The feature subset is reduced in different levels, and ultimately, the significant feature set is evaluated. The pipelined filter techniques are Correlation-Based Feature Selection (CBFS), Chi-Square Test (CST), Information Gain (InG), and Relief Feature Selection (RFS), and the classification techniques are Decision Tree (DT), Logistic Regression (LR), Random Forest (RF), and k-Nearest Neighbor (k-NN). The performance of CFR depends highly on the datasets as well as on the classifiers. Thereafter, the Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) method is used for ranking all reduction combinations and evaluating the superior filter combination among all.

 Artículos similares

       
 
Gary Reyes, Vivian Estrada, Roberto Tolozano-Benites and Victor Maquilón    
The steady increase in data generation by GPS systems poses storage challenges. Previous studies show the need to address trajectory compression. The demand for accuracy and the magnitude of data require effective compression strategies to reduce storage... ver más

 
Xavier Flete, Nicolas Binder, Yannick Bousquet and Sandrine Cros    
In the current study, full-stage unsteady simulations were performed to investigate rotating instability inception mechanisms in a particularly large tip clearance centrifugal compressor with a vaneless diffuser and a volute. Four operating points along ... ver más

 
Vincent Oriez, Nga Thi-Thanh Pham, Jérôme Peydecastaing, Philippe Behra and Pierre-Yves Pontalier    
Sugarcane bagasse (SCB), a by-product of the sugar industry, is composed mainly of cellulose, hemicelluloses, and lignin, and can be used to replace petrochemical polymers in various applications. In this work, SCB was treated under mild alkaline conditi... ver más

 
Nurgali Kadyrbek, Madina Mansurova, Adai Shomanov and Gaukhar Makharova    
This study is devoted to the transcription of human speech in the Kazakh language in dynamically changing conditions. It discusses key aspects related to the phonetic structure of the Kazakh language, technical considerations in collecting the transcribe... ver más

 
Temple Chimuanya Odimegwu, A. B. M. A. Kaish, Maslina Jamil, M. F. M. Zain, Asset Turlanbekov and Ahmed W. Al Zand    
This study evaluated the effect of alum sludge as an alternative to fly ash in fabricating geopolymer paste and mortar. The blending of this industrial waste (alum sludge and fly ash) is not only for the benefit of sustainable construction and disposal o... ver más
Revista: Buildings