ARTÍCULO
TITULO

Data Mining Approach for Breast Cancer Patient Recovery

Tresna Maulana Fahrudin    
Iwan Syarif    
Ali Ridho Barakbah    

Resumen

Breast cancer is the second highest cancer type which attacked Indonesian women. There are several factors known related to encourage an increased risk of breast cancer, but especially in Indonesia that factors often depends on the treatment routinely. This research examines the determinant factors of breast cancer and measures the breast cancer patient data to build the useful classification model using data mining approach.The dataset was originally taken from one of Oncology Hospital in East Java, Indonesia, which consists of 1097 samples, 21 attributes and 2 classes. We used three different feature selection algorithms which are Information Gain, Fisherâ??s Discriminant Ratio and Chi-square to select the best attributes that have great contribution to the data. We applied Hierarchical K-means Clustering to remove attributes which have lowest contribution. Our experiment showed that only 14 of 21 original attributes have the highest contribution factor of the breast cancer data. The clustering algorithmdecreased the error ratio from 44.48% (using 21 original attributes) to 18.32% (using 14 most important attributes).We also applied the classification algorithm to build the classification model and measure the precision of breast cancer patient data. The comparison of classification algorithms between Naïve Bayes and Decision Tree were both given precision reach 92.76% and 92.99% respectively by leave-one-out cross validation. The information based on our data research, the breast cancer patient in Indonesia especially in East Java must be improved by the treatment routinely in the hospital to get early recover of breast cancer which it is related with adherence of patient.

 Artículos similares

       
 
Yilei Wang, Yuelin Hu, Wenliang Xu and Futai Zou    
Dark web vendor identification can be seen as an authorship aliasing problem, aiming to determine whether different accounts on different markets belong to the same real-world vendor, in order to locate cybercriminals involved in dark web market transact... ver más
Revista: Applied Sciences

 
Pablo Caballero, Luis Gonzalez-Abril, Juan A. Ortega and Áurea Simon-Soro    
Endometriosis (EM) is a chronic inflammatory estrogen-dependent disorder that affects 10% of women worldwide. It affects the female reproductive tract and its resident microbiota, as well as distal body sites that can serve as surrogate markers of EM. Cu... ver más
Revista: Algorithms

 
Dimitris C. Gkikas, Marios C. Gkikas and John A. Theodorou    
The specific application of this work involves the development of an intelligent system for diagnosing and treating fish diseases in Greek fish farming. The project aims to enhance the competitiveness of Greek fish farming by addressing the increasing mo... ver más
Revista: Applied Sciences

 
Fangling Leng, Fan Li, Yubin Bao, Tiancheng Zhang and Ge Yu    
As graph models become increasingly prevalent in the processing of scientific data, the exploration of effective methods for the mining of meaningful patterns from large-scale graphs has garnered significant research attention. This paper delves into the... ver más
Revista: Applied Sciences

 
Morag Hunter, D. H. Nimalika Perera, Eustace P. G. Barnes, Hugo V. Lepage, Elias Escobedo-Pacheco, Noorhayati Idros, David Arvidsson-Shukur, Peter J. Newton, Luis de los Santos Valladares, Patrick A. Byrne and Crispin H. W. Barnes    
The expansion of copper mining on the hyper-arid pacific slope of Southern Peru has precipitated growing concern for scarce water resources in the region. Located in the headwaters of the Torata river, in the department of Moquegua, the Cuajone mine, own... ver más
Revista: Water